Author of the publication

Anatomy of high-performance matrix multiplication.

, and . ACM Trans. Math. Softw., 34 (3): 12:1-12:25 (2008)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Designing Linear Algebra Algorithms by Transformation: Mechanizing the Expert Developer., , , and . VECPAR, volume 7851 of Lecture Notes in Computer Science, page 362-378. Springer, (2012)Rapid Development of High-Performance Linear Algebra Libraries., , , , , , and . PARA, volume 3732 of Lecture Notes in Computer Science, page 376-384. Springer, (2004)A Case for Malleable Thread-Level Linear Algebra Libraries: The LU Factorization With Partial Pivoting., , , , and . IEEE Access, (2019)Scalable parallelization of FLAME code via the workqueuing model., , , and . ACM Trans. Math. Softw., 34 (2): 10:1-10:29 (2008)Using desktop computers to solve large-scale dense linear algebra problems., , , and . J. Supercomput., 58 (2): 145-150 (2011)Global Combine Algorithms for 2-D Meshes with Wormhole Routing., , , and . J. Parallel Distributed Comput., 24 (2): 191-201 (1995)Performance and Scalability of Finite Element Analysis for Distributed Parallel Computation., , and . J. Parallel Distributed Comput., 21 (2): 202-212 (1994)Collective communication: theory, practice, and experience., , , and . Concurr. Comput. Pract. Exp., 19 (13): 1749-1783 (2007)All-to-All., and . Encyclopedia of Parallel Computing, Springer, (2011)Efficient Communication Primitives on Mesh Architectures with Hardware Routing., , , and . PPSC, page 943-948. SIAM, (1993)