Author of the publication

TensorFlow at Scale: Performance and productivity analysis of distributed training with Horovod, MLSL, and Cray PE ML.

, , , , and . Concurr. Comput. Pract. Exp., (2019)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Maximizing concave piecewise affine functions on the unitary group., , and . Optim. Lett., 10 (4): 655-665 (2016)ASTRA-SIM: Enabling SW/HW Co-Design Exploration for Distributed DL Training Platforms., , , and . ISPASS, page 81-92. IEEE, (2020)Extending the BT NAS parallel benchmark to exascale computing., , and . SC, page 94. IEEE/ACM, (2012)Automatic target prediction and subtle gaze guidance for improved spatial information recall., and . SAP, page 99-106. ACM, (2015)Sensor-based Methodological Observations for Studying Online Learning., , , , , , and . SmartLearn@IUI, page 25-30. ACM, (2017)Exploring Shared-Memory Optimizations for an Unstructured Mesh CFD Application on Modern Parallel Systems., , , , , , , , , and . IPDPS, page 723-732. IEEE Computer Society, (2015)High Performance Non-uniform FFT on Modern X86-based Multi-core Systems., , , , , , , , , and . IPDPS, page 449-460. IEEE Computer Society, (2012)Automated Personalized Feedback in Introductory Java Programming MOOCs., , , and . ICDE, page 1259-1270. IEEE Computer Society, (2017)Mystique: Enabling Accurate and Scalable Generation of Production AI Benchmarks., , , , , , , and . ISCA, page 37:1-37:13. ACM, (2023)Themis: a network bandwidth-aware collective scheduling policy for distributed training of DL models., , , , and . ISCA, page 581-596. ACM, (2022)