Author of the publication

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Zero-CPU Collection with Direct Telemetry Access., , , , , , and . CoRR, (2021)Zero-CPU Collection with Direct Telemetry Access., , , , , , and . HotNets, page 108-115. ACM, (2021)Direct Telemetry Access., , , , , , and . CoRR, (2022)Quantized Side Tuning: Fast and Memory-Efficient Tuning of Quantized Large Language Models., , , , , , and . CoRR, (2024)Optimal Kernel Orchestration for Tensor Programs with Korch., , , , , , , , , and 1 other author(s). ASPLOS (3), page 755-769. ACM, (2024)SpecInfer: Accelerating Generative LLM Serving with Speculative Inference and Token Tree Verification., , , , , , , , , and . CoRR, (2023)FlexLLM: A System for Co-Serving Large Language Model Inference and Parameter-Efficient Finetuning., , , , , and . CoRR, (2024)Direct Telemetry Access., , , , , and . SIGCOMM, page 832-849. ACM, (2023)Towards Efficient Generative Large Language Model Serving: A Survey from Algorithms to Systems., , , , , , and . CoRR, (2023)SpecInfer: Accelerating Large Language Model Serving with Tree-based Speculative Inference and Verification., , , , , , , , , and 5 other author(s). ASPLOS (3), page 932-949. ACM, (2024)