Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Knowledge Distillation of Large Language Models

Y. Gu, L. Dong, F. Wei, and M. Huang. (2023)cite arxiv:2306.08543Comment: 20 pages, 12 figures.

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Yuxian He

Xuewu Gu

Xiaogang Gu

Wei Gu

Hyosong Gu

Other publications of authors with the same name

EVA2.0: Investigating Open-domain Chinese Dialogue Systems with Large-scale Pre-training.Y. Gu, J. Wen, H. Sun, Y. Song, P. Ke, C. Zheng, Z. Zhang, J. Yao, L. Liu, X. Zhu and 1 other author(s). Mach. Intell. Res., 20 (2): 207-219 (April 2023)EVA: An Open-Domain Chinese Dialogue System with Large-Scale Generative Pre-Training.H. Zhou, P. Ke, Z. Zhang, Y. Gu, Y. Zheng, C. Zheng, Y. Wang, C. Wu, H. Sun, X. Yang and 4 other author(s). CoRR, (2021)Structured Prompting: Scaling In-Context Learning to 1, 000 Examples.Y. Hao, Y. Sun, L. Dong, Z. Han, Y. Gu, and F. Wei. CoRR, (2022)CPM: A Large-scale Generative Chinese Pre-trained Language Model.Z. Zhang, X. Han, H. Zhou, P. Ke, Y. Gu, D. Ye, Y. Qin, Y. Su, H. Ji, J. Guan and 15 other author(s). CoRR, (2020)Pre-Training to Learn in Context.Y. Gu, L. Dong, F. Wei, and M. Huang. ACL (1), page 4849-4870. Association for Computational Linguistics, (2023)CPM: A large-scale generative Chinese Pre-trained language model.Z. Zhang, X. Han, H. Zhou, P. Ke, Y. Gu, D. Ye, Y. Qin, Y. Su, H. Ji, J. Guan and 15 other author(s). AI Open, (2021)CUGE: A Chinese Language Understanding and Generation Evaluation Benchmark.Y. Yao, Q. Dong, J. Guan, B. Cao, Z. Zhang, C. Xiao, X. Wang, F. Qi, J. Bao, J. Nie and 25 other author(s). CoRR, (2021)Towards Optimal Learning of Language Models.Y. Gu, L. Dong, Y. Hao, Q. Dong, M. Huang, and F. Wei. CoRR, (2024)Train No Evil: Selective Masking for Task-Guided Pre-Training.Y. Gu, Z. Zhang, X. Wang, Z. Liu, and M. Sun. EMNLP (1), page 6966-6974. Association for Computational Linguistics, (2020)Learning Instructions with Unlabeled Data for Zero-Shot Cross-Task Generalization.Y. Gu, P. Ke, X. Zhu, and M. Huang. EMNLP, page 1617-1634. Association for Computational Linguistics, (2022)

BibSonomy

Disambiguation of "Gu, Yuxian"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Knowledge Distillation of Large Language Models

Please choose a person to relate this publication to

Yuxian He

Xuewu Gu

Xiaogang Gu

Wei Gu

Hyosong Gu

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Gu, Yuxian"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Knowledge Distillation of Large Language Models

Please choose a person to relate this publication to

Yuxian He

Xuewu Gu

Xiaogang Gu

Wei Gu

Hyosong Gu

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Knowledge Distillation of Large Language Models