Author of the publication

Continual Pre-training of Language Models

, , , , , and . The Eleventh International Conference on Learning Representations, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Energy Consumption of IT System in Cloud Data Center: Architecture, Factors and Prediction., , and . NPC, volume 11783 of Lecture Notes in Computer Science, page 311-315. Springer, (2019)Selecting Large Language Model to Fine-tune via Rectified Scaling Law., , , , , , , , , and . CoRR, (2024)An Adaptive Flow Table Adjustment Algorithm for SDN., , , and . HPCC/SmartCity/DSS, page 1779-1784. IEEE, (2019)Dynamic service migration in ultra-dense multi-access edge computing network for high-mobility scenarios., , , and . EURASIP J. Wirel. Commun. Netw., 2020 (1): 191 (2020)CMG: A Class-Mixed Generation Approach to Out-of-Distribution Detection., , , , and . ECML/PKDD (4), volume 13716 of Lecture Notes in Computer Science, page 502-518. Springer, (2022)Adapting a Language Model While Preserving its General Knowledge., , , , , and . EMNLP, page 10177-10188. Association for Computational Linguistics, (2022)Continual Learning of Language Models., , , , , and . CoRR, (2023)FLatS: Principled Out-of-Distribution Detection with Feature-Based Likelihood Ratio Score., and . EMNLP, page 8956-8963. Association for Computational Linguistics, (2023)Mutation Relation Extraction and Genes Network Analysis in Colon Cancer., , , and . ICSAI, page 1085-1092. IEEE, (2018)Particle swarm optimized neural networks based local tracking control scheme of unknown nonlinear interconnected systems., , , and . Neural Networks, (2021)