Author of the publication

Bert: Pre-training of deep bidirectional transformers for language understanding

, , , and . arXiv preprint arXiv:1810.04805, (2018)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

PaLM: Scaling Language Modeling with Pathways., , , , , , , , , and 57 other author(s). CoRR, (2022)Pre-Computable Multi-Layer Neural Network Language Models., , and . EMNLP, page 256-260. The Association for Computational Linguistics, (2015)Zero-Shot Entity Linking by Reading Entity Descriptions., , , , , and . ACL (1), page 3449-3460. Association for Computational Linguistics, (2019)Unsupervised Morphology Rivals Supervised Morphology for Arabic MT., , , , and . ACL (2), page 322-327. The Association for Computer Linguistics, (2012)PaLM: Scaling Language Modeling with Pathways., , , , , , , , , and 57 other author(s). J. Mach. Learn. Res., (2023)Language Model Pre-training for Hierarchical Document Representations., , , and . CoRR, (2019)Detecting Interrogative Utterances with Recurrent Neural Networks., , and . CoRR, (2015)PaLM 2 Technical Report., , , , , , , , , and 43 other author(s). CoRR, (2023)Automatic Tune Set Generation for Machine Translation with Limited Indomain Data., , , , and . EAMT, page 161-168. European Association for Machine Translation, (2012)System Combination Using Discriminative Cross-Adaptation., , , and . IJCNLP, page 667-675. The Association for Computer Linguistics, (2011)