Author of the publication

Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets

, , , , and . (2022)cite arxiv:2201.02177Comment: Correspondence to alethea@openai.com. Code available at: https://github.com/openai/grok.

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Vulnerability Analysis of High Dimensional Complex Systems., , and . SSS, volume 6366 of Lecture Notes in Computer Science, page 560-572. Springer, (2010)Evaluating Large Language Models Trained on Code., , , , , , , , , and 48 other author(s). CoRR, (2021)Solving Quantitative Reasoning Problems with Language Models., , , , , , , , , and 4 other author(s). NeurIPS, (2022)Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models, , , , , , , , , and 441 other author(s). (2022)cite arxiv:2206.04615Comment: 27 pages, 17 figures + references and appendices, repo: https://github.com/google/BIG-bench.Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets., , , , and . CoRR, (2022)Exploring Length Generalization in Large Language Models., , , , , , , , , and . NeurIPS, (2022)PaLM: Scaling Language Modeling with Pathways., , , , , , , , , and 57 other author(s). J. Mach. Learn. Res., (2023)PaLM: Scaling Language Modeling with Pathways., , , , , , , , , and 57 other author(s). CoRR, (2022)