Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets

A. Power, Y. Burda, H. Edwards, I. Babuschkin, and V. Misra. (2022)cite arxiv:2201.02177Comment: Correspondence to alethea@openai.com. Code available at: https://github.com/openai/grok.

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Manoj Misra

Sidhant Misra

Ashok Misra

Pranob Misra

Hara Prasanna Misra

Other publications of authors with the same name

Vulnerability Analysis of High Dimensional Complex Systems.V. Misra, D. Harmon, and Y. Bar-Yam. SSS, volume 6366 of Lecture Notes in Computer Science, page 560-572. Springer, (2010)Evaluating Large Language Models Trained on Code.M. Chen, J. Tworek, H. Jun, Q. Yuan, H. de Oliveira Pinto, J. Kaplan, H. Edwards, Y. Burda, N. Joseph, G. Brockman and 48 other author(s). CoRR, (2021)Solving Quantitative Reasoning Problems with Language Models.A. Lewkowycz, A. Andreassen, D. Dohan, E. Dyer, H. Michalewski, V. Ramasesh, A. Slone, C. Anil, I. Schlag, T. Gutman-Solo and 4 other author(s). NeurIPS, (2022)Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language modelsA. Srivastava, A. Rastogi, A. Rao, A. Shoeb, A. Abid, A. Fisch, A. Brown, A. Santoro, A. Gupta, A. Garriga-Alonso and 441 other author(s). (2022)cite arxiv:2206.04615Comment: 27 pages, 17 figures + references and appendices, repo: https://github.com/google/BIG-bench.Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets.A. Power, Y. Burda, H. Edwards, I. Babuschkin, and V. Misra. CoRR, (2022)Exploring Length Generalization in Large Language Models.C. Anil, Y. Wu, A. Andreassen, A. Lewkowycz, V. Misra, V. Ramasesh, A. Slone, G. Gur-Ari, E. Dyer, and B. Neyshabur. NeurIPS, (2022)PaLM: Scaling Language Modeling with Pathways.A. Chowdhery, S. Narang, J. Devlin, M. Bosma, G. Mishra, A. Roberts, P. Barham, H. Chung, C. Sutton, S. Gehrmann and 57 other author(s). J. Mach. Learn. Res., (2023)PaLM: Scaling Language Modeling with Pathways.A. Chowdhery, S. Narang, J. Devlin, M. Bosma, G. Mishra, A. Roberts, P. Barham, H. Chung, C. Sutton, S. Gehrmann and 57 other author(s). CoRR, (2022)

BibSonomy

Disambiguation of "Misra, Vedant"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets

Please choose a person to relate this publication to

Manoj Misra

Sidhant Misra

Ashok Misra

Pranob Misra

Hara Prasanna Misra

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Misra, Vedant"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets

Please choose a person to relate this publication to

Manoj Misra

Sidhant Misra

Ashok Misra

Pranob Misra

Hara Prasanna Misra

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets