Shared computational principles for language

Abstract

Departing from traditional linguistic models, advances in deep learning have resulted in a new type of predictive (autoregressive) deep language models (DLMs). Using a self-supervised next-word prediction task, these models generate appropriate linguistic responses in a given context. In the current study, nine participants listened to a 30-min podcast while their brain responses were recorded using electrocorticography (ECoG). We provide empirical evidence that the human brain and autoregressive DLMs share three fundamental computational principles as they process the same natural narrative: (1) both are engaged in continuous next-word prediction before word onset; (2) both match their pre-onset predictions to the incoming word to calculate post-onset surprise; (3) both rely on contextual embeddings to represent words in natural contexts. Together, our findings suggest that autoregressive DLMs provide a new and biologically feasible computational framework for studying the neural basis of language.

Deep language models have revolutionized natural language processing. The paper discovers three computational principles shared between deep language models and the human brain, which can transform our understanding of the neural basis of language.

Details

Title

Shared computational principles for language processing in humans and deep language models

Author

Goldstein, Ariel¹

; Zada Zaid²

; Buchnik Eliav³; Schain Mariano³; Price, Amy²

; Aubrey Bobbi⁴; Nastase, Samuel A²

; Feder, Amir³; Dotan, Emanuel³; Cohen, Alon³; Jansen Aren³; Gazula Harshvardhan²; Choe, Gina⁴; Rao, Aditi⁴; Kim, Catherine⁴; Colton, Casto²; Fanda Lora⁵

; Doyle, Werner⁵; Friedman, Daniel⁵; Dugan, Patricia⁵; Melloni, Lucia⁶

; Reichart Roi⁷; Devore Sasha⁵; Flinker Adeen⁵; Hasenfratz Liat²; Levy, Omer⁸

; Avinatan, Hassidim³; Brenner, Michael⁹; Matias Yossi³; Norman, Kenneth A²

; Devinsky Orrin⁵; Hasson Uri¹

¹ Princeton University, Department of Psychology and the Neuroscience Institute, Princeton, USA (GRID:grid.16750.35) (ISNI:0000 0001 2097 5006); Google Research, Mountain View, USA (GRID:grid.420451.6) (ISNI:0000 0004 0635 6729)
² Princeton University, Department of Psychology and the Neuroscience Institute, Princeton, USA (GRID:grid.16750.35) (ISNI:0000 0001 2097 5006)
³ Google Research, Mountain View, USA (GRID:grid.420451.6) (ISNI:0000 0004 0635 6729)
⁴ Princeton University, Department of Psychology and the Neuroscience Institute, Princeton, USA (GRID:grid.16750.35) (ISNI:0000 0001 2097 5006); New York University Grossman School of Medicine, New York, USA (GRID:grid.240324.3) (ISNI:0000 0001 2109 4251)
⁵ New York University Grossman School of Medicine, New York, USA (GRID:grid.240324.3) (ISNI:0000 0001 2109 4251)
⁶ Max Planck Institute for Empirical Aesthetics, Frankfurt, Germany (GRID:grid.461782.e) (ISNI:0000 0004 1795 8610)
⁷ Israel Institute of Technology, Faculty of Industrial Engineering and Management, Technion, Haifa, Israel (GRID:grid.6451.6) (ISNI:0000000121102151)
⁸ Tel Aviv University, Blavatnik School of Computer Science, Tel Aviv, Israel (GRID:grid.12136.37) (ISNI:0000 0004 1937 0546)
⁹ Google Research, Mountain View, USA (GRID:grid.420451.6) (ISNI:0000 0004 0635 6729); Harvard University, School of Engineering and Applied Science, Cambridge, USA (GRID:grid.38142.3c) (ISNI:000000041936754X)

Pages

369-380

Publication year

2022

Publication date

Mar 2022

Publisher

Nature Publishing Group

ISSN

10976256

e-ISSN

15461726

Source type

Scholarly Journal

Language of publication

English

DOI

https://doi.org/10.1038/s41593-022-01026-4

ProQuest document ID

2637586611

© The Author(s) 2022. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Shared computational principles for language processing in humans and deep language models

Jump to:

Abstract

Details

Full text options

Suggested sources