Abstract

Departing from traditional linguistic models, advances in deep learning have resulted in a new type of predictive (autoregressive) deep language models (DLMs). Using a self-supervised next-word prediction task, these models generate appropriate linguistic responses in a given context. In the current study, nine participants listened to a 30-min podcast while their brain responses were recorded using electrocorticography (ECoG). We provide empirical evidence that the human brain and autoregressive DLMs share three fundamental computational principles as they process the same natural narrative: (1) both are engaged in continuous next-word prediction before word onset; (2) both match their pre-onset predictions to the incoming word to calculate post-onset surprise; (3) both rely on contextual embeddings to represent words in natural contexts. Together, our findings suggest that autoregressive DLMs provide a new and biologically feasible computational framework for studying the neural basis of language.

Deep language models have revolutionized natural language processing. The paper discovers three computational principles shared between deep language models and the human brain, which can transform our understanding of the neural basis of language.

Details

Title
Shared computational principles for language processing in humans and deep language models
Author
Goldstein, Ariel 1   VIAFID ORCID Logo  ; Zada Zaid 2   VIAFID ORCID Logo  ; Buchnik Eliav 3 ; Schain Mariano 3 ; Price, Amy 2   VIAFID ORCID Logo  ; Aubrey Bobbi 4 ; Nastase, Samuel A 2   VIAFID ORCID Logo  ; Feder, Amir 3 ; Dotan, Emanuel 3 ; Cohen, Alon 3 ; Jansen Aren 3 ; Gazula Harshvardhan 2 ; Choe, Gina 4 ; Rao, Aditi 4 ; Kim, Catherine 4 ; Colton, Casto 2 ; Fanda Lora 5   VIAFID ORCID Logo  ; Doyle, Werner 5 ; Friedman, Daniel 5 ; Dugan, Patricia 5 ; Melloni, Lucia 6   VIAFID ORCID Logo  ; Reichart Roi 7 ; Devore Sasha 5 ; Flinker Adeen 5 ; Hasenfratz Liat 2 ; Levy, Omer 8   VIAFID ORCID Logo  ; Avinatan, Hassidim 3 ; Brenner, Michael 9 ; Matias Yossi 3 ; Norman, Kenneth A 2   VIAFID ORCID Logo  ; Devinsky Orrin 5 ; Hasson Uri 1   VIAFID ORCID Logo 

 Princeton University, Department of Psychology and the Neuroscience Institute, Princeton, USA (GRID:grid.16750.35) (ISNI:0000 0001 2097 5006); Google Research, Mountain View, USA (GRID:grid.420451.6) (ISNI:0000 0004 0635 6729) 
 Princeton University, Department of Psychology and the Neuroscience Institute, Princeton, USA (GRID:grid.16750.35) (ISNI:0000 0001 2097 5006) 
 Google Research, Mountain View, USA (GRID:grid.420451.6) (ISNI:0000 0004 0635 6729) 
 Princeton University, Department of Psychology and the Neuroscience Institute, Princeton, USA (GRID:grid.16750.35) (ISNI:0000 0001 2097 5006); New York University Grossman School of Medicine, New York, USA (GRID:grid.240324.3) (ISNI:0000 0001 2109 4251) 
 New York University Grossman School of Medicine, New York, USA (GRID:grid.240324.3) (ISNI:0000 0001 2109 4251) 
 Max Planck Institute for Empirical Aesthetics, Frankfurt, Germany (GRID:grid.461782.e) (ISNI:0000 0004 1795 8610) 
 Israel Institute of Technology, Faculty of Industrial Engineering and Management, Technion, Haifa, Israel (GRID:grid.6451.6) (ISNI:0000000121102151) 
 Tel Aviv University, Blavatnik School of Computer Science, Tel Aviv, Israel (GRID:grid.12136.37) (ISNI:0000 0004 1937 0546) 
 Google Research, Mountain View, USA (GRID:grid.420451.6) (ISNI:0000 0004 0635 6729); Harvard University, School of Engineering and Applied Science, Cambridge, USA (GRID:grid.38142.3c) (ISNI:000000041936754X) 
Pages
369-380
Publication year
2022
Publication date
Mar 2022
Publisher
Nature Publishing Group
ISSN
10976256
e-ISSN
15461726
Source type
Scholarly Journal
Language of publication
English
ProQuest document ID
2637586611
Copyright
© The Author(s) 2022. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.