The relational processing limits of classic and contemporary neural network 
models of language processing

Puebla, Guillermo; Martin, Andrea E.; Doumas, Leonidas A. A.

doi:10.1080/23273798.2020.1821906

Item

ITEM ACTIONSEXPORT

Add to Basket

Local TagsRelease HistoryDetailsSummary

Released

Journal Article

The relational processing limits of classic and contemporary neural network models of language processing

MPS-Authors

/persons/resource/persons198520

Martin, Andrea E.
Language and Computation in Neural Systems, MPI for Psycholinguistics, Max Planck Society;
Donders Institute for Brain, Cognition and Behaviour, External Organizations;

External Resource

No external resources are shared

Fulltext (restricted access)

There are currently no full texts shared for your IP range.

Fulltext (public)

Puebla_Martin_Doumas_2021_Relational processing limits of....pdf
(Publisher version), 3MB

Supplementary Material (public)

Puebla_Martin_Doumas_2020suppl_Relational processing limits of classic and ...pdf
(Supplementary material), 121KB

Citation

Puebla, G., Martin, A. E., & Doumas, L. A. A. (2021). The relational processing limits of classic and contemporary neural network models of language processing. Language, Cognition and Neuroscience, 36(2), 240-254. doi:10.1080/23273798.2020.1821906.

Cite as: https://hdl.handle.net/21.11116/0000-0007-38D2-D

Abstract

Whether neural networks can capture relational knowledge is a matter of long-standing controversy. Recently, some researchers have argued that (1) classic connectionist models can handle relational structure and (2) the success of deep learning approaches to natural language processing suggests that structured representations are unnecessary to model human language. We tested the Story Gestalt model, a classic connectionist model of text comprehension, and a Sequence-to-Sequence with Attention model, a modern deep learning architecture for natural language processing. Both models were trained to answer questions about stories based on abstract thematic roles. Two simulations varied the statistical structure of new stories while keeping their relational structure intact. The performance of each model fell below chance at least under one manipulation. We argue that both models fail our tests because they can't perform dynamic binding. These results cast doubts on the suitability of traditional neural networks for explaining relational reasoning and language processing phenomena.