Skip to main content
eScholarship
Open Access Publications from the University of California

Different kinds of cognitive plausibility: why are transformers better than RNNs at predicting N400 amplitude?

Creative Commons 'BY' version 4.0 license
Abstract

Despite being designed for performance rather than cognitive plausibility, transformer language models have been found to be better at predicting metrics used to assess human language comprehension than language models with other architectures, such as recurrent neural networks. Based on how well they predict the N400, a neural signal associated with processing difficulty, we propose and provide evidence for one possible explanation—their predictions are affected by the preceding context in a way analogous to the effect of semantic facilitation in humans.

Main Content
For improved accessibility of PDF content, download the file to your device.
Current View