Revisiting the poverty of the stimulus: hierarchical generalization without a hierarchical bias in recurrent neural networks

McCoy, R. Thomas; Frank, Robert; Linzen, Tal

Computer Science > Computation and Language

arXiv:1802.09091 (cs)

[Submitted on 25 Feb 2018 (v1), last revised 8 Jun 2018 (this version, v3)]

Title:Revisiting the poverty of the stimulus: hierarchical generalization without a hierarchical bias in recurrent neural networks

Authors:R. Thomas McCoy, Robert Frank, Tal Linzen

View PDF

Abstract:Syntactic rules in natural language typically need to make reference to hierarchical sentence structure. However, the simple examples that language learners receive are often equally compatible with linear rules. Children consistently ignore these linear explanations and settle instead on the correct hierarchical one. This fact has motivated the proposal that the learner's hypothesis space is constrained to include only hierarchical rules. We examine this proposal using recurrent neural networks (RNNs), which are not constrained in such a way. We simulate the acquisition of question formation, a hierarchical transformation, in a fragment of English. We find that some RNN architectures tend to learn the hierarchical rule, suggesting that hierarchical cues within the language, combined with the implicit architectural biases inherent in certain RNNs, may be sufficient to induce hierarchical generalizations. The likelihood of acquiring the hierarchical generalization increased when the language included an additional cue to hierarchy in the form of subject-verb agreement, underscoring the role of cues to hierarchy in the learner's input.

Comments:	Proceedings of the 40th Annual Conference of the Cognitive Science Society; 10 pages
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:1802.09091 [cs.CL]
	(or arXiv:1802.09091v3 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1802.09091

Submission history

From: Tom McCoy [view email]
[v1] Sun, 25 Feb 2018 21:52:37 UTC (186 KB)
[v2] Wed, 28 Feb 2018 05:11:18 UTC (186 KB)
[v3] Fri, 8 Jun 2018 04:20:31 UTC (697 KB)

Computer Science > Computation and Language

Title:Revisiting the poverty of the stimulus: hierarchical generalization without a hierarchical bias in recurrent neural networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Revisiting the poverty of the stimulus: hierarchical generalization without a hierarchical bias in recurrent neural networks

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators