A Cross-Linguistic Pressure for Uniform Information Density in Word Order

Clark, Thomas Hikaru; Meister, Clara; Pimentel, Tiago; Hahn, Michael; Cotterell, Ryan; Futrell, Richard; Levy, Roger

Computer Science > Computation and Language

arXiv:2306.03734 (cs)

[Submitted on 6 Jun 2023 (v1), last revised 9 Jul 2023 (this version, v2)]

Title:A Cross-Linguistic Pressure for Uniform Information Density in Word Order

Authors:Thomas Hikaru Clark, Clara Meister, Tiago Pimentel, Michael Hahn, Ryan Cotterell, Richard Futrell, Roger Levy

View PDF

Abstract:While natural languages differ widely in both canonical word order and word order flexibility, their word orders still follow shared cross-linguistic statistical patterns, often attributed to functional pressures. In the effort to identify these pressures, prior work has compared real and counterfactual word orders. Yet one functional pressure has been overlooked in such investigations: the uniform information density (UID) hypothesis, which holds that information should be spread evenly throughout an utterance. Here, we ask whether a pressure for UID may have influenced word order patterns cross-linguistically. To this end, we use computational models to test whether real orders lead to greater information uniformity than counterfactual orders. In our empirical study of 10 typologically diverse languages, we find that: (i) among SVO languages, real word orders consistently have greater uniformity than reverse word orders, and (ii) only linguistically implausible counterfactual orders consistently exceed the uniformity of real orders. These findings are compatible with a pressure for information uniformity in the development and usage of natural languages.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2306.03734 [cs.CL]
	(or arXiv:2306.03734v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2306.03734

Submission history

From: Thomas Clark [view email]
[v1] Tue, 6 Jun 2023 14:52:15 UTC (4,349 KB)
[v2] Sun, 9 Jul 2023 17:17:39 UTC (4,349 KB)

Computer Science > Computation and Language

Title:A Cross-Linguistic Pressure for Uniform Information Density in Word Order

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:A Cross-Linguistic Pressure for Uniform Information Density in Word Order

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators