Large-scale study of speech acts' development using automatic labelling

M Nikolaus, J Maes, J Auguste, L Prevot… - Proceedings of the …, 2021 - escholarship.org
Proceedings of the Annual Meeting of the Cognitive Science Society, 2021escholarship.org
Studies of children's language use in the wild (eg, in the context of child-caregiver social
interaction) have been slowed by the time-and resource-consuming task of hand annotating
utterances for communicative intents/speech acts. Existing studies have typically focused on
investigating rather small samples of children, raising the question of how their findings
generalize both to larger and more representative populations and to a richer set of
interaction contexts. Here we propose a simple automatic model for speech act labeling in …
Studies of children's language use in the wild (e.g., in the context of child-caregiver social interaction) have been slowed by the time- and resource- consuming task of hand annotating utterances for communicative intents/speech acts. Existing studies have typically focused on investigating rather small samples of children, raising the question of how their findings generalize both to larger and more representative populations and to a richer set of interaction contexts. Here we propose a simple automatic model for speech act labeling in early childhood based on the INCA-A coding scheme (Ninio et al., 1994). After validating the model against ground truth labels, we automatically annotated the entire English-language data from the CHILDES corpus. The major theoretical result was that earlier findings generalize quite well at a large scale. Our model will be shared with the community so that researchers can use it with their data to investigate various question related to language use both in typical and atypical populations of children.
escholarship.org
Showing the best result for this search. See all results