Investigating model performance in language identification: beyond simple error statistics

Styles, Suzy J.; Chua, Victoria Y. H.; Woon, Fei Ting; Liu, Hexin; Perera, Leibny Paola Garcia; Khudanpur, Sanjeev; Khong, Andy W. H.; Dauwels, Justin

Electrical Engineering and Systems Science > Audio and Speech Processing

arXiv:2305.18925 (eess)

[Submitted on 30 May 2023]

Title:Investigating model performance in language identification: beyond simple error statistics

Authors:Suzy J. Styles, Victoria Y. H. Chua, Fei Ting Woon, Hexin Liu, Leibny Paola Garcia Perera, Sanjeev Khudanpur, Andy W. H. Khong, Justin Dauwels

View PDF

Abstract:Language development experts need tools that can automatically identify languages from fluent, conversational speech, and provide reliable estimates of usage rates at the level of an individual recording. However, language identification systems are typically evaluated on metrics such as equal error rate and balanced accuracy, applied at the level of an entire speech corpus. These overview metrics do not provide information about model performance at the level of individual speakers, recordings, or units of speech with different linguistic characteristics. Overview statistics may therefore mask systematic errors in model performance for some subsets of the data, and consequently, have worse performance on data derived from some subsets of human speakers, creating a kind of algorithmic bias. In the current paper, we investigate how well a number of language identification systems perform on individual recordings and speech units with different linguistic properties in the MERLIon CCS Challenge. The Challenge dataset features accented English-Mandarin code-switched child-directed speech.

Comments:	Accepted to Interspeech 2023, 5 pages, 5 figures
Subjects:	Audio and Speech Processing (eess.AS); Computation and Language (cs.CL); Sound (cs.SD)
Cite as:	arXiv:2305.18925 [eess.AS]
	(or arXiv:2305.18925v1 [eess.AS] for this version)
	https://doi.org/10.48550/arXiv.2305.18925

Submission history

From: Hexin Liu [view email]
[v1] Tue, 30 May 2023 10:32:53 UTC (5,216 KB)

Electrical Engineering and Systems Science > Audio and Speech Processing

Title:Investigating model performance in language identification: beyond simple error statistics

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Audio and Speech Processing

Title:Investigating model performance in language identification: beyond simple error statistics

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators