Identification of primary and collateral tracks in stuttered speech

Riad, Rachid; Bachoud-Lévi, Anne-Catherine; Rudzicz, Frank; Dupoux, Emmanuel

Computer Science > Computation and Language

arXiv:2003.01018 (cs)

[Submitted on 2 Mar 2020]

Title:Identification of primary and collateral tracks in stuttered speech

Authors:Rachid Riad, Anne-Catherine Bachoud-Lévi, Frank Rudzicz, Emmanuel Dupoux

View PDF

Abstract:Disfluent speech has been previously addressed from two main perspectives: the clinical perspective focusing on diagnostic, and the Natural Language Processing (NLP) perspective aiming at modeling these events and detect them for downstream tasks. In addition, previous works often used different metrics depending on whether the input features are text or speech, making it difficult to compare the different contributions. Here, we introduce a new evaluation framework for disfluency detection inspired by the clinical and NLP perspective together with the theory of performance from \cite{clark1996using} which distinguishes between primary and collateral tracks. We introduce a novel forced-aligned disfluency dataset from a corpus of semi-directed interviews, and present baseline results directly comparing the performance of text-based features (word and span information) and speech-based (acoustic-prosodic information). Finally, we introduce new audio features inspired by the word-based span features. We show experimentally that using these features outperformed the baselines for speech-based predictions on the present dataset.

Comments:	To be published in LREC 2020
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2003.01018 [cs.CL]
	(or arXiv:2003.01018v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2003.01018

Submission history

From: Rachid Riad [view email]
[v1] Mon, 2 Mar 2020 16:50:33 UTC (1,848 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2020-03

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Rachid Riad
Frank Rudzicz
Emmanuel Dupoux

export BibTeX citation

Computer Science > Computation and Language

Title:Identification of primary and collateral tracks in stuttered speech

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Identification of primary and collateral tracks in stuttered speech

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators