Progressive Learning for Stabilizing Label Selection in Speech Separation with Mapping-based Method

Gao, Chenyang; Gu, Yue; Marsic, Ivan

Computer Science > Sound

arXiv:2110.10593 (cs)

[Submitted on 20 Oct 2021 (v1), last revised 21 Mar 2022 (this version, v2)]

Title:Progressive Learning for Stabilizing Label Selection in Speech Separation with Mapping-based Method

Authors:Chenyang Gao, Yue Gu, Ivan Marsic

View PDF

Abstract:Speech separation has been studied in time domain because of lower latency and higher performance compared to time-frequency domain. The masking-based method has been mostly used in time domain, and the other common method (mapping-based) has been inadequately studied. We investigate the use of the mapping-based method in the time domain and show that it can perform better on a large training set than the masking-based method. We also investigate the frequent label-switching problem in permutation invariant training (PIT), which results in suboptimal training because the labels selected by PIT differ across training epochs. Our experiment results showed that PIT works well in a shallow separation model, and the label switching occurs for a deeper model. We inferred that layer decoupling may be the reason for the frequent label switching. Therefore, we propose a training strategy based on progressive learning. This approach significantly reduced inconsistent label assignment without added computational complexity or training corpus. By combining this training strategy with the mapping-based method, we significantly improved the separation performance compared to the baseline.

Comments:	Submitted to Interspeech 2022
Subjects:	Sound (cs.SD); Machine Learning (cs.LG); Audio and Speech Processing (eess.AS)
Cite as:	arXiv:2110.10593 [cs.SD]
	(or arXiv:2110.10593v2 [cs.SD] for this version)
	https://doi.org/10.48550/arXiv.2110.10593

Submission history

From: Chenyang Gao [view email]
[v1] Wed, 20 Oct 2021 14:42:50 UTC (317 KB)
[v2] Mon, 21 Mar 2022 14:55:17 UTC (776 KB)

Computer Science > Sound

Title:Progressive Learning for Stabilizing Label Selection in Speech Separation with Mapping-based Method

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Sound

Title:Progressive Learning for Stabilizing Label Selection in Speech Separation with Mapping-based Method

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators