The results show that our proposed method can obtain a large gain on children's speech, with relative ~20% WER reduction compared to the baseline, and also no ...
The range of the pitch frequency mainly lies between 70 Hz to 255 Hz for the adult speakers whereas for children's pitch frequency ranges usually from 200 Hz to ...
Prosody Usage Optimization for Children Speech Recognition with Zero Resource Children Speech. Li C., Qian Y. Expand. Publication type: Proceedings Article.
People also ask
What is prosody of speech recognition?
What is prosody modification?
In this paper, we present our efforts towards developing a children's ASR system in Punjabi which a low-resourced language.
Feb 23, 2024 · One promising approach is to align vocal-tract parameters between adults and children through children-specific data augmentation, referred here to as ...
Jan 1, 2022 · In this paper, we have presented our efforts towards developing a noise-robust children's speech recognition system in zero-resource conditions.
Missing: Optimization | Show results with:Optimization
Prosody Usage Optimization for Children Speech Recognition with Zero Resource Children Speech · Chenda LiY. Qian. Computer Science. INTERSPEECH. 2019. TLDR.
It is known that school-aged children with cochlear implants show deficits in voice emotion recognition relative to normally-hearing peers.
Missing: Optimization Zero
Li C, Qian Y (2019) Prosody usage optimization for children speech recognition with zero resource children speech. In Interspeech 3446–3450. https://doi.org ...
Jul 20, 2022 · Li C, Qian Y (2019) Prosody usage optimization for children speech recognition with zero resource children speech. In Interspeech 3446–3450.