default search action
Speech Communication, Volume 157
Volume 157, 2024
- Nan Li, Longbiao Wang, Meng Ge, Masashi Unoki, Sheng Li, Jianwu Dang:
Robust voice activity detection using an auditory-inspired masked modulation encoder based convolutional attention network. 103024 - Stefano Bannò, Marco Matassoni:
Back to grammar: Using grammatical error correction to automatically assess L2 speaking proficiency. 103025 - Yunqi C. Zhang, Yusuke Hioka, C. T. Justine Hui, Catherine I. Watson:
Performance of single-channel speech enhancement algorithms on Mandarin listeners with different immersion conditions in New Zealand English. 103026 - Wei-Cheng Lin, Carlos Busso:
Deep temporal clustering features for speech emotion recognition. 103027 - Zhipeng Chen, Xinheng Wang, Lun Xie, Haijie Yuan, Hang Pan:
LPIPS-AttnWav2Lip: Generic audio-driven lip synchronization for talking head generation in the wild. 103028 - Ingy Emara, Nabil H. Shaker:
The impact of non-native English speakers' phonological and prosodic features on automatic speech recognition accuracy. 103038 - Paavo Alku, Manila Kodali, Laura Laaksonen, Sudarsana Reddy Kadiri:
AVID: A speech database for machine learning studies on vocal intensity. 103039 - Yagnavajjula Madhu Keerthana, Mittapalle Kiran Reddy, Paavo Alku, K. Sreenivasa Rao, Pabitra Mitra:
Automatic classification of neurological voice disorders using wavelet scattering features. 103040 - Simon Stone, Peter Birkholz:
Monophthong vocal tract shapes are sufficient for articulatory synthesis of German primary diphthongs. 103041
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.