default search action

combined dblp search
author search
venue search
publication search

ask others

19th Interspeech 2018: Hyderabad, India

> Home > Conferences and Workshops > Interspeech

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/2018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/2018
B. Yegnanarayana:
19th Annual Conference of the International Speech Communication Association, Interspeech 2018, Hyderabad, India, September 2-6, 2018. ISCA 2018

ISCA Medal Talk

- view
  - electronic edition @ isca-speech.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/Atal18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Atal18
Bishnu S. Atal:
From Vocoders to Code-Excited Linear Prediction: Learning How We Hear What We Hear. 1

End-to-End Speech Recognition

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KaritaWIOD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KaritaWIOD18
Shigeki Karita, Shinji Watanabe, Tomoharu Iwata, Atsunori Ogawa, Marc Delcroix:
Semi-Supervised End-to-End Speech Recognition. 2-6
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZeyerISN18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZeyerISN18
Albert Zeyer, Kazuki Irie, Ralf Schlüter, Hermann Ney:
Improved Training of End-to-end Attention Models for Speech Recognition. 7-11
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HadianSPK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HadianSPK18
Hossein Hadian, Hossein Sameti, Daniel Povey, Sanjeev Khudanpur:
End-to-end Speech Recognition Using Lattice-free MMI. 12-16
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/0005NACL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/0005NACL18
Stefan Braun, Daniel Neil, Jithendar Anumula, Enea Ceolini, Shih-Chii Liu:
Multi-channel Attention for End-to-End Speech Recognition. 17-21
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ParcolletZMTLMB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ParcolletZMTLMB18
Titouan Parcollet, Ying Zhang, Mohamed Morchid, Chiheb Trabelsi, Georges Linarès, Renato de Mori, Yoshua Bengio:
Quaternion Convolutional Neural Networks for End-to-End Automatic Speech Recognition. 22-26
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PangSPGWZC18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PangSPGWZC18
Ruoming Pang, Tara N. Sainath, Rohit Prabhavalkar, Suyog Gupta, Yonghui Wu, Shuyuan Zhang, Chung-Cheng Chiu:
Compression of End-to-End Models. 27-31

Prosody Modeling and Generation

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HodariWRK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HodariWRK18
Zack Hodari, Oliver Watts, Srikanth Ronanki, Simon King:
Learning Interpretable Control Dimensions for Speech Synthesis by Using External Data. 32-36
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Luong0YN18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Luong0YN18
Hieu-Thi Luong, Xin Wang, Junichi Yamagishi, Nobuyuki Nishizawa:
Investigating Accuracy of Pitch-accent Annotations in Neural Network-based Speech Synthesis and Denoising Effects. 37-41
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiouCWC18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiouCWC18
Guan-Ting Liou, Chen-Yu Chiang, Yih-Ru Wang, Sin-Horng Chen:
An Exploration of Local Speaking Rate Variations in Mandarin Read Speech. 42-46
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhengTWL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhengTWL18
Yibin Zheng, Jianhua Tao, Zhengqi Wen, Ya Li:
BLSTM-CRF Based End-to-End Prosodic Boundary Prediction with Context Sensitive Embeddings in a Text-to-Speech Front-End. 47-51
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SismanL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SismanL18
Berrak Sisman, Haizhou Li:
Wavelet Analysis of Speaker Dependent and Independent Prosody for Voice Conversion. 52-56
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiuBGZW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiuBGZW18
Rui Liu, Feilong Bao, Guanglai Gao, Hui Zhang, Yonghe Wang:
Improving Mongolian Phrase Break Prediction by Using Syllable and Morphological Embeddings with BiLSTM Model. 57-61

Speaker Verification I

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YouGSZ18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YouGSZ18
Lanhua You, Wu Guo, Yan Song, Sheng Zhang:
Improved Supervised Locality Preserving Projection for I-vector Based Speaker Verification. 62-66
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShiLLL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShiLLL18
Ziqiang Shi, Huibin Lin, Liu Liu, Rujie Liu:
Double Joint Bayesian Modeling of DNN Local I-Vector for Text Dependent Speaker Verification with Random Digit Strings. 67-71
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SilnovaBGSB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SilnovaBGSB18
Anna Silnova, Niko Brümmer, Daniel Garcia-Romero, David Snyder, Lukás Burget:
Fast Variational Bayes for Heavy-tailed PLDA Applied to i-vectors and x-vectors. 72-76
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TodiscoDLSEKY18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TodiscoDLSEKY18
Massimiliano Todisco, Héctor Delgado, Kong-Aik Lee, Md. Sahidullah, Nicholas W. D. Evans, Tomi Kinnunen, Junichi Yamagishi:
Integrated Presentation Attack Detection and Automatic Speaker Verification: Common Features and Gaussian Back-end Fusion. 77-81
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FerrerM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FerrerM18
Luciana Ferrer, Mitchell McLaren:
A Generalization of PLDA for Joint Modeling of Speaker Identity and Multiple Nuisance Conditions. 82-86
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenVD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenVD18
Nanxin Chen, Jesús Villalba, Najim Dehak:
An Investigation of Non-linear i-vectors for Speaker Verification. 87-91

Spoken Term Detection

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RamWB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RamWB18
Dhananjay Ram, Lesly Miculicich, Hervé Bourlard:
CNN Based Query by Example Spoken Term Detection. 92-96
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YuanLXCML18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YuanLXCML18
Yougen Yuan, Cheung-Chi Leung, Lei Xie, Hongjie Chen, Bin Ma, Haizhou Li:
Learning Acoustic Word Embeddings with Temporal Context for Query-by-Example Speech Search. 97-101
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhuWLMC18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhuWLMC18
Ziwei Zhu, Zhiyong Wu, Runnan Li, Helen Meng, Lianhong Cai:
Siamese Recurrent Auto-Encoder Representation for Query-by-Example Spoken Term Detection. 102-106
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiM18
Wei Li, Brian Mak:
Fast Derivation of Cross-lingual Document Vectors from Self-attentive Neural Machine Translation Model. 107-111
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PandeyN18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PandeyN18
Laxmi Pandey, Karan Nathwani:
LSTM Based Attentive Fusion of Spectral and Prosodic Information for Keyword Spotting in Hindi Language. 112-116
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShankarMP18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShankarMP18
Ravi Shankar, Vikram C. M., S. R. Mahadeva Prasanna:
Spoken Keyword Detection Using Joint DTW-CNN. 117-121

The INTERSPEECH 2018 Computational Paralinguistics ChallengE (ComParE): Atypical & Self-Assessed Affect, Crying & Heart Beats 1

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SchullerSBMBDHP18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SchullerSBMBDHP18
Björn W. Schuller, Stefan Steidl, Anton Batliner, Peter B. Marschik, Harald Baumeister, Fengquan Dong, Simone Hantke, Florian B. Pokorny, Eva-Maria Rathner, Katrin D. Bartl-Pokorny, Christa Einspieler, Dajie Zhang, Alice Baird, Shahin Amiriparian, Kun Qian, Zhao Ren, Maximilian Schmitt, Panagiotis Tzirakis, Stefanos Zafeiriou:
The INTERSPEECH 2018 Computational Paralinguistics Challenge: Atypical & Self-Assessed Affect, Crying & Heart Beats. 122-126
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HumayunKG0H18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HumayunKG0H18
Ahmed Imtiaz Humayun, Md. Tauhiduzzaman Khan, Shabnam Ghaffarzadegan, Zhe Feng, Taufiq Hasan:
An Ensemble of Transfer, Semi-supervised and Supervised Learning Methods for Pathological Heart Sound Classification. 127-131
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TuranE18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TuranE18
Mehmet Ali Tugtekin Turan, Engin Erzin:
Monitoring Infant's Emotional Cry in Domestic Environments Using the Capsule Network Architecture. 132-136
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Huckvale18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Huckvale18
Mark A. Huckvale:
Neural Network Architecture That Combines Temporal and Summative Features for Infant Cry Classification in the Interspeech 2018 Computational Paralinguistics Challenge. 137-141
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/000100S18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/000100S18
Zixing Zhang, Jing Han, Kun Qian, Björn W. Schuller:
Evolving Learning for Analysing Mood-Related Infant Vocalisation. 142-146
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/0001SSA18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/0001SSA18
Johannes Wagner, Dominik Schiller, Andreas Seiderer, Elisabeth André:
Deep Learning in Paralinguistic Recognition Tasks: Are Hand-crafted Features Still Relevant? 147-151
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LuoZH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LuoZH18
Danqing Luo, Yuexian Zou, Dongyan Huang:
Investigation on Joint Representation Learning for Robust Feature Extraction in Speech Emotion Recognition. 152-156
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ParkACA18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ParkACA18
Soo Jin Park, Amber Afshan, Zhi Ming Chua, Abeer Alwan:
Using Voice Quality Supervectors for Affect Identification. 157-161
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TangZL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TangZL18
Dengke Tang, Junlin Zeng, Ming Li:
An End-to-End Deep Learning Framework for Speech Emotion Recognition of Atypical Individuals. 162-166

Show and Tell 1

- view
  - electronic edition @ isca-speech.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/KollerBK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KollerBK18
Alexander Koller, Timo Baumann, Arne Köhn:
DialogOS: Simple and Extensible Dialogue Modeling. 167-168
- view
  - electronic edition @ isca-speech.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/DernoncourtBC18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DernoncourtBC18
Franck Dernoncourt, Trung Bui, Walter Chang:
A Framework for Speech Recognition Benchmarking. 169-170
- view
  - electronic edition @ isca-speech.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/Arai18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Arai18
Takayuki Arai:
Flexible Tongue Housed in a Static Model of the Vocal Tract With Jaws, Lips and Teeth. 171-172
- view
  - electronic edition @ isca-speech.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/MathewG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MathewG18
Lani Mathew, K. Gopakumar:
Voice Analysis Using Acoustic and Throat Microphones for Speech Therapy. 173-174
- view
  - electronic edition @ isca-speech.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/RaynerTS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RaynerTS18
Manny Rayner, Nikos Tsourakis, Jan Stanek:
A Robust Context-Dependent Speech-to-Speech Phraselator Toolkit for Alexa. 175-176

Speech Segments and Voice Quality

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PrasadKGY18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PrasadKGY18
RaviShankar Prasad, Sudarsana Reddy Kadiri, Suryakanth V. Gangashetty, Bayya Yegnanarayana:
Discriminating Nasals and Approximants in English Language Using Zero Time Windowing. 177-181
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HowsonK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HowsonK18
Phil Howson, Alexei Kochetov:
Gestural Lenition of Rhotics Captures Variation in Brazilian Portuguese. 182-186
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PrasadY18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PrasadY18
RaviShankar Prasad, Bayya Yegnanarayana:
Identification and Classification of Fricatives in Speech Using Zero Time Windowing Method. 187-191
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChanchaochaiCDD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChanchaochaiCDD18
Nattanun Chanchaochai, Christopher Cieri, Japhet Debrah, Hongwei Ding, Yue Jiang, Sishi Liao, Mark Liberman, Jonathan Wright, Jiahong Yuan, Juhong Zhan, Yuqing Zhan:
GlobalTIMIT: Acoustic-Phonetic Datasets for the World's Languages. 192-196
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HermesMAR18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HermesMAR18
Anne Hermes, Doris Mücke, Bastian Auris, Rachid Ridouane:
Structural Effects on Properties of Consonantal Gestures in Tashlhiyt. 197-201
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KochetovFN18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KochetovFN18
Alexei Kochetov, Matthew Faytak, Kiranpreet Nara:
The Retroflex-dental Contrast in Punjabi Stops and Nasals: A Principal Component Analysis of Ultrasound Images. 202-206
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YueH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YueH18
Yang Yue, Fang Hu:
Vowels and Diphthongs in Hangzhou Wu Chinese Dialect. 207-211
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MPM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MPM18
Mahesh M, Jeena J. Prakash, Hema A. Murthy:
Resyllabification in Indian Languages and Its Implications in Text-to-speech Systems. 212-216
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MurphyYCG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MurphyYCG18
Andy Murphy, Irena Yanushevskaya, Ailbhe Ní Chasaide, Christer Gobl:
Voice Source Contribution to Prominence Perception: Rd Implementation. 217-221
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GoblMYC18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GoblMYC18
Christer Gobl, Andy Murphy, Irena Yanushevskaya, Ailbhe Ní Chasaide:
On the Relationship between Glottal Pulse Shape and Its Spectrum: Correlations of Open Quotient, Pulse Skew and Peak Flow with Source Harmonic Amplitudes. 222-226
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HughesHFFKF18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HughesHFFKF18
Vincent Hughes, Philip Harrison, Paul Foulkes, Peter French, Colleen Kavanagh, Eugenia San Segundo Fernández:
The Individual and the System: Assessing the Stability of the Output of a Semi-automatic Forensic Voice Comparison System. 227-231
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KadiriY18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KadiriY18
Sudarsana Reddy Kadiri, Bayya Yegnanarayana:
Breathy to Tense Voice Discrimination using Zero-Time Windowing Cepstral Coefficients (ZTWCCs). 232-236
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GogoiKGWSP18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GogoiKGWSP18
Pamir Gogoi, Sishir Kalita, Parismita Gogoi, Ratree Wayland, Priyankoo Sarmah, S. R. Mahadeva Prasanna:
Analysis of Breathiness in Contextual Vowel of Voiceless Nasals in Mizo. 237-241

Speaker State and Trait

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XuHM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XuHM18
Yijia Xu, Mark Hasegawa-Johnson, Nancy McElwain:
Infant Emotional Outbursts Detection in Infant-parent Spoken Interactions. 242-246
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChoPKVCD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChoPKVCD18
Jaejin Cho, Raghavendra Pappagari, Purva Kulkarni, Jesús Villalba, Yishay Carmiel, Najim Dehak:
Deep Neural Networks for Emotion Recognition Combining Audio and Transcripts. 247-251
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ParthasarathyB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ParthasarathyB18
Srinivas Parthasarathy, Carlos Busso:
Preference-Learning with Qualitative Agreement for Sentence Level Emotional Annotations. 252-256
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LatifRYQE18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LatifRYQE18
Siddique Latif, Rajib Rana, Shahzad Younis, Junaid Qadir, Julien Epps:
Transfer Learning for Improving Speech Emotion Classification Accuracy. 257-261
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MeyerBF18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MeyerBF18
Patrick Meyer, Eric Buschermöhle, Tim Fingscheidt:
What Do Classifiers Actually Learn? a Case Study on Emotion Recognition Datasets. 262-266
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RathnerTCSB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RathnerTCSB18
Eva-Maria Rathner, Yannik Terhorst, Nicholas Cummins, Björn W. Schuller, Harald Baumeister:
State of Mind: Classification through Self-reported Affect and Word Use in Speech. 267-271
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhaoZZWZL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhaoZZWZL18
Ziping Zhao, Yu Zheng, Zixing Zhang, Haishuai Wang, Yiqin Zhao, Chao Li:
Exploring Spatio-Temporal Representations by Integrating Attention-based Bidirectional-LSTM-RNNs and FCNs for Speech Emotion Recognition. 272-276
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GhahremaniNCVPK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GhahremaniNCVPK18
Pegah Ghahremani, Phani Sankar Nidadavolu, Nanxin Chen, Jesús Villalba, Daniel Povey, Sanjeev Khudanpur, Najim Dehak:
End-to-end Deep Neural Network Age Estimation. 277-281
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HebbarSN18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HebbarSN18
Rajat Hebbar, Krishna Somandepalli, Shrikanth S. Narayanan:
Improving Gender Identification in Movie Audio Using Cross-Domain Data. 282-286
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KabilMM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KabilMM18
Selen Hande Kabil, Hannah Muckenhirn, Mathew Magimai-Doss:
On Learning to Identify Genders from Raw Speech Signal Using CNNs. 287-291
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SebastianKSMMN18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SebastianKSMMN18
Jilt Sebastian, Manoj Kumar, Pavan Kumar D. S., Mathew Magimai-Doss, Hema A. Murthy, Shrikanth S. Narayanan:
Denoising and Raw-waveform Networks for Weakly-Supervised Gender Identification on Noisy Speech. 292-296
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WilliamsonQLMFE18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WilliamsonQLMFE18
James R. Williamson, Thomas F. Quatieri, Adam C. Lammert, Katherine Mitchell, Katherine Finkelstein, Nicole Ekon, Caitlin Dillon, Robert Kenefick, Kristin Heaton:
The Effect of Exposure to High Altitude and Heat on Speech Articulatory Coordination. 297-301

Deep Learning for Source Separation and Pitch Tracking

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenYQSY18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenYQSY18
Lianwu Chen, Meng Yu, Yanmin Qian, Dan Su, Dong Yu:
Permutation Invariant Training of Generative Adversarial Network for Monaural Speech Separation. 302-306
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangCSCYQY18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangCSCYQY18
Jun Wang, Jie Chen, Dan Su, Lianwu Chen, Meng Yu, Yanmin Qian, Dong Yu:
Deep Extractor Network for Target Speaker Recovery from Single Channel Speech Mixtures. 307-311
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HeMO18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HeMO18
Weipeng He, Petr Motlícek, Jean-Marc Odobez:
Joint Localization and Classification of Multiple Sound Sources Using a Multi-task Neural Network. 312-316
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YangWSM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YangWSM18
Shuai Yang, Zhiyong Wu, Binbin Shen, Helen Meng:
Detection of Glottal Closure Instants from Speech Signals: A Convolutional Neural Network Based Method. 317-321
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangZW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangZW18
Zhong-Qiu Wang, Xueliang Zhang, DeLiang Wang:
Robust TDOA Estimation Based on Time-Frequency Masking and Deep Neural Networks. 322-326
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KatoK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KatoK18
Akihiro Kato, Tomi Kinnunen:
Waveform to Single Sinusoid Regression to Estimate the F0 Contour from Noisy Speech Using Recurrent Deep Neural Networks. 327-331
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MagronDMV18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MagronDMV18
Paul Magron, Konstantinos Drossos, Stylianos Ioannis Mimilakis, Tuomas Virtanen:
Reducing Interference with Phase Recovery in DNN-based Monaural Singing Voice Separation. 332-336
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Hua18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Hua18
Kanru Hua:
Nebula: F0 Estimation and Voicing Detection by Modeling the Statistical Properties of Feature Extractors. 337-341
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/0004M18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/0004M18
Yi Luo, Nima Mesgarani:
Real-time Single-channel Dereverberation and Separation with Time-domain Audio Separation Network. 342-346
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Kumar0M18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Kumar0M18
Rajath Kumar, Yi Luo, Nima Mesgarani:
Music Source Activity Detection and Separation Using Deep Attractor Network. 347-351
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YangXZ18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YangXZ18
Longfei Yang, Yanlu Xie, Jinsong Zhang:
Improving Mandarin Tone Recognition Using Convolutional Bidirectional Long Short-Term Memory with Attention. 352-356

Acoustic Analysis-Synthesis of Speech Disorders

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SonMD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SonMD18
Rob van Son, Catherine Middag, Kris Demuynck:
Vowel Space as a Tool to Evaluate Articulation Problems. 357-361
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DelvauxHPMH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DelvauxHPMH18
Véronique Delvaux, Kathy Huet, Myriam Piccaluga, Sophie van Malderen, Bernard Harmegnies:
Towards a Better Characterization of Parkinsonian Speech: A Multidimensional Acoustic Study. 362-366
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KalitaPD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KalitaPD18
Sishir Kalita, S. R. Mahadeva Prasanna, Samarendra Dandapat:
Self-similarity Matrix Based Intelligibility Assessment of Cleft Lip and Palate Speech. 367-371
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DubeyPD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DubeyPD18
Akhilesh Kumar Dubey, S. R. Mahadeva Prasanna, Samarendra Dandapat:
Pitch-Adaptive Front-end Feature for Hypernasality Detection. 372-376
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NorelPARC18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NorelPARC18
Raquel Norel, Mary Pietrowicz, Carla Agurto, Shay Rishoni, Guillermo A. Cecchi:
Detection of Amyotrophic Lateral Sclerosis (ALS) via Acoustic Analysis. 377-381
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MPAMS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MPAMS18
Vikram C. M., S. R. Mahadeva Prasanna, Ajish K. Abraham, Pushpavathi M, Girish K. S:
Detection of Glottal Activity Errors in Production of Stop Consonants in Children with Cleft Lip and Palate. 382-386

ASR Systems and Technologies

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SriramJSC18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SriramJSC18
Anuroop Sriram, Heewoo Jun, Sanjeev Satheesh, Adam Coates:
Cold Fusion: Training Seq2Seq Models Together with Language Models. 387-391
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/IrieLDSN18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/IrieLDSN18
Kazuki Irie, Zhihong Lei, Liuhui Deng, Ralf Schlüter, Hermann Ney:
Investigation on Estimation of Sentence Probability by Combining Forward, Backward and Bi-directional LSTM-RNNs. 392-395
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZenkelSMW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZenkelSMW18
Thomas Zenkel, Ramon Sanabria, Florian Metze, Alex Waibel:
Subword and Crossword Units for CTC Acoustic Models. 396-400
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TanakaMMA18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TanakaMMA18
Tomohiro Tanaka, Ryo Masumura, Hirokazu Masataki, Yushi Aono:
Neural Error Corrective Language Models for Automatic Speech Recognition. 401-405
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RasooliP18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RasooliP18
Mohammad Sadegh Rasooli, Sarangarajan Parthasarathy:
Entity-Aware Language Model as an Unsupervised Reranker. 406-410
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChoiPS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChoiPS18
Iksoo Choi, Jinhwan Park, Wonyong Sung:
Character-level Language Modeling with Gated Hierarchical Recurrent Neural Networks. 411-415

Deception, Personality, and Culture Attribute

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LevitanMH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LevitanMH18
Sarah Ita Levitan, Angel Maredia, Julia Hirschberg:
Acoustic-Prosodic Indicators of Deception and Trust in Interview Dialogues. 416-420
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AnLHL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AnLHL18
Guozhen An, Sarah Ita Levitan, Julia Hirschberg, Rivka Levitan:
Deep Personality Recognition for Deception Detection. 421-425
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MixdorffRLMH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MixdorffRLMH18
Hansjörg Mixdorff, Albert Rilliard, Tan Lee, Matthew K. H. Ma, Angelika Hönemann:
Cross-cultural (A)symmetries in Audio-visual Attitude Perception. 426-430
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HaiderSCL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HaiderSCL18
Fasih Haider, Fahim A. Salim, Owen Conlan, Saturnino Luz:
An Active Feature Transformation Method for Attitude Recognition of Video Bloggers. 431-435
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TsaiYCL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TsaiYCL18
Fu-Sheng Tsai, Hao-Chun Yang, Wei-Wen Chang, Chi-Chun Lee:
Automatic Assessment of Individual Culture Attribute of Power Distance Using a Social Context-Enhanced Prosodic Network Representation. 436-440
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KadiriY18a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KadiriY18a
Sudarsana Reddy Kadiri, Bayya Yegnanarayana:
Analysis and Detection of Phonation Modes in Singing Voice using Excitation Source Features and Single Frequency Filtering Cepstral Coefficients (SFFCC). 441-445

Automatic Detection and Recognition of Voice and Speech Disorders

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WuSLC18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuSLC18
Huiyi Wu, John J. Soraghan, Anja Lowit, Gaetano Di Caterina:
A Deep Learning Method for Pathological Voice Detection Using Convolutional Deep Belief Networks. 446-450
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BhatDVK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BhatDVK18
Chitralekha Bhat, Biswajit Das, Bhavik Vachhani, Sunil Kumar Kopparapu:
Dysarthric Speech Recognition Using Time-delay Neural Network Based Denoising Autoencoder. 451-455
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Vasquez-CorreaA18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Vasquez-CorreaA18
Juan Camilo Vásquez-Correa, Tomás Arias-Vergara, Juan Rafael Orozco-Arroyave, Elmar Nöth:
A Multitask Learning Approach to Assess the Dysarthria Severity in Patients with Parkinson's Disease. 456-460
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LilleyCB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LilleyCB18
Jason Lilley, Erin L. Crowgey, H. Timothy Bunnell:
The Use of Machine Learning and Phonetic Endophenotypes to Discover Genetic Variants Associated with Speech Sound Disorder. 461-465
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MooreVP18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MooreVP18
Meredith Moore, Hemanth Venkateswara, Sethuraman Panchanathan:
Whistle-blowing ASRs: Evaluating the Need for More Inclusive Speech Recognition Systems. 466-470
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VachhaniBK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VachhaniBK18
Bhavik Vachhani, Chitralekha Bhat, Sunil Kumar Kopparapu:
Data Augmentation Using Healthy Speech for Dysarthric Speech Recognition. 471-475

Voice Conversion

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DingZLG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DingZLG18
Shaojin Ding, Guanlong Zhao, Christopher Liberatore, Ricardo Gutierrez-Osuna:
Improving Sparse Representations in Exemplar-Based Voice Conversion with a Phoneme-Selective Objective Function. 476-480
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DingLG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DingLG18
Shaojin Ding, Christopher Liberatore, Ricardo Gutierrez-Osuna:
Learning Structured Dictionaries for Exemplar-based Voice Conversion. 481-485
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PengHWTW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PengHWTW18
Yu-Huai Peng, Hsin-Te Hwang, Yi-Chiao Wu, Yu Tsao, Hsin-Min Wang:
Exemplar-Based Spectral Detail Compensation for Voice Conversion. 486-490
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MeenakshiG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MeenakshiG18
G. Nisha Meenakshi, Prasanta Kumar Ghosh:
Whispered Speech to Neutral Speech Conversion Using Bidirectional LSTMs. 491-495
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiuZSWLM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiuZSWLM18
Songxiang Liu, Jinghua Zhong, Lifa Sun, Xixin Wu, Xunying Liu, Helen Meng:
Voice Conversion Across Arbitrary Speakers Based on a Single Target-Speaker Utterance. 496-500
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChouYLL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChouYLL18
Ju-Chieh Chou, Cheng-chieh Yeh, Hung-yi Lee, Lin-Shan Lee:
Multi-target Voice Conversion without Parallel Data by Adversarially Learning Disentangled Audio Representations. 501-505

The INTERSPEECH 2018 Computational Paralinguistics ChallengE (ComParE): Atypical & Self-Assessed Affect, Crying & Heart Beats 2

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GorrostietaBTSM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GorrostietaBTSM18
Cristina Gorrostieta, Richard Brutti, Kye Taylor, Avi Shapiro, Joseph Moran, Ali Azarbayejani, John Kane:
Attention-based Sequence Classification for Affect Detection. 506-510
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SyedSSM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SyedSSM18
Zafi Sherhan Syed, Julien Schroeter, Kirill A. Sidorov, A. David Marshall:
Computational Paralinguistics: Automatic Assessment of Emotions, Mood and Behavioural State from Acoustics of Speech. 511-515
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RallabandiKVNB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RallabandiKVNB18
Sai Krishna Rallabandi, Bhavya Karki, Carla Viegas, Eric Nyberg, Alan W. Black:
Investigating Utterance Level Representations for Detecting Intent from Acoustics. 516-520
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KayaFYVZ018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KayaFYVZ018
Heysem Kaya, Dmitrii Fedotov, Ali Yesilkanat, Oxana Verkholyak, Yang Zhang, Alexey Karpov:
LSTM Based Cross-corpus and Cross-task Acoustic Emotion Recognition. 521-525
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VlasenkoSSM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VlasenkoSSM18
Bogdan Vlasenko, Jilt Sebastian, Pavan Kumar D. S., Mathew Magimai-Doss:
Implementing Fusion Techniques for the Classification of Paralinguistic Information. 526-530
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GosztolyaGT18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GosztolyaGT18
Gábor Gosztolya, Tamás Grósz, László Tóth:
General Utterance-Level Feature Extraction for Classifying Crying Sounds, Atypical & Self-Assessed Affect and Heart Beats. 531-535
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SuYKCZLL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SuYKCZLL18
Bo-Hao Su, Sung-Lin Yeh, Ming-Ya Ko, Huan-Yu Chen, Shun-Chang Zhong, Jeng-Lin Li, Chi-Chun Lee:
Self-Assessed Affect Recognition Using Fusion of Attentional BLSTM and Static Acoustic Features. 536-540
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MontacieC18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MontacieC18
Claude Montacié, Marie-José Caraty:
Vocalic, Lexical and Prosodic Cues for the INTERSPEECH 2018 Self-Assessed Affect Challenge. 541-545

Show and Tell 2

- view
  - electronic edition @ isca-speech.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/AYG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AYG18
Anand P. A, Chiranjeevi Yarra, N. K. Kausthubha, Prasanta Kumar Ghosh:
Intonation tutor by SPIRE (In-SPIRE): An Online Tool for an Automatic Feedback to the Second Language Learners in Learning Intonation. 546-547
- view
  - electronic edition @ isca-speech.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/EvaniniTTBLBRLS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/EvaniniTTBLBRLS18
Keelan Evanini, Veronika Timpe-Laughlin, Eugene Tsuprun, Ian Blood, Jeremy Lee, James V. Bruno, Vikram Ramanarayanan, Patrick L. Lange, David Suendermann-Oeft:
Game-based Spoken Dialog Language Learning Applications for Young Students. 548-549
- view
  - electronic edition @ isca-speech.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/SorinSKHBPRRD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SorinSKHBPRRD18
Alexander Sorin, Slava Shechtman, Zvi Kons, Ron Hoory, Shay Ben-David, Joe Pavitt, Shai Rozenberg, Carmel Rabinovitz, Tal Drory:
The IBM Virtual Voice Creator. 550-551
- view
  - electronic edition @ isca-speech.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/GMPM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GMPM18
Gayathri G, N. Mohana, Radhika Pal, Hema A. Murthy:
Mobile Application for Learning Languages for the Unlettered. 552-553
- view
  - electronic edition @ isca-speech.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/XuPKLCL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XuPKLCL18
Haihua Xu, Van Tung Pham, Zin Tun Kyaw, Zhi Hao Lim, Eng Siong Chng, Haizhou Li:
Mandarin-English Code-switching Speech Recognition. 554-555

Spoken Dialogue Systems and Conversational Analysis

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimK18
Joo-Kyung Kim, Young-Bum Kim:
Joint Learning of Domain Classification and Out-of-Domain Detection with Dynamic Class Weighting for Satisficing False Acceptance Rates. 556-560
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MukherjeeLLHTFD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MukherjeeLLHTFD18
Sankar Mukherjee, Thierry Legou, Leonardo Lancia, Pauline M. Hilt, Alice Tomassini, Luciano Fadiga, Alessandro D'Ausilio, Leonardo Badino, Noël Nguyen:
Analyzing Vocal Tract Movements During Speech Accommodation. 561-565
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiZX018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiZX018
Yujiang Li, Xuemin Zhao, Weiqun Xu, Yonghong Yan:
Cross-Lingual Multi-Task Neural Architecture for Spoken Language Understanding. 566-570
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/StrimelSP18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/StrimelSP18
Grant P. Strimel, Kanthashree Mysore Sathyendra, Stanislav Peshterliev:
Statistical Model Compression for Small-Footprint Natural Language Understanding. 571-575
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BraunschweilerP18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BraunschweilerP18
Norbert Braunschweiler, Alexandros Papangelis:
Comparison of an End-to-end Trainable Dialogue System with a Modular Statistical Dialogue System. 576-580
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WilliBBTB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WilliBBTB18
Megan M. Willi, Stephanie A. Borrie, Tyson S. Barrett, Ming Tu, Visar Berisha:
A Discriminative Acoustic-Prosodic Approach for Measuring Local Entrainment. 581-585
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RoddySH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RoddySH18
Matthew Roddy, Gabriel Skantze, Naomi Harte:
Investigating Speech Features for Continuous Turn-Taking Prediction Using LSTMs. 586-590
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KraljevskiH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KraljevskiH18
Ivan Kraljevski, Diane Hirschfeld:
Classification of Correction Turns in Multilingual Dialogue Corpus. 591-595
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NaikGGMS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NaikGGMS18
Chetan Naik, Arpit Gupta, Hancheng Ge, Lambert Mathias, Ruhi Sarikaya:
Contextual Slot Carryover for Disparate Schemas. 596-600
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Renkensh18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Renkensh18
Vincent Renkens, Hugo Van hamme:
Capsule Networks for Low Resource Spoken Language Understanding. 601-605
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PadmasundariB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PadmasundariB18
Padmasundari, Srinivas Bangalore:
Intent Discovery Through Unsupervised Semantic Text Clustering. 606-610
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DuBME18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DuBME18
Yulun Du, Alan W. Black, Louis-Philippe Morency, Maxine Eskénazi:
Multimodal Polynomial Fusion for Detecting Driver Distraction. 611-615
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/InoueLTK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/InoueLTK18
Koji Inoue, Divesh Lala, Katsuya Takanashi, Tatsuya Kawahara:
Engagement Recognition in Spoken Dialogue via Neural Network by Aggregating Different Annotators' Models. 616-620
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BuanzurZNW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BuanzurZNW18
Tuarik Buanzur, Margaret Zellers, Saudah Namyalo, Alena Witzlack-Makarevich:
A First Investigation of the Timing of Turn-taking in Ruuli. 621-625

Spoofing Detection

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhaoTS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhaoTS18
Yuanjun Zhao, Roberto Togneri, Victor Sreeram:
Spoofing Detection Using Adaptive Weighting Framework and Clustering Analysis. 626-630
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JelilKP018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JelilKP018
Sarfaraz Jelil, Sishir Kalita, S. R. Mahadeva Prasanna, Rohit Sinha:
Exploration of Compressed ILPR Features for Replay Attack Detection. 631-635
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GunendradasanWL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GunendradasanWL18
Tharshini Gunendradasan, Buddhi Wickramasinghe, Phu Ngoc Le, Eliathamby Ambikairajah, Julien Epps:
Detection of Replay-Spoofing Attacks Using Frequency Modulation Features. 636-640
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KambleTP18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KambleTP18
Madhu R. Kamble, Hemlata Tak, Hemant A. Patil:
Effectiveness of Speech Demodulation-Based Features for Replay Detection. 641-645
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KambleP18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KambleP18
Madhu R. Kamble, Hemant A. Patil:
Novel Variable Length Energy Separation Algorithm Using Instantaneous Amplitude Features for Replay Detection. 646-650
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YangYH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YangYH18
Ji-Chen Yang, Changhuai You, Qianhua He:
Feature with Complementarity of Statistics and Principal Information for Spoofing Detection. 651-655
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiWDLONGL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiWDLONGL18
Dongbo Li, Longbiao Wang, Jianwu Dang, Meng Liu, Zeyan Oo, Seiichi Nakagawa, Haotian Guan, Xiangang Li:
Multiple Phase Information Combination for Replay Attacks Detection. 656-660
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WickramasingheI18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WickramasingheI18
Buddhi Wickramasinghe, Saad Irtza, Eliathamby Ambikairajah, Julien Epps:
Frequency Domain Linear Prediction Features for Replay Spoofing Attack Detection. 661-665
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SailorKP18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SailorKP18
Hardik B. Sailor, Madhu R. Kamble, Hemant A. Patil:
Auditory Filterbank Learning for Temporal Modulation Features in Replay Spoof Speech Detection. 666-670
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SriskandarajaSA18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SriskandarajaSA18
Kaavya Sriskandaraja, Vidhyasaharan Sethu, Eliathamby Ambikairajah:
Deep Siamese Architecture Based Replay Detection for Secure Voice Biometric. 671-675
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AlanisP0G18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AlanisP0G18
Alejandro Gómez Alanís, Antonio M. Peinado, José A. González, Ángel M. Gómez:
A Deep Identity Representation for Noise Robust Spoofing Detection. 676-680
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TomJD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TomJD18
Francis Tom, Mohit Jain, Prasenjit Dey:
End-To-End Audio Replay Attack Detection Using Deep Convolutional Networks with Attention. 681-685
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SaranyaM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SaranyaM18
M. S. Saranya, Hema A. Murthy:
Decision-level Feature Switching as a Paradigm for Replay Attack Detection. 686-690
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SuthokumarSWA18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SuthokumarSWA18
Gajan Suthokumar, Vidhyasaharan Sethu, Chamith Wijenayake, Eliathamby Ambikairajah:
Modulation Dynamic Features for the Detection of Replay Attacks. 691-695

Speech Analysis and Representation

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LoweimiBH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LoweimiBH18
Erfan Loweimi, Jon Barker, Thomas Hain:
On the Usefulness of the Speech Phase Spectrum for Pitch Extraction. 696-700
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AiraksinenJRA18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AiraksinenJRA18
Manu Airaksinen, Lauri Juvela, Okko Räsänen, Paavo Alku:
Time-regularized Linear Prediction for Noise-robust Extraction of the Spectral Envelope of Speech. 701-705
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SailorP18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SailorP18
Hardik B. Sailor, Hemant A. Patil:
Auditory Filterbank Learning Using ConvRBM for Infant Cry Classification. 706-710
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShahP18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShahP18
Nirmesh J. Shah, Hemant A. Patil:
Effectiveness of Dynamic Features in INCA and Temporal Context-INCA. 711-715
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GongS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GongS18
Rong Gong, Xavier Serra:
Singing Voice Phoneme Segmentation by Hierarchically Inferring Syllable and Phoneme Onset Positions. 716-720
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TapkirP18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TapkirP18
Prasad Tapkir, Hemant A. Patil:
Novel Empirical Mode Decomposition Cepstral Features for Replay Spoof Detection. 721-725
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TakP18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TakP18
Hemlata Tak, Hemant A. Patil:
Novel Linear Frequency Residual Cepstral Features for Replay Attack Detection. 726-730
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TripathiR18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TripathiR18
Kumud Tripathi, K. Sreenivasa Rao:
Analysis of sparse representation based feature on speech mode classification. 731-735
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DhimanSS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DhimanSS18
Jitendra Kumar Dhiman, Neeraj Sharma, Chandra Sekhar Seelamantula:
Multicomponent 2-D AM-FM Modeling of Speech Spectrograms. 736-740
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SainathanRS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SainathanRS18
Abhilash Sainathan, Sunil Rudresh, Chandra Sekhar Seelamantula:
An Optimization Framework for Recovery of Speech from Phase-Encoded Spectrograms. 741-745
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XiaH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XiaH18
Wei Xia, John H. L. Hansen:
Speaker Recognition with Nonlinear Distortion: Clipping Analysis and Impact. 746-750
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SinghP18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SinghP18
Madhusudan Singh, Debadatta Pati:
Linear Prediction Residual based Short-term Cepstral Features for Replay Attacks Detection. 751-755
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SakshiKP18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SakshiKP18
Surbhi Sakshi, Avinash Kumar, Gayadhar Pradhan:
Analysis of Variational Mode Functions for Robust Detection of Vowels. 756-760

Sequence Models for ASR

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WengCWWYSY18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WengCWWYSY18
Chao Weng, Jia Cui, Guangsen Wang, Jun Wang, Chengzhu Yu, Dan Su, Dong Yu:
Improving Attention Based Sequence-to-Sequence Models for End-to-End English Conversational Speech Recognition. 761-765
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BeckHDSN18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BeckHDSN18
Eugen Beck, Mirko Hannemann, Patrick Dötsch, Ralf Schlüter, Hermann Ney:
Segmental Encoder-Decoder Models for Large Vocabulary Automatic Speech Recognition. 766-770
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangL18
Shiliang Zhang, Ming Lei:
Acoustic Modeling with DFSMN-CTC and Joint CTC-CE Learning. 771-775
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BaeK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BaeK18
Jaesung Bae, Dae-Shik Kim:
End-to-End Speech Command Recognition with Capsule Network. 776-780
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZeghidourUSCD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZeghidourUSCD18
Neil Zeghidour, Nicolas Usunier, Gabriel Synnaeve, Ronan Collobert, Emmanuel Dupoux:
End-to-End Speech Recognition from the Raw Waveform. 781-785
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YuZWCY18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YuZWCY18
Chengzhu Yu, Chunlei Zhang, Chao Weng, Jia Cui, Dong Yu:
A Multistage Training Framework for Acoustic-to-Word Model. 786-790
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhouDXX18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhouDXX18
Shiyu Zhou, Linhao Dong, Shuang Xu, Bo Xu:
Syllable-Based Sequence-to-Sequence Speech Recognition with the Transformer in Mandarin Chinese. 791-795
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HanCKL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HanCKL18
Kyu Jeong Han, Akshay Chandrashekaran, Jungsuk Kim, Ian R. Lane:
Densely Connected Networks for Conversational Speech Recognition. 796-800
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HayashiWTT18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HayashiWTT18
Tomoki Hayashi, Shinji Watanabe, Tomoki Toda, Kazuya Takeda:
Multi-Head Decoder for End-to-End Speech Recognition. 801-805
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MoriTS018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MoriTS018
Takuma Mori, Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Compressing End-to-end ASR Networks by Tensor-Train Decomposition. 806-810
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChungG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChungG18
Yu-An Chung, James R. Glass:
Speech2Vec: A Sequence-to-Sequence Framework for Learning Word Embeddings from Speech. 811-815
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DongZCX18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DongZCX18
Linhao Dong, Shiyu Zhou, Wei Chen, Bo Xu:
Extending Recurrent Neural Aligner for Streaming End-to-End Speech Recognition in Mandarin. 816-820

Source Separation and Spatial Analysis

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangZ18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangZ18
Disong Wang, Yuexian Zou:
Joint Noise and Reverberation Adaptive Learning for Robust Speaker DOA Estimation with an Acoustic Vector Sensor. 821-825
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiuLYP18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiuLYP18
Hong Liu, Haipeng Lan, Bing Yang, Cheng Pang:
Multiple Concurrent Sound Source Tracking Based on Observation-Guided Adaptive Particle Filter. 826-830
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MRD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MRD18
Gurunath Reddy M., K. Sreenivasa Rao, Partha Pratim Das:
Harmonic-Percussive Source Separation of Polyphonic Music by Suppressing Impulsive Noise Events. 831-835
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CeoliniAHKL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CeoliniAHKL18
Enea Ceolini, Jithendar Anumula, Adrian E. G. Huber, Ilya Kiselev, Shih-Chii Liu:
Speaker Activity Detection and Minimum Variance Beamforming for Source Separation. 836-840
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/QiT18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/QiT18
Xiaoke Qi, Jianhua Tao:
Sparsity-Constrained Weight Mapping for Head-Related Transfer Functions Individualization from Anthropometric Features. 841-845
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NSK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NSK18
Dheeraj Sai D. V. L. N, Kishor K. S, Sri Rama Murty Kodukula:
Speech Source Separation Using ICA in Constant Q Transform Domain. 846-850
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YinWXL018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YinWXL018
Lu Yin, Ziteng Wang, Risheng Xia, Junfeng Li, Yonghong Yan:
Multi-talker Speech Separation Based on Permutation Invariant Training and Beamforming. 851-855
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MagronV18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MagronV18
Paul Magron, Tuomas Virtanen:
Expectation-Maximization Algorithms for Itakura-Saito Nonnegative Matrix Factorization. 856-860
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KarthikSG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KarthikSG18
Girija Ramesan Karthik, Parth Suresh, Prasanta Kumar Ghosh:
Subband Weighting for Binaural Speech Source Localization. 861-865

Plenary Talk-1

- view
  - electronic edition @ isca-speech.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/Vaissiere18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Vaissiere18
Jacqueline Vaissière:
Universal Tendencies for Cross-Linguistic Prosodic Tendencies: A Review and Some New Proposals. 866

Acoustic Model Adaptation

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KlejchF018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KlejchF018
Ondrej Klejch, Joachim Fainberg, Peter Bell:
Learning to Adapt: A Meta-learning Approach for Speaker Adaptation. 867-871
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangZGW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangZGW18
Yu Wang, Chao Zhang, Mark J. F. Gales, Philip C. Woodland:
Speaker Adaptation and Adaptive Training for Jointly Optimised Tandem Systems. 872-876
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KitzaSN18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KitzaSN18
Markus Kitza, Ralf Schlüter, Hermann Ney:
Comparison of BLSTM-Layer-Specific Affine Transformations for Speaker Adaptation. 877-881
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SharonKU18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SharonKU18
Rini A. Sharon, Sandeep Reddy Kothinti, Srinivasan Umesh:
Correlational Networks for Speaker Normalization in Automatic Speech Recognition. 882-886
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TjandraS018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TjandraS018
Andros Tjandra, Sakriani Sakti, Satoshi Nakamura:
Machine Speech Chain with One-shot Speaker Adaptation. 887-891
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SimNMTPSHLB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SimNMTPSHLB18
Khe Chai Sim, Arun Narayanan, Ananya Misra, Anshuman Tripathi, Golan Pundak, Tara N. Sainath, Parisa Haghani, Bo Li, Michiel Bacchiani:
Domain Adaptation Using Factorized Hidden Layer for Robust Automatic Speech Recognition. 892-896

Statistical Parametric Speech Synthesis

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WanDG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WanDG18
Moquan Wan, Gilles Degottex, Mark J. F. Gales:
Waveform-Based Speaker Representations for Speech Synthesis. 897-901
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YanagitaS018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YanagitaS018
Tomoya Yanagita, Sakriani Sakti, Satoshi Nakamura:
Incremental TTS for Japanese Language. 902-906
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FuTZW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FuTZW18
Ruibo Fu, Jianhua Tao, Yibin Zheng, Zhengqi Wen:
Transfer Learning Based Progressive Neural Networks for Acoustic Modeling in Statistical Parametric Speech Synthesis. 907-911
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HwangSKK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HwangSKK18
Min-Jae Hwang, Eunwoo Song, Jin-Seob Kim, Hong-Goo Kang:
A Unified Framework for the Generation of Glottal Signals in Deep Learning-based Parametric Speech Synthesis Systems. 912-916
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeeCCKS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeeCCKS18
Joun Yeop Lee, Sung Jun Cheon, Byoung Jin Choi, Nam Soo Kim, Eunwoo Song:
Acoustic Modeling Using Adversarially Trained Variational Recurrent Neural Network for Speech Synthesis. 917-921
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhengTWF18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhengTWF18
Yibin Zheng, Jianhua Tao, Zhengqi Wen, Ruibo Fu:
On the Application and Compression of Deep Time Delay Neural Network for Embedded Statistical Parametric Speech Synthesis. 922-926

Emotion Modeling

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TzinisPBP18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TzinisPBP18
Efthymios Tzinis, Georgios Paraskevopoulos, Christos Baziotis, Alexandros Potamianos:
Integrating Recurrence Dynamics for Speech Emotion Recognition. 927-931
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HanRCW0S18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HanRCW0S18
Wenjing Han, Huabin Ruan, Xiaomin Chen, Zhixiang Wang, Haifeng Li, Björn W. Schuller:
Towards Temporal Modelling of Categorical Speech Emotion Recognition. 932-936
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimS18
John W. Kim, Rif A. Saurous:
Emotion Recognition from Human Speech Using Temporal Information and Deep Learning. 937-940
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SridharPB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SridharPB18
Kusha Sridhar, Srinivas Parthasarathy, Carlos Busso:
Role of Regularization in the Prediction of Valence from Speech. 941-945
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MangalamG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MangalamG18
Karttikeya Mangalam, Tanaya Guha:
Learning Spontaneity to Improve Emotion Recognition in Speech. 946-950
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LotfianB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LotfianB18
Reza Lotfian, Carlos Busso:
Predicting Categorical Emotions by Jointly Learning Primary and Secondary Emotions through Multitask Learning. 951-955

Models of Speech Perception

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CaudrelierPSR18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CaudrelierPSR18
Tiphaine Caudrelier, Pascal Perrier, Jean-Luc Schwartz, Amélie Rochet-Capellan:
Picture Naming or Word Reading: Does the Modality Affect Speech Motor Adaptation and Its Transfer? 956-960
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DuSYWC18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DuSYWC18
Yufan Du, Yi Shen, Hongying Yang, Xihong Wu, Jing Chen:
Measuring the Band Importance Function for Mandarin Chinese with a Bayesian Adaptive Procedure. 961-965
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Shafaei-Bajestan18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Shafaei-Bajestan18
Elnaz Shafaei-Bajestan, R. Harald Baayen:
Wide Learning for Auditory Comprehension. 966-970
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BoschEB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BoschEB18
Louis ten Bosch, Mirjam Ernestus, Lou Boves:
Analyzing Reaction Time Sequences from Human Participants in Auditory Experiments. 971-975
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/OosterHM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OosterHM18
Jasper Ooster, Rainer Huber, Bernd T. Meyer:
Prediction of Perceived Speech Quality Using Deep Machine Listening. 976-980
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KranzuschHKKM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KranzuschHKKM18
Paul Kranzusch, Rainer Huber, Melanie Krüger, Birger Kollmeier, Bernd T. Meyer:
Prediction of Subjective Listening Effort from Acoustic Data with Non-Intrusive Deep Models. 981-985

Multimodal Dialogue Systems

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KottiDPLS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KottiDPLS18
Margarita Kotti, Vassilios Diakoloukas, Alexandros Papangelis, Michail Lagoudakis, Yannis Stylianou:
A Case Study on the Importance of Belief State Representation for Dialogue Policy Management. 986-990
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HaraITK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HaraITK18
Kohei Hara, Koji Inoue, Katsuya Takanashi, Tatsuya Kawahara:
Prediction of Turn-taking Using Multitask Learning with Prediction of Backchannels and Fillers. 991-995
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BotheMWW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BotheMWW18
Chandrakant Bothe, Sven Magg, Cornelius Weber, Stefan Wermter:
Conversational Analysis Using Utterance-level Attention-based Bidirectional Recurrent Neural Networks. 996-1000
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/OhsugiSM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OhsugiSM18
Yasuhito Ohsugi, Daisuke Saito, Nobuaki Minematsu:
A Comparative Study of Statistical Conversion of Face to Voice Based on Their Subjective Impressions. 1001-1005
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SuWHHH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SuWHHH18
Ming-Hsiang Su, Chung-Hsien Wu, Kun-Yi Huang, Qian-Bei Hong, Huai-Hung Huang:
Follow-up Question Generation Using Pattern-based Seq2seq with a Small Corpus for Interview Coaching. 1006-1010
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CervoneSR18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CervoneSR18
Alessandra Cervone, Evgeny A. Stepanov, Giuseppe Riccardi:
Coherence Models for Dialogue. 1011-1015

Speech Recognition for Indian Languages

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ManjunathRJR18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ManjunathRJR18
K. E. Manjunath, K. Sreenivasa Rao, Dinesh Babu Jayagopi, V. Ramasubramanian:
Indian Languages ASR: A Multilingual Phone Recognition Framework with IPA Based Common Phone-set, Predicted Articulatory Features and Feature fusion. 1016-1020
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RazaARTSZSR18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RazaARTSZSR18
Agha Ali Raza, Awais Athar, Shan Randhawa, Zain Tariq, Muhammad Bilal Saleem, Haris Bin Zia, Umar Saif, Roni Rosenfeld:
Rapid Collection of Spontaneous Speech Corpora Using Telephonic Community Forums. 1021-1025
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MurthySS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MurthySS18
Savitha Murthy, Dinkar Sitaram, Sunayana Sitaram:
Effect of TTS Generated Audio on OOV Detection and Word Error Rate in ASR for Low-resource Languages. 1026-1030
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PatelNFSCKI18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PatelNFSCKI18
Tanvina Patel, Krishna D. N, Noor Fathima, Nisar Shah, Mahima C, Deepak Kumar, Anuroop Iyengar:
Development of Large Vocabulary Speech Recognition System with Keyword Search for Manipuri. 1031-1035
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DeySLNGSP0R18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DeySLNGSP0R18
Abhishek Dey, Biswajit Dev Sarma, Wendy Lalhminghlui, Lalnunsiami Ngente, Parismita Gogoi, Priyankoo Sarmah, S. R. Mahadeva Prasanna, Rohit Sinha, S. R. Nirmala:
Robust Mizo Continuous Speech Recognition. 1036-1040
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Chellapriyadharshini18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Chellapriyadharshini18
Maharajan Chellapriyadharshini, Anoop Toffy, Srinivasa Raghavan K. M., V. Ramasubramanian:
Semi-supervised and Active-learning Scenarios: Efficient Acoustic Model Refinement for a Low Resource Indian Language. 1041-1045
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DashKTW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DashKTW18
Debadatta Dash, Myung Jong Kim, Kristin Teplansky, Jun Wang:
Automatic Speech Recognition with Articulatory Information and a Unified Dictionary for Hindi, Marathi, Bengali and Oriya. 1046-1050

Show and Tell 3

- view
  - electronic edition @ isca-speech.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/RouheKETSSK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RouheKETSSK18
Aku Rouhe, Reima Karhila, Aija Elg, Minnaleena Toivola, Peter Smit, Anna-Riikka Smolander, Mikko Kurimo:
Captaina: Integrated Pronunciation Practice and Data Collection Portal. 1051-1052
- view
  - electronic edition @ isca-speech.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/SachdevJM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SachdevJM18
Umesh Sachdev, Rajagopal Jayaraman, Zainab Millwala:
auMina™ - Enterprise Speech Analytics. 1053-1054
- view
  - electronic edition @ isca-speech.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/NareshGBG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NareshGBG18
Annam Naresh, Rushabh Gandhi, Mallikarjuna Rao Bellamkonda, Mithun Das Gupta:
HoloCompanion: An MR Friend for EveryOne. 1055-1056
- view
  - electronic edition @ isca-speech.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/SachdevJM18a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SachdevJM18a
Umesh Sachdev, Rajagopal Jayaraman, Zainab Millwala:
akeira™ - Virtual Assistant. 1057-1058
- view
  - electronic edition @ isca-speech.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/MaruthachalamAK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MaruthachalamAK18
Srihari Maruthachalam, Sidharth Aggarwal, Mari Ganesh Kumar, Mriganka Sur, Hema A. Murthy:
Brain-Computer Interface using Electroencephalogram Signatures of Eye Blinks. 1059-1060

Speaker Verification II

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AjiliBR18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AjiliBR18
Moez Ajili, Jean-François Bonastre, Solange Rossato:
Voice Comparison and Rhythm: Behavioral Differences between Target and Non-target Comparisons. 1061-1065
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XuLLY18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XuLLY18
Longting Xu, Kong-Aik Lee, Haizhou Li, Zhen Yang:
Co-whitening of I-vectors for Short and Long Duration Speaker Verification. 1066-1070
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BahmaninezhadH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BahmaninezhadH18
Fahimeh Bahmaninezhad, John H. L. Hansen:
Compensation for Domain Mismatch in Text-independent Speaker Recognition. 1071-1075
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShiLLL18a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShiLLL18a
Ziqiang Shi, Liu Liu, Huibin Lin, Rujie Liu:
Joint Learning of J-Vector Extractor and Joint Bayesian Model for Text Dependent Speaker Verification. 1076-1080
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShiLLL18b
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShiLLL18b
Ziqiang Shi, Huibin Lin, Liu Liu, Rujie Liu:
Latent Factor Analysis of Deep Bottleneck Features for Speaker Verification with Random Digit Strings. 1081-1085
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChungNZ18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChungNZ18
Joon Son Chung, Arsha Nagrani, Andrew Zisserman:
VoxCeleb2: Deep Speaker Recognition. 1086-1090
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RamojiG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RamojiG18
Shreyas Ramoji, Sriram Ganapathy:
Supervised I-vector Modeling - Theory and Applications. 1091-1095
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DmitrievKMMBSZZ18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DmitrievKMMBSZZ18
Evgeny Dmitriev, Yulia Kim, Anastasia Matveeva, Claude Montacié, Yannick Boulard, Yadviga Sinyavskaya, Yulia Zhukova, Adam Zarazinski, Egor Akhanov, Ilya I. Viksnin, Andrei A. Shlykov, Maria Usova:
LOCUST - Longitudinal Corpus and Toolset for Speaker Verification. 1096-1100
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MadikeriDM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MadikeriDM18
Srikanth R. Madikeri, Subhadeep Dey, Petr Motlícek:
Analysis of Language Dependent Front-End for Speaker Recognition. 1101-1105
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NandwanaHMSRLG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NandwanaHMSRLG18
Mahesh Kumar Nandwana, Julien van Hout, Mitchell McLaren, Allen R. Stauffer, Colleen Richey, Aaron Lawson, Martin Graciarena:
Robust Speaker Recognition from Distant Speech under Real Reverberant Environments Using Speaker Embeddings. 1106-1110
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NidadavoluLVD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NidadavoluLVD18
Phani Sankar Nidadavolu, Cheng-I Lai, Jesús Villalba, Najim Dehak:
Investigation on Bandwidth Extension for Speaker Recognition. 1111-1115
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MuckenhirnMM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MuckenhirnMM18
Hannah Muckenhirn, Mathew Magimai-Doss, Sébastien Marcel:
On Learning Vocal Tract System Related Speaker Discriminative Information from Raw Signal Using CNNs. 1116-1120
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KumarYG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KumarYG18
Rajath Kumar, Vaishnavi Yeruva, Sriram Ganapathy:
On Convolutional LSTM Modeling for Joint Wake-Word Detection and Text Dependent Speaker Verification. 1121-1125
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BaiZC18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BaiZC18
Zhongxin Bai, Xiao-Lei Zhang, Jingdong Chen:
Cosine Metric Learning for Speaker Verification in the I-vector Space. 1126-1130
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JatiG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JatiG18
Arindam Jati, Panayiotis G. Georgiou:
An Unsupervised Neural Prediction Framework for Learning Speaker Embeddings Using Recurrent Neural Networks. 1131-1135

Novel Approaches to Enhancement

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PandeyW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PandeyW18
Ashutosh Pandey, DeLiang Wang:
A New Framework for Supervised Speech Enhancement in the Time Domain. 1136-1140
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SadasivanMS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SadasivanMS18
Jishnu Sadasivan, Subhadip Mukherjee, Chandra Sekhar Seelamantula:
Speech Enhancement Using the Minimum-probability-of-error Criterion. 1141-1145
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PapadopoulosVN18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PapadopoulosVN18
Pavlos Papadopoulos, Colin Vaz, Shrikanth S. Narayanan:
Exploring the Relationship between Conic Affinity of NMF Dictionaries and Speech Enhancement Metrics. 1146-1150
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiuZZ18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiuZZ18
Yun Liu, Hui Zhang, Xueliang Zhang:
Using Shifted Real Spectrum Mask as Training Target for Supervised Speech Separation. 1151-1155
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SrinivasPS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SrinivasPS18
Nagapuri Srinivas, Gayadhar Pradhan, Syed Shahnawazuddin:
Enhancement of Noisy Speech Signal by Non-Local Means Estimation of Variational Mode Functions. 1156-1160
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PallaviR18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PallaviR18
Priya Pallavi, Ch. V. Rama Rao:
Phase-locked Loop (PLL) Based Phase Estimation in Single Channel Speech Enhancement. 1161-1164
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MengLGJ18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MengLGJ18
Zhong Meng, Jinyu Li, Yifan Gong, Biing-Hwang Fred Juang:
Cycle-Consistent Speech Enhancement. 1165-1169
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GabbaySP18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GabbaySP18
Aviv Gabbay, Asaph Shamir, Shmuel Peleg:
Visual Speech Enhancement. 1170-1174
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SharmaTP18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SharmaTP18
Saketh Sharma, Nitya Tiwari, Prem C. Pandey:
Implementation of Digital Hearing Aid as a Smartphone Application. 1175-1179
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeeRG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeeRG18
Ching Hua Lee, Bhaskar D. Rao, Harinath Garudadri:
Bone-Conduction Sensor Assisted Noise Estimation for Improved Speech Enhancement. 1180-1184
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BachhavTE18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BachhavTE18
Pramod B. Bachhav, Massimiliano Todisco, Nicholas W. D. Evans:
Artificial Bandwidth Extension with Memory Inclusion Using Semi-supervised Stacked Auto-encoders. 1185-1189
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MaitiCM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MaitiCM18
Soumi Maiti, Joey Ching, Michael I. Mandel:
Large Vocabulary Concatenative Resynthesis. 1190-1194
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SyedTM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SyedTM18
Ali Raza Syed, Viet Anh Trinh, Michael I. Mandel:
Concatenative Resynthesis with Improved Training Signals for Speech Enhancement. 1195-1199

Syllabification, Rhythm, and Voice Activity Detection

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RasanenSC18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RasanenSC18
Okko Räsänen, Shreyas Seshadri, Marisa Casillas:
Comparison of Syllabification Algorithms and Training Strategies for Robust Word Count Estimation across Different Languages and Recording Conditions. 1200-1204
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KelleyT18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KelleyT18
Matthew C. Kelley, Benjamin V. Tucker:
A Comparison of Input Types to a Deep Neural Network-based Forced Aligner. 1205-1209
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JungKCK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JungKCK18
Youngmoon Jung, Younggwan Kim, Yeunju Choi, Hoirin Kim:
Joint Learning Using Denoising Variational Autoencoders for Voice Activity Detection. 1210-1214
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DawalatabadKSM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DawalatabadKSM18
Nauman Dawalatabad, Jom Kuriakose, Chellu Chandra Sekhar, Hema A. Murthy:
Information Bottleneck Based Percussion Instrument Diarization System for Taniavartanam Segments of Carnatic Music Concerts. 1215-1219
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GhoshRG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GhoshRG18
Debayan Ghosh, R. Muralishankar, Sanjeev Gurugopinath:
Robust Voice Activity Detection Using Frequency Domain Long-Term Differential Entropy. 1220-1224
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MallidiMGRMH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MallidiMGRMH18
Sri Harish Reddy Mallidi, Roland Maas, Kyle Goehner, Ariya Rastrow, Spyros Matsoukas, Björn Hoffmeister:
Device-directed Utterance Detection. 1225-1228
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AR18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AR18
Rohit M. A., Preeti Rao:
Acoustic-Prosodic Features of Tabla Bol Recitation and Correspondence with the Tabla Imitation. 1229-1233
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KrikkeBL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KrikkeBL18
Teun F. Krikke, Frank Broz, David Lane:
Who Said That? a Comparative Study of Non-negative Matrix Factorization Techniques. 1234-1238
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChaudhuriREGKMP18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChaudhuriREGKMP18
Sourish Chaudhuri, Joseph Roth, Daniel P. W. Ellis, Andrew C. Gallagher, Liat Kaver, Radhika Marvin, Caroline Pantofaru, Nathan Reale, Loretta Guarino Reid, Kevin W. Wilson, Zhonghua Xi:
AVA-Speech: A Densely Labeled Dataset of Speech Activity in Movies. 1239-1243
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TaoB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TaoB18
Fei Tao, Carlos Busso:
Audiovisual Speech Activity Detection with Advanced Long Short-Term Memory. 1244-1248
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SahaSF18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SahaSF18
Pramit Saha, Praneeth Srungarapu, Sidney S. Fels:
Towards Automatic Speech Identification from Vocal Tract Shape Dynamics in Real-time MRI. 1249-1253

Selected Topics in Neural Speech Processing

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShiY18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShiY18
Kaiyu Shi, Kai Yu:
Structured Word Embedding for Low Memory Neural Network Language Model. 1254-1258
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MasumuraTAMA18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MasumuraTAMA18
Ryo Masumura, Tomohiro Tanaka, Atsushi Ando, Hirokazu Masataki, Yushi Aono:
Role Play Dialogue Aware Language Models Based on Conditional Hierarchical Recurrent Encoder-Decoder. 1259-1263
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MyerT18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MyerT18
Samuel Myer, Vikrant Singh Tomar:
Efficient Keyword Spotting Using Time Delay Neural Networks. 1264-1268
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YoshidaMWSYA18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YoshidaMWSYA18
Tsukasa Yoshida, Takafumi Moriya, Kazuho Watanabe, Yusuke Shinohara, Yoshikazu Yamaguchi, Yushi Aono:
Automatic DNN Node Pruning Using Mixture Distribution-based Group Regularization. 1269-1273
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TavaroneB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TavaroneB18
Raffaele Tavarone, Leonardo Badino:
Conditional-Computation-Based Recurrent Neural Networks for Computationally Efficient Acoustic Modelling. 1274-1278
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Anastasopoulos018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Anastasopoulos018
Antonios Anastasopoulos, David Chiang:
Leveraging Translations for Speech Transcription in Low-resource Settings. 1279-1283
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BruguierZA18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BruguierZA18
Antoine Bruguier, Heiga Zen, Arkady Arkhangorodsky:
Sequence-to-sequence Neural Network Model with 2D Attention for Learning Japanese Pitch Accents. 1284-1287
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GhannayEC18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GhannayEC18
Sahar Ghannay, Yannick Estève, Nathalie Camelin:
Task Specific Sentence Embeddings for ASR Error Detection. 1288-1292
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NiehuesPHSW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NiehuesPHSW18
Jan Niehues, Ngoc-Quan Pham, Thanh-Le Ha, Matthias Sperber, Alex Waibel:
Low-Latency Neural Speech Translation. 1293-1297
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BansalKLLG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BansalKLLG18
Sameer Bansal, Herman Kamper, Karen Livescu, Adam Lopez, Sharon Goldwater:
Low-Resource Speech-to-Text Translation. 1298-1302
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BrasserFRS0W18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BrasserFRS0W18
Ferdinand Brasser, Tommaso Frassetto, Korbinian Riedhammer, Ahmad-Reza Sadeghi, Thomas Schneider, Christian Weinert:
VoiceGuard: Secure and Private Speech Processing. 1303-1307

Perspective Talk-1

- view
  - electronic edition @ isca-speech.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/Hakkani-Tur18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Hakkani-Tur18
Dilek Hakkani-Tür:
Deep Learning based Situated Goal-oriented Dialogue Systems. 1308

Dereverberation

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiWXX18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiWXX18
Chenxing Li, Tieqiang Wang, Shuang Xu, Bo Xu:
Single-channel Speech Dereverberation via Generative Adversarial Training. 1309-1313
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MackCSBEH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MackCSBEH18
Wolfgang Mack, Soumitro Chakrabarty, Fabian-Robert Stöter, Sebastian Braun, Bernd Edler, Emanuël A. P. Habets:
Single-Channel Dereverberation Using Direct MMSE Optimization and Bidirectional LSTM Networks. 1314-1318
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KodrasiB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KodrasiB18
Ina Kodrasi, Hervé Bourlard:
Single-channel Late Reverberation Power Spectral Density Estimation Using Denoising Autoencoders. 1319-1323
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MohananVR18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MohananVR18
Nikhil Mohanan, Rajbabu Velmurugan, Preeti Rao:
A Non-convolutive NMF Model for Speech Dereverberation. 1324-1328
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GuzewichZCZ18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GuzewichZCZ18
Peter Guzewich, Stephen A. Zahorian, Xiao Chen, Hao Zhang:
Cross-Corpora Convolutional Deep Neural Network Dereverberation Preprocessing for Speaker Verification and Speech Enhancement. 1329-1333
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MosnerPMNC18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MosnerPMNC18
Ladislav Mosner, Oldrich Plchot, Pavel Matejka, Ondrej Novotný, Jan Cernocký:
Dereverberation and Beamforming in Robust Far-Field Speaker Recognition. 1334-1338

Audio Events and Acoustic Scenes

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangLM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangLM18
Yun Wang, Juncheng Li, Florian Metze:
Comparing the Max and Noisy-Or Pooling Functions in Multiple Instance Learning for Weakly Supervised Sequence Learning Tasks. 1339-1343
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangKW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangKW18
Weiran Wang, Chieh-Chi Kao, Chao Wang:
A Simple Model for Detection of Rare Sound Events. 1344-1348
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangZW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangZW18
Teng Zhang, Kailai Zhang, Ji Wu:
Temporal Transformer Networks for Acoustic Scene Classification. 1349-1353
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LuSLTK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LuSLTK18
Xugang Lu, Peng Shen, Sheng Li, Yu Tsao, Hisashi Kawai:
Temporal Attentive Pooling for Acoustic Event Detection. 1354-1357
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KaoWSW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KaoWSW18
Chieh-Chi Kao, Weiran Wang, Ming Sun, Chao Wang:
R-CRNN: Region-based Convolutional Recurrent Neural Network for Audio Event Detection. 1358-1362
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PapayiannisARSW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PapayiannisARSW18
Constantinos Papayiannis, Justice Amoh, Viktor Rozgic, Shiva Sundaram, Chao Wang:
Detecting Media Sound Presence in Acoustic Scenes. 1363-1367

Speaker Diarization

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BrouxDLPCM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BrouxDLPCM18
Pierre-Alexandre Broux, Florent Desnous, Anthony Larcher, Simon Petitrenaud, Jean Carrive, Sylvain Meignier:
S4D: Speaker Diarization Toolkit in Python. 1368-1372
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ParkG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ParkG18
Tae Jin Park, Panayiotis G. Georgiou:
Multimodal Speaker Segmentation and Diarization Using Lexical and Acoustic Cues via Sequence to Sequence Neural Networks. 1373-1377
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FlemotomosPGN18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FlemotomosPGN18
Nikolaos Flemotomos, Pavlos Papadopoulos, James Gibson, Shrikanth S. Narayanan:
Combined Speaker Clustering and Role Recognition in Conversational Speech. 1378-1382
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FrancRKWSMC18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FrancRKWSMC18
Adrien Le Franc, Eric Riebling, Julien Karadayi, Yun Wang, Camila Scaff, Florian Metze, Alejandrina Cristià:
The ACLEW DiViMe: An Easy-to-use Diarization Tool. 1383-1387
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KazimirovaB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KazimirovaB18
Evdokia Kazimirova, Andrey Belyaev:
Automatic Detection of Multi-speaker Fragments with High Time Resolution. 1388-1392
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YinBB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YinBB18
Ruiqing Yin, Hervé Bredin, Claude Barras:
Neural Speech Turn Segmentation and Affinity Propagation for Speaker Diarization. 1393-1397

Phonation

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangH18
Minghui Zhang, Fang Hu:
Pitch or Phonation: on the Glottalization in Tone Productions in the Ruokeng Hui Chinese Dialect. 1398-1402
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HullebusTG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HullebusTG18
Marc Antony Hullebus, Stephen J. Tobin, Adamantios I. Gafos:
Speaker-specific Structure in German Voiceless Stop Voice Onset Times. 1403-1407
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AareLWH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AareLWH18
Kätlin Aare, Pärtel Lippus, Marcin Wlodarczak, Mattias Heldner:
Creak in the Respiratory Cycle. 1408-1412
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangLCY18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangLCY18
Cuiling Zhang, Bin Li, Si Chen, Yike Yang:
Acoustic Analysis of Whispery Voice Disguise in Mandarin Chinese. 1413-1416
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MaurerdSDFK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MaurerdSDFK18
Dieter Maurer, Christian d'Heureuse, Heidy Suter, Volker Dellwo, Daniel Friedrichs, Thayabaran Kathiresan:
The Zurich Corpus of Vowel and Voice Quality, Version 1.0. 1417-1421
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PenneyCS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PenneyCS18
Joshua Penney, Felicity Cox, Anita Szakay:
Weighting of Coda Voicing Cues: Glottalisation and Vowel Duration. 1422-1426

Cognition and Brain Studies

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhaoHZDCYW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhaoHZDCYW18
Bin Zhao, Jinfeng Huang, Gaoyan Zhang, Jianwu Dang, Minbo Chen, YingjianFu, Longbiao Wang:
Revealing Spatiotemporal Brain Dynamics of Speech Production Based on EEG and Eye Movement. 1427-1431
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Boll-AvetisyanN18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Boll-AvetisyanN18
Natalie Boll-Avetisyan, Jessie S. Nixon, Tomas O. Lentz, Liquan Liu, Sandrien van Ommen, Çagri Çöltekin, Jacolien van Rij:
Neural Response Development During Distributional Learning. 1432-1436
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MagguZLW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MagguZLW18
Akshay Raj Maggu, Wenqing Zong, Vina Law, Patrick C. M. Wong:
Learning Two Tone Languages Enhances the Brainstem Encoding of Lexical Tones. 1437-1441
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/0002EG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/0002EG18
Daniel Williams, Paola Escudero, Adamantios I. Gafos:
Perceptual Sensitivity to Spectral Change in Australian English Close Front Vowels: An Electroencephalographic Investigation. 1442-1446
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Nixon18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Nixon18
Jessie S. Nixon:
Effective Acoustic Cue Learning Is Not Just Statistical, It Is Discriminative. 1447-1451
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MulderBB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MulderBB18
Kimberley Mulder, Louis ten Bosch, Lou Boves:
Analyzing EEG Signals in Auditory Speech Comprehension Using Temporal Response Functions and Generalized Additive Models. 1452-1456

Deep Neural Networks: How Can We Interpret What They Learned?

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BoschB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BoschB18
Louis ten Bosch, Lou Boves:
Information Encoding by Deep Neural Networks: What Can We Learn? 1457-1461
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HsuG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HsuG18
Wei-Ning Hsu, James R. Glass:
Scalable Factorized Hierarchical Variational Autoencoder Training. 1462-1466
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VerwimphRW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VerwimphRW18
Lyan Verwimp, Hugo Van hamme, Vincent Renkens, Patrick Wambacq:
State Gradients for RNN Memory Analysis. 1467-1471
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BaiWJR18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BaiWJR18
Linxue Bai, Philip Weber, Peter Jancovic, Martin J. Russell:
Exploring How Phone Classification Neural Networks Learn Phonetic Information by Visualising and Interpreting Bottleneck Features. 1472-1476
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Zegersh18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Zegersh18
Jeroen Zegers, Hugo Van hamme:
Memory Time Span in LSTMs for Multi-Speaker Source Separation. 1477-1481
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ScharenborgTHD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ScharenborgTHD18
Odette Scharenborg, Sebastian Tiesmeyer, Mark Hasegawa-Johnson, Najim Dehak:
Visualizing Phoneme Category Adaptation in Deep Neural Networks. 1482-1486

Show and Tell 4

- view
  - electronic edition @ isca-speech.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/KasthuriRMJP18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KasthuriRMJP18
G. R. Kasthuri, Prabha Ramanathan, Hema A. Murthy, Namita Jacob, Anil Prabhakar:
Early Vocabulary Development Through Picture-based Software Solutions. 1487-1488
- view
  - electronic edition @ isca-speech.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/SabuKR18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SabuKR18
Kamini Sabu, Kanhaiya Kumar, Preeti Rao:
Automatic Detection of Expressiveness in Oral Reading. 1489-1490
- view
  - electronic edition @ isca-speech.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/PalRKBB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PalRKBB18
Madhab Pal, Rajib Roy, Soma Khan, Milton Samirakshma Bepari, Joyanta Basu:
PannoMulloKathan: Voice Enabled Mobile App for Agricultural Commodity Price Dissemination in Bengali Language. 1491-1492
- view
  - electronic edition @ isca-speech.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/OktemFB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OktemFB18
Alp Öktem, Mireia Farrús, Antonio Bonafonte:
Visualizing Punctuation Restoration in Speech Transcripts with Prosograph. 1493-1494
- view
  - electronic edition @ isca-speech.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/MathivananSPV18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MathivananSPV18
Mithul Mathivanan, Kinnera Saranu, Abhishek Pandey, Jithendra Vepa:
CACTAS - Collaborative Audio Categorization and Transcription for ASR Systems. 1495-1496

Speech and Singing Production

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ParrellRNH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ParrellRNH18
Benjamin Parrell, Vikram Ramanarayanan, Srikantan S. Nagarajan, John F. Houde:
FACTS: A Hierarchical Task-based Control Model of Speech Incorporating Sensory Feedback. 1497-1501
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KatzRP18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KatzRP18
William F. Katz, Patrick Reidy, Divya Prabhakaran:
Sensorimotor Response to Tongue Displacement Imagery by Talkers with Parkinson's Disease. 1502-1506
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GuptaLW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GuptaLW18
Chitralekha Gupta, Haizhou Li, Ye Wang:
Automatic Pronunciation Evaluation of Singing. 1507-1511
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BouserhalCPCV18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BouserhalCPCV18
Rachel E. Bouserhal, Philippe Chabot, Milton Sarria Paja, Patrick Cardinal, Jérémie Voix:
Classification of Nonverbal Human Produced Audio Events: A Pilot Study. 1512-1516
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SpreaficoPM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SpreaficoPM18
Lorenzo Spreafico, Michael Pucher, Anna Matosova:
UltraFit: A Speaker-friendly Headset for Ultrasound Recordings in Speech Science. 1517-1520
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CortesWS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CortesWS18
Elísabet Eir Cortes, Marcin Wlodarczak, Juraj Simko:
Articulatory Consequences of Vocal Effort Elicitation Method. 1521-1525
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HermesMM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HermesMM18
Anne Hermes, Jane Mertens, Doris Mücke:
Age-related Effects on Sensorimotor Control of Speech Production. 1526-1530
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PercivalKK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PercivalKK18
Maida Percival, Alexei Kochetov, Yoonjung Kang:
An Ultrasound Study of Gemination in Coronal Stops in Eastern Oromo. 1531-1535
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SudroKP18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SudroKP18
Protima Nomo Sudro, Sishir Kalita, S. R. Mahadeva Prasanna:
Processing Transition Regions of Glottal Stop Substituted /S/ for Intelligibility Enhancement of Cleft Palate Speech. 1536-1540
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NRMG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NRMG18
Abinay Reddy Naini, M. V. Achuth Rao, G. Nisha Meenakshi, Prasanta Kumar Ghosh:
Reconstructing Neutral Speech from Tracheoesophageal Speech. 1541-1545
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/OchiMS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OchiMS18
Keiko Ochi, Koichi Mori, Naomi Sakai:
Automatic Evaluation of Soft Articulatory Contact for Stuttering Treatment. 1546-1550
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimCPHKK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimCPHKK18
Juntae Kim, Heejin Choi, Jinuk Park, Minsoo Hahn, Sang-Jin Kim, Jong-Jin Kim:
Korean Singing Voice Synthesis Based on an LSTM Recurrent Neural Network. 1551-1555
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenXH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenXH18
Xuanda Chen, Ziyu Xiong, Jian Hu:
The Trajectory of Voice Onset Time with Vocal Aging. 1556-1560

Robust Speech Recognition

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BarkerWVT18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BarkerWVT18
Jon Barker, Shinji Watanabe, Emmanuel Vincent, Jan Trmal:
The Fifth 'CHiME' Speech Separation and Recognition Challenge: Dataset, Task and Baselines. 1561-1565
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RicheyBABFGLNSH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RicheyBABFGLNSH18
Colleen Richey, María Auxiliadora Barrios, Zeb Armstrong, Chris Bartels, Horacio Franco, Martin Graciarena, Aaron Lawson, Mahesh Kumar Nandwana, Allen R. Stauffer, Julien van Hout, Paul Gamble, Jeffrey Hetherly, Cory Stephenson, Karl Ni:
Voices Obscured in Complex Environmental Settings (VOiCES) Corpus. 1566-1570
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenSXW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenSXW18
Szu-Jui Chen, Aswin Shanmugam Subramanian, Hainan Xu, Shinji Watanabe:
Building State-of-the-art Distant Speech Recognition Using the CHiME-4 Challenge with a Setup of Speech Enhancement Baseline. 1571-1575
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HsuTG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HsuTG18
Wei-Ning Hsu, Hao Tang, James R. Glass:
Unsupervised Adaptation with Interpretable Disentangled Representations for Distant Conversational Speech Recognition. 1576-1580
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangZSWXX18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangZSWXX18
Ke Wang, Junbo Zhang, Sining Sun, Yujun Wang, Fei Xiang, Lei Xie:
Investigating Generative Adversarial Networks Based Speech Dereverberation for Robust Speech Recognition. 1581-1585
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChangQ018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChangQ018
Xuankai Chang, Yanmin Qian, Dong Yu:
Monaural Multi-Talker Speech Recognition with Attention Mechanism and Gated Convolutional Networks. 1586-1590
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DoS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DoS18
Cong-Thanh Do, Yannis Stylianou:
Weighting Time-Frequency Representation of Speech Using Auditory Saliency for Automatic Speech Recognition. 1591-1595
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GhahremaniHLPK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GhahremaniHLPK18
Pegah Ghahremani, Hossein Hadian, Hang Lv, Daniel Povey, Sanjeev Khudanpur:
Acoustic Modeling from Frequency Domain Representations of Speech. 1596-1600
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YadavKSP18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YadavKSP18
Ishwar Chandra Yadav, Avinash Kumar, Syed Shahnawazuddin, Gayadhar Pradhan:
Non-Uniform Spectral Smoothing for Robust Children's Speech Recognition. 1601-1605
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NicolsonP18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NicolsonP18
Aaron Nicolson, Kuldip K. Paliwal:
Bidirectional Long-Short Term Memory Network-based Estimation of Reliable Spectral Component Locations. 1606-1610
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GuoWDZGL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GuoWDZGL18
Lili Guo, Longbiao Wang, Jianwu Dang, Linjuan Zhang, Haotian Guan, Xiangang Li:
Speech Emotion Recognition by Combining Amplitude and Phase Information Using Convolutional Neural Network. 1611-1615
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TrinhMM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TrinhMM18
Viet Anh Trinh, Brian McFee, Michael I. Mandel:
Bubble Cooperative Networks for Identifying Important Speech Cues. 1616-1620

Applications in Education and Learning

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Cheng18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Cheng18
Jian Cheng:
Real-Time Scoring of an Oral Reading Assessment on Mobile Devices. 1621-1625
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KyriakopoulosKG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KyriakopoulosKG18
Konstantinos Kyriakopoulos, Kate M. Knill, Mark J. F. Gales:
A Deep Learning Approach to Assessing Non-native Pronunciation of English Using Phone Distances. 1626-1630
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XiaoSH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XiaoSH18
Yujia Xiao, Frank K. Soong, Wenping Hu:
Paired Phone-Posteriors Approach to ESL Pronunciation Quality Assessment. 1631-1635
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TuGLB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TuGLB18
Ming Tu, Anna Grabek, Julie Liss, Visar Berisha:
Investigating the Role of L1 in Automatic Pronunciation Evaluation of L2 Speech. 1636-1640
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KnillGKMRWC18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KnillGKMRWC18
Kate M. Knill, Mark J. F. Gales, Konstantinos Kyriakopoulos, Andrey Malinin, Anton Ragni, Yu Wang, Andrew Caines:
Impact of ASR Performance on Free Speaking Language Assessment. 1641-1645
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HongKG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HongKG18
Yoon Seok Hong, Kyung Seo Ki, Gahgene Gweon:
Automatic Miscue Detection Using RNN Based Models with Data Augmentation. 1646-1650
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/InoueKSMKY18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/InoueKSMKY18
Yusuke Inoue, Suguru Kabashima, Daisuke Saito, Nobuaki Minematsu, Kumi Kanamura, Yutaka Yamauchi:
A Study of Objective Measurement of Comprehensibility through Native Speakers' Shadowing of Learners' Utterances. 1651-1655
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LuoZXW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LuoZXW18
Dean Luo, Chunxiao Zhang, Linzhong Xia, Lixin Wang:
Factorized Deep Neural Network Adaptation for Automatic Scoring of L2 Speech in English Speaking Tests. 1656-1660
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YeungA18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YeungA18
Gary Yeung, Abeer Alwan:
On the Difficulties of Automatic Speech Recognition for Kindergarten-Aged Children. 1661-1665
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NicolaoSH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NicolaoSH18
Mauro Nicolao, Michiel Sanders, Thomas Hain:
Improved Acoustic Modelling for Automatic Literacy Assessment of Children. 1666-1670

Integrating Speech Science and Technology for Clinical Applications

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShahinAJB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShahinAJB18
Mostafa Ali Shahin, Beena Ahmed, Jim X. Ji, Kirrie J. Ballard:
Anomaly Detection Approach for Pronunciation Verification of Disordered Speech Using Speech Attribute Features. 1671-1675
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AfshanGPRFA18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AfshanGPRFA18
Amber Afshan, Jinxi Guo, Soo Jin Park, Vijay Ravi, Jonathan Flint, Abeer Alwan:
Effectiveness of Voice Quality Features in Detecting Depression. 1676-1680
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KothalkarRDMCH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KothalkarRDMCH18
Prasanna V. Kothalkar, Johanna Rudolph, Christine Dollaghan, Jennifer McGlothlin, Thomas F. Campbell, John H. L. Hansen:
Fusing Text-dependent Word-level i-Vector Models to Screen 'at Risk' Child Speech. 1681-1685
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShekarAH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShekarAH18
Ram Charan Chandra Shekar, Hussnain Ali, John H. L. Hansen:
Testing Paradigms for Assistive Hearing Devices in Diverse Acoustic Environments. 1686-1690
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/UjiroTAKIK018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/UjiroTAKIK018
Tsuyoki Ujiro, Hiroki Tanaka, Hiroyoshi Adachi, Hiroaki Kazui, Manabu Ikeda, Takashi Kudo, Satoshi Nakamura:
Detection of Dementia from Responses to Atypical Questions Asked by Embodied Conversational Agents. 1691-1695
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangGWNYWY18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangGWNYWY18
Wang Zhang, Xiangquan Gui, Tianqi Wang, Manwa L. Ng, Feng Yang, Lan Wang, Nan Yan:
Acoustic Features Associated with Sustained Vowel and Continuous Speech Productions by Chinese Children with Functional Articulation Disorders. 1696-1700
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MTKP18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MTKP18
Vikram C. M., Ayush Tripathi, Sishir Kalita, S. R. Mahadeva Prasanna:
Estimation of Hypernasality Scores from Cleft Lip and Palate Speech. 1701-1705
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WarnitaIS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WarnitaIS18
Tifani Warnita, Nakamasa Inoue, Koichi Shinoda:
Detecting Alzheimer's Disease Using Gated Convolutional Neural Network from Audio Data. 1706-1710
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BandiniGRY18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BandiniGRY18
Andrea Bandini, Jordan R. Green, Brian Richburg, Yana Yunusova:
Automatic Detection of Orofacial Impairment in Stroke. 1711-1715
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HanaiGG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HanaiGG18
Tuka Al Hanai, Mohammad M. Ghassemi, James R. Glass:
Detecting Depression with Audio/Text Sequence Modeling of Interviews. 1716-1720

Speaker Characterization and Analysis

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangHCC18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangHCC18
Yu-Wun Wang, Hen-Hsen Huang, Kuan-Yu Chen, Hsin-Hsi Chen:
Discourse Marker Detection for Hesitation Events on Mandarin Conversation. 1721-1725
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GengGF18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GengGF18
Puyang Geng, Wentao Gu, Hiroya Fujisaki:
Acoustic and Perceptual Characteristics of Mandarin Speech in Homosexual and Heterosexual Male Speakers. 1726-1730
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AndoAMKKA18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AndoAMKKA18
Atsushi Ando, Reine Asakawa, Ryo Masumura, Hosana Kamiyama, Satoshi Kobashikawa, Yushi Aono:
Automatic Question Detection from Acoustic and Phonetic Features Using Feature-wise Pre-training. 1731-1735
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HaiderLV018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HaiderLV018
Fasih Haider, Saturnino Luz, Carl Vogel, Nick Campbell:
Improving Response Time of Active Speaker Detection Using Visual Prosody Information Prior to Articulation. 1736-1740
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TurkerEYS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TurkerEYS18
Bekir Berker Türker, Engin Erzin, Yücel Yemez, T. Metin Sezgin:
Audio-Visual Prediction of Head-Nod and Turn-Taking Events in Dyadic Interactions. 1741-1745
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WuCNI18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuCNI18
Haoran Wu, Yuya Chiba, Takashi Nose, Akinori Ito:
Analyzing Effect of Physical Expression on English Proficiency for Multimodal Computer-Assisted Language Learning. 1746-1750
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DumpalaPK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DumpalaPK18
Sri Harsha Dumpala, Ashish Panda, Sunil Kumar Kopparapu:
Analysis of the Effect of Speech-Laugh on Speaker Recognition System. 1751-1755
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SlobodaLWSMCHPQ18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SlobodaLWSMCHPQ18
Jennifer Sloboda, Adam C. Lammert, James R. Williamson, Christopher J. Smalt, Daryush D. Mehta, C. O. L. Ian Curry, Kristin Heaton, Jeff Palmer, Thomas F. Quatieri:
Vocal Biomarkers for Cognitive Performance Estimation in a Working Memory Task. 1756-1760
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AnL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AnL18
Guozhen An, Rivka Levitan:
Lexical and Acoustic Deep Learning Model for Personality Recognition. 1761-1765

Perspective Talk-2

- view
  - electronic edition @ isca-speech.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/Ramabhadran18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Ramabhadran18
Bhuvana Ramabhadran:
Open Problems in Speech Recognition. 1766

Plenary Talk-2

- view
  - electronic edition @ isca-speech.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/Bourlard18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Bourlard18
Hervé Bourlard:
Evolution of Neural Network Architectures for Speech Recognition. 1767

Novel Neural Network Architectures for Acoustic Modelling

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiLG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiLG18
Jinyu Li, Changliang Liu, Yifan Gong:
Layer Trajectory LSTM. 1768-1772
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangW18
Chao Zhang, Philip C. Woodland:
Semi-tied Units for Efficient Gating in LSTM and Highway Networks. 1773-1777
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LamHXLYSLM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LamHXLYSLM18
Max W. Y. Lam, Shoukang Hu, Xurong Xie, Shansong Liu, Jianwei Yu, Rongfeng Su, Xunying Liu, Helen Meng:
Gaussian Process Neural Networks for Speech Recognition. 1778-1782
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TangSDM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TangSDM18
Jian Tang, Yan Song, Lirong Dai, Ian McLoughlin:
Acoustic Modeling with Densely Connected Residual Network for Multichannel Speech Recognition. 1783-1787
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiWZL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiWZL18
Jie Li, Xiaorui Wang, Yuanyuan Zhao, Yan Li:
Gated Recurrent Unit Based Acoustic Modeling with Future Context. 1788-1792
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChengPHXK018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChengPHXK018
Gaofeng Cheng, Daniel Povey, Lu Huang, Ji Xu, Sanjeev Khudanpur, Yonghong Yan:
Output-Gate Projected Gated Recurrent Unit for Speech Recognition. 1793-1797

Language Identification

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SadjadiKGSRMH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SadjadiKGSRMH18
Seyed Omid Sadjadi, Timothée Kheyrkhah, Craig S. Greenberg, Elliot Singer, Douglas A. Reynolds, Lisa P. Mason, Jaime Hernandez-Cordero:
Performance Analysis of the 2017 NIST Language Recognition Evaluation. 1798-1802
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MatejuCZS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MatejuCZS18
Lukás Mateju, Petr Cerva, Jindrich Zdánský, Radek Safarík:
Using Deep Neural Networks for Identification of Slavic Languages from Acoustic Signal. 1803-1807
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TaitelbaumBG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TaitelbaumBG18
Hagai Taitelbaum, Ehud Ben-Reuven, Jacob Goldberger:
Adding New Classes without Access to the Original Training Data with Applications to Language Identification. 1808-1812
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShenLLK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShenLLK18
Peng Shen, Xugang Lu, Sheng Li, Hisashi Kawai:
Feature Representation of Short Utterances Based on Knowledge Distillation for Spoken Language Identification. 1813-1817
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FernandoSA18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FernandoSA18
Sarith Fernando, Vidhyasaharan Sethu, Eliathamby Ambikairajah:
Sub-band Envelope Features Using Frequency Domain Linear Prediction for Short Duration Language Identification. 1818-1822
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FrederiksenVWTD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FrederiksenVWTD18
Peter Sibbern Frederiksen, Jesús Villalba, Shinji Watanabe, Zheng-Hua Tan, Najim Dehak:
Effectiveness of Single-Channel BLSTM Enhancement for Language Identification. 1823-1827

Production of Prosody

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Gold18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Gold18
Erica Gold:
Articulation Rate as a Speaker Discriminant in British English. 1828-1832
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YuZ18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YuZ18
Jenny Yu, Katharina Zahner:
Truncation and Compression in Southern German and Australian English. 1833-1837
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KallioSVS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KallioSVS18
Heini Kallio, Antti Suni, Päivi Virkkunen, Juraj Simko:
Prominence-based Evaluation of L2 Prosody. 1838-1842
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RidouaneTM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RidouaneTM18
Rachid Ridouane, Giuseppina Turco, Julien Meyer:
Length Contrast and Covarying Features: Whistled Speech as a Case Study. 1843-1847
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChodroffC18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChodroffC18
Eleanor Chodroff, Jennifer S. Cole:
Information Structure, Affect and Prenuclear Prominence in American English. 1848-1852
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NovakK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NovakK18
John S. Novak III, Robert V. Kenyon:
Effects of User Controlled Speech Rate on Intelligibility in Noisy Environments. 1853-1857

Speech Intelligibility and Quality

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KondoTK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KondoTK18
Kazuhiro Kondo, Kazuya Taira, Yosuke Kobayashi:
Binaural Speech Intelligibility Estimation Using Deep Neural Networks. 1858-1862
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YamamotoIOAKN18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YamamotoIOAKN18
Katsuhiko Yamamoto, Toshio Irino, Narumi Ohashi, Shoko Araki, Keisuke Kinoshita, Tomohiro Nakatani:
Multi-resolution Gammachirp Envelope Distortion Index for Intelligibility Prediction of Noisy Speech. 1863-1867
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShifasTS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShifasTS18
P. V. Muhammed Shifas, Vassilis Tsiaras, Yannis Stylianou:
Speech Intelligibility Enhancement Based on a Non-causal Wavenet-like Model. 1868-1872
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FuTHW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FuTHW18
Szu-Wei Fu, Yu Tsao, Hsin-Te Hwang, Hsin-Min Wang:
Quality-Net: An End-to-End Non-intrusive Speech Quality Assessment Model Based on BLSTM. 1873-1877
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AralikattiMSTV18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AralikattiMSTV18
Rohith Aralikatti, Dilip Kumar Margam, Tanay Sharma, Abhinav Thanda, Shankar M. Venkatesan:
Global SNR Estimation of Speech Signals Using Entropy and Uncertainty Estimates from Dropout Networks. 1878-1882
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MittagM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MittagM18
Gabriel Mittag, Sebastian Möller:
Detecting Packet-Loss Concealment Using Formant Features and Decision Tree Learning. 1883-1887

Integrating Speech Science and Technology for Clinical Applications

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/EshkyRCRRSW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/EshkyRCRRSW18
Aciel Eshky, Manuel Sam Ribeiro, Joanne Cleland, Korin Richmond, Zoe Roxburgh, James M. Scobbie, Alan Wrench:
UltraSuite: A Repository of Ultrasound and Acoustic Data from Child Speech Therapy Sessions. 1888-1892
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MirheidariBWVRC18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MirheidariBWVRC18
Bahman Mirheidari, Daniel Blackburn, Traci Walker, Annalena Venneri, Markus Reuber, Heidi Christensen:
Detecting Signs of Dementia Using Word Vector Representations. 1893-1897
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PerezJLCDRP18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PerezJLCDRP18
Matthew Perez, Wenyu Jin, Duc Le, Noelle Carlozzi, Praveen Dayalu, Angela Roberts, Emily Mower Provost:
Classification of Huntington Disease Using Acoustic and Lexical Features. 1898-1902
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KhorramJGMP18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KhorramJGMP18
Soheil Khorram, Mimansa Jaiswal, John Gideon, Melvin G. McInnis, Emily Mower Provost:
The PRIORI Emotion Dataset: Linking Mood to Emotion Detected In-the-Wild. 1903-1907
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FlemotomosMGACN18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FlemotomosMGACN18
Nikolaos Flemotomos, Victor R. Martinez, James Gibson, David C. Atkins, Torrey A. Creed, Shrikanth S. Narayanan:
Language Features for Automated Evaluation of Cognitive Behavior Psychotherapy Sessions. 1908-1912
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AnKTGCYHW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AnKTGCYHW18
Kwanghoon An, Myung Jong Kim, Kristin Teplansky, Jordan R. Green, Thomas F. Campbell, Yana Yunusova, Daragh Heitzman, Jun Wang:
Automatic Early Detection of Amyotrophic Lateral Sclerosis from Intelligible Speech Using Convolutional Neural Networks. 1913-1917

Speech Technologies for Code-Switching in Multilingual Communities

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RaoPSKB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RaoPSKB18
Preeti Rao, Mugdha Pandya, Kamini Sabu, Kanhaiya Kumar, Nandini Bondale:
A Study of Lexical and Prosodic Cues to Segmentation in a Hindi-English Code-switched Discourse. 1918-1922
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YilmazBWWN18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YilmazBWWN18
Emre Yilmaz, Astik Biswas, Ewald van der Westhuizen, Febe de Wet, Thomas Niesler:
Building a Unified Code-Switching ASR System for South African Languages. 1923-1927
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GuoXXC18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GuoXXC18
Pengcheng Guo, Haihua Xu, Lei Xie, Eng Siong Chng:
Study of Semi-supervised Approaches to Improving English-Mandarin Code-Switching Speech Recognition. 1928-1932
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YilmazHL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YilmazHL18
Emre Yilmaz, Henk van den Heuvel, David A. van Leeuwen:
Acoustic and Textual Data Augmentation for Improved ASR of Code-Switching Speech. 1933-1937
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SotoCH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SotoCH18
Victor Soto, Nishmar Cestero, Julia Hirschberg:
The Role of Cognate Words, POS Tags and Entrainment in Code-Switching. 1938-1942
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SrivastavaS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SrivastavaS18
Brij Mohan Lal Srivastava, Sunayana Sitaram:
Homophone Identification and Merging for Code-switched Speech Recognition. 1943-1947
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ThomasPBM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ThomasPBM18
Anju Leela Thomas, Anusha Prakash, Arun Baby, Hema A. Murthy:
Code-switching in Indic Speech Synthesisers. 1948-1952
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Ganji018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Ganji018
Ganji Sreeram, Rohit Sinha:
A Novel Approach for Effective Recognition of the Code-Switched Data on Monolingual Language Model. 1953-1957

Show and Tell 5

- view
  - electronic edition @ isca-speech.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/ViswanathanPV18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ViswanathanPV18
Ramya Viswanathan, Periyasamy Paramasivam, Jithendra Vepa:
Hierarchical Accent Determination and Application in a Large Scale ASR System. 1958-1959
- view
  - electronic edition @ isca-speech.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/RamanarayananPL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RamanarayananPL18
Vikram Ramanarayanan, David Pautler, Patrick L. Lange, Eugene Tsuprun, Rutuja Ubale, Keelan Evanini, David Suendermann-Oeft:
Toward Scalable Dialog Technology for Conversational Language Learning: Case Study of the TOEFL® MOOC. 1960-1961
- view
  - electronic edition @ isca-speech.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/FreitasRBOB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FreitasRBOB18
João Freitas, Jorge Ribeiro, Daan Baldewijns, Sara Oliveira, Daniela Braga:
Machine Learning Powered Data Platform for High-Quality Speech and NLP Workflows. 1962-1963
- view
  - electronic edition @ isca-speech.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/CohenKLLBA18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CohenKLLBA18
Raphael Cohen, Orgad Keller, Jason Levy, Russell Levy, Micha Breakstone, Amit Ashkenazi:
Fully Automatic Speaker Separation System, with Automatic Enrolling of Recurrent Speakers. 1964-1965
- view
  - electronic edition @ isca-speech.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/AyyavuRG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AyyavuRG18
Madhavaraj Ayyavu, Shiva Kumar H. R., A. G. Ramakrishnan:
Online Speech Translation System for Tamil. 1966-1967

Voice Conversion and Speech Synthesis

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShahMP18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShahMP18
Nirmesh J. Shah, Maulik C. Madhavi, Hemant A. Patil:
Unsupervised Vocal Tract Length Warped Posterior Features for Non-Parallel Voice Conversion. 1968-1972
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhouHKVD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhouHKVD18
Cong Zhou, Michael Horgan, Vivek Kumar, Cristina Vasco, Dan Darcy:
Voice Conversion with Conditional SampleRNN. 1973-1977
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SismanZL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SismanZL18
Berrak Sisman, Mingyang Zhang, Haizhou Li:
A Voice Conversion Framework with Tandem Feature Sparse Representation and Speaker-Adapted WaveNet Vocoder. 1978-1982
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiuLJZD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiuLJZD18
Li-Juan Liu, Zhen-Hua Ling, Yuan Jiang, Ming Zhou, Li-Rong Dai:
WaveNet Vocoder with Limited Training Data for Voice Conversion. 1983-1987
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WuKHTT18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuKHTT18
Yi-Chiao Wu, Kazuhiro Kobayashi, Tomoki Hayashi, Patrick Lumban Tobing, Tomoki Toda:
Collapsed Speech Segment Detection and Suppression for WaveNet Vocoder. 1988-1992
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenCLY18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenCLY18
Kuan Chen, Bo Chen, Jiahao Lai, Kai Yu:
High-quality Voice Conversion Using Spectrogram-Based WaveNet Vocoder. 1993-1997
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BonafontePD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BonafontePD18
Antonio Bonafonte, Santiago Pascual, Georgina Dorca:
Spanish Statistical Parametric Speech Synthesis Using a Neural Vocoder. 1998-2001
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PodsiadloU18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PodsiadloU18
Monika Podsiadlo, Victor Ungureanu:
Experiments with Training Corpora for Statistical Text-to-speech Systems. 2002-2006
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GuK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GuK18
Yu Gu, Yongguo Kang:
Multi-task WaveNet: A Multi-task Generative Model for Statistical Parametric Speech Synthesis without Fundamental Frequency Conditions. 2007-2011
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JuvelaTBAYA18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JuvelaTBAYA18
Lauri Juvela, Vassilis Tsiaras, Bajibabu Bollepalli, Manu Airaksinen, Junichi Yamagishi, Paavo Alku:
Speaker-independent Raw Waveform Model for Glottal Excitation. 2012-2016
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CuiWHS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CuiWHS18
Yang Cui, Xi Wang, Lei He, Frank K. Soong:
A New Glottal Neural Vocoder for Speech Synthesis. 2017-2021
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WattsVEK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WattsVEK18
Oliver Watts, Cassia Valentini-Botinhao, Felipe Espic, Simon King:
Exemplar-based Speech Waveform Generation. 2022-2026
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KawaharaSMBTI18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KawaharaSMBTI18
Hideki Kawahara, Ken-Ichi Sakakibara, Masanori Morise, Hideki Banno, Tomoki Toda, Toshio Irino:
Frequency Domain Variants of Velvet Noise and Their Application to Speech Processing and Synthesis. 2027-2031

Extracting Information from Audio

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChungTTL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChungTTL18
Pei-Hung Chung, Kuan Tung, Ching-Lun Tai, Hung-yi Lee:
Joint Learning of Interactive Spoken Content Retrieval and Trainable User Simulator. 2032-2036
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShanZWX18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShanZWX18
Changhao Shan, Junbo Zhang, Yujun Wang, Lei Xie:
Attention-based End-to-End Models for Small-Footprint Keyword Spotting. 2037-2041
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MVV18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MVV18
Ragesh Rajan M, Ashwin Vijayakumar, Deepu Vijayasenan:
Prediction of Aesthetic Elements in Karnatic Music: A Machine Learning Approach. 2042-2046
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenHC18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenHC18
Wenda Chen, Mark Hasegawa-Johnson, Nancy F. Chen:
Topic and Keyword Identification for Low-resourced Speech Using Cross-Language Transfer Learning. 2047-2051
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WiesnerLOHMTHDK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WiesnerLOHMTHDK18
Matthew Wiesner, Chunxi Liu, Lucas Ondel, Craig Harman, Vimal Manohar, Jan Trmal, Zhongqiang Huang, Najim Dehak, Sanjeev Khudanpur:
Automatic Speech Recognition and Topic Identification from Speech for Almost-Zero-Resource Languages. 2052-2056
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XiaoMAR18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XiaoMAR18
Bo Xiao, Nicholas Monath, Shankar Ananthakrishnan, Abishek Ravi:
Play Duration Based User-Entity Affinity Modeling in Spoken Dialog System. 2057-2061
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeeT018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeeT018
Shi-wook Lee, Kazuyo Tanaka, Yoshiaki Itoh:
Empirical Analysis of Score Fusion Application to Combined Neural Networks for Open Vocabulary Spoken Term Detection. 2062-2066
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AsaeiRB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AsaeiRB18
Afsaneh Asaei, Dhananjay Ram, Hervé Bourlard:
Phonological Posterior Hashing for Query by Example Spoken Term Detection. 2067-2071
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KuczaNZWS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KuczaNZWS18
Maren Kucza, Jan Niehues, Thomas Zenkel, Alex Waibel, Sebastian Stüker:
Term Extraction via Neural Sequence Labeling a Comparative Evaluation of Strategies Using Recurrent Neural Networks. 2072-2076
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Kannan0JR18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Kannan0JR18
Anjuli Kannan, Kai Chen, Diana Jaunzeikare, Alvin Rajkomar:
Semi-supervised Learning for Information Extraction from Dialogue. 2077-2081
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShinYL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShinYL18
Youhyun Shin, Kang Min Yoo, Sang-goo Lee:
Slot Filling with Delexicalized Sentence Generation. 2082-2086
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GhosalK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GhosalK18
Deepanway Ghosal, Maheshkumar H. Kolekar:
Music Genre Recognition Using Deep Neural Networks and Transfer Learning. 2087-2091
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SigtiaHRMB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SigtiaHRMB18
Siddharth Sigtia, Rob Haynes, Hywel Richards, Erik Marchi, John Bridle:
Efficient Voice Trigger Detection for Low Resource Hardware. 2092-2096

Signal Analysis for the Natural, Biological and Social Sciences

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LinS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LinS18
Qiguang Lin, Yiwen Shao:
A Novel Normalization Method for Autocorrelation Function for Pitch Detection and for Speech Activity Detection. 2097-2101
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Ananthapadmanabha18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Ananthapadmanabha18
T. V. Ananthapadmanabha, A. G. Ramakrishnan:
Estimation of the Vocal Tract Length of Vowel Sounds Based on the Frequency of the Significant Spectral Valley. 2102-2106
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HimawanTLR18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HimawanTLR18
Ivan Himawan, Michael Towsey, Bradley Law, Paul Roe:
Deep Learning Techniques for Koala Activity Detection. 2107-2111
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MatousekT18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MatousekT18
Jindrich Matousek, Daniel Tihelka:
Glottal Closure Instant Detection from Speech Signal Using Voting Classifier and Recursive Feature Elimination. 2112-2116
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YousefiSH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YousefiSH18
Midia Yousefi, Navid Shokouhi, John H. L. Hansen:
Assessing Speaker Engagement in 2-Person Debates: Overlap Detection in United States Presidential Debates. 2117-2121
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PankajakshanTTR18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PankajakshanTTR18
Arjun Pankajakshan, Anshul Thakur, Daksh Thapar, Padmanabhan Rajan, Aditya Nigam:
All-Conv Net for Bird Activity Detection: Significance of Learned Pooling. 2122-2126
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ThakurASR18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ThakurASR18
Anshul Thakur, Vinayak Abrol, Pulkit Sharma, Padmanabhan Rajan:
Deep Convex Representations: Feature Representations for Bioacoustics Classification. 2127-2131
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DasguptaPN18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DasguptaPN18
Hirak Dasgupta, Prem C. Pandey, K. S. Nataraj:
Detection of Glottal Excitation Epochs in Speech Signal Using Hilbert Envelope. 2132-2136
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Zhang18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Zhang18
Hong Zhang:
Analyzing Thai Tone Distribution through Functional Data Analysis. 2137-2141
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MerkxS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MerkxS18
Danny Merkx, Odette Scharenborg:
Articulatory Feature Classification Using Convolutional Neural Networks. 2142-2146
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Lin18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Lin18
Shoufeng Lin:
A New Frequency Coverage Metric and a New Subband Encoding Model, with an Application in Pitch Estimation. 2147-2151
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GowriPG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GowriPG18
B. Ganga Gowri, Soman K. P, D. Govind:
Improved Epoch Extraction from Telephonic Speech Using Chebfun and Zero Frequency Filtering. 2152-2156

Speech Prosody

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KohnBD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KohnBD18
Arne Köhn, Timo Baumann, Oskar Dörfler:
An Empirical Analysis of the Correlation of Syntax and Prosody. 2157-2161
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BaumannHM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BaumannHM18
Timo Baumann, Hussein Hussein, Burkhard Meyer-Sickendiek:
Analysing the Focus of a Hierarchical Attention Network: the Importance of Enjambments When Classifying Post-modern Poetry. 2162-2166
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KocharovM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KocharovM18
Daniil Kocharov, Alla Menshikova:
Language-Dependent Melody Embeddings. 2167-2170
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JiaM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JiaM18
Yuan Jia, Xiaoxiao Ma:
Stress Distribution of Given Information in Chinese Reading Texts. 2171-2175
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CabarraoBMTM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CabarraoBMTM18
Vera Cabarrão, Fernando Batista, Helena Moniz, Isabel Trancoso, Ana Isabel Mata:
Acoustic-prosodic Entrainment in Structural Metadata Events. 2176-2180
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TabainBB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TabainBB18
Marija Tabain, Richard Beare, Andrew Butcher:
Formant Measures of Vowels Adjacent to Alveolar and Retroflex Consonants in Arrernte: Stressed and Unstressed Position. 2181-2185
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TruongKY18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TruongKY18
Quy-Thao Truong, Tsuneo Kato, Seiichi Yamamoto:
Automatic Assessment of L2 English Word Prosody Using Weighted Distances of F0 and Intensity Contours. 2186-2190
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MaxwellPB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MaxwellPB18
Olga Maxwell, Elinor Payne, Rosey Billington:
Homogeneity vs Heterogeneity in Indian English: Investigating Influences of L1 on f0 Range. 2191-2195
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangGZ18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangGZ18
Yixin Zhang, Tianzhu Geng, Jinsong Zhang:
Emotional Prosody Perception in Mandarin-speaking Congenital Amusics. 2196-2200
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShochiRGE18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShochiRGE18
Takaaki Shochi, Jean-Luc Rouas, Marine Guerry, Donna Erickson:
Cultural Differences in Pattern Matching: Multisensory Recognition of Socio-affective Prosody. 2201-2205

Perspective Talk-3

- view
  - electronic edition @ isca-speech.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/Mesgarani18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Mesgarani18
Nima Mesgarani:
Speech Processing in the Human Brain Meets Deep Learning. 2206

Recurrent Neural Models for ASR

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WatanabeHKHNUSH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WatanabeHKHNUSH18
Shinji Watanabe, Takaaki Hori, Shigeki Karita, Tomoki Hayashi, Jiro Nishitoba, Yuya Unno, Nelson Enrique Yalta Soplin, Jahn Heymann, Matthew Wiesner, Nanxin Chen, Adithya Renduchintala, Tsubasa Ochiai:
ESPnet: End-to-End Speech Processing Toolkit. 2207-2211
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenLXWPK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenLXWPK18
Zhehuai Chen, Justin Luitjens, Hainan Xu, Yiming Wang, Daniel Povey, Sanjeev Khudanpur:
A GPU-based WFST Decoder with Exact Lattice Generation. 2212-2216
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RagniG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RagniG18
Anton Ragni, Mark J. F. Gales:
Automatic Speech Recognition System Development in the "Wild". 2217-2221
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VelikovichWSAMR18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VelikovichWSAMR18
Leonid Velikovich, Ian Williams, Justin Scheiner, Petar S. Aleksic, Pedro J. Moreno, Michael Riley:
Semantic Lattice Processing in Contextual Automatic Speech Recognition for Google Assistant. 2222-2226
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WilliamsKARS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WilliamsKARS18
Ian Williams, Anjuli Kannan, Petar S. Aleksic, David Rybach, Tara N. Sainath:
Contextual Speech Recognition in End-to-end Neural Network Systems Using Beam Search. 2227-2231
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MimuraSK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MimuraSK18
Masato Mimura, Shinsuke Sakai, Tatsuya Kawahara:
Forward-Backward Attention Decoder. 2232-2236

Speaker Verification Using Neural Network Methods I

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YadavR18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YadavR18
Sarthak Yadav, Atul Rai:
Learning Discriminative Features for Speaker Identification and Verification. 2237-2241
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NovoselovSSKK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NovoselovSSKK18
Sergey Novoselov, Vadim Shchemelinin, Andrey Shulipa, Alexander Kozlov, Ivan Kremnev:
Triplet Loss Based Cosine Similarity Metric Learning for Text-independent Speaker Recognition. 2242-2246
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiuHLJ18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiuHLJ18
Yi Liu, Liang He, Jia Liu, Michael T. Johnson:
Speaker Embedding Extraction with Phonetic Information. 2247-2251
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/OkabeKS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OkabeKS18
Koji Okabe, Takafumi Koshinaka, Koichi Shinoda:
Attentive Statistics Pooling for Deep Speaker Embedding. 2252-2256
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeO18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeO18
Nam Le, Jean-Marc Odobez:
Robust and Discriminative Speaker Embedding via Intra-Class Distance Variance Regularization. 2257-2261
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiTSLY18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiTSLY18
Na Li, Deyi Tuo, Dan Su, Zhifeng Li, Dong Yu:
Deep Discriminative Embeddings for Duration Robust Speaker Verification. 2262-2266

Speech Perception in Adverse Conditions

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SimantirakiCK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SimantirakiCK18
Olympia Simantiraki, Martin Cooke, Simon King:
Impact of Different Speech Types on Listening Effort. 2267-2271
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuetMGP18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuetMGP18
Moïra-Phoebé Huet, Christophe Micheyl, Etienne Gaudrain, Etienne Parizet:
Who Are You Listening to? Towards a Dynamic Measure of Auditory Attention to Speech-on-speech. 2272-2275
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimKA018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimKA018
Jeesun Kim, Sonya Karisma, Vincent Aubanel, Chris Davis:
Investigating the Role of Familiar Face and Voice Cues in Speech Processing in Noise. 2276-2279
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ScharenborgL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ScharenborgL18
Odette Scharenborg, Martha A. Larson:
The Conversation Continues: the Effect of Lyrics and Music Complexity of Background Music on Spoken-Word Recognition. 2280-2284
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MeyerMDBS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MeyerMDBS18
Julien Meyer, Fanny Meunier, Laure Dentel, Noelia Do Carmo Blanco, Frédéric Sèbe:
Loud and Shouted Speech Perception at Variable Distances in a Forest. 2285-2289
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BlancoMHM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BlancoMHM18
Noelia Do Carmo Blanco, Julien Meyer, Michel Hoen, Fanny Meunier:
Phoneme Resistance and Phoneme Confusion in Noise: Impact of Dyslexia. 2290-2294

Measuring Pitch and Articulation

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HaqueGV18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HaqueGV18
Albert Haque, Michelle Guo, Prateek Verma:
Conditional End-to-End Audio Transforms. 2295-2299
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AneejaKY18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AneejaKY18
Gunnam Aneeja, Sudarsana Reddy Kadiri, Bayya Yegnanarayana:
Detection of Glottal Closure Instants in Degraded Speech Using Single Frequency Filtering Analysis. 2300-2304
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LugoschT18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LugoschT18
Loren Lugosch, Vikrant Singh Tomar:
Tone Recognition Using Lifters and CTC. 2305-2309
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MP18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MP18
Vikram C. M., S. R. Mahadeva Prasanna:
Epoch Extraction from Pathological Children Speech Using Single Pole Filtering Approach. 2310-2314
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TC18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TC18
Balamurali B. T., Jer-Ming Chen:
Automated Classification of Vowel-Gesture Parameters Using External Broadband Excitation. 2315-2318
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KadiriY18b
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KadiriY18b
Sudarsana Reddy Kadiri, Bayya Yegnanarayana:
Estimation of Fundamental Frequency from Singing Voice Using Harmonics of Impulse-like Excitation Source. 2319-2323

Speech and Language Analytics for Mental Health

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WeinerAUS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WeinerAUS18
Jochen Weiner, Miguel Angrick, Srinivasan Umesh, Tanja Schultz:
Investigating the Effect of Audio Duration on Dementia Detection Using Acoustic Features. 2324-2328
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LinGL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LinGL18
Yun-Shao Lin, Susan Shur-Fen Gau, Chi-Chun Lee:
An Interlocutor-Modulated Attentional LSTM for Differentiating between Subgroups of Autism Spectrum Disorder. 2329-2333
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AmiriparianBJAO18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AmiriparianBJAO18
Shahin Amiriparian, Alice Baird, Sahib Julka, Alyssa Alcorn, Sandra Ottl, Suncica Petrovic, Eloise Ainger, Nicholas Cummins, Björn W. Schuller:
Recognition of Echolalic Autistic Child Vocalisations Utilising Convolutional Recurrent Neural Networks. 2334-2338
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChakravarthulaB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChakravarthulaB18
Sandeep Nallan Chakravarthula, Brian R. Baucom, Panayiotis G. Georgiou:
Modeling Interpersonal Influence of Verbal Behavior in Couples Therapy Dyadic Interactions. 2339-2343
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RamakrishnaGAN18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RamakrishnaGAN18
Anil Ramakrishna, Timothy Greer, David C. Atkins, Shrikanth S. Narayanan:
Computational Modeling of Conversational Humor in Psychotherapy. 2344-2348
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GarciaVON18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GarciaVON18
Nicanor García, Juan Camilo Vásquez-Correa, Juan Rafael Orozco-Arroyave, Elmar Nöth:
Multimodal I-vectors to Detect and Evaluate Parkinson's Disease. 2349-2353

Spoken CALL Shared Task, Second Edition

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BaurCCGQRRSW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BaurCCGQRRSW18
Claudia Baur, Andrew Caines, Cathy Chua, Johanna Gerlach, Mengjie Qian, Manny Rayner, Martin J. Russell, Helmer Strik, Xizi Wei:
Overview of the 2018 Spoken CALL Shared Task. 2354-2358
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JulgKFBQ18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JulgKFBQ18
Dominik Jülg, Mario Kunstek, Cem Philipp Freimoser, Kay Berkling, Mengjie Qian:
The CSU-K Rule-Based System for the 2nd Edition Spoken CALL Shared Task. 2359-2363
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NguyenCPWL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NguyenCPWL18
Huy Nguyen, Lei Chen, Ramon Prieto, Chuan Wang, Yang Liu:
Liulishuo's System for the Spoken CALL Shared Task 2018. 2364-2368
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AteeqHQ18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AteeqHQ18
Mohammad A. Ateeq, Abualsoud Hanani, Aziz Qaroush:
An Optimization Based Approach for Solving Spoken CALL Shared Task. 2369-2373
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/QianWJR18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/QianWJR18
Mengjie Qian, Xizi Wei, Peter Jancovic, Martin J. Russell:
The University of Birmingham 2018 Spoken CALL Shared Task Systems. 2374-2378
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/EvaniniMUQPRC18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/EvaniniMUQPRC18
Keelan Evanini, Matthew Mulholland, Rutuja Ubale, Yao Qian, Robert A. Pugh, Vikram Ramanarayanan, Aoife Cahill:
Improvements to an Automated Content Scoring System for Spoken CALL Responses: the ETS Submission to the Second Spoken CALL Shared Task. 2379-2383

Show and Tell 6

- view
  - electronic edition @ isca-speech.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/GoelSKAIC18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GoelSKAIC18
Nagendra Kumar Goel, Mousmita Sarma, Tejendra Kushwah, Dharmesh Agarwal, Zikra Iqbal, Surbhi Chauhan:
Extracting Speaker's Gender, Accent, Age and Emotional State from Speech. 2384-2385
- view
  - electronic edition @ isca-speech.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/NarayanamurthyS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NarayanamurthyS18
B. H. V. S. Narayanamurthy, J. V. Satyanarayana, Bayya Yegnanarayana:
Determining Speaker Location from Speech in a Practical Environment. 2386-2387
- view
  - electronic edition @ isca-speech.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/PatelNFSCKI18a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PatelNFSCKI18a
Tanvina Patel, Krishna D. N, Noor Fathima, Nisar Shah, Mahima C, Deepak Kumar, Anuroop Iyengar:
An Automatic Speech Transcription System for Manipuri Language. 2388-2389
- view
  - electronic edition @ isca-speech.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/YarraAKG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YarraAKG18
Chiranjeevi Yarra, Anand P. A, N. K. Kausthubha, Prasanta Kumar Ghosh:
SPIRE-SST: An Automatic Web-based Self-learning Tool for Syllable Stress Tutoring (SST) to the Second Language Learners. 2390-2391
- view
  - electronic edition @ isca-speech.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/ChakrabortyDDPS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChakrabortyDDPS18
Kishalay Chakraborty, Senjam Shantirani Devi, Sanjeevan Devnath, S. R. Mahadeva Prasanna, Priyankoo Sarmah:
Glotto Vibrato Graph: A Device and Method for Recording, Analysis and Visualization of Glottal Activity. 2392-2393

Adjusting to Speaker, Accent, and Domain

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RenduchintalaDW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RenduchintalaDW18
Adithya Renduchintala, Shuoyang Ding, Matthew Wiesner, Shinji Watanabe:
Multi-Modal Data Augmentation for End-to-end ASR. 2394-2398
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MoriyaUSDYA18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MoriyaUSDYA18
Takafumi Moriya, Sei Ueno, Yusuke Shinohara, Marc Delcroix, Yoshikazu Yamaguchi, Yushi Aono:
Multi-task Learning with Augmentation Strategy for Acoustic-to-word Attention-based Encoder-decoder Speech Recognition. 2399-2403
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SunYOHX18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SunYOHX18
Sining Sun, Ching-Feng Yeh, Mari Ostendorf, Mei-Yuh Hwang, Lei Xie:
Training Augmentation with Adversarial Examples for Robust Speech Recognition. 2404-2408
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FukudaFRTRSK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FukudaFRTRSK18
Takashi Fukuda, Raul Fernandez, Andrew Rosenberg, Samuel Thomas, Bhuvana Ramabhadran, Alexander Sorin, Gakuto Kurata:
Data Augmentation Improves Recognition of Foreign Accented Speech. 2409-2413
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TomashenkoKE18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TomashenkoKE18
Natalia A. Tomashenko, Yuri Y. Khokhlov, Yannick Estève:
Speaker Adaptive Training and Mixup Regularization for Neural Network Acoustic Models in Automatic Speech Recognition. 2414-2418
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/0001SW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/0001SW18
Markus Müller, Sebastian Stüker, Alex Waibel:
Neural Language Codes for Multilingual Acoustic Models. 2419-2423
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/UenoMMSSYAK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/UenoMMSSYAK18
Sei Ueno, Takafumi Moriya, Masato Mimura, Shinsuke Sakai, Yusuke Shinohara, Yoshikazu Yamaguchi, Yushi Aono, Tatsuya Kawahara:
Encoder Transfer for Attention-based Acoustic-to-word Speech Recognition. 2424-2428
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangZWX18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangZWX18
Ke Wang, Junbo Zhang, Yujun Wang, Lei Xie:
Empirical Evaluation of Speaker Adaptation on DNN Based Acoustic Model. 2429-2433
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DasH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DasH18
Amit Das, Mark Hasegawa-Johnson:
Improving DNNs Trained with Non-Native Transcriptions Using Knowledge Distillation and Target Interpolation. 2434-2438
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FengL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FengL18
Siyuan Feng, Tan Lee:
Improving Cross-Lingual Knowledge Transferability Using Multilingual TDNN-BLSTM with Language-Dependent Pre-Final Layer. 2439-2443
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DelcroixWOKN18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DelcroixWOKN18
Marc Delcroix, Shinji Watanabe, Atsunori Ogawa, Shigeki Karita, Tomohiro Nakatani:
Auxiliary Feature Based Adaptation of End-to-end ASR Systems. 2444-2448
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GhorbaniH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GhorbaniH18
Shahram Ghorbani, John H. L. Hansen:
Leveraging Native Language Information for Improved Accented Speech Recognition. 2449-2453
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JainUJ18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JainUJ18
Abhinav Jain, Minali Upreti, Preethi Jyothi:
Improved Accented Speech Recognition Using Accent Embeddings and Multi-task Learning. 2454-2458
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TongGB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TongGB18
Sibo Tong, Philip N. Garner, Hervé Bourlard:
Fast Language Adaptation Using Phonological Information. 2459-2463

Speech Synthesis Paradigms and Methods

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MurakamiHASM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MurakamiHASM18
Hiroki Murakami, Sunao Hara, Masanobu Abe, Masaaki Sato, Shogo Minagi:
Naturalness Improvement Algorithm for Reconstructed Glossectomy Patient's Speech Using Spectral Differential Modification in Voice Conversion. 2464-2468
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TamuraHEHT18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TamuraHEHT18
Satoshi Tamura, Kento Horio, Hajime Endo, Satoru Hayamizu, Tomoki Toda:
Audio-visual Voice Conversion Using Deep Canonical Correlation Analysis for Deep Bottleneck Features. 2469-2473
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BaljekarRB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BaljekarRB18
Pallavi Baljekar, Sai Krishna Rallabandi, Alan W. Black:
An Investigation of Convolution Attention Based Models for Multilingual Speech Synthesis of Indian Languages. 2474-2478
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WebsdaleTM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WebsdaleTM18
Danny Websdale, Sarah Taylor, Ben Milner:
The Effect of Real-Time Constraints on Automatic Speech Animation. 2479-2483
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GreenwoodML18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GreenwoodML18
David Greenwood, Iain A. Matthews, Stephen D. Laycock:
Joint Learning of Facial Expression and Head Pose from Speech. 2484-2488
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VythelingumER18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VythelingumER18
Kévin Vythelingum, Yannick Estève, Olivier Rosec:
Acoustic-dependent Phonemic Transcription for Text-to-speech Synthesis. 2489-2493
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LuongY18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LuongY18
Hieu-Thi Luong, Junichi Yamagishi:
Multimodal Speech Synthesis Architecture for Unsupervised Speaker Adaptation. 2494-2498
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TaguchiK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TaguchiK18
Fumiaki Taguchi, Tokihiko Kaburagi:
Articulatory-to-speech Conversion Using Bi-directional Long Short-term Memory. 2499-2503
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TaniharaYK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TaniharaYK18
Keisuke Tanihara, Shogo Yonekura, Yasuo Kuniyoshi:
Implementation of Respiration in Articulatory Synthesis Using a Pressure-Volume Lung Model. 2504-2508
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhouLZD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhouLZD18
Xiao Zhou, Zhen-Hua Ling, Zhi-Ping Zhou, Li-Rong Dai:
Learning and Modeling Unit Embeddings for Improving HMM-based Unit Selection Speech Synthesis. 2509-2513
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FuTZW18a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FuTZW18a
Ruibo Fu, Jianhua Tao, Yibin Zheng, Zhengqi Wen:
Deep Metric Learning for the Target Cost in Unit-Selection Speech Synthesizer. 2514-2518
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SoneN18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SoneN18
Kentaro Sone, Toru Nakashika:
DNN-based Speech Synthesis for Small Data Sets Considering Bidirectional Speech-Text Conversion. 2519-2523
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GerazovBX18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GerazovBX18
Branislav Gerazov, Gérard Bailly, Yi Xu:
A Weighted Superposition of Functional Contours Model for Modelling Contextual Prominence of Elementary Prosodic Contours. 2524-2528
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Nakashika18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Nakashika18
Toru Nakashika:
LSTBM: A Novel Sequence Representation of Speech Spectra Using Restricted Boltzmann Machine with Long Short-Term Memory. 2529-2533

Second Language Acquisition and Code-switching

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BullockGST18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BullockGST18
Barbara E. Bullock, Gualberto A. Guzmán, Jacqueline Serigos, Almeida Jacqueline Toribio:
Should Code-switching Models Be Asymmetric? 2534-2538
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TsukadaR18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TsukadaR18
Kimiko Tsukada, Yu Rong:
Cross-language Perception of Mandarin Lexical Tones by Mongolian-speaking Bilinguals in the Inner Mongolia Autonomous Region, China. 2539-2543
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FontanCD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FontanCD18
Lionel Fontan, Maxime Le Coz, Sylvain Detey:
Automatically Measuring L2 Speech Fluency without the Need of ASR: A Proof-of-concept Study with Japanese Learners of French. 2544-2548
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SunKZS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SunKZS18
Yue Sun, Win Thuzar Kyaw, Jinsong Zhang, Yoshinori Sagisaka:
Analysis of L2 Learners' Progress of Distinguishing Mandarin Tone 2 and Tone 3. 2549-2553
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiMWLLM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiMWLLM18
Xu Li, Shaoguang Mao, Xixin Wu, Kun Li, Xunying Liu, Helen Meng:
Unsupervised Discovery of Non-native Phonetic Patterns in L2 English Speech for Mispronunciation Detection and Diagnosis. 2554-2558
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangCC18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangCC18
Lei Wang, Jie Cui, Ying Chen:
Wuxi Speakers' Production and Perception of Coda Nasals in Mandarin. 2559-2562
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DyrenkoF18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DyrenkoF18
Natalia Dyrenko, Robert Fuchs:
The Diphthongs of Formal Nigerian English: A Preliminary Acoustic Analysis. 2563-2567
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/0001K18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/0001K18
Chris Davis, Jeesun Kim:
Characterizing Rhythm Differences between Strong and Weak Accented L2 Speech. 2568-2572
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FringiR18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FringiR18
Eva Fringi, Martin J. Russell:
Analysis of Phone Errors Attributable to Phonological Effects Associated With Language Acquisition Through Bottleneck Feature Visualisations. 2573-2577
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Koreman18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Koreman18
Jacques C. Koreman:
Category Similarity in Multilingual Pronunciation Training. 2578-2582
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CristiaGCG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CristiaGCG18
Alejandrina Cristià, Shobhana Ganesh, Marisa Casillas, Sriram Ganapathy:
Talker Diarization in the Wild: the Case of Child-centered Daylong Audio-recordings. 2583-2587
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangCWS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangCWS18
Zixing Zhang, Alejandrina Cristià, Anne S. Warlaumont, Björn W. Schuller:
Automated Classification of Children's Linguistic versus Non-Linguistic Vocalisations. 2588-2592
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YuanDWLYLL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YuanDWLYLL18
Jiahong Yuan, Qiusi Dong, Fei Wu, Huan Luan, Xiaofei Yang, Hui Lin, Yang Liu:
Pitch Characteristics of L2 English Speech by Chinese Speakers: A Large-scale Study. 2593-2597

Topics in Speech Recognition

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GargPJ18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GargPJ18
Saurabh Garg, Tanmay Parekh, Preethi Jyothi:
Dual Language Models for Code Switched Speech Recognition. 2598-2602
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BiswasWWYN18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BiswasWWYN18
Astik Biswas, Febe de Wet, Ewald van der Westhuizen, Emre Yilmaz, Thomas Niesler:
Multilingual Neural Network Acoustic Modelling for ASR of Under-Resourced English-isiZulu Code-Switched Speech. 2603-2607
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MenonKQN18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MenonKQN18
Raghav Menon, Herman Kamper, John A. Quinn, Thomas Niesler:
Fast ASR-free and Almost Zero-resource Keyword Spotting Using DTW and CNNs for Humanitarian Monitoring. 2608-2612
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YuJGCCZSY18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YuJGCCZSY18
Meng Yu, Xuan Ji, Yi Gao, Lianwu Chen, Jie Chen, Jimeng Zheng, Dan Su, Dong Yu:
Text-Dependent Speech Enhancement for Small-Footprint Robust Keyword Detection. 2613-2617
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HeLYHC18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HeLYHC18
Di He, Boon Pang Lim, Xuesong Yang, Mark Hasegawa-Johnson, Deming Chen:
Improved ASR for Under-resourced Languages through Multi-task Learning with Acoustic Landmarks. 2618-2622
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChibuyeRD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChibuyeRD18
Nick K. Chibuye, Todd Rosenstock, Brian DeRenzi:
Cross-language Phoneme Mapping for Low-resource Languages: An Exploration of Benefits and Trade-offs. 2623-2627
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TundikSGB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TundikSGB18
Máté Ákos Tündik, György Szaszák, Gábor Gosztolya, András Beke:
User-centric Evaluation of Automatic Punctuation in ASR Closed Captioning. 2628-2632
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZelaskoSMSCD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZelaskoSMSCD18
Piotr Zelasko, Piotr Szymanski, Jan Mizgajski, Adrian Szymczak, Yishay Carmiel, Najim Dehak:
Punctuation Prediction Model for Conversational Speech. 2633-2637
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KarafiatBSMVGBC18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KarafiatBSMVGBC18
Martin Karafiát, Murali Karthick Baskar, Igor Szöke, Vladimír Malenovský, Karel Veselý, Frantisek Grézl, Lukás Burget, Jan Cernocký:
BUT OpenSAT 2017 Speech Recognition System. 2638-2642
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiuH0B18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiuH0B18
Li Liu, Thomas Hueber, Gang Feng, Denis Beautemps:
Visual Recognition of Continuous Cued Speech Using a Tandem CNN-HMM Approach. 2643-2647
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ThangthaiH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ThangthaiH18
Kwanchiva Thangthai, Richard W. Harvey:
Building Large-vocabulary Speaker-independent Lipreading Systems. 2648-2652
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GuptaB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GuptaB18
Vishwa Gupta, Gilles Boulianne:
CRIM's System for the MGB-3 English Multi-Genre Broadcast Media Transcription. 2653-2657
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RiadDKZSD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RiadDKZSD18
Rachid Riad, Corentin Dancette, Julien Karadayi, Neil Zeghidour, Thomas Schatz, Emmanuel Dupoux:
Sampling Strategies in Siamese Networks for Unsupervised Speech Representation Learning. 2658-2662
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenZLLYG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenZLLYG18
Mengzhe Chen, Shiliang Zhang, Ming Lei, Yong Liu, Haitao Yao, Jie Gao:
Compact Feedforward Sequential Memory Networks for Small-footprint Keyword Spotting. 2663-2667

Zero-resource Speech Recognition

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HermannG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HermannG18
Enno Hermann, Sharon Goldwater:
Multilingual Bottleneck Features for Subword Modeling in Zero-resource Languages. 2668-2672
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FengL18a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FengL18a
Siyuan Feng, Tan Lee:
Exploiting Speaker and Phonetic Diversity of Mismatched Language Resources for Unsupervised Subword Modeling. 2673-2677
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GodardBOBYVB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GodardBOBYVB18
Pierre Godard, Marcely Zanon Boito, Lucas Ondel, Alexandre Berard, François Yvon, Aline Villavicencio, Laurent Besacier:
Unsupervised Word Segmentation from Speech with Attention. 2678-2682
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HolzenbergerDKR18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HolzenbergerDKR18
Nils Holzenberger, Mingxing Du, Julien Karadayi, Rachid Riad, Emmanuel Dupoux:
Learning Word Embeddings: Unsupervised Methods for Fixed-size Representations of Variable-length Speech Segments. 2683-2687
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GlarnerHEH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GlarnerHEH18
Thomas Glarner, Patrick Hanebrink, Janek Ebbers, Reinhold Haeb-Umbach:
Full Bayesian Hidden Markov Model Variational Autoencoder for Acoustic Unit Discovery. 2688-2692
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MildeB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MildeB18
Benjamin Milde, Chris Biemann:
Unspeech: Unsupervised Speech Context Embeddings. 2693-2697

Spatial and Phase Cues for Source Separation and Speech Recognition

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GongP18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GongP18
Yuan Gong, Christian Poellabauer:
Impact of Aliasing on Deep CNN-Based End-to-End Acoustic Models. 2698-2702
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SivasankaranVF18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SivasankaranVF18
Sunit Sivasankaran, Emmanuel Vincent, Dominique Fohr:
Keyword Based Speaker Localization: Localizing a Target Speaker in a Multi-speaker Environment. 2703-2707
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangRWH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangRWH18
Zhong-Qiu Wang, Jonathan Le Roux, DeLiang Wang, John R. Hershey:
End-to-End Speech Separation with Unfolded Iterative Phase Reconstruction. 2708-2712
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TakahashiAGM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TakahashiAGM18
Naoya Takahashi, Purvi Agrawal, Nabarun Goswami, Yuki Mitsufuji:
PhaseNet: Discretized Phase Modeling with Deep Neural Networks for Audio Source Separation. 2713-2717
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangW18
Zhong-Qiu Wang, DeLiang Wang:
Integrating Spectral and Spatial Features for Multi-Channel Speaker Separation. 2718-2722
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GogateAMBH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GogateAMBH18
Mandar Gogate, Ahsan Adeel, Ricard Marxer, Jon Barker, Amir Hussain:
DNN Driven Speaker Independent Audio-Visual Mask Estimation for Speech Separation. 2723-2727

Dialectal Variation

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VasilescuHVL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VasilescuHVL18
Ioana Vasilescu, Nidia Hernández, Bianca Vieru, Lori Lamel:
Exploring Temporal Reduction in Dialectal Spanish: A Large-scale Study of Lenition of Voiced Stops and Coda-s. 2728-2732
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Rose18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Rose18
Phil Rose:
Dialect-geographical Acoustic-Tonetics: Five Disyllabic Tone Sandhi Patterns in Cognate Words from the Wu Dialects of ZhèJiāNg Province. 2733-2737
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeemannSSK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeemannSSK18
Adrian Leemann, Stephan Schmid, Dieter Studer-Joho, Marie-José Kolly:
Regional Variation of /r/ in Swiss German Dialects. 2738-2742
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/EarnshawG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/EarnshawG18
Kate Earnshaw, Erica Gold:
Variation in the FACE Vowel across West Yorkshire: Implications for Forensic Speaker Comparisons. 2743-2747
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GoldRE18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GoldRE18
Erica Gold, Sula Ross, Kate Earnshaw:
The 'West Yorkshire Regional English Database': Investigations into the Generalizability of Reference Populations for Forensic Speaker Comparison Casework. 2748-2752
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WottawaAAL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WottawaAAL18
Jane Wottawa, Djegdjiga Amazouz, Martine Adda-Decker, Lori Lamel:
Studying Vowel Variation in French-Algerian Arabic Code-switched Speech. 2753-2757

Spoken Corpora and Annotation

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HansenSJBKY18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HansenSJBKY18
John H. L. Hansen, Abhijeet Sangwan, Aditya Joglekar, Ahmet Emin Bulut, Lakshmish Kaushik, Chengzhu Yu:
Fearless Steps: Apollo-11 Corpus Advancements for Speech Technologies from Earth to the Moon. 2758-2762
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KumarCKMLN18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KumarCKMLN18
Manoj Kumar, Pooja Chebolu, So Hyun Kim, Kassandra Martinez, Catherine Lord, Shrikanth S. Narayanan:
A Knowledge Driven Structural Segmentation Approach for Play-Talk Classification During Autism Assessment. 2763-2767
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JamesTW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JamesTW18
Jesin James, Li Tian, Catherine Inez Watson:
An Open Source Emotional Speech Corpus for Human Robot Interaction Applications. 2768-2772
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LapidotDTEB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LapidotDTEB18
Itshak Lapidot, Héctor Delgado, Massimiliano Todisco, Nicholas W. D. Evans, Jean-François Bonastre:
Speech Database and Protocol Validation Using Waveform Entropy. 2773-2777
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TerissiSCOGGGH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TerissiSCOGGGH18
Lucas D. Terissi, Gonzalo D. Sad, Mauricio Cerda, Slim Ouni, Rodrigo Galvez, Juan Carlos Gómez, Bernard Girau, Nancy Hitschfeld-Kahler:
A French-Spanish Multimodal Speech Communication Corpus Incorporating Acoustic Data, Facial, Hands and Arms Gestures Information. 2778-2782
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhaoSSLCLG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhaoSSLCLG18
Guanlong Zhao, Sinem Sonsaat, Alif Silpachai, Ivana Lucic, Evgeny Chukharev-Hudilainen, John Levis, Ricardo Gutierrez-Osuna:
L2-ARCTIC: A Non-native English Speech Corpus. 2783-2787

The First DIHARD Speech Diarization Challenge

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZajicKZH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZajicKZH18
Zbynek Zajíc, Marie Kunesová, Jan Zelinka, Marek Hrúz:
ZCU-NTIS Speaker Diarization System for the DIHARD 2018 Challenge. 2788-2792
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SunDJZHYL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SunDJZHYL18
Lei Sun, Jun Du, Chao Jiang, Xueyang Zhang, Shan He, Bing Yin, Chin-Hui Lee:
Speaker Diarization with Enhancing Speech for the First DIHARD Challenge. 2793-2797
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DiezLBRSZNVGPMM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DiezLBRSZNVGPMM18
Mireia Díez, Federico Landini, Lukás Burget, Johan Rohdin, Anna Silnova, Katerina Zmolíková, Ondrej Novotný, Karel Veselý, Ondrej Glembek, Oldrich Plchot, Ladislav Mosner, Pavel Matejka:
BUT System for DIHARD Speech Diarization Challenge 2018. 2798-2802
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VinalsGOML18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VinalsGOML18
Ignacio Viñals, Pablo Gimeno, Alfonso Ortega, Antonio Miguel, Eduardo Lleida:
Estimation of the Number of Speakers with Variational Bayesian PLDA in the DIHARD Diarization Challenge. 2803-2807
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SellSMGVMMDPWK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SellSMGVMMDPWK18
Gregory Sell, David Snyder, Alan McCree, Daniel Garcia-Romero, Jesús Villalba, Matthew Maciejewski, Vimal Manohar, Najim Dehak, Daniel Povey, Shinji Watanabe, Sanjeev Khudanpur:
Diarization is Hard: Some Experiences and Lessons Learned for the JHU Team in the Inaugural DIHARD Challenge. 2808-2812
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PatinoDE18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PatinoDE18
Jose Patino, Héctor Delgado, Nicholas W. D. Evans:
The EURECOM Submission to the First DIHARD Challenge. 2813-2817
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FilhoSC18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FilhoSC18
Valter Akira Miasato Filho, Diego Augusto Silva, Luis Gustavo Depra Cuozzo:
Joint Discriminative Embedding Learning, Speech Activity and Overlap Detection for the DIHARD Speaker Diarization Challenge. 2818-2822

Text Analysis, Multilingual Issues and Evaluation in Speech Synthesis

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NiSK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NiSK18
Jinfu Ni, Yoshinori Shiga, Hisashi Kawai:
Multilingual Grapheme-to-Phoneme Conversion with Global Character Vectors. 2823-2827
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RoyM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RoyM18
Somnath Roy, Shakuntala Mahanta:
A Hybrid Approach to Grapheme to Phoneme Conversion in Assamese. 2828-2832
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MohammadiK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MohammadiK18
Seyed Hamidreza Mohammadi, Taehwan Kim:
Investigation of Using Disentangled and Interpretable Representations for One-shot Cross-lingual Voice Conversion. 2833-2837
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GovenderK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GovenderK18
Avashna Govender, Simon King:
Using Pupillometry to Measure the Cognitive Load of Synthetic Speech. 2838-2842
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GovenderK18a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GovenderK18a
Avashna Govender, Simon King:
Measuring the Cognitive Load of Synthetic Speech Using a Dual Task Paradigm. 2843-2847
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Orife18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Orife18
Iroro Orife:
Attentive Sequence-to-Sequence Learning for Diacritic Restoration of YorùBá Language Text. 2848-2852
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenGCSY18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenGCSY18
Peixin Chen, Wu Guo, Zhi Chen, Jian Sun, Lanhua You:
Gated Convolutional Neural Network for Sentence Matching. 2853-2857
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Sharma18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Sharma18
Dravyansh Sharma:
On Training and Evaluation of Grapheme-to-Phoneme Mappings with Limited Data. 2858-2862
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BairdPHBCS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BairdPHBCS18
Alice Baird, Emilia Parada-Cabaleiro, Simone Hantke, Felix Burkhardt, Nicholas Cummins, Björn W. Schuller:
The Perception and Analysis of the Likeability and Human Likeness of Synthesized Speech. 2863-2867
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MassSMHSLK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MassSMHSLK18
Yosi Mass, Slava Shechtman, Moran Mordechay, Ron Hoory, Oren Sar Shalom, Guy Lev, David Konopnicki:
Word Emphasis Prediction for Expressive Text to Speech. 2868-2872
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeeCH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeeCH18
Kai-Zhan Lee, Erica Cooper, Julia Hirschberg:
A Comparison of Speaker-based and Utterance-based Data Selection for Text-to-Speech Synthesis. 2873-2877
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TomanMP18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TomanMP18
Markus Toman, Geoffrey S. Meltzner, Rupal Patel:
Data Requirements, Selection and Augmentation for DNN-based Speech Synthesis from Crowdsourced Data. 2878-2882

Neural Network Training Strategies for ASR

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VeselySSLC18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VeselySSLC18
Karel Veselý, Carlos Segura, Igor Szöke, Jordi Luque, Jan Cernocký:
Lightly Supervised vs. Semi-supervised Training of Acoustic Model on Luxembourgish for Low-resource Automatic Speech Recognition. 2883-2887
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiCGZ018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiCGZ018
Wenjie Li, Gaofeng Cheng, Fengpei Ge, Pengyuan Zhang, Yonghong Yan:
Investigation on the Combination of Batch Normalization and Dropout in BLSTM-based Acoustic Modeling for ASR. 2888-2892
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SuzukiNKT18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SuzukiNKT18
Masayuki Suzuki, Tohru Nagano, Gakuto Kurata, Samuel Thomas:
Inference-Invariant Transformation of Batch Normalization for Domain Adaptation of Acoustic Models. 2893-2897
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LongYLL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LongYLL18
Yanhua Long, Hong Ye, Yijie Li, Jiaen Liang:
Active Learning for LF-MMI Trained Neural Networks in ASR. 2898-2902
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MedennikovKRPTS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MedennikovKRPTS18
Ivan Medennikov, Yuri Y. Khokhlov, Aleksei Romanenko, Dmitry Popov, Natalia A. Tomashenko, Ivan Sorokin, Alexander Zatvornitskiy:
An Investigation of Mixup Training Strategies for Acoustic Models in ASR. 2903-2907
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AgrawalG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AgrawalG18
Purvi Agrawal, Sriram Ganapathy:
Comparison of Unsupervised Modulation Filter Learning Methods for ASR. 2908-2912
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimSLZ18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimSLZ18
Suyoun Kim, Michael L. Seltzer, Jinyu Li, Rui Zhao:
Improved Training for Online End-to-end Speech Recognition Systems. 2913-2917
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HaiderW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HaiderW18
Adnan Haider, Philip C. Woodland:
Combining Natural Gradient with Hessian Free Methods for Sequence Training. 2918-2922
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KandaFN18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KandaFN18
Naoyuki Kanda, Yusuke Fujita, Kenji Nagamatsu:
Lattice-free State-level Minimum Bayes Risk Training of Acoustic Models. 2923-2927
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TangHGG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TangHGG18
Hao Tang, Wei-Ning Hsu, François Grondin, James R. Glass:
A Study of Enhancement, Augmentation and Autoencoder Methods for Domain Adaptation in Distant Speech Recognition. 2928-2932
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KirkedalK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KirkedalK18
Andreas Søeborg Kirkedal, Yeon-Jun Kim:
Multilingual Deep Neural Network Training Using Cyclical Learning Rate. 2933-2937

Application of ASR in Medical Practice

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YuXLHLWWLM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YuXLHLWWLM18
Jianwei Yu, Xurong Xie, Shansong Liu, Shoukang Hu, Max W. Y. Lam, Xixin Wu, Ka Ho Wong, Xunying Liu, Helen Meng:
Development of the CUHK Dysarthric Speech Recognition System for the UA Speech Corpus. 2938-2942
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LaaridhFGLW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LaaridhFGLW18
Imed Laaridh, Corinne Fredouille, Alain Ghio, Muriel Lalain, Virginie Woisard:
Automatic Evaluation of Speech Intelligibility Based on I-vectors in the Context of Head and Neck Cancers. 2943-2947
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimCAW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimCAW18
Myung Jong Kim, Beiming Cao, Kwanghoon An, Jun Wang:
Dysarthric Speech Recognition Using Convolutional LSTM Neural Network. 2948-2952
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LaaridhTMGFP18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LaaridhTMGFP18
Imed Laaridh, Julien Tardieu, Cynthia Magnen, Pascal Gaillard, Jérôme Farinas, Julien Pinquier:
Perceptual and Automatic Evaluations of the Intelligibility of Speech Degraded by Noise Induced Hearing Loss Simulation. 2953-2957
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YilmazMBF18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YilmazMBF18
Emre Yilmaz, Vikramjit Mitra, Chris Bartels, Horacio Franco:
Articulatory Features for ASR of Pathological Speech. 2958-2962
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CorreiaRTT18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CorreiaRTT18
M. Joana Correia, Bhiksha Raj, Isabel Trancoso, Francisco Teixeira:
Mining Multimodal Repositories for Speech Affecting Diseases. 2963-2967
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/QinKT18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/QinKT18
Zhen Qin, Tom Ko, Guangjian Tian:
Long Distance Voice Channel Diagnosis Using Deep Neural Networks. 2968-2971
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChiuTCCJJKNSSTW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChiuTCCJJKNSSTW18
Chung-Cheng Chiu, Anshuman Tripathi, Katherine Chou, Chris Co, Navdeep Jaitly, Diana Jaunzeikare, Anjuli Kannan, Patrick Nguyen, Hasim Sak, Ananth Sankar, Justin Tansuwan, Nathan Wan, Yonghui Wu, Xuedong Zhang:
Speech Recognition for Medical Conversations. 2972-2976

Source and Supra-segmentals

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FarahRD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FarahRD18
Chadi Farah, Stephane Roman, Mariapaola D'Imperio:
Prosodic Focus Acquisition in French Early Cochlear Implanted Children. 2977-2981
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Fezza18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Fezza18
Nassima Fezza:
The Role of Temporal Variation in Narrative Organization. 2982-2986
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MurtolaM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MurtolaM18
Tiina Murtola, Jarmo Malinen:
Interaction Mechanisms between Glottal Source and Vocal Tract in Pitch Glides. 2987-2991
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SinghMG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SinghMG18
Astha Singh, G. Nisha Meenakshi, Prasanta Kumar Ghosh:
Relating Articulatory Motions in Different Speaking Rates. 2992-2996
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Cabral18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Cabral18
João Cabral:
Estimation of the Asymmetry Parameter of the Glottal Flow Waveform Using the Electroglottographic Signal. 2997-3001
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MandalRG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MandalRG18
Tanumay Mandal, K. Sreenivasa Rao, Sanjay Kumar Gupta:
Classification of Disorders in Vocal Folds Using Electroglottographic Signal. 3002-3006
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MVKGPG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MVKGPG18
M. V. Achuth Rao, Rahul Krishnamurthy, Pebbili Gopikishore, Veeramani Priyadharshini, Prasanta Kumar Ghosh:
Automatic Glottis Localization and Segmentation in Stroboscopic Videos Using Deep Neural Network. 3007-3011
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Isei-JaakkolaOH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Isei-JaakkolaOH18
Toshiko Isei-Jaakkola, Keiko Ochi, Keikichi Hirose:
Respiratory and Respiratory Muscular Control in JL1's and JL2's Text Reading Utilizing 4-RSTs and a Soft Respiratory Mask with a Two-Way Bulb. 3012-3016
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HaoZXZ18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HaoZXZ18
Lixia Hao, Wei Zhang, Yanlu Xie, Jinsong Zhang:
A Preliminary Study on Tonal Coarticulation in Continuous Speech. 3017-3021

Plenary Talk-3

- view
  - electronic edition @ isca-speech.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/Meng18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Meng18
Helen Meng:
Speech and Language Processing for Learning and Wellbeing. 3022

Distant ASR

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GanapathyH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GanapathyH18
Sriram Ganapathy, Madhumita Harish:
Far-Field Speech Recognition Using Multivariate Autoregressive Models. 3023-3027
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimVNB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimVNB18
Chanwoo Kim, Ehsan Variani, Arun Narayanan, Michiel Bacchiani:
Efficient Implementation of the Room Simulator for Training Deep Neural Network Acoustic Models. 3028-3032
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangLH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangLH18
Xiaofei Wang, Ruizhi Li, Hynek Hermansky:
Stream Attention for Distributed Multi-Microphone Speech Recognition. 3033-3037
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YoshiokaECXA18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YoshiokaECXA18
Takuya Yoshioka, Hakan Erdogan, Zhuo Chen, Xiong Xiao, Fil Alleva:
Recognizing Overlapped Speech in Meetings: A Multichannel Separation Approach Using Neural Networks. 3038-3042
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DrudeBHHKDN18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DrudeBHHKDN18
Lukas Drude, Christoph Böddeker, Jahn Heymann, Reinhold Haeb-Umbach, Keisuke Kinoshita, Marc Delcroix, Tomohiro Nakatani:
Integrating Neural Network Based Beamforming and Weighted Prediction Error Dereverberation. 3043-3047
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BuZHS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BuZHS18
Suliang Bu, Yunxin Zhao, Mei-Yuh Hwang, Sining Sun:
A Probability Weighted Beamformer for Noise Robust ASR. 3048-3052

Expressive Speech Synthesis

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YokoyamaNM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YokoyamaNM18
Masaki Yokoyama, Tomohiro Nagata, Hiroki Mori:
Effects of Dimensional Input on Paralinguistic Information Perceived from Synthesized Dialogue Speech with Neural Network. 3053-3056
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/OrabyRSTW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OrabyRSTW18
Shereen Oraby, Lena Reed, Sharath T. S., Shubhangi Tandon, Marilyn A. Walker:
Neural MultiVoice Models for Expressing Novel Personalities in Dialog. 3057-3061
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JaukLYB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JaukLYB18
Igor Jauk, Jaime Lorenzo-Trueba, Junichi Yamagishi, Antonio Bonafonte:
Expressive Speech Synthesis Using Sentiment Embeddings. 3062-3066
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AkuzawaIM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AkuzawaIM18
Kei Akuzawa, Yusuke Iwasawa, Yutaka Matsuo:
Expressive Speech Synthesis via Modeling Expressions with Variational Autoencoder. 3067-3071
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WuCWLKWLSYM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuCWLKWLSYM18
Xixin Wu, Yuewen Cao, Mu Wang, Songxiang Liu, Shiyin Kang, Zhiyong Wu, Xunying Liu, Dan Su, Dong Yu, Helen Meng:
Rapid Style Adaptation Using Residual Error Embedding for Expressive Speech Synthesis. 3072-3076
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiKW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiKW18
Hao Li, Yongguo Kang, Zhenyu Wang:
EMPHASIS: An Emotional Phoneme-based Acoustic Model for Speech Synthesis System. 3077-3081

Representation Learning for Emotion

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/00100SRRS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/00100SRRS18
Jing Han, Zixing Zhang, Maximilian Schmitt, Zhao Ren, Fabien Ringeval, Björn W. Schuller:
Bags in Bag: Generating Context-Aware Bags for Tracking Emotions from Speech. 3082-3086
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiSMGD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiSMGD18
Pengcheng Li, Yan Song, Ian McLoughlin, Wu Guo, Lirong Dai:
An Attention Pooling Based Representation Learning Method for Speech Emotion Recognition. 3087-3091
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YangH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YangH18
Zixiaofan Yang, Julia Hirschberg:
Predicting Arousal and Valence from Waveforms and Spectrograms Using Deep Neural Networks. 3092-3096
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SarmaGPGSD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SarmaGPGSD18
Mousmita Sarma, Pegah Ghahremani, Daniel Povey, Nagendra Kumar Goel, Kandarpa Kumar Sarma, Najim Dehak:
Emotion Identification from Raw Speech Signals Using DNNs. 3097-3101
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiL18
Jeng-Lin Li, Chi-Chun Lee:
Encoding Individual Acoustic Features Using Dyad-Augmented Deep Variational Representations for Dialog-level Emotion Recognition. 3102-3106
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LatifRQE18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LatifRQE18
Siddique Latif, Rajib Rana, Junaid Qadir, Julien Epps:
Variational Autoencoders for Learning Latent Representations of Speech Emotion: A Preliminary Study. 3107-3111

Articulatory Information, Modeling and Inversion

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Biasutto-Lervat18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Biasutto-Lervat18
Théo Biasutto-Lervat, Slim Ouni:
Phoneme-to-Articulatory Mapping Using Bidirectional Gated RNN. 3112-3116
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SuWFWH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SuWFWH18
Zhihua Su, Jianguo Wei, Qiang Fang, Jianrong Wang, Kiyoshi Honda:
Tongue Segmentation with Geometrically Constrained Snake Model. 3117-3121
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/IllaG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/IllaG18
Aravind Illa, Prasanta Kumar Ghosh:
Low Resource Acoustic-to-articulatory Inversion Using Bi-directional Long Short Term Memory. 3122-3126
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SYAMKTSG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SYAMKTSG18
Chandana Srinivasan, Chiranjeevi Yarra, Ritu Aggarwal, Sanjeev Kumar Mittal, N. K. Kausthubha, Raseena K. T, Astha Singh, Prasanta Kumar Ghosh:
Automatic Visual Augmentation for Concatenation Based Synthesized Articulatory Videos from Real-time MRI Data for Spoken Language Training. 3127-3131
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ValliappanMG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ValliappanMG18
C. A. Valliappan, Renuka Mannem, Prasanta Kumar Ghosh:
Air-Tissue Boundary Segmentation in Real-Time Magnetic Resonance Imaging Video Using Semantic Segmentation with Fully Convolutional Networks. 3132-3136
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SeneviratneSME18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SeneviratneSME18
Nadee Seneviratne, Ganesh Sivaraman, Vikramjit Mitra, Carol Y. Espy-Wilson:
Noise Robust Acoustic to Articulatory Speech Inversion. 3137-3141

Novel Paradigms for Direct Synthesis Based on Speech-Related Biosignals

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AhmadiT18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AhmadiT18
Farzaneh Ahmadi, Tomoki Toda:
Designing a Pneumatic Bionic Voice Prosthesis - A Statistical Approach for Source Excitation Generation. 3142-3146
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SchnellG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SchnellG18
Bastian Schnell, Philip N. Garner:
A Neural Model to Predict Parameters for a Generalized Command Response Model of Intonation. 3147-3151
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CaoKWSM018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CaoKWSM018
Beiming Cao, Myung Jong Kim, Jun R. Wang, Jan P. H. van Santen, Ted Mau, Jun Wang:
Articulation-to-Speech Synthesis Using Articulatory Flesh Point Sensors' Orientation Information. 3152-3156
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShahSP18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShahSP18
Neil Shah, Nirmesh J. Shah, Hemant A. Patil:
Effectiveness of Generative Adversarial Network for Non-Audible Murmur-to-Whisper Speech Conversion. 3157-3161
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DienerS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DienerS18
Lorenz Diener, Tanja Schultz:
Investigating Objective Intelligibility in Real-Time EMG-to-Speech Conversion. 3162-3166
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WandSS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WandSS18
Michael Wand, Tanja Schultz, Jürgen Schmidhuber:
Domain-Adversarial Training for Session Independent EMG-based Speech Recognition. 3167-3171
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TothGGMC18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TothGGMC18
László Tóth, Gábor Gosztolya, Tamás Grósz, Alexandra Markó, Tamás Gábor Csapó:
Multi-Task Learning of Speech Recognition and Speech Synthesis Parameters for Ultrasound-based Silent Speech Interfaces. 3172-3176

Low Resource Speech Recognition Challenge for Indian Languages

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PrakashRM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PrakashRM18
Jeena J. Prakash, Rajan Golda Brunet, Hema A. Murthy:
Transcription Correction for Indian Languages Using Acoustic Signatures. 3177-3181
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PulugundlaBKEKB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PulugundlaBKEKB18
Bhargav Pulugundla, Murali Karthick Baskar, Santosh Kesiraju, Ekaterina Egorova, Martin Karafiát, Lukás Burget, Jan Cernocký:
BUT System for Low Resource Indian Language ASR. 3182-3186
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SailorKCPKP18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SailorKCPKP18
Hardik B. Sailor, Maddala Venkata Siva Krishna, Diksha Chhabra, Ankur T. Patil, Madhu R. Kamble, Hemant A. Patil:
DA-IICT/IIITV System for Low Resource Speech Recognition Challenge 2018. 3187-3191
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VydanaGVV18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VydanaGVV18
Hari Krishna Vydana, Krishna Gurugubelli, Vishnu Vidyadhara Raju Vegesna, Anil Kumar Vuppala:
An Exploration towards Joint Acoustic Modeling for Indian Languages: IIIT-H Submission for Low Resource Speech Recognition Challenge for Indian Languages, INTERSPEECH 2018. 3192-3196
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FathimaPCI18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FathimaPCI18
Noor Fathima, Tanvina Patel, Mahima C, Anuroop Iyengar:
TDNN-based Multilingual Speech Recognition System for Low Resource Indian Languages. 3197-3201
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShettySASPRU18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShettySASPRU18
Vishwas M. Shetty, Rini A. Sharon, Basil Abraham, Tejaswi Seeram, Anusha Prakash, Nithya Ravi, Srinivasan Umesh:
Articulatory and Stacked Bottleneck Features for Low Resource Speech Recognition. 3202-3206
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Billa18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Billa18
Jayadev Billa:
ISI ASR System for the Low Resource Speech Recognition Challenge for Indian Languages. 3207-3211

Show and Tell 7

- view
  - electronic edition @ isca-speech.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/FinleyERSF0SBA18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FinleyERSF0SBA18
Gregory P. Finley, Erik Edwards, Amanda Robinson, Najmeh Sadoughi, James Fone, Mark Miller, David Suendermann-Oeft, Michael Brenndoerfer, Nico Axtmann:
An Automated Assistant for Medical Scribes. 3212-3213
- view
  - electronic edition @ isca-speech.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/DeyDID0PSSR18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DeyDID0PSSR18
Abhishek Dey, Abhash Deka, Siddika Imani, Barsha Deka, Rohit Sinha, S. R. Mahadeva Prasanna, Priyankoo Sarmah, K. Samudravijaya, S. R. Nirmala:
AGROASSAM: A Web Based Assamese Speech Recognition Application for Retrieving Agricultural Commodity Price and Weather Information. 3214-3215
- view
  - electronic edition @ isca-speech.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/Aharon18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Aharon18
Dan Aharon:
Voice-powered Solutions with Cloud AI. 3216
- view
  - electronic edition @ isca-speech.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/SivaramanNK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SivaramanNK18
Ganesh Sivaraman, Parav Nagarsheth, Elie Khoury:
Speech Synthesis in the Wild. 3217-3218

Deep Enhancement

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NieLLZLT18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NieLLZLT18
Shuai Nie, Shan Liang, Bin Liu, Yaping Zhang, Wenju Liu, Jianhua Tao:
Deep Noise Tracking Network: A Hybrid Signal Processing/Deep Learning Approach to Speech Enhancement. 3219-3223
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/OuyangYZC18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OuyangYZC18
Zhiheng Ouyang, Hongjiang Yu, Wei-Ping Zhu, Benoît Champagne:
A Deep Neural Network Based Harmonic Noise Model for Speech Enhancement. 3224-3228
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TanW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TanW18
Ke Tan, DeLiang Wang:
A Convolutional Recurrent Neural Network for Real-Time Speech Enhancement. 3229-3233
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangW18a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangW18a
Zhong-Qiu Wang, DeLiang Wang:
All-Neural Multi-Channel Speech Enhancement. 3234-3238
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangW18a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangW18a
Hao Zhang, DeLiang Wang:
Deep Learning for Acoustic Echo Cancellation in Noisy and Double-Talk Scenarios. 3239-3243
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AfourasCZ18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AfourasCZ18
Triantafyllos Afouras, Joon Son Chung, Andrew Zisserman:
The Conversation: Deep Audio-Visual Speech Enhancement. 3244-3248
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SubramanianCW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SubramanianCW18
Aswin Shanmugam Subramanian, Szu-Jui Chen, Shinji Watanabe:
Student-Teacher Learning for BLSTM Mask-based Speech Enhancement. 3249-3253
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KarjolG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KarjolG18
Pavan Karjol, Prasanta Kumar Ghosh:
Speech Enhancement Using Deep Mixture of Experts Based on Hard Expectation Maximization. 3254-3258
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MengLGJ18a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MengLGJ18a
Zhong Meng, Jinyu Li, Yifan Gong, Biing-Hwang Fred Juang:
Adversarial Feature-Mapping for Speech Enhancement. 3259-3263
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BabyV18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BabyV18
Deepak Baby, Sarah Verhulst:
Biophysically-inspired Features Improve the Generalizability of Neural Network-based Speech Enhancement Systems. 3264-3268
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChaiDL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChaiDL18
Li Chai, Jun Du, Chin-Hui Lee:
Error Modeling via Asymmetric Laplace Distribution for Deep Neural Network Based Single-Channel Speech Enhancement. 3269-3273
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XiaS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XiaS18
Yangyang Xia, Richard M. Stern:
A Priori SNR Estimation Based on a Recurrent Neural Network for Robust Speech Enhancement. 3274-3278

Acoustic Scenes and Rare Events

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TsengLWMSD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TsengLWMSD18
Shao-Yen Tseng, Juncheng Li, Yun Wang, Florian Metze, Joseph Szurley, Samarjit Das:
Multiple Instance Deep Learning for Weakly Supervised Small-Footprint Audio Event Detection. 3279-3283
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangHD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangHD18
Liwen Zhang, Jiqing Han, Shiwen Deng:
Unsupervised Temporal Feature Learning Based on Sparse Coding Embedded BoAW for Acoustic Event Recognition. 3284-3288
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TengZW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TengZW18
Teng Zhang, Kailai Zhang, Ji Wu:
Data Independent Sequence Augmentation Method for Acoustic Scene Classification. 3289-3293
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SongHD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SongHD18
Hongwei Song, Jiqing Han, Shiwen Deng:
A Compact and Discriminative Feature Based on Auditory Summary Statistics for Acoustic Scene Classification. 3294-3298
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SharmaAT18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SharmaAT18
Pulkit Sharma, Vinayak Abrol, Anshul Thakur:
ASe: Acoustic Scene Embedding Using Deep Archetypal Analysis and GMM. 3299-3303
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenZBYB018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenZBYB018
Hangting Chen, Pengyuan Zhang, Haichuan Bai, Qingsheng Yuan, Xiuguo Bao, Yonghong Yan:
Deep Convolutional Neural Network with Scalogram for Audio Scene Modeling. 3304-3308
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JoshiGRJ18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JoshiGRJ18
Pankaj Joshi, Digvijaysingh Gautam, Ganesh Ramakrishnan, Preethi Jyothi:
Time Aggregation Operators for Multi-label Audio Event Detection. 3309-3313
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/McLoughlinSPPPL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/McLoughlinSPPPL18
Ian McLoughlin, Yan Song, Lam Dang Pham, Ramaswamy Palaniappan, Huy Phan, Yue Lang:
Early Detection of Continuous and Partial Audio Events Using CNN. 3314-3318
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MulimaniK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MulimaniK18
Manjunath Mulimani, Shashidhar G. Koolagudi:
Robust Acoustic Event Classification Using Bag-of-Visual-Words. 3319-3322
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WaldekarS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WaldekarS18
Shefali Waldekar, Goutam Saha:
Wavelet Transform Based Mel-scaled Features for Acoustic Scene Classification. 3323-3327
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangZW18a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangZW18a
Teng Zhang, Kailai Zhang, Ji Wu:
Multi-modal Attention Mechanisms in LSTM and Its Application to Acoustic Scene Classification. 3328-3332

Language Modeling

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RajuHLGKMVR18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RajuHLGKMVR18
Anirudh Raju, Behnam Hedayatnia, Linda Liu, Ankur Gandhe, Chandra Khatri, Angeliki Metallinou, Anu Venkatesh, Ariya Rastrow:
Contextual Language Model Adaptation for Conversational Agents. 3333-3337
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenRGC18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenRGC18
Oscar Chen, Anton Ragni, Mark J. F. Gales, Xie Chen:
Active Memory Networks for Language Modeling. 3338-3342
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KhassanovC18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KhassanovC18
Yerbolat Khassanov, Eng Siong Chng:
Unsupervised and Efficient Vocabulary Expansion for Recurrent Neural Network Language Models in ASR. 3343-3347
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangZ018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangZ018
Yike Zhang, Pengyuan Zhang, Yonghong Yan:
Improving Language Modeling with an Adversarial Critic for Automatic Speech Recognition. 3348-3352
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DengSCJ18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DengSCJ18
Yue Deng, Yilin Shen, KaWai Chen, Hongxia Jin:
Training Recurrent Neural Network through Moment Matching for NLP Applications. 3353-3357
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TuskeSN18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TuskeSN18
Zoltán Tüske, Ralf Schlüter, Hermann Ney:
Investigation on LSTM Recurrent N-gram Language Models for Speech Recognition. 3358-3362
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuLSL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuLSL18
Chih Chi Hu, Bing Liu, John Shen, Ian R. Lane:
Online Incremental Learning for Speaker-Adaptive Language Models. 3363-3367
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Andres-FerrerBV18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Andres-FerrerBV18
Jesús Andrés-Ferrer, Nathan Bodenstab, Paul Vozila:
Efficient Language Model Adaptation with Noise Contrastive Estimation and Kullback-Leibler Regularization. 3368-3372
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiXWPK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiXWPK18
Ke Li, Hainan Xu, Yiming Wang, Daniel Povey, Sanjeev Khudanpur:
Recurrent Neural Network Language Model Adaptation for Conversational Speech Recognition. 3373-3377
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LevitPC18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LevitPC18
Michael Levit, Sarangarajan Parthasarathy, Shuangyu Chang:
What to Expect from Expected Kneser-Ney Smoothing. 3378-3382
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BenesKB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BenesKB18
Karel Benes, Santosh Kesiraju, Lukás Burget:
i-Vectors in Language Modeling: An Efficient Way of Domain Adaptation for Feed-Forward Models. 3383-3387

Speech Pathology, Depression, and Medical Applications

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RathnerDTSCSHB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RathnerDTSCSHB18
Eva-Maria Rathner, Julia Djamali, Yannik Terhorst, Björn W. Schuller, Nicholas Cummins, Gudrun Salamon, Christina Hunger-Schoppe, Harald Baumeister:
How Did You like 2017? Detection of Language Markers of Depression and Narcissism in Personal Narratives. 3388-3392
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuangEJC18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuangEJC18
Zhaocheng Huang, Julien Epps, Dale Joachim, Michael Chen:
Depression Detection from Short Utterances via Diverse Smartphones in Natural Environmental Conditions. 3393-3397
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/OzkancaDBC18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/OzkancaDBC18
Yasin Özkanca, Cenk Demiroglu, Asli Besirli, Selime Celik:
Multi-Lingual Depression-Level Assessment from Conversational Speech Using Acoustic and Text Features. 3398-3402
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NarendraA18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NarendraA18
N. P. Narendra, Paavo Alku:
Dysarthric Speech Classification Using Glottal Features Computed from Non-words, Words and Sentences. 3403-3407
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GosztolyaBSSH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GosztolyaBSSH18
Gábor Gosztolya, Anita Bagi, Szilvia Szalóki, István Szendi, Ildikó Hoffmann:
Identifying Schizophrenia Based on Temporal Parameters in Spontaneous Speech. 3408-3412
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SinglaCFGCAN18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SinglaCFGCAN18
Karan Singla, Zhuohao Chen, Nikolaos Flemotomos, James Gibson, Dogan Can, David C. Atkins, Shrikanth S. Narayanan:
Using Prosodic and Lexical Information for Learning Utterance-level Behaviors in Psychotherapy. 3413-3417
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/QinLFK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/QinLFK18
Ying Qin, Tan Lee, Siyuan Feng, Anthony Pak-Hin Kong:
Automatic Speech Assessment for People with Aphasia Using TDNN-BLSTM with Multi-Task Learning. 3418-3422
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NasirBNG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NasirBNG18
Md. Nasir, Brian R. Baucom, Shrikanth S. Narayanan, Panayiotis G. Georgiou:
Towards an Unsupervised Entrainment Distance in Conversational Speech Using Deep Neural Networks. 3423-3427
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TeixeiraAT18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TeixeiraAT18
Francisco Teixeira, Alberto Abad, Isabel Trancoso:
Patient Privacy in Paralinguistic Tasks. 3428-3432
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AlharbiHSBG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AlharbiHSBG18
Sadeen Alharbi, Madina Hasan, Anthony J. H. Simons, Shelagh Brumfitt, Phil D. Green:
A Lightly Supervised Approach to Detect Stuttering in Children's Speech. 3433-3437
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiWNL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiWNL18
Jeng-Lin Li, Yi-Ming Weng, Chip-Jin Ng, Chi-Chun Lee:
Learning Conditional Acoustic Latent Representation with Gender and Age Attributes for Automatic Pain Level Recognition. 3438-3442

Perspective Talk-4

- view
  - electronic edition @ isca-speech.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/interspeech/Ganapathy18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Ganapathy18
Sriram Ganapathy:
Speaker and Language Recognition - From Laboratory Technologies to the Wild. 3443

Spoken Language Understanding

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangPSJ18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangPSJ18
Yu Wang, Abhishek Patel, Yilin Shen, Hongxia Jin:
A Deep Reinforcement Learning Based Multimodal Coaching Model (DCM) for Slot Filling in Spoken Language Understanding(SLU). 3444-3448
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BechetR18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BechetR18
Frédéric Béchet, Christian Raymond:
Is ATIS Too Shallow to Go Deeper for Benchmarking Spoken Language Understanding Models? 3449-3453
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RaySJ18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RaySJ18
Avik Ray, Yilin Shen, Hongxia Jin:
Robust Spoken Language Understanding via Paraphrasing. 3454-3458
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeeWLL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeeWLL18
Chia-Hsuan Li, Szu-Lin Wu, Chi-Liang Liu, Hung-yi Lee:
Spoken SQuAD: A Study of Mitigating the Impact of Speech Recognition Errors on Listening Comprehension. 3459-3463
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShenZWJ18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShenZWJ18
Yilin Shen, Xiangyu Zeng, Yu Wang, Hongxia Jin:
User Information Augmented Semantic Frame Parsing Using Progressive Neural Networks. 3464-3468
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GuptaRH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GuptaRH18
Raghav Gupta, Abhinav Rastogi, Dilek Hakkani-Tür:
An Efficient Approach to Encoding Context for Spoken Language Understanding. 3469-3473

Source Separation from Monaural Input

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HetherlyGBSN18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HetherlyGBSN18
Jeffrey Hetherly, Paul Gamble, Maria Alejandra Barrios, Cory Stephenson, Karl Ni:
Deep Speech Denoising with Vector Space Projections. 3474-3478
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XuRCL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XuRCL18
Chenglin Xu, Wei Rao, Eng Siong Chng, Haizhou Li:
A Shifted Delta Coefficient Objective for Monaural Speech Separation Using Multi-task Learning. 3479-3483
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TanW18a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TanW18a
Ke Tan, DeLiang Wang:
A Two-Stage Approach to Noisy Cochannel Speech Separation with Gated Residual Networks. 3484-3488
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PandeyKN18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PandeyKN18
Laxmi Pandey, Anurendra Kumar, Vinay P. Namboodiri:
Monoaural Audio Source Separation Using Variational Autoencoders. 3489-3493
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GangBS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GangBS18
Arpita Gang, Pravesh Biyani, Akshay Soni:
Towards Automated Single Channel Source Separation Using Neural Networks. 3494-3498
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ErdoganY18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ErdoganY18
Hakan Erdogan, Takuya Yoshioka:
Investigations on Data Augmentation and Loss Functions for Deep Learning Based Speech-Background Separation. 3499-3503

Multimodal Systems

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HantkeSS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HantkeSS18
Simone Hantke, Christoph Stemp, Björn W. Schuller:
Annotator Trustability-based Cooperative Learning Solutions for Intelligent Audio Analysis. 3504-3508
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SuLW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SuLW18
Rongfeng Su, Xunying Liu, Lan Wang:
Semi-supervised Cross-domain Visual Feature Learning for Audio-Visual Broadcast Speech Transcription. 3509-3513
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AfourasCZ18a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AfourasCZ18a
Triantafyllos Afouras, Joon Son Chung, Andrew Zisserman:
Deep Lip Reading: A Comparison of Models and an Online Application. 3514-3518
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Srinivasamurthy18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Srinivasamurthy18
Ajay Srinivasamurthy, Petr Motlícek, Mittul Singh, Youssef Oualil, Matthias Kleinert, Heiko Ehr, Hartmut Helmke:
Iterative Learning of Speech Recognition Models for Air Traffic Control. 3519-3523
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SariHSSN18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SariHSSN18
Leda Sari, Mark Hasegawa-Johnson, Kumaran S, Georg Stemmer, Krishnakumar N. Nair:
Speaker Adaptive Audio-Visual Fusion for the Open-Vocabulary Section of AVICAR. 3524-3528
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HruzPB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HruzPB18
Marek Hrúz, Ales Prazák, Michal Busta:
Multimodal Name Recognition in Live TV Subtitling. 3529-3532

Coding

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Backstrom0D18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Backstrom0D18
Tom Bäckström, Johannes Fischer, Sneha Das:
Dithered Quantization for Frequency-Domain Speech and Audio Coding. 3533-3537
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DasB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DasB18
Sneha Das, Tom Bäckström:
Postfiltering with Complex Spectral Correlations for Speech and Audio Coding. 3538-3542
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DasB18a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DasB18a
Sneha Das, Tom Bäckström:
Postfiltering Using Log-Magnitude Spectrum for Speech and Audio Coding. 3543-3547
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BiswasHVM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BiswasHVM18
Arijit Biswas, Per Hedelin, Lars F. Villemoes, Vinay Melkote:
Temporal Noise Shaping with Companding. 3548-3552
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiEXZDL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiEXZDL18
Yaxing Li, Eshete Derb Emiru, Shengwu Xiong, Anna Zhu, Pengfei Duan, Yichang Li:
Multi-frame Quantization of LSF Parameters Using a Deep Autoencoder and Pyramid Vector Quantizer. 3553-3557
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiXXZDD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiXXZDD18
Yaxing Li, Shan Xu, Shengwu Xiong, Anna Zhu, Pengfei Duan, Yueming Ding:
Multi-frame Coding of LSF Parameters Using Block-Constrained Trellis Coded Vector Quantization. 3558-3562

Speaker Verification Using Neural Network Methods II

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ParkCPKP18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ParkCPKP18
Heewoong Park, Sukhyun Cho, Kyubyong Park, Namju Kim, Jonghun Park:
Training Utterance-level Embedding Networks for Speaker Identification and Verification. 3563-3567
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NandwanaMCHL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NandwanaMCHL18
Mahesh Kumar Nandwana, Mitchell McLaren, Diego Castán, Julien van Hout, Aaron Lawson:
Analysis of Complementary Information Sources in the Speaker Embeddings Framework. 3568-3572
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhuKSMP18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhuKSMP18
Yingke Zhu, Tom Ko, David Snyder, Brian Mak, Daniel Povey:
Self-Attentive Speaker Embeddings for Text-Independent Speaker Verification. 3573-3577
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GaoSMGD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GaoSMGD18
Zhifu Gao, Yan Song, Ian McLoughlin, Wu Guo, Lirong Dai:
An Improved Deep Embedding Learning Method for Short Duration Speaker Verification. 3578-3582
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JungHYSY18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JungHYSY18
Jee-weon Jung, Hee-Soo Heo, Il-Ho Yang, Hye-jin Shim, Ha-Jin Yu:
Avoiding Speaker Overfitting in End-to-End DNNs Using Raw Waveform for Text-Independent Speaker Verification. 3583-3587
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BhattacharyaAGK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BhattacharyaAGK18
Gautam Bhattacharya, Jahangir Alam, Vishwa Gupta, Patrick Kenny:
Deeply Fused Speaker Embeddings for Text-Independent Speaker Verification. 3588-3592
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RahmanHMFS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RahmanHMFS18
Md. Hafizur Rahman, Ivan Himawan, Mitchell McLaren, Clinton Fookes, Sridha Sridharan:
Employing Phonetic Information in DNN Speaker Embeddings to Improve Speaker Recognition Performance. 3593-3597
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DeyMM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DeyMM18
Subhadeep Dey, Srikanth R. Madikeri, Petr Motlícek:
End-to-end Text-dependent Speaker Verification Using Novel Distance Measures. 3598-3602
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DubeySH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DubeySH18
Harishchandra Dubey, Abhijeet Sangwan, John H. L. Hansen:
Robust Speaker Clustering using Mixtures of von Mises-Fisher Distributions for Naturalistic Audio Streams. 3603-3607
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SongWTBS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SongWTBS18
Huan Song, Megan M. Willi, Jayaraman J. Thiagarajan, Visar Berisha, Andreas Spanias:
Triplet Network with Attention for Speaker Diarization. 3608-3612
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangIS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangIS18
Jiacen Zhang, Nakamasa Inoue, Koichi Shinoda:
I-vector Transformation Using Conditional Generative Adversarial Networks for Short Utterance Speaker Verification. 3613-3617
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CaiCL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CaiCL18
Weicheng Cai, Jinkun Chen, Ming Li:
Analysis of Length Normalization in End-to-End Speaker Verification System. 3618-3622
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuangW018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuangW018
Zili Huang, Shuai Wang, Kai Yu:
Angular Softmax for Short-Duration Text-independent Speaker Verification. 3623-3627
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JiCB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JiCB18
Ruifang Ji, Xinyuan Cai, Bo Xu:
An End-to-End Text-Independent Speaker Identification System on Short Utterances. 3628-3632
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DingH18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DingH18
Wenhao Ding, Liang He:
MTGAN: Speaker Verification through Multitasking Triplet Generative Adversarial Networks. 3633-3637

Emotion Recognition and Analysis

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Parada-Cabaleiro18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Parada-Cabaleiro18
Emilia Parada-Cabaleiro, Giovanni Costantini, Anton Batliner, Alice Baird, Björn W. Schuller:
Categorical vs Dimensional Perception of Italian Emotional Speech. 3638-3642
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiA18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiA18
Xingfeng Li, Masato Akagi:
A Three-Layer Emotion Perception Model for Valence and Arousal-Based Detection from Multilingual Speech. 3643-3647
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DesplanquesD18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DesplanquesD18
Brecht Desplanques, Kris Demuynck:
Cross-lingual Speech Emotion Recognition through Factor Analysis. 3648-3652
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChengBRFCHE18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChengBRFCHE18
Jian Cheng, Jared Bernstein, Elizabeth Rosenfeld, Peter W. Foltz, Alex S. Cohen, Terje B. Holmlund, Brita Elvevåg:
Modeling Self-Reported and Observed Affect from Speech. 3653-3657
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuangN18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuangN18
Che-Wei Huang, Shrikanth S. Narayanan:
Stochastic Shake-Shake Regularization for Affective Learning from Speech. 3658-3662
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AvilaAOF18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AvilaAOF18
Anderson R. Avila, Md. Jahangir Alam, Douglas D. O'Shaughnessy, Tiago H. Falk:
Investigating Speech Enhancement and Perceptual Quality for Speech Emotion Recognition. 3663-3667
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AtchesonSE18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AtchesonSE18
Mia Atcheson, Vidhyasaharan Sethu, Julien Epps:
Demonstrating and Modelling Systematic Time-varying Annotator Disagreement in Continuous Emotion Annotation. 3668-3672
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuangLTL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuangLTL18
Jian Huang, Ya Li, Jianhua Tao, Zhen Lian:
Speech Emotion Recognition from Variable-Length Inputs with Triplet Loss Function. 3673-3677
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhangCXZ18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhangCXZ18
Xiaotong Zhang, Xingliang Cheng, Mingxing Xu, Thomas Fang Zheng:
Imbalance Learning-based Framework for Fear Recognition in the MediaEval Emotional Impact of Movies Task. 3678-3682
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MaW0XMC18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MaW0XMC18
Xi Ma, Zhiyong Wu, Jia Jia, Mingxing Xu, Helen Meng, Lianhong Cai:
Emotion Recognition from Variable-Length Speech Segments Using Deep Learning on Spectrograms. 3683-3687
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YenigallaKTSKV18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YenigallaKTSKV18
Promod Yenigalla, Abhay Kumar, Suraj Tripathi, Chirag Singh, Sibsambhu Kar, Jithendra Vepa:
Speech Emotion Recognition Using Spectrogram & Phoneme Embedding. 3688-3692
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SahuGE18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SahuGE18
Saurabh Sahu, Rahul Gupta, Carol Y. Espy-Wilson:
On Enhancing Speech Emotion Recognition Using Generative Adversarial Networks. 3693-3697
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ParthasarathyB18a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ParthasarathyB18a
Srinivas Parthasarathy, Carlos Busso:
Ladder Networks for Emotion Recognition: Using Unsupervised Auxiliary Tasks to Improve Predictions of Emotional Attributes. 3698-3702

Acoustic Modelling

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuangYCQ018
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuangYCQ018
Mingkun Huang, Yongbin You, Zhehuai Chen, Yanmin Qian, Kai Yu:
Knowledge Distillation for Sequence Model. 3703-3707
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiLTSKK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiLTSKK18
Sheng Li, Xugang Lu, Ryoichi Takashima, Peng Shen, Tatsuya Kawahara, Hisashi Kawai:
Improving CTC-based Acoustic Model with Very Deep Residual Time-delay Neural Networks. 3708-3712
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GuoXCSXA18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GuoXCSXA18
Jinxi Guo, Ning Xu, Xin Chen, Yang Shi, Kaiyuan Xu, Abeer Alwan:
Filter Sampling and Combination CNN (FSC-CNN): A Compact CNN Model for Small-footprint ASR Acoustic Modeling Using Raw Waveforms. 3713-3717
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RavanelliSB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RavanelliSB18
Mirco Ravanelli, Dmitriy Serdyuk, Yoshua Bengio:
Twin Regularization for Online Speech Recognition. 3718-3722
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SperberNNSW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SperberNNSW18
Matthias Sperber, Jan Niehues, Graham Neubig, Sebastian Stüker, Alex Waibel:
Self-Attentional Acoustic Models. 3723-3727
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ParkCBS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ParkCBS18
Jinhwan Park, Iksoo Choi, Yoonho Boo, Wonyong Sung:
Hierarchical Recurrent Neural Networks for Acoustic Modeling. 3728-3732
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BruguierBS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BruguierBS18
Antoine Bruguier, Anton Bakhtin, Dravyansh Sharma:
Dictionary Augmented Sequence-to-Sequence Neural Network for Grapheme to Phoneme Prediction. 3733-3737
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RajRV18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RajRV18
Ankit Raj, Shakti P. Rath, Jithendra Vepa:
Leveraging Second-Order Log-Linear Model for Improved Deep Learning Based ASR Performance. 3738-3742
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PoveyCWLXYK18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PoveyCWLXYK18
Daniel Povey, Gaofeng Cheng, Yiming Wang, Ke Li, Hainan Xu, Mahsa Yarmohammadi, Sanjeev Khudanpur:
Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks. 3743-3747
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiuCLL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiuCLL18
Da-Rong Liu, Kuan-Yu Chen, Hung-yi Lee, Lin-Shan Lee:
Completely Unsupervised Phoneme Recognition by Adversarially Learning Mapping Relationships from Audio Embeddings. 3748-3752
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/QianBJR18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/QianBJR18
Mengjie Qian, Linxue Bai, Peter Jancovic, Martin J. Russell:
Phone Recognition Using a Non-Linear Manifold with Broad Phone Class Dependent DNNs. 3753-3757
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Hosseini-AslZXS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Hosseini-AslZXS18
Ehsan Hosseini-Asl, Yingbo Zhou, Caiming Xiong, Richard Socher:
A Multi-Discriminator CycleGAN for Unsupervised Non-Parallel Speech Domain Adaptation. 3758-3762

Speech and Speaker Perception

- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CaoWWXZ18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CaoWWXZ18
Chong Cao, Wei Wei, Wei Wang, Yanlu Xie, Jinsong Zhang:
Interactions between Vowels and Nasal Codas in Mandarin Speakers' Perception of Nasal Finals. 3763-3767
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MengZMLS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MengZMLS18
Qinglin Meng, Nengheng Zheng, Ambika Prasad Mishra, Jacinta Dan Luo, Jan W. H. Schnupp:
Weighting Pitch Contour and Loudness Contour in Mandarin Tone Perception in Cochlear Implant Listeners. 3768-3771
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/NenadicBT18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/NenadicBT18
Filip Nenadic, Louis ten Bosch, Benjamin V. Tucker:
Implementing DIANA to Model Isolated Auditory Word Recognition in English. 3772-3776
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Sharma18a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Sharma18a
Bhamini Sharma:
Effects of Homophone Density on Spoken Word Recognition in Mandarin Chinese. 3777-3780
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XieZW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XieZW18
Hui Xie, Biao Zeng, Rui Wang:
Visual Timing Information in Audiovisual Speech Perception: Evidence from Lexical Tone Contour. 3781-3785
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BarnaudDBS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BarnaudDBS18
Marie-Lou Barnaud, Julien Diard, Pierre Bessière, Jean-Luc Schwartz:
COSMO SylPhon: A Bayesian Perceptuo-motor Model to Assess Phonological Learning. 3786-3790
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MagguWLW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MagguWLW18
Akshay Raj Maggu, Patrick C. M. Wong, Hanjun Liu, Francis C. K. Wong:
Experience-dependent Influence of Music and Language on Lexical Pitch Learning Is Not Additive. 3791-3794
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DellwoKPHSM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DellwoKPHSM18
Volker Dellwo, Thayabaran Kathiresan, Elisa Pellegrino, Lei He, Sandra Schwab, Dieter Maurer:
Influences of Fundamental Oscillation on Speaker Identification in Vocalic Utterances by Humans and Computers. 3795-3799

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.