default search action
Mathew Magimai-Doss
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j23]Vishal Kumar, Vinayak Abrol, Mathew Magimai-Doss:
On the Quantization of Neural Models for Speaker Verification. IEEE ACM Trans. Audio Speech Lang. Process. 32: 4226-4236 (2024) - [c127]Neha Tarigopula, Preyas Garg, Skanda Muralidhar, Sandrine Tornay, Dinesh Babu Jayagopi, Mathew Magimai-Doss:
Content-Based Objective Evaluation of Artificially Generated Sign Language Videos. ICASSP 2024: 3815-3819 - [c126]Sevada Hovsepyan, Mathew Magimai-Doss:
Syllable Level Features for Parkinson's Disease Detection from Speech. ICASSP 2024: 11416-11420 - [c125]Bogdan Vlasenko, Sargam Vyas, Mathew Magimai-Doss:
Comparing data-Driven and Handcrafted Features for Dimensional Emotion Recognition. ICASSP 2024: 11841-11845 - [i10]Gasser Elbanna, Zohreh Mostaani, Mathew Magimai-Doss:
Predicting Heart Activity from Speech using Data-driven and Knowledge-based features. CoRR abs/2406.06341 (2024) - [i9]Eklavya Sarkar, Mathew Magimai-Doss:
On the Utility of Speech and Audio Foundation Models for Marmoset Call Analysis. CoRR abs/2407.16417 (2024) - [i8]Maryam Naderi, Enno Hermann, Alexandre Nanchen, Sevada Hovsepyan, Mathew Magimai-Doss:
Towards interfacing large language models with ASR systems using confidence measures and prompting. CoRR abs/2407.21414 (2024) - [i7]Karl El Hajal, Ajinkya Kulkarni, Enno Hermann, Mathew Magimai-Doss:
SSL-TTS: Leveraging Self-Supervised Embeddings and kNN Retrieval for Zero-Shot Multi-speaker TTS. CoRR abs/2408.10771 (2024) - [i6]Imen Ben Mahmoud, Eklavya Sarkar, Marta B. Manser, Mathew Magimai-Doss:
Feature Representations for Automatic Meerkat Vocalization Classification. CoRR abs/2408.15296 (2024) - 2023
- [c124]Tilak Purohit, Sarthak Yadav, Bogdan Vlasenko, S. Pavankumar Dubagunta, Mathew Magimai-Doss:
Towards Learning Emotion Information from Short Segments of Speech. ICASSP 2023: 1-5 - [c123]Enno Hermann, Mathew Magimai-Doss:
Few-shot Dysarthric Speech Recognition with Text-to-Speech Data Augmentation. INTERSPEECH 2023: 156-160 - [c122]Eklavya Sarkar, Mathew Magimai-Doss:
Can Self-Supervised Neural Representations Pre-Trained on Human Speech distinguish Animal Callers? INTERSPEECH 2023: 1189-1193 - [c121]Tilak Purohit, Bogdan Vlasenko, Mathew Magimai-Doss:
Implicit phonetic information modeling for speech emotion recognition. INTERSPEECH 2023: 1883-1887 - [c120]Timothy Piton, Enno Hermann, Angela Pasqualotto, Marjolaine Cohen, Mathew Magimai-Doss, Daphne Bavelier:
Using Commercial ASR Solutions to Assess Reading Skills in Children: A Case Report. INTERSPEECH 2023: 4573-4577 - [i5]Eklavya Sarkar, Mathew Magimai-Doss:
Can Self-Supervised Neural Representations Pre-Trained on Human Speech distinguish Animal Callers? CoRR abs/2305.14035 (2023) - 2022
- [j22]S. Pavankumar Dubagunta, Rob J. J. H. van Son, Mathew Magimai-Doss:
Adjustable deterministic pseudonymization of speech. Comput. Speech Lang. 72: 101284 (2022) - [c119]Zohreh Mostaani, RaviShankar Prasad, Bogdan Vlasenko, Mathew Magimai-Doss:
Modeling of Pre-Trained Neural Network Embeddings Learned From Raw Waveform for COVID-19 Infection Detection. ICASSP 2022: 8482-8486 - [c118]S. Pavankumar Dubagunta, Edoardo Moneta, Eleni Theocharopoulos, Mathew Magimai-Doss:
Towards Automatic Prediction of Non-Expert Perceived Speech Fluency Ratings. ICMI Companion 2022: 7-11 - [c117]Neha Tarigopula, Sandrine Tornay, Skanda Muralidhar, Mathew Magimai-Doss:
Towards Accessible Sign Language Assessment and Learning. ICMI 2022: 626-631 - [c116]Zohreh Mostaani, Mathew Magimai-Doss:
On Breathing Pattern Information in Synthetic Speech. INTERSPEECH 2022: 2768-2772 - [c115]Eklavya Sarkar, RaviShankar Prasad, Mathew Magimai-Doss:
Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering. INTERSPEECH 2022: 4626-4630 - [c114]Sarthak Yadav, Tilak Purohit, Zohreh Mostaani, Bogdan Vlasenko, Mathew Magimai-Doss:
Comparing Biosignal and Acoustic feature Representation for Continuous Emotion Recognition. MuSe @ ACM Multimedia 2022: 37-45 - [i4]Tilak Purohit, Imen Ben Mahmoud, Bogdan Vlasenko, Mathew Magimai-Doss:
Comparing supervised and self-supervised embedding for ExVo Multi-Task learning track. CoRR abs/2206.11968 (2022) - 2021
- [j21]Venkata Srikanth Nallanthighal, Zohreh Mostaani, Aki Härmä, Helmer Strik, Mathew Magimai-Doss:
Deep learning architectures for estimating breathing signal and respiratory parameters from speech recordings. Neural Networks 141: 211-224 (2021) - [j20]Jilt Sebastian, Mriganka Sur, Hema A. Murthy, Mathew Magimai-Doss:
Signal-to-signal neural networks for improved spike estimation from calcium imaging data. PLoS Comput. Biol. 17(3) (2021) - [j19]Julian Fritsch, Mathew Magimai-Doss:
Utterance Verification-Based Dysarthric Speech Intelligibility Assessment Using Phonetic Posterior Features. IEEE Signal Process. Lett. 28: 224-228 (2021) - [j18]Alejandro Gómez Alanís, José Andrés González López, S. Pavankumar Dubagunta, Antonio M. Peinado, Mathew Magimai-Doss:
On Joint Optimization of Automatic Speaker Verification and Anti-Spoofing in the Embedding Space. IEEE Trans. Inf. Forensics Secur. 16: 1579-1593 (2021) - [c113]Bence Mark Halpern, Julian Fritsch, Enno Hermann, Rob van Son, Odette Scharenborg, Mathew Magimai-Doss:
An Objective Evaluation Framework for Pathological Speech Synthesis. ITG Conference on Speech Communication 2021: 1-5 - [c112]Venkata Srikanth Nallanthighal, Aki Härmä, Helmer Strik, Mathew Magimai-Doss:
Phoneme Based Respiratory Analysis of Read Speech. EUSIPCO 2021: 191-195 - [c111]Zohreh Mostaani, Venkata Srikanth Nallanthighal, Aki Härmä, Helmer Strik, Mathew Magimai-Doss:
On The Relationship Between Speech-Based Breathing Signal Prediction Evaluation Measures and Breathing Parameters Estimation. ICASSP 2021: 1345-1349 - [c110]Esaú Villatoro-Tello, Gabriela Ramírez-de-la-Rosa, Daniel Gática-Pérez, Mathew Magimai-Doss, Héctor Jiménez-Salazar:
Approximating the Mental Lexicon from Clinical Interviews as a Support Tool for Depression Detection. ICMI 2021: 557-566 - [c109]Juan Camilo Vásquez-Correa, Julian Fritsch, Juan Rafael Orozco-Arroyave, Elmar Nöth, Mathew Magimai-Doss:
On Modeling Glottal Source Information for Phonation Assessment in Parkinson's Disease. Interspeech 2021: 26-30 - [c108]RaviShankar Prasad, Mathew Magimai-Doss:
Identification of F1 and F2 in Speech Using Modified Zero Frequency Filtering. Interspeech 2021: 56-60 - [c107]Esaú Villatoro-Tello, S. Pavankumar Dubagunta, Julian Fritsch, Gabriela Ramírez-de-la-Rosa, Petr Motlícek, Mathew Magimai-Doss:
Late Fusion of the Available Lexicon and Raw Waveform-Based Acoustic Modeling for Depression and Dementia Recognition. Interspeech 2021: 1927-1931 - [c106]Enno Hermann, Mathew Magimai-Doss:
Handling Acoustic Variation in Dysarthric Speech Recognition Systems Through Model Combination. Interspeech 2021: 4788-4792 - [c105]Bogdan Vlasenko, RaviShankar Prasad, Mathew Magimai-Doss:
Fusion of Acoustic and Linguistic Information using Supervised Autoencoder for Improved Emotion Recognition. MuSe @ ACM Multimedia 2021: 51-59 - [i3]Bence Mark Halpern, Julian Fritsch, Enno Hermann, Rob van Son, Odette Scharenborg, Mathew Magimai-Doss:
An Objective Evaluation Framework for Pathological Speech Synthesis. CoRR abs/2107.00308 (2021) - 2020
- [c104]RaviShankar Prasad, Gürkan Yilmaz, Olivier Chételat, Mathew Magimai-Doss:
Detection Of S1 And S2 Locations In Phonocardiogram Signals Using Zero Frequency Filter. ICASSP 2020: 1254-1258 - [c103]Enno Hermann, Mathew Magimai-Doss:
Dysarthric Speech Recognition with Lattice-Free MMI. ICASSP 2020: 6109-6113 - [c102]Sandrine Tornay, Marzieh Razavi, Mathew Magimai-Doss:
Towards Multilingual Sign Language Recognition. ICASSP 2020: 6309-6313 - [c101]Julian Fritsch, S. Pavankumar Dubagunta, Mathew Magimai-Doss:
Estimating the Degree of Sleepiness by Integrating Articulatory Feature Knowledge in Raw Waveform Based CNNS. ICASSP 2020: 6534-6538 - [c100]Sandrine Tornay, Necati Cihan Camgöz, Richard Bowden, Mathew Magimai-Doss:
A Phonology-based Approach for Isolated Sign Production Assessment in Sign Language. ICMI Companion 2020: 102-106 - [c99]Nicholas Cummins, Yilin Pan, Zhao Ren, Julian Fritsch, Venkata Srikanth Nallanthighal, Heidi Christensen, Daniel Blackburn, Björn W. Schuller, Mathew Magimai-Doss, Helmer Strik, Aki Härmä:
A Comparison of Acoustic and Linguistics Methodologies for Alzheimer's Dementia Recognition. INTERSPEECH 2020: 2182-2186 - [c98]Sandrine Tornay, Oya Aran, Mathew Magimai-Doss:
An HMM Approach with Inherent Model Selection for Sign Language and Gesture Recognition. LREC 2020: 6049-6056
2010 – 2019
- 2019
- [j17]Sandrine Tornay, Mathew Magimai-Doss:
Subunits Inference and Lexicon Development Based on Pairwise Comparison of Utterances and Signs. Inf. 10(10): 298 (2019) - [j16]Dimitri Palaz, Mathew Magimai-Doss, Ronan Collobert:
End-to-end acoustic modeling using convolutional neural networks for HMM-based automatic speech recognition. Speech Commun. 108: 15-32 (2019) - [c97]Sandrine Tornay, Marzieh Razavi, Necati Cihan Camgöz, Richard Bowden, Mathew Magimai-Doss:
HMM-based Approaches to Model Multichannel Information in Sign Language Inspired from Articulatory Features-based Speech Processing. ICASSP 2019: 2817-2821 - [c96]S. Pavankumar Dubagunta, Selen Hande Kabil, Mathew Magimai-Doss:
Improving Children Speech Recognition through Feature Learning from Raw Speech Signal. ICASSP 2019: 5736-5740 - [c95]S. Pavankumar Dubagunta, Mathew Magimai-Doss:
Segment-level Training of ANNs Based on Acoustic Confidence Measures for Hybrid HMM/ANN Speech Recognition. ICASSP 2019: 6435-6439 - [c94]S. Pavankumar Dubagunta, Bogdan Vlasenko, Mathew Magimai-Doss:
Learning Voice Source Related Information for Depression Detection. ICASSP 2019: 6525-6529 - [c93]Hannah Muckenhirn, Vinayak Abrol, Mathew Magimai-Doss, Sébastien Marcel:
Understanding and Visualizing Raw Waveform-Based CNNs. INTERSPEECH 2019: 2345-2349 - [c92]S. Pavankumar Dubagunta, Mathew Magimai-Doss:
Using Speech Production Knowledge for Raw Waveform Modelling Based Styrian Dialect Identification. INTERSPEECH 2019: 2383-2387 - 2018
- [j15]Marzieh Razavi, Ramya Rasipuram, Mathew Magimai-Doss:
Towards weakly supervised acoustic subword unit discovery and lexicon development using hidden Markov models. Speech Commun. 96: 168-183 (2018) - [c91]Hannah Muckenhirn, Mathew Magimai-Doss, Sébastien Marcel:
Towards Directly Modeling Raw Speech Signal for Speaker Verification Using CNNS. ICASSP 2018: 4884-4888 - [c90]Selen Hande Kabil, Hannah Muckenhirn, Mathew Magimai-Doss:
On Learning to Identify Genders from Raw Speech Signal Using CNNs. INTERSPEECH 2018: 287-291 - [c89]Jilt Sebastian, Manoj Kumar, Pavan Kumar D. S., Mathew Magimai-Doss, Hema A. Murthy, Shrikanth S. Narayanan:
Denoising and Raw-waveform Networks for Weakly-Supervised Gender Identification on Noisy Speech. INTERSPEECH 2018: 292-296 - [c88]Bogdan Vlasenko, Jilt Sebastian, Pavan Kumar D. S., Mathew Magimai-Doss:
Implementing Fusion Techniques for the Classification of Paralinguistic Information. INTERSPEECH 2018: 526-530 - [c87]Hannah Muckenhirn, Mathew Magimai-Doss, Sébastien Marcel:
On Learning Vocal Tract System Related Speaker Discriminative Information from Raw Signal Using CNNs. INTERSPEECH 2018: 1116-1120 - [c86]Sarah Ebling, Necati Cihan Camgöz, Penny Boyes Braem, Katja Tissi, Sandra Sidler-Miserez, Stephanie Stoll, Simon Hadfield, Tobias Haug, Richard Bowden, Sandrine Tornay, Marzieh Razavi, Mathew Magimai-Doss:
SMILE Swiss German Sign Language Dataset. LREC 2018 - 2017
- [j14]Marzieh Razavi, Mathew Magimai-Doss:
A Posterior-Based Multistream Formulation for G2P Conversion. IEEE Signal Process. Lett. 24(4): 475-479 (2017) - [j13]Hannah Muckenhirn, Pavel Korshunov, Mathew Magimai-Doss, Sébastien Marcel:
Long-Term Spectral Statistics for Voice Presentation Attack Detection. IEEE ACM Trans. Audio Speech Lang. Process. 25(11): 2098-2111 (2017) - [c85]Hannah Muckenhirn, Mathew Magimai-Doss, Sébastien Marcel:
End-to-End convolutional neural network-based voice presentation attack detection. IJCB 2017: 335-341 - 2016
- [j12]Ramya Rasipuram, Mathew Magimai-Doss:
Articulatory feature based continuous speech recognition using probabilistic lexical modeling. Comput. Speech Lang. 36: 233-259 (2016) - [j11]Marzieh Razavi, Ramya Rasipuram, Mathew Magimai-Doss:
Acoustic data-driven grapheme-to-phoneme conversion in the probabilistic lexical modeling framework. Speech Commun. 80: 1-21 (2016) - [c84]Hannah Muckenhirn, Mathew Magimai-Doss, Sébastien Marcel:
Presentation Attack Detection Using Long-Term Spectral Statistics for Trustworthy Speaker Verification. BIOSIG 2016: 123-134 - [c83]Ramya Rasipuram, Milos Cernak, Mathew Magimai-Doss:
HMM-Based Non-Native Accent Assessment Using Posterior Features. INTERSPEECH 2016: 3137-3141 - [c82]Marzieh Razavi, Mathew Magimai-Doss:
Improving Under-Resourced Language ASR Through Latent Subword Unit Space Discovery. INTERSPEECH 2016: 3873-3877 - 2015
- [j10]Ramya Rasipuram, Mathew Magimai-Doss:
Acoustic and lexical resource constrained ASR using language-independent acoustic model and language-dependent probabilistic lexical model. Speech Commun. 68: 23-40 (2015) - [c81]Dimitri Palaz, Mathew Magimai-Doss, Ronan Collobert:
Convolutional Neural Networks-based continuous speech recognition using raw speech signal. ICASSP 2015: 4295-4299 - [c80]Marzieh Razavi, Mathew Magimai-Doss:
An HMM-based formalism for automatic subword unit derivation and pronunciation generation. ICASSP 2015: 4639-4643 - [c79]Raphael Ullmann, Mathew Magimai-Doss, Hervé Bourlard:
Objective speech intelligibility assessment through comparison of phoneme class conditional probability sequences. ICASSP 2015: 4924-4928 - [c78]Ramya Rasipuram, Marzieh Razavi, Mathew Magimai-Doss:
Integrated pronunciation learning for automatic speech recognition using probabilistic lexical modeling. ICASSP 2015: 5176-5180 - [c77]Dimitri Palaz, Mathew Magimai-Doss, Ronan Collobert:
Analysis of CNN-based speech recognition system using raw speech as input. INTERSPEECH 2015: 11-15 - [c76]Ramya Rasipuram, Milos Cernak, Alexandre Nanchen, Mathew Magimai-Doss:
Automatic accentedness evaluation of non-native speech using phonetic and sub-phonetic posterior probabilities. INTERSPEECH 2015: 648-652 - [c75]Raphael Ullmann, Ramya Rasipuram, Mathew Magimai-Doss, Hervé Bourlard:
Objective intelligibility assessment of text-to-speech systems through utterance verification. INTERSPEECH 2015: 3501-3505 - [c74]Dimitri Palaz, Mathew Magimai-Doss, Ronan Collobert:
Learning linearly separable features for speech recognition using convolutional neural networks. ICLR (Workshop) 2015 - 2014
- [j9]Weifeng Li, Longbiao Wang, Yicong Zhou, John Dines, Mathew Magimai-Doss, Hervé Bourlard, Qingmin Liao:
Feature mapping of multiple beamformed sources for robust overlapping speech recognition using a microphone array. IEEE ACM Trans. Audio Speech Lang. Process. 22(12): 2244-2255 (2014) - [c73]Dimitri Palaz, Mathew Magimai-Doss, Ronan Collobert:
Joint phoneme segmentation inference and classification using CRFs. GlobalSIP 2014: 587-591 - [c72]Marzieh Razavi, Ramya Rasipuram, Mathew Magimai-Doss:
On modeling context-dependent clustered states: Comparing HMM/GMM, hybrid HMM/ANN and KL-HMM approaches. ICASSP 2014: 7659-7663 - [c71]Marzieh Razavi, Mathew Magimai-Doss:
On recognition of non-native speech using probabilistic lexical model. INTERSPEECH 2014: 26-30 - 2013
- [j8]Sunder Ram Krishnan, Mathew Magimai-Doss, Chandra Sekhar Seelamantula:
A Savitzky-Golay Filtering Perspective of Dynamic Feature Computation. IEEE Signal Process. Lett. 20(3): 281-284 (2013) - [j7]David Imseng, Hervé Bourlard, John Dines, Philip N. Garner, Mathew Magimai-Doss:
Applying Multi- and Cross-Lingual Stochastic Phone Space Transformations to Non-Native Speech Recognition. IEEE Trans. Speech Audio Process. 21(8): 1713-1726 (2013) - [c70]Ramya Rasipuram, Marzieh Razavi, Mathew Magimai-Doss:
Probabilistic lexical modeling and unsupervised training for zero-resourced ASR. ASRU 2013: 446-451 - [c69]Youssef Oualil, Mathew Magimai-Doss, Friedrich Faubel, Dietrich Klakow:
A probabilistic framework for multiple speaker localization. ICASSP 2013: 3962-3966 - [c68]Ramya Rasipuram, Peter Bell, Mathew Magimai-Doss:
Grapheme and multilingual posterior features for under-resourced speech recognition: A study on Scottish Gaelic. ICASSP 2013: 7334-7338 - [c67]Ramya Rasipuram, Mathew Magimai-Doss:
Improving grapheme-based ASR by probabilistic lexical modeling approach. INTERSPEECH 2013: 505-509 - [c66]Dimitri Palaz, Ronan Collobert, Mathew Magimai-Doss:
Estimating phoneme class conditional probabilities from raw speech signal using convolutional neural networks. INTERSPEECH 2013: 1766-1770 - [i2]Dimitri Palaz, Ronan Collobert, Mathew Magimai-Doss:
Estimating Phoneme Class Conditional Probabilities from Raw Speech Signal using Convolutional Neural Networks. CoRR abs/1304.1018 (2013) - [i1]Dimitri Palaz, Ronan Collobert, Mathew Magimai-Doss:
End-to-end Phoneme Sequence Recognition using Convolutional Neural Networks. CoRR abs/1312.2137 (2013) - 2012
- [j6]Shajith Ikbal, Hemant Misra, Hynek Hermansky, Mathew Magimai-Doss:
Phase AutoCorrelation (PAC) features for noise robust speech recognition. Speech Commun. 54(7): 867-880 (2012) - [j5]Anindya Roy, Mathew Magimai-Doss, Sébastien Marcel:
A Fast Parts-Based Approach to Speaker Verification Using Boosted Slice Classifiers. IEEE Trans. Inf. Forensics Secur. 7(1): 241-254 (2012) - [c65]Youssef Oualil, Friedrich Faubel, Mathew Magimai-Doss, Dietrich Klakow:
A TDOA Gaussian mixture model for improving acoustic source tracking. EUSIPCO 2012: 1339-1343 - [c64]Ramya Rasipuram, Mathew Magimai-Doss:
Acoustic data-driven grapheme-to-phoneme conversion using KL-HMM. ICASSP 2012: 4841-4844 - [c63]Serena Soldo, Mathew Magimai-Doss, Hervé Bourlard:
Template-based ASR using posterior features and synthetic references: comparing different TTS systems. SAPA@INTERSPEECH 2012: 52-57 - [c62]Youssef Oualil, Mathew Magimai-Doss, Friedrich Faubel, Dietrich Klakow:
Joint detection and localization of multiple speakers using a probabilistic interpretation of the steered response power. SAPA@INTERSPEECH 2012: 68-73 - [c61]Yang Sun, Mathew M. Doss, Jort F. Gemmeke, Bert Cranen, Louis ten Bosch, Lou Boves:
Combination of Sparse Classification and Multilayer Perceptron for Noise-robust ASR. INTERSPEECH 2012: 310-313 - [c60]Ramya Rasipuram, Mathew Magimai-Doss:
Combining Acoustic Data Driven G2P and Letter-to-Sound Rules for Under Resource Lexicon Generation. INTERSPEECH 2012: 1820-1823 - [c59]Yang Sun, Bert Cranen, Jort F. Gemmeke, Louis ten Bosch, Lou Boves, Mathew M. Doss:
Using Sparse Classification Outputs as Feature Observations for Noise-robust ASR. INTERSPEECH 2012: 2142-2145 - [c58]Serena Soldo, Mathew Magimai-Doss, Hervé Bourlard:
Synthetic References for Template-based ASR using posterior features. INTERSPEECH 2012: 2146-2149 - [c57]Anindya Roy, Mathew Magimai-Doss, Sébastien Marcel:
Boosting localized binary features for speech recognition. MLSLP 2012: 18-21 - 2011
- [j4]Joel Pinto, Garimella S. V. S. Sivaram, Mathew Magimai-Doss, Hynek Hermansky, Hervé Bourlard:
Analysis of MLP-Based Hierarchical Phoneme Posterior Probability Estimator. IEEE Trans. Speech Audio Process. 19(2): 225-241 (2011) - [j3]Fabio Valente, Mathew Magimai-Doss, Christian Plahl, Suman V. Ravuri, Wen Wang:
Transcribing Mandarin Broadcast Speech Using Multi-Layer Perceptron Acoustic Features. IEEE ACM Trans. Audio Speech Lang. Process. 19(8): 2439-2450 (2011) - [j2]Sree Hari Krishnan Parthasarathi, Daniel Gatica-Perez, Hervé Bourlard, Mathew Magimai-Doss:
Privacy-Sensitive Audio Features for Speech/Nonspeech Detection. IEEE ACM Trans. Audio Speech Lang. Process. 19(8): 2538-2551 (2011) - [c56]David Imseng, Ramya Rasipuram, Mathew Magimai-Doss:
Fast and flexible Kullback-Leibler divergence based acoustic modeling for non-native speech recognition. ASRU 2011: 348-353 - [c55]Ramya Rasipuram, Mathew Magimai-Doss:
Improving Articulatory Feature and Phoneme Recognition Using Multitask Learning. ICANN (1) 2011: 299-306 - [c54]Serena Soldo, Mathew Magimai-Doss, Joel Pinto, Hervé Bourlard:
Posterior features for template-based ASR. ICASSP 2011: 4864-4867 - [c53]Anindya Roy, Mathew Magimai-Doss, Sébastien Marcel:
Phoneme recognition using Boosted Binary Features. ICASSP 2011: 4868-4871 - [c52]David Imseng, Hervé Bourlard, Mathew Magimai-Doss, John Dines:
Language dependent universal phoneme posterior estimation for mixed language speech recognition. ICASSP 2011: 5012-5015 - [c51]Ramya Rasipuram, Mathew Magimai-Doss:
Integrating articulatory features using Kullback-Leibler divergence based acoustic model for phoneme recognition. ICASSP 2011: 5192-5195 - [c50]Anindya Roy, Mathew Magimai-Doss, Sébastien Marcel:
Fast speaker verification on mobile phone data using boosted slice classifiers. IJCB 2011: 1-6 - [c49]Mathew Magimai-Doss, Ramya Rasipuram, Guillermo Aradilla, Hervé Bourlard:
Grapheme-Based Automatic Speech Recognition Using KL-HMM. INTERSPEECH 2011: 445-448 - [c48]David Imseng, Hervé Bourlard, John Dines, Philip N. Garner, Mathew Magimai-Doss:
Improving Non-Native ASR Through Stochastic Multilingual Phoneme Space Transformations. INTERSPEECH 2011: 537-540 - [c47]Joel Pinto, Mathew Magimai-Doss, Hervé Bourlard:
Hierarchical Tandem Features for ASR in Mandarin. INTERSPEECH 2011: 1241-1244 - [c46]Fabio Valente, Mathew Magimai-Doss, Wen Wang:
Analysis and Comparison of Recent MLP Features for LVCSR Systems. INTERSPEECH 2011: 1245-1248 - 2010
- [c45]Anindya Roy, Mathew Magimai-Doss, Sébastien Marcel:
Boosted binary features for noise-robust speaker verification. ICASSP 2010: 4442-4445 - [c44]Sree Hari Krishnan Parthasarathi, Mathew Magimai-Doss, Hervé Bourlard, Daniel Gatica-Perez:
Evaluating the robustness of privacy-sensitive audio features for speech detection in personal audio log scenarios. ICASSP 2010: 4474-4477 - [c43]David Imseng, Hervé Bourlard, Mathew Magimai-Doss:
Towards mixed language speech recognition systems. INTERSPEECH 2010: 278-281 - [c42]Fabio Valente, Mathew Magimai-Doss, Christian Plahl, Suman V. Ravuri, Wen Wang:
A comparative large scale study of MLP features for Mandarin ASR. INTERSPEECH 2010: 2630-2633 - [c41]David Imseng, Mathew Magimai-Doss, Hervé Bourlard:
Hierarchical multilayer perceptron based language identification. INTERSPEECH 2010: 2722-2725
2000 – 2009
- 2009
- [c40]Joel Pinto, Mathew Magimai-Doss, Hervé Bourlard:
MLP based hierarchical system for task adaptation in ASR. ASRU 2009: 365-370 - [c39]Joel Pinto, Garimella S. V. S. Sivaram, Hynek Hermansky, Mathew Magimai-Doss:
Volterra series for analyzing MLP based phoneme posterior estimator. ICASSP 2009: 1813-1816 - [c38]Guillermo Aradilla, Hervé Bourlard, Mathew Magimai-Doss:
Posterior features applied to speech recognition tasks with user-defined vocabulary. ICASSP 2009: 3809-3812 - [c37]Weifeng Li, John Dines, Mathew Magimai-Doss, Hervé Bourlard:
Non-linear mapping for multi-channel speech separation and robust overlapping spech recognition. ICASSP 2009: 3921-3924 - [c36]Sree Hari Krishnan Parthasarathi, Mathew Magimai-Doss, Daniel Gatica-Perez, Hervé Bourlard:
Speaker change detection with privacy-preserving audio cues. ICMI 2009: 343-346 - [c35]Sree Hari Krishnan Parthasarathi, Mathew Magimai-Doss, Hervé Bourlard, Daniel Gatica-Perez:
Investigating privacy-sensitive features for speech detection in multiparty conversations. INTERSPEECH 2009: 2243-2246 - [c34]Fabio Valente, Mathew Magimai-Doss, Christian Plahl, Suman V. Ravuri:
Hierarchical processing of the modulation spectrum for GALE Mandarin LVCSR system. INTERSPEECH 2009: 2963-2966 - 2008
- [c33]Weifeng Li, Mathew Magimai-Doss, John Dines, Hervé Bourlard:
MLP-based log spectral energy mapping for robust overlapping speech recognition. EUSIPCO 2008: 1-5 - [c32]Tamara Tosic, Mathew Magimai-Doss, Hynek Hermansky:
Using comparison of parallel phoneme probability streams for OOV word detection. EUSIPCO 2008: 1-5 - [c31]Joel Pinto, B. Yegnanarayana, Hynek Hermansky, Mathew Magimai-Doss:
Exploiting contextual information for improved phoneme recognition. ICASSP 2008: 4449-4452 - [c30]Guillermo Aradilla, Hervé Bourlard, Mathew Magimai-Doss:
Using KL-based acoustic models in a large vocabulary recognition task. INTERSPEECH 2008: 928-931 - [c29]Weifeng Li, John Dines, Mathew Magimai-Doss, Hervé Bourlard:
Neural network based regression for robust overlapping speech recognition using microphone arrays. INTERSPEECH 2008: 2012-2015 - [c28]Weifeng Li, Ken'ichi Kumatani, John Dines, Mathew Magimai-Doss, Hervé Bourlard:
A Neural Network Based Regression Approach for Recognizing Simultaneous Speech. MLMI 2008: 110-118 - 2007
- [c27]Özgür Çetin, Mathew Magimai-Doss, Karen Livescu, Arthur Kantor, Simon King, Chris D. Bartels, Joe Frankel:
Monolingual and crosslingual comparison of tandem features derived from articulatory and phone MLPS. ASRU 2007: 36-41 - [c26]Andreas Stolcke, Xavier Anguera, Kofi Boakye, Özgür Çetin, Adam Janin, Mathew Magimai-Doss, Chuck Wooters, Jing Zheng:
The SRI-ICSI Spring 2007 Meeting and Lecture Recognition System. CLEAR 2007: 450-463 - [c25]Mathew Magimai-Doss, Dilek Hakkani-Tür, Özgür Çetin, Elizabeth Shriberg, James G. Fung, Nikki Mirghafori:
Entropy Based Classifier Combination for Sentence Segmentation. ICASSP (4) 2007: 189-192 - [c24]Octavian Cheng, John Dines, Mathew Magimai-Doss:
A Generalized Dynamic Composition Algorithm of Weighted Finite State Transducers for Large Vocabulary Speech Recognition. ICASSP (4) 2007: 345-348 - [c23]Karen Livescu, Özgür Çetin, Mark Hasegawa-Johnson, Simon King, Chris D. Bartels, Nash M. Borges, Arthur Kantor, Partha Lal, Lisa Yung, Ari Bezman, Stephen Dawson-Haggerty, Bronwyn Woods, Joe Frankel, Mathew Magimai-Doss, Kate Saenko:
Articulatory Feature-Based Methods for Acoustic and Audio-Visual Speech Recognition: Summary from the 2006 JHU Summer workshop. ICASSP (4) 2007: 621-624 - [c22]Özgür Çetin, Arthur Kantor, Simon King, Chris D. Bartels, Mathew Magimai-Doss, Joe Frankel, Karen Livescu:
An Articulatory Feature-Based Tandem Approach and Factored Observation Modeling. ICASSP (4) 2007: 645-648 - [c21]Karen Livescu, Ari Bezman, Nash M. Borges, Lisa Yung, Özgür Çetin, Joe Frankel, Simon King, Mathew Magimai-Doss, Xuemin Chi, Lisa Lavoie:
Manual Transcription of Conversational Speech at the Articulatory Feature Level. ICASSP (4) 2007: 953-956 - [c20]Evgeny Matusov, Dustin Hillard, Mathew Magimai-Doss, Dilek Hakkani-Tür, Mari Ostendorf, Hermann Ney:
Improving speech translation with automatic boundary prediction. INTERSPEECH 2007: 2449-2452 - [c19]Joe Frankel, Mathew Magimai-Doss, Simon King, Karen Livescu, Özgür Çetin:
Articulatory feature classifiers trained on 2000 hours of telephone speech. INTERSPEECH 2007: 2485-2488 - [c18]James G. Fung, Dilek Hakkani-Tür, Mathew Magimai-Doss, Elizabeth Shriberg, Sébastien Cuendet, Nikki Mirghafori:
Cross-linguistic analysis of prosodic features for sentence segmentation. INTERSPEECH 2007: 2585-2588 - [c17]John Dines, Mathew Magimai-Doss:
A Study of Phoneme and Grapheme Based Context-Dependent ASR Systems. MLMI 2007: 215-226 - 2006
- [c16]Guillaume Lathoud, Mathew Magimai-Doss, Hervé Bourlard:
Threshold Selection for Unsupervised Detection, With an Application to Microphone Arrays. ICASSP (3) 2006: 285-288 - [c15]Darren Moore, John Dines, Mathew Magimai-Doss, Jithendra Vepa, Octavian Cheng, Thomas Hain:
Juicer: A Weighted Finite-State Transducer Speech Decoder. MLMI 2006: 285-296 - 2005
- [c14]Guillaume Lathoud, Mathew Magimai-Doss:
A sector-based, frequency-domain approach to detection and localization of multiple speakers. ICASSP (3) 2005: 265-268 - [c13]Shajith Ikbal, Hervé Bourlard, Mathew Magimai-Doss:
HMM/ANN Based Spectral Peak Location Estimation for Noise Robust Speech Recognition. ICASSP (1) 2005: 453-456 - [c12]Guillaume Lathoud, Mathew Magimai-Doss, Bertrand Mesot:
A spectrogram model for enhanced source localization and noise-robust ASR. INTERSPEECH 2005: 2345-2348 - 2004
- [j1]Todd A. Stephenson, Mathew Magimai-Doss, Hervé Bourlard:
Speech recognition with auxiliary information. IEEE Trans. Speech Audio Process. 12(3): 189-203 (2004) - [c11]Mathew Magimai-Doss, Samy Bengio, Hervé Bourlard:
Joint decoding for phoneme-grapheme continuous speech recognition. ICASSP (1) 2004: 177-180 - [c10]Mathew Magimai-Doss, Shajith Ikbal, Todd A. Stephenson, Hervé Bourlard:
Modeling auxiliary features in tandem systems. INTERSPEECH 2004: 1501-1504 - [c9]Shajith Ikbal, Mathew Magimai-Doss, Hemant Misra, Hervé Bourlard:
Spectro-temporal activity pattern (STAP) features for noise robust ASR. INTERSPEECH 2004: 2109-2112 - [c8]Mathew Magimai-Doss, Hervé Bourlard:
On the Adequacy of Baseform Pronunciations and Pronunciation Variants. MLMI 2004: 209-222 - 2003
- [c7]Todd A. Stephenson, Mathew Magimai-Doss, Hervé Bourlard:
Speech recognition of spontaneous, noisy speech using auxiliary information in Bayesian networks. ICASSP (1) 2003: 20-23 - [c6]B. Yegnanarayana, S. R. Mahadeva Prasanna, Mathew Magimai-Doss:
Enhancement of speech in multispeaker environment. INTERSPEECH 2003: 581-584 - [c5]Mathew Magimai-Doss, Todd A. Stephenson, Hervé Bourlard:
Using pitch frequency information in speech recognition. INTERSPEECH 2003: 2525-2528 - 2002
- [c4]Todd A. Stephenson, Mathew Magimai-Doss, Hervé Bourlard:
Mixed Bayesian Networks with Auxiliary Variables for Automatic Speech Recognition. ICPR (4) 2002: 293- - [c3]Todd A. Stephenson, Mathew Magimai-Doss, Hervé Bourlard:
Auxiliary variables in conditional Gaussian mixtures for automatic speech recognition. INTERSPEECH 2002: 2665-2668 - [c2]Todd A. Stephenson, Jaume Escofet, Mathew Magimai-Doss, Hervé Bourlard:
Dynamic Bayesian network based speech recognition with pitch and energy as auxiliary variables. NNSP 2002: 637-646 - 2001
- [c1]Todd A. Stephenson, Mathew Magimai-Doss, Hervé Bourlard:
Modeling auxiliary information in Bayesian network based ASR. INTERSPEECH 2001: 2765-2768
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-23 20:35 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint