default search action
Jesús Villalba 0001
Person information
- affiliation: Johns Hopkins University, Center for Language and Speech Processing, Baltimore, MD, USA
- affiliation: University of Zaragoza, Spain
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j15]Deming Li, Ankur A. Butala, Laureano Moro-Velázquez, Trevor Meyer, Esther S. Oh, Chelsey Motley, Jesús Villalba, Najim Dehak:
Automating the analysis of eye movement for different neurodegenerative disorders. Comput. Biol. Medicine 170: 107951 (2024) - [j14]Saurabh Kataria, Jesús Villalba, Laureano Moro-Velázquez, Piotr Zelasko, Najim Dehak:
Time-Domain Speech Super-Resolution With GAN Based Modeling for Telephony Speaker Verification. IEEE ACM Trans. Audio Speech Lang. Process. 32: 1736-1749 (2024) - [j13]Magdalena Rybicka, Jesús Villalba, Thomas Thebaud, Najim Dehak, Konrad Kowalczyk:
End-to-End Neural Speaker Diarization With Non-Autoregressive Attractors. IEEE ACM Trans. Audio Speech Lang. Process. 32: 3960-3973 (2024) - [j12]Saurabhchand Bhati, Jesús Villalba, Piotr Zelasko, Laureano Moro-Velázquez, Najim Dehak:
Slowness Regularized Contrastive Predictive Coding for Acoustic Unit Discovery. IEEE ACM Trans. Audio Speech Lang. Process. 32: 4277-4287 (2024) - [c92]Jesús Villalba:
Towards Speech Processing Robust to Adversarial Deceptions. Odyssey 2024 - [c91]Sonal Joshi, Thomas Thebaud, Jesús Villalba, Najim Dehak:
Unraveling Adversarial Examples against Speaker Identification - Techniques for Attack Detection and Victim Model Classification. Odyssey 2024: 165-171 - [c90]Anna Favaro, Najim Dehak, Thomas Thebaud, Jesús Villalba, Esther S. Oh, Laureano Moro-Velázquez:
Discovering Invariant Patterns of Cognitive Decline Via an Automated Analysis of the Cookie Thief Picture Description Task. Odyssey 2024: 201-208 - [i32]Sonal Joshi, Thomas Thebaud, Jesús Villalba, Najim Dehak:
Unraveling Adversarial Examples against Speaker Identification - Techniques for Attack Detection and Victim Model Classification. CoRR abs/2402.19355 (2024) - [i31]Henry Li Xinyuan, Sonal Joshi, Thomas Thebaud, Jesús Villalba, Najim Dehak, Sanjeev Khudanpur:
Clean Label Attacks against SLU Systems. CoRR abs/2409.08985 (2024) - 2023
- [j11]Anna Favaro, Yi-Ting Tsai, Ankur A. Butala, Thomas Thebaud, Jesús Villalba, Najim Dehak, Laureano Moro-Velázquez:
Interpretable speech features vs. DNN embeddings: What to use in the automatic assessment of Parkinson's disease in multi-lingual scenarios. Comput. Biol. Medicine 166: 107559 (2023) - [c89]Maliha Jahan, Laureano Moro-Velázquez, Thomas Thebaud, Najim Dehak, Jesús Villalba:
Model-Based Fairness Metric for Speaker Verification. ASRU 2023: 1-7 - [c88]Martin Sustek, Sonal Joshi, Henry Li, Thomas Thebaud, Jesús Villalba, Sanjeev Khudanpur, Najim Dehak:
Joint Energy-Based Model for Robust Speech Classification System Against Dirty-Label Backdoor Poisoning Attacks. ASRU 2023: 1-8 - [c87]Thomas Thebaud, Sonal Joshi, Henry Li, Martin Sustek, Jesús Villalba, Sanjeev Khudanpur, Najim Dehak:
Clustering Unsupervised Representations as Defense Against Poisoning Attacks on Speech Commands Classification System. ASRU 2023: 1-8 - [c86]Saurabhchand Bhati, Jesús Villalba, Laureano Moro-Velázquez, Thomas Thebaud, Najim Dehak:
Segmental SpeechCLIP: Utilizing Pretrained Image-text Models for Audio-Visual Learning. INTERSPEECH 2023: 431-435 - [c85]Jesús Villalba, Jonas Borgstrom, Maliha Jahan, Saurabh Kataria, Leibny Paola García, Pedro A. Torres-Carrasquillo, Najim Dehak:
Advances in Language Recognition in Low Resource African Languages: The JHU-MIT Submission for NIST LRE22. INTERSPEECH 2023: 521-525 - [c84]Helin Wang, Thomas Thebaud, Jesús Villalba, Myra Sydnor, Becky Lammers, Najim Dehak, Laureano Moro-Velázquez:
DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion Probabilistic Model. INTERSPEECH 2023: 1548-1552 - [c83]Anna Favaro, Tianyu Cao, Thomas Thebaud, Jesús Villalba, Ankur A. Butala, Najim Dehak, Laureano Moro-Velázquez:
Do Phonatory Features Display Robustness to Characterize Parkinsonian Speech Across Corpora? INTERSPEECH 2023: 2388-2392 - [c82]Saurabh Kataria, Jesús Villalba, Laureano Moro-Velázquez, Thomas Thebaud, Najim Dehak:
Self-FiLM: Conditioning GANs with self-supervised representations for bandwidth extension based speaker recognition. INTERSPEECH 2023: 4688-4692 - [i30]Martin Sustek, Samik Sadhu, Lukás Burget, Hynek Hermansky, Jesús Villalba, Laureano Moro-Velázquez, Najim Dehak:
Stabilized training of joint energy-based models and their practical applications. CoRR abs/2303.04187 (2023) - [i29]Saurabhchand Bhati, Jesús Villalba, Laureano Moro-Velázquez, Thomas Thebaud, Najim Dehak:
Leveraging Pretrained Image-text Models for Improving Audio-Visual Learning. CoRR abs/2309.04628 (2023) - 2022
- [j10]Jaejin Cho, Jesús Villalba, Laureano Moro-Velázquez, Najim Dehak:
Non-Contrastive Self-Supervised Learning for Utterance-Level Information Extraction From Speech. IEEE J. Sel. Top. Signal Process. 16(6): 1284-1295 (2022) - [j9]Saurabhchand Bhati, Jesús Villalba, Piotr Zelasko, Laureano Moro-Velázquez, Najim Dehak:
Unsupervised Speech Segmentation and Variable Rate Representation Learning Using Segmental Contrastive Predictive Coding. IEEE ACM Trans. Audio Speech Lang. Process. 30: 2002-2014 (2022) - [c81]Saurabh Kataria, Jesús Villalba, Laureano Moro-Velázquez, Najim Dehak:
Joint domain adaptation and speech bandwidth extension using time-domain GANs for speaker verification. INTERSPEECH 2022: 615-619 - [c80]Jaejin Cho, Raghavendra Pappagari, Piotr Zelasko, Laureano Moro-Velázquez, Jesús Villalba, Najim Dehak:
Non-contrastive self-supervised learning of utterance-level speech representations. INTERSPEECH 2022: 4028-4032 - [c79]Sonal Joshi, Saurabh Kataria, Yiwen Shao, Piotr Zelasko, Jesús Villalba, Sanjeev Khudanpur, Najim Dehak:
Defense against Adversarial Attacks on Hybrid Speech Recognition System using Adversarial Fine-tuning with Denoiser. INTERSPEECH 2022: 5035-5039 - [c78]Yiwen Shao, Jesús Villalba, Sonal Joshi, Saurabh Kataria, Sanjeev Khudanpur, Najim Dehak:
Chunking Defense for Adversarial Attacks on ASR. INTERSPEECH 2022: 5045-5049 - [c77]Sonal Joshi, Saurabh Kataria, Jesús Villalba, Najim Dehak:
AdvEst: Adversarial Perturbation Estimation to Classify and Detect Adversarial Attacks against Speaker Identification. INTERSPEECH 2022: 5060-5064 - [c76]Magdalena Rybicka, Jesús Villalba, Najim Dehak, Konrad Kowalczyk:
End-to-End Neural Speaker Diarization with an Iterative Refinement of Non-Autoregressive Attention-based Attractors. INTERSPEECH 2022: 5090-5094 - [c75]Jesús Villalba, Bengt J. Borgstrom, Saurabh Kataria, Magdalena Rybicka, Carlos D. Castillo, Jaejin Cho, L. Paola García-Perera, Pedro A. Torres-Carrasquillo, Najim Dehak:
Advances in Cross-Lingual and Cross-Source Audio-Visual Speaker Recognition: The JHU-MIT System for NIST SRE21. Odyssey 2022: 213-220 - [c74]Jesús Villalba, Bengt J. Borgstrom, Saurabh Kataria, Jaejin Cho, Pedro A. Torres-Carrasquillo, Najim Dehak:
Advances in Speaker Recognition for Multilingual Conversational Telephone Speech: The JHU-MIT System for NIST SRE20 CTS Challenge. Odyssey 2022: 338-345 - [c73]Tianyu Cao, Laureano Moro-Velázquez, Piotr Zelasko, Jesús Villalba, Najim Dehak:
Vsameter: Evaluation of a New Open-Source Tool to Measure Vowel Space Area and Related Metrics. SLT 2022: 517-524 - [c72]Anna Favaro, Chelsie Motley, Tianyu Cao, Miguel Iglesias, Ankur A. Butala, Esther S. Oh, Robert D. Stevens, Jesús Villalba, Najim Dehak, Laureano Moro-Velázquez:
A Multi-Modal Array of Interpretable Features to Evaluate Language and Speech Patterns in Different Neurological Disorders. SLT 2022: 532-539 - [i28]Saurabh Kataria, Jesús Villalba, Laureano Moro-Velázquez, Najim Dehak:
Joint domain adaptation and speech bandwidth extension using time-domain GANs for speaker verification. CoRR abs/2203.16614 (2022) - [i27]Sonal Joshi, Saurabh Kataria, Jesús Villalba, Najim Dehak:
AdvEst: Adversarial Perturbation Estimation to Classify and Detect Adversarial Attacks against Speaker Identification. CoRR abs/2204.03848 (2022) - [i26]Sonal Joshi, Saurabh Kataria, Yiwen Shao, Piotr Zelasko, Jesús Villalba, Sanjeev Khudanpur, Najim Dehak:
Defense against Adversarial Attacks on Hybrid Speech Recognition using Joint Adversarial Fine-tuning with Denoiser. CoRR abs/2204.03851 (2022) - [i25]Jaejin Cho, Raghavendra Pappagari, Piotr Zelasko, Laureano Moro-Velázquez, Jesús Villalba, Najim Dehak:
Non-Contrastive Self-Supervised Learning of Utterance-Level Speech Representations. CoRR abs/2208.05413 (2022) - [i24]Jaejin Cho, Jesús Villalba, Laureano Moro-Velázquez, Najim Dehak:
Non-Contrastive Self-supervised Learning for Utterance-Level Information Extraction from Speech. CoRR abs/2208.05445 (2022) - 2021
- [j8]Nanxin Chen, Shinji Watanabe, Jesús Villalba, Piotr Zelasko, Najim Dehak:
Non-Autoregressive Transformer for Speech Recognition. IEEE Signal Process. Lett. 28: 121-125 (2021) - [j7]Sonal Joshi, Jesús Villalba, Piotr Zelasko, Laureano Moro-Velázquez, Najim Dehak:
Study of Pre-Processing Defenses Against Adversarial Attacks on State-of-the-Art Speaker Recognition Systems. IEEE Trans. Inf. Forensics Secur. 16: 4811-4826 (2021) - [c71]Raghavendra Pappagari, Piotr Zelasko, Jesús Villalba, Laureano Moro-Velázquez, Najim Dehak:
Beyond Isolated Utterances: Conversational Emotion Recognition. ASRU 2021: 39-46 - [c70]Nanxin Chen, Piotr Zelasko, Jesús Villalba, Najim Dehak:
Focus on the Present: A Regularization Method for the ASR Source-Target Attention Layer. ICASSP 2021: 5994-5998 - [c69]Raghavendra Pappagari, Jesús Villalba, Piotr Zelasko, Laureano Moro-Velázquez, Najim Dehak:
CopyPaste: An Augmentation Method for Speech Emotion Recognition. ICASSP 2021: 6324-6328 - [c68]Jaejin Cho, Piotr Zelasko, Jesús Villalba, Najim Dehak:
Improving Reconstruction Loss Based Speaker Embedding in Unsupervised and Semi-Supervised Scenarios. ICASSP 2021: 6733-6737 - [c67]Saurabh Kataria, Jesús Villalba, Najim Dehak:
Perceptual Loss Based Speech Denoising with an Ensemble of Audio Pattern Recognition and Self-Supervised Models. ICASSP 2021: 7118-7122 - [c66]Saurabhchand Bhati, Jesús Villalba, Piotr Zelasko, Laureano Moro-Velázquez, Najim Dehak:
Segmental Contrastive Predictive Coding for Unsupervised Word Segmentation. Interspeech 2021: 366-370 - [c65]Magdalena Rybicka, Jesús Villalba, Piotr Zelasko, Najim Dehak, Konrad Kowalczyk:
Spine2Net: SpineNet with Res2Net and Time-Squeeze-and-Excitation Blocks for Speaker Recognition. Interspeech 2021: 496-500 - [c64]Saurabh Kataria, Jesús Villalba, Piotr Zelasko, Laureano Moro-Velázquez, Najim Dehak:
Deep Feature CycleGANs: Speaker Identity Preserving Non-Parallel Microphone-Telephone Domain Adaptation for Speaker Verification. Interspeech 2021: 1079-1083 - [c63]Nanxin Chen, Piotr Zelasko, Laureano Moro-Velázquez, Jesús Villalba, Najim Dehak:
Align-Denoise: Single-Pass Non-Autoregressive Speech Recognition. Interspeech 2021: 3770-3774 - [c62]Raghavendra Pappagari, Jaejin Cho, Sonal Joshi, Laureano Moro-Velázquez, Piotr Zelasko, Jesús Villalba, Najim Dehak:
Automatic Detection and Assessment of Alzheimer Disease Using Speech and Language Technologies in Low-Resource Scenarios. Interspeech 2021: 3825-3829 - [c61]Jesús Villalba, Sonal Joshi, Piotr Zelasko, Najim Dehak:
Representation Learning to Classify and Detect Adversarial Attacks Against Speaker and Speech Recognition Systems. Interspeech 2021: 4304-4308 - [c60]Aviad Shtrosberg, Jesús Villalba, Najim Dehak, Azaria Cohen, Bar Ben-Yair:
Invariant Representation Learning for Robust Far-Field Speaker Recognition. SLSP 2021: 97-110 - [i23]Sonal Joshi, Jesús Villalba, Piotr Zelasko, Laureano Moro-Velázquez, Najim Dehak:
Adversarial Attacks and Defenses for Speaker Identification Systems. CoRR abs/2101.08909 (2021) - [i22]Piotr Zelasko, Sonal Joshi, Yiwen Shao, Jesús Villalba, Jan Trmal, Najim Dehak, Sanjeev Khudanpur:
Adversarial Attacks and Defenses for Speech Recognition Systems. CoRR abs/2103.17122 (2021) - [i21]Saurabhchand Bhati, Jesús Villalba, Piotr Zelasko, Laureano Moro-Velázquez, Najim Dehak:
Segmental Contrastive Predictive Coding for Unsupervised Word Segmentation. CoRR abs/2106.02170 (2021) - [i20]Raghavendra Pappagari, Piotr Zelasko, Jesús Villalba, Laureano Moro-Velázquez, Najim Dehak:
Beyond Isolated Utterances: Conversational Emotion Recognition. CoRR abs/2109.06112 (2021) - [i19]Jaejin Cho, Jesús Villalba, Najim Dehak:
The JHU submission to VoxSRC-21: Track 3. CoRR abs/2109.13425 (2021) - [i18]Saurabhchand Bhati, Jesús Villalba, Piotr Zelasko, Laureano Moro-Velázquez, Najim Dehak:
Unsupervised Speech Segmentation and Variable Rate Representation Learning using Segmental Contrastive Predictive Coding. CoRR abs/2110.02345 (2021) - 2020
- [j6]Jesús Villalba, Nanxin Chen, David Snyder, Daniel Garcia-Romero, Alan McCree, Gregory Sell, Jonas Borgstrom, Leibny Paola García-Perera, Fred Richardson, Réda Dehak, Pedro A. Torres-Carrasquillo, Najim Dehak:
State-of-the-art speaker recognition with neural network embeddings in NIST SRE18 and Speakers in the Wild evaluations. Comput. Speech Lang. 60 (2020) - [c59]Laureano Moro-Velázquez, Jesús Villalba, Najim Dehak:
Using X-Vectors to Automatically Detect Parkinson's Disease from Speech. ICASSP 2020: 1155-1159 - [c58]Raghavendra Pappagari, Tianzi Wang, Jesús Villalba, Nanxin Chen, Najim Dehak:
X-Vectors Meet Emotions: A Study On Dependencies Between Emotion and Speaker Recognition. ICASSP 2020: 7169-7173 - [c57]Saurabh Kataria, Phani Sankar Nidadavolu, Jesús Villalba, Nanxin Chen, L. Paola García-Perera, Najim Dehak:
Feature Enhancement with Deep Feature Losses for Speaker Verification. ICASSP 2020: 7584-7588 - [c56]Phani Sankar Nidadavolu, Saurabh Kataria, Jesús Villalba, L. Paola García-Perera, Najim Dehak:
Unsupervised Feature Enhancement for Speaker Verification. ICASSP 2020: 7599-7603 - [c55]Jaejin Cho, Piotr Zelasko, Jesús Villalba, Shinji Watanabe, Najim Dehak:
Learning Speaker Embedding from Text-to-Speech. INTERSPEECH 2020: 3256-3260 - [c54]Jesús Villalba, Yuekai Zhang, Najim Dehak:
x-Vectors Meet Adversarial Attacks: Benchmarking Adversarial Robustness in Speaker Verification. INTERSPEECH 2020: 4233-4237 - [c53]Yuekai Zhang, Ziyan Jiang, Jesús Villalba, Najim Dehak:
Black-Box Attacks on Spoofing Countermeasures Using Transferability of Adversarial Examples. INTERSPEECH 2020: 4238-4242 - [c52]Saurabhchand Bhati, Jesús Villalba, Piotr Zelasko, Najim Dehak:
Self-Expressing Autoencoders for Unsupervised Spoken Term Discovery. INTERSPEECH 2020: 4876-4880 - [c51]Jesús Antonio Villalba López, Daniel Garcia-Romero, Nanxin Chen, Gregory Sell, Jonas Borgstrom, Alan McCree, Leibny Paola García-Perera, Saurabh Kataria, Phani Sankar Nidadavolu, Pedro Torres-Carrasquiilo, Najim Dehak:
Advances in Speaker Recognition for Telephone and Audio-Visual Data: the JHU-MIT Submission for NIST SRE19. Odyssey 2020: 273-280 - [c50]Leibny Paola García-Perera, Jesús Villalba, Hervé Bredin, Jun Du, Diego Castán, Alejandrina Cristià, Latané Bullock, Ling Guo, Koji Okabe, Phani Sankar Nidadavolu, Saurabh Kataria, Sizhu Chen, Léo Galmant, Marvin Lavechin, Lei Sun, Marie-Philippe Gill, Bar Ben-Yair, Sajjad Abdoli, Xin Wang, Wassim Bouaziz, Hadrien Titeux, Emmanuel Dupoux, Kong Aik Lee, Najim Dehak:
Speaker Detection in the Wild: Lessons Learned from JSALT 2019. Odyssey 2020: 415-422 - [c49]Saurabh Kataria, Phani Sankar Nidadavolu, Jesús Villalba, Najim Dehak:
Analysis of Deep Feature Loss Based Enhancement for Speaker Verification. Odyssey 2020: 459-466 - [i17]Saurabh Kataria, Phani Sankar Nidadavolu, Jesús Villalba, Najim Dehak:
Analysis of Deep Feature Loss based Enhancement for Speaker Verification. CoRR abs/2002.00139 (2020) - [i16]Raghavendra Pappagari, Tianzi Wang, Jesús Villalba, Nanxin Chen, Najim Dehak:
x-vectors meet emotions: A study on dependencies between emotion and speaker recognition. CoRR abs/2002.05039 (2020) - [i15]Phani Sankar Nidadavolu, Saurabh Kataria, L. Paola García-Perera, Jesús Villalba, Najim Dehak:
Single Channel Far Field Feature Enhancement For Speaker Verification In The Wild. CoRR abs/2005.08331 (2020) - [i14]Saurabhchand Bhati, Jesús Villalba, Piotr Zelasko, Najim Dehak:
Self-Expressing Autoencoders for Unsupervised Spoken Term Discovery. CoRR abs/2007.13033 (2020) - [i13]Jaejin Cho, Piotr Zelasko, Jesús Villalba, Shinji Watanabe, Najim Dehak:
Learning Speaker Embedding from Text-to-Speech. CoRR abs/2010.11221 (2020) - [i12]Saurabh Kataria, Jesús Villalba, Najim Dehak:
Perceptual Loss based Speech Denoising with an ensemble of Audio Pattern Recognition and Self-Supervised Models. CoRR abs/2010.11860 (2020) - [i11]Raghavendra Pappagari, Jesús Villalba, Piotr Zelasko, Laureano Moro-Velázquez, Najim Dehak:
CopyPaste: An Augmentation Method for Speech Emotion Recognition. CoRR abs/2010.14602 (2020) - [i10]Nanxin Chen, Piotr Zelasko, Jesús Villalba, Najim Dehak:
Focus on the present: a regularization method for the ASR source-target attention layer. CoRR abs/2011.01210 (2020) - [i9]Desh Raj, Jesús Villalba, Daniel Povey, Sanjeev Khudanpur:
Frustratingly Easy Noise-aware Training of Acoustic Models. CoRR abs/2011.02090 (2020)
2010 – 2019
- 2019
- [j5]Laureano Moro-Velázquez, Jorge Andrés Gómez García, Juan Ignacio Godino-Llorente, Jesús Villalba, Jan Rusz, Stefanie Shattuck-Hufnagel, Najim Dehak:
A forced gaussians based methodology for the differential evaluation of Parkinson's Disease by means of speech processing. Biomed. Signal Process. Control. 48: 205-220 (2019) - [c48]Phani Sankar Nidadavolu, Saurabh Kataria, Jesús Villalba, Najim Dehak:
Low-Resource Domain Adaptation for Speaker Recognition Using Cycle-Gans. ASRU 2019: 710-717 - [c47]Raghavendra Pappagari, Piotr Zelasko, Jesús Villalba, Yishay Carmiel, Najim Dehak:
Hierarchical Transformers for Long Document Classification. ASRU 2019: 838-844 - [c46]Saurabhchand Bhati, Chunxi Liu, Jesús Villalba, Jan Trmal, Sanjeev Khudanpur, Najim Dehak:
Bottom-Up Unsupervised Word Discovery via Acoustic Units. GlobalSIP 2019: 1-5 - [c45]Saurabhchand Bhati, Laureano Moro-Velázquez, Jesús Villalba, Najim Dehak:
LSTM Siamese Network for Parkinson's Disease Detection from Speech. GlobalSIP 2019: 1-5 - [c44]Phani Sankar Nidadavolu, Vicente Iglesias, Jesús Villalba, Najim Dehak:
Investigation on Neural Bandwidth Extension of Telephone Speech for Improved Speaker Recognition. ICASSP 2019: 6111-6115 - [c43]Jaejin Cho, Shinji Watanabe, Takaaki Hori, Murali Karthick Baskar, Hirofumi Inaguma, Jesús Villalba, Najim Dehak:
Language Model Integration Based on Memory Control for Sequence to Sequence Speech Recognition. ICASSP 2019: 6191-6195 - [c42]Phani Sankar Nidadavolu, Jesús Villalba, Najim Dehak:
Cycle-GANs for Domain Adaptation of Acoustic Features for Speaker Recognition. ICASSP 2019: 6206-6210 - [c41]Cheng-I Lai, Nanxin Chen, Jesús Villalba, Najim Dehak:
ASSERT: Anti-Spoofing with Squeeze-Excitation and Residual Networks. INTERSPEECH 2019: 1013-1017 - [c40]Jesús Villalba, Nanxin Chen, David Snyder, Daniel Garcia-Romero, Alan McCree, Gregory Sell, Jonas Borgstrom, Fred Richardson, Suwon Shon, François Grondin, Réda Dehak, Leibny Paola García-Perera, Daniel Povey, Pedro A. Torres-Carrasquillo, Sanjeev Khudanpur, Najim Dehak:
State-of-the-Art Speaker Recognition for Telephone and Video Speech: The JHU-MIT Submission for NIST SRE18. INTERSPEECH 2019: 1488-1492 - [c39]David Snyder, Jesús Villalba, Nanxin Chen, Daniel Povey, Gregory Sell, Najim Dehak, Sanjeev Khudanpur:
The JHU Speaker Recognition System for the VOiCES 2019 Challenge. INTERSPEECH 2019: 2468-2472 - [c38]Nanxin Chen, Jesús Villalba, Najim Dehak:
Tied Mixture of Factor Analyzers Layer to Combine Frame Level Representations in Neural Speaker Embeddings. INTERSPEECH 2019: 2948-2952 - [i8]Cheng-I Lai, Nanxin Chen, Jesús Villalba, Najim Dehak:
ASSERT: Anti-Spoofing with Squeeze-Excitation and Residual neTworks. CoRR abs/1904.01120 (2019) - [i7]Raghavendra Pappagari, Piotr Zelasko, Jesús Villalba, Yishay Carmiel, Najim Dehak:
Hierarchical Transformers for Long Document Classification. CoRR abs/1910.10781 (2019) - [i6]Saurabh Kataria, Phani Sankar Nidadavolu, Jesús Villalba, Nanxin Chen, Paola García, Najim Dehak:
Feature Enhancement with Deep Feature Losses for Speaker Verification. CoRR abs/1910.11905 (2019) - [i5]Phani Sankar Nidadavolu, Saurabh Kataria, Jesús Villalba, Najim Dehak:
Low-Resource Domain Adaptation for Speaker Recognition Using Cycle-GANs. CoRR abs/1910.11909 (2019) - [i4]Phani Sankar Nidadavolu, Saurabh Kataria, Jesús Villalba, L. Paola García-Perera, Najim Dehak:
Unsupervised Feature Enhancement for speaker verification. CoRR abs/1910.11915 (2019) - [i3]Nanxin Chen, Shinji Watanabe, Jesús Villalba, Najim Dehak:
Listen and Fill in the Missing Letters: Non-Autoregressive Transformer for Speech Recognition. CoRR abs/1911.04908 (2019) - [i2]Paola García, Jesús Villalba, Hervé Bredin, Jun Du, Diego Castán, Alejandrina Cristià, Latané Bullock, Ling Guo, Koji Okabe, Phani Sankar Nidadavolu, Saurabh Kataria, Sizhu Chen, Léo Galmant, Marvin Lavechin, Lei Sun, Marie-Philippe Gill, Bar Ben-Yair, Sajjad Abdoli, Xin Wang, Wassim Bouaziz, Hadrien Titeux, Emmanuel Dupoux, Kong Aik Lee, Najim Dehak:
Speaker detection in the wild: Lessons learned from JSALT 2019. CoRR abs/1912.00938 (2019) - 2018
- [j4]Laureano Moro-Velázquez, Jorge Andrés Gómez García, Juan Ignacio Godino-Llorente, Jesús Villalba, Juan Rafael Orozco-Arroyave, Najim Dehak:
Analysis of speaker recognition methodologies and the influence of kinetic changes to automatically detect Parkinson's Disease. Appl. Soft Comput. 62: 649-666 (2018) - [c37]Zili Huang, L. Paola García-Perera, Jesús Villalba, Daniel Povey, Najim Dehak:
JHU Diarization System Description. IberSPEECH 2018: 236-239 - [c36]Nanxin Chen, Jesús Villalba, Yishay Carmiel, Najim Dehak:
Measuring Uncertainty in Deep Regression Models: The Case of Age Estimation from Speech. ICASSP 2018: 4939-4943 - [c35]Raghavendra Pappagari, Jesús Villalba, Najim Dehak:
Joint Verification-Identification in end-to-end Multi-Scale CNN Framework for Topic Identification. ICASSP 2018: 6199-6203 - [c34]Nanxin Chen, Jesús Villalba, Najim Dehak:
An Investigation of Non-linear i-vectors for Speaker Verification. INTERSPEECH 2018: 87-91 - [c33]Jaejin Cho, Raghavendra Pappagari, Purva Kulkarni, Jesús Villalba, Yishay Carmiel, Najim Dehak:
Deep Neural Networks for Emotion Recognition Combining Audio and Transcripts. INTERSPEECH 2018: 247-251 - [c32]Pegah Ghahremani, Phani Sankar Nidadavolu, Nanxin Chen, Jesús Villalba, Daniel Povey, Sanjeev Khudanpur, Najim Dehak:
End-to-end Deep Neural Network Age Estimation. INTERSPEECH 2018: 277-281 - [c31]Phani Sankar Nidadavolu, Cheng-I Lai, Jesús Villalba, Najim Dehak:
Investigation on Bandwidth Extension for Speaker Recognition. INTERSPEECH 2018: 1111-1115 - [c30]Peter Sibbern Frederiksen, Jesús Villalba, Shinji Watanabe, Zheng-Hua Tan, Najim Dehak:
Effectiveness of Single-Channel BLSTM Enhancement for Language Identification. INTERSPEECH 2018: 1823-1827 - [c29]Gregory Sell, David Snyder, Alan McCree, Daniel Garcia-Romero, Jesús Villalba, Matthew Maciejewski, Vimal Manohar, Najim Dehak, Daniel Povey, Shinji Watanabe, Sanjeev Khudanpur:
Diarization is Hard: Some Experiences and Lessons Learned for the JHU Team in the Inaugural DIHARD Challenge. INTERSPEECH 2018: 2808-2812 - [c28]Fred Richardson, Pedro A. Torres-Carrasquillo, Jonas Borgstrom, Douglas E. Sturim, Youngjune Gwon, Jesús Villalba, Jan Trmal, Nanxin Chen, Réda Dehak, Najim Dehak:
The MIT Lincoln Laboratory / JHU / EPITA-LSE LRE17 System. Odyssey 2018: 54-59 - [c27]Jesús Antonio Villalba López, Niko Brummer, Najim Dehak:
End-to-End versus Embedding Neural Networks for Language Recognition in Mismatched Conditions. Odyssey 2018: 112-119 - [i1]Jaejin Cho, Shinji Watanabe, Takaaki Hori, Murali Karthick Baskar, Hirofumi Inaguma, Jesús Villalba, Najim Dehak:
Language model integration based on memory control for sequence to sequence speech recognition. CoRR abs/1811.02162 (2018) - 2017
- [c26]Jesús Villalba, Niko Brümmer, Najim Dehak:
Tied Variational Autoencoder Backends for i-Vector Speaker Recognition. INTERSPEECH 2017: 1004-1008 - [c25]Ignacio Viñals, Alfonso Ortega, Jesús Antonio Villalba López, Antonio Miguel, Eduardo Lleida:
Domain Adaptation of PLDA Models in Broadcast Diarization by Means of Unsupervised Speaker Clustering. INTERSPEECH 2017: 2829-2833 - 2016
- [j3]Jesús Antonio Villalba López, Alfonso Ortega, Antonio Miguel, Eduardo Lleida:
Analysis of speech quality measures for the task of estimating the reliability of speaker verification decisions. Speech Commun. 78: 42-61 (2016) - [j2]Jesús Antonio Villalba López, Antonio Miguel, Alfonso Ortega, Eduardo Lleida:
Bayesian Networks to Model the Variability of Speaker Verification Scores in Adverse Environments. IEEE ACM Trans. Audio Speech Lang. Process. 24(12): 2327-2340 (2016) - [c24]Ignacio Viñals, Jesús Antonio Villalba López, Alfonso Ortega, Antonio Miguel, Eduardo Lleida:
Bottleneck Based Front-End for Diarization Systems. IberSPEECH 2016: 276-286 - 2015
- [c23]Jesús Antonio Villalba López, Alfonso Ortega, Antonio Miguel, Eduardo Lleida:
Variational Bayesian PLDA for speaker diarization in the MGB challenge. ASRU 2015: 667-674 - [c22]Jesús Antonio Villalba López, Antonio Miguel, Alfonso Ortega, Eduardo Lleida:
Spoofing detection with DNN and one-class SVM for the ASVspoof 2015 challenge. INTERSPEECH 2015: 2067-2071 - 2014
- [c21]David Martínez González, Jesús Antonio Villalba López, Eduardo Lleida-Solano, Alfonso Ortega Giménez:
Unsupervised Accent Modeling for Language Identification. IberSPEECH 2014: 49-58 - [c20]Jesús Antonio Villalba López, Eduardo Lleida:
Unsupervised Training of PLDA with Variational Bayes. IberSPEECH 2014: 69-78 - [c19]Jesús Antonio Villalba López, Eduardo Lleida:
Unsupervised adaptation of PLDA by using variational Bayes methods. ICASSP 2014: 744-748 - [c18]Antonio Miguel, Jesús Antonio Villalba López, Alfonso Ortega, Eduardo Lleida, Carlos Vaquero:
Factor analysis with sampling methods for text dependent speaker recognition. INTERSPEECH 2014: 1342-1346 - 2013
- [c17]Diego Castán, Alfonso Ortega, Jesús Antonio Villalba López, Antonio Miguel, Eduardo Lleida:
Segmentation-by-classification system based on factor analysis. ICASSP 2013: 783-787 - [c16]Jesús Antonio Villalba López, Eduardo Lleida:
Handling i-vectors from different recording conditions using multi-channel simplified PLDA in speaker recognition. ICASSP 2013: 6763-6767 - [c15]Jesús Antonio Villalba López, Mireia Díez, Amparo Varona, Eduardo Lleida:
Handling recordings acquired simultaneously over multiple channels with PLDA. INTERSPEECH 2013: 2509-2513 - [c14]Jesús Antonio Villalba López, Eduardo Lleida, Alfonso Ortega, Antonio Miguel:
A new Bayesian network to assess the reliability of speaker verification decisions. INTERSPEECH 2013: 3132-3136 - [c13]Jesús Antonio Villalba López, Eduardo Lleida, Alfonso Ortega, Antonio Miguel:
The I3a speaker recognition system for NIST SRE12: post-evaluation analysis. INTERSPEECH 2013: 3679-3683 - 2012
- [c12]Jesús Antonio Villalba López, Eduardo Lleida, Alfonso Ortega, Antonio Miguel:
Reliability Estimation of the Speaker Verification Decisions Using Bayesian Networks to Combine Information from Multiple Speech Quality Measures. IberSPEECH 2012: 1-10 - [c11]David Martínez González, Eduardo Lleida, Alfonso Ortega, Antonio Miguel, Jesús Antonio Villalba López:
Voice Pathology Detection on the Saarbrücken Voice Database with Calibration and Fusion of Scores Using MultiFocal Toolkit. IberSPEECH 2012: 99-109 - [c10]Luis Javier Rodríguez-Fuentes, Mikel Peñagarikano, Amparo Varona, Mireia Díez, Germán Bordel, Alberto Abad, David Martínez González, Jesús Antonio Villalba López, Alfonso Ortega, Eduardo Lleida:
The BLZ Submission to the NIST 2011 LRE: Data Collection, System Development and Performance. INTERSPEECH 2012: 38-41 - [c9]Jesús Antonio Villalba López, Eduardo Lleida:
Bayesian adaptation of PLDA based speaker recognition to domains with scarce development data. Odyssey 2012: 47-54 - 2011
- [j1]Carlos Vaquero Avilés Casco, Jesús Antonio Villalba López, Alfonso Ortega Giménez, Eduardo Lleida-Solano:
Speaker Verification On Summed-Channel Conditions With Confidence Measures. Computación y Sistemas 15(1) (2011) - [c8]Luis Javier Rodríguez, Mikel Peñagarikano, Amparo Varona, Mireia Díez, Germán Bordel, David Martínez González, Jesús Antonio Villalba López, Antonio Miguel, Alfonso Ortega, Eduardo Lleida, Alberto Abad, Oscar Koller, Isabel Trancoso, Paula Lopez-Otero, Laura Docío Fernández, Carmen García-Mateo, Rahim Saeidi, Mehdi Soufifar, Tomi Kinnunen, Torbjørn Svendsen, Pasi Fränti:
Multi-site heterogeneous system fusions for the Albayzin 2010 Language Recognition Evaluation. ASRU 2011: 377-382 - [c7]Jesús Antonio Villalba López, Eduardo Lleida:
Detecting Replay Attacks from Far-Field Recordings on Speaker Verification Systems. BIOID 2011: 274-285 - [c6]Jesús Villalba, Eduardo Lleida:
Preventing replay attacks on speaker verification systems. ICCST 2011: 1-8 - [c5]Diego Castán, Carlos Vaquero, Alfonso Ortega, David Martínez González, Jesús Antonio Villalba López, Eduardo Lleida:
Hierarchical Audio Segmentation with HMM and Factor Analysis in Broadcast News Domain. INTERSPEECH 2011: 421-424 - [c4]Jesús Antonio Villalba López, Niko Brümmer:
Towards Fully Bayesian Speaker Recognition: Integrating Out the Between-Speaker Covariance. INTERSPEECH 2011: 505-508 - [c3]David Martínez González, Jesús Antonio Villalba López, Antonio Miguel, Alfonso Ortega, Eduardo Lleida:
I3A Language Recognition System for Albayzin 2010 LRE. INTERSPEECH 2011: 849-852 - 2010
- [c2]Dayana Ribas, Jesús Antonio Villalba López, Eduardo Lleida, José R. Calvo:
Speaker Verification in Noisy Environment Using Missing Feature Approach. CIARP 2010: 220-227 - [c1]Carlos Vaquero, Alfonso Ortega, Jesús Antonio Villalba López, Antonio Miguel, Eduardo Lleida:
Confidence measures for speaker segmentation and their relation to speaker verification. INTERSPEECH 2010: 2310-2313
Coauthor Index
aka: Eduardo Lleida-Solano
aka: Phani Sankar Nidadavolu
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-23 20:31 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint