default search action
Pedro J. Moreno 0001
Person information
- affiliation: Google. Inc., Mountain View, CA, USA
Other persons with the same name
- Pedro J. Moreno 0002 — University of Cádiz, Agrifood Excellence International Campus, Spain
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c91]Rohit Prabhavalkar, Zhong Meng, Weiran Wang, Adam Stooke, Xingyu Cai, Yanzhang He, Arun Narayanan, Dongseong Hwang, Tara N. Sainath, Pedro J. Moreno:
Extreme Encoder Output Frame Rate Reduction: Improving Computational Latencies of Large End-to-End Models. ICASSP 2024: 11816-11820 - [c90]Shefali Garg, Zhouyuan Huo, Khe Chai Sim, Suzan Schwartz, Mason Chua, Alëna Aksënova, Tsendsuren Munkhdalai, Levi King, Darryl Wright, Zion Mengesha, Dongseong Hwang, Tara N. Sainath, Françoise Beaufays, Pedro Moreno Mengibar:
Improving Speech Recognition for African American English with Audio Classification. ICASSP 2024: 12356-12360 - [c89]Weiran Wang, Rohit Prabhavalkar, Haozhe Shan, Zhong Meng, Dongseong Hwang, Qiujia Li, Khe Chai Sim, Bo Li, James Qin, Xingyu Cai, Adam Stooke, Chengjian Zheng, Yanzhang He, Tara N. Sainath, Pedro Moreno Mengibar:
Massive End-to-end Speech Recognition Models with Time Reduction. NAACL-HLT 2024: 6206-6217 - [i23]Rohit Prabhavalkar, Zhong Meng, Weiran Wang, Adam Stooke, Xingyu Cai, Yanzhang He, Arun Narayanan, Dongseong Hwang, Tara N. Sainath, Pedro J. Moreno:
Extreme Encoder Output Frame Rate Reduction: Improving Computational Latencies of Large End-to-End Models. CoRR abs/2402.17184 (2024) - [i22]Tsendsuren Munkhdalai, Youzheng Chen, Khe Chai Sim, Fadi Biadsy, Tara N. Sainath, Pedro Moreno Mengibar:
Hierarchical Recurrent Adapters for Efficient Multi-Task Adaptation of Large Speech Models. CoRR abs/2403.19709 (2024) - [i21]Dongseong Hwang, Weiran Wang, Zhuoyuan Huo, Khe Chai Sim, Pedro Moreno Mengibar:
TransformerFAM: Feedback attention is working memory. CoRR abs/2404.09173 (2024) - 2023
- [c88]Hillary Ngai, Rohan Agrawal, Neeraj Gaur, W. Ronny Huang, Parisa Haghani, Pedro Moreno Mengibar:
Audio-Adapterfusion: A Task-Id-Free Approach for Efficient and Non-Destructive Multi-Task Speech Recognition. ASRU 2023: 1-8 - [c87]Kartik Audhkhasi, Brian Farris, Bhuvana Ramabhadran, Pedro J. Moreno:
Modular Conformer Training for Flexible End-to-End ASR. ICASSP 2023: 1-5 - [c86]Tongzhou Chen, Cyril Allauzen, Yinghui Huang, Daniel S. Park, David Rybach, W. Ronny Huang, Rodrigo Cabrera, Kartik Audhkhasi, Bhuvana Ramabhadran, Pedro J. Moreno, Michael Riley:
Large-Scale Language Model Rescoring on Long-Form Data. ICASSP 2023: 1-5 - [c85]Zhouyuan Huo, Khe Chai Sim, Dongseong Hwang, Tsendsuren Munkhdalai, Tara N. Sainath, Pedro Moreno Mengibar:
Re-investigating the Efficient Transfer Learning of Speech Foundation Model using Feature Fusion Methods. INTERSPEECH 2023: 556-560 - [c84]Qiujia Li, Bo Li, Dongseong Hwang, Tara N. Sainath, Pedro Moreno Mengibar:
Modular Domain Adaptation for Conformer-Based Streaming ASR. INTERSPEECH 2023: 3357-3361 - [i20]Yu Zhang, Wei Han, James Qin, Yongqiang Wang, Ankur Bapna, Zhehuai Chen, Nanxin Chen, Bo Li, Vera Axelrod, Gary Wang, Zhong Meng, Ke Hu, Andrew Rosenberg, Rohit Prabhavalkar, Daniel S. Park, Parisa Haghani, Jason Riesa, Ginger Perng, Hagen Soltau, Trevor Strohman, Bhuvana Ramabhadran, Tara N. Sainath, Pedro J. Moreno, Chung-Cheng Chiu, Johan Schalkwyk, Françoise Beaufays, Yonghui Wu:
Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages. CoRR abs/2303.01037 (2023) - [i19]Qiujia Li, Bo Li, Dongseong Hwang, Tara N. Sainath, Pedro Moreno Mengibar:
Modular Domain Adaptation for Conformer-Based Streaming ASR. CoRR abs/2305.13408 (2023) - [i18]Tongzhou Chen, Cyril Allauzen, Yinghui Huang, Daniel S. Park, David Rybach, W. Ronny Huang, Rodrigo Cabrera, Kartik Audhkhasi, Bhuvana Ramabhadran, Pedro J. Moreno, Michael Riley:
Large-scale Language Model Rescoring on Long-form Data. CoRR abs/2306.08133 (2023) - [i17]Shefali Garg, Zhouyuan Huo, Khe Chai Sim, Suzan Schwartz, Mason Chua, Alëna Aksënova, Tsendsuren Munkhdalai, Levi King, Darryl Wright, Zion Mengesha, Dongseong Hwang, Tara N. Sainath, Françoise Beaufays, Pedro Moreno Mengibar:
Improving Speech Recognition for African American English With Audio Classification. CoRR abs/2309.09996 (2023) - [i16]Weiran Wang, Rohit Prabhavalkar, Dongseong Hwang, Qiujia Li, Khe Chai Sim, Bo Li, James Qin, Xingyu Cai, Adam Stooke, Zhong Meng, CJ Zheng, Yanzhang He, Tara N. Sainath, Pedro Moreno Mengibar:
Massive End-to-end Models for Short Search Queries. CoRR abs/2309.12963 (2023) - [i15]Weiran Wang, Zelin Wu, Diamantino Caseiro, Tsendsuren Munkhdalai, Khe Chai Sim, Pat Rondon, Golan Pundak, Gan Song, Rohit Prabhavalkar, Zhong Meng, Ding Zhao, Tara N. Sainath, Pedro Moreno Mengibar:
Contextual Biasing with the Knuth-Morris-Pratt Matching Algorithm. CoRR abs/2310.00178 (2023) - [i14]Hillary Ngai, Rohan Agrawal, Neeraj Gaur, W. Ronny Huang, Parisa Haghani, Pedro Moreno Mengibar:
Audio-AdapterFusion: A Task-ID-free Approach for Efficient and Non-Destructive Multi-task Speech Recognition. CoRR abs/2310.13015 (2023) - 2022
- [j13]Murali Karthick Baskar, Andrew Rosenberg, Bhuvana Ramabhadran, Yu Zhang, Pedro J. Moreno:
Ask2Mask: Guided Data Selection for Masked Speech Modeling. IEEE J. Sel. Top. Signal Process. 16(6): 1357-1366 (2022) - [c83]Neeraj Gaur, Tongzhou Chen, Ehsan Variani, Parisa Haghani, Bhuvana Ramabhadran, Pedro J. Moreno:
Multilingual Second-Pass Rescoring for Automatic Speech Recognition Systems. ICASSP 2022: 6407-6411 - [c82]Zhehuai Chen, Yu Zhang, Andrew Rosenberg, Bhuvana Ramabhadran, Pedro J. Moreno, Gary Wang:
Tts4pretrain 2.0: Advancing the use of Text and Speech in ASR Pretraining with Consistency and Contrastive Losses. ICASSP 2022: 7677-7681 - [c81]Kartik Audhkhasi, Yinghui Huang, Bhuvana Ramabhadran, Pedro J. Moreno:
Analysis of Self-Attention Head Diversity for Conformer-based Automatic Speech Recognition. INTERSPEECH 2022: 1026-1030 - [c80]Gary Wang, Andrew Rosenberg, Bhuvana Ramabhadran, Fadi Biadsy, Jesse Emond, Yinghui Huang, Pedro J. Moreno:
Non-Parallel Voice Conversion for ASR Augmentation. INTERSPEECH 2022: 3408-3412 - [c79]Zhehuai Chen, Yu Zhang, Andrew Rosenberg, Bhuvana Ramabhadran, Pedro J. Moreno, Ankur Bapna, Heiga Zen:
MAESTRO: Matched Speech Text Representations through Modality Matching. INTERSPEECH 2022: 4093-4097 - [c78]Fadi Biadsy, Youzheng Chen, Xia Zhang, Oleg Rybakov, Andrew Rosenberg, Pedro J. Moreno:
A Scalable Model Specialization Framework for Training and Inference using Submodels and its Application to Speech Model Personalization. INTERSPEECH 2022: 5125-5129 - [c77]Gary Wang, Ekin D. Cubuk, Andrew Rosenberg, Shuyang Cheng, Ron J. Weiss, Bhuvana Ramabhadran, Pedro J. Moreno, Quoc V. Le, Daniel S. Park:
G-Augment: Searching for the Meta-Structure of Data Augmentation Policies for ASR. SLT 2022: 23-30 - [c76]Zhehuai Chen, Ankur Bapna, Andrew Rosenberg, Yu Zhang, Bhuvana Ramabhadran, Pedro J. Moreno, Nanxin Chen:
Maestro-U: Leveraging Joint Speech-Text Representation Learning for Zero Supervised Speech ASR. SLT 2022: 68-75 - [c75]Zhong Meng, Tongzhou Chen, Rohit Prabhavalkar, Yu Zhang, Gary Wang, Kartik Audhkhasi, Jesse Emond, Trevor Strohman, Bhuvana Ramabhadran, W. Ronny Huang, Ehsan Variani, Yinghui Huang, Pedro J. Moreno:
Modular Hybrid Autoregressive Transducer. SLT 2022: 197-204 - [i13]Murali Karthick Baskar, Andrew Rosenberg, Bhuvana Ramabhadran, Yu Zhang, Pedro J. Moreno:
Ask2Mask: Guided Data Selection for Masked Speech Modeling. CoRR abs/2202.12719 (2022) - [i12]Fadi Biadsy, Youzheng Chen, Xia Zhang, Oleg Rybakov, Andrew Rosenberg, Pedro J. Moreno:
A Scalable Model Specialization Framework for Training and Inference using Submodels and its Application to Speech Model Personalization. CoRR abs/2203.12559 (2022) - [i11]Zhehuai Chen, Yu Zhang, Andrew Rosenberg, Bhuvana Ramabhadran, Pedro J. Moreno, Ankur Bapna, Heiga Zen:
MAESTRO: Matched Speech Text Representations through Modality Matching. CoRR abs/2204.03409 (2022) - [i10]Kartik Audhkhasi, Yinghui Huang, Bhuvana Ramabhadran, Pedro J. Moreno:
Analysis of Self-Attention Head Diversity for Conformer-based Automatic Speech Recognition. CoRR abs/2209.06096 (2022) - [i9]Gary Wang, Andrew Rosenberg, Bhuvana Ramabhadran, Fadi Biadsy, Yinghui Huang, Jesse Emond, Pedro Moreno Mengibar:
Non-Parallel Voice Conversion for ASR Augmentation. CoRR abs/2209.06987 (2022) - [i8]Zhehuai Chen, Ankur Bapna, Andrew Rosenberg, Yu Zhang, Bhuvana Ramabhadran, Pedro J. Moreno, Nanxin Chen:
Maestro-U: Leveraging joint speech-text representation learning for zero supervised speech ASR. CoRR abs/2210.10027 (2022) - [i7]Gary Wang, Ekin D. Cubuk, Andrew Rosenberg, Shuyang Cheng, Ron J. Weiss, Bhuvana Ramabhadran, Pedro J. Moreno, Quoc V. Le, Daniel S. Park:
G-Augment: Searching for the Meta-Structure of Data Augmentation Policies for ASR. CoRR abs/2210.10879 (2022) - [i6]Zhong Meng, Tongzhou Chen, Rohit Prabhavalkar, Yu Zhang, Gary Wang, Kartik Audhkhasi, Jesse Emond, Trevor Strohman, Bhuvana Ramabhadran, W. Ronny Huang, Ehsan Variani, Yinghui Huang, Pedro J. Moreno:
Modular Hybrid Autoregressive Transducer. CoRR abs/2210.17049 (2022) - 2021
- [c74]Zhehuai Chen, Yu Zhang, Andrew Rosenberg, Bhuvana Ramabhadran, Gary Wang, Pedro J. Moreno:
Injecting Text in Self-Supervised Speech Pretraining. ASRU 2021: 251-258 - [c73]Neeraj Gaur, Brian Farris, Parisa Haghani, Isabel Leal, Pedro J. Moreno, Manasa Prasad, Bhuvana Ramabhadran, Yun Zhu:
Mixture of Informed Experts for Multilingual Speech Recognition. ICASSP 2021: 6234-6238 - [c72]Rohan Doshi, Youzheng Chen, Liyang Jiang, Xia Zhang, Fadi Biadsy, Bhuvana Ramabhadran, Fang Chu, Andrew Rosenberg, Pedro J. Moreno:
Extending Parrotron: An End-to-End, Speech Conversion and Speech Recognition Model for Atypical Speech. ICASSP 2021: 6988-6992 - [c71]Zhehuai Chen, Andrew Rosenberg, Yu Zhang, Heiga Zen, Mohammadreza Ghodsi, Yinghui Huang, Jesse Emond, Gary Wang, Bhuvana Ramabhadran, Pedro J. Moreno:
Semi-Supervision in ASR: Sequential MixMatch and Factorized TTS-Based Augmentation. Interspeech 2021: 736-740 - [c70]Kartik Audhkhasi, Tongzhou Chen, Bhuvana Ramabhadran, Pedro J. Moreno:
Mixture Model Attention: Flexible Streaming and Non-Streaming Automatic Speech Recognition. Interspeech 2021: 1812-1816 - [c69]Isabel Leal, Neeraj Gaur, Parisa Haghani, Brian Farris, Pedro J. Moreno, Manasa Prasad, Bhuvana Ramabhadran, Yun Zhu:
Self-Adaptive Distillation for Multilingual Speech Recognition: Leveraging Student Independence. Interspeech 2021: 2556-2560 - [c68]Zhehuai Chen, Bhuvana Ramabhadran, Fadi Biadsy, Xia Zhang, Youzheng Chen, Liyang Jiang, Fang Chu, Rohan Doshi, Pedro J. Moreno:
Conformer Parrotron: A Faster and Stronger End-to-End Speech Conversion and Recognition Model for Atypical Speech. Interspeech 2021: 4828-4832 - [i5]Zhehuai Chen, Yu Zhang, Andrew Rosenberg, Bhuvana Ramabhadran, Gary Wang, Pedro J. Moreno:
Injecting Text in Self-Supervised Speech Pretraining. CoRR abs/2108.12226 (2021) - 2020
- [c67]Gary Wang, Andrew Rosenberg, Zhehuai Chen, Yu Zhang, Bhuvana Ramabhadran, Yonghui Wu, Pedro J. Moreno:
Improving Speech Recognition Using Consistent Predictions on Synthesized Speech. ICASSP 2020: 7029-7033 - [c66]Ehsan Variani, Tongzhou Chen, James Apfel, Bhuvana Ramabhadran, Seungji Lee, Pedro J. Moreno:
Neural Oracle Search on N-BEST Hypotheses. ICASSP 2020: 7824-7828 - [c65]Zhehuai Chen, Andrew Rosenberg, Yu Zhang, Gary Wang, Bhuvana Ramabhadran, Pedro J. Moreno:
Improving Speech Recognition Using GAN-Based Speech Synthesis and Contrastive Unspoken Text Selection. INTERSPEECH 2020: 556-560 - [c64]Gary Wang, Andrew Rosenberg, Zhehuai Chen, Yu Zhang, Bhuvana Ramabhadran, Pedro J. Moreno:
SCADA: Stochastic, Consistent and Adversarial Data Augmentation to Improve ASR. INTERSPEECH 2020: 2832-2836 - [c63]Yun Zhu, Parisa Haghani, Anshuman Tripathi, Bhuvana Ramabhadran, Brian Farris, Hainan Xu, Han Lu, Hasim Sak, Isabel Leal, Neeraj Gaur, Pedro J. Moreno, Qian Zhang:
Multilingual Speech Recognition with Self-Attention Structured Parameterization. INTERSPEECH 2020: 4741-4745
2010 – 2019
- 2019
- [c62]Austin Waters, Neeraj Gaur, Parisa Haghani, Pedro J. Moreno, Zhongdi Qu:
Leveraging Language ID in Multilingual End-to-End Speech Recognition. ASRU 2019: 928-935 - [c61]Andrew Rosenberg, Yu Zhang, Bhuvana Ramabhadran, Ye Jia, Pedro J. Moreno, Yonghui Wu, Zelin Wu:
Speech Recognition with Augmented Synthesized Speech. ASRU 2019: 996-1002 - [c60]Fadi Biadsy, Ron J. Weiss, Pedro J. Moreno, Dimitri Kanvesky, Ye Jia:
Parrotron: An End-to-End Speech-to-Speech Conversion Model and its Applications to Hearing-Impaired Speech and Speech Separation. INTERSPEECH 2019: 4115-4119 - [i4]Fadi Biadsy, Ron J. Weiss, Pedro J. Moreno, Dimitri Kanvesky, Ye Jia:
Parrotron: An End-to-End Speech-to-Speech Conversion Model and its Applications to Hearing-Impaired Speech and Speech Separation. CoRR abs/1904.04169 (2019) - [i3]Andrew Rosenberg, Yu Zhang, Bhuvana Ramabhadran, Ye Jia, Pedro J. Moreno, Yonghui Wu, Zelin Wu:
Speech Recognition with Augmented Synthesized Speech. CoRR abs/1909.11699 (2019) - 2018
- [c59]Shubham Toshniwal, Tara N. Sainath, Ron J. Weiss, Bo Li, Pedro J. Moreno, Eugene Weinstein, Kanishka Rao:
Multilingual Speech Recognition with a Single End-to-End Model. ICASSP 2018: 4904-4908 - [c58]Asa Oines, Eugene Weinstein, Pedro J. Moreno:
Hybrid Lstm-Fsmn Networks for Acoustic Modeling. ICASSP 2018: 5844-5848 - [c57]Min Ma, Shankar Kumar, Fadi Biadsy, Michael Nirschl, Tomas Vykruta, Pedro J. Moreno:
Modeling Non-Linguistic Contextual Signals in LSTM Language Models Via Domain Adaptation. ICASSP 2018: 6094-6098 - [c56]Leonid Velikovich, Ian Williams, Justin Scheiner, Petar S. Aleksic, Pedro J. Moreno, Michael Riley:
Semantic Lattice Processing in Contextual Automatic Speech Recognition for Google Assistant. INTERSPEECH 2018: 2222-2226 - [c55]Jesse Emond, Bhuvana Ramabhadran, Brian Roark, Pedro J. Moreno, Min Ma:
Transliteration Based Approaches to Improve Code-Switched Speech Recognition Performance. SLT 2018: 448-455 - [c54]Parisa Haghani, Arun Narayanan, Michiel Bacchiani, Galen Chuang, Neeraj Gaur, Pedro J. Moreno, Rohit Prabhavalkar, Zhongdi Qu, Austin Waters:
From Audio to Semantics: Approaches to End-to-End Spoken Language Understanding. SLT 2018: 720-726 - [i2]Parisa Haghani, Arun Narayanan, Michiel Bacchiani, Galen Chuang, Neeraj Gaur, Pedro J. Moreno, Rohit Prabhavalkar, Zhongdi Qu, Austin Waters:
From Audio to Semantics: Approaches to end-to-end spoken language understanding. CoRR abs/1809.09190 (2018) - 2017
- [c53]Zhongdi Qu, Parisa Haghani, Eugene Weinstein, Pedro J. Moreno:
Syllable-based acoustic modeling with CTC-SMBR-LSTM. ASRU 2017: 173-177 - [p1]Michiel Bacchiani, Françoise Beaufays, Alexander Gruenstein, Pedro J. Moreno, Johan Schalkwyk, Trevor Strohman, Heiga Zen:
Speech Research at Google to Enable Universal Speech Interfaces. New Era for Robust Speech Recognition, Exploiting Deep Learning 2017: 385-399 - [i1]Shubham Toshniwal, Tara N. Sainath, Ron J. Weiss, Bo Li, Pedro J. Moreno, Eugene Weinstein, Kanishka Rao:
Multilingual Speech Recognition With A Single End-To-End Model. CoRR abs/1711.01694 (2017) - 2016
- [j12]Ignacio López-Moreno, Javier Gonzalez-Dominguez, David Martinez, Oldrich Plchot, Joaquin Gonzalez-Rodriguez, Pedro J. Moreno:
On the use of deep feedforward neural networks for automatic language identification. Comput. Speech Lang. 40: 46-59 (2016) - [c52]Victor Soto, Olivier Siohan, Mohamed Elfeky, Pedro J. Moreno:
Selection and combination of hypotheses for dialectal speech recognition. ICASSP 2016: 5845-5849 - [c51]Felix de Chaumont Quitry, Asa Oines, Pedro J. Moreno, Eugene Weinstein:
High quality agreement-based semi-supervised training data for acoustic modeling. SLT 2016: 592-596 - [c50]Mohamed Elfeky, Meysam Bastani, Xavier Velez, Pedro J. Moreno, Austin Waters:
Towards acoustic model unification across dialects. SLT 2016: 624-628 - 2015
- [j11]Javier Gonzalez-Dominguez, David Eustis, Ignacio López-Moreno, Andrew W. Senior, Françoise Beaufays, Pedro J. Moreno:
A Real-Time End-to-End Multilingual Speech Recognition Architecture. IEEE J. Sel. Top. Signal Process. 9(4): 749-759 (2015) - [j10]Javier Gonzalez-Dominguez, Ignacio López-Moreno, Pedro J. Moreno, Joaquín González-Rodríguez:
Frame-by-frame language identification in short utterances using deep neural networks. Neural Networks 64: 49-58 (2015) - [c49]Petar S. Aleksic, Cyril Allauzen, David Elson, Aleksandar Kracun, Diego Melendo Casado, Pedro J. Moreno:
Improved recognition of contact names in voice commands. ICASSP 2015: 5172-5175 - [c48]Mohamed G. Elfeky, Pedro J. Moreno, Victor Soto:
Multi-Dialectical Languages Effect on Speech Recognition: Too Much Choice Can Hurt. ICNLSP 2015: 1-8 - [c47]Petar S. Aleksic, Mohammadreza Ghodsi, Assaf Hurwitz Michaely, Cyril Allauzen, Keith B. Hall, Brian Roark, David Rybach, Pedro J. Moreno:
Bringing contextual information to google speech recognition. INTERSPEECH 2015: 468-472 - 2014
- [c46]Ignacio López-Moreno, Javier Gonzalez-Dominguez, Oldrich Plchot, David Martinez, Joaquin Gonzalez-Rodriguez, Pedro J. Moreno:
Automatic language identification using deep neural networks. ICASSP 2014: 5337-5341 - [c45]Erik McDermott, Georg Heigold, Pedro J. Moreno, Andrew W. Senior, Michiel Bacchiani:
Asynchronous stochastic optimization for sequence training of deep neural networks: towards big data. INTERSPEECH 2014: 1224-1228 - [c44]Olga Kapralova, John Alex, Eugene Weinstein, Pedro J. Moreno, Olivier Siohan:
A big data approach to acoustic model training corpus selection. INTERSPEECH 2014: 2083-2087 - [c43]Javier Gonzalez-Dominguez, Ignacio López-Moreno, Hasim Sak, Joaquin Gonzalez-Rodriguez, Pedro J. Moreno:
Automatic language identification using long short-term memory recurrent neural networks. INTERSPEECH 2014: 2155-2159 - [c42]Fadi Biadsy, Keith B. Hall, Pedro J. Moreno, Brian Roark:
Backoff inspired features for maximum entropy language models. INTERSPEECH 2014: 2645-2649 - 2012
- [c41]Fadi Biadsy, Pedro J. Moreno, Martin Jansche:
Google's cross-dialect Arabic voice search. ICASSP 2012: 4441-4444 - 2011
- [c40]Yun-Hsuan Sung, Martin Jansche, Pedro J. Moreno:
Deploying Google Search by Voice in Cantonese. INTERSPEECH 2011: 2865-2868 - 2010
- [j9]Mehryar Mohri, Pedro J. Moreno, Eugene Weinstein:
Efficient and Robust Music Identification With Weighted Finite-State Transducers. IEEE Trans. Speech Audio Process. 18(1): 197-207 (2010) - [c39]Etienne Barnard, Johan Schalkwyk, Charl Johannes van Heerden, Pedro J. Moreno:
Voice search for development. INTERSPEECH 2010: 282-285 - [c38]Jiulong Shan, Genqing Wu, Zhihong Hu, Xiliu Tang, Martin Jansche, Pedro J. Moreno:
Search by voice in Mandarin Chinese. INTERSPEECH 2010: 354-357 - [c37]Thad Hughes, Kaisuke Nakajima, Linne Ha, Atul Vasu, Pedro J. Moreno, Mike LeBeau:
Building transcribed speech corpora quickly and cheaply for many languages. INTERSPEECH 2010: 1914-1917 - [c36]Mehryar Mohri, Pedro J. Moreno, Eugene Weinstein:
Discriminative Topic Segmentation of Text and Speech. AISTATS 2010: 533-540
2000 – 2009
- 2009
- [j8]Mehryar Mohri, Pedro J. Moreno, Eugene Weinstein:
General suffix automaton construction algorithm and space bounds. Theor. Comput. Sci. 410(37): 3553-3562 (2009) - [c35]Mehmet Emre Sargin, Hrishikesh B. Aradhye, Pedro J. Moreno, Ming Zhao:
Audiovisual celebrity recognition in unconstrained web videos. ICASSP 2009: 1977-1980 - [c34]Pedro J. Moreno, Christopher Alberti:
A factor automaton approach for the forced alignment of long speech recordings. ICASSP 2009: 4869-4872 - [c33]Christopher Alberti, Michiel Bacchiani, Ari Bezman, Ciprian Chelba, Anastassia Drofa, Hank Liao, Pedro J. Moreno, Ted Power, Arnaud Sahuguet, Maria Shugrina, Olivier Siohan:
An audio indexing system for election video material. ICASSP 2009: 4873-4876 - [c32]Mehryar Mohri, Pedro J. Moreno, Eugene Weinstein:
A new quality measure for topic segmentation of text and speech. INTERSPEECH 2009: 2743-2746 - 2007
- [j7]Gustavo Carneiro, Antoni B. Chan, Pedro J. Moreno, Nuno Vasconcelos:
Supervised Learning of Semantic Classes for Image Annotation and Retrieval. IEEE Trans. Pattern Anal. Mach. Intell. 29(3): 394-410 (2007) - [j6]Nikhil Rasiwasia, Pedro J. Moreno, Nuno Vasconcelos:
Bridging the Gap: Query by Semantic Example. IEEE Trans. Multim. 9(5): 923-938 (2007) - [c31]Eugene Weinstein, Pedro J. Moreno:
Music Identification with Weighted Finite-State Transducers. ICASSP (2) 2007: 689-692 - [c30]Mehryar Mohri, Pedro J. Moreno, Eugene Weinstein:
Robust Music Identification, Detection, and Analysis. ISMIR 2007: 135-138 - [c29]Mehryar Mohri, Pedro J. Moreno, Eugene Weinstein:
Factor Automata of Automata and Applications. CIAA 2007: 168-179 - 2006
- [c28]Nikhil Rasiwasia, Nuno Vasconcelos, Pedro J. Moreno:
Query by Semantic Example. CIVR 2006: 51-60 - 2005
- [j5]Beth Logan, Jean-Manuel Van Thong, Pedro J. Moreno:
Approaches to reduce the effects of OOV queries on indexed spoken audio. IEEE Trans. Multim. 7(5): 899-906 (2005) - 2004
- [c27]Nuno Vasconcelos, Purdy Ho, Pedro J. Moreno:
The Kullback-Leibler Kernel as a Framework for Discriminant and Localized Representations for Visual Recognition. ECCV (3) 2004: 430-441 - [c26]Beth Logan, A. Kositsky, Pedro J. Moreno:
Semantic analysis of song lyrics. ICME 2004: 827-830 - [c25]J. Marston, G. MacCarthy, Beth Logan, Pedro J. Moreno, Jean-Manuel Van Thong:
News Tuner: a simple interface for searching and browsing radio archives. ICME 2004: 1531-1534 - [c24]Purdy Ho, Pedro J. Moreno:
SVM kernel adaptation in speaker classification and verification. INTERSPEECH 2004: 1413-1416 - 2003
- [c23]Pedro J. Moreno, Purdy Ho:
A new SVM approach to speaker identification and verification using probabilistic distance kernels. INTERSPEECH 2003: 2965-2968 - [c22]Pedro J. Moreno, Purdy Ho, Nuno Vasconcelos:
A Kullback-Leibler Divergence Based Kernel for SVM Classification in Multimedia Applications. NIPS 2003: 1385-1392 - 2002
- [j4]Pedro J. Moreno, Jean-Manuel Van Thong, Beth Logan, Gareth J. F. Jones:
From Multimedia Retrieval to Knowledge Management. Computer 35(4): 58-66 (2002) - [j3]Jean-Manuel Van Thong, Pedro J. Moreno, Beth Logan, Blair Fidler, K. Maffey, M. Moores:
Speechbot: an experimental speech-based search engine for multimedia content on the web. IEEE Trans. Multim. 4(1): 88-96 (2002) - 2001
- [c21]Pedro J. Moreno, Beth Logan, Bhiksha Raj:
A boosting approach for confidence scoring. INTERSPEECH 2001: 2109-2112 - [c20]David M. Blei, Pedro J. Moreno:
Topic Segmentation with an Aspect Hidden Markov Model. SIGIR 2001: 343-348 - 2000
- [c19]Pedro J. Moreno, Ryan Rifkin:
Using the Fisher kernel method for Web audio classification. ICASSP 2000: 2417-2420 - [c18]Beth Logan, Pedro J. Moreno, Jean-Manuel Van Thong, Edward W. D. Whittaker:
An experimental study of an audio indexing system for the web. INTERSPEECH 2000: 676-679 - [c17]David Goddeau, Anna Litvinova, Beth Logan, Pedro J. Moreno, Michael J. Swain, Jean-Manuel Van Thong:
SpeechBot: a Speech Recognition based Audio Indexing System for the Web. RIAO 2000: 106-115
1990 – 1999
- 1999
- [c16]Philip Clarkson, Pedro J. Moreno:
On the use of support vector machines for phonetic classification. ICASSP 1999: 585-588 - [c15]Brian S. Eberman, Blair Fidler, Robert A. Iannucci, Christopher F. Joerg, Leonidas I. Kontothanassis, David E. Kovalcin, Pedro J. Moreno, Michael J. Swain, Jean-Manuel Van Thong:
Indexing Multimedia for the Internet. VISUAL 1999: 195-202 - 1998
- [j2]Pedro J. Moreno, Bhiksha Raj, Richard M. Stern:
Data-driven environmental compensation for speech recognition: A unified approach. Speech Commun. 24(4): 267-285 (1998) - [c14]Beth Logan, Pedro J. Moreno:
Factorial HMMs for acoustic modeling. ICASSP 1998: 813-816 - [c13]Pedro J. Moreno, Christopher F. Joerg, Jean-Manuel Van Thong, Oren Glickman:
A recursive algorithm for the forced alignment of very long audio segments. ICSLP 1998 - 1997
- [c12]Brian S. Eberman, Pedro J. Moreno:
Delta vector taylor series environment compensation for speaker recognition. EUROSPEECH 1997: 2335-2338 - [c11]Pedro J. Moreno, Brian S. Eberman:
A new algorithm for robust speech recognition: the delta vector taylor series approach. EUROSPEECH 1997: 2599-2602 - 1996
- [c10]Pedro J. Moreno, Bhiksha Raj, Richard M. Stern:
A vector Taylor series approach for environment-independent speech recognition. ICASSP 1996: 733-736 - [c9]Bhiksha Raj, Evandro Bacci Gouvêa, Pedro J. Moreno, Richard M. Stern:
Cepstral compensation by polynomial approximation for environment-independent speech recognition. ICSLP 1996: 2340-2343 - 1995
- [c8]Pedro J. Moreno, Bhiksha Raj, Evandro B. Gouvêa, Richard M. Stern:
Multivariate-Gaussian-based cepstral normalization for robust speech recognition. ICASSP 1995: 137-140 - [c7]Pedro J. Moreno, Bhiksha Raj, Richard M. Stern:
A unified approach for robust speech recognition. EUROSPEECH 1995: 481-484 - 1994
- [c6]Fu-Hua Liu, Richard M. Stern, Alejandro Acero, Pedro J. Moreno:
Environment normalization for robust speech recognition using direct cepstral comparison. ICASSP (2) 1994: 61-64 - [c5]Pedro J. Moreno, Richard M. Stern:
Sources of degradation of speech recognition in the telephone network. ICASSP (1) 1994: 109-112 - [c4]Richard M. Stern, Fu-Hua Liu, Pedro J. Moreno, Alejandro Acero:
Signal processing for robust speech recognition. ICSLP 1994: 1027-1030 - [c3]Fu-Hua Liu, Pedro J. Moreno, Richard M. Stern, Alejandro Acero:
Signal Processing for Robust Speech Recognition. HLT 1994 - 1992
- [j1]David B. Roe, Pedro J. Moreno, Richard Sproat, Fernando C. N. Pereira, Michael Riley, Alejandro Macarrón:
A spoken language translator for restricted-domain context-free languages. Speech Commun. 11(2-3): 311-319 (1992) - [c2]David B. Roe, Fernando C. N. Pereira, Richard Sproat, Michael D. Riley, Pedro J. Moreno, Alejandro Macarrón:
Efficient grammar processing for a spoken language translation system. ICASSP 1992: 213-216 - 1991
- [c1]David B. Roe, Fernando Pereira, Richard Sproat, Michael D. Riley, Pedro J. Moreno, Alejandro Macarrón:
Toward a spoken language translator for restricted-domain context-free languages. EUROSPEECH 1991: 1063-1066
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-07 21:16 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint