default search action

combined dblp search
author search
venue search
publication search

ask others

Pedro J. Moreno 0001

Pedro Moreno Mengibar

> Home > Persons

Person information

affiliation: Google. Inc., Mountain View, CA, USA

Other persons with the same name

see FAQ

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c91]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/PrabhavalkarMWS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/PrabhavalkarMWS24
Rohit Prabhavalkar, Zhong Meng, Weiran Wang, Adam Stooke, Xingyu Cai, Yanzhang He, Arun Narayanan, Dongseong Hwang, Tara N. Sainath, Pedro J. Moreno:
Extreme Encoder Output Frame Rate Reduction: Improving Computational Latencies of Large End-to-End Models. ICASSP 2024: 11816-11820
[c90]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/GargHSSCAMKWMHS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/GargHSSCAMKWMHS24
Shefali Garg, Zhouyuan Huo, Khe Chai Sim, Suzan Schwartz, Mason Chua, Alëna Aksënova, Tsendsuren Munkhdalai, Levi King, Darryl Wright, Zion Mengesha, Dongseong Hwang, Tara N. Sainath, Françoise Beaufays, Pedro Moreno Mengibar:
Improving Speech Recognition for African American English with Audio Classification. ICASSP 2024: 12356-12360
[c89]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/naacl/WangPSMHLS0QCSZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/naacl/WangPSMHLS0QCSZ24
Weiran Wang, Rohit Prabhavalkar, Haozhe Shan, Zhong Meng, Dongseong Hwang, Qiujia Li, Khe Chai Sim, Bo Li, James Qin, Xingyu Cai, Adam Stooke, Chengjian Zheng, Yanzhang He, Tara N. Sainath, Pedro Moreno Mengibar:
Massive End-to-end Speech Recognition Models with Time Reduction. NAACL-HLT 2024: 6206-6217
[i23]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-17184
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-17184
Rohit Prabhavalkar, Zhong Meng, Weiran Wang, Adam Stooke, Xingyu Cai, Yanzhang He, Arun Narayanan, Dongseong Hwang, Tara N. Sainath, Pedro J. Moreno:
Extreme Encoder Output Frame Rate Reduction: Improving Computational Latencies of Large End-to-End Models. CoRR abs/2402.17184 (2024)
[i22]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-19709
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-19709
Tsendsuren Munkhdalai, Youzheng Chen, Khe Chai Sim, Fadi Biadsy, Tara N. Sainath, Pedro Moreno Mengibar:
Hierarchical Recurrent Adapters for Efficient Multi-Task Adaptation of Large Speech Models. CoRR abs/2403.19709 (2024)
[i21]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-09173
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-09173
Dongseong Hwang, Weiran Wang, Zhuoyuan Huo, Khe Chai Sim, Pedro Moreno Mengibar:
TransformerFAM: Feedback attention is working memory. CoRR abs/2404.09173 (2024)
2023
[c88]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/NgaiAGHHM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/NgaiAGHHM23
Hillary Ngai, Rohan Agrawal, Neeraj Gaur, W. Ronny Huang, Parisa Haghani, Pedro Moreno Mengibar:
Audio-Adapterfusion: A Task-Id-Free Approach for Efficient and Non-Destructive Multi-Task Speech Recognition. ASRU 2023: 1-8
[c87]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/AudhkhasiFRM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/AudhkhasiFRM23
Kartik Audhkhasi, Brian Farris, Bhuvana Ramabhadran, Pedro J. Moreno:
Modular Conformer Training for Flexible End-to-End ASR. ICASSP 2023: 1-5
[c86]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenAHPRHCARMR23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenAHPRHCARMR23
Tongzhou Chen, Cyril Allauzen, Yinghui Huang, Daniel S. Park, David Rybach, W. Ronny Huang, Rodrigo Cabrera, Kartik Audhkhasi, Bhuvana Ramabhadran, Pedro J. Moreno, Michael Riley:
Large-Scale Language Model Rescoring on Long-Form Data. ICASSP 2023: 1-5
[c85]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuoSHMSM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuoSHMSM23
Zhouyuan Huo, Khe Chai Sim, Dongseong Hwang, Tsendsuren Munkhdalai, Tara N. Sainath, Pedro Moreno Mengibar:
Re-investigating the Efficient Transfer Learning of Speech Foundation Model using Feature Fusion Methods. INTERSPEECH 2023: 556-560
[c84]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Li0HSM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Li0HSM23
Qiujia Li, Bo Li, Dongseong Hwang, Tara N. Sainath, Pedro Moreno Mengibar:
Modular Domain Adaptation for Conformer-Based Streaming ASR. INTERSPEECH 2023: 3357-3361
[i20]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-01037
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-01037
Yu Zhang, Wei Han, James Qin, Yongqiang Wang, Ankur Bapna, Zhehuai Chen, Nanxin Chen, Bo Li, Vera Axelrod, Gary Wang, Zhong Meng, Ke Hu, Andrew Rosenberg, Rohit Prabhavalkar, Daniel S. Park, Parisa Haghani, Jason Riesa, Ginger Perng, Hagen Soltau, Trevor Strohman, Bhuvana Ramabhadran, Tara N. Sainath, Pedro J. Moreno, Chung-Cheng Chiu, Johan Schalkwyk, Françoise Beaufays, Yonghui Wu:
Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages. CoRR abs/2303.01037 (2023)
[i19]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-13408
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-13408
Qiujia Li, Bo Li, Dongseong Hwang, Tara N. Sainath, Pedro Moreno Mengibar:
Modular Domain Adaptation for Conformer-Based Streaming ASR. CoRR abs/2305.13408 (2023)
[i18]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-08133
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-08133
Tongzhou Chen, Cyril Allauzen, Yinghui Huang, Daniel S. Park, David Rybach, W. Ronny Huang, Rodrigo Cabrera, Kartik Audhkhasi, Bhuvana Ramabhadran, Pedro J. Moreno, Michael Riley:
Large-scale Language Model Rescoring on Long-form Data. CoRR abs/2306.08133 (2023)
[i17]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-09996
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-09996
Shefali Garg, Zhouyuan Huo, Khe Chai Sim, Suzan Schwartz, Mason Chua, Alëna Aksënova, Tsendsuren Munkhdalai, Levi King, Darryl Wright, Zion Mengesha, Dongseong Hwang, Tara N. Sainath, Françoise Beaufays, Pedro Moreno Mengibar:
Improving Speech Recognition for African American English With Audio Classification. CoRR abs/2309.09996 (2023)
[i16]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-12963
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-12963
Weiran Wang, Rohit Prabhavalkar, Dongseong Hwang, Qiujia Li, Khe Chai Sim, Bo Li, James Qin, Xingyu Cai, Adam Stooke, Zhong Meng, CJ Zheng, Yanzhang He, Tara N. Sainath, Pedro Moreno Mengibar:
Massive End-to-end Models for Short Search Queries. CoRR abs/2309.12963 (2023)
[i15]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-00178
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-00178
Weiran Wang, Zelin Wu, Diamantino Caseiro, Tsendsuren Munkhdalai, Khe Chai Sim, Pat Rondon, Golan Pundak, Gan Song, Rohit Prabhavalkar, Zhong Meng, Ding Zhao, Tara N. Sainath, Pedro Moreno Mengibar:
Contextual Biasing with the Knuth-Morris-Pratt Matching Algorithm. CoRR abs/2310.00178 (2023)
[i14]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-13015
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-13015
Hillary Ngai, Rohan Agrawal, Neeraj Gaur, W. Ronny Huang, Parisa Haghani, Pedro Moreno Mengibar:
Audio-AdapterFusion: A Task-ID-free Approach for Efficient and Non-Destructive Multi-task Speech Recognition. CoRR abs/2310.13015 (2023)
2022
[j13]
- view
  authority control:
- export record
  dblp key:
  - journals/jstsp/BaskarRRZM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jstsp/BaskarRRZM22
Murali Karthick Baskar, Andrew Rosenberg, Bhuvana Ramabhadran, Yu Zhang, Pedro J. Moreno:
Ask2Mask: Guided Data Selection for Masked Speech Modeling. IEEE J. Sel. Top. Signal Process. 16(6): 1357-1366 (2022)
[c83]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/GaurCVHRM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/GaurCVHRM22
Neeraj Gaur, Tongzhou Chen, Ehsan Variani, Parisa Haghani, Bhuvana Ramabhadran, Pedro J. Moreno:
Multilingual Second-Pass Rescoring for Automatic Speech Recognition Systems. ICASSP 2022: 6407-6411
[c82]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenZRRMW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenZRRMW22
Zhehuai Chen, Yu Zhang, Andrew Rosenberg, Bhuvana Ramabhadran, Pedro J. Moreno, Gary Wang:
Tts4pretrain 2.0: Advancing the use of Text and Speech in ASR Pretraining with Consistency and Contrastive Losses. ICASSP 2022: 7677-7681
[c81]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AudhkhasiHRM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AudhkhasiHRM22
Kartik Audhkhasi, Yinghui Huang, Bhuvana Ramabhadran, Pedro J. Moreno:
Analysis of Self-Attention Head Diversity for Conformer-based Automatic Speech Recognition. INTERSPEECH 2022: 1026-1030
[c80]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangRRBEHM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangRRBEHM22
Gary Wang, Andrew Rosenberg, Bhuvana Ramabhadran, Fadi Biadsy, Jesse Emond, Yinghui Huang, Pedro J. Moreno:
Non-Parallel Voice Conversion for ASR Augmentation. INTERSPEECH 2022: 3408-3412
[c79]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenZRRMBZ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenZRRMBZ22
Zhehuai Chen, Yu Zhang, Andrew Rosenberg, Bhuvana Ramabhadran, Pedro J. Moreno, Ankur Bapna, Heiga Zen:
MAESTRO: Matched Speech Text Representations through Modality Matching. INTERSPEECH 2022: 4093-4097
[c78]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BiadsyCZRRM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BiadsyCZRRM22
Fadi Biadsy, Youzheng Chen, Xia Zhang, Oleg Rybakov, Andrew Rosenberg, Pedro J. Moreno:
A Scalable Model Specialization Framework for Training and Inference using Submodels and its Application to Speech Model Personalization. INTERSPEECH 2022: 5125-5129
[c77]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/WangCRCWRMLP22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/WangCRCWRMLP22
Gary Wang, Ekin D. Cubuk, Andrew Rosenberg, Shuyang Cheng, Ron J. Weiss, Bhuvana Ramabhadran, Pedro J. Moreno, Quoc V. Le, Daniel S. Park:
G-Augment: Searching for the Meta-Structure of Data Augmentation Policies for ASR. SLT 2022: 23-30
[c76]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/ChenBRZRMC22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/ChenBRZRMC22
Zhehuai Chen, Ankur Bapna, Andrew Rosenberg, Yu Zhang, Bhuvana Ramabhadran, Pedro J. Moreno, Nanxin Chen:
Maestro-U: Leveraging Joint Speech-Text Representation Learning for Zero Supervised Speech ASR. SLT 2022: 68-75
[c75]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/MengCPZWAESRHVHM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/MengCPZWAESRHVHM22
Zhong Meng, Tongzhou Chen, Rohit Prabhavalkar, Yu Zhang, Gary Wang, Kartik Audhkhasi, Jesse Emond, Trevor Strohman, Bhuvana Ramabhadran, W. Ronny Huang, Ehsan Variani, Yinghui Huang, Pedro J. Moreno:
Modular Hybrid Autoregressive Transducer. SLT 2022: 197-204
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2202-12719
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-12719
Murali Karthick Baskar, Andrew Rosenberg, Bhuvana Ramabhadran, Yu Zhang, Pedro J. Moreno:
Ask2Mask: Guided Data Selection for Masked Speech Modeling. CoRR abs/2202.12719 (2022)
[i12]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-12559
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-12559
Fadi Biadsy, Youzheng Chen, Xia Zhang, Oleg Rybakov, Andrew Rosenberg, Pedro J. Moreno:
A Scalable Model Specialization Framework for Training and Inference using Submodels and its Application to Speech Model Personalization. CoRR abs/2203.12559 (2022)
[i11]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-03409
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-03409
Zhehuai Chen, Yu Zhang, Andrew Rosenberg, Bhuvana Ramabhadran, Pedro J. Moreno, Ankur Bapna, Heiga Zen:
MAESTRO: Matched Speech Text Representations through Modality Matching. CoRR abs/2204.03409 (2022)
[i10]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2209-06096
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2209-06096
Kartik Audhkhasi, Yinghui Huang, Bhuvana Ramabhadran, Pedro J. Moreno:
Analysis of Self-Attention Head Diversity for Conformer-based Automatic Speech Recognition. CoRR abs/2209.06096 (2022)
[i9]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2209-06987
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2209-06987
Gary Wang, Andrew Rosenberg, Bhuvana Ramabhadran, Fadi Biadsy, Yinghui Huang, Jesse Emond, Pedro Moreno Mengibar:
Non-Parallel Voice Conversion for ASR Augmentation. CoRR abs/2209.06987 (2022)
[i8]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-10027
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-10027
Zhehuai Chen, Ankur Bapna, Andrew Rosenberg, Yu Zhang, Bhuvana Ramabhadran, Pedro J. Moreno, Nanxin Chen:
Maestro-U: Leveraging joint speech-text representation learning for zero supervised speech ASR. CoRR abs/2210.10027 (2022)
[i7]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-10879
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-10879
Gary Wang, Ekin D. Cubuk, Andrew Rosenberg, Shuyang Cheng, Ron J. Weiss, Bhuvana Ramabhadran, Pedro J. Moreno, Quoc V. Le, Daniel S. Park:
G-Augment: Searching for the Meta-Structure of Data Augmentation Policies for ASR. CoRR abs/2210.10879 (2022)
[i6]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-17049
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-17049
Zhong Meng, Tongzhou Chen, Rohit Prabhavalkar, Yu Zhang, Gary Wang, Kartik Audhkhasi, Jesse Emond, Trevor Strohman, Bhuvana Ramabhadran, W. Ronny Huang, Ehsan Variani, Yinghui Huang, Pedro J. Moreno:
Modular Hybrid Autoregressive Transducer. CoRR abs/2210.17049 (2022)
2021
[c74]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/ChenZRRWM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/ChenZRRWM21
Zhehuai Chen, Yu Zhang, Andrew Rosenberg, Bhuvana Ramabhadran, Gary Wang, Pedro J. Moreno:
Injecting Text in Self-Supervised Speech Pretraining. ASRU 2021: 251-258
[c73]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/GaurFHLMPRZ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/GaurFHLMPRZ21
Neeraj Gaur, Brian Farris, Parisa Haghani, Isabel Leal, Pedro J. Moreno, Manasa Prasad, Bhuvana Ramabhadran, Yun Zhu:
Mixture of Informed Experts for Multilingual Speech Recognition. ICASSP 2021: 6234-6238
[c72]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/DoshiCJZBRCRM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/DoshiCJZBRCRM21
Rohan Doshi, Youzheng Chen, Liyang Jiang, Xia Zhang, Fadi Biadsy, Bhuvana Ramabhadran, Fang Chu, Andrew Rosenberg, Pedro J. Moreno:
Extending Parrotron: An End-to-End, Speech Conversion and Speech Recognition Model for Atypical Speech. ICASSP 2021: 6988-6992
[c71]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenRZZGHEWRM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenRZZGHEWRM21
Zhehuai Chen, Andrew Rosenberg, Yu Zhang, Heiga Zen, Mohammadreza Ghodsi, Yinghui Huang, Jesse Emond, Gary Wang, Bhuvana Ramabhadran, Pedro J. Moreno:
Semi-Supervision in ASR: Sequential MixMatch and Factorized TTS-Based Augmentation. Interspeech 2021: 736-740
[c70]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AudhkhasiCRM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AudhkhasiCRM21
Kartik Audhkhasi, Tongzhou Chen, Bhuvana Ramabhadran, Pedro J. Moreno:
Mixture Model Attention: Flexible Streaming and Non-Streaming Automatic Speech Recognition. Interspeech 2021: 1812-1816
[c69]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LealGHFMPRZ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LealGHFMPRZ21
Isabel Leal, Neeraj Gaur, Parisa Haghani, Brian Farris, Pedro J. Moreno, Manasa Prasad, Bhuvana Ramabhadran, Yun Zhu:
Self-Adaptive Distillation for Multilingual Speech Recognition: Leveraging Student Independence. Interspeech 2021: 2556-2560
[c68]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenRBZCJCDM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenRBZCJCDM21
Zhehuai Chen, Bhuvana Ramabhadran, Fadi Biadsy, Xia Zhang, Youzheng Chen, Liyang Jiang, Fang Chu, Rohan Doshi, Pedro J. Moreno:
Conformer Parrotron: A Faster and Stronger End-to-End Speech Conversion and Recognition Model for Atypical Speech. Interspeech 2021: 4828-4832
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2108-12226
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2108-12226
Zhehuai Chen, Yu Zhang, Andrew Rosenberg, Bhuvana Ramabhadran, Gary Wang, Pedro J. Moreno:
Injecting Text in Self-Supervised Speech Pretraining. CoRR abs/2108.12226 (2021)
2020
[c67]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangRCZRWM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangRCZRWM20
Gary Wang, Andrew Rosenberg, Zhehuai Chen, Yu Zhang, Bhuvana Ramabhadran, Yonghui Wu, Pedro J. Moreno:
Improving Speech Recognition Using Consistent Predictions on Synthesized Speech. ICASSP 2020: 7029-7033
[c66]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/VarianiCARLM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/VarianiCARLM20
Ehsan Variani, Tongzhou Chen, James Apfel, Bhuvana Ramabhadran, Seungji Lee, Pedro J. Moreno:
Neural Oracle Search on N-BEST Hypotheses. ICASSP 2020: 7824-7828
[c65]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenR0WRM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenR0WRM20
Zhehuai Chen, Andrew Rosenberg, Yu Zhang, Gary Wang, Bhuvana Ramabhadran, Pedro J. Moreno:
Improving Speech Recognition Using GAN-Based Speech Synthesis and Contrastive Unspoken Text Selection. INTERSPEECH 2020: 556-560
[c64]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangRCZRM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangRCZRM20
Gary Wang, Andrew Rosenberg, Zhehuai Chen, Yu Zhang, Bhuvana Ramabhadran, Pedro J. Moreno:
SCADA: Stochastic, Consistent and Adversarial Data Augmentation to Improve ASR. INTERSPEECH 2020: 2832-2836
[c63]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhuHTRFXLSLGMZ20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhuHTRFXLSLGMZ20
Yun Zhu, Parisa Haghani, Anshuman Tripathi, Bhuvana Ramabhadran, Brian Farris, Hainan Xu, Han Lu, Hasim Sak, Isabel Leal, Neeraj Gaur, Pedro J. Moreno, Qian Zhang:
Multilingual Speech Recognition with Self-Attention Structured Parameterization. INTERSPEECH 2020: 4741-4745

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c62]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/WatersGHMQ19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/WatersGHMQ19
Austin Waters, Neeraj Gaur, Parisa Haghani, Pedro J. Moreno, Zhongdi Qu:
Leveraging Language ID in Multilingual End-to-End Speech Recognition. ASRU 2019: 928-935
[c61]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/RosenbergZRJMWW19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/RosenbergZRJMWW19
Andrew Rosenberg, Yu Zhang, Bhuvana Ramabhadran, Ye Jia, Pedro J. Moreno, Yonghui Wu, Zelin Wu:
Speech Recognition with Augmented Synthesized Speech. ASRU 2019: 996-1002
[c60]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BiadsyWMKJ19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BiadsyWMKJ19
Fadi Biadsy, Ron J. Weiss, Pedro J. Moreno, Dimitri Kanvesky, Ye Jia:
Parrotron: An End-to-End Speech-to-Speech Conversion Model and its Applications to Hearing-Impaired Speech and Speech Separation. INTERSPEECH 2019: 4115-4119
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1904-04169
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1904-04169
Fadi Biadsy, Ron J. Weiss, Pedro J. Moreno, Dimitri Kanvesky, Ye Jia:
Parrotron: An End-to-End Speech-to-Speech Conversion Model and its Applications to Hearing-Impaired Speech and Speech Separation. CoRR abs/1904.04169 (2019)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1909-11699
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1909-11699
Andrew Rosenberg, Yu Zhang, Bhuvana Ramabhadran, Ye Jia, Pedro J. Moreno, Yonghui Wu, Zelin Wu:
Speech Recognition with Augmented Synthesized Speech. CoRR abs/1909.11699 (2019)
2018
[c59]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ToshniwalSWLMWR18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ToshniwalSWLMWR18
Shubham Toshniwal, Tara N. Sainath, Ron J. Weiss, Bo Li, Pedro J. Moreno, Eugene Weinstein, Kanishka Rao:
Multilingual Speech Recognition with a Single End-to-End Model. ICASSP 2018: 4904-4908
[c58]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/OinesWM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/OinesWM18
Asa Oines, Eugene Weinstein, Pedro J. Moreno:
Hybrid Lstm-Fsmn Networks for Acoustic Modeling. ICASSP 2018: 5844-5848
[c57]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MaKBNVM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MaKBNVM18
Min Ma, Shankar Kumar, Fadi Biadsy, Michael Nirschl, Tomas Vykruta, Pedro J. Moreno:
Modeling Non-Linguistic Contextual Signals in LSTM Language Models Via Domain Adaptation. ICASSP 2018: 6094-6098
[c56]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/VelikovichWSAMR18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/VelikovichWSAMR18
Leonid Velikovich, Ian Williams, Justin Scheiner, Petar S. Aleksic, Pedro J. Moreno, Michael Riley:
Semantic Lattice Processing in Contextual Automatic Speech Recognition for Google Assistant. INTERSPEECH 2018: 2222-2226
[c55]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/EmondRRMM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/EmondRRMM18
Jesse Emond, Bhuvana Ramabhadran, Brian Roark, Pedro J. Moreno, Min Ma:
Transliteration Based Approaches to Improve Code-Switched Speech Recognition Performance. SLT 2018: 448-455
[c54]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/HaghaniNBCGMPQW18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/HaghaniNBCGMPQW18
Parisa Haghani, Arun Narayanan, Michiel Bacchiani, Galen Chuang, Neeraj Gaur, Pedro J. Moreno, Rohit Prabhavalkar, Zhongdi Qu, Austin Waters:
From Audio to Semantics: Approaches to End-to-End Spoken Language Understanding. SLT 2018: 720-726
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1809-09190
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1809-09190
Parisa Haghani, Arun Narayanan, Michiel Bacchiani, Galen Chuang, Neeraj Gaur, Pedro J. Moreno, Rohit Prabhavalkar, Zhongdi Qu, Austin Waters:
From Audio to Semantics: Approaches to end-to-end spoken language understanding. CoRR abs/1809.09190 (2018)
2017
[c53]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/QuHWM17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/QuHWM17
Zhongdi Qu, Parisa Haghani, Eugene Weinstein, Pedro J. Moreno:
Syllable-based acoustic modeling with CTC-SMBR-LSTM. ASRU 2017: 173-177
[p1]
- view
  authority control:
- export record
  dblp key:
  - books/sp/17/BacchianiBGMSSZ17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/books/sp/17/BacchianiBGMSSZ17
Michiel Bacchiani, Françoise Beaufays, Alexander Gruenstein, Pedro J. Moreno, Johan Schalkwyk, Trevor Strohman, Heiga Zen:
Speech Research at Google to Enable Universal Speech Interfaces. New Era for Robust Speech Recognition, Exploiting Deep Learning 2017: 385-399
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1711-01694
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1711-01694
Shubham Toshniwal, Tara N. Sainath, Ron J. Weiss, Bo Li, Pedro J. Moreno, Eugene Weinstein, Kanishka Rao:
Multilingual Speech Recognition With A Single End-To-End Model. CoRR abs/1711.01694 (2017)
2016
[j12]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/csl/Lopez-MorenoGMP16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/csl/Lopez-MorenoGMP16
Ignacio López-Moreno, Javier Gonzalez-Dominguez, David Martinez, Oldrich Plchot, Joaquin Gonzalez-Rodriguez, Pedro J. Moreno:
On the use of deep feedforward neural networks for automatic language identification. Comput. Speech Lang. 40: 46-59 (2016)
[c52]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SotoSEM16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SotoSEM16
Victor Soto, Olivier Siohan, Mohamed Elfeky, Pedro J. Moreno:
Selection and combination of hypotheses for dialectal speech recognition. ICASSP 2016: 5845-5849
[c51]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/QuitryOMW16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/QuitryOMW16
Felix de Chaumont Quitry, Asa Oines, Pedro J. Moreno, Eugene Weinstein:
High quality agreement-based semi-supervised training data for acoustic modeling. SLT 2016: 592-596
[c50]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/ElfekyBVMW16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/ElfekyBVMW16
Mohamed Elfeky, Meysam Bastani, Xavier Velez, Pedro J. Moreno, Austin Waters:
Towards acoustic model unification across dialects. SLT 2016: 624-628
2015
[j11]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/jstsp/Gonzalez-Dominguez15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jstsp/Gonzalez-Dominguez15
Javier Gonzalez-Dominguez, David Eustis, Ignacio López-Moreno, Andrew W. Senior, Françoise Beaufays, Pedro J. Moreno:
A Real-Time End-to-End Multilingual Speech Recognition Architecture. IEEE J. Sel. Top. Signal Process. 9(4): 749-759 (2015)
[j10]
- view
  authority control:
- export record
  dblp key:
  - journals/nn/Gonzalez-Dominguez15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/nn/Gonzalez-Dominguez15
Javier Gonzalez-Dominguez, Ignacio López-Moreno, Pedro J. Moreno, Joaquín González-Rodríguez:
Frame-by-frame language identification in short utterances using deep neural networks. Neural Networks 64: 49-58 (2015)
[c49]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/AleksicAEKCM15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/AleksicAEKCM15
Petar S. Aleksic, Cyril Allauzen, David Elson, Aleksandar Kracun, Diego Melendo Casado, Pedro J. Moreno:
Improved recognition of contact names in voice commands. ICASSP 2015: 5172-5175
[c48]
- view
  authority control:
- export record
  dblp key:
  - conf/icnlsp/ElfekyMS15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icnlsp/ElfekyMS15
Mohamed G. Elfeky, Pedro J. Moreno, Victor Soto:
Multi-Dialectical Languages Effect on Speech Recognition: Too Much Choice Can Hurt. ICNLSP 2015: 1-8
[c47]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AleksicGMAHRRM15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AleksicGMAHRRM15
Petar S. Aleksic, Mohammadreza Ghodsi, Assaf Hurwitz Michaely, Cyril Allauzen, Keith B. Hall, Brian Roark, David Rybach, Pedro J. Moreno:
Bringing contextual information to google speech recognition. INTERSPEECH 2015: 468-472
2014
[c46]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/Lopez-MorenoGPMGM14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/Lopez-MorenoGPMGM14
Ignacio López-Moreno, Javier Gonzalez-Dominguez, Oldrich Plchot, David Martinez, Joaquin Gonzalez-Rodriguez, Pedro J. Moreno:
Automatic language identification using deep neural networks. ICASSP 2014: 5337-5341
[c45]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/McDermottHMSB14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/McDermottHMSB14
Erik McDermott, Georg Heigold, Pedro J. Moreno, Andrew W. Senior, Michiel Bacchiani:
Asynchronous stochastic optimization for sequence training of deep neural networks: towards big data. INTERSPEECH 2014: 1224-1228
[c44]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KapralovaAWMS14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KapralovaAWMS14
Olga Kapralova, John Alex, Eugene Weinstein, Pedro J. Moreno, Olivier Siohan:
A big data approach to acoustic model training corpus selection. INTERSPEECH 2014: 2083-2087
[c43]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Gonzalez-DominguezLSGM14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Gonzalez-DominguezLSGM14
Javier Gonzalez-Dominguez, Ignacio López-Moreno, Hasim Sak, Joaquin Gonzalez-Rodriguez, Pedro J. Moreno:
Automatic language identification using long short-term memory recurrent neural networks. INTERSPEECH 2014: 2155-2159
[c42]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BiadsyHMR14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BiadsyHMR14
Fadi Biadsy, Keith B. Hall, Pedro J. Moreno, Brian Roark:
Backoff inspired features for maximum entropy language models. INTERSPEECH 2014: 2645-2649
2012
[c41]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/BiadsyMJ12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/BiadsyMJ12
Fadi Biadsy, Pedro J. Moreno, Martin Jansche:
Google's cross-dialect Arabic voice search. ICASSP 2012: 4441-4444
2011
[c40]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SungJM11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SungJM11
Yun-Hsuan Sung, Martin Jansche, Pedro J. Moreno:
Deploying Google Search by Voice in Cantonese. INTERSPEECH 2011: 2865-2868
2010
[j9]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/MohriMW10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/MohriMW10
Mehryar Mohri, Pedro J. Moreno, Eugene Weinstein:
Efficient and Robust Music Identification With Weighted Finite-State Transducers. IEEE Trans. Speech Audio Process. 18(1): 197-207 (2010)
[c39]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BarnardSHM10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BarnardSHM10
Etienne Barnard, Johan Schalkwyk, Charl Johannes van Heerden, Pedro J. Moreno:
Voice search for development. INTERSPEECH 2010: 282-285
[c38]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShanWHTJM10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShanWHTJM10
Jiulong Shan, Genqing Wu, Zhihong Hu, Xiliu Tang, Martin Jansche, Pedro J. Moreno:
Search by voice in Mandarin Chinese. INTERSPEECH 2010: 354-357
[c37]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HughesNHVML10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HughesNHVML10
Thad Hughes, Kaisuke Nakajima, Linne Ha, Atul Vasu, Pedro J. Moreno, Mike LeBeau:
Building transcribed speech corpora quickly and cheaply for many languages. INTERSPEECH 2010: 1914-1917
[c36]
- view
  - electronic edition @ mlr.press (open access)
  - no references & citations available
- export record
  dblp key:
  - journals/jmlr/MohriMW10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jmlr/MohriMW10
Mehryar Mohri, Pedro J. Moreno, Eugene Weinstein:
Discriminative Topic Segmentation of Text and Speech. AISTATS 2010: 533-540

2000 – 2009

see FAQ

What is the meaning of the colors in the publication lists?

2009
[j8]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/tcs/MohriMW09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tcs/MohriMW09
Mehryar Mohri, Pedro J. Moreno, Eugene Weinstein:
General suffix automaton construction algorithm and space bounds. Theor. Comput. Sci. 410(37): 3553-3562 (2009)
[c35]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SarginAMZ09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SarginAMZ09
Mehmet Emre Sargin, Hrishikesh B. Aradhye, Pedro J. Moreno, Ming Zhao:
Audiovisual celebrity recognition in unconstrained web videos. ICASSP 2009: 1977-1980
[c34]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MorenoA09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MorenoA09
Pedro J. Moreno, Christopher Alberti:
A factor automaton approach for the forced alignment of long speech recordings. ICASSP 2009: 4869-4872
[c33]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/AlbertiBBCDLMPSSS09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/AlbertiBBCDLMPSSS09
Christopher Alberti, Michiel Bacchiani, Ari Bezman, Ciprian Chelba, Anastassia Drofa, Hank Liao, Pedro J. Moreno, Ted Power, Arnaud Sahuguet, Maria Shugrina, Olivier Siohan:
An audio indexing system for election video material. ICASSP 2009: 4873-4876
[c32]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MohriMW09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MohriMW09
Mehryar Mohri, Pedro J. Moreno, Eugene Weinstein:
A new quality measure for topic segmentation of text and speech. INTERSPEECH 2009: 2743-2746
2007
[j7]
- view
  authority control:
- export record
  dblp key:
  - journals/pami/CarneiroCMV07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/pami/CarneiroCMV07
Gustavo Carneiro, Antoni B. Chan, Pedro J. Moreno, Nuno Vasconcelos:
Supervised Learning of Semantic Classes for Image Annotation and Retrieval. IEEE Trans. Pattern Anal. Mach. Intell. 29(3): 394-410 (2007)
[j6]
- view
  authority control:
- export record
  dblp key:
  - journals/tmm/RasiwasiaMV07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmm/RasiwasiaMV07
Nikhil Rasiwasia, Pedro J. Moreno, Nuno Vasconcelos:
Bridging the Gap: Query by Semantic Example. IEEE Trans. Multim. 9(5): 923-938 (2007)
[c31]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WeinsteinM07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WeinsteinM07
Eugene Weinstein, Pedro J. Moreno:
Music Identification with Weighted Finite-State Transducers. ICASSP (2) 2007: 689-692
[c30]
- view
  - electronic edition @ ismir.net (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/ismir/MohriMW07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ismir/MohriMW07
Mehryar Mohri, Pedro J. Moreno, Eugene Weinstein:
Robust Music Identification, Detection, and Analysis. ISMIR 2007: 135-138
[c29]
- view
  authority control:
- export record
  dblp key:
  - conf/wia/MohriMW07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/wia/MohriMW07
Mehryar Mohri, Pedro J. Moreno, Eugene Weinstein:
Factor Automata of Automata and Applications. CIAA 2007: 168-179
2006
[c28]
- view
  authority control:
- export record
  dblp key:
  - conf/civr/RasiwasiaVM06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/civr/RasiwasiaVM06
Nikhil Rasiwasia, Nuno Vasconcelos, Pedro J. Moreno:
Query by Semantic Example. CIVR 2006: 51-60
2005
[j5]
- view
  authority control:
- export record
  dblp key:
  - journals/tmm/LoganTM05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmm/LoganTM05
Beth Logan, Jean-Manuel Van Thong, Pedro J. Moreno:
Approaches to reduce the effects of OOV queries on indexed spoken audio. IEEE Trans. Multim. 7(5): 899-906 (2005)
2004
[c27]
- view
  authority control:
- export record
  dblp key:
  - conf/eccv/VasconcelosHM04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eccv/VasconcelosHM04
Nuno Vasconcelos, Purdy Ho, Pedro J. Moreno:
The Kullback-Leibler Kernel as a Framework for Discriminant and Localized Representations for Visual Recognition. ECCV (3) 2004: 430-441
[c26]
- view
  authority control:
- export record
  dblp key:
  - conf/icmcs/LoganKM04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmcs/LoganKM04
Beth Logan, A. Kositsky, Pedro J. Moreno:
Semantic analysis of song lyrics. ICME 2004: 827-830
[c25]
- view
  authority control:
- export record
  dblp key:
  - conf/icmcs/MarstonMLMT04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmcs/MarstonMLMT04
J. Marston, G. MacCarthy, Beth Logan, Pedro J. Moreno, Jean-Manuel Van Thong:
News Tuner: a simple interface for searching and browsing radio archives. ICME 2004: 1531-1534
[c24]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HoM04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HoM04
Purdy Ho, Pedro J. Moreno:
SVM kernel adaptation in speaker classification and verification. INTERSPEECH 2004: 1413-1416
2003
[c23]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MorenoH03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MorenoH03
Pedro J. Moreno, Purdy Ho:
A new SVM approach to speaker identification and verification using probabilistic distance kernels. INTERSPEECH 2003: 2965-2968
[c22]
- view
- export record
  dblp key:
  - conf/nips/MorenoHV03
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/MorenoHV03
Pedro J. Moreno, Purdy Ho, Nuno Vasconcelos:
A Kullback-Leibler Divergence Based Kernel for SVM Classification in Multimedia Applications. NIPS 2003: 1385-1392
2002
[j4]
- view
  authority control:
- export record
  dblp key:
  - journals/computer/MorenoTLJ02
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/computer/MorenoTLJ02
Pedro J. Moreno, Jean-Manuel Van Thong, Beth Logan, Gareth J. F. Jones:
From Multimedia Retrieval to Knowledge Management. Computer 35(4): 58-66 (2002)
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/tmm/ThongMLFMM02
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmm/ThongMLFMM02
Jean-Manuel Van Thong, Pedro J. Moreno, Beth Logan, Blair Fidler, K. Maffey, M. Moores:
Speechbot: an experimental speech-based search engine for multimedia content on the web. IEEE Trans. Multim. 4(1): 88-96 (2002)
2001
[c21]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MorenoLR01
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MorenoLR01
Pedro J. Moreno, Beth Logan, Bhiksha Raj:
A boosting approach for confidence scoring. INTERSPEECH 2001: 2109-2112
[c20]
- view
  authority control:
- export record
  dblp key:
  - conf/sigir/BleiM01
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sigir/BleiM01
David M. Blei, Pedro J. Moreno:
Topic Segmentation with an Aspect Hidden Markov Model. SIGIR 2001: 343-348
2000
[c19]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MorenoR00
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MorenoR00
Pedro J. Moreno, Ryan Rifkin:
Using the Fisher kernel method for Web audio classification. ICASSP 2000: 2417-2420
[c18]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LoganMTW00
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LoganMTW00
Beth Logan, Pedro J. Moreno, Jean-Manuel Van Thong, Edward W. D. Whittaker:
An experimental study of an audio indexing system for the web. INTERSPEECH 2000: 676-679
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/riao/GoddeauLLMST00
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/riao/GoddeauLLMST00
David Goddeau, Anna Litvinova, Beth Logan, Pedro J. Moreno, Michael J. Swain, Jean-Manuel Van Thong:
SpeechBot: a Speech Recognition based Audio Indexing System for the Web. RIAO 2000: 106-115

1990 – 1999

see FAQ

What is the meaning of the colors in the publication lists?

1999
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ClarksonM99
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ClarksonM99
Philip Clarkson, Pedro J. Moreno:
On the use of support vector machines for phonetic classification. ICASSP 1999: 585-588
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/visual/EbermanFIJKKMST99
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/visual/EbermanFIJKKMST99
Brian S. Eberman, Blair Fidler, Robert A. Iannucci, Christopher F. Joerg, Leonidas I. Kontothanassis, David E. Kovalcin, Pedro J. Moreno, Michael J. Swain, Jean-Manuel Van Thong:
Indexing Multimedia for the Internet. VISUAL 1999: 195-202
1998
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/speech/MorenoRS98
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/MorenoRS98
Pedro J. Moreno, Bhiksha Raj, Richard M. Stern:
Data-driven environmental compensation for speech recognition: A unified approach. Speech Commun. 24(4): 267-285 (1998)
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LoganM98
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LoganM98
Beth Logan, Pedro J. Moreno:
Factorial HMMs for acoustic modeling. ICASSP 1998: 813-816
[c13]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MorenoJTG98
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MorenoJTG98
Pedro J. Moreno, Christopher F. Joerg, Jean-Manuel Van Thong, Oren Glickman:
A recursive algorithm for the forced alignment of very long audio segments. ICSLP 1998
1997
[c12]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/EbermanM97
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/EbermanM97
Brian S. Eberman, Pedro J. Moreno:
Delta vector taylor series environment compensation for speaker recognition. EUROSPEECH 1997: 2335-2338
[c11]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MorenoE97
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MorenoE97
Pedro J. Moreno, Brian S. Eberman:
A new algorithm for robust speech recognition: the delta vector taylor series approach. EUROSPEECH 1997: 2599-2602
1996
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MorenoRS96
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MorenoRS96
Pedro J. Moreno, Bhiksha Raj, Richard M. Stern:
A vector Taylor series approach for environment-independent speech recognition. ICASSP 1996: 733-736
[c9]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RajGMS96
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RajGMS96
Bhiksha Raj, Evandro Bacci Gouvêa, Pedro J. Moreno, Richard M. Stern:
Cepstral compensation by polynomial approximation for environment-independent speech recognition. ICSLP 1996: 2340-2343
1995
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MorenoRGS95
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MorenoRGS95
Pedro J. Moreno, Bhiksha Raj, Evandro B. Gouvêa, Richard M. Stern:
Multivariate-Gaussian-based cepstral normalization for robust speech recognition. ICASSP 1995: 137-140
[c7]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MorenoRS95
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MorenoRS95
Pedro J. Moreno, Bhiksha Raj, Richard M. Stern:
A unified approach for robust speech recognition. EUROSPEECH 1995: 481-484
1994
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiuSAM94
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiuSAM94
Fu-Hua Liu, Richard M. Stern, Alejandro Acero, Pedro J. Moreno:
Environment normalization for robust speech recognition using direct cepstral comparison. ICASSP (2) 1994: 61-64
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MorenoS94
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MorenoS94
Pedro J. Moreno, Richard M. Stern:
Sources of degradation of speech recognition in the telephone network. ICASSP (1) 1994: 109-112
[c4]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SternLMA94
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SternLMA94
Richard M. Stern, Fu-Hua Liu, Pedro J. Moreno, Alejandro Acero:
Signal processing for robust speech recognition. ICSLP 1994: 1027-1030
[c3]
- view
  - electronic edition @ aclanthology.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/naacl/LiuMSA94
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/naacl/LiuMSA94
Fu-Hua Liu, Pedro J. Moreno, Richard M. Stern, Alejandro Acero:
Signal Processing for Robust Speech Recognition. HLT 1994
1992
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/speech/RoeMSPRM92
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/speech/RoeMSPRM92
David B. Roe, Pedro J. Moreno, Richard Sproat, Fernando C. N. Pereira, Michael Riley, Alejandro Macarrón:
A spoken language translator for restricted-domain context-free languages. Speech Commun. 11(2-3): 311-319 (1992)
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/RoePSRMM92
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/RoePSRMM92
David B. Roe, Fernando C. N. Pereira, Richard Sproat, Michael D. Riley, Pedro J. Moreno, Alejandro Macarrón:
Efficient grammar processing for a spoken language translation system. ICASSP 1992: 213-216
1991
[c1]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/RoePSRMM91
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/RoePSRMM91
David B. Roe, Fernando Pereira, Richard Sproat, Michael D. Riley, Pedro J. Moreno, Alejandro Macarrón:
Toward a spoken language translator for restricted-domain context-free languages. EUROSPEECH 1991: 1063-1066

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.