


default search action
Shaojin Ding
Person information
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c22]Shaojin Ding, David Qiu, David Rim, Yanzhang He, Oleg Rybakov, Bo Li, Rohit Prabhavalkar, Weiran Wang, Tara N. Sainath, Zhonglin Han, Jian Li, Amir Yazdanbakhsh, Shivani Agrawal:
USM-Lite: Quantization and Sparsity Aware Fine-Tuning for Speech Recognition with Universal Speech Models. ICASSP 2024: 10756-10760 - [c21]David Qiu, David Rim, Shaojin Ding, Oleg Rybakov, Yanzhang He:
Rand: Robustness Aware Norm Decay for Quantized Neural Networks. SLT 2024: 1023-1030 - 2023
- [c20]Xingyu Cai, David Qiu, Shaojin Ding, Dongseong Hwang, Weiran Wang, Antoine Bruguier, Rohit Prabhavalkar, Tara N. Sainath, Yanzhang He:
Efficient Cascaded Streaming ASR System Via Frame Rate Reduction. ASRU 2023: 1-8 - [c19]David Qiu, Shaojin Ding, Yanzhang He:
The Role of Feature Correlation on Quantized Neural Networks. ASRU 2023: 1-7 - [c18]Steven M. Hernandez, Ding Zhao, Shaojin Ding, Antoine Bruguier, Rohit Prabhavalkar, Tara N. Sainath, Yanzhang He, Ian McGraw:
Sharing Low Rank Conformer Weights for Tiny Always-On Ambient Speech Recognition Models. ICASSP 2023: 1-5 - [c17]Tom O'Malley, Shaojin Ding, Arun Narayanan, Quan Wang, Rajeev Rikhye, Qiao Liang, Yanzhang He, Ian McGraw:
Conditional Conformer: Improving Speaker Modulation For Single And Multi-User Speech Enhancement. ICASSP 2023: 1-5 - [c16]Weiran Wang, Ding Zhao, Shaojin Ding, Hao Zhang, Shuo-Yiin Chang, David Rybach, Tara N. Sainath, Yanzhang He, Ian McGraw, Shankar Kumar:
Multi-Output RNN-T Joint Networks for Multi-Task Learning of ASR and Auxiliary Tasks. ICASSP 2023: 1-5 - [c15]Oleg Rybakov, Phoenix Meadowlark, Shaojin Ding, David Qiu, Jian Li, David Rim, Yanzhang He:
2-bit Conformer quantization for automatic speech recognition. INTERSPEECH 2023: 4908-4912 - [i11]Steven M. Hernandez, Ding Zhao, Shaojin Ding, Antoine Bruguier, Rohit Prabhavalkar, Tara N. Sainath, Yanzhang He, Ian McGraw:
Sharing Low Rank Conformer Weights for Tiny Always-On Ambient Speech Recognition Models. CoRR abs/2303.08343 (2023) - [i10]David Qiu, David Rim, Shaojin Ding, Oleg Rybakov, Yanzhang He:
RAND: Robustness Aware Norm Decay For Quantized Seq2seq Models. CoRR abs/2305.15536 (2023) - [i9]Shaojin Ding, David Qiu, David Rim, Yanzhang He, Oleg Rybakov, Bo Li, Rohit Prabhavalkar, Weiran Wang, Tara N. Sainath, Shivani Agrawal, Zhonglin Han, Jian Li, Amir Yazdanbakhsh:
USM-Lite: Quantization and Sparsity Aware Fine-tuning for Speech Recognition with Universal Speech Models. CoRR abs/2312.08553 (2023) - 2022
- [j4]Shaojin Ding
, Guanlong Zhao
, Ricardo Gutierrez-Osuna
:
Accentron: Foreign accent conversion to arbitrary non-native speakers using zero-shot learning. Comput. Speech Lang. 72: 101302 (2022) - [c14]Mu Yang, Shaojin Ding, Tianlong Chen, Tong Wang, Zhangyang Wang:
Towards Lifelong Learning of Multilingual Text-to-Speech Synthesis. ICASSP 2022: 8022-8026 - [c13]Shaojin Ding, Tianlong Chen, Zhangyang Wang:
Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable. ICLR 2022 - [c12]Shaojin Ding, Weiran Wang, Ding Zhao, Tara N. Sainath, Yanzhang He, Robert David, Rami Botros, Xin Wang, Rina Panigrahy, Qiao Liang, Dongseong Hwang, Ian McGraw, Rohit Prabhavalkar, Trevor Strohman:
A Unified Cascaded Encoder ASR Model for Dynamic Model Sizes. INTERSPEECH 2022: 1706-1710 - [c11]Shaojin Ding, Phoenix Meadowlark, Yanzhang He, Lukasz Lew, Shivani Agrawal, Oleg Rybakov:
4-bit Conformer with Native Quantization Aware Training for Speech Recognition. INTERSPEECH 2022: 1711-1715 - [c10]Shaojin Ding, Rajeev Rikhye, Qiao Liang, Yanzhang He, Quan Wang, Arun Narayanan, Tom O'Malley, Ian McGraw:
Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition. INTERSPEECH 2022: 3744-3748 - [i8]Shaojin Ding, Phoenix Meadowlark, Yanzhang He, Lukasz Lew, Shivani Agrawal, Oleg Rybakov:
4-bit Conformer with Native Quantization Aware Training for Speech Recognition. CoRR abs/2203.15952 (2022) - [i7]Shaojin Ding, Rajeev Rikhye, Qiao Liang, Yanzhang He, Quan Wang, Arun Narayanan, Tom O'Malley, Ian McGraw:
Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition. CoRR abs/2204.03793 (2022) - [i6]Shaojin Ding, Weiran Wang, Ding Zhao, Tara N. Sainath, Yanzhang He, Robert David, Rami Botros, Xin Wang, Rina Panigrahy, Qiao Liang, Dongseong Hwang, Ian McGraw, Rohit Prabhavalkar, Trevor Strohman:
A Unified Cascaded Encoder ASR Model for Dynamic Model Sizes. CoRR abs/2204.06164 (2022) - 2021
- [j3]Guanlong Zhao
, Shaojin Ding
, Ricardo Gutierrez-Osuna:
Converting Foreign Accent Speech Without a Reference. IEEE ACM Trans. Audio Speech Lang. Process. 29: 2367-2381 (2021) - [c9]Shaojin Ding, Ye Jia, Ke Hu, Quan Wang:
Textual Echo Cancellation. ASRU 2021: 548-555 - [i5]Mu Yang, Shaojin Ding, Tianlong Chen, Tong Wang, Zhangyang Wang:
Towards Lifelong Learning of Multilingual Text-To-Speech Synthesis. CoRR abs/2110.04482 (2021) - 2020
- [j2]Shaojin Ding
, Guanlong Zhao
, Christopher Liberatore
, Ricardo Gutierrez-Osuna:
Learning Structured Sparse Representations for Voice Conversion. IEEE ACM Trans. Audio Speech Lang. Process. 28: 343-354 (2020) - [c8]Shaojin Ding, Guanlong Zhao, Ricardo Gutierrez-Osuna:
Improving the Speaker Identity of Non-Parallel Many-to-Many Voice Conversion with Adversarial Speaker Recognition. INTERSPEECH 2020: 776-780 - [c7]Shaojin Ding, Tianlong Chen, Xinyu Gong, Weiwei Zha, Zhangyang Wang:
AutoSpeech: Neural Architecture Search for Speaker Recognition. INTERSPEECH 2020: 916-920 - [c6]Shaojin Ding, Quan Wang, Shuo-Yiin Chang, Li Wan, Ignacio López-Moreno:
Personal VAD: Speaker-Conditioned Voice Activity Detection. Odyssey 2020: 433-439 - [i4]Shaojin Ding, Tianlong Chen, Xinyu Gong, Weiwei Zha, Zhangyang Wang:
AutoSpeech: Neural Architecture Search for Speaker Recognition. CoRR abs/2005.03215 (2020) - [i3]Shaojin Ding, Ye Jia, Ke Hu, Quan Wang:
Textual Echo Cancellation. CoRR abs/2008.06006 (2020)
2010 – 2019
- 2019
- [j1]Shaojin Ding, Christopher Liberatore, Sinem Sonsaat
, Ivana Lucic, Alif Silpachai
, Guanlong Zhao, Evgeny Chukharev-Hudilainen
, John Levis, Ricardo Gutierrez-Osuna:
Golden speaker builder - An interactive tool for pronunciation training. Speech Commun. 115: 51-66 (2019) - [c5]Tianlong Chen, Shaojin Ding, Jingyi Xie, Ye Yuan, Wuyang Chen, Yang Yang, Zhou Ren, Zhangyang Wang:
ABD-Net: Attentive but Diverse Person Re-Identification. ICCV 2019: 8350-8360 - [c4]Shaojin Ding, Ricardo Gutierrez-Osuna:
Group Latent Embedding for Vector Quantized Variational Autoencoder in Non-Parallel Voice Conversion. INTERSPEECH 2019: 724-728 - [c3]Guanlong Zhao, Shaojin Ding, Ricardo Gutierrez-Osuna:
Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams. INTERSPEECH 2019: 2843-2847 - [i2]Tianlong Chen, Shaojin Ding, Jingyi Xie, Ye Yuan, Wuyang Chen, Yang Yang, Zhou Ren, Zhangyang Wang:
ABD-Net: Attentive but Diverse Person Re-Identification. CoRR abs/1908.01114 (2019) - [i1]Shaojin Ding, Quan Wang, Shuo-Yiin Chang, Li Wan, Ignacio López-Moreno:
Personal VAD: Speaker-Conditioned Voice Activity Detection. CoRR abs/1908.04284 (2019) - 2018
- [c2]Shaojin Ding, Guanlong Zhao, Christopher Liberatore, Ricardo Gutierrez-Osuna:
Improving Sparse Representations in Exemplar-Based Voice Conversion with a Phoneme-Selective Objective Function. INTERSPEECH 2018: 476-480 - [c1]Shaojin Ding, Christopher Liberatore, Ricardo Gutierrez-Osuna:
Learning Structured Dictionaries for Exemplar-based Voice Conversion. INTERSPEECH 2018: 481-485
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-03-04 22:22 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint