default search action

combined dblp search
author search
venue search
publication search

ask others

Jinyu Li 0001

Jin-Yu Li 0001

> Home > Persons

Person information

affiliation: Microsoft Corporation, Redmond, WA, USA
affiliation (PhD): Georgia Institute of Technology, Center for Signal and Image Processing, Atlanta, GA, USA
affiliation: University of Science and Technology of China, iFlytek Speech Lab, Hefei, China

Other persons with the same name

see FAQ

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[j20]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/GongWLLZCQ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/GongWLLZCQ24
Xun Gong, Yu Wu, Jinyu Li, Shujie Liu, Rui Zhao, Xie Chen, Yanmin Qian:
Advanced Long-Content Speech Recognition With Factorized Neural Transducer. IEEE ACM Trans. Audio Speech Lang. Process. 32: 1803-1815 (2024)
[j19]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/ZhangCZWRLYGDLW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/ZhangCZWRLYGDLW24
Ziqiang Zhang, Sanyuan Chen, Long Zhou, Yu Wu, Shuo Ren, Shujie Liu, Zhuoyuan Yao, Xun Gong, Li-Rong Dai, Jinyu Li, Furu Wei:
SpeechLM: Enhanced Speech Pre-Training With Unpaired Textual Data. IEEE ACM Trans. Audio Speech Lang. Process. 32: 2177-2187 (2024)
[j18]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/WangTCKECTLLY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/WangTCKECTLLY24
Xiaofei Wang, Manthan Thakker, Zhuo Chen, Naoyuki Kanda, Sefik Emre Eskimez, Sanyuan Chen, Min Tang, Shujie Liu, Jinyu Li, Takuya Yoshioka:
SpeechX: Neural Codec Language Model as a Versatile Speech Transformer. IEEE ACM Trans. Audio Speech Lang. Process. 32: 3355-3364 (2024)
[j17]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/WangZZWLGCLW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/WangZZWLGCLW24
Tianrui Wang, Long Zhou, Ziqiang Zhang, Yu Wu, Shujie Liu, Yashesh Gaur, Zhuo Chen, Jinyu Li, Furu Wei:
VioLA: Conditional Language Models for Speech Recognition, Synthesis, and Translation. IEEE ACM Trans. Audio Speech Lang. Process. 32: 3709-3716 (2024)
[j16]
- view
  authority control:
- export record
  dblp key:
  - journals/tmm/ZhuZZLJZDJLW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmm/ZhuZZLJZDJLW24
Qiushi Zhu, Long Zhou, Ziqiang Zhang, Shujie Liu, Binxing Jiao, Jie Zhang, Li-Rong Dai, Daxin Jiang, Jinyu Li, Furu Wei:
VatLM: Visual-Audio-Text Pre-Training With Unified Masked Prediction for Speech Representation Learning. IEEE Trans. Multim. 26: 1055-1064 (2024)
[c180]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/emnlp/HuZ0CMHPL0SLW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/HuZ0CMHPL0SLW24
Shujie Hu, Long Zhou, Shujie Liu, Sanyuan Chen, Lingwei Meng, Hongkun Hao, Jing Pan, Xunying Liu, Jinyu Li, Sunit Sivasankaran, Linquan Liu, Furu Wei:
WavLLM: Towards Robust and Adaptive Speech Large Language Model. EMNLP (Findings) 2024: 4552-4572
[c179]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/PapiWCXK0G24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/PapiWCXK0G24
Sara Papi, Peidong Wang, Junkun Chen, Jian Xue, Naoyuki Kanda, Jinyu Li, Yashesh Gaur:
Leveraging Timestamp Information for Serialized Joint Streaming Recognition and Translation. ICASSP 2024: 10381-10385
[c178]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YangK0CWX0Y24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YangK0CWX0Y24
Mu Yang, Naoyuki Kanda, Xiaofei Wang, Junkun Chen, Peidong Wang, Jian Xue, Jinyu Li, Takuya Yoshioka:
Diarist: Streaming Speech Translation with Speaker Diarization. ICASSP 2024: 10866-10870
[c177]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/Wang024a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/Wang024a
Yiming Wang, Jinyu Li:
Residualtransformer: Residual Low-Rank Learning With Weight-Sharing For Transformer Layers. ICASSP 2024: 11161-11165
[c176]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WuKY00024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WuKY00024
Jian Wu, Naoyuki Kanda, Takuya Yoshioka, Rui Zhao, Zhuo Chen, Jinyu Li:
T-SOT FNT: Streaming Multi-Talker ASR with Text-Only Domain Adaptation Capability. ICASSP 2024: 11531-11535
[c175]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/JuWS0XYLLST000024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/JuWS0XYLLST000024
Zeqian Ju, Yuancheng Wang, Kai Shen, Xu Tan, Detai Xin, Dongchao Yang, Eric Liu, Yichong Leng, Kaitao Song, Siliang Tang, Zhizheng Wu, Tao Qin, Xiangyang Li, Wei Ye, Shikun Zhang, Jiang Bian, Lei He, Jinyu Li, Sheng Zhao:
NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models. ICML 2024
[c174]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/LeQWZ00YQ0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/LeQWZ00YQ0024
Chenyang Le, Yao Qian, Dongmei Wang, Long Zhou, Shujie Liu, Xiaofei Wang, Midia Yousefi, Yanmin Qian, Jinyu Li, Michael Zeng:
TransVIP: Speech to Speech Translation System with Voice and Isochrony Preservation. NeurIPS 2024
[c173]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/ZhangQZ0WWYQ00Z24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ZhangQZ0WWYQ00Z24
Leying Zhang, Yao Qian, Long Zhou, Shujie Liu, Dongmei Wang, Xiaofei Wang, Midia Yousefi, Yanmin Qian, Jinyu Li, Lei He, Sheng Zhao, Michael Zeng:
CoVoMix: Advancing Zero-Shot Speech Generation for Human-like Multi-talker Conversations. NeurIPS 2024
[i127]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-00246
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-00246
Hongkun Hao, Long Zhou, Shujie Liu, Jinyu Li, Shujie Hu, Rui Wang, Furu Wei:
Boosting Large Language Model for Speech Synthesis: An Empirical Study. CoRR abs/2401.00246 (2024)
[i126]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-03100
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-03100
Zeqian Ju, Yuancheng Wang, Kai Shen, Xu Tan, Detai Xin, Dongchao Yang, Yanqing Liu, Yichong Leng, Kaitao Song, Siliang Tang, Zhizheng Wu, Tao Qin, Xiang-Yang Li, Wei Ye, Shikun Zhang, Jiang Bian, Lei He, Jinyu Li, Sheng Zhao:
NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models. CoRR abs/2403.03100 (2024)
[i125]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-13423
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-13423
Xun Gong, Yu Wu, Jinyu Li, Shujie Liu, Rui Zhao, Xie Chen, Yanmin Qian:
Advanced Long-Content Speech Recognition With Factorized Neural Transducer. CoRR abs/2403.13423 (2024)
[i124]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-00656
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-00656
Shujie Hu, Long Zhou, Shujie Liu, Sanyuan Chen, Hongkun Hao, Jing Pan, Xunying Liu, Jinyu Li, Sunit Sivasankaran, Linquan Liu, Furu Wei:
WavLLM: Towards Robust and Adaptive Speech Large Language Model. CoRR abs/2404.00656 (2024)
[i123]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-03204
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-03204
Detai Xin, Xu Tan, Kai Shen, Zeqian Ju, Dongchao Yang, Yuancheng Wang, Shinnosuke Takamichi, Hiroshi Saruwatari, Shujie Liu, Jinyu Li, Sheng Zhao:
RALL-E: Robust Codec Language Modeling with Chain-of-Thought Prompting for Text-to-Speech Synthesis. CoRR abs/2404.03204 (2024)
[i122]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-06690
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-06690
Leying Zhang, Yao Qian, Long Zhou, Shujie Liu, Dongmei Wang, Xiaofei Wang, Midia Yousefi, Yanmin Qian, Jinyu Li, Lei He, Sheng Zhao, Michael Zeng:
CoVoMix: Advancing Zero-Shot Speech Generation for Human-like Multi-talker Conversations. CoRR abs/2404.06690 (2024)
[i121]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-17809
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-17809
Chenyang Le, Yao Qian, Dongmei Wang, Long Zhou, Shujie Liu, Xiaofei Wang, Midia Yousefi, Yanmin Qian, Jinyu Li, Sheng Zhao, Michael Zeng:
TransVIP: Speech to Speech Translation System with Voice and Isochrony Preservation. CoRR abs/2405.17809 (2024)
[i120]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-05370
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-05370
Sanyuan Chen, Shujie Liu, Long Zhou, Yanqing Liu, Xu Tan, Jinyu Li, Sheng Zhao, Yao Qian, Furu Wei:
VALL-E 2: Neural Codec Language Models are Human Parity Zero-Shot Text to Speech Synthesizers. CoRR abs/2406.05370 (2024)
[i119]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-05699
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-05699
Xiaofei Wang, Sefik Emre Eskimez, Manthan Thakker, Hemin Yang, Zirun Zhu, Min Tang, Yufei Xia, Jinzhu Li, Sheng Zhao, Jinyu Li, Naoyuki Kanda:
An Investigation of Noise Robustness for Flow-Matching-Based Zero-Shot TTS. CoRR abs/2406.05699 (2024)
[i118]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-07855
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-07855
Bing Han, Long Zhou, Shujie Liu, Sanyuan Chen, Lingwei Meng, Yanming Qian, Yanqing Liu, Sheng Zhao, Jinyu Li, Furu Wei:
VALL-E R: Robust and Efficient Zero-Shot Text-to-Speech Synthesis via Monotonic Alignment. CoRR abs/2406.07855 (2024)
[i117]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-10276
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-10276
Peidong Wang, Jian Xue, Jinyu Li, Junkun Chen, Aswin Shanmugam Subramanian:
Soft Language Identification for Language-Agnostic Many-to-One End-to-End Speech Translation. CoRR abs/2406.10276 (2024)
[i116]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-08551
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-08551
Lingwei Meng, Long Zhou, Shujie Liu, Sanyuan Chen, Bing Han, Shujie Hu, Yanqing Liu, Jinyu Li, Sheng Zhao, Xixin Wu, Helen Meng, Furu Wei:
Autoregressive Speech Synthesis without Vector Quantization. CoRR abs/2407.08551 (2024)
[i115]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-12229
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-12229
Haibin Wu, Xiaofei Wang, Sefik Emre Eskimez, Manthan Thakker, Daniel Tompkins, Chung-Hsien Tsai, Canrun Li, Zhen Xiao, Sheng Zhao, Jinyu Li, Naoyuki Kanda:
Laugh Now Cry Later: Controlling Time-Varying Emotional States of Flow-Matching-Based Zero-Shot Text-to-Speech. CoRR abs/2407.12229 (2024)
[i114]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-04016
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-04016
Jiaqi Li, Dongmei Wang, Xiaofei Wang, Yao Qian, Long Zhou, Shujie Liu, Midia Yousefi, Canrun Li, Chung-Hsien Tsai, Zhen Xiao, Yanqing Liu, Junkun Chen, Sheng Zhao, Jinyu Li, Zhizheng Wu, Michael Zeng:
Investigating Neural Audio Codecs for Speech Language Model-Based Speech Generation. CoRR abs/2409.04016 (2024)
[i113]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-13913
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-13913
Sunit Sivasankaran, Eric Sun, Jin-Yu Li, Yan Huang, Jing Pan:
Target word activity detector: An approach to obtain ASR word boundaries without lexicon. CoRR abs/2409.13913 (2024)
[i112]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-05146
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-05146
Rui Zhao, Jinyu Li, Ruchao Fan, Matt Post:
CTC-GMM: CTC guided modality matching for fast and accurate streaming speech translation. CoRR abs/2410.05146 (2024)
2023
[c172]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/ChenXWPL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/ChenXWPL23
Junkun Chen, Jian Xue, Peidong Wang, Jing Pan, Jinyu Li:
Improving Stability in Simultaneous Speech Translation: A Revision-Controllable Decoding Approach. ASRU 2023: 1-7
[c171]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/LiWLL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/LiWLL23
Yuang Li, Yu Wu, Jinyu Li, Shujie Liu:
Prompting Large Language Models for Zero-Shot Domain Adaptation in Speech Recognition. ASRU 2023: 1-8
[c170]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/PapiWCXLG23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/PapiWCXLG23
Sara Papi, Peidong Wang, Junkun Chen, Jian Xue, Jinyu Li, Yashesh Gaur:
Token-Level Serialized Output Training for Joint Streaming ASR and ST Leveraging Textual Alignments. ASRU 2023: 1-8
[c169]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/SunLHZZXWLLLG23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/SunLHZZXWLLLG23
Eric Sun, Jinyu Li, Yuxuan Hu, Yimeng Zhu, Long Zhou, Jian Xue, Peidong Wang, Linquan Liu, Shujie Liu, Edward Lin, Yifan Gong:
Building High-Accuracy Multilingual ASR With Gated Language Experts and Curriculum Training. ASRU 2023: 1-7
[c168]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/WuGCZZWLLRLW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/WuGCZZWLLRLW23
Jian Wu, Yashesh Gaur, Zhuo Chen, Long Zhou, Yimeng Zhu, Tianrui Wang, Jinyu Li, Shujie Liu, Bo Ren, Linquan Liu, Yu Wu:
On Decoder-Only Architecture For Speech-to-Text and Large Language Model Integration. ASRU 2023: 1-8
[c167]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/XueWLS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/XueWLS23
Jian Xue, Peidong Wang, Jinyu Li, Eric Sun:
A Weakly-Supervised Streaming Multilingual Speech Model with Truly Zero-Shot Capability. ASRU 2023: 1-7
[c166]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenKWWWYLSE23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenKWWWYLSE23
Zhuo Chen, Naoyuki Kanda, Jian Wu, Yu Wu, Xiaofei Wang, Takuya Yoshioka, Jinyu Li, Sunit Sivasankaran, Sefik Emre Eskimez:
Speech Separation with Large-Scale Self-Supervised Learning. ICASSP 2023: 1-5
[c165]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/FanWGL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/FanWGL23
Ruchao Fan, Yiming Wang, Yashesh Gaur, Jinyu Li:
CTCBERT: Advancing Hidden-Unit Bert with CTC Objectives. ICASSP 2023: 1-5
[c164]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/GongWLLZCQ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/GongWLLZCQ23
Xun Gong, Yu Wu, Jinyu Li, Shujie Liu, Rui Zhao, Xie Chen, Yanmin Qian:
LongFNT: Long-Form Speech Recognition with Factorized Neural Transducer. ICASSP 2023: 1-5
[c163]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HuangCKWWLYWW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HuangCKWWLYWW23
Zili Huang, Zhuo Chen, Naoyuki Kanda, Jian Wu, Yiming Wang, Jinyu Li, Takuya Yoshioka, Xiaofei Wang, Peidong Wang:
Self-Supervised Learning with Bi-Label Masked Speech Prediction for Streaming Multi-Talker Speech Recognition. ICASSP 2023: 1-5
[c162]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KandaWWCLY23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KandaWWCLY23
Naoyuki Kanda, Jian Wu, Xiaofei Wang, Zhuo Chen, Jinyu Li, Takuya Yoshioka:
Vararray Meets T-Sot: Advancing the State of the Art of Streaming Distant Conversational Speech Recognition. ICASSP 2023: 1-5
[c161]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangLLZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangLLZ23
Xiaoqiang Wang, Yanqing Liu, Jinyu Li, Sheng Zhao:
Improving Contextual Spelling Correction by External Acoustics Attention and Semantic Aware Data Augmentation. ICASSP 2023: 1-5
[c160]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WeiZZCLHLW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WeiZZCLHLW23
Kun Wei, Long Zhou, Ziqiang Zhang, Liping Chen, Shujie Liu, Lei He, Jinyu Li, Furu Wei:
Joint Pre-Training with Speech and Bilingual Text for Direct Speech to Speech Translation. ICASSP 2023: 1-5
[c159]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WuCHXL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WuCHXL23
Jian Wu, Zhuo Chen, Min Hu, Xiong Xiao, Jinyu Li:
Speaker Change Detection For Transformer Transducer ASR. ICASSP 2023: 1-5
[c158]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YangKWWSCLY23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YangKWWSCLY23
Muqiao Yang, Naoyuki Kanda, Xiaofei Wang, Jian Wu, Sunit Sivasankaran, Zhuo Chen, Jinyu Li, Takuya Yoshioka:
Simulating Realistic Speech Overlaps Improves Multi-Talker ASR. ICASSP 2023: 1-5
[c157]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhaoXPML23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhaoXPML23
Rui Zhao, Jian Xue, Partha Parthasarathy, Veljko Miljanic, Jinyu Li:
Fast and Accurate Factorized Neural Transducer for Text Adaption of End-to-End Speech Recognition Models. ICASSP 2023: 1-5
[c156]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangSX0ZG0023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangSX0ZG0023
Peidong Wang, Eric Sun, Jian Xue, Yu Wu, Long Zhou, Yashesh Gaur, Shujie Liu, Jinyu Li:
LAMASSU: A Streaming Language-Agnostic Multilingual Speech Recognition and Translation Model Using Neural Transducers. INTERSPEECH 2023: 57-61
[c155]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Li00023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Li00023
Yuang Li, Yu Wu, Jinyu Li, Shujie Liu:
Accelerating Transducers through Adjacent Token Merging. INTERSPEECH 2023: 1379-1383
[i111]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2301-02111
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2301-02111
Chengyi Wang, Sanyuan Chen, Yu Wu, Ziqiang Zhang, Long Zhou, Shujie Liu, Zhuo Chen, Yanqing Liu, Huaming Wang, Jinyu Li, Lei He, Sheng Zhao, Furu Wei:
Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers. CoRR abs/2301.02111 (2023)
[i110]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-08549
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-08549
Jian Wu, Zhuo Chen, Min Hu, Xiong Xiao, Jinyu Li:
Speaker Change Detection for Transformer Transducer ASR. CoRR abs/2302.08549 (2023)
[i109]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-11192
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-11192
Xiaoqiang Wang, Yanqing Liu, Jinyu Li, Sheng Zhao:
Improving Contextual Spelling Correction by External Acoustics Attention and Semantic Aware Data Augmentation. CoRR abs/2302.11192 (2023)
[i108]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-00786
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-00786
Eric Sun, Jinyu Li, Yuxuan Hu, Yimeng Zhu, Long Zhou, Jian Xue, Peidong Wang, Linquan Liu, Shujie Liu, Edward Lin, Yifan Gong:
Building High-accuracy Multilingual ASR with Gated Language Experts and Curriculum Training. CoRR abs/2303.00786 (2023)
[i107]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-03926
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-03926
Ziqiang Zhang, Long Zhou, Chengyi Wang, Sanyuan Chen, Yu Wu, Shujie Liu, Zhuo Chen, Yanqing Liu, Huaming Wang, Jinyu Li, Lei He, Sheng Zhao, Furu Wei:
Speak Foreign Languages with Your Own Voice: Cross-Lingual Neural Codec Language Modeling. CoRR abs/2303.03926 (2023)
[i106]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-16107
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-16107
Tianrui Wang, Long Zhou, Ziqiang Zhang, Yu Wu, Shujie Liu, Yashesh Gaur, Zhuo Chen, Jinyu Li, Furu Wei:
VioLA: Unified Codec Language Models for Speech Recognition, Synthesis, and Translation. CoRR abs/2305.16107 (2023)
[i105]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-16007
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-16007
Yuang Li, Yu Wu, Jinyu Li, Shujie Liu:
Prompting Large Language Models for Zero-Shot Domain Adaptation in Speech Recognition. CoRR abs/2306.16007 (2023)
[i104]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-16009
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-16009
Yuang Li, Yu Wu, Jinyu Li, Shujie Liu:
Accelerating Transducers through Adjacent Token Merging. CoRR abs/2306.16009 (2023)
[i103]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-03354
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-03354
Sara Papi, Peidong Wang, Junkun Chen, Jian Xue, Jinyu Li, Yashesh Gaur:
Token-Level Serialized Output Training for Joint Streaming ASR and ST Leveraging Textual Alignments. CoRR abs/2307.03354 (2023)
[i102]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-03917
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-03917
Jian Wu, Yashesh Gaur, Zhuo Chen, Long Zhou, Yimeng Zhu, Tianrui Wang, Jinyu Li, Shujie Liu, Bo Ren, Linquan Liu, Yu Wu:
On decoder-only architecture for speech-to-text and large language model integration. CoRR abs/2307.03917 (2023)
[i101]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-06873
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-06873
Xiaofei Wang, Manthan Thakker, Zhuo Chen, Naoyuki Kanda, Sefik Emre Eskimez, Sanyuan Chen, Min Tang, Shujie Liu, Jinyu Li, Takuya Yoshioka:
SpeechX: Neural Codec Language Model as a Versatile Speech Transformer. CoRR abs/2308.06873 (2023)
[i100]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-08007
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-08007
Mu Yang, Naoyuki Kanda, Xiaofei Wang, Junkun Chen, Peidong Wang, Jian Xue, Jinyu Li, Takuya Yoshioka:
DiariST: Streaming Speech Translation with Speaker Diarization. CoRR abs/2309.08007 (2023)
[i99]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-08131
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-08131
Jian Wu, Naoyuki Kanda, Takuya Yoshioka, Rui Zhao, Zhuo Chen, Jinyu Li:
t-SOT FNT: Streaming Multi-talker ASR with Text-only Domain Adaptation Capability. CoRR abs/2309.08131 (2023)
[i98]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-02489
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-02489
Yiming Wang, Jinyu Li:
ResidualTransformer: Residual Low-rank Learning with Weight-sharing for Transformer Layers. CoRR abs/2310.02489 (2023)
[i97]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-04399
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-04399
Junkun Chen, Jian Xue, Peidong Wang, Jing Pan, Jinyu Li:
Improving Stability in Simultaneous Speech Translation: A Revision-Controllable Decoding Approach. CoRR abs/2310.04399 (2023)
[i96]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-14806
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-14806
Sara Papi, Peidong Wang, Junkun Chen, Jian Xue, Naoyuki Kanda, Jinyu Li, Yashesh Gaur:
Leveraging Timestamp Information for Serialized Joint Streaming Recognition and Translation. CoRR abs/2310.14806 (2023)
[i95]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-02248
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-02248
Jing Pan, Jian Wu, Yashesh Gaur, Sunit Sivasankaran, Zhuo Chen, Shujie Liu, Jinyu Li:
COSMIC: Data Efficient Instruction-tuning For Speech In-Context Learning. CoRR abs/2311.02248 (2023)
2022
[j15]
- view
  authority control:
- export record
  dblp key:
  - journals/jstsp/ChenWCWLCLKYXWZ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jstsp/ChenWCWLCLKYXWZ22
Sanyuan Chen, Chengyi Wang, Zhengyang Chen, Yu Wu, Shujie Liu, Zhuo Chen, Jinyu Li, Naoyuki Kanda, Takuya Yoshioka, Xiong Xiao, Jian Wu, Long Zhou, Shuo Ren, Yanmin Qian, Yao Qian, Jian Wu, Michael Zeng, Xiangzhan Yu, Furu Wei:
WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing. IEEE J. Sel. Top. Signal Process. 16(6): 1505-1518 (2022)
[j14]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/WangLLMZK22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/WangLLMZK22
Xiaoqiang Wang, Yanqing Liu, Jinyu Li, Veljko Miljanic, Sheng Zhao, Hosam Khalil:
Towards Contextual Spelling Correction for Customization of End-to-End Speech Recognition Systems. IEEE ACM Trans. Audio Speech Lang. Process. 30: 3089-3097 (2022)
[c154]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/AoWZ0RW0KLZWQ0W22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/AoWZ0RW0KLZWQ0W22
Junyi Ao, Rui Wang, Long Zhou, Chengyi Wang, Shuo Ren, Yu Wu, Shujie Liu, Tom Ko, Qing Li, Yu Zhang, Zhihua Wei, Yao Qian, Jinyu Li, Furu Wei:
SpeechT5: Unified-Modal Encoder-Decoder Pre-Training for Spoken Language Processing. ACL (1) 2022: 5723-5738
[c153]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/ZhangZA0D0W22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/ZhangZA0D0W22
Ziqiang Zhang, Long Zhou, Junyi Ao, Shujie Liu, Lirong Dai, Jinyu Li, Furu Wei:
SpeechUT: Bridging Speech and Text with Hidden-Unit for Encoder-Decoder Based Speech-Text Pre-training. EMNLP 2022: 1663-1676
[c152]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhangCWYWML22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhangCWYWML22
Yixuan Zhang, Zhuo Chen, Jian Wu, Takuya Yoshioka, Peidong Wang, Zhong Meng, Jinyu Li:
Continuous Speech Separation with Recurrent Selective Attention Network. ICASSP 2022: 6017-6021
[c151]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangQWWWLYLW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangQWWWLYLW22
Heming Wang, Yao Qian, Xiaofei Wang, Yiming Wang, Chengyi Wang, Shujie Liu, Takuya Yoshioka, Jinyu Li, DeLiang Wang:
Improving Noise Robustness of Contrastive Speech Representation Learning with Speech Reconstruction. ICASSP 2022: 6062-6066
[c150]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenWWCCLWQWLY22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenWWCCLWQWLY22
Sanyuan Chen, Yu Wu, Chengyi Wang, Zhengyang Chen, Zhuo Chen, Shujie Liu, Jian Wu, Yao Qian, Furu Wei, Jinyu Li, Xiangzhan Yu:
Unispeech-Sat: Universal Speech Representation Learning With Speaker Aware Pre-Training. ICASSP 2022: 6152-6156
[c149]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhouLSL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhouLSL22
Long Zhou, Jinyu Li, Eric Sun, Shujie Liu:
A Configurable Multilingual Model is All You Need to Recognize All Languages. ICASSP 2022: 6422-6426
[c148]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangWCLLQY22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangWCLLQY22
Chengyi Wang, Yu Wu, Sanyuan Chen, Shujie Liu, Jinyu Li, Yao Qian, Zhenglu Yang:
Improving Self-Supervised Learning for Speech Recognition with Intermediate Layer Supervision. ICASSP 2022: 7092-7096
[c147]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangLWQWW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangLWQWW22
Yiming Wang, Jinyu Li, Heming Wang, Yao Qian, Chengyi Wang, Yu Wu:
Wav2vec-Switch: Contrastive Learning from Original-Noisy Speech Pairs for Robust Speech Recognition. ICASSP 2022: 7097-7101
[c146]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LuLG22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LuLG22
Liang Lu, Jinyu Li, Yifan Gong:
Endpoint Detection for Streaming End-to-End Multi-Talker ASR. ICASSP 2022: 7312-7316
[c145]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/RajLCGL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/RajLCGL22
Desh Raj, Liang Lu, Zhuo Chen, Yashesh Gaur, Jinyu Li:
Continuous Streaming Multi-Talker ASR with Dual-Path Transducers. ICASSP 2022: 7317-7321
[c144]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YeMLG22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YeMLG22
Guoli Ye, Vadim Mazalov, Jinyu Li, Yifan Gong:
Have Best of Both Worlds: Two-Pass Hybrid and E2E Cascading Framework for Speech Recognition. ICASSP 2022: 7432-7436
[c143]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenMPL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenMPL22
Xie Chen, Zhong Meng, Sarangarajan Parthasarathy, Jinyu Li:
Factorized Neural Transducer for Efficient Language Model Adaptation. ICASSP 2022: 8132-8136
[c142]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Kanda0WXMWG00Y22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Kanda0WXMWG00Y22
Naoyuki Kanda, Jian Wu, Yu Wu, Xiong Xiao, Zhong Meng, Xiaofei Wang, Yashesh Gaur, Zhuo Chen, Jinyu Li, Takuya Yoshioka:
Streaming Speaker-Attributed ASR with Token-Level Speaker Embeddings. INTERSPEECH 2022: 521-525
[c141]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MengGK0CW022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MengGK0CW022
Zhong Meng, Yashesh Gaur, Naoyuki Kanda, Jinyu Li, Xie Chen, Yu Wu, Yifan Gong:
Internal Language Model Adaptation with Text-Only Data for End-to-End Speech Recognition. INTERSPEECH 2022: 2608-2612
[c140]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangWWC00W22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangWWC00W22
Chengyi Wang, Yiming Wang, Yu Wu, Sanyuan Chen, Jinyu Li, Shujie Liu, Furu Wei:
Supervision-Guided Codebooks for Masked Prediction in Speech Pre-training. INTERSPEECH 2022: 2643-2647
[c139]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AoZZ00K00QW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AoZZ00K00QW22
Junyi Ao, Ziqiang Zhang, Long Zhou, Shujie Liu, Haizhou Li, Tom Ko, Lirong Dai, Jinyu Li, Yao Qian, Furu Wei:
Pre-Training Transformer Decoder for End-to-End ASR Model with Unpaired Speech Data. INTERSPEECH 2022: 2658-2662
[c138]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XueW0PG22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XueW0PG22
Jian Xue, Peidong Wang, Jinyu Li, Matt Post, Yashesh Gaur:
Large-Scale Streaming End-to-End Speech Translation with Neural Transducers. INTERSPEECH 2022: 3263-3267
[c137]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Chen0000WL00YW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Chen0000WL00YW22
Sanyuan Chen, Yu Wu, Chengyi Wang, Shujie Liu, Zhuo Chen, Peidong Wang, Gang Liu, Jinyu Li, Jian Wu, Xiangzhan Yu, Furu Wei:
Why does Self-Supervised Learning for Speech Recognition Benefit Speaker Recognition? INTERSPEECH 2022: 3699-3703
[c136]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KandaWWXMWGC0Y22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KandaWWXMWGC0Y22
Naoyuki Kanda, Jian Wu, Yu Wu, Xiong Xiao, Zhong Meng, Xiaofei Wang, Yashesh Gaur, Zhuo Chen, Jinyu Li, Takuya Yoshioka:
Streaming Multi-Talker ASR with Token-Level Serialized Output Training. INTERSPEECH 2022: 3774-3778
[c135]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Zhang0K00EYXMQW22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Zhang0K00EYXMQW22
Wangyou Zhang, Zhuo Chen, Naoyuki Kanda, Shujie Liu, Jinyu Li, Sefik Emre Eskimez, Takuya Yoshioka, Xiong Xiao, Zhong Meng, Yanmin Qian, Furu Wei:
Separating Long-Form Speech with Group-wise Permutation Invariant Training. INTERSPEECH 2022: 5383-5387
[c134]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/GaurKXSWALG22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/GaurKXSWALG22
Yashesh Gaur, Nick Kibre, Jian Xue, Kangyuan Shu, Yuhui Wang, Issac Alphanso, Jinyu Li, Yifan Gong:
Streaming, Fast and Accurate on-Device Inverse Text Normalization for Automatic Speech Recognition. SLT 2022: 237-244
[i94]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2201-09979
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2201-09979
Liang Lu, Jinyu Li, Yifan Gong:
Endpoint Detection for Streaming End-to-End Multi-talker ASR. CoRR abs/2201.09979 (2022)
[i93]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2202-00842
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-00842
Naoyuki Kanda, Jian Wu, Yu Wu, Xiong Xiao, Zhong Meng, Xiaofei Wang, Yashesh Gaur, Zhuo Chen, Jinyu Li, Takuya Yoshioka:
Streaming Multi-Talker ASR with Token-Level Serialized Output Training. CoRR abs/2202.00842 (2022)
[i92]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-00888
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-00888
Xiaoqiang Wang, Yanqing Liu, Jinyu Li, Veljko Miljanic, Sheng Zhao, Hosam Khalil:
Towards Contextual Spelling Correction for Customization of End-to-end Speech Recognition Systems. CoRR abs/2203.00888 (2022)
[i91]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-16685
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-16685
Naoyuki Kanda, Jian Wu, Yu Wu, Xiong Xiao, Zhong Meng, Xiaofei Wang, Yashesh Gaur, Zhuo Chen, Jinyu Li, Takuya Yoshioka:
Streaming Speaker-Attributed ASR with Token-Level Speaker Embeddings. CoRR abs/2203.16685 (2022)
[i90]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-17113
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-17113
Junyi Ao, Ziqiang Zhang, Long Zhou, Shujie Liu, Haizhou Li, Tom Ko, Lirong Dai, Jinyu Li, Yao Qian, Furu Wei:
Pre-Training Transformer Decoder for End-to-End ASR Model with Unpaired Speech Data. CoRR abs/2203.17113 (2022)
[i89]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-05352
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-05352
Jian Xue, Peidong Wang, Jinyu Li, Matt Post, Yashesh Gaur:
Large-Scale Streaming End-to-End Speech Translation with Neural Transducers. CoRR abs/2204.05352 (2022)
[i88]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-12765
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-12765
Sanyuan Chen, Yu Wu, Chengyi Wang, Shujie Liu, Zhuo Chen, Peidong Wang, Gang Liu, Jinyu Li, Jian Wu, Xiangzhan Yu, Furu Wei:
Why does Self-Supervised Learning for Speech Recognition Benefit Speaker Recognition? CoRR abs/2204.12765 (2022)
[i87]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-12777
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-12777
Sanyuan Chen, Yu Wu, Zhuo Chen, Jian Wu, Takuya Yoshioka, Shujie Liu, Jinyu Li, Xiangzhan Yu:
Ultra Fast Speech Separation Model with Teacher Student Learning. CoRR abs/2204.12777 (2022)
[i86]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-05777
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-05777
Ziqiang Zhang, Junyi Ao, Long Zhou, Shujie Liu, Furu Wei, Jinyu Li:
The YiTrans End-to-End Speech Translation System for IWSLT 2022 Offline Shared Task. CoRR abs/2206.05777 (2022)
[i85]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-10125
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-10125
Chengyi Wang, Yiming Wang, Yu Wu, Sanyuan Chen, Jinyu Li, Shujie Liu, Furu Wei:
Supervision-Guided Codebooks for Masked Prediction in Speech Pre-training. CoRR abs/2206.10125 (2022)
[i84]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2209-04974
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2209-04974
Naoyuki Kanda, Jian Wu, Xiaofei Wang, Zhuo Chen, Jinyu Li, Takuya Yoshioka:
VarArray Meets t-SOT: Advancing the State of the Art of Streaming Distant Conversational Speech Recognition. CoRR abs/2209.04974 (2022)
[i83]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-03730
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-03730
Ziqiang Zhang, Long Zhou, Junyi Ao, Shujie Liu, Lirong Dai, Jinyu Li, Furu Wei:
SpeechUT: Bridging Speech and Text with Hidden-Unit for Encoder-Decoder Based Speech-Text Pre-training. CoRR abs/2210.03730 (2022)
[i82]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-08603
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-08603
Ruchao Fan, Yiming Wang, Yashesh Gaur, Jinyu Li:
CTCBERT: Advancing Hidden-unit BERT with CTC Objectives. CoRR abs/2210.08603 (2022)
[i81]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-08665
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-08665
Ruchao Fan, Guoli Ye, Yashesh Gaur, Jinyu Li:
Acoustic-aware Non-autoregressive Spell Correction with Mask Sample Decoding. CoRR abs/2210.08665 (2022)
[i80]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-15715
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-15715
Muqiao Yang, Naoyuki Kanda, Xiaofei Wang, Jian Wu, Sunit Sivasankaran, Zhuo Chen, Jinyu Li, Takuya Yoshioka:
Simulating realistic speech overlaps improves multi-talker ASR. CoRR abs/2210.15715 (2022)
[i79]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-17027
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-17027
Kun Wei, Long Zhou, Ziqiang Zhang, Liping Chen, Shujie Liu, Lei He, Jinyu Li, Furu Wei:
Joint Pre-Training with Speech and Bilingual Text for Direct Speech to Speech Translation. CoRR abs/2210.17027 (2022)
[i78]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-02499
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-02499
Jian Xue, Peidong Wang, Jinyu Li, Eric Sun:
A Weakly-Supervised Streaming Multilingual Speech Model with Truly Zero-Shot Capability. CoRR abs/2211.02499 (2022)
[i77]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-02809
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-02809
Peidong Wang, Eric Sun, Jian Xue, Yu Wu, Long Zhou, Yashesh Gaur, Shujie Liu, Jinyu Li:
LAMASSU: Streaming Language-Agnostic Multilingual Speech Recognition and Translation Using Neural Transducers. CoRR abs/2211.02809 (2022)
[i76]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-03721
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-03721
Yashesh Gaur, Nick Kibre, Jian Xue, Kangyuan Shu, Yuhui Wang, Issac Alphonso, Jinyu Li, Yifan Gong:
Streaming, fast and accurate on-device Inverse Text Normalization for Automatic Speech Recognition. CoRR abs/2211.03721 (2022)
[i75]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-05172
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-05172
Zhuo Chen, Naoyuki Kanda, Jian Wu, Yu Wu, Xiaofei Wang, Takuya Yoshioka, Jinyu Li, Sunit Sivasankaran, Sefik Emre Eskimez:
Speech separation with large-scale self-supervised learning. CoRR abs/2211.05172 (2022)
[i74]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-05564
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-05564
Zili Huang, Zhuo Chen, Naoyuki Kanda, Jian Wu, Yiming Wang, Jinyu Li, Takuya Yoshioka, Xiaofei Wang, Peidong Wang:
Self-supervised learning with bi-label masked speech prediction for streaming multi-talker speech recognition. CoRR abs/2211.05564 (2022)
[i73]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-09412
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-09412
Xun Gong, Yu Wu, Jinyu Li, Shujie Liu, Rui Zhao, Xie Chen, Yanmin Qian:
LongFNT: Long-form Speech Recognition with Factorized Neural Transducer. CoRR abs/2211.09412 (2022)
[i72]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-11275
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-11275
Qiu-Shi Zhu, Long Zhou, Ziqiang Zhang, Shujie Liu, Binxing Jiao, Jie Zhang, Lirong Dai, Daxin Jiang, Jinyu Li, Furu Wei:
VATLM: Visual-Audio-Text Pre-Training with Unified Masked Prediction for Speech Representation Learning. CoRR abs/2211.11275 (2022)
[i71]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2212-01992
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2212-01992
Rui Zhao, Jian Xue, Partha Parthasarathy, Veljko Miljanic, Jinyu Li:
Fast and accurate factorized neural transducer for text adaption of end-to-end speech recognition models. CoRR abs/2212.01992 (2022)
2021
[j13]
- view
  authority control:
- export record
  dblp key:
  - journals/spl/LuKLG21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/LuKLG21
Liang Lu, Naoyuki Kanda, Jinyu Li, Yifan Gong:
Streaming End-to-End Multi-Talker Speech Recognition. IEEE Signal Process. Lett. 28: 803-807 (2021)
[j12]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/WangCWLG21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/WangCWLG21
Peidong Wang, Zhuo Chen, DeLiang Wang, Jinyu Li, Yifan Gong:
Speaker Separation Using Speaker Inventories and Estimated Speech. IEEE ACM Trans. Audio Speech Lang. Process. 29: 537-546 (2021)
[c133]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/ZhaoXLWHG21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/ZhaoXLWHG21
Rui Zhao, Jian Xue, Jinyu Li, Wenning Wei, Lei He, Yifan Gong:
On Addressing Practical Challenges for RNN-Transducer. ASRU 2021: 526-533
[c132]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenWCW0Y00021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenWCW0Y00021
Sanyuan Chen, Yu Wu, Zhuo Chen, Jian Wu, Jinyu Li, Takuya Yoshioka, Chengyi Wang, Shujie Liu, Ming Zhou:
Continuous Speech Separation with Conformer. ICASSP 2021: 5749-5753
[c131]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/XiaoK0ZYC0L0W0021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/XiaoK0ZYC0L0W0021
Xiong Xiao, Naoyuki Kanda, Zhuo Chen, Tianyan Zhou, Takuya Yoshioka, Sanyuan Chen, Yong Zhao, Gang Liu, Yu Wu, Jian Wu, Shujie Liu, Jinyu Li, Yifan Gong:
Microsoft Speaker Diarization System for the Voxceleb Speaker Recognition Challenge 2020. ICASSP 2021: 5824-5828
[c130]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/Chen0W0021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/Chen0W0021
Xie Chen, Yu Wu, Zhenghao Wang, Shujie Liu, Jinyu Li:
Developing Real-Time Streaming Transformer Transducer for Speech Recognition on Large-Scale Dataset. ICASSP 2021: 5904-5908
[c129]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/Chen00Y00Y21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/Chen00Y00Y21
Sanyuan Chen, Yu Wu, Zhuo Chen, Takuya Yoshioka, Shujie Liu, Jin-Yu Li, Xiangzhan Yu:
Don't Shoot Butterfly with Rifles: Multi-Channel Continuous Speech Separation with Early Exit Transformer. ICASSP 2021: 6139-6143
[c128]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WongDKGPPS0021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WongDKGPPS0021
Jeremy Heng Meng Wong, Dimitrios Dimitriadis, Ken'ichi Kumatani, Yashesh Gaur, George Polovets, Partha Parthasarathy, Eric Sun, Jinyu Li, Yifan Gong:
Ensemble Combination between Different Time Segmentations. ICASSP 2021: 6768-6772
[c127]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MengKGPSLC0021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MengKGPSLC0021
Zhong Meng, Naoyuki Kanda, Yashesh Gaur, Sarangarajan Parthasarathy, Eric Sun, Liang Lu, Xie Chen, Jinyu Li, Yifan Gong:
Internal Language Model Training for Domain-Adaptive End-To-End Speech Recognition. ICASSP 2021: 7338-7342
[c126]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DengZMCLL0H21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DengZMCLL0H21
Yan Deng, Rui Zhao, Zhong Meng, Xie Chen, Bing Liu, Jinyu Li, Yifan Gong, Lei He:
Improving RNN-T for Domain Scaling Using Semi-Supervised Training with Neural TTS. Interspeech 2021: 751-755
[c125]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/0028Y0021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/0028Y0021
Yan Huang, Guoli Ye, Jinyu Li, Yifan Gong:
Rapid Speaker Adaptation for Conformer Transducer: Attention and Bias Are All You Need. Interspeech 2021: 1309-1313
[c124]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JoshiDSM0021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JoshiDSM0021
Vikas Joshi, Amit Das, Eric Sun, Rupesh R. Mehta, Jinyu Li, Yifan Gong:
Multiple Softmax Architecture for Streaming Multilingual End-to-End ASR Systems. Interspeech 2021: 1767-1771
[c123]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/0001K0021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/0001K0021
Liang Lu, Naoyuki Kanda, Jinyu Li, Yifan Gong:
Streaming Multi-Talker Speech Recognition with Joint Speaker Identification. Interspeech 2021: 1782-1786
[c122]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WangLZ021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WangLZ021
Xiaoqiang Wang, Yanqing Liu, Sheng Zhao, Jinyu Li:
A Light-Weight Contextual Spelling Correction Model for Customizing Transducer-Based Speech Recognition Systems. Interspeech 2021: 1982-1986
[c121]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Meng0K0CYSL021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Meng0K0CYSL021
Zhong Meng, Yu Wu, Naoyuki Kanda, Liang Lu, Xie Chen, Guoli Ye, Eric Sun, Jinyu Li, Yifan Gong:
Minimum Word Error Rate Training with Language Model Fusion for End-to-End Speech Recognition. Interspeech 2021: 2596-2600
[c120]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenWCWY00Y21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenWCWY00Y21
Sanyuan Chen, Yu Wu, Zhuo Chen, Jian Wu, Takuya Yoshioka, Shujie Liu, Jinyu Li, Xiangzhan Yu:
Ultra Fast Speech Separation Model with Teacher Student Learning. Interspeech 2021: 3026-3030
[c119]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WuCCWYK0L21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuCCWYK0L21
Jian Wu, Zhuo Chen, Sanyuan Chen, Yu Wu, Takuya Yoshioka, Naoyuki Kanda, Shujie Liu, Jinyu Li:
Investigation of Practical Aspects of Single Channel Speech Separation for ASR. Interspeech 2021: 3066-3070
[c118]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/0001MKL021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/0001MKL021
Liang Lu, Zhong Meng, Naoyuki Kanda, Jinyu Li, Yifan Gong:
On Minimum Word Error Rate Training of the Hybrid Autoregressive Transducer. Interspeech 2021: 3435-3439
[c117]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Sun0MWX0021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Sun0MWX0021
Eric Sun, Jinyu Li, Zhong Meng, Yu Wu, Jian Xue, Shujie Liu, Yifan Gong:
Improving Multilingual Transformer Transducer Models by Reducing Language Confusions. Interspeech 2021: 3470-3474
[c116]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/MengPSGKLCZLG21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/MengPSGKLCZLG21
Zhong Meng, Sarangarajan Parthasarathy, Eric Sun, Yashesh Gaur, Naoyuki Kanda, Liang Lu, Xie Chen, Rui Zhao, Jinyu Li, Yifan Gong:
Internal Language Model Estimation for Domain-Adaptive End-to-End Speech Recognition. SLT 2021: 243-250
[c115]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/GhorbaniGSL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/GhorbaniGSL21
Shahram Ghorbani, Yashesh Gaur, Yu Shi, Jinyu Li:
Listen, Look and Deliberate: Visual Context-Aware Speech Recognition Using Pre-Trained Text-Video Representations. SLT 2021: 621-628
[c114]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/LiLHLYZDKBQ0C21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/LiLHLYZDKBQ0C21
Chenda Li, Yi Luo, Cong Han, Jinyu Li, Takuya Yoshioka, Tianyan Zhou, Marc Delcroix, Keisuke Kinoshita, Christoph Böddeker, Yanmin Qian, Shinji Watanabe, Zhuo Chen:
Dual-Path RNN for Long Recording Speech Separation. SLT 2021: 865-872
[c113]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/RajDCEHH0DYLKLW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/RajDCEHH0DYLKLW21
Desh Raj, Pavel Denisov, Zhuo Chen, Hakan Erdogan, Zili Huang, Maokui He, Shinji Watanabe, Jun Du, Takuya Yoshioka, Yi Luo, Naoyuki Kanda, Jinyu Li, Scott Wisdom, John R. Hershey:
Integration of Speech Separation, Diarization, and Recognition for Multi-Speaker Meetings: System Description, Comparison, and Analysis. SLT 2021: 897-904
[i70]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2102-01380
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-01380
Zhong Meng, Naoyuki Kanda, Yashesh Gaur, Sarangarajan Parthasarathy, Eric Sun, Liang Lu, Xie Chen, Jinyu Li, Yifan Gong:
Internal Language Model Training for Domain-Adaptive End-to-End Speech Recognition. CoRR abs/2102.01380 (2021)
[i69]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2104-02109
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-02109
Liang Lu, Naoyuki Kanda, Jinyu Li, Yifan Gong:
Streaming Multi-talker Speech Recognition with Joint Speaker Identification. CoRR abs/2104.02109 (2021)
[i68]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2105-00858
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2105-00858
Rui Zhao, Jian Xue, Jinyu Li, Wenning Wei, Lei He, Yifan Gong:
On Addressing Practical Challenges for RNN-Transducer. CoRR abs/2105.00858 (2021)
[i67]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2106-02302
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-02302
Zhong Meng, Yu Wu, Naoyuki Kanda, Liang Lu, Xie Chen, Guoli Ye, Eric Sun, Jinyu Li, Yifan Gong:
Minimum Word Error Rate Training with Language Model Fusion for End-to-End Speech Recognition. CoRR abs/2106.02302 (2021)
[i66]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2107-01922
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-01922
Jian Wu, Zhuo Chen, Sanyuan Chen, Yu Wu, Takuya Yoshioka, Naoyuki Kanda, Shujie Liu, Jinyu Li:
Investigation of Practical Aspects of Single Channel Speech Separation for ASR. CoRR abs/2107.01922 (2021)
[i65]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2107-05876
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-05876
Long Zhou, Jinyu Li, Eric Sun, Shujie Liu:
A Configurable Multilingual Model is All You Need to Recognize All Languages. CoRR abs/2107.05876 (2021)
[i64]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2108-07493
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2108-07493
Xiaoqiang Wang, Yanqing Liu, Sheng Zhao, Jinyu Li:
A Light-weight contextual spelling correction model for customizing transducer-based speech recognition systems. CoRR abs/2108.07493 (2021)
[i63]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2109-08555
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2109-08555
Desh Raj, Liang Lu, Zhuo Chen, Yashesh Gaur, Jinyu Li:
Continuous Streaming Multi-Talker ASR with Dual-path Transducers. CoRR abs/2109.08555 (2021)
[i62]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-01500
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-01500
Xie Chen, Zhong Meng, Sarangarajan Parthasarathy, Jinyu Li:
Factorized Neural Transducer for Efficient Language Model Adaptation. CoRR abs/2110.01500 (2021)
[i61]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-04891
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-04891
Guoli Ye, Vadim Mazalov, Jinyu Li, Yifan Gong:
Have best of both worlds: two-pass hybrid and E2E cascading framework for speech recognition. CoRR abs/2110.04891 (2021)
[i60]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-04934
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-04934
Yiming Wang, Jinyu Li, Heming Wang, Yao Qian, Chengyi Wang, Yu Wu:
Wav2vec-Switch: Contrastive Learning from Original-noisy Speech Pairs for Robust Speech Recognition. CoRR abs/2110.04934 (2021)
[i59]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-05354
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-05354
Zhong Meng, Yashesh Gaur, Naoyuki Kanda, Jinyu Li, Xie Chen, Yu Wu, Yifan Gong:
Internal Language Model Adaptation with Text-Only Data for End-to-End Speech Recognition. CoRR abs/2110.05354 (2021)
[i58]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-05752
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-05752
Sanyuan Chen, Yu Wu, Chengyi Wang, Zhengyang Chen, Zhuo Chen, Shujie Liu, Jian Wu, Yao Qian, Furu Wei, Jinyu Li, Xiangzhan Yu:
UniSpeech-SAT: Universal Speech Representation Learning with Speaker Aware Pre-Training. CoRR abs/2110.05752 (2021)
[i57]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-07205
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-07205
Junyi Ao, Rui Wang, Long Zhou, Shujie Liu, Shuo Ren, Yu Wu, Tom Ko, Qing Li, Yu Zhang, Zhihua Wei, Yao Qian, Jinyu Li, Furu Wei:
SpeechT5: Unified-Modal Encoder-Decoder Pre-training for Spoken Language Processing. CoRR abs/2110.07205 (2021)
[i56]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-13900
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-13900
Sanyuan Chen, Chengyi Wang, Zhengyang Chen, Yu Wu, Shujie Liu, Zhuo Chen, Jinyu Li, Naoyuki Kanda, Takuya Yoshioka, Xiong Xiao, Jian Wu, Long Zhou, Shuo Ren, Yanmin Qian, Yao Qian, Jian Wu, Michael Zeng, Furu Wei:
WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing. CoRR abs/2110.13900 (2021)
[i55]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-14142
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-14142
Wangyou Zhang, Zhuo Chen, Naoyuki Kanda, Shujie Liu, Jinyu Li, Sefik Emre Eskimez, Takuya Yoshioka, Xiong Xiao, Zhong Meng, Yanmin Qian, Furu Wei:
Separating Long-Form Speech with Group-Wise Permutation Invariant Training. CoRR abs/2110.14142 (2021)
[i54]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-14838
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-14838
Yixuan Zhang, Zhuo Chen, Jian Wu, Takuya Yoshioka, Peidong Wang, Zhong Meng, Jinyu Li:
Continuous Speech Separation with Recurrent Selective Attention Network. CoRR abs/2110.14838 (2021)
[i53]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-15430
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-15430
Heming Wang, Yao Qian, Xiaofei Wang, Yiming Wang, Chengyi Wang, Shujie Liu, Takuya Yoshioka, Jinyu Li, DeLiang Wang:
Improving Noise Robustness of Contrastive Speech Representation Learning with Speech Reconstruction. CoRR abs/2110.15430 (2021)
[i52]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2111-01690
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2111-01690
Jinyu Li:
Recent Advances in End-to-End Automatic Speech Recognition. CoRR abs/2111.01690 (2021)
[i51]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2112-05826
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2112-05826
Ken'ichi Kumatani, Dimitrios Dimitriadis, Yashesh Gaur, Robert Gmyr, Sefik Emre Eskimez, Jinyu Li, Michael Zeng:
Sequence-level self-learning with multiple hypotheses. CoRR abs/2112.05826 (2021)
[i50]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2112-08778
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2112-08778
Chengyi Wang, Yu Wu, Sanyuan Chen, Shujie Liu, Jinyu Li, Yao Qian, Zhenglu Yang:
Self-Supervised Learning for speech recognition with Intermediate layer supervision. CoRR abs/2112.08778 (2021)
2020
[c112]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/InagumaGLLG20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/InagumaGLLG20
Hirofumi Inaguma, Yashesh Gaur, Liang Lu, Jinyu Li, Yifan Gong:
Minimum Latency Training Strategies for Streaming Sequence-to-Sequence ASR. ICASSP 2020: 6064-6068
[c111]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HuZLLG20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HuZLLG20
Hu Hu, Rui Zhao, Jinyu Li, Liang Lu, Yifan Gong:
Exploring Pre-Training with Alignments for RNN Transducer Based End-to-End Speech Recognition. ICASSP 2020: 7079-7083
[c110]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenYLZMLWXL20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenYLZMLWXL20
Zhuo Chen, Takuya Yoshioka, Liang Lu, Tianyan Zhou, Zhong Meng, Yi Luo, Jian Wu, Xiong Xiao, Jinyu Li:
Continuous Speech Separation: Dataset and Analysis. ICASSP 2020: 7284-7288
[c109]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MengHLL0GL20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MengHLL0GL20
Zhong Meng, Hu Hu, Jinyu Li, Changliang Liu, Yan Huang, Yifan Gong, Chin-Hui Lee:
L-Vector: Neural Label Embedding for Domain Adaptation. ICASSP 2020: 7389-7393
[c108]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/0028HWGLG20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/0028HWGLG20
Yan Huang, Lei He, Wenning Wei, William Gale, Jinyu Li, Yifan Gong:
Using Personalized Speech Synthesis and Neural Language Generator for Rapid Speaker Adaptation. ICASSP 2020: 7399-7403
[c107]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiZSWDMG20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiZSWDMG20
Jinyu Li, Rui Zhao, Eric Sun, Jeremy Heng Meng Wong, Amit Das, Zhong Meng, Yifan Gong:
High-Accuracy and Low-Latency Speech Recognition with Two-Head Contextual Layer Trajectory LSTM Model. ICASSP 2020: 7699-7703
[c106]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Li0G0Z020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Li0G0Z020
Jinyu Li, Yu Wu, Yashesh Gaur, Chengyi Wang, Rui Zhao, Shujie Liu:
On the Comparison of Popular End-to-End Models for Large Scale Speech Recognition. INTERSPEECH 2020: 1-5
[c105]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WuCLYTLLX20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuCLYTLLX20
Jian Wu, Zhuo Chen, Jinyu Li, Takuya Yoshioka, Zhili Tan, Ed Lin, Yi Luo, Lei Xie:
An End-to-End Architecture of Online Multi-Channel Speech Separation. INTERSPEECH 2020: 81-85
[c104]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/00020DL0LRYZ020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/00020DL0LRYZ020
Chengyi Wang, Yu Wu, Yujiao Du, Jinyu Li, Shujie Liu, Liang Lu, Shuo Ren, Guoli Ye, Sheng Zhao, Ming Zhou:
Semantic Mask for Transformer Based End-to-End Speech Recognition. INTERSPEECH 2020: 971-975
[c103]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/0028LHWGG20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/0028LHWGG20
Yan Huang, Jinyu Li, Lei He, Wenning Wei, William Gale, Yifan Gong:
Rapid RNN-T Adaptation Using Personalized Speech Synthesis and Neural Language Generator. INTERSPEECH 2020: 1256-1260
[c102]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WongGZLSLG20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WongGZLSLG20
Jeremy Heng Meng Wong, Yashesh Gaur, Rui Zhao, Liang Lu, Eric Sun, Jinyu Li, Yifan Gong:
Combination of End-to-End and Hybrid Models for Speech Recognition. INTERSPEECH 2020: 1783-1787
[c101]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/00020L0LYZ20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/00020L0LYZ20
Chengyi Wang, Yu Wu, Liang Lu, Shujie Liu, Jinyu Li, Guoli Ye, Ming Zhou:
Low Latency End-to-End Streaming Speech Recognition with a Scout Network. INTERSPEECH 2020: 2112-2116
[c100]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/JoshiZMKL20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/JoshiZMKL20
Vikas Joshi, Rui Zhao, Rupesh R. Mehta, Kshitiz Kumar, Jinyu Li:
Transfer Learning Approaches for Streaming End-to-End Speech Recognition System. INTERSPEECH 2020: 2152-2156
[c99]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiZMLWPMWHZG20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiZMLWPMWHZG20
Jinyu Li, Rui Zhao, Zhong Meng, Yanqing Liu, Wenning Wei, Sarangarajan Parthasarathy, Vadim Mazalov, Zhenghao Wang, Lei He, Sheng Zhao, Yifan Gong:
Developing RNN-T Models Surpassing High-Performance Hybrid Models with Customization Capability. INTERSPEECH 2020: 3590-3594
[c98]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KumataniDGGELZ20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KumataniDGGELZ20
Ken'ichi Kumatani, Dimitrios Dimitriadis, Yashesh Gaur, Robert Gmyr, Sefik Emre Eskimez, Jinyu Li, Michael Zeng:
Sequence-Level Self-Learning with Multiple Hypotheses. INTERSPEECH 2020: 3775-3779
[c97]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LuLLG20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LuLLG20
Liang Lu, Changliang Liu, Jinyu Li, Yifan Gong:
Exploring Transformers for Large-Scale Speech Recognition. INTERSPEECH 2020: 5041-5045
[i49]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2001-01795
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2001-01795
Zhong Meng, Yashesh Gaur, Jinyu Li, Yifan Gong:
Character-Aware Attention-Based End-to-End Speech Recognition. CoRR abs/2001.01795 (2020)
[i48]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2001-01798
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2001-01798
Zhong Meng, Jinyu Li, Yashesh Gaur, Yifan Gong:
Domain Adaptation via Teacher-Student Learning for End-to-End Speech Recognition. CoRR abs/2001.01798 (2020)
[i47]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2001-11482
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2001-11482
Zhuo Chen, Takuya Yoshioka, Liang Lu, Tianyan Zhou, Zhong Meng, Yi Luo, Jian Wu, Jinyu Li:
Continuous speech separation: dataset and analysis. CoRR abs/2001.11482 (2020)
[i46]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2003-07482
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2003-07482
Jinyu Li, Rui Zhao, Eric Sun, Jeremy Heng Meng Wong, Amit Das, Zhong Meng, Yifan Gong:
High-Accuracy and Low-Latency Speech Recognition with Two-Head Contextual Layer Trajectory LSTM Model. CoRR abs/2003.07482 (2020)
[i45]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2004-05009
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2004-05009
Hirofumi Inaguma, Yashesh Gaur, Liang Lu, Jinyu Li, Yifan Gong:
Minimum Latency Training Strategies for Streaming Sequence-to-Sequence ASR. CoRR abs/2004.05009 (2020)
[i44]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2004-13480
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2004-13480
Zhong Meng, Hu Hu, Jinyu Li, Changliang Liu, Yan Huang, Yifan Gong, Chin-Hui Lee:
L-Vector: Neural Label Embedding for Domain Adaptation. CoRR abs/2004.13480 (2020)
[i43]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2005-00572
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-00572
Hu Hu, Rui Zhao, Jinyu Li, Liang Lu, Yifan Gong:
Exploring Pre-training with Alignments for RNN Transducer based End-to-End Speech Recognition. CoRR abs/2005.00572 (2020)
[i42]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2005-09684
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-09684
Liang Lu, Changliang Liu, Jinyu Li, Yifan Gong:
Exploring Transformers for Large-Scale Speech Recognition. CoRR abs/2005.09684 (2020)
[i41]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2005-14327
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-14327
Jinyu Li, Yu Wu, Yashesh Gaur, Chengyi Wang, Rui Zhao, Shujie Liu:
On the Comparison of Popular End-to-End Models for Large Scale Speech Recognition. CoRR abs/2005.14327 (2020)
[i40]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2007-15188
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-15188
Jinyu Li, Rui Zhao, Zhong Meng, Yanqing Liu, Wenning Wei, Sarangarajan Parthasarathy, Vadim Mazalov, Zhenghao Wang, Lei He, Sheng Zhao, Yifan Gong:
Developing RNN-T Models Surpassing High-Performance Hybrid Models with Customization Capability. CoRR abs/2007.15188 (2020)
[i39]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2008-05086
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-05086
Vikas Joshi, Rui Zhao, Rupesh R. Mehta, Kshitiz Kumar, Jinyu Li:
Transfer Learning Approaches for Streaming End-to-End Speech Recognition System. CoRR abs/2008.05086 (2020)
[i38]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2008-05773
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-05773
Sanyuan Chen, Yu Wu, Zhuo Chen, Jinyu Li, Chengyi Wang, Shujie Liu, Ming Zhou:
Continuous Speech Separation with Conformer. CoRR abs/2008.05773 (2020)
[i37]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2008-06580
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-06580
Peter Bell, Joachim Fainberg, Ondrej Klejch, Jinyu Li, Steve Renals, Pawel Swietojanski:
Adaptation Algorithms for Speech Recognition: An Overview. CoRR abs/2008.06580 (2020)
[i36]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2009-03141
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2009-03141
Jian Wu, Zhuo Chen, Jinyu Li, Takuya Yoshioka, Zhili Tan, Ed Lin, Yi Luo, Lei Xie:
An End-to-end Architecture of Online Multi-channel Speech Separation. CoRR abs/2009.03141 (2020)
[i35]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-10556
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-10556
Peidong Wang, Zhuo Chen, DeLiang Wang, Jinyu Li, Yifan Gong:
Speaker Separation Using Speaker Inventories and Estimated Speech. CoRR abs/2010.10556 (2020)
[i34]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-11395
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-11395
Xie Chen, Yu Wu, Zhenghao Wang, Shujie Liu, Jinyu Li:
Developing Real-time Streaming Transformer Transducer for Speech Recognition on Large-scale Dataset. CoRR abs/2010.11395 (2020)
[i33]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-11458
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-11458
Xiong Xiao, Naoyuki Kanda, Zhuo Chen, Tianyan Zhou, Takuya Yoshioka, Sanyuan Chen, Yong Zhao, Gang Liu, Yu Wu, Jian Wu, Shujie Liu, Jinyu Li, Yifan Gong:
Microsoft Speaker Diarization System for the VoxCeleb Speaker Recognition Challenge 2020. CoRR abs/2010.11458 (2020)
[i32]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-12180
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-12180
Sanyuan Chen, Yu Wu, Zhuo Chen, Takuya Yoshioka, Shujie Liu, Jinyu Li:
Don't shoot butterfly with rifles: Multi-channel Continuous Speech Separation with Early Exit Transformer. CoRR abs/2010.12180 (2020)
[i31]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-12673
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-12673
Liang Lu, Zhong Meng, Naoyuki Kanda, Jinyu Li, Yifan Gong:
On Minimum Word Error Rate Training of the Hybrid Autoregressive Transducer. CoRR abs/2010.12673 (2020)
[i30]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2011-01991
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-01991
Zhong Meng, Sarangarajan Parthasarathy, Eric Sun, Yashesh Gaur, Naoyuki Kanda, Liang Lu, Xie Chen, Rui Zhao, Jinyu Li, Yifan Gong:
Internal Language Model Estimation for Domain-Adaptive End-to-End Speech Recognition. CoRR abs/2011.01991 (2020)
[i29]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2011-02014
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-02014
Desh Raj, Pavel Denisov, Zhuo Chen, Hakan Erdogan, Zili Huang, Mao-Kui He, Shinji Watanabe, Jun Du, Takuya Yoshioka, Yi Luo, Naoyuki Kanda, Jinyu Li, Scott Wisdom, John R. Hershey:
Integration of speech separation, diarization, and recognition for multi-speaker meetings: System description, comparison, and analysis. CoRR abs/2011.02014 (2020)
[i28]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2011-04084
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-04084
Shahram Ghorbani, Yashesh Gaur, Yu Shi, Jinyu Li:
Listen, Look and Deliberate: Visual context-aware speech recognition using pre-trained text-video representations. CoRR abs/2011.04084 (2020)
[i27]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2011-13148
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-13148
Liang Lu, Naoyuki Kanda, Jinyu Li, Yifan Gong:
Streaming end-to-end multi-talker speech recognition. CoRR abs/2011.13148 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[j11]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/DasLYZG19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/DasLYZG19
Amit Das, Jinyu Li, Guoli Ye, Rui Zhao, Yifan Gong:
Advancing Acoustic-to-Word CTC Model With Attention and Mixed-Units. IEEE ACM Trans. Audio Speech Lang. Process. 27(12): 1880-1892 (2019)
[c96]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/LiZHG19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/LiZHG19
Jinyu Li, Rui Zhao, Hu Hu, Yifan Gong:
Improving RNN Transducer Modeling for End-to-End Speech Recognition. ASRU 2019: 114-121
[c95]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/WangCXMYZLL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/WangCXMYZLL19
Peidong Wang, Zhuo Chen, Xiong Xiao, Zhong Meng, Takuya Yoshioka, Tianyan Zhou, Liang Lu, Jinyu Li:
Speech Separation Using Speaker Inventory. ASRU 2019: 230-236
[c94]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/MengLGG19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/MengLGG19
Zhong Meng, Jinyu Li, Yashesh Gaur, Yifan Gong:
Domain Adaptation via Teacher-Student Learning for End-to-End Speech Recognition. ASRU 2019: 268-275
[c93]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/ZhouZLGW19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/ZhouZLGW19
Tianyan Zhou, Yong Zhao, Jinyu Li, Yifan Gong, Jian Wu:
CNN with Phonetic Attention for Text-Independent Speaker Verification. ASRU 2019: 718-725
[c92]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/MengGLG19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/MengGLG19
Zhong Meng, Yashesh Gaur, Jinyu Li, Yifan Gong:
Character-Aware Attention-Based End-to-End Speech Recognition. ASRU 2019: 949-955
[c91]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/DasLLG19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/DasLLG19
Amit Das, Jinyu Li, Changliang Liu, Yifan Gong:
Universal Acoustic Modeling Using Neural Mixture Models. ICASSP 2019: 5681-5685
[c90]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MengLG19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MengLG19
Zhong Meng, Jinyu Li, Yifan Gong:
Adversarial Speaker Adaptation. ICASSP 2019: 5721-5725
[c89]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiLYZG19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiLYZG19
Ke Li, Jinyu Li, Guoli Ye, Rui Zhao, Yifan Gong:
Towards Code-switching ASR for End-to-end CTC Models. ICASSP 2019: 6076-6080
[c88]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MengZLG19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MengZLG19
Zhong Meng, Yong Zhao, Jinyu Li, Yifan Gong:
Adversarial Speaker Verification. ICASSP 2019: 6216-6220
[c87]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MengLZG19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MengLZG19
Zhong Meng, Jinyu Li, Yong Zhao, Yifan Gong:
Conditional Teacher-student Learning. ICASSP 2019: 6445-6449
[c86]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiLLG19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiLLG19
Jinyu Li, Liang Lu, Changliang Liu, Yifan Gong:
Improving Layer Trajectory LSTM with Future Context Frames. ICASSP 2019: 6550-6554
[c85]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MengLG19a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MengLG19a
Zhong Meng, Jinyu Li, Yifan Gong:
Attentive Adversarial Learning for Domain-invariant Training. ICASSP 2019: 6740-6744
[c84]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MengGLG19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MengGLG19
Zhong Meng, Yashesh Gaur, Jinyu Li, Yifan Gong:
Speaker Adaptation for Attention-Based End-to-End Speech Recognition. INTERSPEECH 2019: 241-245
[c83]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SunLG19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SunLG19
Eric Sun, Jinyu Li, Yifan Gong:
Layer Trajectory BLSTM. INTERSPEECH 2019: 1403-1407
[c82]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GaurLMG19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GaurLMG19
Yashesh Gaur, Jinyu Li, Zhong Meng, Yifan Gong:
Acoustic-to-Phrase Models for Speech Recognition. INTERSPEECH 2019: 2240-2244
[i26]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1901-01239
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1901-01239
Ke Li, Jinyu Li, Yong Zhao, Kshitiz Kumar, Yifan Gong:
Speaker Adaptation for End-to-End CTC Models. CoRR abs/1901.01239 (2019)
[i25]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1904-12399
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1904-12399
Zhong Meng, Jinyu Li, Yong Zhao, Yifan Gong:
Conditional Teacher-Student Learning. CoRR abs/1904.12399 (2019)
[i24]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1904-12400
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1904-12400
Zhong Meng, Jinyu Li, Yifan Gong:
Attentive Adversarial Learning for Domain-Invariant Training. CoRR abs/1904.12400 (2019)
[i23]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1904-12406
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1904-12406
Zhong Meng, Yong Zhao, Jinyu Li, Yifan Gong:
Adversarial Speaker Verification. CoRR abs/1904.12406 (2019)
[i22]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1904-12407
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1904-12407
Zhong Meng, Jinyu Li, Yifan Gong:
Adversarial Speaker Adaptation. CoRR abs/1904.12407 (2019)
[i21]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1909-12415
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1909-12415
Jinyu Li, Rui Zhao, Hu Hu, Yifan Gong:
Improving RNN Transducer Modeling for End-to-End Speech Recognition. CoRR abs/1909.12415 (2019)
[i20]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1911-03762
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1911-03762
Zhong Meng, Yashesh Gaur, Jinyu Li, Yifan Gong:
Speaker Adaptation for Attention-Based End-to-End Speech Recognition. CoRR abs/1911.03762 (2019)
[i19]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1912-03010
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1912-03010
Chengyi Wang, Yu Wu, Yujiao Du, Jinyu Li, Shujie Liu, Liang Lu, Shuo Ren, Guoli Ye, Sheng Zhao, Ming Zhou:
Semantic Mask for Transformer based End-to-End Speech Recognition. CoRR abs/1912.03010 (2019)
2018
[j10]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/ChenDLX18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/ChenDLX18
Zhehuai Chen, Jasha Droppo, Jinyu Li, Wayne Xiong:
Progressive Joint Modeling in Unsupervised Single-Channel Overlapped Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 26(1): 184-196 (2018)
[c81]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/DasLZG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/DasLZG18
Amit Das, Jinyu Li, Rui Zhao, Yifan Gong:
Advancing Connectionist Temporal Classification with Attention Modeling. ICASSP 2018: 4769-4773
[c80]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiZCLXYG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiZCLXYG18
Jinyu Li, Rui Zhao, Zhuo Chen, Changliang Liu, Xiong Xiao, Guoli Ye, Yifan Gong:
Developing Far-Field Speaker System Via Teacher-Student Learning. ICASSP 2018: 5699-5703
[c79]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiYDZG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiYDZG18
Jinyu Li, Guoli Ye, Amit Das, Rui Zhao, Yifan Gong:
Advancing Acoustic-to-Word CTC Model. ICASSP 2018: 5794-5798
[c78]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MengLGJ18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MengLGJ18
Zhong Meng, Jinyu Li, Yifan Gong, Biing-Hwang Juang:
Adversarial Teacher-Student Learning for Unsupervised Domain Adaptation. ICASSP 2018: 5949-5953
[c77]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MengLCZMGJ18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MengLCZMGJ18
Zhong Meng, Jinyu Li, Zhuo Chen, Yang Zhao, Vadim Mazalov, Yifan Gong, Biing-Hwang Juang:
Speaker-Invariant Training Via Adversarial Learning. ICASSP 2018: 5969-5973
[c76]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhaoLZCG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhaoLZCG18
Yong Zhao, Jinyu Li, Shi-Xiong Zhang, Liping Chen, Yifan Gong:
Domain and Speaker Adaptation for Cortana Speech Recognition. ICASSP 2018: 5984-5988
[c75]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MengLGJ18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MengLGJ18
Zhong Meng, Jinyu Li, Yifan Gong, Biing-Hwang Fred Juang:
Cycle-Consistent Speech Enhancement. INTERSPEECH 2018: 1165-1169
[c74]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiLG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiLG18
Jinyu Li, Changliang Liu, Yifan Gong:
Layer Trajectory LSTM. INTERSPEECH 2018: 1768-1772
[c73]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimSLZ18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimSLZ18
Suyoun Kim, Michael L. Seltzer, Jinyu Li, Rui Zhao:
Improved Training for Online End-to-end Speech Recognition Systems. INTERSPEECH 2018: 2913-2917
[c72]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MengLGJ18a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MengLGJ18a
Zhong Meng, Jinyu Li, Yifan Gong, Biing-Hwang Fred Juang:
Adversarial Feature-Mapping for Speech Enhancement. INTERSPEECH 2018: 3259-3263
[c71]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/LiLLG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/LiLLG18
Jinyu Li, Liang Lu, Changliang Liu, Yifan Gong:
Exploring Layer Trajectory LSTM with Depth Processing Units and Attention. SLT 2018: 456-462
[c70]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/LiLZKG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/LiLZKG18
Ke Li, Jinyu Li, Yong Zhao, Kshitiz Kumar, Yifan Gong:
Speaker Adaptation for End-to-End CTC Models. SLT 2018: 542-549
[c69]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/ChenXYELG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/ChenXYELG18
Zhuo Chen, Xiong Xiao, Takuya Yoshioka, Hakan Erdogan, Jinyu Li, Yifan Gong:
Multi-Channel Overlapped Speech Recognition with Location Guided Speech Extraction Network. SLT 2018: 558-565
[i18]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1803-05563
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1803-05563
Amit Das, Jinyu Li, Rui Zhao, Yifan Gong:
Advancing Connectionist Temporal Classification With Attention Modeling. CoRR abs/1803.05563 (2018)
[i17]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1803-05566
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1803-05566
Jinyu Li, Guoli Ye, Amit Das, Rui Zhao, Yifan Gong:
Advancing Acoustic-to-Word CTC Model. CoRR abs/1803.05566 (2018)
[i16]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1803-10924
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1803-10924
Zhuo Chen, Jinyu Li, Xiong Xiao, Takuya Yoshioka, Huaming Wang, Zhenghao Wang, Yifan Gong:
Cracking the cocktail party problem by multi-beam deep attractor network. CoRR abs/1803.10924 (2018)
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1804-00644
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1804-00644
Zhong Meng, Jinyu Li, Yifan Gong, Biing-Hwang Juang:
Adversarial Teacher-Student Learning for Unsupervised Domain Adaptation. CoRR abs/1804.00644 (2018)
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1804-00732
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1804-00732
Zhong Meng, Jinyu Li, Zhuo Chen, Yong Zhao, Vadim Mazalov, Yifan Gong, Biing-Hwang Juang:
Speaker-Invariant Training via Adversarial Learning. CoRR abs/1804.00732 (2018)
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1804-05166
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1804-05166
Jinyu Li, Rui Zhao, Zhuo Chen, Changliang Liu, Xiong Xiao, Guoli Ye, Yifan Gong:
Developing Far-Field Speaker System Via Teacher-Student Learning. CoRR abs/1804.05166 (2018)
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1804-09298
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1804-09298
Dong Yu, Jinyu Li:
Recent Progresses in Deep Learning based Acoustic Models (Updated). CoRR abs/1804.09298 (2018)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1808-09522
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1808-09522
Jinyu Li, Changliang Liu, Yifan Gong:
Layer Trajectory LSTM. CoRR abs/1808.09522 (2018)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1809-02251
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1809-02251
Zhong Meng, Jinyu Li, Yifan Gong, Biing-Hwang Juang:
Adversarial Feature-Mapping for Speech Enhancement. CoRR abs/1809.02251 (2018)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1809-02253
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1809-02253
Zhong Meng, Jinyu Li, Yifan Gong, Biing-Hwang Juang:
Cycle-Consistent Speech Enhancement. CoRR abs/1809.02253 (2018)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1812-11928
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1812-11928
Amit Das, Jinyu Li, Guoli Ye, Rui Zhao, Yifan Gong:
Advancing Acoustic-to-Word CTC Model with Attention and Mixed-Units. CoRR abs/1812.11928 (2018)
2017
[j9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ieeejas/YuL17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ieeejas/YuL17
Dong Yu, Jinyu Li:
Recent progresses in deep learning based acoustic models. IEEE CAA J. Autom. Sinica 4(3): 396-409 (2017)
[c68]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/LiYZDG17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/LiYZDG17
Jinyu Li, Guoli Ye, Rui Zhao, Jasha Droppo, Yifan Gong:
Acoustic-to-word model without OOV. ASRU 2017: 111-117
[c67]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/MengCMLG17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/MengCMLG17
Zhong Meng, Zhuo Chen, Vadim Mazalov, Jinyu Li, Yifan Gong:
Unsupervised adaptation with domain separation networks for robust speech recognition. ASRU 2017: 214-221
[c66]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/ChenLXYWWG17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/ChenLXYWWG17
Zhuo Chen, Jinyu Li, Xiong Xiao, Takuya Yoshioka, Huaming Wang, Zhenghao Wang, Yifan Gong:
Cracking the cocktail party problem by multi-beam deep attractor network. ASRU 2017: 437-444
[c65]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiHG17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiHG17
Jinyu Li, Yan Huang, Yifan Gong:
Improved cepstra minimum-mean-square-error noise reduction algorithm for robust speech recognition. ICASSP 2017: 4865-4869
[c64]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhaoLKG17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhaoLKG17
Yong Zhao, Jinyu Li, Kshitiz Kumar, Yifan Gong:
Extended low-rank plus diagonal adaptation for deep and recurrent neural networks. ICASSP 2017: 5040-5044
[c63]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiSWZG17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiSWZG17
Jinyu Li, Michael L. Seltzer, Xi Wang, Rui Zhao, Yifan Gong:
Large-Scale Domain Adaptation via Teacher-Student Learning. INTERSPEECH 2017: 2386-2390
[c62]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenHLG17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenHLG17
Zhuo Chen, Yan Huang, Jinyu Li, Yifan Gong:
Improving Mask Learning Based Speech Enhancement System with Restoration Layers and Residual Connection. INTERSPEECH 2017: 3632-3636
[p1]
- view
  authority control:
- export record
  dblp key:
  - books/sp/17/GongHKLLYZZZ17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/books/sp/17/GongHKLLYZZZ17
Yifan Gong, Yan Huang, Kshitiz Kumar, Jinyu Li, Chaojun Liu, Guoli Ye, Shi-Xiong Zhang, Yong Zhao, Rui Zhao:
Challenges in and Solutions to Deep Learning Network Acoustic Modeling in Speech Recognition Products at Microsoft. New Era for Robust Speech Recognition, Exploiting Deep Learning 2017: 401-417
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/ZhangCZLG17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/ZhangCZLG17
Shi-Xiong Zhang, Zhuo Chen, Yong Zhao, Jinyu Li, Yifan Gong:
End-to-End Attention based Text-Dependent Speaker Verification. CoRR abs/1701.00562 (2017)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/ChenDLX17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/ChenDLX17
Zhehuai Chen, Jasha Droppo, Jinyu Li, Wayne Xiong:
Progressive Joint Modeling in Unsupervised Single-channel Overlapped Speech Recognition. CoRR abs/1707.07048 (2017)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1708-05466
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1708-05466
Jinyu Li, Michael L. Seltzer, Xi Wang, Rui Zhao, Yifan Gong:
Large-Scale Domain Adaptation via Teacher-Student Learning. CoRR abs/1708.05466 (2017)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1711-02212
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1711-02212
Suyoun Kim, Michael L. Seltzer, Jinyu Li, Rui Zhao:
Improved training for online end-to-end speech recognition systems. CoRR abs/1711.02212 (2017)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1711-08010
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1711-08010
Zhong Meng, Zhuo Chen, Vadim Mazalov, Jinyu Li, Yifan Gong:
Unsupervised Adaptation with Domain Separation Networks for Robust Speech Recognition. CoRR abs/1711.08010 (2017)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1711-10136
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1711-10136
Jinyu Li, Guoli Ye, Rui Zhao, Jasha Droppo, Yifan Gong:
Acoustic-To-Word Model Without OOV. CoRR abs/1711.10136 (2017)
2016
[j8]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/SwietojanskiLR16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/SwietojanskiLR16
Pawel Swietojanski, Jinyu Li, Steve Renals:
Learning Hidden Unit Contributions for Unsupervised Acoustic Model Adaptation. IEEE ACM Trans. Audio Speech Lang. Process. 24(8): 1450-1463 (2016)
[c61]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MiaoLWZG16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MiaoLWZG16
Yajie Miao, Jinyu Li, Yongqiang Wang, Shi-Xiong Zhang, Yifan Gong:
Simplifying long short-term memory acoustic models for fast training and decoding. ICASSP 2016: 2284-2288
[c60]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiMZG16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiMZG16
Jinyu Li, Abdelrahman Mohamed, Geoffrey Zweig, Yifan Gong:
Exploring multidimensional lstms for large vocabulary ASR. ICASSP 2016: 4940-4944
[c59]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhaoLG16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhaoLG16
Yong Zhao, Jinyu Li, Yifan Gong:
Low-rank plus diagonal adaptation for deep neural networks. ICASSP 2016: 5005-5009
[c58]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhangZLLG16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhangZLLG16
Shi-Xiong Zhang, Rui Zhao, Chaojun Liu, Jinyu Li, Yifan Gong:
Recurrent support vector machines for speech recognition. ICASSP 2016: 5885-5889
[c57]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YuXDSYLZ16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YuXDSYLZ16
Dong Yu, Wayne Xiong, Jasha Droppo, Andreas Stolcke, Guoli Ye, Jinyu Li, Geoffrey Zweig:
Deep Convolutional Neural Networks with Layer-Wise Context Expansion and Attention. INTERSPEECH 2016: 17-21
[c56]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/ZhangCZLG16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/ZhangCZLG16
Shi-Xiong Zhang, Zhuo Chen, Yong Zhao, Jinyu Li, Yifan Gong:
End-to-End attention based text-dependent speaker verification. SLT 2016: 171-178
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/SwietojanskiLR16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/SwietojanskiLR16
Pawel Swietojanski, Jinyu Li, Steve Renals:
Learning Hidden Unit Contributions for Unsupervised Acoustic Model Adaptation. CoRR abs/1601.02828 (2016)
2015
[c55]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/LiMZG15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/LiMZG15
Jinyu Li, Abdelrahman Mohamed, Geoffrey Zweig, Yifan Gong:
LSTM time and frequency recurrence for automatic speech recognition. ASRU 2015: 187-191
[c54]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhaoLXG15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhaoLXG15
Yong Zhao, Jinyu Li, Jian Xue, Yifan Gong:
Investigating online low-footprint speaker adaptation using generalized linear regression and click-through data. ICASSP 2015: 4310-4314
[c53]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangLG15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangLG15
Yongqiang Wang, Jinyu Li, Yifan Gong:
Small-footprint high-performance deep neural network-based speech recognition using split-VQ. ICASSP 2015: 4984-4988
[c52]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HuangLG15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HuangLG15
Jui-Ting Huang, Jinyu Li, Yifan Gong:
An analysis of convolutional neural networks for speech recognition. ICASSP 2015: 4989-4993
[c51]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuangSCLWL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuangSCLWL15
Zhen Huang, Sabato Marco Siniscalchi, I-Fan Chen, Jinyu Li, Jiadong Wu, Chin-Hui Lee:
Maximum a posteriori adaptation of network parameters in deep models. INTERSPEECH 2015: 1076-1080
[c50]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiuLG15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiuLG15
Changliang Liu, Jinyu Li, Yifan Gong:
SVD-based universal DNN modeling for multiple scenarios. INTERSPEECH 2015: 3269-3273
[c49]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuangLSCWL15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuangLSCWL15
Zhen Huang, Jinyu Li, Sabato Marco Siniscalchi, I-Fan Chen, Ji Wu, Chin-Hui Lee:
Rapid adaptation for deep neural networks through multi-task learning. INTERSPEECH 2015: 3625-3629
2014
[j7]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/LiDGH14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/LiDGH14
Jinyu Li, Li Deng, Yifan Gong, Reinhold Haeb-Umbach:
An Overview of Noise-Robust Automatic Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 22(4): 745-777 (2014)
[c48]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/XiaoLCL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/XiaoLCL14
Xiong Xiao, Jinyu Li, Engsiong Chng, Haizhou Li:
Feature compensation using linear combination of speaker and environment dependent correction vectors. ICASSP 2014: 1720-1724
[c47]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiHG14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiHG14
Jinyu Li, Jui-Ting Huang, Yifan Gong:
Factorized adaptation for deep neural network. ICASSP 2014: 5537-5541
[c46]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/XueLYSG14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/XueLYSG14
Jian Xue, Jinyu Li, Dong Yu, Mike Seltzer, Yifan Gong:
Singular value decomposition based low-footprint speaker adaptation and personalization for deep neural network. ICASSP 2014: 6359-6363
[c45]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SwietojanskiLH14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SwietojanskiLH14
Pawel Swietojanski, Jinyu Li, Jui-Ting Huang:
Investigation of maxout networks for speech recognition. ICASSP 2014: 7649-7653
[c44]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuangLWL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuangLWL14
Zhen Huang, Jinyu Li, Chao Weng, Chin-Hui Lee:
Beyond cross-entropy: towards better frame-level objective functions for deep neural network training in automatic speech recognition. INTERSPEECH 2014: 1214-1218
[c43]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiZHG14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiZHG14
Jinyu Li, Rui Zhao, Jui-Ting Huang, Yifan Gong:
Learning small-size DNN with output-distribution-based criteria. INTERSPEECH 2014: 1910-1914
[c42]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhaoLG14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhaoLG14
Rui Zhao, Jinyu Li, Yifan Gong:
Variable-component deep neural network for robust speech recognition. INTERSPEECH 2014: 2719-2723
[c41]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HuangLSCWL14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HuangLSCWL14
Zhen Huang, Jinyu Li, Sabato Marco Siniscalchi, I-Fan Chen, Chao Weng, Chin-Hui Lee:
Feature space maximum a posteriori linear regression for adaptation of deep neural networks. INTERSPEECH 2014: 2992-2996
[c40]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/ZhaoLG14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/ZhaoLG14
Rui Zhao, Jinyu Li, Yifan Gong:
Variable-activation and variable-input deep neural network for robust speech recognition. SLT 2014: 542-547
2013
[j6]
- view
  authority control:
- export record
  dblp key:
  - journals/iet-spr/SiniscalchiLL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/iet-spr/SiniscalchiLL13
Sabato Marco Siniscalchi, Jinyu Li, Chin-Hui Lee:
Model-based margin estimation for hidden Markov model learning and generalisation. IET Signal Process. 7(8): 704-709 (2013)
[j5]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/SiniscalchiLL13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/SiniscalchiLL13
Sabato Marco Siniscalchi, Jinyu Li, Chin-Hui Lee:
Hermitian Polynomial for Speaker Adaptation of Connectionist Speech Recognition Systems. IEEE Trans. Speech Audio Process. 21(10): 2152-2161 (2013)
[c39]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HuangLYDG13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HuangLYDG13
Jui-Ting Huang, Jinyu Li, Dong Yu, Li Deng, Yifan Gong:
Cross-language knowledge transfer using multilingual deep neural network with shared hidden layers. ICASSP 2013: 7304-7308
[c38]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/DengLHYYSSZHWGA13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/DengLHYYSSZHWGA13
Li Deng, Jinyu Li, Jui-Ting Huang, Kaisheng Yao, Dong Yu, Frank Seide, Michael L. Seltzer, Geoffrey Zweig, Xiaodong He, Jason D. Williams, Yifan Gong, Alex Acero:
Recent advances in deep learning for speech research at Microsoft. ICASSP 2013: 8604-8608
[c37]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XueLG13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XueLG13
Jian Xue, Jinyu Li, Yifan Gong:
Restructuring of deep neural network acoustic models with singular value decomposition. INTERSPEECH 2013: 2365-2369
[c36]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WieslerLX13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WieslerLX13
Simon Wiesler, Jinyu Li, Jian Xue:
Investigations on hessian-free optimization for cross-entropy training of deep neural networks. INTERSPEECH 2013: 3317-3321
[c35]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1301-3605
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1301-3605
Dong Yu, Michael L. Seltzer, Jinyu Li, Jui-Ting Huang, Frank Seide:
Feature Learning in Deep Neural Networks - A Study on Speech Recognition Tasks. ICLR 2013
2012
[c34]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/XiaoLCL12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/XiaoLCL12
Xiong Xiao, Jinyu Li, Engsiong Chng, Haizhou Li:
Lasso environment model combination for robust speech recognition. ICASSP 2012: 4305-4308
[c33]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiSG12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiSG12
Jinyu Li, Michael L. Seltzer, Yifan Gong:
Improvements to VTS feature enhancement. ICASSP 2012: 4677-4680
[c32]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiSG12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiSG12
Jinyu Li, Michael L. Seltzer, Yifan Gong:
Efficient VTS Adaptation Using Jacobian Approximation. INTERSPEECH 2012: 1906-1909
[c31]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SiniscalchiLL12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SiniscalchiLL12
Sabato Marco Siniscalchi, Jinyu Li, Chin-Hui Lee:
Hermitian based Hidden Activation Functions for Adaptation of Hybrid HMM/ANN Models. INTERSPEECH 2012: 2590-2593
[c30]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/LiYHG12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/LiYHG12
Jinyu Li, Dong Yu, Jui-Ting Huang, Yifan Gong:
Improving wideband speech recognition using mixed-bandwidth training data in CD-DNN-HMM. SLT 2012: 131-136
2011
[j4]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/YuLD11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/YuLD11
Dong Yu, Jinyu Li, Li Deng:
Calibration of Confidence Measures in Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 19(8): 2461-2473 (2011)
[c29]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/XiaoLCL11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/XiaoLCL11
Xiong Xiao, Jinyu Li, Engsiong Chng, Haizhou Li:
Maximum likelihood adaptation of histogram equalization with constraint for robust speech recognition. ICASSP 2011: 5480-5483
[c28]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/XiaoLSL11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/XiaoLSL11
Xiong Xiao, Jinyu Li, Chng Eng Siong, Haizhou Li:
Feature Normalization Using Structured Full Transforms for Robust Speech Recognition. INTERSPEECH 2011: 693-696
2010
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/XiaoLCLL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/XiaoLCLL10
Xiong Xiao, Jinyu Li, Engsiong Chng, Haizhou Li, Chin-Hui Lee:
A Study on the Generalization Capability of Acoustic Models for Robust Speech Recognition. IEEE Trans. Speech Audio Process. 18(6): 1158-1169 (2010)
[c27]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YuWLD10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YuWLD10
Dong Yu, Shizhen Wang, Jinyu Li, Li Deng:
Word confidence calibration using a maximum entropy model with constraints on confidence and word distributions. ICASSP 2010: 4446-4449
[c26]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiTL10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiTL10
Jinyu Li, Yu Tsao, Chin-Hui Lee:
Shrinkage model adaptation in automatic speech recognition. INTERSPEECH 2010: 1656-1659
[c25]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiYGD10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiYGD10
Jinyu Li, Dong Yu, Yifan Gong, Li Deng:
Unscented transform with online distortion estimation for HMM adaptation. INTERSPEECH 2010: 1660-1663

2000 – 2009

see FAQ

What is the meaning of the colors in the publication lists?

2009
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/csl/LiDYGA09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/csl/LiDYGA09
Jinyu Li, Li Deng, Dong Yu, Yifan Gong, Alex Acero:
A unified framework of HMM adaptation with joint compensation of additive and convolutive distortions. Comput. Speech Lang. 23(3): 389-405 (2009)
[c24]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/XiaoLCLL09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/XiaoLCLL09
Xiong Xiao, Jinyu Li, Engsiong Chng, Haizhou Li, Chin-Hui Lee:
A study on hidden Markov model's generalization capability for speech recognition. ASRU 2009: 255-260
[c23]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TsaoLL09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TsaoLL09
Yu Tsao, Jinyu Li, Chin-Hui Lee:
Ensemble speaker and speaking environment modeling approach with advanced online estimation process. ICASSP 2009: 3833-3836
[c22]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MatsudaTLNL09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MatsudaTLNL09
Shigeki Matsuda, Yu Tsao, Jinyu Li, Satoshi Nakamura, Chin-Hui Lee:
A study on soft margin estimation of linear regression parameters for speaker adaptation. INTERSPEECH 2009: 1603-1606
[c21]
- view
  authority control:
- export record
  dblp key:
  - conf/iucs/TsaoLLN09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iucs/TsaoLLN09
Yu Tsao, Jinyu Li, Chin-Hui Lee, Satoshi Nakamura:
Soft margin estimation on improving environment structures for ensemble speaker and speaking environment modeling. IUCS 2009: 404-408
2008
[b1]
- view
  - electronic edition via handle.net
  - details & citations
  authority control:
- export record
  dblp key:
  - phd/basesearch/Li08a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/phd/basesearch/Li08a
Jinyu Li:
Soft margin estimation for automatic speech recognition. Georgia Institute of Technology, Atlanta, GA, USA, 2008
[c20]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiDYGA08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiDYGA08
Jinyu Li, Li Deng, Dong Yu, Yifan Gong, Alex Acero:
HMM adaptation using a phase-sensitive acoustic distortion model for environment-robust speech recognition. ICASSP 2008: 4069-4072
[c19]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiDYWGA08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiDYWGA08
Jinyu Li, Li Deng, Dong Yu, Jian Wu, Yifan Gong, Alex Acero:
Adaptation of compressed HMM parameters for resource-constrained speech recognition. ICASSP 2008: 4333-4336
[c18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiYLW08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiYLW08
Jinyu Li, Zhi-Jie Yan, Chin-Hui Lee, Ren-Hua Wang:
Soft margin estimation with various separation levels for LVCSR. INTERSPEECH 2008: 269-272
[c17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiL08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiL08
Jinyu Li, Chin-Hui Lee:
On a generalization of margin-based discriminative training to robust speech recognition. INTERSPEECH 2008: 1992-1995
2007
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/LiYL07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/LiYL07
Jinyu Li, Ming Yuan, Chin-Hui Lee:
Approximate Test Risk Bound Minimization Through Soft Margin Estimation. IEEE Trans. Speech Audio Process. 15(8): 2393-2404 (2007)
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/LiDYGA07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/LiDYGA07
Jinyu Li, Li Deng, Dong Yu, Yifan Gong, Alex Acero:
High-performance hmm adaptation with joint compensation of additive and convolutive distortions via Vector Taylor Series. ASRU 2007: 65-70
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/LiYLW07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/LiYLW07
Jinyu Li, Zhi-Jie Yan, Chin-Hui Lee, Ren-Hua Wang:
A study on soft margin estimation for LVCSR. ASRU 2007: 268-271
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiSL07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiSL07
Jinyu Li, Sabato Marco Siniscalchi, Chin-Hui Lee:
Approximate Test Risk Minimization Through Soft Margin Estimation. ICASSP (4) 2007: 653-656
[c13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiL07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiL07
Jinyu Li, Chin-Hui Lee:
Soft margin feature extraction for automatic speech recognition. INTERSPEECH 2007: 30-33
[c12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/BrombergQHLMMMMSTW07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/BrombergQHLMMMMSTW07
Ilana Bromberg, Qian Qian, Jun Hou, Jinyu Li, Chengyuan Ma, Brett Matthews, Antonio Moreno-Daniel, Jeremy Morris, Sabato Marco Siniscalchi, Yu Tsao, Yu Wang:
Detection-based ASR in the automatic speech attribute transcription project. INTERSPEECH 2007: 1829-1832
2006
[c11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiYL06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiYL06
Jinyu Li, Ming Yuan, Chin-Hui Lee:
Soft margin estimation of hidden Markov model parameters. INTERSPEECH 2006
[c10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SiniscalchiLL06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SiniscalchiLL06
Sabato Marco Siniscalchi, Jinyu Li, Chin-Hui Lee:
A study on lattice rescoring with knowledge scores for automatic speech recognition. INTERSPEECH 2006
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/odyssey/LiYLMTZL06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/odyssey/LiYLMTZL06
Jinyu Li, Sibel Yaman, Chin-Hui Lee, Bin Ma, Rong Tong, Donglai Zhu, Haizhou Li:
Language Recognition Based on Score Distribution Feature Vectors and Discriminative Classifier Fusion. Odyssey 2006: 1-5
2005
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiTL05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiTL05
Jinyu Li, Yu Tsao, Chin-Hui Lee:
A Study on Knowledge Source Integration for Candidate Rescoring in Automatic Speech Recognition. ICASSP (1) 2005: 837-840
[c7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TsaoLL05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TsaoLL05
Yu Tsao, Jinyu Li, Chin-Hui Lee:
A study on separation between acoustic models and its applications. INTERSPEECH 2005: 1109-1112
[c6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiL05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiL05
Jinyu Li, Chin-Hui Lee:
On designing and evaluating speech event detectors. INTERSPEECH 2005: 3365-3368
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/wirn/SiniscalchiLPVCGS05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/wirn/SiniscalchiLPVCGS05
Sabato Marco Siniscalchi, Jinyu Li, Giovanni Pilato, Giorgio Vassallo, Mark A. Clements, Antonio Gentile, Filippo Sorbello:
Application of EalphaNets to Feature Recognition of Articulation Manner in Knowledge-Based Automatic Speech Recognition. WIRN/NAIS 2005: 140-146
2004
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiLWD04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiLWD04
Jin-Yu Li, Bo Liu, Ren-Hua Wang, Li-Rong Dai:
A complexity reduction of ETSI advanced front-end for DSR. ICASSP (1) 2004: 61-64
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiLW04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiLW04
Xiao-Bing Li, Jin-Yu Li, Ren-Hua Wang:
Dimensionality reduction using MCE-optimized LDA transformation. ICASSP (1) 2004: 137-140
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/iscslp/LiuDLW04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscslp/LiuDLW04
Bo Liu, Li-Rong Dai, Jin-Yu Li, Ren-Hua Wang:
Double Gaussian based feature normalization for robust speech recognition. ISCSLP 2004: 253-256
2000
[c1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LiLW00
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LiLW00
Jinyu Li, Xin Luo, Ren-Hua Wang:
A novel search algorithm for LSF VQ. INTERSPEECH 2000: 194-197

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.