default search action

combined dblp search
author search
venue search
publication search

ask others

Yi Luo 0004

> Home > Persons

Person information

affiliation: Tencent AI Lab, Shenzhen, China
affiliation (PhD 2021): Columbia University, Department of Electrical Engineering, New York, NY, USA

Other persons with the same name

see FAQ

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[j8]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/GuL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/GuL24
Rongzhi Gu, Yi Luo:
ReZero: Region-Customizable Sound Extraction. IEEE ACM Trans. Audio Speech Lang. Process. 32: 2576-2589 (2024)
[j7]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/tismir/UhlichFHTWRCMLLYGSSHSM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tismir/UhlichFHTWRCMLLYGSSHSM24
Stefan Uhlich, Giorgio Fabbro, Masato Hirano, Shusuke Takahashi, Gordon Wichern, Jonathan Le Roux, Dipam Chakraborty, Sharada Mohanty, Kai Li, Yi Luo, Jianwei Yu, Rongzhi Gu, Roman A. Solovyev, Alexander L. Stempkovskiy, Tatiana Habruseva, Mikhail Sukhovei, Yuki Mitsufuji:
The Sound Demixing Challenge 2023 - Cinematic Demixing Track. Trans. Int. Soc. Music. Inf. Retr. 7(1): 44-62 (2024)
[j6]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/tismir/FabbroULCRLGRHRSDLYCMSSHGHKLDZLM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tismir/FabbroULCRLGRHRSDLYCMSSHGHKLDZLM24
Giorgio Fabbro, Stefan Uhlich, Chieh-Hsin Lai, Woosung Choi, Marco A. Martínez Ramírez, Wei-Hsiang Liao, Igor Gadelha, Geraldo Ramos, Eddie Hsu, Hugo Rodrigues, Fabian-Robert Stöter, Alexandre Défossez, Yi Luo, Jianwei Yu, Dipam Chakraborty, Sharada P. Mohanty, Roman A. Solovyev, Alexander L. Stempkovskiy, Tatiana Habruseva, Nabarun Goswami, Tatsuya Harada, Minseok Kim, Jun Hyung Lee, Yuanliang Dong, Xinran Zhang, Jiafeng Liu, Yuki Mitsufuji:
The Sound Demixing Challenge 2023 - Music Demixing Track. Trans. Int. Soc. Music. Inf. Retr. 7(1): 63-84 (2024)
[c29]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/XuCYHWZLLG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/XuCYHWZLLG24
Yaoxun Xu, Hangting Chen, Jianwei Yu, Qiaochu Huang, Zhiyong Wu, Shi-Xiong Zhang, Guangzhi Li, Yi Luo, Rongzhi Gu:
SECap: Speech Emotion Captioning with Large Language Model. AAAI 2024: 19323-19331
[c28]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/0004G24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/0004G24
Yi Luo, Rongzhi Gu:
Improving Music Source Separation with Simo Stereo Band-Split Rnn. ICASSP 2024: 426-430
[c27]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LuoG24a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LuoG24a
Yi Luo, Rongzhi Gu:
Fast Random Approximation of Multi-Channel Room Impulse Response. ICASSP Workshops 2024: 449-454
[c26]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/FanGLP24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/FanGLP24
Jingjie Fan, Rongzhi Gu, Yi Luo, Cong Pang:
A Unified Geometry-Aware Source Localization and Separation Framework for AD-HOC Microphone Array. ICASSP Workshops 2024: 725-729
[i28]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-04947
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-04947
Yi Luo, Jianwei Yu, Hangting Chen, Rongzhi Gu, Chao Weng:
Gull: A Generative Multifunctional Audio Codec. CoRR abs/2404.04947 (2024)
2023
[j5]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/LuoY23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/LuoY23
Yi Luo, Jianwei Yu:
Music Source Separation With Band-Split RNN. IEEE ACM Trans. Audio Speech Lang. Process. 31: 1893-1901 (2023)
[c25]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YuCLGLW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YuCLGLW23
Jianwei Yu, Hangting Chen, Yi Luo, Rongzhi Gu, Weihua Li, Chao Weng:
TSpeech-AI System Description to the 5th Deep Noise Suppression (DNS) Challenge. ICASSP 2023: 1-2
[c24]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/YuL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/YuL23
Jianwei Yu, Yi Luo:
Efficient Monaural Speech Enhancement with Universal Sample Rate Band-Split RNN. ICASSP 2023: 1-5
[c23]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YuC0GW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YuC0GW23
Jianwei Yu, Hangting Chen, Yi Luo, Rongzhi Gu, Chao Weng:
High Fidelity Speech Enhancement with Band-split RNN. INTERSPEECH 2023: 2483-2487
[c22]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenY0GLLW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenY0GLLW23
Hangting Chen, Jianwei Yu, Yi Luo, Rongzhi Gu, Weihua Li, Zhuocheng Lu, Chao Weng:
Ultra Dual-Path Compression For Joint Echo Cancellation And Noise Suppression. INTERSPEECH 2023: 2523-2527
[c21]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/0004Y23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/0004Y23
Yi Luo, Jianwei Yu:
FRA-RIR: Fast Random Approximation of the Image-source Method. INTERSPEECH 2023: 3884-3888
[i27]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2304-08052
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2304-08052
Yi Luo, Rongzhi Gu:
Fast Random Approximation of Multi-channel Room Impulse Response. CoRR abs/2304.08052 (2023)
[i26]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-06979
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-06979
Giorgio Fabbro, Stefan Uhlich, Chieh-Hsin Lai, Woosung Choi, Marco A. Martínez Ramírez, Wei-Hsiang Liao, Igor Gadelha, Geraldo Ramos, Eddie Hsu, Hugo Rodrigues, Fabian-Robert Stöter, Alexandre Défossez, Yi Luo, Jianwei Yu, Dipam Chakraborty, Sharada P. Mohanty, Roman A. Solovyev, Alexander L. Stempkovskiy, Tatiana Habruseva, Nabarun Goswami, Tatsuya Harada, Minseok Kim, Jun Hyung Lee, Yuanliang Dong, Xinran Zhang, Jiafeng Liu, Yuki Mitsufuji:
The Sound Demixing Challenge 2023 - Music Demixing Track. CoRR abs/2308.06979 (2023)
[i25]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-06981
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-06981
Stefan Uhlich, Giorgio Fabbro, Masato Hirano, Shusuke Takahashi, Gordon Wichern, Jonathan Le Roux, Dipam Chakraborty, Sharada P. Mohanty, Kai Li, Yi Luo, Jianwei Yu, Rongzhi Gu, Roman A. Solovyev, Alexander L. Stempkovskiy, Tatiana Habruseva, Mikhail Sukhovei, Yuki Mitsufuji:
The Sound Demixing Challenge 2023 - Cinematic Demixing Track. CoRR abs/2308.06981 (2023)
[i24]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-11053
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-11053
Hangting Chen, Jianwei Yu, Yi Luo, Rongzhi Gu, Weihua Li, Zhuocheng Lu, Chao Weng:
Ultra Dual-Path Compression For Joint Echo Cancellation And Noise Suppression. CoRR abs/2308.11053 (2023)
[i23]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-16892
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-16892
Rongzhi Gu, Yi Luo:
ReZero: Region-customizable Sound Extraction. CoRR abs/2308.16892 (2023)
[i22]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-13905
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-13905
Jianwei Yu, Hangting Chen, Yanyao Bian, Xiang Li, Yi Luo, Jinchuan Tian, Mengyang Liu, Jiayi Jiang, Shuai Wang:
AutoPrep: An Automatic Preprocessing Framework for In-the-Wild Speech Data. CoRR abs/2309.13905 (2023)
[i21]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-10381
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-10381
Yaoxun Xu, Hangting Chen, Jianwei Yu, Qiaochu Huang, Zhiyong Wu, Shi-Xiong Zhang, Guangzhi Li, Yi Luo, Rongzhi Gu:
SECap: Speech Emotion Captioning with Large Language Model. CoRR abs/2312.10381 (2023)
2022
[j4]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/Luo22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/Luo22
Yi Luo:
A Time-Domain Real-Valued Generalized Wiener Filter for Multi-Channel Neural Separation Systems. IEEE ACM Trans. Audio Speech Lang. Process. 30: 3008-3019 (2022)
[i20]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2208-04101
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2208-04101
Yi Luo, Jianwei Yu:
FRA-RIR: Fast Random Approximation of the Image-source Method. CoRR abs/2208.04101 (2022)
[i19]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2209-15174
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2209-15174
Yi Luo, Jianwei Yu:
Music Source Separation with Band-split RNN. CoRR abs/2209.15174 (2022)
2021
[b1]
- view
  authority control:
- export record
  dblp key:
  - phd/us/Luo21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/phd/us/Luo21
Yi Luo:
End-to-end Speech Separation with Neural Networks. Columbia University, USA, 2021
[j3]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/taslp/LuoHM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/LuoHM21
Yi Luo, Cong Han, Nima Mesgarani:
Group Communication With Context Codec for Lightweight Source Separation. IEEE ACM Trans. Audio Speech Lang. Process. 29: 1752-1761 (2021)
[c20]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LuoCHLZM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LuoCHLZM21
Yi Luo, Zhuo Chen, Cong Han, Chenda Li, Tianyan Zhou, Nima Mesgarani:
Rethinking The Separation Layers In Speech Separation Networks. ICASSP 2021: 1-5
[c19]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiCLHZKD0Q21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiCLHZKD0Q21
Chenda Li, Zhuo Chen, Yi Luo, Cong Han, Tianyan Zhou, Keisuke Kinoshita, Marc Delcroix, Shinji Watanabe, Yanmin Qian:
Dual-Path Modeling for Long Recording Speech Separation in Meetings. ICASSP 2021: 5739-5743
[c18]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HanLLZK0DEHMC21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HanLLZK0DEHMC21
Cong Han, Yi Luo, Chenda Li, Tianyan Zhou, Keisuke Kinoshita, Shinji Watanabe, Marc Delcroix, Hakan Erdogan, John R. Hershey, Nima Mesgarani, Zhuo Chen:
Continuous Speech Separation Using Speaker Inventory for Long Recording. Interspeech 2021: 3036-3040
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/0004HM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/0004HM21
Yi Luo, Cong Han, Nima Mesgarani:
Distortion-Controlled Training for end-to-end Reverberant Speech Separation with Auxiliary Autoencoding Loss. SLT 2021: 825-832
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/LiLHLYZDKBQ0C21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/LiLHLYZDKBQ0C21
Chenda Li, Yi Luo, Cong Han, Jinyu Li, Takuya Yoshioka, Tianyan Zhou, Marc Delcroix, Keisuke Kinoshita, Christoph Böddeker, Yanmin Qian, Shinji Watanabe, Zhuo Chen:
Dual-Path RNN for Long Recording Speech Separation. SLT 2021: 865-872
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/RajDCEHH0DYLKLW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/RajDCEHH0DYLKLW21
Desh Raj, Pavel Denisov, Zhuo Chen, Hakan Erdogan, Zili Huang, Maokui He, Shinji Watanabe, Jun Du, Takuya Yoshioka, Yi Luo, Naoyuki Kanda, Jinyu Li, Scott Wisdom, John R. Hershey:
Integration of Speech Separation, Diarization, and Recognition for Multi-Speaker Meetings: System Description, Comparison, and Analysis. SLT 2021: 897-904
[i18]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2102-11634
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-11634
Chenda Li, Zhuo Chen, Yi Luo, Cong Han, Tianyan Zhou, Keisuke Kinoshita, Marc Delcroix, Shinji Watanabe, Yanmin Qian:
Dual-Path Modeling for Long Recording Speech Separation in Meetings. CoRR abs/2102.11634 (2021)
2020
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LuoCY20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LuoCY20
Yi Luo, Zhuo Chen, Takuya Yoshioka:
Dual-Path RNN: Efficient Long Sequence Modeling for Time-Domain Single-Channel Speech Separation. ICASSP 2020: 46-50
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LuoCMY20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LuoCMY20
Yi Luo, Zhuo Chen, Nima Mesgarani, Takuya Yoshioka:
End-to-end Microphone Permutation and Number Invariant Multi-channel Speech Separation. ICASSP 2020: 6394-6398
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HanLM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HanLM20
Cong Han, Yi Luo, Nima Mesgarani:
Real-Time Binaural Speech Separation with Preserved Spatial Cues. ICASSP 2020: 6404-6408
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenYLZMLWXL20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenYLZMLWXL20
Zhuo Chen, Takuya Yoshioka, Liang Lu, Tianyan Zhou, Zhong Meng, Yi Luo, Jian Wu, Xiong Xiao, Jinyu Li:
Continuous Speech Separation: Dataset and Analysis. ICASSP 2020: 7284-7288
[c10]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WuCLYTLLX20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuCLYTLLX20
Jian Wu, Zhuo Chen, Jinyu Li, Takuya Yoshioka, Zhili Tan, Ed Lin, Yi Luo, Lei Xie:
An End-to-End Architecture of Online Multi-Channel Speech Separation. INTERSPEECH 2020: 81-85
[c9]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LuoM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LuoM20
Yi Luo, Nima Mesgarani:
Separating Varying Numbers of Sources with Auxiliary Autoencoding Loss. INTERSPEECH 2020: 2622-2626
[i17]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2001-11482
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2001-11482
Zhuo Chen, Takuya Yoshioka, Liang Lu, Tianyan Zhou, Zhong Meng, Yi Luo, Jian Wu, Jinyu Li:
Continuous speech separation: dataset and analysis. CoRR abs/2001.11482 (2020)
[i16]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2002-06637
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-06637
Cong Han, Yi Luo, Nima Mesgarani:
Real-time binaural speech separation with preserved spatial cues. CoRR abs/2002.06637 (2020)
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2003-12326
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2003-12326
Yi Luo, Nima Mesgarani:
Separating Varying Numbers of Sources with Auxiliary Autoencoding Loss. CoRR abs/2003.12326 (2020)
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2009-03141
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2009-03141
Jian Wu, Zhuo Chen, Jinyu Li, Takuya Yoshioka, Zhili Tan, Ed Lin, Yi Luo, Lei Xie:
An End-to-end Architecture of Online Multi-channel Speech Separation. CoRR abs/2009.03141 (2020)
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2011-02014
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-02014
Desh Raj, Pavel Denisov, Zhuo Chen, Hakan Erdogan, Zili Huang, Mao-Kui He, Shinji Watanabe, Jun Du, Takuya Yoshioka, Yi Luo, Naoyuki Kanda, Jinyu Li, Scott Wisdom, John R. Hershey:
Integration of speech separation, diarization, and recognition for multi-speaker meetings: System description, comparison, and analysis. CoRR abs/2011.02014 (2020)
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2011-08397
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-08397
Yi Luo, Cong Han, Nima Mesgarani:
Ultra-Lightweight Speech Separation via Group Communication. CoRR abs/2011.08397 (2020)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2011-08400
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-08400
Yi Luo, Zhuo Chen, Cong Han, Chenda Li, Tianyan Zhou, Nima Mesgarani:
Rethinking the Separation Layers in Speech Separation Networks. CoRR abs/2011.08400 (2020)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2012-07291
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2012-07291
Yi Luo, Cong Han, Nima Mesgarani:
Group Communication with Context Codec for Ultra-Lightweight Source Separation. CoRR abs/2012.07291 (2020)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2012-09727
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2012-09727
Cong Han, Yi Luo, Chenda Li, Tianyan Zhou, Keisuke Kinoshita, Shinji Watanabe, Marc Delcroix, Hakan Erdogan, John R. Hershey, Nima Mesgarani, Zhuo Chen:
Continuous Speech Separation Using Speaker Inventory for Long Multi-talker Recording. CoRR abs/2012.09727 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[j2]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/taslp/LuoM19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/LuoM19
Yi Luo, Nima Mesgarani:
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation. IEEE ACM Trans. Audio Speech Lang. Process. 27(8): 1256-1266 (2019)
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/LuoHMCL19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/LuoHMCL19
Yi Luo, Cong Han, Nima Mesgarani, Enea Ceolini, Shih-Chii Liu:
FaSNet: Low-Latency Adaptive Beamforming for Multi-Microphone Audio Processing. ASRU 2019: 260-267
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/Han0M19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/Han0M19
Cong Han, Yi Luo, Nima Mesgarani:
Online Deep Attractor Network for Real-time Single-channel Speech Separation. ICASSP 2019: 361-365
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/0004M19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/0004M19
Yi Luo, Nima Mesgarani:
Augmented Time-frequency Mask Estimation in Cluster-based Source Separation Algorithms. ICASSP 2019: 710-714
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1909-13387
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1909-13387
Yi Luo, Enea Ceolini, Cong Han, Shih-Chii Liu, Nima Mesgarani:
FaSNet: Low-latency Adaptive Beamforming for Multi-microphone Audio Processing. CoRR abs/1909.13387 (2019)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1910-06379
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-06379
Yi Luo, Zhuo Chen, Takuya Yoshioka:
Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation. CoRR abs/1910.06379 (2019)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1910-14104
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-14104
Yi Luo, Zhuo Chen, Nima Mesgarani, Takuya Yoshioka:
End-to-end Microphone Permutation and Number Invariant Multi-channel Speech Separation. CoRR abs/1910.14104 (2019)
2018
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/LuoCM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/LuoCM18
Yi Luo, Zhuo Chen, Nima Mesgarani:
Speaker-Independent Speech Separation With Deep Attractor Network. IEEE ACM Trans. Audio Speech Lang. Process. 26(4): 787-796 (2018)
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/0004M18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/0004M18
Yi Luo, Nima Mesgarani:
TaSNet: Time-Domain Audio Separation Network for Real-Time, Single-Channel Speech Separation. ICASSP 2018: 696-700
[c4]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/0004M18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/0004M18
Yi Luo, Nima Mesgarani:
Real-time Single-channel Dereverberation and Separation with Time-domain Audio Separation Network. INTERSPEECH 2018: 342-346
[c3]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Kumar0M18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Kumar0M18
Rajath Kumar, Yi Luo, Nima Mesgarani:
Music Source Activity Detection and Separation Using Deep Attractor Network. INTERSPEECH 2018: 347-351
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1809-07454
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1809-07454
Yi Luo, Nima Mesgarani:
TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation. CoRR abs/1809.07454 (2018)
2017
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LuoCHRM17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LuoCHRM17
Yi Luo, Zhuo Chen, John R. Hershey, Jonathan Le Roux, Nima Mesgarani:
Deep clustering and conventional networks for music separation: Stronger together. ICASSP 2017: 61-65
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenLM17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenLM17
Zhuo Chen, Yi Luo, Nima Mesgarani:
Deep attractor network for single-microphone speaker separation. ICASSP 2017: 246-250
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/ChenLM17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/ChenLM17
Zhuo Chen, Yi Luo, Nima Mesgarani:
Speaker-independent Speech Separation with Deep Attractor Network. CoRR abs/1707.03634 (2017)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1711-00541
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1711-00541
Yi Luo, Nima Mesgarani:
TasNet: time-domain audio separation network for real-time, single-channel speech separation. CoRR abs/1711.00541 (2017)
2016
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/LuoCHRM16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/LuoCHRM16
Yi Luo, Zhuo Chen, John R. Hershey, Jonathan Le Roux, Nima Mesgarani:
Deep Clustering and Conventional Networks for Music Separation: Stronger Together. CoRR abs/1611.06265 (2016)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/ChenLM16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/ChenLM16
Zhuo Chen, Yi Luo, Nima Mesgarani:
Deep attractor network for single-microphone speaker separation. CoRR abs/1611.08930 (2016)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.