default search action
Yi Luo 0004
Person information
- affiliation: Tencent AI Lab, Shenzhen, China
- affiliation (PhD 2021): Columbia University, Department of Electrical Engineering, New York, NY, USA
Other persons with the same name
- Yi Luo — disambiguation page
- Yi Luo 0001 — CNRS, Laboratory Le2i, Dijon, France (and 1 more)
- Yi Luo 0002 — University of Kentucky, Lexington, KY, USA
- Yi Luo 0003 — KTH Royal Institute of Technology, School of Biotechnology, Stockholm, Sweden (and 2 more)
- Yi Luo 0005 — University of British Columbia, Department of Electrical and Computer Engineering, Vancouver, BC, Canada
- Yi Luo 0006 — Chongqing University, Chongqing University, China
- Yi Luo 0007 — China Southern Power Grid Co Ltd, Guangzhou, China (and 1 more)
- Yi Luo 0008 — University of Arizona, Tucson, USA
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j8]Rongzhi Gu, Yi Luo:
ReZero: Region-Customizable Sound Extraction. IEEE ACM Trans. Audio Speech Lang. Process. 32: 2576-2589 (2024) - [j7]Stefan Uhlich, Giorgio Fabbro, Masato Hirano, Shusuke Takahashi, Gordon Wichern, Jonathan Le Roux, Dipam Chakraborty, Sharada Mohanty, Kai Li, Yi Luo, Jianwei Yu, Rongzhi Gu, Roman A. Solovyev, Alexander L. Stempkovskiy, Tatiana Habruseva, Mikhail Sukhovei, Yuki Mitsufuji:
The Sound Demixing Challenge 2023 - Cinematic Demixing Track. Trans. Int. Soc. Music. Inf. Retr. 7(1): 44-62 (2024) - [j6]Giorgio Fabbro, Stefan Uhlich, Chieh-Hsin Lai, Woosung Choi, Marco A. Martínez Ramírez, Wei-Hsiang Liao, Igor Gadelha, Geraldo Ramos, Eddie Hsu, Hugo Rodrigues, Fabian-Robert Stöter, Alexandre Défossez, Yi Luo, Jianwei Yu, Dipam Chakraborty, Sharada P. Mohanty, Roman A. Solovyev, Alexander L. Stempkovskiy, Tatiana Habruseva, Nabarun Goswami, Tatsuya Harada, Minseok Kim, Jun Hyung Lee, Yuanliang Dong, Xinran Zhang, Jiafeng Liu, Yuki Mitsufuji:
The Sound Demixing Challenge 2023 - Music Demixing Track. Trans. Int. Soc. Music. Inf. Retr. 7(1): 63-84 (2024) - [c29]Yaoxun Xu, Hangting Chen, Jianwei Yu, Qiaochu Huang, Zhiyong Wu, Shi-Xiong Zhang, Guangzhi Li, Yi Luo, Rongzhi Gu:
SECap: Speech Emotion Captioning with Large Language Model. AAAI 2024: 19323-19331 - [c28]Yi Luo, Rongzhi Gu:
Improving Music Source Separation with Simo Stereo Band-Split Rnn. ICASSP 2024: 426-430 - [c27]Yi Luo, Rongzhi Gu:
Fast Random Approximation of Multi-Channel Room Impulse Response. ICASSP Workshops 2024: 449-454 - [c26]Jingjie Fan, Rongzhi Gu, Yi Luo, Cong Pang:
A Unified Geometry-Aware Source Localization and Separation Framework for AD-HOC Microphone Array. ICASSP Workshops 2024: 725-729 - [i28]Yi Luo, Jianwei Yu, Hangting Chen, Rongzhi Gu, Chao Weng:
Gull: A Generative Multifunctional Audio Codec. CoRR abs/2404.04947 (2024) - 2023
- [j5]Yi Luo, Jianwei Yu:
Music Source Separation With Band-Split RNN. IEEE ACM Trans. Audio Speech Lang. Process. 31: 1893-1901 (2023) - [c25]Jianwei Yu, Hangting Chen, Yi Luo, Rongzhi Gu, Weihua Li, Chao Weng:
TSpeech-AI System Description to the 5th Deep Noise Suppression (DNS) Challenge. ICASSP 2023: 1-2 - [c24]Jianwei Yu, Yi Luo:
Efficient Monaural Speech Enhancement with Universal Sample Rate Band-Split RNN. ICASSP 2023: 1-5 - [c23]Jianwei Yu, Hangting Chen, Yi Luo, Rongzhi Gu, Chao Weng:
High Fidelity Speech Enhancement with Band-split RNN. INTERSPEECH 2023: 2483-2487 - [c22]Hangting Chen, Jianwei Yu, Yi Luo, Rongzhi Gu, Weihua Li, Zhuocheng Lu, Chao Weng:
Ultra Dual-Path Compression For Joint Echo Cancellation And Noise Suppression. INTERSPEECH 2023: 2523-2527 - [c21]Yi Luo, Jianwei Yu:
FRA-RIR: Fast Random Approximation of the Image-source Method. INTERSPEECH 2023: 3884-3888 - [i27]Yi Luo, Rongzhi Gu:
Fast Random Approximation of Multi-channel Room Impulse Response. CoRR abs/2304.08052 (2023) - [i26]Giorgio Fabbro, Stefan Uhlich, Chieh-Hsin Lai, Woosung Choi, Marco A. Martínez Ramírez, Wei-Hsiang Liao, Igor Gadelha, Geraldo Ramos, Eddie Hsu, Hugo Rodrigues, Fabian-Robert Stöter, Alexandre Défossez, Yi Luo, Jianwei Yu, Dipam Chakraborty, Sharada P. Mohanty, Roman A. Solovyev, Alexander L. Stempkovskiy, Tatiana Habruseva, Nabarun Goswami, Tatsuya Harada, Minseok Kim, Jun Hyung Lee, Yuanliang Dong, Xinran Zhang, Jiafeng Liu, Yuki Mitsufuji:
The Sound Demixing Challenge 2023 - Music Demixing Track. CoRR abs/2308.06979 (2023) - [i25]Stefan Uhlich, Giorgio Fabbro, Masato Hirano, Shusuke Takahashi, Gordon Wichern, Jonathan Le Roux, Dipam Chakraborty, Sharada P. Mohanty, Kai Li, Yi Luo, Jianwei Yu, Rongzhi Gu, Roman A. Solovyev, Alexander L. Stempkovskiy, Tatiana Habruseva, Mikhail Sukhovei, Yuki Mitsufuji:
The Sound Demixing Challenge 2023 - Cinematic Demixing Track. CoRR abs/2308.06981 (2023) - [i24]Hangting Chen, Jianwei Yu, Yi Luo, Rongzhi Gu, Weihua Li, Zhuocheng Lu, Chao Weng:
Ultra Dual-Path Compression For Joint Echo Cancellation And Noise Suppression. CoRR abs/2308.11053 (2023) - [i23]Rongzhi Gu, Yi Luo:
ReZero: Region-customizable Sound Extraction. CoRR abs/2308.16892 (2023) - [i22]Jianwei Yu, Hangting Chen, Yanyao Bian, Xiang Li, Yi Luo, Jinchuan Tian, Mengyang Liu, Jiayi Jiang, Shuai Wang:
AutoPrep: An Automatic Preprocessing Framework for In-the-Wild Speech Data. CoRR abs/2309.13905 (2023) - [i21]Yaoxun Xu, Hangting Chen, Jianwei Yu, Qiaochu Huang, Zhiyong Wu, Shi-Xiong Zhang, Guangzhi Li, Yi Luo, Rongzhi Gu:
SECap: Speech Emotion Captioning with Large Language Model. CoRR abs/2312.10381 (2023) - 2022
- [j4]Yi Luo:
A Time-Domain Real-Valued Generalized Wiener Filter for Multi-Channel Neural Separation Systems. IEEE ACM Trans. Audio Speech Lang. Process. 30: 3008-3019 (2022) - [i20]Yi Luo, Jianwei Yu:
FRA-RIR: Fast Random Approximation of the Image-source Method. CoRR abs/2208.04101 (2022) - [i19]Yi Luo, Jianwei Yu:
Music Source Separation with Band-split RNN. CoRR abs/2209.15174 (2022) - 2021
- [b1]Yi Luo:
End-to-end Speech Separation with Neural Networks. Columbia University, USA, 2021 - [j3]Yi Luo, Cong Han, Nima Mesgarani:
Group Communication With Context Codec for Lightweight Source Separation. IEEE ACM Trans. Audio Speech Lang. Process. 29: 1752-1761 (2021) - [c20]Yi Luo, Zhuo Chen, Cong Han, Chenda Li, Tianyan Zhou, Nima Mesgarani:
Rethinking The Separation Layers In Speech Separation Networks. ICASSP 2021: 1-5 - [c19]Chenda Li, Zhuo Chen, Yi Luo, Cong Han, Tianyan Zhou, Keisuke Kinoshita, Marc Delcroix, Shinji Watanabe, Yanmin Qian:
Dual-Path Modeling for Long Recording Speech Separation in Meetings. ICASSP 2021: 5739-5743 - [c18]Cong Han, Yi Luo, Chenda Li, Tianyan Zhou, Keisuke Kinoshita, Shinji Watanabe, Marc Delcroix, Hakan Erdogan, John R. Hershey, Nima Mesgarani, Zhuo Chen:
Continuous Speech Separation Using Speaker Inventory for Long Recording. Interspeech 2021: 3036-3040 - [c17]Yi Luo, Cong Han, Nima Mesgarani:
Distortion-Controlled Training for end-to-end Reverberant Speech Separation with Auxiliary Autoencoding Loss. SLT 2021: 825-832 - [c16]Chenda Li, Yi Luo, Cong Han, Jinyu Li, Takuya Yoshioka, Tianyan Zhou, Marc Delcroix, Keisuke Kinoshita, Christoph Böddeker, Yanmin Qian, Shinji Watanabe, Zhuo Chen:
Dual-Path RNN for Long Recording Speech Separation. SLT 2021: 865-872 - [c15]Desh Raj, Pavel Denisov, Zhuo Chen, Hakan Erdogan, Zili Huang, Maokui He, Shinji Watanabe, Jun Du, Takuya Yoshioka, Yi Luo, Naoyuki Kanda, Jinyu Li, Scott Wisdom, John R. Hershey:
Integration of Speech Separation, Diarization, and Recognition for Multi-Speaker Meetings: System Description, Comparison, and Analysis. SLT 2021: 897-904 - [i18]Chenda Li, Zhuo Chen, Yi Luo, Cong Han, Tianyan Zhou, Keisuke Kinoshita, Marc Delcroix, Shinji Watanabe, Yanmin Qian:
Dual-Path Modeling for Long Recording Speech Separation in Meetings. CoRR abs/2102.11634 (2021) - 2020
- [c14]Yi Luo, Zhuo Chen, Takuya Yoshioka:
Dual-Path RNN: Efficient Long Sequence Modeling for Time-Domain Single-Channel Speech Separation. ICASSP 2020: 46-50 - [c13]Yi Luo, Zhuo Chen, Nima Mesgarani, Takuya Yoshioka:
End-to-end Microphone Permutation and Number Invariant Multi-channel Speech Separation. ICASSP 2020: 6394-6398 - [c12]Cong Han, Yi Luo, Nima Mesgarani:
Real-Time Binaural Speech Separation with Preserved Spatial Cues. ICASSP 2020: 6404-6408 - [c11]Zhuo Chen, Takuya Yoshioka, Liang Lu, Tianyan Zhou, Zhong Meng, Yi Luo, Jian Wu, Xiong Xiao, Jinyu Li:
Continuous Speech Separation: Dataset and Analysis. ICASSP 2020: 7284-7288 - [c10]Jian Wu, Zhuo Chen, Jinyu Li, Takuya Yoshioka, Zhili Tan, Ed Lin, Yi Luo, Lei Xie:
An End-to-End Architecture of Online Multi-Channel Speech Separation. INTERSPEECH 2020: 81-85 - [c9]Yi Luo, Nima Mesgarani:
Separating Varying Numbers of Sources with Auxiliary Autoencoding Loss. INTERSPEECH 2020: 2622-2626 - [i17]Zhuo Chen, Takuya Yoshioka, Liang Lu, Tianyan Zhou, Zhong Meng, Yi Luo, Jian Wu, Jinyu Li:
Continuous speech separation: dataset and analysis. CoRR abs/2001.11482 (2020) - [i16]Cong Han, Yi Luo, Nima Mesgarani:
Real-time binaural speech separation with preserved spatial cues. CoRR abs/2002.06637 (2020) - [i15]Yi Luo, Nima Mesgarani:
Separating Varying Numbers of Sources with Auxiliary Autoencoding Loss. CoRR abs/2003.12326 (2020) - [i14]Jian Wu, Zhuo Chen, Jinyu Li, Takuya Yoshioka, Zhili Tan, Ed Lin, Yi Luo, Lei Xie:
An End-to-end Architecture of Online Multi-channel Speech Separation. CoRR abs/2009.03141 (2020) - [i13]Desh Raj, Pavel Denisov, Zhuo Chen, Hakan Erdogan, Zili Huang, Mao-Kui He, Shinji Watanabe, Jun Du, Takuya Yoshioka, Yi Luo, Naoyuki Kanda, Jinyu Li, Scott Wisdom, John R. Hershey:
Integration of speech separation, diarization, and recognition for multi-speaker meetings: System description, comparison, and analysis. CoRR abs/2011.02014 (2020) - [i12]Yi Luo, Cong Han, Nima Mesgarani:
Ultra-Lightweight Speech Separation via Group Communication. CoRR abs/2011.08397 (2020) - [i11]Yi Luo, Zhuo Chen, Cong Han, Chenda Li, Tianyan Zhou, Nima Mesgarani:
Rethinking the Separation Layers in Speech Separation Networks. CoRR abs/2011.08400 (2020) - [i10]Yi Luo, Cong Han, Nima Mesgarani:
Group Communication with Context Codec for Ultra-Lightweight Source Separation. CoRR abs/2012.07291 (2020) - [i9]Cong Han, Yi Luo, Chenda Li, Tianyan Zhou, Keisuke Kinoshita, Shinji Watanabe, Marc Delcroix, Hakan Erdogan, John R. Hershey, Nima Mesgarani, Zhuo Chen:
Continuous Speech Separation Using Speaker Inventory for Long Multi-talker Recording. CoRR abs/2012.09727 (2020)
2010 – 2019
- 2019
- [j2]Yi Luo, Nima Mesgarani:
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation. IEEE ACM Trans. Audio Speech Lang. Process. 27(8): 1256-1266 (2019) - [c8]Yi Luo, Cong Han, Nima Mesgarani, Enea Ceolini, Shih-Chii Liu:
FaSNet: Low-Latency Adaptive Beamforming for Multi-Microphone Audio Processing. ASRU 2019: 260-267 - [c7]Cong Han, Yi Luo, Nima Mesgarani:
Online Deep Attractor Network for Real-time Single-channel Speech Separation. ICASSP 2019: 361-365 - [c6]Yi Luo, Nima Mesgarani:
Augmented Time-frequency Mask Estimation in Cluster-based Source Separation Algorithms. ICASSP 2019: 710-714 - [i8]Yi Luo, Enea Ceolini, Cong Han, Shih-Chii Liu, Nima Mesgarani:
FaSNet: Low-latency Adaptive Beamforming for Multi-microphone Audio Processing. CoRR abs/1909.13387 (2019) - [i7]Yi Luo, Zhuo Chen, Takuya Yoshioka:
Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation. CoRR abs/1910.06379 (2019) - [i6]Yi Luo, Zhuo Chen, Nima Mesgarani, Takuya Yoshioka:
End-to-end Microphone Permutation and Number Invariant Multi-channel Speech Separation. CoRR abs/1910.14104 (2019) - 2018
- [j1]Yi Luo, Zhuo Chen, Nima Mesgarani:
Speaker-Independent Speech Separation With Deep Attractor Network. IEEE ACM Trans. Audio Speech Lang. Process. 26(4): 787-796 (2018) - [c5]Yi Luo, Nima Mesgarani:
TaSNet: Time-Domain Audio Separation Network for Real-Time, Single-Channel Speech Separation. ICASSP 2018: 696-700 - [c4]Yi Luo, Nima Mesgarani:
Real-time Single-channel Dereverberation and Separation with Time-domain Audio Separation Network. INTERSPEECH 2018: 342-346 - [c3]Rajath Kumar, Yi Luo, Nima Mesgarani:
Music Source Activity Detection and Separation Using Deep Attractor Network. INTERSPEECH 2018: 347-351 - [i5]Yi Luo, Nima Mesgarani:
TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation. CoRR abs/1809.07454 (2018) - 2017
- [c2]Yi Luo, Zhuo Chen, John R. Hershey, Jonathan Le Roux, Nima Mesgarani:
Deep clustering and conventional networks for music separation: Stronger together. ICASSP 2017: 61-65 - [c1]Zhuo Chen, Yi Luo, Nima Mesgarani:
Deep attractor network for single-microphone speaker separation. ICASSP 2017: 246-250 - [i4]Zhuo Chen, Yi Luo, Nima Mesgarani:
Speaker-independent Speech Separation with Deep Attractor Network. CoRR abs/1707.03634 (2017) - [i3]Yi Luo, Nima Mesgarani:
TasNet: time-domain audio separation network for real-time, single-channel speech separation. CoRR abs/1711.00541 (2017) - 2016
- [i2]Yi Luo, Zhuo Chen, John R. Hershey, Jonathan Le Roux, Nima Mesgarani:
Deep Clustering and Conventional Networks for Music Separation: Stronger Together. CoRR abs/1611.06265 (2016) - [i1]Zhuo Chen, Yi Luo, Nima Mesgarani:
Deep attractor network for single-microphone speaker separation. CoRR abs/1611.08930 (2016)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-09-20 00:37 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint