default search action
Guanlong Zhao
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c13]Guanlong Zhao, Yongqiang Wang, Jason Pelecanos, Yu Zhang, Hank Liao, Yiling Huang, Han Lu, Quan Wang:
USM-SCD: Multilingual Speaker Change Detection Based on Large Pretrained Foundation Models. ICASSP 2024: 11801-11805 - [i10]Quan Wang, Yiling Huang, Guanlong Zhao, Evan Clark, Wei Xia, Hank Liao:
DiarizationLM: Speaker Diarization Post-Processing with Large Language Models. CoRR abs/2401.03506 (2024) - 2023
- [c12]Beltrán Labrador, Guanlong Zhao, Ignacio López-Moreno, Angelo Scorza Scarpati, Liam Fowl, Quan Wang:
Exploring Sequence-to-Sequence Transformer-Transducer Models for Keyword Spotting. ICASSP 2023: 1-5 - [c11]Guanlong Zhao, Quan Wang, Han Lu, Yiling Huang, Ignacio López-Moreno:
Augmenting Transformer-Transducer Based Speaker Change Detection with Token-Level Training Loss. ICASSP 2023: 1-5 - [i9]Guanlong Zhao, Yongqiang Wang, Jason Pelecanos, Yu Zhang, Hank Liao, Yiling Huang, Han Lu, Quan Wang:
USM-SCD: Multilingual Speaker Change Detection Based on Large Pretrained Foundation Models. CoRR abs/2309.08023 (2023) - [i8]Yiling Huang, Weiran Wang, Guanlong Zhao, Hank Liao, Wei Xia, Quan Wang:
Towards Word-Level End-to-End Neural Speaker Diarization with Auxiliary Network. CoRR abs/2309.08489 (2023) - [i7]Beltrán Labrador, Pai Zhu, Guanlong Zhao, Angelo Scorza Scarpati, Quan Wang, Alicia Lozano-Diez, Alex Park, Ignacio López-Moreno:
Personalizing Keyword Spotting with Speaker Information. CoRR abs/2311.03419 (2023) - 2022
- [j5]Shaojin Ding, Guanlong Zhao, Ricardo Gutierrez-Osuna:
Accentron: Foreign accent conversion to arbitrary non-native speakers using zero-shot learning. Comput. Speech Lang. 72: 101302 (2022) - [i6]Quan Wang, Yiling Huang, Han Lu, Guanlong Zhao, Ignacio López-Moreno:
Highly Efficient Real-Time Streaming and Fully On-Device Speaker Diarization with Multi-Stage Clustering. CoRR abs/2210.13690 (2022) - [i5]Beltrán Labrador, Guanlong Zhao, Ignacio López-Moreno, Angelo Scorza Scarpati, Liam Fowl, Quan Wang:
Exploring Sequence-to-Sequence Transformer-Transducer Models for Keyword Spotting. CoRR abs/2211.06478 (2022) - [i4]Guanlong Zhao, Quan Wang, Han Lu, Yiling Huang, Ignacio López-Moreno:
Augmenting Transformer-Transducer Based Speaker Change Detection With Token-Level Training Loss. CoRR abs/2211.06482 (2022) - 2021
- [j4]Guanlong Zhao, Shaojin Ding, Ricardo Gutierrez-Osuna:
Converting Foreign Accent Speech Without a Reference. IEEE ACM Trans. Audio Speech Lang. Process. 29: 2367-2381 (2021) - [c10]Alif Silpachai, Ivana Rehman, Taylor Anne Barriuso, John Levis, Evgeny Chukharev-Hudilainen, Guanlong Zhao, Ricardo Gutierrez-Osuna:
Effects of Voice Type and Task on L2 Learners' Awareness of Pronunciation Errors. Interspeech 2021: 1952-1956 - [c9]Adam Hair, Guanlong Zhao, Beena Ahmed, Kirrie J. Ballard, Ricardo Gutierrez-Osuna:
Assessing Posterior-Based Mispronunciation Detection on Field-Collected Recordings from Child Speech Therapy Sessions. Interspeech 2021: 2936-2940 - 2020
- [j3]Shaojin Ding, Guanlong Zhao, Christopher Liberatore, Ricardo Gutierrez-Osuna:
Learning Structured Sparse Representations for Voice Conversion. IEEE ACM Trans. Audio Speech Lang. Process. 28: 343-354 (2020) - [c8]Shaojin Ding, Guanlong Zhao, Ricardo Gutierrez-Osuna:
Improving the Speaker Identity of Non-Parallel Many-to-Many Voice Conversion with Adversarial Speaker Recognition. INTERSPEECH 2020: 776-780 - [c7]Anurag Das, Guanlong Zhao, John Levis, Evgeny Chukharev-Hudilainen, Ricardo Gutierrez-Osuna:
Understanding the Effect of Voice Quality and Accent on Talker Similarity. INTERSPEECH 2020: 1763-1767 - [i3]Arindrima Datta, Guanlong Zhao, Bhuvana Ramabhadran, Eugene Weinstein:
LSTM Acoustic Models Learn to Align and Pronounce with Graphemes. CoRR abs/2008.06121 (2020)
2010 – 2019
- 2019
- [j2]Shaojin Ding, Christopher Liberatore, Sinem Sonsaat, Ivana Lucic, Alif Silpachai, Guanlong Zhao, Evgeny Chukharev-Hudilainen, John Levis, Ricardo Gutierrez-Osuna:
Golden speaker builder - An interactive tool for pronunciation training. Speech Commun. 115: 51-66 (2019) - [j1]Guanlong Zhao, Ricardo Gutierrez-Osuna:
Using Phonetic Posteriorgram Based Frame Pairing for Segmental Accent Conversion. IEEE ACM Trans. Audio Speech Lang. Process. 27(10): 1649-1660 (2019) - [c6]Guanlong Zhao, Shaojin Ding, Ricardo Gutierrez-Osuna:
Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams. INTERSPEECH 2019: 2843-2847 - 2018
- [c5]Christopher Liberatore, Guanlong Zhao, Ricardo Gutierrez-Osuna:
Voice Conversion Through Residual Warping in a Sparse, Anchor-Based Representation of Speech. ICASSP 2018: 5284-5288 - [c4]Guanlong Zhao, Sinem Sonsaat, John Levis, Evgeny Chukharev-Hudilainen, Ricardo Gutierrez-Osuna:
Accent Conversion Using Phonetic Posteriorgrams. ICASSP 2018: 5314-5318 - [c3]Shaojin Ding, Guanlong Zhao, Christopher Liberatore, Ricardo Gutierrez-Osuna:
Improving Sparse Representations in Exemplar-Based Voice Conversion with a Phoneme-Selective Objective Function. INTERSPEECH 2018: 476-480 - [c2]Guanlong Zhao, Sinem Sonsaat, Alif Silpachai, Ivana Lucic, Evgeny Chukharev-Hudilainen, John Levis, Ricardo Gutierrez-Osuna:
L2-ARCTIC: A Non-native English Speech Corpus. INTERSPEECH 2018: 2783-2787 - [i2]Yu Liu, Guanlong Zhao:
PAD-Net: A Perception-Aided Single Image Dehazing Network. CoRR abs/1805.03146 (2018) - [i1]Yu Liu, Guanlong Zhao, Boyuan Gong, Yang Li, Ritu Raj, Niraj Goel, Satya Kesav, Sandeep Gottimukkala, Zhangyang Wang, Wenqi Ren, Dacheng Tao:
Improved Techniques for Learning to Dehaze and Beyond: A Collective Study. CoRR abs/1807.00202 (2018) - 2017
- [c1]Guanlong Zhao, Ricardo Gutierrez-Osuna:
Exemplar selection methods in voice conversion. ICASSP 2017: 5525-5529
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-07 22:19 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint