default search action
Hung-Shin Lee
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [i19]Chien-Chun Wang, Li-Wei Chen, Hung-Shin Lee, Berlin Chen, Hsin-Min Wang:
Effective Noise-aware Data Simulation for Domain-adaptive Speech Enhancement Leveraging Dynamic Stochastic Perturbation. CoRR abs/2409.01545 (2024) - [i18]Li-Wei Chen, Hung-Shin Lee, Chen-Chi Chang:
VoxHakka: A Dialectally Diverse Multi-speaker Text-to-Speech System for Taiwanese Hakka. CoRR abs/2409.01548 (2024) - [i17]Chen-Chi Chang, Ching-Yuan Chen, Hung-Shin Lee, Chih-Cheng Lee:
Benchmarking Cognitive Domains for LLMs: Insights from Taiwanese Hakka Culture. CoRR abs/2409.01556 (2024) - [i16]Yao-Fei Cheng, Li-Wei Chen, Hung-Shin Lee, Hsin-Min Wang:
Exploring the Impact of Data Quantity on ASR in Extremely Low-resource Languages. CoRR abs/2409.08872 (2024) - [i15]Chien-Chun Wang, Li-Wei Chen, Cheng-Kang Chou, Hung-Shin Lee, Berlin Chen, Hsin-Min Wang:
Channel-Aware Domain-Adaptive Generative Adversarial Network for Robust Speech Recognition. CoRR abs/2409.12386 (2024) - 2023
- [j3]Chin-Yi Cheng, Hung-Shin Lee, Yu Tsao, Hsin-Min Wang:
Multi-Target Extractor and Detector for Unknown-Number Speaker Diarization. IEEE Signal Process. Lett. 30: 638-642 (2023) - [c39]Li-Wei Chen, Yao-Fei Cheng, Hung-Shin Lee, Yu Tsao, Hsin-Min Wang:
A Training and Inference Strategy Using Noisy and Enhanced Speech as Target for Speech Enhancement without Clean Speech. INTERSPEECH 2023: 2473-2477 - [i14]Li-Wei Chen, Kai-Chen Cheng, Hung-Shin Lee:
The North System for Formosa Speech Recognition Challenge 2023. CoRR abs/2310.03443 (2023) - 2022
- [c38]Hung-Shin Lee, Pin-Tuan Huang, Yao-Fei Cheng, Hsin-Min Wang:
Chain-based Discriminative Autoencoders for Speech Recognition. INTERSPEECH 2022: 2078-2082 - [c37]Fan-Lin Wang, Hung-Shin Lee, Yu Tsao, Hsin-Min Wang:
Disentangling the Impacts of Language and Channel Variability on Speech Separation Networks. INTERSPEECH 2022: 5343-5347 - [c36]Hung-Shin Lee, Pin-Yuan Chen, Yao-Fei Cheng, Yu Tsao, Hsin-Min Wang:
Speech-enhanced and Noise-aware Networks for Robust Speech Recognition. ISCSLP 2022: 145-149 - [i13]Hung-Shin Lee, Pin-Tuan Huang, Yao-Fei Cheng, Hsin-Min Wang:
Chain-based Discriminative Autoencoders for Speech Recognition. CoRR abs/2203.13687 (2022) - [i12]Hung-Shin Lee, Pin-Yuan Chen, Yu Tsao, Hsin-Min Wang:
Speech-enhanced and Noise-aware Networks for Robust Speech Recognition. CoRR abs/2203.13696 (2022) - [i11]Hung-Shin Lee, Yu Tsao, Shyh-Kang Jeng, Hsin-Min Wang:
Subspace-based Representation and Learning for Phonotactic Spoken Language Recognition. CoRR abs/2203.15576 (2022) - [i10]Chin-Yi Cheng, Hung-Shin Lee, Yu Tsao, Hsin-Min Wang:
Multi-Target Filter and Detector for Speaker Diarization. CoRR abs/2203.16007 (2022) - [i9]Fan-Lin Wang, Hung-Shin Lee, Yu Tsao, Hsin-Min Wang:
Disentangling the Impacts of Language and Channel Variability on Speech Separation Networks. CoRR abs/2203.16040 (2022) - [i8]Yu-Huai Peng, Hung-Shin Lee, Pin-Tuan Huang, Hsin-Min Wang:
Generation of Speaker Representations Using Heterogeneous Training Batch Assembly. CoRR abs/2203.16646 (2022) - [i7]Chiang-Lin Tai, Hung-Shin Lee, Yu Tsao, Hsin-Min Wang:
Filter-based Discriminative Autoencoders for Children Speech Recognition. CoRR abs/2204.00164 (2022) - [i6]Li-Wei Chen, Yao-Fei Cheng, Hung-Shin Lee, Yu Tsao, Hsin-Min Wang:
A Teacher-student Framework for Unsupervised Speech Enhancement Using Noise Remixing Training and Two-stage Inference. CoRR abs/2210.15368 (2022) - [i5]Fan-Lin Wang, Yao-Fei Cheng, Hung-Shin Lee, Yu Tsao, Hsin-Min Wang:
CasNet: Investigating Channel Robustness for Speech Separation. CoRR abs/2210.15370 (2022) - 2021
- [c35]Yu-Huai Peng, Hung-Shin Lee, Pin-Tuan Huang, Hsin-Min Wang:
Generation of Speaker Representations Using Heterogeneous Training Batch Assembly. APSIPA ASC 2021: 719-724 - [c34]Chung-En Sun, Yi-Wei Chen, Hung-Shin Lee, Yen-Hsing Chen, Hsin-Min Wang:
Melody Harmonization Using Orderless Nade, Chord Balancing, and Blocked Gibbs Sampling. ICASSP 2021: 4145-4149 - [c33]Yao-Fei Cheng, Hung-Shin Lee, Hsin-Min Wang:
AlloST: Low-Resource Speech Translation Without Source Transcription. Interspeech 2021: 2252-2256 - [c32]Fan-Lin Wang, Yu-Huai Peng, Hung-Shin Lee, Hsin-Min Wang:
Dual-Path Filter Network: Speaker-Aware Modeling for Speech Separation. Interspeech 2021: 3061-3065 - [c31]Yi-Chiao Wu, Cheng-Hung Hu, Hung-Shin Lee, Yu-Huai Peng, Wen-Chin Huang, Yu Tsao, Hsin-Min Wang, Tomoki Toda:
Relational Data Selection for Data Augmentation of Speaker-Dependent Multi-Band MelGAN Vocoder. Interspeech 2021: 3630-3634 - [c30]Yi-Wei Chen, Hung-Shin Lee, Yen-Hsing Chen, Hsin-Min Wang:
SurpriseNet: Melody Harmonization Conditioning on User-controlled Surprise Contours. ISMIR 2021: 105-112 - [i4]Yao-Fei Cheng, Hung-Shin Lee, Hsin-Min Wang:
AlloST: Low-resource Speech Translation without Source Transcription. CoRR abs/2105.00171 (2021) - [i3]Yi-Wei Chen, Hung-Shin Lee, Yen-Hsing Chen, Hsin-Min Wang:
SurpriseNet: Melody Harmonization Conditioning on User-controlled Surprise Contours. CoRR abs/2108.00378 (2021) - 2020
- [j2]Hung-Shin Lee, Yu Tsao, Shyh-Kang Jeng, Hsin-Min Wang:
Subspace-Based Representation and Learning for Phonotactic Spoken Language Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 28: 3065-3079 (2020) - [c29]Yu-Huai Peng, Cheng-Hung Hu, Alexander Chao-Fu Kang, Hung-Shin Lee, Pin-Yuan Chen, Yu Tsao, Hsin-Min Wang:
The Academia Sinica Systems of Voice Conversion for VCC2020. Blizzard Challenge / Voice Conversion Challenge 2020 - [c28]Hao Yen, Pin-Jui Ku, Ming-Chi Yen, Hung-Shin Lee, Hsin-Min Wang:
Joint Training of Guided Learning and Mean Teacher Models for Sound Event Detection. DCASE 2020: 235-239 - [c27]Pin-Yuan Chen, Chia-Hua Wu, Hung-Shin Lee, Shao-Kang Tsao, Ming-Tat Ko, Hsin-Min Wang:
Using Taigi Dramas with Mandarin Chinese Subtitles to Improve Taigi Speech Recognition. O-COCOSDA 2020: 71-76 - [i2]Yu-Huai Peng, Cheng-Hung Hu, Alexander Chao-Fu Kang, Hung-Shin Lee, Pin-Yuan Chen, Yu Tsao, Hsin-Min Wang:
The Academia Sinica Systems of Voice Conversion for VCC2020. CoRR abs/2010.02669 (2020) - [i1]Chung-En Sun, Yi-Wei Chen, Hung-Shin Lee, Yen-Hsing Chen, Hsin-Min Wang:
Melody Harmonization Using Orderless NADE, Chord Balancing, and Blocked Gibbs Sampling. CoRR abs/2010.13468 (2020)
2010 – 2019
- 2019
- [c26]Yueh-Ting Lee, Xuan-Bo Chen, Hung-Shin Lee, Jyh-Shing Roger Jang, Hsin-Min Wang:
Multi-task Learning for Acoustic Modeling Using Articulatory Attributes. APSIPA 2019: 855-861 - [c25]Shang-Bao Luo, Hung-Shin Lee, Kuan-Yu Chen, Hsin-Min Wang:
Spoken Multiple-Choice Question Answering Using Multimodal Convolutional Neural Networks. ASRU 2019: 772-778 - [c24]Pin-Tuan Huang, Hung-Shin Lee, Syu-Siang Wang, Kuan-Yu Chen, Yu Tsao, Hsin-Min Wang:
Exploring the Encoder Layers of Discriminative Autoencoders for LVCSR. INTERSPEECH 2019: 1631-1635 - 2018
- [c23]Yi-Ying Kao, Hsiang-Ping Hsu, Chien-Feng Liao, Yu Tsao, Hao-Chun Yang, Jeng-Lin Li, Chi-Chun Lee, Hung-Shin Lee, Hsin-Min Wang:
Automatic Detection of Speech Under Cold Using Discriminative Autoencoders and Strength Modeling with Multiple Sub-Dictionary Generation. IWAENC 2018: 416-420 - 2017
- [j1]Chia-Lung Wu, Hsiang-Ping Hsu, Yu-Ding Lu, Yu Tsao, Hung-Shin Lee, Hsin-Min Wang:
A Replay Spoofing Detection System Based on Discriminative Autoencoders. Int. J. Comput. Linguistics Chin. Lang. Process. 22(2) (2017) - [c22]Hung-Shin Lee, Yu-Ding Lu, Chin-Cheng Hsu, Yu Tsao, Hsin-Min Wang, Shyh-Kang Jeng:
Discriminative autoencoders for speaker verification. ICASSP 2017: 5375-5379 - [c21]Ming-Han Yang, Hung-Shin Lee, Yu-Ding Lu, Kuan-Yu Chen, Yu Tsao, Berlin Chen, Hsin-Min Wang:
Discriminative Autoencoders for Acoustic Modeling. INTERSPEECH 2017: 3557-3561 - [c20]Yu-Ding Lu, Hung-Shin Lee, Yu Tsao, Hsin-Min Wang:
基於鑑別式自編碼解碼器之錄音回放攻擊偵測系統 (A Replay Spoofing Detection System Based on Discriminative Autoencoders) [In Chinese]. ROCLING 2017: 114-115 - [c19]Cheng-Jo Ray Chang, Hung-Shin Lee, Hsin-Min Wang, Jyh-Shing Roger Jang:
基於i-vector與PLDA並使用GMM-HMM強制對位之自動語者分段標記系統 (Speaker Diarization based on I-vector PLDA Scoring and using GMM-HMM Forced Alignment) [In Chinese]. ROCLING 2017: 119-135 - 2016
- [c18]Hung-Shin Lee, Yu Tsao, Chi-Chun Lee, Hsin-Min Wang, Wei-Cheng Lin, Wei-Chen Chen, Shan-Wen Hsiao, Shyh-Kang Jeng:
Minimization of Regression and Ranking Losses with Shallow Neural Networks on Automatic Sincerity Evaluation. INTERSPEECH 2016: 2031-2035 - 2015
- [c17]Shih-Hung Liu, Hung-Shin Lee, Hsiao-Tsung Hung, Kuan-Yu Chen, Berlin Chen, Hsin-Min Wang, Hsu-Chun Yen, Wen-Lian Hsu:
Incorporating proximity information in relevance language modeling for extractive speech summarization. APSIPA 2015: 401-407 - 2014
- [c16]Hung-Shin Lee, Yu Tso, Yun-Fan Chang, Hsin-Min Wang, Shyh-Kang Jeng:
Speaker verification using kernel-based binary classifiers with binary operation derived features. ICASSP 2014: 1660-1664 - [c15]Kuan-Yu Chen, Hung-Shin Lee, Hsin-Min Wang, Berlin Chen, Hsin-Hsi Chen:
I-vector based language modeling for spoken document retrieval. ICASSP 2014: 7083-7088 - [c14]How Jing, Ting-Yao Hu, Hung-Shin Lee, Wei-Chen Chen, Chi-Chun Lee, Yu Tsao, Hsin-Min Wang:
Ensemble of machine learning algorithms for cognitive and physical speaker load detection. INTERSPEECH 2014: 447-451 - [c13]Hung-Shin Lee, Yu Tsao, Hsin-Min Wang, Shyh-Kang Jeng:
Clustering-based i-vector formulation for speaker recognition. INTERSPEECH 2014: 1101-1105 - 2013
- [c12]Kuan-Yu Chen, Hung-Shin Lee, Chung-Han Lee, Hsin-Min Wang, Hsin-Hsi Chen:
A Study of Language Modeling for Chinese Spelling Check. SIGHAN@IJCNLP 2013: 79-83 - [c11]Hung-Shin Lee, Yu-Chin Shih, Hsin-Min Wang, Shyh-Kang Jeng:
Subspace-based phonotactic language recognition using multivariate dynamic linear models. ICASSP 2013: 6870-6874 - 2012
- [c10]Yu-Chin Shih, Hung-Shin Lee, Hsin-Min Wang, Shyh-Kang Jeng:
Subspace-Based Feature Representation and Learning for Language Recognition. INTERSPEECH 2012: 2061-2064 - 2011
- [c9]Ju-Chiang Wang, Hung-Shin Lee, Hsin-Min Wang, Shyh-Kang Jeng:
Learning the Similarity of Audio Music in Bag-of-frames Representation from Tagged Music Data. ISMIR 2011: 85-90 - 2010
- [c8]Hung-Shin Lee, Hsin-Min Wang, Berlin Chen:
A Discriminative and Heteroscedastic Linear Feature Transformation for Multiclass Classification. ICPR 2010: 690-693 - [c7]Meng-Sung Wu, Hung-Shin Lee, Hsin-Min Wang:
Exploiting semantic associative information in topic modeling. SLT 2010: 384-388
2000 – 2009
- 2009
- [c6]Hung-Shin Lee, Berlin Chen:
Generalized likelihood ratio discriminant analysis. ASRU 2009: 158-163 - [c5]Hung-Shin Lee, Berlin Chen:
Empirical error rate minimization based linear discriminant analysis. ICASSP 2009: 1801-1804 - [c4]Hung-Shin Lee, Berlin Chen:
相似度比率式鑑別分析應用於大詞彙連續語音辨識 (Likelihood Ratio Based Discriminant Analysis for Large Vocabulary Continuous Speech Recognition) [In Chinese]. ROCLING 2009 - 2008
- [c3]Hung-Shin Lee, Berlin Chen:
Linear discriminant feature extraction using weighted classification confusion information. INTERSPEECH 2008: 2254-2257 - [c2]Hung-Shin Lee, Berlin Chen:
Improved Linear Discriminant Analysis Considering Empirical Pairwise Classification Error Rates. ISCSLP 2008: 149-152 - 2007
- [c1]Shih-Hung Liu, Fang-Hui Chu, Shih-Hsiang Lin, Hung-Shin Lee, Berlin Chen:
Training data selection for improving discriminative training of acoustic models. ASRU 2007: 284-289
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-18 19:30 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint