default search action

combined dblp search
author search
venue search
publication search

ask others

Minglun Han

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/NiHC00L024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/NiHC00L024
Ziyi Ni, Minglun Han, Feilong Chen, Linghui Meng, Jing Shi, Pin Lv, Bo Xu:
ViLaS: Exploring the Effects of Vision and Language Context in Automatic Speech Recognition. ICASSP 2024: 11366-11370
[i10]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-04675
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-04675
Ye Bai, Jingping Chen, Jitong Chen, Wei Chen, Zhuo Chen, Chuang Ding, Linhao Dong, Qianqian Dong, Yujiao Du, Kepan Gao, Lu Gao, Yi Guo, Minglun Han, Ting Han, Wenchao Hu, Xinying Hu, Yuxiang Hu, Deyu Hua, Lu Huang, Mingkun Huang, Youjia Huang, Jishuo Jin, Fanliu Kong, Zongwei Lan, Tianyu Li, Xiaoyang Li, Zeyang Li, Zehua Lin, Rui Liu, Shouda Liu, Lu Lu, Yizhou Lu, Jingting Ma, Shengtao Ma, Yulin Pei, Chen Shen, Tian Tan, Xiaogang Tian, Ming Tu, Bo Wang, Hao Wang, Yuping Wang, Yuxuan Wang, Hanzhang Xia, Rui Xia, Shuangyi Xie, Hongmin Xu, Meng Yang, Bihong Zhang, Jun Zhang, Wanyi Zhang, Yang Zhang, Yawei Zhang, Yijie Zheng, Ming Zou:
Seed-ASR: Understanding Diverse Speech and Contexts with LLM-based Speech Recognition. CoRR abs/2407.04675 (2024)
[i9]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-08680
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-08680
Minglun Han, Ye Bai, Chen Shen, Youjia Huang, Mingkun Huang, Zehua Lin, Linhao Dong, Lu Lu, Yuxuan Wang:
NEST-RQ: Next Token Prediction for Speech Self-Supervised Pre-Training. CoRR abs/2409.08680 (2024)
2023
[j1]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/ijautcomp/ChenZHCSXX23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ijautcomp/ChenZHCSXX23
Feilong Chen, Duzhen Zhang, Minglun Han, Xiuyi Chen, Jing Shi, Shuang Xu, Bo Xu:
VLP: A Survey on Vision-language Pre-training. Int. J. Autom. Comput. 20(1): 38-56 (2023)
[c6]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/WangZHWZX23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/WangZHWZX23
Qingyu Wang, Tielin Zhang, Minglun Han, Yi Wang, Duzhen Zhang, Bo Xu:
Complex Dynamic Neurons Improved Spiking Transformer Network for Efficient Automatic Speech Recognition. AAAI 2023: 102-109
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HuCWHNSXX23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HuCWHNSXX23
Zefa Hu, Xiuyi Chen, Haoran Wu, Minglun Han, Ziyi Ni, Jing Shi, Shuang Xu, Bo Xu:
Matching-Based Term Semantics Pre-Training for Spoken Patient Query Understanding. ICASSP 2023: 1-5
[c4]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/HanC0X023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/HanC0X023
Minglun Han, Feilong Chen, Jing Shi, Shuang Xu, Bo Xu:
Knowledge Transfer from Pre-trained Language Models to Cif-based Speech Recognizers via Hierarchical Distillation. INTERSPEECH 2023: 1364-1368
[c3]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenH0X023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenH0X023
Feilong Chen, Minglun Han, Jing Shi, Shuang Xu, Bo Xu:
Enhancing Visual Question Answering via Deconstructing Questions and Explicating Answers. INTERSPEECH 2023: 3447-3451
[i8]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2301-13003
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2301-13003
Minglun Han, Feilong Chen, Jing Shi, Shuang Xu, Bo Xu:
Knowledge Transfer from Pre-trained Language Models to Cif-based Speech Recognizers via Hierarchical Distillation. CoRR abs/2301.13003 (2023)
[i7]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-01194
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-01194
Minglun Han, Qingyu Wang, Tielin Zhang, Yi Wang, Duzhen Zhang, Bo Xu:
Complex Dynamic Neurons Improved Spiking Transformer Network for Efficient Automatic Speech Recognition. CoRR abs/2302.01194 (2023)
[i6]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-01341
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-01341
Zefa Hu, Xiuyi Chen, Haoran Wu, Minglun Han, Ziyi Ni, Jing Shi, Shuang Xu, Bo Xu:
Matching-based Term Semantics Pre-training for Spoken Patient Query Understanding. CoRR abs/2303.01341 (2023)
[i5]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-04160
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-04160
Feilong Chen, Minglun Han, Haozhi Zhao, Qingyang Zhang, Jing Shi, Shuang Xu, Bo Xu:
X-LLM: Bootstrapping Advanced Large Language Models by Treating Multi-Modalities as Foreign Languages. CoRR abs/2305.04160 (2023)
[i4]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-19972
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-19972
Minglun Han, Feilong Chen, Ziyi Ni, Linghui Meng, Jing Shi, Shuang Xu, Bo Xu:
ViLaS: Integrating Vision and Language into Automatic Speech Recognition. CoRR abs/2305.19972 (2023)
2022
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HanDLCZMX22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HanDLCZMX22
Minglun Han, Linhao Dong, Zhenlin Liang, Meng Cai, Shiyu Zhou, Zejun Ma, Bo Xu:
Improving End-to-End Contextual Speech Recognition with Fine-Grained Contextual Knowledge Selection. ICASSP 2022: 8532-8536
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2201-12806
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2201-12806
Minglun Han, Linhao Dong, Zhenlin Liang, Meng Cai, Shiyu Zhou, Zejun Ma, Bo Xu:
Improving End-to-End Contextual Speech Recognition with Fine-grained Contextual Knowledge Selection. CoRR abs/2201.12806 (2022)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2202-09061
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-09061
Feilong Chen, Duzhen Zhang, Minglun Han, Xiuyi Chen, Jing Shi, Shuang Xu, Bo Xu:
VLP: A Survey on Vision-Language Pre-training. CoRR abs/2202.09061 (2022)
2021
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HanDZX21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HanDZX21
Minglun Han, Linhao Dong, Shiyu Zhou, Bo Xu:
Cif-Based Collaborative Decoding for End-to-End Contextual Speech Recognition. ICASSP 2021: 6528-6532
2020
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2012-09466
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2012-09466
Minglun Han, Linhao Dong, Shiyu Zhou, Bo Xu:
cif-based collaborative decoding for end-to-end contextual speech recognition. CoRR abs/2012.09466 (2020)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.