default search action
Xubo Liu
This is just a disambiguation page, and is not intended to be the bibliography of an actual person. Any publication listed on this page has not been assigned to an actual author yet. If you know the true author of one of the publications listed below, you are welcome to contact us.
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j4]Haohe Liu, Yi Yuan, Xubo Liu, Xinhao Mei, Qiuqiang Kong, Qiao Tian, Yuping Wang, Wenwu Wang, Yuxuan Wang, Mark D. Plumbley:
AudioLDM 2: Learning Holistic Audio Generation With Self-Supervised Pretraining. IEEE ACM Trans. Audio Speech Lang. Process. 32: 2871-2883 (2024) - [j3]Xinhao Mei, Xubo Liu, Jianyuan Sun, Mark D. Plumbley, Wenwu Wang:
Towards Generating Diverse Audio Captions via Adversarial Training. IEEE ACM Trans. Audio Speech Lang. Process. 32: 3311-3323 (2024) - [c39]Haohe Liu, Xubo Liu, Qiuqiang Kong, Wenwu Wang, Mark D. Plumbley:
Learning Temporal Resolution in Spectrogram for Audio Classification. AAAI 2024: 13873-13881 - [c38]Qiushi Huang, Xubo Liu, Tom Ko, Bo Wu, Wenwu Wang, Yu Zhang, Lilian Tang:
Selective Prompting Tuning for Personalized Conversations with LLMs. ACL (Findings) 2024: 16212-16226 - [c37]Xumeng Liu, Wenya Guo, Ying Zhang, Xubo Liu, Yu Zhao, Shenglong Yu, Xiaojie Yuan:
Look before You Leap: Dual Logical Verification for Knowledge-based Visual Question Generation. LREC/COLING 2024: 10802-10812 - [c36]Junqi Zhao, Xubo Liu, Jinzheng Zhao, Yi Yuan, Qiuqiang Kong, Mark D. Plumbley, Wenwu Wang:
Universal Sound Separation with Self-Supervised Audio Masked Autoencoder. EUSIPCO 2024: 1-5 - [c35]Yi Yuan, Haohe Liu, Xubo Liu, Qiushi Huang, Mark D. Plumbley, Wenwu Wang:
Retrieval-Augmented Text-to-Audio Generation. ICASSP 2024: 581-585 - [c34]Hejing Zhang, Qiaoxi Zhu, Jian Guan, Haohe Liu, Feiyang Xiao, Jiantong Tian, Xinhao Mei, Xubo Liu, Wenwu Wang:
First-Shot Unsupervised Anomalous Sound Detection with Unknown Anomalies Estimated by Metadata-Assisted Audio Generation. ICASSP 2024: 1271-1275 - [c33]Yuzhuo Liu, Xubo Liu, Yan Zhao, Yuanyuan Wang, Rui Xia, Pingchuan Tain, Yuxuan Wang:
Audio Prompt Tuning for Universal Sound Separation. ICASSP 2024: 1446-1450 - [c32]Yaru Chen, Ruohao Guo, Xubo Liu, Peipei Wu, Guangyao Li, Zhenbo Li, Wenwu Wang:
CM-PIE: Cross-Modal Perception for Interactive-Enhanced Audio-Visual Video Parsing. ICASSP 2024: 8421-8425 - [c31]Ruxue Yan, Wenya Guo, Xubo Liu, Xumeng Liu, Ying Zhang, Xiaojie Yuan:
Tracking-forced Referring Video Object Segmentation. ACM Multimedia 2024: 5356-5364 - [i50]Yi Yuan, Zhuo Chen, Xubo Liu, Haohe Liu, Xuenan Xu, Dongya Jia, Yuanzhe Chen, Mark D. Plumbley, Wenwu Wang:
T-CLAP: Temporal-Enhanced Contrastive Language-Audio Pretraining. CoRR abs/2404.17806 (2024) - [i49]Qixin Deng, Qikai Yang, Ruibin Yuan, Yipeng Huang, Yi Wang, Xubo Liu, Zeyue Tian, Jiahao Pan, Ge Zhang, Hanfeng Lin, Yizhi Li, Yinghao Ma, Jie Fu, Chenghua Lin, Emmanouil Benetos, Wenwu Wang, Guangyu Xia, Wei Xue, Yike Guo:
ComposerX: Multi-Agent Symbolic Music Composition with LLMs. CoRR abs/2404.18081 (2024) - [i48]Meng Cui, Xubo Liu, Haohe Liu, Jinzheng Zhao, Daoliang Li, Wenwu Wang:
Fish Tracking, Counting, and Behaviour Analysis in Digital Aquaculture: A Comprehensive Review. CoRR abs/2406.17800 (2024) - [i47]Qiushi Huang, Xubo Liu, Tom Ko, Bo Wu, Wenwu Wang, Yu Zhang, Lilian Tang:
Selective Prompting Tuning for Personalized Conversations with LLMs. CoRR abs/2406.18187 (2024) - [i46]Qiushi Huang, Shuai Fu, Xubo Liu, Wenwu Wang, Tom Ko, Yu Zhang, Lilian Tang:
Learning Retrieval Augmentation for Personalized Dialogue Generation. CoRR abs/2406.18847 (2024) - [i45]Yi Yuan, Dongya Jia, Xiaobin Zhuang, Yuanzhe Chen, Zhengxi Liu, Zhuo Chen, Yuping Wang, Yuxuan Wang, Xubo Liu, Mark D. Plumbley, Wenwu Wang:
Improving Audio Generation with Visual Enhanced Caption. CoRR abs/2407.04416 (2024) - [i44]Feiyang Xiao, Jian Guan, Qiaoxi Zhu, Xubo Liu, Wenbo Wang, Shuhan Qi, Kejia Zhang, Jianyuan Sun, Wenwu Wang:
A Reference-free Metric for Language-Queried Audio Source Separation using Contrastive Language-Audio Pretraining. CoRR abs/2407.04936 (2024) - [i43]Junqi Zhao, Xubo Liu, Jinzheng Zhao, Yi Yuan, Qiuqiang Kong, Mark D. Plumbley, Wenwu Wang:
Universal Sound Separation with Self-Supervised Audio Masked Autoencoder. CoRR abs/2407.11745 (2024) - [i42]Yi Yuan, Xubo Liu, Haohe Liu, Mark D. Plumbley, Wenwu Wang:
FlowSep: Language-Queried Sound Separation with Rectified Flow Matching. CoRR abs/2409.07614 (2024) - [i41]Xiaoyu Bie, Xubo Liu, Gaël Richard:
Learning Source Disentanglement in Neural Audio Codec. CoRR abs/2409.11228 (2024) - 2023
- [c30]Qiushi Huang, Yu Zhang, Tom Ko, Xubo Liu, Bo Wu, Wenwu Wang, H. Lilian Tang:
Personalized Dialogue Generation with Persona-Adaptive Attention. AAAI 2023: 12916-12923 - [c29]Xubo Liu, Egor Lakomkin, Konstantinos Vougioukas, Pingchuan Ma, Honglie Chen, Ruiming Xie, Morrie Doulaty, Niko Moritz, Jáchym Kolár, Stavros Petridis, Maja Pantic, Christian Fuegen:
SynthVSR: Scaling Up Visual Speech RecognitionWith Synthetic Supervision. CVPR 2023: 18806-18815 - [c28]Qiushi Huang, Shuai Fu, Xubo Liu, Wenwu Wang, Tom Ko, Yu Zhang, Lilian Tang:
Learning Retrieval Augmentation for Personalized Dialogue Generation. EMNLP 2023: 2523-2540 - [c27]Özkan Çayli, Xubo Liu, Volkan Kiliç, Wenwu Wang:
Knowledge Distillation for Efficient Audio-Visual Video Captioning. EUSIPCO 2023: 745-749 - [c26]Yi Yuan, Haohe Liu, Jinhua Liang, Xubo Liu, Mark D. Plumbley, Wenwu Wang:
Leveraging Pre-Trained AudioLDM for Sound Generation: A Benchmark Study. EUSIPCO 2023: 765-769 - [c25]Xubo Liu, Haohe Liu, Qiuqiang Kong, Xinhao Mei, Mark D. Plumbley, Wenwu Wang:
Simple Pooling Front-Ends for Efficient Audio Classification. ICASSP 2023: 1-5 - [c24]Haohe Liu, Zehua Chen, Yi Yuan, Xinhao Mei, Xubo Liu, Danilo P. Mandic, Wenwu Wang, Mark D. Plumbley:
AudioLDM: Text-to-Audio Generation with Latent Diffusion Models. ICML 2023: 21450-21474 - [c23]Jinhua Liang, Xubo Liu, Haohe Liu, Huy Phan, Emmanouil Benetos, Mark D. Plumbley, Wenwu Wang:
Adapting Language-Audio Models as Few-Shot Audio Learners. INTERSPEECH 2023: 276-280 - [c22]Xubo Liu, Qiushi Huang, Xinhao Mei, Haohe Liu, Qiuqiang Kong, Jianyuan Sun, Shengchen Li, Tom Ko, Yu Zhang, H. Lilian Tang, Mark D. Plumbley, Volkan Kiliç, Wenwu Wang:
Visually-Aware Audio Captioning With Adaptive Audio-Visual Attention. INTERSPEECH 2023: 2838-2842 - [c21]Haohe Liu, Qiuqiang Kong, Xubo Liu, Xinhao Mei, Wenwu Wang, Mark D. Plumbley:
Ontology-aware Learning and Evaluation for Audio Tagging. INTERSPEECH 2023: 3799-3803 - [c20]Jianyuan Sun, Xubo Liu, Xinhao Mei, Volkan Kiliç, Mark D. Plumbley, Wenwu Wang:
Dual Transformer Decoder based Features Fusion Network for Automated Audio Captioning. INTERSPEECH 2023: 4164-4168 - [i40]Haohe Liu, Zehua Chen, Yi Yuan, Xinhao Mei, Xubo Liu, Danilo P. Mandic, Wenwu Wang, Mark D. Plumbley:
AudioLDM: Text-to-Audio Generation with Latent Diffusion Models. CoRR abs/2301.12503 (2023) - [i39]Yi Yuan, Haohe Liu, Jinhua Liang, Xubo Liu, Mark D. Plumbley, Wenwu Wang:
Leveraging Pre-trained AudioLDM for Text to Sound Generation: A Benchmark Study. CoRR abs/2303.03857 (2023) - [i38]Xubo Liu, Egor Lakomkin, Konstantinos Vougioukas, Pingchuan Ma, Honglie Chen, Ruiming Xie, Morrie Doulaty, Niko Moritz, Jáchym Kolár, Stavros Petridis, Maja Pantic, Christian Fuegen:
SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision. CoRR abs/2303.17200 (2023) - [i37]Yi Yuan, Haohe Liu, Xubo Liu, Xiyuan Kang, Mark D. Plumbley, Wenwu Wang:
Latent Diffusion Model Based Foley Sound Generation System For DCASE Challenge 2023 Task 7. CoRR abs/2305.15905 (2023) - [i36]Jinhua Liang, Xubo Liu, Haohe Liu, Huy Phan, Emmanouil Benetos, Mark D. Plumbley, Wenwu Wang:
Adapting Language-Audio Models as Few-Shot Audio Learners. CoRR abs/2305.17719 (2023) - [i35]Jianyuan Sun, Xubo Liu, Xinhao Mei, Volkan Kiliç, Mark D. Plumbley, Wenwu Wang:
Dual Transformer Decoder based Features Fusion Network for Automated Audio Captioning. CoRR abs/2305.18753 (2023) - [i34]Yi Yuan, Haohe Liu, Xubo Liu, Xiyuan Kang, Peipei Wu, Mark D. Plumbley, Wenwu Wang:
Text-Driven Foley Sound Generation With Latent Diffusion Model. CoRR abs/2306.10359 (2023) - [i33]Xubo Liu, Zhongkai Zhu, Haohe Liu, Yi Yuan, Meng Cui, Qiushi Huang, Jinhua Liang, Yin Cao, Qiuqiang Kong, Mark D. Plumbley, Wenwu Wang:
WavJourney: Compositional Audio Creation with Large Language Models. CoRR abs/2307.14335 (2023) - [i32]Xubo Liu, Qiuqiang Kong, Yan Zhao, Haohe Liu, Yi Yuan, Yuzhuo Liu, Rui Xia, Yuxuan Wang, Mark D. Plumbley, Wenwu Wang:
Separate Anything You Describe. CoRR abs/2308.05037 (2023) - [i31]Haohe Liu, Qiao Tian, Yi Yuan, Xubo Liu, Xinhao Mei, Qiuqiang Kong, Yuping Wang, Wenwu Wang, Yuxuan Wang, Mark D. Plumbley:
AudioLDM 2: Learning Holistic Audio Generation with Self-supervised Pretraining. CoRR abs/2308.05734 (2023) - [i30]Meng Cui, Xubo Liu, Haohe Liu, Zhuangzhuang Du, Tao Chen, Guoping Lian, Daoliang Li, Wenwu Wang:
Multimodal Fish Feeding Intensity Assessment in Aquaculture. CoRR abs/2309.05058 (2023) - [i29]Yi Yuan, Haohe Liu, Xubo Liu, Qiushi Huang, Mark D. Plumbley, Wenwu Wang:
Retrieval-Augmented Text-to-Audio Generation. CoRR abs/2309.08051 (2023) - [i28]Feiyang Xiao, Qiaoxi Zhu, Jian Guan, Xubo Liu, Haohe Liu, Kejia Zhang, Wenwu Wang:
Synth-AC: Enhancing Audio Captioning with Synthetic Supervision. CoRR abs/2309.09705 (2023) - [i27]Yaru Chen, Ruohao Guo, Xubo Liu, Peipei Wu, Guangyao Li, Zhenbo Li, Wenwu Wang:
CM-PIE: Cross-modal perception for interactive-enhanced audio-visual video parsing. CoRR abs/2310.07517 (2023) - [i26]Hejing Zhang, Qiaoxi Zhu, Jian Guan, Haohe Liu, Feiyang Xiao, Jiantong Tian, Xinhao Mei, Xubo Liu, Wenwu Wang:
First-Shot Unsupervised Anomalous Sound Detection With Unknown Anomalies Estimated by Metadata-Assisted Audio Generation. CoRR abs/2310.14173 (2023) - [i25]Yuzhuo Liu, Xubo Liu, Yan Zhao, Yuanyuan Wang, Rui Xia, Pingchuan Tain, Yuxuan Wang:
Audio Prompt Tuning for Universal Sound Separation. CoRR abs/2311.18399 (2023) - 2022
- [j2]Xinhao Mei, Xubo Liu, Mark D. Plumbley, Wenwu Wang:
Automated audio captioning: an overview of recent progress and new challenges. EURASIP J. Audio Speech Music. Process. 2022(1): 26 (2022) - [c19]Haohe Liu, Xubo Liu, Xinhao Mei, Qiuqiang Kong, Wenwu Wang, Mark D. Plumbley:
Segment-Level Metric Learning for Few-Shot Bioacoustic Event Detection. DCASE 2022 - [c18]Yang Xiao, Xubo Liu, James A. King, Arshdeep Singh, Eng Siong Chng, Mark D. Plumbley, Wenwu Wang:
Continual Learning for On-Ddevice Environmental Sound Classification. DCASE 2022 - [c17]Jianyuan Sun, Xubo Liu, Xinhao Mei, Jinzheng Zhao, Mark D. Plumbley, Volkan Kiliç, Wenwu Wang:
Deep Neural Decision Forest for Acoustic Scene Classification. EUSIPCO 2022: 772-776 - [c16]Jinzheng Zhao, Peipei Wu, Shidrokh Goudarzi, Xubo Liu, Jianyuan Sun, Yong Xu, Wenwu Wang:
Visually Assisted Self-supervised Audio Speaker Localization and Tracking. EUSIPCO 2022: 787-791 - [c15]Xubo Liu, Xinhao Mei, Qiushi Huang, Jianyuan Sun, Jinzheng Zhao, Haohe Liu, Mark D. Plumbley, Volkan Kilic, Wenwu Wang:
Leveraging Pre-trained BERT for Audio Captioning. EUSIPCO 2022: 1145-1149 - [c14]Yunxiang Liu, Jianlin Zhu, Xubo Liu, Xinxin Yuan:
Path Planning based on Astar Algorithm in Automatic Driving. ICACS 2022: 7:1-7:4 - [c13]Jinzheng Zhao, Peipei Wu, Xubo Liu, Yong Xu, Lyudmila Mihaylova, Simon J. Godsill, Wenwu Wang:
Audio-Visual Tracking of Multiple Speakers Via a PMBM Filter. ICASSP 2022: 5068-5072 - [c12]Xinhao Mei, Xubo Liu, Jianyuan Sun, Mark D. Plumbley, Wenwu Wang:
Diverse Audio Captioning Via Adversarial Training. ICASSP 2022: 8882-8886 - [c11]Xubo Liu, Haohe Liu, Qiuqiang Kong, Xinhao Mei, Jinzheng Zhao, Qiushi Huang, Mark D. Plumbley, Wenwu Wang:
Separate What You Describe: Language-Queried Audio Source Separation. INTERSPEECH 2022: 1801-1805 - [c10]Jinzheng Zhao, Peipei Wu, Xubo Liu, Shidrokh Goudarzi, Haohe Liu, Yong Xu, Wenwu Wang:
Audio Visual Multi-Speaker Tracking with Improved GCF and PMBM Filter. INTERSPEECH 2022: 3704-3708 - [c9]Xinhao Mei, Xubo Liu, Jianyuan Sun, Mark D. Plumbley, Wenwu Wang:
On Metric Learning for Audio-Text Cross-Modal Retrieval. INTERSPEECH 2022: 4142-4146 - [c8]Haohe Liu, Woosung Choi, Xubo Liu, Qiuqiang Kong, Qiao Tian, DeLiang Wang:
Neural Vocoder is All You Need for Speech Super-resolution. INTERSPEECH 2022: 4227-4231 - [c7]Haohe Liu, Xubo Liu, Qiuqiang Kong, Qiao Tian, Yan Zhao, DeLiang Wang, Chuanzeng Huang, Yuxuan Wang:
VoiceFixer: A Unified Framework for High-Fidelity Speech Restoration. INTERSPEECH 2022: 4232-4236 - [c6]Meng Cui, Xubo Liu, Jinzheng Zhao, Jianyuan Sun, Guoping Lian, Tao Chen, Mark D. Plumbley, Daoliang Li, Wenwu Wang:
Fish Feeding Intensity Assessment in Aquaculture: A New Audio Dataset AFFIA3K and a Deep Learning Algorithm. MLSP 2022: 1-6 - [i24]Xubo Liu, Xinhao Mei, Qiushi Huang, Jianyuan Sun, Jinzheng Zhao, Haohe Liu, Mark D. Plumbley, Volkan Kiliç, Wenwu Wang:
Leveraging Pre-trained BERT for Audio Captioning. CoRR abs/2203.02838 (2022) - [i23]Jianyuan Sun, Xubo Liu, Xinhao Mei, Jinzheng Zhao, Mark D. Plumbley, Volkan Kiliç, Wenwu Wang:
Deep Neural Decision Forest for Acoustic Scene Classification. CoRR abs/2203.03436 (2022) - [i22]Haohe Liu, Woosung Choi, Xubo Liu, Qiuqiang Kong, Qiao Tian, DeLiang Wang:
Neural Vocoder is All You Need for Speech Super-resolution. CoRR abs/2203.14941 (2022) - [i21]Xubo Liu, Haohe Liu, Qiuqiang Kong, Xinhao Mei, Jinzheng Zhao, Qiushi Huang, Mark D. Plumbley, Wenwu Wang:
Separate What You Describe: Language-Queried Audio Source Separation. CoRR abs/2203.15147 (2022) - [i20]Xinhao Mei, Xubo Liu, Jianyuan Sun, Mark D. Plumbley, Wenwu Wang:
On Metric Learning for Audio-Text Cross-Modal Retrieval. CoRR abs/2203.15537 (2022) - [i19]Haohe Liu, Xubo Liu, Qiuqiang Kong, Qiao Tian, Yan Zhao, DeLiang Wang, Chuanzeng Huang, Yuxuan Wang:
VoiceFixer: A Unified Framework for High-Fidelity Speech Restoration. CoRR abs/2204.05841 (2022) - [i18]Xinhao Mei, Xubo Liu, Mark D. Plumbley, Wenwu Wang:
Automated Audio Captioning: an Overview of Recent Progress and New Challenges. CoRR abs/2205.05949 (2022) - [i17]Yang Xiao, Xubo Liu, James A. King, Arshdeep Singh, Eng Siong Chng, Mark D. Plumbley, Wenwu Wang:
Continual Learning For On-Device Environmental Sound Classification. CoRR abs/2207.07429 (2022) - [i16]Haohe Liu, Xubo Liu, Xinhao Mei, Qiuqiang Kong, Wenwu Wang, Mark D. Plumbley:
Segment-level Metric Learning for Few-shot Bioacoustic Event Detection. CoRR abs/2207.07773 (2022) - [i15]Haohe Liu, Xubo Liu, Xinhao Mei, Qiuqiang Kong, Wenwu Wang, Mark D. Plumbley:
Surrey System for DCASE 2022 Task 5: Few-shot Bioacoustic Event Detection with Segment-level Metric Learning. CoRR abs/2207.10547 (2022) - [i14]Arshdeep Singh, James A. King, Xubo Liu, Wenwu Wang, Mark D. Plumbley:
Low-complexity CNNs for Acoustic Scene Classification. CoRR abs/2208.01555 (2022) - [i13]Xubo Liu, Haohe Liu, Qiuqiang Kong, Xinhao Mei, Mark D. Plumbley, Wenwu Wang:
Simple Pooling Front-ends For Efficient Audio Classification. CoRR abs/2210.00943 (2022) - [i12]Haohe Liu, Xubo Liu, Qiuqiang Kong, Wenwu Wang, Mark D. Plumbley:
Learning the Spectrogram Temporal Resolution for Audio Classification. CoRR abs/2210.01719 (2022) - [i11]Jianyuan Sun, Xubo Liu, Xinhao Mei, Mark D. Plumbley, Volkan Kilic, Wenwu Wang:
Automated Audio Captioning via Fusion of Low- and High- Dimensional Features. CoRR abs/2210.05037 (2022) - [i10]Qiushi Huang, Yu Zhang, Tom Ko, Xubo Liu, Bo Wu, Wenwu Wang, H. Lilian Tang:
Personalized Dialogue Generation with Persona-Adaptive Attention. CoRR abs/2210.15088 (2022) - [i9]Xubo Liu, Qiushi Huang, Xinhao Mei, Haohe Liu, Qiuqiang Kong, Jianyuan Sun, Shengchen Li, Tom Ko, Yu Zhang, H. Lilian Tang, Mark D. Plumbley, Volkan Kiliç, Wenwu Wang:
Visually-Aware Audio Captioning With Adaptive Audio-Visual Attention. CoRR abs/2210.16428 (2022) - [i8]Haohe Liu, Qiuqiang Kong, Xubo Liu, Xinhao Mei, Wenwu Wang, Mark D. Plumbley:
Ontology-aware Learning and Evaluation for Audio Tagging. CoRR abs/2211.12195 (2022) - [i7]Xinhao Mei, Xubo Liu, Jianyuan Sun, Mark D. Plumbley, Wenwu Wang:
Towards Generating Diverse Audio Captions via Adversarial Training. CoRR abs/2212.02033 (2022) - 2021
- [c5]Xubo Liu, Qiushi Huang, Xinhao Mei, Tom Ko, H. Lilian Tang, Mark D. Plumbley, Wenwu Wang:
CL4AC: A Contrastive Loss for Audio Captioning. DCASE 2021: 196-200 - [c4]Xinhao Mei, Qiushi Huang, Xubo Liu, Gengyun Chen, Jingqian Wu, Yusong Wu, Jinzheng Zhao, Shengchen Li, Tom Ko, H. Lilian Tang, Xi Shao, Mark D. Plumbley, Wenwu Wang:
An Encoder-Decoder Based Audio Captioning System with Transfer and Reinforcement Learning. DCASE 2021: 206-210 - [c3]Xinhao Mei, Xubo Liu, Qiushi Huang, Mark D. Plumbley, Wenwu Wang:
Audio Captioning Transformer. DCASE 2021: 211-215 - [c2]Qiushi Huang, Tom Ko, H. Lilian Tang, Xubo Liu, Bo Wu:
Token-Level Supervised Contrastive Learning for Punctuation Restoration. Interspeech 2021: 2012-2016 - [c1]Xubo Liu, Turab Iqbal, Jinzheng Zhao, Qiushi Huang, Mark D. Plumbley, Wenwu Wang:
Conditional Sound Generation Using Neural Discrete Time-Frequency Representation Learning. MLSP 2021: 1-6 - [i6]Qiushi Huang, Tom Ko, H. Lilian Tang, Xubo Liu, Bo Wu:
Token-Level Supervised Contrastive Learning for Punctuation Restoration. CoRR abs/2107.09099 (2021) - [i5]Xinhao Mei, Xubo Liu, Qiushi Huang, Mark D. Plumbley, Wenwu Wang:
Audio Captioning Transformer. CoRR abs/2107.09817 (2021) - [i4]Xubo Liu, Qiushi Huang, Xinhao Mei, Tom Ko, H. Lilian Tang, Mark D. Plumbley, Wenwu Wang:
CL4AC: A Contrastive Loss for Audio Captioning. CoRR abs/2107.09990 (2021) - [i3]Xubo Liu, Turab Iqbal, Jinzheng Zhao, Qiushi Huang, Mark D. Plumbley, Wenwu Wang:
Conditional Sound Generation Using Neural Discrete Time-Frequency Representation Learning. CoRR abs/2107.09998 (2021) - [i2]Xinhao Mei, Qiushi Huang, Xubo Liu, Gengyun Chen, Jingqian Wu, Yusong Wu, Jinzheng Zhao, Shengchen Li, Tom Ko, H. Lilian Tang, Xi Shao, Mark D. Plumbley, Wenwu Wang:
An Encoder-Decoder Based Audio Captioning System With Transfer and Reinforcement Learning. CoRR abs/2108.02752 (2021) - [i1]Xinhao Mei, Xubo Liu, Jianyuan Sun, Mark D. Plumbley, Wenwu Wang:
Diverse Audio Captioning via Adversarial Training. CoRR abs/2110.06691 (2021)
2010 – 2019
- 2019
- [j1]Zhenbao Liu, Xubo Liu, Jie Chen, Chen Fang:
Altitude Control for Variable Load Quadrotor via Learning Rate Based Robust Sliding Mode Controller. IEEE Access 7: 9736-9744 (2019)
Coauthor Index
aka: H. Lilian Tang
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-07 21:28 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint