Qing Li

Cited by

	All	Since 2019
Citations	2126	1979
h-index	21	19
i10-index	23	22

840

420

210

630

2016201720182019202020212022202320248 55 60 98 152 216 296 348 834

Public access

View all

10 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Siyuan HuangBeijing Institute for General Artificial Intelligence (BIGAI)Verified email at ucla.edu
song-chun zhuProfessor of Statistics and Computer Science, UCLAVerified email at stat.ucla.edu
Xiaojian MaUniversity of California, Los AngelesVerified email at cs.ucla.edu
Jiebo Luo, Fellow of NAI/ACM/AAAI/IE...Albert Arendt Hopeman Professor of Engineering, University of RochesterVerified email at cs.rochester.edu
Danna GurariAssistant Professor, University of Colorado Boulder - Director of Image and Video Computing GroupVerified email at colorado.edu
Yining HongUniverisy of California, Los AngelesVerified email at cs.ucla.edu
Yixin ChenUniversity of California, Los AngelesVerified email at g.ucla.edu
Baoxiong JiaPh.D. in Computer Science, UCLAVerified email at cs.ucla.edu
Zhaofan QiuAI Research, JD.COMVerified email at mail.ustc.edu.cn
Ting YaoHiDream.ai, previously JD.com and Microsoft ResearchVerified email at hidream.ai
Ying Nian WuUCLA Department of StatisticsVerified email at stat.ucla.edu
yuntao duBIGAIVerified email at bigai.ai
Yixin ZhuAssistant Professor, Peking UniversityVerified email at pku.edu.cn
Jiangyong HuangPeking UniversityVerified email at pku.edu.cn
Jianfei CaiProfessor of Data Science & AI, Monash UniversityVerified email at monash.edu
Zilong Zheng (郑子隆)UCLA CS PhDVerified email at ucla.edu
Chong-Wah NgoSingapore Management UniversityVerified email at smu.edu.sg
Ran (Steven) GongThe AI InstituteVerified email at g.ucla.edu

Qing Li

BIGAI

Verified email at ucla.edu - Homepage

Multimodal Learning Neural-Symbolic Learning Embodied AI


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Vizwiz grand challenge: Answering visual questions from blind people D Gurari, Q Li, AJ Stangl, A Guo, C Lin, K Grauman, J Luo, JP Bigham Proceedings of the IEEE conference on computer vision and pattern …, 2018	768	2018
Action recognition by learning deep multi-granular spatio-temporal video representation Q Li, Z Qiu, T Yao, T Mei, Y Rui, J Luo Proceedings of the 2016 ACM on international conference on multimedia …, 2016	159	2016
Vqa-e: Explaining, elaborating, and enhancing your answers for visual questions Q Li, Q Tao, S Joty, J Cai, J Luo Proceedings of the European Conference on Computer Vision (ECCV), 552-567, 2018	126	2018
Vizwiz-priv: A dataset for recognizing the presence and purpose of private visual information in images taken by blind people D Gurari, Q Li, C Lin, Y Zhao, A Guo, A Stangl, JP Bigham Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019	109	2019
Sqa3d: Situated question answering in 3d scenes X Ma, S Yong, Z Zheng, Q Li, Y Liang, SC Zhu, S Huang arXiv preprint arXiv:2210.07474, 2022	96	2022
Closed Loop Neural-Symbolic Learning via Integrating Neural Perception, Grammar Parsing, and Symbolic Reasoning Q Li, S Huang, Y Hong, Y Chen, YN Wu, SC Zhu ICML, 2020	94	2020
3d-vista: Pre-trained transformer for 3d vision and text alignment Z Zhu, X Ma, Y Chen, Z Deng, S Huang, Q Li Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023	79	2023
Tell-and-answer: Towards explainable visual question answering using attributes and captions Q Li, J Fu, D Yu, T Mei, J Luo EMNLP, 2018	71	2018
An embodied generalist agent in 3d world J Huang, S Yong, X Ma, X Linghu, P Li, Y Wang, Q Li, SC Zhu, B Jia, ... arXiv preprint arXiv:2311.12871, 2023	66	2023
Why does a visual question have different answers? N Bhattacharya, Q Li, D Gurari Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2019	65	2019
Learning by fixing: Solving math word problems with weak supervision Y Hong, Q Li, D Ciao, S Huang, SC Zhu Proceedings of the AAAI conference on artificial intelligence 35 (6), 4959-4967, 2021	61	2021
Parameter-efficient fine-tuning for pre-trained vision models: A survey Y Xin, S Luo, H Zhou, J Du, X Liu, Y Fan, Q Li, Y Du arXiv preprint arXiv:2402.02242, 2024	53	2024
Yourefit: Embodied reference understanding with language and gesture Y Chen, Q Li, D Kong, YL Kei, SC Zhu, T Gao, Y Zhu, S Huang Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021	40	2021
A Competence-aware Curriculum for Visual Concepts Learning via Question Answering Q Li, S Huang, Y Hong, SC Zhu ECCV, 2020	38	2020
Vireo@ trecvid 2017: Video-to-text, ad-hoc video search and video hyperlinking PA Nguyen, Q Li, ZQ Cheng, YJ Lu, H Zhang, X Wu, CW Ngo IEEE, 2017	37	2017
Smart: A situation model for algebra story problems via attributed grammar Y Hong, Q Li, R Gong, D Ciao, S Huang, SC Zhu Proceedings of the AAAI conference on artificial intelligence 35 (14), 13009 …, 2021	36	2021
Vlgrammar: Grounded grammar induction of vision and language Y Hong, Q Li, SC Zhu, S Huang Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021	32	2021
Sceneverse: Scaling 3d vision-language learning for grounded scene understanding B Jia, Y Chen, H Yu, Y Wang, X Niu, T Liu, Q Li, S Huang European Conference on Computer Vision, 289-310, 2025	27	2025
Learning hierarchical video representation for action recognition Q Li, Z Qiu, T Yao, T Mei, Y Rui, J Luo International Journal of Multimedia Information Retrieval 6, 85-98, 2017	25	2017
Towards a unified foundation model: Jointly pre-training transformers on unpaired images and text Q Li, B Gong, Y Cui, D Kondratyuk, X Du, MH Yang, M Brown arXiv preprint arXiv:2112.07074, 2021	24	2021

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors