default search action
Peng Jin 0001
Person information
- affiliation: Peking University, School of Electronic and Computer Engineering, Shenzhen, China
Other persons with the same name
- Peng Jin — disambiguation page
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c12]Zesen Cheng, Kehan Li, Peng Jin, Siheng Li, Xiangyang Ji, Li Yuan, Chang Liu, Jie Chen:
Parallel Vertex Diffusion for Unified Visual Grounding. AAAI 2024: 1326-1334 - [c11]Peng Jin, Ryuichi Takanobu, Wancai Zhang, Xiaochun Cao, Li Yuan:
Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding. CVPR 2024: 13700-13710 - [c10]Junwu Zhang, Zhenyu Tang, Yatian Pang, Xinhua Cheng, Peng Jin, Yida Wei, Xing Zhou, Munan Ning, Li Yuan:
Repaint123: Fast and High-Quality One Image to 3D Generation with Progressive Controllable Repainting. ECCV (25) 2024: 303-320 - [c9]Peng Jin, Hao Li, Zesen Cheng, Kehan Li, Runyi Yu, Chang Liu, Xiangyang Ji, Li Yuan, Jie Chen:
Local Action-Guided Motion Diffusion Model for Text-to-Motion Generation. ECCV (25) 2024: 392-409 - [i19]Zesen Cheng, Kehan Li, Hao Li, Peng Jin, Chang Liu, Xiawu Zheng, Rongrong Ji, Jie Chen:
Instance Brownian Bridge as Texts for Open-vocabulary Video Instance Segmentation. CoRR abs/2401.09732 (2024) - [i18]Bin Lin, Zhenyu Tang, Yang Ye, Jiaxi Cui, Bin Zhu, Peng Jin, Junwu Zhang, Munan Ning, Li Yuan:
MoE-LLaVA: Mixture of Experts for Large Vision-Language Models. CoRR abs/2401.15947 (2024) - [i17]Bin Zhu, Peng Jin, Munan Ning, Bin Lin, Jinfa Huang, Qi Song, Jiaxi Cui, Junwu Zhang, Zhenyu Tang, Mingjun Pan, Xing Zhou, Li Yuan:
LLMBind: A Unified Modality-Task Integration Framework. CoRR abs/2402.14891 (2024) - [i16]Zhongwei Wan, Ziang Wu, Che Liu, Jinfa Huang, Zhihong Zhu, Peng Jin, Longyue Wang, Li Yuan:
LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context Inference. CoRR abs/2406.18139 (2024) - [i15]Peng Jin, Hao Li, Zesen Cheng, Kehan Li, Runyi Yu, Chang Liu, Xiangyang Ji, Li Yuan, Jie Chen:
Local Action-Guided Motion Diffusion Model for Text-to-Motion Generation. CoRR abs/2407.10528 (2024) - 2023
- [j1]Hao Li, Jinfa Huang, Peng Jin, Guoli Song, Qi Wu, Jie Chen:
Weakly-Supervised 3D Spatial Reasoning for Text-Based Visual Question Answering. IEEE Trans. Image Process. 32: 3367-3382 (2023) - [c8]Peng Jin, Jinfa Huang, Pengfei Xiong, Shangxuan Tian, Chang Liu, Xiangyang Ji, Li Yuan, Jie Chen:
Video-Text as Game Players: Hierarchical Banzhaf Interaction for Cross-Modal Representation Learning. CVPR 2023: 2472-2482 - [c7]Kehan Li, Yian Zhao, Zhennan Wang, Zesen Cheng, Peng Jin, Xiangyang Ji, Li Yuan, Chang Liu, Jie Chen:
Multi-granularity Interaction Simulation for Unsupervised Interactive Segmentation. ICCV 2023: 666-676 - [c6]Peng Jin, Hao Li, Zesen Cheng, Kehan Li, Xiangyang Ji, Chang Liu, Li Yuan, Jie Chen:
DiffusionRet: Generative Text-Video Retrieval with Diffusion Model. ICCV 2023: 2470-2481 - [c5]Zesen Cheng, Peng Jin, Hao Li, Kehan Li, Siheng Li, Xiangyang Ji, Chang Liu, Jie Chen:
WiCo: Win-win Cooperation of Bottom-up and Top-down Referring Image Segmentation. IJCAI 2023: 636-644 - [c4]Peng Jin, Hao Li, Zesen Cheng, Jinfa Huang, Zhennan Wang, Li Yuan, Chang Liu, Jie Chen:
Text-Video Retrieval with Disentangled Conceptualization and Set-to-Set Alignment. IJCAI 2023: 938-946 - [c3]Hao Li, Peng Jin, Zesen Cheng, Songyang Zhang, Kai Chen, Zhennan Wang, Chang Liu, Jie Chen:
TG-VQA: Ternary Game of Video Question Answering. IJCAI 2023: 1044-1052 - [c2]Peng Jin, Yang Wu, Yanbo Fan, Zhongqian Sun, Wei Yang, Li Yuan:
Act As You Wish: Fine-Grained Control of Motion Diffusion Model with Hierarchical Semantic Graphs. NeurIPS 2023 - [i14]Zesen Cheng, Kehan Li, Peng Jin, Xiangyang Ji, Li Yuan, Chang Liu, Jie Chen:
Parallel Vertex Diffusion for Unified Visual Grounding. CoRR abs/2303.07216 (2023) - [i13]Peng Jin, Hao Li, Zesen Cheng, Kehan Li, Xiangyang Ji, Chang Liu, Li Yuan, Jie Chen:
DiffusionRet: Generative Text-Video Retrieval with Diffusion Model. CoRR abs/2303.09867 (2023) - [i12]Kehan Li, Yian Zhao, Zhennan Wang, Zesen Cheng, Peng Jin, Xiangyang Ji, Li Yuan, Chang Liu, Jie Chen:
Multi-granularity Interaction Simulation for Unsupervised Interactive Segmentation. CoRR abs/2303.13399 (2023) - [i11]Peng Jin, Jinfa Huang, Pengfei Xiong, Shangxuan Tian, Chang Liu, Xiangyang Ji, Li Yuan, Jie Chen:
Video-Text as Game Players: Hierarchical Banzhaf Interaction for Cross-Modal Representation Learning. CoRR abs/2303.14369 (2023) - [i10]Hao Li, Peng Jin, Zesen Cheng, Songyang Zhang, Kai Chen, Zhennan Wang, Chang Liu, Jie Chen:
TG-VQA: Ternary Game of Video Question Answering. CoRR abs/2305.10049 (2023) - [i9]Peng Jin, Hao Li, Zesen Cheng, Jinfa Huang, Zhennan Wang, Li Yuan, Chang Liu, Jie Chen:
Text-Video Retrieval with Disentangled Conceptualization and Set-to-Set Alignment. CoRR abs/2305.12218 (2023) - [i8]Zesen Cheng, Peng Jin, Hao Li, Kehan Li, Siheng Li, Xiangyang Ji, Chang Liu, Jie Chen:
WiCo: Win-win Cooperation of Bottom-up and Top-down Referring Image Segmentation. CoRR abs/2306.10750 (2023) - [i7]Peng Jin, Yang Wu, Yanbo Fan, Zhongqian Sun, Yang Wei, Li Yuan:
Act As You Wish: Fine-Grained Control of Motion Diffusion Model with Hierarchical Semantic Graphs. CoRR abs/2311.01015 (2023) - [i6]Peng Jin, Ryuichi Takanobu, Caiwan Zhang, Xiaochun Cao, Li Yuan:
Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding. CoRR abs/2311.08046 (2023) - [i5]Bin Lin, Yang Ye, Bin Zhu, Jiaxi Cui, Munan Ning, Peng Jin, Li Yuan:
Video-LLaVA: Learning United Visual Representation by Alignment Before Projection. CoRR abs/2311.10122 (2023) - [i4]Hao Li, Curise Jia, Peng Jin, Zesen Cheng, Kehan Li, Jialu Sui, Chang Liu, Li Yuan:
FreestyleRet: Retrieving Images from Style-Diversified Queries. CoRR abs/2312.02428 (2023) - [i3]Junwu Zhang, Zhenyu Tang, Yatian Pang, Xinhua Cheng, Peng Jin, Yida Wei, Munan Ning, Li Yuan:
Repaint123: Fast and High-quality One Image to 3D Generation with Progressive Controllable 2D Repainting. CoRR abs/2312.13271 (2023) - 2022
- [c1]Peng Jin, Jinfa Huang, Fenglin Liu, Xian Wu, Shen Ge, Guoli Song, David A. Clifton, Jie Chen:
Expectation-Maximization Contrastive Learning for Compact Video-and-Language Representations. NeurIPS 2022 - [i2]Hao Li, Jinfa Huang, Peng Jin, Guoli Song, Qi Wu, Jie Chen:
Toward 3D Spatial Reasoning for Human-like Text-based Visual Question Answering. CoRR abs/2209.10326 (2022) - [i1]Peng Jin, Jinfa Huang, Fenglin Liu, Xian Wu, Shen Ge, Guoli Song, David A. Clifton, Jie Chen:
Expectation-Maximization Contrastive Learning for Compact Video-and-Language Representations. CoRR abs/2211.11427 (2022)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-07 20:36 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint