default search action

combined dblp search
author search
venue search
publication search

ask others

Yaya Shi

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[j4]
- view
  authority control:
- export record
  dblp key:
  - journals/tomccap/YeTYXYSYWZHL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tomccap/YeTYXYSYWZHL24
Jiabo Ye, Junfeng Tian, Ming Yan, Haiyang Xu, Qinghao Ye, Yaya Shi, Xiaoshan Yang, Xuwu Wang, Ji Zhang, Liang He, Xin Lin:
UniQRNet: Unifying Referring Expression Grounding and Segmentation with QRNet. ACM Trans. Multim. Comput. Commun. Appl. 20(8): 246:1-246:28 (2024)
[c7]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/coling/LiuSXYYLYZHLH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/coling/LiuSXYYLYZHLH24
Haowei Liu, Yaya Shi, Haiyang Xu, Chunfeng Yuan, Qinghao Ye, Chenliang Li, Ming Yan, Ji Zhang, Fei Huang, Bing Li, Weiming Hu:
Semantics-enhanced Cross-modal Masked Image Modeling for Vision-Language Pre-training. LREC/COLING 2024: 14664-14675
[c6]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/coling/LiuSXYYLYZHLH24a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/coling/LiuSXYYLYZHLH24a
Haowei Liu, Yaya Shi, Haiyang Xu, Chunfeng Yuan, Qinghao Ye, Chenliang Li, Ming Yan, Ji Zhang, Fei Huang, Bing Li, Weiming Hu:
Unifying Latent and Lexicon Representations for Effective Video-Text Retrieval. LREC/COLING 2024: 17031-17041
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/HuSXYYYL00024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/HuSXYYYL00024
Anwen Hu, Yaya Shi, Haiyang Xu, Jiabo Ye, Qinghao Ye, Ming Yan, Chenliang Li, Qi Qian, Ji Zhang, Fei Huang:
mPLUG-PaperOwl: Scientific Diagram Analysis with the Multimodal Large Language Model. ACM Multimedia 2024: 6929-6938
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-16769
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-16769
Haowei Liu, Yaya Shi, Haiyang Xu, Chunfeng Yuan, Qinghao Ye, Chenliang Li, Ming Yan, Ji Zhang, Fei Huang, Bing Li, Weiming Hu:
Unifying Latent and Lexicon Representations for Effective Video-Text Retrieval. CoRR abs/2402.16769 (2024)
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-00249
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-00249
Haowei Liu, Yaya Shi, Haiyang Xu, Chunfeng Yuan, Qinghao Ye, Chenliang Li, Ming Yan, Ji Zhang, Fei Huang, Bing Li, Weiming Hu:
Semantics-enhanced Cross-modal Masked Image Modeling for Vision-Language Pre-training. CoRR abs/2403.00249 (2024)
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-15272
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-15272
Haowei Liu, Xi Zhang, Haiyang Xu, Yaya Shi, Chaoya Jiang, Ming Yan, Ji Zhang, Fei Huang, Chunfeng Yuan, Bing Li, Weiming Hu:
MIBench: Evaluating Multimodal Large Language Models over Multiple Images. CoRR abs/2407.15272 (2024)
2023
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/tomccap/ShiXY0HZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tomccap/ShiXY0HZ23
Yaya Shi, Haiyang Xu, Chunfeng Yuan, Bing Li, Weiming Hu, Zheng-Jun Zha:
Learning Video-Text Aligned Representations for Video Captioning. ACM Trans. Multim. Comput. Commun. Appl. 19(2): 63:1-63:21 (2023)
[c4]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/XuYYSYXLBQWXZH023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/XuYYSYXLBQWXZH023
Haiyang Xu, Qinghao Ye, Ming Yan, Yaya Shi, Jiabo Ye, Yuanhong Xu, Chenliang Li, Bin Bi, Qi Qian, Wei Wang, Guohai Xu, Ji Zhang, Songfang Huang, Fei Huang, Jingren Zhou:
mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video. ICML 2023: 38728-38748
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ShiLXMYHY00YLHZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ShiLXMYHY00YLHZ23
Yaya Shi, Haowei Liu, Haiyang Xu, Zongyang Ma, Qinghao Ye, Anwen Hu, Ming Yan, Ji Zhang, Fei Huang, Chunfeng Yuan, Bing Li, Weiming Hu, Zheng-Jun Zha:
Learning Semantics-Grounded Vocabulary Representation for Video-Text Retrieval. ACM Multimedia 2023: 4460-4470
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-00402
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-00402
Haiyang Xu, Qinghao Ye, Ming Yan, Yaya Shi, Jiabo Ye, Yuanhong Xu, Chenliang Li, Bin Bi, Qi Qian, Wei Wang, Guohai Xu, Ji Zhang, Songfang Huang, Fei Huang, Jingren Zhou:
mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video. CoRR abs/2302.00402 (2023)
[i7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2304-14178
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2304-14178
Qinghao Ye, Haiyang Xu, Guohai Xu, Jiabo Ye, Ming Yan, Yiyang Zhou, Junyang Wang, Anwen Hu, Pengcheng Shi, Yaya Shi, Chenliang Li, Yuanhong Xu, Hehong Chen, Junfeng Tian, Qian Qi, Ji Zhang, Fei Huang:
mPLUG-Owl: Modularization Empowers Large Language Models with Multimodality. CoRR abs/2304.14178 (2023)
[i6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-04362
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-04362
Haiyang Xu, Qinghao Ye, Xuan Wu, Ming Yan, Yuan Miao, Jiabo Ye, Guohai Xu, Anwen Hu, Yaya Shi, Guangwei Xu, Chenliang Li, Qi Qian, Maofei Que, Ji Zhang, Xiao Zeng, Fei Huang:
Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Dataset for Pre-training and Benchmarks. CoRR abs/2306.04362 (2023)
[i5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-18248
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-18248
Anwen Hu, Yaya Shi, Haiyang Xu, Jiabo Ye, Qinghao Ye, Ming Yan, Chenliang Li, Qi Qian, Ji Zhang, Fei Huang:
mPLUG-PaperOwl: Scientific Diagram Analysis with the Multimodal Large Language Model. CoRR abs/2311.18248 (2023)
2022
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/tcsv/LiSGWLLH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tcsv/LiSGWLLH22
Zhenbang Li, Yaya Shi, Jin Gao, Shaoru Wang, Bing Li, Pengpeng Liang, Weiming Hu:
A Simple and Strong Baseline for Universal Targeted Attacks on Siamese Visual Tracking. IEEE Trans. Circuits Syst. Video Technol. 32(6): 3880-3894 (2022)
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/ShiYXYLHZ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/ShiYXYLHZ22
Yaya Shi, Xu Yang, Haiyang Xu, Chunfeng Yuan, Bing Li, Weiming Hu, Zheng-Jun Zha:
EMScore: Evaluating Video Captioning via Coarse-Grained and Fine-Grained Embedding Matching. CVPR 2022: 17908-17917
2021
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2105-02480
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2105-02480
Zhenbang Li, Yaya Shi, Jin Gao, Shaoru Wang, Bing Li, Pengpeng Liang, Weiming Hu:
A Simple and Strong Baseline for Universal Targeted Attacks on Siamese Visual Tracking. CoRR abs/2105.02480 (2021)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2111-08919
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2111-08919
Yaya Shi, Xu Yang, Haiyang Xu, Chunfeng Yuan, Bing Li, Weiming Hu, Zheng-Jun Zha:
EMScore: Evaluating Video Captioning via Coarse-Grained and Fine-Grained Embedding Matching. CoRR abs/2111.08919 (2021)
2020
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/ZhangSY0WHZ20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/ZhangSY0WHZ20
Ziqi Zhang, Yaya Shi, Chunfeng Yuan, Bing Li, Peijin Wang, Weiming Hu, Zheng-Jun Zha:
Object Relational Graph With Teacher-Recommended Learning for Video Captioning. CVPR 2020: 13275-13285
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2002-11566
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-11566
Ziqi Zhang, Yaya Shi, Chunfeng Yuan, Bing Li, Peijin Wang, Weiming Hu, Zhengjun Zha:
Object Relational Graph with Teacher-Recommended Learning for Video Captioning. CoRR abs/2002.11566 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1910-05752
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-05752
Ziqi Zhang, Yaya Shi, Jiutong Wei, Chunfeng Yuan, Bing Li, Weiming Hu:
VATEX Captioning Challenge 2019: Multi-modal Information Fusion and Multi-stage Training Strategy for Video Captioning. CoRR abs/1910.05752 (2019)
2018
[j1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/remotesensing/ShiNYCLL18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/remotesensing/ShiNYCLL18
Yaya Shi, Fujun Niu, Chengsong Yang, Tao Che, Zhanju Lin, Jing Luo:
Permafrost Presence/Absence Mapping of the Qinghai-Tibet Plateau Based on Multi-Source Remote Sensing Data. Remote. Sens. 10(2): 309 (2018)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.