TempCompass: Do Video LLMs Really Understand Videos?

TempCompass: Do Video LLMs Really Understand Videos? - arXiv

2024/03/01 · Based on TempCompass, we comprehensively evaluate 8 state-of-the-art (SOTA) Video LLMs and 3 Image LLMs, and reveal the discerning fact that ...

TempCompass: Do Video LLMs Really Understand Videos? - ACL ...

aclanthology.org › 2024.findings-acl.517

We also design an LLM-based approach to automatically and accurately evaluate the responses from Video LLMs. Based on TempCompass, we comprehensively evaluate 9 ...

[ACL 2024 Findings] "TempCompass: Do Video LLMs Really ... - GitHub

github.com › llyx97 › TempCompass

Conflicting Videos. We construct conflicting videos to prevent the models from taking advantage of single-frame bias and language priors.

TempCompass: Do Video LLMs Really Understand Videos? - arXiv

arxiv.org › html

This work proposes the TempCompass, a benchmark to comprehensively evaluate the temporal perception ability of Video LLMs.

[PDF] TempCompass: Do Video LLMs Really Understand Videos?

www.semanticscholar.org › paper

The TempCompass benchmark is proposed, which introduces a diversity of temporal aspects and task formats and comprehensively evaluate 8 state-of-the-art ...

他の人はこちらも検索

Video search LLM

video-mme: the first-ever comprehensive evaluation benchmark of multi-modal llms in video analysis

LLMs Meet long video: Advancing long video comprehension with an interactive visual adapter in LLMs

mvbench: a comprehensive multi-modal video understanding benchmark

MovieChat: from dense token to sparse memory for long video understanding

Multimodal video understanding

TempCompass: Do Video LLMs Really Understand Videos?

www.researchgate.net › publication › 38...

2024/09/23 · Our analysis reveals significant performance variations based on question and chart types, highlighting both strengths and weaknesses of current ...

TempCompass - Yuanxin Liu

llyx97.github.io › tempcompass

However, existing benchmarks fail to provide a comprehensive feedback on the temporal perception ability of Video LLMs. On the one hand, most of them are unable ...

TempCompass: Do Video LLMs Really Understand Videos? - X-MOL

www.x-mol.com › paper

我们还设计了一种基于法学硕士的方法来自动准确地评估视频法学硕士的回答。基于TempCompass，我们综合评估了8 个最先进的（SOTA）视频LLM 和3 个图像LLM，并 ...

TempCompass: Do Video LLMs Really Understand Videos?

www.aimodels.fyi › papers › arxiv › tem...

2024/06/03 · The paper proposes a new benchmark called TempCompass to evaluate the temporal reasoning capabilities of video large language models (VLLMs).

TempCompass: Do Video LLMs Really Understand Videos?

www.emergentmind.com › papers

2024/03/01 · TempCompass is a new benchmark designed to evaluate Video LLMs on their understanding of temporal aspects such as action, speed, direction, ...

他の人はこちらも検索

Video understanding models

vitatecs: a diagnostic dataset for temporal concept understanding of video-language models