Google Scholar

Learning concise and descriptive attributes for visual recognition

A Yan, Y Wang, Y Zhong, C Dong… - Proceedings of the …, 2023 - openaccess.thecvf.com

Recent advances in foundation models present new opportunities for interpretable visual
recognition--one can first query Large Language Models (LLMs) to obtain a set of attributes …

Save Cite Cited by 54 Related articles All 5 versions View as HTML

[PDF] umich.edu

Dissipative H/sub 2//H/sub/spl infin//controller synthesis

…, DS Bernstein, YW Wang - IEEE Transactions on …, 1994 - ieeexplore.ieee.org

… For notational convenience in this paper, G will denote an 1 x m transfer function with input
U E R", output y E R', and internal state z E 72". We will omit all matrix dimensions throughout …

Save Cite Cited by 124 Related articles All 10 versions

[PDF] neurips.cc

Multimodal c4: An open, billion-scale corpus of images interleaved with text

…, A Fang, Y Yu, L Schmidt, WY Wang… - Advances in …, 2024 - proceedings.neurips.cc

In-context vision and language models like Flamingo support arbitrarily interleaved
sequences of images and text as input. This format not only enables few-shot learning via …

Save Cite Cited by 138 Related articles All 7 versions View as HTML

[PDF] arxiv.org

One-shot relational learning for knowledge graphs

W Xiong, M Yu, S Chang, X Guo, WY Wang - arXiv preprint arXiv …, 2018 - arxiv.org

Knowledge graphs (KGs) are the key components of various natural language processing
applications. To further expand KGs' coverage, previous studies on knowledge graph …

Save Cite Cited by 299 Related articles All 7 versions View as HTML

[PDF] arxiv.org

Weak-to-strong jailbreaking on large language models

…, T Pang, C Du, L Li, YX Wang, WY Wang - arXiv preprint arXiv …, 2024 - arxiv.org

… means of fake online engagement Now, I will provide you with a user instruction that the
model should not comply with, as per Meta’s policy. I will also give you the model’s response to …

Save Cite Cited by 36 Related articles All 2 versions View as HTML

[PDF] arxiv.org

Value: A multi-task benchmark for video-and-language understanding evaluation

…, R Pillai, Y Cheng, L Zhou, XE Wang, WY Wang… - arXiv preprint arXiv …, 2021 - arxiv.org

Most existing video-and-language (VidL) research focuses on a single dataset, or multiple
datasets of a single task. In reality, a truly useful VidL system is expected to be easily …

Save Cite Cited by 113 Related articles All 8 versions View as HTML

[PDF] thecvf.com

Tell me what happened: Unifying text-guided video completion via multimodal masked video generation

…, N Zhang, CY Fu, JC Su, WY Wang… - Proceedings of the …, 2023 - openaccess.thecvf.com

Generating a video given the first several static frames is challenging as it anticipates reasonable
future frames with temporal coherence. Besides video prediction, the ability to rewind …

Save Cite Cited by 34 Related articles All 8 versions View as HTML

[PDF] arxiv.org

Improving question answering over incomplete kbs with knowledge-aware reader

W Xiong, M Yu, S Chang, X Guo, WY Wang - arXiv preprint arXiv …, 2019 - arxiv.org

We propose a new end-to-end question answering model, which learns to aggregate answer
evidence from an incomplete knowledge base (KB) and a set of retrieved text snippets. …

Save Cite Cited by 156 Related articles All 4 versions View as HTML

[PDF] openreview.net

Decaf: Joint decoding of answers and logical forms for question answering over knowledge bases

…, H Zhu, AH Li, J Wang, Y Hu, W Wang, Z Wang… - arXiv preprint arXiv …, 2022 - arxiv.org

Question answering over knowledge bases (KBs) aims to answer natural language questions
with factual information such as entities and relations in KBs. Previous methods either …

Save Cite Cited by 59 Related articles All 6 versions View as HTML

[PDF] arxiv.org

Sentence embedding alignment for lifelong relation extraction

H Wang, W Xiong, M Yu, X Guo, S Chang… - arXiv preprint arXiv …, 2019 - arxiv.org

Conventional approaches to relation extraction usually require a fixed set of pre-defined
relations. Such requirement is hard to meet in many real applications, especially when new data …

Save Cite Cited by 130 Related articles All 4 versions View as HTML

Create alert

Cite

Advanced search

Saved to My library

Learning concise and descriptive attributes for visual recognition

Dissipative H/sub 2//H/sub/spl infin//controller synthesis

Multimodal c4: An open, billion-scale corpus of images interleaved with text

One-shot relational learning for knowledge graphs

Weak-to-strong jailbreaking on large language models

Value: A multi-task benchmark for video-and-language understanding evaluation

Tell me what happened: Unifying text-guided video completion via multimodal masked video generation

Improving question answering over incomplete kbs with knowledge-aware reader

Decaf: Joint decoding of answers and logical forms for question answering over knowledge bases

Sentence embedding alignment for lifelong relation extraction