Multi-Modal fusion with multi-level attention for Visual Dialog
www.sciencedirect.com › article › pii
We propose a novel visual dialog method, which focuses on both high-level and low-level information of the dialog history, the question, and the image.
We propose a novel visual dialog method, which focuses on both high-level and low-level information of the dialog history, the question, and the image.
In our approach, we introduce three low-level attention modules, the goal of which is to enhance the representation of words in the sentence of the dialog ...
Connected Papers is a visual tool to help researchers and applied scientists find academic papers relevant to their field of work.
Multi-Modal fusion with multi-level attention for Visual Dialog. Jingping Zhang, Qiang Wang, Yahong Han. Anthology ID: DBLP:journals/ipm/ZhangWH20; Volume: ...
A comprehensive survey of the recent achievements in the Visual Dialog task and many aspects of multimodal fusion research: Visual Co-reference Resolution, ...
This survey covers many aspects of multimodal fusion research: Visual Co-reference Resolution, Attention Mechanism, Graph Neural Networks, evaluation issues, ...
We propose a novel visual dialog method with multi-level attention. • Three high-level attention modules are devised to select important words.
Nov 1, 2021 · Visual dialog is a challenging vision-language task in which a series of questions visually grounded by a given image are answered. To resolve ...
May 30, 2023 · We discuss the classification and application of existing attention mechanisms in VQA tasks, analysis their shortcomings, and summarize current improvement ...