Weakly Supervised Relative Spatial Reasoning for Visual Question Answering.

AllBooks Images Videos Maps News Shopping

Weakly Supervised Relative Spatial Reasoning for Visual Question ...

Sep 4, 2021 · In this work, we evaluate the faithfulness of V\&L models to such geometric understanding, by formulating the prediction of pair-wise relative locations of ...

Scholarly articles for Weakly Supervised Relative Spatial Reasoning for Visual Question Answering.

scholar.google.com › citations

… relative spatial reasoning for visual question answering
Banerjee · Cited by 24

[PDF] Weakly Supervised Relative Spatial Reasoning for Visual Question ...

par.nsf.gov › servlets › purl

The most significant improvement is observed on the open-ended questions (2.21%). We can observe that weak-supervision and joint end-to-end training of SR and.

Weakly Supervised Relative Spatial Reasoning for Visual Question ...

www.semanticscholar.org › paper › Wea...

Two objectives as proxies for 3D spatial reasoning (SR) – object centroid estimation, and relative position estimation are designed, and V&L is trained with ...

Weakly Supervised Relative Spatial Reasoning for Visual Question ...

ieeexplore.ieee.org › iel7

One such ability is spatial reasoning – un- derstanding the geometry of the scene and spatial locations of objects in an image. Visual question answering (such ...

Weakly Supervised Relative Spatial Reasoning for Visual Question ...

www.computer.org › csdl › iccv

In this work, we evaluate the faithfulness of V&L models to such geometric understanding, by formulating the prediction of pair-wise relative locations of ...

Weakly-Supervised 3D Spatial Reasoning for Text-Based Visual ...

ieeexplore.ieee.org › document

May 31, 2023 · In this paper, we introduce 3D geometric information into the spatial reasoning process to capture the contextual knowledge of key objects step-by-step.

Missing: Relative | Show results with:Relative

Weakly Supervised Relative Spatial Reasoning for Visual Question ...

bibbase.org › network › publication › ba...

Weakly Supervised Relative Spatial Reasoning for Visual Question Answering. Banerjee, P., Gokhale, T., Yang, Y., & Baral, C. In Proceedings of the IEEE/CVF ...

About Me - Knowledge in, Reason out.

pratyay-banerjee.github.io

We weakly-supervise transformer-based VQA systems using two novel, unit normalized 3D-vision guided tasks, Centroid Estimation and Relative Position Estimation.

[PDF] Weakly-Supervised 3D Spatial Reasoning for Text-Based Visual ...

www.jdl.link › doc › 20231204_W...

Dec 3, 2023 · Abstract—Text-based Visual Question Answering (TextVQA) aims to produce correct answers for given questions about the.

‪Pratyay Banerjee‬ - ‪Google Scholar‬

scholar.google.com › citations

Weakly Supervised Relative Spatial Reasoning for Visual Question Answering. P Banerjee, T Gokhale, Y Yang, C Baral. Proceedings of the IEEE/CVF International ...