default search action
Vineet Gandhi
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j5]Neil Kumar Shah, Neha Sahipjohn, Vishal Tambrahalli, Ramanathan Subramanian, Vineet Gandhi:
StethoSpeech: Speech Generation Through a Clinical Stethoscope Attached to the Skin. Proc. ACM Interact. Mob. Wearable Ubiquitous Technol. 8(3): 123:1-123:21 (2024) - [c47]Neil Kumar Shah, Saiteja Kosgi, Vishal Tambrahalli, Neha Sahipjohn, Anil Nelakanti, Vineet Gandhi:
ParrotTTS: Text-to-speech synthesis exploiting disentangled self-supervised representations. EACL (Findings) 2024: 79-91 - [c46]Kawshik Sundar, Shubham Toshniwal, Makarand Tapaswi, Vineet Gandhi:
Major Entity Identification: A Generalizable Alternative to Coreference Resolution. EMNLP 2024: 11679-11695 - [c45]Sudheer Achary, Rohit Girmaji, Adhiraj Anil Deshmukh, Vineet Gandhi:
Real Time GAZED: Online Shot Selection and Editing of Virtual Cameras from Wide-Angle Monocular Video Recordings. WACV 2024: 4096-4104 - [i34]Darshana Saravanan, Naresh Manwani, Vineet Gandhi:
SARI: Simplistic Average and Robust Identification based Noisy Partial Label Learning. CoRR abs/2402.04835 (2024) - [i33]Darshana Saravanan, Darshan Singh S, Varun Gupta, Zeeshan Khan, Vineet Gandhi, Makarand Tapaswi:
VELOCITI: Can Video-Language Models Bind Semantic Concepts through Time? CoRR abs/2406.10889 (2024) - [i32]Kawshik Manikantan, Shubham Toshniwal, Makarand Tapaswi, Vineet Gandhi:
Major Entity Identification: A Generalizable Alternative to Coreference Resolution. CoRR abs/2406.14654 (2024) - [i31]Neil Kumar Shah, Shirish Karande, Vineet Gandhi:
Towards Improving NAM-to-Speech Synthesis Intelligibility using Self-Supervised Speech Models. CoRR abs/2407.18541 (2024) - 2023
- [c44]Ritu Srivastava, Saiteja Kosgi, Sarath Sivaprasad, Neha Sahipjohn, Vineet Gandhi:
Adversarial Robustness of Mel Based Speaker Recognition Systems. APSIPA ASC 2023: 145-150 - [c43]Neha Sahipjohn, Neil Kumar Shah, Vishal Tambrahalli, Vineet Gandhi:
RobustL2S: Speaker-Specific Lip-to-Speech Synthesis exploiting Self-Supervised Representations. APSIPA ASC 2023: 1492-1499 - [c42]Kanishk Jain, Varun Chhangani, Amogh Tiwari, K. Madhava Krishna, Vineet Gandhi:
Ground then Navigate: Language-guided Navigation in Dynamic Scenes. ICRA 2023: 4113-4120 - [c41]Kanishk Jain, Shyamgopal Karthik, Vineet Gandhi:
Test-Time Amendment with a Coarse Classifier for Fine-Grained Classification. NeurIPS 2023 - [c40]Laksh Nanwani, Anmol Agarwal, Kanishk Jain, Raghav Prabhakar, Aaron Monis, Aditya Mathur, Krishna Murthy Jatavallabhula, A. H. Abdul Hafez, Vineet Gandhi, K. Madhava Krishna:
Instance-Level Semantic Maps for Vision Language Navigation. RO-MAN 2023: 507-512 - [c39]Rohit Girmaji, Sudheer Achary, Adhiraj Deshmukh, Vineet Gandhi:
Assessing active speaker detection algorithms through the lens of automated editing. IMX Workshops 2023: 123-130 - [c38]Jeet Vora, Swetanjal Dutta, Kanishk Jain, Shyamgopal Karthik, Vineet Gandhi:
Bringing Generalization to Deep Multi-View Pedestrian Detection. WACV (Workshops) 2023: 110-119 - [i30]Kanishk Jain, Shyamgopal Karthik, Vineet Gandhi:
Test-Time Amendment with a Coarse Classifier for Fine-Grained Classification. CoRR abs/2302.00368 (2023) - [i29]Saiteja Kosgi, Neil Kumar Shah, Vishal Tambrahalli, Neha Sherin, Vineet Gandhi:
ParrotTTS: Text-to-Speech synthesis by exploiting self-supervised representations. CoRR abs/2303.01261 (2023) - [i28]Neil Kumar Shah, Vishal Tambrahalli, Saiteja Kosgi, Niranjan Pedanekar, Vineet Gandhi:
MParrotTTS: Multilingual Multi-speaker Text to Speech Synthesis in Low Resource Setting. CoRR abs/2305.11926 (2023) - [i27]Laksh Nanwani, Anmol Agarwal, Kanishk Jain, Raghav Prabhakar, Aaron Monis, Aditya Mathur, Krishna Murthy Jatavallabhula, A. H. Abdul Hafez, Vineet Gandhi, K. Madhava Krishna:
Instance-Level Semantic Maps for Vision Language Navigation. CoRR abs/2305.12363 (2023) - [i26]Neha Sahipjohn, Neil Kumar Shah, Vishal Tambrahalli, Vineet Gandhi:
RobustL2S: Speaker-Specific Lip-to-Speech Synthesis exploiting Self-Supervised Representations. CoRR abs/2307.01233 (2023) - [i25]Sudheer Achary, Rohit Girmaji, Adhiraj Anil Deshmukh, Vineet Gandhi:
Real Time GAZED: Online Shot Selection and Editing of Virtual Cameras from Wide-Angle Monocular Video Recordings. CoRR abs/2311.15581 (2023) - 2022
- [c37]Kanishk Jain, Vineet Gandhi:
Comprehensive Multi-Modal Interactions for Referring Image Segmentation. ACL (Findings) 2022: 3427-3435 - [c36]Ritvik Agrawal, Shreyank Jyoti, Rohit Girmaji, Sarath Sivaprasad, Vineet Gandhi:
Does Audio help in deep Audio-Visual Saliency prediction models? ICMI 2022: 48-56 - [c35]Saransh Dave, Ritam Basu, Vineet Gandhi:
Cross-Domain Class-Contrastive Learning: Finding Lower Dimensional Representations for Improved Domain Generalization. ICVGIP 2022: 48:1-48:8 - [c34]Saiteja Kosgi, Sarath Sivaprasad, Niranjan Pedanekar, Anil Nelakanti, Vineet Gandhi:
Empathic Machines: Using Intermediate Features as Levers to Emulate Emotions in Text-To-Speech Systems. NAACL-HLT 2022: 336-347 - [c33]Rémi Ronfard, Vineet Gandhi, Laurent Boiron, Vaishnavi Ameya Murukutla:
The Prose Storyboard Language: A Tool for Annotating and Directing Movies. WICED@Eurographics/EuroVis 2022: 13-27 - [c32]Pratikkumar Bulani, Jayachandran S., Sarath Sivaprasad, Vineet Gandhi:
Framework to Computationally Analyze Kathakali Videos. WICED@Eurographics/EuroVis 2022: 29-36 - [i24]Kanishk Jain, Varun Chhangani, Amogh Tiwari, K. Madhava Krishna, Vineet Gandhi:
Ground then Navigate: Language-guided Navigation in Dynamic Scenes. CoRR abs/2209.11972 (2022) - 2021
- [c31]Shyamgopal Karthik, Ameya Prabhu, Puneet K. Dokania, Vineet Gandhi:
No Cost Likelihood Manipulation at Test Time for Making Better Mistakes in Deep Networks. ICLR 2021 - [c30]Sarath Sivaprasad, Saiteja Kosgi, Vineet Gandhi:
Emotional Prosody Control for Speech Generation. Interspeech 2021: 4653-4657 - [c29]Samyak Jain, Pradeep Yarlagadda, Shreyank Jyoti, Shyamgopal Karthik, Ramanathan Subramanian, Vineet Gandhi:
ViNet: Pushing the limits of Visual Modality for Audio-Visual Saliency Prediction. IROS 2021: 3520-3527 - [c28]Nivedita Rufus, Kanishk Jain, Unni Krishnan R. Nair, Vineet Gandhi, K. Madhava Krishna:
Grounding Linguistic Commands to Navigable Regions. IROS 2021: 8593-8600 - [c27]Sarath Sivaprasad, Ankur Singh, Naresh Manwani, Vineet Gandhi:
The Curious Case of Convex Neural Networks. ECML/PKDD (1) 2021: 738-754 - [i23]Shyamgopal Karthik, Ameya Prabhu, Puneet K. Dokania, Vineet Gandhi:
No Cost Likelihood Manipulation at Test Time for Making Better Mistakes in Deep Networks. CoRR abs/2104.00795 (2021) - [i22]Kanishk Jain, Vineet Gandhi:
Comprehensive Multi-Modal Interactions for Referring Image Segmentation. CoRR abs/2104.10412 (2021) - [i21]Vineet Gandhi, Jan Cech, Radu Horaud:
High-Resolution Depth Maps Based on TOF-Stereo Fusion. CoRR abs/2107.14688 (2021) - [i20]Jeet Vora, Swetanjal Dutta, Shyamgopal Karthik, Vineet Gandhi:
Bringing Generalization to Deep Multi-view Detection. CoRR abs/2109.12227 (2021) - [i19]Sarath Sivaprasad, Akshay Goindani, Vaibhav Garg, Vineet Gandhi:
Reappraising Domain Generalization in Neural Networks. CoRR abs/2110.07981 (2021) - [i18]Sarath Sivaprasad, Saiteja Kosgi, Vineet Gandhi:
Emotional Prosody Control for Speech Generation. CoRR abs/2111.04730 (2021) - [i17]Nivedita Rufus, Kanishk Jain, Unni Krishnan R. Nair, Vineet Gandhi, K. Madhava Krishna:
Grounding Linguistic Commands to Navigable Regions. CoRR abs/2112.13031 (2021) - 2020
- [j4]Murtuza Bohra, Sajal Maheshwari, Vineet Gandhi:
TextureToMTF: predicting spatial frequency response in the wild. Signal Image Video Process. 14(6): 1163-1170 (2020) - [c26]K. L. Bhanu Moorthy, Moneish Kumar, Ramanathan Subramanian, Vineet Gandhi:
GAZED- Gaze-guided Cinematic Editing of Wide-Angle Monocular Video Recordings. CHI 2020: 1-11 - [c25]Nivedita Rufus, Unni Krishnan R. Nair, K. Madhava Krishna, Vineet Gandhi:
Cosine Meets Softmax: A Tough-to-beat Baseline for Visual Grounding. ECCV Workshops (2) 2020: 39-50 - [c24]Murtuza Bohra, Vineet Gandhi:
ColorArt: Suggesting Colorizations For Graphic Arts Using Optimal Color-Graph Matching. Graphics Interface 2020: 95-102 - [c23]Aasheesh Singh, Aditya Kamireddypalli, Vineet Gandhi, K. Madhava Krishna:
LiDAR guided Small obstacle Segmentation. IROS 2020: 8513-8520 - [c22]Navyasri Reddy, Samyak Jain, Pradeep Yarlagadda, Vineet Gandhi:
Tidying Deep Saliency Prediction Architectures. IROS 2020: 10241-10247 - [c21]Shyamgopal Karthik, Abhinav Moudgil, Vineet Gandhi:
Exploring 3 R's of Long-term Tracking: Re-detection, Recovery and Reliability. WACV 2020: 1000-1009 - [c20]Sudheer Achary, K. L. Bhanu Moorthy, Ashar Javed, Nikita Shravan, Vineet Gandhi, Anoop M. Namboodiri:
CineFilter: Unsupervised Filtering for Real Time Autonomous Camera Systems. WICED@Eurographics/EuroVis 2020: 27-33 - [c19]K. L. Bhanu Moorthy, Moneish Kumar, Ramanathan Subramanian, Vineet Gandhi:
GAZED - Gaze-guided Cinematic Editing of Wide-Angle Monocular Video Recordings. WICED@Eurographics/EuroVis 2020: 35-36 - [e2]Marc Christie, Hui-Yin Wu, Tsai-Yen Li, Vineet Gandhi:
9th Workshop on Intelligent Cinematography and Editing, WICED@Eurographics/EuroVis 2020, Norrköping, Sweden, May 25-29, 2020 [online only]. Eurographics Association 2020, ISBN 978-3-03868-127-4 [contents] - [i16]Navyasri Reddy, Samyak Jain, Pradeep Yarlagadda, Vineet Gandhi:
Tidying Deep Saliency Prediction Architectures. CoRR abs/2003.04942 (2020) - [i15]Aasheesh Singh, Aditya Kamireddypalli, Vineet Gandhi, K. Madhava Krishna:
LiDAR guided Small obstacle Segmentation. CoRR abs/2003.05970 (2020) - [i14]Shyamgopal Karthik, Ameya Prabhu, Vineet Gandhi:
Simple Unsupervised Multi-Object Tracking. CoRR abs/2006.02609 (2020) - [i13]Sarath Sivaprasad, Naresh Manwani, Vineet Gandhi:
The Curious Case of Convex Networks. CoRR abs/2006.05103 (2020) - [i12]Nivedita Rufus, Unni Krishnan R. Nair, K. Madhava Krishna, Vineet Gandhi:
Cosine meets Softmax: A tough-to-beat baseline for visual grounding. CoRR abs/2009.06066 (2020) - [i11]K. L. Bhanu Moorthy, Moneish Kumar, Ramanathan Subramanian, Vineet Gandhi:
GAZED- Gaze-guided Cinematic Editing of Wide-Angle Monocular Video Recordings. CoRR abs/2010.11886 (2020) - [i10]Samyak Jain, Pradeep Yarlagadda, Ramanathan Subramanian, Vineet Gandhi:
AViNet: Diving Deep into Audio-Visual Saliency Prediction. CoRR abs/2012.06170 (2020)
2010 – 2019
- 2019
- [c18]Aryaman Gupta, Kalpit C. Thakkar, Vineet Gandhi, P. J. Narayanan:
Nose, Eyes and Ears: Head Pose Estimation by Locating Facial Keypoints. ICASSP 2019: 1977-1981 - [c17]Syed Ashar Javed, Shreyas Saxena, Vineet Gandhi:
Learning Unsupervised Visual Grounding Through Semantic Self-Supervision. IJCAI 2019: 796-802 - [c16]Sriram N. N., Tirth Maniar, Jayaganesh Kalyanasundaram, Vineet Gandhi, Brojeshwar Bhowmick, K. Madhava Krishna:
Talk to the Vehicle: Language Conditioned Autonomous Navigation of Self Driving Cars. IROS 2019: 5284-5290 - [i9]Shyamgopal Karthik, Abhinav Moudgil, Vineet Gandhi:
Exploring 3 R's of Long-term Tracking: Re-detection, Recovery and Reliability. CoRR abs/1910.12273 (2019) - [i8]Sudheer Achary, Ashar Javed, Nikita Shravan, K. L. Bhanu Moorthy, Vineet Gandhi, Anoop M. Namboodiri:
CineFilter: Unsupervised Filtering for Real Time Autonomous Camera Systems. CoRR abs/1912.05636 (2019) - 2018
- [j3]Kranthi Kumar Rachavarapu, Moneish Kumar, Vineet Gandhi, Ramanathan Subramanian:
Watch to Edit: Video Retargeting using Gaze. Comput. Graph. Forum 37(2): 205-215 (2018) - [c15]Abhinav Moudgil, Vineet Gandhi:
Long-Term Visual Object Tracking Benchmark. ACCV (2) 2018: 629-645 - [c14]Pranjal Kumar Rai, Sajal Maheshwari, Vineet Gandhi:
Document Quality Estimation Using Spatial Frequency Response. ICASSP 2018: 1233-1237 - [c13]Vatsal Shah, Vineet Gandhi:
An Iterative Approach for Shadow Removal in Document Images. ICASSP 2018: 1892-1896 - [c12]Krishnam Gupta, Syed Ashar Javed, Vineet Gandhi, K. Madhava Krishna:
MergeNet: A Deep Net Architecture for Small Obstacle Discovery. ICRA 2018: 1-7 - [c11]Rahul Anand Sharma, Bharath Bhat, Vineet Gandhi, C. V. Jawahar:
Automated Top View Registration of Broadcast Football Videos. WACV 2018: 305-313 - [i7]Syed Ashar Javed, Shreyas Saxena, Vineet Gandhi:
Learning Unsupervised Visual Grounding Through Semantic Self-Supervision. CoRR abs/1803.06506 (2018) - [i6]Krishnam Gupta, Syed Ashar Javed, Vineet Gandhi, K. Madhava Krishna:
MergeNet: A Deep Net Architecture for Small Obstacle Discovery. CoRR abs/1803.06508 (2018) - [i5]Kranthi Kumar Rachavarapu, Moneish Kumar, Vineet Gandhi, Ramanathan Subramanian:
Watch to Edit: Video Retargeting using Gaze. CoRR abs/1807.03125 (2018) - [i4]Aryaman Gupta, Kalpit C. Thakkar, Vineet Gandhi, P. J. Narayanan:
Nose, eyes and ears: Head pose estimation by locating facial keypoints. CoRR abs/1812.00739 (2018) - 2017
- [j2]Moneish Kumar, Vineet Gandhi, Rémi Ronfard, Michael Gleicher:
Zooming On All Actors: Automatic Focus+Context Split Screen Video Generation. Comput. Graph. Forum 36(2): 455-465 (2017) - [j1]Rahul Anand Sharma, Vineet Gandhi, Visesh Chari, C. V. Jawahar:
Automatic analysis of broadcast football videos using contextual priors. Signal Image Video Process. 11(1): 171-178 (2017) - [c10]Krishnam Gupta, Sarthak Upadhyay, Vineet Gandhi, K. Madhava Krishna:
Small obstacle detection using stereo vision for autonomous ground vehicle. AIR 2017: 25:1-25:6 - [c9]Pranjal Kumar Rai, Sajal Maheshwari, Ishit Mehta, Parikshit Sakurikar, Vineet Gandhi:
Beyond OCRs for Document Blur Estimation. ICDAR 2017: 1101-1107 - [c8]Sheetal Reddy, Vineet Gandhi, K. Madhava Krishna:
3D Region Proposals For Selective Object Search. VISIGRAPP (5: VISAPP) 2017: 353-361 - [c7]Moneish Kumar, Vineet Gandhi, Rémi Ronfard, Michael Gleicher:
Zooming On All Actors: Automatic Focus+Context Split Screen Video Generation. WICED@Eurographics 2017: 43 - [e1]William H. Bares, Vineet Gandhi, Quentin Galvane, Rémi Ronfard:
6th Workshop on Intelligent Cinematography and Editing, WICED@Eurographics 2017, Lyon, France, April 24, 2017. Eurographics Association 2017, ISBN 978-3-03868-031-4 [contents] - [i3]Rahul Anand Sharma, Bharath Bhat, Vineet Gandhi, C. V. Jawahar:
Automated Top View Registration of Broadcast Football Videos. CoRR abs/1703.01437 (2017) - [i2]Abhinav Moudgil, Vineet Gandhi:
Long-Term Visual Object Tracking Benchmark. CoRR abs/1712.01358 (2017) - 2016
- [c6]Sajal Maheshwari, Pranjal Kumar Rai, Gopal Sharma, Vineet Gandhi:
Document blur detection using edge profile mining. ICVGIP 2016: 23:1-23:7 - 2015
- [c5]Rémi Ronfard, Benoît Encelle, Nicolas Sauret, Pierre-Antoine Champin, Thomas Steiner, Vineet Gandhi, Cyrille Migniot, Florent Thiery:
Capturing and Indexing Rehearsals: The Design and Usage of a Digital Archive of Performing Arts. Digital Heritage 2015: 533-540 - [c4]Vineet Gandhi, Rémi Ronfard:
A Computational Framework for Vertical Video Editing. WICED@Eurographics 2015: 31-37 - [i1]Rémi Ronfard, Vineet Gandhi, Laurent Boiron:
The Prose Storyboard Language: A Tool for Annotating and Directing Movies. CoRR abs/1508.07593 (2015) - 2014
- [b1]Vineet Gandhi:
Automatic Rush Generation with Application to Theatre Performances. (Généation Automatique de Prises de Vues Cinématographiques avec Applications aux Captations de Théâtre). University of Grenoble, France, 2014 - [c3]Vineet Gandhi, Rémi Ronfard, Michael Gleicher:
Multi-clip video editing from a single viewpoint. CVMP 2014: 9:1-9:10 - 2013
- [c2]Vineet Gandhi, Rémi Ronfard:
Detecting and Naming Actors in Movies Using Generative Appearance Models. CVPR 2013: 3706-3713 - 2012
- [c1]Vineet Gandhi, Jan Cech, Radu Horaud:
High-resolution depth maps based on TOF-stereo fusion. ICRA 2012: 4742-4749
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-15 19:31 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint