default search action
18th ECCV 2024: Milan, Italy - Part XLII
- Ales Leonardis, Elisa Ricci, Stefan Roth, Olga Russakovsky, Torsten Sattler, Gül Varol:
Computer Vision - ECCV 2024 - 18th European Conference, Milan, Italy, September 29-October 4, 2024, Proceedings, Part XLII. Lecture Notes in Computer Science 15100, Springer 2025, ISBN 978-3-031-72945-4 - Dimity Miller, Niko Sünderhauf, Alex Kenna, Keita Mason:
Open-Set Recognition in the Age of Vision-Language Models. 1-18 - Qing Su, Shihao Ji:
Unsqueeze [CLS] Bottleneck to Learn Rich Representations. 19-37 - Shicai Wei, Yang Luo, Yuji Wang, Chunbo Luo:
Robust Multimodal Learning via Representation Decoupling. 38-54 - Yasi Zhang, Peiyu Yu, Ying Nian Wu:
Object-Conditioned Energy-Based Attention Map Alignment in Text-to-Image Diffusion Models. 55-71 - Shuokang Huang, Kaihan Li, Di You, Yichong Chen, Arvin Lin, Siying Liu, Xiaohui Li, Julie A. McCann:
WiMANS: A Benchmark Dataset for WiFi-Based Multi-user Activity Sensing. 72-91 - Hyunwoo Yu, Yubin Cho, Beoungwoo Kang, Seunghun Moon, Kyeongbo Kong, Suk-Ju Kang:
Embedding-Free Transformer with Inference Spatial Reduction for Efficient Semantic Segmentation. 92-110 - Zhengfeng Lai, Haotian Zhang, Bowen Zhang, Wentao Wu, Haoping Bai, Aleksei Timofeev, Xianzhi Du, Zhe Gan, Jiulong Shan, Chen-Nee Chuah, Yinfei Yang, Meng Cao:
VeCLIP: Improving CLIP Training via Visual-Enriched Captions. 111-127 - Manyuan Zhang, Guanglu Song, Xiaoyu Shi, Yu Liu, Hongsheng Li:
Three Things We Need to Know About Transferring Stable Diffusion to Visual Dense Prediction Tasks. 128-145 - Yongjian Zhang, Longguang Wang, Kunhong Li, Yun Wang, Yulan Guo:
Learning Representations from Foundation Models for Domain Generalized Stereo Matching. 146-162 - Jianxiong Tang, Jian-Huang Lai, Lingxiao Yang, Xiaohua Xie:
Spike-Temporal Latent Representation for Energy-Efficient Event-to-Video Reconstruction. 163-179 - Qinji Yu, Yirui Wang, Ke Yan, Haoshen Li, Dazhou Guo, Li Zhang, Na Shen, Qifeng Wang, Xiaowei Ding, Le Lu, Xianghua Ye, Dakai Jin:
Effective Lymph Nodes Detection in CT Scans Using Location Debiased Query Selection and Contrastive Query Representation in Transformer. 180-198 - Shuangkang Fang, Yufeng Wang, Yi-Hsuan Tsai, Yi Yang, Wenrui Ding, Shuchang Zhou, Ming-Hsuan Yang:
Chat-Edit-3D: Interactive 3D Scene Editing via Text Prompts. 199-216 - Zeyu Xiao, Dachun Kai, Yueyi Zhang, Zheng-Jun Zha, Xiaoyan Sun, Zhiwei Xiong:
Event-Adapted Video Super-Resolution. 217-235 - Sounak Mondal, Seoyoung Ahn, Zhibo Yang, Niranjan Balasubramanian, Dimitris Samaras, Gregory J. Zelinsky, Minh Hoai:
Look Hear: Gaze Prediction for Speech-Directed Human Attention. 236-255 - Xiaoyong Lu, Songlin Du:
Raising the Ceiling: Conflict-Free Local Feature Matching with Dynamic View Switching. 256-273 - Haibo Wang, Weifeng Ge:
Q&A Prompts: Discovering Rich Visual Clues through Mining Question-Answer Prompts for VQA requiring Diverse World Knowledge. 274-292 - Mengnan Zhao, Lihe Zhang, Yuqiu Kong, Baocai Yin:
Catastrophic Overfitting: A Potential Blessing in Disguise. 293-310 - Shengqi Xu, Run Sun, Yi Chang, Shuning Cao, Xueyao Xiao, Luxin Yan:
Long-Range Turbulence Mitigation: A Large-Scale Dataset and A Coarse-to-Fine Framework. 311-329 - Yuwei Guo, Ceyuan Yang, Anyi Rao, Maneesh Agrawala, Dahua Lin, Bo Dai:
SparseCtrl: Adding Sparse Controls to Text-to-Video Diffusion Models. 330-348 - Peiqi Jiao, Yuecong Min, Xilin Chen:
Visual Alignment Pre-training for Sign Language Translation. 349-367 - Yiqi Lin, Conghui He, Alex Jinpeng Wang, Bin Wang, Weijia Li, Mike Zheng Shou:
Parrot Captions Teach CLIP to Spot Text. 368-385 - Yihan Hu, Siqi Chai, Zhening Yang, Jingyu Qian, Kun Li, Wenxin Shao, Haichao Zhang, Wei Xu, Qiang Liu:
Solving Motion Planning Tasks with a Scalable Generative Model. 386-404 - Yufei Zhan, Yousong Zhu, Zhiyang Chen, Fan Yang, Ming Tang, Jinqiao Wang:
Griffon: Spelling Out All Object Locations at Any Granularity with Large Language Models. 405-422 - Huangbiao Xu, Xiao Ke, Yuezhou Li, Rui Xu, Huanqi Wu, Xiaofeng Lin, Wenzhong Guo:
Vision-Language Action Knowledge Learning for Semantic-Aware Action Quality Assessment. 423-440 - Tao Chen, Xiruo Jiang, Gensheng Pei, Zeren Sun, Yucheng Wang, Yazhou Yao:
Knowledge Transfer with Simulated Inter-image Erasing for Weakly Supervised Semantic Segmentation. 441-458 - EungGu Kang, Byeonghun Lee, Sunghoon Im, Kyong Hwan Jin:
BurstM: Deep Burst Multi-scale SR Using Fourier Space with Optical Flow. 459-477 - Tao Huang, Guangqi Jiang, Yanjie Ze, Huazhe Xu:
Diffusion Reward: Learning Rewards via Conditional Video Diffusion. 478-495
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.