"visual grounding" Papers
54 papers found • Page 2 of 2
Conference
VidLA: Video-Language Alignment at Scale
Mamshad Nayeem Rizve, Fan Fei, Jayakrishnan Unnikrishnan et al.
CVPR 2024arXiv:2403.14870
8
citations
Visual Grounding for Object-Level Generalization in Reinforcement Learning
Haobin Jiang, Zongqing Lu
ECCV 2024arXiv:2408.01942
4
citations
Visual Relationship Transformation
Xiaoyu Xu, Jiayan Qiu, Baosheng Yu et al.
ECCV 2024
Zero-shot Referring Expression Comprehension via Structural Similarity Between Images and Captions
Zeyu Han, Fangrui Zhu, Qianru Lao et al.
CVPR 2024arXiv:2311.17048
21
citations