Paper "visual grounding" Papers
7 papers found
Conference
CVLUE: A New Benchmark Dataset for Chinese Vision-Language Understanding Evaluation
Yuxuan Wang, Yijun Liu, Fei Yu et al.
AAAI 2025paperarXiv:2407.01081
7
citations
Progressive Multi-granular Alignments for Grounded Reasoning in Large Vision-Language Models
Quang-Hung Le, Long Hoang Dang, Ngan Hoang Le et al.
AAAI 2025paperarXiv:2412.08125
3
citations
VHM: Versatile and Honest Vision Language Model for Remote Sensing Image Analysis
Chao Pang, Xingxing Weng, Jiang Wu et al.
AAAI 2025paperarXiv:2403.20213
54
citations
Cycle-Consistency Learning for Captioning and Grounding
Ning Wang, Jiajun Deng, Mingbo Jia
AAAI 2024paperarXiv:2312.15162
14
citations
GroundVLP: Harnessing Zero-Shot Visual Grounding from Vision-Language Pre-training and Open-Vocabulary Object Detection
Haozhan Shen, Tiancheng Zhao, Mingwei Zhu et al.
AAAI 2024paperarXiv:2312.15043
26
citations
Parallel Vertex Diffusion for Unified Visual Grounding
Authors: Zesen Cheng, Kehan Li, Peng Jin et al.
AAAI 2024paperarXiv:2303.07216
37
citations
Unifying Visual and Vision-Language Tracking via Contrastive Learning
AAAI 2024paperarXiv:2401.11228
47
citations