"spatial understanding" Papers
3 papers found
Conference
MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models
Fanqing Meng, Jin Wang, Chuanhao Li et al.
ICLR 2025arXiv:2408.02718
48
citations
RoboSpatial: Teaching Spatial Understanding to 2D and 3D Vision-Language Models for Robotics
Chan Hee Song, Valts Blukis, Jonathan Tremblay et al.
CVPR 2025arXiv:2411.16537
90
citations
See&Trek: Training-Free Spatial Prompting for Multimodal Large Language Model
Pengteng Li, Pinhao Song, Wuyang Li et al.
NEURIPS 2025oralarXiv:2509.16087
1
citations