Poster "spatial understanding" Papers
2 papers found
Conference
MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models
Fanqing Meng, Jin Wang, Chuanhao Li et al.
ICLR 2025arXiv:2408.02718
48
citations
RoboSpatial: Teaching Spatial Understanding to 2D and 3D Vision-Language Models for Robotics
Chan Hee Song, Valts Blukis, Jonathan Tremblay et al.
CVPR 2025arXiv:2411.16537
90
citations