"spatial comprehension" Papers
3 papers found
Conference
Do Large Language Models Truly Understand Geometric Structures?
Xiaofeng Wang, Yiming Wang, Wenhong Zhu et al.
ICLR 2025arXiv:2501.13773
9
citations
ScImage: How good are multimodal large language models at scientific text-to-image generation?
Leixin Zhang, Steffen Eger, Yinjie Cheng et al.
ICLR 2025arXiv:2412.02368
5
citations
Unleashing Hour-Scale Video Training for Long Video-Language Understanding
Jingyang Lin, Jialian Wu, Ximeng Sun et al.
NEURIPS 2025oralarXiv:2506.05332
10
citations