Does Spatial Cognition Emerge in Frontier Models?

51citations

arXiv:2410.06468

citations

#320

in ICLR 2025

of 3827 papers

Top Authors

Data Points

Top Authors

Santhosh Kumar Ramakrishnan Erik Wijmans Philipp Krähenbühl Vladlen Koltun

Topics

spatial cognition benchmark evaluation large language models large multimodal models cognitive science spatial attention spatial memory animal cognition

Abstract

Not yet. We present SPACE, a benchmark that systematically evaluates spatial cognition in frontier models. Our benchmark builds on decades of research in cognitive science. It evaluates large-scale mapping abilities that are brought to bear when an organism traverses physical environments, smaller-scale reasoning about object shapes and layouts, and cognitive infrastructure such as spatial attention and memory. For many tasks, we instantiate parallel presentations via text and images, allowing us to benchmark both large language models and large multimodal models. Results suggest that contemporary frontier models fall short of the spatial intelligence of animals, performing near chance level on a number of classic tests of animal cognition. Code and data are available: https://github.com/apple/ml-space-benchmark

Citation History

Jan 26, 2026

Jan 27, 2026

Feb 2, 2026

50+50

Feb 7, 2026

51+1

Feb 13, 2026