Spotlight "vision-language tasks" Papers
2 papers found
Conference
Head Pursuit: Probing Attention Specialization in Multimodal Transformers
Lorenzo Basile, Valentino Maiorca, Diego Doimo et al.
NEURIPS 2025spotlightarXiv:2510.21518
5
citations
UFO: A Unified Approach to Fine-grained Visual Perception via Open-ended Language Interface
Hao Tang, Chen-Wei Xie, Haiyang Wang et al.
NEURIPS 2025spotlightarXiv:2503.01342
14
citations