Poster "multi-image understanding" Papers
3 papers found
Conference
Dynamic-VLM: Simple Dynamic Visual Token Compression for VideoLLM
Han Wang, Yuxiang Nie, Yongjie Ye et al.
ICCV 2025arXiv:2412.09530
15
citations
VLRMBench: A Comprehensive and Challenging Benchmark for Vision-Language Reward Models
JIACHENG RUAN, Wenzhen Yuan, Xian Gao et al.
ICCV 2025arXiv:2503.07478
15
citations
Zooming from Context to Cue: Hierarchical Preference Optimization for Multi-Image MLLMs
Xudong Li, Mengdan Zhang, Peixian Chen et al.
NEURIPS 2025arXiv:2505.22396
2
citations