Oral "video comprehension" Papers
4 papers found
Conference
HoliTom: Holistic Token Merging for Fast Video Large Language Models
Kele Shao, Keda TAO, Can Qin et al.
NEURIPS 2025oralarXiv:2505.21334
20
citations
Seeing the Arrow of Time in Large Multimodal Models
Zihui (Sherry) Xue, Romy Luo, Kristen Grauman
NEURIPS 2025oralarXiv:2506.03340
6
citations
Temporal Reasoning Transfer from Text to Video
Lei Li, Yuanxin Liu, Linli Yao et al.
ICLR 2025oralarXiv:2410.06166
21
citations
Unhackable Temporal Reward for Scalable Video MLLMs
En Yu, Kangheng Lin, Liang Zhao et al.
ICLR 2025oralarXiv:2502.12081
22
citations