"video question-answering" Papers
6 papers found
Conference
Flexible Frame Selection for Efficient Video Reasoning
Shyamal Buch, Arsha Nagrani, Anurag Arnab et al.
CVPR 2025
10
citations
Logic-in-Frames: Dynamic Keyframe Search via Visual Semantic-Logical Verification for Long Video Understanding
Weiyu Guo, Ziyang Chen, Shaoguang WANG et al.
NEURIPS 2025oralarXiv:2503.13139
18
citations
Self-alignment of Large Video Language Models with Refined Regularized Preference Optimization
Pritam Sarkar, Ali Etemad
NEURIPS 2025oralarXiv:2504.12083
2
citations
Temporal Chain of Thought: Long-Video Understanding by Thinking in Frames
Anurag Arnab, Ahmet Iscen, Mathilde Caron et al.
NEURIPS 2025oralarXiv:2507.02001
9
citations
FunQA: Towards Surprising Video Comprehension
Binzhu Xie, Sicheng Zhang, Zitang Zhou et al.
ECCV 2024arXiv:2306.14899
36
citations
Learning Video Context as Interleaved Multimodal Sequences
Qinghong Lin, Pengchuan Zhang, Difei Gao et al.
ECCV 2024arXiv:2407.21757
12
citations