"keyframe selection" Papers
6 papers found
Conference
Keyframe-oriented Vision Token Pruning: Enhancing Efficiency of Large Vision Language Models on Long-Form Video Processing
Yudong Liu, Jingwei Sun, Yueqian Lin et al.
ICCV 2025arXiv:2503.10742
7
citations
Logic-in-Frames: Dynamic Keyframe Search via Visual Semantic-Logical Verification for Long Video Understanding
Weiyu Guo, Ziyang Chen, Shaoguang WANG et al.
NEURIPS 2025oralarXiv:2503.13139
18
citations
Omni-R1: Reinforcement Learning for Omnimodal Reasoning via Two-System Collaboration
Hao Zhong, Muzhi Zhu, Zongze Du et al.
NEURIPS 2025oralarXiv:2505.20256
14
citations
Progress-Aware Video Frame Captioning
Zihui Xue, Joungbin An, Xitong Yang et al.
CVPR 2025arXiv:2412.02071
7
citations
Video Summarization with Large Language Models
Min Jung Lee, Dayoung Gong, Minsu Cho
CVPR 2025arXiv:2504.11199
11
citations
Reinforcement Learning Meets Visual Odometry
Nico Messikommer, Giovanni Cioffi, Mathias Gehrig et al.
ECCV 2024arXiv:2407.15626
14
citations