"video large multimodal models" Papers
4 papers found
Conference
ISR-DPO: Aligning Large Multimodal Models for Videos by Iterative Self-Retrospective DPO
Daechul Ahn, Yura Choi, San Kim et al.
AAAI 2025paperarXiv:2406.11280
3
citations
SafeVid: Toward Safety Aligned Video Large Multimodal Models
Yixu Wang, Jiaxin Song, Yifeng Gao et al.
NEURIPS 2025arXiv:2505.11926
4
citations
Unleashing Hour-Scale Video Training for Long Video-Language Understanding
Jingyang Lin, Jialian Wu, Ximeng Sun et al.
NEURIPS 2025oralarXiv:2506.05332
10
citations
VideoICL: Confidence-based Iterative In-context Learning for Out-of-Distribution Video Understanding
Kangsan Kim, Geon Park, Youngwan Lee et al.
CVPR 2025arXiv:2412.02186
12
citations