"sequential visual data" Papers
2 papers found
Conference
Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis
Chaoyou Fu, Yuhan Dai, Yongdong Luo et al.
CVPR 2025highlightarXiv:2405.21075
917
citations
Data-efficient Large Vision Models through Sequential Autoregression
Zhiwei Hao, Jianyuan Guo, Chengcheng Wang et al.
ICML 2024arXiv:2402.04841
12
citations