Paper "video question answering" Papers
6 papers found
Conference
ALLVB: All-in-One Long Video Understanding Benchmark
Xichen Tan, Yuanjing Luo, Yunfan Ye et al.
AAAI 2025paperarXiv:2503.07298
6
citations
Assessing Modality Bias in Video Question Answering Benchmarks with Multimodal Large Language Models
Jean Park, Kuk Jin Jang, Basam Alasaly et al.
AAAI 2025paperarXiv:2408.12763
16
citations
ISR-DPO: Aligning Large Multimodal Models for Videos by Iterative Self-Retrospective DPO
Daechul Ahn, Yura Choi, San Kim et al.
AAAI 2025paperarXiv:2406.11280
3
citations
BDIQA: A New Dataset for Video Question Answering to Explore Cognitive Reasoning through Theory of Mind
Yuanyuan Mao, Xin Lin, Qin Ni et al.
AAAI 2024paperarXiv:2402.07402
6
citations
MuLTI: Efficient Video-and-Language Understanding with Text-Guided MultiWay-Sampler and Multiple Choice Modeling
Jiaqi Xu, Bo Liu, Yunkuo Chen et al.
AAAI 2024paperarXiv:2303.05707
2
citations
YTCommentQA: Video Question Answerability in Instructional Videos
Saelyne Yang, Sunghyun Park, Yunseok Jang et al.
AAAI 2024paperarXiv:2401.17343
5
citations