"multimodal question answering" Papers
3 papers found
Conference
MRAG-Bench: Vision-Centric Evaluation for Retrieval-Augmented Multimodal Models
Wenbo Hu, Jia-Chen Gu, Zi-Yi Dou et al.
ICLR 2025arXiv:2410.08182
30
citations
TCM-Ladder: A Benchmark for Multimodal Question Answering on Traditional Chinese Medicine
Jiacheng Xie, Yang Yu, Ziyang Zhang et al.
NEURIPS 2025arXiv:2505.24063
3
citations
Can Language Beat Numerical Regression? Language-Based Multimodal Trajectory Prediction
Inhwan Bae, Junoh Lee, Hae-Gon Jeon
CVPR 2024arXiv:2403.18447
59
citations