"multimodal evaluation" Papers
7 papers found
Conference
AHa-Bench: Benchmarking Audio Hallucinations in Large Audio-Language Models
Xize Cheng, Dongjie Fu, Chenyuhao Wen et al.
NEURIPS 2025
Dynamic Multimodal Evaluation with Flexible Complexity by Vision-Language Bootstrapping
Yue Yang, Shuibo Zhang, Kaipeng Zhang et al.
ICLR 2025arXiv:2410.08695
17
citations
Enhancing Reward Models for High-quality Image Generation: Beyond Text-Image Alignment
ying ba, Tianyu Zhang, Yalong Bai et al.
ICCV 2025arXiv:2507.19002
6
citations
MEGA-Bench: Scaling Multimodal Evaluation to over 500 Real-World Tasks
Jiacheng Chen, Tianhao Liang, Sherman Siu et al.
ICLR 2025arXiv:2410.10563
30
citations
MetaMetrics: Calibrating Metrics for Generation Tasks Using Human Preferences
Genta Winata, David Anugraha, Lucky Susanto et al.
ICLR 2025arXiv:2410.02381
17
citations
On Large Multimodal Models as Open-World Image Classifiers
Alessandro Conti, Massimiliano Mancini, Enrico Fini et al.
ICCV 2025arXiv:2503.21851
3
citations
MMT-Bench: A Comprehensive Multimodal Benchmark for Evaluating Large Vision-Language Models Towards Multitask AGI
Kaining Ying, Fanqing Meng, Jin Wang et al.
ICML 2024arXiv:2404.16006
163
citations