"multimodal benchmark evaluation" Papers
2 papers found
Conference
LOKI: A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models
Junyan Ye, Baichuan Zhou, Zilong Huang et al.
ICLR 2025arXiv:2410.09732
30
citations
WebLINX: Real-World Website Navigation with Multi-Turn Dialogue
Xing Han Lù, Zdeněk Kasner, Siva Reddy
ICML 2024spotlightarXiv:2402.05930
121
citations