"multi-modal benchmarking" Papers
2 papers found
Conference
ECBench: Can Multi-modal Foundation Models Understand the Egocentric World? A Holistic Embodied Cognition Benchmark
Ronghao Dang, Yuqian Yuan, Wenqi Zhang et al.
CVPR 2025arXiv:2501.05031
16
citations
MANTA: A Large-Scale Multi-View and Visual-Text Anomaly Detection Dataset for Tiny Objects
Lei Fan, Dongdong Fan, Zhiguang Hu et al.
CVPR 2025arXiv:2412.04867
18
citations