"benchmark datasets" Papers

16 papers found

Conference

AAAI 2025 (3,028)COLM 2025 (418)CVPR 2025 (2,873)ICCV 2025 (2,701)ICLR 2025 (3,827)ICML 2025 (3,340)ISMAR 2025 (229)NEURIPS 2025 (5,858)AAAI 2024 (2,289)CVPR 2024 (2,716)ECCV 2024 (2,387)ICLR 2024 (2,297)ICML 2024 (2,635)

Paper Type

poster (24,624)paper (8,558)oral (1,594)spotlight (1,421)highlight (975)

$F^3Set$: Towards Analyzing Fast, Frequent, and Fine-grained Events from Videos

Zhaoyu Liu, Kan Jiang, Murong Ma et al.

ICLR 2025oral

citations

Cherry-Picking in Time Series Forecasting: How to Select Datasets to Make Your Model Shine

Luis Roque, Vítor Cerqueira, Carlos Soares et al.

AAAI 2025paperarXiv:2412.14435

citations

ConText-CIR: Learning from Concepts in Text for Composed Image Retrieval

Eric Xing, Pranavi Kolouju, Robert Pless et al.

CVPR 2025arXiv:2505.20764

citations

ILIAS: Instance-Level Image retrieval At Scale

Giorgos Kordopatis-Zilos, Vladan Stojnić, Anna Manko et al.

CVPR 2025arXiv:2502.11748

citations

Is Large-scale Pretraining the Secret to Good Domain Generalization?

Piotr Teterwak, Kuniaki Saito, Theodoros Tsiligkaridis et al.

ICLR 2025arXiv:2412.02856

citations

NoBOOM: Chemical Process Datasets for Industrial Anomaly Detection

Dennis Wagner, Fabian Hartung, Justus Arweiler et al.

NEURIPS 2025

ORBIT - Open Recommendation Benchmark for Reproducible Research with Hidden Tests

Jingyuan He, Jiongnan Liu, Vishan Oberoi et al.

NEURIPS 2025arXiv:2510.26095

PSBench: a large-scale benchmark for estimating the accuracy of protein complex structural models

Pawan Neupane, Jian Liu, Jianlin Cheng

NEURIPS 2025arXiv:2505.22674

The 3D-PC: a benchmark for visual perspective taking in humans and machines

Drew Linsley, Peisen Zhou, Alekh Ashok et al.

ICLR 2025arXiv:2406.04138

citations

Are Synthetic Data Useful for Egocentric Hand-Object Interaction Detection?

Rosario Leonardi, Antonino Furnari, Francesco Ragusa et al.

ECCV 2024arXiv:2312.02672

citations

Directly Denoising Diffusion Models

Dan Zhang, Jingjing Wang, Feng Luo

ICML 2024arXiv:2405.13540

citations

MM-SafetyBench: A Benchmark for Safety Evaluation of Multimodal Large Language Models

Xin Liu, Yichen Zhu, Jindong Gu et al.

ECCV 2024arXiv:2311.17600

199

citations

Navigating Text-to-Image Generative Bias across Indic Languages

Surbhi Mittal, Arnav Sudan, MAYANK VATSA et al.

ECCV 2024arXiv:2408.00283

citations

PACE: Pose Annotations in Cluttered Environments

Yang You, kai xiong, Zhening Yang et al.

ECCV 2024

citations

RetroOOD: Understanding Out-of-Distribution Generalization in Retrosynthesis Prediction

Yemin Yu, Luotian Yuan, Ying WEI et al.

AAAI 2024paperarXiv:2312.10900

citations

WiMANS: A Benchmark Dataset for WiFi-based Multi-user Activity Sensing

Shuokang Huang, Kaihan Li, Di You et al.

ECCV 2024arXiv:2402.09430

citations