"benchmark datasets" Papers
16 papers found
Conference
$F^3Set$: Towards Analyzing Fast, Frequent, and Fine-grained Events from Videos
Zhaoyu Liu, Kan Jiang, Murong Ma et al.
Cherry-Picking in Time Series Forecasting: How to Select Datasets to Make Your Model Shine
Luis Roque, Vítor Cerqueira, Carlos Soares et al.
ConText-CIR: Learning from Concepts in Text for Composed Image Retrieval
Eric Xing, Pranavi Kolouju, Robert Pless et al.
ILIAS: Instance-Level Image retrieval At Scale
Giorgos Kordopatis-Zilos, Vladan Stojnić, Anna Manko et al.
Is Large-scale Pretraining the Secret to Good Domain Generalization?
Piotr Teterwak, Kuniaki Saito, Theodoros Tsiligkaridis et al.
NoBOOM: Chemical Process Datasets for Industrial Anomaly Detection
Dennis Wagner, Fabian Hartung, Justus Arweiler et al.
ORBIT - Open Recommendation Benchmark for Reproducible Research with Hidden Tests
Jingyuan He, Jiongnan Liu, Vishan Oberoi et al.
PSBench: a large-scale benchmark for estimating the accuracy of protein complex structural models
Pawan Neupane, Jian Liu, Jianlin Cheng
The 3D-PC: a benchmark for visual perspective taking in humans and machines
Drew Linsley, Peisen Zhou, Alekh Ashok et al.
Are Synthetic Data Useful for Egocentric Hand-Object Interaction Detection?
Rosario Leonardi, Antonino Furnari, Francesco Ragusa et al.
Directly Denoising Diffusion Models
Dan Zhang, Jingjing Wang, Feng Luo
MM-SafetyBench: A Benchmark for Safety Evaluation of Multimodal Large Language Models
Xin Liu, Yichen Zhu, Jindong Gu et al.
Navigating Text-to-Image Generative Bias across Indic Languages
Surbhi Mittal, Arnav Sudan, MAYANK VATSA et al.
PACE: Pose Annotations in Cluttered Environments
Yang You, kai xiong, Zhening Yang et al.
RetroOOD: Understanding Out-of-Distribution Generalization in Retrosynthesis Prediction
Yemin Yu, Luotian Yuan, Ying WEI et al.
WiMANS: A Benchmark Dataset for WiFi-based Multi-user Activity Sensing
Shuokang Huang, Kaihan Li, Di You et al.