"evaluation metrics" Papers
9 papers found
Conference
A Reliable Cryptographic Framework for Empirical Machine Unlearning Evaluation
Yiwen Tu, Pingbang Hu, Jiaqi Ma
NEURIPS 2025arXiv:2404.11577
2
citations
CEB: Compositional Evaluation Benchmark for Fairness in Large Language Models
Song Wang, Peng Wang, Tong Zhou et al.
ICLR 2025arXiv:2407.02408
14
citations
Image Over Text: Transforming Formula Recognition Evaluation with Character Detection Matching
Bin Wang, Fan Wu, Linke Ouyang et al.
CVPR 2025arXiv:2409.03643
13
citations
MEGA-Bench: Scaling Multimodal Evaluation to over 500 Real-World Tasks
Jiacheng Chen, Tianhao Liang, Sherman Siu et al.
ICLR 2025arXiv:2410.10563
30
citations
Towards Unifying Evaluation of Counterfactual Explanations: Leveraging Large Language Models for Human-Centric Assessments
Marharyta Domnich, Julius Välja, Rasmus Moorits Veski et al.
AAAI 2025paperarXiv:2410.21131
8
citations
Challenges and Considerations in the Evaluation of Bayesian Causal Discovery
Amir Mohammad Karimi Mamaghan, Panagiotis Tigas, Karl Johansson et al.
ICML 2024arXiv:2406.03209
6
citations
Position: Quo Vadis, Unsupervised Time Series Anomaly Detection?
M. Saquib Sarfraz, Mei-Yen Chen, Lukas Layer et al.
ICML 2024arXiv:2405.02678
11
citations
Size-invariance Matters: Rethinking Metrics and Losses for Imbalanced Multi-object Salient Object Detection
Feiran Li, Qianqian Xu, Shilong Bao et al.
ICML 2024spotlightarXiv:2405.09782
14
citations
Time Weaver: A Conditional Time Series Generation Model
Sai Shankar Narasimhan, Shubhankar Agarwal, Oguzhan Akcin et al.
ICML 2024spotlightarXiv:2403.02682
37
citations