"data valuation" Papers

13 papers found

DataRater: Meta-Learned Dataset Curation

Dan Andrei Calian, Greg Farquhar, Iurii Kemaev et al.

NEURIPS 2025arXiv:2505.17895
7
citations

DATE-LM: Benchmarking Data Attribution Evaluation for Large Language Models

Cathy Jiao, Yijun Pan, Emily Xiao et al.

NEURIPS 2025arXiv:2507.09424

Efficient Top-m Data Values Identification for Data Selection

Xiaoqiang Lin, Xinyi Xu, See-Kiong Ng et al.

ICLR 2025

Faithful Group Shapley Value

Kiljae Lee, Ziqi Liu, Weijing Tang et al.

NEURIPS 2025arXiv:2505.19013
2
citations

GMValuator: Similarity-based Data Valuation for Generative Models

Jiaxi Yang, Wenlong Deng, Benlin Liu et al.

ICLR 2025arXiv:2304.10701
3
citations

Influence Guided Context Selection for Effective Retrieval-Augmented Generation

Jiale Deng, Yanyan Shen, Ziyuan Pei et al.

NEURIPS 2025arXiv:2509.21359
2
citations

KAIROS: Scalable Model-Agnostic Data Valuation

Jiongli Zhu, Parjanya Prashant, Alex Cloninger et al.

NEURIPS 2025arXiv:2506.23799

Regression-adjusted Monte Carlo Estimators for Shapley Values and Probabilistic Values

R. Teal Witter, Yurong Liu, Christopher Musco

NEURIPS 2025arXiv:2506.11849
4
citations

SAVA: Scalable Learning-Agnostic Data Valuation

Samuel Kessler, Tam Le, Vu Nguyen

ICLR 2025arXiv:2406.01130
1
citations

Shapley-Based Data Valuation for Weighted $k$-Nearest Neighbors

Guangyi Zhang, Qiyu Liu, Aristides Gionis

NEURIPS 2025

Distributionally Robust Data Valuation

Xiaoqiang Lin, Xinyi Xu, Zhaoxuan Wu et al.

ICML 2024

Rethinking Data Shapley for Data Selection Tasks: Misleads and Merits

Jiachen Wang, Tianji Yang, James Zou et al.

ICML 2024arXiv:2405.03875
23
citations

Scaling Laws for the Value of Individual Data Points in Machine Learning

Ian Covert, Wenlong Ji, Tatsunori Hashimoto et al.

ICML 2024arXiv:2405.20456
11
citations