Poster "training data attribution" Papers
3 papers found
Conference
Better Training Data Attribution via Better Inverse Hessian-Vector Products
Andrew Wang, Elisa Nguyen, Runshi Yang et al.
NEURIPS 2025arXiv:2507.14740
4
citations
Explainable Reinforcement Learning from Human Feedback to Improve Alignment
Shicheng Liu, Siyuan Xu, Wenjie Qiu et al.
NEURIPS 2025arXiv:2512.13837
Distilled Datamodel with Reverse Gradient Matching
Jingwen Ye, Ruonan Yu, Songhua Liu et al.
CVPR 2024arXiv:2404.14006
3
citations