Poster by Chu Xu Papers
3 papers found
Conference
Mechanism Design for LLM Fine-tuning with Multiple Reward Models
Haoran Sun, Yurong Chen, Siwei Wang et al.
NEURIPS 2025arXiv:2405.16276
19
citations
MODEL SHAPLEY: Find Your Ideal Parameter Player via One Gradient Backpropagation
Chu Xu, Xinke Jiang, Rihong Qiu et al.
NEURIPS 2025
Stackelberg Self-Annotation: A Robust Approach to Data-Efficient LLM Alignment
Chu Xu, Zhixin Zhang, Tianyu Jia et al.
NEURIPS 2025arXiv:2502.18099
3
citations