Poster "multi-reward optimization" Papers
3 papers found
Conference
Iterative Foundation Model Fine-Tuning on Multiple Rewards
Pouya M. Ghari, simone sciabola, Ye Wang
NEURIPS 2025arXiv:2511.00220
MRO: Enhancing Reasoning in Diffusion Language Models via Multi-Reward Optimization
Chenglong Wang, Yang Gan, Hang Zhou et al.
NEURIPS 2025arXiv:2510.21473
1
citations
Personalized Preference Fine-tuning of Diffusion Models
Meihua Dang, Anikait Singh, Linqi Zhou et al.
CVPR 2025arXiv:2501.06655
15
citations