by MuRun Yang Papers
2 papers found
Conference
GRAM: A Generative Foundation Reward Model for Reward Generalization
Chenglong Wang, Yang Gan, Yifu Huo et al.
ICML 2025arXiv:2506.14175
14
citations
MRO: Enhancing Reasoning in Diffusion Language Models via Multi-Reward Optimization
Chenglong Wang, Yang Gan, Hang Zhou et al.
NEURIPS 2025arXiv:2510.21473
1
citations