"reinforcement learning alignment" Papers
5 papers found
Conference
Improving Video Generation with Human Feedback
Jie Liu, Gongye Liu, Jiajun Liang et al.
NEURIPS 2025arXiv:2501.13918
127
citations
Measuring And Improving Engagement of Text-to-Image Generation Models
Varun Khurana, Yaman Singla, Jayakumar Subramanian et al.
ICLR 2025
2
citations
Nabla-R2D3: Effective and Efficient 3D Diffusion Alignment with 2D Rewards
Qingming LIU, Zhen Liu, Dinghuai Zhang et al.
NEURIPS 2025arXiv:2506.15684
2
citations
PurpCode: Reasoning for Safer Code Generation
Jiawei Liu, Nirav Diwan, Zhe Wang et al.
NEURIPS 2025arXiv:2507.19060
8
citations
Trustworthy Alignment of Retrieval-Augmented Large Language Models via Reinforcement Learning
Zongmeng Zhang, Yufeng Shi, Jinhua Zhu et al.
ICML 2024arXiv:2410.16843
2
citations