"ppo algorithm" Papers
2 papers found
Conference
MaFeRw: Query Rewriting with Multi-Aspect Feedbacks for Retrieval-Augmented Large Language Models
Yujing Wang, Hainan Zhang, Liang Pang et al.
AAAI 2025paperarXiv:2408.17072
8
citations
Breaking the Barrier: Enhanced Utility and Robustness in Smoothed DRL Agents
Chung-En Sun, Sicun Gao, Lily Weng
ICML 2024arXiv:2406.18062
6
citations