"language model self-improvement" Papers
3 papers found
Conference
Latent Principle Discovery for Language Model Self-Improvement
Keshav Ramji, Tahira Naseem, Ramón Astudillo
NEURIPS 2025oralarXiv:2505.16927
CodeIt: Self-Improving Language Models with Prioritized Hindsight Replay
Natasha Butt, Blazej Manczak, Auke Wiggers et al.
ICML 2024arXiv:2402.04858
27
citations
RLAIF vs. RLHF: Scaling Reinforcement Learning from Human Feedback with AI Feedback
Harrison Lee, Samrat Phatale, Hassan Mansoor et al.
ICML 2024arXiv:2309.00267
527
citations