"self-evolution" Papers
2 papers found
Conference
LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization
Guanzheng Chen, Xin Li, Michael Qizhe Shieh et al.
ICLR 2025arXiv:2502.13922
15
citations
TTRL: Test-Time Reinforcement Learning
Yuxin Zuo, Kaiyan Zhang, Li Sheng et al.
NEURIPS 2025arXiv:2504.16084
129
citations