"iterative model improvement" Papers
2 papers found
Conference
DataEnvGym: Data Generation Agents in Teacher Environments with Student Feedback
Zaid Khan, Elias Stengel-Eskin, Jaemin Cho et al.
ICLR 2025arXiv:2410.06215
9
citations
AlphaZero-Like Tree-Search can Guide Large Language Model Decoding and Training
Ziyu Wan, Xidong Feng, Muning Wen et al.
ICML 2024arXiv:2309.17179
304
citations