"digital environments" Papers
2 papers found
Conference
RefactorBench: Evaluating Stateful Reasoning in Language Agents Through Code
Dhruv Gautam, Spandan Garg, Jinu Jang et al.
ICLR 2025arXiv:2503.07832
17
citations
BAGEL: Bootstrapping Agents by Guiding Exploration with Language
Shikhar Murty, Christopher Manning, Peter Shaw et al.
ICML 2024arXiv:2403.08140
29
citations