Poster "long-term planning" Papers
6 papers found
Conference
Beyond Autoregression: Discrete Diffusion for Complex Reasoning and Planning
Jiacheng Ye, Jiahui Gao, Shansan Gong et al.
ICLR 2025arXiv:2410.14157
84
citations
Factorio Learning Environment
Jack Hopkins, Mart Bakler, Akbir Khan
NEURIPS 2025arXiv:2503.09617
2
citations
Implicit Search via Discrete Diffusion: A Study on Chess
Jiacheng Ye, Zhenyu Wu, Jiahui Gao et al.
ICLR 2025arXiv:2502.19805
14
citations
Regressing the Relative Future: Efficient Policy Optimization for Multi-turn RLHF
Zhaolin Gao, Wenhao Zhan, Jonathan Chang et al.
ICLR 2025arXiv:2410.04612
18
citations
Highway Value Iteration Networks
Yuhui Wang, Weida Li, Francesco Faccio et al.
ICML 2024arXiv:2406.03485
3
citations
MLAgentBench: Evaluating Language Agents on Machine Learning Experimentation
Qian Huang, Jian Vora, Percy Liang et al.
ICML 2024arXiv:2310.03302
168
citations