"reinforcement learning agents" Papers

12 papers found

Deep RL Needs Deep Behavior Analysis: Exploring Implicit Planning by Model-Free Agents in Open-Ended Environments

Riley Simmons-Edler, Ryan Badman, Felix Berg et al.

NEURIPS 2025oralarXiv:2506.06981
3
citations

Ground-Compose-Reinforce: Grounding Language in Agentic Behaviours using Limited Data

Andrew Li, Toryn Klassen, Andrew Wang et al.

NEURIPS 2025arXiv:2507.10741
1
citations

Learning Human-Like RL Agents Through Trajectory Optimization With Action Quantization

Jian-Ting Guo, Yu-Cheng Chen, Ping-Chun Hsieh et al.

NEURIPS 2025arXiv:2511.15055

OMNI-EPIC: Open-endedness via Models of human Notions of Interestingness with Environments Programmed in Code

Maxence Faldor, Jenny Zhang, Antoine Cully et al.

ICLR 2025arXiv:2405.15568
48
citations

Prevalence of Negative Transfer in Continual Reinforcement Learning: Analyses and a Simple Baseline

Hongjoon Ahn, Jinu Hyeon, Youngmin Oh et al.

ICLR 2025
2
citations

SWE-rebench: An Automated Pipeline for Task Collection and Decontaminated Evaluation of Software Engineering Agents

Ibragim Badertdinov, Alexander Golubev, Maksim Nekrashevich et al.

NEURIPS 2025arXiv:2505.20411
33
citations

Transformers Can Learn Temporal Difference Methods for In-Context Reinforcement Learning

Jiuqi Wang, Ethan Blaser, Hadi Daneshmand et al.

ICLR 2025oralarXiv:2405.13861
15
citations

DRED: Zero-Shot Transfer in Reinforcement Learning via Data-Regularised Environment Design

Samuel Garcin, James Doran, Shangmin Guo et al.

ICML 2024arXiv:2402.03479
11
citations

Refining Minimax Regret for Unsupervised Environment Design

Michael Beukman, Samuel Coward, Michael Matthews et al.

ICML 2024arXiv:2402.12284
15
citations

Rethinking Adversarial Policies: A Generalized Attack Formulation and Provable Defense in RL

Xiangyu Liu, Souradip Chakraborty, Yanchao Sun et al.

ICLR 2024arXiv:2305.17342
9
citations

Scalable Real-Time Recurrent Learning Using Columnar-Constructive Networks

Khurram Javed, Haseeb Shah, Richard Sutton et al.

ICML 2024arXiv:2302.05326
10
citations

Think Before You Act: Decision Transformers with Working Memory

Jikun Kang, Romain Laroche, Xingdi Yuan et al.

ICML 2024arXiv:2305.16338