"reinforcement learning agents" Papers
12 papers found
Conference
Deep RL Needs Deep Behavior Analysis: Exploring Implicit Planning by Model-Free Agents in Open-Ended Environments
Riley Simmons-Edler, Ryan Badman, Felix Berg et al.
NEURIPS 2025oralarXiv:2506.06981
3
citations
Ground-Compose-Reinforce: Grounding Language in Agentic Behaviours using Limited Data
Andrew Li, Toryn Klassen, Andrew Wang et al.
NEURIPS 2025arXiv:2507.10741
1
citations
Learning Human-Like RL Agents Through Trajectory Optimization With Action Quantization
Jian-Ting Guo, Yu-Cheng Chen, Ping-Chun Hsieh et al.
NEURIPS 2025arXiv:2511.15055
OMNI-EPIC: Open-endedness via Models of human Notions of Interestingness with Environments Programmed in Code
Maxence Faldor, Jenny Zhang, Antoine Cully et al.
ICLR 2025arXiv:2405.15568
48
citations
Prevalence of Negative Transfer in Continual Reinforcement Learning: Analyses and a Simple Baseline
Hongjoon Ahn, Jinu Hyeon, Youngmin Oh et al.
ICLR 2025
2
citations
SWE-rebench: An Automated Pipeline for Task Collection and Decontaminated Evaluation of Software Engineering Agents
Ibragim Badertdinov, Alexander Golubev, Maksim Nekrashevich et al.
NEURIPS 2025arXiv:2505.20411
33
citations
Transformers Can Learn Temporal Difference Methods for In-Context Reinforcement Learning
Jiuqi Wang, Ethan Blaser, Hadi Daneshmand et al.
ICLR 2025oralarXiv:2405.13861
15
citations
DRED: Zero-Shot Transfer in Reinforcement Learning via Data-Regularised Environment Design
Samuel Garcin, James Doran, Shangmin Guo et al.
ICML 2024arXiv:2402.03479
11
citations
Refining Minimax Regret for Unsupervised Environment Design
Michael Beukman, Samuel Coward, Michael Matthews et al.
ICML 2024arXiv:2402.12284
15
citations
Rethinking Adversarial Policies: A Generalized Attack Formulation and Provable Defense in RL
Xiangyu Liu, Souradip Chakraborty, Yanchao Sun et al.
ICLR 2024arXiv:2305.17342
9
citations
Scalable Real-Time Recurrent Learning Using Columnar-Constructive Networks
Khurram Javed, Haseeb Shah, Richard Sutton et al.
ICML 2024arXiv:2302.05326
10
citations
Think Before You Act: Decision Transformers with Working Memory
Jikun Kang, Romain Laroche, Xingdi Yuan et al.
ICML 2024arXiv:2305.16338