"sample efficiency" Papers

72 papers found • Page 2 of 2

Hieros: Hierarchical Imagination on Structured State Space Sequence World Models

Paul Mattes, Rainer Schlosser, Ralf Herbrich

ICML 2024arXiv:2310.05167
8
citations

How Does Goal Relabeling Improve Sample Efficiency?

Sirui Zheng, Chenjia Bai, Zhuoran Yang et al.

ICML 2024

Hybrid Inverse Reinforcement Learning

Juntao Ren, Gokul Swamy, Steven Wu et al.

ICML 2024oralarXiv:2402.08848
29
citations

Learning to Play Atari in a World of Tokens

Pranav Agarwal, Sheldon Andrews, Samira Ebrahimi Kahou

ICML 2024arXiv:2406.01361
6
citations

Leaving the Nest: Going beyond Local Loss Functions for Predict-Then-Optimize

Sanket Shah, Bryan Wilder, Andrew Perrault et al.

AAAI 2024paperarXiv:2305.16830
20
citations

LLM-Empowered State Representation for Reinforcement Learning

Boyuan Wang, Yun Qu, Yuhang Jiang et al.

ICML 2024arXiv:2407.13237
24
citations

Model-based Reinforcement Learning for Parameterized Action Spaces

Renhao Zhang, Haotian Fu, Yilin Miao et al.

ICML 2024arXiv:2404.03037
8
citations

Offline-Boosted Actor-Critic: Adaptively Blending Optimal Historical Behaviors in Deep Off-Policy RL

Yu Luo, Tianying Ji, Fuchun Sun et al.

ICML 2024arXiv:2405.18520
7
citations

Overestimation, Overfitting, and Plasticity in Actor-Critic: the Bitter Lesson of Reinforcement Learning

Michal Nauman, Michał Bortkiewicz, Piotr Milos et al.

ICML 2024arXiv:2403.00514
41
citations

Quality-Diversity with Limited Resources

Ren-Jian Wang, Ke Xue, Cong Guan et al.

ICML 2024arXiv:2406.03731
3
citations

Reflective Policy Optimization

Yaozhong Gan, yan renye, zhe wu et al.

ICML 2024arXiv:2406.03678
2
citations

Reinforcement Learning within Tree Search for Fast Macro Placement

Zijie Geng, Jie Wang, Ziyan Liu et al.

ICML 2024

Reward Shaping for Reinforcement Learning with An Assistant Reward Agent

Haozhe Ma, Kuankuan Sima, Thanh Vinh Vo et al.

ICML 2024

Rich-Observation Reinforcement Learning with Continuous Latent Dynamics

Yuda Song, Lili Wu, Dylan Foster et al.

ICML 2024arXiv:2405.19269
2
citations

Sample-Efficient Multiagent Reinforcement Learning with Reset Replay

Yaodong Yang, Guangyong Chen, Jianye Hao et al.

ICML 2024

Sample Efficient Myopic Exploration Through Multitask Reinforcement Learning with Diverse Tasks

Ziping Xu, Zifan Xu, Runxuan Jiang et al.

ICLR 2024arXiv:2403.01636
2
citations

SAPG: Split and Aggregate Policy Gradients

Jayesh Singla, Ananye Agarwal, Deepak Pathak

ICML 2024arXiv:2407.20230
13
citations

Seizing Serendipity: Exploiting the Value of Past Success in Off-Policy Actor-Critic

Tianying Ji, Yu Luo, Fuchun Sun et al.

ICML 2024arXiv:2306.02865
21
citations

SiT: Symmetry-invariant Transformers for Generalisation in Reinforcement Learning

Matthias Weissenbacher, Rishabh Agarwal, Yoshinobu Kawahara

ICML 2024arXiv:2406.15025
1
citations

Symmetric Replay Training: Enhancing Sample Efficiency in Deep Reinforcement Learning for Combinatorial Optimization

Hyeonah Kim, Minsu Kim, Sungsoo Ahn et al.

ICML 2024arXiv:2306.01276
9
citations

Uncertainty-Aware Reward-Free Exploration with General Function Approximation

Junkai Zhang, Weitong Zhang, Dongruo Zhou et al.

ICML 2024arXiv:2406.16255
5
citations

Value-Evolutionary-Based Reinforcement Learning

Pengyi Li, Jianye Hao, Hongyao Tang et al.

ICML 2024oral