"model-based reinforcement learning" Papers

33 papers found

Bootstrap Off-policy with World Model

Guojian Zhan, Likun Wang, Xiangteng Zhang et al.

NEURIPS 2025arXiv:2511.00423
1
citations

Differentiable Information Enhanced Model-Based Reinforcement Learning

Xiaoyuan Zhang, Xinyan Cai, Bo Liu et al.

AAAI 2025paperarXiv:2503.01178
3
citations

Drama: Mamba-Enabled Model-Based Reinforcement Learning Is Sample and Parameter Efficient

Wenlong Wang, Ivana Dusparic, Yucheng Shi et al.

ICLR 2025arXiv:2410.08893
3
citations

DyMoDreamer: World Modeling with Dynamic Modulation

Boxuan Zhang, Runqing Wang, Wei Xiao et al.

NEURIPS 2025oralarXiv:2509.24804

Efficient Model-Based Reinforcement Learning Through Optimistic Thompson Sampling

Jasmine Bayrooti, Carl Ek, Amanda Prorok

ICLR 2025arXiv:2410.04988
3
citations

FOSP: Fine-tuning Offline Safe Policy through World Models

Chenyang Cao, Yucheng Xin, Silang Wu et al.

ICLR 2025arXiv:2407.04942
3
citations

GLAM: Global-Local Variation Awareness in Mamba-based World Model

Qian He, Wenqi Liang, Chunhui Hao et al.

AAAI 2025paperarXiv:2501.11949
1
citations

Improving Model-Based Reinforcement Learning by Converging to Flatter Minima

Shrinivas Ramasubramanian, Benjamin Freed, Alexandre Capone et al.

NEURIPS 2025

Learning Transformer-based World Models with Contrastive Predictive Coding

Maxime Burchi, Radu Timofte

ICLR 2025oralarXiv:2503.04416
13
citations

Model-based Offline Reinforcement Learning with Lower Expectile Q-Learning

Kwanyoung Park, Youngwoon Lee

ICLR 2025arXiv:2407.00699
5
citations

Model-based RL as a Minimalist Approach to Horizon-Free and Second-Order Bounds

Zhiyong Wang, Dongruo Zhou, John C.S. Lui et al.

ICLR 2025arXiv:2408.08994
10
citations

Neural Stochastic Differential Equations for Uncertainty-Aware Offline RL

Cevahir Koprulu, Franck Djeumou, ufuk topcu

ICLR 2025

On Rollouts in Model-Based Reinforcement Learning

Bernd Frauenknecht, Devdutt Subhasish, Friedrich Solowjow et al.

ICLR 2025arXiv:2501.16918
1
citations

Raw2Drive: Reinforcement Learning with Aligned World Models for End-to-End Autonomous Driving (in CARLA v2)

Zhenjie Yang, Xiaosong Jia, Qifeng Li et al.

NEURIPS 2025arXiv:2505.16394
21
citations

Risk-Sensitive Variational Actor-Critic: A Model-Based Approach

Alonso Granados, Mohammadreza Ebrahimi, Jason Pacheco

ICLR 2025
1
citations

SOMBRL: Scalable and Optimistic Model-Based RL

Bhavya, Lenart Treven, Carmelo Sferrazza et al.

NEURIPS 2025arXiv:2511.20066
3
citations

Temporal Distance-aware Transition Augmentation for Offline Model-based Reinforcement Learning

Dongsu Lee, Minhae Kwon

ICML 2025oralarXiv:2505.13144
3
citations

The World Is Bigger: A Computationally-Embedded Perspective on the Big World Hypothesis

Alex Lewandowski, Aditya Ramesh, Edan Meyer et al.

NEURIPS 2025spotlightarXiv:2512.23419

Towards Empowerment Gain through Causal Structure Learning in Model-Based Reinforcement Learning

Hongye Cao, Fan Feng, Meng Fang et al.

ICLR 2025
1
citations

Zero-shot Model-based Reinforcement Learning using Large Language Models

Abdelhakim Benechehab, Youssef Attia El Hili, Ambroise Odonnat et al.

ICLR 2025arXiv:2410.11711
5
citations

Coprocessor Actor Critic: A Model-Based Reinforcement Learning Approach For Adaptive Brain Stimulation

Michelle Pan, Mariah Schrum, Vivek Myers et al.

ICML 2024arXiv:2406.06714
2
citations

Dr. Strategy: Model-Based Generalist Agents with Strategic Dreaming

Hany Hamed, Subin Kim, Dongyeong Kim et al.

ICML 2024arXiv:2402.18866
6
citations

HarmonyDream: Task Harmonization Inside World Models

Haoyu Ma, Jialong Wu, Ningya Feng et al.

ICML 2024arXiv:2310.00344
18
citations

Learning Latent Dynamic Robust Representations for World Models

Ruixiang Sun, Hongyu Zang, Xin Li et al.

ICML 2024oralarXiv:2405.06263
12
citations

Learning to Play Atari in a World of Tokens

Pranav Agarwal, Sheldon Andrews, Samira Ebrahimi Kahou

ICML 2024arXiv:2406.01361
6
citations

Locality Sensitive Sparse Encoding for Learning World Models Online

Zichen Liu, Chao Du, Wee Sun Lee et al.

ICLR 2024arXiv:2401.13034
18
citations

Model-based Reinforcement Learning for Confounded POMDPs

Mao Hong, Zhengling Qi, Yanxun Xu

ICML 2024

Model-based Reinforcement Learning for Parameterized Action Spaces

Renhao Zhang, Haotian Fu, Yilin Miao et al.

ICML 2024arXiv:2404.03037
8
citations

Model-Based RL for Mean-Field Games is not Statistically Harder than Single-Agent RL

Jiawei Huang, Niao He, Andreas Krause

ICML 2024arXiv:2402.05724
8
citations

SeMOPO: Learning High-quality Model and Policy from Low-quality Offline Visual Datasets

Shenghua Wan, Ziyuan Chen, Le Gan et al.

ICML 2024arXiv:2406.09486
1
citations

Sequence Compression Speeds Up Credit Assignment in Reinforcement Learning

Aditya A. Ramesh, Kenny Young, Louis Kirsch et al.

ICML 2024oralarXiv:2405.03878
2
citations

Towards Robust Model-Based Reinforcement Learning Against Adversarial Corruption

Chenlu Ye, Jiafan He, Quanquan Gu et al.

ICML 2024arXiv:2402.08991
10
citations

Trust the Model Where It Trusts Itself - Model-Based Actor-Critic with Uncertainty-Aware Rollout Adaption

Bernd Frauenknecht, Artur Eisele, Devdutt Subhasish et al.

ICML 2024arXiv:2405.19014
5
citations