"partial observability" Papers

27 papers found

COMBO: Compositional World Models for Embodied Multi-Agent Cooperation

Hongxin Zhang, Zeyuan Wang, Qiushi Lyu et al.

ICLR 2025arXiv:2404.10775
37
citations

DyWA: Dynamics-adaptive World Action Model for Generalizable Non-prehensile Manipulation

Jiangran Lyu, Ziming Li, Xuesong Shi et al.

ICCV 2025arXiv:2503.16806
14
citations

Exponential Topology-enabled Scalable Communication in Multi-agent Reinforcement Learning

Xinran Li, Xiaolu Wang, Chenjia Bai et al.

ICLR 2025arXiv:2502.19717
5
citations

Forecasting in Offline Reinforcement Learning for Non-stationary Environments

Suzan Ece Ada, Georg Martius, Emre Ugur et al.

NEURIPS 2025spotlightarXiv:2512.01987

Mixture of Attentions For Speculative Decoding

Matthieu Zimmer, Milan Gritta, Gerasimos Lampouras et al.

ICLR 2025arXiv:2410.03804
14
citations

Multi-Environment POMDPs: Discrete Model Uncertainty Under Partial Observability

Eline M. Bovy, Caleb Probine, Marnix Suilen et al.

NEURIPS 2025arXiv:2510.23744

On Evaluating Policies for Robust POMDPs

Merlijn Krale, Eline M. Bovy, Maris F. L. Galesloot et al.

NEURIPS 2025

On Minimizing Adversarial Counterfactual Error in Adversarial Reinforcement Learning

Roman Belaire, Arunesh Sinha, Pradeep Varakantham

ICLR 2025
1
citations

On Shallow Planning Under Partial Observability

Randy Lefebvre, Audrey Durand

AAAI 2025paperarXiv:2407.15820
2
citations

Predictive Coding Enhances Meta-RL To Achieve Interpretable Bayes-Optimal Belief Representation Under Partial Observability

Po-Chen Kuo, Han Hou, Will Dabney et al.

NEURIPS 2025arXiv:2510.22039

Quantifying Generalisation in Imitation Learning

Nathan Gavenski, Odinaldo Rodrigues

NEURIPS 2025arXiv:2509.24784

Real-World Reinforcement Learning of Active Perception Behaviors

Edward Hu, Jie Wang, Xingfang Yuan et al.

NEURIPS 2025arXiv:2512.01188

REVECA: Adaptive Planning and Trajectory-Based Validation in Cooperative Language Agents Using Information Relevance and Relative Proximity

SeungWon Seo, SeongRae Noh, Junhyeok Lee et al.

AAAI 2025paperarXiv:2405.16751
6
citations

Revelations: A Decidable Class of POMDPs with Omega-Regular Objectives

Marius Belly, Nathanaël Fijalkow, Hugo Gimbert et al.

AAAI 2025paperarXiv:2412.12063
5
citations

Stabilizing LTI Systems under Partial Observability: Sample Complexity and Fundamental Limits

Ziyi Zhang, Yorie Nakahira, Guannan Qu

NEURIPS 2025
1
citations

Student-Informed Teacher Training

Nico Messikommer, Jiaxu Xing, Elie Aljalbout et al.

ICLR 2025arXiv:2412.09149
6
citations

To Distill or Decide? Understanding the Algorithmic Trade-off in Partially Observable RL

Yuda Song, Dhruv Rohatgi, Aarti Singh et al.

NEURIPS 2025spotlight
1
citations

Trajectory-Class-Aware Multi-Agent Reinforcement Learning

Hyungho Na, Kwanghyeon Lee, Sumin Lee et al.

ICLR 2025arXiv:2503.01440
1
citations

A Sparsity Principle for Partially Observable Causal Representation Learning

Danru Xu, Dingling Yao, Sébastien Lachapelle et al.

ICML 2024arXiv:2403.08335
22
citations

Constrained Bayesian Optimization under Partial Observations: Balanced Improvements and Provable Convergence

Shengbo Wang, Ke Li

AAAI 2024paperarXiv:2312.03212
19
citations

FoX: Formation-Aware Exploration in Multi-Agent Reinforcement Learning

Yonghyeon Jo, Sunwoo Lee, Junghyuk Yum et al.

AAAI 2024paperarXiv:2308.11272
16
citations

How to Explore with Belief: State Entropy Maximization in POMDPs

Riccardo Zamboni, Duilio Cirino, Marcello Restelli et al.

ICML 2024arXiv:2406.02295
6
citations

Learning the Causal Structure of Networked Dynamical Systems under Latent Nodes and Structured Noise

Augusto Santos, Diogo Rente, Rui Seabra et al.

AAAI 2024paperarXiv:2312.05974
7
citations

Learning to Play Atari in a World of Tokens

Pranav Agarwal, Sheldon Andrews, Samira Ebrahimi Kahou

ICML 2024arXiv:2406.01361
6
citations

Model-based Reinforcement Learning for Confounded POMDPs

Mao Hong, Zhengling Qi, Yanxun Xu

ICML 2024

Rethinking Transformers in Solving POMDPs

Chenhao Lu, Ruizhe Shi, Yuyao Liu et al.

ICML 2024arXiv:2405.17358
9
citations

Task Planning for Object Rearrangement in Multi-Room Environments

Karan Mirakhor, Sourav Ghosh, Dipanjan Das et al.

AAAI 2024paperarXiv:2406.00451
2
citations