Poster "markov decision process" Papers

10 papers found

AutoEdit: Automatic Hyperparameter Tuning for Image Editing

Chau Pham, Quan Dao, Mahesh Bhosale et al.

NEURIPS 2025arXiv:2509.15031
1
citations

Graph-Supported Dynamic Algorithm Configuration for Multi-Objective Combinatorial Optimization

Robbert Reijnen, Yaoxin Wu, Zaharah Bukhsh et al.

ICML 2025arXiv:2505.16471
1
citations

Multi-step Visual Reasoning with Visual Tokens Scaling and Verification

Tianyi Bai, Zengjie Hu, Fupeng Sun et al.

NEURIPS 2025arXiv:2506.07235
14
citations

Multivariate Dynamic Mediation Analysis under a Reinforcement Learning Framework

Lan Luo, Chengchun Shi, Jitao Wang et al.

NEURIPS 2025arXiv:2310.16203
2
citations

Optimizing Backward Policies in GFlowNets via Trajectory Likelihood Maximization

Timofei Gritsaev, Nikita Morozov, Sergey Samsonov et al.

ICLR 2025arXiv:2410.15474
5
citations

AdaNAT: Exploring Adaptive Policy for Token-Based Image Generation

Zanlin Ni, Yulin Wang, Renping Zhou et al.

ECCV 2024arXiv:2409.00342
16
citations

Reinforcement Learning and Regret Bounds for Admission Control

Lucas Weber, Ana Busic, Jiamin ZHU

ICML 2024arXiv:2406.04766
1
citations

Reward Model Learning vs. Direct Policy Optimization: A Comparative Analysis of Learning from Human Preferences

Andi Nika, Debmalya Mandal, Parameswaran Kamalaruban et al.

ICML 2024arXiv:2403.01857
20
citations

RL-CFR: Improving Action Abstraction for Imperfect Information Extensive-Form Games with Reinforcement Learning

Boning Li, Zhixuan Fang, Longbo Huang

ICML 2024arXiv:2403.04344
5
citations

Towards Optimal Adversarial Robust Q-learning with Bellman Infinity-error

Haoran Li, Zicheng Zhang, Wang Luo et al.

ICML 2024arXiv:2402.02165
3
citations