Poster "markov decision process" Papers
10 papers found
Conference
AutoEdit: Automatic Hyperparameter Tuning for Image Editing
Chau Pham, Quan Dao, Mahesh Bhosale et al.
NEURIPS 2025arXiv:2509.15031
1
citations
Graph-Supported Dynamic Algorithm Configuration for Multi-Objective Combinatorial Optimization
Robbert Reijnen, Yaoxin Wu, Zaharah Bukhsh et al.
ICML 2025arXiv:2505.16471
1
citations
Multi-step Visual Reasoning with Visual Tokens Scaling and Verification
Tianyi Bai, Zengjie Hu, Fupeng Sun et al.
NEURIPS 2025arXiv:2506.07235
14
citations
Multivariate Dynamic Mediation Analysis under a Reinforcement Learning Framework
Lan Luo, Chengchun Shi, Jitao Wang et al.
NEURIPS 2025arXiv:2310.16203
2
citations
Optimizing Backward Policies in GFlowNets via Trajectory Likelihood Maximization
Timofei Gritsaev, Nikita Morozov, Sergey Samsonov et al.
ICLR 2025arXiv:2410.15474
5
citations
AdaNAT: Exploring Adaptive Policy for Token-Based Image Generation
Zanlin Ni, Yulin Wang, Renping Zhou et al.
ECCV 2024arXiv:2409.00342
16
citations
Reinforcement Learning and Regret Bounds for Admission Control
Lucas Weber, Ana Busic, Jiamin ZHU
ICML 2024arXiv:2406.04766
1
citations
Reward Model Learning vs. Direct Policy Optimization: A Comparative Analysis of Learning from Human Preferences
Andi Nika, Debmalya Mandal, Parameswaran Kamalaruban et al.
ICML 2024arXiv:2403.01857
20
citations
RL-CFR: Improving Action Abstraction for Imperfect Information Extensive-Form Games with Reinforcement Learning
Boning Li, Zhixuan Fang, Longbo Huang
ICML 2024arXiv:2403.04344
5
citations
Towards Optimal Adversarial Robust Q-learning with Bellman Infinity-error
Haoran Li, Zicheng Zhang, Wang Luo et al.
ICML 2024arXiv:2402.02165
3
citations