"sequential decision making" Papers

24 papers found

Flow-based Variational Mutual Information: Fast and Flexible Approximations

Caleb Dahlke, Jason Pacheco

ICLR 2025
4
citations

Improved Regret and Contextual Linear Extension for Pandora's Box and Prophet Inequality

Junyan Liu, Ziyun Chen, Kun Wang et al.

NEURIPS 2025arXiv:2505.18828
1
citations

Interactive and Hybrid Imitation Learning: Provably Beating Behavior Cloning

Yichen Li, Chicheng Zhang

NEURIPS 2025arXiv:2412.07057

Markov Persuasion Processes: Learning to Persuade From Scratch

Francesco Bacchiocchi, Francesco Emanuele Stradi, Matteo Castiglioni et al.

NEURIPS 2025arXiv:2402.03077
9
citations

Offline Multi-Agent Reinforcement Learning via In-Sample Sequential Policy Optimization

Zongkai Liu, Qian Lin, Chao Yu et al.

AAAI 2025paperarXiv:2412.07639
8
citations

Optimistic Query Routing in Clustering-based Approximate Maximum Inner Product Search

Sebastian Bruch, Aditya Krishnan, Franco Maria Nardini

NEURIPS 2025arXiv:2405.12207
3
citations

Reasoning in Visual Navigation of End-to-end Trained Agents: A Dynamical Systems Approach

Steeven JANNY, Hervé Poirier, Leonid Antsfeld et al.

CVPR 2025highlightarXiv:2503.08306
4
citations

Reinforcement learning with combinatorial actions for coupled restless bandits

Lily Xu, Bryan Wilder, Elias Khalil et al.

ICLR 2025arXiv:2503.01919
6
citations

Revelations: A Decidable Class of POMDPs with Omega-Regular Objectives

Marius Belly, Nathanaël Fijalkow, Hugo Gimbert et al.

AAAI 2025paperarXiv:2412.12063
5
citations

Web-Shepherd: Advancing PRMs for Reinforcing Web Agents

Hyungjoo Chae, Seonghwan Kim, Junhee Cho et al.

NEURIPS 2025spotlightarXiv:2505.15277
8
citations

Accelerating Look-ahead in Bayesian Optimization: Multilevel Monte Carlo is All you Need

Shangda Yang, Vitaly Zankin, Maximilian Balandat et al.

ICML 2024arXiv:2402.02111
4
citations

A General Framework for Sequential Decision-Making under Adaptivity Constraints

Nuoya Xiong, Zhaoran Wang, Zhuoran Yang

ICML 2024arXiv:2306.14468
6
citations

Bayesian Exploration Networks

Mattie Fellows, Brandon Kaplowitz, Christian Schroeder de Witt et al.

ICML 2024arXiv:2308.13049
4
citations

Bayesian Optimization of Function Networks with Partial Evaluations

Poompol Buathong, Jiayue Wan, Raul Astudillo et al.

ICML 2024arXiv:2311.02146
8
citations

Combining Experimental and Historical Data for Policy Evaluation

Ting Li, Chengchun Shi, Qianglin Wen et al.

ICML 2024arXiv:2406.00317
4
citations

Generalization to New Sequential Decision Making Tasks with In-Context Learning

Sharath Chandra Raparthy, Eric Hambro, Robert Kirk et al.

ICML 2024arXiv:2312.03801
37
citations

Learning to Reach Goals via Diffusion

Vineet Jain, Siamak Ravanbakhsh

ICML 2024arXiv:2310.02505
11
citations

NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large Language Models

Gengze Zhou, Yicong Hong, Qi Wu

AAAI 2024paperarXiv:2305.16986
283
citations

On Multi-Armed Bandit with Impatient Arms

Yuming Shao, Zhixuan Fang

ICML 2024

Pursuing Overall Welfare in Federated Learning through Sequential Decision Making

Seok-Ju Hahn, Gi-Soo Kim, Junghye Lee

ICML 2024arXiv:2405.20821
2
citations

Reinforcement Learning from Reachability Specifications: PAC Guarantees with Expected Conditional Distance

Jakub Svoboda, Suguman Bansal, Krishnendu Chatterjee

ICML 2024oral

Remembering to Be Fair: Non-Markovian Fairness in Sequential Decision Making

Parand A. Alamdari, Toryn Q. Klassen, Elliot Creager et al.

ICML 2024arXiv:2312.04772
7
citations

Skill Set Optimization: Reinforcing Language Model Behavior via Transferable Skills

Kolby Nottingham, Bodhisattwa Prasad Majumder, Bhavana Dalvi et al.

ICML 2024arXiv:2402.03244
12
citations

Zero-Shot Reinforcement Learning via Function Encoders

Tyler Ingebrand, Amy Zhang, Ufuk Topcu

ICML 2024arXiv:2401.17173
13
citations