Poster "continuous action spaces" Papers
3 papers found
Conference
Efficient and Near-Optimal Algorithm for Contextual Dueling Bandits with Offline Regression Oracles
Aadirupa Saha, Robert Schapire
NEURIPS 2025
Learning a Diffusion Model Policy from Rewards via Q-Score Matching
Michael Psenka, Alejandro Escontrela, Pieter Abbeel et al.
ICML 2024arXiv:2312.11752
70
citations
Run-Time Task Composition with Safety Semantics
Kevin Leahy, Makai Mann, Zachary Serlin
ICML 2024