1
citations
#2497
in NEURIPS 2025
of 5858 papers
5
Top Authors
7
Data Points
Topics
Abstract
We establish a lower bound on the eluder dimension of generalised linear model classes, showing that standard eluder dimension-based analysis cannot lead to first-order regret bounds. To address this, we introduce a localisation method for the eluder dimension; our analysis immediately recovers and improves on classic results for Bernoulli bandits, and allows for the first genuine first-order bounds for finite-horizon reinforcement learning tasks with bounded cumulative returns.
Citation History
Jan 25, 2026
0
Jan 27, 2026
0
Jan 27, 2026
0
Jan 31, 2026
0
Feb 13, 2026
1+1
Feb 13, 2026
1
Feb 13, 2026
1