Poster "sample complexity analysis" Papers

21 papers found

Finite-Time Bounds for Average-Reward Fitted Q-Iteration

Jongmin Lee, Ernest Ryu

NEURIPS 2025arXiv:2510.17391

FraPPE: Fast and Efficient Preference-Based Pure Exploration

Udvas Das, Apurv Shukla, Debabrota Basu

NEURIPS 2025arXiv:2508.16487

Learning Orthogonal Multi-Index Models: A Fine-Grained Information Exponent Analysis

Yunwei Ren, Jason Lee

NEURIPS 2025arXiv:2410.09678
5
citations

Linear Mixture Distributionally Robust Markov Decision Processes

Zhishuai Liu, Pan Xu

NEURIPS 2025arXiv:2505.18044
5
citations

Model-Free Offline Reinforcement Learning with Enhanced Robustness

Chi Zhang, Zain Ulabedeen Farhat, George Atia et al.

ICLR 2025
5
citations

Offline Actor-Critic for Average Reward MDPs

William Powell, Jeongyeol Kwon, Qiaomin Xie et al.

NEURIPS 2025
73
citations

Outcome-Based Online Reinforcement Learning: Algorithms and Fundamental Limits

Fan Chen, Zeyu Jia, Alexander Rakhlin et al.

NEURIPS 2025arXiv:2505.20268
4
citations

Preference Elicitation for Offline Reinforcement Learning

Alizée Pace, Bernhard Schölkopf, Gunnar Ratsch et al.

ICLR 2025arXiv:2406.18450
2
citations

Probably Approximately Precision and Recall Learning

Lee Cohen, Yishay Mansour, Shay Moran et al.

NEURIPS 2025arXiv:2411.13029
3
citations

Sharp Analysis for KL-Regularized Contextual Bandits and RLHF

Heyang Zhao, Chenlu Ye, Quanquan Gu et al.

NEURIPS 2025arXiv:2411.04625
16
citations

Benign Overfitting in Two-Layer ReLU Convolutional Neural Networks for XOR Data

Xuran Meng, Difan Zou, Yuan Cao

ICML 2024arXiv:2310.01975
10
citations

Federated Offline Reinforcement Learning: Collaborative Single-Policy Coverage Suffices

Jiin Woo, Laixi Shi, Gauri Joshi et al.

ICML 2024arXiv:2402.05876
9
citations

Federated Representation Learning in the Under-Parameterized Regime

Renpu Liu, Cong Shen, Jing Yang

ICML 2024arXiv:2406.04596
11
citations

Graphon Mean Field Games with a Representative Player: Analysis and Learning Algorithm

Fuzhong Zhou, Chenyu Zhang, Xu Chen et al.

ICML 2024arXiv:2405.08005
7
citations

Guarantees for Nonlinear Representation Learning: Non-identical Covariates, Dependent Data, Fewer Samples

Thomas T. Zhang, Bruce Lee, Ingvar Ziemann et al.

ICML 2024arXiv:2410.11227
2
citations

Learning Low-dimensional Latent Dynamics from High-dimensional Observations: Non-asymptotics and Lower Bounds

Yuyang Zhang, Shahriar Talebi, Na Li

ICML 2024arXiv:2405.06089
5
citations

On the sample complexity of conditional independence testing with Von Mises estimator with application to causal discovery

Fateme Jamshidi, Luca Ganassali, Negar Kiyavash

ICML 2024arXiv:2310.13553
6
citations

Promises and Pitfalls of Generative Masked Language Modeling: Theoretical Framework and Practical Guidelines

Yuchen Li, Alexandre Kirchmeyer, Aashay Mehta et al.

ICML 2024arXiv:2407.21046
5
citations

Provable Risk-Sensitive Distributional Reinforcement Learning with General Function Approximation

Yu Chen, XiangCheng Zhang, Siwei Wang et al.

ICML 2024arXiv:2402.18159
3
citations

Risk-Sensitive Reward-Free Reinforcement Learning with CVaR

Xinyi Ni, Guanlin Liu, Lifeng Lai

ICML 2024

Single-Trajectory Distributionally Robust Reinforcement Learning

Zhipeng Liang, Xiaoteng Ma, Jose Blanchet et al.

ICML 2024arXiv:2301.11721
17
citations