α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Kianté Brantley
Kianté Brantley
9
papers
123
total citations
papers (9)
Constrained episodic reinforcement learning in concave-convex and knapsack settings
NEURIPS 2020
arXiv
56
citations
Regressing the Relative Future: Efficient Policy Optimization for Multi-turn RLHF
ICLR 2025
arXiv
18
citations
Accelerating RL for LLM Reasoning with Optimal Advantage Regression
NEURIPS 2025
arXiv
12
citations
$Q\sharp$: Provably Optimal Distributional RL for LLM Post-Training
NEURIPS 2025
arXiv
12
citations
LLMs Are In-Context Bandit Reinforcement Learners
COLM 2025
arXiv
12
citations
Value-Guided Search for Efficient Chain-of-Thought Reasoning
NEURIPS 2025
arXiv
7
citations
Adversarial Imitation Learning via Boosting
ICLR 2024
arXiv
6
citations
When is Transfer Learning Possible?
ICML 2024
0
citations
Coactive Learning for Large Language Models using Implicit User Feedback
ICML 2024
0
citations