α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
T. Sandholm
T. Sandholm
5
papers
95
total citations
papers (5)
Confronting Reward Model Overoptimization with Constrained RLHF
ICLR 2024
arXiv
75
citations
Game-Theoretic Robust Reinforcement Learning Handles Temporally-Coupled Perturbations
ICLR 2024
arXiv
12
citations
The Complexity of Symmetric Equilibria in Min-Max Optimization and Team Zero-Sum Games
NEURIPS 2025
arXiv
3
citations
Optimistic Policy Gradient in Multi-Player Markov Games with a Single Controller: Convergence beyond the Minty Property
AAAI 2024
arXiv
3
citations
Mediator Interpretation and Faster Learning Algorithms for Linear Correlated Equilibria in General Sequential Games
ICLR 2024
2
citations