α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Alexander Bukharin
Alexander Bukharin
5
papers
195
total citations
papers (5)
HelpSteer2-Preference: Complementing Ratings with Preferences
ICLR 2025
arXiv
112
citations
HelpSteer3-Preference: Open Human-Annotated Preference Data across Diverse Tasks and Languages
NEURIPS 2025
arXiv
38
citations
Robust Multi-Agent Reinforcement Learning via Adversarial Regularization: Theoretical Foundation and Stable Algorithms
NEURIPS 2023
arXiv
36
citations
Adversarial Training of Reward Models
COLM 2025
arXiv
7
citations
Deep Reinforcement Learning from Hierarchical Preference Design
ICML 2025
arXiv
2
citations