α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Banghua Zhu
Banghua Zhu
8
papers
1,812
total citations
papers (8)
Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference
ICML 2024
arXiv
1,026
citations
From Crowdsourced Data to High-quality Benchmarks: Arena-Hard and Benchbuilder Pipeline
ICML 2025
arXiv
357
citations
Bridging Offline Reinforcement Learning and Imitation Learning: A Tale of Pessimism
NEURIPS 2021
arXiv
318
citations
How to Evaluate Reward Models for RLHF
ICLR 2025
arXiv
58
citations
Iterative Data Smoothing: Mitigating Reward Overfitting and Overoptimization in RLHF
ICML 2024
arXiv
48
citations
The Effective Horizon Explains Deep RL Performance in Stochastic Environments
ICLR 2024
arXiv
5
citations
Towards Optimal Caching and Model Selection for Large Model Inference
NEURIPS 2023
0
citations
Doubly-Robust Self-Training
NEURIPS 2023
0
citations