α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Qining Zhang
Qining Zhang
2
papers
17
total citations
papers (2)
Zeroth-Order Policy Gradient for Reinforcement Learning from Human Feedback without Reward Inference
ICLR 2025
arXiv
10
citations
Fast and Regret Optimal Best Arm Identification: Fundamental Limits and Low-Complexity Algorithms
NEURIPS 2023
arXiv
7
citations