α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Hanze Dong
Hanze Dong
5
papers
426
total citations
papers (5)
Iterative Preference Learning from Human Feedback: Bridging Theory and Practice for RLHF under KL-constraint
ICML 2024
arXiv
312
citations
Reward-Guided Speculative Decoding for Efficient LLM Reasoning
ICML 2025
arXiv
77
citations
Spurious Feature Diversification Improves Out-of-distribution Generalization
ICLR 2024
arXiv
34
citations
Faster Sampling via Stochastic Gradient Proximal Sampler
ICML 2024
arXiv
3
citations
Bayesian Invariant Risk Minimization
CVPR 2022
0
citations