α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Zhiwei He
Zhiwei He
4
papers
52
total citations
papers (4)
Trust, But Verify: A Self-Verification Approach to Reinforcement Learning with Verifiable Rewards
NEURIPS 2025
arXiv
18
citations
Improving Open-Ended Text Generation via Adaptive Decoding
ICML 2024
arXiv
18
citations
Weak-to-Strong Preference Optimization: Stealing Reward from Weak Aligned Model
ICLR 2025
arXiv
15
citations
UAWTrack: Universal 3D Single Object Tracking in Adverse Weather
AAAI 2025
1
citations