α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Arian Hosseini
Arian Hosseini
3
papers
80
total citations
papers (3)
Asynchronous RLHF: Faster and More Efficient Off-Policy RL for Language Models
ICLR 2025
arXiv
43
citations
When To Solve, When To Verify: Compute-Optimal Problem Solving and Generative Verification for LLM Reasoning
COLM 2025
24
citations
Putting the Value Back in RL: Better Test-Time Scaling by Unifying LLM Reasoners With Verifiers
COLM 2025
arXiv
13
citations