ResearchAlpha Leak

Conferences Topics Top Authors Rankings Browse All

Home/Authors/Zhiyu Mei

Zhiyu Mei

Topic trends: 32,543 papers · similarity ≥ 0.4 · year ≥ 2024 · Data sourced from Semantic Scholar

34,598 papers | Abstracts: 31,650 (91.5%) | Citations: 34,598 (100.0%) | arXiv: 26,074 (75.4%)

Built: Feb 15, 2026, 12:40 AM AMS

3

papers

379

total citations

papers (3)

Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study

AREAL: A Large-Scale Asynchronous Reinforcement Learning System for Language Reasoning

NEURIPS 2025arXiv

SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores