ResearchAlpha Leak

Conferences Topics Top Authors Rankings Browse All

Home/Authors/Shihan Dou

Shihan Dou

Topic trends: 32,543 papers · similarity ≥ 0.4 · year ≥ 2024 · Data sourced from Semantic Scholar

34,598 papers | Abstracts: 31,650 (91.5%) | Citations: 34,598 (100.0%) | arXiv: 26,074 (75.4%)

Built: Feb 15, 2026, 7:01 AM AMS

5

papers

130

total citations

papers (5)

Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning

RMB: Comprehensively benchmarking reward models in LLM alignment

Linear Alignment: A Closed-form Solution for Aligning Human Preferences without Tuning and Feedback

EvaLearn: Quantifying the Learning Capability and Efficiency of LLMs via Sequential Problem Solving

NEURIPS 2025arXiv

Alleviating Shifted Distribution in Human Preference Alignment through Meta-Learning