ResearchAlpha Leak

Conferences Topics Top Authors Rankings Browse All

Home/Authors/Archit Sharma

Archit Sharma

2

Affiliations

Affiliations

GoogleStanford

Topic trends: 32,543 papers · similarity ≥ 0.4 · year ≥ 2024 · Data sourced from Semantic Scholar

34,598 papers | Abstracts: 31,650 (91.5%) | Citations: 34,598 (100.0%) | arXiv: 26,074 (75.4%)

Built: Feb 15, 2026, 3:09 AM AMS

7

papers

7,479

total citations

papers (7)

Direct Preference Optimization: Your Language Model is Secretly a Reward Model

NEURIPS 2023arXiv

Preference Fine-Tuning of LLMs Should Leverage Suboptimal, On-Policy Data

Autonomous Reinforcement Learning via Subgoal Curricula

NEURIPS 2021arXiv

You Only Live Once: Single-Life Reinforcement Learning

NEURIPS 2022arXiv

When to Ask for Help: Proactive Interventions in Autonomous Reinforcement Learning

NEURIPS 2022arXiv

Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval

RLVF: Learning from Verbal Feedback without Overgeneralization