α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Anikait Singh
Anikait Singh
5
papers
712
total citations
papers (5)
Cognitive Behaviors that Enable Self-Improving Reasoners, or, Four Habits of Highly Effective STaRs
COLM 2025
arXiv
318
citations
Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning
NEURIPS 2023
arXiv
200
citations
Preference Fine-Tuning of LLMs Should Leverage Suboptimal, On-Policy Data
ICML 2024
arXiv
179
citations
Personalized Preference Fine-tuning of Diffusion Models
CVPR 2025
arXiv
15
citations
ReDS: Offline RL With Heteroskedastic Datasets via Support Constraints
NEURIPS 2023
0
citations