α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Ashutosh Baheti
Ashutosh Baheti
1
papers
15
total citations
papers (1)
Leftover Lunch: Advantage-based Offline Reinforcement Learning for Language Models
ICLR 2024
arXiv
15
citations