α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Takashi Ishida
Takashi Ishida
1
papers
0
total citations
papers (1)
Off-Policy Corrected Reward Modeling for Reinforcement Learning from Human Feedback
COLM 2025
arXiv
0
citations