α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Jiayi Zhou
Jiayi Zhou
4
papers
17
total citations
papers (4)
Safe RLHF-V: Safe Reinforcement Learning from Multi-modal Human Feedback
NEURIPS 2025
arXiv
9
citations
Generative RLHF-V: Learning Principles from Multi-modal Human Preference
NEURIPS 2025
arXiv
7
citations
InterMT: Multi-Turn Interleaved Preference Alignment with Human Feedback
NEURIPS 2025
arXiv
1
citations
Safety Gymnasium: A Unified Safe Reinforcement Learning Benchmark
NEURIPS 2023
0
citations