α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Rui Zheng
Rui Zheng
6
papers
205
total citations
papers (6)
SPA-VL: A Comprehensive Safety Preference Alignment Dataset for Vision Language Models
CVPR 2025
arXiv
68
citations
Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning
ICML 2024
arXiv
58
citations
RMB: Comprehensively benchmarking reward models in LLM alignment
ICLR 2025
arXiv
47
citations
Linear Alignment: A Closed-form Solution for Aligning Human Preferences without Tuning and Feedback
ICML 2024
arXiv
21
citations
Have the VLMs Lost Confidence? A Study of Sycophancy in VLMs
ICLR 2025
arXiv
11
citations
Alleviating Shifted Distribution in Human Preference Alignment through Meta-Learning
AAAI 2025
0
citations