α
Research
Alpha Leak
Conferences
Topics
Top Authors
Rankings
Browse All
EN
中
Home
/
Authors
/
Shuang Qiu
Shuang Qiu
9
papers
344
total citations
papers (9)
Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment
ICML 2024
arXiv
125
citations
Stylized Neural Painting
CVPR 2021
arXiv
108
citations
Upper Confidence Primal-Dual Reinforcement Learning for CMDP with Adversarial Loss
NEURIPS 2020
arXiv
56
citations
Online Preference Alignment for Language Models via Count-based Exploration
ICLR 2025
arXiv
20
citations
Segment Policy Optimization: Effective Segment-Level Credit Assignment in RL for Large Language Models
NEURIPS 2025
arXiv
18
citations
Forward KL Regularized Preference Optimization for Aligning Diffusion Policies
AAAI 2025
arXiv
8
citations
ROPO: Robust Preference Optimization for Large Language Models
ICML 2025
arXiv
7
citations
Pessimism Meets Risk: Risk-Sensitive Offline Reinforcement Learning
ICML 2024
arXiv
1
citations
Posterior Sampling for Competitive RL: Function Approximation and Partial Observation
NEURIPS 2023
arXiv
1
citations