ResearchAlpha Leak

Conferences Topics Top Authors Rankings Browse All

Home/Authors/Shuang Qiu

Shuang Qiu

Topic trends: 32,543 papers · similarity ≥ 0.4 · year ≥ 2024 · Data sourced from Semantic Scholar

34,598 papers | Abstracts: 31,650 (91.5%) | Citations: 34,598 (100.0%) | arXiv: 26,074 (75.4%)

Built: Feb 14, 2026, 11:22 PM AMS

9

papers

344

total citations

papers (9)

Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment

Stylized Neural Painting

Upper Confidence Primal-Dual Reinforcement Learning for CMDP with Adversarial Loss

NEURIPS 2020arXiv

Online Preference Alignment for Language Models via Count-based Exploration

Segment Policy Optimization: Effective Segment-Level Credit Assignment in RL for Large Language Models

NEURIPS 2025arXiv

Forward KL Regularized Preference Optimization for Aligning Diffusion Policies

ROPO: Robust Preference Optimization for Large Language Models

Pessimism Meets Risk: Risk-Sensitive Offline Reinforcement Learning

Posterior Sampling for Competitive RL: Function Approximation and Partial Observation

NEURIPS 2023arXiv