Poster by Jingyan Shen Papers
2 papers found
Conference
Conformal Tail Risk Control for Large Language Model Alignment
Catherine Chen, Jingyan Shen, Xinyu Yang et al.
ICML 2025arXiv:2502.20285
4
citations
Improving Data Efficiency for LLM Reinforcement Fine-tuning Through Difficulty-targeted Online Data Selection and Rollout Replay
Yifan Sun, Jingyan Shen, Yibin Wang et al.
NEURIPS 2025arXiv:2506.05316
20
citations