Poster "reinforcement fine-tuning" Papers
4 papers found
Conference
BlueSuffix: Reinforced Blue Teaming for Vision-Language Models Against Jailbreak Attacks
Yunhan Zhao, Xiang Zheng, Lin Luo et al.
ICLR 2025arXiv:2410.20971
20
citations
RoboRefer: Towards Spatial Referring with Reasoning in Vision-Language Models for Robotics
Enshen Zhou, Jingkun An, Cheng Chi et al.
NEURIPS 2025arXiv:2506.04308
58
citations
SEEA-R1: Tree-Structured Reinforcement Fine-Tuning for Self-Evolving Embodied Agents
Wanxin Tian, Shijie Zhang, Kevin Zhang et al.
NEURIPS 2025arXiv:2506.21669
6
citations
Visual-RFT: Visual Reinforcement Fine-Tuning
Ziyu Liu, Zeyi Sun, Yuhang Zang et al.
ICCV 2025arXiv:2503.01785
357
citations