"reinforcement learning framework" Papers

11 papers found

Filters:reinforcement learning framework Clear all

Conference

AAAI 2025 (3,028)COLM 2025 (418)CVPR 2025 (2,873)ICCV 2025 (2,701)ICLR 2025 (3,827)ICML 2025 (3,340)ISMAR 2025 (229)NEURIPS 2025 (5,858)AAAI 2024 (2,289)CVPR 2024 (2,716)ECCV 2024 (2,387)ICLR 2024 (2,297)ICML 2024 (2,635)

Paper Type

poster (24,624)paper (8,558)oral (1,594)spotlight (1,421)highlight (975)

Critic-V: VLM Critics Help Catch VLM Errors in Multimodal Reasoning

Di Zhang, Jingdi Lei, Junxian Li et al.

CVPR 2025arXiv:2411.18203

citations

FineRS: Fine-grained Reasoning and Segmentation of Small Objects with Reinforcement Learning

Lu Zhang, Jiazuo Yu, Haomiao Xiong et al.

NEURIPS 2025arXiv:2510.21311

citations

Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval

Sheryl Hsu, Omar Khattab, Chelsea Finn et al.

ICLR 2025arXiv:2410.23214

citations

Multivariate Dynamic Mediation Analysis under a Reinforcement Learning Framework

Lan Luo, Chengchun Shi, Jitao Wang et al.

NEURIPS 2025arXiv:2310.16203

citations

Reinforced Context Order Recovery for Adaptive Reasoning and Planning

Long Ma, Fangwei Zhong, Yizhou Wang

NEURIPS 2025arXiv:2508.13070

citations

SC-Captioner: Improving Image Captioning with Self-Correction by Reinforcement Learning

Lin Zhang, Xianfang Zeng, Kangcong Li et al.

ICCV 2025arXiv:2508.06125

citations

Seeing is Believing? Mitigating OCR Hallucinations in Multimodal Large Language Models

zhentao he, Can Zhang, Ziheng Wu et al.

NEURIPS 2025arXiv:2506.20168

citations

Teaching Language Models to Evolve with Users: Dynamic Profile Modeling for Personalized Alignment

Weixiang Zhao, Xingyu Sui, Yulin Hu et al.

NEURIPS 2025arXiv:2505.15456

citations

Train on Pins and Test on Obstacles for Rectilinear Steiner Minimum Tree

Xingbo Du, Ruizhe Zhong, Junchi Yan

NEURIPS 2025

When Thinking Drifts: Evidential Grounding for Robust Video Reasoning

Romy Luo, Zihui (Sherry) Xue, Alex Dimakis et al.

NEURIPS 2025arXiv:2510.06077

citations

Dialogue for Prompting: A Policy-Gradient-Based Discrete Prompt Generation for Few-Shot Learning

Chengzhengxu Li, Xiaoming Liu, Yichen Wang et al.

AAAI 2024paperarXiv:2308.07272

citations