Poster "self-correction mechanisms" Papers
3 papers found
Conference
ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning
Mingyang Chen, Linzhuang Sun, Tianpeng Li et al.
NEURIPS 2025arXiv:2503.19470
57
citations
SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction
Ling Yang, Zhaochen Yu, Tianjun Zhang et al.
ICLR 2025arXiv:2410.09008
15
citations
Training Large Language Models for Retrieval-Augmented Question Answering through Backtracking Correction
Huawen Feng, ZekunYao, Junhao Zheng et al.
ICLR 2025
1
citations