"instruction-following abilities" Papers
2 papers found
Conference
Advancing Mathematical Reasoning in Language Models: The Impact of Problem-Solving Data, Data Synthesis Methods, and Training Stages
Zui Chen, Tianqiao Liu, Tongqing et al.
ICLR 2025arXiv:2501.14002
12
citations
Self-Boosting Large Language Models with Synthetic Preference Data
Qingxiu Dong, Li Dong, Xingxing Zhang et al.
ICLR 2025arXiv:2410.06961
32
citations