"fine-tuning language models" Papers
2 papers found
Conference
Key-Point-Driven Data Synthesis with Its Enhancement on Mathematical Reasoning
Yiming Huang, Xiao Liu, Yeyun Gong et al.
AAAI 2025paperarXiv:2403.02333
65
citations
Varying Shades of Wrong: Aligning LLMs with Wrong Answers Only
Jihan Yao, Wenxuan Ding, Shangbin Feng et al.
ICLR 2025arXiv:2410.11055
4
citations