FisherSFT: Data-Efficient Supervised Fine-Tuning of Language Models Using Information Gain

6citations

arXiv:2505.14826

citations

#862

in ICML 2025

of 3340 papers

Top Authors

Data Points

Top Authors

Rohan Deb Kiran Thekumparampil Kousha Kalantari Gaurush Hiranandani Shoham Sabach Branislav Kveton

Abstract

Supervised fine-tuning (SFT) is the most common way of adapting large language models (LLMs) to a new domain. In this paper, we improve the efficiency of SFT by selecting an informative subset of training examples. Specifically, for a fixed budget of training examples, which determines the computational cost of fine-tuning, we select those that maximize information gain, as measured by the Fisher information matrix of the SFT objective. We approximate it efficiently by linearization at the last layer of the LLM. Our approach is computationally efficient, analyzable, and performs well empirically. We demonstrate this on several problems, with both quantitative results and LLM-as-a-judge evaluations.

Citation History

Jan 28, 2026

Feb 13, 2026

6+6

Feb 13, 2026