Adaptive Learn-then-Test: Statistically Valid and Efficient Hyperparameter Selection

10citations
arXiv:2409.15844
10
citations
#558
in ICML 2025
of 3340 papers
3
Top Authors
3
Data Points

Abstract

We introduce adaptive learn-then-test (aLTT), an efficient hyperparameter selection procedure that provides finite-sample statistical guarantees on the population risk of AI models. Unlike the existing learn-then-test (LTT) technique, which relies on conventional p-value-based multiple hypothesis testing (MHT), aLTT implements sequential data-dependent MHT with early termination by leveraging e-processes. As a result, aLTT can reduce the number of testing rounds, making it particularly well-suited for scenarios in which testing is costly or presents safety risks. Apart from maintaining statistical validity, in applications such as online policy selection for offline reinforcement learning and prompt engineering, aLTT is shown to achieve the same performance as LTT while requiring only a fraction of the testing rounds.

Citation History

Jan 28, 2026
8
Feb 13, 2026
10+2
Feb 13, 2026
10