Mapping the Multiverse of Latent Representations

10
citations
#1012
in ICML 2024
of 2635 papers
3
Top Authors
4
Data Points

Abstract

Echoing recent calls to counter reliability and robustness concerns in machine learning viamultiverse analysis, we present PRESTO, a principled framework formapping the multiverseof machine-learning models that rely onlatent representations. Although such models enjoy widespread adoption, the variability in their embeddings remains poorly understood, resulting in unnecessary complexity and untrustworthy representations. Our framework usespersistent homologyto characterize the latent spaces arising from different combinations of diverse machine-learning methods, (hyper)parameter configurations, and datasets, allowing us to measure their pairwise(dis)similarityand statistically reason about theirdistributions. As we demonstrate both theoretically and empirically, our pipeline preserves desirable properties of collections of latent representations, and it can be leveraged to perform sensitivity analysis, detect anomalous embeddings, or efficiently and effectively navigate hyperparameter search spaces.

Citation History

Jan 28, 2026
0
Feb 13, 2026
10+10
Feb 13, 2026
10
Feb 13, 2026
10