Initial Guessing Bias: How Untrained Networks Favor Some Classes

8citations

arXiv:2306.00809 PDF Project

citations

#1170

in ICML 2024

of 2635 papers

Top Authors

Data Points

Top Authors

Emanuele Francazi Aurelien Lucchi Marco Baity-Jesi

Topics

initial guessing bias classification problems dataset preprocessing activation functions max-pooling layers network depth analysis node-permutation symmetry self-averaging violation

Abstract

Understanding and controlling biasing effects in neural networks is crucial for ensuring accurate and fair model performance. In the context of classification problems, we provide a theoretical analysis demonstrating that the structure of a deep neural network (DNN) can condition the model to assign all predictions to the same class, even before the beginning of training, and in the absence of explicit biases. We prove that, besides dataset properties, the presence of this phenomenon, which we callInitial Guessing Bias(IGB), is influenced by model choices including dataset preprocessing methods, and architectural decisions, such as activation functions, max-pooling layers, and network depth. Our analysis of IGB provides information for architecture selection and model initialization. We also highlight theoretical consequences, such as the breakdown of node-permutation symmetry, the violation of self-averaging and the non-trivial effects that depth has on the phenomenon.

Citation History

Jan 28, 2026

Feb 13, 2026

8+8

Feb 13, 2026