Poster "overparameterization benefits" Papers
2 papers found
Conference
On the Optimization and Generalization of Multi-head Attention
Christos Thrampoulidis, Rouzbeh Ghaderi, Hossein Taheri et al.
ICLR 2025arXiv:2310.12680
44
citations
Prediction Risk and Estimation Risk of the Ridgeless Least Squares Estimator under General Assumptions on Regression Errors
Sungyoon Lee, Sokbae Lee
ICLR 2025arXiv:2305.12883