"hessian analysis" Papers
3 papers found
Conference
From Low Rank Gradient Subspace Stabilization to Low-Rank Weights: Observations, Theories, and Applications
Ajay Jaiswal, Yifan Wang, Lu Yin et al.
ICML 2025arXiv:2407.11239
20
citations
What Does It Mean to Be a Transformer? Insights from a Theoretical Hessian Analysis
Weronika Ormaniec, Felix Dangel, Sidak Pal Singh
ICLR 2025arXiv:2410.10986
10
citations
Supervised Matrix Factorization: Local Landscape Analysis and Applications
Joowon Lee, Hanbaek Lyu, Weixin Yao
ICML 2024