Global Convergence of Four-Layer Matrix Factorization under Random Initialization
arXiv:2511.09925v2 Announce Type: replace-cross Abstract: Gradient descent dynamics on the deep matrix factorization problem is extensively studied as a simplified theoretical model for deep neural networks. Although the convergence theory for two-layer matrix factorization is well-established, no global convergence guarantee…
