Phase Diagram of Initial Condensation for Two-Layer Neural Networks

Zheng-An Chen , Yuqing Li , Tao Luo , Zhangchen Zhou , Zhi-Qin John Xu

CSIAM Trans. Appl. Math. ›› 2024, Vol. 5 ›› Issue (3) : 448 -514.

PDF (66KB)
CSIAM Trans. Appl. Math. ›› 2024, Vol. 5 ›› Issue (3) : 448 -514. DOI: 10.4208/csiam-am.SO-2023-0016
research-article

Phase Diagram of Initial Condensation for Two-Layer Neural Networks

Author information +
History +
PDF (66KB)

Abstract

The phenomenon of distinct behaviors exhibited by neural networks under varying scales of initialization remains an enigma in deep learning research. In this paper, based on the earlier work [Luo et al., J. Mach. Learn. Res., 22:1-47, 2021], we present a phase diagram of initial condensation for two-layer neural networks. Condensation is a phenomenon wherein the weight vectors of neural networks concentrate on isolated orientations during the training process, and it is a feature in non-linear learning process that enables neural networks to possess better generalization abilities. Our phase diagram serves to provide a comprehensive understanding of the dynamical regimes of neural networks and their dependence on the choice of hyperparameters related to initialization. Furthermore, we demonstrate in detail the underlying mechanisms by which small initialization leads to condensation at the initial training stage.

Keywords

Two-layer neural network / phase diagram / dynamical regime / condensation

Cite this article

Download citation ▾
Zheng-An Chen, Yuqing Li, Tao Luo, Zhangchen Zhou, Zhi-Qin John Xu. Phase Diagram of Initial Condensation for Two-Layer Neural Networks. CSIAM Trans. Appl. Math., 2024, 5(3): 448-514 DOI:10.4208/csiam-am.SO-2023-0016

登录浏览全文

4963

注册一个新账户 忘记密码

References

AI Summary AI Mindmap
PDF (66KB)

93

Accesses

0

Citation

Detail

Sections
Recommended

AI思维导图

/