Importance of initialization of weight matrices in deep learning neural networks

Authors

Nicholas Christopher A Colina; Carlos E Perez; Francis N. C. Paraan

Source

Proceedings of the 34th Samahang Pisika ng Pilipinas Physics Congress, University of the Philippines Visayas, Iloilo City, SPP-2016-PA-21 (2016).

Abstract

The success of deep neural networks relies on optimized weight matrices are initialized in different ways. This work reports learning improvement in a six-layer deep neural network that is initialized with orthogonal weight matrices when compared to other commonly-used initialization schemes. An analysis of the eigenvalue spectra of the optimized solutions implies that the space of orthogonal weight matrices lies close to the manifold of learned states.

Structure and Dynamics Group

National Institute of Physics

University of the Philippines Diliman

Importance of initialization of weight matrices in deep learning neural networks

Authors

Source

Abstract