Simple techniques for improving deep neural network outcomes on commodity hardware

Authors

Nicholas Christopher A Colina; Carlos E Perez; Francis N. C. Paraan

Source

AIP Conf. Proc. (8th Jagna International Workshop) 1871, 040001 (2017). DOI: 10.1063/1.4996523

Abstract

We benchmark improvements in the performance of deep neural networks (DNN) on the MNIST data test upon implementing two simple modifications to the algorithm that have little overhead computational cost. First is GPU parallelization on a commodity graphics card, and second is initializing the DNN with random orthogonal weight matrices prior to optimization. Eigenspectra analysis of the weight matrices reveal that the initially orthogonal matrices remain nearly orthogonal after training. The probability distributions from which these orthogonal matrices are drawn are also shown to significantly affect the performance of these deep neural networks.

Structure and Dynamics Group

National Institute of Physics

University of the Philippines Diliman