Why Your Neural Network is Still Singular and What You Can Do About It

10 Pages Posted: 16 Apr 2019 Last revised: 11 Jun 2019

See all articles by Jakub Dworakowski

Jakub Dworakowski

Stanford University - Computer Science Department

Pablo Rodriguez Bertorello

Stanford University - Computer Science Department

Date Written: March 21, 2019

Abstract

We investigate the effects of neural network regularization techniques. First, we reason formally through the effect of dropout and training stochasticity on gradient descent. Then, we conduct classification experiments on the ImageNet data set, as well as regression experiments in the OneNow Reinforcement Learning data set. A network layer's weight matrix is quantified via Singular Value Decomposition and Conditioning ratios. Our regression network appeared to be well conditioned. However, we find that learning for large-scale classification applications is likely to be capped by poor conditioning. We propose approaches that may prove breakthroughs in learning, providing early evidence. We introduce a gradient perturbation layer, a method to maximize generalization experimentally. Our numerical analysis showcases the opportunity to introduce network circuitry compression, relying on the principal components of a layer's weights, when conditioning peaks. Generally, we propose conditioning as an objective function constraint.

Keywords: Neural Network, Gradient Descent, Singular Value Decomposition, Conditioning, Classification, Regression, ImageNet, Reinforcement Learning, Regularization

Suggested Citation

Dworakowski, Jakub and Rodriguez Bertorello, Pablo Martin, Why Your Neural Network is Still Singular and What You Can Do About It (March 21, 2019). Available at SSRN: https://ssrn.com/abstract=3360696 or http://dx.doi.org/10.2139/ssrn.3360696

Jakub Dworakowski

Stanford University - Computer Science Department ( email )

353 Serra Mall
Stanford, CA 94305
United States

Pablo Martin Rodriguez Bertorello (Contact Author)

Stanford University - Computer Science Department ( email )

353 Serra Mall
Stanford, CA 94305
United States

Do you have a job opening that you would like to promote on SSRN?

Paper statistics

Downloads
100
Abstract Views
957
Rank
572,226
PlumX Metrics