Content area

Abstract

We address the problem of estimating statistics of hidden units in a neural network using a method of analytic moment propagation. These statistics are useful for approximate whitening of the inputs in front of saturating non-linearities such as a sigmoid function. This is important for initialization of training and for reducing the accumulated scale and bias dependencies (compensating covariate shift), which presumably eases the learning. In batch normalization, which is currently a very widely applied technique, sample estimates of statistics of hidden units over a batch are used. The proposed estimation uses an analytic propagation of mean and variance of the training set through the network. The result depends on the network structure and its current weights but not on the specific batch input. The estimates are suitable for initialization and normalization, efficient to compute and independent of the batch size. The experimental verification well supports these claims. However, the method does not share the generalization properties of BN, to which our experiments give some additional insight.

Details

1009240
Title
Normalization of Neural Networks using Analytic Variance Propagation
Publication title
arXiv.org; Ithaca
Publication year
2018
Publication date
Mar 28, 2018
Section
Computer Science; Statistics
Publisher
Cornell University Library, arXiv.org
Source
arXiv.org
Place of publication
Ithaca
Country of publication
United States
University/institution
Cornell University Library arXiv.org
e-ISSN
2331-8422
Source type
Working Paper
Language of publication
English
Document type
Working Paper
Publication history
 
 
Online publication date
2018-03-29
Milestone dates
2018-03-28 (Submission v1)
Publication history
 
 
   First posting date
29 Mar 2018
ProQuest document ID
2071937723
Document URL
https://www.proquest.com/working-papers/normalization-neural-networks-using-analytic/docview/2071937723/se-2?accountid=208611
Full text outside of ProQuest
Copyright
© 2018. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Last updated
2019-04-12
Database
ProQuest One Academic