Abstract
A better understanding of pruning methods based on a ranking of weights according to their saliency in a trained network requires further information on the statistical properties of such saliencies. We focus on two-layer networks with either a linear or nonlinear output unit, and obtain analytic expressions for the distribution of saliencies and their logarithms. Our results reveal unexpected universal properties of the log-saliency distribution and suggest a novel algorithm for saliency-based weight ranking that avoids the numerical cost of second derivative evaluations.
Original language | English (US) |
---|---|
Pages (from-to) | 489-498 |
Number of pages | 10 |
Journal | International journal of neural systems |
Volume | 8 |
Issue number | 5-6 |
DOIs | |
State | Published - 1997 |
ASJC Scopus subject areas
- Computer Networks and Communications