7 votes

Are Deep Neural Networks Dramatically Overfitted?

Posted September 6, 2019 by skybrian

Tags: machine learning, deep networks

https://lilianweng.github.io/lil-log/2019/03/14/are-deep-neural-networks-dramatically-overfitted.html

Link information

This data is scraped automatically and may be incorrect.

Authors: Lilian Weng
Published: Mar 14 2019
Word count: 4220 words

2 comments

skybrian (OP)
September 6, 2019
Link
Apparently most layers in deep learning networks aren't actually used. From the article: "The lottery ticket hypothesis states that a randomly initialized, dense, feed-forward network contains a...

Apparently most layers in deep learning networks aren't actually used. From the article: "The lottery ticket hypothesis states that a randomly initialized, dense, feed-forward network contains a pool of subnetworks and among them only a subset are 'winning tickets' which can achieve the optimal performance when trained in isolation."

This seems to explain why the networks can easily be pruned after training is done. There's a lot of data that's left over from failed attempts.

3 votes
Staross
September 6, 2019
Link
Very interesting, thanks.

Very interesting, thanks.