Cargando…

Complexity control by gradient descent in deep networks

Overparametrized deep networks predict well, despite the lack of an explicit complexity control during training, such as an explicit regularization term. For exponential-type loss functions, we solve this puzzle by showing an effective regularization effect of gradient descent in terms of the normal...

Descripción completa

Detalles Bibliográficos
Autores principales:	Poggio, Tomaso, Liao, Qianli, Banburski, Andrzej
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Nature Publishing Group UK 2020
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7039878/ https://www.ncbi.nlm.nih.gov/pubmed/32094327 http://dx.doi.org/10.1038/s41467-020-14663-9

Descripción
Sumario:	Overparametrized deep networks predict well, despite the lack of an explicit complexity control during training, such as an explicit regularization term. For exponential-type loss functions, we solve this puzzle by showing an effective regularization effect of gradient descent in terms of the normalized weights that are relevant for classification.

Complexity control by gradient descent in deep networks

Ejemplares similares