Cargando…

Enhancing the accuracies by performing pooling decisions adjacent to the output layer

Learning classification tasks of [Formula: see text] inputs typically consist of [Formula: see text] ) max-pooling (MP) operators along the entire feedforward deep architecture. Here we show, using the CIFAR-10 database, that pooling decisions adjacent to the last convolutional layer significantly e...

Descripción completa

Detalles Bibliográficos
Autores principales: Meir, Yuval, Tzach, Yarden, Gross, Ronit D., Tevet, Ofek, Vardi, Roni, Kanter, Ido
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10471572/
https://www.ncbi.nlm.nih.gov/pubmed/37652973
http://dx.doi.org/10.1038/s41598-023-40566-y
_version_ 1785099878838304768
author Meir, Yuval
Tzach, Yarden
Gross, Ronit D.
Tevet, Ofek
Vardi, Roni
Kanter, Ido
author_facet Meir, Yuval
Tzach, Yarden
Gross, Ronit D.
Tevet, Ofek
Vardi, Roni
Kanter, Ido
author_sort Meir, Yuval
collection PubMed
description Learning classification tasks of [Formula: see text] inputs typically consist of [Formula: see text] ) max-pooling (MP) operators along the entire feedforward deep architecture. Here we show, using the CIFAR-10 database, that pooling decisions adjacent to the last convolutional layer significantly enhance accuracies. In particular, average accuracies of the advanced-VGG with [Formula: see text] layers (A-VGGm) architectures are 0.936, 0.940, 0.954, 0.955, and 0.955 for m = 6, 8, 14, 13, and 16, respectively. The results indicate A-VGG8’s accuracy is superior to VGG16’s, and that the accuracies of A-VGG13 and A-VGG16 are equal, and comparable to that of Wide-ResNet16. In addition, replacing the three fully connected (FC) layers with one FC layer, A-VGG6 and A-VGG14, or with several linear activation FC layers, yielded similar accuracies. These significantly enhanced accuracies stem from training the most influential input–output routes, in comparison to the inferior routes selected following multiple MP decisions along the deep architecture. In addition, accuracies are sensitive to the order of the non-commutative MP and average pooling operators adjacent to the output layer, varying the number and location of training routes. The results call for the reexamination of previously proposed deep architectures and their accuracies by utilizing the proposed pooling strategy adjacent to the output layer.
format Online
Article
Text
id pubmed-10471572
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-104715722023-09-02 Enhancing the accuracies by performing pooling decisions adjacent to the output layer Meir, Yuval Tzach, Yarden Gross, Ronit D. Tevet, Ofek Vardi, Roni Kanter, Ido Sci Rep Article Learning classification tasks of [Formula: see text] inputs typically consist of [Formula: see text] ) max-pooling (MP) operators along the entire feedforward deep architecture. Here we show, using the CIFAR-10 database, that pooling decisions adjacent to the last convolutional layer significantly enhance accuracies. In particular, average accuracies of the advanced-VGG with [Formula: see text] layers (A-VGGm) architectures are 0.936, 0.940, 0.954, 0.955, and 0.955 for m = 6, 8, 14, 13, and 16, respectively. The results indicate A-VGG8’s accuracy is superior to VGG16’s, and that the accuracies of A-VGG13 and A-VGG16 are equal, and comparable to that of Wide-ResNet16. In addition, replacing the three fully connected (FC) layers with one FC layer, A-VGG6 and A-VGG14, or with several linear activation FC layers, yielded similar accuracies. These significantly enhanced accuracies stem from training the most influential input–output routes, in comparison to the inferior routes selected following multiple MP decisions along the deep architecture. In addition, accuracies are sensitive to the order of the non-commutative MP and average pooling operators adjacent to the output layer, varying the number and location of training routes. The results call for the reexamination of previously proposed deep architectures and their accuracies by utilizing the proposed pooling strategy adjacent to the output layer. Nature Publishing Group UK 2023-08-31 /pmc/articles/PMC10471572/ /pubmed/37652973 http://dx.doi.org/10.1038/s41598-023-40566-y Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Meir, Yuval
Tzach, Yarden
Gross, Ronit D.
Tevet, Ofek
Vardi, Roni
Kanter, Ido
Enhancing the accuracies by performing pooling decisions adjacent to the output layer
title Enhancing the accuracies by performing pooling decisions adjacent to the output layer
title_full Enhancing the accuracies by performing pooling decisions adjacent to the output layer
title_fullStr Enhancing the accuracies by performing pooling decisions adjacent to the output layer
title_full_unstemmed Enhancing the accuracies by performing pooling decisions adjacent to the output layer
title_short Enhancing the accuracies by performing pooling decisions adjacent to the output layer
title_sort enhancing the accuracies by performing pooling decisions adjacent to the output layer
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10471572/
https://www.ncbi.nlm.nih.gov/pubmed/37652973
http://dx.doi.org/10.1038/s41598-023-40566-y
work_keys_str_mv AT meiryuval enhancingtheaccuraciesbyperformingpoolingdecisionsadjacenttotheoutputlayer
AT tzachyarden enhancingtheaccuraciesbyperformingpoolingdecisionsadjacenttotheoutputlayer
AT grossronitd enhancingtheaccuraciesbyperformingpoolingdecisionsadjacenttotheoutputlayer
AT tevetofek enhancingtheaccuraciesbyperformingpoolingdecisionsadjacenttotheoutputlayer
AT vardironi enhancingtheaccuraciesbyperformingpoolingdecisionsadjacenttotheoutputlayer
AT kanterido enhancingtheaccuraciesbyperformingpoolingdecisionsadjacenttotheoutputlayer