Cargando…
Review of deep learning: concepts, CNN architectures, challenges, applications, future directions
In the last few years, the deep learning (DL) computing paradigm has been deemed the Gold Standard in the machine learning (ML) community. Moreover, it has gradually become the most widely used computational approach in the field of ML, thus achieving outstanding results on several complex cognitive...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer International Publishing
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8010506/ https://www.ncbi.nlm.nih.gov/pubmed/33816053 http://dx.doi.org/10.1186/s40537-021-00444-8 |
_version_ | 1783673079097982976 |
---|---|
author | Alzubaidi, Laith Zhang, Jinglan Humaidi, Amjad J. Al-Dujaili, Ayad Duan, Ye Al-Shamma, Omran Santamaría, J. Fadhel, Mohammed A. Al-Amidie, Muthana Farhan, Laith |
author_facet | Alzubaidi, Laith Zhang, Jinglan Humaidi, Amjad J. Al-Dujaili, Ayad Duan, Ye Al-Shamma, Omran Santamaría, J. Fadhel, Mohammed A. Al-Amidie, Muthana Farhan, Laith |
author_sort | Alzubaidi, Laith |
collection | PubMed |
description | In the last few years, the deep learning (DL) computing paradigm has been deemed the Gold Standard in the machine learning (ML) community. Moreover, it has gradually become the most widely used computational approach in the field of ML, thus achieving outstanding results on several complex cognitive tasks, matching or even beating those provided by human performance. One of the benefits of DL is the ability to learn massive amounts of data. The DL field has grown fast in the last few years and it has been extensively used to successfully address a wide range of traditional applications. More importantly, DL has outperformed well-known ML techniques in many domains, e.g., cybersecurity, natural language processing, bioinformatics, robotics and control, and medical information processing, among many others. Despite it has been contributed several works reviewing the State-of-the-Art on DL, all of them only tackled one aspect of the DL, which leads to an overall lack of knowledge about it. Therefore, in this contribution, we propose using a more holistic approach in order to provide a more suitable starting point from which to develop a full understanding of DL. Specifically, this review attempts to provide a more comprehensive survey of the most important aspects of DL and including those enhancements recently added to the field. In particular, this paper outlines the importance of DL, presents the types of DL techniques and networks. It then presents convolutional neural networks (CNNs) which the most utilized DL network type and describes the development of CNNs architectures together with their main features, e.g., starting with the AlexNet network and closing with the High-Resolution network (HR.Net). Finally, we further present the challenges and suggested solutions to help researchers understand the existing research gaps. It is followed by a list of the major DL applications. Computational tools including FPGA, GPU, and CPU are summarized along with a description of their influence on DL. The paper ends with the evolution matrix, benchmark datasets, and summary and conclusion. |
format | Online Article Text |
id | pubmed-8010506 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Springer International Publishing |
record_format | MEDLINE/PubMed |
spelling | pubmed-80105062021-03-31 Review of deep learning: concepts, CNN architectures, challenges, applications, future directions Alzubaidi, Laith Zhang, Jinglan Humaidi, Amjad J. Al-Dujaili, Ayad Duan, Ye Al-Shamma, Omran Santamaría, J. Fadhel, Mohammed A. Al-Amidie, Muthana Farhan, Laith J Big Data Survey Paper In the last few years, the deep learning (DL) computing paradigm has been deemed the Gold Standard in the machine learning (ML) community. Moreover, it has gradually become the most widely used computational approach in the field of ML, thus achieving outstanding results on several complex cognitive tasks, matching or even beating those provided by human performance. One of the benefits of DL is the ability to learn massive amounts of data. The DL field has grown fast in the last few years and it has been extensively used to successfully address a wide range of traditional applications. More importantly, DL has outperformed well-known ML techniques in many domains, e.g., cybersecurity, natural language processing, bioinformatics, robotics and control, and medical information processing, among many others. Despite it has been contributed several works reviewing the State-of-the-Art on DL, all of them only tackled one aspect of the DL, which leads to an overall lack of knowledge about it. Therefore, in this contribution, we propose using a more holistic approach in order to provide a more suitable starting point from which to develop a full understanding of DL. Specifically, this review attempts to provide a more comprehensive survey of the most important aspects of DL and including those enhancements recently added to the field. In particular, this paper outlines the importance of DL, presents the types of DL techniques and networks. It then presents convolutional neural networks (CNNs) which the most utilized DL network type and describes the development of CNNs architectures together with their main features, e.g., starting with the AlexNet network and closing with the High-Resolution network (HR.Net). Finally, we further present the challenges and suggested solutions to help researchers understand the existing research gaps. It is followed by a list of the major DL applications. Computational tools including FPGA, GPU, and CPU are summarized along with a description of their influence on DL. The paper ends with the evolution matrix, benchmark datasets, and summary and conclusion. Springer International Publishing 2021-03-31 2021 /pmc/articles/PMC8010506/ /pubmed/33816053 http://dx.doi.org/10.1186/s40537-021-00444-8 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Survey Paper Alzubaidi, Laith Zhang, Jinglan Humaidi, Amjad J. Al-Dujaili, Ayad Duan, Ye Al-Shamma, Omran Santamaría, J. Fadhel, Mohammed A. Al-Amidie, Muthana Farhan, Laith Review of deep learning: concepts, CNN architectures, challenges, applications, future directions |
title | Review of deep learning: concepts, CNN architectures, challenges, applications, future directions |
title_full | Review of deep learning: concepts, CNN architectures, challenges, applications, future directions |
title_fullStr | Review of deep learning: concepts, CNN architectures, challenges, applications, future directions |
title_full_unstemmed | Review of deep learning: concepts, CNN architectures, challenges, applications, future directions |
title_short | Review of deep learning: concepts, CNN architectures, challenges, applications, future directions |
title_sort | review of deep learning: concepts, cnn architectures, challenges, applications, future directions |
topic | Survey Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8010506/ https://www.ncbi.nlm.nih.gov/pubmed/33816053 http://dx.doi.org/10.1186/s40537-021-00444-8 |
work_keys_str_mv | AT alzubaidilaith reviewofdeeplearningconceptscnnarchitectureschallengesapplicationsfuturedirections AT zhangjinglan reviewofdeeplearningconceptscnnarchitectureschallengesapplicationsfuturedirections AT humaidiamjadj reviewofdeeplearningconceptscnnarchitectureschallengesapplicationsfuturedirections AT aldujailiayad reviewofdeeplearningconceptscnnarchitectureschallengesapplicationsfuturedirections AT duanye reviewofdeeplearningconceptscnnarchitectureschallengesapplicationsfuturedirections AT alshammaomran reviewofdeeplearningconceptscnnarchitectureschallengesapplicationsfuturedirections AT santamariaj reviewofdeeplearningconceptscnnarchitectureschallengesapplicationsfuturedirections AT fadhelmohammeda reviewofdeeplearningconceptscnnarchitectureschallengesapplicationsfuturedirections AT alamidiemuthana reviewofdeeplearningconceptscnnarchitectureschallengesapplicationsfuturedirections AT farhanlaith reviewofdeeplearningconceptscnnarchitectureschallengesapplicationsfuturedirections |