Cargando…
Deep Learning in Mining Biological Data
Recent technological advancements in data acquisition tools allowed life scientists to acquire multimodal data from different biological application domains. Categorized in three broad types (i.e. images, signals, and sequences), these data are huge in amount and complex in nature. Mining such enorm...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer US
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7783296/ https://www.ncbi.nlm.nih.gov/pubmed/33425045 http://dx.doi.org/10.1007/s12559-020-09773-x |
_version_ | 1783632087185620992 |
---|---|
author | Mahmud, Mufti Kaiser, M. Shamim McGinnity, T. Martin Hussain, Amir |
author_facet | Mahmud, Mufti Kaiser, M. Shamim McGinnity, T. Martin Hussain, Amir |
author_sort | Mahmud, Mufti |
collection | PubMed |
description | Recent technological advancements in data acquisition tools allowed life scientists to acquire multimodal data from different biological application domains. Categorized in three broad types (i.e. images, signals, and sequences), these data are huge in amount and complex in nature. Mining such enormous amount of data for pattern recognition is a big challenge and requires sophisticated data-intensive machine learning techniques. Artificial neural network-based learning systems are well known for their pattern recognition capabilities, and lately their deep architectures—known as deep learning (DL)—have been successfully applied to solve many complex pattern recognition problems. To investigate how DL—especially its different architectures—has contributed and been utilized in the mining of biological data pertaining to those three types, a meta-analysis has been performed and the resulting resources have been critically analysed. Focusing on the use of DL to analyse patterns in data from diverse biological domains, this work investigates different DL architectures’ applications to these data. This is followed by an exploration of available open access data sources pertaining to the three data types along with popular open-source DL tools applicable to these data. Also, comparative investigations of these tools from qualitative, quantitative, and benchmarking perspectives are provided. Finally, some open research challenges in using DL to mine biological data are outlined and a number of possible future perspectives are put forward. |
format | Online Article Text |
id | pubmed-7783296 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Springer US |
record_format | MEDLINE/PubMed |
spelling | pubmed-77832962021-01-05 Deep Learning in Mining Biological Data Mahmud, Mufti Kaiser, M. Shamim McGinnity, T. Martin Hussain, Amir Cognit Comput Article Recent technological advancements in data acquisition tools allowed life scientists to acquire multimodal data from different biological application domains. Categorized in three broad types (i.e. images, signals, and sequences), these data are huge in amount and complex in nature. Mining such enormous amount of data for pattern recognition is a big challenge and requires sophisticated data-intensive machine learning techniques. Artificial neural network-based learning systems are well known for their pattern recognition capabilities, and lately their deep architectures—known as deep learning (DL)—have been successfully applied to solve many complex pattern recognition problems. To investigate how DL—especially its different architectures—has contributed and been utilized in the mining of biological data pertaining to those three types, a meta-analysis has been performed and the resulting resources have been critically analysed. Focusing on the use of DL to analyse patterns in data from diverse biological domains, this work investigates different DL architectures’ applications to these data. This is followed by an exploration of available open access data sources pertaining to the three data types along with popular open-source DL tools applicable to these data. Also, comparative investigations of these tools from qualitative, quantitative, and benchmarking perspectives are provided. Finally, some open research challenges in using DL to mine biological data are outlined and a number of possible future perspectives are put forward. Springer US 2021-01-05 2021 /pmc/articles/PMC7783296/ /pubmed/33425045 http://dx.doi.org/10.1007/s12559-020-09773-x Text en © The Author(s) 2020 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Mahmud, Mufti Kaiser, M. Shamim McGinnity, T. Martin Hussain, Amir Deep Learning in Mining Biological Data |
title | Deep Learning in Mining Biological Data |
title_full | Deep Learning in Mining Biological Data |
title_fullStr | Deep Learning in Mining Biological Data |
title_full_unstemmed | Deep Learning in Mining Biological Data |
title_short | Deep Learning in Mining Biological Data |
title_sort | deep learning in mining biological data |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7783296/ https://www.ncbi.nlm.nih.gov/pubmed/33425045 http://dx.doi.org/10.1007/s12559-020-09773-x |
work_keys_str_mv | AT mahmudmufti deeplearninginminingbiologicaldata AT kaisermshamim deeplearninginminingbiologicaldata AT mcginnitytmartin deeplearninginminingbiologicaldata AT hussainamir deeplearninginminingbiologicaldata |