Cargando…

Omics Data and Data Representations for Deep Learning-Based Predictive Modeling

Medical discoveries mainly depend on the capability to process and analyze biological datasets, which inundate the scientific community and are still expanding as the cost of next-generation sequencing technologies is decreasing. Deep learning (DL) is a viable method to exploit this massive data str...

Descripción completa

Detalles Bibliográficos
Autores principales: Tsimenidis, Stefanos, Vrochidou, Eleni, Papakostas, George A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9603455/
https://www.ncbi.nlm.nih.gov/pubmed/36293133
http://dx.doi.org/10.3390/ijms232012272
_version_ 1784817553412980736
author Tsimenidis, Stefanos
Vrochidou, Eleni
Papakostas, George A.
author_facet Tsimenidis, Stefanos
Vrochidou, Eleni
Papakostas, George A.
author_sort Tsimenidis, Stefanos
collection PubMed
description Medical discoveries mainly depend on the capability to process and analyze biological datasets, which inundate the scientific community and are still expanding as the cost of next-generation sequencing technologies is decreasing. Deep learning (DL) is a viable method to exploit this massive data stream since it has advanced quickly with there being successive innovations. However, an obstacle to scientific progress emerges: the difficulty of applying DL to biology, and this because both fields are evolving at a breakneck pace, thus making it hard for an individual to occupy the front lines of both of them. This paper aims to bridge the gap and help computer scientists bring their valuable expertise into the life sciences. This work provides an overview of the most common types of biological data and data representations that are used to train DL models, with additional information on the models themselves and the various tasks that are being tackled. This is the essential information a DL expert with no background in biology needs in order to participate in DL-based research projects in biomedicine, biotechnology, and drug discovery. Alternatively, this study could be also useful to researchers in biology to understand and utilize the power of DL to gain better insights into and extract important information from the omics data.
format Online
Article
Text
id pubmed-9603455
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-96034552022-10-27 Omics Data and Data Representations for Deep Learning-Based Predictive Modeling Tsimenidis, Stefanos Vrochidou, Eleni Papakostas, George A. Int J Mol Sci Review Medical discoveries mainly depend on the capability to process and analyze biological datasets, which inundate the scientific community and are still expanding as the cost of next-generation sequencing technologies is decreasing. Deep learning (DL) is a viable method to exploit this massive data stream since it has advanced quickly with there being successive innovations. However, an obstacle to scientific progress emerges: the difficulty of applying DL to biology, and this because both fields are evolving at a breakneck pace, thus making it hard for an individual to occupy the front lines of both of them. This paper aims to bridge the gap and help computer scientists bring their valuable expertise into the life sciences. This work provides an overview of the most common types of biological data and data representations that are used to train DL models, with additional information on the models themselves and the various tasks that are being tackled. This is the essential information a DL expert with no background in biology needs in order to participate in DL-based research projects in biomedicine, biotechnology, and drug discovery. Alternatively, this study could be also useful to researchers in biology to understand and utilize the power of DL to gain better insights into and extract important information from the omics data. MDPI 2022-10-14 /pmc/articles/PMC9603455/ /pubmed/36293133 http://dx.doi.org/10.3390/ijms232012272 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Review
Tsimenidis, Stefanos
Vrochidou, Eleni
Papakostas, George A.
Omics Data and Data Representations for Deep Learning-Based Predictive Modeling
title Omics Data and Data Representations for Deep Learning-Based Predictive Modeling
title_full Omics Data and Data Representations for Deep Learning-Based Predictive Modeling
title_fullStr Omics Data and Data Representations for Deep Learning-Based Predictive Modeling
title_full_unstemmed Omics Data and Data Representations for Deep Learning-Based Predictive Modeling
title_short Omics Data and Data Representations for Deep Learning-Based Predictive Modeling
title_sort omics data and data representations for deep learning-based predictive modeling
topic Review
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9603455/
https://www.ncbi.nlm.nih.gov/pubmed/36293133
http://dx.doi.org/10.3390/ijms232012272
work_keys_str_mv AT tsimenidisstefanos omicsdataanddatarepresentationsfordeeplearningbasedpredictivemodeling
AT vrochidoueleni omicsdataanddatarepresentationsfordeeplearningbasedpredictivemodeling
AT papakostasgeorgea omicsdataanddatarepresentationsfordeeplearningbasedpredictivemodeling