Cargando…

Integration strategies of multi-omics data for machine learning analysis

Increased availability of high-throughput technologies has generated an ever-growing number of omics data that seek to portray many different but complementary biological layers including genomics, epigenomics, transcriptomics, proteomics, and metabolomics. New insight from these data have been obta...

Descripción completa

Detalles Bibliográficos
Autores principales: Picard, Milan, Scott-Boyer, Marie-Pier, Bodein, Antoine, Périn, Olivier, Droit, Arnaud
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Research Network of Computational and Structural Biotechnology 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8258788/
https://www.ncbi.nlm.nih.gov/pubmed/34285775
http://dx.doi.org/10.1016/j.csbj.2021.06.030
_version_ 1783718561960689664
author Picard, Milan
Scott-Boyer, Marie-Pier
Bodein, Antoine
Périn, Olivier
Droit, Arnaud
author_facet Picard, Milan
Scott-Boyer, Marie-Pier
Bodein, Antoine
Périn, Olivier
Droit, Arnaud
author_sort Picard, Milan
collection PubMed
description Increased availability of high-throughput technologies has generated an ever-growing number of omics data that seek to portray many different but complementary biological layers including genomics, epigenomics, transcriptomics, proteomics, and metabolomics. New insight from these data have been obtained by machine learning algorithms that have produced diagnostic and classification biomarkers. Most biomarkers obtained to date however only include one omic measurement at a time and thus do not take full advantage of recent multi-omics experiments that now capture the entire complexity of biological systems. Multi-omics data integration strategies are needed to combine the complementary knowledge brought by each omics layer. We have summarized the most recent data integration methods/ frameworks into five different integration strategies: early, mixed, intermediate, late and hierarchical. In this mini-review, we focus on challenges and existing multi-omics integration strategies by paying special attention to machine learning applications.
format Online
Article
Text
id pubmed-8258788
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Research Network of Computational and Structural Biotechnology
record_format MEDLINE/PubMed
spelling pubmed-82587882021-07-19 Integration strategies of multi-omics data for machine learning analysis Picard, Milan Scott-Boyer, Marie-Pier Bodein, Antoine Périn, Olivier Droit, Arnaud Comput Struct Biotechnol J Review Article Increased availability of high-throughput technologies has generated an ever-growing number of omics data that seek to portray many different but complementary biological layers including genomics, epigenomics, transcriptomics, proteomics, and metabolomics. New insight from these data have been obtained by machine learning algorithms that have produced diagnostic and classification biomarkers. Most biomarkers obtained to date however only include one omic measurement at a time and thus do not take full advantage of recent multi-omics experiments that now capture the entire complexity of biological systems. Multi-omics data integration strategies are needed to combine the complementary knowledge brought by each omics layer. We have summarized the most recent data integration methods/ frameworks into five different integration strategies: early, mixed, intermediate, late and hierarchical. In this mini-review, we focus on challenges and existing multi-omics integration strategies by paying special attention to machine learning applications. Research Network of Computational and Structural Biotechnology 2021-06-22 /pmc/articles/PMC8258788/ /pubmed/34285775 http://dx.doi.org/10.1016/j.csbj.2021.06.030 Text en © 2021 The Author(s) https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Review Article
Picard, Milan
Scott-Boyer, Marie-Pier
Bodein, Antoine
Périn, Olivier
Droit, Arnaud
Integration strategies of multi-omics data for machine learning analysis
title Integration strategies of multi-omics data for machine learning analysis
title_full Integration strategies of multi-omics data for machine learning analysis
title_fullStr Integration strategies of multi-omics data for machine learning analysis
title_full_unstemmed Integration strategies of multi-omics data for machine learning analysis
title_short Integration strategies of multi-omics data for machine learning analysis
title_sort integration strategies of multi-omics data for machine learning analysis
topic Review Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8258788/
https://www.ncbi.nlm.nih.gov/pubmed/34285775
http://dx.doi.org/10.1016/j.csbj.2021.06.030
work_keys_str_mv AT picardmilan integrationstrategiesofmultiomicsdataformachinelearninganalysis
AT scottboyermariepier integrationstrategiesofmultiomicsdataformachinelearninganalysis
AT bodeinantoine integrationstrategiesofmultiomicsdataformachinelearninganalysis
AT perinolivier integrationstrategiesofmultiomicsdataformachinelearninganalysis
AT droitarnaud integrationstrategiesofmultiomicsdataformachinelearninganalysis