Cargando…

On fusion methods for knowledge discovery from multi-omics datasets

Recent years have witnessed the tendency of measuring a biological sample on multiple omics scales for a comprehensive understanding of how biological activities on varying levels are perturbed by genetic variants, environments, and their interactions. This new trend raises substantial challenges to...

Descripción completa

Detalles Bibliográficos
Autores principales: Baldwin, Edwin, Han, Jiali, Luo, Wenting, Zhou, Jin, An, Lingling, Liu, Jian, Zhang, Hao Helen, Li, Haiquan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Research Network of Computational and Structural Biotechnology 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7078495/
https://www.ncbi.nlm.nih.gov/pubmed/32206210
http://dx.doi.org/10.1016/j.csbj.2020.02.011
_version_ 1783507632851517440
author Baldwin, Edwin
Han, Jiali
Luo, Wenting
Zhou, Jin
An, Lingling
Liu, Jian
Zhang, Hao Helen
Li, Haiquan
author_facet Baldwin, Edwin
Han, Jiali
Luo, Wenting
Zhou, Jin
An, Lingling
Liu, Jian
Zhang, Hao Helen
Li, Haiquan
author_sort Baldwin, Edwin
collection PubMed
description Recent years have witnessed the tendency of measuring a biological sample on multiple omics scales for a comprehensive understanding of how biological activities on varying levels are perturbed by genetic variants, environments, and their interactions. This new trend raises substantial challenges to data integration and fusion, of which the latter is a specific type of integration that applies a uniform method in a scalable manner, to solve biological problems which the multi-omics measurements target. Fusion-based analysis has advanced rapidly in the past decade, thanks to application drivers and theoretical breakthroughs in mathematics, statistics, and computer science. We will briefly address these methods from methodological and mathematical perspectives and categorize them into three types of approaches: data fusion (a narrowed definition as compared to the general data fusion concept), model fusion, and mixed fusion. We will demonstrate at least one typical example in each specific category to exemplify the characteristics, principles, and applications of the methods in general, as well as discuss the gaps and potential issues for future studies.
format Online
Article
Text
id pubmed-7078495
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Research Network of Computational and Structural Biotechnology
record_format MEDLINE/PubMed
spelling pubmed-70784952020-03-23 On fusion methods for knowledge discovery from multi-omics datasets Baldwin, Edwin Han, Jiali Luo, Wenting Zhou, Jin An, Lingling Liu, Jian Zhang, Hao Helen Li, Haiquan Comput Struct Biotechnol J Review Article Recent years have witnessed the tendency of measuring a biological sample on multiple omics scales for a comprehensive understanding of how biological activities on varying levels are perturbed by genetic variants, environments, and their interactions. This new trend raises substantial challenges to data integration and fusion, of which the latter is a specific type of integration that applies a uniform method in a scalable manner, to solve biological problems which the multi-omics measurements target. Fusion-based analysis has advanced rapidly in the past decade, thanks to application drivers and theoretical breakthroughs in mathematics, statistics, and computer science. We will briefly address these methods from methodological and mathematical perspectives and categorize them into three types of approaches: data fusion (a narrowed definition as compared to the general data fusion concept), model fusion, and mixed fusion. We will demonstrate at least one typical example in each specific category to exemplify the characteristics, principles, and applications of the methods in general, as well as discuss the gaps and potential issues for future studies. Research Network of Computational and Structural Biotechnology 2020-03-05 /pmc/articles/PMC7078495/ /pubmed/32206210 http://dx.doi.org/10.1016/j.csbj.2020.02.011 Text en © 2020 The Authors http://creativecommons.org/licenses/by-nc-nd/4.0/ This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Review Article
Baldwin, Edwin
Han, Jiali
Luo, Wenting
Zhou, Jin
An, Lingling
Liu, Jian
Zhang, Hao Helen
Li, Haiquan
On fusion methods for knowledge discovery from multi-omics datasets
title On fusion methods for knowledge discovery from multi-omics datasets
title_full On fusion methods for knowledge discovery from multi-omics datasets
title_fullStr On fusion methods for knowledge discovery from multi-omics datasets
title_full_unstemmed On fusion methods for knowledge discovery from multi-omics datasets
title_short On fusion methods for knowledge discovery from multi-omics datasets
title_sort on fusion methods for knowledge discovery from multi-omics datasets
topic Review Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7078495/
https://www.ncbi.nlm.nih.gov/pubmed/32206210
http://dx.doi.org/10.1016/j.csbj.2020.02.011
work_keys_str_mv AT baldwinedwin onfusionmethodsforknowledgediscoveryfrommultiomicsdatasets
AT hanjiali onfusionmethodsforknowledgediscoveryfrommultiomicsdatasets
AT luowenting onfusionmethodsforknowledgediscoveryfrommultiomicsdatasets
AT zhoujin onfusionmethodsforknowledgediscoveryfrommultiomicsdatasets
AT anlingling onfusionmethodsforknowledgediscoveryfrommultiomicsdatasets
AT liujian onfusionmethodsforknowledgediscoveryfrommultiomicsdatasets
AT zhanghaohelen onfusionmethodsforknowledgediscoveryfrommultiomicsdatasets
AT lihaiquan onfusionmethodsforknowledgediscoveryfrommultiomicsdatasets