Cargando…

Independent Component Analysis for Unraveling the Complexity of Cancer Omics Datasets

Independent component analysis (ICA) is a matrix factorization approach where the signals captured by each individual matrix factors are optimized to become as mutually independent as possible. Initially suggested for solving source blind separation problems in various fields, ICA was shown to be su...

Descripción completa

Detalles Bibliográficos
Autores principales: Sompairac, Nicolas, Nazarov, Petr V., Czerwinska, Urszula, Cantini, Laura, Biton, Anne, Molkenov, Askhat, Zhumadilov, Zhaxybay, Barillot, Emmanuel, Radvanyi, Francois, Gorban, Alexander, Kairov, Ulykbek, Zinovyev, Andrei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6771121/
https://www.ncbi.nlm.nih.gov/pubmed/31500324
http://dx.doi.org/10.3390/ijms20184414
_version_ 1783455631007547392
author Sompairac, Nicolas
Nazarov, Petr V.
Czerwinska, Urszula
Cantini, Laura
Biton, Anne
Molkenov, Askhat
Zhumadilov, Zhaxybay
Barillot, Emmanuel
Radvanyi, Francois
Gorban, Alexander
Kairov, Ulykbek
Zinovyev, Andrei
author_facet Sompairac, Nicolas
Nazarov, Petr V.
Czerwinska, Urszula
Cantini, Laura
Biton, Anne
Molkenov, Askhat
Zhumadilov, Zhaxybay
Barillot, Emmanuel
Radvanyi, Francois
Gorban, Alexander
Kairov, Ulykbek
Zinovyev, Andrei
author_sort Sompairac, Nicolas
collection PubMed
description Independent component analysis (ICA) is a matrix factorization approach where the signals captured by each individual matrix factors are optimized to become as mutually independent as possible. Initially suggested for solving source blind separation problems in various fields, ICA was shown to be successful in analyzing functional magnetic resonance imaging (fMRI) and other types of biomedical data. In the last twenty years, ICA became a part of the standard machine learning toolbox, together with other matrix factorization methods such as principal component analysis (PCA) and non-negative matrix factorization (NMF). Here, we review a number of recent works where ICA was shown to be a useful tool for unraveling the complexity of cancer biology from the analysis of different types of omics data, mainly collected for tumoral samples. Such works highlight the use of ICA in dimensionality reduction, deconvolution, data pre-processing, meta-analysis, and others applied to different data types (transcriptome, methylome, proteome, single-cell data). We particularly focus on the technical aspects of ICA application in omics studies such as using different protocols, determining the optimal number of components, assessing and improving reproducibility of the ICA results, and comparison with other popular matrix factorization techniques. We discuss the emerging ICA applications to the integrative analysis of multi-level omics datasets and introduce a conceptual view on ICA as a tool for defining functional subsystems of a complex biological system and their interactions under various conditions. Our review is accompanied by a Jupyter notebook which illustrates the discussed concepts and provides a practical tool for applying ICA to the analysis of cancer omics datasets.
format Online
Article
Text
id pubmed-6771121
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-67711212019-10-30 Independent Component Analysis for Unraveling the Complexity of Cancer Omics Datasets Sompairac, Nicolas Nazarov, Petr V. Czerwinska, Urszula Cantini, Laura Biton, Anne Molkenov, Askhat Zhumadilov, Zhaxybay Barillot, Emmanuel Radvanyi, Francois Gorban, Alexander Kairov, Ulykbek Zinovyev, Andrei Int J Mol Sci Review Independent component analysis (ICA) is a matrix factorization approach where the signals captured by each individual matrix factors are optimized to become as mutually independent as possible. Initially suggested for solving source blind separation problems in various fields, ICA was shown to be successful in analyzing functional magnetic resonance imaging (fMRI) and other types of biomedical data. In the last twenty years, ICA became a part of the standard machine learning toolbox, together with other matrix factorization methods such as principal component analysis (PCA) and non-negative matrix factorization (NMF). Here, we review a number of recent works where ICA was shown to be a useful tool for unraveling the complexity of cancer biology from the analysis of different types of omics data, mainly collected for tumoral samples. Such works highlight the use of ICA in dimensionality reduction, deconvolution, data pre-processing, meta-analysis, and others applied to different data types (transcriptome, methylome, proteome, single-cell data). We particularly focus on the technical aspects of ICA application in omics studies such as using different protocols, determining the optimal number of components, assessing and improving reproducibility of the ICA results, and comparison with other popular matrix factorization techniques. We discuss the emerging ICA applications to the integrative analysis of multi-level omics datasets and introduce a conceptual view on ICA as a tool for defining functional subsystems of a complex biological system and their interactions under various conditions. Our review is accompanied by a Jupyter notebook which illustrates the discussed concepts and provides a practical tool for applying ICA to the analysis of cancer omics datasets. MDPI 2019-09-07 /pmc/articles/PMC6771121/ /pubmed/31500324 http://dx.doi.org/10.3390/ijms20184414 Text en © 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Review
Sompairac, Nicolas
Nazarov, Petr V.
Czerwinska, Urszula
Cantini, Laura
Biton, Anne
Molkenov, Askhat
Zhumadilov, Zhaxybay
Barillot, Emmanuel
Radvanyi, Francois
Gorban, Alexander
Kairov, Ulykbek
Zinovyev, Andrei
Independent Component Analysis for Unraveling the Complexity of Cancer Omics Datasets
title Independent Component Analysis for Unraveling the Complexity of Cancer Omics Datasets
title_full Independent Component Analysis for Unraveling the Complexity of Cancer Omics Datasets
title_fullStr Independent Component Analysis for Unraveling the Complexity of Cancer Omics Datasets
title_full_unstemmed Independent Component Analysis for Unraveling the Complexity of Cancer Omics Datasets
title_short Independent Component Analysis for Unraveling the Complexity of Cancer Omics Datasets
title_sort independent component analysis for unraveling the complexity of cancer omics datasets
topic Review
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6771121/
https://www.ncbi.nlm.nih.gov/pubmed/31500324
http://dx.doi.org/10.3390/ijms20184414
work_keys_str_mv AT sompairacnicolas independentcomponentanalysisforunravelingthecomplexityofcanceromicsdatasets
AT nazarovpetrv independentcomponentanalysisforunravelingthecomplexityofcanceromicsdatasets
AT czerwinskaurszula independentcomponentanalysisforunravelingthecomplexityofcanceromicsdatasets
AT cantinilaura independentcomponentanalysisforunravelingthecomplexityofcanceromicsdatasets
AT bitonanne independentcomponentanalysisforunravelingthecomplexityofcanceromicsdatasets
AT molkenovaskhat independentcomponentanalysisforunravelingthecomplexityofcanceromicsdatasets
AT zhumadilovzhaxybay independentcomponentanalysisforunravelingthecomplexityofcanceromicsdatasets
AT barillotemmanuel independentcomponentanalysisforunravelingthecomplexityofcanceromicsdatasets
AT radvanyifrancois independentcomponentanalysisforunravelingthecomplexityofcanceromicsdatasets
AT gorbanalexander independentcomponentanalysisforunravelingthecomplexityofcanceromicsdatasets
AT kairovulykbek independentcomponentanalysisforunravelingthecomplexityofcanceromicsdatasets
AT zinovyevandrei independentcomponentanalysisforunravelingthecomplexityofcanceromicsdatasets