Cargando…

Integrative Analysis of Multi-Omics Data Based on Blockwise Sparse Principal Components

The recent development of high-throughput technology has allowed us to accumulate vast amounts of multi-omics data. Because even single omics data have a large number of variables, integrated analysis of multi-omics data suffers from problems such as computational instability and variable redundancy...

Descripción completa

Detalles Bibliográficos
Autores principales:	Park, Mira, Kim, Doyoen, Moon, Kwanyoung, Park, Taesung
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2020
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7663540/ https://www.ncbi.nlm.nih.gov/pubmed/33147797 http://dx.doi.org/10.3390/ijms21218202

_version_	1783609650513444864
author	Park, Mira Kim, Doyoen Moon, Kwanyoung Park, Taesung
author_facet	Park, Mira Kim, Doyoen Moon, Kwanyoung Park, Taesung
author_sort	Park, Mira
collection	PubMed
description	The recent development of high-throughput technology has allowed us to accumulate vast amounts of multi-omics data. Because even single omics data have a large number of variables, integrated analysis of multi-omics data suffers from problems such as computational instability and variable redundancy. Most multi-omics data analyses apply single supervised analysis, repeatedly, for dimensional reduction and variable selection. However, these approaches cannot avoid the problems of redundancy and collinearity of variables. In this study, we propose a novel approach using blockwise component analysis. This would solve the limitations of current methods by applying variable clustering and sparse principal component (sPC) analysis. Our approach consists of two stages. The first stage identifies homogeneous variable blocks, and then extracts sPCs, for each omics dataset. The second stage merges sPCs from each omics dataset, and then constructs a prediction model. We also propose a graphical method showing the results of sparse PCA and model fitting, simultaneously. We applied the proposed methodology to glioblastoma multiforme data from The Cancer Genome Atlas. The comparison with other existing approaches showed that our proposed methodology is more easily interpretable than other approaches, and has comparable predictive power, with a much smaller number of variables.
format	Online Article Text
id	pubmed-7663540
institution	National Center for Biotechnology Information
language	English
publishDate	2020
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-76635402020-11-14 Integrative Analysis of Multi-Omics Data Based on Blockwise Sparse Principal Components Park, Mira Kim, Doyoen Moon, Kwanyoung Park, Taesung Int J Mol Sci Article The recent development of high-throughput technology has allowed us to accumulate vast amounts of multi-omics data. Because even single omics data have a large number of variables, integrated analysis of multi-omics data suffers from problems such as computational instability and variable redundancy. Most multi-omics data analyses apply single supervised analysis, repeatedly, for dimensional reduction and variable selection. However, these approaches cannot avoid the problems of redundancy and collinearity of variables. In this study, we propose a novel approach using blockwise component analysis. This would solve the limitations of current methods by applying variable clustering and sparse principal component (sPC) analysis. Our approach consists of two stages. The first stage identifies homogeneous variable blocks, and then extracts sPCs, for each omics dataset. The second stage merges sPCs from each omics dataset, and then constructs a prediction model. We also propose a graphical method showing the results of sparse PCA and model fitting, simultaneously. We applied the proposed methodology to glioblastoma multiforme data from The Cancer Genome Atlas. The comparison with other existing approaches showed that our proposed methodology is more easily interpretable than other approaches, and has comparable predictive power, with a much smaller number of variables. MDPI 2020-11-02 /pmc/articles/PMC7663540/ /pubmed/33147797 http://dx.doi.org/10.3390/ijms21218202 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Park, Mira Kim, Doyoen Moon, Kwanyoung Park, Taesung Integrative Analysis of Multi-Omics Data Based on Blockwise Sparse Principal Components
title	Integrative Analysis of Multi-Omics Data Based on Blockwise Sparse Principal Components
title_full	Integrative Analysis of Multi-Omics Data Based on Blockwise Sparse Principal Components
title_fullStr	Integrative Analysis of Multi-Omics Data Based on Blockwise Sparse Principal Components
title_full_unstemmed	Integrative Analysis of Multi-Omics Data Based on Blockwise Sparse Principal Components
title_short	Integrative Analysis of Multi-Omics Data Based on Blockwise Sparse Principal Components
title_sort	integrative analysis of multi-omics data based on blockwise sparse principal components
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7663540/ https://www.ncbi.nlm.nih.gov/pubmed/33147797 http://dx.doi.org/10.3390/ijms21218202
work_keys_str_mv	AT parkmira integrativeanalysisofmultiomicsdatabasedonblockwisesparseprincipalcomponents AT kimdoyoen integrativeanalysisofmultiomicsdatabasedonblockwisesparseprincipalcomponents AT moonkwanyoung integrativeanalysisofmultiomicsdatabasedonblockwisesparseprincipalcomponents AT parktaesung integrativeanalysisofmultiomicsdatabasedonblockwisesparseprincipalcomponents

Integrative Analysis of Multi-Omics Data Based on Blockwise Sparse Principal Components

Ejemplares similares