Cargando…

Sparse multiple co-Inertia analysis with application to integrative analysis of multi -Omics data

BACKGROUND: Multiple co-inertia analysis (mCIA) is a multivariate analysis method that can assess relationships and trends in multiple datasets. Recently it has been used for integrative analysis of multiple high-dimensional -omics datasets. However, its estimated loading vectors are non-sparse, whi...

Descripción completa

Detalles Bibliográficos
Autores principales: Min, Eun Jeong, Long, Qi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7157996/
https://www.ncbi.nlm.nih.gov/pubmed/32293260
http://dx.doi.org/10.1186/s12859-020-3455-4
_version_ 1783522447686893568
author Min, Eun Jeong
Long, Qi
author_facet Min, Eun Jeong
Long, Qi
author_sort Min, Eun Jeong
collection PubMed
description BACKGROUND: Multiple co-inertia analysis (mCIA) is a multivariate analysis method that can assess relationships and trends in multiple datasets. Recently it has been used for integrative analysis of multiple high-dimensional -omics datasets. However, its estimated loading vectors are non-sparse, which presents challenges for identifying important features and interpreting analysis results. We propose two new mCIA methods: 1) a sparse mCIA method that produces sparse loading estimates and 2) a structured sparse mCIA method that further enables incorporation of structural information among variables such as those from functional genomics. RESULTS: Our extensive simulation studies demonstrate the superior performance of the sparse mCIA and structured sparse mCIA methods compared to the existing mCIA in terms of feature selection and estimation accuracy. Application to the integrative analysis of transcriptomics data and proteomics data from a cancer study identified biomarkers that are suggested in the literature related with cancer disease. CONCLUSION: Proposed sparse mCIA achieves simultaneous model estimation and feature selection and yields analysis results that are more interpretable than the existing mCIA. Furthermore, proposed structured sparse mCIA can effectively incorporate prior network information among genes, resulting in improved feature selection and enhanced interpretability.
format Online
Article
Text
id pubmed-7157996
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-71579962020-04-20 Sparse multiple co-Inertia analysis with application to integrative analysis of multi -Omics data Min, Eun Jeong Long, Qi BMC Bioinformatics Methodology Article BACKGROUND: Multiple co-inertia analysis (mCIA) is a multivariate analysis method that can assess relationships and trends in multiple datasets. Recently it has been used for integrative analysis of multiple high-dimensional -omics datasets. However, its estimated loading vectors are non-sparse, which presents challenges for identifying important features and interpreting analysis results. We propose two new mCIA methods: 1) a sparse mCIA method that produces sparse loading estimates and 2) a structured sparse mCIA method that further enables incorporation of structural information among variables such as those from functional genomics. RESULTS: Our extensive simulation studies demonstrate the superior performance of the sparse mCIA and structured sparse mCIA methods compared to the existing mCIA in terms of feature selection and estimation accuracy. Application to the integrative analysis of transcriptomics data and proteomics data from a cancer study identified biomarkers that are suggested in the literature related with cancer disease. CONCLUSION: Proposed sparse mCIA achieves simultaneous model estimation and feature selection and yields analysis results that are more interpretable than the existing mCIA. Furthermore, proposed structured sparse mCIA can effectively incorporate prior network information among genes, resulting in improved feature selection and enhanced interpretability. BioMed Central 2020-04-15 /pmc/articles/PMC7157996/ /pubmed/32293260 http://dx.doi.org/10.1186/s12859-020-3455-4 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Methodology Article
Min, Eun Jeong
Long, Qi
Sparse multiple co-Inertia analysis with application to integrative analysis of multi -Omics data
title Sparse multiple co-Inertia analysis with application to integrative analysis of multi -Omics data
title_full Sparse multiple co-Inertia analysis with application to integrative analysis of multi -Omics data
title_fullStr Sparse multiple co-Inertia analysis with application to integrative analysis of multi -Omics data
title_full_unstemmed Sparse multiple co-Inertia analysis with application to integrative analysis of multi -Omics data
title_short Sparse multiple co-Inertia analysis with application to integrative analysis of multi -Omics data
title_sort sparse multiple co-inertia analysis with application to integrative analysis of multi -omics data
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7157996/
https://www.ncbi.nlm.nih.gov/pubmed/32293260
http://dx.doi.org/10.1186/s12859-020-3455-4
work_keys_str_mv AT mineunjeong sparsemultiplecoinertiaanalysiswithapplicationtointegrativeanalysisofmultiomicsdata
AT longqi sparsemultiplecoinertiaanalysiswithapplicationtointegrativeanalysisofmultiomicsdata