Cargando…
Sparse multiple co-Inertia analysis with application to integrative analysis of multi -Omics data
BACKGROUND: Multiple co-inertia analysis (mCIA) is a multivariate analysis method that can assess relationships and trends in multiple datasets. Recently it has been used for integrative analysis of multiple high-dimensional -omics datasets. However, its estimated loading vectors are non-sparse, whi...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7157996/ https://www.ncbi.nlm.nih.gov/pubmed/32293260 http://dx.doi.org/10.1186/s12859-020-3455-4 |
_version_ | 1783522447686893568 |
---|---|
author | Min, Eun Jeong Long, Qi |
author_facet | Min, Eun Jeong Long, Qi |
author_sort | Min, Eun Jeong |
collection | PubMed |
description | BACKGROUND: Multiple co-inertia analysis (mCIA) is a multivariate analysis method that can assess relationships and trends in multiple datasets. Recently it has been used for integrative analysis of multiple high-dimensional -omics datasets. However, its estimated loading vectors are non-sparse, which presents challenges for identifying important features and interpreting analysis results. We propose two new mCIA methods: 1) a sparse mCIA method that produces sparse loading estimates and 2) a structured sparse mCIA method that further enables incorporation of structural information among variables such as those from functional genomics. RESULTS: Our extensive simulation studies demonstrate the superior performance of the sparse mCIA and structured sparse mCIA methods compared to the existing mCIA in terms of feature selection and estimation accuracy. Application to the integrative analysis of transcriptomics data and proteomics data from a cancer study identified biomarkers that are suggested in the literature related with cancer disease. CONCLUSION: Proposed sparse mCIA achieves simultaneous model estimation and feature selection and yields analysis results that are more interpretable than the existing mCIA. Furthermore, proposed structured sparse mCIA can effectively incorporate prior network information among genes, resulting in improved feature selection and enhanced interpretability. |
format | Online Article Text |
id | pubmed-7157996 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-71579962020-04-20 Sparse multiple co-Inertia analysis with application to integrative analysis of multi -Omics data Min, Eun Jeong Long, Qi BMC Bioinformatics Methodology Article BACKGROUND: Multiple co-inertia analysis (mCIA) is a multivariate analysis method that can assess relationships and trends in multiple datasets. Recently it has been used for integrative analysis of multiple high-dimensional -omics datasets. However, its estimated loading vectors are non-sparse, which presents challenges for identifying important features and interpreting analysis results. We propose two new mCIA methods: 1) a sparse mCIA method that produces sparse loading estimates and 2) a structured sparse mCIA method that further enables incorporation of structural information among variables such as those from functional genomics. RESULTS: Our extensive simulation studies demonstrate the superior performance of the sparse mCIA and structured sparse mCIA methods compared to the existing mCIA in terms of feature selection and estimation accuracy. Application to the integrative analysis of transcriptomics data and proteomics data from a cancer study identified biomarkers that are suggested in the literature related with cancer disease. CONCLUSION: Proposed sparse mCIA achieves simultaneous model estimation and feature selection and yields analysis results that are more interpretable than the existing mCIA. Furthermore, proposed structured sparse mCIA can effectively incorporate prior network information among genes, resulting in improved feature selection and enhanced interpretability. BioMed Central 2020-04-15 /pmc/articles/PMC7157996/ /pubmed/32293260 http://dx.doi.org/10.1186/s12859-020-3455-4 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Methodology Article Min, Eun Jeong Long, Qi Sparse multiple co-Inertia analysis with application to integrative analysis of multi -Omics data |
title | Sparse multiple co-Inertia analysis with application to integrative analysis of multi -Omics data |
title_full | Sparse multiple co-Inertia analysis with application to integrative analysis of multi -Omics data |
title_fullStr | Sparse multiple co-Inertia analysis with application to integrative analysis of multi -Omics data |
title_full_unstemmed | Sparse multiple co-Inertia analysis with application to integrative analysis of multi -Omics data |
title_short | Sparse multiple co-Inertia analysis with application to integrative analysis of multi -Omics data |
title_sort | sparse multiple co-inertia analysis with application to integrative analysis of multi -omics data |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7157996/ https://www.ncbi.nlm.nih.gov/pubmed/32293260 http://dx.doi.org/10.1186/s12859-020-3455-4 |
work_keys_str_mv | AT mineunjeong sparsemultiplecoinertiaanalysiswithapplicationtointegrativeanalysisofmultiomicsdata AT longqi sparsemultiplecoinertiaanalysiswithapplicationtointegrativeanalysisofmultiomicsdata |