Cargando…
Integrative subspace clustering by common and specific decomposition for applications on cancer subtype identification
BACKGROUND: Recent high throughput technologies have been applied for collecting heterogeneous biomedical omics datasets. Computational analysis of the multi-omics datasets could potentially reveal deep insights for a given disease. Most existing clustering methods by multi-omics data assume strong...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6929329/ https://www.ncbi.nlm.nih.gov/pubmed/31874642 http://dx.doi.org/10.1186/s12920-019-0633-1 |
_version_ | 1783482678850355200 |
---|---|
author | Guo, Yin Li, Huiran Cai, Menglan Li, Limin |
author_facet | Guo, Yin Li, Huiran Cai, Menglan Li, Limin |
author_sort | Guo, Yin |
collection | PubMed |
description | BACKGROUND: Recent high throughput technologies have been applied for collecting heterogeneous biomedical omics datasets. Computational analysis of the multi-omics datasets could potentially reveal deep insights for a given disease. Most existing clustering methods by multi-omics data assume strong consistency among different sources of datasets, and thus may lose efficacy when the consistency is relatively weak. Furthermore, they could not identify the conflicting parts for each view, which might be important in applications such as cancer subtype identification. METHODS: In this work, we propose an integrative subspace clustering method (ISC) by common and specific decomposition to identify clustering structures with multi-omics datasets. The main idea of our ISC method is that the original representations for the samples in each view could be reconstructed by the concatenation of a common part and a view-specific part in orthogonal subspaces. The problem can be formulated as a matrix decomposition problem and solved efficiently by our proposed algorithm. RESULTS: The experiments on simulation and text datasets show that our method outperforms other state-of-art methods. Our method is further evaluated by identifying cancer types using a colorectal dataset. We finally apply our method to cancer subtype identification for five cancers using TCGA datasets, and the survival analysis shows that the subtypes we found are significantly better than other compared methods. CONCLUSION: We conclude that our ISC model could not only discover the weak common information across views but also identify the view-specific information. |
format | Online Article Text |
id | pubmed-6929329 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-69293292019-12-30 Integrative subspace clustering by common and specific decomposition for applications on cancer subtype identification Guo, Yin Li, Huiran Cai, Menglan Li, Limin BMC Med Genomics Research BACKGROUND: Recent high throughput technologies have been applied for collecting heterogeneous biomedical omics datasets. Computational analysis of the multi-omics datasets could potentially reveal deep insights for a given disease. Most existing clustering methods by multi-omics data assume strong consistency among different sources of datasets, and thus may lose efficacy when the consistency is relatively weak. Furthermore, they could not identify the conflicting parts for each view, which might be important in applications such as cancer subtype identification. METHODS: In this work, we propose an integrative subspace clustering method (ISC) by common and specific decomposition to identify clustering structures with multi-omics datasets. The main idea of our ISC method is that the original representations for the samples in each view could be reconstructed by the concatenation of a common part and a view-specific part in orthogonal subspaces. The problem can be formulated as a matrix decomposition problem and solved efficiently by our proposed algorithm. RESULTS: The experiments on simulation and text datasets show that our method outperforms other state-of-art methods. Our method is further evaluated by identifying cancer types using a colorectal dataset. We finally apply our method to cancer subtype identification for five cancers using TCGA datasets, and the survival analysis shows that the subtypes we found are significantly better than other compared methods. CONCLUSION: We conclude that our ISC model could not only discover the weak common information across views but also identify the view-specific information. BioMed Central 2019-12-24 /pmc/articles/PMC6929329/ /pubmed/31874642 http://dx.doi.org/10.1186/s12920-019-0633-1 Text en © The Author(s) 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Guo, Yin Li, Huiran Cai, Menglan Li, Limin Integrative subspace clustering by common and specific decomposition for applications on cancer subtype identification |
title | Integrative subspace clustering by common and specific decomposition for applications on cancer subtype identification |
title_full | Integrative subspace clustering by common and specific decomposition for applications on cancer subtype identification |
title_fullStr | Integrative subspace clustering by common and specific decomposition for applications on cancer subtype identification |
title_full_unstemmed | Integrative subspace clustering by common and specific decomposition for applications on cancer subtype identification |
title_short | Integrative subspace clustering by common and specific decomposition for applications on cancer subtype identification |
title_sort | integrative subspace clustering by common and specific decomposition for applications on cancer subtype identification |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6929329/ https://www.ncbi.nlm.nih.gov/pubmed/31874642 http://dx.doi.org/10.1186/s12920-019-0633-1 |
work_keys_str_mv | AT guoyin integrativesubspaceclusteringbycommonandspecificdecompositionforapplicationsoncancersubtypeidentification AT lihuiran integrativesubspaceclusteringbycommonandspecificdecompositionforapplicationsoncancersubtypeidentification AT caimenglan integrativesubspaceclusteringbycommonandspecificdecompositionforapplicationsoncancersubtypeidentification AT lilimin integrativesubspaceclusteringbycommonandspecificdecompositionforapplicationsoncancersubtypeidentification |