Cargando…
Integrative clustering of multi-level ‘omic data based on non-negative matrix factorization algorithm
Integrative analyses of high-throughput ‘omic data, such as DNA methylation, DNA copy number alteration, mRNA and protein expression levels, have created unprecedented opportunities to understand the molecular basis of human disease. In particular, integrative analyses have been the cornerstone in t...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5411077/ https://www.ncbi.nlm.nih.gov/pubmed/28459819 http://dx.doi.org/10.1371/journal.pone.0176278 |
_version_ | 1783232783434383360 |
---|---|
author | Chalise, Prabhakar Fridley, Brooke L. |
author_facet | Chalise, Prabhakar Fridley, Brooke L. |
author_sort | Chalise, Prabhakar |
collection | PubMed |
description | Integrative analyses of high-throughput ‘omic data, such as DNA methylation, DNA copy number alteration, mRNA and protein expression levels, have created unprecedented opportunities to understand the molecular basis of human disease. In particular, integrative analyses have been the cornerstone in the study of cancer to determine molecular subtypes within a given cancer. As malignant tumors with similar morphological characteristics have been shown to exhibit entirely different molecular profiles, there has been significant interest in using multiple ‘omic data for the identification of novel molecular subtypes of disease, which could impact treatment decisions. Therefore, we have developed intNMF, an integrative approach for disease subtype classification based on non-negative matrix factorization. The proposed approach carries out integrative clustering of multiple high dimensional molecular data in a single comprehensive analysis utilizing the information across multiple biological levels assessed on the same individual. As intNMF does not assume any distributional form for the data, it has obvious advantages over other model based clustering methods which require specific distributional assumptions. Application of intNMF is illustrated using both simulated and real data from The Cancer Genome Atlas (TCGA). |
format | Online Article Text |
id | pubmed-5411077 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-54110772017-05-12 Integrative clustering of multi-level ‘omic data based on non-negative matrix factorization algorithm Chalise, Prabhakar Fridley, Brooke L. PLoS One Research Article Integrative analyses of high-throughput ‘omic data, such as DNA methylation, DNA copy number alteration, mRNA and protein expression levels, have created unprecedented opportunities to understand the molecular basis of human disease. In particular, integrative analyses have been the cornerstone in the study of cancer to determine molecular subtypes within a given cancer. As malignant tumors with similar morphological characteristics have been shown to exhibit entirely different molecular profiles, there has been significant interest in using multiple ‘omic data for the identification of novel molecular subtypes of disease, which could impact treatment decisions. Therefore, we have developed intNMF, an integrative approach for disease subtype classification based on non-negative matrix factorization. The proposed approach carries out integrative clustering of multiple high dimensional molecular data in a single comprehensive analysis utilizing the information across multiple biological levels assessed on the same individual. As intNMF does not assume any distributional form for the data, it has obvious advantages over other model based clustering methods which require specific distributional assumptions. Application of intNMF is illustrated using both simulated and real data from The Cancer Genome Atlas (TCGA). Public Library of Science 2017-05-01 /pmc/articles/PMC5411077/ /pubmed/28459819 http://dx.doi.org/10.1371/journal.pone.0176278 Text en © 2017 Chalise, Fridley http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Chalise, Prabhakar Fridley, Brooke L. Integrative clustering of multi-level ‘omic data based on non-negative matrix factorization algorithm |
title | Integrative clustering of multi-level ‘omic data based on non-negative matrix factorization algorithm |
title_full | Integrative clustering of multi-level ‘omic data based on non-negative matrix factorization algorithm |
title_fullStr | Integrative clustering of multi-level ‘omic data based on non-negative matrix factorization algorithm |
title_full_unstemmed | Integrative clustering of multi-level ‘omic data based on non-negative matrix factorization algorithm |
title_short | Integrative clustering of multi-level ‘omic data based on non-negative matrix factorization algorithm |
title_sort | integrative clustering of multi-level ‘omic data based on non-negative matrix factorization algorithm |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5411077/ https://www.ncbi.nlm.nih.gov/pubmed/28459819 http://dx.doi.org/10.1371/journal.pone.0176278 |
work_keys_str_mv | AT chaliseprabhakar integrativeclusteringofmultilevelomicdatabasedonnonnegativematrixfactorizationalgorithm AT fridleybrookel integrativeclusteringofmultilevelomicdatabasedonnonnegativematrixfactorizationalgorithm |