Cargando…

Integrative clustering of multi-level ‘omic data based on non-negative matrix factorization algorithm

Integrative analyses of high-throughput ‘omic data, such as DNA methylation, DNA copy number alteration, mRNA and protein expression levels, have created unprecedented opportunities to understand the molecular basis of human disease. In particular, integrative analyses have been the cornerstone in t...

Descripción completa

Detalles Bibliográficos
Autores principales: Chalise, Prabhakar, Fridley, Brooke L.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5411077/
https://www.ncbi.nlm.nih.gov/pubmed/28459819
http://dx.doi.org/10.1371/journal.pone.0176278
_version_ 1783232783434383360
author Chalise, Prabhakar
Fridley, Brooke L.
author_facet Chalise, Prabhakar
Fridley, Brooke L.
author_sort Chalise, Prabhakar
collection PubMed
description Integrative analyses of high-throughput ‘omic data, such as DNA methylation, DNA copy number alteration, mRNA and protein expression levels, have created unprecedented opportunities to understand the molecular basis of human disease. In particular, integrative analyses have been the cornerstone in the study of cancer to determine molecular subtypes within a given cancer. As malignant tumors with similar morphological characteristics have been shown to exhibit entirely different molecular profiles, there has been significant interest in using multiple ‘omic data for the identification of novel molecular subtypes of disease, which could impact treatment decisions. Therefore, we have developed intNMF, an integrative approach for disease subtype classification based on non-negative matrix factorization. The proposed approach carries out integrative clustering of multiple high dimensional molecular data in a single comprehensive analysis utilizing the information across multiple biological levels assessed on the same individual. As intNMF does not assume any distributional form for the data, it has obvious advantages over other model based clustering methods which require specific distributional assumptions. Application of intNMF is illustrated using both simulated and real data from The Cancer Genome Atlas (TCGA).
format Online
Article
Text
id pubmed-5411077
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-54110772017-05-12 Integrative clustering of multi-level ‘omic data based on non-negative matrix factorization algorithm Chalise, Prabhakar Fridley, Brooke L. PLoS One Research Article Integrative analyses of high-throughput ‘omic data, such as DNA methylation, DNA copy number alteration, mRNA and protein expression levels, have created unprecedented opportunities to understand the molecular basis of human disease. In particular, integrative analyses have been the cornerstone in the study of cancer to determine molecular subtypes within a given cancer. As malignant tumors with similar morphological characteristics have been shown to exhibit entirely different molecular profiles, there has been significant interest in using multiple ‘omic data for the identification of novel molecular subtypes of disease, which could impact treatment decisions. Therefore, we have developed intNMF, an integrative approach for disease subtype classification based on non-negative matrix factorization. The proposed approach carries out integrative clustering of multiple high dimensional molecular data in a single comprehensive analysis utilizing the information across multiple biological levels assessed on the same individual. As intNMF does not assume any distributional form for the data, it has obvious advantages over other model based clustering methods which require specific distributional assumptions. Application of intNMF is illustrated using both simulated and real data from The Cancer Genome Atlas (TCGA). Public Library of Science 2017-05-01 /pmc/articles/PMC5411077/ /pubmed/28459819 http://dx.doi.org/10.1371/journal.pone.0176278 Text en © 2017 Chalise, Fridley http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Chalise, Prabhakar
Fridley, Brooke L.
Integrative clustering of multi-level ‘omic data based on non-negative matrix factorization algorithm
title Integrative clustering of multi-level ‘omic data based on non-negative matrix factorization algorithm
title_full Integrative clustering of multi-level ‘omic data based on non-negative matrix factorization algorithm
title_fullStr Integrative clustering of multi-level ‘omic data based on non-negative matrix factorization algorithm
title_full_unstemmed Integrative clustering of multi-level ‘omic data based on non-negative matrix factorization algorithm
title_short Integrative clustering of multi-level ‘omic data based on non-negative matrix factorization algorithm
title_sort integrative clustering of multi-level ‘omic data based on non-negative matrix factorization algorithm
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5411077/
https://www.ncbi.nlm.nih.gov/pubmed/28459819
http://dx.doi.org/10.1371/journal.pone.0176278
work_keys_str_mv AT chaliseprabhakar integrativeclusteringofmultilevelomicdatabasedonnonnegativematrixfactorizationalgorithm
AT fridleybrookel integrativeclusteringofmultilevelomicdatabasedonnonnegativematrixfactorizationalgorithm