Cargando…

Enumerating the gene sets in breast cancer, a "direct" alternative to hierarchical clustering

BACKGROUND: Two-way hierarchical clustering, with results visualized as heatmaps, has served as the method of choice for exploring structure in large matrices of expression data since the advent of microarrays. While it has delivered important insights, including a typology of breast cancer subtypes...

Descripción completa

Detalles Bibliográficos
Autores principales: Mefford, Dwain, Mefford, Joel A
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2996978/
https://www.ncbi.nlm.nih.gov/pubmed/20731868
http://dx.doi.org/10.1186/1471-2164-11-482
_version_ 1782193248706494464
author Mefford, Dwain
Mefford, Joel A
author_facet Mefford, Dwain
Mefford, Joel A
author_sort Mefford, Dwain
collection PubMed
description BACKGROUND: Two-way hierarchical clustering, with results visualized as heatmaps, has served as the method of choice for exploring structure in large matrices of expression data since the advent of microarrays. While it has delivered important insights, including a typology of breast cancer subtypes, it suffers from instability in the face of gene or sample selection, and an inability to detect small sets that may be dominated by larger sets such as the estrogen-related genes in breast cancer. The rank-based partitioning algorithm introduced in this paper addresses several of these limitations. It delivers results comparable to two-way hierarchical clustering, and much more. Applied systematically across a range of parameter settings, it enumerates all the partition-inducing gene sets in a matrix of expression values. RESULTS: Applied to four large breast cancer datasets, this alternative exploratory method detects more than thirty sets of co-regulated genes, many of which are conserved across experiments and across platforms. Many of these sets are readily identified in biological terms, e.g., "estrogen", "erbb2", and 8p11-12, and several are clinically significant as prognostic of either increased survival ("adipose", "stromal"...) or diminished survival ("proliferation", "immune/interferon", "histone",...). Of special interest are the sets that effectively factor "immune response" and "stromal signalling". CONCLUSION: The gene sets induced by the enumeration include many of the sets reported in the literature. In this regard these inventories confirm and consolidate findings from microarray-based work on breast cancer over the last decade. But, the enumerations also identify gene sets that have not been studied as of yet, some of which are prognostic of survival. The sets induced are robust, biologically meaningful, and serve to reveal a finer structure in existing breast cancer microarrays.
format Text
id pubmed-2996978
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-29969782010-12-07 Enumerating the gene sets in breast cancer, a "direct" alternative to hierarchical clustering Mefford, Dwain Mefford, Joel A BMC Genomics Methodology Article BACKGROUND: Two-way hierarchical clustering, with results visualized as heatmaps, has served as the method of choice for exploring structure in large matrices of expression data since the advent of microarrays. While it has delivered important insights, including a typology of breast cancer subtypes, it suffers from instability in the face of gene or sample selection, and an inability to detect small sets that may be dominated by larger sets such as the estrogen-related genes in breast cancer. The rank-based partitioning algorithm introduced in this paper addresses several of these limitations. It delivers results comparable to two-way hierarchical clustering, and much more. Applied systematically across a range of parameter settings, it enumerates all the partition-inducing gene sets in a matrix of expression values. RESULTS: Applied to four large breast cancer datasets, this alternative exploratory method detects more than thirty sets of co-regulated genes, many of which are conserved across experiments and across platforms. Many of these sets are readily identified in biological terms, e.g., "estrogen", "erbb2", and 8p11-12, and several are clinically significant as prognostic of either increased survival ("adipose", "stromal"...) or diminished survival ("proliferation", "immune/interferon", "histone",...). Of special interest are the sets that effectively factor "immune response" and "stromal signalling". CONCLUSION: The gene sets induced by the enumeration include many of the sets reported in the literature. In this regard these inventories confirm and consolidate findings from microarray-based work on breast cancer over the last decade. But, the enumerations also identify gene sets that have not been studied as of yet, some of which are prognostic of survival. The sets induced are robust, biologically meaningful, and serve to reveal a finer structure in existing breast cancer microarrays. BioMed Central 2010-08-23 /pmc/articles/PMC2996978/ /pubmed/20731868 http://dx.doi.org/10.1186/1471-2164-11-482 Text en Copyright ©2010 Mefford and Mefford; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Mefford, Dwain
Mefford, Joel A
Enumerating the gene sets in breast cancer, a "direct" alternative to hierarchical clustering
title Enumerating the gene sets in breast cancer, a "direct" alternative to hierarchical clustering
title_full Enumerating the gene sets in breast cancer, a "direct" alternative to hierarchical clustering
title_fullStr Enumerating the gene sets in breast cancer, a "direct" alternative to hierarchical clustering
title_full_unstemmed Enumerating the gene sets in breast cancer, a "direct" alternative to hierarchical clustering
title_short Enumerating the gene sets in breast cancer, a "direct" alternative to hierarchical clustering
title_sort enumerating the gene sets in breast cancer, a "direct" alternative to hierarchical clustering
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2996978/
https://www.ncbi.nlm.nih.gov/pubmed/20731868
http://dx.doi.org/10.1186/1471-2164-11-482
work_keys_str_mv AT mefforddwain enumeratingthegenesetsinbreastcanceradirectalternativetohierarchicalclustering
AT meffordjoela enumeratingthegenesetsinbreastcanceradirectalternativetohierarchicalclustering