Cargando…
Enumerating the gene sets in breast cancer, a "direct" alternative to hierarchical clustering
BACKGROUND: Two-way hierarchical clustering, with results visualized as heatmaps, has served as the method of choice for exploring structure in large matrices of expression data since the advent of microarrays. While it has delivered important insights, including a typology of breast cancer subtypes...
Autores principales: | , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2010
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2996978/ https://www.ncbi.nlm.nih.gov/pubmed/20731868 http://dx.doi.org/10.1186/1471-2164-11-482 |
_version_ | 1782193248706494464 |
---|---|
author | Mefford, Dwain Mefford, Joel A |
author_facet | Mefford, Dwain Mefford, Joel A |
author_sort | Mefford, Dwain |
collection | PubMed |
description | BACKGROUND: Two-way hierarchical clustering, with results visualized as heatmaps, has served as the method of choice for exploring structure in large matrices of expression data since the advent of microarrays. While it has delivered important insights, including a typology of breast cancer subtypes, it suffers from instability in the face of gene or sample selection, and an inability to detect small sets that may be dominated by larger sets such as the estrogen-related genes in breast cancer. The rank-based partitioning algorithm introduced in this paper addresses several of these limitations. It delivers results comparable to two-way hierarchical clustering, and much more. Applied systematically across a range of parameter settings, it enumerates all the partition-inducing gene sets in a matrix of expression values. RESULTS: Applied to four large breast cancer datasets, this alternative exploratory method detects more than thirty sets of co-regulated genes, many of which are conserved across experiments and across platforms. Many of these sets are readily identified in biological terms, e.g., "estrogen", "erbb2", and 8p11-12, and several are clinically significant as prognostic of either increased survival ("adipose", "stromal"...) or diminished survival ("proliferation", "immune/interferon", "histone",...). Of special interest are the sets that effectively factor "immune response" and "stromal signalling". CONCLUSION: The gene sets induced by the enumeration include many of the sets reported in the literature. In this regard these inventories confirm and consolidate findings from microarray-based work on breast cancer over the last decade. But, the enumerations also identify gene sets that have not been studied as of yet, some of which are prognostic of survival. The sets induced are robust, biologically meaningful, and serve to reveal a finer structure in existing breast cancer microarrays. |
format | Text |
id | pubmed-2996978 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2010 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-29969782010-12-07 Enumerating the gene sets in breast cancer, a "direct" alternative to hierarchical clustering Mefford, Dwain Mefford, Joel A BMC Genomics Methodology Article BACKGROUND: Two-way hierarchical clustering, with results visualized as heatmaps, has served as the method of choice for exploring structure in large matrices of expression data since the advent of microarrays. While it has delivered important insights, including a typology of breast cancer subtypes, it suffers from instability in the face of gene or sample selection, and an inability to detect small sets that may be dominated by larger sets such as the estrogen-related genes in breast cancer. The rank-based partitioning algorithm introduced in this paper addresses several of these limitations. It delivers results comparable to two-way hierarchical clustering, and much more. Applied systematically across a range of parameter settings, it enumerates all the partition-inducing gene sets in a matrix of expression values. RESULTS: Applied to four large breast cancer datasets, this alternative exploratory method detects more than thirty sets of co-regulated genes, many of which are conserved across experiments and across platforms. Many of these sets are readily identified in biological terms, e.g., "estrogen", "erbb2", and 8p11-12, and several are clinically significant as prognostic of either increased survival ("adipose", "stromal"...) or diminished survival ("proliferation", "immune/interferon", "histone",...). Of special interest are the sets that effectively factor "immune response" and "stromal signalling". CONCLUSION: The gene sets induced by the enumeration include many of the sets reported in the literature. In this regard these inventories confirm and consolidate findings from microarray-based work on breast cancer over the last decade. But, the enumerations also identify gene sets that have not been studied as of yet, some of which are prognostic of survival. The sets induced are robust, biologically meaningful, and serve to reveal a finer structure in existing breast cancer microarrays. BioMed Central 2010-08-23 /pmc/articles/PMC2996978/ /pubmed/20731868 http://dx.doi.org/10.1186/1471-2164-11-482 Text en Copyright ©2010 Mefford and Mefford; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methodology Article Mefford, Dwain Mefford, Joel A Enumerating the gene sets in breast cancer, a "direct" alternative to hierarchical clustering |
title | Enumerating the gene sets in breast cancer, a "direct" alternative to hierarchical clustering |
title_full | Enumerating the gene sets in breast cancer, a "direct" alternative to hierarchical clustering |
title_fullStr | Enumerating the gene sets in breast cancer, a "direct" alternative to hierarchical clustering |
title_full_unstemmed | Enumerating the gene sets in breast cancer, a "direct" alternative to hierarchical clustering |
title_short | Enumerating the gene sets in breast cancer, a "direct" alternative to hierarchical clustering |
title_sort | enumerating the gene sets in breast cancer, a "direct" alternative to hierarchical clustering |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2996978/ https://www.ncbi.nlm.nih.gov/pubmed/20731868 http://dx.doi.org/10.1186/1471-2164-11-482 |
work_keys_str_mv | AT mefforddwain enumeratingthegenesetsinbreastcanceradirectalternativetohierarchicalclustering AT meffordjoela enumeratingthegenesetsinbreastcanceradirectalternativetohierarchicalclustering |