Cargando…
Clustering of the SOM easily reveals distinct gene expression patterns: results of a reanalysis of lymphoma study
BACKGROUND: A method to evaluate and analyze the massive data generated by series of microarray experiments is of utmost importance to reveal the hidden patterns of gene expression. Because of the complexity and the high dimensionality of microarray gene expression profiles, the dimensional reductio...
Autores principales: | , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2002
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC138792/ https://www.ncbi.nlm.nih.gov/pubmed/12445336 http://dx.doi.org/10.1186/1471-2105-3-36 |
_version_ | 1782120494040875008 |
---|---|
author | Wang, Junbai Delabie, Jan Aasheim, Hans Christian Smeland, Erlend Myklebost, Ola |
author_facet | Wang, Junbai Delabie, Jan Aasheim, Hans Christian Smeland, Erlend Myklebost, Ola |
author_sort | Wang, Junbai |
collection | PubMed |
description | BACKGROUND: A method to evaluate and analyze the massive data generated by series of microarray experiments is of utmost importance to reveal the hidden patterns of gene expression. Because of the complexity and the high dimensionality of microarray gene expression profiles, the dimensional reduction of raw expression data and the feature selections necessary for, for example, classification of disease samples remains a challenge. To solve the problem we propose a two-level analysis. First self-organizing map (SOM) is used. SOM is a vector quantization method that simplifies and reduces the dimensionality of original measurements and visualizes individual tumor sample in a SOM component plane. Next, hierarchical clustering and K-means clustering is used to identify patterns of gene expression useful for classification of samples. RESULTS: We tested the two-level analysis on public data from diffuse large B-cell lymphomas. The analysis easily distinguished major gene expression patterns without the need for supervision: a germinal center-related, a proliferation, an inflammatory and a plasma cell differentiation-related gene expression pattern. The first three patterns matched the patterns described in the original publication using supervised clustering analysis, whereas the fourth one was novel. CONCLUSIONS: Our study shows that by using SOM as an intermediate step to analyze genome-wide gene expression data, the gene expression patterns can more easily be revealed. The "expression display" by the SOM component plane summarises the complicated data in a way that allows the clinician to evaluate the classification options rather than giving a fixed diagnosis. |
format | Text |
id | pubmed-138792 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2002 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-1387922002-12-19 Clustering of the SOM easily reveals distinct gene expression patterns: results of a reanalysis of lymphoma study Wang, Junbai Delabie, Jan Aasheim, Hans Christian Smeland, Erlend Myklebost, Ola BMC Bioinformatics Research article BACKGROUND: A method to evaluate and analyze the massive data generated by series of microarray experiments is of utmost importance to reveal the hidden patterns of gene expression. Because of the complexity and the high dimensionality of microarray gene expression profiles, the dimensional reduction of raw expression data and the feature selections necessary for, for example, classification of disease samples remains a challenge. To solve the problem we propose a two-level analysis. First self-organizing map (SOM) is used. SOM is a vector quantization method that simplifies and reduces the dimensionality of original measurements and visualizes individual tumor sample in a SOM component plane. Next, hierarchical clustering and K-means clustering is used to identify patterns of gene expression useful for classification of samples. RESULTS: We tested the two-level analysis on public data from diffuse large B-cell lymphomas. The analysis easily distinguished major gene expression patterns without the need for supervision: a germinal center-related, a proliferation, an inflammatory and a plasma cell differentiation-related gene expression pattern. The first three patterns matched the patterns described in the original publication using supervised clustering analysis, whereas the fourth one was novel. CONCLUSIONS: Our study shows that by using SOM as an intermediate step to analyze genome-wide gene expression data, the gene expression patterns can more easily be revealed. The "expression display" by the SOM component plane summarises the complicated data in a way that allows the clinician to evaluate the classification options rather than giving a fixed diagnosis. BioMed Central 2002-11-24 /pmc/articles/PMC138792/ /pubmed/12445336 http://dx.doi.org/10.1186/1471-2105-3-36 Text en Copyright ©2002 Wang et al; licensee BioMed Central Ltd. This is an Open Access article: verbatim copying and redistribution of this article are permitted in all media for any purpose, provided this notice is preserved along with the article's original URL. |
spellingShingle | Research article Wang, Junbai Delabie, Jan Aasheim, Hans Christian Smeland, Erlend Myklebost, Ola Clustering of the SOM easily reveals distinct gene expression patterns: results of a reanalysis of lymphoma study |
title | Clustering of the SOM easily reveals distinct gene expression patterns: results of a reanalysis of lymphoma study |
title_full | Clustering of the SOM easily reveals distinct gene expression patterns: results of a reanalysis of lymphoma study |
title_fullStr | Clustering of the SOM easily reveals distinct gene expression patterns: results of a reanalysis of lymphoma study |
title_full_unstemmed | Clustering of the SOM easily reveals distinct gene expression patterns: results of a reanalysis of lymphoma study |
title_short | Clustering of the SOM easily reveals distinct gene expression patterns: results of a reanalysis of lymphoma study |
title_sort | clustering of the som easily reveals distinct gene expression patterns: results of a reanalysis of lymphoma study |
topic | Research article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC138792/ https://www.ncbi.nlm.nih.gov/pubmed/12445336 http://dx.doi.org/10.1186/1471-2105-3-36 |
work_keys_str_mv | AT wangjunbai clusteringofthesomeasilyrevealsdistinctgeneexpressionpatternsresultsofareanalysisoflymphomastudy AT delabiejan clusteringofthesomeasilyrevealsdistinctgeneexpressionpatternsresultsofareanalysisoflymphomastudy AT aasheimhanschristian clusteringofthesomeasilyrevealsdistinctgeneexpressionpatternsresultsofareanalysisoflymphomastudy AT smelanderlend clusteringofthesomeasilyrevealsdistinctgeneexpressionpatternsresultsofareanalysisoflymphomastudy AT myklebostola clusteringofthesomeasilyrevealsdistinctgeneexpressionpatternsresultsofareanalysisoflymphomastudy |