Cargando…

Clustering of the SOM easily reveals distinct gene expression patterns: results of a reanalysis of lymphoma study

BACKGROUND: A method to evaluate and analyze the massive data generated by series of microarray experiments is of utmost importance to reveal the hidden patterns of gene expression. Because of the complexity and the high dimensionality of microarray gene expression profiles, the dimensional reductio...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Junbai, Delabie, Jan, Aasheim, Hans Christian, Smeland, Erlend, Myklebost, Ola
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2002
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC138792/
https://www.ncbi.nlm.nih.gov/pubmed/12445336
http://dx.doi.org/10.1186/1471-2105-3-36
_version_ 1782120494040875008
author Wang, Junbai
Delabie, Jan
Aasheim, Hans Christian
Smeland, Erlend
Myklebost, Ola
author_facet Wang, Junbai
Delabie, Jan
Aasheim, Hans Christian
Smeland, Erlend
Myklebost, Ola
author_sort Wang, Junbai
collection PubMed
description BACKGROUND: A method to evaluate and analyze the massive data generated by series of microarray experiments is of utmost importance to reveal the hidden patterns of gene expression. Because of the complexity and the high dimensionality of microarray gene expression profiles, the dimensional reduction of raw expression data and the feature selections necessary for, for example, classification of disease samples remains a challenge. To solve the problem we propose a two-level analysis. First self-organizing map (SOM) is used. SOM is a vector quantization method that simplifies and reduces the dimensionality of original measurements and visualizes individual tumor sample in a SOM component plane. Next, hierarchical clustering and K-means clustering is used to identify patterns of gene expression useful for classification of samples. RESULTS: We tested the two-level analysis on public data from diffuse large B-cell lymphomas. The analysis easily distinguished major gene expression patterns without the need for supervision: a germinal center-related, a proliferation, an inflammatory and a plasma cell differentiation-related gene expression pattern. The first three patterns matched the patterns described in the original publication using supervised clustering analysis, whereas the fourth one was novel. CONCLUSIONS: Our study shows that by using SOM as an intermediate step to analyze genome-wide gene expression data, the gene expression patterns can more easily be revealed. The "expression display" by the SOM component plane summarises the complicated data in a way that allows the clinician to evaluate the classification options rather than giving a fixed diagnosis.
format Text
id pubmed-138792
institution National Center for Biotechnology Information
language English
publishDate 2002
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-1387922002-12-19 Clustering of the SOM easily reveals distinct gene expression patterns: results of a reanalysis of lymphoma study Wang, Junbai Delabie, Jan Aasheim, Hans Christian Smeland, Erlend Myklebost, Ola BMC Bioinformatics Research article BACKGROUND: A method to evaluate and analyze the massive data generated by series of microarray experiments is of utmost importance to reveal the hidden patterns of gene expression. Because of the complexity and the high dimensionality of microarray gene expression profiles, the dimensional reduction of raw expression data and the feature selections necessary for, for example, classification of disease samples remains a challenge. To solve the problem we propose a two-level analysis. First self-organizing map (SOM) is used. SOM is a vector quantization method that simplifies and reduces the dimensionality of original measurements and visualizes individual tumor sample in a SOM component plane. Next, hierarchical clustering and K-means clustering is used to identify patterns of gene expression useful for classification of samples. RESULTS: We tested the two-level analysis on public data from diffuse large B-cell lymphomas. The analysis easily distinguished major gene expression patterns without the need for supervision: a germinal center-related, a proliferation, an inflammatory and a plasma cell differentiation-related gene expression pattern. The first three patterns matched the patterns described in the original publication using supervised clustering analysis, whereas the fourth one was novel. CONCLUSIONS: Our study shows that by using SOM as an intermediate step to analyze genome-wide gene expression data, the gene expression patterns can more easily be revealed. The "expression display" by the SOM component plane summarises the complicated data in a way that allows the clinician to evaluate the classification options rather than giving a fixed diagnosis. BioMed Central 2002-11-24 /pmc/articles/PMC138792/ /pubmed/12445336 http://dx.doi.org/10.1186/1471-2105-3-36 Text en Copyright ©2002 Wang et al; licensee BioMed Central Ltd. This is an Open Access article: verbatim copying and redistribution of this article are permitted in all media for any purpose, provided this notice is preserved along with the article's original URL.
spellingShingle Research article
Wang, Junbai
Delabie, Jan
Aasheim, Hans Christian
Smeland, Erlend
Myklebost, Ola
Clustering of the SOM easily reveals distinct gene expression patterns: results of a reanalysis of lymphoma study
title Clustering of the SOM easily reveals distinct gene expression patterns: results of a reanalysis of lymphoma study
title_full Clustering of the SOM easily reveals distinct gene expression patterns: results of a reanalysis of lymphoma study
title_fullStr Clustering of the SOM easily reveals distinct gene expression patterns: results of a reanalysis of lymphoma study
title_full_unstemmed Clustering of the SOM easily reveals distinct gene expression patterns: results of a reanalysis of lymphoma study
title_short Clustering of the SOM easily reveals distinct gene expression patterns: results of a reanalysis of lymphoma study
title_sort clustering of the som easily reveals distinct gene expression patterns: results of a reanalysis of lymphoma study
topic Research article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC138792/
https://www.ncbi.nlm.nih.gov/pubmed/12445336
http://dx.doi.org/10.1186/1471-2105-3-36
work_keys_str_mv AT wangjunbai clusteringofthesomeasilyrevealsdistinctgeneexpressionpatternsresultsofareanalysisoflymphomastudy
AT delabiejan clusteringofthesomeasilyrevealsdistinctgeneexpressionpatternsresultsofareanalysisoflymphomastudy
AT aasheimhanschristian clusteringofthesomeasilyrevealsdistinctgeneexpressionpatternsresultsofareanalysisoflymphomastudy
AT smelanderlend clusteringofthesomeasilyrevealsdistinctgeneexpressionpatternsresultsofareanalysisoflymphomastudy
AT myklebostola clusteringofthesomeasilyrevealsdistinctgeneexpressionpatternsresultsofareanalysisoflymphomastudy