Cargando…

Tumor classification and marker gene prediction by feature selection and fuzzy c-means clustering using microarray data

BACKGROUND: Using DNA microarrays, we have developed two novel models for tumor classification and target gene prediction. First, gene expression profiles are summarized by optimally selected Self-Organizing Maps (SOMs), followed by tumor sample classification by Fuzzy C-means clustering. Then, the...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Junbai, Bø, Trond Hellem, Jonassen, Inge, Myklebost, Ola, Hovig, Eivind
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2003
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC302113/
https://www.ncbi.nlm.nih.gov/pubmed/14651757
http://dx.doi.org/10.1186/1471-2105-4-60
_version_ 1782121093502337024
author Wang, Junbai
Bø, Trond Hellem
Jonassen, Inge
Myklebost, Ola
Hovig, Eivind
author_facet Wang, Junbai
Bø, Trond Hellem
Jonassen, Inge
Myklebost, Ola
Hovig, Eivind
author_sort Wang, Junbai
collection PubMed
description BACKGROUND: Using DNA microarrays, we have developed two novel models for tumor classification and target gene prediction. First, gene expression profiles are summarized by optimally selected Self-Organizing Maps (SOMs), followed by tumor sample classification by Fuzzy C-means clustering. Then, the prediction of marker genes is accomplished by either manual feature selection (visualizing the weighted/mean SOM component plane) or automatic feature selection (by pair-wise Fisher's linear discriminant). RESULTS: The proposed models were tested on four published datasets: (1) Leukemia (2) Colon cancer (3) Brain tumors and (4) NCI cancer cell lines. The models gave class prediction with markedly reduced error rates compared to other class prediction approaches, and the importance of feature selection on microarray data analysis was also emphasized. CONCLUSIONS: Our models identify marker genes with predictive potential, often better than other available methods in the literature. The models are potentially useful for medical diagnostics and may reveal some insights into cancer classification. Additionally, we illustrated two limitations in tumor classification from microarray data related to the biology underlying the data, in terms of (1) the class size of data, and (2) the internal structure of classes. These limitations are not specific for the classification models used.
format Text
id pubmed-302113
institution National Center for Biotechnology Information
language English
publishDate 2003
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-3021132003-12-30 Tumor classification and marker gene prediction by feature selection and fuzzy c-means clustering using microarray data Wang, Junbai Bø, Trond Hellem Jonassen, Inge Myklebost, Ola Hovig, Eivind BMC Bioinformatics Methodology Article BACKGROUND: Using DNA microarrays, we have developed two novel models for tumor classification and target gene prediction. First, gene expression profiles are summarized by optimally selected Self-Organizing Maps (SOMs), followed by tumor sample classification by Fuzzy C-means clustering. Then, the prediction of marker genes is accomplished by either manual feature selection (visualizing the weighted/mean SOM component plane) or automatic feature selection (by pair-wise Fisher's linear discriminant). RESULTS: The proposed models were tested on four published datasets: (1) Leukemia (2) Colon cancer (3) Brain tumors and (4) NCI cancer cell lines. The models gave class prediction with markedly reduced error rates compared to other class prediction approaches, and the importance of feature selection on microarray data analysis was also emphasized. CONCLUSIONS: Our models identify marker genes with predictive potential, often better than other available methods in the literature. The models are potentially useful for medical diagnostics and may reveal some insights into cancer classification. Additionally, we illustrated two limitations in tumor classification from microarray data related to the biology underlying the data, in terms of (1) the class size of data, and (2) the internal structure of classes. These limitations are not specific for the classification models used. BioMed Central 2003-12-02 /pmc/articles/PMC302113/ /pubmed/14651757 http://dx.doi.org/10.1186/1471-2105-4-60 Text en Copyright © 2003 Wang et al; licensee BioMed Central Ltd. This is an Open Access article: verbatim copying and redistribution of this article are permitted in all media for any purpose, provided this notice is preserved along with the article's original URL.
spellingShingle Methodology Article
Wang, Junbai
Bø, Trond Hellem
Jonassen, Inge
Myklebost, Ola
Hovig, Eivind
Tumor classification and marker gene prediction by feature selection and fuzzy c-means clustering using microarray data
title Tumor classification and marker gene prediction by feature selection and fuzzy c-means clustering using microarray data
title_full Tumor classification and marker gene prediction by feature selection and fuzzy c-means clustering using microarray data
title_fullStr Tumor classification and marker gene prediction by feature selection and fuzzy c-means clustering using microarray data
title_full_unstemmed Tumor classification and marker gene prediction by feature selection and fuzzy c-means clustering using microarray data
title_short Tumor classification and marker gene prediction by feature selection and fuzzy c-means clustering using microarray data
title_sort tumor classification and marker gene prediction by feature selection and fuzzy c-means clustering using microarray data
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC302113/
https://www.ncbi.nlm.nih.gov/pubmed/14651757
http://dx.doi.org/10.1186/1471-2105-4-60
work_keys_str_mv AT wangjunbai tumorclassificationandmarkergenepredictionbyfeatureselectionandfuzzycmeansclusteringusingmicroarraydata
AT bøtrondhellem tumorclassificationandmarkergenepredictionbyfeatureselectionandfuzzycmeansclusteringusingmicroarraydata
AT jonasseninge tumorclassificationandmarkergenepredictionbyfeatureselectionandfuzzycmeansclusteringusingmicroarraydata
AT myklebostola tumorclassificationandmarkergenepredictionbyfeatureselectionandfuzzycmeansclusteringusingmicroarraydata
AT hovigeivind tumorclassificationandmarkergenepredictionbyfeatureselectionandfuzzycmeansclusteringusingmicroarraydata