Cargando…
Tumor classification and marker gene prediction by feature selection and fuzzy c-means clustering using microarray data
BACKGROUND: Using DNA microarrays, we have developed two novel models for tumor classification and target gene prediction. First, gene expression profiles are summarized by optimally selected Self-Organizing Maps (SOMs), followed by tumor sample classification by Fuzzy C-means clustering. Then, the...
Autores principales: | , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2003
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC302113/ https://www.ncbi.nlm.nih.gov/pubmed/14651757 http://dx.doi.org/10.1186/1471-2105-4-60 |
_version_ | 1782121093502337024 |
---|---|
author | Wang, Junbai Bø, Trond Hellem Jonassen, Inge Myklebost, Ola Hovig, Eivind |
author_facet | Wang, Junbai Bø, Trond Hellem Jonassen, Inge Myklebost, Ola Hovig, Eivind |
author_sort | Wang, Junbai |
collection | PubMed |
description | BACKGROUND: Using DNA microarrays, we have developed two novel models for tumor classification and target gene prediction. First, gene expression profiles are summarized by optimally selected Self-Organizing Maps (SOMs), followed by tumor sample classification by Fuzzy C-means clustering. Then, the prediction of marker genes is accomplished by either manual feature selection (visualizing the weighted/mean SOM component plane) or automatic feature selection (by pair-wise Fisher's linear discriminant). RESULTS: The proposed models were tested on four published datasets: (1) Leukemia (2) Colon cancer (3) Brain tumors and (4) NCI cancer cell lines. The models gave class prediction with markedly reduced error rates compared to other class prediction approaches, and the importance of feature selection on microarray data analysis was also emphasized. CONCLUSIONS: Our models identify marker genes with predictive potential, often better than other available methods in the literature. The models are potentially useful for medical diagnostics and may reveal some insights into cancer classification. Additionally, we illustrated two limitations in tumor classification from microarray data related to the biology underlying the data, in terms of (1) the class size of data, and (2) the internal structure of classes. These limitations are not specific for the classification models used. |
format | Text |
id | pubmed-302113 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2003 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-3021132003-12-30 Tumor classification and marker gene prediction by feature selection and fuzzy c-means clustering using microarray data Wang, Junbai Bø, Trond Hellem Jonassen, Inge Myklebost, Ola Hovig, Eivind BMC Bioinformatics Methodology Article BACKGROUND: Using DNA microarrays, we have developed two novel models for tumor classification and target gene prediction. First, gene expression profiles are summarized by optimally selected Self-Organizing Maps (SOMs), followed by tumor sample classification by Fuzzy C-means clustering. Then, the prediction of marker genes is accomplished by either manual feature selection (visualizing the weighted/mean SOM component plane) or automatic feature selection (by pair-wise Fisher's linear discriminant). RESULTS: The proposed models were tested on four published datasets: (1) Leukemia (2) Colon cancer (3) Brain tumors and (4) NCI cancer cell lines. The models gave class prediction with markedly reduced error rates compared to other class prediction approaches, and the importance of feature selection on microarray data analysis was also emphasized. CONCLUSIONS: Our models identify marker genes with predictive potential, often better than other available methods in the literature. The models are potentially useful for medical diagnostics and may reveal some insights into cancer classification. Additionally, we illustrated two limitations in tumor classification from microarray data related to the biology underlying the data, in terms of (1) the class size of data, and (2) the internal structure of classes. These limitations are not specific for the classification models used. BioMed Central 2003-12-02 /pmc/articles/PMC302113/ /pubmed/14651757 http://dx.doi.org/10.1186/1471-2105-4-60 Text en Copyright © 2003 Wang et al; licensee BioMed Central Ltd. This is an Open Access article: verbatim copying and redistribution of this article are permitted in all media for any purpose, provided this notice is preserved along with the article's original URL. |
spellingShingle | Methodology Article Wang, Junbai Bø, Trond Hellem Jonassen, Inge Myklebost, Ola Hovig, Eivind Tumor classification and marker gene prediction by feature selection and fuzzy c-means clustering using microarray data |
title | Tumor classification and marker gene prediction by feature selection and fuzzy c-means clustering using microarray data |
title_full | Tumor classification and marker gene prediction by feature selection and fuzzy c-means clustering using microarray data |
title_fullStr | Tumor classification and marker gene prediction by feature selection and fuzzy c-means clustering using microarray data |
title_full_unstemmed | Tumor classification and marker gene prediction by feature selection and fuzzy c-means clustering using microarray data |
title_short | Tumor classification and marker gene prediction by feature selection and fuzzy c-means clustering using microarray data |
title_sort | tumor classification and marker gene prediction by feature selection and fuzzy c-means clustering using microarray data |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC302113/ https://www.ncbi.nlm.nih.gov/pubmed/14651757 http://dx.doi.org/10.1186/1471-2105-4-60 |
work_keys_str_mv | AT wangjunbai tumorclassificationandmarkergenepredictionbyfeatureselectionandfuzzycmeansclusteringusingmicroarraydata AT bøtrondhellem tumorclassificationandmarkergenepredictionbyfeatureselectionandfuzzycmeansclusteringusingmicroarraydata AT jonasseninge tumorclassificationandmarkergenepredictionbyfeatureselectionandfuzzycmeansclusteringusingmicroarraydata AT myklebostola tumorclassificationandmarkergenepredictionbyfeatureselectionandfuzzycmeansclusteringusingmicroarraydata AT hovigeivind tumorclassificationandmarkergenepredictionbyfeatureselectionandfuzzycmeansclusteringusingmicroarraydata |