Cargando…

Tumor classification ranking from microarray data

BACKGROUND: Gene expression profiles based on microarray data are recognized as potential diagnostic indices of cancer. Molecular tumor classifications resulted from these data and learning algorithms have advanced our understanding of genetic changes associated with cancer etiology and development....

Descripción completa

Detalles Bibliográficos
Autores principales:	Hewett, Rattikorn, Kijsanayothin, Phongphun
Formato:	Texto
Lenguaje:	English
Publicado:	BioMed Central 2008
Materias:	Research
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2559886/ https://www.ncbi.nlm.nih.gov/pubmed/18831787 http://dx.doi.org/10.1186/1471-2164-9-S2-S21

_version_	1782159686651346944
author	Hewett, Rattikorn Kijsanayothin, Phongphun
author_facet	Hewett, Rattikorn Kijsanayothin, Phongphun
author_sort	Hewett, Rattikorn
collection	PubMed
description	BACKGROUND: Gene expression profiles based on microarray data are recognized as potential diagnostic indices of cancer. Molecular tumor classifications resulted from these data and learning algorithms have advanced our understanding of genetic changes associated with cancer etiology and development. However, classifications are not always perfect and in such cases the classification rankings (likelihoods of correct class predictions) can be useful for directing further research (e.g., by deriving inferences about predictive indicators or prioritizing future experiments). Classification ranking is a challenging problem, particularly for microarray data, where there is a huge number of possible regulated genes with no known rating function. This study investigates the possibility of making tumor classification more informative by using a method for classification ranking that requires no additional ranking analysis and maintains relatively good classification accuracy. RESULTS: Microarray data of 11 different types and subtypes of cancer were analyzed using MDR (Multi-Dimensional Ranker), a recently developed boosting-based ranking algorithm. The number of predictor genes in all of the resulting classification models was at most nine, a huge reduction from the more than 12 thousands genes in the majority of the expression samples. Compared to several other learning algorithms, MDR gives the greatest AUC (area under the ROC curve) for the classifications of prostate cancer, acute lymphoblastic leukemia (ALL) and four ALL subtypes: BCR-ABL, E2A-PBX1, MALL and TALL. SVM (Support Vector Machine) gives the highest AUC for the classifications of lung, lymphoma, and breast cancers, and two ALL subtypes: Hyperdiploid > 50 and TEL-AML1. MDR gives highly competitive results, producing the highest average AUC, 91.01%, and an average overall accuracy of 90.01% for cancer expression analysis. CONCLUSION: Using the classification rankings from MDR is a simple technique for obtaining effective and informative tumor classifications from cancer gene expression data. Further interpretation of the results obtained from MDR is required. MDR can also be used directly as a simple feature selection mechanism to identify genes relevant to tumor classification. MDR may be applicable to many other classification problems for microarray data.
format	Text
id	pubmed-2559886
institution	National Center for Biotechnology Information
language	English
publishDate	2008
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-25598862008-10-04 Tumor classification ranking from microarray data Hewett, Rattikorn Kijsanayothin, Phongphun BMC Genomics Research BACKGROUND: Gene expression profiles based on microarray data are recognized as potential diagnostic indices of cancer. Molecular tumor classifications resulted from these data and learning algorithms have advanced our understanding of genetic changes associated with cancer etiology and development. However, classifications are not always perfect and in such cases the classification rankings (likelihoods of correct class predictions) can be useful for directing further research (e.g., by deriving inferences about predictive indicators or prioritizing future experiments). Classification ranking is a challenging problem, particularly for microarray data, where there is a huge number of possible regulated genes with no known rating function. This study investigates the possibility of making tumor classification more informative by using a method for classification ranking that requires no additional ranking analysis and maintains relatively good classification accuracy. RESULTS: Microarray data of 11 different types and subtypes of cancer were analyzed using MDR (Multi-Dimensional Ranker), a recently developed boosting-based ranking algorithm. The number of predictor genes in all of the resulting classification models was at most nine, a huge reduction from the more than 12 thousands genes in the majority of the expression samples. Compared to several other learning algorithms, MDR gives the greatest AUC (area under the ROC curve) for the classifications of prostate cancer, acute lymphoblastic leukemia (ALL) and four ALL subtypes: BCR-ABL, E2A-PBX1, MALL and TALL. SVM (Support Vector Machine) gives the highest AUC for the classifications of lung, lymphoma, and breast cancers, and two ALL subtypes: Hyperdiploid > 50 and TEL-AML1. MDR gives highly competitive results, producing the highest average AUC, 91.01%, and an average overall accuracy of 90.01% for cancer expression analysis. CONCLUSION: Using the classification rankings from MDR is a simple technique for obtaining effective and informative tumor classifications from cancer gene expression data. Further interpretation of the results obtained from MDR is required. MDR can also be used directly as a simple feature selection mechanism to identify genes relevant to tumor classification. MDR may be applicable to many other classification problems for microarray data. BioMed Central 2008-09-16 /pmc/articles/PMC2559886/ /pubmed/18831787 http://dx.doi.org/10.1186/1471-2164-9-S2-S21 Text en Copyright © 2008 Hewett and Kijsanayothin; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Research Hewett, Rattikorn Kijsanayothin, Phongphun Tumor classification ranking from microarray data
title	Tumor classification ranking from microarray data
title_full	Tumor classification ranking from microarray data
title_fullStr	Tumor classification ranking from microarray data
title_full_unstemmed	Tumor classification ranking from microarray data
title_short	Tumor classification ranking from microarray data
title_sort	tumor classification ranking from microarray data
topic	Research
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2559886/ https://www.ncbi.nlm.nih.gov/pubmed/18831787 http://dx.doi.org/10.1186/1471-2164-9-S2-S21
work_keys_str_mv	AT hewettrattikorn tumorclassificationrankingfrommicroarraydata AT kijsanayothinphongphun tumorclassificationrankingfrommicroarraydata

Tumor classification ranking from microarray data

Ejemplares similares