Cargando…

Optimization Based Tumor Classification from Microarray Gene Expression Data

BACKGROUND: An important use of data obtained from microarray measurements is the classification of tumor types with respect to genes that are either up or down regulated in specific cancer types. A number of algorithms have been proposed to obtain such classifications. These algorithms usually requ...

Descripción completa

Detalles Bibliográficos
Autores principales: Dagliyan, Onur, Uney-Yuksektepe, Fadime, Kavakli, I. Halil, Turkay, Metin
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3033885/
https://www.ncbi.nlm.nih.gov/pubmed/21326602
http://dx.doi.org/10.1371/journal.pone.0014579
_version_ 1782197624856641536
author Dagliyan, Onur
Uney-Yuksektepe, Fadime
Kavakli, I. Halil
Turkay, Metin
author_facet Dagliyan, Onur
Uney-Yuksektepe, Fadime
Kavakli, I. Halil
Turkay, Metin
author_sort Dagliyan, Onur
collection PubMed
description BACKGROUND: An important use of data obtained from microarray measurements is the classification of tumor types with respect to genes that are either up or down regulated in specific cancer types. A number of algorithms have been proposed to obtain such classifications. These algorithms usually require parameter optimization to obtain accurate results depending on the type of data. Additionally, it is highly critical to find an optimal set of markers among those up or down regulated genes that can be clinically utilized to build assays for the diagnosis or to follow progression of specific cancer types. In this paper, we employ a mixed integer programming based classification algorithm named hyper-box enclosure method (HBE) for the classification of some cancer types with a minimal set of predictor genes. This optimization based method which is a user friendly and efficient classifier may allow the clinicians to diagnose and follow progression of certain cancer types. METHODOLOGY/PRINCIPAL FINDINGS: We apply HBE algorithm to some well known data sets such as leukemia, prostate cancer, diffuse large B-cell lymphoma (DLBCL), small round blue cell tumors (SRBCT) to find some predictor genes that can be utilized for diagnosis and prognosis in a robust manner with a high accuracy. Our approach does not require any modification or parameter optimization for each data set. Additionally, information gain attribute evaluator, relief attribute evaluator and correlation-based feature selection methods are employed for the gene selection. The results are compared with those from other studies and biological roles of selected genes in corresponding cancer type are described. CONCLUSIONS/SIGNIFICANCE: The performance of our algorithm overall was better than the other algorithms reported in the literature and classifiers found in WEKA data-mining package. Since it does not require a parameter optimization and it performs consistently very high prediction rate on different type of data sets, HBE method is an effective and consistent tool for cancer type prediction with a small number of gene markers.
format Text
id pubmed-3033885
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-30338852011-02-15 Optimization Based Tumor Classification from Microarray Gene Expression Data Dagliyan, Onur Uney-Yuksektepe, Fadime Kavakli, I. Halil Turkay, Metin PLoS One Research Article BACKGROUND: An important use of data obtained from microarray measurements is the classification of tumor types with respect to genes that are either up or down regulated in specific cancer types. A number of algorithms have been proposed to obtain such classifications. These algorithms usually require parameter optimization to obtain accurate results depending on the type of data. Additionally, it is highly critical to find an optimal set of markers among those up or down regulated genes that can be clinically utilized to build assays for the diagnosis or to follow progression of specific cancer types. In this paper, we employ a mixed integer programming based classification algorithm named hyper-box enclosure method (HBE) for the classification of some cancer types with a minimal set of predictor genes. This optimization based method which is a user friendly and efficient classifier may allow the clinicians to diagnose and follow progression of certain cancer types. METHODOLOGY/PRINCIPAL FINDINGS: We apply HBE algorithm to some well known data sets such as leukemia, prostate cancer, diffuse large B-cell lymphoma (DLBCL), small round blue cell tumors (SRBCT) to find some predictor genes that can be utilized for diagnosis and prognosis in a robust manner with a high accuracy. Our approach does not require any modification or parameter optimization for each data set. Additionally, information gain attribute evaluator, relief attribute evaluator and correlation-based feature selection methods are employed for the gene selection. The results are compared with those from other studies and biological roles of selected genes in corresponding cancer type are described. CONCLUSIONS/SIGNIFICANCE: The performance of our algorithm overall was better than the other algorithms reported in the literature and classifiers found in WEKA data-mining package. Since it does not require a parameter optimization and it performs consistently very high prediction rate on different type of data sets, HBE method is an effective and consistent tool for cancer type prediction with a small number of gene markers. Public Library of Science 2011-02-04 /pmc/articles/PMC3033885/ /pubmed/21326602 http://dx.doi.org/10.1371/journal.pone.0014579 Text en Dagliyan et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Dagliyan, Onur
Uney-Yuksektepe, Fadime
Kavakli, I. Halil
Turkay, Metin
Optimization Based Tumor Classification from Microarray Gene Expression Data
title Optimization Based Tumor Classification from Microarray Gene Expression Data
title_full Optimization Based Tumor Classification from Microarray Gene Expression Data
title_fullStr Optimization Based Tumor Classification from Microarray Gene Expression Data
title_full_unstemmed Optimization Based Tumor Classification from Microarray Gene Expression Data
title_short Optimization Based Tumor Classification from Microarray Gene Expression Data
title_sort optimization based tumor classification from microarray gene expression data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3033885/
https://www.ncbi.nlm.nih.gov/pubmed/21326602
http://dx.doi.org/10.1371/journal.pone.0014579
work_keys_str_mv AT dagliyanonur optimizationbasedtumorclassificationfrommicroarraygeneexpressiondata
AT uneyyuksektepefadime optimizationbasedtumorclassificationfrommicroarraygeneexpressiondata
AT kavakliihalil optimizationbasedtumorclassificationfrommicroarraygeneexpressiondata
AT turkaymetin optimizationbasedtumorclassificationfrommicroarraygeneexpressiondata