Cargando…

A genetic programming approach to oral cancer prognosis

BACKGROUND: The potential of genetic programming (GP) on various fields has been attained in recent years. In bio-medical field, many researches in GP are focused on the recognition of cancerous cells and also on gene expression profiling data. In this research, the aim is to study the performance o...

Descripción completa

Detalles Bibliográficos
Autores principales: Tan, Mei Sze, Tan, Jing Wei, Chang, Siow-Wee, Yap, Hwa Jen, Abdul Kareem, Sameem, Zain, Rosnah Binti
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5036111/
https://www.ncbi.nlm.nih.gov/pubmed/27688975
http://dx.doi.org/10.7717/peerj.2482
_version_ 1782455498017079296
author Tan, Mei Sze
Tan, Jing Wei
Chang, Siow-Wee
Yap, Hwa Jen
Abdul Kareem, Sameem
Zain, Rosnah Binti
author_facet Tan, Mei Sze
Tan, Jing Wei
Chang, Siow-Wee
Yap, Hwa Jen
Abdul Kareem, Sameem
Zain, Rosnah Binti
author_sort Tan, Mei Sze
collection PubMed
description BACKGROUND: The potential of genetic programming (GP) on various fields has been attained in recent years. In bio-medical field, many researches in GP are focused on the recognition of cancerous cells and also on gene expression profiling data. In this research, the aim is to study the performance of GP on the survival prediction of a small sample size of oral cancer prognosis dataset, which is the first study in the field of oral cancer prognosis. METHOD: GP is applied on an oral cancer dataset that contains 31 cases collected from the Malaysia Oral Cancer Database and Tissue Bank System (MOCDTBS). The feature subsets that is automatically selected through GP were noted and the influences of this subset on the results of GP were recorded. In addition, a comparison between the GP performance and that of the Support Vector Machine (SVM) and logistic regression (LR) are also done in order to verify the predictive capabilities of the GP. RESULT: The result shows that GP performed the best (average accuracy of 83.87% and average AUROC of 0.8341) when the features selected are smoking, drinking, chewing, histological differentiation of SCC, and oncogene p63. In addition, based on the comparison results, we found that the GP outperformed the SVM and LR in oral cancer prognosis. DISCUSSION: Some of the features in the dataset are found to be statistically co-related. This is because the accuracy of the GP prediction drops when one of the feature in the best feature subset is excluded. Thus, GP provides an automatic feature selection function, which chooses features that are highly correlated to the prognosis of oral cancer. This makes GP an ideal prediction model for cancer clinical and genomic data that can be used to aid physicians in their decision making stage of diagnosis or prognosis.
format Online
Article
Text
id pubmed-5036111
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-50361112016-09-29 A genetic programming approach to oral cancer prognosis Tan, Mei Sze Tan, Jing Wei Chang, Siow-Wee Yap, Hwa Jen Abdul Kareem, Sameem Zain, Rosnah Binti PeerJ Computational Biology BACKGROUND: The potential of genetic programming (GP) on various fields has been attained in recent years. In bio-medical field, many researches in GP are focused on the recognition of cancerous cells and also on gene expression profiling data. In this research, the aim is to study the performance of GP on the survival prediction of a small sample size of oral cancer prognosis dataset, which is the first study in the field of oral cancer prognosis. METHOD: GP is applied on an oral cancer dataset that contains 31 cases collected from the Malaysia Oral Cancer Database and Tissue Bank System (MOCDTBS). The feature subsets that is automatically selected through GP were noted and the influences of this subset on the results of GP were recorded. In addition, a comparison between the GP performance and that of the Support Vector Machine (SVM) and logistic regression (LR) are also done in order to verify the predictive capabilities of the GP. RESULT: The result shows that GP performed the best (average accuracy of 83.87% and average AUROC of 0.8341) when the features selected are smoking, drinking, chewing, histological differentiation of SCC, and oncogene p63. In addition, based on the comparison results, we found that the GP outperformed the SVM and LR in oral cancer prognosis. DISCUSSION: Some of the features in the dataset are found to be statistically co-related. This is because the accuracy of the GP prediction drops when one of the feature in the best feature subset is excluded. Thus, GP provides an automatic feature selection function, which chooses features that are highly correlated to the prognosis of oral cancer. This makes GP an ideal prediction model for cancer clinical and genomic data that can be used to aid physicians in their decision making stage of diagnosis or prognosis. PeerJ Inc. 2016-09-21 /pmc/articles/PMC5036111/ /pubmed/27688975 http://dx.doi.org/10.7717/peerj.2482 Text en ©2016 Tan et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.
spellingShingle Computational Biology
Tan, Mei Sze
Tan, Jing Wei
Chang, Siow-Wee
Yap, Hwa Jen
Abdul Kareem, Sameem
Zain, Rosnah Binti
A genetic programming approach to oral cancer prognosis
title A genetic programming approach to oral cancer prognosis
title_full A genetic programming approach to oral cancer prognosis
title_fullStr A genetic programming approach to oral cancer prognosis
title_full_unstemmed A genetic programming approach to oral cancer prognosis
title_short A genetic programming approach to oral cancer prognosis
title_sort genetic programming approach to oral cancer prognosis
topic Computational Biology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5036111/
https://www.ncbi.nlm.nih.gov/pubmed/27688975
http://dx.doi.org/10.7717/peerj.2482
work_keys_str_mv AT tanmeisze ageneticprogrammingapproachtooralcancerprognosis
AT tanjingwei ageneticprogrammingapproachtooralcancerprognosis
AT changsiowwee ageneticprogrammingapproachtooralcancerprognosis
AT yaphwajen ageneticprogrammingapproachtooralcancerprognosis
AT abdulkareemsameem ageneticprogrammingapproachtooralcancerprognosis
AT zainrosnahbinti ageneticprogrammingapproachtooralcancerprognosis
AT tanmeisze geneticprogrammingapproachtooralcancerprognosis
AT tanjingwei geneticprogrammingapproachtooralcancerprognosis
AT changsiowwee geneticprogrammingapproachtooralcancerprognosis
AT yaphwajen geneticprogrammingapproachtooralcancerprognosis
AT abdulkareemsameem geneticprogrammingapproachtooralcancerprognosis
AT zainrosnahbinti geneticprogrammingapproachtooralcancerprognosis