Cargando…

Predicting Colorectal Cancer Recurrence and Patient Survival Using Supervised Machine Learning Approach: A South African Population-Based Study

Background: South Africa (SA) has the highest incidence of colorectal cancer (CRC) in Sub-Saharan Africa (SSA). However, there is limited research on CRC recurrence and survival in SA. CRC recurrence and overall survival are highly variable across studies. Accurate prediction of patients at risk can...

Descripción completa

Detalles Bibliográficos
Autores principales: Achilonu, Okechinyere J., Fabian, June, Bebington, Brendan, Singh, Elvira, Nimako, Gideon, Eijkemans, M. J. C., Musenge, Eustasius
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8292767/
https://www.ncbi.nlm.nih.gov/pubmed/34307286
http://dx.doi.org/10.3389/fpubh.2021.694306
_version_ 1783724890349633536
author Achilonu, Okechinyere J.
Fabian, June
Bebington, Brendan
Singh, Elvira
Nimako, Gideon
Eijkemans, M. J. C.
Musenge, Eustasius
author_facet Achilonu, Okechinyere J.
Fabian, June
Bebington, Brendan
Singh, Elvira
Nimako, Gideon
Eijkemans, M. J. C.
Musenge, Eustasius
author_sort Achilonu, Okechinyere J.
collection PubMed
description Background: South Africa (SA) has the highest incidence of colorectal cancer (CRC) in Sub-Saharan Africa (SSA). However, there is limited research on CRC recurrence and survival in SA. CRC recurrence and overall survival are highly variable across studies. Accurate prediction of patients at risk can enhance clinical expectations and decisions within the South African CRC patients population. We explored the feasibility of integrating statistical and machine learning (ML) algorithms to achieve higher predictive performance and interpretability in findings. Methods: We selected and compared six algorithms:- logistic regression (LR), naïve Bayes (NB), C5.0, random forest (RF), support vector machine (SVM) and artificial neural network (ANN). Commonly selected features based on OneR and information gain, within 10-fold cross-validation, were used for model development. The validity and stability of the predictive models were further assessed using simulated datasets. Results: The six algorithms achieved high discriminative accuracies (AUC-ROC). ANN achieved the highest AUC-ROC for recurrence (87.0%) and survival (82.0%), and other models showed comparable performance with ANN. We observed no statistical difference in the performance of the models. Features including radiological stage and patient's age, histology, and race are risk factors of CRC recurrence and patient survival, respectively. Conclusions: Based on other studies and what is known in the field, we have affirmed important predictive factors for recurrence and survival using rigorous procedures. Outcomes of this study can be generalised to CRC patient population elsewhere in SA and other SSA countries with similar patient profiles.
format Online
Article
Text
id pubmed-8292767
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-82927672021-07-22 Predicting Colorectal Cancer Recurrence and Patient Survival Using Supervised Machine Learning Approach: A South African Population-Based Study Achilonu, Okechinyere J. Fabian, June Bebington, Brendan Singh, Elvira Nimako, Gideon Eijkemans, M. J. C. Musenge, Eustasius Front Public Health Public Health Background: South Africa (SA) has the highest incidence of colorectal cancer (CRC) in Sub-Saharan Africa (SSA). However, there is limited research on CRC recurrence and survival in SA. CRC recurrence and overall survival are highly variable across studies. Accurate prediction of patients at risk can enhance clinical expectations and decisions within the South African CRC patients population. We explored the feasibility of integrating statistical and machine learning (ML) algorithms to achieve higher predictive performance and interpretability in findings. Methods: We selected and compared six algorithms:- logistic regression (LR), naïve Bayes (NB), C5.0, random forest (RF), support vector machine (SVM) and artificial neural network (ANN). Commonly selected features based on OneR and information gain, within 10-fold cross-validation, were used for model development. The validity and stability of the predictive models were further assessed using simulated datasets. Results: The six algorithms achieved high discriminative accuracies (AUC-ROC). ANN achieved the highest AUC-ROC for recurrence (87.0%) and survival (82.0%), and other models showed comparable performance with ANN. We observed no statistical difference in the performance of the models. Features including radiological stage and patient's age, histology, and race are risk factors of CRC recurrence and patient survival, respectively. Conclusions: Based on other studies and what is known in the field, we have affirmed important predictive factors for recurrence and survival using rigorous procedures. Outcomes of this study can be generalised to CRC patient population elsewhere in SA and other SSA countries with similar patient profiles. Frontiers Media S.A. 2021-07-07 /pmc/articles/PMC8292767/ /pubmed/34307286 http://dx.doi.org/10.3389/fpubh.2021.694306 Text en Copyright © 2021 Achilonu, Fabian, Bebington, Singh, Nimako, Eijkemans and Musenge. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Public Health
Achilonu, Okechinyere J.
Fabian, June
Bebington, Brendan
Singh, Elvira
Nimako, Gideon
Eijkemans, M. J. C.
Musenge, Eustasius
Predicting Colorectal Cancer Recurrence and Patient Survival Using Supervised Machine Learning Approach: A South African Population-Based Study
title Predicting Colorectal Cancer Recurrence and Patient Survival Using Supervised Machine Learning Approach: A South African Population-Based Study
title_full Predicting Colorectal Cancer Recurrence and Patient Survival Using Supervised Machine Learning Approach: A South African Population-Based Study
title_fullStr Predicting Colorectal Cancer Recurrence and Patient Survival Using Supervised Machine Learning Approach: A South African Population-Based Study
title_full_unstemmed Predicting Colorectal Cancer Recurrence and Patient Survival Using Supervised Machine Learning Approach: A South African Population-Based Study
title_short Predicting Colorectal Cancer Recurrence and Patient Survival Using Supervised Machine Learning Approach: A South African Population-Based Study
title_sort predicting colorectal cancer recurrence and patient survival using supervised machine learning approach: a south african population-based study
topic Public Health
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8292767/
https://www.ncbi.nlm.nih.gov/pubmed/34307286
http://dx.doi.org/10.3389/fpubh.2021.694306
work_keys_str_mv AT achilonuokechinyerej predictingcolorectalcancerrecurrenceandpatientsurvivalusingsupervisedmachinelearningapproachasouthafricanpopulationbasedstudy
AT fabianjune predictingcolorectalcancerrecurrenceandpatientsurvivalusingsupervisedmachinelearningapproachasouthafricanpopulationbasedstudy
AT bebingtonbrendan predictingcolorectalcancerrecurrenceandpatientsurvivalusingsupervisedmachinelearningapproachasouthafricanpopulationbasedstudy
AT singhelvira predictingcolorectalcancerrecurrenceandpatientsurvivalusingsupervisedmachinelearningapproachasouthafricanpopulationbasedstudy
AT nimakogideon predictingcolorectalcancerrecurrenceandpatientsurvivalusingsupervisedmachinelearningapproachasouthafricanpopulationbasedstudy
AT eijkemansmjc predictingcolorectalcancerrecurrenceandpatientsurvivalusingsupervisedmachinelearningapproachasouthafricanpopulationbasedstudy
AT musengeeustasius predictingcolorectalcancerrecurrenceandpatientsurvivalusingsupervisedmachinelearningapproachasouthafricanpopulationbasedstudy