Cargando…

An Integrated  Approach for Cancer Survival Prediction Using Data Mining Techniques

Ovarian cancer is the third most common gynecologic cancers worldwide. Advanced ovarian cancer patients bear a significant mortality rate. Survival estimation is essential for clinicians and patients to understand better and tolerate future outcomes. The present study intends to investigate differen...

Descripción completa

Detalles Bibliográficos
Autores principales: Kaur, Ishleen, Doja, M. N., Ahmad, Tanvir, Ahmad, Musheer, Hussain, Amir, Nadeem, Ahmed, Abd El-Latif, Ahmed A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8727098/
https://www.ncbi.nlm.nih.gov/pubmed/34992648
http://dx.doi.org/10.1155/2021/6342226
_version_ 1784626442275913728
author Kaur, Ishleen
Doja, M. N.
Ahmad, Tanvir
Ahmad, Musheer
Hussain, Amir
Nadeem, Ahmed
Abd El-Latif, Ahmed A.
author_facet Kaur, Ishleen
Doja, M. N.
Ahmad, Tanvir
Ahmad, Musheer
Hussain, Amir
Nadeem, Ahmed
Abd El-Latif, Ahmed A.
author_sort Kaur, Ishleen
collection PubMed
description Ovarian cancer is the third most common gynecologic cancers worldwide. Advanced ovarian cancer patients bear a significant mortality rate. Survival estimation is essential for clinicians and patients to understand better and tolerate future outcomes. The present study intends to investigate different survival predictors available for cancer prognosis using data mining techniques. Dataset of 140 advanced ovarian cancer patients containing data from different data profiles (clinical, treatment, and overall life quality) has been collected and used to foresee cancer patients' survival. Attributes from each data profile have been processed accordingly. Clinical data has been prepared corresponding to missing values and outliers. Treatment data including varying time periods were created using sequence mining techniques to identify the treatments given to the patients. And lastly, different comorbidities were combined into a single factor by computing Charlson Comorbidity Index for each patient. After appropriate preprocessing, the integrated dataset is classified using appropriate machine learning algorithms. The proposed integrated model approach gave the highest accuracy of 76.4% using ensemble technique with sequential pattern mining including time intervals of 2 months between treatments. Thus, the treatment sequences and, most importantly, life quality attributes significantly contribute to the survival prediction of cancer patients.
format Online
Article
Text
id pubmed-8727098
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-87270982022-01-05 An Integrated  Approach for Cancer Survival Prediction Using Data Mining Techniques Kaur, Ishleen Doja, M. N. Ahmad, Tanvir Ahmad, Musheer Hussain, Amir Nadeem, Ahmed Abd El-Latif, Ahmed A. Comput Intell Neurosci Research Article Ovarian cancer is the third most common gynecologic cancers worldwide. Advanced ovarian cancer patients bear a significant mortality rate. Survival estimation is essential for clinicians and patients to understand better and tolerate future outcomes. The present study intends to investigate different survival predictors available for cancer prognosis using data mining techniques. Dataset of 140 advanced ovarian cancer patients containing data from different data profiles (clinical, treatment, and overall life quality) has been collected and used to foresee cancer patients' survival. Attributes from each data profile have been processed accordingly. Clinical data has been prepared corresponding to missing values and outliers. Treatment data including varying time periods were created using sequence mining techniques to identify the treatments given to the patients. And lastly, different comorbidities were combined into a single factor by computing Charlson Comorbidity Index for each patient. After appropriate preprocessing, the integrated dataset is classified using appropriate machine learning algorithms. The proposed integrated model approach gave the highest accuracy of 76.4% using ensemble technique with sequential pattern mining including time intervals of 2 months between treatments. Thus, the treatment sequences and, most importantly, life quality attributes significantly contribute to the survival prediction of cancer patients. Hindawi 2021-12-28 /pmc/articles/PMC8727098/ /pubmed/34992648 http://dx.doi.org/10.1155/2021/6342226 Text en Copyright © 2021 Ishleen Kaur et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Kaur, Ishleen
Doja, M. N.
Ahmad, Tanvir
Ahmad, Musheer
Hussain, Amir
Nadeem, Ahmed
Abd El-Latif, Ahmed A.
An Integrated  Approach for Cancer Survival Prediction Using Data Mining Techniques
title An Integrated  Approach for Cancer Survival Prediction Using Data Mining Techniques
title_full An Integrated  Approach for Cancer Survival Prediction Using Data Mining Techniques
title_fullStr An Integrated  Approach for Cancer Survival Prediction Using Data Mining Techniques
title_full_unstemmed An Integrated  Approach for Cancer Survival Prediction Using Data Mining Techniques
title_short An Integrated  Approach for Cancer Survival Prediction Using Data Mining Techniques
title_sort integrated  approach for cancer survival prediction using data mining techniques
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8727098/
https://www.ncbi.nlm.nih.gov/pubmed/34992648
http://dx.doi.org/10.1155/2021/6342226
work_keys_str_mv AT kaurishleen anintegratedapproachforcancersurvivalpredictionusingdataminingtechniques
AT dojamn anintegratedapproachforcancersurvivalpredictionusingdataminingtechniques
AT ahmadtanvir anintegratedapproachforcancersurvivalpredictionusingdataminingtechniques
AT ahmadmusheer anintegratedapproachforcancersurvivalpredictionusingdataminingtechniques
AT hussainamir anintegratedapproachforcancersurvivalpredictionusingdataminingtechniques
AT nadeemahmed anintegratedapproachforcancersurvivalpredictionusingdataminingtechniques
AT abdellatifahmeda anintegratedapproachforcancersurvivalpredictionusingdataminingtechniques
AT kaurishleen integratedapproachforcancersurvivalpredictionusingdataminingtechniques
AT dojamn integratedapproachforcancersurvivalpredictionusingdataminingtechniques
AT ahmadtanvir integratedapproachforcancersurvivalpredictionusingdataminingtechniques
AT ahmadmusheer integratedapproachforcancersurvivalpredictionusingdataminingtechniques
AT hussainamir integratedapproachforcancersurvivalpredictionusingdataminingtechniques
AT nadeemahmed integratedapproachforcancersurvivalpredictionusingdataminingtechniques
AT abdellatifahmeda integratedapproachforcancersurvivalpredictionusingdataminingtechniques