Cargando…

Cervical cancer detection using K nearest neighbor imputer and stacked ensemble learningmodel

OBJECTIVE: Cervical cancer stands as a leading cause of mortality among women in developing nations. To ensure the reduction of its adverse consequences, the primary protocols to be adhered to involve early detection and treatment under the guidance of expert medical professionals. An effective appr...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Xiaoyuan, Aljrees, Turki, Umer, Muhammad, Saidani, Oumaima, Almuqren, Latifah, Mzoughi, Olfa, Ishaq, Abid, Ashraf, Imran
Formato: Online Artículo Texto
Lenguaje:English
Publicado: SAGE Publications 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10548812/
https://www.ncbi.nlm.nih.gov/pubmed/37799501
http://dx.doi.org/10.1177/20552076231203802
_version_ 1785115354559676416
author Chen, Xiaoyuan
Aljrees, Turki
Umer, Muhammad
Saidani, Oumaima
Almuqren, Latifah
Mzoughi, Olfa
Ishaq, Abid
Ashraf, Imran
author_facet Chen, Xiaoyuan
Aljrees, Turki
Umer, Muhammad
Saidani, Oumaima
Almuqren, Latifah
Mzoughi, Olfa
Ishaq, Abid
Ashraf, Imran
author_sort Chen, Xiaoyuan
collection PubMed
description OBJECTIVE: Cervical cancer stands as a leading cause of mortality among women in developing nations. To ensure the reduction of its adverse consequences, the primary protocols to be adhered to involve early detection and treatment under the guidance of expert medical professionals. An effective approach for identifying this form of malignancy involves the examination of Pap smear images. However, in the context of automating cervical cancer detection, many of the existing datasets frequently exhibit missing data points, a factor that can substantially impact the effectiveness of machine learning models. METHODS: In response to these hurdles, this research introduces an automated system designed to predict cervical cancer with a dual focus: adeptly managing missing data while attaining remarkable accuracy. The system's core is built upon a stacked ensemble voting classifier model, which amalgamates three distinct machine learning models, all harmoniously integrated with the KNN Imputer to address the issue of missing values. RESULTS: The model put forth attains an accuracy of 99.41%, precision of 97.63%, recall of 95.96%, and an F1 score of 96.76% when incorporating the KNN imputation method. The investigation conducts a comparative analysis, contrasting the performance of this model with seven alternative machine learning algorithms in two scenarios: one where missing values are eliminated, and another employing KNN imputation. This study offers validation of the effectiveness of the proposed model in comparison to current state-of-the-art methodologies. CONCLUSIONS: This research delves into the challenge of handling missing data in the dataset utilized for cervical cancer detection. The findings have the potential to assist healthcare professionals in achieving early detection and enhancing the quality of care provided to individuals affected by cervical cancer.
format Online
Article
Text
id pubmed-10548812
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher SAGE Publications
record_format MEDLINE/PubMed
spelling pubmed-105488122023-10-05 Cervical cancer detection using K nearest neighbor imputer and stacked ensemble learningmodel Chen, Xiaoyuan Aljrees, Turki Umer, Muhammad Saidani, Oumaima Almuqren, Latifah Mzoughi, Olfa Ishaq, Abid Ashraf, Imran Digit Health Original Research OBJECTIVE: Cervical cancer stands as a leading cause of mortality among women in developing nations. To ensure the reduction of its adverse consequences, the primary protocols to be adhered to involve early detection and treatment under the guidance of expert medical professionals. An effective approach for identifying this form of malignancy involves the examination of Pap smear images. However, in the context of automating cervical cancer detection, many of the existing datasets frequently exhibit missing data points, a factor that can substantially impact the effectiveness of machine learning models. METHODS: In response to these hurdles, this research introduces an automated system designed to predict cervical cancer with a dual focus: adeptly managing missing data while attaining remarkable accuracy. The system's core is built upon a stacked ensemble voting classifier model, which amalgamates three distinct machine learning models, all harmoniously integrated with the KNN Imputer to address the issue of missing values. RESULTS: The model put forth attains an accuracy of 99.41%, precision of 97.63%, recall of 95.96%, and an F1 score of 96.76% when incorporating the KNN imputation method. The investigation conducts a comparative analysis, contrasting the performance of this model with seven alternative machine learning algorithms in two scenarios: one where missing values are eliminated, and another employing KNN imputation. This study offers validation of the effectiveness of the proposed model in comparison to current state-of-the-art methodologies. CONCLUSIONS: This research delves into the challenge of handling missing data in the dataset utilized for cervical cancer detection. The findings have the potential to assist healthcare professionals in achieving early detection and enhancing the quality of care provided to individuals affected by cervical cancer. SAGE Publications 2023-10-03 /pmc/articles/PMC10548812/ /pubmed/37799501 http://dx.doi.org/10.1177/20552076231203802 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by-nc/4.0/This article is distributed under the terms of the Creative Commons Attribution-NonCommercial 4.0 License (https://creativecommons.org/licenses/by-nc/4.0/) which permits non-commercial use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access page (https://us.sagepub.com/en-us/nam/open-access-at-sage).
spellingShingle Original Research
Chen, Xiaoyuan
Aljrees, Turki
Umer, Muhammad
Saidani, Oumaima
Almuqren, Latifah
Mzoughi, Olfa
Ishaq, Abid
Ashraf, Imran
Cervical cancer detection using K nearest neighbor imputer and stacked ensemble learningmodel
title Cervical cancer detection using K nearest neighbor imputer and stacked ensemble learningmodel
title_full Cervical cancer detection using K nearest neighbor imputer and stacked ensemble learningmodel
title_fullStr Cervical cancer detection using K nearest neighbor imputer and stacked ensemble learningmodel
title_full_unstemmed Cervical cancer detection using K nearest neighbor imputer and stacked ensemble learningmodel
title_short Cervical cancer detection using K nearest neighbor imputer and stacked ensemble learningmodel
title_sort cervical cancer detection using k nearest neighbor imputer and stacked ensemble learningmodel
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10548812/
https://www.ncbi.nlm.nih.gov/pubmed/37799501
http://dx.doi.org/10.1177/20552076231203802
work_keys_str_mv AT chenxiaoyuan cervicalcancerdetectionusingknearestneighborimputerandstackedensemblelearningmodel
AT aljreesturki cervicalcancerdetectionusingknearestneighborimputerandstackedensemblelearningmodel
AT umermuhammad cervicalcancerdetectionusingknearestneighborimputerandstackedensemblelearningmodel
AT saidanioumaima cervicalcancerdetectionusingknearestneighborimputerandstackedensemblelearningmodel
AT almuqrenlatifah cervicalcancerdetectionusingknearestneighborimputerandstackedensemblelearningmodel
AT mzoughiolfa cervicalcancerdetectionusingknearestneighborimputerandstackedensemblelearningmodel
AT ishaqabid cervicalcancerdetectionusingknearestneighborimputerandstackedensemblelearningmodel
AT ashrafimran cervicalcancerdetectionusingknearestneighborimputerandstackedensemblelearningmodel