Cargando…

Improving Prediction of Cervical Cancer Using KNN Imputed SMOTE Features and Multi-Model Ensemble Learning Approach

SIMPLE SUMMARY: This paper presents a cervical cancer detection approach where the KNN Imputer techniques is used to fill the missing values and after that SMOTE upsampled features are utilized to train a multi-model ensemble learning approach. Results demonstrate that use of KNN Imputed SMOTE featu...

Descripción completa

Detalles Bibliográficos
Autores principales: Karamti, Hanen, Alharthi, Raed, Anizi, Amira Al, Alhebshi, Reemah M., Eshmawi, Ala’ Abdulmajid, Alsubai, Shtwai, Umer, Muhammad
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10486648/
https://www.ncbi.nlm.nih.gov/pubmed/37686692
http://dx.doi.org/10.3390/cancers15174412
_version_ 1785103056454549504
author Karamti, Hanen
Alharthi, Raed
Anizi, Amira Al
Alhebshi, Reemah M.
Eshmawi, Ala’ Abdulmajid
Alsubai, Shtwai
Umer, Muhammad
author_facet Karamti, Hanen
Alharthi, Raed
Anizi, Amira Al
Alhebshi, Reemah M.
Eshmawi, Ala’ Abdulmajid
Alsubai, Shtwai
Umer, Muhammad
author_sort Karamti, Hanen
collection PubMed
description SIMPLE SUMMARY: This paper presents a cervical cancer detection approach where the KNN Imputer techniques is used to fill the missing values and after that SMOTE upsampled features are utilized to train a multi-model ensemble learning approach. Results demonstrate that use of KNN Imputed SMOTE features yields better results than the original features to classify cancerous and normal patients. ABSTRACT: Objective: Cervical cancer ranks among the top causes of death among females in developing countries. The most important procedures that should be followed to guarantee the minimizing of cervical cancer’s aftereffects are early identification and treatment under the finest medical guidance. One of the best methods to find this sort of malignancy is by looking at a Pap smear image. For automated detection of cervical cancer, the available datasets often have missing values, which can significantly affect the performance of machine learning models. Methods: To address these challenges, this study proposes an automated system for predicting cervical cancer that efficiently handles missing values with SMOTE features to achieve high accuracy. The proposed system employs a stacked ensemble voting classifier model that combines three machine learning models, along with KNN Imputer and SMOTE up-sampled features for handling missing values. Results: The proposed model achieves 99.99% accuracy, 99.99% precision, 99.99% recall, and 99.99% F1 score when using KNN imputed SMOTE features. The study compares the performance of the proposed model with multiple other machine learning algorithms under four scenarios: with missing values removed, with KNN imputation, with SMOTE features, and with KNN imputed SMOTE features. The study validates the efficacy of the proposed model against existing state-of-the-art approaches. Conclusions: This study investigates the issue of missing values and class imbalance in the data collected for cervical cancer detection and might aid medical practitioners in timely detection and providing cervical cancer patients with better care.
format Online
Article
Text
id pubmed-10486648
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-104866482023-09-09 Improving Prediction of Cervical Cancer Using KNN Imputed SMOTE Features and Multi-Model Ensemble Learning Approach Karamti, Hanen Alharthi, Raed Anizi, Amira Al Alhebshi, Reemah M. Eshmawi, Ala’ Abdulmajid Alsubai, Shtwai Umer, Muhammad Cancers (Basel) Article SIMPLE SUMMARY: This paper presents a cervical cancer detection approach where the KNN Imputer techniques is used to fill the missing values and after that SMOTE upsampled features are utilized to train a multi-model ensemble learning approach. Results demonstrate that use of KNN Imputed SMOTE features yields better results than the original features to classify cancerous and normal patients. ABSTRACT: Objective: Cervical cancer ranks among the top causes of death among females in developing countries. The most important procedures that should be followed to guarantee the minimizing of cervical cancer’s aftereffects are early identification and treatment under the finest medical guidance. One of the best methods to find this sort of malignancy is by looking at a Pap smear image. For automated detection of cervical cancer, the available datasets often have missing values, which can significantly affect the performance of machine learning models. Methods: To address these challenges, this study proposes an automated system for predicting cervical cancer that efficiently handles missing values with SMOTE features to achieve high accuracy. The proposed system employs a stacked ensemble voting classifier model that combines three machine learning models, along with KNN Imputer and SMOTE up-sampled features for handling missing values. Results: The proposed model achieves 99.99% accuracy, 99.99% precision, 99.99% recall, and 99.99% F1 score when using KNN imputed SMOTE features. The study compares the performance of the proposed model with multiple other machine learning algorithms under four scenarios: with missing values removed, with KNN imputation, with SMOTE features, and with KNN imputed SMOTE features. The study validates the efficacy of the proposed model against existing state-of-the-art approaches. Conclusions: This study investigates the issue of missing values and class imbalance in the data collected for cervical cancer detection and might aid medical practitioners in timely detection and providing cervical cancer patients with better care. MDPI 2023-09-04 /pmc/articles/PMC10486648/ /pubmed/37686692 http://dx.doi.org/10.3390/cancers15174412 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Karamti, Hanen
Alharthi, Raed
Anizi, Amira Al
Alhebshi, Reemah M.
Eshmawi, Ala’ Abdulmajid
Alsubai, Shtwai
Umer, Muhammad
Improving Prediction of Cervical Cancer Using KNN Imputed SMOTE Features and Multi-Model Ensemble Learning Approach
title Improving Prediction of Cervical Cancer Using KNN Imputed SMOTE Features and Multi-Model Ensemble Learning Approach
title_full Improving Prediction of Cervical Cancer Using KNN Imputed SMOTE Features and Multi-Model Ensemble Learning Approach
title_fullStr Improving Prediction of Cervical Cancer Using KNN Imputed SMOTE Features and Multi-Model Ensemble Learning Approach
title_full_unstemmed Improving Prediction of Cervical Cancer Using KNN Imputed SMOTE Features and Multi-Model Ensemble Learning Approach
title_short Improving Prediction of Cervical Cancer Using KNN Imputed SMOTE Features and Multi-Model Ensemble Learning Approach
title_sort improving prediction of cervical cancer using knn imputed smote features and multi-model ensemble learning approach
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10486648/
https://www.ncbi.nlm.nih.gov/pubmed/37686692
http://dx.doi.org/10.3390/cancers15174412
work_keys_str_mv AT karamtihanen improvingpredictionofcervicalcancerusingknnimputedsmotefeaturesandmultimodelensemblelearningapproach
AT alharthiraed improvingpredictionofcervicalcancerusingknnimputedsmotefeaturesandmultimodelensemblelearningapproach
AT aniziamiraal improvingpredictionofcervicalcancerusingknnimputedsmotefeaturesandmultimodelensemblelearningapproach
AT alhebshireemahm improvingpredictionofcervicalcancerusingknnimputedsmotefeaturesandmultimodelensemblelearningapproach
AT eshmawialaabdulmajid improvingpredictionofcervicalcancerusingknnimputedsmotefeaturesandmultimodelensemblelearningapproach
AT alsubaishtwai improvingpredictionofcervicalcancerusingknnimputedsmotefeaturesandmultimodelensemblelearningapproach
AT umermuhammad improvingpredictionofcervicalcancerusingknnimputedsmotefeaturesandmultimodelensemblelearningapproach