Cargando…

Exploring the dominant features and data-driven detection of polycystic ovary syndrome through modified stacking ensemble machine learning technique

Polycystic ovary syndrome (PCOS) is the most frequent endocrinological anomaly in reproductive women that causes persistent hormonal secretion disruption, leading to the formation of numerous cysts within the ovaries and serious health complications. But the real-world clinical detection technique f...

Descripción completa

Detalles Bibliográficos
Autores principales: Alam Suha, Sayma, Islam, Muhammad Nazrul
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10040521/
https://www.ncbi.nlm.nih.gov/pubmed/36994397
http://dx.doi.org/10.1016/j.heliyon.2023.e14518
_version_ 1784912490335830016
author Alam Suha, Sayma
Islam, Muhammad Nazrul
author_facet Alam Suha, Sayma
Islam, Muhammad Nazrul
author_sort Alam Suha, Sayma
collection PubMed
description Polycystic ovary syndrome (PCOS) is the most frequent endocrinological anomaly in reproductive women that causes persistent hormonal secretion disruption, leading to the formation of numerous cysts within the ovaries and serious health complications. But the real-world clinical detection technique for PCOS is very critical since the accuracy of interpretations being substantially dependent on the physician's expertise. Thus, an artificially intelligent PCOS prediction model might be a feasible additional technique to the error prone and time-consuming diagnostic technique. In this study, a modified ensemble machine learning (ML) classification approach is proposed utilizing state-of-the-art stacking technique for PCOS identification with patients' symptom data; employing five traditional ML models as base learners and then one bagging or boosting ensemble ML model as the meta-learner of the stacked model. Furthermore, three distinct types of feature selection strategies are applied to pick different sets of features with varied numbers and combinations of attributes. To evaluate and explore the dominant features necessary for predicting PCOS, the proposed technique with five variety of models and other ten types of classifiers is trained, tested and assessed utilizing different feature sets. As outcomes, the proposed stacking ensemble technique significantly enhances the accuracy in comparison to the other existing ML based techniques in case of all varieties of feature sets. However, among various models investigated to categorize PCOS and non-PCOS patients, the stacking ensemble model with ‘Gradient Boosting’ classifier as meta learner outperforms others with 95.7% accuracy while utilizing the top 25 features selected using Principal Component Analysis (PCA) feature selection technique.
format Online
Article
Text
id pubmed-10040521
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-100405212023-03-28 Exploring the dominant features and data-driven detection of polycystic ovary syndrome through modified stacking ensemble machine learning technique Alam Suha, Sayma Islam, Muhammad Nazrul Heliyon Research Article Polycystic ovary syndrome (PCOS) is the most frequent endocrinological anomaly in reproductive women that causes persistent hormonal secretion disruption, leading to the formation of numerous cysts within the ovaries and serious health complications. But the real-world clinical detection technique for PCOS is very critical since the accuracy of interpretations being substantially dependent on the physician's expertise. Thus, an artificially intelligent PCOS prediction model might be a feasible additional technique to the error prone and time-consuming diagnostic technique. In this study, a modified ensemble machine learning (ML) classification approach is proposed utilizing state-of-the-art stacking technique for PCOS identification with patients' symptom data; employing five traditional ML models as base learners and then one bagging or boosting ensemble ML model as the meta-learner of the stacked model. Furthermore, three distinct types of feature selection strategies are applied to pick different sets of features with varied numbers and combinations of attributes. To evaluate and explore the dominant features necessary for predicting PCOS, the proposed technique with five variety of models and other ten types of classifiers is trained, tested and assessed utilizing different feature sets. As outcomes, the proposed stacking ensemble technique significantly enhances the accuracy in comparison to the other existing ML based techniques in case of all varieties of feature sets. However, among various models investigated to categorize PCOS and non-PCOS patients, the stacking ensemble model with ‘Gradient Boosting’ classifier as meta learner outperforms others with 95.7% accuracy while utilizing the top 25 features selected using Principal Component Analysis (PCA) feature selection technique. Elsevier 2023-03-16 /pmc/articles/PMC10040521/ /pubmed/36994397 http://dx.doi.org/10.1016/j.heliyon.2023.e14518 Text en © 2023 The Author(s) https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Research Article
Alam Suha, Sayma
Islam, Muhammad Nazrul
Exploring the dominant features and data-driven detection of polycystic ovary syndrome through modified stacking ensemble machine learning technique
title Exploring the dominant features and data-driven detection of polycystic ovary syndrome through modified stacking ensemble machine learning technique
title_full Exploring the dominant features and data-driven detection of polycystic ovary syndrome through modified stacking ensemble machine learning technique
title_fullStr Exploring the dominant features and data-driven detection of polycystic ovary syndrome through modified stacking ensemble machine learning technique
title_full_unstemmed Exploring the dominant features and data-driven detection of polycystic ovary syndrome through modified stacking ensemble machine learning technique
title_short Exploring the dominant features and data-driven detection of polycystic ovary syndrome through modified stacking ensemble machine learning technique
title_sort exploring the dominant features and data-driven detection of polycystic ovary syndrome through modified stacking ensemble machine learning technique
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10040521/
https://www.ncbi.nlm.nih.gov/pubmed/36994397
http://dx.doi.org/10.1016/j.heliyon.2023.e14518
work_keys_str_mv AT alamsuhasayma exploringthedominantfeaturesanddatadrivendetectionofpolycysticovarysyndromethroughmodifiedstackingensemblemachinelearningtechnique
AT islammuhammadnazrul exploringthedominantfeaturesanddatadrivendetectionofpolycysticovarysyndromethroughmodifiedstackingensemblemachinelearningtechnique