Cargando…

Diagnosis of Endometriosis Based on Comorbidities: A Machine Learning Approach

Endometriosis is defined as the presence of estrogen-dependent endometrial-like tissue outside the uterine cavity. Despite extensive research, endometriosis is still an enigmatic disease and is challenging to diagnose and treat. A common clinical finding is the association of endometriosis with mult...

Descripción completa

Detalles Bibliográficos
Autores principales: Tore, Ulan, Abilgazym, Aibek, Asunsolo-del-Barco, Angel, Terzic, Milan, Yemenkhan, Yerden, Zollanvari, Amin, Sarria-Santamera, Antonio
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10669733/
https://www.ncbi.nlm.nih.gov/pubmed/38002015
http://dx.doi.org/10.3390/biomedicines11113015
_version_ 1785139765761277952
author Tore, Ulan
Abilgazym, Aibek
Asunsolo-del-Barco, Angel
Terzic, Milan
Yemenkhan, Yerden
Zollanvari, Amin
Sarria-Santamera, Antonio
author_facet Tore, Ulan
Abilgazym, Aibek
Asunsolo-del-Barco, Angel
Terzic, Milan
Yemenkhan, Yerden
Zollanvari, Amin
Sarria-Santamera, Antonio
author_sort Tore, Ulan
collection PubMed
description Endometriosis is defined as the presence of estrogen-dependent endometrial-like tissue outside the uterine cavity. Despite extensive research, endometriosis is still an enigmatic disease and is challenging to diagnose and treat. A common clinical finding is the association of endometriosis with multiple diseases. We use a total of 627,566 clinically collected data from cases of endometriosis (0.82%) and controls (99.18%) to construct and evaluate predictive models. We develop a machine learning platform to construct diagnostic tools for endometriosis. The platform consists of logistic regression, decision tree, random forest, AdaBoost, and XGBoost for prediction, and uses Shapley Additive Explanation (SHAP) values to quantify the importance of features. In the model selection phase, the constructed XGBoost model performs better than other algorithms while achieving an area under the curve (AUC) of 0.725 on the test set during the evaluation phase, resulting in a specificity of 62.9% and a sensitivity of 68.6%. The model leads to a quite low positive predictive value of 1.5%, but a quite satisfactory negative predictive value of 99.58%. Moreover, the feature importance analysis points to age, infertility, uterine fibroids, anxiety, and allergic rhinitis as the top five most important features for predicting endometriosis. Although these results show the feasibility of using machine learning to improve the diagnosis of endometriosis, more research is required to improve the performance of predictive models for the diagnosis of endometriosis. This state of affairs is in part attributed to the complex nature of the condition and, at the same time, the administrative nature of our features. Should more informative features be used, we could possibly achieve a higher AUC for predicting endometriosis. As a result, we merely perceive the constructed predictive model as a tool to provide auxiliary information in clinical practice.
format Online
Article
Text
id pubmed-10669733
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-106697332023-11-10 Diagnosis of Endometriosis Based on Comorbidities: A Machine Learning Approach Tore, Ulan Abilgazym, Aibek Asunsolo-del-Barco, Angel Terzic, Milan Yemenkhan, Yerden Zollanvari, Amin Sarria-Santamera, Antonio Biomedicines Article Endometriosis is defined as the presence of estrogen-dependent endometrial-like tissue outside the uterine cavity. Despite extensive research, endometriosis is still an enigmatic disease and is challenging to diagnose and treat. A common clinical finding is the association of endometriosis with multiple diseases. We use a total of 627,566 clinically collected data from cases of endometriosis (0.82%) and controls (99.18%) to construct and evaluate predictive models. We develop a machine learning platform to construct diagnostic tools for endometriosis. The platform consists of logistic regression, decision tree, random forest, AdaBoost, and XGBoost for prediction, and uses Shapley Additive Explanation (SHAP) values to quantify the importance of features. In the model selection phase, the constructed XGBoost model performs better than other algorithms while achieving an area under the curve (AUC) of 0.725 on the test set during the evaluation phase, resulting in a specificity of 62.9% and a sensitivity of 68.6%. The model leads to a quite low positive predictive value of 1.5%, but a quite satisfactory negative predictive value of 99.58%. Moreover, the feature importance analysis points to age, infertility, uterine fibroids, anxiety, and allergic rhinitis as the top five most important features for predicting endometriosis. Although these results show the feasibility of using machine learning to improve the diagnosis of endometriosis, more research is required to improve the performance of predictive models for the diagnosis of endometriosis. This state of affairs is in part attributed to the complex nature of the condition and, at the same time, the administrative nature of our features. Should more informative features be used, we could possibly achieve a higher AUC for predicting endometriosis. As a result, we merely perceive the constructed predictive model as a tool to provide auxiliary information in clinical practice. MDPI 2023-11-10 /pmc/articles/PMC10669733/ /pubmed/38002015 http://dx.doi.org/10.3390/biomedicines11113015 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Tore, Ulan
Abilgazym, Aibek
Asunsolo-del-Barco, Angel
Terzic, Milan
Yemenkhan, Yerden
Zollanvari, Amin
Sarria-Santamera, Antonio
Diagnosis of Endometriosis Based on Comorbidities: A Machine Learning Approach
title Diagnosis of Endometriosis Based on Comorbidities: A Machine Learning Approach
title_full Diagnosis of Endometriosis Based on Comorbidities: A Machine Learning Approach
title_fullStr Diagnosis of Endometriosis Based on Comorbidities: A Machine Learning Approach
title_full_unstemmed Diagnosis of Endometriosis Based on Comorbidities: A Machine Learning Approach
title_short Diagnosis of Endometriosis Based on Comorbidities: A Machine Learning Approach
title_sort diagnosis of endometriosis based on comorbidities: a machine learning approach
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10669733/
https://www.ncbi.nlm.nih.gov/pubmed/38002015
http://dx.doi.org/10.3390/biomedicines11113015
work_keys_str_mv AT toreulan diagnosisofendometriosisbasedoncomorbiditiesamachinelearningapproach
AT abilgazymaibek diagnosisofendometriosisbasedoncomorbiditiesamachinelearningapproach
AT asunsolodelbarcoangel diagnosisofendometriosisbasedoncomorbiditiesamachinelearningapproach
AT terzicmilan diagnosisofendometriosisbasedoncomorbiditiesamachinelearningapproach
AT yemenkhanyerden diagnosisofendometriosisbasedoncomorbiditiesamachinelearningapproach
AT zollanvariamin diagnosisofendometriosisbasedoncomorbiditiesamachinelearningapproach
AT sarriasantameraantonio diagnosisofendometriosisbasedoncomorbiditiesamachinelearningapproach