Cargando…
Diagnosis of Endometriosis Based on Comorbidities: A Machine Learning Approach
Endometriosis is defined as the presence of estrogen-dependent endometrial-like tissue outside the uterine cavity. Despite extensive research, endometriosis is still an enigmatic disease and is challenging to diagnose and treat. A common clinical finding is the association of endometriosis with mult...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10669733/ https://www.ncbi.nlm.nih.gov/pubmed/38002015 http://dx.doi.org/10.3390/biomedicines11113015 |
_version_ | 1785139765761277952 |
---|---|
author | Tore, Ulan Abilgazym, Aibek Asunsolo-del-Barco, Angel Terzic, Milan Yemenkhan, Yerden Zollanvari, Amin Sarria-Santamera, Antonio |
author_facet | Tore, Ulan Abilgazym, Aibek Asunsolo-del-Barco, Angel Terzic, Milan Yemenkhan, Yerden Zollanvari, Amin Sarria-Santamera, Antonio |
author_sort | Tore, Ulan |
collection | PubMed |
description | Endometriosis is defined as the presence of estrogen-dependent endometrial-like tissue outside the uterine cavity. Despite extensive research, endometriosis is still an enigmatic disease and is challenging to diagnose and treat. A common clinical finding is the association of endometriosis with multiple diseases. We use a total of 627,566 clinically collected data from cases of endometriosis (0.82%) and controls (99.18%) to construct and evaluate predictive models. We develop a machine learning platform to construct diagnostic tools for endometriosis. The platform consists of logistic regression, decision tree, random forest, AdaBoost, and XGBoost for prediction, and uses Shapley Additive Explanation (SHAP) values to quantify the importance of features. In the model selection phase, the constructed XGBoost model performs better than other algorithms while achieving an area under the curve (AUC) of 0.725 on the test set during the evaluation phase, resulting in a specificity of 62.9% and a sensitivity of 68.6%. The model leads to a quite low positive predictive value of 1.5%, but a quite satisfactory negative predictive value of 99.58%. Moreover, the feature importance analysis points to age, infertility, uterine fibroids, anxiety, and allergic rhinitis as the top five most important features for predicting endometriosis. Although these results show the feasibility of using machine learning to improve the diagnosis of endometriosis, more research is required to improve the performance of predictive models for the diagnosis of endometriosis. This state of affairs is in part attributed to the complex nature of the condition and, at the same time, the administrative nature of our features. Should more informative features be used, we could possibly achieve a higher AUC for predicting endometriosis. As a result, we merely perceive the constructed predictive model as a tool to provide auxiliary information in clinical practice. |
format | Online Article Text |
id | pubmed-10669733 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-106697332023-11-10 Diagnosis of Endometriosis Based on Comorbidities: A Machine Learning Approach Tore, Ulan Abilgazym, Aibek Asunsolo-del-Barco, Angel Terzic, Milan Yemenkhan, Yerden Zollanvari, Amin Sarria-Santamera, Antonio Biomedicines Article Endometriosis is defined as the presence of estrogen-dependent endometrial-like tissue outside the uterine cavity. Despite extensive research, endometriosis is still an enigmatic disease and is challenging to diagnose and treat. A common clinical finding is the association of endometriosis with multiple diseases. We use a total of 627,566 clinically collected data from cases of endometriosis (0.82%) and controls (99.18%) to construct and evaluate predictive models. We develop a machine learning platform to construct diagnostic tools for endometriosis. The platform consists of logistic regression, decision tree, random forest, AdaBoost, and XGBoost for prediction, and uses Shapley Additive Explanation (SHAP) values to quantify the importance of features. In the model selection phase, the constructed XGBoost model performs better than other algorithms while achieving an area under the curve (AUC) of 0.725 on the test set during the evaluation phase, resulting in a specificity of 62.9% and a sensitivity of 68.6%. The model leads to a quite low positive predictive value of 1.5%, but a quite satisfactory negative predictive value of 99.58%. Moreover, the feature importance analysis points to age, infertility, uterine fibroids, anxiety, and allergic rhinitis as the top five most important features for predicting endometriosis. Although these results show the feasibility of using machine learning to improve the diagnosis of endometriosis, more research is required to improve the performance of predictive models for the diagnosis of endometriosis. This state of affairs is in part attributed to the complex nature of the condition and, at the same time, the administrative nature of our features. Should more informative features be used, we could possibly achieve a higher AUC for predicting endometriosis. As a result, we merely perceive the constructed predictive model as a tool to provide auxiliary information in clinical practice. MDPI 2023-11-10 /pmc/articles/PMC10669733/ /pubmed/38002015 http://dx.doi.org/10.3390/biomedicines11113015 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Tore, Ulan Abilgazym, Aibek Asunsolo-del-Barco, Angel Terzic, Milan Yemenkhan, Yerden Zollanvari, Amin Sarria-Santamera, Antonio Diagnosis of Endometriosis Based on Comorbidities: A Machine Learning Approach |
title | Diagnosis of Endometriosis Based on Comorbidities: A Machine Learning Approach |
title_full | Diagnosis of Endometriosis Based on Comorbidities: A Machine Learning Approach |
title_fullStr | Diagnosis of Endometriosis Based on Comorbidities: A Machine Learning Approach |
title_full_unstemmed | Diagnosis of Endometriosis Based on Comorbidities: A Machine Learning Approach |
title_short | Diagnosis of Endometriosis Based on Comorbidities: A Machine Learning Approach |
title_sort | diagnosis of endometriosis based on comorbidities: a machine learning approach |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10669733/ https://www.ncbi.nlm.nih.gov/pubmed/38002015 http://dx.doi.org/10.3390/biomedicines11113015 |
work_keys_str_mv | AT toreulan diagnosisofendometriosisbasedoncomorbiditiesamachinelearningapproach AT abilgazymaibek diagnosisofendometriosisbasedoncomorbiditiesamachinelearningapproach AT asunsolodelbarcoangel diagnosisofendometriosisbasedoncomorbiditiesamachinelearningapproach AT terzicmilan diagnosisofendometriosisbasedoncomorbiditiesamachinelearningapproach AT yemenkhanyerden diagnosisofendometriosisbasedoncomorbiditiesamachinelearningapproach AT zollanvariamin diagnosisofendometriosisbasedoncomorbiditiesamachinelearningapproach AT sarriasantameraantonio diagnosisofendometriosisbasedoncomorbiditiesamachinelearningapproach |