Cargando…

Machine-Aided Self-diagnostic Prediction Models for Polycystic Ovary Syndrome: Observational Study

BACKGROUND: Artificial intelligence and digital health care have substantially advanced to improve and enhance medical diagnosis and treatment during the prolonged period of the COVID-19 global pandemic. In this study, we discuss the development of prediction models for the self-diagnosis of polycys...

Descripción completa

Detalles Bibliográficos
Autores principales: Zigarelli, Angela, Jia, Ziyang, Lee, Hyunsun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8965679/
https://www.ncbi.nlm.nih.gov/pubmed/35289757
http://dx.doi.org/10.2196/29967
_version_ 1784678486031466496
author Zigarelli, Angela
Jia, Ziyang
Lee, Hyunsun
author_facet Zigarelli, Angela
Jia, Ziyang
Lee, Hyunsun
author_sort Zigarelli, Angela
collection PubMed
description BACKGROUND: Artificial intelligence and digital health care have substantially advanced to improve and enhance medical diagnosis and treatment during the prolonged period of the COVID-19 global pandemic. In this study, we discuss the development of prediction models for the self-diagnosis of polycystic ovary syndrome (PCOS) using machine learning techniques. OBJECTIVE: We aim to develop self-diagnostic prediction models for PCOS in potential patients and clinical providers. For potential patients, the prediction is based only on noninvasive measures such as anthropomorphic measures, symptoms, age, and other lifestyle factors so that the proposed prediction tool can be conveniently used without any laboratory or ultrasound test results. For clinical providers who can access patients’ medical test results, prediction models using all predictor variables can be adopted to help health providers diagnose patients with PCOS. We compare both prediction models using various error metrics. We call the former model the patient model and the latter, the provider model throughout this paper. METHODS: In this retrospective study, a publicly available data set of 541 women’s health information collected from 10 different hospitals in Kerala, India, including PCOS status, was acquired and used for analysis. We adopted the CatBoost method for classification, K-fold cross-validation for estimating the performance of models, and SHAP (Shapley Additive Explanations) values to explain the importance of each variable. In our subgroup study, we used k-means clustering and Principal Component Analysis to split the data set into 2 distinct BMI subgroups and compared the prediction results as well as the feature importance between the 2 subgroups. RESULTS: We achieved 81% to 82.5% prediction accuracy of PCOS status without any invasive measures in the patient models and achieved 87.5% to 90.1% prediction accuracy using both noninvasive and invasive predictor variables in the provider models. Among noninvasive measures, variables including acanthosis nigricans, acne, hirsutism, irregular menstrual cycle, length of menstrual cycle, weight gain, fast food consumption, and age were more important in the models. In medical test results, the numbers of follicles in the right and left ovaries and anti-Müllerian hormone were ranked highly in feature importance. We also reported more detailed results in a subgroup study. CONCLUSIONS: The proposed prediction models are ultimately expected to serve as a convenient digital platform with which users can acquire pre- or self-diagnosis and counsel for the risk of PCOS, with or without obtaining medical test results. It will enable women to conveniently access the platform at home without delay before they seek further medical care. Clinical providers can also use the proposed prediction tool to help diagnose PCOS in women.
format Online
Article
Text
id pubmed-8965679
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-89656792022-03-31 Machine-Aided Self-diagnostic Prediction Models for Polycystic Ovary Syndrome: Observational Study Zigarelli, Angela Jia, Ziyang Lee, Hyunsun JMIR Form Res Early Reports BACKGROUND: Artificial intelligence and digital health care have substantially advanced to improve and enhance medical diagnosis and treatment during the prolonged period of the COVID-19 global pandemic. In this study, we discuss the development of prediction models for the self-diagnosis of polycystic ovary syndrome (PCOS) using machine learning techniques. OBJECTIVE: We aim to develop self-diagnostic prediction models for PCOS in potential patients and clinical providers. For potential patients, the prediction is based only on noninvasive measures such as anthropomorphic measures, symptoms, age, and other lifestyle factors so that the proposed prediction tool can be conveniently used without any laboratory or ultrasound test results. For clinical providers who can access patients’ medical test results, prediction models using all predictor variables can be adopted to help health providers diagnose patients with PCOS. We compare both prediction models using various error metrics. We call the former model the patient model and the latter, the provider model throughout this paper. METHODS: In this retrospective study, a publicly available data set of 541 women’s health information collected from 10 different hospitals in Kerala, India, including PCOS status, was acquired and used for analysis. We adopted the CatBoost method for classification, K-fold cross-validation for estimating the performance of models, and SHAP (Shapley Additive Explanations) values to explain the importance of each variable. In our subgroup study, we used k-means clustering and Principal Component Analysis to split the data set into 2 distinct BMI subgroups and compared the prediction results as well as the feature importance between the 2 subgroups. RESULTS: We achieved 81% to 82.5% prediction accuracy of PCOS status without any invasive measures in the patient models and achieved 87.5% to 90.1% prediction accuracy using both noninvasive and invasive predictor variables in the provider models. Among noninvasive measures, variables including acanthosis nigricans, acne, hirsutism, irregular menstrual cycle, length of menstrual cycle, weight gain, fast food consumption, and age were more important in the models. In medical test results, the numbers of follicles in the right and left ovaries and anti-Müllerian hormone were ranked highly in feature importance. We also reported more detailed results in a subgroup study. CONCLUSIONS: The proposed prediction models are ultimately expected to serve as a convenient digital platform with which users can acquire pre- or self-diagnosis and counsel for the risk of PCOS, with or without obtaining medical test results. It will enable women to conveniently access the platform at home without delay before they seek further medical care. Clinical providers can also use the proposed prediction tool to help diagnose PCOS in women. JMIR Publications 2022-03-15 /pmc/articles/PMC8965679/ /pubmed/35289757 http://dx.doi.org/10.2196/29967 Text en ©Angela Zigarelli, Ziyang Jia, Hyunsun Lee. Originally published in JMIR Formative Research (https://formative.jmir.org), 15.03.2022. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Formative Research, is properly cited. The complete bibliographic information, a link to the original publication on https://formative.jmir.org, as well as this copyright and license information must be included.
spellingShingle Early Reports
Zigarelli, Angela
Jia, Ziyang
Lee, Hyunsun
Machine-Aided Self-diagnostic Prediction Models for Polycystic Ovary Syndrome: Observational Study
title Machine-Aided Self-diagnostic Prediction Models for Polycystic Ovary Syndrome: Observational Study
title_full Machine-Aided Self-diagnostic Prediction Models for Polycystic Ovary Syndrome: Observational Study
title_fullStr Machine-Aided Self-diagnostic Prediction Models for Polycystic Ovary Syndrome: Observational Study
title_full_unstemmed Machine-Aided Self-diagnostic Prediction Models for Polycystic Ovary Syndrome: Observational Study
title_short Machine-Aided Self-diagnostic Prediction Models for Polycystic Ovary Syndrome: Observational Study
title_sort machine-aided self-diagnostic prediction models for polycystic ovary syndrome: observational study
topic Early Reports
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8965679/
https://www.ncbi.nlm.nih.gov/pubmed/35289757
http://dx.doi.org/10.2196/29967
work_keys_str_mv AT zigarelliangela machineaidedselfdiagnosticpredictionmodelsforpolycysticovarysyndromeobservationalstudy
AT jiaziyang machineaidedselfdiagnosticpredictionmodelsforpolycysticovarysyndromeobservationalstudy
AT leehyunsun machineaidedselfdiagnosticpredictionmodelsforpolycysticovarysyndromeobservationalstudy