Cargando…
Machine-Aided Self-diagnostic Prediction Models for Polycystic Ovary Syndrome: Observational Study
BACKGROUND: Artificial intelligence and digital health care have substantially advanced to improve and enhance medical diagnosis and treatment during the prolonged period of the COVID-19 global pandemic. In this study, we discuss the development of prediction models for the self-diagnosis of polycys...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
JMIR Publications
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8965679/ https://www.ncbi.nlm.nih.gov/pubmed/35289757 http://dx.doi.org/10.2196/29967 |
_version_ | 1784678486031466496 |
---|---|
author | Zigarelli, Angela Jia, Ziyang Lee, Hyunsun |
author_facet | Zigarelli, Angela Jia, Ziyang Lee, Hyunsun |
author_sort | Zigarelli, Angela |
collection | PubMed |
description | BACKGROUND: Artificial intelligence and digital health care have substantially advanced to improve and enhance medical diagnosis and treatment during the prolonged period of the COVID-19 global pandemic. In this study, we discuss the development of prediction models for the self-diagnosis of polycystic ovary syndrome (PCOS) using machine learning techniques. OBJECTIVE: We aim to develop self-diagnostic prediction models for PCOS in potential patients and clinical providers. For potential patients, the prediction is based only on noninvasive measures such as anthropomorphic measures, symptoms, age, and other lifestyle factors so that the proposed prediction tool can be conveniently used without any laboratory or ultrasound test results. For clinical providers who can access patients’ medical test results, prediction models using all predictor variables can be adopted to help health providers diagnose patients with PCOS. We compare both prediction models using various error metrics. We call the former model the patient model and the latter, the provider model throughout this paper. METHODS: In this retrospective study, a publicly available data set of 541 women’s health information collected from 10 different hospitals in Kerala, India, including PCOS status, was acquired and used for analysis. We adopted the CatBoost method for classification, K-fold cross-validation for estimating the performance of models, and SHAP (Shapley Additive Explanations) values to explain the importance of each variable. In our subgroup study, we used k-means clustering and Principal Component Analysis to split the data set into 2 distinct BMI subgroups and compared the prediction results as well as the feature importance between the 2 subgroups. RESULTS: We achieved 81% to 82.5% prediction accuracy of PCOS status without any invasive measures in the patient models and achieved 87.5% to 90.1% prediction accuracy using both noninvasive and invasive predictor variables in the provider models. Among noninvasive measures, variables including acanthosis nigricans, acne, hirsutism, irregular menstrual cycle, length of menstrual cycle, weight gain, fast food consumption, and age were more important in the models. In medical test results, the numbers of follicles in the right and left ovaries and anti-Müllerian hormone were ranked highly in feature importance. We also reported more detailed results in a subgroup study. CONCLUSIONS: The proposed prediction models are ultimately expected to serve as a convenient digital platform with which users can acquire pre- or self-diagnosis and counsel for the risk of PCOS, with or without obtaining medical test results. It will enable women to conveniently access the platform at home without delay before they seek further medical care. Clinical providers can also use the proposed prediction tool to help diagnose PCOS in women. |
format | Online Article Text |
id | pubmed-8965679 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | JMIR Publications |
record_format | MEDLINE/PubMed |
spelling | pubmed-89656792022-03-31 Machine-Aided Self-diagnostic Prediction Models for Polycystic Ovary Syndrome: Observational Study Zigarelli, Angela Jia, Ziyang Lee, Hyunsun JMIR Form Res Early Reports BACKGROUND: Artificial intelligence and digital health care have substantially advanced to improve and enhance medical diagnosis and treatment during the prolonged period of the COVID-19 global pandemic. In this study, we discuss the development of prediction models for the self-diagnosis of polycystic ovary syndrome (PCOS) using machine learning techniques. OBJECTIVE: We aim to develop self-diagnostic prediction models for PCOS in potential patients and clinical providers. For potential patients, the prediction is based only on noninvasive measures such as anthropomorphic measures, symptoms, age, and other lifestyle factors so that the proposed prediction tool can be conveniently used without any laboratory or ultrasound test results. For clinical providers who can access patients’ medical test results, prediction models using all predictor variables can be adopted to help health providers diagnose patients with PCOS. We compare both prediction models using various error metrics. We call the former model the patient model and the latter, the provider model throughout this paper. METHODS: In this retrospective study, a publicly available data set of 541 women’s health information collected from 10 different hospitals in Kerala, India, including PCOS status, was acquired and used for analysis. We adopted the CatBoost method for classification, K-fold cross-validation for estimating the performance of models, and SHAP (Shapley Additive Explanations) values to explain the importance of each variable. In our subgroup study, we used k-means clustering and Principal Component Analysis to split the data set into 2 distinct BMI subgroups and compared the prediction results as well as the feature importance between the 2 subgroups. RESULTS: We achieved 81% to 82.5% prediction accuracy of PCOS status without any invasive measures in the patient models and achieved 87.5% to 90.1% prediction accuracy using both noninvasive and invasive predictor variables in the provider models. Among noninvasive measures, variables including acanthosis nigricans, acne, hirsutism, irregular menstrual cycle, length of menstrual cycle, weight gain, fast food consumption, and age were more important in the models. In medical test results, the numbers of follicles in the right and left ovaries and anti-Müllerian hormone were ranked highly in feature importance. We also reported more detailed results in a subgroup study. CONCLUSIONS: The proposed prediction models are ultimately expected to serve as a convenient digital platform with which users can acquire pre- or self-diagnosis and counsel for the risk of PCOS, with or without obtaining medical test results. It will enable women to conveniently access the platform at home without delay before they seek further medical care. Clinical providers can also use the proposed prediction tool to help diagnose PCOS in women. JMIR Publications 2022-03-15 /pmc/articles/PMC8965679/ /pubmed/35289757 http://dx.doi.org/10.2196/29967 Text en ©Angela Zigarelli, Ziyang Jia, Hyunsun Lee. Originally published in JMIR Formative Research (https://formative.jmir.org), 15.03.2022. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Formative Research, is properly cited. The complete bibliographic information, a link to the original publication on https://formative.jmir.org, as well as this copyright and license information must be included. |
spellingShingle | Early Reports Zigarelli, Angela Jia, Ziyang Lee, Hyunsun Machine-Aided Self-diagnostic Prediction Models for Polycystic Ovary Syndrome: Observational Study |
title | Machine-Aided Self-diagnostic Prediction Models for Polycystic Ovary Syndrome: Observational Study |
title_full | Machine-Aided Self-diagnostic Prediction Models for Polycystic Ovary Syndrome: Observational Study |
title_fullStr | Machine-Aided Self-diagnostic Prediction Models for Polycystic Ovary Syndrome: Observational Study |
title_full_unstemmed | Machine-Aided Self-diagnostic Prediction Models for Polycystic Ovary Syndrome: Observational Study |
title_short | Machine-Aided Self-diagnostic Prediction Models for Polycystic Ovary Syndrome: Observational Study |
title_sort | machine-aided self-diagnostic prediction models for polycystic ovary syndrome: observational study |
topic | Early Reports |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8965679/ https://www.ncbi.nlm.nih.gov/pubmed/35289757 http://dx.doi.org/10.2196/29967 |
work_keys_str_mv | AT zigarelliangela machineaidedselfdiagnosticpredictionmodelsforpolycysticovarysyndromeobservationalstudy AT jiaziyang machineaidedselfdiagnosticpredictionmodelsforpolycysticovarysyndromeobservationalstudy AT leehyunsun machineaidedselfdiagnosticpredictionmodelsforpolycysticovarysyndromeobservationalstudy |