Cargando…

SAT-LB121 Development of a Machine-Learning Method for Predicting New Onset of Diabetes Mellitus: A Retrospective Analysis of 509,153 Annual Specific Health Checkup Records

Diabetes mellitus (DM) is a chronic disorder, characterized by impaired glucose metabolism. It is linked to increased risks of several diseases such as atrial fibrillation, cancer, and cardiovascular diseases. Therefore, DM prevention is essential. However, the traditional regression-based DM-onset...

Descripción completa

Detalles Bibliográficos
Autores principales: Nomura, Akihiro, Yamamoto, Sho, Hayakawa, Yuta, Taniguchi, Kouki, Higashitani, Takuya, Aono, Daisuke, Kometani, Mitsuhiro, Yoneda, Takashi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7208505/
http://dx.doi.org/10.1210/jendso/bvaa046.2194
_version_ 1783530860850446336
author Nomura, Akihiro
Yamamoto, Sho
Hayakawa, Yuta
Taniguchi, Kouki
Higashitani, Takuya
Aono, Daisuke
Kometani, Mitsuhiro
Yoneda, Takashi
author_facet Nomura, Akihiro
Yamamoto, Sho
Hayakawa, Yuta
Taniguchi, Kouki
Higashitani, Takuya
Aono, Daisuke
Kometani, Mitsuhiro
Yoneda, Takashi
author_sort Nomura, Akihiro
collection PubMed
description Diabetes mellitus (DM) is a chronic disorder, characterized by impaired glucose metabolism. It is linked to increased risks of several diseases such as atrial fibrillation, cancer, and cardiovascular diseases. Therefore, DM prevention is essential. However, the traditional regression-based DM-onset prediction methods are incapable of investigating future DM for generally healthy individuals without DM. Employing gradient-boosting decision trees, we developed a machine learning-based prediction model to identify the DM signatures, prior to the onset of DM. We employed the nationwide annual specific health checkup records, collected during the years 2008 to 2018, from Kanazawa city, Ishikawa, Japan. The data included the physical examinations, blood and urine tests, and participant questionnaires. Individuals without DM (at baseline), who underwent more than two annual health checkups during the said period, were included. The new cases of DM onset were recorded when the participants were diagnosed with DM in the annual check-ups. The dataset was divided into three subsets in a 6:2:2 ratio to constitute the training, tuning (internal validation), and testing datasets. Employing the testing dataset, the ability of our trained prediction model to calculate the area under the curve (AUC), precision, recall, F1 score, and overall accuracy was evaluated. Using a 1,000-iteration bootstrap method, every performance test resulted in a two-sided 95% confidence interval (CI). We included 509,153 annual health checkup records of 139,225 participants. Among them, 65,505 participants without DM were included, which constituted36,303 participants in the training dataset and 13,101 participants in each of the tuning and testing datasets. We identified a total of 4,696 new DM-onset patients (7.2%) in the study period. Our trained model predicted the future incidence of DM with the AUC, precision, recall, F1 score, and overall accuracy of 0.71 (0.69-0.72 with 95% CI), 75.3% (71.6-78.8), 42.2% (39.3-45.2), 54.1% (51.2-56.7), and 94.9% (94.5-95.2), respectively. In conclusion, the machine learning-based prediction model satisfactorily identified the DM onset prior to the actual incidence.
format Online
Article
Text
id pubmed-7208505
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-72085052020-05-13 SAT-LB121 Development of a Machine-Learning Method for Predicting New Onset of Diabetes Mellitus: A Retrospective Analysis of 509,153 Annual Specific Health Checkup Records Nomura, Akihiro Yamamoto, Sho Hayakawa, Yuta Taniguchi, Kouki Higashitani, Takuya Aono, Daisuke Kometani, Mitsuhiro Yoneda, Takashi J Endocr Soc Diabetes Mellitus and Glucose Metabolism Diabetes mellitus (DM) is a chronic disorder, characterized by impaired glucose metabolism. It is linked to increased risks of several diseases such as atrial fibrillation, cancer, and cardiovascular diseases. Therefore, DM prevention is essential. However, the traditional regression-based DM-onset prediction methods are incapable of investigating future DM for generally healthy individuals without DM. Employing gradient-boosting decision trees, we developed a machine learning-based prediction model to identify the DM signatures, prior to the onset of DM. We employed the nationwide annual specific health checkup records, collected during the years 2008 to 2018, from Kanazawa city, Ishikawa, Japan. The data included the physical examinations, blood and urine tests, and participant questionnaires. Individuals without DM (at baseline), who underwent more than two annual health checkups during the said period, were included. The new cases of DM onset were recorded when the participants were diagnosed with DM in the annual check-ups. The dataset was divided into three subsets in a 6:2:2 ratio to constitute the training, tuning (internal validation), and testing datasets. Employing the testing dataset, the ability of our trained prediction model to calculate the area under the curve (AUC), precision, recall, F1 score, and overall accuracy was evaluated. Using a 1,000-iteration bootstrap method, every performance test resulted in a two-sided 95% confidence interval (CI). We included 509,153 annual health checkup records of 139,225 participants. Among them, 65,505 participants without DM were included, which constituted36,303 participants in the training dataset and 13,101 participants in each of the tuning and testing datasets. We identified a total of 4,696 new DM-onset patients (7.2%) in the study period. Our trained model predicted the future incidence of DM with the AUC, precision, recall, F1 score, and overall accuracy of 0.71 (0.69-0.72 with 95% CI), 75.3% (71.6-78.8), 42.2% (39.3-45.2), 54.1% (51.2-56.7), and 94.9% (94.5-95.2), respectively. In conclusion, the machine learning-based prediction model satisfactorily identified the DM onset prior to the actual incidence. Oxford University Press 2020-05-08 /pmc/articles/PMC7208505/ http://dx.doi.org/10.1210/jendso/bvaa046.2194 Text en © Endocrine Society 2020. http://creativecommons.org/licenses/by-nc-nd/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivs licence (http://creativecommons.org/licenses/by-nc-nd/4.0/), which permits non-commercial reproduction and distribution of the work, in any medium, provided the original work is not altered or transformed in any way, and that the work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Diabetes Mellitus and Glucose Metabolism
Nomura, Akihiro
Yamamoto, Sho
Hayakawa, Yuta
Taniguchi, Kouki
Higashitani, Takuya
Aono, Daisuke
Kometani, Mitsuhiro
Yoneda, Takashi
SAT-LB121 Development of a Machine-Learning Method for Predicting New Onset of Diabetes Mellitus: A Retrospective Analysis of 509,153 Annual Specific Health Checkup Records
title SAT-LB121 Development of a Machine-Learning Method for Predicting New Onset of Diabetes Mellitus: A Retrospective Analysis of 509,153 Annual Specific Health Checkup Records
title_full SAT-LB121 Development of a Machine-Learning Method for Predicting New Onset of Diabetes Mellitus: A Retrospective Analysis of 509,153 Annual Specific Health Checkup Records
title_fullStr SAT-LB121 Development of a Machine-Learning Method for Predicting New Onset of Diabetes Mellitus: A Retrospective Analysis of 509,153 Annual Specific Health Checkup Records
title_full_unstemmed SAT-LB121 Development of a Machine-Learning Method for Predicting New Onset of Diabetes Mellitus: A Retrospective Analysis of 509,153 Annual Specific Health Checkup Records
title_short SAT-LB121 Development of a Machine-Learning Method for Predicting New Onset of Diabetes Mellitus: A Retrospective Analysis of 509,153 Annual Specific Health Checkup Records
title_sort sat-lb121 development of a machine-learning method for predicting new onset of diabetes mellitus: a retrospective analysis of 509,153 annual specific health checkup records
topic Diabetes Mellitus and Glucose Metabolism
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7208505/
http://dx.doi.org/10.1210/jendso/bvaa046.2194
work_keys_str_mv AT nomuraakihiro satlb121developmentofamachinelearningmethodforpredictingnewonsetofdiabetesmellitusaretrospectiveanalysisof509153annualspecifichealthcheckuprecords
AT yamamotosho satlb121developmentofamachinelearningmethodforpredictingnewonsetofdiabetesmellitusaretrospectiveanalysisof509153annualspecifichealthcheckuprecords
AT hayakawayuta satlb121developmentofamachinelearningmethodforpredictingnewonsetofdiabetesmellitusaretrospectiveanalysisof509153annualspecifichealthcheckuprecords
AT taniguchikouki satlb121developmentofamachinelearningmethodforpredictingnewonsetofdiabetesmellitusaretrospectiveanalysisof509153annualspecifichealthcheckuprecords
AT higashitanitakuya satlb121developmentofamachinelearningmethodforpredictingnewonsetofdiabetesmellitusaretrospectiveanalysisof509153annualspecifichealthcheckuprecords
AT aonodaisuke satlb121developmentofamachinelearningmethodforpredictingnewonsetofdiabetesmellitusaretrospectiveanalysisof509153annualspecifichealthcheckuprecords
AT kometanimitsuhiro satlb121developmentofamachinelearningmethodforpredictingnewonsetofdiabetesmellitusaretrospectiveanalysisof509153annualspecifichealthcheckuprecords
AT yonedatakashi satlb121developmentofamachinelearningmethodforpredictingnewonsetofdiabetesmellitusaretrospectiveanalysisof509153annualspecifichealthcheckuprecords