Cargando…
SAT-LB121 Development of a Machine-Learning Method for Predicting New Onset of Diabetes Mellitus: A Retrospective Analysis of 509,153 Annual Specific Health Checkup Records
Diabetes mellitus (DM) is a chronic disorder, characterized by impaired glucose metabolism. It is linked to increased risks of several diseases such as atrial fibrillation, cancer, and cardiovascular diseases. Therefore, DM prevention is essential. However, the traditional regression-based DM-onset...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7208505/ http://dx.doi.org/10.1210/jendso/bvaa046.2194 |
_version_ | 1783530860850446336 |
---|---|
author | Nomura, Akihiro Yamamoto, Sho Hayakawa, Yuta Taniguchi, Kouki Higashitani, Takuya Aono, Daisuke Kometani, Mitsuhiro Yoneda, Takashi |
author_facet | Nomura, Akihiro Yamamoto, Sho Hayakawa, Yuta Taniguchi, Kouki Higashitani, Takuya Aono, Daisuke Kometani, Mitsuhiro Yoneda, Takashi |
author_sort | Nomura, Akihiro |
collection | PubMed |
description | Diabetes mellitus (DM) is a chronic disorder, characterized by impaired glucose metabolism. It is linked to increased risks of several diseases such as atrial fibrillation, cancer, and cardiovascular diseases. Therefore, DM prevention is essential. However, the traditional regression-based DM-onset prediction methods are incapable of investigating future DM for generally healthy individuals without DM. Employing gradient-boosting decision trees, we developed a machine learning-based prediction model to identify the DM signatures, prior to the onset of DM. We employed the nationwide annual specific health checkup records, collected during the years 2008 to 2018, from Kanazawa city, Ishikawa, Japan. The data included the physical examinations, blood and urine tests, and participant questionnaires. Individuals without DM (at baseline), who underwent more than two annual health checkups during the said period, were included. The new cases of DM onset were recorded when the participants were diagnosed with DM in the annual check-ups. The dataset was divided into three subsets in a 6:2:2 ratio to constitute the training, tuning (internal validation), and testing datasets. Employing the testing dataset, the ability of our trained prediction model to calculate the area under the curve (AUC), precision, recall, F1 score, and overall accuracy was evaluated. Using a 1,000-iteration bootstrap method, every performance test resulted in a two-sided 95% confidence interval (CI). We included 509,153 annual health checkup records of 139,225 participants. Among them, 65,505 participants without DM were included, which constituted36,303 participants in the training dataset and 13,101 participants in each of the tuning and testing datasets. We identified a total of 4,696 new DM-onset patients (7.2%) in the study period. Our trained model predicted the future incidence of DM with the AUC, precision, recall, F1 score, and overall accuracy of 0.71 (0.69-0.72 with 95% CI), 75.3% (71.6-78.8), 42.2% (39.3-45.2), 54.1% (51.2-56.7), and 94.9% (94.5-95.2), respectively. In conclusion, the machine learning-based prediction model satisfactorily identified the DM onset prior to the actual incidence. |
format | Online Article Text |
id | pubmed-7208505 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-72085052020-05-13 SAT-LB121 Development of a Machine-Learning Method for Predicting New Onset of Diabetes Mellitus: A Retrospective Analysis of 509,153 Annual Specific Health Checkup Records Nomura, Akihiro Yamamoto, Sho Hayakawa, Yuta Taniguchi, Kouki Higashitani, Takuya Aono, Daisuke Kometani, Mitsuhiro Yoneda, Takashi J Endocr Soc Diabetes Mellitus and Glucose Metabolism Diabetes mellitus (DM) is a chronic disorder, characterized by impaired glucose metabolism. It is linked to increased risks of several diseases such as atrial fibrillation, cancer, and cardiovascular diseases. Therefore, DM prevention is essential. However, the traditional regression-based DM-onset prediction methods are incapable of investigating future DM for generally healthy individuals without DM. Employing gradient-boosting decision trees, we developed a machine learning-based prediction model to identify the DM signatures, prior to the onset of DM. We employed the nationwide annual specific health checkup records, collected during the years 2008 to 2018, from Kanazawa city, Ishikawa, Japan. The data included the physical examinations, blood and urine tests, and participant questionnaires. Individuals without DM (at baseline), who underwent more than two annual health checkups during the said period, were included. The new cases of DM onset were recorded when the participants were diagnosed with DM in the annual check-ups. The dataset was divided into three subsets in a 6:2:2 ratio to constitute the training, tuning (internal validation), and testing datasets. Employing the testing dataset, the ability of our trained prediction model to calculate the area under the curve (AUC), precision, recall, F1 score, and overall accuracy was evaluated. Using a 1,000-iteration bootstrap method, every performance test resulted in a two-sided 95% confidence interval (CI). We included 509,153 annual health checkup records of 139,225 participants. Among them, 65,505 participants without DM were included, which constituted36,303 participants in the training dataset and 13,101 participants in each of the tuning and testing datasets. We identified a total of 4,696 new DM-onset patients (7.2%) in the study period. Our trained model predicted the future incidence of DM with the AUC, precision, recall, F1 score, and overall accuracy of 0.71 (0.69-0.72 with 95% CI), 75.3% (71.6-78.8), 42.2% (39.3-45.2), 54.1% (51.2-56.7), and 94.9% (94.5-95.2), respectively. In conclusion, the machine learning-based prediction model satisfactorily identified the DM onset prior to the actual incidence. Oxford University Press 2020-05-08 /pmc/articles/PMC7208505/ http://dx.doi.org/10.1210/jendso/bvaa046.2194 Text en © Endocrine Society 2020. http://creativecommons.org/licenses/by-nc-nd/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivs licence (http://creativecommons.org/licenses/by-nc-nd/4.0/), which permits non-commercial reproduction and distribution of the work, in any medium, provided the original work is not altered or transformed in any way, and that the work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Diabetes Mellitus and Glucose Metabolism Nomura, Akihiro Yamamoto, Sho Hayakawa, Yuta Taniguchi, Kouki Higashitani, Takuya Aono, Daisuke Kometani, Mitsuhiro Yoneda, Takashi SAT-LB121 Development of a Machine-Learning Method for Predicting New Onset of Diabetes Mellitus: A Retrospective Analysis of 509,153 Annual Specific Health Checkup Records |
title | SAT-LB121 Development of a Machine-Learning Method for Predicting New Onset of Diabetes Mellitus: A Retrospective Analysis of 509,153 Annual Specific Health Checkup Records |
title_full | SAT-LB121 Development of a Machine-Learning Method for Predicting New Onset of Diabetes Mellitus: A Retrospective Analysis of 509,153 Annual Specific Health Checkup Records |
title_fullStr | SAT-LB121 Development of a Machine-Learning Method for Predicting New Onset of Diabetes Mellitus: A Retrospective Analysis of 509,153 Annual Specific Health Checkup Records |
title_full_unstemmed | SAT-LB121 Development of a Machine-Learning Method for Predicting New Onset of Diabetes Mellitus: A Retrospective Analysis of 509,153 Annual Specific Health Checkup Records |
title_short | SAT-LB121 Development of a Machine-Learning Method for Predicting New Onset of Diabetes Mellitus: A Retrospective Analysis of 509,153 Annual Specific Health Checkup Records |
title_sort | sat-lb121 development of a machine-learning method for predicting new onset of diabetes mellitus: a retrospective analysis of 509,153 annual specific health checkup records |
topic | Diabetes Mellitus and Glucose Metabolism |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7208505/ http://dx.doi.org/10.1210/jendso/bvaa046.2194 |
work_keys_str_mv | AT nomuraakihiro satlb121developmentofamachinelearningmethodforpredictingnewonsetofdiabetesmellitusaretrospectiveanalysisof509153annualspecifichealthcheckuprecords AT yamamotosho satlb121developmentofamachinelearningmethodforpredictingnewonsetofdiabetesmellitusaretrospectiveanalysisof509153annualspecifichealthcheckuprecords AT hayakawayuta satlb121developmentofamachinelearningmethodforpredictingnewonsetofdiabetesmellitusaretrospectiveanalysisof509153annualspecifichealthcheckuprecords AT taniguchikouki satlb121developmentofamachinelearningmethodforpredictingnewonsetofdiabetesmellitusaretrospectiveanalysisof509153annualspecifichealthcheckuprecords AT higashitanitakuya satlb121developmentofamachinelearningmethodforpredictingnewonsetofdiabetesmellitusaretrospectiveanalysisof509153annualspecifichealthcheckuprecords AT aonodaisuke satlb121developmentofamachinelearningmethodforpredictingnewonsetofdiabetesmellitusaretrospectiveanalysisof509153annualspecifichealthcheckuprecords AT kometanimitsuhiro satlb121developmentofamachinelearningmethodforpredictingnewonsetofdiabetesmellitusaretrospectiveanalysisof509153annualspecifichealthcheckuprecords AT yonedatakashi satlb121developmentofamachinelearningmethodforpredictingnewonsetofdiabetesmellitusaretrospectiveanalysisof509153annualspecifichealthcheckuprecords |