Cargando…

Development and Validation of a Deep Learning Based Diabetes Prediction System Using a Nationwide Population-Based Cohort

BACKGROUND: Previously developed prediction models for type 2 diabetes mellitus (T2DM) have limited performance. We developed a deep learning (DL) based model using a cohort representative of the Korean population. METHODS: This study was conducted on the basis of the National Health Insurance Servi...

Descripción completa

Detalles Bibliográficos
Autores principales: Rhee, Sang Youl, Sung, Ji Min, Kim, Sunhee, Cho, In-Jeong, Lee, Sang-Eun, Chang, Hyuk-Jae
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Korean Diabetes Association 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8369223/
https://www.ncbi.nlm.nih.gov/pubmed/33631067
http://dx.doi.org/10.4093/dmj.2020.0081
_version_ 1783739247875850240
author Rhee, Sang Youl
Sung, Ji Min
Kim, Sunhee
Cho, In-Jeong
Lee, Sang-Eun
Chang, Hyuk-Jae
author_facet Rhee, Sang Youl
Sung, Ji Min
Kim, Sunhee
Cho, In-Jeong
Lee, Sang-Eun
Chang, Hyuk-Jae
author_sort Rhee, Sang Youl
collection PubMed
description BACKGROUND: Previously developed prediction models for type 2 diabetes mellitus (T2DM) have limited performance. We developed a deep learning (DL) based model using a cohort representative of the Korean population. METHODS: This study was conducted on the basis of the National Health Insurance Service-Health Screening (NHIS-HEALS) cohort of Korea. Overall, 335,302 subjects without T2DM at baseline were included. We developed the model based on 80% of the subjects, and verified the power in the remainder. Predictive models for T2DM were constructed using the recurrent neural network long short-term memory (RNN-LSTM) network and the Cox longitudinal summary model. The performance of both models over a 10-year period was compared using a time dependent area under the curve. RESULTS: During a mean follow-up of 10.4±1.7 years, the mean frequency of periodic health check-ups was 2.9±1.0 per subject. During the observation period, T2DM was newly observed in 8.7% of the subjects. The annual performance of the model created using the RNN-LSTM network was superior to that of the Cox model, and the risk factors for T2DM, derived using the two models were similar; however, certain results differed. CONCLUSION: The DL-based T2DM prediction model, constructed using a cohort representative of the population, performs better than the conventional model. After pilot tests, this model will be provided to all Korean national health screening recipients in the future.
format Online
Article
Text
id pubmed-8369223
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Korean Diabetes Association
record_format MEDLINE/PubMed
spelling pubmed-83692232021-08-26 Development and Validation of a Deep Learning Based Diabetes Prediction System Using a Nationwide Population-Based Cohort Rhee, Sang Youl Sung, Ji Min Kim, Sunhee Cho, In-Jeong Lee, Sang-Eun Chang, Hyuk-Jae Diabetes Metab J Original Article BACKGROUND: Previously developed prediction models for type 2 diabetes mellitus (T2DM) have limited performance. We developed a deep learning (DL) based model using a cohort representative of the Korean population. METHODS: This study was conducted on the basis of the National Health Insurance Service-Health Screening (NHIS-HEALS) cohort of Korea. Overall, 335,302 subjects without T2DM at baseline were included. We developed the model based on 80% of the subjects, and verified the power in the remainder. Predictive models for T2DM were constructed using the recurrent neural network long short-term memory (RNN-LSTM) network and the Cox longitudinal summary model. The performance of both models over a 10-year period was compared using a time dependent area under the curve. RESULTS: During a mean follow-up of 10.4±1.7 years, the mean frequency of periodic health check-ups was 2.9±1.0 per subject. During the observation period, T2DM was newly observed in 8.7% of the subjects. The annual performance of the model created using the RNN-LSTM network was superior to that of the Cox model, and the risk factors for T2DM, derived using the two models were similar; however, certain results differed. CONCLUSION: The DL-based T2DM prediction model, constructed using a cohort representative of the population, performs better than the conventional model. After pilot tests, this model will be provided to all Korean national health screening recipients in the future. Korean Diabetes Association 2021-07 2021-02-25 /pmc/articles/PMC8369223/ /pubmed/33631067 http://dx.doi.org/10.4093/dmj.2020.0081 Text en Copyright © 2021 Korean Diabetes Association https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) ) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Article
Rhee, Sang Youl
Sung, Ji Min
Kim, Sunhee
Cho, In-Jeong
Lee, Sang-Eun
Chang, Hyuk-Jae
Development and Validation of a Deep Learning Based Diabetes Prediction System Using a Nationwide Population-Based Cohort
title Development and Validation of a Deep Learning Based Diabetes Prediction System Using a Nationwide Population-Based Cohort
title_full Development and Validation of a Deep Learning Based Diabetes Prediction System Using a Nationwide Population-Based Cohort
title_fullStr Development and Validation of a Deep Learning Based Diabetes Prediction System Using a Nationwide Population-Based Cohort
title_full_unstemmed Development and Validation of a Deep Learning Based Diabetes Prediction System Using a Nationwide Population-Based Cohort
title_short Development and Validation of a Deep Learning Based Diabetes Prediction System Using a Nationwide Population-Based Cohort
title_sort development and validation of a deep learning based diabetes prediction system using a nationwide population-based cohort
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8369223/
https://www.ncbi.nlm.nih.gov/pubmed/33631067
http://dx.doi.org/10.4093/dmj.2020.0081
work_keys_str_mv AT rheesangyoul developmentandvalidationofadeeplearningbaseddiabetespredictionsystemusinganationwidepopulationbasedcohort
AT sungjimin developmentandvalidationofadeeplearningbaseddiabetespredictionsystemusinganationwidepopulationbasedcohort
AT kimsunhee developmentandvalidationofadeeplearningbaseddiabetespredictionsystemusinganationwidepopulationbasedcohort
AT choinjeong developmentandvalidationofadeeplearningbaseddiabetespredictionsystemusinganationwidepopulationbasedcohort
AT leesangeun developmentandvalidationofadeeplearningbaseddiabetespredictionsystemusinganationwidepopulationbasedcohort
AT changhyukjae developmentandvalidationofadeeplearningbaseddiabetespredictionsystemusinganationwidepopulationbasedcohort