Cargando…

A Reinforcement Learning–Based Method for Management of Type 1 Diabetes: Exploratory Study

BACKGROUND: Type 1 diabetes mellitus (T1DM) is characterized by chronic insulin deficiency and consequent hyperglycemia. Patients with T1DM require long-term exogenous insulin therapy to regulate blood glucose levels and prevent the long-term complications of the disease. Currently, there are no eff...

Descripción completa

Detalles Bibliográficos
Autores principales: Oroojeni Mohammad Javad, Mahsa, Agboola, Stephen Olusegun, Jethwani, Kamal, Zeid, Abe, Kamarthi, Sagar
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6737889/
https://www.ncbi.nlm.nih.gov/pubmed/31464196
http://dx.doi.org/10.2196/12905
_version_ 1783450741639217152
author Oroojeni Mohammad Javad, Mahsa
Agboola, Stephen Olusegun
Jethwani, Kamal
Zeid, Abe
Kamarthi, Sagar
author_facet Oroojeni Mohammad Javad, Mahsa
Agboola, Stephen Olusegun
Jethwani, Kamal
Zeid, Abe
Kamarthi, Sagar
author_sort Oroojeni Mohammad Javad, Mahsa
collection PubMed
description BACKGROUND: Type 1 diabetes mellitus (T1DM) is characterized by chronic insulin deficiency and consequent hyperglycemia. Patients with T1DM require long-term exogenous insulin therapy to regulate blood glucose levels and prevent the long-term complications of the disease. Currently, there are no effective algorithms that consider the unique characteristics of T1DM patients to automatically recommend personalized insulin dosage levels. OBJECTIVE: The objective of this study was to develop and validate a general reinforcement learning (RL) framework for the personalized treatment of T1DM using clinical data. METHODS: This research presents a model-free data-driven RL algorithm, namely Q-learning, that recommends insulin doses to regulate the blood glucose level of a T1DM patient, considering his or her state defined by glycated hemoglobin (HbA(1c)) levels, body mass index, engagement in physical activity, and alcohol usage. In this approach, the RL agent identifies the different states of the patient by exploring the patient’s responses when he or she is subjected to varying insulin doses. On the basis of the result of a treatment action at time step t, the RL agent receives a numeric reward, positive or negative. The reward is calculated as a function of the difference between the actual blood glucose level achieved in response to the insulin dose and the targeted HbA(1c) level. The RL agent was trained on 10 years of clinical data of patients treated at the Mass General Hospital. RESULTS: A total of 87 patients were included in the training set. The mean age of these patients was 53 years, 59% (51/87) were male, 86% (75/87) were white, and 47% (41/87) were married. The performance of the RL agent was evaluated on 60 test cases. RL agent–recommended insulin dosage interval includes the actual dose prescribed by the physician in 53 out of 60 cases (53/60, 88%). CONCLUSIONS: This exploratory study demonstrates that an RL algorithm can be used to recommend personalized insulin doses to achieve adequate glycemic control in patients with T1DM. However, further investigation in a larger sample of patients is needed to confirm these findings.
format Online
Article
Text
id pubmed-6737889
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-67378892019-09-23 A Reinforcement Learning–Based Method for Management of Type 1 Diabetes: Exploratory Study Oroojeni Mohammad Javad, Mahsa Agboola, Stephen Olusegun Jethwani, Kamal Zeid, Abe Kamarthi, Sagar JMIR Diabetes Original Paper BACKGROUND: Type 1 diabetes mellitus (T1DM) is characterized by chronic insulin deficiency and consequent hyperglycemia. Patients with T1DM require long-term exogenous insulin therapy to regulate blood glucose levels and prevent the long-term complications of the disease. Currently, there are no effective algorithms that consider the unique characteristics of T1DM patients to automatically recommend personalized insulin dosage levels. OBJECTIVE: The objective of this study was to develop and validate a general reinforcement learning (RL) framework for the personalized treatment of T1DM using clinical data. METHODS: This research presents a model-free data-driven RL algorithm, namely Q-learning, that recommends insulin doses to regulate the blood glucose level of a T1DM patient, considering his or her state defined by glycated hemoglobin (HbA(1c)) levels, body mass index, engagement in physical activity, and alcohol usage. In this approach, the RL agent identifies the different states of the patient by exploring the patient’s responses when he or she is subjected to varying insulin doses. On the basis of the result of a treatment action at time step t, the RL agent receives a numeric reward, positive or negative. The reward is calculated as a function of the difference between the actual blood glucose level achieved in response to the insulin dose and the targeted HbA(1c) level. The RL agent was trained on 10 years of clinical data of patients treated at the Mass General Hospital. RESULTS: A total of 87 patients were included in the training set. The mean age of these patients was 53 years, 59% (51/87) were male, 86% (75/87) were white, and 47% (41/87) were married. The performance of the RL agent was evaluated on 60 test cases. RL agent–recommended insulin dosage interval includes the actual dose prescribed by the physician in 53 out of 60 cases (53/60, 88%). CONCLUSIONS: This exploratory study demonstrates that an RL algorithm can be used to recommend personalized insulin doses to achieve adequate glycemic control in patients with T1DM. However, further investigation in a larger sample of patients is needed to confirm these findings. JMIR Publications 2019-08-28 /pmc/articles/PMC6737889/ /pubmed/31464196 http://dx.doi.org/10.2196/12905 Text en ©Mahsa Oroojeni Mohammad Javad, Stephen Olusegun Agboola, Kamal Jethwani, Abe Zeid, Sagar Kamarthi. Originally published in JMIR Diabetes (http://diabetes.jmir.org), 28.08.2019. https://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Diabetes, is properly cited. The complete bibliographic information, a link to the original publication on http://diabetes.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Oroojeni Mohammad Javad, Mahsa
Agboola, Stephen Olusegun
Jethwani, Kamal
Zeid, Abe
Kamarthi, Sagar
A Reinforcement Learning–Based Method for Management of Type 1 Diabetes: Exploratory Study
title A Reinforcement Learning–Based Method for Management of Type 1 Diabetes: Exploratory Study
title_full A Reinforcement Learning–Based Method for Management of Type 1 Diabetes: Exploratory Study
title_fullStr A Reinforcement Learning–Based Method for Management of Type 1 Diabetes: Exploratory Study
title_full_unstemmed A Reinforcement Learning–Based Method for Management of Type 1 Diabetes: Exploratory Study
title_short A Reinforcement Learning–Based Method for Management of Type 1 Diabetes: Exploratory Study
title_sort reinforcement learning–based method for management of type 1 diabetes: exploratory study
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6737889/
https://www.ncbi.nlm.nih.gov/pubmed/31464196
http://dx.doi.org/10.2196/12905
work_keys_str_mv AT oroojenimohammadjavadmahsa areinforcementlearningbasedmethodformanagementoftype1diabetesexploratorystudy
AT agboolastephenolusegun areinforcementlearningbasedmethodformanagementoftype1diabetesexploratorystudy
AT jethwanikamal areinforcementlearningbasedmethodformanagementoftype1diabetesexploratorystudy
AT zeidabe areinforcementlearningbasedmethodformanagementoftype1diabetesexploratorystudy
AT kamarthisagar areinforcementlearningbasedmethodformanagementoftype1diabetesexploratorystudy
AT oroojenimohammadjavadmahsa reinforcementlearningbasedmethodformanagementoftype1diabetesexploratorystudy
AT agboolastephenolusegun reinforcementlearningbasedmethodformanagementoftype1diabetesexploratorystudy
AT jethwanikamal reinforcementlearningbasedmethodformanagementoftype1diabetesexploratorystudy
AT zeidabe reinforcementlearningbasedmethodformanagementoftype1diabetesexploratorystudy
AT kamarthisagar reinforcementlearningbasedmethodformanagementoftype1diabetesexploratorystudy