Cargando…
A Comparative Study of Natural Language Processing Algorithms Based on Cities Changing Diabetes Vulnerability Data
(1) Background: Poor adherence to management behaviors in Chinese Type 2 diabetes mellitus (T2DM) patients leads to an uncontrolled prognosis of diabetes, which results in significant economic costs for China. It is imperative to quickly locate vulnerability factors in the management behavior of pat...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9223144/ https://www.ncbi.nlm.nih.gov/pubmed/35742169 http://dx.doi.org/10.3390/healthcare10061119 |
_version_ | 1784733053661216768 |
---|---|
author | Wang, Siting Song, Fuman Qiao, Qinqun Liu, Yuanyuan Chen, Jiageng Ma, Jun |
author_facet | Wang, Siting Song, Fuman Qiao, Qinqun Liu, Yuanyuan Chen, Jiageng Ma, Jun |
author_sort | Wang, Siting |
collection | PubMed |
description | (1) Background: Poor adherence to management behaviors in Chinese Type 2 diabetes mellitus (T2DM) patients leads to an uncontrolled prognosis of diabetes, which results in significant economic costs for China. It is imperative to quickly locate vulnerability factors in the management behavior of patients with T2DM. (2) Methods: In this study, a thematic analysis of the collected interview materials was conducted to construct the themes of T2DM management vulnerability. We explored the applicability of the pre-trained models based on the evaluation metrics in text classification. (3) Results: We constructed 12 themes of vulnerability related to the health and well-being of people with T2DM in Tianjin. We considered that Bidirectional Encoder Representation from Transformers (BERT) performed better in this Natural Language Processing (NLP) task with a shorter completion time. With the splitting ratio of 6:3:1 and batch size of 64 for BERT, the test accuracy was 97.71%, the completion time was 10 min 24 s, and the macro-F1 score was 0.9752. (4) Conclusions: Our results proved the applicability of NLP techniques in this specific Chinese-language medical environment. We filled the knowledge gap in the application of NLP technologies in diabetes management. Our study provided strong support for using NLP techniques to rapidly locate vulnerability factors in T2DM management. |
format | Online Article Text |
id | pubmed-9223144 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-92231442022-06-24 A Comparative Study of Natural Language Processing Algorithms Based on Cities Changing Diabetes Vulnerability Data Wang, Siting Song, Fuman Qiao, Qinqun Liu, Yuanyuan Chen, Jiageng Ma, Jun Healthcare (Basel) Article (1) Background: Poor adherence to management behaviors in Chinese Type 2 diabetes mellitus (T2DM) patients leads to an uncontrolled prognosis of diabetes, which results in significant economic costs for China. It is imperative to quickly locate vulnerability factors in the management behavior of patients with T2DM. (2) Methods: In this study, a thematic analysis of the collected interview materials was conducted to construct the themes of T2DM management vulnerability. We explored the applicability of the pre-trained models based on the evaluation metrics in text classification. (3) Results: We constructed 12 themes of vulnerability related to the health and well-being of people with T2DM in Tianjin. We considered that Bidirectional Encoder Representation from Transformers (BERT) performed better in this Natural Language Processing (NLP) task with a shorter completion time. With the splitting ratio of 6:3:1 and batch size of 64 for BERT, the test accuracy was 97.71%, the completion time was 10 min 24 s, and the macro-F1 score was 0.9752. (4) Conclusions: Our results proved the applicability of NLP techniques in this specific Chinese-language medical environment. We filled the knowledge gap in the application of NLP technologies in diabetes management. Our study provided strong support for using NLP techniques to rapidly locate vulnerability factors in T2DM management. MDPI 2022-06-15 /pmc/articles/PMC9223144/ /pubmed/35742169 http://dx.doi.org/10.3390/healthcare10061119 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Wang, Siting Song, Fuman Qiao, Qinqun Liu, Yuanyuan Chen, Jiageng Ma, Jun A Comparative Study of Natural Language Processing Algorithms Based on Cities Changing Diabetes Vulnerability Data |
title | A Comparative Study of Natural Language Processing Algorithms Based on Cities Changing Diabetes Vulnerability Data |
title_full | A Comparative Study of Natural Language Processing Algorithms Based on Cities Changing Diabetes Vulnerability Data |
title_fullStr | A Comparative Study of Natural Language Processing Algorithms Based on Cities Changing Diabetes Vulnerability Data |
title_full_unstemmed | A Comparative Study of Natural Language Processing Algorithms Based on Cities Changing Diabetes Vulnerability Data |
title_short | A Comparative Study of Natural Language Processing Algorithms Based on Cities Changing Diabetes Vulnerability Data |
title_sort | comparative study of natural language processing algorithms based on cities changing diabetes vulnerability data |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9223144/ https://www.ncbi.nlm.nih.gov/pubmed/35742169 http://dx.doi.org/10.3390/healthcare10061119 |
work_keys_str_mv | AT wangsiting acomparativestudyofnaturallanguageprocessingalgorithmsbasedoncitieschangingdiabetesvulnerabilitydata AT songfuman acomparativestudyofnaturallanguageprocessingalgorithmsbasedoncitieschangingdiabetesvulnerabilitydata AT qiaoqinqun acomparativestudyofnaturallanguageprocessingalgorithmsbasedoncitieschangingdiabetesvulnerabilitydata AT liuyuanyuan acomparativestudyofnaturallanguageprocessingalgorithmsbasedoncitieschangingdiabetesvulnerabilitydata AT chenjiageng acomparativestudyofnaturallanguageprocessingalgorithmsbasedoncitieschangingdiabetesvulnerabilitydata AT majun acomparativestudyofnaturallanguageprocessingalgorithmsbasedoncitieschangingdiabetesvulnerabilitydata AT wangsiting comparativestudyofnaturallanguageprocessingalgorithmsbasedoncitieschangingdiabetesvulnerabilitydata AT songfuman comparativestudyofnaturallanguageprocessingalgorithmsbasedoncitieschangingdiabetesvulnerabilitydata AT qiaoqinqun comparativestudyofnaturallanguageprocessingalgorithmsbasedoncitieschangingdiabetesvulnerabilitydata AT liuyuanyuan comparativestudyofnaturallanguageprocessingalgorithmsbasedoncitieschangingdiabetesvulnerabilitydata AT chenjiageng comparativestudyofnaturallanguageprocessingalgorithmsbasedoncitieschangingdiabetesvulnerabilitydata AT majun comparativestudyofnaturallanguageprocessingalgorithmsbasedoncitieschangingdiabetesvulnerabilitydata |