Cargando…

Development of predictive models for lymphedema by using blood tests and therapy data

Lymphedema is a disease that refers to tissue swelling caused by an accumulation of protein-rich fluid that is usually drained through the lymphatic system. Detection of lymphedema is often based on expensive diagnoses such as bioimpedance spectroscopy, shear wave elastography, computed tomography,...

Descripción completa

Detalles Bibliográficos
Autores principales: Trinh, Xuan-Tung, Chien, Pham Ngoc, Long, Nguyen-Van, Van Anh, Le Thi, Giang, Nguyen Ngan, Nam, Sun-Young, Myung, Yujin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10643602/
https://www.ncbi.nlm.nih.gov/pubmed/37957217
http://dx.doi.org/10.1038/s41598-023-46567-1
_version_ 1785147137535770624
author Trinh, Xuan-Tung
Chien, Pham Ngoc
Long, Nguyen-Van
Van Anh, Le Thi
Giang, Nguyen Ngan
Nam, Sun-Young
Myung, Yujin
author_facet Trinh, Xuan-Tung
Chien, Pham Ngoc
Long, Nguyen-Van
Van Anh, Le Thi
Giang, Nguyen Ngan
Nam, Sun-Young
Myung, Yujin
author_sort Trinh, Xuan-Tung
collection PubMed
description Lymphedema is a disease that refers to tissue swelling caused by an accumulation of protein-rich fluid that is usually drained through the lymphatic system. Detection of lymphedema is often based on expensive diagnoses such as bioimpedance spectroscopy, shear wave elastography, computed tomography, etc. In current machine learning models for lymphedema prediction, reliance on observable symptoms reported by patients introduces the possibility of errors in patient-input data. Moreover, these symptoms are often absent during the initial stages of lymphedema, creating challenges in its early detection. Identifying lymphedema before these observable symptoms manifest would greatly benefit patients by potentially minimizing the discomfort caused by these symptoms. In this study, we propose to use new data, such as complete blood count, serum, and therapy data, to develop predictive models for lymphedema. This approach aims to compensate for the limitations of using only observable symptoms data. We collected data from 2137 patients, including 356 patients with lymphedema and 1781 patients without lymphedema, with the lymphedema status of each patient confirmed by clinicians. The data for each patient included: (1) a complete blood count (CBC) test, (2) a serum test, and (3) therapy information. We used various machine learning algorithms (i.e. random forest, gradient boosting, decision tree, logistic regression, and artificial neural network) to develop predictive models on the training dataset (i.e. 80% of the data) and evaluated the models on the external validation dataset (i.e. 20% of the data). After selecting the best predictive models, we created a web application to aid medical doctors and clinicians in the rapid screening of lymphedema patients. A dataset of 2137 patients was assembled from Seoul National University Bundang Hospital. Predictive models based on the random forest algorithm exhibited satisfactory performance (balanced accuracy = 87.0 ± 0.7%, sensitivity = 84.3 ± 0.6%, specificity = 89.1 ± 1.5%, precision = 97.4 ± 0.7%, F1 score = 90.4 ± 0.4%, and AUC = 0.931 ± 0.007). We developed a web application to facilitate the swift screening of lymphedema among medical practitioners: https://snubhtxt.shinyapps.io/SNUBH_Lymphedema. Our study introduces a novel tool for the early detection of lymphedema and establishes the foundation for future investigations into predicting different stages of the condition.
format Online
Article
Text
id pubmed-10643602
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-106436022023-11-13 Development of predictive models for lymphedema by using blood tests and therapy data Trinh, Xuan-Tung Chien, Pham Ngoc Long, Nguyen-Van Van Anh, Le Thi Giang, Nguyen Ngan Nam, Sun-Young Myung, Yujin Sci Rep Article Lymphedema is a disease that refers to tissue swelling caused by an accumulation of protein-rich fluid that is usually drained through the lymphatic system. Detection of lymphedema is often based on expensive diagnoses such as bioimpedance spectroscopy, shear wave elastography, computed tomography, etc. In current machine learning models for lymphedema prediction, reliance on observable symptoms reported by patients introduces the possibility of errors in patient-input data. Moreover, these symptoms are often absent during the initial stages of lymphedema, creating challenges in its early detection. Identifying lymphedema before these observable symptoms manifest would greatly benefit patients by potentially minimizing the discomfort caused by these symptoms. In this study, we propose to use new data, such as complete blood count, serum, and therapy data, to develop predictive models for lymphedema. This approach aims to compensate for the limitations of using only observable symptoms data. We collected data from 2137 patients, including 356 patients with lymphedema and 1781 patients without lymphedema, with the lymphedema status of each patient confirmed by clinicians. The data for each patient included: (1) a complete blood count (CBC) test, (2) a serum test, and (3) therapy information. We used various machine learning algorithms (i.e. random forest, gradient boosting, decision tree, logistic regression, and artificial neural network) to develop predictive models on the training dataset (i.e. 80% of the data) and evaluated the models on the external validation dataset (i.e. 20% of the data). After selecting the best predictive models, we created a web application to aid medical doctors and clinicians in the rapid screening of lymphedema patients. A dataset of 2137 patients was assembled from Seoul National University Bundang Hospital. Predictive models based on the random forest algorithm exhibited satisfactory performance (balanced accuracy = 87.0 ± 0.7%, sensitivity = 84.3 ± 0.6%, specificity = 89.1 ± 1.5%, precision = 97.4 ± 0.7%, F1 score = 90.4 ± 0.4%, and AUC = 0.931 ± 0.007). We developed a web application to facilitate the swift screening of lymphedema among medical practitioners: https://snubhtxt.shinyapps.io/SNUBH_Lymphedema. Our study introduces a novel tool for the early detection of lymphedema and establishes the foundation for future investigations into predicting different stages of the condition. Nature Publishing Group UK 2023-11-13 /pmc/articles/PMC10643602/ /pubmed/37957217 http://dx.doi.org/10.1038/s41598-023-46567-1 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Trinh, Xuan-Tung
Chien, Pham Ngoc
Long, Nguyen-Van
Van Anh, Le Thi
Giang, Nguyen Ngan
Nam, Sun-Young
Myung, Yujin
Development of predictive models for lymphedema by using blood tests and therapy data
title Development of predictive models for lymphedema by using blood tests and therapy data
title_full Development of predictive models for lymphedema by using blood tests and therapy data
title_fullStr Development of predictive models for lymphedema by using blood tests and therapy data
title_full_unstemmed Development of predictive models for lymphedema by using blood tests and therapy data
title_short Development of predictive models for lymphedema by using blood tests and therapy data
title_sort development of predictive models for lymphedema by using blood tests and therapy data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10643602/
https://www.ncbi.nlm.nih.gov/pubmed/37957217
http://dx.doi.org/10.1038/s41598-023-46567-1
work_keys_str_mv AT trinhxuantung developmentofpredictivemodelsforlymphedemabyusingbloodtestsandtherapydata
AT chienphamngoc developmentofpredictivemodelsforlymphedemabyusingbloodtestsandtherapydata
AT longnguyenvan developmentofpredictivemodelsforlymphedemabyusingbloodtestsandtherapydata
AT vananhlethi developmentofpredictivemodelsforlymphedemabyusingbloodtestsandtherapydata
AT giangnguyenngan developmentofpredictivemodelsforlymphedemabyusingbloodtestsandtherapydata
AT namsunyoung developmentofpredictivemodelsforlymphedemabyusingbloodtestsandtherapydata
AT myungyujin developmentofpredictivemodelsforlymphedemabyusingbloodtestsandtherapydata