Cargando…
Development of predictive models for lymphedema by using blood tests and therapy data
Lymphedema is a disease that refers to tissue swelling caused by an accumulation of protein-rich fluid that is usually drained through the lymphatic system. Detection of lymphedema is often based on expensive diagnoses such as bioimpedance spectroscopy, shear wave elastography, computed tomography,...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10643602/ https://www.ncbi.nlm.nih.gov/pubmed/37957217 http://dx.doi.org/10.1038/s41598-023-46567-1 |
_version_ | 1785147137535770624 |
---|---|
author | Trinh, Xuan-Tung Chien, Pham Ngoc Long, Nguyen-Van Van Anh, Le Thi Giang, Nguyen Ngan Nam, Sun-Young Myung, Yujin |
author_facet | Trinh, Xuan-Tung Chien, Pham Ngoc Long, Nguyen-Van Van Anh, Le Thi Giang, Nguyen Ngan Nam, Sun-Young Myung, Yujin |
author_sort | Trinh, Xuan-Tung |
collection | PubMed |
description | Lymphedema is a disease that refers to tissue swelling caused by an accumulation of protein-rich fluid that is usually drained through the lymphatic system. Detection of lymphedema is often based on expensive diagnoses such as bioimpedance spectroscopy, shear wave elastography, computed tomography, etc. In current machine learning models for lymphedema prediction, reliance on observable symptoms reported by patients introduces the possibility of errors in patient-input data. Moreover, these symptoms are often absent during the initial stages of lymphedema, creating challenges in its early detection. Identifying lymphedema before these observable symptoms manifest would greatly benefit patients by potentially minimizing the discomfort caused by these symptoms. In this study, we propose to use new data, such as complete blood count, serum, and therapy data, to develop predictive models for lymphedema. This approach aims to compensate for the limitations of using only observable symptoms data. We collected data from 2137 patients, including 356 patients with lymphedema and 1781 patients without lymphedema, with the lymphedema status of each patient confirmed by clinicians. The data for each patient included: (1) a complete blood count (CBC) test, (2) a serum test, and (3) therapy information. We used various machine learning algorithms (i.e. random forest, gradient boosting, decision tree, logistic regression, and artificial neural network) to develop predictive models on the training dataset (i.e. 80% of the data) and evaluated the models on the external validation dataset (i.e. 20% of the data). After selecting the best predictive models, we created a web application to aid medical doctors and clinicians in the rapid screening of lymphedema patients. A dataset of 2137 patients was assembled from Seoul National University Bundang Hospital. Predictive models based on the random forest algorithm exhibited satisfactory performance (balanced accuracy = 87.0 ± 0.7%, sensitivity = 84.3 ± 0.6%, specificity = 89.1 ± 1.5%, precision = 97.4 ± 0.7%, F1 score = 90.4 ± 0.4%, and AUC = 0.931 ± 0.007). We developed a web application to facilitate the swift screening of lymphedema among medical practitioners: https://snubhtxt.shinyapps.io/SNUBH_Lymphedema. Our study introduces a novel tool for the early detection of lymphedema and establishes the foundation for future investigations into predicting different stages of the condition. |
format | Online Article Text |
id | pubmed-10643602 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-106436022023-11-13 Development of predictive models for lymphedema by using blood tests and therapy data Trinh, Xuan-Tung Chien, Pham Ngoc Long, Nguyen-Van Van Anh, Le Thi Giang, Nguyen Ngan Nam, Sun-Young Myung, Yujin Sci Rep Article Lymphedema is a disease that refers to tissue swelling caused by an accumulation of protein-rich fluid that is usually drained through the lymphatic system. Detection of lymphedema is often based on expensive diagnoses such as bioimpedance spectroscopy, shear wave elastography, computed tomography, etc. In current machine learning models for lymphedema prediction, reliance on observable symptoms reported by patients introduces the possibility of errors in patient-input data. Moreover, these symptoms are often absent during the initial stages of lymphedema, creating challenges in its early detection. Identifying lymphedema before these observable symptoms manifest would greatly benefit patients by potentially minimizing the discomfort caused by these symptoms. In this study, we propose to use new data, such as complete blood count, serum, and therapy data, to develop predictive models for lymphedema. This approach aims to compensate for the limitations of using only observable symptoms data. We collected data from 2137 patients, including 356 patients with lymphedema and 1781 patients without lymphedema, with the lymphedema status of each patient confirmed by clinicians. The data for each patient included: (1) a complete blood count (CBC) test, (2) a serum test, and (3) therapy information. We used various machine learning algorithms (i.e. random forest, gradient boosting, decision tree, logistic regression, and artificial neural network) to develop predictive models on the training dataset (i.e. 80% of the data) and evaluated the models on the external validation dataset (i.e. 20% of the data). After selecting the best predictive models, we created a web application to aid medical doctors and clinicians in the rapid screening of lymphedema patients. A dataset of 2137 patients was assembled from Seoul National University Bundang Hospital. Predictive models based on the random forest algorithm exhibited satisfactory performance (balanced accuracy = 87.0 ± 0.7%, sensitivity = 84.3 ± 0.6%, specificity = 89.1 ± 1.5%, precision = 97.4 ± 0.7%, F1 score = 90.4 ± 0.4%, and AUC = 0.931 ± 0.007). We developed a web application to facilitate the swift screening of lymphedema among medical practitioners: https://snubhtxt.shinyapps.io/SNUBH_Lymphedema. Our study introduces a novel tool for the early detection of lymphedema and establishes the foundation for future investigations into predicting different stages of the condition. Nature Publishing Group UK 2023-11-13 /pmc/articles/PMC10643602/ /pubmed/37957217 http://dx.doi.org/10.1038/s41598-023-46567-1 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Trinh, Xuan-Tung Chien, Pham Ngoc Long, Nguyen-Van Van Anh, Le Thi Giang, Nguyen Ngan Nam, Sun-Young Myung, Yujin Development of predictive models for lymphedema by using blood tests and therapy data |
title | Development of predictive models for lymphedema by using blood tests and therapy data |
title_full | Development of predictive models for lymphedema by using blood tests and therapy data |
title_fullStr | Development of predictive models for lymphedema by using blood tests and therapy data |
title_full_unstemmed | Development of predictive models for lymphedema by using blood tests and therapy data |
title_short | Development of predictive models for lymphedema by using blood tests and therapy data |
title_sort | development of predictive models for lymphedema by using blood tests and therapy data |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10643602/ https://www.ncbi.nlm.nih.gov/pubmed/37957217 http://dx.doi.org/10.1038/s41598-023-46567-1 |
work_keys_str_mv | AT trinhxuantung developmentofpredictivemodelsforlymphedemabyusingbloodtestsandtherapydata AT chienphamngoc developmentofpredictivemodelsforlymphedemabyusingbloodtestsandtherapydata AT longnguyenvan developmentofpredictivemodelsforlymphedemabyusingbloodtestsandtherapydata AT vananhlethi developmentofpredictivemodelsforlymphedemabyusingbloodtestsandtherapydata AT giangnguyenngan developmentofpredictivemodelsforlymphedemabyusingbloodtestsandtherapydata AT namsunyoung developmentofpredictivemodelsforlymphedemabyusingbloodtestsandtherapydata AT myungyujin developmentofpredictivemodelsforlymphedemabyusingbloodtestsandtherapydata |