Cargando…
Comparison of machine learning models for predicting the risk of breast cancer-related lymphedema in Chinese women
OBJECTIVE: Predictive models for the occurrence of cancer symptoms by using machine learning (ML) algorithms could be used to aid clinical decision-making in order to enhance the quality of cancer care. This study aimed to develop and validate a selection of classification models that used ML algori...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9579303/ https://www.ncbi.nlm.nih.gov/pubmed/36276882 http://dx.doi.org/10.1016/j.apjon.2022.100101 |
_version_ | 1784812150874701824 |
---|---|
author | Wu, Xiumei Guan, Qiongyao Cheng, Andy S.K. Guan, Changhe Su, Yan Jiang, Jingchi Zeng, Yingchun Zeng, Linghui Wang, Boran |
author_facet | Wu, Xiumei Guan, Qiongyao Cheng, Andy S.K. Guan, Changhe Su, Yan Jiang, Jingchi Zeng, Yingchun Zeng, Linghui Wang, Boran |
author_sort | Wu, Xiumei |
collection | PubMed |
description | OBJECTIVE: Predictive models for the occurrence of cancer symptoms by using machine learning (ML) algorithms could be used to aid clinical decision-making in order to enhance the quality of cancer care. This study aimed to develop and validate a selection of classification models that used ML algorithms to predict the occurrence of breast cancer-related lymphedema (BCRL) among Chinese women. METHODS: This was a retrospective cohort study of consecutive cases that had been diagnosed with breast cancer, stages I-IV. Forty-eight variables were grouped into five feature sets. Five classification models with ML algorithms were developed, and the models' performance and the variables’ relative importance were assessed accordingly. RESULTS: Of 370 eligible female participants, 91 had BCRL (24.6%). The mean age of this study sample was 49.89 (SD = 7.45). All participants had had breast cancer surgery, and more than half of them had had a modified radical mastectomy (n = 206, 55.5%). The mean follow-up time after breast cancer surgery was 28.73 months (SD = 11.71). Most of the tumors were either stage I (n = 49, 31.2%) or stage II (n = 252, 68.1%). More than half of the sample had had postoperative chemotherapy (n = 227, 61.4%). Overall, the logistic regression model achieved the best performance in terms of accuracy (91.6%), precision (82.1%), and recall (91.4%) for BCRL. Although this study included 48 predicting variables, we found that the five models required only 22 variables to achieve predictive performance. The most important variable was the number of positive lymph nodes, followed in descending order by the BCRL occurring on the same side as the surgery, a history of sentinel lymph node biopsy, a dietary preference for meat and fried food, and an exercise frequency of less than three times per week. These factors were the most influential predictors for enhancing the ML models’ performance. CONCLUSIONS: This study found that in the ML training dataset, the multilayer perceptron model and the logistic regression model were the best discrimination models for predicting the outcome of BCRL, and the k-nearest neighbors and support vector machine models demonstrated good calibration performance in the ML validation dataset. Future research will need to use large-sample datasets to establish a more robust ML model for predicting BCRL deeply and reliably. |
format | Online Article Text |
id | pubmed-9579303 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Elsevier |
record_format | MEDLINE/PubMed |
spelling | pubmed-95793032022-10-20 Comparison of machine learning models for predicting the risk of breast cancer-related lymphedema in Chinese women Wu, Xiumei Guan, Qiongyao Cheng, Andy S.K. Guan, Changhe Su, Yan Jiang, Jingchi Zeng, Yingchun Zeng, Linghui Wang, Boran Asia Pac J Oncol Nurs Original Article OBJECTIVE: Predictive models for the occurrence of cancer symptoms by using machine learning (ML) algorithms could be used to aid clinical decision-making in order to enhance the quality of cancer care. This study aimed to develop and validate a selection of classification models that used ML algorithms to predict the occurrence of breast cancer-related lymphedema (BCRL) among Chinese women. METHODS: This was a retrospective cohort study of consecutive cases that had been diagnosed with breast cancer, stages I-IV. Forty-eight variables were grouped into five feature sets. Five classification models with ML algorithms were developed, and the models' performance and the variables’ relative importance were assessed accordingly. RESULTS: Of 370 eligible female participants, 91 had BCRL (24.6%). The mean age of this study sample was 49.89 (SD = 7.45). All participants had had breast cancer surgery, and more than half of them had had a modified radical mastectomy (n = 206, 55.5%). The mean follow-up time after breast cancer surgery was 28.73 months (SD = 11.71). Most of the tumors were either stage I (n = 49, 31.2%) or stage II (n = 252, 68.1%). More than half of the sample had had postoperative chemotherapy (n = 227, 61.4%). Overall, the logistic regression model achieved the best performance in terms of accuracy (91.6%), precision (82.1%), and recall (91.4%) for BCRL. Although this study included 48 predicting variables, we found that the five models required only 22 variables to achieve predictive performance. The most important variable was the number of positive lymph nodes, followed in descending order by the BCRL occurring on the same side as the surgery, a history of sentinel lymph node biopsy, a dietary preference for meat and fried food, and an exercise frequency of less than three times per week. These factors were the most influential predictors for enhancing the ML models’ performance. CONCLUSIONS: This study found that in the ML training dataset, the multilayer perceptron model and the logistic regression model were the best discrimination models for predicting the outcome of BCRL, and the k-nearest neighbors and support vector machine models demonstrated good calibration performance in the ML validation dataset. Future research will need to use large-sample datasets to establish a more robust ML model for predicting BCRL deeply and reliably. Elsevier 2022-06-09 /pmc/articles/PMC9579303/ /pubmed/36276882 http://dx.doi.org/10.1016/j.apjon.2022.100101 Text en © 2022 The Author(s) https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). |
spellingShingle | Original Article Wu, Xiumei Guan, Qiongyao Cheng, Andy S.K. Guan, Changhe Su, Yan Jiang, Jingchi Zeng, Yingchun Zeng, Linghui Wang, Boran Comparison of machine learning models for predicting the risk of breast cancer-related lymphedema in Chinese women |
title | Comparison of machine learning models for predicting the risk of breast cancer-related lymphedema in Chinese women |
title_full | Comparison of machine learning models for predicting the risk of breast cancer-related lymphedema in Chinese women |
title_fullStr | Comparison of machine learning models for predicting the risk of breast cancer-related lymphedema in Chinese women |
title_full_unstemmed | Comparison of machine learning models for predicting the risk of breast cancer-related lymphedema in Chinese women |
title_short | Comparison of machine learning models for predicting the risk of breast cancer-related lymphedema in Chinese women |
title_sort | comparison of machine learning models for predicting the risk of breast cancer-related lymphedema in chinese women |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9579303/ https://www.ncbi.nlm.nih.gov/pubmed/36276882 http://dx.doi.org/10.1016/j.apjon.2022.100101 |
work_keys_str_mv | AT wuxiumei comparisonofmachinelearningmodelsforpredictingtheriskofbreastcancerrelatedlymphedemainchinesewomen AT guanqiongyao comparisonofmachinelearningmodelsforpredictingtheriskofbreastcancerrelatedlymphedemainchinesewomen AT chengandysk comparisonofmachinelearningmodelsforpredictingtheriskofbreastcancerrelatedlymphedemainchinesewomen AT guanchanghe comparisonofmachinelearningmodelsforpredictingtheriskofbreastcancerrelatedlymphedemainchinesewomen AT suyan comparisonofmachinelearningmodelsforpredictingtheriskofbreastcancerrelatedlymphedemainchinesewomen AT jiangjingchi comparisonofmachinelearningmodelsforpredictingtheriskofbreastcancerrelatedlymphedemainchinesewomen AT zengyingchun comparisonofmachinelearningmodelsforpredictingtheriskofbreastcancerrelatedlymphedemainchinesewomen AT zenglinghui comparisonofmachinelearningmodelsforpredictingtheriskofbreastcancerrelatedlymphedemainchinesewomen AT wangboran comparisonofmachinelearningmodelsforpredictingtheriskofbreastcancerrelatedlymphedemainchinesewomen |