Cargando…

Comparison of machine learning models for predicting the risk of breast cancer-related lymphedema in Chinese women

OBJECTIVE: Predictive models for the occurrence of cancer symptoms by using machine learning (ML) algorithms could be used to aid clinical decision-making in order to enhance the quality of cancer care. This study aimed to develop and validate a selection of classification models that used ML algori...

Descripción completa

Detalles Bibliográficos
Autores principales: Wu, Xiumei, Guan, Qiongyao, Cheng, Andy S.K., Guan, Changhe, Su, Yan, Jiang, Jingchi, Zeng, Yingchun, Zeng, Linghui, Wang, Boran
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9579303/
https://www.ncbi.nlm.nih.gov/pubmed/36276882
http://dx.doi.org/10.1016/j.apjon.2022.100101
_version_ 1784812150874701824
author Wu, Xiumei
Guan, Qiongyao
Cheng, Andy S.K.
Guan, Changhe
Su, Yan
Jiang, Jingchi
Zeng, Yingchun
Zeng, Linghui
Wang, Boran
author_facet Wu, Xiumei
Guan, Qiongyao
Cheng, Andy S.K.
Guan, Changhe
Su, Yan
Jiang, Jingchi
Zeng, Yingchun
Zeng, Linghui
Wang, Boran
author_sort Wu, Xiumei
collection PubMed
description OBJECTIVE: Predictive models for the occurrence of cancer symptoms by using machine learning (ML) algorithms could be used to aid clinical decision-making in order to enhance the quality of cancer care. This study aimed to develop and validate a selection of classification models that used ML algorithms to predict the occurrence of breast cancer-related lymphedema (BCRL) among Chinese women. METHODS: This was a retrospective cohort study of consecutive cases that had been diagnosed with breast cancer, stages I-IV. Forty-eight variables were grouped into five feature sets. Five classification models with ML algorithms were developed, and the models' performance and the variables’ relative importance were assessed accordingly. RESULTS: Of 370 eligible female participants, 91 had BCRL (24.6%). The mean age of this study sample was 49.89 (SD ​= ​7.45). All participants had had breast cancer surgery, and more than half of them had had a modified radical mastectomy (n ​= ​206, 55.5%). The mean follow-up time after breast cancer surgery was 28.73 months (SD ​= ​11.71). Most of the tumors were either stage I (n ​= ​49, 31.2%) or stage II (n ​= ​252, 68.1%). More than half of the sample had had postoperative chemotherapy (n ​= ​227, 61.4%). Overall, the logistic regression model achieved the best performance in terms of accuracy (91.6%), precision (82.1%), and recall (91.4%) for BCRL. Although this study included 48 predicting variables, we found that the five models required only 22 variables to achieve predictive performance. The most important variable was the number of positive lymph nodes, followed in descending order by the BCRL occurring on the same side as the surgery, a history of sentinel lymph node biopsy, a dietary preference for meat and fried food, and an exercise frequency of less than three times per week. These factors were the most influential predictors for enhancing the ML models’ performance. CONCLUSIONS: This study found that in the ML training dataset, the multilayer perceptron model and the logistic regression model were the best discrimination models for predicting the outcome of BCRL, and the k-nearest neighbors and support vector machine models demonstrated good calibration performance in the ML validation dataset. Future research will need to use large-sample datasets to establish a more robust ML model for predicting BCRL deeply and reliably.
format Online
Article
Text
id pubmed-9579303
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-95793032022-10-20 Comparison of machine learning models for predicting the risk of breast cancer-related lymphedema in Chinese women Wu, Xiumei Guan, Qiongyao Cheng, Andy S.K. Guan, Changhe Su, Yan Jiang, Jingchi Zeng, Yingchun Zeng, Linghui Wang, Boran Asia Pac J Oncol Nurs Original Article OBJECTIVE: Predictive models for the occurrence of cancer symptoms by using machine learning (ML) algorithms could be used to aid clinical decision-making in order to enhance the quality of cancer care. This study aimed to develop and validate a selection of classification models that used ML algorithms to predict the occurrence of breast cancer-related lymphedema (BCRL) among Chinese women. METHODS: This was a retrospective cohort study of consecutive cases that had been diagnosed with breast cancer, stages I-IV. Forty-eight variables were grouped into five feature sets. Five classification models with ML algorithms were developed, and the models' performance and the variables’ relative importance were assessed accordingly. RESULTS: Of 370 eligible female participants, 91 had BCRL (24.6%). The mean age of this study sample was 49.89 (SD ​= ​7.45). All participants had had breast cancer surgery, and more than half of them had had a modified radical mastectomy (n ​= ​206, 55.5%). The mean follow-up time after breast cancer surgery was 28.73 months (SD ​= ​11.71). Most of the tumors were either stage I (n ​= ​49, 31.2%) or stage II (n ​= ​252, 68.1%). More than half of the sample had had postoperative chemotherapy (n ​= ​227, 61.4%). Overall, the logistic regression model achieved the best performance in terms of accuracy (91.6%), precision (82.1%), and recall (91.4%) for BCRL. Although this study included 48 predicting variables, we found that the five models required only 22 variables to achieve predictive performance. The most important variable was the number of positive lymph nodes, followed in descending order by the BCRL occurring on the same side as the surgery, a history of sentinel lymph node biopsy, a dietary preference for meat and fried food, and an exercise frequency of less than three times per week. These factors were the most influential predictors for enhancing the ML models’ performance. CONCLUSIONS: This study found that in the ML training dataset, the multilayer perceptron model and the logistic regression model were the best discrimination models for predicting the outcome of BCRL, and the k-nearest neighbors and support vector machine models demonstrated good calibration performance in the ML validation dataset. Future research will need to use large-sample datasets to establish a more robust ML model for predicting BCRL deeply and reliably. Elsevier 2022-06-09 /pmc/articles/PMC9579303/ /pubmed/36276882 http://dx.doi.org/10.1016/j.apjon.2022.100101 Text en © 2022 The Author(s) https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Original Article
Wu, Xiumei
Guan, Qiongyao
Cheng, Andy S.K.
Guan, Changhe
Su, Yan
Jiang, Jingchi
Zeng, Yingchun
Zeng, Linghui
Wang, Boran
Comparison of machine learning models for predicting the risk of breast cancer-related lymphedema in Chinese women
title Comparison of machine learning models for predicting the risk of breast cancer-related lymphedema in Chinese women
title_full Comparison of machine learning models for predicting the risk of breast cancer-related lymphedema in Chinese women
title_fullStr Comparison of machine learning models for predicting the risk of breast cancer-related lymphedema in Chinese women
title_full_unstemmed Comparison of machine learning models for predicting the risk of breast cancer-related lymphedema in Chinese women
title_short Comparison of machine learning models for predicting the risk of breast cancer-related lymphedema in Chinese women
title_sort comparison of machine learning models for predicting the risk of breast cancer-related lymphedema in chinese women
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9579303/
https://www.ncbi.nlm.nih.gov/pubmed/36276882
http://dx.doi.org/10.1016/j.apjon.2022.100101
work_keys_str_mv AT wuxiumei comparisonofmachinelearningmodelsforpredictingtheriskofbreastcancerrelatedlymphedemainchinesewomen
AT guanqiongyao comparisonofmachinelearningmodelsforpredictingtheriskofbreastcancerrelatedlymphedemainchinesewomen
AT chengandysk comparisonofmachinelearningmodelsforpredictingtheriskofbreastcancerrelatedlymphedemainchinesewomen
AT guanchanghe comparisonofmachinelearningmodelsforpredictingtheriskofbreastcancerrelatedlymphedemainchinesewomen
AT suyan comparisonofmachinelearningmodelsforpredictingtheriskofbreastcancerrelatedlymphedemainchinesewomen
AT jiangjingchi comparisonofmachinelearningmodelsforpredictingtheriskofbreastcancerrelatedlymphedemainchinesewomen
AT zengyingchun comparisonofmachinelearningmodelsforpredictingtheriskofbreastcancerrelatedlymphedemainchinesewomen
AT zenglinghui comparisonofmachinelearningmodelsforpredictingtheriskofbreastcancerrelatedlymphedemainchinesewomen
AT wangboran comparisonofmachinelearningmodelsforpredictingtheriskofbreastcancerrelatedlymphedemainchinesewomen