Cargando…

Comparison of machine learning models for predicting the risk of breast cancer-related lymphedema in Chinese women

OBJECTIVE: Predictive models for the occurrence of cancer symptoms by using machine learning (ML) algorithms could be used to aid clinical decision-making in order to enhance the quality of cancer care. This study aimed to develop and validate a selection of classification models that used ML algori...

Descripción completa

Detalles Bibliográficos
Autores principales:	Wu, Xiumei, Guan, Qiongyao, Cheng, Andy S.K., Guan, Changhe, Su, Yan, Jiang, Jingchi, Zeng, Yingchun, Zeng, Linghui, Wang, Boran
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Elsevier 2022
Materias:	Original Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9579303/ https://www.ncbi.nlm.nih.gov/pubmed/36276882 http://dx.doi.org/10.1016/j.apjon.2022.100101

_version_	1784812150874701824
author	Wu, Xiumei Guan, Qiongyao Cheng, Andy S.K. Guan, Changhe Su, Yan Jiang, Jingchi Zeng, Yingchun Zeng, Linghui Wang, Boran
author_facet	Wu, Xiumei Guan, Qiongyao Cheng, Andy S.K. Guan, Changhe Su, Yan Jiang, Jingchi Zeng, Yingchun Zeng, Linghui Wang, Boran
author_sort	Wu, Xiumei
collection	PubMed
description	OBJECTIVE: Predictive models for the occurrence of cancer symptoms by using machine learning (ML) algorithms could be used to aid clinical decision-making in order to enhance the quality of cancer care. This study aimed to develop and validate a selection of classification models that used ML algorithms to predict the occurrence of breast cancer-related lymphedema (BCRL) among Chinese women. METHODS: This was a retrospective cohort study of consecutive cases that had been diagnosed with breast cancer, stages I-IV. Forty-eight variables were grouped into five feature sets. Five classification models with ML algorithms were developed, and the models' performance and the variables’ relative importance were assessed accordingly. RESULTS: Of 370 eligible female participants, 91 had BCRL (24.6%). The mean age of this study sample was 49.89 (SD = 7.45). All participants had had breast cancer surgery, and more than half of them had had a modified radical mastectomy (n = 206, 55.5%). The mean follow-up time after breast cancer surgery was 28.73 months (SD = 11.71). Most of the tumors were either stage I (n = 49, 31.2%) or stage II (n = 252, 68.1%). More than half of the sample had had postoperative chemotherapy (n = 227, 61.4%). Overall, the logistic regression model achieved the best performance in terms of accuracy (91.6%), precision (82.1%), and recall (91.4%) for BCRL. Although this study included 48 predicting variables, we found that the five models required only 22 variables to achieve predictive performance. The most important variable was the number of positive lymph nodes, followed in descending order by the BCRL occurring on the same side as the surgery, a history of sentinel lymph node biopsy, a dietary preference for meat and fried food, and an exercise frequency of less than three times per week. These factors were the most influential predictors for enhancing the ML models’ performance. CONCLUSIONS: This study found that in the ML training dataset, the multilayer perceptron model and the logistic regression model were the best discrimination models for predicting the outcome of BCRL, and the k-nearest neighbors and support vector machine models demonstrated good calibration performance in the ML validation dataset. Future research will need to use large-sample datasets to establish a more robust ML model for predicting BCRL deeply and reliably.
format	Online Article Text
id	pubmed-9579303
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	Elsevier
record_format	MEDLINE/PubMed
spelling	pubmed-95793032022-10-20 Comparison of machine learning models for predicting the risk of breast cancer-related lymphedema in Chinese women Wu, Xiumei Guan, Qiongyao Cheng, Andy S.K. Guan, Changhe Su, Yan Jiang, Jingchi Zeng, Yingchun Zeng, Linghui Wang, Boran Asia Pac J Oncol Nurs Original Article OBJECTIVE: Predictive models for the occurrence of cancer symptoms by using machine learning (ML) algorithms could be used to aid clinical decision-making in order to enhance the quality of cancer care. This study aimed to develop and validate a selection of classification models that used ML algorithms to predict the occurrence of breast cancer-related lymphedema (BCRL) among Chinese women. METHODS: This was a retrospective cohort study of consecutive cases that had been diagnosed with breast cancer, stages I-IV. Forty-eight variables were grouped into five feature sets. Five classification models with ML algorithms were developed, and the models' performance and the variables’ relative importance were assessed accordingly. RESULTS: Of 370 eligible female participants, 91 had BCRL (24.6%). The mean age of this study sample was 49.89 (SD = 7.45). All participants had had breast cancer surgery, and more than half of them had had a modified radical mastectomy (n = 206, 55.5%). The mean follow-up time after breast cancer surgery was 28.73 months (SD = 11.71). Most of the tumors were either stage I (n = 49, 31.2%) or stage II (n = 252, 68.1%). More than half of the sample had had postoperative chemotherapy (n = 227, 61.4%). Overall, the logistic regression model achieved the best performance in terms of accuracy (91.6%), precision (82.1%), and recall (91.4%) for BCRL. Although this study included 48 predicting variables, we found that the five models required only 22 variables to achieve predictive performance. The most important variable was the number of positive lymph nodes, followed in descending order by the BCRL occurring on the same side as the surgery, a history of sentinel lymph node biopsy, a dietary preference for meat and fried food, and an exercise frequency of less than three times per week. These factors were the most influential predictors for enhancing the ML models’ performance. CONCLUSIONS: This study found that in the ML training dataset, the multilayer perceptron model and the logistic regression model were the best discrimination models for predicting the outcome of BCRL, and the k-nearest neighbors and support vector machine models demonstrated good calibration performance in the ML validation dataset. Future research will need to use large-sample datasets to establish a more robust ML model for predicting BCRL deeply and reliably. Elsevier 2022-06-09 /pmc/articles/PMC9579303/ /pubmed/36276882 http://dx.doi.org/10.1016/j.apjon.2022.100101 Text en © 2022 The Author(s) https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle	Original Article Wu, Xiumei Guan, Qiongyao Cheng, Andy S.K. Guan, Changhe Su, Yan Jiang, Jingchi Zeng, Yingchun Zeng, Linghui Wang, Boran Comparison of machine learning models for predicting the risk of breast cancer-related lymphedema in Chinese women
title	Comparison of machine learning models for predicting the risk of breast cancer-related lymphedema in Chinese women
title_full	Comparison of machine learning models for predicting the risk of breast cancer-related lymphedema in Chinese women
title_fullStr	Comparison of machine learning models for predicting the risk of breast cancer-related lymphedema in Chinese women
title_full_unstemmed	Comparison of machine learning models for predicting the risk of breast cancer-related lymphedema in Chinese women
title_short	Comparison of machine learning models for predicting the risk of breast cancer-related lymphedema in Chinese women
title_sort	comparison of machine learning models for predicting the risk of breast cancer-related lymphedema in chinese women
topic	Original Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9579303/ https://www.ncbi.nlm.nih.gov/pubmed/36276882 http://dx.doi.org/10.1016/j.apjon.2022.100101
work_keys_str_mv	AT wuxiumei comparisonofmachinelearningmodelsforpredictingtheriskofbreastcancerrelatedlymphedemainchinesewomen AT guanqiongyao comparisonofmachinelearningmodelsforpredictingtheriskofbreastcancerrelatedlymphedemainchinesewomen AT chengandysk comparisonofmachinelearningmodelsforpredictingtheriskofbreastcancerrelatedlymphedemainchinesewomen AT guanchanghe comparisonofmachinelearningmodelsforpredictingtheriskofbreastcancerrelatedlymphedemainchinesewomen AT suyan comparisonofmachinelearningmodelsforpredictingtheriskofbreastcancerrelatedlymphedemainchinesewomen AT jiangjingchi comparisonofmachinelearningmodelsforpredictingtheriskofbreastcancerrelatedlymphedemainchinesewomen AT zengyingchun comparisonofmachinelearningmodelsforpredictingtheriskofbreastcancerrelatedlymphedemainchinesewomen AT zenglinghui comparisonofmachinelearningmodelsforpredictingtheriskofbreastcancerrelatedlymphedemainchinesewomen AT wangboran comparisonofmachinelearningmodelsforpredictingtheriskofbreastcancerrelatedlymphedemainchinesewomen

Comparison of machine learning models for predicting the risk of breast cancer-related lymphedema in Chinese women

Ejemplares similares