Cargando…

Prediction of lymph node metastasis in patients with breast invasive micropapillary carcinoma based on machine learning and SHapley Additive exPlanations framework

ABSTRACT: Background and purpose: Machine learning (ML) is applied for outcome prediction and treatment support. This study aims to develop different ML models to predict risk of axillary lymph node metastasis (LNM) in breast invasive micropapillary carcinoma (IMPC) and to explore the risk factors o...

Descripción completa

Detalles Bibliográficos
Autores principales: Jiang, Cong, Xiu, Yuting, Qiao, Kun, Yu, Xiao, Zhang, Shiyuan, Huang, Yuanxi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9520536/
https://www.ncbi.nlm.nih.gov/pubmed/36185290
http://dx.doi.org/10.3389/fonc.2022.981059
_version_ 1784799647207784448
author Jiang, Cong
Xiu, Yuting
Qiao, Kun
Yu, Xiao
Zhang, Shiyuan
Huang, Yuanxi
author_facet Jiang, Cong
Xiu, Yuting
Qiao, Kun
Yu, Xiao
Zhang, Shiyuan
Huang, Yuanxi
author_sort Jiang, Cong
collection PubMed
description ABSTRACT: Background and purpose: Machine learning (ML) is applied for outcome prediction and treatment support. This study aims to develop different ML models to predict risk of axillary lymph node metastasis (LNM) in breast invasive micropapillary carcinoma (IMPC) and to explore the risk factors of LNM. METHODS: From the Surveillance, Epidemiology, and End Results (SEER) database and the records of our hospital, a total of 1547 patients diagnosed with breast IMPC were incorporated in this study. The ML model is built and the external validation is carried out. SHapley Additive exPlanations (SHAP) framework was applied to explain the optimal model; multivariable analysis was performed with logistic regression (LR); and nomograms were constructed according to the results of LR analysis. RESULTS: Age and tumor size were correlated with LNM in both cohorts. The luminal subtype is the most common in patients, with the tumor size <=20mm. Compared to other models, Xgboost was the best ML model with the biggest AUC of 0.813 (95% CI: 0.7994 - 0.8262) and the smallest Brier score of 0.186 (95% CI: 0.799-0.826). SHAP plots demonstrated that tumor size was the most vital risk factor for LNM. In both training and test sets, Xgboost had better AUC (0.761 vs 0.745; 0.813 vs 0.775; respectively), and it also achieved a smaller Brier score (0.202 vs 0.204; 0.186 vs 0.191; 0.220 vs 0.221; respectively) than the nomogram model based on LR in those three different sets. After adjusting for five most influential variables (tumor size, age, ER, HER-2, and PR), prediction score based on the Xgboost model was still correlated with LNM (adjusted OR:2.73, 95% CI: 1.30-5.71, P=0.008). CONCLUSIONS: The Xgboost model outperforms the traditional LR-based nomogram model in predicting the LNM of IMPC patients. Combined with SHAP, it can more intuitively reflect the influence of different variables on the LNM. The tumor size was the most important risk factor of LNM for breast IMPC patients. The prediction score obtained by the Xgboost model could be a good indicator for LNM.
format Online
Article
Text
id pubmed-9520536
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-95205362022-09-30 Prediction of lymph node metastasis in patients with breast invasive micropapillary carcinoma based on machine learning and SHapley Additive exPlanations framework Jiang, Cong Xiu, Yuting Qiao, Kun Yu, Xiao Zhang, Shiyuan Huang, Yuanxi Front Oncol Oncology ABSTRACT: Background and purpose: Machine learning (ML) is applied for outcome prediction and treatment support. This study aims to develop different ML models to predict risk of axillary lymph node metastasis (LNM) in breast invasive micropapillary carcinoma (IMPC) and to explore the risk factors of LNM. METHODS: From the Surveillance, Epidemiology, and End Results (SEER) database and the records of our hospital, a total of 1547 patients diagnosed with breast IMPC were incorporated in this study. The ML model is built and the external validation is carried out. SHapley Additive exPlanations (SHAP) framework was applied to explain the optimal model; multivariable analysis was performed with logistic regression (LR); and nomograms were constructed according to the results of LR analysis. RESULTS: Age and tumor size were correlated with LNM in both cohorts. The luminal subtype is the most common in patients, with the tumor size <=20mm. Compared to other models, Xgboost was the best ML model with the biggest AUC of 0.813 (95% CI: 0.7994 - 0.8262) and the smallest Brier score of 0.186 (95% CI: 0.799-0.826). SHAP plots demonstrated that tumor size was the most vital risk factor for LNM. In both training and test sets, Xgboost had better AUC (0.761 vs 0.745; 0.813 vs 0.775; respectively), and it also achieved a smaller Brier score (0.202 vs 0.204; 0.186 vs 0.191; 0.220 vs 0.221; respectively) than the nomogram model based on LR in those three different sets. After adjusting for five most influential variables (tumor size, age, ER, HER-2, and PR), prediction score based on the Xgboost model was still correlated with LNM (adjusted OR:2.73, 95% CI: 1.30-5.71, P=0.008). CONCLUSIONS: The Xgboost model outperforms the traditional LR-based nomogram model in predicting the LNM of IMPC patients. Combined with SHAP, it can more intuitively reflect the influence of different variables on the LNM. The tumor size was the most important risk factor of LNM for breast IMPC patients. The prediction score obtained by the Xgboost model could be a good indicator for LNM. Frontiers Media S.A. 2022-09-15 /pmc/articles/PMC9520536/ /pubmed/36185290 http://dx.doi.org/10.3389/fonc.2022.981059 Text en Copyright © 2022 Jiang, Xiu, Qiao, Yu, Zhang and Huang https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Oncology
Jiang, Cong
Xiu, Yuting
Qiao, Kun
Yu, Xiao
Zhang, Shiyuan
Huang, Yuanxi
Prediction of lymph node metastasis in patients with breast invasive micropapillary carcinoma based on machine learning and SHapley Additive exPlanations framework
title Prediction of lymph node metastasis in patients with breast invasive micropapillary carcinoma based on machine learning and SHapley Additive exPlanations framework
title_full Prediction of lymph node metastasis in patients with breast invasive micropapillary carcinoma based on machine learning and SHapley Additive exPlanations framework
title_fullStr Prediction of lymph node metastasis in patients with breast invasive micropapillary carcinoma based on machine learning and SHapley Additive exPlanations framework
title_full_unstemmed Prediction of lymph node metastasis in patients with breast invasive micropapillary carcinoma based on machine learning and SHapley Additive exPlanations framework
title_short Prediction of lymph node metastasis in patients with breast invasive micropapillary carcinoma based on machine learning and SHapley Additive exPlanations framework
title_sort prediction of lymph node metastasis in patients with breast invasive micropapillary carcinoma based on machine learning and shapley additive explanations framework
topic Oncology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9520536/
https://www.ncbi.nlm.nih.gov/pubmed/36185290
http://dx.doi.org/10.3389/fonc.2022.981059
work_keys_str_mv AT jiangcong predictionoflymphnodemetastasisinpatientswithbreastinvasivemicropapillarycarcinomabasedonmachinelearningandshapleyadditiveexplanationsframework
AT xiuyuting predictionoflymphnodemetastasisinpatientswithbreastinvasivemicropapillarycarcinomabasedonmachinelearningandshapleyadditiveexplanationsframework
AT qiaokun predictionoflymphnodemetastasisinpatientswithbreastinvasivemicropapillarycarcinomabasedonmachinelearningandshapleyadditiveexplanationsframework
AT yuxiao predictionoflymphnodemetastasisinpatientswithbreastinvasivemicropapillarycarcinomabasedonmachinelearningandshapleyadditiveexplanationsframework
AT zhangshiyuan predictionoflymphnodemetastasisinpatientswithbreastinvasivemicropapillarycarcinomabasedonmachinelearningandshapleyadditiveexplanationsframework
AT huangyuanxi predictionoflymphnodemetastasisinpatientswithbreastinvasivemicropapillarycarcinomabasedonmachinelearningandshapleyadditiveexplanationsframework