Cargando…
Prediction of lymph node metastasis in patients with breast invasive micropapillary carcinoma based on machine learning and SHapley Additive exPlanations framework
ABSTRACT: Background and purpose: Machine learning (ML) is applied for outcome prediction and treatment support. This study aims to develop different ML models to predict risk of axillary lymph node metastasis (LNM) in breast invasive micropapillary carcinoma (IMPC) and to explore the risk factors o...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9520536/ https://www.ncbi.nlm.nih.gov/pubmed/36185290 http://dx.doi.org/10.3389/fonc.2022.981059 |
_version_ | 1784799647207784448 |
---|---|
author | Jiang, Cong Xiu, Yuting Qiao, Kun Yu, Xiao Zhang, Shiyuan Huang, Yuanxi |
author_facet | Jiang, Cong Xiu, Yuting Qiao, Kun Yu, Xiao Zhang, Shiyuan Huang, Yuanxi |
author_sort | Jiang, Cong |
collection | PubMed |
description | ABSTRACT: Background and purpose: Machine learning (ML) is applied for outcome prediction and treatment support. This study aims to develop different ML models to predict risk of axillary lymph node metastasis (LNM) in breast invasive micropapillary carcinoma (IMPC) and to explore the risk factors of LNM. METHODS: From the Surveillance, Epidemiology, and End Results (SEER) database and the records of our hospital, a total of 1547 patients diagnosed with breast IMPC were incorporated in this study. The ML model is built and the external validation is carried out. SHapley Additive exPlanations (SHAP) framework was applied to explain the optimal model; multivariable analysis was performed with logistic regression (LR); and nomograms were constructed according to the results of LR analysis. RESULTS: Age and tumor size were correlated with LNM in both cohorts. The luminal subtype is the most common in patients, with the tumor size <=20mm. Compared to other models, Xgboost was the best ML model with the biggest AUC of 0.813 (95% CI: 0.7994 - 0.8262) and the smallest Brier score of 0.186 (95% CI: 0.799-0.826). SHAP plots demonstrated that tumor size was the most vital risk factor for LNM. In both training and test sets, Xgboost had better AUC (0.761 vs 0.745; 0.813 vs 0.775; respectively), and it also achieved a smaller Brier score (0.202 vs 0.204; 0.186 vs 0.191; 0.220 vs 0.221; respectively) than the nomogram model based on LR in those three different sets. After adjusting for five most influential variables (tumor size, age, ER, HER-2, and PR), prediction score based on the Xgboost model was still correlated with LNM (adjusted OR:2.73, 95% CI: 1.30-5.71, P=0.008). CONCLUSIONS: The Xgboost model outperforms the traditional LR-based nomogram model in predicting the LNM of IMPC patients. Combined with SHAP, it can more intuitively reflect the influence of different variables on the LNM. The tumor size was the most important risk factor of LNM for breast IMPC patients. The prediction score obtained by the Xgboost model could be a good indicator for LNM. |
format | Online Article Text |
id | pubmed-9520536 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-95205362022-09-30 Prediction of lymph node metastasis in patients with breast invasive micropapillary carcinoma based on machine learning and SHapley Additive exPlanations framework Jiang, Cong Xiu, Yuting Qiao, Kun Yu, Xiao Zhang, Shiyuan Huang, Yuanxi Front Oncol Oncology ABSTRACT: Background and purpose: Machine learning (ML) is applied for outcome prediction and treatment support. This study aims to develop different ML models to predict risk of axillary lymph node metastasis (LNM) in breast invasive micropapillary carcinoma (IMPC) and to explore the risk factors of LNM. METHODS: From the Surveillance, Epidemiology, and End Results (SEER) database and the records of our hospital, a total of 1547 patients diagnosed with breast IMPC were incorporated in this study. The ML model is built and the external validation is carried out. SHapley Additive exPlanations (SHAP) framework was applied to explain the optimal model; multivariable analysis was performed with logistic regression (LR); and nomograms were constructed according to the results of LR analysis. RESULTS: Age and tumor size were correlated with LNM in both cohorts. The luminal subtype is the most common in patients, with the tumor size <=20mm. Compared to other models, Xgboost was the best ML model with the biggest AUC of 0.813 (95% CI: 0.7994 - 0.8262) and the smallest Brier score of 0.186 (95% CI: 0.799-0.826). SHAP plots demonstrated that tumor size was the most vital risk factor for LNM. In both training and test sets, Xgboost had better AUC (0.761 vs 0.745; 0.813 vs 0.775; respectively), and it also achieved a smaller Brier score (0.202 vs 0.204; 0.186 vs 0.191; 0.220 vs 0.221; respectively) than the nomogram model based on LR in those three different sets. After adjusting for five most influential variables (tumor size, age, ER, HER-2, and PR), prediction score based on the Xgboost model was still correlated with LNM (adjusted OR:2.73, 95% CI: 1.30-5.71, P=0.008). CONCLUSIONS: The Xgboost model outperforms the traditional LR-based nomogram model in predicting the LNM of IMPC patients. Combined with SHAP, it can more intuitively reflect the influence of different variables on the LNM. The tumor size was the most important risk factor of LNM for breast IMPC patients. The prediction score obtained by the Xgboost model could be a good indicator for LNM. Frontiers Media S.A. 2022-09-15 /pmc/articles/PMC9520536/ /pubmed/36185290 http://dx.doi.org/10.3389/fonc.2022.981059 Text en Copyright © 2022 Jiang, Xiu, Qiao, Yu, Zhang and Huang https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Oncology Jiang, Cong Xiu, Yuting Qiao, Kun Yu, Xiao Zhang, Shiyuan Huang, Yuanxi Prediction of lymph node metastasis in patients with breast invasive micropapillary carcinoma based on machine learning and SHapley Additive exPlanations framework |
title | Prediction of lymph node metastasis in patients with breast invasive micropapillary carcinoma based on machine learning and SHapley Additive exPlanations framework |
title_full | Prediction of lymph node metastasis in patients with breast invasive micropapillary carcinoma based on machine learning and SHapley Additive exPlanations framework |
title_fullStr | Prediction of lymph node metastasis in patients with breast invasive micropapillary carcinoma based on machine learning and SHapley Additive exPlanations framework |
title_full_unstemmed | Prediction of lymph node metastasis in patients with breast invasive micropapillary carcinoma based on machine learning and SHapley Additive exPlanations framework |
title_short | Prediction of lymph node metastasis in patients with breast invasive micropapillary carcinoma based on machine learning and SHapley Additive exPlanations framework |
title_sort | prediction of lymph node metastasis in patients with breast invasive micropapillary carcinoma based on machine learning and shapley additive explanations framework |
topic | Oncology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9520536/ https://www.ncbi.nlm.nih.gov/pubmed/36185290 http://dx.doi.org/10.3389/fonc.2022.981059 |
work_keys_str_mv | AT jiangcong predictionoflymphnodemetastasisinpatientswithbreastinvasivemicropapillarycarcinomabasedonmachinelearningandshapleyadditiveexplanationsframework AT xiuyuting predictionoflymphnodemetastasisinpatientswithbreastinvasivemicropapillarycarcinomabasedonmachinelearningandshapleyadditiveexplanationsframework AT qiaokun predictionoflymphnodemetastasisinpatientswithbreastinvasivemicropapillarycarcinomabasedonmachinelearningandshapleyadditiveexplanationsframework AT yuxiao predictionoflymphnodemetastasisinpatientswithbreastinvasivemicropapillarycarcinomabasedonmachinelearningandshapleyadditiveexplanationsframework AT zhangshiyuan predictionoflymphnodemetastasisinpatientswithbreastinvasivemicropapillarycarcinomabasedonmachinelearningandshapleyadditiveexplanationsframework AT huangyuanxi predictionoflymphnodemetastasisinpatientswithbreastinvasivemicropapillarycarcinomabasedonmachinelearningandshapleyadditiveexplanationsframework |