Cargando…
Establishment and Verification of a Bagged-Trees-Based Model for Prediction of Sentinel Lymph Node Metastasis for Early Breast Cancer Patients
Purpose: Lymph node metastasis is a multifactorial event. Several scholars have developed nomograph models to predict the sentinel lymph nodes (SLN) metastasis before operation. According to the clinical and pathological characteristics of breast cancer patients, we use the new method to establish a...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6476951/ https://www.ncbi.nlm.nih.gov/pubmed/31041192 http://dx.doi.org/10.3389/fonc.2019.00282 |
_version_ | 1783412966696157184 |
---|---|
author | Liu, Chao Zhao, Zeyin Gu, Xi Sun, Lisha Chen, Guanglei Zhang, Hao Jiang, Yanlin Zhang, Yixiao Cui, Xiaoyu Liu, Caigang |
author_facet | Liu, Chao Zhao, Zeyin Gu, Xi Sun, Lisha Chen, Guanglei Zhang, Hao Jiang, Yanlin Zhang, Yixiao Cui, Xiaoyu Liu, Caigang |
author_sort | Liu, Chao |
collection | PubMed |
description | Purpose: Lymph node metastasis is a multifactorial event. Several scholars have developed nomograph models to predict the sentinel lymph nodes (SLN) metastasis before operation. According to the clinical and pathological characteristics of breast cancer patients, we use the new method to establish a more comprehensive model and add some new factors which have never been analyzed in the world and explored the prospect of its clinical application. Materials and methods: The clinicopathological data of 633 patients with breast cancer who underwent SLN examination from January 2011 to December 2014 were retrospectively analyzed. Because of the imbalance in data, we used smote algorithm to oversample the data to increase the balanced amount of data. Our study for the first time included the shape of the tumor and breast gland content. The location of the tumor was analyzed by the vector combining quadrant method, at the same time we use the method of simply using quadrant or vector for comparing. We also compared the predictive ability of building models through logistic regression and Bagged-Tree algorithm. The Bagged-Tree algorithm was used to categorize samples. The SMOTE-Bagged Tree algorithm and 5-fold cross-validation was used to established the prediction model. The clinical application value of the model in early breast cancer patients was evaluated by confusion matrix and the area under receiver operating characteristic (ROC) curve (AUC). Results: Our predictive model included 12 variables as follows: age, body mass index (BMI), quadrant, clock direction, the distance of tumor from the nipple, morphology of tumor molybdenum target, glandular content, tumor size, ER, PR, HER2, and Ki-67.Finally, our model obtained the AUC value of 0.801 and the accuracy of 70.3%.We used logistic regression to established the model, in the modeling and validation groups, the area under the curve (AUC) were 0.660 and 0.580.We used the vector combining quadrant method to analyze the original location of the tumor, which is more precise than simply using vector or quadrant (AUC 0.801 vs. 0.791 vs. 0.701, Accuracy 70.3 vs. 70.3 vs. 63.6%). Conclusions: Our model is more reliable and stable to assist doctors predict the SLN metastasis in breast cancer patients before operation. |
format | Online Article Text |
id | pubmed-6476951 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-64769512019-04-30 Establishment and Verification of a Bagged-Trees-Based Model for Prediction of Sentinel Lymph Node Metastasis for Early Breast Cancer Patients Liu, Chao Zhao, Zeyin Gu, Xi Sun, Lisha Chen, Guanglei Zhang, Hao Jiang, Yanlin Zhang, Yixiao Cui, Xiaoyu Liu, Caigang Front Oncol Oncology Purpose: Lymph node metastasis is a multifactorial event. Several scholars have developed nomograph models to predict the sentinel lymph nodes (SLN) metastasis before operation. According to the clinical and pathological characteristics of breast cancer patients, we use the new method to establish a more comprehensive model and add some new factors which have never been analyzed in the world and explored the prospect of its clinical application. Materials and methods: The clinicopathological data of 633 patients with breast cancer who underwent SLN examination from January 2011 to December 2014 were retrospectively analyzed. Because of the imbalance in data, we used smote algorithm to oversample the data to increase the balanced amount of data. Our study for the first time included the shape of the tumor and breast gland content. The location of the tumor was analyzed by the vector combining quadrant method, at the same time we use the method of simply using quadrant or vector for comparing. We also compared the predictive ability of building models through logistic regression and Bagged-Tree algorithm. The Bagged-Tree algorithm was used to categorize samples. The SMOTE-Bagged Tree algorithm and 5-fold cross-validation was used to established the prediction model. The clinical application value of the model in early breast cancer patients was evaluated by confusion matrix and the area under receiver operating characteristic (ROC) curve (AUC). Results: Our predictive model included 12 variables as follows: age, body mass index (BMI), quadrant, clock direction, the distance of tumor from the nipple, morphology of tumor molybdenum target, glandular content, tumor size, ER, PR, HER2, and Ki-67.Finally, our model obtained the AUC value of 0.801 and the accuracy of 70.3%.We used logistic regression to established the model, in the modeling and validation groups, the area under the curve (AUC) were 0.660 and 0.580.We used the vector combining quadrant method to analyze the original location of the tumor, which is more precise than simply using vector or quadrant (AUC 0.801 vs. 0.791 vs. 0.701, Accuracy 70.3 vs. 70.3 vs. 63.6%). Conclusions: Our model is more reliable and stable to assist doctors predict the SLN metastasis in breast cancer patients before operation. Frontiers Media S.A. 2019-04-16 /pmc/articles/PMC6476951/ /pubmed/31041192 http://dx.doi.org/10.3389/fonc.2019.00282 Text en Copyright © 2019 Liu, Zhao, Gu, Sun, Chen, Zhang, Jiang, Zhang, Cui and Liu. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Oncology Liu, Chao Zhao, Zeyin Gu, Xi Sun, Lisha Chen, Guanglei Zhang, Hao Jiang, Yanlin Zhang, Yixiao Cui, Xiaoyu Liu, Caigang Establishment and Verification of a Bagged-Trees-Based Model for Prediction of Sentinel Lymph Node Metastasis for Early Breast Cancer Patients |
title | Establishment and Verification of a Bagged-Trees-Based Model for Prediction of Sentinel Lymph Node Metastasis for Early Breast Cancer Patients |
title_full | Establishment and Verification of a Bagged-Trees-Based Model for Prediction of Sentinel Lymph Node Metastasis for Early Breast Cancer Patients |
title_fullStr | Establishment and Verification of a Bagged-Trees-Based Model for Prediction of Sentinel Lymph Node Metastasis for Early Breast Cancer Patients |
title_full_unstemmed | Establishment and Verification of a Bagged-Trees-Based Model for Prediction of Sentinel Lymph Node Metastasis for Early Breast Cancer Patients |
title_short | Establishment and Verification of a Bagged-Trees-Based Model for Prediction of Sentinel Lymph Node Metastasis for Early Breast Cancer Patients |
title_sort | establishment and verification of a bagged-trees-based model for prediction of sentinel lymph node metastasis for early breast cancer patients |
topic | Oncology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6476951/ https://www.ncbi.nlm.nih.gov/pubmed/31041192 http://dx.doi.org/10.3389/fonc.2019.00282 |
work_keys_str_mv | AT liuchao establishmentandverificationofabaggedtreesbasedmodelforpredictionofsentinellymphnodemetastasisforearlybreastcancerpatients AT zhaozeyin establishmentandverificationofabaggedtreesbasedmodelforpredictionofsentinellymphnodemetastasisforearlybreastcancerpatients AT guxi establishmentandverificationofabaggedtreesbasedmodelforpredictionofsentinellymphnodemetastasisforearlybreastcancerpatients AT sunlisha establishmentandverificationofabaggedtreesbasedmodelforpredictionofsentinellymphnodemetastasisforearlybreastcancerpatients AT chenguanglei establishmentandverificationofabaggedtreesbasedmodelforpredictionofsentinellymphnodemetastasisforearlybreastcancerpatients AT zhanghao establishmentandverificationofabaggedtreesbasedmodelforpredictionofsentinellymphnodemetastasisforearlybreastcancerpatients AT jiangyanlin establishmentandverificationofabaggedtreesbasedmodelforpredictionofsentinellymphnodemetastasisforearlybreastcancerpatients AT zhangyixiao establishmentandverificationofabaggedtreesbasedmodelforpredictionofsentinellymphnodemetastasisforearlybreastcancerpatients AT cuixiaoyu establishmentandverificationofabaggedtreesbasedmodelforpredictionofsentinellymphnodemetastasisforearlybreastcancerpatients AT liucaigang establishmentandverificationofabaggedtreesbasedmodelforpredictionofsentinellymphnodemetastasisforearlybreastcancerpatients |