Cargando…

Establishment and Verification of a Bagged-Trees-Based Model for Prediction of Sentinel Lymph Node Metastasis for Early Breast Cancer Patients

Purpose: Lymph node metastasis is a multifactorial event. Several scholars have developed nomograph models to predict the sentinel lymph nodes (SLN) metastasis before operation. According to the clinical and pathological characteristics of breast cancer patients, we use the new method to establish a...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Chao, Zhao, Zeyin, Gu, Xi, Sun, Lisha, Chen, Guanglei, Zhang, Hao, Jiang, Yanlin, Zhang, Yixiao, Cui, Xiaoyu, Liu, Caigang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6476951/
https://www.ncbi.nlm.nih.gov/pubmed/31041192
http://dx.doi.org/10.3389/fonc.2019.00282
_version_ 1783412966696157184
author Liu, Chao
Zhao, Zeyin
Gu, Xi
Sun, Lisha
Chen, Guanglei
Zhang, Hao
Jiang, Yanlin
Zhang, Yixiao
Cui, Xiaoyu
Liu, Caigang
author_facet Liu, Chao
Zhao, Zeyin
Gu, Xi
Sun, Lisha
Chen, Guanglei
Zhang, Hao
Jiang, Yanlin
Zhang, Yixiao
Cui, Xiaoyu
Liu, Caigang
author_sort Liu, Chao
collection PubMed
description Purpose: Lymph node metastasis is a multifactorial event. Several scholars have developed nomograph models to predict the sentinel lymph nodes (SLN) metastasis before operation. According to the clinical and pathological characteristics of breast cancer patients, we use the new method to establish a more comprehensive model and add some new factors which have never been analyzed in the world and explored the prospect of its clinical application. Materials and methods: The clinicopathological data of 633 patients with breast cancer who underwent SLN examination from January 2011 to December 2014 were retrospectively analyzed. Because of the imbalance in data, we used smote algorithm to oversample the data to increase the balanced amount of data. Our study for the first time included the shape of the tumor and breast gland content. The location of the tumor was analyzed by the vector combining quadrant method, at the same time we use the method of simply using quadrant or vector for comparing. We also compared the predictive ability of building models through logistic regression and Bagged-Tree algorithm. The Bagged-Tree algorithm was used to categorize samples. The SMOTE-Bagged Tree algorithm and 5-fold cross-validation was used to established the prediction model. The clinical application value of the model in early breast cancer patients was evaluated by confusion matrix and the area under receiver operating characteristic (ROC) curve (AUC). Results: Our predictive model included 12 variables as follows: age, body mass index (BMI), quadrant, clock direction, the distance of tumor from the nipple, morphology of tumor molybdenum target, glandular content, tumor size, ER, PR, HER2, and Ki-67.Finally, our model obtained the AUC value of 0.801 and the accuracy of 70.3%.We used logistic regression to established the model, in the modeling and validation groups, the area under the curve (AUC) were 0.660 and 0.580.We used the vector combining quadrant method to analyze the original location of the tumor, which is more precise than simply using vector or quadrant (AUC 0.801 vs. 0.791 vs. 0.701, Accuracy 70.3 vs. 70.3 vs. 63.6%). Conclusions: Our model is more reliable and stable to assist doctors predict the SLN metastasis in breast cancer patients before operation.
format Online
Article
Text
id pubmed-6476951
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-64769512019-04-30 Establishment and Verification of a Bagged-Trees-Based Model for Prediction of Sentinel Lymph Node Metastasis for Early Breast Cancer Patients Liu, Chao Zhao, Zeyin Gu, Xi Sun, Lisha Chen, Guanglei Zhang, Hao Jiang, Yanlin Zhang, Yixiao Cui, Xiaoyu Liu, Caigang Front Oncol Oncology Purpose: Lymph node metastasis is a multifactorial event. Several scholars have developed nomograph models to predict the sentinel lymph nodes (SLN) metastasis before operation. According to the clinical and pathological characteristics of breast cancer patients, we use the new method to establish a more comprehensive model and add some new factors which have never been analyzed in the world and explored the prospect of its clinical application. Materials and methods: The clinicopathological data of 633 patients with breast cancer who underwent SLN examination from January 2011 to December 2014 were retrospectively analyzed. Because of the imbalance in data, we used smote algorithm to oversample the data to increase the balanced amount of data. Our study for the first time included the shape of the tumor and breast gland content. The location of the tumor was analyzed by the vector combining quadrant method, at the same time we use the method of simply using quadrant or vector for comparing. We also compared the predictive ability of building models through logistic regression and Bagged-Tree algorithm. The Bagged-Tree algorithm was used to categorize samples. The SMOTE-Bagged Tree algorithm and 5-fold cross-validation was used to established the prediction model. The clinical application value of the model in early breast cancer patients was evaluated by confusion matrix and the area under receiver operating characteristic (ROC) curve (AUC). Results: Our predictive model included 12 variables as follows: age, body mass index (BMI), quadrant, clock direction, the distance of tumor from the nipple, morphology of tumor molybdenum target, glandular content, tumor size, ER, PR, HER2, and Ki-67.Finally, our model obtained the AUC value of 0.801 and the accuracy of 70.3%.We used logistic regression to established the model, in the modeling and validation groups, the area under the curve (AUC) were 0.660 and 0.580.We used the vector combining quadrant method to analyze the original location of the tumor, which is more precise than simply using vector or quadrant (AUC 0.801 vs. 0.791 vs. 0.701, Accuracy 70.3 vs. 70.3 vs. 63.6%). Conclusions: Our model is more reliable and stable to assist doctors predict the SLN metastasis in breast cancer patients before operation. Frontiers Media S.A. 2019-04-16 /pmc/articles/PMC6476951/ /pubmed/31041192 http://dx.doi.org/10.3389/fonc.2019.00282 Text en Copyright © 2019 Liu, Zhao, Gu, Sun, Chen, Zhang, Jiang, Zhang, Cui and Liu. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Oncology
Liu, Chao
Zhao, Zeyin
Gu, Xi
Sun, Lisha
Chen, Guanglei
Zhang, Hao
Jiang, Yanlin
Zhang, Yixiao
Cui, Xiaoyu
Liu, Caigang
Establishment and Verification of a Bagged-Trees-Based Model for Prediction of Sentinel Lymph Node Metastasis for Early Breast Cancer Patients
title Establishment and Verification of a Bagged-Trees-Based Model for Prediction of Sentinel Lymph Node Metastasis for Early Breast Cancer Patients
title_full Establishment and Verification of a Bagged-Trees-Based Model for Prediction of Sentinel Lymph Node Metastasis for Early Breast Cancer Patients
title_fullStr Establishment and Verification of a Bagged-Trees-Based Model for Prediction of Sentinel Lymph Node Metastasis for Early Breast Cancer Patients
title_full_unstemmed Establishment and Verification of a Bagged-Trees-Based Model for Prediction of Sentinel Lymph Node Metastasis for Early Breast Cancer Patients
title_short Establishment and Verification of a Bagged-Trees-Based Model for Prediction of Sentinel Lymph Node Metastasis for Early Breast Cancer Patients
title_sort establishment and verification of a bagged-trees-based model for prediction of sentinel lymph node metastasis for early breast cancer patients
topic Oncology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6476951/
https://www.ncbi.nlm.nih.gov/pubmed/31041192
http://dx.doi.org/10.3389/fonc.2019.00282
work_keys_str_mv AT liuchao establishmentandverificationofabaggedtreesbasedmodelforpredictionofsentinellymphnodemetastasisforearlybreastcancerpatients
AT zhaozeyin establishmentandverificationofabaggedtreesbasedmodelforpredictionofsentinellymphnodemetastasisforearlybreastcancerpatients
AT guxi establishmentandverificationofabaggedtreesbasedmodelforpredictionofsentinellymphnodemetastasisforearlybreastcancerpatients
AT sunlisha establishmentandverificationofabaggedtreesbasedmodelforpredictionofsentinellymphnodemetastasisforearlybreastcancerpatients
AT chenguanglei establishmentandverificationofabaggedtreesbasedmodelforpredictionofsentinellymphnodemetastasisforearlybreastcancerpatients
AT zhanghao establishmentandverificationofabaggedtreesbasedmodelforpredictionofsentinellymphnodemetastasisforearlybreastcancerpatients
AT jiangyanlin establishmentandverificationofabaggedtreesbasedmodelforpredictionofsentinellymphnodemetastasisforearlybreastcancerpatients
AT zhangyixiao establishmentandverificationofabaggedtreesbasedmodelforpredictionofsentinellymphnodemetastasisforearlybreastcancerpatients
AT cuixiaoyu establishmentandverificationofabaggedtreesbasedmodelforpredictionofsentinellymphnodemetastasisforearlybreastcancerpatients
AT liucaigang establishmentandverificationofabaggedtreesbasedmodelforpredictionofsentinellymphnodemetastasisforearlybreastcancerpatients