Cargando…

Exploring risk factors for cervical lymph node metastasis in papillary thyroid microcarcinoma: construction of a novel population-based predictive model

BACKGROUND: Machine learning was a highly effective tool in model construction. We aim to establish a machine learning-based predictive model for predicting the cervical lymph node metastasis (LNM) in papillary thyroid microcarcinoma (PTMC). METHODS: We obtained data on PTMC from the SEER database,...

Descripción completa

Detalles Bibliográficos
Autores principales: Huang, Yanling, Mao, Yaqian, Xu, Lizhen, Wen, Junping, Chen, Gang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9635156/
https://www.ncbi.nlm.nih.gov/pubmed/36329470
http://dx.doi.org/10.1186/s12902-022-01186-1
_version_ 1784824648235483136
author Huang, Yanling
Mao, Yaqian
Xu, Lizhen
Wen, Junping
Chen, Gang
author_facet Huang, Yanling
Mao, Yaqian
Xu, Lizhen
Wen, Junping
Chen, Gang
author_sort Huang, Yanling
collection PubMed
description BACKGROUND: Machine learning was a highly effective tool in model construction. We aim to establish a machine learning-based predictive model for predicting the cervical lymph node metastasis (LNM) in papillary thyroid microcarcinoma (PTMC). METHODS: We obtained data on PTMC from the SEER database, including 10 demographic and clinicopathological characteristics. Univariate and multivariate logistic regression (LR) analyses were applied to screen the risk factors for cervical LNM in PTMC. Risk factors with P < 0.05 in multivariate LR analysis were used as modeling variables. Five different machine learning (ML) algorithms including extreme gradient boosting (XGBoost), random forest (RF), adaptive boosting (AdaBoost), gaussian naive bayes (GNB) and multi-layer perceptron (MLP) and traditional regression analysis were used to construct the prediction model. Finally, the area under the receiver operating characteristic (AUROC) curve was used to compare the model performance. RESULTS: Through univariate and multivariate LR analysis, we screened out 9 independent risk factors most closely associated with cervical LNM in PTMC, including age, sex, race, marital status, region, histology, tumor size, and extrathyroidal extension (ETE) and multifocality. We used these risk factors to build an ML prediction model, in which the AUROC value of the XGBoost algorithm was higher than the other 4 ML algorithms and was the best ML model. We optimized the XGBoost algorithm through 10-fold cross-validation, and its best performance on the training set (AUROC: 0.809, 95%CI 0.800–0.818) was better than traditional LR analysis (AUROC: 0.780, 95%CI 0.772–0.787). CONCLUSIONS: ML algorithms have good predictive performance, especially the XGBoost algorithm. With the continuous development of artificial intelligence, ML algorithms have broad prospects in clinical prognosis prediction. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12902-022-01186-1.
format Online
Article
Text
id pubmed-9635156
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-96351562022-11-05 Exploring risk factors for cervical lymph node metastasis in papillary thyroid microcarcinoma: construction of a novel population-based predictive model Huang, Yanling Mao, Yaqian Xu, Lizhen Wen, Junping Chen, Gang BMC Endocr Disord Research BACKGROUND: Machine learning was a highly effective tool in model construction. We aim to establish a machine learning-based predictive model for predicting the cervical lymph node metastasis (LNM) in papillary thyroid microcarcinoma (PTMC). METHODS: We obtained data on PTMC from the SEER database, including 10 demographic and clinicopathological characteristics. Univariate and multivariate logistic regression (LR) analyses were applied to screen the risk factors for cervical LNM in PTMC. Risk factors with P < 0.05 in multivariate LR analysis were used as modeling variables. Five different machine learning (ML) algorithms including extreme gradient boosting (XGBoost), random forest (RF), adaptive boosting (AdaBoost), gaussian naive bayes (GNB) and multi-layer perceptron (MLP) and traditional regression analysis were used to construct the prediction model. Finally, the area under the receiver operating characteristic (AUROC) curve was used to compare the model performance. RESULTS: Through univariate and multivariate LR analysis, we screened out 9 independent risk factors most closely associated with cervical LNM in PTMC, including age, sex, race, marital status, region, histology, tumor size, and extrathyroidal extension (ETE) and multifocality. We used these risk factors to build an ML prediction model, in which the AUROC value of the XGBoost algorithm was higher than the other 4 ML algorithms and was the best ML model. We optimized the XGBoost algorithm through 10-fold cross-validation, and its best performance on the training set (AUROC: 0.809, 95%CI 0.800–0.818) was better than traditional LR analysis (AUROC: 0.780, 95%CI 0.772–0.787). CONCLUSIONS: ML algorithms have good predictive performance, especially the XGBoost algorithm. With the continuous development of artificial intelligence, ML algorithms have broad prospects in clinical prognosis prediction. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12902-022-01186-1. BioMed Central 2022-11-04 /pmc/articles/PMC9635156/ /pubmed/36329470 http://dx.doi.org/10.1186/s12902-022-01186-1 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Huang, Yanling
Mao, Yaqian
Xu, Lizhen
Wen, Junping
Chen, Gang
Exploring risk factors for cervical lymph node metastasis in papillary thyroid microcarcinoma: construction of a novel population-based predictive model
title Exploring risk factors for cervical lymph node metastasis in papillary thyroid microcarcinoma: construction of a novel population-based predictive model
title_full Exploring risk factors for cervical lymph node metastasis in papillary thyroid microcarcinoma: construction of a novel population-based predictive model
title_fullStr Exploring risk factors for cervical lymph node metastasis in papillary thyroid microcarcinoma: construction of a novel population-based predictive model
title_full_unstemmed Exploring risk factors for cervical lymph node metastasis in papillary thyroid microcarcinoma: construction of a novel population-based predictive model
title_short Exploring risk factors for cervical lymph node metastasis in papillary thyroid microcarcinoma: construction of a novel population-based predictive model
title_sort exploring risk factors for cervical lymph node metastasis in papillary thyroid microcarcinoma: construction of a novel population-based predictive model
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9635156/
https://www.ncbi.nlm.nih.gov/pubmed/36329470
http://dx.doi.org/10.1186/s12902-022-01186-1
work_keys_str_mv AT huangyanling exploringriskfactorsforcervicallymphnodemetastasisinpapillarythyroidmicrocarcinomaconstructionofanovelpopulationbasedpredictivemodel
AT maoyaqian exploringriskfactorsforcervicallymphnodemetastasisinpapillarythyroidmicrocarcinomaconstructionofanovelpopulationbasedpredictivemodel
AT xulizhen exploringriskfactorsforcervicallymphnodemetastasisinpapillarythyroidmicrocarcinomaconstructionofanovelpopulationbasedpredictivemodel
AT wenjunping exploringriskfactorsforcervicallymphnodemetastasisinpapillarythyroidmicrocarcinomaconstructionofanovelpopulationbasedpredictivemodel
AT chengang exploringriskfactorsforcervicallymphnodemetastasisinpapillarythyroidmicrocarcinomaconstructionofanovelpopulationbasedpredictivemodel