Cargando…

Machine learning for identifying benign and malignant of thyroid tumors: A retrospective study of 2,423 patients

Thyroid tumors, one of the common tumors in the endocrine system, while the discrimination between benign and malignant thyroid tumors remains insufficient. The aim of this study is to construct a diagnostic model of benign and malignant thyroid tumors, in order to provide an emerging auxiliary diag...

Descripción completa

Detalles Bibliográficos
Autores principales: Guo, Yuan-yuan, Li, Zhi-jie, Du, Chao, Gong, Jun, Liao, Pu, Zhang, Jia-xing, Shao, Cong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9515945/
https://www.ncbi.nlm.nih.gov/pubmed/36187616
http://dx.doi.org/10.3389/fpubh.2022.960740
_version_ 1784798604090671104
author Guo, Yuan-yuan
Li, Zhi-jie
Du, Chao
Gong, Jun
Liao, Pu
Zhang, Jia-xing
Shao, Cong
author_facet Guo, Yuan-yuan
Li, Zhi-jie
Du, Chao
Gong, Jun
Liao, Pu
Zhang, Jia-xing
Shao, Cong
author_sort Guo, Yuan-yuan
collection PubMed
description Thyroid tumors, one of the common tumors in the endocrine system, while the discrimination between benign and malignant thyroid tumors remains insufficient. The aim of this study is to construct a diagnostic model of benign and malignant thyroid tumors, in order to provide an emerging auxiliary diagnostic method for patients with thyroid tumors. The patients were selected from the Chongqing General Hospital (Chongqing, China) from July 2020 to September 2021. And peripheral blood, BRAFV600E gene, and demographic indicators were selected, including sex, age, BRAFV600E gene, lymphocyte count (Lymph#), neutrophil count (Neu#), neutrophil/lymphocyte ratio (NLR), platelet/lymphocyte ratio (PLR), red blood cell distribution width (RDW), platelets count (PLT), red blood cell distribution width—coefficient of variation (RDW–CV), alkaline phosphatase (ALP), and parathyroid hormone (PTH). First, feature selection was executed by univariate analysis combined with least absolute shrinkage and selection operator (LASSO) analysis. Afterward, we used machine learning algorithms to establish three types of models. The first model contains all predictors, the second model contains indicators after feature selection, and the third model contains patient peripheral blood indicators. The four machine learning algorithms include extreme gradient boosting (XGBoost), random forest (RF), light gradient boosting machine (LightGBM), and adaptive boosting (AdaBoost) which were used to build predictive models. A grid search algorithm was used to find the optimal parameters of the machine learning algorithms. A series of indicators, such as the area under the curve (AUC), were intended to determine the model performance. A total of 2,042 patients met the criteria and were enrolled in this study, and 12 variables were included. Sex, age, Lymph#, PLR, RDW, and BRAFV600E were identified as statistically significant indicators by univariate and LASSO analysis. Among the model we constructed, RF, XGBoost, LightGBM and AdaBoost with the AUC of 0.874 (95% CI, 0.841–0.906), 0.868 (95% CI, 0.834–0.901), 0.861 (95% CI, 0.826–0.895), and 0.837 (95% CI, 0.802–0.873) in the first model. With the AUC of 0.853 (95% CI, 0.818–0.888), 0.853 (95% CI, 0.818–0.889), 0.837 (95% CI, 0.800–0.873), and 0.832 (95% CI, 0.797–0.867) in the second model. With the AUC of 0.698 (95% CI, 0.651–0.745), 0.688 (95% CI, 0.639–0.736), 0.693 (95% CI, 0.645–0.741), and 0.666 (95% CI, 0.618–0.714) in the third model. Compared with the existing models, our study proposes a model incorporating novel biomarkers which could be a powerful and promising tool for predicting benign and malignant thyroid tumors.
format Online
Article
Text
id pubmed-9515945
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-95159452022-09-29 Machine learning for identifying benign and malignant of thyroid tumors: A retrospective study of 2,423 patients Guo, Yuan-yuan Li, Zhi-jie Du, Chao Gong, Jun Liao, Pu Zhang, Jia-xing Shao, Cong Front Public Health Public Health Thyroid tumors, one of the common tumors in the endocrine system, while the discrimination between benign and malignant thyroid tumors remains insufficient. The aim of this study is to construct a diagnostic model of benign and malignant thyroid tumors, in order to provide an emerging auxiliary diagnostic method for patients with thyroid tumors. The patients were selected from the Chongqing General Hospital (Chongqing, China) from July 2020 to September 2021. And peripheral blood, BRAFV600E gene, and demographic indicators were selected, including sex, age, BRAFV600E gene, lymphocyte count (Lymph#), neutrophil count (Neu#), neutrophil/lymphocyte ratio (NLR), platelet/lymphocyte ratio (PLR), red blood cell distribution width (RDW), platelets count (PLT), red blood cell distribution width—coefficient of variation (RDW–CV), alkaline phosphatase (ALP), and parathyroid hormone (PTH). First, feature selection was executed by univariate analysis combined with least absolute shrinkage and selection operator (LASSO) analysis. Afterward, we used machine learning algorithms to establish three types of models. The first model contains all predictors, the second model contains indicators after feature selection, and the third model contains patient peripheral blood indicators. The four machine learning algorithms include extreme gradient boosting (XGBoost), random forest (RF), light gradient boosting machine (LightGBM), and adaptive boosting (AdaBoost) which were used to build predictive models. A grid search algorithm was used to find the optimal parameters of the machine learning algorithms. A series of indicators, such as the area under the curve (AUC), were intended to determine the model performance. A total of 2,042 patients met the criteria and were enrolled in this study, and 12 variables were included. Sex, age, Lymph#, PLR, RDW, and BRAFV600E were identified as statistically significant indicators by univariate and LASSO analysis. Among the model we constructed, RF, XGBoost, LightGBM and AdaBoost with the AUC of 0.874 (95% CI, 0.841–0.906), 0.868 (95% CI, 0.834–0.901), 0.861 (95% CI, 0.826–0.895), and 0.837 (95% CI, 0.802–0.873) in the first model. With the AUC of 0.853 (95% CI, 0.818–0.888), 0.853 (95% CI, 0.818–0.889), 0.837 (95% CI, 0.800–0.873), and 0.832 (95% CI, 0.797–0.867) in the second model. With the AUC of 0.698 (95% CI, 0.651–0.745), 0.688 (95% CI, 0.639–0.736), 0.693 (95% CI, 0.645–0.741), and 0.666 (95% CI, 0.618–0.714) in the third model. Compared with the existing models, our study proposes a model incorporating novel biomarkers which could be a powerful and promising tool for predicting benign and malignant thyroid tumors. Frontiers Media S.A. 2022-09-14 /pmc/articles/PMC9515945/ /pubmed/36187616 http://dx.doi.org/10.3389/fpubh.2022.960740 Text en Copyright © 2022 Guo, Li, Du, Gong, Liao, Zhang and Shao. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Public Health
Guo, Yuan-yuan
Li, Zhi-jie
Du, Chao
Gong, Jun
Liao, Pu
Zhang, Jia-xing
Shao, Cong
Machine learning for identifying benign and malignant of thyroid tumors: A retrospective study of 2,423 patients
title Machine learning for identifying benign and malignant of thyroid tumors: A retrospective study of 2,423 patients
title_full Machine learning for identifying benign and malignant of thyroid tumors: A retrospective study of 2,423 patients
title_fullStr Machine learning for identifying benign and malignant of thyroid tumors: A retrospective study of 2,423 patients
title_full_unstemmed Machine learning for identifying benign and malignant of thyroid tumors: A retrospective study of 2,423 patients
title_short Machine learning for identifying benign and malignant of thyroid tumors: A retrospective study of 2,423 patients
title_sort machine learning for identifying benign and malignant of thyroid tumors: a retrospective study of 2,423 patients
topic Public Health
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9515945/
https://www.ncbi.nlm.nih.gov/pubmed/36187616
http://dx.doi.org/10.3389/fpubh.2022.960740
work_keys_str_mv AT guoyuanyuan machinelearningforidentifyingbenignandmalignantofthyroidtumorsaretrospectivestudyof2423patients
AT lizhijie machinelearningforidentifyingbenignandmalignantofthyroidtumorsaretrospectivestudyof2423patients
AT duchao machinelearningforidentifyingbenignandmalignantofthyroidtumorsaretrospectivestudyof2423patients
AT gongjun machinelearningforidentifyingbenignandmalignantofthyroidtumorsaretrospectivestudyof2423patients
AT liaopu machinelearningforidentifyingbenignandmalignantofthyroidtumorsaretrospectivestudyof2423patients
AT zhangjiaxing machinelearningforidentifyingbenignandmalignantofthyroidtumorsaretrospectivestudyof2423patients
AT shaocong machinelearningforidentifyingbenignandmalignantofthyroidtumorsaretrospectivestudyof2423patients