Cargando…

Prediction of lung metastases in thyroid cancer using machine learning based on SEER database

PURPOSE: Lung metastasis (LM) is one of the most frequent distant metastases of thyroid cancer (TC). This study aimed to develop a machine learning algorithm model to predict lung metastasis of thyroid cancer for providing relative information in clinical decision‐making. METHODS: Data comprising of...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Wenfei, Wang, Shoufei, Ye, Ziheng, Xu, Peipei, Xia, Xiaotian, Guo, Minggao
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9189456/
https://www.ncbi.nlm.nih.gov/pubmed/35191613
http://dx.doi.org/10.1002/cam4.4617
_version_ 1784725593387958272
author Liu, Wenfei
Wang, Shoufei
Ye, Ziheng
Xu, Peipei
Xia, Xiaotian
Guo, Minggao
author_facet Liu, Wenfei
Wang, Shoufei
Ye, Ziheng
Xu, Peipei
Xia, Xiaotian
Guo, Minggao
author_sort Liu, Wenfei
collection PubMed
description PURPOSE: Lung metastasis (LM) is one of the most frequent distant metastases of thyroid cancer (TC). This study aimed to develop a machine learning algorithm model to predict lung metastasis of thyroid cancer for providing relative information in clinical decision‐making. METHODS: Data comprising of demographic and clinicopathological characteristics of patients with thyroid cancer were extracted from the National Institutes of Health (NIH)’s Surveillance, Epidemiology, and End Results (SEER) database between 2010 and 2015, which is employed to develop six machine learning algorithm models support vector machine (SVM), logistic regression (LR), eXtreme gradient boosting (XGBoost), decision tree (DT), random forest (RF), and k‐nearest neighbor (KNN). Compared and evaluated models by the following indicators: accuracy, precision, recall rate, F1‐score, the area under the ROC curve (AUC) value and Brier score, and interpreted the association between clinicopathological characteristics and target variables based on the best model. RESULTS: Nine thousand nine hundred and fifty patients were selected, which including 212 patients (2.1%) with lung metastasis, and 9738 patients without lung metastasis (97.9%). Multivariate logistic regression showed that age, T stage, N stage, and histological type were independent factors in TC with LM. Evaluation indicators of the best model‐ RF were as following: accuracy (0.99), recall rate (0.88), precision (0.61), F1‐score (0.72), AUC value (0.99), and the Brier score (0.016). CONCLUSION: RF learning model performed better and can be applied to forecast lung metastasis of thyroid cancer, and offer valuable and significant reference for clinicians' decision‐making in advance.
format Online
Article
Text
id pubmed-9189456
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-91894562022-06-16 Prediction of lung metastases in thyroid cancer using machine learning based on SEER database Liu, Wenfei Wang, Shoufei Ye, Ziheng Xu, Peipei Xia, Xiaotian Guo, Minggao Cancer Med Research Articles PURPOSE: Lung metastasis (LM) is one of the most frequent distant metastases of thyroid cancer (TC). This study aimed to develop a machine learning algorithm model to predict lung metastasis of thyroid cancer for providing relative information in clinical decision‐making. METHODS: Data comprising of demographic and clinicopathological characteristics of patients with thyroid cancer were extracted from the National Institutes of Health (NIH)’s Surveillance, Epidemiology, and End Results (SEER) database between 2010 and 2015, which is employed to develop six machine learning algorithm models support vector machine (SVM), logistic regression (LR), eXtreme gradient boosting (XGBoost), decision tree (DT), random forest (RF), and k‐nearest neighbor (KNN). Compared and evaluated models by the following indicators: accuracy, precision, recall rate, F1‐score, the area under the ROC curve (AUC) value and Brier score, and interpreted the association between clinicopathological characteristics and target variables based on the best model. RESULTS: Nine thousand nine hundred and fifty patients were selected, which including 212 patients (2.1%) with lung metastasis, and 9738 patients without lung metastasis (97.9%). Multivariate logistic regression showed that age, T stage, N stage, and histological type were independent factors in TC with LM. Evaluation indicators of the best model‐ RF were as following: accuracy (0.99), recall rate (0.88), precision (0.61), F1‐score (0.72), AUC value (0.99), and the Brier score (0.016). CONCLUSION: RF learning model performed better and can be applied to forecast lung metastasis of thyroid cancer, and offer valuable and significant reference for clinicians' decision‐making in advance. John Wiley and Sons Inc. 2022-02-22 /pmc/articles/PMC9189456/ /pubmed/35191613 http://dx.doi.org/10.1002/cam4.4617 Text en © 2022 The Authors. Cancer Medicine published by John Wiley & Sons Ltd. https://creativecommons.org/licenses/by/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Articles
Liu, Wenfei
Wang, Shoufei
Ye, Ziheng
Xu, Peipei
Xia, Xiaotian
Guo, Minggao
Prediction of lung metastases in thyroid cancer using machine learning based on SEER database
title Prediction of lung metastases in thyroid cancer using machine learning based on SEER database
title_full Prediction of lung metastases in thyroid cancer using machine learning based on SEER database
title_fullStr Prediction of lung metastases in thyroid cancer using machine learning based on SEER database
title_full_unstemmed Prediction of lung metastases in thyroid cancer using machine learning based on SEER database
title_short Prediction of lung metastases in thyroid cancer using machine learning based on SEER database
title_sort prediction of lung metastases in thyroid cancer using machine learning based on seer database
topic Research Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9189456/
https://www.ncbi.nlm.nih.gov/pubmed/35191613
http://dx.doi.org/10.1002/cam4.4617
work_keys_str_mv AT liuwenfei predictionoflungmetastasesinthyroidcancerusingmachinelearningbasedonseerdatabase
AT wangshoufei predictionoflungmetastasesinthyroidcancerusingmachinelearningbasedonseerdatabase
AT yeziheng predictionoflungmetastasesinthyroidcancerusingmachinelearningbasedonseerdatabase
AT xupeipei predictionoflungmetastasesinthyroidcancerusingmachinelearningbasedonseerdatabase
AT xiaxiaotian predictionoflungmetastasesinthyroidcancerusingmachinelearningbasedonseerdatabase
AT guominggao predictionoflungmetastasesinthyroidcancerusingmachinelearningbasedonseerdatabase