Cargando…

Incorporation of a machine learning pathological diagnosis algorithm into the thyroid ultrasound imaging data improves the diagnosis risk of malignant thyroid nodules

OBJECTIVE: This study aimed at establishing a new model to predict malignant thyroid nodules using machine learning algorithms. METHODS: A retrospective study was performed on 274 patients with thyroid nodules who underwent fine-needle aspiration (FNA) cytology or surgery from October 2018 to 2020 i...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Wanying, Hong, Tao, Fang, Jianqiang, Liu, Wencai, Liu, Yuwen, He, Cunyu, Li, Xinxin, Xu, Chan, Wang, Bing, Chen, Yuanyuan, Sun, Chenyu, Li, Wenle, Kang, Wei, Yin, Chengliang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9774948/
https://www.ncbi.nlm.nih.gov/pubmed/36568189
http://dx.doi.org/10.3389/fonc.2022.968784
_version_ 1784855524014030848
author Li, Wanying
Hong, Tao
Fang, Jianqiang
Liu, Wencai
Liu, Yuwen
He, Cunyu
Li, Xinxin
Xu, Chan
Wang, Bing
Chen, Yuanyuan
Sun, Chenyu
Li, Wenle
Kang, Wei
Yin, Chengliang
author_facet Li, Wanying
Hong, Tao
Fang, Jianqiang
Liu, Wencai
Liu, Yuwen
He, Cunyu
Li, Xinxin
Xu, Chan
Wang, Bing
Chen, Yuanyuan
Sun, Chenyu
Li, Wenle
Kang, Wei
Yin, Chengliang
author_sort Li, Wanying
collection PubMed
description OBJECTIVE: This study aimed at establishing a new model to predict malignant thyroid nodules using machine learning algorithms. METHODS: A retrospective study was performed on 274 patients with thyroid nodules who underwent fine-needle aspiration (FNA) cytology or surgery from October 2018 to 2020 in Xianyang Central Hospital. The least absolute shrinkage and selection operator (lasso) regression analysis and logistic analysis were applied to screen and identified variables. Six machine learning algorithms, including Decision Tree (DT), Extreme Gradient Boosting (XGBoost), Gradient Boosting Machine (GBM), Naive Bayes Classifier (NBC), Random Forest (RF), and Logistic Regression (LR), were employed and compared in constructing the predictive model, coupled with preoperative clinical characteristics and ultrasound features. Internal validation was performed by using 10-fold cross-validation. The performance of the model was measured by the area under the receiver operating characteristic curve (AUC), accuracy, precision, recall, F1 score, Shapley additive explanations (SHAP) plot, feature importance, and correlation of features. The best cutoff value for risk stratification was identified by probability density function (PDF) and clinical utility curve (CUC). RESULTS: The malignant rate of thyroid nodules in the study cohort was 53.2%. The predictive models are constructed by age, margin, shape, echogenic foci, echogenicity, and lymph nodes. The XGBoost model was significantly superior to any one of the machine learning models, with an AUC value of 0.829. According to the PDF and CUC, we recommended that 51% probability be used as a threshold for determining the risk stratification of malignant nodules, where about 85.6% of patients with malignant nodules could be detected. Meanwhile, approximately 89.8% of unnecessary biopsy procedures would be saved. Finally, an online web risk calculator has been built to estimate the personal likelihood of malignant thyroid nodules based on the best-performing ML-ed model of XGBoost. CONCLUSIONS: Combining clinical characteristics and features of ultrasound images, ML algorithms can achieve reliable prediction of malignant thyroid nodules. The online web risk calculator based on the XGBoost model can easily identify in real-time the probability of malignant thyroid nodules, which can assist clinicians to formulate individualized management strategies for patients.
format Online
Article
Text
id pubmed-9774948
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-97749482022-12-23 Incorporation of a machine learning pathological diagnosis algorithm into the thyroid ultrasound imaging data improves the diagnosis risk of malignant thyroid nodules Li, Wanying Hong, Tao Fang, Jianqiang Liu, Wencai Liu, Yuwen He, Cunyu Li, Xinxin Xu, Chan Wang, Bing Chen, Yuanyuan Sun, Chenyu Li, Wenle Kang, Wei Yin, Chengliang Front Oncol Oncology OBJECTIVE: This study aimed at establishing a new model to predict malignant thyroid nodules using machine learning algorithms. METHODS: A retrospective study was performed on 274 patients with thyroid nodules who underwent fine-needle aspiration (FNA) cytology or surgery from October 2018 to 2020 in Xianyang Central Hospital. The least absolute shrinkage and selection operator (lasso) regression analysis and logistic analysis were applied to screen and identified variables. Six machine learning algorithms, including Decision Tree (DT), Extreme Gradient Boosting (XGBoost), Gradient Boosting Machine (GBM), Naive Bayes Classifier (NBC), Random Forest (RF), and Logistic Regression (LR), were employed and compared in constructing the predictive model, coupled with preoperative clinical characteristics and ultrasound features. Internal validation was performed by using 10-fold cross-validation. The performance of the model was measured by the area under the receiver operating characteristic curve (AUC), accuracy, precision, recall, F1 score, Shapley additive explanations (SHAP) plot, feature importance, and correlation of features. The best cutoff value for risk stratification was identified by probability density function (PDF) and clinical utility curve (CUC). RESULTS: The malignant rate of thyroid nodules in the study cohort was 53.2%. The predictive models are constructed by age, margin, shape, echogenic foci, echogenicity, and lymph nodes. The XGBoost model was significantly superior to any one of the machine learning models, with an AUC value of 0.829. According to the PDF and CUC, we recommended that 51% probability be used as a threshold for determining the risk stratification of malignant nodules, where about 85.6% of patients with malignant nodules could be detected. Meanwhile, approximately 89.8% of unnecessary biopsy procedures would be saved. Finally, an online web risk calculator has been built to estimate the personal likelihood of malignant thyroid nodules based on the best-performing ML-ed model of XGBoost. CONCLUSIONS: Combining clinical characteristics and features of ultrasound images, ML algorithms can achieve reliable prediction of malignant thyroid nodules. The online web risk calculator based on the XGBoost model can easily identify in real-time the probability of malignant thyroid nodules, which can assist clinicians to formulate individualized management strategies for patients. Frontiers Media S.A. 2022-12-08 /pmc/articles/PMC9774948/ /pubmed/36568189 http://dx.doi.org/10.3389/fonc.2022.968784 Text en Copyright © 2022 Li, Hong, Fang, Liu, Liu, He, Li, Xu, Wang, Chen, Sun, Li, Kang and Yin https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Oncology
Li, Wanying
Hong, Tao
Fang, Jianqiang
Liu, Wencai
Liu, Yuwen
He, Cunyu
Li, Xinxin
Xu, Chan
Wang, Bing
Chen, Yuanyuan
Sun, Chenyu
Li, Wenle
Kang, Wei
Yin, Chengliang
Incorporation of a machine learning pathological diagnosis algorithm into the thyroid ultrasound imaging data improves the diagnosis risk of malignant thyroid nodules
title Incorporation of a machine learning pathological diagnosis algorithm into the thyroid ultrasound imaging data improves the diagnosis risk of malignant thyroid nodules
title_full Incorporation of a machine learning pathological diagnosis algorithm into the thyroid ultrasound imaging data improves the diagnosis risk of malignant thyroid nodules
title_fullStr Incorporation of a machine learning pathological diagnosis algorithm into the thyroid ultrasound imaging data improves the diagnosis risk of malignant thyroid nodules
title_full_unstemmed Incorporation of a machine learning pathological diagnosis algorithm into the thyroid ultrasound imaging data improves the diagnosis risk of malignant thyroid nodules
title_short Incorporation of a machine learning pathological diagnosis algorithm into the thyroid ultrasound imaging data improves the diagnosis risk of malignant thyroid nodules
title_sort incorporation of a machine learning pathological diagnosis algorithm into the thyroid ultrasound imaging data improves the diagnosis risk of malignant thyroid nodules
topic Oncology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9774948/
https://www.ncbi.nlm.nih.gov/pubmed/36568189
http://dx.doi.org/10.3389/fonc.2022.968784
work_keys_str_mv AT liwanying incorporationofamachinelearningpathologicaldiagnosisalgorithmintothethyroidultrasoundimagingdataimprovesthediagnosisriskofmalignantthyroidnodules
AT hongtao incorporationofamachinelearningpathologicaldiagnosisalgorithmintothethyroidultrasoundimagingdataimprovesthediagnosisriskofmalignantthyroidnodules
AT fangjianqiang incorporationofamachinelearningpathologicaldiagnosisalgorithmintothethyroidultrasoundimagingdataimprovesthediagnosisriskofmalignantthyroidnodules
AT liuwencai incorporationofamachinelearningpathologicaldiagnosisalgorithmintothethyroidultrasoundimagingdataimprovesthediagnosisriskofmalignantthyroidnodules
AT liuyuwen incorporationofamachinelearningpathologicaldiagnosisalgorithmintothethyroidultrasoundimagingdataimprovesthediagnosisriskofmalignantthyroidnodules
AT hecunyu incorporationofamachinelearningpathologicaldiagnosisalgorithmintothethyroidultrasoundimagingdataimprovesthediagnosisriskofmalignantthyroidnodules
AT lixinxin incorporationofamachinelearningpathologicaldiagnosisalgorithmintothethyroidultrasoundimagingdataimprovesthediagnosisriskofmalignantthyroidnodules
AT xuchan incorporationofamachinelearningpathologicaldiagnosisalgorithmintothethyroidultrasoundimagingdataimprovesthediagnosisriskofmalignantthyroidnodules
AT wangbing incorporationofamachinelearningpathologicaldiagnosisalgorithmintothethyroidultrasoundimagingdataimprovesthediagnosisriskofmalignantthyroidnodules
AT chenyuanyuan incorporationofamachinelearningpathologicaldiagnosisalgorithmintothethyroidultrasoundimagingdataimprovesthediagnosisriskofmalignantthyroidnodules
AT sunchenyu incorporationofamachinelearningpathologicaldiagnosisalgorithmintothethyroidultrasoundimagingdataimprovesthediagnosisriskofmalignantthyroidnodules
AT liwenle incorporationofamachinelearningpathologicaldiagnosisalgorithmintothethyroidultrasoundimagingdataimprovesthediagnosisriskofmalignantthyroidnodules
AT kangwei incorporationofamachinelearningpathologicaldiagnosisalgorithmintothethyroidultrasoundimagingdataimprovesthediagnosisriskofmalignantthyroidnodules
AT yinchengliang incorporationofamachinelearningpathologicaldiagnosisalgorithmintothethyroidultrasoundimagingdataimprovesthediagnosisriskofmalignantthyroidnodules