Cargando…

Establishment of prognostic models of adrenocortical carcinoma using machine learning and big data

BACKGROUND: Adrenocortical carcinoma (ACC) is a rare malignant tumor with a short life expectancy. It is important to identify patients at high risk so that doctors can adopt more aggressive regimens to treat their condition. Machine learning has the advantage of processing complicated data. To date...

Descripción completa

Detalles Bibliográficos
Autores principales: Tang, Jun, Fang, Yu, Xu, Zhe
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9857757/
https://www.ncbi.nlm.nih.gov/pubmed/36684185
http://dx.doi.org/10.3389/fsurg.2022.966307
_version_ 1784873936367910912
author Tang, Jun
Fang, Yu
Xu, Zhe
author_facet Tang, Jun
Fang, Yu
Xu, Zhe
author_sort Tang, Jun
collection PubMed
description BACKGROUND: Adrenocortical carcinoma (ACC) is a rare malignant tumor with a short life expectancy. It is important to identify patients at high risk so that doctors can adopt more aggressive regimens to treat their condition. Machine learning has the advantage of processing complicated data. To date, there is no research that tries to use machine learning algorithms and big data to construct prognostic models for ACC patients. METHODS: Clinical data of patients with ACC were obtained from the Surveillance, Epidemiology, and End Results (SEER) database. These records were screened according to preset inclusion and exclusion criteria. The remaining data were applied to univariate survival analysis to select meaningful outcome-related candidates. Backpropagation artificial neural network (BP-ANN), random forest (RF), support vector machine (SVM), and naive Bayes classifier (NBC) were chosen as alternative algorithms. The acquired cases were grouped into a training set and a test set at a ratio of 8:2, and a 10-fold cross-validation method repeated 10 times was performed. Area under the receiver operating characteristic (AUROC) curves were used as indices of efficiency. RESULTS: The calculated 1-, 3-, 5-, and 10-year overall survival rates were 62.3%, 42.0%, 34.9%, and 26.1%, respectively. A total of 825 patients were included in the study. In the training set, the AUCs of BP-ANN, RF, SVM, and NBC for predicting 1-year survival status were 0.921, 0.885, 0.865, and 0.854; those for predicting 3-year survival status were 0.859, 0.865, 0.837, and 0.831; and those for 5-year survival status were 0.888, 0.872, 0.852, and 0.841, respectively. In the test set, AUCs of these four models for 1-year survival status were 0.899, 0.875, 0.886, and 0.862; those for 3-year survival status were 0.871, 0.858, 0.853, and 0.869; and those for 5-year survival status were 0.841, 0.783, 0.836, and 0.867, respectively. The consequences of the 10-fold cross-validation method repeated 10 times indicated that the mean values of 1-, 3-, and 5-year AUROCs of BP-ANN were 0.890, 0.847, and 0.854, respectively, which were better than those of other classifiers (P < 0.008). CONCLUSION: The model combined with BP-ANN and big data can precisely predict the survival status of ACC patients and has the potential for clinical application.
format Online
Article
Text
id pubmed-9857757
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-98577572023-01-21 Establishment of prognostic models of adrenocortical carcinoma using machine learning and big data Tang, Jun Fang, Yu Xu, Zhe Front Surg Surgery BACKGROUND: Adrenocortical carcinoma (ACC) is a rare malignant tumor with a short life expectancy. It is important to identify patients at high risk so that doctors can adopt more aggressive regimens to treat their condition. Machine learning has the advantage of processing complicated data. To date, there is no research that tries to use machine learning algorithms and big data to construct prognostic models for ACC patients. METHODS: Clinical data of patients with ACC were obtained from the Surveillance, Epidemiology, and End Results (SEER) database. These records were screened according to preset inclusion and exclusion criteria. The remaining data were applied to univariate survival analysis to select meaningful outcome-related candidates. Backpropagation artificial neural network (BP-ANN), random forest (RF), support vector machine (SVM), and naive Bayes classifier (NBC) were chosen as alternative algorithms. The acquired cases were grouped into a training set and a test set at a ratio of 8:2, and a 10-fold cross-validation method repeated 10 times was performed. Area under the receiver operating characteristic (AUROC) curves were used as indices of efficiency. RESULTS: The calculated 1-, 3-, 5-, and 10-year overall survival rates were 62.3%, 42.0%, 34.9%, and 26.1%, respectively. A total of 825 patients were included in the study. In the training set, the AUCs of BP-ANN, RF, SVM, and NBC for predicting 1-year survival status were 0.921, 0.885, 0.865, and 0.854; those for predicting 3-year survival status were 0.859, 0.865, 0.837, and 0.831; and those for 5-year survival status were 0.888, 0.872, 0.852, and 0.841, respectively. In the test set, AUCs of these four models for 1-year survival status were 0.899, 0.875, 0.886, and 0.862; those for 3-year survival status were 0.871, 0.858, 0.853, and 0.869; and those for 5-year survival status were 0.841, 0.783, 0.836, and 0.867, respectively. The consequences of the 10-fold cross-validation method repeated 10 times indicated that the mean values of 1-, 3-, and 5-year AUROCs of BP-ANN were 0.890, 0.847, and 0.854, respectively, which were better than those of other classifiers (P < 0.008). CONCLUSION: The model combined with BP-ANN and big data can precisely predict the survival status of ACC patients and has the potential for clinical application. Frontiers Media S.A. 2023-01-06 /pmc/articles/PMC9857757/ /pubmed/36684185 http://dx.doi.org/10.3389/fsurg.2022.966307 Text en © 2023 Tang, Fang and Xu. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY) (https://creativecommons.org/licenses/by/4.0/) . The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Surgery
Tang, Jun
Fang, Yu
Xu, Zhe
Establishment of prognostic models of adrenocortical carcinoma using machine learning and big data
title Establishment of prognostic models of adrenocortical carcinoma using machine learning and big data
title_full Establishment of prognostic models of adrenocortical carcinoma using machine learning and big data
title_fullStr Establishment of prognostic models of adrenocortical carcinoma using machine learning and big data
title_full_unstemmed Establishment of prognostic models of adrenocortical carcinoma using machine learning and big data
title_short Establishment of prognostic models of adrenocortical carcinoma using machine learning and big data
title_sort establishment of prognostic models of adrenocortical carcinoma using machine learning and big data
topic Surgery
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9857757/
https://www.ncbi.nlm.nih.gov/pubmed/36684185
http://dx.doi.org/10.3389/fsurg.2022.966307
work_keys_str_mv AT tangjun establishmentofprognosticmodelsofadrenocorticalcarcinomausingmachinelearningandbigdata
AT fangyu establishmentofprognosticmodelsofadrenocorticalcarcinomausingmachinelearningandbigdata
AT xuzhe establishmentofprognosticmodelsofadrenocorticalcarcinomausingmachinelearningandbigdata