Cargando…
Establishment of prognostic models of adrenocortical carcinoma using machine learning and big data
BACKGROUND: Adrenocortical carcinoma (ACC) is a rare malignant tumor with a short life expectancy. It is important to identify patients at high risk so that doctors can adopt more aggressive regimens to treat their condition. Machine learning has the advantage of processing complicated data. To date...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9857757/ https://www.ncbi.nlm.nih.gov/pubmed/36684185 http://dx.doi.org/10.3389/fsurg.2022.966307 |
_version_ | 1784873936367910912 |
---|---|
author | Tang, Jun Fang, Yu Xu, Zhe |
author_facet | Tang, Jun Fang, Yu Xu, Zhe |
author_sort | Tang, Jun |
collection | PubMed |
description | BACKGROUND: Adrenocortical carcinoma (ACC) is a rare malignant tumor with a short life expectancy. It is important to identify patients at high risk so that doctors can adopt more aggressive regimens to treat their condition. Machine learning has the advantage of processing complicated data. To date, there is no research that tries to use machine learning algorithms and big data to construct prognostic models for ACC patients. METHODS: Clinical data of patients with ACC were obtained from the Surveillance, Epidemiology, and End Results (SEER) database. These records were screened according to preset inclusion and exclusion criteria. The remaining data were applied to univariate survival analysis to select meaningful outcome-related candidates. Backpropagation artificial neural network (BP-ANN), random forest (RF), support vector machine (SVM), and naive Bayes classifier (NBC) were chosen as alternative algorithms. The acquired cases were grouped into a training set and a test set at a ratio of 8:2, and a 10-fold cross-validation method repeated 10 times was performed. Area under the receiver operating characteristic (AUROC) curves were used as indices of efficiency. RESULTS: The calculated 1-, 3-, 5-, and 10-year overall survival rates were 62.3%, 42.0%, 34.9%, and 26.1%, respectively. A total of 825 patients were included in the study. In the training set, the AUCs of BP-ANN, RF, SVM, and NBC for predicting 1-year survival status were 0.921, 0.885, 0.865, and 0.854; those for predicting 3-year survival status were 0.859, 0.865, 0.837, and 0.831; and those for 5-year survival status were 0.888, 0.872, 0.852, and 0.841, respectively. In the test set, AUCs of these four models for 1-year survival status were 0.899, 0.875, 0.886, and 0.862; those for 3-year survival status were 0.871, 0.858, 0.853, and 0.869; and those for 5-year survival status were 0.841, 0.783, 0.836, and 0.867, respectively. The consequences of the 10-fold cross-validation method repeated 10 times indicated that the mean values of 1-, 3-, and 5-year AUROCs of BP-ANN were 0.890, 0.847, and 0.854, respectively, which were better than those of other classifiers (P < 0.008). CONCLUSION: The model combined with BP-ANN and big data can precisely predict the survival status of ACC patients and has the potential for clinical application. |
format | Online Article Text |
id | pubmed-9857757 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-98577572023-01-21 Establishment of prognostic models of adrenocortical carcinoma using machine learning and big data Tang, Jun Fang, Yu Xu, Zhe Front Surg Surgery BACKGROUND: Adrenocortical carcinoma (ACC) is a rare malignant tumor with a short life expectancy. It is important to identify patients at high risk so that doctors can adopt more aggressive regimens to treat their condition. Machine learning has the advantage of processing complicated data. To date, there is no research that tries to use machine learning algorithms and big data to construct prognostic models for ACC patients. METHODS: Clinical data of patients with ACC were obtained from the Surveillance, Epidemiology, and End Results (SEER) database. These records were screened according to preset inclusion and exclusion criteria. The remaining data were applied to univariate survival analysis to select meaningful outcome-related candidates. Backpropagation artificial neural network (BP-ANN), random forest (RF), support vector machine (SVM), and naive Bayes classifier (NBC) were chosen as alternative algorithms. The acquired cases were grouped into a training set and a test set at a ratio of 8:2, and a 10-fold cross-validation method repeated 10 times was performed. Area under the receiver operating characteristic (AUROC) curves were used as indices of efficiency. RESULTS: The calculated 1-, 3-, 5-, and 10-year overall survival rates were 62.3%, 42.0%, 34.9%, and 26.1%, respectively. A total of 825 patients were included in the study. In the training set, the AUCs of BP-ANN, RF, SVM, and NBC for predicting 1-year survival status were 0.921, 0.885, 0.865, and 0.854; those for predicting 3-year survival status were 0.859, 0.865, 0.837, and 0.831; and those for 5-year survival status were 0.888, 0.872, 0.852, and 0.841, respectively. In the test set, AUCs of these four models for 1-year survival status were 0.899, 0.875, 0.886, and 0.862; those for 3-year survival status were 0.871, 0.858, 0.853, and 0.869; and those for 5-year survival status were 0.841, 0.783, 0.836, and 0.867, respectively. The consequences of the 10-fold cross-validation method repeated 10 times indicated that the mean values of 1-, 3-, and 5-year AUROCs of BP-ANN were 0.890, 0.847, and 0.854, respectively, which were better than those of other classifiers (P < 0.008). CONCLUSION: The model combined with BP-ANN and big data can precisely predict the survival status of ACC patients and has the potential for clinical application. Frontiers Media S.A. 2023-01-06 /pmc/articles/PMC9857757/ /pubmed/36684185 http://dx.doi.org/10.3389/fsurg.2022.966307 Text en © 2023 Tang, Fang and Xu. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY) (https://creativecommons.org/licenses/by/4.0/) . The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Surgery Tang, Jun Fang, Yu Xu, Zhe Establishment of prognostic models of adrenocortical carcinoma using machine learning and big data |
title | Establishment of prognostic models of adrenocortical carcinoma using machine learning and big data |
title_full | Establishment of prognostic models of adrenocortical carcinoma using machine learning and big data |
title_fullStr | Establishment of prognostic models of adrenocortical carcinoma using machine learning and big data |
title_full_unstemmed | Establishment of prognostic models of adrenocortical carcinoma using machine learning and big data |
title_short | Establishment of prognostic models of adrenocortical carcinoma using machine learning and big data |
title_sort | establishment of prognostic models of adrenocortical carcinoma using machine learning and big data |
topic | Surgery |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9857757/ https://www.ncbi.nlm.nih.gov/pubmed/36684185 http://dx.doi.org/10.3389/fsurg.2022.966307 |
work_keys_str_mv | AT tangjun establishmentofprognosticmodelsofadrenocorticalcarcinomausingmachinelearningandbigdata AT fangyu establishmentofprognosticmodelsofadrenocorticalcarcinomausingmachinelearningandbigdata AT xuzhe establishmentofprognosticmodelsofadrenocorticalcarcinomausingmachinelearningandbigdata |