Cargando…

Construction of Osteosarcoma Diagnosis Model by Random Forest and Artificial Neural Network

Osteosarcoma accounts for 28% of primary bone malignancies in adults and up to 56% in children and adolescents (<20 years). However, early diagnosis and treatment are still inadequate, and new improvements are still needed. Missed diagnoses exist due to fewer traditional diagnostic methods, and c...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Sheng, Que, Yukang, Yang, Rui, He, Peng, Xu, Shenglin, Hu, Yong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10056981/
https://www.ncbi.nlm.nih.gov/pubmed/36983630
http://dx.doi.org/10.3390/jpm13030447
_version_ 1785016253390258176
author Li, Sheng
Que, Yukang
Yang, Rui
He, Peng
Xu, Shenglin
Hu, Yong
author_facet Li, Sheng
Que, Yukang
Yang, Rui
He, Peng
Xu, Shenglin
Hu, Yong
author_sort Li, Sheng
collection PubMed
description Osteosarcoma accounts for 28% of primary bone malignancies in adults and up to 56% in children and adolescents (<20 years). However, early diagnosis and treatment are still inadequate, and new improvements are still needed. Missed diagnoses exist due to fewer traditional diagnostic methods, and clinical symptoms are often already present before diagnosis. This study aimed to develop novel and efficient predictive models for the diagnosis of osteosarcoma and to identify potential targets for exploring osteosarcoma markers. First, osteosarcoma and normal tissue expression microarray datasets were downloaded from the Gene Expression Omnibus (GEO). Then we screened the differentially expressed genes (DEGs) in the osteosarcoma and normal groups in the training group. Next, in order to explore the biologically relevant role of DEGs, Metascape and enrichment analyses were also performed on DEGs. The “randomForest” and “neuralnet” packages in R software were used to select representative genes and construct diagnostic models for osteosarcoma. The next step is to validate the model of the artificial neural network. Then, we performed an immune infiltration analysis by using the training set data. Finally, we constructed a prognostic model using representative genes for prognostic analysis. The copy number of osteosarcoma was also analyzed. A random forest classifier identified nine representative genes (ANK1, TGFBR3, TNFRSF21, HSPB8, ITGA7, RHD, AASS, GREM2, NFASC). HSPB8, RHD, AASS, and NFASC were genes we identified that have not been previously reported to be associated with osteosarcoma. The osteosarcoma diagnostic model we constructed has good performance with areas under the curves (AUCs) of 1 and 0.987 in the training and validation groups, respectively. This study opens new horizons for the early diagnosis of osteosarcoma and provides representative markers for the future treatment of osteosarcoma. This is the first study to pioneer the establishment of a genetic diagnosis model for osteosarcoma and advance the development of osteosarcoma diagnosis and treatment.
format Online
Article
Text
id pubmed-10056981
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-100569812023-03-30 Construction of Osteosarcoma Diagnosis Model by Random Forest and Artificial Neural Network Li, Sheng Que, Yukang Yang, Rui He, Peng Xu, Shenglin Hu, Yong J Pers Med Article Osteosarcoma accounts for 28% of primary bone malignancies in adults and up to 56% in children and adolescents (<20 years). However, early diagnosis and treatment are still inadequate, and new improvements are still needed. Missed diagnoses exist due to fewer traditional diagnostic methods, and clinical symptoms are often already present before diagnosis. This study aimed to develop novel and efficient predictive models for the diagnosis of osteosarcoma and to identify potential targets for exploring osteosarcoma markers. First, osteosarcoma and normal tissue expression microarray datasets were downloaded from the Gene Expression Omnibus (GEO). Then we screened the differentially expressed genes (DEGs) in the osteosarcoma and normal groups in the training group. Next, in order to explore the biologically relevant role of DEGs, Metascape and enrichment analyses were also performed on DEGs. The “randomForest” and “neuralnet” packages in R software were used to select representative genes and construct diagnostic models for osteosarcoma. The next step is to validate the model of the artificial neural network. Then, we performed an immune infiltration analysis by using the training set data. Finally, we constructed a prognostic model using representative genes for prognostic analysis. The copy number of osteosarcoma was also analyzed. A random forest classifier identified nine representative genes (ANK1, TGFBR3, TNFRSF21, HSPB8, ITGA7, RHD, AASS, GREM2, NFASC). HSPB8, RHD, AASS, and NFASC were genes we identified that have not been previously reported to be associated with osteosarcoma. The osteosarcoma diagnostic model we constructed has good performance with areas under the curves (AUCs) of 1 and 0.987 in the training and validation groups, respectively. This study opens new horizons for the early diagnosis of osteosarcoma and provides representative markers for the future treatment of osteosarcoma. This is the first study to pioneer the establishment of a genetic diagnosis model for osteosarcoma and advance the development of osteosarcoma diagnosis and treatment. MDPI 2023-02-28 /pmc/articles/PMC10056981/ /pubmed/36983630 http://dx.doi.org/10.3390/jpm13030447 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Li, Sheng
Que, Yukang
Yang, Rui
He, Peng
Xu, Shenglin
Hu, Yong
Construction of Osteosarcoma Diagnosis Model by Random Forest and Artificial Neural Network
title Construction of Osteosarcoma Diagnosis Model by Random Forest and Artificial Neural Network
title_full Construction of Osteosarcoma Diagnosis Model by Random Forest and Artificial Neural Network
title_fullStr Construction of Osteosarcoma Diagnosis Model by Random Forest and Artificial Neural Network
title_full_unstemmed Construction of Osteosarcoma Diagnosis Model by Random Forest and Artificial Neural Network
title_short Construction of Osteosarcoma Diagnosis Model by Random Forest and Artificial Neural Network
title_sort construction of osteosarcoma diagnosis model by random forest and artificial neural network
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10056981/
https://www.ncbi.nlm.nih.gov/pubmed/36983630
http://dx.doi.org/10.3390/jpm13030447
work_keys_str_mv AT lisheng constructionofosteosarcomadiagnosismodelbyrandomforestandartificialneuralnetwork
AT queyukang constructionofosteosarcomadiagnosismodelbyrandomforestandartificialneuralnetwork
AT yangrui constructionofosteosarcomadiagnosismodelbyrandomforestandartificialneuralnetwork
AT hepeng constructionofosteosarcomadiagnosismodelbyrandomforestandartificialneuralnetwork
AT xushenglin constructionofosteosarcomadiagnosismodelbyrandomforestandartificialneuralnetwork
AT huyong constructionofosteosarcomadiagnosismodelbyrandomforestandartificialneuralnetwork