Cargando…
Construction of Osteosarcoma Diagnosis Model by Random Forest and Artificial Neural Network
Osteosarcoma accounts for 28% of primary bone malignancies in adults and up to 56% in children and adolescents (<20 years). However, early diagnosis and treatment are still inadequate, and new improvements are still needed. Missed diagnoses exist due to fewer traditional diagnostic methods, and c...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10056981/ https://www.ncbi.nlm.nih.gov/pubmed/36983630 http://dx.doi.org/10.3390/jpm13030447 |
_version_ | 1785016253390258176 |
---|---|
author | Li, Sheng Que, Yukang Yang, Rui He, Peng Xu, Shenglin Hu, Yong |
author_facet | Li, Sheng Que, Yukang Yang, Rui He, Peng Xu, Shenglin Hu, Yong |
author_sort | Li, Sheng |
collection | PubMed |
description | Osteosarcoma accounts for 28% of primary bone malignancies in adults and up to 56% in children and adolescents (<20 years). However, early diagnosis and treatment are still inadequate, and new improvements are still needed. Missed diagnoses exist due to fewer traditional diagnostic methods, and clinical symptoms are often already present before diagnosis. This study aimed to develop novel and efficient predictive models for the diagnosis of osteosarcoma and to identify potential targets for exploring osteosarcoma markers. First, osteosarcoma and normal tissue expression microarray datasets were downloaded from the Gene Expression Omnibus (GEO). Then we screened the differentially expressed genes (DEGs) in the osteosarcoma and normal groups in the training group. Next, in order to explore the biologically relevant role of DEGs, Metascape and enrichment analyses were also performed on DEGs. The “randomForest” and “neuralnet” packages in R software were used to select representative genes and construct diagnostic models for osteosarcoma. The next step is to validate the model of the artificial neural network. Then, we performed an immune infiltration analysis by using the training set data. Finally, we constructed a prognostic model using representative genes for prognostic analysis. The copy number of osteosarcoma was also analyzed. A random forest classifier identified nine representative genes (ANK1, TGFBR3, TNFRSF21, HSPB8, ITGA7, RHD, AASS, GREM2, NFASC). HSPB8, RHD, AASS, and NFASC were genes we identified that have not been previously reported to be associated with osteosarcoma. The osteosarcoma diagnostic model we constructed has good performance with areas under the curves (AUCs) of 1 and 0.987 in the training and validation groups, respectively. This study opens new horizons for the early diagnosis of osteosarcoma and provides representative markers for the future treatment of osteosarcoma. This is the first study to pioneer the establishment of a genetic diagnosis model for osteosarcoma and advance the development of osteosarcoma diagnosis and treatment. |
format | Online Article Text |
id | pubmed-10056981 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-100569812023-03-30 Construction of Osteosarcoma Diagnosis Model by Random Forest and Artificial Neural Network Li, Sheng Que, Yukang Yang, Rui He, Peng Xu, Shenglin Hu, Yong J Pers Med Article Osteosarcoma accounts for 28% of primary bone malignancies in adults and up to 56% in children and adolescents (<20 years). However, early diagnosis and treatment are still inadequate, and new improvements are still needed. Missed diagnoses exist due to fewer traditional diagnostic methods, and clinical symptoms are often already present before diagnosis. This study aimed to develop novel and efficient predictive models for the diagnosis of osteosarcoma and to identify potential targets for exploring osteosarcoma markers. First, osteosarcoma and normal tissue expression microarray datasets were downloaded from the Gene Expression Omnibus (GEO). Then we screened the differentially expressed genes (DEGs) in the osteosarcoma and normal groups in the training group. Next, in order to explore the biologically relevant role of DEGs, Metascape and enrichment analyses were also performed on DEGs. The “randomForest” and “neuralnet” packages in R software were used to select representative genes and construct diagnostic models for osteosarcoma. The next step is to validate the model of the artificial neural network. Then, we performed an immune infiltration analysis by using the training set data. Finally, we constructed a prognostic model using representative genes for prognostic analysis. The copy number of osteosarcoma was also analyzed. A random forest classifier identified nine representative genes (ANK1, TGFBR3, TNFRSF21, HSPB8, ITGA7, RHD, AASS, GREM2, NFASC). HSPB8, RHD, AASS, and NFASC were genes we identified that have not been previously reported to be associated with osteosarcoma. The osteosarcoma diagnostic model we constructed has good performance with areas under the curves (AUCs) of 1 and 0.987 in the training and validation groups, respectively. This study opens new horizons for the early diagnosis of osteosarcoma and provides representative markers for the future treatment of osteosarcoma. This is the first study to pioneer the establishment of a genetic diagnosis model for osteosarcoma and advance the development of osteosarcoma diagnosis and treatment. MDPI 2023-02-28 /pmc/articles/PMC10056981/ /pubmed/36983630 http://dx.doi.org/10.3390/jpm13030447 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Li, Sheng Que, Yukang Yang, Rui He, Peng Xu, Shenglin Hu, Yong Construction of Osteosarcoma Diagnosis Model by Random Forest and Artificial Neural Network |
title | Construction of Osteosarcoma Diagnosis Model by Random Forest and Artificial Neural Network |
title_full | Construction of Osteosarcoma Diagnosis Model by Random Forest and Artificial Neural Network |
title_fullStr | Construction of Osteosarcoma Diagnosis Model by Random Forest and Artificial Neural Network |
title_full_unstemmed | Construction of Osteosarcoma Diagnosis Model by Random Forest and Artificial Neural Network |
title_short | Construction of Osteosarcoma Diagnosis Model by Random Forest and Artificial Neural Network |
title_sort | construction of osteosarcoma diagnosis model by random forest and artificial neural network |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10056981/ https://www.ncbi.nlm.nih.gov/pubmed/36983630 http://dx.doi.org/10.3390/jpm13030447 |
work_keys_str_mv | AT lisheng constructionofosteosarcomadiagnosismodelbyrandomforestandartificialneuralnetwork AT queyukang constructionofosteosarcomadiagnosismodelbyrandomforestandartificialneuralnetwork AT yangrui constructionofosteosarcomadiagnosismodelbyrandomforestandartificialneuralnetwork AT hepeng constructionofosteosarcomadiagnosismodelbyrandomforestandartificialneuralnetwork AT xushenglin constructionofosteosarcomadiagnosismodelbyrandomforestandartificialneuralnetwork AT huyong constructionofosteosarcomadiagnosismodelbyrandomforestandartificialneuralnetwork |