Cargando…

A support vector machine classifier for the prediction of osteosarcoma metastasis with high accuracy

In this study, gene expression profiles of osteosarcoma (OS) were analyzed to identify critical genes associated with metastasis. Five gene expression datasets were screened and downloaded from Gene Expression Omnibus (GEO). Following assessment by MetaQC, the dataset GSE9508 was excluded for poor q...

Descripción completa

Detalles Bibliográficos
Autores principales: He, Yunfei, Ma, Jun, Ye, Xiaojian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: D.A. Spandidos 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5627885/
https://www.ncbi.nlm.nih.gov/pubmed/28901446
http://dx.doi.org/10.3892/ijmm.2017.3126
_version_ 1783268789923610624
author He, Yunfei
Ma, Jun
Ye, Xiaojian
author_facet He, Yunfei
Ma, Jun
Ye, Xiaojian
author_sort He, Yunfei
collection PubMed
description In this study, gene expression profiles of osteosarcoma (OS) were analyzed to identify critical genes associated with metastasis. Five gene expression datasets were screened and downloaded from Gene Expression Omnibus (GEO). Following assessment by MetaQC, the dataset GSE9508 was excluded for poor quality. Subsequently, differentially expressed genes (DEGs) between metastatic and non-metastatic OS were identified using meta-analysis. A protein-protein interaction (PPI) network was constructed with information from Human Protein Reference Database (HPRD) for the DEGs. Betweenness centrality (BC) was calculated for each node in the network and top featured genes ranked by BC were selected out to construct support vector machine (SVM) classifier using the training set GSE21257, which was then validated using the other three independent datasets. Pathway enrichment analysis was performed for the featured genes using Fisher's exact test. A total of 353 DEGs were identified and a PPI network including 164 nodes and 272 edges was then constructed. The top 64 featured genes ranked by BC were included in the SVM classifier. The SVM classifier exhibited high prediction accuracies in all of the 4 datasets, with accuracies of 100, 100, 92.6 and 100%, respectively. Further analysis of the featured genes revealed that 11 Gene Ontology (GO) biological pathways and 5 Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways were significantly over-represented, including the regulation of cell proliferation, regulation of apoptosis, pathways in cancer, regulation of actin cytoskeleton and the TGF-β signaling pathway. On the whole, an SVM classifier with high prediction accuracy was constructed and validated, in which key genes associated with metastasis in OS were also revealed. These findings may promote the development of genetic diagnostic methods and may enhance our understanding of the molecular mechanisms underlying the metastasis of OS.
format Online
Article
Text
id pubmed-5627885
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher D.A. Spandidos
record_format MEDLINE/PubMed
spelling pubmed-56278852017-10-08 A support vector machine classifier for the prediction of osteosarcoma metastasis with high accuracy He, Yunfei Ma, Jun Ye, Xiaojian Int J Mol Med Articles In this study, gene expression profiles of osteosarcoma (OS) were analyzed to identify critical genes associated with metastasis. Five gene expression datasets were screened and downloaded from Gene Expression Omnibus (GEO). Following assessment by MetaQC, the dataset GSE9508 was excluded for poor quality. Subsequently, differentially expressed genes (DEGs) between metastatic and non-metastatic OS were identified using meta-analysis. A protein-protein interaction (PPI) network was constructed with information from Human Protein Reference Database (HPRD) for the DEGs. Betweenness centrality (BC) was calculated for each node in the network and top featured genes ranked by BC were selected out to construct support vector machine (SVM) classifier using the training set GSE21257, which was then validated using the other three independent datasets. Pathway enrichment analysis was performed for the featured genes using Fisher's exact test. A total of 353 DEGs were identified and a PPI network including 164 nodes and 272 edges was then constructed. The top 64 featured genes ranked by BC were included in the SVM classifier. The SVM classifier exhibited high prediction accuracies in all of the 4 datasets, with accuracies of 100, 100, 92.6 and 100%, respectively. Further analysis of the featured genes revealed that 11 Gene Ontology (GO) biological pathways and 5 Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways were significantly over-represented, including the regulation of cell proliferation, regulation of apoptosis, pathways in cancer, regulation of actin cytoskeleton and the TGF-β signaling pathway. On the whole, an SVM classifier with high prediction accuracy was constructed and validated, in which key genes associated with metastasis in OS were also revealed. These findings may promote the development of genetic diagnostic methods and may enhance our understanding of the molecular mechanisms underlying the metastasis of OS. D.A. Spandidos 2017-11 2017-09-07 /pmc/articles/PMC5627885/ /pubmed/28901446 http://dx.doi.org/10.3892/ijmm.2017.3126 Text en Copyright: © He et al. This is an open access article distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivs License (https://creativecommons.org/licenses/by-nc-nd/4.0/) , which permits use and distribution in any medium, provided the original work is properly cited, the use is non-commercial and no modifications or adaptations are made.
spellingShingle Articles
He, Yunfei
Ma, Jun
Ye, Xiaojian
A support vector machine classifier for the prediction of osteosarcoma metastasis with high accuracy
title A support vector machine classifier for the prediction of osteosarcoma metastasis with high accuracy
title_full A support vector machine classifier for the prediction of osteosarcoma metastasis with high accuracy
title_fullStr A support vector machine classifier for the prediction of osteosarcoma metastasis with high accuracy
title_full_unstemmed A support vector machine classifier for the prediction of osteosarcoma metastasis with high accuracy
title_short A support vector machine classifier for the prediction of osteosarcoma metastasis with high accuracy
title_sort support vector machine classifier for the prediction of osteosarcoma metastasis with high accuracy
topic Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5627885/
https://www.ncbi.nlm.nih.gov/pubmed/28901446
http://dx.doi.org/10.3892/ijmm.2017.3126
work_keys_str_mv AT heyunfei asupportvectormachineclassifierforthepredictionofosteosarcomametastasiswithhighaccuracy
AT majun asupportvectormachineclassifierforthepredictionofosteosarcomametastasiswithhighaccuracy
AT yexiaojian asupportvectormachineclassifierforthepredictionofosteosarcomametastasiswithhighaccuracy
AT heyunfei supportvectormachineclassifierforthepredictionofosteosarcomametastasiswithhighaccuracy
AT majun supportvectormachineclassifierforthepredictionofosteosarcomametastasiswithhighaccuracy
AT yexiaojian supportvectormachineclassifierforthepredictionofosteosarcomametastasiswithhighaccuracy