Cargando…

A 21-gene Support Vector Machine classifier and a 10-gene risk score system constructed for patients with gastric cancer

Gastric cancer (GC) ranks fifth in terms of incidence and third in terms of tumor mortality worldwide. The present study was designed to construct a Support Vector Machine (SVM) classifier and risk score system for GC. The GSE62254 (training set) and GSE26253 (validation set 2) datasets were downloa...

Descripción completa

Detalles Bibliográficos
Autores principales: Jiang, Hui, Gu, Jiming, Du, Jun, Qi, Xiaowei, Qian, Chengjia, Fei, Bojian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: D.A. Spandidos 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6896370/
https://www.ncbi.nlm.nih.gov/pubmed/31939629
http://dx.doi.org/10.3892/mmr.2019.10841
_version_ 1783476763762884608
author Jiang, Hui
Gu, Jiming
Du, Jun
Qi, Xiaowei
Qian, Chengjia
Fei, Bojian
author_facet Jiang, Hui
Gu, Jiming
Du, Jun
Qi, Xiaowei
Qian, Chengjia
Fei, Bojian
author_sort Jiang, Hui
collection PubMed
description Gastric cancer (GC) ranks fifth in terms of incidence and third in terms of tumor mortality worldwide. The present study was designed to construct a Support Vector Machine (SVM) classifier and risk score system for GC. The GSE62254 (training set) and GSE26253 (validation set 2) datasets were downloaded from the Gene Expression Omnibus database. Furthermore, the gene expression profile of GC (validation set 1) was obtained from The Cancer Genome Atlas database. Differentially expressed genes (DEGs) between recurrent and non-recurrent samples were determined using the limma package. The feature genes were selected using the Caret package, and an SVM classifier was built using the e1071 package. Using the penalized package, the optimal predictive genes for constructing a risk score system were screened. Finally, stratification analysis of clinical factors and pathway enrichment analysis were performed using Gene Set Enrichment Analysis. A total of 239 DEGs were identified in GSE62254, among which 114 DEGs were significantly associated with both recurrence-free survival and overall survival. Subsequently, 21 feature genes were screened from the 114 DEGs, and an SVM classifier was built. A risk score system for survival prediction was constructed, following the selection of 10 optimal genes, including A-kinase anchoring protein 12, angiopoietin-like protein 1, cysteine-rich sequence 1, myeloid/lymphoid or mixed-lineage leukemia, translocated to chromosome 11, neuron navigator 3, neurobeachin, nephroblastoma overexpressed, pleiotrophin, tumor suppressor candidate 3 and zinc finger and SCAN domain containing 18. The stratification analysis revealed that pathological stage was an independent prognostic clinical factor in the high-risk group. Additionally, eight significant pathways were associated with the 10-gene signature. The SVM classifier and risk score system may be applied for classifying and predicting the prognosis of patients with GC, respectively.
format Online
Article
Text
id pubmed-6896370
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher D.A. Spandidos
record_format MEDLINE/PubMed
spelling pubmed-68963702019-12-09 A 21-gene Support Vector Machine classifier and a 10-gene risk score system constructed for patients with gastric cancer Jiang, Hui Gu, Jiming Du, Jun Qi, Xiaowei Qian, Chengjia Fei, Bojian Mol Med Rep Articles Gastric cancer (GC) ranks fifth in terms of incidence and third in terms of tumor mortality worldwide. The present study was designed to construct a Support Vector Machine (SVM) classifier and risk score system for GC. The GSE62254 (training set) and GSE26253 (validation set 2) datasets were downloaded from the Gene Expression Omnibus database. Furthermore, the gene expression profile of GC (validation set 1) was obtained from The Cancer Genome Atlas database. Differentially expressed genes (DEGs) between recurrent and non-recurrent samples were determined using the limma package. The feature genes were selected using the Caret package, and an SVM classifier was built using the e1071 package. Using the penalized package, the optimal predictive genes for constructing a risk score system were screened. Finally, stratification analysis of clinical factors and pathway enrichment analysis were performed using Gene Set Enrichment Analysis. A total of 239 DEGs were identified in GSE62254, among which 114 DEGs were significantly associated with both recurrence-free survival and overall survival. Subsequently, 21 feature genes were screened from the 114 DEGs, and an SVM classifier was built. A risk score system for survival prediction was constructed, following the selection of 10 optimal genes, including A-kinase anchoring protein 12, angiopoietin-like protein 1, cysteine-rich sequence 1, myeloid/lymphoid or mixed-lineage leukemia, translocated to chromosome 11, neuron navigator 3, neurobeachin, nephroblastoma overexpressed, pleiotrophin, tumor suppressor candidate 3 and zinc finger and SCAN domain containing 18. The stratification analysis revealed that pathological stage was an independent prognostic clinical factor in the high-risk group. Additionally, eight significant pathways were associated with the 10-gene signature. The SVM classifier and risk score system may be applied for classifying and predicting the prognosis of patients with GC, respectively. D.A. Spandidos 2020-01 2019-11-21 /pmc/articles/PMC6896370/ /pubmed/31939629 http://dx.doi.org/10.3892/mmr.2019.10841 Text en Copyright: © Jiang et al. This is an open access article distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivs License (https://creativecommons.org/licenses/by-nc-nd/4.0/) , which permits use and distribution in any medium, provided the original work is properly cited, the use is non-commercial and no modifications or adaptations are made.
spellingShingle Articles
Jiang, Hui
Gu, Jiming
Du, Jun
Qi, Xiaowei
Qian, Chengjia
Fei, Bojian
A 21-gene Support Vector Machine classifier and a 10-gene risk score system constructed for patients with gastric cancer
title A 21-gene Support Vector Machine classifier and a 10-gene risk score system constructed for patients with gastric cancer
title_full A 21-gene Support Vector Machine classifier and a 10-gene risk score system constructed for patients with gastric cancer
title_fullStr A 21-gene Support Vector Machine classifier and a 10-gene risk score system constructed for patients with gastric cancer
title_full_unstemmed A 21-gene Support Vector Machine classifier and a 10-gene risk score system constructed for patients with gastric cancer
title_short A 21-gene Support Vector Machine classifier and a 10-gene risk score system constructed for patients with gastric cancer
title_sort 21-gene support vector machine classifier and a 10-gene risk score system constructed for patients with gastric cancer
topic Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6896370/
https://www.ncbi.nlm.nih.gov/pubmed/31939629
http://dx.doi.org/10.3892/mmr.2019.10841
work_keys_str_mv AT jianghui a21genesupportvectormachineclassifieranda10generiskscoresystemconstructedforpatientswithgastriccancer
AT gujiming a21genesupportvectormachineclassifieranda10generiskscoresystemconstructedforpatientswithgastriccancer
AT dujun a21genesupportvectormachineclassifieranda10generiskscoresystemconstructedforpatientswithgastriccancer
AT qixiaowei a21genesupportvectormachineclassifieranda10generiskscoresystemconstructedforpatientswithgastriccancer
AT qianchengjia a21genesupportvectormachineclassifieranda10generiskscoresystemconstructedforpatientswithgastriccancer
AT feibojian a21genesupportvectormachineclassifieranda10generiskscoresystemconstructedforpatientswithgastriccancer
AT jianghui 21genesupportvectormachineclassifieranda10generiskscoresystemconstructedforpatientswithgastriccancer
AT gujiming 21genesupportvectormachineclassifieranda10generiskscoresystemconstructedforpatientswithgastriccancer
AT dujun 21genesupportvectormachineclassifieranda10generiskscoresystemconstructedforpatientswithgastriccancer
AT qixiaowei 21genesupportvectormachineclassifieranda10generiskscoresystemconstructedforpatientswithgastriccancer
AT qianchengjia 21genesupportvectormachineclassifieranda10generiskscoresystemconstructedforpatientswithgastriccancer
AT feibojian 21genesupportvectormachineclassifieranda10generiskscoresystemconstructedforpatientswithgastriccancer