Cargando…

Support vector machine classifier for prediction of the metastasis of colorectal cancer

Colorectal cancer (CRC) is one of the most common cancers and a major cause of mortality. The present study aimed to identify potential biomarkers for CRC metastasis and uncover the mechanisms underlying the etiology of the disease. The five datasets GSE68468, GSE62321, GSE22834, GSE14297 and GSE698...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhi, Jiajun, Sun, Jiwei, Wang, Zhongchuan, Ding, Wenjun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: D.A. Spandidos 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5819940/
https://www.ncbi.nlm.nih.gov/pubmed/29328363
http://dx.doi.org/10.3892/ijmm.2018.3359
_version_ 1783301293283999744
author Zhi, Jiajun
Sun, Jiwei
Wang, Zhongchuan
Ding, Wenjun
author_facet Zhi, Jiajun
Sun, Jiwei
Wang, Zhongchuan
Ding, Wenjun
author_sort Zhi, Jiajun
collection PubMed
description Colorectal cancer (CRC) is one of the most common cancers and a major cause of mortality. The present study aimed to identify potential biomarkers for CRC metastasis and uncover the mechanisms underlying the etiology of the disease. The five datasets GSE68468, GSE62321, GSE22834, GSE14297 and GSE6988 were utilized in the study, all of which contained metastatic and non-metastatic CRC samples. Among them, three datasets were integrated via meta-analysis to identify the differentially expressed genes (DEGs) between the two types of samples. A protein-protein interaction (PPI) network was constructed for these DEGs. Candidate genes were then selected by the support vector machine (SVM) classifier based on the betweenness centrality (BC) algorithm. A CRC dataset from The Cancer Genome Atlas database was used to evaluate the accuracy of the SVM classifier. Pathway enrichment analysis was carried out for the SVM-classified gene signatures. In total, 358 DEGs were identified by meta-analysis. The top ten nodes in the PPI network with the highest BC values were selected, including cAMP responsive element binding protein 1 (CREB1), cullin 7 (CUL7) and signal sequence receptor 3 (SSR3). The optimal SVM classification model was established, which was able to precisely distinguish between the metastatic and non-metastatic samples. Based on this SVM classifier, 40 signature genes were identified, which were mainly enriched in protein processing in endoplasmic reticulum (e.g., SSR3), AMPK signaling pathway (e.g., CREB1) and ubiquitin mediated proteolysis (e.g., FBXO2, CUL7 and UBE2D3) pathways. In conclusion, the SVM-classified genes, including CREB1, CUL7 and SSR3, precisely distinguished the metastatic CRC samples from the non-metastatic ones. These genes have the potential to be used as biomarkers for the prognosis of metastatic CRC.
format Online
Article
Text
id pubmed-5819940
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher D.A. Spandidos
record_format MEDLINE/PubMed
spelling pubmed-58199402018-03-02 Support vector machine classifier for prediction of the metastasis of colorectal cancer Zhi, Jiajun Sun, Jiwei Wang, Zhongchuan Ding, Wenjun Int J Mol Med Articles Colorectal cancer (CRC) is one of the most common cancers and a major cause of mortality. The present study aimed to identify potential biomarkers for CRC metastasis and uncover the mechanisms underlying the etiology of the disease. The five datasets GSE68468, GSE62321, GSE22834, GSE14297 and GSE6988 were utilized in the study, all of which contained metastatic and non-metastatic CRC samples. Among them, three datasets were integrated via meta-analysis to identify the differentially expressed genes (DEGs) between the two types of samples. A protein-protein interaction (PPI) network was constructed for these DEGs. Candidate genes were then selected by the support vector machine (SVM) classifier based on the betweenness centrality (BC) algorithm. A CRC dataset from The Cancer Genome Atlas database was used to evaluate the accuracy of the SVM classifier. Pathway enrichment analysis was carried out for the SVM-classified gene signatures. In total, 358 DEGs were identified by meta-analysis. The top ten nodes in the PPI network with the highest BC values were selected, including cAMP responsive element binding protein 1 (CREB1), cullin 7 (CUL7) and signal sequence receptor 3 (SSR3). The optimal SVM classification model was established, which was able to precisely distinguish between the metastatic and non-metastatic samples. Based on this SVM classifier, 40 signature genes were identified, which were mainly enriched in protein processing in endoplasmic reticulum (e.g., SSR3), AMPK signaling pathway (e.g., CREB1) and ubiquitin mediated proteolysis (e.g., FBXO2, CUL7 and UBE2D3) pathways. In conclusion, the SVM-classified genes, including CREB1, CUL7 and SSR3, precisely distinguished the metastatic CRC samples from the non-metastatic ones. These genes have the potential to be used as biomarkers for the prognosis of metastatic CRC. D.A. Spandidos 2018-03 2018-01-02 /pmc/articles/PMC5819940/ /pubmed/29328363 http://dx.doi.org/10.3892/ijmm.2018.3359 Text en Copyright: © Zhi et al. This is an open access article distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivs License (https://creativecommons.org/licenses/by-nc-nd/4.0/) , which permits use and distribution in any medium, provided the original work is properly cited, the use is non-commercial and no modifications or adaptations are made.
spellingShingle Articles
Zhi, Jiajun
Sun, Jiwei
Wang, Zhongchuan
Ding, Wenjun
Support vector machine classifier for prediction of the metastasis of colorectal cancer
title Support vector machine classifier for prediction of the metastasis of colorectal cancer
title_full Support vector machine classifier for prediction of the metastasis of colorectal cancer
title_fullStr Support vector machine classifier for prediction of the metastasis of colorectal cancer
title_full_unstemmed Support vector machine classifier for prediction of the metastasis of colorectal cancer
title_short Support vector machine classifier for prediction of the metastasis of colorectal cancer
title_sort support vector machine classifier for prediction of the metastasis of colorectal cancer
topic Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5819940/
https://www.ncbi.nlm.nih.gov/pubmed/29328363
http://dx.doi.org/10.3892/ijmm.2018.3359
work_keys_str_mv AT zhijiajun supportvectormachineclassifierforpredictionofthemetastasisofcolorectalcancer
AT sunjiwei supportvectormachineclassifierforpredictionofthemetastasisofcolorectalcancer
AT wangzhongchuan supportvectormachineclassifierforpredictionofthemetastasisofcolorectalcancer
AT dingwenjun supportvectormachineclassifierforpredictionofthemetastasisofcolorectalcancer