Cargando…

Identification of risk genes associated with myocardial infarction based on the recursive feature elimination algorithm and support vector machine classifier

The aim of the present study was to identify risk genes in myocardial infarction. Microarray data GSE34198, containing data from the peripheral blood of 49 myocardial infarction samples and 48 corresponding control samples, were downloaded from the Gene Expression Omnibus database to screen the diff...

Descripción completa

Detalles Bibliográficos
Autor principal: Yang, Xiaoqiang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: D.A. Spandidos 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5780094/
https://www.ncbi.nlm.nih.gov/pubmed/29138828
http://dx.doi.org/10.3892/mmr.2017.8044
_version_ 1783294679024926720
author Yang, Xiaoqiang
author_facet Yang, Xiaoqiang
author_sort Yang, Xiaoqiang
collection PubMed
description The aim of the present study was to identify risk genes in myocardial infarction. Microarray data GSE34198, containing data from the peripheral blood of 49 myocardial infarction samples and 48 corresponding control samples, were downloaded from the Gene Expression Omnibus database to screen the differentially expressed genes (DEGs). The DEGs were used to construct a protein-protein interaction (PPI) network of patient samples, from which the feature genes were identified using the neighboring score method. The recursive feature elimination (RFE) algorithm was employed to select the risk genes among feature genes, which were subsequently applied to perform a support vector machine (SVM) classifier to identify the specific signature in myocardial infarction samples. Another dataset, GSE61144, was also downloaded to verify the efficacy of the classifier. A total of 724 downregulated and 483 upregulated DEGs were screened in patient samples compared with control samples in the GSE34198 dataset. The PPI network of myocardial infarction was comprised of 1,083 nodes (genes) and 46,363 lines (connections). Using the neighborhood scoring method, the top 100 feature genes in myocardial infarction samples were identified as the disease feature genes, which distinguish the myocardial infarction samples from the control samples. The RFE algorithm screened 15 risk genes, which were employed to construct a SVM classifier with an average precision of 88% to the patient sample following visualization by a confusion matrix. The predictive precision of the classifier on another microarray dataset, GSE61144, was 0.92, with an average true positive of 0.9278 and an average false positive of 0.2361. A-kinase-anchoring protein 12 (AKAP12) and glycine receptor α2 (GLRA2) were two risk genes in the SVM classifier. Therefore, AKAP12 and GLRA2 exert potential roles in the development of myocardial infarction, potentially by influencing cardiac contractility and protecting against ischemia-reperfusion injury, which may provide clues in developing potential diagnostic biomarkers or therapeutic targets for myocardial infarction.
format Online
Article
Text
id pubmed-5780094
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher D.A. Spandidos
record_format MEDLINE/PubMed
spelling pubmed-57800942018-02-12 Identification of risk genes associated with myocardial infarction based on the recursive feature elimination algorithm and support vector machine classifier Yang, Xiaoqiang Mol Med Rep Articles The aim of the present study was to identify risk genes in myocardial infarction. Microarray data GSE34198, containing data from the peripheral blood of 49 myocardial infarction samples and 48 corresponding control samples, were downloaded from the Gene Expression Omnibus database to screen the differentially expressed genes (DEGs). The DEGs were used to construct a protein-protein interaction (PPI) network of patient samples, from which the feature genes were identified using the neighboring score method. The recursive feature elimination (RFE) algorithm was employed to select the risk genes among feature genes, which were subsequently applied to perform a support vector machine (SVM) classifier to identify the specific signature in myocardial infarction samples. Another dataset, GSE61144, was also downloaded to verify the efficacy of the classifier. A total of 724 downregulated and 483 upregulated DEGs were screened in patient samples compared with control samples in the GSE34198 dataset. The PPI network of myocardial infarction was comprised of 1,083 nodes (genes) and 46,363 lines (connections). Using the neighborhood scoring method, the top 100 feature genes in myocardial infarction samples were identified as the disease feature genes, which distinguish the myocardial infarction samples from the control samples. The RFE algorithm screened 15 risk genes, which were employed to construct a SVM classifier with an average precision of 88% to the patient sample following visualization by a confusion matrix. The predictive precision of the classifier on another microarray dataset, GSE61144, was 0.92, with an average true positive of 0.9278 and an average false positive of 0.2361. A-kinase-anchoring protein 12 (AKAP12) and glycine receptor α2 (GLRA2) were two risk genes in the SVM classifier. Therefore, AKAP12 and GLRA2 exert potential roles in the development of myocardial infarction, potentially by influencing cardiac contractility and protecting against ischemia-reperfusion injury, which may provide clues in developing potential diagnostic biomarkers or therapeutic targets for myocardial infarction. D.A. Spandidos 2018-01 2017-11-14 /pmc/articles/PMC5780094/ /pubmed/29138828 http://dx.doi.org/10.3892/mmr.2017.8044 Text en Copyright: © Yang et al. This is an open access article distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivs License (https://creativecommons.org/licenses/by-nc-nd/4.0/) , which permits use and distribution in any medium, provided the original work is properly cited, the use is non-commercial and no modifications or adaptations are made.
spellingShingle Articles
Yang, Xiaoqiang
Identification of risk genes associated with myocardial infarction based on the recursive feature elimination algorithm and support vector machine classifier
title Identification of risk genes associated with myocardial infarction based on the recursive feature elimination algorithm and support vector machine classifier
title_full Identification of risk genes associated with myocardial infarction based on the recursive feature elimination algorithm and support vector machine classifier
title_fullStr Identification of risk genes associated with myocardial infarction based on the recursive feature elimination algorithm and support vector machine classifier
title_full_unstemmed Identification of risk genes associated with myocardial infarction based on the recursive feature elimination algorithm and support vector machine classifier
title_short Identification of risk genes associated with myocardial infarction based on the recursive feature elimination algorithm and support vector machine classifier
title_sort identification of risk genes associated with myocardial infarction based on the recursive feature elimination algorithm and support vector machine classifier
topic Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5780094/
https://www.ncbi.nlm.nih.gov/pubmed/29138828
http://dx.doi.org/10.3892/mmr.2017.8044
work_keys_str_mv AT yangxiaoqiang identificationofriskgenesassociatedwithmyocardialinfarctionbasedontherecursivefeatureeliminationalgorithmandsupportvectormachineclassifier