Cargando…
Identification of risk genes associated with myocardial infarction based on the recursive feature elimination algorithm and support vector machine classifier
The aim of the present study was to identify risk genes in myocardial infarction. Microarray data GSE34198, containing data from the peripheral blood of 49 myocardial infarction samples and 48 corresponding control samples, were downloaded from the Gene Expression Omnibus database to screen the diff...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
D.A. Spandidos
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5780094/ https://www.ncbi.nlm.nih.gov/pubmed/29138828 http://dx.doi.org/10.3892/mmr.2017.8044 |
_version_ | 1783294679024926720 |
---|---|
author | Yang, Xiaoqiang |
author_facet | Yang, Xiaoqiang |
author_sort | Yang, Xiaoqiang |
collection | PubMed |
description | The aim of the present study was to identify risk genes in myocardial infarction. Microarray data GSE34198, containing data from the peripheral blood of 49 myocardial infarction samples and 48 corresponding control samples, were downloaded from the Gene Expression Omnibus database to screen the differentially expressed genes (DEGs). The DEGs were used to construct a protein-protein interaction (PPI) network of patient samples, from which the feature genes were identified using the neighboring score method. The recursive feature elimination (RFE) algorithm was employed to select the risk genes among feature genes, which were subsequently applied to perform a support vector machine (SVM) classifier to identify the specific signature in myocardial infarction samples. Another dataset, GSE61144, was also downloaded to verify the efficacy of the classifier. A total of 724 downregulated and 483 upregulated DEGs were screened in patient samples compared with control samples in the GSE34198 dataset. The PPI network of myocardial infarction was comprised of 1,083 nodes (genes) and 46,363 lines (connections). Using the neighborhood scoring method, the top 100 feature genes in myocardial infarction samples were identified as the disease feature genes, which distinguish the myocardial infarction samples from the control samples. The RFE algorithm screened 15 risk genes, which were employed to construct a SVM classifier with an average precision of 88% to the patient sample following visualization by a confusion matrix. The predictive precision of the classifier on another microarray dataset, GSE61144, was 0.92, with an average true positive of 0.9278 and an average false positive of 0.2361. A-kinase-anchoring protein 12 (AKAP12) and glycine receptor α2 (GLRA2) were two risk genes in the SVM classifier. Therefore, AKAP12 and GLRA2 exert potential roles in the development of myocardial infarction, potentially by influencing cardiac contractility and protecting against ischemia-reperfusion injury, which may provide clues in developing potential diagnostic biomarkers or therapeutic targets for myocardial infarction. |
format | Online Article Text |
id | pubmed-5780094 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | D.A. Spandidos |
record_format | MEDLINE/PubMed |
spelling | pubmed-57800942018-02-12 Identification of risk genes associated with myocardial infarction based on the recursive feature elimination algorithm and support vector machine classifier Yang, Xiaoqiang Mol Med Rep Articles The aim of the present study was to identify risk genes in myocardial infarction. Microarray data GSE34198, containing data from the peripheral blood of 49 myocardial infarction samples and 48 corresponding control samples, were downloaded from the Gene Expression Omnibus database to screen the differentially expressed genes (DEGs). The DEGs were used to construct a protein-protein interaction (PPI) network of patient samples, from which the feature genes were identified using the neighboring score method. The recursive feature elimination (RFE) algorithm was employed to select the risk genes among feature genes, which were subsequently applied to perform a support vector machine (SVM) classifier to identify the specific signature in myocardial infarction samples. Another dataset, GSE61144, was also downloaded to verify the efficacy of the classifier. A total of 724 downregulated and 483 upregulated DEGs were screened in patient samples compared with control samples in the GSE34198 dataset. The PPI network of myocardial infarction was comprised of 1,083 nodes (genes) and 46,363 lines (connections). Using the neighborhood scoring method, the top 100 feature genes in myocardial infarction samples were identified as the disease feature genes, which distinguish the myocardial infarction samples from the control samples. The RFE algorithm screened 15 risk genes, which were employed to construct a SVM classifier with an average precision of 88% to the patient sample following visualization by a confusion matrix. The predictive precision of the classifier on another microarray dataset, GSE61144, was 0.92, with an average true positive of 0.9278 and an average false positive of 0.2361. A-kinase-anchoring protein 12 (AKAP12) and glycine receptor α2 (GLRA2) were two risk genes in the SVM classifier. Therefore, AKAP12 and GLRA2 exert potential roles in the development of myocardial infarction, potentially by influencing cardiac contractility and protecting against ischemia-reperfusion injury, which may provide clues in developing potential diagnostic biomarkers or therapeutic targets for myocardial infarction. D.A. Spandidos 2018-01 2017-11-14 /pmc/articles/PMC5780094/ /pubmed/29138828 http://dx.doi.org/10.3892/mmr.2017.8044 Text en Copyright: © Yang et al. This is an open access article distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivs License (https://creativecommons.org/licenses/by-nc-nd/4.0/) , which permits use and distribution in any medium, provided the original work is properly cited, the use is non-commercial and no modifications or adaptations are made. |
spellingShingle | Articles Yang, Xiaoqiang Identification of risk genes associated with myocardial infarction based on the recursive feature elimination algorithm and support vector machine classifier |
title | Identification of risk genes associated with myocardial infarction based on the recursive feature elimination algorithm and support vector machine classifier |
title_full | Identification of risk genes associated with myocardial infarction based on the recursive feature elimination algorithm and support vector machine classifier |
title_fullStr | Identification of risk genes associated with myocardial infarction based on the recursive feature elimination algorithm and support vector machine classifier |
title_full_unstemmed | Identification of risk genes associated with myocardial infarction based on the recursive feature elimination algorithm and support vector machine classifier |
title_short | Identification of risk genes associated with myocardial infarction based on the recursive feature elimination algorithm and support vector machine classifier |
title_sort | identification of risk genes associated with myocardial infarction based on the recursive feature elimination algorithm and support vector machine classifier |
topic | Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5780094/ https://www.ncbi.nlm.nih.gov/pubmed/29138828 http://dx.doi.org/10.3892/mmr.2017.8044 |
work_keys_str_mv | AT yangxiaoqiang identificationofriskgenesassociatedwithmyocardialinfarctionbasedontherecursivefeatureeliminationalgorithmandsupportvectormachineclassifier |