Cargando…

Risk gene identification and support vector machine learning to construct an early diagnosis model of myocardial infarction

The present study aimed to identify genes associated with increased risk of myocardial infarction (MI) and construct an early diagnosis model based on support vector machine (SVM) learning. The gene expression profile data of GSE34198, containing 97 human blood samples including 49 patients with MI...

Descripción completa

Detalles Bibliográficos
Autores principales: Fang, Hong-Zhi, Hu, Dan-Li, Li, Qin, Tu, Su
Formato: Online Artículo Texto
Lenguaje:English
Publicado: D.A. Spandidos 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7411293/
https://www.ncbi.nlm.nih.gov/pubmed/32705275
http://dx.doi.org/10.3892/mmr.2020.11247
_version_ 1783568347145699328
author Fang, Hong-Zhi
Hu, Dan-Li
Li, Qin
Tu, Su
author_facet Fang, Hong-Zhi
Hu, Dan-Li
Li, Qin
Tu, Su
author_sort Fang, Hong-Zhi
collection PubMed
description The present study aimed to identify genes associated with increased risk of myocardial infarction (MI) and construct an early diagnosis model based on support vector machine (SVM) learning. The gene expression profile data of GSE34198, containing 97 human blood samples including 49 patients with MI and 48 healthy individuals, were obtained from the Gene Expression Omnibus database. Differentially expressed gene (DEG) screening, DEG enrichment analysis, protein-protein interaction (PPI) network investigation and clustering analysis were performed. The feature genes were identified using the neighboring score algorithm. Furthermore, a recursive feature elimination (RFE) algorithm was employed to screen risk factors among feature genes. The SVM prediction model was constructed and validated using the dataset GSE61144. A total of 1,207 DEGs (724 downregulated, 483 upregulated) between the two groups were identified. PPI analysis investigated 1,083 DEGs and 46,363 edges. In total, 87 genes were selected as candidate genes, and were primarily enriched in functions including ‘G-protein coupled receptor signaling’ or pathways such as ‘focal adhesion’. Furthermore, 15 genes with a high RFE score were selected to construct an SVM prediction model. The model's average accuracy was 86%. Data set verification showed that the predictive precision reached 0.92. High expression of the genes vascular endothelial growth factor A, A-kinase anchoring protein 12 and olfactory receptor 8D2 were potential risk factors for MI. The SVM early diagnosis model constructed by candidate genes could not only predict early MI, but also provide risk probability according to the severity of MI.
format Online
Article
Text
id pubmed-7411293
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher D.A. Spandidos
record_format MEDLINE/PubMed
spelling pubmed-74112932020-08-14 Risk gene identification and support vector machine learning to construct an early diagnosis model of myocardial infarction Fang, Hong-Zhi Hu, Dan-Li Li, Qin Tu, Su Mol Med Rep Articles The present study aimed to identify genes associated with increased risk of myocardial infarction (MI) and construct an early diagnosis model based on support vector machine (SVM) learning. The gene expression profile data of GSE34198, containing 97 human blood samples including 49 patients with MI and 48 healthy individuals, were obtained from the Gene Expression Omnibus database. Differentially expressed gene (DEG) screening, DEG enrichment analysis, protein-protein interaction (PPI) network investigation and clustering analysis were performed. The feature genes were identified using the neighboring score algorithm. Furthermore, a recursive feature elimination (RFE) algorithm was employed to screen risk factors among feature genes. The SVM prediction model was constructed and validated using the dataset GSE61144. A total of 1,207 DEGs (724 downregulated, 483 upregulated) between the two groups were identified. PPI analysis investigated 1,083 DEGs and 46,363 edges. In total, 87 genes were selected as candidate genes, and were primarily enriched in functions including ‘G-protein coupled receptor signaling’ or pathways such as ‘focal adhesion’. Furthermore, 15 genes with a high RFE score were selected to construct an SVM prediction model. The model's average accuracy was 86%. Data set verification showed that the predictive precision reached 0.92. High expression of the genes vascular endothelial growth factor A, A-kinase anchoring protein 12 and olfactory receptor 8D2 were potential risk factors for MI. The SVM early diagnosis model constructed by candidate genes could not only predict early MI, but also provide risk probability according to the severity of MI. D.A. Spandidos 2020-09 2020-06-17 /pmc/articles/PMC7411293/ /pubmed/32705275 http://dx.doi.org/10.3892/mmr.2020.11247 Text en Copyright: © Fang et al. This is an open access article distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivs License (https://creativecommons.org/licenses/by-nc-nd/4.0/) , which permits use and distribution in any medium, provided the original work is properly cited, the use is non-commercial and no modifications or adaptations are made.
spellingShingle Articles
Fang, Hong-Zhi
Hu, Dan-Li
Li, Qin
Tu, Su
Risk gene identification and support vector machine learning to construct an early diagnosis model of myocardial infarction
title Risk gene identification and support vector machine learning to construct an early diagnosis model of myocardial infarction
title_full Risk gene identification and support vector machine learning to construct an early diagnosis model of myocardial infarction
title_fullStr Risk gene identification and support vector machine learning to construct an early diagnosis model of myocardial infarction
title_full_unstemmed Risk gene identification and support vector machine learning to construct an early diagnosis model of myocardial infarction
title_short Risk gene identification and support vector machine learning to construct an early diagnosis model of myocardial infarction
title_sort risk gene identification and support vector machine learning to construct an early diagnosis model of myocardial infarction
topic Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7411293/
https://www.ncbi.nlm.nih.gov/pubmed/32705275
http://dx.doi.org/10.3892/mmr.2020.11247
work_keys_str_mv AT fanghongzhi riskgeneidentificationandsupportvectormachinelearningtoconstructanearlydiagnosismodelofmyocardialinfarction
AT hudanli riskgeneidentificationandsupportvectormachinelearningtoconstructanearlydiagnosismodelofmyocardialinfarction
AT liqin riskgeneidentificationandsupportvectormachinelearningtoconstructanearlydiagnosismodelofmyocardialinfarction
AT tusu riskgeneidentificationandsupportvectormachinelearningtoconstructanearlydiagnosismodelofmyocardialinfarction