Cargando…
Risk gene identification and support vector machine learning to construct an early diagnosis model of myocardial infarction
The present study aimed to identify genes associated with increased risk of myocardial infarction (MI) and construct an early diagnosis model based on support vector machine (SVM) learning. The gene expression profile data of GSE34198, containing 97 human blood samples including 49 patients with MI...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
D.A. Spandidos
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7411293/ https://www.ncbi.nlm.nih.gov/pubmed/32705275 http://dx.doi.org/10.3892/mmr.2020.11247 |
_version_ | 1783568347145699328 |
---|---|
author | Fang, Hong-Zhi Hu, Dan-Li Li, Qin Tu, Su |
author_facet | Fang, Hong-Zhi Hu, Dan-Li Li, Qin Tu, Su |
author_sort | Fang, Hong-Zhi |
collection | PubMed |
description | The present study aimed to identify genes associated with increased risk of myocardial infarction (MI) and construct an early diagnosis model based on support vector machine (SVM) learning. The gene expression profile data of GSE34198, containing 97 human blood samples including 49 patients with MI and 48 healthy individuals, were obtained from the Gene Expression Omnibus database. Differentially expressed gene (DEG) screening, DEG enrichment analysis, protein-protein interaction (PPI) network investigation and clustering analysis were performed. The feature genes were identified using the neighboring score algorithm. Furthermore, a recursive feature elimination (RFE) algorithm was employed to screen risk factors among feature genes. The SVM prediction model was constructed and validated using the dataset GSE61144. A total of 1,207 DEGs (724 downregulated, 483 upregulated) between the two groups were identified. PPI analysis investigated 1,083 DEGs and 46,363 edges. In total, 87 genes were selected as candidate genes, and were primarily enriched in functions including ‘G-protein coupled receptor signaling’ or pathways such as ‘focal adhesion’. Furthermore, 15 genes with a high RFE score were selected to construct an SVM prediction model. The model's average accuracy was 86%. Data set verification showed that the predictive precision reached 0.92. High expression of the genes vascular endothelial growth factor A, A-kinase anchoring protein 12 and olfactory receptor 8D2 were potential risk factors for MI. The SVM early diagnosis model constructed by candidate genes could not only predict early MI, but also provide risk probability according to the severity of MI. |
format | Online Article Text |
id | pubmed-7411293 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | D.A. Spandidos |
record_format | MEDLINE/PubMed |
spelling | pubmed-74112932020-08-14 Risk gene identification and support vector machine learning to construct an early diagnosis model of myocardial infarction Fang, Hong-Zhi Hu, Dan-Li Li, Qin Tu, Su Mol Med Rep Articles The present study aimed to identify genes associated with increased risk of myocardial infarction (MI) and construct an early diagnosis model based on support vector machine (SVM) learning. The gene expression profile data of GSE34198, containing 97 human blood samples including 49 patients with MI and 48 healthy individuals, were obtained from the Gene Expression Omnibus database. Differentially expressed gene (DEG) screening, DEG enrichment analysis, protein-protein interaction (PPI) network investigation and clustering analysis were performed. The feature genes were identified using the neighboring score algorithm. Furthermore, a recursive feature elimination (RFE) algorithm was employed to screen risk factors among feature genes. The SVM prediction model was constructed and validated using the dataset GSE61144. A total of 1,207 DEGs (724 downregulated, 483 upregulated) between the two groups were identified. PPI analysis investigated 1,083 DEGs and 46,363 edges. In total, 87 genes were selected as candidate genes, and were primarily enriched in functions including ‘G-protein coupled receptor signaling’ or pathways such as ‘focal adhesion’. Furthermore, 15 genes with a high RFE score were selected to construct an SVM prediction model. The model's average accuracy was 86%. Data set verification showed that the predictive precision reached 0.92. High expression of the genes vascular endothelial growth factor A, A-kinase anchoring protein 12 and olfactory receptor 8D2 were potential risk factors for MI. The SVM early diagnosis model constructed by candidate genes could not only predict early MI, but also provide risk probability according to the severity of MI. D.A. Spandidos 2020-09 2020-06-17 /pmc/articles/PMC7411293/ /pubmed/32705275 http://dx.doi.org/10.3892/mmr.2020.11247 Text en Copyright: © Fang et al. This is an open access article distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivs License (https://creativecommons.org/licenses/by-nc-nd/4.0/) , which permits use and distribution in any medium, provided the original work is properly cited, the use is non-commercial and no modifications or adaptations are made. |
spellingShingle | Articles Fang, Hong-Zhi Hu, Dan-Li Li, Qin Tu, Su Risk gene identification and support vector machine learning to construct an early diagnosis model of myocardial infarction |
title | Risk gene identification and support vector machine learning to construct an early diagnosis model of myocardial infarction |
title_full | Risk gene identification and support vector machine learning to construct an early diagnosis model of myocardial infarction |
title_fullStr | Risk gene identification and support vector machine learning to construct an early diagnosis model of myocardial infarction |
title_full_unstemmed | Risk gene identification and support vector machine learning to construct an early diagnosis model of myocardial infarction |
title_short | Risk gene identification and support vector machine learning to construct an early diagnosis model of myocardial infarction |
title_sort | risk gene identification and support vector machine learning to construct an early diagnosis model of myocardial infarction |
topic | Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7411293/ https://www.ncbi.nlm.nih.gov/pubmed/32705275 http://dx.doi.org/10.3892/mmr.2020.11247 |
work_keys_str_mv | AT fanghongzhi riskgeneidentificationandsupportvectormachinelearningtoconstructanearlydiagnosismodelofmyocardialinfarction AT hudanli riskgeneidentificationandsupportvectormachinelearningtoconstructanearlydiagnosismodelofmyocardialinfarction AT liqin riskgeneidentificationandsupportvectormachinelearningtoconstructanearlydiagnosismodelofmyocardialinfarction AT tusu riskgeneidentificationandsupportvectormachinelearningtoconstructanearlydiagnosismodelofmyocardialinfarction |