Cargando…
Classification models using circulating neutrophil transcripts can detect unruptured intracranial aneurysm
BACKGROUND: Intracranial aneurysms (IAs) are dangerous because of their potential to rupture. We previously found significant RNA expression differences in circulating neutrophils between patients with and without unruptured IAs and trained machine learning models to predict presence of IA using 40...
Autores principales: | , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7565814/ https://www.ncbi.nlm.nih.gov/pubmed/33059716 http://dx.doi.org/10.1186/s12967-020-02550-2 |
_version_ | 1783596014782906368 |
---|---|
author | Poppenberg, Kerry E. Tutino, Vincent M. Li, Lu Waqas, Muhammad June, Armond Chaves, Lee Jiang, Kaiyu Jarvis, James N. Sun, Yijun Snyder, Kenneth V. Levy, Elad I. Siddiqui, Adnan H. Kolega, John Meng, Hui |
author_facet | Poppenberg, Kerry E. Tutino, Vincent M. Li, Lu Waqas, Muhammad June, Armond Chaves, Lee Jiang, Kaiyu Jarvis, James N. Sun, Yijun Snyder, Kenneth V. Levy, Elad I. Siddiqui, Adnan H. Kolega, John Meng, Hui |
author_sort | Poppenberg, Kerry E. |
collection | PubMed |
description | BACKGROUND: Intracranial aneurysms (IAs) are dangerous because of their potential to rupture. We previously found significant RNA expression differences in circulating neutrophils between patients with and without unruptured IAs and trained machine learning models to predict presence of IA using 40 neutrophil transcriptomes. Here, we aim to develop a predictive model for unruptured IA using neutrophil transcriptomes from a larger population and more robust machine learning methods. METHODS: Neutrophil RNA extracted from the blood of 134 patients (55 with IA, 79 IA-free controls) was subjected to next-generation RNA sequencing. In a randomly-selected training cohort (n = 94), the Least Absolute Shrinkage and Selection Operator (LASSO) selected transcripts, from which we constructed prediction models via 4 well-established supervised machine-learning algorithms (K-Nearest Neighbors, Random Forest, and Support Vector Machines with Gaussian and cubic kernels). We tested the models in the remaining samples (n = 40) and assessed model performance by receiver-operating-characteristic (ROC) curves. Real-time quantitative polymerase chain reaction (RT-qPCR) of 9 IA-associated genes was used to verify gene expression in a subset of 49 neutrophil RNA samples. We also examined the potential influence of demographics and comorbidities on model prediction. RESULTS: Feature selection using LASSO in the training cohort identified 37 IA-associated transcripts. Models trained using these transcripts had a maximum accuracy of 90% in the testing cohort. The testing performance across all methods had an average area under ROC curve (AUC) = 0.97, an improvement over our previous models. The Random Forest model performed best across both training and testing cohorts. RT-qPCR confirmed expression differences in 7 of 9 genes tested. Gene ontology and IPA network analyses performed on the 37 model genes reflected dysregulated inflammation, cell signaling, and apoptosis processes. In our data, demographics and comorbidities did not affect model performance. CONCLUSIONS: We improved upon our previous IA prediction models based on circulating neutrophil transcriptomes by increasing sample size and by implementing LASSO and more robust machine learning methods. Future studies are needed to validate these models in larger cohorts and further investigate effect of covariates. |
format | Online Article Text |
id | pubmed-7565814 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-75658142020-10-20 Classification models using circulating neutrophil transcripts can detect unruptured intracranial aneurysm Poppenberg, Kerry E. Tutino, Vincent M. Li, Lu Waqas, Muhammad June, Armond Chaves, Lee Jiang, Kaiyu Jarvis, James N. Sun, Yijun Snyder, Kenneth V. Levy, Elad I. Siddiqui, Adnan H. Kolega, John Meng, Hui J Transl Med Research BACKGROUND: Intracranial aneurysms (IAs) are dangerous because of their potential to rupture. We previously found significant RNA expression differences in circulating neutrophils between patients with and without unruptured IAs and trained machine learning models to predict presence of IA using 40 neutrophil transcriptomes. Here, we aim to develop a predictive model for unruptured IA using neutrophil transcriptomes from a larger population and more robust machine learning methods. METHODS: Neutrophil RNA extracted from the blood of 134 patients (55 with IA, 79 IA-free controls) was subjected to next-generation RNA sequencing. In a randomly-selected training cohort (n = 94), the Least Absolute Shrinkage and Selection Operator (LASSO) selected transcripts, from which we constructed prediction models via 4 well-established supervised machine-learning algorithms (K-Nearest Neighbors, Random Forest, and Support Vector Machines with Gaussian and cubic kernels). We tested the models in the remaining samples (n = 40) and assessed model performance by receiver-operating-characteristic (ROC) curves. Real-time quantitative polymerase chain reaction (RT-qPCR) of 9 IA-associated genes was used to verify gene expression in a subset of 49 neutrophil RNA samples. We also examined the potential influence of demographics and comorbidities on model prediction. RESULTS: Feature selection using LASSO in the training cohort identified 37 IA-associated transcripts. Models trained using these transcripts had a maximum accuracy of 90% in the testing cohort. The testing performance across all methods had an average area under ROC curve (AUC) = 0.97, an improvement over our previous models. The Random Forest model performed best across both training and testing cohorts. RT-qPCR confirmed expression differences in 7 of 9 genes tested. Gene ontology and IPA network analyses performed on the 37 model genes reflected dysregulated inflammation, cell signaling, and apoptosis processes. In our data, demographics and comorbidities did not affect model performance. CONCLUSIONS: We improved upon our previous IA prediction models based on circulating neutrophil transcriptomes by increasing sample size and by implementing LASSO and more robust machine learning methods. Future studies are needed to validate these models in larger cohorts and further investigate effect of covariates. BioMed Central 2020-10-15 /pmc/articles/PMC7565814/ /pubmed/33059716 http://dx.doi.org/10.1186/s12967-020-02550-2 Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Poppenberg, Kerry E. Tutino, Vincent M. Li, Lu Waqas, Muhammad June, Armond Chaves, Lee Jiang, Kaiyu Jarvis, James N. Sun, Yijun Snyder, Kenneth V. Levy, Elad I. Siddiqui, Adnan H. Kolega, John Meng, Hui Classification models using circulating neutrophil transcripts can detect unruptured intracranial aneurysm |
title | Classification models using circulating neutrophil transcripts can detect unruptured intracranial aneurysm |
title_full | Classification models using circulating neutrophil transcripts can detect unruptured intracranial aneurysm |
title_fullStr | Classification models using circulating neutrophil transcripts can detect unruptured intracranial aneurysm |
title_full_unstemmed | Classification models using circulating neutrophil transcripts can detect unruptured intracranial aneurysm |
title_short | Classification models using circulating neutrophil transcripts can detect unruptured intracranial aneurysm |
title_sort | classification models using circulating neutrophil transcripts can detect unruptured intracranial aneurysm |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7565814/ https://www.ncbi.nlm.nih.gov/pubmed/33059716 http://dx.doi.org/10.1186/s12967-020-02550-2 |
work_keys_str_mv | AT poppenbergkerrye classificationmodelsusingcirculatingneutrophiltranscriptscandetectunrupturedintracranialaneurysm AT tutinovincentm classificationmodelsusingcirculatingneutrophiltranscriptscandetectunrupturedintracranialaneurysm AT lilu classificationmodelsusingcirculatingneutrophiltranscriptscandetectunrupturedintracranialaneurysm AT waqasmuhammad classificationmodelsusingcirculatingneutrophiltranscriptscandetectunrupturedintracranialaneurysm AT junearmond classificationmodelsusingcirculatingneutrophiltranscriptscandetectunrupturedintracranialaneurysm AT chaveslee classificationmodelsusingcirculatingneutrophiltranscriptscandetectunrupturedintracranialaneurysm AT jiangkaiyu classificationmodelsusingcirculatingneutrophiltranscriptscandetectunrupturedintracranialaneurysm AT jarvisjamesn classificationmodelsusingcirculatingneutrophiltranscriptscandetectunrupturedintracranialaneurysm AT sunyijun classificationmodelsusingcirculatingneutrophiltranscriptscandetectunrupturedintracranialaneurysm AT snyderkennethv classificationmodelsusingcirculatingneutrophiltranscriptscandetectunrupturedintracranialaneurysm AT levyeladi classificationmodelsusingcirculatingneutrophiltranscriptscandetectunrupturedintracranialaneurysm AT siddiquiadnanh classificationmodelsusingcirculatingneutrophiltranscriptscandetectunrupturedintracranialaneurysm AT kolegajohn classificationmodelsusingcirculatingneutrophiltranscriptscandetectunrupturedintracranialaneurysm AT menghui classificationmodelsusingcirculatingneutrophiltranscriptscandetectunrupturedintracranialaneurysm |