Cargando…
An elastic-net logistic regression approach to generate classifiers and gene signatures for types of immune cells and T helper cell subsets
BACKGROUND: Host immune response is coordinated by a variety of different specialized cell types that vary in time and location. While host immune response can be studied using conventional low-dimensional approaches, advances in transcriptomics analysis may provide a less biased view. Yet, leveragi...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6704630/ https://www.ncbi.nlm.nih.gov/pubmed/31438843 http://dx.doi.org/10.1186/s12859-019-2994-z |
_version_ | 1783445538641805312 |
---|---|
author | Torang, Arezo Gupta, Paraag Klinke, David J. |
author_facet | Torang, Arezo Gupta, Paraag Klinke, David J. |
author_sort | Torang, Arezo |
collection | PubMed |
description | BACKGROUND: Host immune response is coordinated by a variety of different specialized cell types that vary in time and location. While host immune response can be studied using conventional low-dimensional approaches, advances in transcriptomics analysis may provide a less biased view. Yet, leveraging transcriptomics data to identify immune cell subtypes presents challenges for extracting informative gene signatures hidden within a high dimensional transcriptomics space characterized by low sample numbers with noisy and missing values. To address these challenges, we explore using machine learning methods to select gene subsets and estimate gene coefficients simultaneously. RESULTS: Elastic-net logistic regression, a type of machine learning, was used to construct separate classifiers for ten different types of immune cell and for five T helper cell subsets. The resulting classifiers were then used to develop gene signatures that best discriminate among immune cell types and T helper cell subsets using RNA-seq datasets. We validated the approach using single-cell RNA-seq (scRNA-seq) datasets, which gave consistent results. In addition, we classified cell types that were previously unannotated. Finally, we benchmarked the proposed gene signatures against other existing gene signatures. CONCLUSIONS: Developed classifiers can be used as priors in predicting the extent and functional orientation of the host immune response in diseases, such as cancer, where transcriptomic profiling of bulk tissue samples and single cells are routinely employed. Information that can provide insight into the mechanistic basis of disease and therapeutic response. The source code and documentation are available through GitHub: https://github.com/KlinkeLab/ImmClass2019. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-019-2994-z) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-6704630 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-67046302019-08-22 An elastic-net logistic regression approach to generate classifiers and gene signatures for types of immune cells and T helper cell subsets Torang, Arezo Gupta, Paraag Klinke, David J. BMC Bioinformatics Research Article BACKGROUND: Host immune response is coordinated by a variety of different specialized cell types that vary in time and location. While host immune response can be studied using conventional low-dimensional approaches, advances in transcriptomics analysis may provide a less biased view. Yet, leveraging transcriptomics data to identify immune cell subtypes presents challenges for extracting informative gene signatures hidden within a high dimensional transcriptomics space characterized by low sample numbers with noisy and missing values. To address these challenges, we explore using machine learning methods to select gene subsets and estimate gene coefficients simultaneously. RESULTS: Elastic-net logistic regression, a type of machine learning, was used to construct separate classifiers for ten different types of immune cell and for five T helper cell subsets. The resulting classifiers were then used to develop gene signatures that best discriminate among immune cell types and T helper cell subsets using RNA-seq datasets. We validated the approach using single-cell RNA-seq (scRNA-seq) datasets, which gave consistent results. In addition, we classified cell types that were previously unannotated. Finally, we benchmarked the proposed gene signatures against other existing gene signatures. CONCLUSIONS: Developed classifiers can be used as priors in predicting the extent and functional orientation of the host immune response in diseases, such as cancer, where transcriptomic profiling of bulk tissue samples and single cells are routinely employed. Information that can provide insight into the mechanistic basis of disease and therapeutic response. The source code and documentation are available through GitHub: https://github.com/KlinkeLab/ImmClass2019. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-019-2994-z) contains supplementary material, which is available to authorized users. BioMed Central 2019-08-22 /pmc/articles/PMC6704630/ /pubmed/31438843 http://dx.doi.org/10.1186/s12859-019-2994-z Text en © The Author(s) 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Torang, Arezo Gupta, Paraag Klinke, David J. An elastic-net logistic regression approach to generate classifiers and gene signatures for types of immune cells and T helper cell subsets |
title | An elastic-net logistic regression approach to generate classifiers and gene signatures for types of immune cells and T helper cell subsets |
title_full | An elastic-net logistic regression approach to generate classifiers and gene signatures for types of immune cells and T helper cell subsets |
title_fullStr | An elastic-net logistic regression approach to generate classifiers and gene signatures for types of immune cells and T helper cell subsets |
title_full_unstemmed | An elastic-net logistic regression approach to generate classifiers and gene signatures for types of immune cells and T helper cell subsets |
title_short | An elastic-net logistic regression approach to generate classifiers and gene signatures for types of immune cells and T helper cell subsets |
title_sort | elastic-net logistic regression approach to generate classifiers and gene signatures for types of immune cells and t helper cell subsets |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6704630/ https://www.ncbi.nlm.nih.gov/pubmed/31438843 http://dx.doi.org/10.1186/s12859-019-2994-z |
work_keys_str_mv | AT torangarezo anelasticnetlogisticregressionapproachtogenerateclassifiersandgenesignaturesfortypesofimmunecellsandthelpercellsubsets AT guptaparaag anelasticnetlogisticregressionapproachtogenerateclassifiersandgenesignaturesfortypesofimmunecellsandthelpercellsubsets AT klinkedavidj anelasticnetlogisticregressionapproachtogenerateclassifiersandgenesignaturesfortypesofimmunecellsandthelpercellsubsets AT torangarezo elasticnetlogisticregressionapproachtogenerateclassifiersandgenesignaturesfortypesofimmunecellsandthelpercellsubsets AT guptaparaag elasticnetlogisticregressionapproachtogenerateclassifiersandgenesignaturesfortypesofimmunecellsandthelpercellsubsets AT klinkedavidj elasticnetlogisticregressionapproachtogenerateclassifiersandgenesignaturesfortypesofimmunecellsandthelpercellsubsets |