Cargando…

An elastic-net logistic regression approach to generate classifiers and gene signatures for types of immune cells and T helper cell subsets

BACKGROUND: Host immune response is coordinated by a variety of different specialized cell types that vary in time and location. While host immune response can be studied using conventional low-dimensional approaches, advances in transcriptomics analysis may provide a less biased view. Yet, leveragi...

Descripción completa

Detalles Bibliográficos
Autores principales: Torang, Arezo, Gupta, Paraag, Klinke, David J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6704630/
https://www.ncbi.nlm.nih.gov/pubmed/31438843
http://dx.doi.org/10.1186/s12859-019-2994-z
_version_ 1783445538641805312
author Torang, Arezo
Gupta, Paraag
Klinke, David J.
author_facet Torang, Arezo
Gupta, Paraag
Klinke, David J.
author_sort Torang, Arezo
collection PubMed
description BACKGROUND: Host immune response is coordinated by a variety of different specialized cell types that vary in time and location. While host immune response can be studied using conventional low-dimensional approaches, advances in transcriptomics analysis may provide a less biased view. Yet, leveraging transcriptomics data to identify immune cell subtypes presents challenges for extracting informative gene signatures hidden within a high dimensional transcriptomics space characterized by low sample numbers with noisy and missing values. To address these challenges, we explore using machine learning methods to select gene subsets and estimate gene coefficients simultaneously. RESULTS: Elastic-net logistic regression, a type of machine learning, was used to construct separate classifiers for ten different types of immune cell and for five T helper cell subsets. The resulting classifiers were then used to develop gene signatures that best discriminate among immune cell types and T helper cell subsets using RNA-seq datasets. We validated the approach using single-cell RNA-seq (scRNA-seq) datasets, which gave consistent results. In addition, we classified cell types that were previously unannotated. Finally, we benchmarked the proposed gene signatures against other existing gene signatures. CONCLUSIONS: Developed classifiers can be used as priors in predicting the extent and functional orientation of the host immune response in diseases, such as cancer, where transcriptomic profiling of bulk tissue samples and single cells are routinely employed. Information that can provide insight into the mechanistic basis of disease and therapeutic response. The source code and documentation are available through GitHub: https://github.com/KlinkeLab/ImmClass2019. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-019-2994-z) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6704630
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-67046302019-08-22 An elastic-net logistic regression approach to generate classifiers and gene signatures for types of immune cells and T helper cell subsets Torang, Arezo Gupta, Paraag Klinke, David J. BMC Bioinformatics Research Article BACKGROUND: Host immune response is coordinated by a variety of different specialized cell types that vary in time and location. While host immune response can be studied using conventional low-dimensional approaches, advances in transcriptomics analysis may provide a less biased view. Yet, leveraging transcriptomics data to identify immune cell subtypes presents challenges for extracting informative gene signatures hidden within a high dimensional transcriptomics space characterized by low sample numbers with noisy and missing values. To address these challenges, we explore using machine learning methods to select gene subsets and estimate gene coefficients simultaneously. RESULTS: Elastic-net logistic regression, a type of machine learning, was used to construct separate classifiers for ten different types of immune cell and for five T helper cell subsets. The resulting classifiers were then used to develop gene signatures that best discriminate among immune cell types and T helper cell subsets using RNA-seq datasets. We validated the approach using single-cell RNA-seq (scRNA-seq) datasets, which gave consistent results. In addition, we classified cell types that were previously unannotated. Finally, we benchmarked the proposed gene signatures against other existing gene signatures. CONCLUSIONS: Developed classifiers can be used as priors in predicting the extent and functional orientation of the host immune response in diseases, such as cancer, where transcriptomic profiling of bulk tissue samples and single cells are routinely employed. Information that can provide insight into the mechanistic basis of disease and therapeutic response. The source code and documentation are available through GitHub: https://github.com/KlinkeLab/ImmClass2019. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-019-2994-z) contains supplementary material, which is available to authorized users. BioMed Central 2019-08-22 /pmc/articles/PMC6704630/ /pubmed/31438843 http://dx.doi.org/10.1186/s12859-019-2994-z Text en © The Author(s) 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Torang, Arezo
Gupta, Paraag
Klinke, David J.
An elastic-net logistic regression approach to generate classifiers and gene signatures for types of immune cells and T helper cell subsets
title An elastic-net logistic regression approach to generate classifiers and gene signatures for types of immune cells and T helper cell subsets
title_full An elastic-net logistic regression approach to generate classifiers and gene signatures for types of immune cells and T helper cell subsets
title_fullStr An elastic-net logistic regression approach to generate classifiers and gene signatures for types of immune cells and T helper cell subsets
title_full_unstemmed An elastic-net logistic regression approach to generate classifiers and gene signatures for types of immune cells and T helper cell subsets
title_short An elastic-net logistic regression approach to generate classifiers and gene signatures for types of immune cells and T helper cell subsets
title_sort elastic-net logistic regression approach to generate classifiers and gene signatures for types of immune cells and t helper cell subsets
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6704630/
https://www.ncbi.nlm.nih.gov/pubmed/31438843
http://dx.doi.org/10.1186/s12859-019-2994-z
work_keys_str_mv AT torangarezo anelasticnetlogisticregressionapproachtogenerateclassifiersandgenesignaturesfortypesofimmunecellsandthelpercellsubsets
AT guptaparaag anelasticnetlogisticregressionapproachtogenerateclassifiersandgenesignaturesfortypesofimmunecellsandthelpercellsubsets
AT klinkedavidj anelasticnetlogisticregressionapproachtogenerateclassifiersandgenesignaturesfortypesofimmunecellsandthelpercellsubsets
AT torangarezo elasticnetlogisticregressionapproachtogenerateclassifiersandgenesignaturesfortypesofimmunecellsandthelpercellsubsets
AT guptaparaag elasticnetlogisticregressionapproachtogenerateclassifiersandgenesignaturesfortypesofimmunecellsandthelpercellsubsets
AT klinkedavidj elasticnetlogisticregressionapproachtogenerateclassifiersandgenesignaturesfortypesofimmunecellsandthelpercellsubsets