Cargando…
TBIO-11. DEEP LEARNING-BASED SINGLE-CELL RNA SEQUENCING DIFFERENTIATION IDENTIFIES SIMPLE AND COMPLEX TRANSCRIPTIONAL NETWORKS FOR SUBPOPULATION CLASSIFICATION
BACKGROUND: Genomic assays capable of cellular resolution (i.e. scRNA-seq) are becoming ubiquitous in biomedical research. Machine learning, and the subtype known as Deep Learning, have broad application within scRNA-seq analytics. However, methods to facilitate the classification of cell population...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7715250/ http://dx.doi.org/10.1093/neuonc/noaa222.838 |
_version_ | 1783618911024971776 |
---|---|
author | Prince, Eric Hankinson, Todd |
author_facet | Prince, Eric Hankinson, Todd |
author_sort | Prince, Eric |
collection | PubMed |
description | BACKGROUND: Genomic assays capable of cellular resolution (i.e. scRNA-seq) are becoming ubiquitous in biomedical research. Machine learning, and the subtype known as Deep Learning, have broad application within scRNA-seq analytics. However, methods to facilitate the classification of cell populations are lacking. We present the novel computational framework HD Spot, which generates interpretable and robust Deep Learning classifiers that enable unbiased interrogation of linear and non-linear genomic signatures. METHODS: HD Spot is written in python and relies on Google’s TensorFlow2 deep learning framework. Four datasets of immune cells were obtained from the publicly available Seurat repository, generated using the 10X chromium platform. Data preprocessing used standard Seurat methodology. HD Spot generated optimized classifiers via a custom platform. Network interpretability was achieved using Shapley values. Ontology analysis was performed using Metascape. RESULTS: HD Spot identified meaningful ontologic signatures across all tested datasets. In the binary case of control versus IFN-B stimulated CD4+ T cells, gene ontologies reflected T(h0) and T(h2) T cell populations, congruent with T cell activation. In the 9-class case of PBMCs, HD Spot identified meaningful gene networks characteristic of the ground-truth populations using raw feature counts alone. When feature counts are processed into expression values, HD Spot demonstrates increased specificity of top genes and respective ontologies between subpopulations. CONCLUSION: This work introduces a broadly applicable computational tool for the advanced bioinformatician to decipher complex cellular heterogeneity (e.g., tumors) in an unbiased way. Additionally, HD Spot lowers the barrier for novice bioinformaticists to derive actionable insights from their data. |
format | Online Article Text |
id | pubmed-7715250 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-77152502020-12-09 TBIO-11. DEEP LEARNING-BASED SINGLE-CELL RNA SEQUENCING DIFFERENTIATION IDENTIFIES SIMPLE AND COMPLEX TRANSCRIPTIONAL NETWORKS FOR SUBPOPULATION CLASSIFICATION Prince, Eric Hankinson, Todd Neuro Oncol Tumor Biology (not fitting a specific disease category) BACKGROUND: Genomic assays capable of cellular resolution (i.e. scRNA-seq) are becoming ubiquitous in biomedical research. Machine learning, and the subtype known as Deep Learning, have broad application within scRNA-seq analytics. However, methods to facilitate the classification of cell populations are lacking. We present the novel computational framework HD Spot, which generates interpretable and robust Deep Learning classifiers that enable unbiased interrogation of linear and non-linear genomic signatures. METHODS: HD Spot is written in python and relies on Google’s TensorFlow2 deep learning framework. Four datasets of immune cells were obtained from the publicly available Seurat repository, generated using the 10X chromium platform. Data preprocessing used standard Seurat methodology. HD Spot generated optimized classifiers via a custom platform. Network interpretability was achieved using Shapley values. Ontology analysis was performed using Metascape. RESULTS: HD Spot identified meaningful ontologic signatures across all tested datasets. In the binary case of control versus IFN-B stimulated CD4+ T cells, gene ontologies reflected T(h0) and T(h2) T cell populations, congruent with T cell activation. In the 9-class case of PBMCs, HD Spot identified meaningful gene networks characteristic of the ground-truth populations using raw feature counts alone. When feature counts are processed into expression values, HD Spot demonstrates increased specificity of top genes and respective ontologies between subpopulations. CONCLUSION: This work introduces a broadly applicable computational tool for the advanced bioinformatician to decipher complex cellular heterogeneity (e.g., tumors) in an unbiased way. Additionally, HD Spot lowers the barrier for novice bioinformaticists to derive actionable insights from their data. Oxford University Press 2020-12-04 /pmc/articles/PMC7715250/ http://dx.doi.org/10.1093/neuonc/noaa222.838 Text en © The Author(s) 2020. Published by Oxford University Press on behalf of the Society for Neuro-Oncology. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Tumor Biology (not fitting a specific disease category) Prince, Eric Hankinson, Todd TBIO-11. DEEP LEARNING-BASED SINGLE-CELL RNA SEQUENCING DIFFERENTIATION IDENTIFIES SIMPLE AND COMPLEX TRANSCRIPTIONAL NETWORKS FOR SUBPOPULATION CLASSIFICATION |
title | TBIO-11. DEEP LEARNING-BASED SINGLE-CELL RNA SEQUENCING DIFFERENTIATION IDENTIFIES SIMPLE AND COMPLEX TRANSCRIPTIONAL NETWORKS FOR SUBPOPULATION CLASSIFICATION |
title_full | TBIO-11. DEEP LEARNING-BASED SINGLE-CELL RNA SEQUENCING DIFFERENTIATION IDENTIFIES SIMPLE AND COMPLEX TRANSCRIPTIONAL NETWORKS FOR SUBPOPULATION CLASSIFICATION |
title_fullStr | TBIO-11. DEEP LEARNING-BASED SINGLE-CELL RNA SEQUENCING DIFFERENTIATION IDENTIFIES SIMPLE AND COMPLEX TRANSCRIPTIONAL NETWORKS FOR SUBPOPULATION CLASSIFICATION |
title_full_unstemmed | TBIO-11. DEEP LEARNING-BASED SINGLE-CELL RNA SEQUENCING DIFFERENTIATION IDENTIFIES SIMPLE AND COMPLEX TRANSCRIPTIONAL NETWORKS FOR SUBPOPULATION CLASSIFICATION |
title_short | TBIO-11. DEEP LEARNING-BASED SINGLE-CELL RNA SEQUENCING DIFFERENTIATION IDENTIFIES SIMPLE AND COMPLEX TRANSCRIPTIONAL NETWORKS FOR SUBPOPULATION CLASSIFICATION |
title_sort | tbio-11. deep learning-based single-cell rna sequencing differentiation identifies simple and complex transcriptional networks for subpopulation classification |
topic | Tumor Biology (not fitting a specific disease category) |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7715250/ http://dx.doi.org/10.1093/neuonc/noaa222.838 |
work_keys_str_mv | AT princeeric tbio11deeplearningbasedsinglecellrnasequencingdifferentiationidentifiessimpleandcomplextranscriptionalnetworksforsubpopulationclassification AT hankinsontodd tbio11deeplearningbasedsinglecellrnasequencingdifferentiationidentifiessimpleandcomplextranscriptionalnetworksforsubpopulationclassification |