Cargando…
DISCERN: deep single-cell expression reconstruction for improved cell clustering and cell subtype and state detection
BACKGROUND: Single-cell sequencing provides detailed insights into biological processes including cell differentiation and identity. While providing deep cell-specific information, the method suffers from technical constraints, most notably a limited number of expressed genes per cell, which leads t...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10510283/ https://www.ncbi.nlm.nih.gov/pubmed/37730638 http://dx.doi.org/10.1186/s13059-023-03049-x |
_version_ | 1785107935137890304 |
---|---|
author | Hausmann, Fabian Ergen, Can Khatri, Robin Marouf, Mohamed Hänzelmann, Sonja Gagliani, Nicola Huber, Samuel Machart, Pierre Bonn, Stefan |
author_facet | Hausmann, Fabian Ergen, Can Khatri, Robin Marouf, Mohamed Hänzelmann, Sonja Gagliani, Nicola Huber, Samuel Machart, Pierre Bonn, Stefan |
author_sort | Hausmann, Fabian |
collection | PubMed |
description | BACKGROUND: Single-cell sequencing provides detailed insights into biological processes including cell differentiation and identity. While providing deep cell-specific information, the method suffers from technical constraints, most notably a limited number of expressed genes per cell, which leads to suboptimal clustering and cell type identification. RESULTS: Here, we present DISCERN, a novel deep generative network that precisely reconstructs missing single-cell gene expression using a reference dataset. DISCERN outperforms competing algorithms in expression inference resulting in greatly improved cell clustering, cell type and activity detection, and insights into the cellular regulation of disease. We show that DISCERN is robust against differences between batches and is able to keep biological differences between batches, which is a common problem for imputation and batch correction algorithms. We use DISCERN to detect two unseen COVID-19-associated T cell types, cytotoxic CD4(+) and CD8(+) Tc2 T helper cells, with a potential role in adverse disease outcome. We utilize T cell fraction information of patient blood to classify mild or severe COVID-19 with an AUROC of 80% that can serve as a biomarker of disease stage. DISCERN can be easily integrated into existing single-cell sequencing workflow. CONCLUSIONS: Thus, DISCERN is a flexible tool for reconstructing missing single-cell gene expression using a reference dataset and can easily be applied to a variety of data sets yielding novel insights, e.g., into disease mechanisms. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13059-023-03049-x. |
format | Online Article Text |
id | pubmed-10510283 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-105102832023-09-21 DISCERN: deep single-cell expression reconstruction for improved cell clustering and cell subtype and state detection Hausmann, Fabian Ergen, Can Khatri, Robin Marouf, Mohamed Hänzelmann, Sonja Gagliani, Nicola Huber, Samuel Machart, Pierre Bonn, Stefan Genome Biol Research BACKGROUND: Single-cell sequencing provides detailed insights into biological processes including cell differentiation and identity. While providing deep cell-specific information, the method suffers from technical constraints, most notably a limited number of expressed genes per cell, which leads to suboptimal clustering and cell type identification. RESULTS: Here, we present DISCERN, a novel deep generative network that precisely reconstructs missing single-cell gene expression using a reference dataset. DISCERN outperforms competing algorithms in expression inference resulting in greatly improved cell clustering, cell type and activity detection, and insights into the cellular regulation of disease. We show that DISCERN is robust against differences between batches and is able to keep biological differences between batches, which is a common problem for imputation and batch correction algorithms. We use DISCERN to detect two unseen COVID-19-associated T cell types, cytotoxic CD4(+) and CD8(+) Tc2 T helper cells, with a potential role in adverse disease outcome. We utilize T cell fraction information of patient blood to classify mild or severe COVID-19 with an AUROC of 80% that can serve as a biomarker of disease stage. DISCERN can be easily integrated into existing single-cell sequencing workflow. CONCLUSIONS: Thus, DISCERN is a flexible tool for reconstructing missing single-cell gene expression using a reference dataset and can easily be applied to a variety of data sets yielding novel insights, e.g., into disease mechanisms. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13059-023-03049-x. BioMed Central 2023-09-20 /pmc/articles/PMC10510283/ /pubmed/37730638 http://dx.doi.org/10.1186/s13059-023-03049-x Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Hausmann, Fabian Ergen, Can Khatri, Robin Marouf, Mohamed Hänzelmann, Sonja Gagliani, Nicola Huber, Samuel Machart, Pierre Bonn, Stefan DISCERN: deep single-cell expression reconstruction for improved cell clustering and cell subtype and state detection |
title | DISCERN: deep single-cell expression reconstruction for improved cell clustering and cell subtype and state detection |
title_full | DISCERN: deep single-cell expression reconstruction for improved cell clustering and cell subtype and state detection |
title_fullStr | DISCERN: deep single-cell expression reconstruction for improved cell clustering and cell subtype and state detection |
title_full_unstemmed | DISCERN: deep single-cell expression reconstruction for improved cell clustering and cell subtype and state detection |
title_short | DISCERN: deep single-cell expression reconstruction for improved cell clustering and cell subtype and state detection |
title_sort | discern: deep single-cell expression reconstruction for improved cell clustering and cell subtype and state detection |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10510283/ https://www.ncbi.nlm.nih.gov/pubmed/37730638 http://dx.doi.org/10.1186/s13059-023-03049-x |
work_keys_str_mv | AT hausmannfabian discerndeepsinglecellexpressionreconstructionforimprovedcellclusteringandcellsubtypeandstatedetection AT ergencan discerndeepsinglecellexpressionreconstructionforimprovedcellclusteringandcellsubtypeandstatedetection AT khatrirobin discerndeepsinglecellexpressionreconstructionforimprovedcellclusteringandcellsubtypeandstatedetection AT maroufmohamed discerndeepsinglecellexpressionreconstructionforimprovedcellclusteringandcellsubtypeandstatedetection AT hanzelmannsonja discerndeepsinglecellexpressionreconstructionforimprovedcellclusteringandcellsubtypeandstatedetection AT gaglianinicola discerndeepsinglecellexpressionreconstructionforimprovedcellclusteringandcellsubtypeandstatedetection AT hubersamuel discerndeepsinglecellexpressionreconstructionforimprovedcellclusteringandcellsubtypeandstatedetection AT machartpierre discerndeepsinglecellexpressionreconstructionforimprovedcellclusteringandcellsubtypeandstatedetection AT bonnstefan discerndeepsinglecellexpressionreconstructionforimprovedcellclusteringandcellsubtypeandstatedetection |