Cargando…

DISCERN: deep single-cell expression reconstruction for improved cell clustering and cell subtype and state detection

BACKGROUND: Single-cell sequencing provides detailed insights into biological processes including cell differentiation and identity. While providing deep cell-specific information, the method suffers from technical constraints, most notably a limited number of expressed genes per cell, which leads t...

Descripción completa

Detalles Bibliográficos
Autores principales: Hausmann, Fabian, Ergen, Can, Khatri, Robin, Marouf, Mohamed, Hänzelmann, Sonja, Gagliani, Nicola, Huber, Samuel, Machart, Pierre, Bonn, Stefan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10510283/
https://www.ncbi.nlm.nih.gov/pubmed/37730638
http://dx.doi.org/10.1186/s13059-023-03049-x
_version_ 1785107935137890304
author Hausmann, Fabian
Ergen, Can
Khatri, Robin
Marouf, Mohamed
Hänzelmann, Sonja
Gagliani, Nicola
Huber, Samuel
Machart, Pierre
Bonn, Stefan
author_facet Hausmann, Fabian
Ergen, Can
Khatri, Robin
Marouf, Mohamed
Hänzelmann, Sonja
Gagliani, Nicola
Huber, Samuel
Machart, Pierre
Bonn, Stefan
author_sort Hausmann, Fabian
collection PubMed
description BACKGROUND: Single-cell sequencing provides detailed insights into biological processes including cell differentiation and identity. While providing deep cell-specific information, the method suffers from technical constraints, most notably a limited number of expressed genes per cell, which leads to suboptimal clustering and cell type identification. RESULTS: Here, we present DISCERN, a novel deep generative network that precisely reconstructs missing single-cell gene expression using a reference dataset. DISCERN outperforms competing algorithms in expression inference resulting in greatly improved cell clustering, cell type and activity detection, and insights into the cellular regulation of disease. We show that DISCERN is robust against differences between batches and is able to keep biological differences between batches, which is a common problem for imputation and batch correction algorithms. We use DISCERN to detect two unseen COVID-19-associated T cell types, cytotoxic CD4(+) and CD8(+) Tc2 T helper cells, with a potential role in adverse disease outcome. We utilize T cell fraction information of patient blood to classify mild or severe COVID-19 with an AUROC of 80% that can serve as a biomarker of disease stage. DISCERN can be easily integrated into existing single-cell sequencing workflow. CONCLUSIONS: Thus, DISCERN is a flexible tool for reconstructing missing single-cell gene expression using a reference dataset and can easily be applied to a variety of data sets yielding novel insights, e.g., into disease mechanisms. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13059-023-03049-x.
format Online
Article
Text
id pubmed-10510283
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-105102832023-09-21 DISCERN: deep single-cell expression reconstruction for improved cell clustering and cell subtype and state detection Hausmann, Fabian Ergen, Can Khatri, Robin Marouf, Mohamed Hänzelmann, Sonja Gagliani, Nicola Huber, Samuel Machart, Pierre Bonn, Stefan Genome Biol Research BACKGROUND: Single-cell sequencing provides detailed insights into biological processes including cell differentiation and identity. While providing deep cell-specific information, the method suffers from technical constraints, most notably a limited number of expressed genes per cell, which leads to suboptimal clustering and cell type identification. RESULTS: Here, we present DISCERN, a novel deep generative network that precisely reconstructs missing single-cell gene expression using a reference dataset. DISCERN outperforms competing algorithms in expression inference resulting in greatly improved cell clustering, cell type and activity detection, and insights into the cellular regulation of disease. We show that DISCERN is robust against differences between batches and is able to keep biological differences between batches, which is a common problem for imputation and batch correction algorithms. We use DISCERN to detect two unseen COVID-19-associated T cell types, cytotoxic CD4(+) and CD8(+) Tc2 T helper cells, with a potential role in adverse disease outcome. We utilize T cell fraction information of patient blood to classify mild or severe COVID-19 with an AUROC of 80% that can serve as a biomarker of disease stage. DISCERN can be easily integrated into existing single-cell sequencing workflow. CONCLUSIONS: Thus, DISCERN is a flexible tool for reconstructing missing single-cell gene expression using a reference dataset and can easily be applied to a variety of data sets yielding novel insights, e.g., into disease mechanisms. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13059-023-03049-x. BioMed Central 2023-09-20 /pmc/articles/PMC10510283/ /pubmed/37730638 http://dx.doi.org/10.1186/s13059-023-03049-x Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Hausmann, Fabian
Ergen, Can
Khatri, Robin
Marouf, Mohamed
Hänzelmann, Sonja
Gagliani, Nicola
Huber, Samuel
Machart, Pierre
Bonn, Stefan
DISCERN: deep single-cell expression reconstruction for improved cell clustering and cell subtype and state detection
title DISCERN: deep single-cell expression reconstruction for improved cell clustering and cell subtype and state detection
title_full DISCERN: deep single-cell expression reconstruction for improved cell clustering and cell subtype and state detection
title_fullStr DISCERN: deep single-cell expression reconstruction for improved cell clustering and cell subtype and state detection
title_full_unstemmed DISCERN: deep single-cell expression reconstruction for improved cell clustering and cell subtype and state detection
title_short DISCERN: deep single-cell expression reconstruction for improved cell clustering and cell subtype and state detection
title_sort discern: deep single-cell expression reconstruction for improved cell clustering and cell subtype and state detection
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10510283/
https://www.ncbi.nlm.nih.gov/pubmed/37730638
http://dx.doi.org/10.1186/s13059-023-03049-x
work_keys_str_mv AT hausmannfabian discerndeepsinglecellexpressionreconstructionforimprovedcellclusteringandcellsubtypeandstatedetection
AT ergencan discerndeepsinglecellexpressionreconstructionforimprovedcellclusteringandcellsubtypeandstatedetection
AT khatrirobin discerndeepsinglecellexpressionreconstructionforimprovedcellclusteringandcellsubtypeandstatedetection
AT maroufmohamed discerndeepsinglecellexpressionreconstructionforimprovedcellclusteringandcellsubtypeandstatedetection
AT hanzelmannsonja discerndeepsinglecellexpressionreconstructionforimprovedcellclusteringandcellsubtypeandstatedetection
AT gaglianinicola discerndeepsinglecellexpressionreconstructionforimprovedcellclusteringandcellsubtypeandstatedetection
AT hubersamuel discerndeepsinglecellexpressionreconstructionforimprovedcellclusteringandcellsubtypeandstatedetection
AT machartpierre discerndeepsinglecellexpressionreconstructionforimprovedcellclusteringandcellsubtypeandstatedetection
AT bonnstefan discerndeepsinglecellexpressionreconstructionforimprovedcellclusteringandcellsubtypeandstatedetection