Cargando…

scAVENGERS: a genotype-based deconvolution of individuals in multiplexed single-cell ATAC-seq data without reference genotypes

Genetic differences inferred from sequencing reads can be used for demultiplexing of pooled single-cell RNA-seq (scRNA-seq) data across multiple donors without WGS-based reference genotypes. However, such methods could not be directly applied to single-cell ATAC-seq (scATAC-seq) data owing to the lo...

Descripción completa

Detalles Bibliográficos
Autores principales: Han, Seungbeom, Kim, Kyukwang, Park, Seongwan, Lee, Andrew J, Chun, Hyonho, Jung, Inkyung
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9803874/
https://www.ncbi.nlm.nih.gov/pubmed/36601579
http://dx.doi.org/10.1093/nargab/lqac095
_version_ 1784861980603973632
author Han, Seungbeom
Kim, Kyukwang
Park, Seongwan
Lee, Andrew J
Chun, Hyonho
Jung, Inkyung
author_facet Han, Seungbeom
Kim, Kyukwang
Park, Seongwan
Lee, Andrew J
Chun, Hyonho
Jung, Inkyung
author_sort Han, Seungbeom
collection PubMed
description Genetic differences inferred from sequencing reads can be used for demultiplexing of pooled single-cell RNA-seq (scRNA-seq) data across multiple donors without WGS-based reference genotypes. However, such methods could not be directly applied to single-cell ATAC-seq (scATAC-seq) data owing to the lower read coverage for each variant compared to scRNA-seq. We propose a new software, scATAC-seq Variant-based EstimatioN for GEnotype ReSolving (scAVENGERS), which resolves this issue by calling more individual-specific germline variants and using an optimized mixture model for the scATAC-seq. The benchmark conducted with three synthetic multiplexed scATAC-seq datasets of peripheral blood mononuclear cells and prefrontal cortex tissues showed outstanding performance compared to existing methods in terms of accuracy, doublet detection, and a portion of donor-assigned cells. Furthermore, analyzing the effect of the improved sections provided insight into handling pooled single-cell data in the future. Our source code of the devised software is available at GitHub: https://github.com/kaistcbfg/scAVENGERS.
format Online
Article
Text
id pubmed-9803874
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-98038742023-01-03 scAVENGERS: a genotype-based deconvolution of individuals in multiplexed single-cell ATAC-seq data without reference genotypes Han, Seungbeom Kim, Kyukwang Park, Seongwan Lee, Andrew J Chun, Hyonho Jung, Inkyung NAR Genom Bioinform High Throughput Sequencing Methods Genetic differences inferred from sequencing reads can be used for demultiplexing of pooled single-cell RNA-seq (scRNA-seq) data across multiple donors without WGS-based reference genotypes. However, such methods could not be directly applied to single-cell ATAC-seq (scATAC-seq) data owing to the lower read coverage for each variant compared to scRNA-seq. We propose a new software, scATAC-seq Variant-based EstimatioN for GEnotype ReSolving (scAVENGERS), which resolves this issue by calling more individual-specific germline variants and using an optimized mixture model for the scATAC-seq. The benchmark conducted with three synthetic multiplexed scATAC-seq datasets of peripheral blood mononuclear cells and prefrontal cortex tissues showed outstanding performance compared to existing methods in terms of accuracy, doublet detection, and a portion of donor-assigned cells. Furthermore, analyzing the effect of the improved sections provided insight into handling pooled single-cell data in the future. Our source code of the devised software is available at GitHub: https://github.com/kaistcbfg/scAVENGERS. Oxford University Press 2022-12-31 /pmc/articles/PMC9803874/ /pubmed/36601579 http://dx.doi.org/10.1093/nargab/lqac095 Text en © The Author(s) 2022. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle High Throughput Sequencing Methods
Han, Seungbeom
Kim, Kyukwang
Park, Seongwan
Lee, Andrew J
Chun, Hyonho
Jung, Inkyung
scAVENGERS: a genotype-based deconvolution of individuals in multiplexed single-cell ATAC-seq data without reference genotypes
title scAVENGERS: a genotype-based deconvolution of individuals in multiplexed single-cell ATAC-seq data without reference genotypes
title_full scAVENGERS: a genotype-based deconvolution of individuals in multiplexed single-cell ATAC-seq data without reference genotypes
title_fullStr scAVENGERS: a genotype-based deconvolution of individuals in multiplexed single-cell ATAC-seq data without reference genotypes
title_full_unstemmed scAVENGERS: a genotype-based deconvolution of individuals in multiplexed single-cell ATAC-seq data without reference genotypes
title_short scAVENGERS: a genotype-based deconvolution of individuals in multiplexed single-cell ATAC-seq data without reference genotypes
title_sort scavengers: a genotype-based deconvolution of individuals in multiplexed single-cell atac-seq data without reference genotypes
topic High Throughput Sequencing Methods
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9803874/
https://www.ncbi.nlm.nih.gov/pubmed/36601579
http://dx.doi.org/10.1093/nargab/lqac095
work_keys_str_mv AT hanseungbeom scavengersagenotypebaseddeconvolutionofindividualsinmultiplexedsinglecellatacseqdatawithoutreferencegenotypes
AT kimkyukwang scavengersagenotypebaseddeconvolutionofindividualsinmultiplexedsinglecellatacseqdatawithoutreferencegenotypes
AT parkseongwan scavengersagenotypebaseddeconvolutionofindividualsinmultiplexedsinglecellatacseqdatawithoutreferencegenotypes
AT leeandrewj scavengersagenotypebaseddeconvolutionofindividualsinmultiplexedsinglecellatacseqdatawithoutreferencegenotypes
AT chunhyonho scavengersagenotypebaseddeconvolutionofindividualsinmultiplexedsinglecellatacseqdatawithoutreferencegenotypes
AT junginkyung scavengersagenotypebaseddeconvolutionofindividualsinmultiplexedsinglecellatacseqdatawithoutreferencegenotypes