Cargando…

Enhancement and Imputation of Peak Signal Enables Accurate Cell-Type Classification in scATAC-seq

Single-cell Assay Transposase Accessible Chromatin sequencing (scATAC-seq) has been widely used in profiling genome-wide chromatin accessibility in thousands of individual cells. However, compared with single-cell RNA-seq, the peaks of scATAC-seq are much sparser due to the lower copy numbers (diplo...

Descripción completa

Detalles Bibliográficos
Autores principales: Cui, Zhe, Cui, Ya, Gao, Yan, Jiang, Tao, Zang, Tianyi, Wang, Yadong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8056015/
https://www.ncbi.nlm.nih.gov/pubmed/33889181
http://dx.doi.org/10.3389/fgene.2021.658352
_version_ 1783680563740147712
author Cui, Zhe
Cui, Ya
Gao, Yan
Jiang, Tao
Zang, Tianyi
Wang, Yadong
author_facet Cui, Zhe
Cui, Ya
Gao, Yan
Jiang, Tao
Zang, Tianyi
Wang, Yadong
author_sort Cui, Zhe
collection PubMed
description Single-cell Assay Transposase Accessible Chromatin sequencing (scATAC-seq) has been widely used in profiling genome-wide chromatin accessibility in thousands of individual cells. However, compared with single-cell RNA-seq, the peaks of scATAC-seq are much sparser due to the lower copy numbers (diploid in humans) and the inherent missing signals, which makes it more challenging to classify cell type based on specific expressed gene or other canonical markers. Here, we present svmATAC, a support vector machine (SVM)-based method for accurately identifying cell types in scATAC-seq datasets by enhancing peak signal strength and imputing signals through patterns of co-accessibility. We applied svmATAC to several scATAC-seq data from human immune cells, human hematopoietic system cells, and peripheral blood mononuclear cells. The benchmark results showed that svmATAC is free of literature-based markers and robust across datasets in different libraries and platforms. The source code of svmATAC is available at https://github.com/mrcuizhe/svmATAC under the MIT license.
format Online
Article
Text
id pubmed-8056015
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-80560152021-04-21 Enhancement and Imputation of Peak Signal Enables Accurate Cell-Type Classification in scATAC-seq Cui, Zhe Cui, Ya Gao, Yan Jiang, Tao Zang, Tianyi Wang, Yadong Front Genet Genetics Single-cell Assay Transposase Accessible Chromatin sequencing (scATAC-seq) has been widely used in profiling genome-wide chromatin accessibility in thousands of individual cells. However, compared with single-cell RNA-seq, the peaks of scATAC-seq are much sparser due to the lower copy numbers (diploid in humans) and the inherent missing signals, which makes it more challenging to classify cell type based on specific expressed gene or other canonical markers. Here, we present svmATAC, a support vector machine (SVM)-based method for accurately identifying cell types in scATAC-seq datasets by enhancing peak signal strength and imputing signals through patterns of co-accessibility. We applied svmATAC to several scATAC-seq data from human immune cells, human hematopoietic system cells, and peripheral blood mononuclear cells. The benchmark results showed that svmATAC is free of literature-based markers and robust across datasets in different libraries and platforms. The source code of svmATAC is available at https://github.com/mrcuizhe/svmATAC under the MIT license. Frontiers Media S.A. 2021-04-06 /pmc/articles/PMC8056015/ /pubmed/33889181 http://dx.doi.org/10.3389/fgene.2021.658352 Text en Copyright © 2021 Cui, Cui, Gao, Jiang, Zang and Wang. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Cui, Zhe
Cui, Ya
Gao, Yan
Jiang, Tao
Zang, Tianyi
Wang, Yadong
Enhancement and Imputation of Peak Signal Enables Accurate Cell-Type Classification in scATAC-seq
title Enhancement and Imputation of Peak Signal Enables Accurate Cell-Type Classification in scATAC-seq
title_full Enhancement and Imputation of Peak Signal Enables Accurate Cell-Type Classification in scATAC-seq
title_fullStr Enhancement and Imputation of Peak Signal Enables Accurate Cell-Type Classification in scATAC-seq
title_full_unstemmed Enhancement and Imputation of Peak Signal Enables Accurate Cell-Type Classification in scATAC-seq
title_short Enhancement and Imputation of Peak Signal Enables Accurate Cell-Type Classification in scATAC-seq
title_sort enhancement and imputation of peak signal enables accurate cell-type classification in scatac-seq
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8056015/
https://www.ncbi.nlm.nih.gov/pubmed/33889181
http://dx.doi.org/10.3389/fgene.2021.658352
work_keys_str_mv AT cuizhe enhancementandimputationofpeaksignalenablesaccuratecelltypeclassificationinscatacseq
AT cuiya enhancementandimputationofpeaksignalenablesaccuratecelltypeclassificationinscatacseq
AT gaoyan enhancementandimputationofpeaksignalenablesaccuratecelltypeclassificationinscatacseq
AT jiangtao enhancementandimputationofpeaksignalenablesaccuratecelltypeclassificationinscatacseq
AT zangtianyi enhancementandimputationofpeaksignalenablesaccuratecelltypeclassificationinscatacseq
AT wangyadong enhancementandimputationofpeaksignalenablesaccuratecelltypeclassificationinscatacseq