Cargando…
Enhancement and Imputation of Peak Signal Enables Accurate Cell-Type Classification in scATAC-seq
Single-cell Assay Transposase Accessible Chromatin sequencing (scATAC-seq) has been widely used in profiling genome-wide chromatin accessibility in thousands of individual cells. However, compared with single-cell RNA-seq, the peaks of scATAC-seq are much sparser due to the lower copy numbers (diplo...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8056015/ https://www.ncbi.nlm.nih.gov/pubmed/33889181 http://dx.doi.org/10.3389/fgene.2021.658352 |
_version_ | 1783680563740147712 |
---|---|
author | Cui, Zhe Cui, Ya Gao, Yan Jiang, Tao Zang, Tianyi Wang, Yadong |
author_facet | Cui, Zhe Cui, Ya Gao, Yan Jiang, Tao Zang, Tianyi Wang, Yadong |
author_sort | Cui, Zhe |
collection | PubMed |
description | Single-cell Assay Transposase Accessible Chromatin sequencing (scATAC-seq) has been widely used in profiling genome-wide chromatin accessibility in thousands of individual cells. However, compared with single-cell RNA-seq, the peaks of scATAC-seq are much sparser due to the lower copy numbers (diploid in humans) and the inherent missing signals, which makes it more challenging to classify cell type based on specific expressed gene or other canonical markers. Here, we present svmATAC, a support vector machine (SVM)-based method for accurately identifying cell types in scATAC-seq datasets by enhancing peak signal strength and imputing signals through patterns of co-accessibility. We applied svmATAC to several scATAC-seq data from human immune cells, human hematopoietic system cells, and peripheral blood mononuclear cells. The benchmark results showed that svmATAC is free of literature-based markers and robust across datasets in different libraries and platforms. The source code of svmATAC is available at https://github.com/mrcuizhe/svmATAC under the MIT license. |
format | Online Article Text |
id | pubmed-8056015 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-80560152021-04-21 Enhancement and Imputation of Peak Signal Enables Accurate Cell-Type Classification in scATAC-seq Cui, Zhe Cui, Ya Gao, Yan Jiang, Tao Zang, Tianyi Wang, Yadong Front Genet Genetics Single-cell Assay Transposase Accessible Chromatin sequencing (scATAC-seq) has been widely used in profiling genome-wide chromatin accessibility in thousands of individual cells. However, compared with single-cell RNA-seq, the peaks of scATAC-seq are much sparser due to the lower copy numbers (diploid in humans) and the inherent missing signals, which makes it more challenging to classify cell type based on specific expressed gene or other canonical markers. Here, we present svmATAC, a support vector machine (SVM)-based method for accurately identifying cell types in scATAC-seq datasets by enhancing peak signal strength and imputing signals through patterns of co-accessibility. We applied svmATAC to several scATAC-seq data from human immune cells, human hematopoietic system cells, and peripheral blood mononuclear cells. The benchmark results showed that svmATAC is free of literature-based markers and robust across datasets in different libraries and platforms. The source code of svmATAC is available at https://github.com/mrcuizhe/svmATAC under the MIT license. Frontiers Media S.A. 2021-04-06 /pmc/articles/PMC8056015/ /pubmed/33889181 http://dx.doi.org/10.3389/fgene.2021.658352 Text en Copyright © 2021 Cui, Cui, Gao, Jiang, Zang and Wang. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Cui, Zhe Cui, Ya Gao, Yan Jiang, Tao Zang, Tianyi Wang, Yadong Enhancement and Imputation of Peak Signal Enables Accurate Cell-Type Classification in scATAC-seq |
title | Enhancement and Imputation of Peak Signal Enables Accurate Cell-Type Classification in scATAC-seq |
title_full | Enhancement and Imputation of Peak Signal Enables Accurate Cell-Type Classification in scATAC-seq |
title_fullStr | Enhancement and Imputation of Peak Signal Enables Accurate Cell-Type Classification in scATAC-seq |
title_full_unstemmed | Enhancement and Imputation of Peak Signal Enables Accurate Cell-Type Classification in scATAC-seq |
title_short | Enhancement and Imputation of Peak Signal Enables Accurate Cell-Type Classification in scATAC-seq |
title_sort | enhancement and imputation of peak signal enables accurate cell-type classification in scatac-seq |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8056015/ https://www.ncbi.nlm.nih.gov/pubmed/33889181 http://dx.doi.org/10.3389/fgene.2021.658352 |
work_keys_str_mv | AT cuizhe enhancementandimputationofpeaksignalenablesaccuratecelltypeclassificationinscatacseq AT cuiya enhancementandimputationofpeaksignalenablesaccuratecelltypeclassificationinscatacseq AT gaoyan enhancementandimputationofpeaksignalenablesaccuratecelltypeclassificationinscatacseq AT jiangtao enhancementandimputationofpeaksignalenablesaccuratecelltypeclassificationinscatacseq AT zangtianyi enhancementandimputationofpeaksignalenablesaccuratecelltypeclassificationinscatacseq AT wangyadong enhancementandimputationofpeaksignalenablesaccuratecelltypeclassificationinscatacseq |