Cargando…

Deep learning-based enhancement of epigenomics data with AtacWorks

ATAC-seq is a widely-applied assay used to measure genome-wide chromatin accessibility; however, its ability to detect active regulatory regions can depend on the depth of sequencing coverage and the signal-to-noise ratio. Here we introduce AtacWorks, a deep learning toolkit to denoise sequencing co...

Descripción completa

Detalles Bibliográficos
Autores principales: Lal, Avantika, Chiang, Zachary D., Yakovenko, Nikolai, Duarte, Fabiana M., Israeli, Johnny, Buenrostro, Jason D.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7940635/
https://www.ncbi.nlm.nih.gov/pubmed/33686069
http://dx.doi.org/10.1038/s41467-021-21765-5
_version_ 1783661981324017664
author Lal, Avantika
Chiang, Zachary D.
Yakovenko, Nikolai
Duarte, Fabiana M.
Israeli, Johnny
Buenrostro, Jason D.
author_facet Lal, Avantika
Chiang, Zachary D.
Yakovenko, Nikolai
Duarte, Fabiana M.
Israeli, Johnny
Buenrostro, Jason D.
author_sort Lal, Avantika
collection PubMed
description ATAC-seq is a widely-applied assay used to measure genome-wide chromatin accessibility; however, its ability to detect active regulatory regions can depend on the depth of sequencing coverage and the signal-to-noise ratio. Here we introduce AtacWorks, a deep learning toolkit to denoise sequencing coverage and identify regulatory peaks at base-pair resolution from low cell count, low-coverage, or low-quality ATAC-seq data. Models trained by AtacWorks can detect peaks from cell types not seen in the training data, and are generalizable across diverse sample preparations and experimental platforms. We demonstrate that AtacWorks enhances the sensitivity of single-cell experiments by producing results on par with those of conventional methods using ~10 times as many cells, and further show that this framework can be adapted to enable cross-modality inference of protein-DNA interactions. Finally, we establish that AtacWorks can enable new biological discoveries by identifying active regulatory regions associated with lineage priming in rare subpopulations of hematopoietic stem cells.
format Online
Article
Text
id pubmed-7940635
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-79406352021-03-28 Deep learning-based enhancement of epigenomics data with AtacWorks Lal, Avantika Chiang, Zachary D. Yakovenko, Nikolai Duarte, Fabiana M. Israeli, Johnny Buenrostro, Jason D. Nat Commun Article ATAC-seq is a widely-applied assay used to measure genome-wide chromatin accessibility; however, its ability to detect active regulatory regions can depend on the depth of sequencing coverage and the signal-to-noise ratio. Here we introduce AtacWorks, a deep learning toolkit to denoise sequencing coverage and identify regulatory peaks at base-pair resolution from low cell count, low-coverage, or low-quality ATAC-seq data. Models trained by AtacWorks can detect peaks from cell types not seen in the training data, and are generalizable across diverse sample preparations and experimental platforms. We demonstrate that AtacWorks enhances the sensitivity of single-cell experiments by producing results on par with those of conventional methods using ~10 times as many cells, and further show that this framework can be adapted to enable cross-modality inference of protein-DNA interactions. Finally, we establish that AtacWorks can enable new biological discoveries by identifying active regulatory regions associated with lineage priming in rare subpopulations of hematopoietic stem cells. Nature Publishing Group UK 2021-03-08 /pmc/articles/PMC7940635/ /pubmed/33686069 http://dx.doi.org/10.1038/s41467-021-21765-5 Text en © The Author(s) 2021 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Lal, Avantika
Chiang, Zachary D.
Yakovenko, Nikolai
Duarte, Fabiana M.
Israeli, Johnny
Buenrostro, Jason D.
Deep learning-based enhancement of epigenomics data with AtacWorks
title Deep learning-based enhancement of epigenomics data with AtacWorks
title_full Deep learning-based enhancement of epigenomics data with AtacWorks
title_fullStr Deep learning-based enhancement of epigenomics data with AtacWorks
title_full_unstemmed Deep learning-based enhancement of epigenomics data with AtacWorks
title_short Deep learning-based enhancement of epigenomics data with AtacWorks
title_sort deep learning-based enhancement of epigenomics data with atacworks
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7940635/
https://www.ncbi.nlm.nih.gov/pubmed/33686069
http://dx.doi.org/10.1038/s41467-021-21765-5
work_keys_str_mv AT lalavantika deeplearningbasedenhancementofepigenomicsdatawithatacworks
AT chiangzacharyd deeplearningbasedenhancementofepigenomicsdatawithatacworks
AT yakovenkonikolai deeplearningbasedenhancementofepigenomicsdatawithatacworks
AT duartefabianam deeplearningbasedenhancementofepigenomicsdatawithatacworks
AT israelijohnny deeplearningbasedenhancementofepigenomicsdatawithatacworks
AT buenrostrojasond deeplearningbasedenhancementofepigenomicsdatawithatacworks