Cargando…
Model-based compound hypothesis testing for snATAC-seq data with PACS
Single nucleus ATAC-seq (snATAC-seq) experimental designs have become increasingly complex with multiple factors that might affect chromatin accessibility, including cell type, tissue of origin, sample location, batch, etc., whose compound effects are difficult to test by existing methods. In additi...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cold Spring Harbor Laboratory
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10418058/ https://www.ncbi.nlm.nih.gov/pubmed/37577623 http://dx.doi.org/10.1101/2023.07.30.551108 |
_version_ | 1785088187928936448 |
---|---|
author | Miao, Zhen Wang, Jianqiao Park, Kernyu Kuang, Da Kim, Junhyong |
author_facet | Miao, Zhen Wang, Jianqiao Park, Kernyu Kuang, Da Kim, Junhyong |
author_sort | Miao, Zhen |
collection | PubMed |
description | Single nucleus ATAC-seq (snATAC-seq) experimental designs have become increasingly complex with multiple factors that might affect chromatin accessibility, including cell type, tissue of origin, sample location, batch, etc., whose compound effects are difficult to test by existing methods. In addition, current snATAC-seq data present statistical difficulties due to their sparsity and variations in individual sequence capture. To address these problems, we present a zero-adjusted statistical model, PACS, that can allow complex hypothesis testing of factors that affect accessibility while accounting for sparse and incomplete data. For differential accessibility analysis, PACS controls the false positive rate and achieves on average a 17% to 122% higher power than existing tools. We demonstrate the effectiveness of PACS through several analysis tasks including supervised cell type annotation, compound hypothesis testing, batch effect correction, and spatiotemporal modeling. We apply PACS to several datasets from a variety of tissues and show its ability to reveal previously undiscovered insights in snATAC-seq data. |
format | Online Article Text |
id | pubmed-10418058 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Cold Spring Harbor Laboratory |
record_format | MEDLINE/PubMed |
spelling | pubmed-104180582023-08-12 Model-based compound hypothesis testing for snATAC-seq data with PACS Miao, Zhen Wang, Jianqiao Park, Kernyu Kuang, Da Kim, Junhyong bioRxiv Article Single nucleus ATAC-seq (snATAC-seq) experimental designs have become increasingly complex with multiple factors that might affect chromatin accessibility, including cell type, tissue of origin, sample location, batch, etc., whose compound effects are difficult to test by existing methods. In addition, current snATAC-seq data present statistical difficulties due to their sparsity and variations in individual sequence capture. To address these problems, we present a zero-adjusted statistical model, PACS, that can allow complex hypothesis testing of factors that affect accessibility while accounting for sparse and incomplete data. For differential accessibility analysis, PACS controls the false positive rate and achieves on average a 17% to 122% higher power than existing tools. We demonstrate the effectiveness of PACS through several analysis tasks including supervised cell type annotation, compound hypothesis testing, batch effect correction, and spatiotemporal modeling. We apply PACS to several datasets from a variety of tissues and show its ability to reveal previously undiscovered insights in snATAC-seq data. Cold Spring Harbor Laboratory 2023-07-31 /pmc/articles/PMC10418058/ /pubmed/37577623 http://dx.doi.org/10.1101/2023.07.30.551108 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License (https://creativecommons.org/licenses/by-nc-nd/4.0/) , which allows reusers to copy and distribute the material in any medium or format in unadapted form only, for noncommercial purposes only, and only so long as attribution is given to the creator. |
spellingShingle | Article Miao, Zhen Wang, Jianqiao Park, Kernyu Kuang, Da Kim, Junhyong Model-based compound hypothesis testing for snATAC-seq data with PACS |
title | Model-based compound hypothesis testing for snATAC-seq data with PACS |
title_full | Model-based compound hypothesis testing for snATAC-seq data with PACS |
title_fullStr | Model-based compound hypothesis testing for snATAC-seq data with PACS |
title_full_unstemmed | Model-based compound hypothesis testing for snATAC-seq data with PACS |
title_short | Model-based compound hypothesis testing for snATAC-seq data with PACS |
title_sort | model-based compound hypothesis testing for snatac-seq data with pacs |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10418058/ https://www.ncbi.nlm.nih.gov/pubmed/37577623 http://dx.doi.org/10.1101/2023.07.30.551108 |
work_keys_str_mv | AT miaozhen modelbasedcompoundhypothesistestingforsnatacseqdatawithpacs AT wangjianqiao modelbasedcompoundhypothesistestingforsnatacseqdatawithpacs AT parkkernyu modelbasedcompoundhypothesistestingforsnatacseqdatawithpacs AT kuangda modelbasedcompoundhypothesistestingforsnatacseqdatawithpacs AT kimjunhyong modelbasedcompoundhypothesistestingforsnatacseqdatawithpacs |