Cargando…

Model-based compound hypothesis testing for snATAC-seq data with PACS

Single nucleus ATAC-seq (snATAC-seq) experimental designs have become increasingly complex with multiple factors that might affect chromatin accessibility, including cell type, tissue of origin, sample location, batch, etc., whose compound effects are difficult to test by existing methods. In additi...

Descripción completa

Detalles Bibliográficos
Autores principales: Miao, Zhen, Wang, Jianqiao, Park, Kernyu, Kuang, Da, Kim, Junhyong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10418058/
https://www.ncbi.nlm.nih.gov/pubmed/37577623
http://dx.doi.org/10.1101/2023.07.30.551108
_version_ 1785088187928936448
author Miao, Zhen
Wang, Jianqiao
Park, Kernyu
Kuang, Da
Kim, Junhyong
author_facet Miao, Zhen
Wang, Jianqiao
Park, Kernyu
Kuang, Da
Kim, Junhyong
author_sort Miao, Zhen
collection PubMed
description Single nucleus ATAC-seq (snATAC-seq) experimental designs have become increasingly complex with multiple factors that might affect chromatin accessibility, including cell type, tissue of origin, sample location, batch, etc., whose compound effects are difficult to test by existing methods. In addition, current snATAC-seq data present statistical difficulties due to their sparsity and variations in individual sequence capture. To address these problems, we present a zero-adjusted statistical model, PACS, that can allow complex hypothesis testing of factors that affect accessibility while accounting for sparse and incomplete data. For differential accessibility analysis, PACS controls the false positive rate and achieves on average a 17% to 122% higher power than existing tools. We demonstrate the effectiveness of PACS through several analysis tasks including supervised cell type annotation, compound hypothesis testing, batch effect correction, and spatiotemporal modeling. We apply PACS to several datasets from a variety of tissues and show its ability to reveal previously undiscovered insights in snATAC-seq data.
format Online
Article
Text
id pubmed-10418058
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cold Spring Harbor Laboratory
record_format MEDLINE/PubMed
spelling pubmed-104180582023-08-12 Model-based compound hypothesis testing for snATAC-seq data with PACS Miao, Zhen Wang, Jianqiao Park, Kernyu Kuang, Da Kim, Junhyong bioRxiv Article Single nucleus ATAC-seq (snATAC-seq) experimental designs have become increasingly complex with multiple factors that might affect chromatin accessibility, including cell type, tissue of origin, sample location, batch, etc., whose compound effects are difficult to test by existing methods. In addition, current snATAC-seq data present statistical difficulties due to their sparsity and variations in individual sequence capture. To address these problems, we present a zero-adjusted statistical model, PACS, that can allow complex hypothesis testing of factors that affect accessibility while accounting for sparse and incomplete data. For differential accessibility analysis, PACS controls the false positive rate and achieves on average a 17% to 122% higher power than existing tools. We demonstrate the effectiveness of PACS through several analysis tasks including supervised cell type annotation, compound hypothesis testing, batch effect correction, and spatiotemporal modeling. We apply PACS to several datasets from a variety of tissues and show its ability to reveal previously undiscovered insights in snATAC-seq data. Cold Spring Harbor Laboratory 2023-07-31 /pmc/articles/PMC10418058/ /pubmed/37577623 http://dx.doi.org/10.1101/2023.07.30.551108 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License (https://creativecommons.org/licenses/by-nc-nd/4.0/) , which allows reusers to copy and distribute the material in any medium or format in unadapted form only, for noncommercial purposes only, and only so long as attribution is given to the creator.
spellingShingle Article
Miao, Zhen
Wang, Jianqiao
Park, Kernyu
Kuang, Da
Kim, Junhyong
Model-based compound hypothesis testing for snATAC-seq data with PACS
title Model-based compound hypothesis testing for snATAC-seq data with PACS
title_full Model-based compound hypothesis testing for snATAC-seq data with PACS
title_fullStr Model-based compound hypothesis testing for snATAC-seq data with PACS
title_full_unstemmed Model-based compound hypothesis testing for snATAC-seq data with PACS
title_short Model-based compound hypothesis testing for snATAC-seq data with PACS
title_sort model-based compound hypothesis testing for snatac-seq data with pacs
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10418058/
https://www.ncbi.nlm.nih.gov/pubmed/37577623
http://dx.doi.org/10.1101/2023.07.30.551108
work_keys_str_mv AT miaozhen modelbasedcompoundhypothesistestingforsnatacseqdatawithpacs
AT wangjianqiao modelbasedcompoundhypothesistestingforsnatacseqdatawithpacs
AT parkkernyu modelbasedcompoundhypothesistestingforsnatacseqdatawithpacs
AT kuangda modelbasedcompoundhypothesistestingforsnatacseqdatawithpacs
AT kimjunhyong modelbasedcompoundhypothesistestingforsnatacseqdatawithpacs