Cargando…

Matrix factorization and transfer learning uncover regulatory biology across multiple single-cell ATAC-seq data sets

While the methods available for single-cell ATAC-seq analysis are well optimized for clustering cell types, the question of how to integrate multiple scATAC-seq data sets and/or sequencing modalities is still open. We present an analysis framework that enables such integration across scATAC-seq data...

Descripción completa

Detalles Bibliográficos
Autores principales: Erbe, Rossin, Kessler, Michael D, Favorov, Alexander V, Easwaran, Hariharan, Gaykalova, Daria A, Fertig, Elana J
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7337516/
https://www.ncbi.nlm.nih.gov/pubmed/32392348
http://dx.doi.org/10.1093/nar/gkaa349
Descripción
Sumario:While the methods available for single-cell ATAC-seq analysis are well optimized for clustering cell types, the question of how to integrate multiple scATAC-seq data sets and/or sequencing modalities is still open. We present an analysis framework that enables such integration across scATAC-seq data sets by applying the CoGAPS Matrix Factorization algorithm and the projectR transfer learning program to identify common regulatory patterns across scATAC-seq data sets. We additionally integrate our analysis with scRNA-seq data to identify orthogonal evidence for transcriptional regulators predicted by scATAC-seq analysis. Using publicly available scATAC-seq data, we find patterns that accurately characterize cell types both within and across data sets. Furthermore, we demonstrate that these patterns are both consistent with current biological understanding and reflective of novel regulatory biology.