Cargando…

Matrix factorization and transfer learning uncover regulatory biology across multiple single-cell ATAC-seq data sets

While the methods available for single-cell ATAC-seq analysis are well optimized for clustering cell types, the question of how to integrate multiple scATAC-seq data sets and/or sequencing modalities is still open. We present an analysis framework that enables such integration across scATAC-seq data...

Descripción completa

Detalles Bibliográficos
Autores principales: Erbe, Rossin, Kessler, Michael D, Favorov, Alexander V, Easwaran, Hariharan, Gaykalova, Daria A, Fertig, Elana J
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7337516/
https://www.ncbi.nlm.nih.gov/pubmed/32392348
http://dx.doi.org/10.1093/nar/gkaa349
_version_ 1783554530440380416
author Erbe, Rossin
Kessler, Michael D
Favorov, Alexander V
Easwaran, Hariharan
Gaykalova, Daria A
Fertig, Elana J
author_facet Erbe, Rossin
Kessler, Michael D
Favorov, Alexander V
Easwaran, Hariharan
Gaykalova, Daria A
Fertig, Elana J
author_sort Erbe, Rossin
collection PubMed
description While the methods available for single-cell ATAC-seq analysis are well optimized for clustering cell types, the question of how to integrate multiple scATAC-seq data sets and/or sequencing modalities is still open. We present an analysis framework that enables such integration across scATAC-seq data sets by applying the CoGAPS Matrix Factorization algorithm and the projectR transfer learning program to identify common regulatory patterns across scATAC-seq data sets. We additionally integrate our analysis with scRNA-seq data to identify orthogonal evidence for transcriptional regulators predicted by scATAC-seq analysis. Using publicly available scATAC-seq data, we find patterns that accurately characterize cell types both within and across data sets. Furthermore, we demonstrate that these patterns are both consistent with current biological understanding and reflective of novel regulatory biology.
format Online
Article
Text
id pubmed-7337516
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-73375162020-07-13 Matrix factorization and transfer learning uncover regulatory biology across multiple single-cell ATAC-seq data sets Erbe, Rossin Kessler, Michael D Favorov, Alexander V Easwaran, Hariharan Gaykalova, Daria A Fertig, Elana J Nucleic Acids Res Methods Online While the methods available for single-cell ATAC-seq analysis are well optimized for clustering cell types, the question of how to integrate multiple scATAC-seq data sets and/or sequencing modalities is still open. We present an analysis framework that enables such integration across scATAC-seq data sets by applying the CoGAPS Matrix Factorization algorithm and the projectR transfer learning program to identify common regulatory patterns across scATAC-seq data sets. We additionally integrate our analysis with scRNA-seq data to identify orthogonal evidence for transcriptional regulators predicted by scATAC-seq analysis. Using publicly available scATAC-seq data, we find patterns that accurately characterize cell types both within and across data sets. Furthermore, we demonstrate that these patterns are both consistent with current biological understanding and reflective of novel regulatory biology. Oxford University Press 2020-07-09 2020-05-11 /pmc/articles/PMC7337516/ /pubmed/32392348 http://dx.doi.org/10.1093/nar/gkaa349 Text en © The Author(s) 2020. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods Online
Erbe, Rossin
Kessler, Michael D
Favorov, Alexander V
Easwaran, Hariharan
Gaykalova, Daria A
Fertig, Elana J
Matrix factorization and transfer learning uncover regulatory biology across multiple single-cell ATAC-seq data sets
title Matrix factorization and transfer learning uncover regulatory biology across multiple single-cell ATAC-seq data sets
title_full Matrix factorization and transfer learning uncover regulatory biology across multiple single-cell ATAC-seq data sets
title_fullStr Matrix factorization and transfer learning uncover regulatory biology across multiple single-cell ATAC-seq data sets
title_full_unstemmed Matrix factorization and transfer learning uncover regulatory biology across multiple single-cell ATAC-seq data sets
title_short Matrix factorization and transfer learning uncover regulatory biology across multiple single-cell ATAC-seq data sets
title_sort matrix factorization and transfer learning uncover regulatory biology across multiple single-cell atac-seq data sets
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7337516/
https://www.ncbi.nlm.nih.gov/pubmed/32392348
http://dx.doi.org/10.1093/nar/gkaa349
work_keys_str_mv AT erberossin matrixfactorizationandtransferlearninguncoverregulatorybiologyacrossmultiplesinglecellatacseqdatasets
AT kesslermichaeld matrixfactorizationandtransferlearninguncoverregulatorybiologyacrossmultiplesinglecellatacseqdatasets
AT favorovalexanderv matrixfactorizationandtransferlearninguncoverregulatorybiologyacrossmultiplesinglecellatacseqdatasets
AT easwaranhariharan matrixfactorizationandtransferlearninguncoverregulatorybiologyacrossmultiplesinglecellatacseqdatasets
AT gaykalovadariaa matrixfactorizationandtransferlearninguncoverregulatorybiologyacrossmultiplesinglecellatacseqdatasets
AT fertigelanaj matrixfactorizationandtransferlearninguncoverregulatorybiologyacrossmultiplesinglecellatacseqdatasets