Cargando…
Matrix factorization and transfer learning uncover regulatory biology across multiple single-cell ATAC-seq data sets
While the methods available for single-cell ATAC-seq analysis are well optimized for clustering cell types, the question of how to integrate multiple scATAC-seq data sets and/or sequencing modalities is still open. We present an analysis framework that enables such integration across scATAC-seq data...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7337516/ https://www.ncbi.nlm.nih.gov/pubmed/32392348 http://dx.doi.org/10.1093/nar/gkaa349 |
_version_ | 1783554530440380416 |
---|---|
author | Erbe, Rossin Kessler, Michael D Favorov, Alexander V Easwaran, Hariharan Gaykalova, Daria A Fertig, Elana J |
author_facet | Erbe, Rossin Kessler, Michael D Favorov, Alexander V Easwaran, Hariharan Gaykalova, Daria A Fertig, Elana J |
author_sort | Erbe, Rossin |
collection | PubMed |
description | While the methods available for single-cell ATAC-seq analysis are well optimized for clustering cell types, the question of how to integrate multiple scATAC-seq data sets and/or sequencing modalities is still open. We present an analysis framework that enables such integration across scATAC-seq data sets by applying the CoGAPS Matrix Factorization algorithm and the projectR transfer learning program to identify common regulatory patterns across scATAC-seq data sets. We additionally integrate our analysis with scRNA-seq data to identify orthogonal evidence for transcriptional regulators predicted by scATAC-seq analysis. Using publicly available scATAC-seq data, we find patterns that accurately characterize cell types both within and across data sets. Furthermore, we demonstrate that these patterns are both consistent with current biological understanding and reflective of novel regulatory biology. |
format | Online Article Text |
id | pubmed-7337516 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-73375162020-07-13 Matrix factorization and transfer learning uncover regulatory biology across multiple single-cell ATAC-seq data sets Erbe, Rossin Kessler, Michael D Favorov, Alexander V Easwaran, Hariharan Gaykalova, Daria A Fertig, Elana J Nucleic Acids Res Methods Online While the methods available for single-cell ATAC-seq analysis are well optimized for clustering cell types, the question of how to integrate multiple scATAC-seq data sets and/or sequencing modalities is still open. We present an analysis framework that enables such integration across scATAC-seq data sets by applying the CoGAPS Matrix Factorization algorithm and the projectR transfer learning program to identify common regulatory patterns across scATAC-seq data sets. We additionally integrate our analysis with scRNA-seq data to identify orthogonal evidence for transcriptional regulators predicted by scATAC-seq analysis. Using publicly available scATAC-seq data, we find patterns that accurately characterize cell types both within and across data sets. Furthermore, we demonstrate that these patterns are both consistent with current biological understanding and reflective of novel regulatory biology. Oxford University Press 2020-07-09 2020-05-11 /pmc/articles/PMC7337516/ /pubmed/32392348 http://dx.doi.org/10.1093/nar/gkaa349 Text en © The Author(s) 2020. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methods Online Erbe, Rossin Kessler, Michael D Favorov, Alexander V Easwaran, Hariharan Gaykalova, Daria A Fertig, Elana J Matrix factorization and transfer learning uncover regulatory biology across multiple single-cell ATAC-seq data sets |
title | Matrix factorization and transfer learning uncover regulatory biology across multiple single-cell ATAC-seq data sets |
title_full | Matrix factorization and transfer learning uncover regulatory biology across multiple single-cell ATAC-seq data sets |
title_fullStr | Matrix factorization and transfer learning uncover regulatory biology across multiple single-cell ATAC-seq data sets |
title_full_unstemmed | Matrix factorization and transfer learning uncover regulatory biology across multiple single-cell ATAC-seq data sets |
title_short | Matrix factorization and transfer learning uncover regulatory biology across multiple single-cell ATAC-seq data sets |
title_sort | matrix factorization and transfer learning uncover regulatory biology across multiple single-cell atac-seq data sets |
topic | Methods Online |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7337516/ https://www.ncbi.nlm.nih.gov/pubmed/32392348 http://dx.doi.org/10.1093/nar/gkaa349 |
work_keys_str_mv | AT erberossin matrixfactorizationandtransferlearninguncoverregulatorybiologyacrossmultiplesinglecellatacseqdatasets AT kesslermichaeld matrixfactorizationandtransferlearninguncoverregulatorybiologyacrossmultiplesinglecellatacseqdatasets AT favorovalexanderv matrixfactorizationandtransferlearninguncoverregulatorybiologyacrossmultiplesinglecellatacseqdatasets AT easwaranhariharan matrixfactorizationandtransferlearninguncoverregulatorybiologyacrossmultiplesinglecellatacseqdatasets AT gaykalovadariaa matrixfactorizationandtransferlearninguncoverregulatorybiologyacrossmultiplesinglecellatacseqdatasets AT fertigelanaj matrixfactorizationandtransferlearninguncoverregulatorybiologyacrossmultiplesinglecellatacseqdatasets |