Cargando…
Effective and scalable single-cell data alignment with non-linear canonical correlation analysis
Data alignment is one of the first key steps in single cell analysis for integrating multiple datasets and performing joint analysis across studies. Data alignment is challenging in extremely large datasets, however, as the major of the current single cell data alignment methods are not computationa...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8887421/ https://www.ncbi.nlm.nih.gov/pubmed/34871454 http://dx.doi.org/10.1093/nar/gkab1147 |
_version_ | 1784660890330595328 |
---|---|
author | Hu, Jialu Chen, Mengjie Zhou, Xiang |
author_facet | Hu, Jialu Chen, Mengjie Zhou, Xiang |
author_sort | Hu, Jialu |
collection | PubMed |
description | Data alignment is one of the first key steps in single cell analysis for integrating multiple datasets and performing joint analysis across studies. Data alignment is challenging in extremely large datasets, however, as the major of the current single cell data alignment methods are not computationally efficient. Here, we present VIPCCA, a computational framework based on non-linear canonical correlation analysis for effective and scalable single cell data alignment. VIPCCA leverages both deep learning for effective single cell data modeling and variational inference for scalable computation, thus enabling powerful data alignment across multiple samples, multiple data platforms, and multiple data types. VIPCCA is accurate for a range of alignment tasks including alignment between single cell RNAseq and ATACseq datasets and can easily accommodate millions of cells, thereby providing researchers unique opportunities to tackle challenges emerging from large-scale single-cell atlas. |
format | Online Article Text |
id | pubmed-8887421 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-88874212022-03-02 Effective and scalable single-cell data alignment with non-linear canonical correlation analysis Hu, Jialu Chen, Mengjie Zhou, Xiang Nucleic Acids Res Methods Online Data alignment is one of the first key steps in single cell analysis for integrating multiple datasets and performing joint analysis across studies. Data alignment is challenging in extremely large datasets, however, as the major of the current single cell data alignment methods are not computationally efficient. Here, we present VIPCCA, a computational framework based on non-linear canonical correlation analysis for effective and scalable single cell data alignment. VIPCCA leverages both deep learning for effective single cell data modeling and variational inference for scalable computation, thus enabling powerful data alignment across multiple samples, multiple data platforms, and multiple data types. VIPCCA is accurate for a range of alignment tasks including alignment between single cell RNAseq and ATACseq datasets and can easily accommodate millions of cells, thereby providing researchers unique opportunities to tackle challenges emerging from large-scale single-cell atlas. Oxford University Press 2021-12-06 /pmc/articles/PMC8887421/ /pubmed/34871454 http://dx.doi.org/10.1093/nar/gkab1147 Text en © The Author(s) 2021. Published by Oxford University Press on behalf of Nucleic Acids Research. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methods Online Hu, Jialu Chen, Mengjie Zhou, Xiang Effective and scalable single-cell data alignment with non-linear canonical correlation analysis |
title | Effective and scalable single-cell data alignment with non-linear canonical correlation analysis |
title_full | Effective and scalable single-cell data alignment with non-linear canonical correlation analysis |
title_fullStr | Effective and scalable single-cell data alignment with non-linear canonical correlation analysis |
title_full_unstemmed | Effective and scalable single-cell data alignment with non-linear canonical correlation analysis |
title_short | Effective and scalable single-cell data alignment with non-linear canonical correlation analysis |
title_sort | effective and scalable single-cell data alignment with non-linear canonical correlation analysis |
topic | Methods Online |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8887421/ https://www.ncbi.nlm.nih.gov/pubmed/34871454 http://dx.doi.org/10.1093/nar/gkab1147 |
work_keys_str_mv | AT hujialu effectiveandscalablesinglecelldataalignmentwithnonlinearcanonicalcorrelationanalysis AT chenmengjie effectiveandscalablesinglecelldataalignmentwithnonlinearcanonicalcorrelationanalysis AT zhouxiang effectiveandscalablesinglecelldataalignmentwithnonlinearcanonicalcorrelationanalysis |