Cargando…

Effective and scalable single-cell data alignment with non-linear canonical correlation analysis

Data alignment is one of the first key steps in single cell analysis for integrating multiple datasets and performing joint analysis across studies. Data alignment is challenging in extremely large datasets, however, as the major of the current single cell data alignment methods are not computationa...

Descripción completa

Detalles Bibliográficos
Autores principales: Hu, Jialu, Chen, Mengjie, Zhou, Xiang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8887421/
https://www.ncbi.nlm.nih.gov/pubmed/34871454
http://dx.doi.org/10.1093/nar/gkab1147
_version_ 1784660890330595328
author Hu, Jialu
Chen, Mengjie
Zhou, Xiang
author_facet Hu, Jialu
Chen, Mengjie
Zhou, Xiang
author_sort Hu, Jialu
collection PubMed
description Data alignment is one of the first key steps in single cell analysis for integrating multiple datasets and performing joint analysis across studies. Data alignment is challenging in extremely large datasets, however, as the major of the current single cell data alignment methods are not computationally efficient. Here, we present VIPCCA, a computational framework based on non-linear canonical correlation analysis for effective and scalable single cell data alignment. VIPCCA leverages both deep learning for effective single cell data modeling and variational inference for scalable computation, thus enabling powerful data alignment across multiple samples, multiple data platforms, and multiple data types. VIPCCA is accurate for a range of alignment tasks including alignment between single cell RNAseq and ATACseq datasets and can easily accommodate millions of cells, thereby providing researchers unique opportunities to tackle challenges emerging from large-scale single-cell atlas.
format Online
Article
Text
id pubmed-8887421
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-88874212022-03-02 Effective and scalable single-cell data alignment with non-linear canonical correlation analysis Hu, Jialu Chen, Mengjie Zhou, Xiang Nucleic Acids Res Methods Online Data alignment is one of the first key steps in single cell analysis for integrating multiple datasets and performing joint analysis across studies. Data alignment is challenging in extremely large datasets, however, as the major of the current single cell data alignment methods are not computationally efficient. Here, we present VIPCCA, a computational framework based on non-linear canonical correlation analysis for effective and scalable single cell data alignment. VIPCCA leverages both deep learning for effective single cell data modeling and variational inference for scalable computation, thus enabling powerful data alignment across multiple samples, multiple data platforms, and multiple data types. VIPCCA is accurate for a range of alignment tasks including alignment between single cell RNAseq and ATACseq datasets and can easily accommodate millions of cells, thereby providing researchers unique opportunities to tackle challenges emerging from large-scale single-cell atlas. Oxford University Press 2021-12-06 /pmc/articles/PMC8887421/ /pubmed/34871454 http://dx.doi.org/10.1093/nar/gkab1147 Text en © The Author(s) 2021. Published by Oxford University Press on behalf of Nucleic Acids Research. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods Online
Hu, Jialu
Chen, Mengjie
Zhou, Xiang
Effective and scalable single-cell data alignment with non-linear canonical correlation analysis
title Effective and scalable single-cell data alignment with non-linear canonical correlation analysis
title_full Effective and scalable single-cell data alignment with non-linear canonical correlation analysis
title_fullStr Effective and scalable single-cell data alignment with non-linear canonical correlation analysis
title_full_unstemmed Effective and scalable single-cell data alignment with non-linear canonical correlation analysis
title_short Effective and scalable single-cell data alignment with non-linear canonical correlation analysis
title_sort effective and scalable single-cell data alignment with non-linear canonical correlation analysis
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8887421/
https://www.ncbi.nlm.nih.gov/pubmed/34871454
http://dx.doi.org/10.1093/nar/gkab1147
work_keys_str_mv AT hujialu effectiveandscalablesinglecelldataalignmentwithnonlinearcanonicalcorrelationanalysis
AT chenmengjie effectiveandscalablesinglecelldataalignmentwithnonlinearcanonicalcorrelationanalysis
AT zhouxiang effectiveandscalablesinglecelldataalignmentwithnonlinearcanonicalcorrelationanalysis