Cargando…

pyInfinityFlow: optimized imputation and analysis of high-dimensional flow cytometry data for millions of cells

MOTIVATION: While conventional flow cytometry is limited to dozens of markers, new experimental and computational strategies, such as Infinity Flow, allow for the generation and imputation of hundreds of cell surface protein markers in millions of cells. Here, we describe an end-to-end analysis work...

Descripción completa

Detalles Bibliográficos
Autores principales: Ferchen, Kyle, Salomonis, Nathan, Grimes, H Leighton
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10166583/
https://www.ncbi.nlm.nih.gov/pubmed/37097893
http://dx.doi.org/10.1093/bioinformatics/btad287
_version_ 1785038473482207232
author Ferchen, Kyle
Salomonis, Nathan
Grimes, H Leighton
author_facet Ferchen, Kyle
Salomonis, Nathan
Grimes, H Leighton
author_sort Ferchen, Kyle
collection PubMed
description MOTIVATION: While conventional flow cytometry is limited to dozens of markers, new experimental and computational strategies, such as Infinity Flow, allow for the generation and imputation of hundreds of cell surface protein markers in millions of cells. Here, we describe an end-to-end analysis workflow for Infinity Flow data in Python. RESULTS: pyInfinityFlow enables the efficient analysis of millions of cells, without down-sampling, through direct integration with well-established Python packages for single-cell genomics analysis. pyInfinityFlow accurately identifies both common and extremely rare cell populations which are challenging to define from single-cell genomics studies alone. We demonstrate that this workflow can nominate novel markers to design new flow cytometry gating strategies for predicted cell populations. pyInfinityFlow can be extended to diverse cell discovery analyses with flexibility to adapt to diverse Infinity Flow experimental designs. AVAILABILITY AND IMPLEMENTATION: pyInfinityFlow is freely available in GitHub (https://github.com/KyleFerchen/pyInfinityFlow) and on PyPI (https://pypi.org/project/pyInfinityFlow/). Package documentation with tutorials on a test dataset is available by Read the Docs (pyinfinityflow.readthedocs.io). The scripts and data for reproducing the results are available at https://github.com/KyleFerchen/pyInfinityFlow/tree/main/analysis_scripts, along with the raw flow cytometry input data.
format Online
Article
Text
id pubmed-10166583
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-101665832023-05-09 pyInfinityFlow: optimized imputation and analysis of high-dimensional flow cytometry data for millions of cells Ferchen, Kyle Salomonis, Nathan Grimes, H Leighton Bioinformatics Applications Note MOTIVATION: While conventional flow cytometry is limited to dozens of markers, new experimental and computational strategies, such as Infinity Flow, allow for the generation and imputation of hundreds of cell surface protein markers in millions of cells. Here, we describe an end-to-end analysis workflow for Infinity Flow data in Python. RESULTS: pyInfinityFlow enables the efficient analysis of millions of cells, without down-sampling, through direct integration with well-established Python packages for single-cell genomics analysis. pyInfinityFlow accurately identifies both common and extremely rare cell populations which are challenging to define from single-cell genomics studies alone. We demonstrate that this workflow can nominate novel markers to design new flow cytometry gating strategies for predicted cell populations. pyInfinityFlow can be extended to diverse cell discovery analyses with flexibility to adapt to diverse Infinity Flow experimental designs. AVAILABILITY AND IMPLEMENTATION: pyInfinityFlow is freely available in GitHub (https://github.com/KyleFerchen/pyInfinityFlow) and on PyPI (https://pypi.org/project/pyInfinityFlow/). Package documentation with tutorials on a test dataset is available by Read the Docs (pyinfinityflow.readthedocs.io). The scripts and data for reproducing the results are available at https://github.com/KyleFerchen/pyInfinityFlow/tree/main/analysis_scripts, along with the raw flow cytometry input data. Oxford University Press 2023-04-25 /pmc/articles/PMC10166583/ /pubmed/37097893 http://dx.doi.org/10.1093/bioinformatics/btad287 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Applications Note
Ferchen, Kyle
Salomonis, Nathan
Grimes, H Leighton
pyInfinityFlow: optimized imputation and analysis of high-dimensional flow cytometry data for millions of cells
title pyInfinityFlow: optimized imputation and analysis of high-dimensional flow cytometry data for millions of cells
title_full pyInfinityFlow: optimized imputation and analysis of high-dimensional flow cytometry data for millions of cells
title_fullStr pyInfinityFlow: optimized imputation and analysis of high-dimensional flow cytometry data for millions of cells
title_full_unstemmed pyInfinityFlow: optimized imputation and analysis of high-dimensional flow cytometry data for millions of cells
title_short pyInfinityFlow: optimized imputation and analysis of high-dimensional flow cytometry data for millions of cells
title_sort pyinfinityflow: optimized imputation and analysis of high-dimensional flow cytometry data for millions of cells
topic Applications Note
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10166583/
https://www.ncbi.nlm.nih.gov/pubmed/37097893
http://dx.doi.org/10.1093/bioinformatics/btad287
work_keys_str_mv AT ferchenkyle pyinfinityflowoptimizedimputationandanalysisofhighdimensionalflowcytometrydataformillionsofcells
AT salomonisnathan pyinfinityflowoptimizedimputationandanalysisofhighdimensionalflowcytometrydataformillionsofcells
AT grimeshleighton pyinfinityflowoptimizedimputationandanalysisofhighdimensionalflowcytometrydataformillionsofcells