Cargando…
pyInfinityFlow: optimized imputation and analysis of high-dimensional flow cytometry data for millions of cells
MOTIVATION: While conventional flow cytometry is limited to dozens of markers, new experimental and computational strategies, such as Infinity Flow, allow for the generation and imputation of hundreds of cell surface protein markers in millions of cells. Here, we describe an end-to-end analysis work...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10166583/ https://www.ncbi.nlm.nih.gov/pubmed/37097893 http://dx.doi.org/10.1093/bioinformatics/btad287 |
_version_ | 1785038473482207232 |
---|---|
author | Ferchen, Kyle Salomonis, Nathan Grimes, H Leighton |
author_facet | Ferchen, Kyle Salomonis, Nathan Grimes, H Leighton |
author_sort | Ferchen, Kyle |
collection | PubMed |
description | MOTIVATION: While conventional flow cytometry is limited to dozens of markers, new experimental and computational strategies, such as Infinity Flow, allow for the generation and imputation of hundreds of cell surface protein markers in millions of cells. Here, we describe an end-to-end analysis workflow for Infinity Flow data in Python. RESULTS: pyInfinityFlow enables the efficient analysis of millions of cells, without down-sampling, through direct integration with well-established Python packages for single-cell genomics analysis. pyInfinityFlow accurately identifies both common and extremely rare cell populations which are challenging to define from single-cell genomics studies alone. We demonstrate that this workflow can nominate novel markers to design new flow cytometry gating strategies for predicted cell populations. pyInfinityFlow can be extended to diverse cell discovery analyses with flexibility to adapt to diverse Infinity Flow experimental designs. AVAILABILITY AND IMPLEMENTATION: pyInfinityFlow is freely available in GitHub (https://github.com/KyleFerchen/pyInfinityFlow) and on PyPI (https://pypi.org/project/pyInfinityFlow/). Package documentation with tutorials on a test dataset is available by Read the Docs (pyinfinityflow.readthedocs.io). The scripts and data for reproducing the results are available at https://github.com/KyleFerchen/pyInfinityFlow/tree/main/analysis_scripts, along with the raw flow cytometry input data. |
format | Online Article Text |
id | pubmed-10166583 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-101665832023-05-09 pyInfinityFlow: optimized imputation and analysis of high-dimensional flow cytometry data for millions of cells Ferchen, Kyle Salomonis, Nathan Grimes, H Leighton Bioinformatics Applications Note MOTIVATION: While conventional flow cytometry is limited to dozens of markers, new experimental and computational strategies, such as Infinity Flow, allow for the generation and imputation of hundreds of cell surface protein markers in millions of cells. Here, we describe an end-to-end analysis workflow for Infinity Flow data in Python. RESULTS: pyInfinityFlow enables the efficient analysis of millions of cells, without down-sampling, through direct integration with well-established Python packages for single-cell genomics analysis. pyInfinityFlow accurately identifies both common and extremely rare cell populations which are challenging to define from single-cell genomics studies alone. We demonstrate that this workflow can nominate novel markers to design new flow cytometry gating strategies for predicted cell populations. pyInfinityFlow can be extended to diverse cell discovery analyses with flexibility to adapt to diverse Infinity Flow experimental designs. AVAILABILITY AND IMPLEMENTATION: pyInfinityFlow is freely available in GitHub (https://github.com/KyleFerchen/pyInfinityFlow) and on PyPI (https://pypi.org/project/pyInfinityFlow/). Package documentation with tutorials on a test dataset is available by Read the Docs (pyinfinityflow.readthedocs.io). The scripts and data for reproducing the results are available at https://github.com/KyleFerchen/pyInfinityFlow/tree/main/analysis_scripts, along with the raw flow cytometry input data. Oxford University Press 2023-04-25 /pmc/articles/PMC10166583/ /pubmed/37097893 http://dx.doi.org/10.1093/bioinformatics/btad287 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Applications Note Ferchen, Kyle Salomonis, Nathan Grimes, H Leighton pyInfinityFlow: optimized imputation and analysis of high-dimensional flow cytometry data for millions of cells |
title | pyInfinityFlow: optimized imputation and analysis of high-dimensional flow cytometry data for millions of cells |
title_full | pyInfinityFlow: optimized imputation and analysis of high-dimensional flow cytometry data for millions of cells |
title_fullStr | pyInfinityFlow: optimized imputation and analysis of high-dimensional flow cytometry data for millions of cells |
title_full_unstemmed | pyInfinityFlow: optimized imputation and analysis of high-dimensional flow cytometry data for millions of cells |
title_short | pyInfinityFlow: optimized imputation and analysis of high-dimensional flow cytometry data for millions of cells |
title_sort | pyinfinityflow: optimized imputation and analysis of high-dimensional flow cytometry data for millions of cells |
topic | Applications Note |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10166583/ https://www.ncbi.nlm.nih.gov/pubmed/37097893 http://dx.doi.org/10.1093/bioinformatics/btad287 |
work_keys_str_mv | AT ferchenkyle pyinfinityflowoptimizedimputationandanalysisofhighdimensionalflowcytometrydataformillionsofcells AT salomonisnathan pyinfinityflowoptimizedimputationandanalysisofhighdimensionalflowcytometrydataformillionsofcells AT grimeshleighton pyinfinityflowoptimizedimputationandanalysisofhighdimensionalflowcytometrydataformillionsofcells |