Cargando…

SWIFT—Scalable Clustering for Automated Identification of Rare Cell Populations in Large, High-Dimensional Flow Cytometry Datasets, Part 2: Biological Evaluation

A multistage clustering and data processing method, SWIFT (detailed in a companion manuscript), has been developed to detect rare subpopulations in large, high-dimensional flow cytometry datasets. An iterative sampling procedure initially fits the data to multidimensional Gaussian distributions, the...

Descripción completa

Detalles Bibliográficos
Autores principales: Mosmann, Tim R, Naim, Iftekhar, Rebhahn, Jonathan, Datta, Suprakash, Cavenaugh, James S, Weaver, Jason M, Sharma, Gaurav
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BlackWell Publishing Ltd 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4238823/
https://www.ncbi.nlm.nih.gov/pubmed/24532172
http://dx.doi.org/10.1002/cyto.a.22445
_version_ 1782345521576280064
author Mosmann, Tim R
Naim, Iftekhar
Rebhahn, Jonathan
Datta, Suprakash
Cavenaugh, James S
Weaver, Jason M
Sharma, Gaurav
author_facet Mosmann, Tim R
Naim, Iftekhar
Rebhahn, Jonathan
Datta, Suprakash
Cavenaugh, James S
Weaver, Jason M
Sharma, Gaurav
author_sort Mosmann, Tim R
collection PubMed
description A multistage clustering and data processing method, SWIFT (detailed in a companion manuscript), has been developed to detect rare subpopulations in large, high-dimensional flow cytometry datasets. An iterative sampling procedure initially fits the data to multidimensional Gaussian distributions, then splitting and merging stages use a criterion of unimodality to optimize the detection of rare subpopulations, to converge on a consistent cluster number, and to describe non-Gaussian distributions. Probabilistic assignment of cells to clusters, visualization, and manipulation of clusters by their cluster medians, facilitate application of expert knowledge using standard flow cytometry programs. The dual problems of rigorously comparing similar complex samples, and enumerating absent or very rare cell subpopulations in negative controls, were solved by assigning cells in multiple samples to a cluster template derived from a single or combined sample. Comparison of antigen-stimulated and control human peripheral blood cell samples demonstrated that SWIFT could identify biologically significant subpopulations, such as rare cytokine-producing influenza-specific T cells. A sensitivity of better than one part per million was attained in very large samples. Results were highly consistent on biological replicates, yet the analysis was sensitive enough to show that multiple samples from the same subject were more similar than samples from different subjects. A companion manuscript (Part 1) details the algorithmic development of SWIFT. © 2014 The Authors. Published by Wiley Periodicals Inc.
format Online
Article
Text
id pubmed-4238823
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BlackWell Publishing Ltd
record_format MEDLINE/PubMed
spelling pubmed-42388232014-11-28 SWIFT—Scalable Clustering for Automated Identification of Rare Cell Populations in Large, High-Dimensional Flow Cytometry Datasets, Part 2: Biological Evaluation Mosmann, Tim R Naim, Iftekhar Rebhahn, Jonathan Datta, Suprakash Cavenaugh, James S Weaver, Jason M Sharma, Gaurav Cytometry A Original Articles A multistage clustering and data processing method, SWIFT (detailed in a companion manuscript), has been developed to detect rare subpopulations in large, high-dimensional flow cytometry datasets. An iterative sampling procedure initially fits the data to multidimensional Gaussian distributions, then splitting and merging stages use a criterion of unimodality to optimize the detection of rare subpopulations, to converge on a consistent cluster number, and to describe non-Gaussian distributions. Probabilistic assignment of cells to clusters, visualization, and manipulation of clusters by their cluster medians, facilitate application of expert knowledge using standard flow cytometry programs. The dual problems of rigorously comparing similar complex samples, and enumerating absent or very rare cell subpopulations in negative controls, were solved by assigning cells in multiple samples to a cluster template derived from a single or combined sample. Comparison of antigen-stimulated and control human peripheral blood cell samples demonstrated that SWIFT could identify biologically significant subpopulations, such as rare cytokine-producing influenza-specific T cells. A sensitivity of better than one part per million was attained in very large samples. Results were highly consistent on biological replicates, yet the analysis was sensitive enough to show that multiple samples from the same subject were more similar than samples from different subjects. A companion manuscript (Part 1) details the algorithmic development of SWIFT. © 2014 The Authors. Published by Wiley Periodicals Inc. BlackWell Publishing Ltd 2014-05 2014-02-14 /pmc/articles/PMC4238823/ /pubmed/24532172 http://dx.doi.org/10.1002/cyto.a.22445 Text en © 2014 The Authors. Published by Wiley Periodicals Inc. http://creativecommons.org/licenses/by-nc/3.0/ This is an open access article under the terms of the Creative Commons Attribution-NonCommercial License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited and is not used for commercial purposes.
spellingShingle Original Articles
Mosmann, Tim R
Naim, Iftekhar
Rebhahn, Jonathan
Datta, Suprakash
Cavenaugh, James S
Weaver, Jason M
Sharma, Gaurav
SWIFT—Scalable Clustering for Automated Identification of Rare Cell Populations in Large, High-Dimensional Flow Cytometry Datasets, Part 2: Biological Evaluation
title SWIFT—Scalable Clustering for Automated Identification of Rare Cell Populations in Large, High-Dimensional Flow Cytometry Datasets, Part 2: Biological Evaluation
title_full SWIFT—Scalable Clustering for Automated Identification of Rare Cell Populations in Large, High-Dimensional Flow Cytometry Datasets, Part 2: Biological Evaluation
title_fullStr SWIFT—Scalable Clustering for Automated Identification of Rare Cell Populations in Large, High-Dimensional Flow Cytometry Datasets, Part 2: Biological Evaluation
title_full_unstemmed SWIFT—Scalable Clustering for Automated Identification of Rare Cell Populations in Large, High-Dimensional Flow Cytometry Datasets, Part 2: Biological Evaluation
title_short SWIFT—Scalable Clustering for Automated Identification of Rare Cell Populations in Large, High-Dimensional Flow Cytometry Datasets, Part 2: Biological Evaluation
title_sort swift—scalable clustering for automated identification of rare cell populations in large, high-dimensional flow cytometry datasets, part 2: biological evaluation
topic Original Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4238823/
https://www.ncbi.nlm.nih.gov/pubmed/24532172
http://dx.doi.org/10.1002/cyto.a.22445
work_keys_str_mv AT mosmanntimr swiftscalableclusteringforautomatedidentificationofrarecellpopulationsinlargehighdimensionalflowcytometrydatasetspart2biologicalevaluation
AT naimiftekhar swiftscalableclusteringforautomatedidentificationofrarecellpopulationsinlargehighdimensionalflowcytometrydatasetspart2biologicalevaluation
AT rebhahnjonathan swiftscalableclusteringforautomatedidentificationofrarecellpopulationsinlargehighdimensionalflowcytometrydatasetspart2biologicalevaluation
AT dattasuprakash swiftscalableclusteringforautomatedidentificationofrarecellpopulationsinlargehighdimensionalflowcytometrydatasetspart2biologicalevaluation
AT cavenaughjamess swiftscalableclusteringforautomatedidentificationofrarecellpopulationsinlargehighdimensionalflowcytometrydatasetspart2biologicalevaluation
AT weaverjasonm swiftscalableclusteringforautomatedidentificationofrarecellpopulationsinlargehighdimensionalflowcytometrydatasetspart2biologicalevaluation
AT sharmagaurav swiftscalableclusteringforautomatedidentificationofrarecellpopulationsinlargehighdimensionalflowcytometrydatasetspart2biologicalevaluation