Cargando…

Automated subset identification and characterization pipeline for multidimensional flow and mass cytometry data clustering and visualization

When examining datasets of any dimensionality, researchers frequently aim to identify individual subsets (clusters) of objects within the dataset. The ubiquity of multidimensional data has motivated the replacement of user-guided clustering with fully automated clustering. The fully automated method...

Descripción completa

Detalles Bibliográficos
Autores principales: Meehan, Stephen, Kolyagin, Gleb A., Parks, David, Youngyunpipatkul, Justin, Herzenberg, Leonore A., Walther, Guenther, Ghosn, Eliver E. B., Orlova, Darya Y.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6586874/
https://www.ncbi.nlm.nih.gov/pubmed/31240267
http://dx.doi.org/10.1038/s42003-019-0467-6
_version_ 1783428962357084160
author Meehan, Stephen
Kolyagin, Gleb A.
Parks, David
Youngyunpipatkul, Justin
Herzenberg, Leonore A.
Walther, Guenther
Ghosn, Eliver E. B.
Orlova, Darya Y.
author_facet Meehan, Stephen
Kolyagin, Gleb A.
Parks, David
Youngyunpipatkul, Justin
Herzenberg, Leonore A.
Walther, Guenther
Ghosn, Eliver E. B.
Orlova, Darya Y.
author_sort Meehan, Stephen
collection PubMed
description When examining datasets of any dimensionality, researchers frequently aim to identify individual subsets (clusters) of objects within the dataset. The ubiquity of multidimensional data has motivated the replacement of user-guided clustering with fully automated clustering. The fully automated methods are designed to make clustering more accurate, standardized and faster. However, the adoption of these methods is still limited by the lack of intuitive visualization and cluster matching methods that would allow users to readily interpret fully automatically generated clusters. To address these issues, we developed a fully automated subset identification and characterization (SIC) pipeline providing robust cluster matching and data visualization tools for high-dimensional flow/mass cytometry (and other) data. This pipeline automatically (and intuitively) generates two-dimensional representations of high-dimensional datasets that are safe from the curse of dimensionality. This new approach allows more robust and reproducible data analysis,+ facilitating the development of new gold standard practices across laboratories and institutions.
format Online
Article
Text
id pubmed-6586874
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-65868742019-06-25 Automated subset identification and characterization pipeline for multidimensional flow and mass cytometry data clustering and visualization Meehan, Stephen Kolyagin, Gleb A. Parks, David Youngyunpipatkul, Justin Herzenberg, Leonore A. Walther, Guenther Ghosn, Eliver E. B. Orlova, Darya Y. Commun Biol Article When examining datasets of any dimensionality, researchers frequently aim to identify individual subsets (clusters) of objects within the dataset. The ubiquity of multidimensional data has motivated the replacement of user-guided clustering with fully automated clustering. The fully automated methods are designed to make clustering more accurate, standardized and faster. However, the adoption of these methods is still limited by the lack of intuitive visualization and cluster matching methods that would allow users to readily interpret fully automatically generated clusters. To address these issues, we developed a fully automated subset identification and characterization (SIC) pipeline providing robust cluster matching and data visualization tools for high-dimensional flow/mass cytometry (and other) data. This pipeline automatically (and intuitively) generates two-dimensional representations of high-dimensional datasets that are safe from the curse of dimensionality. This new approach allows more robust and reproducible data analysis,+ facilitating the development of new gold standard practices across laboratories and institutions. Nature Publishing Group UK 2019-06-20 /pmc/articles/PMC6586874/ /pubmed/31240267 http://dx.doi.org/10.1038/s42003-019-0467-6 Text en © The Author(s) 2019 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Meehan, Stephen
Kolyagin, Gleb A.
Parks, David
Youngyunpipatkul, Justin
Herzenberg, Leonore A.
Walther, Guenther
Ghosn, Eliver E. B.
Orlova, Darya Y.
Automated subset identification and characterization pipeline for multidimensional flow and mass cytometry data clustering and visualization
title Automated subset identification and characterization pipeline for multidimensional flow and mass cytometry data clustering and visualization
title_full Automated subset identification and characterization pipeline for multidimensional flow and mass cytometry data clustering and visualization
title_fullStr Automated subset identification and characterization pipeline for multidimensional flow and mass cytometry data clustering and visualization
title_full_unstemmed Automated subset identification and characterization pipeline for multidimensional flow and mass cytometry data clustering and visualization
title_short Automated subset identification and characterization pipeline for multidimensional flow and mass cytometry data clustering and visualization
title_sort automated subset identification and characterization pipeline for multidimensional flow and mass cytometry data clustering and visualization
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6586874/
https://www.ncbi.nlm.nih.gov/pubmed/31240267
http://dx.doi.org/10.1038/s42003-019-0467-6
work_keys_str_mv AT meehanstephen automatedsubsetidentificationandcharacterizationpipelineformultidimensionalflowandmasscytometrydataclusteringandvisualization
AT kolyagingleba automatedsubsetidentificationandcharacterizationpipelineformultidimensionalflowandmasscytometrydataclusteringandvisualization
AT parksdavid automatedsubsetidentificationandcharacterizationpipelineformultidimensionalflowandmasscytometrydataclusteringandvisualization
AT youngyunpipatkuljustin automatedsubsetidentificationandcharacterizationpipelineformultidimensionalflowandmasscytometrydataclusteringandvisualization
AT herzenbergleonorea automatedsubsetidentificationandcharacterizationpipelineformultidimensionalflowandmasscytometrydataclusteringandvisualization
AT waltherguenther automatedsubsetidentificationandcharacterizationpipelineformultidimensionalflowandmasscytometrydataclusteringandvisualization
AT ghosnelivereb automatedsubsetidentificationandcharacterizationpipelineformultidimensionalflowandmasscytometrydataclusteringandvisualization
AT orlovadaryay automatedsubsetidentificationandcharacterizationpipelineformultidimensionalflowandmasscytometrydataclusteringandvisualization