Cargando…

A theorem proving approach for automatically synthesizing visualizations of flow cytometry data

BACKGROUND: Polychromatic flow cytometry is a popular technique that has wide usage in the medical sciences, especially for studying phenotypic properties of cells. The high-dimensionality of data generated by flow cytometry usually makes it difficult to visualize. The naive solution of simply plott...

Descripción completa

Detalles Bibliográficos
Autores principales: Raj, Sunny, Hussain, Faraz, Husein, Zubir, Torosdagli, Neslisah, Turgut, Damla, Deo, Narsingh, Pattanaik, Sumanta, Chang, Chung-Che (Jeff), Jha, Sumit Kumar
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5471952/
https://www.ncbi.nlm.nih.gov/pubmed/28617220
http://dx.doi.org/10.1186/s12859-017-1662-4
_version_ 1783244052551958528
author Raj, Sunny
Hussain, Faraz
Husein, Zubir
Torosdagli, Neslisah
Turgut, Damla
Deo, Narsingh
Pattanaik, Sumanta
Chang, Chung-Che (Jeff)
Jha, Sumit Kumar
author_facet Raj, Sunny
Hussain, Faraz
Husein, Zubir
Torosdagli, Neslisah
Turgut, Damla
Deo, Narsingh
Pattanaik, Sumanta
Chang, Chung-Che (Jeff)
Jha, Sumit Kumar
author_sort Raj, Sunny
collection PubMed
description BACKGROUND: Polychromatic flow cytometry is a popular technique that has wide usage in the medical sciences, especially for studying phenotypic properties of cells. The high-dimensionality of data generated by flow cytometry usually makes it difficult to visualize. The naive solution of simply plotting two-dimensional graphs for every combination of observables becomes impractical as the number of dimensions increases. A natural solution is to project the data from the original high dimensional space to a lower dimensional space while approximately preserving the overall relationship between the data points. The expert can then easily visualize and analyze this low-dimensional embedding of the original dataset. RESULTS: This paper describes a new method, SANJAY, for visualizing high-dimensional flow cytometry datasets. This technique uses a decision procedure to automatically synthesize two-dimensional and three-dimensional projections of the original high-dimensional data while trying to minimize distortion. We compare SANJAY to the popular multidimensional scaling (MDS) approach for visualization of small data sets drawn from a representative set of benchmarks, and our experiments show that SANJAY produces distortions that are 1.44 to 4.15 times smaller than those caused due to MDS. Our experimental results show that SANJAY also outperforms the Random Projections technique in terms of the distortions in the projections. CONCLUSIONS: We describe a new algorithmic technique that uses a symbolic decision procedure to automatically synthesize low-dimensional projections of flow cytometry data that typically have a high number of dimensions. Our algorithm is the first application, to our knowledge, of using automated theorem proving for automatically generating highly-accurate, low-dimensional visualizations of high-dimensional data.
format Online
Article
Text
id pubmed-5471952
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-54719522017-06-19 A theorem proving approach for automatically synthesizing visualizations of flow cytometry data Raj, Sunny Hussain, Faraz Husein, Zubir Torosdagli, Neslisah Turgut, Damla Deo, Narsingh Pattanaik, Sumanta Chang, Chung-Che (Jeff) Jha, Sumit Kumar BMC Bioinformatics Research BACKGROUND: Polychromatic flow cytometry is a popular technique that has wide usage in the medical sciences, especially for studying phenotypic properties of cells. The high-dimensionality of data generated by flow cytometry usually makes it difficult to visualize. The naive solution of simply plotting two-dimensional graphs for every combination of observables becomes impractical as the number of dimensions increases. A natural solution is to project the data from the original high dimensional space to a lower dimensional space while approximately preserving the overall relationship between the data points. The expert can then easily visualize and analyze this low-dimensional embedding of the original dataset. RESULTS: This paper describes a new method, SANJAY, for visualizing high-dimensional flow cytometry datasets. This technique uses a decision procedure to automatically synthesize two-dimensional and three-dimensional projections of the original high-dimensional data while trying to minimize distortion. We compare SANJAY to the popular multidimensional scaling (MDS) approach for visualization of small data sets drawn from a representative set of benchmarks, and our experiments show that SANJAY produces distortions that are 1.44 to 4.15 times smaller than those caused due to MDS. Our experimental results show that SANJAY also outperforms the Random Projections technique in terms of the distortions in the projections. CONCLUSIONS: We describe a new algorithmic technique that uses a symbolic decision procedure to automatically synthesize low-dimensional projections of flow cytometry data that typically have a high number of dimensions. Our algorithm is the first application, to our knowledge, of using automated theorem proving for automatically generating highly-accurate, low-dimensional visualizations of high-dimensional data. BioMed Central 2017-06-07 /pmc/articles/PMC5471952/ /pubmed/28617220 http://dx.doi.org/10.1186/s12859-017-1662-4 Text en © The Author(s) 2017 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Raj, Sunny
Hussain, Faraz
Husein, Zubir
Torosdagli, Neslisah
Turgut, Damla
Deo, Narsingh
Pattanaik, Sumanta
Chang, Chung-Che (Jeff)
Jha, Sumit Kumar
A theorem proving approach for automatically synthesizing visualizations of flow cytometry data
title A theorem proving approach for automatically synthesizing visualizations of flow cytometry data
title_full A theorem proving approach for automatically synthesizing visualizations of flow cytometry data
title_fullStr A theorem proving approach for automatically synthesizing visualizations of flow cytometry data
title_full_unstemmed A theorem proving approach for automatically synthesizing visualizations of flow cytometry data
title_short A theorem proving approach for automatically synthesizing visualizations of flow cytometry data
title_sort theorem proving approach for automatically synthesizing visualizations of flow cytometry data
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5471952/
https://www.ncbi.nlm.nih.gov/pubmed/28617220
http://dx.doi.org/10.1186/s12859-017-1662-4
work_keys_str_mv AT rajsunny atheoremprovingapproachforautomaticallysynthesizingvisualizationsofflowcytometrydata
AT hussainfaraz atheoremprovingapproachforautomaticallysynthesizingvisualizationsofflowcytometrydata
AT huseinzubir atheoremprovingapproachforautomaticallysynthesizingvisualizationsofflowcytometrydata
AT torosdaglineslisah atheoremprovingapproachforautomaticallysynthesizingvisualizationsofflowcytometrydata
AT turgutdamla atheoremprovingapproachforautomaticallysynthesizingvisualizationsofflowcytometrydata
AT deonarsingh atheoremprovingapproachforautomaticallysynthesizingvisualizationsofflowcytometrydata
AT pattanaiksumanta atheoremprovingapproachforautomaticallysynthesizingvisualizationsofflowcytometrydata
AT changchungchejeff atheoremprovingapproachforautomaticallysynthesizingvisualizationsofflowcytometrydata
AT jhasumitkumar atheoremprovingapproachforautomaticallysynthesizingvisualizationsofflowcytometrydata
AT rajsunny theoremprovingapproachforautomaticallysynthesizingvisualizationsofflowcytometrydata
AT hussainfaraz theoremprovingapproachforautomaticallysynthesizingvisualizationsofflowcytometrydata
AT huseinzubir theoremprovingapproachforautomaticallysynthesizingvisualizationsofflowcytometrydata
AT torosdaglineslisah theoremprovingapproachforautomaticallysynthesizingvisualizationsofflowcytometrydata
AT turgutdamla theoremprovingapproachforautomaticallysynthesizingvisualizationsofflowcytometrydata
AT deonarsingh theoremprovingapproachforautomaticallysynthesizingvisualizationsofflowcytometrydata
AT pattanaiksumanta theoremprovingapproachforautomaticallysynthesizingvisualizationsofflowcytometrydata
AT changchungchejeff theoremprovingapproachforautomaticallysynthesizingvisualizationsofflowcytometrydata
AT jhasumitkumar theoremprovingapproachforautomaticallysynthesizingvisualizationsofflowcytometrydata