Cargando…

A Tool for Interactive Data Visualization: Application to Over 10,000 Brain Imaging and Phantom MRI Data Sets

In this paper we propose a web-based approach for quick visualization of big data from brain magnetic resonance imaging (MRI) scans using a combination of an automated image capture and processing system, nonlinear embedding, and interactive data visualization tools. We draw upon thousands of MRI sc...

Descripción completa

Detalles Bibliográficos
Autores principales: Panta, Sandeep R., Wang, Runtang, Fries, Jill, Kalyanam, Ravi, Speer, Nicole, Banich, Marie, Kiehl, Kent, King, Margaret, Milham, Michael, Wager, Tor D., Turner, Jessica A., Plis, Sergey M., Calhoun, Vince D.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4791544/
https://www.ncbi.nlm.nih.gov/pubmed/27014049
http://dx.doi.org/10.3389/fninf.2016.00009
_version_ 1782421110651879424
author Panta, Sandeep R.
Wang, Runtang
Fries, Jill
Kalyanam, Ravi
Speer, Nicole
Banich, Marie
Kiehl, Kent
King, Margaret
Milham, Michael
Wager, Tor D.
Turner, Jessica A.
Plis, Sergey M.
Calhoun, Vince D.
author_facet Panta, Sandeep R.
Wang, Runtang
Fries, Jill
Kalyanam, Ravi
Speer, Nicole
Banich, Marie
Kiehl, Kent
King, Margaret
Milham, Michael
Wager, Tor D.
Turner, Jessica A.
Plis, Sergey M.
Calhoun, Vince D.
author_sort Panta, Sandeep R.
collection PubMed
description In this paper we propose a web-based approach for quick visualization of big data from brain magnetic resonance imaging (MRI) scans using a combination of an automated image capture and processing system, nonlinear embedding, and interactive data visualization tools. We draw upon thousands of MRI scans captured via the COllaborative Imaging and Neuroinformatics Suite (COINS). We then interface the output of several analysis pipelines based on structural and functional data to a t-distributed stochastic neighbor embedding (t-SNE) algorithm which reduces the number of dimensions for each scan in the input data set to two dimensions while preserving the local structure of data sets. Finally, we interactively display the output of this approach via a web-page, based on data driven documents (D3) JavaScript library. Two distinct approaches were used to visualize the data. In the first approach, we computed multiple quality control (QC) values from pre-processed data, which were used as inputs to the t-SNE algorithm. This approach helps in assessing the quality of each data set relative to others. In the second case, computed variables of interest (e.g., brain volume or voxel values from segmented gray matter images) were used as inputs to the t-SNE algorithm. This approach helps in identifying interesting patterns in the data sets. We demonstrate these approaches using multiple examples from over 10,000 data sets including (1) quality control measures calculated from phantom data over time, (2) quality control data from human functional MRI data across various studies, scanners, sites, (3) volumetric and density measures from human structural MRI data across various studies, scanners and sites. Results from (1) and (2) show the potential of our approach to combine t-SNE data reduction with interactive color coding of variables of interest to quickly identify visually unique clusters of data (i.e., data sets with poor QC, clustering of data by site) quickly. Results from (3) demonstrate interesting patterns of gray matter and volume, and evaluate how they map onto variables including scanners, age, and gender. In sum, the proposed approach allows researchers to rapidly identify and extract meaningful information from big data sets. Such tools are becoming increasingly important as datasets grow larger.
format Online
Article
Text
id pubmed-4791544
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-47915442016-03-24 A Tool for Interactive Data Visualization: Application to Over 10,000 Brain Imaging and Phantom MRI Data Sets Panta, Sandeep R. Wang, Runtang Fries, Jill Kalyanam, Ravi Speer, Nicole Banich, Marie Kiehl, Kent King, Margaret Milham, Michael Wager, Tor D. Turner, Jessica A. Plis, Sergey M. Calhoun, Vince D. Front Neuroinform Neuroscience In this paper we propose a web-based approach for quick visualization of big data from brain magnetic resonance imaging (MRI) scans using a combination of an automated image capture and processing system, nonlinear embedding, and interactive data visualization tools. We draw upon thousands of MRI scans captured via the COllaborative Imaging and Neuroinformatics Suite (COINS). We then interface the output of several analysis pipelines based on structural and functional data to a t-distributed stochastic neighbor embedding (t-SNE) algorithm which reduces the number of dimensions for each scan in the input data set to two dimensions while preserving the local structure of data sets. Finally, we interactively display the output of this approach via a web-page, based on data driven documents (D3) JavaScript library. Two distinct approaches were used to visualize the data. In the first approach, we computed multiple quality control (QC) values from pre-processed data, which were used as inputs to the t-SNE algorithm. This approach helps in assessing the quality of each data set relative to others. In the second case, computed variables of interest (e.g., brain volume or voxel values from segmented gray matter images) were used as inputs to the t-SNE algorithm. This approach helps in identifying interesting patterns in the data sets. We demonstrate these approaches using multiple examples from over 10,000 data sets including (1) quality control measures calculated from phantom data over time, (2) quality control data from human functional MRI data across various studies, scanners, sites, (3) volumetric and density measures from human structural MRI data across various studies, scanners and sites. Results from (1) and (2) show the potential of our approach to combine t-SNE data reduction with interactive color coding of variables of interest to quickly identify visually unique clusters of data (i.e., data sets with poor QC, clustering of data by site) quickly. Results from (3) demonstrate interesting patterns of gray matter and volume, and evaluate how they map onto variables including scanners, age, and gender. In sum, the proposed approach allows researchers to rapidly identify and extract meaningful information from big data sets. Such tools are becoming increasingly important as datasets grow larger. Frontiers Media S.A. 2016-03-15 /pmc/articles/PMC4791544/ /pubmed/27014049 http://dx.doi.org/10.3389/fninf.2016.00009 Text en Copyright © 2016 Panta, Wang, Fries, Kalyanam, Speer, Banich, Kiehl, King, Milham, Wager, Turner, Plis and Calhoun. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Neuroscience
Panta, Sandeep R.
Wang, Runtang
Fries, Jill
Kalyanam, Ravi
Speer, Nicole
Banich, Marie
Kiehl, Kent
King, Margaret
Milham, Michael
Wager, Tor D.
Turner, Jessica A.
Plis, Sergey M.
Calhoun, Vince D.
A Tool for Interactive Data Visualization: Application to Over 10,000 Brain Imaging and Phantom MRI Data Sets
title A Tool for Interactive Data Visualization: Application to Over 10,000 Brain Imaging and Phantom MRI Data Sets
title_full A Tool for Interactive Data Visualization: Application to Over 10,000 Brain Imaging and Phantom MRI Data Sets
title_fullStr A Tool for Interactive Data Visualization: Application to Over 10,000 Brain Imaging and Phantom MRI Data Sets
title_full_unstemmed A Tool for Interactive Data Visualization: Application to Over 10,000 Brain Imaging and Phantom MRI Data Sets
title_short A Tool for Interactive Data Visualization: Application to Over 10,000 Brain Imaging and Phantom MRI Data Sets
title_sort tool for interactive data visualization: application to over 10,000 brain imaging and phantom mri data sets
topic Neuroscience
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4791544/
https://www.ncbi.nlm.nih.gov/pubmed/27014049
http://dx.doi.org/10.3389/fninf.2016.00009
work_keys_str_mv AT pantasandeepr atoolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets
AT wangruntang atoolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets
AT friesjill atoolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets
AT kalyanamravi atoolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets
AT speernicole atoolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets
AT banichmarie atoolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets
AT kiehlkent atoolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets
AT kingmargaret atoolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets
AT milhammichael atoolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets
AT wagertord atoolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets
AT turnerjessicaa atoolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets
AT plissergeym atoolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets
AT calhounvinced atoolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets
AT pantasandeepr toolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets
AT wangruntang toolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets
AT friesjill toolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets
AT kalyanamravi toolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets
AT speernicole toolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets
AT banichmarie toolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets
AT kiehlkent toolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets
AT kingmargaret toolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets
AT milhammichael toolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets
AT wagertord toolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets
AT turnerjessicaa toolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets
AT plissergeym toolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets
AT calhounvinced toolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets