Cargando…
A Tool for Interactive Data Visualization: Application to Over 10,000 Brain Imaging and Phantom MRI Data Sets
In this paper we propose a web-based approach for quick visualization of big data from brain magnetic resonance imaging (MRI) scans using a combination of an automated image capture and processing system, nonlinear embedding, and interactive data visualization tools. We draw upon thousands of MRI sc...
Autores principales: | , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4791544/ https://www.ncbi.nlm.nih.gov/pubmed/27014049 http://dx.doi.org/10.3389/fninf.2016.00009 |
_version_ | 1782421110651879424 |
---|---|
author | Panta, Sandeep R. Wang, Runtang Fries, Jill Kalyanam, Ravi Speer, Nicole Banich, Marie Kiehl, Kent King, Margaret Milham, Michael Wager, Tor D. Turner, Jessica A. Plis, Sergey M. Calhoun, Vince D. |
author_facet | Panta, Sandeep R. Wang, Runtang Fries, Jill Kalyanam, Ravi Speer, Nicole Banich, Marie Kiehl, Kent King, Margaret Milham, Michael Wager, Tor D. Turner, Jessica A. Plis, Sergey M. Calhoun, Vince D. |
author_sort | Panta, Sandeep R. |
collection | PubMed |
description | In this paper we propose a web-based approach for quick visualization of big data from brain magnetic resonance imaging (MRI) scans using a combination of an automated image capture and processing system, nonlinear embedding, and interactive data visualization tools. We draw upon thousands of MRI scans captured via the COllaborative Imaging and Neuroinformatics Suite (COINS). We then interface the output of several analysis pipelines based on structural and functional data to a t-distributed stochastic neighbor embedding (t-SNE) algorithm which reduces the number of dimensions for each scan in the input data set to two dimensions while preserving the local structure of data sets. Finally, we interactively display the output of this approach via a web-page, based on data driven documents (D3) JavaScript library. Two distinct approaches were used to visualize the data. In the first approach, we computed multiple quality control (QC) values from pre-processed data, which were used as inputs to the t-SNE algorithm. This approach helps in assessing the quality of each data set relative to others. In the second case, computed variables of interest (e.g., brain volume or voxel values from segmented gray matter images) were used as inputs to the t-SNE algorithm. This approach helps in identifying interesting patterns in the data sets. We demonstrate these approaches using multiple examples from over 10,000 data sets including (1) quality control measures calculated from phantom data over time, (2) quality control data from human functional MRI data across various studies, scanners, sites, (3) volumetric and density measures from human structural MRI data across various studies, scanners and sites. Results from (1) and (2) show the potential of our approach to combine t-SNE data reduction with interactive color coding of variables of interest to quickly identify visually unique clusters of data (i.e., data sets with poor QC, clustering of data by site) quickly. Results from (3) demonstrate interesting patterns of gray matter and volume, and evaluate how they map onto variables including scanners, age, and gender. In sum, the proposed approach allows researchers to rapidly identify and extract meaningful information from big data sets. Such tools are becoming increasingly important as datasets grow larger. |
format | Online Article Text |
id | pubmed-4791544 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-47915442016-03-24 A Tool for Interactive Data Visualization: Application to Over 10,000 Brain Imaging and Phantom MRI Data Sets Panta, Sandeep R. Wang, Runtang Fries, Jill Kalyanam, Ravi Speer, Nicole Banich, Marie Kiehl, Kent King, Margaret Milham, Michael Wager, Tor D. Turner, Jessica A. Plis, Sergey M. Calhoun, Vince D. Front Neuroinform Neuroscience In this paper we propose a web-based approach for quick visualization of big data from brain magnetic resonance imaging (MRI) scans using a combination of an automated image capture and processing system, nonlinear embedding, and interactive data visualization tools. We draw upon thousands of MRI scans captured via the COllaborative Imaging and Neuroinformatics Suite (COINS). We then interface the output of several analysis pipelines based on structural and functional data to a t-distributed stochastic neighbor embedding (t-SNE) algorithm which reduces the number of dimensions for each scan in the input data set to two dimensions while preserving the local structure of data sets. Finally, we interactively display the output of this approach via a web-page, based on data driven documents (D3) JavaScript library. Two distinct approaches were used to visualize the data. In the first approach, we computed multiple quality control (QC) values from pre-processed data, which were used as inputs to the t-SNE algorithm. This approach helps in assessing the quality of each data set relative to others. In the second case, computed variables of interest (e.g., brain volume or voxel values from segmented gray matter images) were used as inputs to the t-SNE algorithm. This approach helps in identifying interesting patterns in the data sets. We demonstrate these approaches using multiple examples from over 10,000 data sets including (1) quality control measures calculated from phantom data over time, (2) quality control data from human functional MRI data across various studies, scanners, sites, (3) volumetric and density measures from human structural MRI data across various studies, scanners and sites. Results from (1) and (2) show the potential of our approach to combine t-SNE data reduction with interactive color coding of variables of interest to quickly identify visually unique clusters of data (i.e., data sets with poor QC, clustering of data by site) quickly. Results from (3) demonstrate interesting patterns of gray matter and volume, and evaluate how they map onto variables including scanners, age, and gender. In sum, the proposed approach allows researchers to rapidly identify and extract meaningful information from big data sets. Such tools are becoming increasingly important as datasets grow larger. Frontiers Media S.A. 2016-03-15 /pmc/articles/PMC4791544/ /pubmed/27014049 http://dx.doi.org/10.3389/fninf.2016.00009 Text en Copyright © 2016 Panta, Wang, Fries, Kalyanam, Speer, Banich, Kiehl, King, Milham, Wager, Turner, Plis and Calhoun. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Neuroscience Panta, Sandeep R. Wang, Runtang Fries, Jill Kalyanam, Ravi Speer, Nicole Banich, Marie Kiehl, Kent King, Margaret Milham, Michael Wager, Tor D. Turner, Jessica A. Plis, Sergey M. Calhoun, Vince D. A Tool for Interactive Data Visualization: Application to Over 10,000 Brain Imaging and Phantom MRI Data Sets |
title | A Tool for Interactive Data Visualization: Application to Over 10,000 Brain Imaging and Phantom MRI Data Sets |
title_full | A Tool for Interactive Data Visualization: Application to Over 10,000 Brain Imaging and Phantom MRI Data Sets |
title_fullStr | A Tool for Interactive Data Visualization: Application to Over 10,000 Brain Imaging and Phantom MRI Data Sets |
title_full_unstemmed | A Tool for Interactive Data Visualization: Application to Over 10,000 Brain Imaging and Phantom MRI Data Sets |
title_short | A Tool for Interactive Data Visualization: Application to Over 10,000 Brain Imaging and Phantom MRI Data Sets |
title_sort | tool for interactive data visualization: application to over 10,000 brain imaging and phantom mri data sets |
topic | Neuroscience |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4791544/ https://www.ncbi.nlm.nih.gov/pubmed/27014049 http://dx.doi.org/10.3389/fninf.2016.00009 |
work_keys_str_mv | AT pantasandeepr atoolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets AT wangruntang atoolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets AT friesjill atoolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets AT kalyanamravi atoolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets AT speernicole atoolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets AT banichmarie atoolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets AT kiehlkent atoolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets AT kingmargaret atoolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets AT milhammichael atoolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets AT wagertord atoolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets AT turnerjessicaa atoolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets AT plissergeym atoolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets AT calhounvinced atoolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets AT pantasandeepr toolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets AT wangruntang toolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets AT friesjill toolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets AT kalyanamravi toolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets AT speernicole toolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets AT banichmarie toolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets AT kiehlkent toolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets AT kingmargaret toolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets AT milhammichael toolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets AT wagertord toolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets AT turnerjessicaa toolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets AT plissergeym toolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets AT calhounvinced toolforinteractivedatavisualizationapplicationtoover10000brainimagingandphantommridatasets |