Cargando…

Towards human-computer synergetic analysis of large-scale biological data

BACKGROUND: Advances in technology have led to the generation of massive amounts of complex and multifarious biological data in areas ranging from genomics to structural biology. The volume and complexity of such data leads to significant challenges in terms of its analysis, especially when one seek...

Descripción completa

Detalles Bibliográficos
Autores principales: Singh, Rahul, Yang, Hui, Dalziel, Ben, Asarnow, Daniel, Murad, William, Foote, David, Gormley, Matthew, Stillman, Jonathan, Fisher, Susan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3851181/
https://www.ncbi.nlm.nih.gov/pubmed/24267485
http://dx.doi.org/10.1186/1471-2105-14-S14-S10
_version_ 1782294241566785536
author Singh, Rahul
Yang, Hui
Dalziel, Ben
Asarnow, Daniel
Murad, William
Foote, David
Gormley, Matthew
Stillman, Jonathan
Fisher, Susan
author_facet Singh, Rahul
Yang, Hui
Dalziel, Ben
Asarnow, Daniel
Murad, William
Foote, David
Gormley, Matthew
Stillman, Jonathan
Fisher, Susan
author_sort Singh, Rahul
collection PubMed
description BACKGROUND: Advances in technology have led to the generation of massive amounts of complex and multifarious biological data in areas ranging from genomics to structural biology. The volume and complexity of such data leads to significant challenges in terms of its analysis, especially when one seeks to generate hypotheses or explore the underlying biological processes. At the state-of-the-art, the application of automated algorithms followed by perusal and analysis of the results by an expert continues to be the predominant paradigm for analyzing biological data. This paradigm works well in many problem domains. However, it also is limiting, since domain experts are forced to apply their instincts and expertise such as contextual reasoning, hypothesis formulation, and exploratory analysis after the algorithm has produced its results. In many areas where the organization and interaction of the biological processes is poorly understood and exploratory analysis is crucial, what is needed is to integrate domain expertise during the data analysis process and use it to drive the analysis itself. RESULTS: In context of the aforementioned background, the results presented in this paper describe advancements along two methodological directions. First, given the context of biological data, we utilize and extend a design approach called experiential computing from multimedia information system design. This paradigm combines information visualization and human-computer interaction with algorithms for exploratory analysis of large-scale and complex data. In the proposed approach, emphasis is laid on: (1) allowing users to directly visualize, interact, experience, and explore the data through interoperable visualization-based and algorithmic components, (2) supporting unified query and presentation spaces to facilitate experimentation and exploration, (3) providing external contextual information by assimilating relevant supplementary data, and (4) encouraging user-directed information visualization, data exploration, and hypotheses formulation. Second, to illustrate the proposed design paradigm and measure its efficacy, we describe two prototype web applications. The first, called XMAS (Experiential Microarray Analysis System) is designed for analysis of time-series transcriptional data. The second system, called PSPACE (Protein Space Explorer) is designed for holistic analysis of structural and structure-function relationships using interactive low-dimensional maps of the protein structure space. Both these systems promote and facilitate human-computer synergy, where cognitive elements such as domain knowledge, contextual reasoning, and purpose-driven exploration, are integrated with a host of powerful algorithmic operations that support large-scale data analysis, multifaceted data visualization, and multi-source information integration. CONCLUSIONS: The proposed design philosophy, combines visualization, algorithmic components and cognitive expertise into a seamless processing-analysis-exploration framework that facilitates sense-making, exploration, and discovery. Using XMAS, we present case studies that analyze transcriptional data from two highly complex domains: gene expression in the placenta during human pregnancy and reaction of marine organisms to heat stress. With PSPACE, we demonstrate how complex structure-function relationships can be explored. These results demonstrate the novelty, advantages, and distinctions of the proposed paradigm. Furthermore, the results also highlight how domain insights can be combined with algorithms to discover meaningful knowledge and formulate evidence-based hypotheses during the data analysis process. Finally, user studies against comparable systems indicate that both XMAS and PSPACE deliver results with better interpretability while placing lower cognitive loads on the users. XMAS is available at: http://tintin.sfsu.edu:8080/xmas. PSPACE is available at: http://pspace.info/.
format Online
Article
Text
id pubmed-3851181
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-38511812013-12-13 Towards human-computer synergetic analysis of large-scale biological data Singh, Rahul Yang, Hui Dalziel, Ben Asarnow, Daniel Murad, William Foote, David Gormley, Matthew Stillman, Jonathan Fisher, Susan BMC Bioinformatics Proceedings BACKGROUND: Advances in technology have led to the generation of massive amounts of complex and multifarious biological data in areas ranging from genomics to structural biology. The volume and complexity of such data leads to significant challenges in terms of its analysis, especially when one seeks to generate hypotheses or explore the underlying biological processes. At the state-of-the-art, the application of automated algorithms followed by perusal and analysis of the results by an expert continues to be the predominant paradigm for analyzing biological data. This paradigm works well in many problem domains. However, it also is limiting, since domain experts are forced to apply their instincts and expertise such as contextual reasoning, hypothesis formulation, and exploratory analysis after the algorithm has produced its results. In many areas where the organization and interaction of the biological processes is poorly understood and exploratory analysis is crucial, what is needed is to integrate domain expertise during the data analysis process and use it to drive the analysis itself. RESULTS: In context of the aforementioned background, the results presented in this paper describe advancements along two methodological directions. First, given the context of biological data, we utilize and extend a design approach called experiential computing from multimedia information system design. This paradigm combines information visualization and human-computer interaction with algorithms for exploratory analysis of large-scale and complex data. In the proposed approach, emphasis is laid on: (1) allowing users to directly visualize, interact, experience, and explore the data through interoperable visualization-based and algorithmic components, (2) supporting unified query and presentation spaces to facilitate experimentation and exploration, (3) providing external contextual information by assimilating relevant supplementary data, and (4) encouraging user-directed information visualization, data exploration, and hypotheses formulation. Second, to illustrate the proposed design paradigm and measure its efficacy, we describe two prototype web applications. The first, called XMAS (Experiential Microarray Analysis System) is designed for analysis of time-series transcriptional data. The second system, called PSPACE (Protein Space Explorer) is designed for holistic analysis of structural and structure-function relationships using interactive low-dimensional maps of the protein structure space. Both these systems promote and facilitate human-computer synergy, where cognitive elements such as domain knowledge, contextual reasoning, and purpose-driven exploration, are integrated with a host of powerful algorithmic operations that support large-scale data analysis, multifaceted data visualization, and multi-source information integration. CONCLUSIONS: The proposed design philosophy, combines visualization, algorithmic components and cognitive expertise into a seamless processing-analysis-exploration framework that facilitates sense-making, exploration, and discovery. Using XMAS, we present case studies that analyze transcriptional data from two highly complex domains: gene expression in the placenta during human pregnancy and reaction of marine organisms to heat stress. With PSPACE, we demonstrate how complex structure-function relationships can be explored. These results demonstrate the novelty, advantages, and distinctions of the proposed paradigm. Furthermore, the results also highlight how domain insights can be combined with algorithms to discover meaningful knowledge and formulate evidence-based hypotheses during the data analysis process. Finally, user studies against comparable systems indicate that both XMAS and PSPACE deliver results with better interpretability while placing lower cognitive loads on the users. XMAS is available at: http://tintin.sfsu.edu:8080/xmas. PSPACE is available at: http://pspace.info/. BioMed Central 2013-10-09 /pmc/articles/PMC3851181/ /pubmed/24267485 http://dx.doi.org/10.1186/1471-2105-14-S14-S10 Text en Copyright © 2013 Singh et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Singh, Rahul
Yang, Hui
Dalziel, Ben
Asarnow, Daniel
Murad, William
Foote, David
Gormley, Matthew
Stillman, Jonathan
Fisher, Susan
Towards human-computer synergetic analysis of large-scale biological data
title Towards human-computer synergetic analysis of large-scale biological data
title_full Towards human-computer synergetic analysis of large-scale biological data
title_fullStr Towards human-computer synergetic analysis of large-scale biological data
title_full_unstemmed Towards human-computer synergetic analysis of large-scale biological data
title_short Towards human-computer synergetic analysis of large-scale biological data
title_sort towards human-computer synergetic analysis of large-scale biological data
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3851181/
https://www.ncbi.nlm.nih.gov/pubmed/24267485
http://dx.doi.org/10.1186/1471-2105-14-S14-S10
work_keys_str_mv AT singhrahul towardshumancomputersynergeticanalysisoflargescalebiologicaldata
AT yanghui towardshumancomputersynergeticanalysisoflargescalebiologicaldata
AT dalzielben towardshumancomputersynergeticanalysisoflargescalebiologicaldata
AT asarnowdaniel towardshumancomputersynergeticanalysisoflargescalebiologicaldata
AT muradwilliam towardshumancomputersynergeticanalysisoflargescalebiologicaldata
AT footedavid towardshumancomputersynergeticanalysisoflargescalebiologicaldata
AT gormleymatthew towardshumancomputersynergeticanalysisoflargescalebiologicaldata
AT stillmanjonathan towardshumancomputersynergeticanalysisoflargescalebiologicaldata
AT fishersusan towardshumancomputersynergeticanalysisoflargescalebiologicaldata