Cargando…

Language workbench user interfaces for data analysis

Biological data analysis is frequently performed with command line software. While this practice provides considerable flexibility for computationally savy individuals, such as investigators trained in bioinformatics, this also creates a barrier to the widespread use of data analysis software by inv...

Descripción completa

Detalles Bibliográficos
Autores principales: Benson, Victoria M., Campagne, Fabien
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4349052/
https://www.ncbi.nlm.nih.gov/pubmed/25755929
http://dx.doi.org/10.7717/peerj.800
_version_ 1782359990398353408
author Benson, Victoria M.
Campagne, Fabien
author_facet Benson, Victoria M.
Campagne, Fabien
author_sort Benson, Victoria M.
collection PubMed
description Biological data analysis is frequently performed with command line software. While this practice provides considerable flexibility for computationally savy individuals, such as investigators trained in bioinformatics, this also creates a barrier to the widespread use of data analysis software by investigators trained as biologists and/or clinicians. Workflow systems such as Galaxy and Taverna have been developed to try and provide generic user interfaces that can wrap command line analysis software. These solutions are useful for problems that can be solved with workflows, and that do not require specialized user interfaces. However, some types of analyses can benefit from custom user interfaces. For instance, developing biomarker models from high-throughput data is a type of analysis that can be expressed more succinctly with specialized user interfaces. Here, we show how Language Workbench (LW) technology can be used to model the biomarker development and validation process. We developed a language that models the concepts of Dataset, Endpoint, Feature Selection Method and Classifier. These high-level language concepts map directly to abstractions that analysts who develop biomarker models are familiar with. We found that user interfaces developed in the Meta-Programming System (MPS) LW provide convenient means to configure a biomarker development project, to train models and view the validation statistics. We discuss several advantages of developing user interfaces for data analysis with a LW, including increased interface consistency, portability and extension by language composition. The language developed during this experiment is distributed as an MPS plugin (available at http://campagnelab.org/software/bdval-for-mps/).
format Online
Article
Text
id pubmed-4349052
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-43490522015-03-09 Language workbench user interfaces for data analysis Benson, Victoria M. Campagne, Fabien PeerJ Bioinformatics Biological data analysis is frequently performed with command line software. While this practice provides considerable flexibility for computationally savy individuals, such as investigators trained in bioinformatics, this also creates a barrier to the widespread use of data analysis software by investigators trained as biologists and/or clinicians. Workflow systems such as Galaxy and Taverna have been developed to try and provide generic user interfaces that can wrap command line analysis software. These solutions are useful for problems that can be solved with workflows, and that do not require specialized user interfaces. However, some types of analyses can benefit from custom user interfaces. For instance, developing biomarker models from high-throughput data is a type of analysis that can be expressed more succinctly with specialized user interfaces. Here, we show how Language Workbench (LW) technology can be used to model the biomarker development and validation process. We developed a language that models the concepts of Dataset, Endpoint, Feature Selection Method and Classifier. These high-level language concepts map directly to abstractions that analysts who develop biomarker models are familiar with. We found that user interfaces developed in the Meta-Programming System (MPS) LW provide convenient means to configure a biomarker development project, to train models and view the validation statistics. We discuss several advantages of developing user interfaces for data analysis with a LW, including increased interface consistency, portability and extension by language composition. The language developed during this experiment is distributed as an MPS plugin (available at http://campagnelab.org/software/bdval-for-mps/). PeerJ Inc. 2015-02-24 /pmc/articles/PMC4349052/ /pubmed/25755929 http://dx.doi.org/10.7717/peerj.800 Text en © 2015 Benson and Campagne http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.
spellingShingle Bioinformatics
Benson, Victoria M.
Campagne, Fabien
Language workbench user interfaces for data analysis
title Language workbench user interfaces for data analysis
title_full Language workbench user interfaces for data analysis
title_fullStr Language workbench user interfaces for data analysis
title_full_unstemmed Language workbench user interfaces for data analysis
title_short Language workbench user interfaces for data analysis
title_sort language workbench user interfaces for data analysis
topic Bioinformatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4349052/
https://www.ncbi.nlm.nih.gov/pubmed/25755929
http://dx.doi.org/10.7717/peerj.800
work_keys_str_mv AT bensonvictoriam languageworkbenchuserinterfacesfordataanalysis
AT campagnefabien languageworkbenchuserinterfacesfordataanalysis