Cargando…

Improve your Galaxy text life: The Query Tabular Tool

Galaxy provides an accessible platform where multi-step data analysis workflows integrating disparate software can be run, even by researchers with limited programming expertise. Applications of such sophisticated workflows are many, including those which integrate software from different ‘omic doma...

Descripción completa

Detalles Bibliográficos
Autores principales: Johnson, James E., Kumar, Praveen, Easterly, Caleb, Esler, Mark, Mehta, Subina, Eschenlauer, Arthur C., Hegeman, Adrian D., Jagtap, Pratik D., Griffin, Timothy J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: F1000 Research Limited 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6248266/
https://www.ncbi.nlm.nih.gov/pubmed/30519459
http://dx.doi.org/10.12688/f1000research.16450.2
_version_ 1783372606566563840
author Johnson, James E.
Kumar, Praveen
Easterly, Caleb
Esler, Mark
Mehta, Subina
Eschenlauer, Arthur C.
Hegeman, Adrian D.
Jagtap, Pratik D.
Griffin, Timothy J.
author_facet Johnson, James E.
Kumar, Praveen
Easterly, Caleb
Esler, Mark
Mehta, Subina
Eschenlauer, Arthur C.
Hegeman, Adrian D.
Jagtap, Pratik D.
Griffin, Timothy J.
author_sort Johnson, James E.
collection PubMed
description Galaxy provides an accessible platform where multi-step data analysis workflows integrating disparate software can be run, even by researchers with limited programming expertise. Applications of such sophisticated workflows are many, including those which integrate software from different ‘omic domains (e.g. genomics, proteomics, metabolomics). In these complex workflows, intermediate outputs are often generated as tabular text files, which must be transformed into customized formats which are compatible with the next software tools in the pipeline. Consequently, many text manipulation steps are added to an already complex workflow, overly complicating the process. In some cases, limitations to existing text manipulation are such that desired analyses can only be carried out using highly sophisticated processing steps beyond the reach of even advanced users and developers. For users with some SQL knowledge, these text operations could be combined into single, concise query on a relational database. As a solution, we have developed the Query Tabular Galaxy tool, which leverages a SQLite database generated from tabular input data. This database can be queried and manipulated to produce transformed and customized tabular outputs compatible with downstream processing steps. Regular expressions can also be utilized for even more sophisticated manipulations, such as find and replace and other filtering actions. Using several Galaxy-based multi-omic workflows as an example, we demonstrate how the Query Tabular tool dramatically streamlines and simplifies the creation of multi-step analyses, efficiently enabling complicated textual manipulations and processing. This tool should find broad utility for users of the Galaxy platform seeking to develop and use sophisticated workflows involving text manipulation on tabular outputs.
format Online
Article
Text
id pubmed-6248266
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher F1000 Research Limited
record_format MEDLINE/PubMed
spelling pubmed-62482662018-12-04 Improve your Galaxy text life: The Query Tabular Tool Johnson, James E. Kumar, Praveen Easterly, Caleb Esler, Mark Mehta, Subina Eschenlauer, Arthur C. Hegeman, Adrian D. Jagtap, Pratik D. Griffin, Timothy J. F1000Res Software Tool Article Galaxy provides an accessible platform where multi-step data analysis workflows integrating disparate software can be run, even by researchers with limited programming expertise. Applications of such sophisticated workflows are many, including those which integrate software from different ‘omic domains (e.g. genomics, proteomics, metabolomics). In these complex workflows, intermediate outputs are often generated as tabular text files, which must be transformed into customized formats which are compatible with the next software tools in the pipeline. Consequently, many text manipulation steps are added to an already complex workflow, overly complicating the process. In some cases, limitations to existing text manipulation are such that desired analyses can only be carried out using highly sophisticated processing steps beyond the reach of even advanced users and developers. For users with some SQL knowledge, these text operations could be combined into single, concise query on a relational database. As a solution, we have developed the Query Tabular Galaxy tool, which leverages a SQLite database generated from tabular input data. This database can be queried and manipulated to produce transformed and customized tabular outputs compatible with downstream processing steps. Regular expressions can also be utilized for even more sophisticated manipulations, such as find and replace and other filtering actions. Using several Galaxy-based multi-omic workflows as an example, we demonstrate how the Query Tabular tool dramatically streamlines and simplifies the creation of multi-step analyses, efficiently enabling complicated textual manipulations and processing. This tool should find broad utility for users of the Galaxy platform seeking to develop and use sophisticated workflows involving text manipulation on tabular outputs. F1000 Research Limited 2019-01-09 /pmc/articles/PMC6248266/ /pubmed/30519459 http://dx.doi.org/10.12688/f1000research.16450.2 Text en Copyright: © 2019 Johnson JE et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Software Tool Article
Johnson, James E.
Kumar, Praveen
Easterly, Caleb
Esler, Mark
Mehta, Subina
Eschenlauer, Arthur C.
Hegeman, Adrian D.
Jagtap, Pratik D.
Griffin, Timothy J.
Improve your Galaxy text life: The Query Tabular Tool
title Improve your Galaxy text life: The Query Tabular Tool
title_full Improve your Galaxy text life: The Query Tabular Tool
title_fullStr Improve your Galaxy text life: The Query Tabular Tool
title_full_unstemmed Improve your Galaxy text life: The Query Tabular Tool
title_short Improve your Galaxy text life: The Query Tabular Tool
title_sort improve your galaxy text life: the query tabular tool
topic Software Tool Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6248266/
https://www.ncbi.nlm.nih.gov/pubmed/30519459
http://dx.doi.org/10.12688/f1000research.16450.2
work_keys_str_mv AT johnsonjamese improveyourgalaxytextlifethequerytabulartool
AT kumarpraveen improveyourgalaxytextlifethequerytabulartool
AT easterlycaleb improveyourgalaxytextlifethequerytabulartool
AT eslermark improveyourgalaxytextlifethequerytabulartool
AT mehtasubina improveyourgalaxytextlifethequerytabulartool
AT eschenlauerarthurc improveyourgalaxytextlifethequerytabulartool
AT hegemanadriand improveyourgalaxytextlifethequerytabulartool
AT jagtappratikd improveyourgalaxytextlifethequerytabulartool
AT griffintimothyj improveyourgalaxytextlifethequerytabulartool