Cargando…
Applications of the pipeline environment for visual informatics and genomics computations
BACKGROUND: Contemporary informatics and genomics research require efficient, flexible and robust management of large heterogeneous data, advanced computational tools, powerful visualization, reliable hardware infrastructure, interoperability of computational resources, and detailed data and analysi...
Autores principales: | , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3199760/ https://www.ncbi.nlm.nih.gov/pubmed/21791102 http://dx.doi.org/10.1186/1471-2105-12-304 |
_version_ | 1782214590304616448 |
---|---|
author | Dinov, Ivo D Torri, Federica Macciardi, Fabio Petrosyan, Petros Liu, Zhizhong Zamanyan, Alen Eggert, Paul Pierce, Jonathan Genco, Alex Knowles, James A Clark, Andrew P Van Horn, John D Ames, Joseph Kesselman, Carl Toga, Arthur W |
author_facet | Dinov, Ivo D Torri, Federica Macciardi, Fabio Petrosyan, Petros Liu, Zhizhong Zamanyan, Alen Eggert, Paul Pierce, Jonathan Genco, Alex Knowles, James A Clark, Andrew P Van Horn, John D Ames, Joseph Kesselman, Carl Toga, Arthur W |
author_sort | Dinov, Ivo D |
collection | PubMed |
description | BACKGROUND: Contemporary informatics and genomics research require efficient, flexible and robust management of large heterogeneous data, advanced computational tools, powerful visualization, reliable hardware infrastructure, interoperability of computational resources, and detailed data and analysis-protocol provenance. The Pipeline is a client-server distributed computational environment that facilitates the visual graphical construction, execution, monitoring, validation and dissemination of advanced data analysis protocols. RESULTS: This paper reports on the applications of the LONI Pipeline environment to address two informatics challenges - graphical management of diverse genomics tools, and the interoperability of informatics software. Specifically, this manuscript presents the concrete details of deploying general informatics suites and individual software tools to new hardware infrastructures, the design, validation and execution of new visual analysis protocols via the Pipeline graphical interface, and integration of diverse informatics tools via the Pipeline eXtensible Markup Language syntax. We demonstrate each of these processes using several established informatics packages (e.g., miBLAST, EMBOSS, mrFAST, GWASS, MAQ, SAMtools, Bowtie) for basic local sequence alignment and search, molecular biology data analysis, and genome-wide association studies. These examples demonstrate the power of the Pipeline graphical workflow environment to enable integration of bioinformatics resources which provide a well-defined syntax for dynamic specification of the input/output parameters and the run-time execution controls. CONCLUSIONS: The LONI Pipeline environment http://pipeline.loni.ucla.edu provides a flexible graphical infrastructure for efficient biomedical computing and distributed informatics research. The interactive Pipeline resource manager enables the utilization and interoperability of diverse types of informatics resources. The Pipeline client-server model provides computational power to a broad spectrum of informatics investigators - experienced developers and novice users, user with or without access to advanced computational-resources (e.g., Grid, data), as well as basic and translational scientists. The open development, validation and dissemination of computational networks (pipeline workflows) facilitates the sharing of knowledge, tools, protocols and best practices, and enables the unbiased validation and replication of scientific findings by the entire community. |
format | Online Article Text |
id | pubmed-3199760 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-31997602011-10-24 Applications of the pipeline environment for visual informatics and genomics computations Dinov, Ivo D Torri, Federica Macciardi, Fabio Petrosyan, Petros Liu, Zhizhong Zamanyan, Alen Eggert, Paul Pierce, Jonathan Genco, Alex Knowles, James A Clark, Andrew P Van Horn, John D Ames, Joseph Kesselman, Carl Toga, Arthur W BMC Bioinformatics Software BACKGROUND: Contemporary informatics and genomics research require efficient, flexible and robust management of large heterogeneous data, advanced computational tools, powerful visualization, reliable hardware infrastructure, interoperability of computational resources, and detailed data and analysis-protocol provenance. The Pipeline is a client-server distributed computational environment that facilitates the visual graphical construction, execution, monitoring, validation and dissemination of advanced data analysis protocols. RESULTS: This paper reports on the applications of the LONI Pipeline environment to address two informatics challenges - graphical management of diverse genomics tools, and the interoperability of informatics software. Specifically, this manuscript presents the concrete details of deploying general informatics suites and individual software tools to new hardware infrastructures, the design, validation and execution of new visual analysis protocols via the Pipeline graphical interface, and integration of diverse informatics tools via the Pipeline eXtensible Markup Language syntax. We demonstrate each of these processes using several established informatics packages (e.g., miBLAST, EMBOSS, mrFAST, GWASS, MAQ, SAMtools, Bowtie) for basic local sequence alignment and search, molecular biology data analysis, and genome-wide association studies. These examples demonstrate the power of the Pipeline graphical workflow environment to enable integration of bioinformatics resources which provide a well-defined syntax for dynamic specification of the input/output parameters and the run-time execution controls. CONCLUSIONS: The LONI Pipeline environment http://pipeline.loni.ucla.edu provides a flexible graphical infrastructure for efficient biomedical computing and distributed informatics research. The interactive Pipeline resource manager enables the utilization and interoperability of diverse types of informatics resources. The Pipeline client-server model provides computational power to a broad spectrum of informatics investigators - experienced developers and novice users, user with or without access to advanced computational-resources (e.g., Grid, data), as well as basic and translational scientists. The open development, validation and dissemination of computational networks (pipeline workflows) facilitates the sharing of knowledge, tools, protocols and best practices, and enables the unbiased validation and replication of scientific findings by the entire community. BioMed Central 2011-07-26 /pmc/articles/PMC3199760/ /pubmed/21791102 http://dx.doi.org/10.1186/1471-2105-12-304 Text en Copyright © 2011 Dinov et al; licensee BioMed Central Ltd. https://creativecommons.org/licenses/by/2.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0 (https://creativecommons.org/licenses/by/2.0/) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Software Dinov, Ivo D Torri, Federica Macciardi, Fabio Petrosyan, Petros Liu, Zhizhong Zamanyan, Alen Eggert, Paul Pierce, Jonathan Genco, Alex Knowles, James A Clark, Andrew P Van Horn, John D Ames, Joseph Kesselman, Carl Toga, Arthur W Applications of the pipeline environment for visual informatics and genomics computations |
title | Applications of the pipeline environment for visual informatics and genomics computations |
title_full | Applications of the pipeline environment for visual informatics and genomics computations |
title_fullStr | Applications of the pipeline environment for visual informatics and genomics computations |
title_full_unstemmed | Applications of the pipeline environment for visual informatics and genomics computations |
title_short | Applications of the pipeline environment for visual informatics and genomics computations |
title_sort | applications of the pipeline environment for visual informatics and genomics computations |
topic | Software |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3199760/ https://www.ncbi.nlm.nih.gov/pubmed/21791102 http://dx.doi.org/10.1186/1471-2105-12-304 |
work_keys_str_mv | AT dinovivod applicationsofthepipelineenvironmentforvisualinformaticsandgenomicscomputations AT torrifederica applicationsofthepipelineenvironmentforvisualinformaticsandgenomicscomputations AT macciardifabio applicationsofthepipelineenvironmentforvisualinformaticsandgenomicscomputations AT petrosyanpetros applicationsofthepipelineenvironmentforvisualinformaticsandgenomicscomputations AT liuzhizhong applicationsofthepipelineenvironmentforvisualinformaticsandgenomicscomputations AT zamanyanalen applicationsofthepipelineenvironmentforvisualinformaticsandgenomicscomputations AT eggertpaul applicationsofthepipelineenvironmentforvisualinformaticsandgenomicscomputations AT piercejonathan applicationsofthepipelineenvironmentforvisualinformaticsandgenomicscomputations AT gencoalex applicationsofthepipelineenvironmentforvisualinformaticsandgenomicscomputations AT knowlesjamesa applicationsofthepipelineenvironmentforvisualinformaticsandgenomicscomputations AT clarkandrewp applicationsofthepipelineenvironmentforvisualinformaticsandgenomicscomputations AT vanhornjohnd applicationsofthepipelineenvironmentforvisualinformaticsandgenomicscomputations AT amesjoseph applicationsofthepipelineenvironmentforvisualinformaticsandgenomicscomputations AT kesselmancarl applicationsofthepipelineenvironmentforvisualinformaticsandgenomicscomputations AT togaarthurw applicationsofthepipelineenvironmentforvisualinformaticsandgenomicscomputations |