Cargando…

Applications of the pipeline environment for visual informatics and genomics computations

BACKGROUND: Contemporary informatics and genomics research require efficient, flexible and robust management of large heterogeneous data, advanced computational tools, powerful visualization, reliable hardware infrastructure, interoperability of computational resources, and detailed data and analysi...

Descripción completa

Detalles Bibliográficos
Autores principales: Dinov, Ivo D, Torri, Federica, Macciardi, Fabio, Petrosyan, Petros, Liu, Zhizhong, Zamanyan, Alen, Eggert, Paul, Pierce, Jonathan, Genco, Alex, Knowles, James A, Clark, Andrew P, Van Horn, John D, Ames, Joseph, Kesselman, Carl, Toga, Arthur W
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3199760/
https://www.ncbi.nlm.nih.gov/pubmed/21791102
http://dx.doi.org/10.1186/1471-2105-12-304
_version_ 1782214590304616448
author Dinov, Ivo D
Torri, Federica
Macciardi, Fabio
Petrosyan, Petros
Liu, Zhizhong
Zamanyan, Alen
Eggert, Paul
Pierce, Jonathan
Genco, Alex
Knowles, James A
Clark, Andrew P
Van Horn, John D
Ames, Joseph
Kesselman, Carl
Toga, Arthur W
author_facet Dinov, Ivo D
Torri, Federica
Macciardi, Fabio
Petrosyan, Petros
Liu, Zhizhong
Zamanyan, Alen
Eggert, Paul
Pierce, Jonathan
Genco, Alex
Knowles, James A
Clark, Andrew P
Van Horn, John D
Ames, Joseph
Kesselman, Carl
Toga, Arthur W
author_sort Dinov, Ivo D
collection PubMed
description BACKGROUND: Contemporary informatics and genomics research require efficient, flexible and robust management of large heterogeneous data, advanced computational tools, powerful visualization, reliable hardware infrastructure, interoperability of computational resources, and detailed data and analysis-protocol provenance. The Pipeline is a client-server distributed computational environment that facilitates the visual graphical construction, execution, monitoring, validation and dissemination of advanced data analysis protocols. RESULTS: This paper reports on the applications of the LONI Pipeline environment to address two informatics challenges - graphical management of diverse genomics tools, and the interoperability of informatics software. Specifically, this manuscript presents the concrete details of deploying general informatics suites and individual software tools to new hardware infrastructures, the design, validation and execution of new visual analysis protocols via the Pipeline graphical interface, and integration of diverse informatics tools via the Pipeline eXtensible Markup Language syntax. We demonstrate each of these processes using several established informatics packages (e.g., miBLAST, EMBOSS, mrFAST, GWASS, MAQ, SAMtools, Bowtie) for basic local sequence alignment and search, molecular biology data analysis, and genome-wide association studies. These examples demonstrate the power of the Pipeline graphical workflow environment to enable integration of bioinformatics resources which provide a well-defined syntax for dynamic specification of the input/output parameters and the run-time execution controls. CONCLUSIONS: The LONI Pipeline environment http://pipeline.loni.ucla.edu provides a flexible graphical infrastructure for efficient biomedical computing and distributed informatics research. The interactive Pipeline resource manager enables the utilization and interoperability of diverse types of informatics resources. The Pipeline client-server model provides computational power to a broad spectrum of informatics investigators - experienced developers and novice users, user with or without access to advanced computational-resources (e.g., Grid, data), as well as basic and translational scientists. The open development, validation and dissemination of computational networks (pipeline workflows) facilitates the sharing of knowledge, tools, protocols and best practices, and enables the unbiased validation and replication of scientific findings by the entire community.
format Online
Article
Text
id pubmed-3199760
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-31997602011-10-24 Applications of the pipeline environment for visual informatics and genomics computations Dinov, Ivo D Torri, Federica Macciardi, Fabio Petrosyan, Petros Liu, Zhizhong Zamanyan, Alen Eggert, Paul Pierce, Jonathan Genco, Alex Knowles, James A Clark, Andrew P Van Horn, John D Ames, Joseph Kesselman, Carl Toga, Arthur W BMC Bioinformatics Software BACKGROUND: Contemporary informatics and genomics research require efficient, flexible and robust management of large heterogeneous data, advanced computational tools, powerful visualization, reliable hardware infrastructure, interoperability of computational resources, and detailed data and analysis-protocol provenance. The Pipeline is a client-server distributed computational environment that facilitates the visual graphical construction, execution, monitoring, validation and dissemination of advanced data analysis protocols. RESULTS: This paper reports on the applications of the LONI Pipeline environment to address two informatics challenges - graphical management of diverse genomics tools, and the interoperability of informatics software. Specifically, this manuscript presents the concrete details of deploying general informatics suites and individual software tools to new hardware infrastructures, the design, validation and execution of new visual analysis protocols via the Pipeline graphical interface, and integration of diverse informatics tools via the Pipeline eXtensible Markup Language syntax. We demonstrate each of these processes using several established informatics packages (e.g., miBLAST, EMBOSS, mrFAST, GWASS, MAQ, SAMtools, Bowtie) for basic local sequence alignment and search, molecular biology data analysis, and genome-wide association studies. These examples demonstrate the power of the Pipeline graphical workflow environment to enable integration of bioinformatics resources which provide a well-defined syntax for dynamic specification of the input/output parameters and the run-time execution controls. CONCLUSIONS: The LONI Pipeline environment http://pipeline.loni.ucla.edu provides a flexible graphical infrastructure for efficient biomedical computing and distributed informatics research. The interactive Pipeline resource manager enables the utilization and interoperability of diverse types of informatics resources. The Pipeline client-server model provides computational power to a broad spectrum of informatics investigators - experienced developers and novice users, user with or without access to advanced computational-resources (e.g., Grid, data), as well as basic and translational scientists. The open development, validation and dissemination of computational networks (pipeline workflows) facilitates the sharing of knowledge, tools, protocols and best practices, and enables the unbiased validation and replication of scientific findings by the entire community. BioMed Central 2011-07-26 /pmc/articles/PMC3199760/ /pubmed/21791102 http://dx.doi.org/10.1186/1471-2105-12-304 Text en Copyright © 2011 Dinov et al; licensee BioMed Central Ltd. https://creativecommons.org/licenses/by/2.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0 (https://creativecommons.org/licenses/by/2.0/) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Software
Dinov, Ivo D
Torri, Federica
Macciardi, Fabio
Petrosyan, Petros
Liu, Zhizhong
Zamanyan, Alen
Eggert, Paul
Pierce, Jonathan
Genco, Alex
Knowles, James A
Clark, Andrew P
Van Horn, John D
Ames, Joseph
Kesselman, Carl
Toga, Arthur W
Applications of the pipeline environment for visual informatics and genomics computations
title Applications of the pipeline environment for visual informatics and genomics computations
title_full Applications of the pipeline environment for visual informatics and genomics computations
title_fullStr Applications of the pipeline environment for visual informatics and genomics computations
title_full_unstemmed Applications of the pipeline environment for visual informatics and genomics computations
title_short Applications of the pipeline environment for visual informatics and genomics computations
title_sort applications of the pipeline environment for visual informatics and genomics computations
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3199760/
https://www.ncbi.nlm.nih.gov/pubmed/21791102
http://dx.doi.org/10.1186/1471-2105-12-304
work_keys_str_mv AT dinovivod applicationsofthepipelineenvironmentforvisualinformaticsandgenomicscomputations
AT torrifederica applicationsofthepipelineenvironmentforvisualinformaticsandgenomicscomputations
AT macciardifabio applicationsofthepipelineenvironmentforvisualinformaticsandgenomicscomputations
AT petrosyanpetros applicationsofthepipelineenvironmentforvisualinformaticsandgenomicscomputations
AT liuzhizhong applicationsofthepipelineenvironmentforvisualinformaticsandgenomicscomputations
AT zamanyanalen applicationsofthepipelineenvironmentforvisualinformaticsandgenomicscomputations
AT eggertpaul applicationsofthepipelineenvironmentforvisualinformaticsandgenomicscomputations
AT piercejonathan applicationsofthepipelineenvironmentforvisualinformaticsandgenomicscomputations
AT gencoalex applicationsofthepipelineenvironmentforvisualinformaticsandgenomicscomputations
AT knowlesjamesa applicationsofthepipelineenvironmentforvisualinformaticsandgenomicscomputations
AT clarkandrewp applicationsofthepipelineenvironmentforvisualinformaticsandgenomicscomputations
AT vanhornjohnd applicationsofthepipelineenvironmentforvisualinformaticsandgenomicscomputations
AT amesjoseph applicationsofthepipelineenvironmentforvisualinformaticsandgenomicscomputations
AT kesselmancarl applicationsofthepipelineenvironmentforvisualinformaticsandgenomicscomputations
AT togaarthurw applicationsofthepipelineenvironmentforvisualinformaticsandgenomicscomputations