Cargando…

BpWrapper: BioPerl-based sequence and tree utilities for rapid prototyping of bioinformatics pipelines

BACKGROUND: Automated bioinformatics workflows are more robust, easier to maintain, and results more reproducible when built with command-line utilities than with custom-coded scripts. Command-line utilities further benefit by relieving bioinformatics developers to learn the use of, or to interact d...

Descripción completa

Detalles Bibliográficos
Autores principales: Hernández, Yözen, Bernstein, Rocky, Pagan, Pedro, Vargas, Levy, McCaig, William, Ramrattan, Girish, Akther, Saymon, Larracuente, Amanda, Di, Lia, Vieira, Filipe G., Qiu, Wei-Gang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5833151/
https://www.ncbi.nlm.nih.gov/pubmed/29499649
http://dx.doi.org/10.1186/s12859-018-2074-9
_version_ 1783303436776767488
author Hernández, Yözen
Bernstein, Rocky
Pagan, Pedro
Vargas, Levy
McCaig, William
Ramrattan, Girish
Akther, Saymon
Larracuente, Amanda
Di, Lia
Vieira, Filipe G.
Qiu, Wei-Gang
author_facet Hernández, Yözen
Bernstein, Rocky
Pagan, Pedro
Vargas, Levy
McCaig, William
Ramrattan, Girish
Akther, Saymon
Larracuente, Amanda
Di, Lia
Vieira, Filipe G.
Qiu, Wei-Gang
author_sort Hernández, Yözen
collection PubMed
description BACKGROUND: Automated bioinformatics workflows are more robust, easier to maintain, and results more reproducible when built with command-line utilities than with custom-coded scripts. Command-line utilities further benefit by relieving bioinformatics developers to learn the use of, or to interact directly with, biological software libraries. There is however a lack of command-line utilities that leverage popular Open Source biological software toolkits such as BioPerl (http://bioperl.org) to make many of the well-designed, robust, and routinely used biological classes available for a wider base of end users. RESULTS: Designed as standard utilities for UNIX-family operating systems, BpWrapper makes functionality of some of the most popular BioPerl modules readily accessible on the command line to novice as well as to experienced bioinformatics practitioners. The initial release of BpWrapper includes four utilities with concise command-line user interfaces, bioseq, bioaln, biotree, and biopop, specialized for manipulation of molecular sequences, sequence alignments, phylogenetic trees, and DNA polymorphisms, respectively. Over a hundred methods are currently available as command-line options and new methods are easily incorporated. Performance of BpWrapper utilities lags that of precompiled utilities while equivalent to that of other utilities based on BioPerl. BpWrapper has been tested on BioPerl Release 1.6, Perl versions 5.10.1 to 5.25.10, and operating systems including Apple macOS, Microsoft Windows, and GNU/Linux. Release code is available from the Comprehensive Perl Archive Network (CPAN) at https://metacpan.org/pod/Bio::BPWrapper. Source code is available on GitHub at https://github.com/bioperl/p5-bpwrapper. CONCLUSIONS: BpWrapper improves on existing sequence utilities by following the design principles of Unix text utilities such including a concise user interface, extensive command-line options, and standard input/output for serialized operations. Further, dozens of novel methods for manipulation of sequences, alignments, and phylogenetic trees, unavailable in existing utilities (e.g., EMBOSS, Newick Utilities, and FAST), are provided. Bioinformaticians should find BpWrapper useful for rapid prototyping of workflows on the command-line without creating custom scripts for comparative genomics and other bioinformatics applications. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-018-2074-9) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5833151
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-58331512018-03-05 BpWrapper: BioPerl-based sequence and tree utilities for rapid prototyping of bioinformatics pipelines Hernández, Yözen Bernstein, Rocky Pagan, Pedro Vargas, Levy McCaig, William Ramrattan, Girish Akther, Saymon Larracuente, Amanda Di, Lia Vieira, Filipe G. Qiu, Wei-Gang BMC Bioinformatics Software BACKGROUND: Automated bioinformatics workflows are more robust, easier to maintain, and results more reproducible when built with command-line utilities than with custom-coded scripts. Command-line utilities further benefit by relieving bioinformatics developers to learn the use of, or to interact directly with, biological software libraries. There is however a lack of command-line utilities that leverage popular Open Source biological software toolkits such as BioPerl (http://bioperl.org) to make many of the well-designed, robust, and routinely used biological classes available for a wider base of end users. RESULTS: Designed as standard utilities for UNIX-family operating systems, BpWrapper makes functionality of some of the most popular BioPerl modules readily accessible on the command line to novice as well as to experienced bioinformatics practitioners. The initial release of BpWrapper includes four utilities with concise command-line user interfaces, bioseq, bioaln, biotree, and biopop, specialized for manipulation of molecular sequences, sequence alignments, phylogenetic trees, and DNA polymorphisms, respectively. Over a hundred methods are currently available as command-line options and new methods are easily incorporated. Performance of BpWrapper utilities lags that of precompiled utilities while equivalent to that of other utilities based on BioPerl. BpWrapper has been tested on BioPerl Release 1.6, Perl versions 5.10.1 to 5.25.10, and operating systems including Apple macOS, Microsoft Windows, and GNU/Linux. Release code is available from the Comprehensive Perl Archive Network (CPAN) at https://metacpan.org/pod/Bio::BPWrapper. Source code is available on GitHub at https://github.com/bioperl/p5-bpwrapper. CONCLUSIONS: BpWrapper improves on existing sequence utilities by following the design principles of Unix text utilities such including a concise user interface, extensive command-line options, and standard input/output for serialized operations. Further, dozens of novel methods for manipulation of sequences, alignments, and phylogenetic trees, unavailable in existing utilities (e.g., EMBOSS, Newick Utilities, and FAST), are provided. Bioinformaticians should find BpWrapper useful for rapid prototyping of workflows on the command-line without creating custom scripts for comparative genomics and other bioinformatics applications. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-018-2074-9) contains supplementary material, which is available to authorized users. BioMed Central 2018-03-02 /pmc/articles/PMC5833151/ /pubmed/29499649 http://dx.doi.org/10.1186/s12859-018-2074-9 Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Software
Hernández, Yözen
Bernstein, Rocky
Pagan, Pedro
Vargas, Levy
McCaig, William
Ramrattan, Girish
Akther, Saymon
Larracuente, Amanda
Di, Lia
Vieira, Filipe G.
Qiu, Wei-Gang
BpWrapper: BioPerl-based sequence and tree utilities for rapid prototyping of bioinformatics pipelines
title BpWrapper: BioPerl-based sequence and tree utilities for rapid prototyping of bioinformatics pipelines
title_full BpWrapper: BioPerl-based sequence and tree utilities for rapid prototyping of bioinformatics pipelines
title_fullStr BpWrapper: BioPerl-based sequence and tree utilities for rapid prototyping of bioinformatics pipelines
title_full_unstemmed BpWrapper: BioPerl-based sequence and tree utilities for rapid prototyping of bioinformatics pipelines
title_short BpWrapper: BioPerl-based sequence and tree utilities for rapid prototyping of bioinformatics pipelines
title_sort bpwrapper: bioperl-based sequence and tree utilities for rapid prototyping of bioinformatics pipelines
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5833151/
https://www.ncbi.nlm.nih.gov/pubmed/29499649
http://dx.doi.org/10.1186/s12859-018-2074-9
work_keys_str_mv AT hernandezyozen bpwrapperbioperlbasedsequenceandtreeutilitiesforrapidprototypingofbioinformaticspipelines
AT bernsteinrocky bpwrapperbioperlbasedsequenceandtreeutilitiesforrapidprototypingofbioinformaticspipelines
AT paganpedro bpwrapperbioperlbasedsequenceandtreeutilitiesforrapidprototypingofbioinformaticspipelines
AT vargaslevy bpwrapperbioperlbasedsequenceandtreeutilitiesforrapidprototypingofbioinformaticspipelines
AT mccaigwilliam bpwrapperbioperlbasedsequenceandtreeutilitiesforrapidprototypingofbioinformaticspipelines
AT ramrattangirish bpwrapperbioperlbasedsequenceandtreeutilitiesforrapidprototypingofbioinformaticspipelines
AT akthersaymon bpwrapperbioperlbasedsequenceandtreeutilitiesforrapidprototypingofbioinformaticspipelines
AT larracuenteamanda bpwrapperbioperlbasedsequenceandtreeutilitiesforrapidprototypingofbioinformaticspipelines
AT dilia bpwrapperbioperlbasedsequenceandtreeutilitiesforrapidprototypingofbioinformaticspipelines
AT vieirafilipeg bpwrapperbioperlbasedsequenceandtreeutilitiesforrapidprototypingofbioinformaticspipelines
AT qiuweigang bpwrapperbioperlbasedsequenceandtreeutilitiesforrapidprototypingofbioinformaticspipelines