Cargando…
BpWrapper: BioPerl-based sequence and tree utilities for rapid prototyping of bioinformatics pipelines
BACKGROUND: Automated bioinformatics workflows are more robust, easier to maintain, and results more reproducible when built with command-line utilities than with custom-coded scripts. Command-line utilities further benefit by relieving bioinformatics developers to learn the use of, or to interact d...
Autores principales: | , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5833151/ https://www.ncbi.nlm.nih.gov/pubmed/29499649 http://dx.doi.org/10.1186/s12859-018-2074-9 |
_version_ | 1783303436776767488 |
---|---|
author | Hernández, Yözen Bernstein, Rocky Pagan, Pedro Vargas, Levy McCaig, William Ramrattan, Girish Akther, Saymon Larracuente, Amanda Di, Lia Vieira, Filipe G. Qiu, Wei-Gang |
author_facet | Hernández, Yözen Bernstein, Rocky Pagan, Pedro Vargas, Levy McCaig, William Ramrattan, Girish Akther, Saymon Larracuente, Amanda Di, Lia Vieira, Filipe G. Qiu, Wei-Gang |
author_sort | Hernández, Yözen |
collection | PubMed |
description | BACKGROUND: Automated bioinformatics workflows are more robust, easier to maintain, and results more reproducible when built with command-line utilities than with custom-coded scripts. Command-line utilities further benefit by relieving bioinformatics developers to learn the use of, or to interact directly with, biological software libraries. There is however a lack of command-line utilities that leverage popular Open Source biological software toolkits such as BioPerl (http://bioperl.org) to make many of the well-designed, robust, and routinely used biological classes available for a wider base of end users. RESULTS: Designed as standard utilities for UNIX-family operating systems, BpWrapper makes functionality of some of the most popular BioPerl modules readily accessible on the command line to novice as well as to experienced bioinformatics practitioners. The initial release of BpWrapper includes four utilities with concise command-line user interfaces, bioseq, bioaln, biotree, and biopop, specialized for manipulation of molecular sequences, sequence alignments, phylogenetic trees, and DNA polymorphisms, respectively. Over a hundred methods are currently available as command-line options and new methods are easily incorporated. Performance of BpWrapper utilities lags that of precompiled utilities while equivalent to that of other utilities based on BioPerl. BpWrapper has been tested on BioPerl Release 1.6, Perl versions 5.10.1 to 5.25.10, and operating systems including Apple macOS, Microsoft Windows, and GNU/Linux. Release code is available from the Comprehensive Perl Archive Network (CPAN) at https://metacpan.org/pod/Bio::BPWrapper. Source code is available on GitHub at https://github.com/bioperl/p5-bpwrapper. CONCLUSIONS: BpWrapper improves on existing sequence utilities by following the design principles of Unix text utilities such including a concise user interface, extensive command-line options, and standard input/output for serialized operations. Further, dozens of novel methods for manipulation of sequences, alignments, and phylogenetic trees, unavailable in existing utilities (e.g., EMBOSS, Newick Utilities, and FAST), are provided. Bioinformaticians should find BpWrapper useful for rapid prototyping of workflows on the command-line without creating custom scripts for comparative genomics and other bioinformatics applications. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-018-2074-9) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-5833151 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-58331512018-03-05 BpWrapper: BioPerl-based sequence and tree utilities for rapid prototyping of bioinformatics pipelines Hernández, Yözen Bernstein, Rocky Pagan, Pedro Vargas, Levy McCaig, William Ramrattan, Girish Akther, Saymon Larracuente, Amanda Di, Lia Vieira, Filipe G. Qiu, Wei-Gang BMC Bioinformatics Software BACKGROUND: Automated bioinformatics workflows are more robust, easier to maintain, and results more reproducible when built with command-line utilities than with custom-coded scripts. Command-line utilities further benefit by relieving bioinformatics developers to learn the use of, or to interact directly with, biological software libraries. There is however a lack of command-line utilities that leverage popular Open Source biological software toolkits such as BioPerl (http://bioperl.org) to make many of the well-designed, robust, and routinely used biological classes available for a wider base of end users. RESULTS: Designed as standard utilities for UNIX-family operating systems, BpWrapper makes functionality of some of the most popular BioPerl modules readily accessible on the command line to novice as well as to experienced bioinformatics practitioners. The initial release of BpWrapper includes four utilities with concise command-line user interfaces, bioseq, bioaln, biotree, and biopop, specialized for manipulation of molecular sequences, sequence alignments, phylogenetic trees, and DNA polymorphisms, respectively. Over a hundred methods are currently available as command-line options and new methods are easily incorporated. Performance of BpWrapper utilities lags that of precompiled utilities while equivalent to that of other utilities based on BioPerl. BpWrapper has been tested on BioPerl Release 1.6, Perl versions 5.10.1 to 5.25.10, and operating systems including Apple macOS, Microsoft Windows, and GNU/Linux. Release code is available from the Comprehensive Perl Archive Network (CPAN) at https://metacpan.org/pod/Bio::BPWrapper. Source code is available on GitHub at https://github.com/bioperl/p5-bpwrapper. CONCLUSIONS: BpWrapper improves on existing sequence utilities by following the design principles of Unix text utilities such including a concise user interface, extensive command-line options, and standard input/output for serialized operations. Further, dozens of novel methods for manipulation of sequences, alignments, and phylogenetic trees, unavailable in existing utilities (e.g., EMBOSS, Newick Utilities, and FAST), are provided. Bioinformaticians should find BpWrapper useful for rapid prototyping of workflows on the command-line without creating custom scripts for comparative genomics and other bioinformatics applications. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-018-2074-9) contains supplementary material, which is available to authorized users. BioMed Central 2018-03-02 /pmc/articles/PMC5833151/ /pubmed/29499649 http://dx.doi.org/10.1186/s12859-018-2074-9 Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Software Hernández, Yözen Bernstein, Rocky Pagan, Pedro Vargas, Levy McCaig, William Ramrattan, Girish Akther, Saymon Larracuente, Amanda Di, Lia Vieira, Filipe G. Qiu, Wei-Gang BpWrapper: BioPerl-based sequence and tree utilities for rapid prototyping of bioinformatics pipelines |
title | BpWrapper: BioPerl-based sequence and tree utilities for rapid prototyping of bioinformatics pipelines |
title_full | BpWrapper: BioPerl-based sequence and tree utilities for rapid prototyping of bioinformatics pipelines |
title_fullStr | BpWrapper: BioPerl-based sequence and tree utilities for rapid prototyping of bioinformatics pipelines |
title_full_unstemmed | BpWrapper: BioPerl-based sequence and tree utilities for rapid prototyping of bioinformatics pipelines |
title_short | BpWrapper: BioPerl-based sequence and tree utilities for rapid prototyping of bioinformatics pipelines |
title_sort | bpwrapper: bioperl-based sequence and tree utilities for rapid prototyping of bioinformatics pipelines |
topic | Software |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5833151/ https://www.ncbi.nlm.nih.gov/pubmed/29499649 http://dx.doi.org/10.1186/s12859-018-2074-9 |
work_keys_str_mv | AT hernandezyozen bpwrapperbioperlbasedsequenceandtreeutilitiesforrapidprototypingofbioinformaticspipelines AT bernsteinrocky bpwrapperbioperlbasedsequenceandtreeutilitiesforrapidprototypingofbioinformaticspipelines AT paganpedro bpwrapperbioperlbasedsequenceandtreeutilitiesforrapidprototypingofbioinformaticspipelines AT vargaslevy bpwrapperbioperlbasedsequenceandtreeutilitiesforrapidprototypingofbioinformaticspipelines AT mccaigwilliam bpwrapperbioperlbasedsequenceandtreeutilitiesforrapidprototypingofbioinformaticspipelines AT ramrattangirish bpwrapperbioperlbasedsequenceandtreeutilitiesforrapidprototypingofbioinformaticspipelines AT akthersaymon bpwrapperbioperlbasedsequenceandtreeutilitiesforrapidprototypingofbioinformaticspipelines AT larracuenteamanda bpwrapperbioperlbasedsequenceandtreeutilitiesforrapidprototypingofbioinformaticspipelines AT dilia bpwrapperbioperlbasedsequenceandtreeutilitiesforrapidprototypingofbioinformaticspipelines AT vieirafilipeg bpwrapperbioperlbasedsequenceandtreeutilitiesforrapidprototypingofbioinformaticspipelines AT qiuweigang bpwrapperbioperlbasedsequenceandtreeutilitiesforrapidprototypingofbioinformaticspipelines |