Cargando…

BLASTGrabber: a bioinformatic tool for visualization, analysis and sequence selection of massive BLAST data

BACKGROUND: Advances in sequencing efficiency have vastly increased the sizes of biological sequence databases, including many thousands of genome-sequenced species. The BLAST algorithm remains the main search engine for retrieving sequence information, and must consequently handle data on an unprec...

Descripción completa

Detalles Bibliográficos
Autores principales: Neumann, Ralf Stefan, Kumar, Surendra, Haverkamp, Thomas Hendricus Augustus, Shalchian-Tabrizi, Kamran
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4062517/
https://www.ncbi.nlm.nih.gov/pubmed/24885091
http://dx.doi.org/10.1186/1471-2105-15-128
_version_ 1782321666599157760
author Neumann, Ralf Stefan
Kumar, Surendra
Haverkamp, Thomas Hendricus Augustus
Shalchian-Tabrizi, Kamran
author_facet Neumann, Ralf Stefan
Kumar, Surendra
Haverkamp, Thomas Hendricus Augustus
Shalchian-Tabrizi, Kamran
author_sort Neumann, Ralf Stefan
collection PubMed
description BACKGROUND: Advances in sequencing efficiency have vastly increased the sizes of biological sequence databases, including many thousands of genome-sequenced species. The BLAST algorithm remains the main search engine for retrieving sequence information, and must consequently handle data on an unprecedented scale. This has been possible due to high-performance computers and parallel processing. However, the raw BLAST output from contemporary searches involving thousands of queries becomes ill-suited for direct human processing. Few programs attempt to directly visualize and interpret BLAST output; those that do often provide a mere basic structuring of BLAST data. RESULTS: Here we present a bioinformatics application named BLASTGrabber suitable for high-throughput sequencing analysis. BLASTGrabber, being implemented as a Java application, is OS-independent and includes a user friendly graphical user interface. Text or XML-formatted BLAST output files can be directly imported, displayed and categorized based on BLAST statistics. Query names and FASTA headers can be analysed by text-mining. In addition to visualizing sequence alignments, BLAST data can be ordered as an interactive taxonomy tree. All modes of analysis support selection, export and storage of data. A Java interface-based plugin structure facilitates the addition of customized third party functionality. CONCLUSION: The BLASTGrabber application introduces new ways of visualizing and analysing massive BLAST output data by integrating taxonomy identification, text mining capabilities and generic multi-dimensional rendering of BLAST hits. The program aims at a non-expert audience in terms of computer skills; the combination of new functionalities makes the program flexible and useful for a broad range of operations.
format Online
Article
Text
id pubmed-4062517
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-40625172014-06-19 BLASTGrabber: a bioinformatic tool for visualization, analysis and sequence selection of massive BLAST data Neumann, Ralf Stefan Kumar, Surendra Haverkamp, Thomas Hendricus Augustus Shalchian-Tabrizi, Kamran BMC Bioinformatics Software BACKGROUND: Advances in sequencing efficiency have vastly increased the sizes of biological sequence databases, including many thousands of genome-sequenced species. The BLAST algorithm remains the main search engine for retrieving sequence information, and must consequently handle data on an unprecedented scale. This has been possible due to high-performance computers and parallel processing. However, the raw BLAST output from contemporary searches involving thousands of queries becomes ill-suited for direct human processing. Few programs attempt to directly visualize and interpret BLAST output; those that do often provide a mere basic structuring of BLAST data. RESULTS: Here we present a bioinformatics application named BLASTGrabber suitable for high-throughput sequencing analysis. BLASTGrabber, being implemented as a Java application, is OS-independent and includes a user friendly graphical user interface. Text or XML-formatted BLAST output files can be directly imported, displayed and categorized based on BLAST statistics. Query names and FASTA headers can be analysed by text-mining. In addition to visualizing sequence alignments, BLAST data can be ordered as an interactive taxonomy tree. All modes of analysis support selection, export and storage of data. A Java interface-based plugin structure facilitates the addition of customized third party functionality. CONCLUSION: The BLASTGrabber application introduces new ways of visualizing and analysing massive BLAST output data by integrating taxonomy identification, text mining capabilities and generic multi-dimensional rendering of BLAST hits. The program aims at a non-expert audience in terms of computer skills; the combination of new functionalities makes the program flexible and useful for a broad range of operations. BioMed Central 2014-05-05 /pmc/articles/PMC4062517/ /pubmed/24885091 http://dx.doi.org/10.1186/1471-2105-15-128 Text en Copyright © 2014 Neumann et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Software
Neumann, Ralf Stefan
Kumar, Surendra
Haverkamp, Thomas Hendricus Augustus
Shalchian-Tabrizi, Kamran
BLASTGrabber: a bioinformatic tool for visualization, analysis and sequence selection of massive BLAST data
title BLASTGrabber: a bioinformatic tool for visualization, analysis and sequence selection of massive BLAST data
title_full BLASTGrabber: a bioinformatic tool for visualization, analysis and sequence selection of massive BLAST data
title_fullStr BLASTGrabber: a bioinformatic tool for visualization, analysis and sequence selection of massive BLAST data
title_full_unstemmed BLASTGrabber: a bioinformatic tool for visualization, analysis and sequence selection of massive BLAST data
title_short BLASTGrabber: a bioinformatic tool for visualization, analysis and sequence selection of massive BLAST data
title_sort blastgrabber: a bioinformatic tool for visualization, analysis and sequence selection of massive blast data
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4062517/
https://www.ncbi.nlm.nih.gov/pubmed/24885091
http://dx.doi.org/10.1186/1471-2105-15-128
work_keys_str_mv AT neumannralfstefan blastgrabberabioinformatictoolforvisualizationanalysisandsequenceselectionofmassiveblastdata
AT kumarsurendra blastgrabberabioinformatictoolforvisualizationanalysisandsequenceselectionofmassiveblastdata
AT haverkampthomashendricusaugustus blastgrabberabioinformatictoolforvisualizationanalysisandsequenceselectionofmassiveblastdata
AT shalchiantabrizikamran blastgrabberabioinformatictoolforvisualizationanalysisandsequenceselectionofmassiveblastdata