Cargando…
BLASTGrabber: a bioinformatic tool for visualization, analysis and sequence selection of massive BLAST data
BACKGROUND: Advances in sequencing efficiency have vastly increased the sizes of biological sequence databases, including many thousands of genome-sequenced species. The BLAST algorithm remains the main search engine for retrieving sequence information, and must consequently handle data on an unprec...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4062517/ https://www.ncbi.nlm.nih.gov/pubmed/24885091 http://dx.doi.org/10.1186/1471-2105-15-128 |
_version_ | 1782321666599157760 |
---|---|
author | Neumann, Ralf Stefan Kumar, Surendra Haverkamp, Thomas Hendricus Augustus Shalchian-Tabrizi, Kamran |
author_facet | Neumann, Ralf Stefan Kumar, Surendra Haverkamp, Thomas Hendricus Augustus Shalchian-Tabrizi, Kamran |
author_sort | Neumann, Ralf Stefan |
collection | PubMed |
description | BACKGROUND: Advances in sequencing efficiency have vastly increased the sizes of biological sequence databases, including many thousands of genome-sequenced species. The BLAST algorithm remains the main search engine for retrieving sequence information, and must consequently handle data on an unprecedented scale. This has been possible due to high-performance computers and parallel processing. However, the raw BLAST output from contemporary searches involving thousands of queries becomes ill-suited for direct human processing. Few programs attempt to directly visualize and interpret BLAST output; those that do often provide a mere basic structuring of BLAST data. RESULTS: Here we present a bioinformatics application named BLASTGrabber suitable for high-throughput sequencing analysis. BLASTGrabber, being implemented as a Java application, is OS-independent and includes a user friendly graphical user interface. Text or XML-formatted BLAST output files can be directly imported, displayed and categorized based on BLAST statistics. Query names and FASTA headers can be analysed by text-mining. In addition to visualizing sequence alignments, BLAST data can be ordered as an interactive taxonomy tree. All modes of analysis support selection, export and storage of data. A Java interface-based plugin structure facilitates the addition of customized third party functionality. CONCLUSION: The BLASTGrabber application introduces new ways of visualizing and analysing massive BLAST output data by integrating taxonomy identification, text mining capabilities and generic multi-dimensional rendering of BLAST hits. The program aims at a non-expert audience in terms of computer skills; the combination of new functionalities makes the program flexible and useful for a broad range of operations. |
format | Online Article Text |
id | pubmed-4062517 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-40625172014-06-19 BLASTGrabber: a bioinformatic tool for visualization, analysis and sequence selection of massive BLAST data Neumann, Ralf Stefan Kumar, Surendra Haverkamp, Thomas Hendricus Augustus Shalchian-Tabrizi, Kamran BMC Bioinformatics Software BACKGROUND: Advances in sequencing efficiency have vastly increased the sizes of biological sequence databases, including many thousands of genome-sequenced species. The BLAST algorithm remains the main search engine for retrieving sequence information, and must consequently handle data on an unprecedented scale. This has been possible due to high-performance computers and parallel processing. However, the raw BLAST output from contemporary searches involving thousands of queries becomes ill-suited for direct human processing. Few programs attempt to directly visualize and interpret BLAST output; those that do often provide a mere basic structuring of BLAST data. RESULTS: Here we present a bioinformatics application named BLASTGrabber suitable for high-throughput sequencing analysis. BLASTGrabber, being implemented as a Java application, is OS-independent and includes a user friendly graphical user interface. Text or XML-formatted BLAST output files can be directly imported, displayed and categorized based on BLAST statistics. Query names and FASTA headers can be analysed by text-mining. In addition to visualizing sequence alignments, BLAST data can be ordered as an interactive taxonomy tree. All modes of analysis support selection, export and storage of data. A Java interface-based plugin structure facilitates the addition of customized third party functionality. CONCLUSION: The BLASTGrabber application introduces new ways of visualizing and analysing massive BLAST output data by integrating taxonomy identification, text mining capabilities and generic multi-dimensional rendering of BLAST hits. The program aims at a non-expert audience in terms of computer skills; the combination of new functionalities makes the program flexible and useful for a broad range of operations. BioMed Central 2014-05-05 /pmc/articles/PMC4062517/ /pubmed/24885091 http://dx.doi.org/10.1186/1471-2105-15-128 Text en Copyright © 2014 Neumann et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Software Neumann, Ralf Stefan Kumar, Surendra Haverkamp, Thomas Hendricus Augustus Shalchian-Tabrizi, Kamran BLASTGrabber: a bioinformatic tool for visualization, analysis and sequence selection of massive BLAST data |
title | BLASTGrabber: a bioinformatic tool for visualization, analysis and sequence selection of massive BLAST data |
title_full | BLASTGrabber: a bioinformatic tool for visualization, analysis and sequence selection of massive BLAST data |
title_fullStr | BLASTGrabber: a bioinformatic tool for visualization, analysis and sequence selection of massive BLAST data |
title_full_unstemmed | BLASTGrabber: a bioinformatic tool for visualization, analysis and sequence selection of massive BLAST data |
title_short | BLASTGrabber: a bioinformatic tool for visualization, analysis and sequence selection of massive BLAST data |
title_sort | blastgrabber: a bioinformatic tool for visualization, analysis and sequence selection of massive blast data |
topic | Software |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4062517/ https://www.ncbi.nlm.nih.gov/pubmed/24885091 http://dx.doi.org/10.1186/1471-2105-15-128 |
work_keys_str_mv | AT neumannralfstefan blastgrabberabioinformatictoolforvisualizationanalysisandsequenceselectionofmassiveblastdata AT kumarsurendra blastgrabberabioinformatictoolforvisualizationanalysisandsequenceselectionofmassiveblastdata AT haverkampthomashendricusaugustus blastgrabberabioinformatictoolforvisualizationanalysisandsequenceselectionofmassiveblastdata AT shalchiantabrizikamran blastgrabberabioinformatictoolforvisualizationanalysisandsequenceselectionofmassiveblastdata |