Cargando…
VESPA: software to facilitate genomic annotation of prokaryotic organisms through integration of proteomic and transcriptomic data
BACKGROUND: The procedural aspects of genome sequencing and assembly have become relatively inexpensive, yet the full, accurate structural annotation of these genomes remains a challenge. Next-generation sequencing transcriptomics (RNA-Seq), global microarrays, and tandem mass spectrometry (MS/MS)-b...
Autores principales: | , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3364912/ https://www.ncbi.nlm.nih.gov/pubmed/22480257 http://dx.doi.org/10.1186/1471-2164-13-131 |
_version_ | 1782234605759234048 |
---|---|
author | Peterson, Elena S McCue, Lee Ann Schrimpe-Rutledge, Alexandra C Jensen, Jeffrey L Walker, Hyunjoo Kobold, Markus A Webb, Samantha R Payne, Samuel H Ansong, Charles Adkins, Joshua N Cannon, William R Webb-Robertson, Bobbie-Jo M |
author_facet | Peterson, Elena S McCue, Lee Ann Schrimpe-Rutledge, Alexandra C Jensen, Jeffrey L Walker, Hyunjoo Kobold, Markus A Webb, Samantha R Payne, Samuel H Ansong, Charles Adkins, Joshua N Cannon, William R Webb-Robertson, Bobbie-Jo M |
author_sort | Peterson, Elena S |
collection | PubMed |
description | BACKGROUND: The procedural aspects of genome sequencing and assembly have become relatively inexpensive, yet the full, accurate structural annotation of these genomes remains a challenge. Next-generation sequencing transcriptomics (RNA-Seq), global microarrays, and tandem mass spectrometry (MS/MS)-based proteomics have demonstrated immense value to genome curators as individual sources of information, however, integrating these data types to validate and improve structural annotation remains a major challenge. Current visual and statistical analytic tools are focused on a single data type, or existing software tools are retrofitted to analyze new data forms. We present Visual Exploration and Statistics to Promote Annotation (VESPA) is a new interactive visual analysis software tool focused on assisting scientists with the annotation of prokaryotic genomes though the integration of proteomics and transcriptomics data with current genome location coordinates. RESULTS: VESPA is a desktop Java™ application that integrates high-throughput proteomics data (peptide-centric) and transcriptomics (probe or RNA-Seq) data into a genomic context, all of which can be visualized at three levels of genomic resolution. Data is interrogated via searches linked to the genome visualizations to find regions with high likelihood of mis-annotation. Search results are linked to exports for further validation outside of VESPA or potential coding-regions can be analyzed concurrently with the software through interaction with BLAST. VESPA is demonstrated on two use cases (Yersinia pestis Pestoides F and Synechococcus sp. PCC 7002) to demonstrate the rapid manner in which mis-annotations can be found and explored in VESPA using either proteomics data alone, or in combination with transcriptomic data. CONCLUSIONS: VESPA is an interactive visual analytics tool that integrates high-throughput data into a genomic context to facilitate the discovery of structural mis-annotations in prokaryotic genomes. Data is evaluated via visual analysis across multiple levels of genomic resolution, linked searches and interaction with existing bioinformatics tools. We highlight the novel functionality of VESPA and core programming requirements for visualization of these large heterogeneous datasets for a client-side application. The software is freely available at https://www.biopilot.org/docs/Software/Vespa.php. |
format | Online Article Text |
id | pubmed-3364912 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-33649122012-06-01 VESPA: software to facilitate genomic annotation of prokaryotic organisms through integration of proteomic and transcriptomic data Peterson, Elena S McCue, Lee Ann Schrimpe-Rutledge, Alexandra C Jensen, Jeffrey L Walker, Hyunjoo Kobold, Markus A Webb, Samantha R Payne, Samuel H Ansong, Charles Adkins, Joshua N Cannon, William R Webb-Robertson, Bobbie-Jo M BMC Genomics Software BACKGROUND: The procedural aspects of genome sequencing and assembly have become relatively inexpensive, yet the full, accurate structural annotation of these genomes remains a challenge. Next-generation sequencing transcriptomics (RNA-Seq), global microarrays, and tandem mass spectrometry (MS/MS)-based proteomics have demonstrated immense value to genome curators as individual sources of information, however, integrating these data types to validate and improve structural annotation remains a major challenge. Current visual and statistical analytic tools are focused on a single data type, or existing software tools are retrofitted to analyze new data forms. We present Visual Exploration and Statistics to Promote Annotation (VESPA) is a new interactive visual analysis software tool focused on assisting scientists with the annotation of prokaryotic genomes though the integration of proteomics and transcriptomics data with current genome location coordinates. RESULTS: VESPA is a desktop Java™ application that integrates high-throughput proteomics data (peptide-centric) and transcriptomics (probe or RNA-Seq) data into a genomic context, all of which can be visualized at three levels of genomic resolution. Data is interrogated via searches linked to the genome visualizations to find regions with high likelihood of mis-annotation. Search results are linked to exports for further validation outside of VESPA or potential coding-regions can be analyzed concurrently with the software through interaction with BLAST. VESPA is demonstrated on two use cases (Yersinia pestis Pestoides F and Synechococcus sp. PCC 7002) to demonstrate the rapid manner in which mis-annotations can be found and explored in VESPA using either proteomics data alone, or in combination with transcriptomic data. CONCLUSIONS: VESPA is an interactive visual analytics tool that integrates high-throughput data into a genomic context to facilitate the discovery of structural mis-annotations in prokaryotic genomes. Data is evaluated via visual analysis across multiple levels of genomic resolution, linked searches and interaction with existing bioinformatics tools. We highlight the novel functionality of VESPA and core programming requirements for visualization of these large heterogeneous datasets for a client-side application. The software is freely available at https://www.biopilot.org/docs/Software/Vespa.php. BioMed Central 2012-04-05 /pmc/articles/PMC3364912/ /pubmed/22480257 http://dx.doi.org/10.1186/1471-2164-13-131 Text en Copyright ©2012 Peterson et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Software Peterson, Elena S McCue, Lee Ann Schrimpe-Rutledge, Alexandra C Jensen, Jeffrey L Walker, Hyunjoo Kobold, Markus A Webb, Samantha R Payne, Samuel H Ansong, Charles Adkins, Joshua N Cannon, William R Webb-Robertson, Bobbie-Jo M VESPA: software to facilitate genomic annotation of prokaryotic organisms through integration of proteomic and transcriptomic data |
title | VESPA: software to facilitate genomic annotation of prokaryotic organisms through integration of proteomic and transcriptomic data |
title_full | VESPA: software to facilitate genomic annotation of prokaryotic organisms through integration of proteomic and transcriptomic data |
title_fullStr | VESPA: software to facilitate genomic annotation of prokaryotic organisms through integration of proteomic and transcriptomic data |
title_full_unstemmed | VESPA: software to facilitate genomic annotation of prokaryotic organisms through integration of proteomic and transcriptomic data |
title_short | VESPA: software to facilitate genomic annotation of prokaryotic organisms through integration of proteomic and transcriptomic data |
title_sort | vespa: software to facilitate genomic annotation of prokaryotic organisms through integration of proteomic and transcriptomic data |
topic | Software |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3364912/ https://www.ncbi.nlm.nih.gov/pubmed/22480257 http://dx.doi.org/10.1186/1471-2164-13-131 |
work_keys_str_mv | AT petersonelenas vespasoftwaretofacilitategenomicannotationofprokaryoticorganismsthroughintegrationofproteomicandtranscriptomicdata AT mccueleeann vespasoftwaretofacilitategenomicannotationofprokaryoticorganismsthroughintegrationofproteomicandtranscriptomicdata AT schrimperutledgealexandrac vespasoftwaretofacilitategenomicannotationofprokaryoticorganismsthroughintegrationofproteomicandtranscriptomicdata AT jensenjeffreyl vespasoftwaretofacilitategenomicannotationofprokaryoticorganismsthroughintegrationofproteomicandtranscriptomicdata AT walkerhyunjoo vespasoftwaretofacilitategenomicannotationofprokaryoticorganismsthroughintegrationofproteomicandtranscriptomicdata AT koboldmarkusa vespasoftwaretofacilitategenomicannotationofprokaryoticorganismsthroughintegrationofproteomicandtranscriptomicdata AT webbsamanthar vespasoftwaretofacilitategenomicannotationofprokaryoticorganismsthroughintegrationofproteomicandtranscriptomicdata AT paynesamuelh vespasoftwaretofacilitategenomicannotationofprokaryoticorganismsthroughintegrationofproteomicandtranscriptomicdata AT ansongcharles vespasoftwaretofacilitategenomicannotationofprokaryoticorganismsthroughintegrationofproteomicandtranscriptomicdata AT adkinsjoshuan vespasoftwaretofacilitategenomicannotationofprokaryoticorganismsthroughintegrationofproteomicandtranscriptomicdata AT cannonwilliamr vespasoftwaretofacilitategenomicannotationofprokaryoticorganismsthroughintegrationofproteomicandtranscriptomicdata AT webbrobertsonbobbiejom vespasoftwaretofacilitategenomicannotationofprokaryoticorganismsthroughintegrationofproteomicandtranscriptomicdata |