Cargando…

ParsEval: parallel comparison and analysis of gene structure annotations

BACKGROUND: Accurate gene structure annotation is a fundamental but somewhat elusive goal of genome projects, as witnessed by the fact that (model) genomes typically undergo several cycles of re-annotation. In many cases, it is not only different versions of annotations that need to be compared but...

Descripción completa

Detalles Bibliográficos
Autores principales: Standage, Daniel S, Brendel, Volker P
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3439248/
https://www.ncbi.nlm.nih.gov/pubmed/22852583
http://dx.doi.org/10.1186/1471-2105-13-187
_version_ 1782242963175243776
author Standage, Daniel S
Brendel, Volker P
author_facet Standage, Daniel S
Brendel, Volker P
author_sort Standage, Daniel S
collection PubMed
description BACKGROUND: Accurate gene structure annotation is a fundamental but somewhat elusive goal of genome projects, as witnessed by the fact that (model) genomes typically undergo several cycles of re-annotation. In many cases, it is not only different versions of annotations that need to be compared but also different sources of annotation of the same genome, derived from distinct gene prediction workflows. Such comparisons are of interest to annotation providers, prediction software developers, and end-users, who all need to assess what is common and what is different among distinct annotation sources. We developed ParsEval, a software application for pairwise comparison of sets of gene structure annotations. ParsEval calculates several statistics that highlight the similarities and differences between the two sets of annotations provided. These statistics are presented in an aggregate summary report, with additional details provided as individual reports specific to non-overlapping, gene-model-centric genomic loci. Genome browser styled graphics embedded in these reports help visualize the genomic context of the annotations. Output from ParsEval is both easily read and parsed, enabling systematic identification of problematic gene models for subsequent focused analysis. RESULTS: ParsEval is capable of analyzing annotations for large eukaryotic genomes on typical desktop or laptop hardware. In comparison to existing methods, ParsEval exhibits a considerable performance improvement, both in terms of runtime and memory consumption. Reports from ParsEval can provide relevant biological insights into the gene structure annotations being compared. CONCLUSIONS: Implemented in C, ParsEval provides the quickest and most feature-rich solution for genome annotation comparison to date. The source code is freely available (under an ISC license) at http://parseval.sourceforge.net/.
format Online
Article
Text
id pubmed-3439248
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-34392482012-09-12 ParsEval: parallel comparison and analysis of gene structure annotations Standage, Daniel S Brendel, Volker P BMC Bioinformatics Software BACKGROUND: Accurate gene structure annotation is a fundamental but somewhat elusive goal of genome projects, as witnessed by the fact that (model) genomes typically undergo several cycles of re-annotation. In many cases, it is not only different versions of annotations that need to be compared but also different sources of annotation of the same genome, derived from distinct gene prediction workflows. Such comparisons are of interest to annotation providers, prediction software developers, and end-users, who all need to assess what is common and what is different among distinct annotation sources. We developed ParsEval, a software application for pairwise comparison of sets of gene structure annotations. ParsEval calculates several statistics that highlight the similarities and differences between the two sets of annotations provided. These statistics are presented in an aggregate summary report, with additional details provided as individual reports specific to non-overlapping, gene-model-centric genomic loci. Genome browser styled graphics embedded in these reports help visualize the genomic context of the annotations. Output from ParsEval is both easily read and parsed, enabling systematic identification of problematic gene models for subsequent focused analysis. RESULTS: ParsEval is capable of analyzing annotations for large eukaryotic genomes on typical desktop or laptop hardware. In comparison to existing methods, ParsEval exhibits a considerable performance improvement, both in terms of runtime and memory consumption. Reports from ParsEval can provide relevant biological insights into the gene structure annotations being compared. CONCLUSIONS: Implemented in C, ParsEval provides the quickest and most feature-rich solution for genome annotation comparison to date. The source code is freely available (under an ISC license) at http://parseval.sourceforge.net/. BioMed Central 2012-08-01 /pmc/articles/PMC3439248/ /pubmed/22852583 http://dx.doi.org/10.1186/1471-2105-13-187 Text en Copyright ©2012 Standage and Brendel; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Software
Standage, Daniel S
Brendel, Volker P
ParsEval: parallel comparison and analysis of gene structure annotations
title ParsEval: parallel comparison and analysis of gene structure annotations
title_full ParsEval: parallel comparison and analysis of gene structure annotations
title_fullStr ParsEval: parallel comparison and analysis of gene structure annotations
title_full_unstemmed ParsEval: parallel comparison and analysis of gene structure annotations
title_short ParsEval: parallel comparison and analysis of gene structure annotations
title_sort parseval: parallel comparison and analysis of gene structure annotations
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3439248/
https://www.ncbi.nlm.nih.gov/pubmed/22852583
http://dx.doi.org/10.1186/1471-2105-13-187
work_keys_str_mv AT standagedaniels parsevalparallelcomparisonandanalysisofgenestructureannotations
AT brendelvolkerp parsevalparallelcomparisonandanalysisofgenestructureannotations