Cargando…

gVolante for standardizing completeness assessment of genome and transcriptome assemblies

MOTIVATION: Along with the increasing accessibility to comprehensive sequence information, such as whole genomes and transcriptomes, the demand for assessing their quality has been multiplied. To this end, metrics based on sequence lengths, such as N50, have become a standard, but they only evaluate...

Descripción completa

Detalles Bibliográficos
Autores principales: Nishimura, Osamu, Hara, Yuichiro, Kuraku, Shigehiro
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5870689/
https://www.ncbi.nlm.nih.gov/pubmed/29036533
http://dx.doi.org/10.1093/bioinformatics/btx445
_version_ 1783309534386716672
author Nishimura, Osamu
Hara, Yuichiro
Kuraku, Shigehiro
author_facet Nishimura, Osamu
Hara, Yuichiro
Kuraku, Shigehiro
author_sort Nishimura, Osamu
collection PubMed
description MOTIVATION: Along with the increasing accessibility to comprehensive sequence information, such as whole genomes and transcriptomes, the demand for assessing their quality has been multiplied. To this end, metrics based on sequence lengths, such as N50, have become a standard, but they only evaluate one aspect of assembly quality. Conversely, analyzing the coverage of pre-selected reference protein-coding genes provides essential content-based quality assessment, but the currently available pipelines for this purpose, CEGMA and BUSCO, do not have a user-friendly interface to serve as a uniform environment for assembly completeness assessment. RESULTS: Here, we introduce a brand-new web server, gVolante, which provides an online tool for (i) on-demand completeness assessment of sequence sets by means of the previously developed pipelines CEGMA and BUSCO and (ii) browsing pre-computed completeness scores for publicly available data in its database section. Completeness assessments performed on gVolante report scores based on not just the coverage of reference genes but also on sequence lengths (e.g. N50 scaffold length), allowing quality control in multiple aspects. Using gVolante, one can compare the quality of original assemblies between their multiple versions (obtained through program choice and parameter tweaking, for example) and evaluate them in comparison to the scores of public resources found in the database section. AVAILABILITY AND IMPLEMENTATION: gVoalte is freely available at https://gvolante.riken.jp/.
format Online
Article
Text
id pubmed-5870689
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-58706892018-04-05 gVolante for standardizing completeness assessment of genome and transcriptome assemblies Nishimura, Osamu Hara, Yuichiro Kuraku, Shigehiro Bioinformatics Applications Notes MOTIVATION: Along with the increasing accessibility to comprehensive sequence information, such as whole genomes and transcriptomes, the demand for assessing their quality has been multiplied. To this end, metrics based on sequence lengths, such as N50, have become a standard, but they only evaluate one aspect of assembly quality. Conversely, analyzing the coverage of pre-selected reference protein-coding genes provides essential content-based quality assessment, but the currently available pipelines for this purpose, CEGMA and BUSCO, do not have a user-friendly interface to serve as a uniform environment for assembly completeness assessment. RESULTS: Here, we introduce a brand-new web server, gVolante, which provides an online tool for (i) on-demand completeness assessment of sequence sets by means of the previously developed pipelines CEGMA and BUSCO and (ii) browsing pre-computed completeness scores for publicly available data in its database section. Completeness assessments performed on gVolante report scores based on not just the coverage of reference genes but also on sequence lengths (e.g. N50 scaffold length), allowing quality control in multiple aspects. Using gVolante, one can compare the quality of original assemblies between their multiple versions (obtained through program choice and parameter tweaking, for example) and evaluate them in comparison to the scores of public resources found in the database section. AVAILABILITY AND IMPLEMENTATION: gVoalte is freely available at https://gvolante.riken.jp/. Oxford University Press 2017-11-15 2017-07-07 /pmc/articles/PMC5870689/ /pubmed/29036533 http://dx.doi.org/10.1093/bioinformatics/btx445 Text en © The Author 2017. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Applications Notes
Nishimura, Osamu
Hara, Yuichiro
Kuraku, Shigehiro
gVolante for standardizing completeness assessment of genome and transcriptome assemblies
title gVolante for standardizing completeness assessment of genome and transcriptome assemblies
title_full gVolante for standardizing completeness assessment of genome and transcriptome assemblies
title_fullStr gVolante for standardizing completeness assessment of genome and transcriptome assemblies
title_full_unstemmed gVolante for standardizing completeness assessment of genome and transcriptome assemblies
title_short gVolante for standardizing completeness assessment of genome and transcriptome assemblies
title_sort gvolante for standardizing completeness assessment of genome and transcriptome assemblies
topic Applications Notes
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5870689/
https://www.ncbi.nlm.nih.gov/pubmed/29036533
http://dx.doi.org/10.1093/bioinformatics/btx445
work_keys_str_mv AT nishimuraosamu gvolanteforstandardizingcompletenessassessmentofgenomeandtranscriptomeassemblies
AT harayuichiro gvolanteforstandardizingcompletenessassessmentofgenomeandtranscriptomeassemblies
AT kurakushigehiro gvolanteforstandardizingcompletenessassessmentofgenomeandtranscriptomeassemblies