Cargando…

A comparative evaluation of genome assembly reconciliation tools

BACKGROUND: The majority of eukaryotic genomes are unfinished due to the algorithmic challenges of assembling them. A variety of assembly and scaffolding tools are available, but it is not always obvious which tool or parameters to use for a specific genome size and complexity. It is, therefore, com...

Descripción completa

Detalles Bibliográficos
Autores principales: Alhakami, Hind, Mirebrahim, Hamid, Lonardi, Stefano
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5436433/
https://www.ncbi.nlm.nih.gov/pubmed/28521789
http://dx.doi.org/10.1186/s13059-017-1213-3
_version_ 1783237404269740032
author Alhakami, Hind
Mirebrahim, Hamid
Lonardi, Stefano
author_facet Alhakami, Hind
Mirebrahim, Hamid
Lonardi, Stefano
author_sort Alhakami, Hind
collection PubMed
description BACKGROUND: The majority of eukaryotic genomes are unfinished due to the algorithmic challenges of assembling them. A variety of assembly and scaffolding tools are available, but it is not always obvious which tool or parameters to use for a specific genome size and complexity. It is, therefore, common practice to produce multiple assemblies using different assemblers and parameters, then select the best one for public release. A more compelling approach would allow one to merge multiple assemblies with the intent of producing a higher quality consensus assembly, which is the objective of assembly reconciliation. RESULTS: Several assembly reconciliation tools have been proposed in the literature, but their strengths and weaknesses have never been compared on a common dataset. We fill this need with this work, in which we report on an extensive comparative evaluation of several tools. Specifically, we evaluate contiguity, correctness, coverage, and the duplication ratio of the merged assembly compared to the individual assemblies provided as input. CONCLUSIONS: None of the tools we tested consistently improved the quality of the input GAGE and synthetic assemblies. Our experiments show an increase in contiguity in the consensus assembly when the original assemblies already have high quality. In terms of correctness, the quality of the results depends on the specific tool, as well as on the quality and the ranking of the input assemblies. In general, the number of misassemblies ranges from being comparable to the best of the input assembly to being comparable to the worst of the input assembly. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13059-017-1213-3) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5436433
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-54364332017-05-19 A comparative evaluation of genome assembly reconciliation tools Alhakami, Hind Mirebrahim, Hamid Lonardi, Stefano Genome Biol Research BACKGROUND: The majority of eukaryotic genomes are unfinished due to the algorithmic challenges of assembling them. A variety of assembly and scaffolding tools are available, but it is not always obvious which tool or parameters to use for a specific genome size and complexity. It is, therefore, common practice to produce multiple assemblies using different assemblers and parameters, then select the best one for public release. A more compelling approach would allow one to merge multiple assemblies with the intent of producing a higher quality consensus assembly, which is the objective of assembly reconciliation. RESULTS: Several assembly reconciliation tools have been proposed in the literature, but their strengths and weaknesses have never been compared on a common dataset. We fill this need with this work, in which we report on an extensive comparative evaluation of several tools. Specifically, we evaluate contiguity, correctness, coverage, and the duplication ratio of the merged assembly compared to the individual assemblies provided as input. CONCLUSIONS: None of the tools we tested consistently improved the quality of the input GAGE and synthetic assemblies. Our experiments show an increase in contiguity in the consensus assembly when the original assemblies already have high quality. In terms of correctness, the quality of the results depends on the specific tool, as well as on the quality and the ranking of the input assemblies. In general, the number of misassemblies ranges from being comparable to the best of the input assembly to being comparable to the worst of the input assembly. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13059-017-1213-3) contains supplementary material, which is available to authorized users. BioMed Central 2017-05-18 /pmc/articles/PMC5436433/ /pubmed/28521789 http://dx.doi.org/10.1186/s13059-017-1213-3 Text en © The Author(s) 2017 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Alhakami, Hind
Mirebrahim, Hamid
Lonardi, Stefano
A comparative evaluation of genome assembly reconciliation tools
title A comparative evaluation of genome assembly reconciliation tools
title_full A comparative evaluation of genome assembly reconciliation tools
title_fullStr A comparative evaluation of genome assembly reconciliation tools
title_full_unstemmed A comparative evaluation of genome assembly reconciliation tools
title_short A comparative evaluation of genome assembly reconciliation tools
title_sort comparative evaluation of genome assembly reconciliation tools
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5436433/
https://www.ncbi.nlm.nih.gov/pubmed/28521789
http://dx.doi.org/10.1186/s13059-017-1213-3
work_keys_str_mv AT alhakamihind acomparativeevaluationofgenomeassemblyreconciliationtools
AT mirebrahimhamid acomparativeevaluationofgenomeassemblyreconciliationtools
AT lonardistefano acomparativeevaluationofgenomeassemblyreconciliationtools
AT alhakamihind comparativeevaluationofgenomeassemblyreconciliationtools
AT mirebrahimhamid comparativeevaluationofgenomeassemblyreconciliationtools
AT lonardistefano comparativeevaluationofgenomeassemblyreconciliationtools