Cargando…
A comparative evaluation of genome assembly reconciliation tools
BACKGROUND: The majority of eukaryotic genomes are unfinished due to the algorithmic challenges of assembling them. A variety of assembly and scaffolding tools are available, but it is not always obvious which tool or parameters to use for a specific genome size and complexity. It is, therefore, com...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5436433/ https://www.ncbi.nlm.nih.gov/pubmed/28521789 http://dx.doi.org/10.1186/s13059-017-1213-3 |
_version_ | 1783237404269740032 |
---|---|
author | Alhakami, Hind Mirebrahim, Hamid Lonardi, Stefano |
author_facet | Alhakami, Hind Mirebrahim, Hamid Lonardi, Stefano |
author_sort | Alhakami, Hind |
collection | PubMed |
description | BACKGROUND: The majority of eukaryotic genomes are unfinished due to the algorithmic challenges of assembling them. A variety of assembly and scaffolding tools are available, but it is not always obvious which tool or parameters to use for a specific genome size and complexity. It is, therefore, common practice to produce multiple assemblies using different assemblers and parameters, then select the best one for public release. A more compelling approach would allow one to merge multiple assemblies with the intent of producing a higher quality consensus assembly, which is the objective of assembly reconciliation. RESULTS: Several assembly reconciliation tools have been proposed in the literature, but their strengths and weaknesses have never been compared on a common dataset. We fill this need with this work, in which we report on an extensive comparative evaluation of several tools. Specifically, we evaluate contiguity, correctness, coverage, and the duplication ratio of the merged assembly compared to the individual assemblies provided as input. CONCLUSIONS: None of the tools we tested consistently improved the quality of the input GAGE and synthetic assemblies. Our experiments show an increase in contiguity in the consensus assembly when the original assemblies already have high quality. In terms of correctness, the quality of the results depends on the specific tool, as well as on the quality and the ranking of the input assemblies. In general, the number of misassemblies ranges from being comparable to the best of the input assembly to being comparable to the worst of the input assembly. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13059-017-1213-3) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-5436433 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-54364332017-05-19 A comparative evaluation of genome assembly reconciliation tools Alhakami, Hind Mirebrahim, Hamid Lonardi, Stefano Genome Biol Research BACKGROUND: The majority of eukaryotic genomes are unfinished due to the algorithmic challenges of assembling them. A variety of assembly and scaffolding tools are available, but it is not always obvious which tool or parameters to use for a specific genome size and complexity. It is, therefore, common practice to produce multiple assemblies using different assemblers and parameters, then select the best one for public release. A more compelling approach would allow one to merge multiple assemblies with the intent of producing a higher quality consensus assembly, which is the objective of assembly reconciliation. RESULTS: Several assembly reconciliation tools have been proposed in the literature, but their strengths and weaknesses have never been compared on a common dataset. We fill this need with this work, in which we report on an extensive comparative evaluation of several tools. Specifically, we evaluate contiguity, correctness, coverage, and the duplication ratio of the merged assembly compared to the individual assemblies provided as input. CONCLUSIONS: None of the tools we tested consistently improved the quality of the input GAGE and synthetic assemblies. Our experiments show an increase in contiguity in the consensus assembly when the original assemblies already have high quality. In terms of correctness, the quality of the results depends on the specific tool, as well as on the quality and the ranking of the input assemblies. In general, the number of misassemblies ranges from being comparable to the best of the input assembly to being comparable to the worst of the input assembly. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13059-017-1213-3) contains supplementary material, which is available to authorized users. BioMed Central 2017-05-18 /pmc/articles/PMC5436433/ /pubmed/28521789 http://dx.doi.org/10.1186/s13059-017-1213-3 Text en © The Author(s) 2017 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Alhakami, Hind Mirebrahim, Hamid Lonardi, Stefano A comparative evaluation of genome assembly reconciliation tools |
title | A comparative evaluation of genome assembly reconciliation tools |
title_full | A comparative evaluation of genome assembly reconciliation tools |
title_fullStr | A comparative evaluation of genome assembly reconciliation tools |
title_full_unstemmed | A comparative evaluation of genome assembly reconciliation tools |
title_short | A comparative evaluation of genome assembly reconciliation tools |
title_sort | comparative evaluation of genome assembly reconciliation tools |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5436433/ https://www.ncbi.nlm.nih.gov/pubmed/28521789 http://dx.doi.org/10.1186/s13059-017-1213-3 |
work_keys_str_mv | AT alhakamihind acomparativeevaluationofgenomeassemblyreconciliationtools AT mirebrahimhamid acomparativeevaluationofgenomeassemblyreconciliationtools AT lonardistefano acomparativeevaluationofgenomeassemblyreconciliationtools AT alhakamihind comparativeevaluationofgenomeassemblyreconciliationtools AT mirebrahimhamid comparativeevaluationofgenomeassemblyreconciliationtools AT lonardistefano comparativeevaluationofgenomeassemblyreconciliationtools |