Cargando…

CAMSA: a tool for comparative analysis and merging of scaffold assemblies

BACKGROUND: Despite the recent progress in genome sequencing and assembly, many of the currently available assembled genomes come in a draft form. Such draft genomes consist of a large number of genomic fragments (scaffolds), whose positions and orientations along the genome are unknown. While there...

Descripción completa

Detalles Bibliográficos
Autores principales: Aganezov, Sergey S., Alekseyev, Max A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5731503/
https://www.ncbi.nlm.nih.gov/pubmed/29244014
http://dx.doi.org/10.1186/s12859-017-1919-y
_version_ 1783286523039318016
author Aganezov, Sergey S.
Alekseyev, Max A.
author_facet Aganezov, Sergey S.
Alekseyev, Max A.
author_sort Aganezov, Sergey S.
collection PubMed
description BACKGROUND: Despite the recent progress in genome sequencing and assembly, many of the currently available assembled genomes come in a draft form. Such draft genomes consist of a large number of genomic fragments (scaffolds), whose positions and orientations along the genome are unknown. While there exists a number of methods for reconstruction of the genome from its scaffolds, utilizing various computational and wet-lab techniques, they often can produce only partial error-prone scaffold assemblies. It therefore becomes important to compare and merge scaffold assemblies produced by different methods, thus combining their advantages and highlighting present conflicts for further investigation. These tasks may be labor intensive if performed manually. RESULTS: We present CAMSA—a tool for comparative analysis and merging of two or more given scaffold assemblies. The tool (i) creates an extensive report with several comparative quality metrics; (ii) constructs the most confident merged scaffold assembly; and (iii) provides an interactive framework for a visual comparative analysis of the given assemblies. Among the CAMSA features, only scaffold merging can be evaluated in comparison to existing methods. Namely, it resembles the functionality of assembly reconciliation tools, although their primary targets are somewhat different. Our evaluations show that CAMSA produces merged assemblies of comparable or better quality than existing assembly reconciliation tools while being the fastest in terms of the total running time. CONCLUSIONS: CAMSA addresses the current deficiency of tools for automated comparison and analysis of multiple assemblies of the same set scaffolds. Since there exist numerous methods and techniques for scaffold assembly, identifying similarities and dissimilarities across assemblies produced by different methods is beneficial both for the developers of scaffold assembly algorithms and for the researchers focused on improving draft assemblies of specific organisms. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-017-1919-y) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5731503
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-57315032017-12-19 CAMSA: a tool for comparative analysis and merging of scaffold assemblies Aganezov, Sergey S. Alekseyev, Max A. BMC Bioinformatics Research BACKGROUND: Despite the recent progress in genome sequencing and assembly, many of the currently available assembled genomes come in a draft form. Such draft genomes consist of a large number of genomic fragments (scaffolds), whose positions and orientations along the genome are unknown. While there exists a number of methods for reconstruction of the genome from its scaffolds, utilizing various computational and wet-lab techniques, they often can produce only partial error-prone scaffold assemblies. It therefore becomes important to compare and merge scaffold assemblies produced by different methods, thus combining their advantages and highlighting present conflicts for further investigation. These tasks may be labor intensive if performed manually. RESULTS: We present CAMSA—a tool for comparative analysis and merging of two or more given scaffold assemblies. The tool (i) creates an extensive report with several comparative quality metrics; (ii) constructs the most confident merged scaffold assembly; and (iii) provides an interactive framework for a visual comparative analysis of the given assemblies. Among the CAMSA features, only scaffold merging can be evaluated in comparison to existing methods. Namely, it resembles the functionality of assembly reconciliation tools, although their primary targets are somewhat different. Our evaluations show that CAMSA produces merged assemblies of comparable or better quality than existing assembly reconciliation tools while being the fastest in terms of the total running time. CONCLUSIONS: CAMSA addresses the current deficiency of tools for automated comparison and analysis of multiple assemblies of the same set scaffolds. Since there exist numerous methods and techniques for scaffold assembly, identifying similarities and dissimilarities across assemblies produced by different methods is beneficial both for the developers of scaffold assembly algorithms and for the researchers focused on improving draft assemblies of specific organisms. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-017-1919-y) contains supplementary material, which is available to authorized users. BioMed Central 2017-12-06 /pmc/articles/PMC5731503/ /pubmed/29244014 http://dx.doi.org/10.1186/s12859-017-1919-y Text en © The Author(s) 2017 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Aganezov, Sergey S.
Alekseyev, Max A.
CAMSA: a tool for comparative analysis and merging of scaffold assemblies
title CAMSA: a tool for comparative analysis and merging of scaffold assemblies
title_full CAMSA: a tool for comparative analysis and merging of scaffold assemblies
title_fullStr CAMSA: a tool for comparative analysis and merging of scaffold assemblies
title_full_unstemmed CAMSA: a tool for comparative analysis and merging of scaffold assemblies
title_short CAMSA: a tool for comparative analysis and merging of scaffold assemblies
title_sort camsa: a tool for comparative analysis and merging of scaffold assemblies
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5731503/
https://www.ncbi.nlm.nih.gov/pubmed/29244014
http://dx.doi.org/10.1186/s12859-017-1919-y
work_keys_str_mv AT aganezovsergeys camsaatoolforcomparativeanalysisandmergingofscaffoldassemblies
AT alekseyevmaxa camsaatoolforcomparativeanalysisandmergingofscaffoldassemblies