Cargando…
CAMSA: a tool for comparative analysis and merging of scaffold assemblies
BACKGROUND: Despite the recent progress in genome sequencing and assembly, many of the currently available assembled genomes come in a draft form. Such draft genomes consist of a large number of genomic fragments (scaffolds), whose positions and orientations along the genome are unknown. While there...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5731503/ https://www.ncbi.nlm.nih.gov/pubmed/29244014 http://dx.doi.org/10.1186/s12859-017-1919-y |
_version_ | 1783286523039318016 |
---|---|
author | Aganezov, Sergey S. Alekseyev, Max A. |
author_facet | Aganezov, Sergey S. Alekseyev, Max A. |
author_sort | Aganezov, Sergey S. |
collection | PubMed |
description | BACKGROUND: Despite the recent progress in genome sequencing and assembly, many of the currently available assembled genomes come in a draft form. Such draft genomes consist of a large number of genomic fragments (scaffolds), whose positions and orientations along the genome are unknown. While there exists a number of methods for reconstruction of the genome from its scaffolds, utilizing various computational and wet-lab techniques, they often can produce only partial error-prone scaffold assemblies. It therefore becomes important to compare and merge scaffold assemblies produced by different methods, thus combining their advantages and highlighting present conflicts for further investigation. These tasks may be labor intensive if performed manually. RESULTS: We present CAMSA—a tool for comparative analysis and merging of two or more given scaffold assemblies. The tool (i) creates an extensive report with several comparative quality metrics; (ii) constructs the most confident merged scaffold assembly; and (iii) provides an interactive framework for a visual comparative analysis of the given assemblies. Among the CAMSA features, only scaffold merging can be evaluated in comparison to existing methods. Namely, it resembles the functionality of assembly reconciliation tools, although their primary targets are somewhat different. Our evaluations show that CAMSA produces merged assemblies of comparable or better quality than existing assembly reconciliation tools while being the fastest in terms of the total running time. CONCLUSIONS: CAMSA addresses the current deficiency of tools for automated comparison and analysis of multiple assemblies of the same set scaffolds. Since there exist numerous methods and techniques for scaffold assembly, identifying similarities and dissimilarities across assemblies produced by different methods is beneficial both for the developers of scaffold assembly algorithms and for the researchers focused on improving draft assemblies of specific organisms. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-017-1919-y) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-5731503 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-57315032017-12-19 CAMSA: a tool for comparative analysis and merging of scaffold assemblies Aganezov, Sergey S. Alekseyev, Max A. BMC Bioinformatics Research BACKGROUND: Despite the recent progress in genome sequencing and assembly, many of the currently available assembled genomes come in a draft form. Such draft genomes consist of a large number of genomic fragments (scaffolds), whose positions and orientations along the genome are unknown. While there exists a number of methods for reconstruction of the genome from its scaffolds, utilizing various computational and wet-lab techniques, they often can produce only partial error-prone scaffold assemblies. It therefore becomes important to compare and merge scaffold assemblies produced by different methods, thus combining their advantages and highlighting present conflicts for further investigation. These tasks may be labor intensive if performed manually. RESULTS: We present CAMSA—a tool for comparative analysis and merging of two or more given scaffold assemblies. The tool (i) creates an extensive report with several comparative quality metrics; (ii) constructs the most confident merged scaffold assembly; and (iii) provides an interactive framework for a visual comparative analysis of the given assemblies. Among the CAMSA features, only scaffold merging can be evaluated in comparison to existing methods. Namely, it resembles the functionality of assembly reconciliation tools, although their primary targets are somewhat different. Our evaluations show that CAMSA produces merged assemblies of comparable or better quality than existing assembly reconciliation tools while being the fastest in terms of the total running time. CONCLUSIONS: CAMSA addresses the current deficiency of tools for automated comparison and analysis of multiple assemblies of the same set scaffolds. Since there exist numerous methods and techniques for scaffold assembly, identifying similarities and dissimilarities across assemblies produced by different methods is beneficial both for the developers of scaffold assembly algorithms and for the researchers focused on improving draft assemblies of specific organisms. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-017-1919-y) contains supplementary material, which is available to authorized users. BioMed Central 2017-12-06 /pmc/articles/PMC5731503/ /pubmed/29244014 http://dx.doi.org/10.1186/s12859-017-1919-y Text en © The Author(s) 2017 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Aganezov, Sergey S. Alekseyev, Max A. CAMSA: a tool for comparative analysis and merging of scaffold assemblies |
title | CAMSA: a tool for comparative analysis and merging of scaffold assemblies |
title_full | CAMSA: a tool for comparative analysis and merging of scaffold assemblies |
title_fullStr | CAMSA: a tool for comparative analysis and merging of scaffold assemblies |
title_full_unstemmed | CAMSA: a tool for comparative analysis and merging of scaffold assemblies |
title_short | CAMSA: a tool for comparative analysis and merging of scaffold assemblies |
title_sort | camsa: a tool for comparative analysis and merging of scaffold assemblies |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5731503/ https://www.ncbi.nlm.nih.gov/pubmed/29244014 http://dx.doi.org/10.1186/s12859-017-1919-y |
work_keys_str_mv | AT aganezovsergeys camsaatoolforcomparativeanalysisandmergingofscaffoldassemblies AT alekseyevmaxa camsaatoolforcomparativeanalysisandmergingofscaffoldassemblies |