Cargando…

ARBitR: an overlap-aware genome assembly scaffolder for linked reads

SUMMARY: Linked genomic sequencing reads contain information that can be used to join sequences together into scaffolds in draft genome assemblies. Existing software for this purpose performs the scaffolding by joining sequences with a gap between them, not considering potential overlaps of contigs....

Descripción completa

Detalles Bibliográficos
Autores principales: Hiltunen, Markus, Ryberg, Martin, Johannesson, Hanna
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8352505/
https://www.ncbi.nlm.nih.gov/pubmed/33216122
http://dx.doi.org/10.1093/bioinformatics/btaa975
_version_ 1783736194215968768
author Hiltunen, Markus
Ryberg, Martin
Johannesson, Hanna
author_facet Hiltunen, Markus
Ryberg, Martin
Johannesson, Hanna
author_sort Hiltunen, Markus
collection PubMed
description SUMMARY: Linked genomic sequencing reads contain information that can be used to join sequences together into scaffolds in draft genome assemblies. Existing software for this purpose performs the scaffolding by joining sequences with a gap between them, not considering potential overlaps of contigs. We developed ARBitR to create scaffolds where overlaps are taken into account and show that it can accurately recreate regions where draft assemblies are broken. AVAILABILITY AND IMPLEMENTATION: ARBitR is written and implemented in Python3 for Unix-based operative systems. All source code is available at https://github.com/markhilt/ARBitR under the GNU General Public License v3. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-8352505
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-83525052021-08-10 ARBitR: an overlap-aware genome assembly scaffolder for linked reads Hiltunen, Markus Ryberg, Martin Johannesson, Hanna Bioinformatics Applications Notes SUMMARY: Linked genomic sequencing reads contain information that can be used to join sequences together into scaffolds in draft genome assemblies. Existing software for this purpose performs the scaffolding by joining sequences with a gap between them, not considering potential overlaps of contigs. We developed ARBitR to create scaffolds where overlaps are taken into account and show that it can accurately recreate regions where draft assemblies are broken. AVAILABILITY AND IMPLEMENTATION: ARBitR is written and implemented in Python3 for Unix-based operative systems. All source code is available at https://github.com/markhilt/ARBitR under the GNU General Public License v3. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2020-11-20 /pmc/articles/PMC8352505/ /pubmed/33216122 http://dx.doi.org/10.1093/bioinformatics/btaa975 Text en © The Author(s) 2020. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Applications Notes
Hiltunen, Markus
Ryberg, Martin
Johannesson, Hanna
ARBitR: an overlap-aware genome assembly scaffolder for linked reads
title ARBitR: an overlap-aware genome assembly scaffolder for linked reads
title_full ARBitR: an overlap-aware genome assembly scaffolder for linked reads
title_fullStr ARBitR: an overlap-aware genome assembly scaffolder for linked reads
title_full_unstemmed ARBitR: an overlap-aware genome assembly scaffolder for linked reads
title_short ARBitR: an overlap-aware genome assembly scaffolder for linked reads
title_sort arbitr: an overlap-aware genome assembly scaffolder for linked reads
topic Applications Notes
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8352505/
https://www.ncbi.nlm.nih.gov/pubmed/33216122
http://dx.doi.org/10.1093/bioinformatics/btaa975
work_keys_str_mv AT hiltunenmarkus arbitranoverlapawaregenomeassemblyscaffolderforlinkedreads
AT rybergmartin arbitranoverlapawaregenomeassemblyscaffolderforlinkedreads
AT johannessonhanna arbitranoverlapawaregenomeassemblyscaffolderforlinkedreads