Cargando…
ARBitR: an overlap-aware genome assembly scaffolder for linked reads
SUMMARY: Linked genomic sequencing reads contain information that can be used to join sequences together into scaffolds in draft genome assemblies. Existing software for this purpose performs the scaffolding by joining sequences with a gap between them, not considering potential overlaps of contigs....
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8352505/ https://www.ncbi.nlm.nih.gov/pubmed/33216122 http://dx.doi.org/10.1093/bioinformatics/btaa975 |
_version_ | 1783736194215968768 |
---|---|
author | Hiltunen, Markus Ryberg, Martin Johannesson, Hanna |
author_facet | Hiltunen, Markus Ryberg, Martin Johannesson, Hanna |
author_sort | Hiltunen, Markus |
collection | PubMed |
description | SUMMARY: Linked genomic sequencing reads contain information that can be used to join sequences together into scaffolds in draft genome assemblies. Existing software for this purpose performs the scaffolding by joining sequences with a gap between them, not considering potential overlaps of contigs. We developed ARBitR to create scaffolds where overlaps are taken into account and show that it can accurately recreate regions where draft assemblies are broken. AVAILABILITY AND IMPLEMENTATION: ARBitR is written and implemented in Python3 for Unix-based operative systems. All source code is available at https://github.com/markhilt/ARBitR under the GNU General Public License v3. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-8352505 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-83525052021-08-10 ARBitR: an overlap-aware genome assembly scaffolder for linked reads Hiltunen, Markus Ryberg, Martin Johannesson, Hanna Bioinformatics Applications Notes SUMMARY: Linked genomic sequencing reads contain information that can be used to join sequences together into scaffolds in draft genome assemblies. Existing software for this purpose performs the scaffolding by joining sequences with a gap between them, not considering potential overlaps of contigs. We developed ARBitR to create scaffolds where overlaps are taken into account and show that it can accurately recreate regions where draft assemblies are broken. AVAILABILITY AND IMPLEMENTATION: ARBitR is written and implemented in Python3 for Unix-based operative systems. All source code is available at https://github.com/markhilt/ARBitR under the GNU General Public License v3. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2020-11-20 /pmc/articles/PMC8352505/ /pubmed/33216122 http://dx.doi.org/10.1093/bioinformatics/btaa975 Text en © The Author(s) 2020. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Applications Notes Hiltunen, Markus Ryberg, Martin Johannesson, Hanna ARBitR: an overlap-aware genome assembly scaffolder for linked reads |
title | ARBitR: an overlap-aware genome assembly scaffolder for linked reads |
title_full | ARBitR: an overlap-aware genome assembly scaffolder for linked reads |
title_fullStr | ARBitR: an overlap-aware genome assembly scaffolder for linked reads |
title_full_unstemmed | ARBitR: an overlap-aware genome assembly scaffolder for linked reads |
title_short | ARBitR: an overlap-aware genome assembly scaffolder for linked reads |
title_sort | arbitr: an overlap-aware genome assembly scaffolder for linked reads |
topic | Applications Notes |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8352505/ https://www.ncbi.nlm.nih.gov/pubmed/33216122 http://dx.doi.org/10.1093/bioinformatics/btaa975 |
work_keys_str_mv | AT hiltunenmarkus arbitranoverlapawaregenomeassemblyscaffolderforlinkedreads AT rybergmartin arbitranoverlapawaregenomeassemblyscaffolderforlinkedreads AT johannessonhanna arbitranoverlapawaregenomeassemblyscaffolderforlinkedreads |