Cargando…

ARCS: scaffolding genome drafts with linked reads

MOTIVATION: Sequencing of human genomes is now routine, and assembly of shotgun reads is increasingly feasible. However, assemblies often fail to inform about chromosome-scale structure due to a lack of linkage information over long stretches of DNA—a shortcoming that is being addressed by new seque...

Descripción completa

Detalles Bibliográficos
Autores principales: Yeo, Sarah, Coombe, Lauren, Warren, René L, Chu, Justin, Birol, Inanç
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6030987/
https://www.ncbi.nlm.nih.gov/pubmed/29069293
http://dx.doi.org/10.1093/bioinformatics/btx675
_version_ 1783337233361666048
author Yeo, Sarah
Coombe, Lauren
Warren, René L
Chu, Justin
Birol, Inanç
author_facet Yeo, Sarah
Coombe, Lauren
Warren, René L
Chu, Justin
Birol, Inanç
author_sort Yeo, Sarah
collection PubMed
description MOTIVATION: Sequencing of human genomes is now routine, and assembly of shotgun reads is increasingly feasible. However, assemblies often fail to inform about chromosome-scale structure due to a lack of linkage information over long stretches of DNA—a shortcoming that is being addressed by new sequencing protocols, such as the GemCode and Chromium linked reads from 10 × Genomics. RESULTS: Here, we present ARCS, an application that utilizes the barcoding information contained in linked reads to further organize draft genomes into highly contiguous assemblies. We show how the contiguity of an ABySS H.sapiens genome assembly can be increased over six-fold, using moderate coverage (25-fold) Chromium data. We expect ARCS to have broad utility in harnessing the barcoding information contained in linked read data for connecting high-quality sequences in genome assembly drafts. AVAILABILITY AND IMPLEMENTATION: https://github.com/bcgsc/ARCS/ SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-6030987
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-60309872018-07-10 ARCS: scaffolding genome drafts with linked reads Yeo, Sarah Coombe, Lauren Warren, René L Chu, Justin Birol, Inanç Bioinformatics Original Papers MOTIVATION: Sequencing of human genomes is now routine, and assembly of shotgun reads is increasingly feasible. However, assemblies often fail to inform about chromosome-scale structure due to a lack of linkage information over long stretches of DNA—a shortcoming that is being addressed by new sequencing protocols, such as the GemCode and Chromium linked reads from 10 × Genomics. RESULTS: Here, we present ARCS, an application that utilizes the barcoding information contained in linked reads to further organize draft genomes into highly contiguous assemblies. We show how the contiguity of an ABySS H.sapiens genome assembly can be increased over six-fold, using moderate coverage (25-fold) Chromium data. We expect ARCS to have broad utility in harnessing the barcoding information contained in linked read data for connecting high-quality sequences in genome assembly drafts. AVAILABILITY AND IMPLEMENTATION: https://github.com/bcgsc/ARCS/ SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2018-03-01 2017-10-23 /pmc/articles/PMC6030987/ /pubmed/29069293 http://dx.doi.org/10.1093/bioinformatics/btx675 Text en © The Author 2017. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Original Papers
Yeo, Sarah
Coombe, Lauren
Warren, René L
Chu, Justin
Birol, Inanç
ARCS: scaffolding genome drafts with linked reads
title ARCS: scaffolding genome drafts with linked reads
title_full ARCS: scaffolding genome drafts with linked reads
title_fullStr ARCS: scaffolding genome drafts with linked reads
title_full_unstemmed ARCS: scaffolding genome drafts with linked reads
title_short ARCS: scaffolding genome drafts with linked reads
title_sort arcs: scaffolding genome drafts with linked reads
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6030987/
https://www.ncbi.nlm.nih.gov/pubmed/29069293
http://dx.doi.org/10.1093/bioinformatics/btx675
work_keys_str_mv AT yeosarah arcsscaffoldinggenomedraftswithlinkedreads
AT coombelauren arcsscaffoldinggenomedraftswithlinkedreads
AT warrenrenel arcsscaffoldinggenomedraftswithlinkedreads
AT chujustin arcsscaffoldinggenomedraftswithlinkedreads
AT birolinanc arcsscaffoldinggenomedraftswithlinkedreads