Cargando…

Genome assembly from synthetic long read clouds

Motivation: Despite rapid progress in sequencing technology, assembling de novo the genomes of new species as well as reconstructing complex metagenomes remains major technological challenges. New synthetic long read (SLR) technologies promise significant advances towards these goals; however, their...

Descripción completa

Detalles Bibliográficos
Autores principales: Kuleshov, Volodymyr, Snyder, Michael P., Batzoglou, Serafim
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4908351/
https://www.ncbi.nlm.nih.gov/pubmed/27307620
http://dx.doi.org/10.1093/bioinformatics/btw267
_version_ 1782437665648410624
author Kuleshov, Volodymyr
Snyder, Michael P.
Batzoglou, Serafim
author_facet Kuleshov, Volodymyr
Snyder, Michael P.
Batzoglou, Serafim
author_sort Kuleshov, Volodymyr
collection PubMed
description Motivation: Despite rapid progress in sequencing technology, assembling de novo the genomes of new species as well as reconstructing complex metagenomes remains major technological challenges. New synthetic long read (SLR) technologies promise significant advances towards these goals; however, their applicability is limited by high sequencing requirements and the inability of current assembly paradigms to cope with combinations of short and long reads. Results: Here, we introduce Architect, a new de novo scaffolder aimed at SLR technologies. Unlike previous assembly strategies, Architect does not require a costly subassembly step; instead it assembles genomes directly from the SLR’s underlying short reads, which we refer to as read clouds. This enables a 4- to 20-fold reduction in sequencing requirements and a 5-fold increase in assembly contiguity on both genomic and metagenomic datasets relative to state-of-the-art assembly strategies aimed directly at fully subassembled long reads. Availability and Implementation: Our source code is freely available at https://github.com/kuleshov/architect. Contact: kuleshov@stanford.edu
format Online
Article
Text
id pubmed-4908351
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-49083512016-06-17 Genome assembly from synthetic long read clouds Kuleshov, Volodymyr Snyder, Michael P. Batzoglou, Serafim Bioinformatics Ismb 2016 Proceedings July 8 to July 12, 2016, Orlando, Florida Motivation: Despite rapid progress in sequencing technology, assembling de novo the genomes of new species as well as reconstructing complex metagenomes remains major technological challenges. New synthetic long read (SLR) technologies promise significant advances towards these goals; however, their applicability is limited by high sequencing requirements and the inability of current assembly paradigms to cope with combinations of short and long reads. Results: Here, we introduce Architect, a new de novo scaffolder aimed at SLR technologies. Unlike previous assembly strategies, Architect does not require a costly subassembly step; instead it assembles genomes directly from the SLR’s underlying short reads, which we refer to as read clouds. This enables a 4- to 20-fold reduction in sequencing requirements and a 5-fold increase in assembly contiguity on both genomic and metagenomic datasets relative to state-of-the-art assembly strategies aimed directly at fully subassembled long reads. Availability and Implementation: Our source code is freely available at https://github.com/kuleshov/architect. Contact: kuleshov@stanford.edu Oxford University Press 2016-06-15 2016-06-11 /pmc/articles/PMC4908351/ /pubmed/27307620 http://dx.doi.org/10.1093/bioinformatics/btw267 Text en © The Author 2016. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Ismb 2016 Proceedings July 8 to July 12, 2016, Orlando, Florida
Kuleshov, Volodymyr
Snyder, Michael P.
Batzoglou, Serafim
Genome assembly from synthetic long read clouds
title Genome assembly from synthetic long read clouds
title_full Genome assembly from synthetic long read clouds
title_fullStr Genome assembly from synthetic long read clouds
title_full_unstemmed Genome assembly from synthetic long read clouds
title_short Genome assembly from synthetic long read clouds
title_sort genome assembly from synthetic long read clouds
topic Ismb 2016 Proceedings July 8 to July 12, 2016, Orlando, Florida
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4908351/
https://www.ncbi.nlm.nih.gov/pubmed/27307620
http://dx.doi.org/10.1093/bioinformatics/btw267
work_keys_str_mv AT kuleshovvolodymyr genomeassemblyfromsyntheticlongreadclouds
AT snydermichaelp genomeassemblyfromsyntheticlongreadclouds
AT batzoglouserafim genomeassemblyfromsyntheticlongreadclouds