Cargando…

Fast and accurate de novo genome assembly from long uncorrected reads

The assembly of long reads from Pacific Biosciences and Oxford Nanopore Technologies typically requires resource-intensive error-correction and consensus-generation steps to obtain high-quality assemblies. We show that the error-correction step can be omitted and that high-quality consensus sequence...

Descripción completa

Detalles Bibliográficos
Autores principales: Vaser, Robert, Sović, Ivan, Nagarajan, Niranjan, Šikić, Mile
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5411768/
https://www.ncbi.nlm.nih.gov/pubmed/28100585
http://dx.doi.org/10.1101/gr.214270.116
_version_ 1783232862036688896
author Vaser, Robert
Sović, Ivan
Nagarajan, Niranjan
Šikić, Mile
author_facet Vaser, Robert
Sović, Ivan
Nagarajan, Niranjan
Šikić, Mile
author_sort Vaser, Robert
collection PubMed
description The assembly of long reads from Pacific Biosciences and Oxford Nanopore Technologies typically requires resource-intensive error-correction and consensus-generation steps to obtain high-quality assemblies. We show that the error-correction step can be omitted and that high-quality consensus sequences can be generated efficiently with a SIMD-accelerated, partial-order alignment–based, stand-alone consensus module called Racon. Based on tests with PacBio and Oxford Nanopore data sets, we show that Racon coupled with miniasm enables consensus genomes with similar or better quality than state-of-the-art methods while being an order of magnitude faster.
format Online
Article
Text
id pubmed-5411768
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Cold Spring Harbor Laboratory Press
record_format MEDLINE/PubMed
spelling pubmed-54117682017-11-01 Fast and accurate de novo genome assembly from long uncorrected reads Vaser, Robert Sović, Ivan Nagarajan, Niranjan Šikić, Mile Genome Res Method The assembly of long reads from Pacific Biosciences and Oxford Nanopore Technologies typically requires resource-intensive error-correction and consensus-generation steps to obtain high-quality assemblies. We show that the error-correction step can be omitted and that high-quality consensus sequences can be generated efficiently with a SIMD-accelerated, partial-order alignment–based, stand-alone consensus module called Racon. Based on tests with PacBio and Oxford Nanopore data sets, we show that Racon coupled with miniasm enables consensus genomes with similar or better quality than state-of-the-art methods while being an order of magnitude faster. Cold Spring Harbor Laboratory Press 2017-05 /pmc/articles/PMC5411768/ /pubmed/28100585 http://dx.doi.org/10.1101/gr.214270.116 Text en © 2017 Vaser et al.; Published by Cold Spring Harbor Laboratory Press http://creativecommons.org/licenses/by-nc/4.0/ This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see http://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/.
spellingShingle Method
Vaser, Robert
Sović, Ivan
Nagarajan, Niranjan
Šikić, Mile
Fast and accurate de novo genome assembly from long uncorrected reads
title Fast and accurate de novo genome assembly from long uncorrected reads
title_full Fast and accurate de novo genome assembly from long uncorrected reads
title_fullStr Fast and accurate de novo genome assembly from long uncorrected reads
title_full_unstemmed Fast and accurate de novo genome assembly from long uncorrected reads
title_short Fast and accurate de novo genome assembly from long uncorrected reads
title_sort fast and accurate de novo genome assembly from long uncorrected reads
topic Method
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5411768/
https://www.ncbi.nlm.nih.gov/pubmed/28100585
http://dx.doi.org/10.1101/gr.214270.116
work_keys_str_mv AT vaserrobert fastandaccuratedenovogenomeassemblyfromlonguncorrectedreads
AT sovicivan fastandaccuratedenovogenomeassemblyfromlonguncorrectedreads
AT nagarajanniranjan fastandaccuratedenovogenomeassemblyfromlonguncorrectedreads
AT sikicmile fastandaccuratedenovogenomeassemblyfromlonguncorrectedreads