Cargando…
Fast and accurate de novo genome assembly from long uncorrected reads
The assembly of long reads from Pacific Biosciences and Oxford Nanopore Technologies typically requires resource-intensive error-correction and consensus-generation steps to obtain high-quality assemblies. We show that the error-correction step can be omitted and that high-quality consensus sequence...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cold Spring Harbor Laboratory Press
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5411768/ https://www.ncbi.nlm.nih.gov/pubmed/28100585 http://dx.doi.org/10.1101/gr.214270.116 |
_version_ | 1783232862036688896 |
---|---|
author | Vaser, Robert Sović, Ivan Nagarajan, Niranjan Šikić, Mile |
author_facet | Vaser, Robert Sović, Ivan Nagarajan, Niranjan Šikić, Mile |
author_sort | Vaser, Robert |
collection | PubMed |
description | The assembly of long reads from Pacific Biosciences and Oxford Nanopore Technologies typically requires resource-intensive error-correction and consensus-generation steps to obtain high-quality assemblies. We show that the error-correction step can be omitted and that high-quality consensus sequences can be generated efficiently with a SIMD-accelerated, partial-order alignment–based, stand-alone consensus module called Racon. Based on tests with PacBio and Oxford Nanopore data sets, we show that Racon coupled with miniasm enables consensus genomes with similar or better quality than state-of-the-art methods while being an order of magnitude faster. |
format | Online Article Text |
id | pubmed-5411768 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Cold Spring Harbor Laboratory Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-54117682017-11-01 Fast and accurate de novo genome assembly from long uncorrected reads Vaser, Robert Sović, Ivan Nagarajan, Niranjan Šikić, Mile Genome Res Method The assembly of long reads from Pacific Biosciences and Oxford Nanopore Technologies typically requires resource-intensive error-correction and consensus-generation steps to obtain high-quality assemblies. We show that the error-correction step can be omitted and that high-quality consensus sequences can be generated efficiently with a SIMD-accelerated, partial-order alignment–based, stand-alone consensus module called Racon. Based on tests with PacBio and Oxford Nanopore data sets, we show that Racon coupled with miniasm enables consensus genomes with similar or better quality than state-of-the-art methods while being an order of magnitude faster. Cold Spring Harbor Laboratory Press 2017-05 /pmc/articles/PMC5411768/ /pubmed/28100585 http://dx.doi.org/10.1101/gr.214270.116 Text en © 2017 Vaser et al.; Published by Cold Spring Harbor Laboratory Press http://creativecommons.org/licenses/by-nc/4.0/ This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see http://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/. |
spellingShingle | Method Vaser, Robert Sović, Ivan Nagarajan, Niranjan Šikić, Mile Fast and accurate de novo genome assembly from long uncorrected reads |
title | Fast and accurate de novo genome assembly from long uncorrected reads |
title_full | Fast and accurate de novo genome assembly from long uncorrected reads |
title_fullStr | Fast and accurate de novo genome assembly from long uncorrected reads |
title_full_unstemmed | Fast and accurate de novo genome assembly from long uncorrected reads |
title_short | Fast and accurate de novo genome assembly from long uncorrected reads |
title_sort | fast and accurate de novo genome assembly from long uncorrected reads |
topic | Method |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5411768/ https://www.ncbi.nlm.nih.gov/pubmed/28100585 http://dx.doi.org/10.1101/gr.214270.116 |
work_keys_str_mv | AT vaserrobert fastandaccuratedenovogenomeassemblyfromlonguncorrectedreads AT sovicivan fastandaccuratedenovogenomeassemblyfromlonguncorrectedreads AT nagarajanniranjan fastandaccuratedenovogenomeassemblyfromlonguncorrectedreads AT sikicmile fastandaccuratedenovogenomeassemblyfromlonguncorrectedreads |