Cargando…

Optimizing experimental design for genome sequencing and assembly with Oxford Nanopore Technologies

High quality reference genome sequences are the core of modern genomics. Oxford Nanopore Technologies (ONT) produces inexpensive DNA sequences, but has high error rates, which make sequence assembly and analysis difficult as genome size and complexity increases. Robust experimental design is necessa...

Descripción completa

Detalles Bibliográficos
Autores principales: Sutton, John M., Millwood, Joshua D., Case McCormack, A., Fierst, Janna L.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: GigaScience Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9650304/
https://www.ncbi.nlm.nih.gov/pubmed/36824342
http://dx.doi.org/10.46471/gigabyte.27
_version_ 1784827984930144256
author Sutton, John M.
Millwood, Joshua D.
Case McCormack, A.
Fierst, Janna L.
author_facet Sutton, John M.
Millwood, Joshua D.
Case McCormack, A.
Fierst, Janna L.
author_sort Sutton, John M.
collection PubMed
description High quality reference genome sequences are the core of modern genomics. Oxford Nanopore Technologies (ONT) produces inexpensive DNA sequences, but has high error rates, which make sequence assembly and analysis difficult as genome size and complexity increases. Robust experimental design is necessary for ONT genome sequencing and assembly, but few studies have addressed eukaryotic organisms. Here, we present novel results using simulated and empirical ONT and DNA libraries to identify best practices for sequencing and assembly for several model species. We find that the unique error structure of ONT libraries causes errors to accumulate and assembly statistics plateau as sequence depth increases. High-quality assembled eukaryotic sequences require high-molecular-weight DNA extractions that increase sequence read length, and computational protocols that reduce error through pre-assembly correction and read selection. Our quantitative results will be helpful for researchers seeking guidance for de novo assembly projects.
format Online
Article
Text
id pubmed-9650304
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher GigaScience Press
record_format MEDLINE/PubMed
spelling pubmed-96503042023-02-22 Optimizing experimental design for genome sequencing and assembly with Oxford Nanopore Technologies Sutton, John M. Millwood, Joshua D. Case McCormack, A. Fierst, Janna L. GigaByte Technical Release High quality reference genome sequences are the core of modern genomics. Oxford Nanopore Technologies (ONT) produces inexpensive DNA sequences, but has high error rates, which make sequence assembly and analysis difficult as genome size and complexity increases. Robust experimental design is necessary for ONT genome sequencing and assembly, but few studies have addressed eukaryotic organisms. Here, we present novel results using simulated and empirical ONT and DNA libraries to identify best practices for sequencing and assembly for several model species. We find that the unique error structure of ONT libraries causes errors to accumulate and assembly statistics plateau as sequence depth increases. High-quality assembled eukaryotic sequences require high-molecular-weight DNA extractions that increase sequence read length, and computational protocols that reduce error through pre-assembly correction and read selection. Our quantitative results will be helpful for researchers seeking guidance for de novo assembly projects. GigaScience Press 2021-07-13 /pmc/articles/PMC9650304/ /pubmed/36824342 http://dx.doi.org/10.46471/gigabyte.27 Text en © The Author(s) 2021. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Technical Release
Sutton, John M.
Millwood, Joshua D.
Case McCormack, A.
Fierst, Janna L.
Optimizing experimental design for genome sequencing and assembly with Oxford Nanopore Technologies
title Optimizing experimental design for genome sequencing and assembly with Oxford Nanopore Technologies
title_full Optimizing experimental design for genome sequencing and assembly with Oxford Nanopore Technologies
title_fullStr Optimizing experimental design for genome sequencing and assembly with Oxford Nanopore Technologies
title_full_unstemmed Optimizing experimental design for genome sequencing and assembly with Oxford Nanopore Technologies
title_short Optimizing experimental design for genome sequencing and assembly with Oxford Nanopore Technologies
title_sort optimizing experimental design for genome sequencing and assembly with oxford nanopore technologies
topic Technical Release
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9650304/
https://www.ncbi.nlm.nih.gov/pubmed/36824342
http://dx.doi.org/10.46471/gigabyte.27
work_keys_str_mv AT suttonjohnm optimizingexperimentaldesignforgenomesequencingandassemblywithoxfordnanoporetechnologies
AT millwoodjoshuad optimizingexperimentaldesignforgenomesequencingandassemblywithoxfordnanoporetechnologies
AT casemccormacka optimizingexperimentaldesignforgenomesequencingandassemblywithoxfordnanoporetechnologies
AT fierstjannal optimizingexperimentaldesignforgenomesequencingandassemblywithoxfordnanoporetechnologies