Cargando…

Nanopore sequencing and assembly of a human genome with ultra-long reads

We report the sequencing and assembly of a reference genome for the human GM12878 Utah/Ceph cell line using the MinION (Oxford Nanopore Technologies) nanopore sequencer. 91.2 Gb of sequence data, representing ∼30× theoretical coverage, were produced. Reference-based alignment enabled detection of la...

Descripción completa

Detalles Bibliográficos
Autores principales: Jain, Miten, Koren, Sergey, Miga, Karen H, Quick, Josh, Rand, Arthur C, Sasani, Thomas A, Tyson, John R, Beggs, Andrew D, Dilthey, Alexander T, Fiddes, Ian T, Malla, Sunir, Marriott, Hannah, Nieto, Tom, O'Grady, Justin, Olsen, Hugh E, Pedersen, Brent S, Rhie, Arang, Richardson, Hollian, Quinlan, Aaron R, Snutch, Terrance P, Tee, Louise, Paten, Benedict, Phillippy, Adam M, Simpson, Jared T, Loman, Nicholas J, Loose, Matthew
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group US 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5889714/
https://www.ncbi.nlm.nih.gov/pubmed/29431738
http://dx.doi.org/10.1038/nbt.4060
_version_ 1783312746028204032
author Jain, Miten
Koren, Sergey
Miga, Karen H
Quick, Josh
Rand, Arthur C
Sasani, Thomas A
Tyson, John R
Beggs, Andrew D
Dilthey, Alexander T
Fiddes, Ian T
Malla, Sunir
Marriott, Hannah
Nieto, Tom
O'Grady, Justin
Olsen, Hugh E
Pedersen, Brent S
Rhie, Arang
Richardson, Hollian
Quinlan, Aaron R
Snutch, Terrance P
Tee, Louise
Paten, Benedict
Phillippy, Adam M
Simpson, Jared T
Loman, Nicholas J
Loose, Matthew
author_facet Jain, Miten
Koren, Sergey
Miga, Karen H
Quick, Josh
Rand, Arthur C
Sasani, Thomas A
Tyson, John R
Beggs, Andrew D
Dilthey, Alexander T
Fiddes, Ian T
Malla, Sunir
Marriott, Hannah
Nieto, Tom
O'Grady, Justin
Olsen, Hugh E
Pedersen, Brent S
Rhie, Arang
Richardson, Hollian
Quinlan, Aaron R
Snutch, Terrance P
Tee, Louise
Paten, Benedict
Phillippy, Adam M
Simpson, Jared T
Loman, Nicholas J
Loose, Matthew
author_sort Jain, Miten
collection PubMed
description We report the sequencing and assembly of a reference genome for the human GM12878 Utah/Ceph cell line using the MinION (Oxford Nanopore Technologies) nanopore sequencer. 91.2 Gb of sequence data, representing ∼30× theoretical coverage, were produced. Reference-based alignment enabled detection of large structural variants and epigenetic modifications. De novo assembly of nanopore reads alone yielded a contiguous assembly (NG50 ∼3 Mb). We developed a protocol to generate ultra-long reads (N50 > 100 kb, read lengths up to 882 kb). Incorporating an additional 5× coverage of these ultra-long reads more than doubled the assembly contiguity (NG50 ∼6.4 Mb). The final assembled genome was 2,867 million bases in size, covering 85.8% of the reference. Assembly accuracy, after incorporating complementary short-read sequencing data, exceeded 99.8%. Ultra-long reads enabled assembly and phasing of the 4-Mb major histocompatibility complex (MHC) locus in its entirety, measurement of telomere repeat length, and closure of gaps in the reference human genome assembly GRCh38. SUPPLEMENTARY INFORMATION: The online version of this article (doi:10.1038/nbt.4060) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5889714
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Nature Publishing Group US
record_format MEDLINE/PubMed
spelling pubmed-58897142018-05-04 Nanopore sequencing and assembly of a human genome with ultra-long reads Jain, Miten Koren, Sergey Miga, Karen H Quick, Josh Rand, Arthur C Sasani, Thomas A Tyson, John R Beggs, Andrew D Dilthey, Alexander T Fiddes, Ian T Malla, Sunir Marriott, Hannah Nieto, Tom O'Grady, Justin Olsen, Hugh E Pedersen, Brent S Rhie, Arang Richardson, Hollian Quinlan, Aaron R Snutch, Terrance P Tee, Louise Paten, Benedict Phillippy, Adam M Simpson, Jared T Loman, Nicholas J Loose, Matthew Nat Biotechnol Article We report the sequencing and assembly of a reference genome for the human GM12878 Utah/Ceph cell line using the MinION (Oxford Nanopore Technologies) nanopore sequencer. 91.2 Gb of sequence data, representing ∼30× theoretical coverage, were produced. Reference-based alignment enabled detection of large structural variants and epigenetic modifications. De novo assembly of nanopore reads alone yielded a contiguous assembly (NG50 ∼3 Mb). We developed a protocol to generate ultra-long reads (N50 > 100 kb, read lengths up to 882 kb). Incorporating an additional 5× coverage of these ultra-long reads more than doubled the assembly contiguity (NG50 ∼6.4 Mb). The final assembled genome was 2,867 million bases in size, covering 85.8% of the reference. Assembly accuracy, after incorporating complementary short-read sequencing data, exceeded 99.8%. Ultra-long reads enabled assembly and phasing of the 4-Mb major histocompatibility complex (MHC) locus in its entirety, measurement of telomere repeat length, and closure of gaps in the reference human genome assembly GRCh38. SUPPLEMENTARY INFORMATION: The online version of this article (doi:10.1038/nbt.4060) contains supplementary material, which is available to authorized users. Nature Publishing Group US 2018-01-29 2018 /pmc/articles/PMC5889714/ /pubmed/29431738 http://dx.doi.org/10.1038/nbt.4060 Text en © The Author(s) 2018 This work is licensed under a Creative Commons Attribution 4.0 International licence. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons licence, users will need to obtain permission from the licence holder to reproduce the material. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Jain, Miten
Koren, Sergey
Miga, Karen H
Quick, Josh
Rand, Arthur C
Sasani, Thomas A
Tyson, John R
Beggs, Andrew D
Dilthey, Alexander T
Fiddes, Ian T
Malla, Sunir
Marriott, Hannah
Nieto, Tom
O'Grady, Justin
Olsen, Hugh E
Pedersen, Brent S
Rhie, Arang
Richardson, Hollian
Quinlan, Aaron R
Snutch, Terrance P
Tee, Louise
Paten, Benedict
Phillippy, Adam M
Simpson, Jared T
Loman, Nicholas J
Loose, Matthew
Nanopore sequencing and assembly of a human genome with ultra-long reads
title Nanopore sequencing and assembly of a human genome with ultra-long reads
title_full Nanopore sequencing and assembly of a human genome with ultra-long reads
title_fullStr Nanopore sequencing and assembly of a human genome with ultra-long reads
title_full_unstemmed Nanopore sequencing and assembly of a human genome with ultra-long reads
title_short Nanopore sequencing and assembly of a human genome with ultra-long reads
title_sort nanopore sequencing and assembly of a human genome with ultra-long reads
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5889714/
https://www.ncbi.nlm.nih.gov/pubmed/29431738
http://dx.doi.org/10.1038/nbt.4060
work_keys_str_mv AT jainmiten nanoporesequencingandassemblyofahumangenomewithultralongreads
AT korensergey nanoporesequencingandassemblyofahumangenomewithultralongreads
AT migakarenh nanoporesequencingandassemblyofahumangenomewithultralongreads
AT quickjosh nanoporesequencingandassemblyofahumangenomewithultralongreads
AT randarthurc nanoporesequencingandassemblyofahumangenomewithultralongreads
AT sasanithomasa nanoporesequencingandassemblyofahumangenomewithultralongreads
AT tysonjohnr nanoporesequencingandassemblyofahumangenomewithultralongreads
AT beggsandrewd nanoporesequencingandassemblyofahumangenomewithultralongreads
AT diltheyalexandert nanoporesequencingandassemblyofahumangenomewithultralongreads
AT fiddesiant nanoporesequencingandassemblyofahumangenomewithultralongreads
AT mallasunir nanoporesequencingandassemblyofahumangenomewithultralongreads
AT marriotthannah nanoporesequencingandassemblyofahumangenomewithultralongreads
AT nietotom nanoporesequencingandassemblyofahumangenomewithultralongreads
AT ogradyjustin nanoporesequencingandassemblyofahumangenomewithultralongreads
AT olsenhughe nanoporesequencingandassemblyofahumangenomewithultralongreads
AT pedersenbrents nanoporesequencingandassemblyofahumangenomewithultralongreads
AT rhiearang nanoporesequencingandassemblyofahumangenomewithultralongreads
AT richardsonhollian nanoporesequencingandassemblyofahumangenomewithultralongreads
AT quinlanaaronr nanoporesequencingandassemblyofahumangenomewithultralongreads
AT snutchterrancep nanoporesequencingandassemblyofahumangenomewithultralongreads
AT teelouise nanoporesequencingandassemblyofahumangenomewithultralongreads
AT patenbenedict nanoporesequencingandassemblyofahumangenomewithultralongreads
AT phillippyadamm nanoporesequencingandassemblyofahumangenomewithultralongreads
AT simpsonjaredt nanoporesequencingandassemblyofahumangenomewithultralongreads
AT lomannicholasj nanoporesequencingandassemblyofahumangenomewithultralongreads
AT loosematthew nanoporesequencingandassemblyofahumangenomewithultralongreads