Cargando…
Nanopore sequencing and assembly of a human genome with ultra-long reads
We report the sequencing and assembly of a reference genome for the human GM12878 Utah/Ceph cell line using the MinION (Oxford Nanopore Technologies) nanopore sequencer. 91.2 Gb of sequence data, representing ∼30× theoretical coverage, were produced. Reference-based alignment enabled detection of la...
Autores principales: | , , , , , , , , , , , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group US
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5889714/ https://www.ncbi.nlm.nih.gov/pubmed/29431738 http://dx.doi.org/10.1038/nbt.4060 |
_version_ | 1783312746028204032 |
---|---|
author | Jain, Miten Koren, Sergey Miga, Karen H Quick, Josh Rand, Arthur C Sasani, Thomas A Tyson, John R Beggs, Andrew D Dilthey, Alexander T Fiddes, Ian T Malla, Sunir Marriott, Hannah Nieto, Tom O'Grady, Justin Olsen, Hugh E Pedersen, Brent S Rhie, Arang Richardson, Hollian Quinlan, Aaron R Snutch, Terrance P Tee, Louise Paten, Benedict Phillippy, Adam M Simpson, Jared T Loman, Nicholas J Loose, Matthew |
author_facet | Jain, Miten Koren, Sergey Miga, Karen H Quick, Josh Rand, Arthur C Sasani, Thomas A Tyson, John R Beggs, Andrew D Dilthey, Alexander T Fiddes, Ian T Malla, Sunir Marriott, Hannah Nieto, Tom O'Grady, Justin Olsen, Hugh E Pedersen, Brent S Rhie, Arang Richardson, Hollian Quinlan, Aaron R Snutch, Terrance P Tee, Louise Paten, Benedict Phillippy, Adam M Simpson, Jared T Loman, Nicholas J Loose, Matthew |
author_sort | Jain, Miten |
collection | PubMed |
description | We report the sequencing and assembly of a reference genome for the human GM12878 Utah/Ceph cell line using the MinION (Oxford Nanopore Technologies) nanopore sequencer. 91.2 Gb of sequence data, representing ∼30× theoretical coverage, were produced. Reference-based alignment enabled detection of large structural variants and epigenetic modifications. De novo assembly of nanopore reads alone yielded a contiguous assembly (NG50 ∼3 Mb). We developed a protocol to generate ultra-long reads (N50 > 100 kb, read lengths up to 882 kb). Incorporating an additional 5× coverage of these ultra-long reads more than doubled the assembly contiguity (NG50 ∼6.4 Mb). The final assembled genome was 2,867 million bases in size, covering 85.8% of the reference. Assembly accuracy, after incorporating complementary short-read sequencing data, exceeded 99.8%. Ultra-long reads enabled assembly and phasing of the 4-Mb major histocompatibility complex (MHC) locus in its entirety, measurement of telomere repeat length, and closure of gaps in the reference human genome assembly GRCh38. SUPPLEMENTARY INFORMATION: The online version of this article (doi:10.1038/nbt.4060) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-5889714 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Nature Publishing Group US |
record_format | MEDLINE/PubMed |
spelling | pubmed-58897142018-05-04 Nanopore sequencing and assembly of a human genome with ultra-long reads Jain, Miten Koren, Sergey Miga, Karen H Quick, Josh Rand, Arthur C Sasani, Thomas A Tyson, John R Beggs, Andrew D Dilthey, Alexander T Fiddes, Ian T Malla, Sunir Marriott, Hannah Nieto, Tom O'Grady, Justin Olsen, Hugh E Pedersen, Brent S Rhie, Arang Richardson, Hollian Quinlan, Aaron R Snutch, Terrance P Tee, Louise Paten, Benedict Phillippy, Adam M Simpson, Jared T Loman, Nicholas J Loose, Matthew Nat Biotechnol Article We report the sequencing and assembly of a reference genome for the human GM12878 Utah/Ceph cell line using the MinION (Oxford Nanopore Technologies) nanopore sequencer. 91.2 Gb of sequence data, representing ∼30× theoretical coverage, were produced. Reference-based alignment enabled detection of large structural variants and epigenetic modifications. De novo assembly of nanopore reads alone yielded a contiguous assembly (NG50 ∼3 Mb). We developed a protocol to generate ultra-long reads (N50 > 100 kb, read lengths up to 882 kb). Incorporating an additional 5× coverage of these ultra-long reads more than doubled the assembly contiguity (NG50 ∼6.4 Mb). The final assembled genome was 2,867 million bases in size, covering 85.8% of the reference. Assembly accuracy, after incorporating complementary short-read sequencing data, exceeded 99.8%. Ultra-long reads enabled assembly and phasing of the 4-Mb major histocompatibility complex (MHC) locus in its entirety, measurement of telomere repeat length, and closure of gaps in the reference human genome assembly GRCh38. SUPPLEMENTARY INFORMATION: The online version of this article (doi:10.1038/nbt.4060) contains supplementary material, which is available to authorized users. Nature Publishing Group US 2018-01-29 2018 /pmc/articles/PMC5889714/ /pubmed/29431738 http://dx.doi.org/10.1038/nbt.4060 Text en © The Author(s) 2018 This work is licensed under a Creative Commons Attribution 4.0 International licence. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons licence, users will need to obtain permission from the licence holder to reproduce the material. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Jain, Miten Koren, Sergey Miga, Karen H Quick, Josh Rand, Arthur C Sasani, Thomas A Tyson, John R Beggs, Andrew D Dilthey, Alexander T Fiddes, Ian T Malla, Sunir Marriott, Hannah Nieto, Tom O'Grady, Justin Olsen, Hugh E Pedersen, Brent S Rhie, Arang Richardson, Hollian Quinlan, Aaron R Snutch, Terrance P Tee, Louise Paten, Benedict Phillippy, Adam M Simpson, Jared T Loman, Nicholas J Loose, Matthew Nanopore sequencing and assembly of a human genome with ultra-long reads |
title | Nanopore sequencing and assembly of a human genome with ultra-long reads |
title_full | Nanopore sequencing and assembly of a human genome with ultra-long reads |
title_fullStr | Nanopore sequencing and assembly of a human genome with ultra-long reads |
title_full_unstemmed | Nanopore sequencing and assembly of a human genome with ultra-long reads |
title_short | Nanopore sequencing and assembly of a human genome with ultra-long reads |
title_sort | nanopore sequencing and assembly of a human genome with ultra-long reads |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5889714/ https://www.ncbi.nlm.nih.gov/pubmed/29431738 http://dx.doi.org/10.1038/nbt.4060 |
work_keys_str_mv | AT jainmiten nanoporesequencingandassemblyofahumangenomewithultralongreads AT korensergey nanoporesequencingandassemblyofahumangenomewithultralongreads AT migakarenh nanoporesequencingandassemblyofahumangenomewithultralongreads AT quickjosh nanoporesequencingandassemblyofahumangenomewithultralongreads AT randarthurc nanoporesequencingandassemblyofahumangenomewithultralongreads AT sasanithomasa nanoporesequencingandassemblyofahumangenomewithultralongreads AT tysonjohnr nanoporesequencingandassemblyofahumangenomewithultralongreads AT beggsandrewd nanoporesequencingandassemblyofahumangenomewithultralongreads AT diltheyalexandert nanoporesequencingandassemblyofahumangenomewithultralongreads AT fiddesiant nanoporesequencingandassemblyofahumangenomewithultralongreads AT mallasunir nanoporesequencingandassemblyofahumangenomewithultralongreads AT marriotthannah nanoporesequencingandassemblyofahumangenomewithultralongreads AT nietotom nanoporesequencingandassemblyofahumangenomewithultralongreads AT ogradyjustin nanoporesequencingandassemblyofahumangenomewithultralongreads AT olsenhughe nanoporesequencingandassemblyofahumangenomewithultralongreads AT pedersenbrents nanoporesequencingandassemblyofahumangenomewithultralongreads AT rhiearang nanoporesequencingandassemblyofahumangenomewithultralongreads AT richardsonhollian nanoporesequencingandassemblyofahumangenomewithultralongreads AT quinlanaaronr nanoporesequencingandassemblyofahumangenomewithultralongreads AT snutchterrancep nanoporesequencingandassemblyofahumangenomewithultralongreads AT teelouise nanoporesequencingandassemblyofahumangenomewithultralongreads AT patenbenedict nanoporesequencingandassemblyofahumangenomewithultralongreads AT phillippyadamm nanoporesequencingandassemblyofahumangenomewithultralongreads AT simpsonjaredt nanoporesequencingandassemblyofahumangenomewithultralongreads AT lomannicholasj nanoporesequencingandassemblyofahumangenomewithultralongreads AT loosematthew nanoporesequencingandassemblyofahumangenomewithultralongreads |