Cargando…

Scalable Nanopore sequencing of human genomes provides a comprehensive view of haplotype-resolved variation and methylation

Long-read sequencing technologies substantially overcome the limitations of short-reads but to date have not been considered as feasible replacement at scale due to a combination of being too expensive, not scalable enough, or too error-prone. Here, we develop an efficient and scalable wet lab and c...

Descripción completa

Detalles Bibliográficos
Autores principales: Kolmogorov, Mikhail, Billingsley, Kimberley J., Mastoras, Mira, Meredith, Melissa, Monlong, Jean, Lorig-Roach, Ryan, Asri, Mobin, Jerez, Pilar Alvarez, Malik, Laksh, Dewan, Ramita, Reed, Xylena, Genner, Rylee M., Daida, Kensuke, Behera, Sairam, Shafin, Kishwar, Pesout, Trevor, Prabakaran, Jeshuwin, Carnevali, Paolo, Yang, Jianzhi, Rhie, Arang, Scholz, Sonja W., Traynor, Bryan J., Miga, Karen H., Jain, Miten, Timp, Winston, Phillippy, Adam M., Chaisson, Mark, Sedlazeck, Fritz J., Blauwendraat, Cornelis, Paten, Benedict
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9882142/
https://www.ncbi.nlm.nih.gov/pubmed/36711673
http://dx.doi.org/10.1101/2023.01.12.523790
_version_ 1784879244096045056
author Kolmogorov, Mikhail
Billingsley, Kimberley J.
Mastoras, Mira
Meredith, Melissa
Monlong, Jean
Lorig-Roach, Ryan
Asri, Mobin
Jerez, Pilar Alvarez
Malik, Laksh
Dewan, Ramita
Reed, Xylena
Genner, Rylee M.
Daida, Kensuke
Behera, Sairam
Shafin, Kishwar
Pesout, Trevor
Prabakaran, Jeshuwin
Carnevali, Paolo
Yang, Jianzhi
Rhie, Arang
Scholz, Sonja W.
Traynor, Bryan J.
Miga, Karen H.
Jain, Miten
Timp, Winston
Phillippy, Adam M.
Chaisson, Mark
Sedlazeck, Fritz J.
Blauwendraat, Cornelis
Paten, Benedict
author_facet Kolmogorov, Mikhail
Billingsley, Kimberley J.
Mastoras, Mira
Meredith, Melissa
Monlong, Jean
Lorig-Roach, Ryan
Asri, Mobin
Jerez, Pilar Alvarez
Malik, Laksh
Dewan, Ramita
Reed, Xylena
Genner, Rylee M.
Daida, Kensuke
Behera, Sairam
Shafin, Kishwar
Pesout, Trevor
Prabakaran, Jeshuwin
Carnevali, Paolo
Yang, Jianzhi
Rhie, Arang
Scholz, Sonja W.
Traynor, Bryan J.
Miga, Karen H.
Jain, Miten
Timp, Winston
Phillippy, Adam M.
Chaisson, Mark
Sedlazeck, Fritz J.
Blauwendraat, Cornelis
Paten, Benedict
author_sort Kolmogorov, Mikhail
collection PubMed
description Long-read sequencing technologies substantially overcome the limitations of short-reads but to date have not been considered as feasible replacement at scale due to a combination of being too expensive, not scalable enough, or too error-prone. Here, we develop an efficient and scalable wet lab and computational protocol for Oxford Nanopore Technologies (ONT) long-read sequencing that seeks to provide a genuine alternative to short-reads for large-scale genomics projects. We applied our protocol to cell lines and brain tissue samples as part of a pilot project for the NIH Center for Alzheimer’s and Related Dementias (CARD). Using a single PromethION flow cell, we can detect SNPs with F1-score better than Illumina short-read sequencing. Small indel calling remains difficult within homopolymers and tandem repeats, but is comparable to Illumina calls elsewhere. Further, we can discover structural variants with F1-score comparable to state-of-the-art methods involving Pacific Biosciences HiFi sequencing and trio information (but at a lower cost and greater throughput). Using ONT-based phasing, we can then combine and phase small and structural variants at megabase scales. Our protocol also produces highly accurate, haplotype-specific methylation calls. Overall, this makes large-scale long-read sequencing projects feasible; the protocol is currently being used to sequence thousands of brain-based genomes as a part of the NIH CARD initiative. We provide the protocol and software as open-source integrated pipelines for generating phased variant calls and assemblies.
format Online
Article
Text
id pubmed-9882142
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cold Spring Harbor Laboratory
record_format MEDLINE/PubMed
spelling pubmed-98821422023-01-28 Scalable Nanopore sequencing of human genomes provides a comprehensive view of haplotype-resolved variation and methylation Kolmogorov, Mikhail Billingsley, Kimberley J. Mastoras, Mira Meredith, Melissa Monlong, Jean Lorig-Roach, Ryan Asri, Mobin Jerez, Pilar Alvarez Malik, Laksh Dewan, Ramita Reed, Xylena Genner, Rylee M. Daida, Kensuke Behera, Sairam Shafin, Kishwar Pesout, Trevor Prabakaran, Jeshuwin Carnevali, Paolo Yang, Jianzhi Rhie, Arang Scholz, Sonja W. Traynor, Bryan J. Miga, Karen H. Jain, Miten Timp, Winston Phillippy, Adam M. Chaisson, Mark Sedlazeck, Fritz J. Blauwendraat, Cornelis Paten, Benedict bioRxiv Article Long-read sequencing technologies substantially overcome the limitations of short-reads but to date have not been considered as feasible replacement at scale due to a combination of being too expensive, not scalable enough, or too error-prone. Here, we develop an efficient and scalable wet lab and computational protocol for Oxford Nanopore Technologies (ONT) long-read sequencing that seeks to provide a genuine alternative to short-reads for large-scale genomics projects. We applied our protocol to cell lines and brain tissue samples as part of a pilot project for the NIH Center for Alzheimer’s and Related Dementias (CARD). Using a single PromethION flow cell, we can detect SNPs with F1-score better than Illumina short-read sequencing. Small indel calling remains difficult within homopolymers and tandem repeats, but is comparable to Illumina calls elsewhere. Further, we can discover structural variants with F1-score comparable to state-of-the-art methods involving Pacific Biosciences HiFi sequencing and trio information (but at a lower cost and greater throughput). Using ONT-based phasing, we can then combine and phase small and structural variants at megabase scales. Our protocol also produces highly accurate, haplotype-specific methylation calls. Overall, this makes large-scale long-read sequencing projects feasible; the protocol is currently being used to sequence thousands of brain-based genomes as a part of the NIH CARD initiative. We provide the protocol and software as open-source integrated pipelines for generating phased variant calls and assemblies. Cold Spring Harbor Laboratory 2023-04-05 /pmc/articles/PMC9882142/ /pubmed/36711673 http://dx.doi.org/10.1101/2023.01.12.523790 Text en https://creativecommons.org/licenses/by/4.0/This work is licensed under a Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/) , which allows reusers to distribute, remix, adapt, and build upon the material in any medium or format, so long as attribution is given to the creator. The license allows for commercial use.
spellingShingle Article
Kolmogorov, Mikhail
Billingsley, Kimberley J.
Mastoras, Mira
Meredith, Melissa
Monlong, Jean
Lorig-Roach, Ryan
Asri, Mobin
Jerez, Pilar Alvarez
Malik, Laksh
Dewan, Ramita
Reed, Xylena
Genner, Rylee M.
Daida, Kensuke
Behera, Sairam
Shafin, Kishwar
Pesout, Trevor
Prabakaran, Jeshuwin
Carnevali, Paolo
Yang, Jianzhi
Rhie, Arang
Scholz, Sonja W.
Traynor, Bryan J.
Miga, Karen H.
Jain, Miten
Timp, Winston
Phillippy, Adam M.
Chaisson, Mark
Sedlazeck, Fritz J.
Blauwendraat, Cornelis
Paten, Benedict
Scalable Nanopore sequencing of human genomes provides a comprehensive view of haplotype-resolved variation and methylation
title Scalable Nanopore sequencing of human genomes provides a comprehensive view of haplotype-resolved variation and methylation
title_full Scalable Nanopore sequencing of human genomes provides a comprehensive view of haplotype-resolved variation and methylation
title_fullStr Scalable Nanopore sequencing of human genomes provides a comprehensive view of haplotype-resolved variation and methylation
title_full_unstemmed Scalable Nanopore sequencing of human genomes provides a comprehensive view of haplotype-resolved variation and methylation
title_short Scalable Nanopore sequencing of human genomes provides a comprehensive view of haplotype-resolved variation and methylation
title_sort scalable nanopore sequencing of human genomes provides a comprehensive view of haplotype-resolved variation and methylation
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9882142/
https://www.ncbi.nlm.nih.gov/pubmed/36711673
http://dx.doi.org/10.1101/2023.01.12.523790
work_keys_str_mv AT kolmogorovmikhail scalablenanoporesequencingofhumangenomesprovidesacomprehensiveviewofhaplotyperesolvedvariationandmethylation
AT billingsleykimberleyj scalablenanoporesequencingofhumangenomesprovidesacomprehensiveviewofhaplotyperesolvedvariationandmethylation
AT mastorasmira scalablenanoporesequencingofhumangenomesprovidesacomprehensiveviewofhaplotyperesolvedvariationandmethylation
AT meredithmelissa scalablenanoporesequencingofhumangenomesprovidesacomprehensiveviewofhaplotyperesolvedvariationandmethylation
AT monlongjean scalablenanoporesequencingofhumangenomesprovidesacomprehensiveviewofhaplotyperesolvedvariationandmethylation
AT lorigroachryan scalablenanoporesequencingofhumangenomesprovidesacomprehensiveviewofhaplotyperesolvedvariationandmethylation
AT asrimobin scalablenanoporesequencingofhumangenomesprovidesacomprehensiveviewofhaplotyperesolvedvariationandmethylation
AT jerezpilaralvarez scalablenanoporesequencingofhumangenomesprovidesacomprehensiveviewofhaplotyperesolvedvariationandmethylation
AT maliklaksh scalablenanoporesequencingofhumangenomesprovidesacomprehensiveviewofhaplotyperesolvedvariationandmethylation
AT dewanramita scalablenanoporesequencingofhumangenomesprovidesacomprehensiveviewofhaplotyperesolvedvariationandmethylation
AT reedxylena scalablenanoporesequencingofhumangenomesprovidesacomprehensiveviewofhaplotyperesolvedvariationandmethylation
AT gennerryleem scalablenanoporesequencingofhumangenomesprovidesacomprehensiveviewofhaplotyperesolvedvariationandmethylation
AT daidakensuke scalablenanoporesequencingofhumangenomesprovidesacomprehensiveviewofhaplotyperesolvedvariationandmethylation
AT beherasairam scalablenanoporesequencingofhumangenomesprovidesacomprehensiveviewofhaplotyperesolvedvariationandmethylation
AT shafinkishwar scalablenanoporesequencingofhumangenomesprovidesacomprehensiveviewofhaplotyperesolvedvariationandmethylation
AT pesouttrevor scalablenanoporesequencingofhumangenomesprovidesacomprehensiveviewofhaplotyperesolvedvariationandmethylation
AT prabakaranjeshuwin scalablenanoporesequencingofhumangenomesprovidesacomprehensiveviewofhaplotyperesolvedvariationandmethylation
AT carnevalipaolo scalablenanoporesequencingofhumangenomesprovidesacomprehensiveviewofhaplotyperesolvedvariationandmethylation
AT scalablenanoporesequencingofhumangenomesprovidesacomprehensiveviewofhaplotyperesolvedvariationandmethylation
AT yangjianzhi scalablenanoporesequencingofhumangenomesprovidesacomprehensiveviewofhaplotyperesolvedvariationandmethylation
AT rhiearang scalablenanoporesequencingofhumangenomesprovidesacomprehensiveviewofhaplotyperesolvedvariationandmethylation
AT scholzsonjaw scalablenanoporesequencingofhumangenomesprovidesacomprehensiveviewofhaplotyperesolvedvariationandmethylation
AT traynorbryanj scalablenanoporesequencingofhumangenomesprovidesacomprehensiveviewofhaplotyperesolvedvariationandmethylation
AT migakarenh scalablenanoporesequencingofhumangenomesprovidesacomprehensiveviewofhaplotyperesolvedvariationandmethylation
AT jainmiten scalablenanoporesequencingofhumangenomesprovidesacomprehensiveviewofhaplotyperesolvedvariationandmethylation
AT timpwinston scalablenanoporesequencingofhumangenomesprovidesacomprehensiveviewofhaplotyperesolvedvariationandmethylation
AT phillippyadamm scalablenanoporesequencingofhumangenomesprovidesacomprehensiveviewofhaplotyperesolvedvariationandmethylation
AT chaissonmark scalablenanoporesequencingofhumangenomesprovidesacomprehensiveviewofhaplotyperesolvedvariationandmethylation
AT sedlazeckfritzj scalablenanoporesequencingofhumangenomesprovidesacomprehensiveviewofhaplotyperesolvedvariationandmethylation
AT blauwendraatcornelis scalablenanoporesequencingofhumangenomesprovidesacomprehensiveviewofhaplotyperesolvedvariationandmethylation
AT patenbenedict scalablenanoporesequencingofhumangenomesprovidesacomprehensiveviewofhaplotyperesolvedvariationandmethylation