Cargando…
Scalable Nanopore sequencing of human genomes provides a comprehensive view of haplotype-resolved variation and methylation
Long-read sequencing technologies substantially overcome the limitations of short-reads but to date have not been considered as feasible replacement at scale due to a combination of being too expensive, not scalable enough, or too error-prone. Here, we develop an efficient and scalable wet lab and c...
Autores principales: | , , , , , , , , , , , , , , , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cold Spring Harbor Laboratory
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9882142/ https://www.ncbi.nlm.nih.gov/pubmed/36711673 http://dx.doi.org/10.1101/2023.01.12.523790 |
_version_ | 1784879244096045056 |
---|---|
author | Kolmogorov, Mikhail Billingsley, Kimberley J. Mastoras, Mira Meredith, Melissa Monlong, Jean Lorig-Roach, Ryan Asri, Mobin Jerez, Pilar Alvarez Malik, Laksh Dewan, Ramita Reed, Xylena Genner, Rylee M. Daida, Kensuke Behera, Sairam Shafin, Kishwar Pesout, Trevor Prabakaran, Jeshuwin Carnevali, Paolo Yang, Jianzhi Rhie, Arang Scholz, Sonja W. Traynor, Bryan J. Miga, Karen H. Jain, Miten Timp, Winston Phillippy, Adam M. Chaisson, Mark Sedlazeck, Fritz J. Blauwendraat, Cornelis Paten, Benedict |
author_facet | Kolmogorov, Mikhail Billingsley, Kimberley J. Mastoras, Mira Meredith, Melissa Monlong, Jean Lorig-Roach, Ryan Asri, Mobin Jerez, Pilar Alvarez Malik, Laksh Dewan, Ramita Reed, Xylena Genner, Rylee M. Daida, Kensuke Behera, Sairam Shafin, Kishwar Pesout, Trevor Prabakaran, Jeshuwin Carnevali, Paolo Yang, Jianzhi Rhie, Arang Scholz, Sonja W. Traynor, Bryan J. Miga, Karen H. Jain, Miten Timp, Winston Phillippy, Adam M. Chaisson, Mark Sedlazeck, Fritz J. Blauwendraat, Cornelis Paten, Benedict |
author_sort | Kolmogorov, Mikhail |
collection | PubMed |
description | Long-read sequencing technologies substantially overcome the limitations of short-reads but to date have not been considered as feasible replacement at scale due to a combination of being too expensive, not scalable enough, or too error-prone. Here, we develop an efficient and scalable wet lab and computational protocol for Oxford Nanopore Technologies (ONT) long-read sequencing that seeks to provide a genuine alternative to short-reads for large-scale genomics projects. We applied our protocol to cell lines and brain tissue samples as part of a pilot project for the NIH Center for Alzheimer’s and Related Dementias (CARD). Using a single PromethION flow cell, we can detect SNPs with F1-score better than Illumina short-read sequencing. Small indel calling remains difficult within homopolymers and tandem repeats, but is comparable to Illumina calls elsewhere. Further, we can discover structural variants with F1-score comparable to state-of-the-art methods involving Pacific Biosciences HiFi sequencing and trio information (but at a lower cost and greater throughput). Using ONT-based phasing, we can then combine and phase small and structural variants at megabase scales. Our protocol also produces highly accurate, haplotype-specific methylation calls. Overall, this makes large-scale long-read sequencing projects feasible; the protocol is currently being used to sequence thousands of brain-based genomes as a part of the NIH CARD initiative. We provide the protocol and software as open-source integrated pipelines for generating phased variant calls and assemblies. |
format | Online Article Text |
id | pubmed-9882142 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Cold Spring Harbor Laboratory |
record_format | MEDLINE/PubMed |
spelling | pubmed-98821422023-01-28 Scalable Nanopore sequencing of human genomes provides a comprehensive view of haplotype-resolved variation and methylation Kolmogorov, Mikhail Billingsley, Kimberley J. Mastoras, Mira Meredith, Melissa Monlong, Jean Lorig-Roach, Ryan Asri, Mobin Jerez, Pilar Alvarez Malik, Laksh Dewan, Ramita Reed, Xylena Genner, Rylee M. Daida, Kensuke Behera, Sairam Shafin, Kishwar Pesout, Trevor Prabakaran, Jeshuwin Carnevali, Paolo Yang, Jianzhi Rhie, Arang Scholz, Sonja W. Traynor, Bryan J. Miga, Karen H. Jain, Miten Timp, Winston Phillippy, Adam M. Chaisson, Mark Sedlazeck, Fritz J. Blauwendraat, Cornelis Paten, Benedict bioRxiv Article Long-read sequencing technologies substantially overcome the limitations of short-reads but to date have not been considered as feasible replacement at scale due to a combination of being too expensive, not scalable enough, or too error-prone. Here, we develop an efficient and scalable wet lab and computational protocol for Oxford Nanopore Technologies (ONT) long-read sequencing that seeks to provide a genuine alternative to short-reads for large-scale genomics projects. We applied our protocol to cell lines and brain tissue samples as part of a pilot project for the NIH Center for Alzheimer’s and Related Dementias (CARD). Using a single PromethION flow cell, we can detect SNPs with F1-score better than Illumina short-read sequencing. Small indel calling remains difficult within homopolymers and tandem repeats, but is comparable to Illumina calls elsewhere. Further, we can discover structural variants with F1-score comparable to state-of-the-art methods involving Pacific Biosciences HiFi sequencing and trio information (but at a lower cost and greater throughput). Using ONT-based phasing, we can then combine and phase small and structural variants at megabase scales. Our protocol also produces highly accurate, haplotype-specific methylation calls. Overall, this makes large-scale long-read sequencing projects feasible; the protocol is currently being used to sequence thousands of brain-based genomes as a part of the NIH CARD initiative. We provide the protocol and software as open-source integrated pipelines for generating phased variant calls and assemblies. Cold Spring Harbor Laboratory 2023-04-05 /pmc/articles/PMC9882142/ /pubmed/36711673 http://dx.doi.org/10.1101/2023.01.12.523790 Text en https://creativecommons.org/licenses/by/4.0/This work is licensed under a Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0/) , which allows reusers to distribute, remix, adapt, and build upon the material in any medium or format, so long as attribution is given to the creator. The license allows for commercial use. |
spellingShingle | Article Kolmogorov, Mikhail Billingsley, Kimberley J. Mastoras, Mira Meredith, Melissa Monlong, Jean Lorig-Roach, Ryan Asri, Mobin Jerez, Pilar Alvarez Malik, Laksh Dewan, Ramita Reed, Xylena Genner, Rylee M. Daida, Kensuke Behera, Sairam Shafin, Kishwar Pesout, Trevor Prabakaran, Jeshuwin Carnevali, Paolo Yang, Jianzhi Rhie, Arang Scholz, Sonja W. Traynor, Bryan J. Miga, Karen H. Jain, Miten Timp, Winston Phillippy, Adam M. Chaisson, Mark Sedlazeck, Fritz J. Blauwendraat, Cornelis Paten, Benedict Scalable Nanopore sequencing of human genomes provides a comprehensive view of haplotype-resolved variation and methylation |
title | Scalable Nanopore sequencing of human genomes provides a comprehensive view of haplotype-resolved variation and methylation |
title_full | Scalable Nanopore sequencing of human genomes provides a comprehensive view of haplotype-resolved variation and methylation |
title_fullStr | Scalable Nanopore sequencing of human genomes provides a comprehensive view of haplotype-resolved variation and methylation |
title_full_unstemmed | Scalable Nanopore sequencing of human genomes provides a comprehensive view of haplotype-resolved variation and methylation |
title_short | Scalable Nanopore sequencing of human genomes provides a comprehensive view of haplotype-resolved variation and methylation |
title_sort | scalable nanopore sequencing of human genomes provides a comprehensive view of haplotype-resolved variation and methylation |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9882142/ https://www.ncbi.nlm.nih.gov/pubmed/36711673 http://dx.doi.org/10.1101/2023.01.12.523790 |
work_keys_str_mv | AT kolmogorovmikhail scalablenanoporesequencingofhumangenomesprovidesacomprehensiveviewofhaplotyperesolvedvariationandmethylation AT billingsleykimberleyj scalablenanoporesequencingofhumangenomesprovidesacomprehensiveviewofhaplotyperesolvedvariationandmethylation AT mastorasmira scalablenanoporesequencingofhumangenomesprovidesacomprehensiveviewofhaplotyperesolvedvariationandmethylation AT meredithmelissa scalablenanoporesequencingofhumangenomesprovidesacomprehensiveviewofhaplotyperesolvedvariationandmethylation AT monlongjean scalablenanoporesequencingofhumangenomesprovidesacomprehensiveviewofhaplotyperesolvedvariationandmethylation AT lorigroachryan scalablenanoporesequencingofhumangenomesprovidesacomprehensiveviewofhaplotyperesolvedvariationandmethylation AT asrimobin scalablenanoporesequencingofhumangenomesprovidesacomprehensiveviewofhaplotyperesolvedvariationandmethylation AT jerezpilaralvarez scalablenanoporesequencingofhumangenomesprovidesacomprehensiveviewofhaplotyperesolvedvariationandmethylation AT maliklaksh scalablenanoporesequencingofhumangenomesprovidesacomprehensiveviewofhaplotyperesolvedvariationandmethylation AT dewanramita scalablenanoporesequencingofhumangenomesprovidesacomprehensiveviewofhaplotyperesolvedvariationandmethylation AT reedxylena scalablenanoporesequencingofhumangenomesprovidesacomprehensiveviewofhaplotyperesolvedvariationandmethylation AT gennerryleem scalablenanoporesequencingofhumangenomesprovidesacomprehensiveviewofhaplotyperesolvedvariationandmethylation AT daidakensuke scalablenanoporesequencingofhumangenomesprovidesacomprehensiveviewofhaplotyperesolvedvariationandmethylation AT beherasairam scalablenanoporesequencingofhumangenomesprovidesacomprehensiveviewofhaplotyperesolvedvariationandmethylation AT shafinkishwar scalablenanoporesequencingofhumangenomesprovidesacomprehensiveviewofhaplotyperesolvedvariationandmethylation AT pesouttrevor scalablenanoporesequencingofhumangenomesprovidesacomprehensiveviewofhaplotyperesolvedvariationandmethylation AT prabakaranjeshuwin scalablenanoporesequencingofhumangenomesprovidesacomprehensiveviewofhaplotyperesolvedvariationandmethylation AT carnevalipaolo scalablenanoporesequencingofhumangenomesprovidesacomprehensiveviewofhaplotyperesolvedvariationandmethylation AT scalablenanoporesequencingofhumangenomesprovidesacomprehensiveviewofhaplotyperesolvedvariationandmethylation AT yangjianzhi scalablenanoporesequencingofhumangenomesprovidesacomprehensiveviewofhaplotyperesolvedvariationandmethylation AT rhiearang scalablenanoporesequencingofhumangenomesprovidesacomprehensiveviewofhaplotyperesolvedvariationandmethylation AT scholzsonjaw scalablenanoporesequencingofhumangenomesprovidesacomprehensiveviewofhaplotyperesolvedvariationandmethylation AT traynorbryanj scalablenanoporesequencingofhumangenomesprovidesacomprehensiveviewofhaplotyperesolvedvariationandmethylation AT migakarenh scalablenanoporesequencingofhumangenomesprovidesacomprehensiveviewofhaplotyperesolvedvariationandmethylation AT jainmiten scalablenanoporesequencingofhumangenomesprovidesacomprehensiveviewofhaplotyperesolvedvariationandmethylation AT timpwinston scalablenanoporesequencingofhumangenomesprovidesacomprehensiveviewofhaplotyperesolvedvariationandmethylation AT phillippyadamm scalablenanoporesequencingofhumangenomesprovidesacomprehensiveviewofhaplotyperesolvedvariationandmethylation AT chaissonmark scalablenanoporesequencingofhumangenomesprovidesacomprehensiveviewofhaplotyperesolvedvariationandmethylation AT sedlazeckfritzj scalablenanoporesequencingofhumangenomesprovidesacomprehensiveviewofhaplotyperesolvedvariationandmethylation AT blauwendraatcornelis scalablenanoporesequencingofhumangenomesprovidesacomprehensiveviewofhaplotyperesolvedvariationandmethylation AT patenbenedict scalablenanoporesequencingofhumangenomesprovidesacomprehensiveviewofhaplotyperesolvedvariationandmethylation |