Cargando…

A Long-Read Sequencing Approach for Direct Haplotype Phasing in Clinical Settings

The reconstruction of individual haplotypes can facilitate the interpretation of disease risks; however, high costs and technical challenges still hinder their assessment in clinical settings. Second-generation sequencing is the gold standard for variant discovery but, due to the production of short...

Descripción completa

Detalles Bibliográficos
Autores principales: Maestri, Simone, Maturo, Maria Giovanna, Cosentino, Emanuela, Marcolungo, Luca, Iadarola, Barbara, Fortunati, Elisabetta, Rossato, Marzia, Delledonne, Massimo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7731377/
https://www.ncbi.nlm.nih.gov/pubmed/33271988
http://dx.doi.org/10.3390/ijms21239177
_version_ 1783621885667311616
author Maestri, Simone
Maturo, Maria Giovanna
Cosentino, Emanuela
Marcolungo, Luca
Iadarola, Barbara
Fortunati, Elisabetta
Rossato, Marzia
Delledonne, Massimo
author_facet Maestri, Simone
Maturo, Maria Giovanna
Cosentino, Emanuela
Marcolungo, Luca
Iadarola, Barbara
Fortunati, Elisabetta
Rossato, Marzia
Delledonne, Massimo
author_sort Maestri, Simone
collection PubMed
description The reconstruction of individual haplotypes can facilitate the interpretation of disease risks; however, high costs and technical challenges still hinder their assessment in clinical settings. Second-generation sequencing is the gold standard for variant discovery but, due to the production of short reads covering small genomic regions, allows only indirect haplotyping based on statistical methods. In contrast, third-generation methods such as the nanopore sequencing platform developed by Oxford Nanopore Technologies (ONT) generate long reads that can be used for direct haplotyping, with fewer drawbacks. However, robust standards for variant phasing in ONT-based target resequencing efforts are not yet available. In this study, we presented a streamlined proof-of-concept workflow for variant calling and phasing based on ONT data in a clinically relevant 12-kb region of the APOE locus, a hotspot for variants and haplotypes associated with aging-related diseases and longevity. Starting with sequencing data from simple amplicons of the target locus, we demonstrated that ONT data allow for reliable single-nucleotide variant (SNV) calling and phasing from as little as 60 reads, although the recognition of indels is less efficient. Even so, we identified the best combination of ONT read sets (600) and software (BWA/Minimap2 and HapCUT2) that enables full haplotype reconstruction when both SNVs and indels have been identified previously using a highly-accurate sequencing platform. In conclusion, we established a rapid and inexpensive workflow for variant phasing based on ONT long reads. This allowed for the analysis of multiple samples in parallel and can easily be implemented in routine clinical practice, including diagnostic testing.
format Online
Article
Text
id pubmed-7731377
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-77313772020-12-12 A Long-Read Sequencing Approach for Direct Haplotype Phasing in Clinical Settings Maestri, Simone Maturo, Maria Giovanna Cosentino, Emanuela Marcolungo, Luca Iadarola, Barbara Fortunati, Elisabetta Rossato, Marzia Delledonne, Massimo Int J Mol Sci Article The reconstruction of individual haplotypes can facilitate the interpretation of disease risks; however, high costs and technical challenges still hinder their assessment in clinical settings. Second-generation sequencing is the gold standard for variant discovery but, due to the production of short reads covering small genomic regions, allows only indirect haplotyping based on statistical methods. In contrast, third-generation methods such as the nanopore sequencing platform developed by Oxford Nanopore Technologies (ONT) generate long reads that can be used for direct haplotyping, with fewer drawbacks. However, robust standards for variant phasing in ONT-based target resequencing efforts are not yet available. In this study, we presented a streamlined proof-of-concept workflow for variant calling and phasing based on ONT data in a clinically relevant 12-kb region of the APOE locus, a hotspot for variants and haplotypes associated with aging-related diseases and longevity. Starting with sequencing data from simple amplicons of the target locus, we demonstrated that ONT data allow for reliable single-nucleotide variant (SNV) calling and phasing from as little as 60 reads, although the recognition of indels is less efficient. Even so, we identified the best combination of ONT read sets (600) and software (BWA/Minimap2 and HapCUT2) that enables full haplotype reconstruction when both SNVs and indels have been identified previously using a highly-accurate sequencing platform. In conclusion, we established a rapid and inexpensive workflow for variant phasing based on ONT long reads. This allowed for the analysis of multiple samples in parallel and can easily be implemented in routine clinical practice, including diagnostic testing. MDPI 2020-12-01 /pmc/articles/PMC7731377/ /pubmed/33271988 http://dx.doi.org/10.3390/ijms21239177 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Maestri, Simone
Maturo, Maria Giovanna
Cosentino, Emanuela
Marcolungo, Luca
Iadarola, Barbara
Fortunati, Elisabetta
Rossato, Marzia
Delledonne, Massimo
A Long-Read Sequencing Approach for Direct Haplotype Phasing in Clinical Settings
title A Long-Read Sequencing Approach for Direct Haplotype Phasing in Clinical Settings
title_full A Long-Read Sequencing Approach for Direct Haplotype Phasing in Clinical Settings
title_fullStr A Long-Read Sequencing Approach for Direct Haplotype Phasing in Clinical Settings
title_full_unstemmed A Long-Read Sequencing Approach for Direct Haplotype Phasing in Clinical Settings
title_short A Long-Read Sequencing Approach for Direct Haplotype Phasing in Clinical Settings
title_sort long-read sequencing approach for direct haplotype phasing in clinical settings
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7731377/
https://www.ncbi.nlm.nih.gov/pubmed/33271988
http://dx.doi.org/10.3390/ijms21239177
work_keys_str_mv AT maestrisimone alongreadsequencingapproachfordirecthaplotypephasinginclinicalsettings
AT maturomariagiovanna alongreadsequencingapproachfordirecthaplotypephasinginclinicalsettings
AT cosentinoemanuela alongreadsequencingapproachfordirecthaplotypephasinginclinicalsettings
AT marcolungoluca alongreadsequencingapproachfordirecthaplotypephasinginclinicalsettings
AT iadarolabarbara alongreadsequencingapproachfordirecthaplotypephasinginclinicalsettings
AT fortunatielisabetta alongreadsequencingapproachfordirecthaplotypephasinginclinicalsettings
AT rossatomarzia alongreadsequencingapproachfordirecthaplotypephasinginclinicalsettings
AT delledonnemassimo alongreadsequencingapproachfordirecthaplotypephasinginclinicalsettings
AT maestrisimone longreadsequencingapproachfordirecthaplotypephasinginclinicalsettings
AT maturomariagiovanna longreadsequencingapproachfordirecthaplotypephasinginclinicalsettings
AT cosentinoemanuela longreadsequencingapproachfordirecthaplotypephasinginclinicalsettings
AT marcolungoluca longreadsequencingapproachfordirecthaplotypephasinginclinicalsettings
AT iadarolabarbara longreadsequencingapproachfordirecthaplotypephasinginclinicalsettings
AT fortunatielisabetta longreadsequencingapproachfordirecthaplotypephasinginclinicalsettings
AT rossatomarzia longreadsequencingapproachfordirecthaplotypephasinginclinicalsettings
AT delledonnemassimo longreadsequencingapproachfordirecthaplotypephasinginclinicalsettings