Cargando…
Targeted Phasing of 2–200 Kilobase DNA Fragments with a Short-Read Sequencer and a Single-Tube Linked-Read Library Method
In the human genome, heterozygous sites are genomic positions with different alleles inherited from each parent. On average, there is a heterozygous site every 1–2 kilobases (kb). Resolving whether two alleles in neighboring heterozygous positions are physically linked—that is, phased—is possible wi...
Autores principales: | , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cold Spring Harbor Laboratory
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10028795/ https://www.ncbi.nlm.nih.gov/pubmed/36945366 http://dx.doi.org/10.1101/2023.03.05.531179 |
_version_ | 1784910022759677952 |
---|---|
author | Mikhaylova, Veronika Rzepka, Madison Kawamura, Tetsuya Xia, Yu Chang, Peter L. Zhou, Shiguo Pham, Long Modi, Naisarg Yao, Likun Perez-Agustin, Adrian Pagans, Sara Boles, T. Christian Lei, Ming Wang, Yong Garcia-Bassets, Ivan Chen, Zhoutao |
author_facet | Mikhaylova, Veronika Rzepka, Madison Kawamura, Tetsuya Xia, Yu Chang, Peter L. Zhou, Shiguo Pham, Long Modi, Naisarg Yao, Likun Perez-Agustin, Adrian Pagans, Sara Boles, T. Christian Lei, Ming Wang, Yong Garcia-Bassets, Ivan Chen, Zhoutao |
author_sort | Mikhaylova, Veronika |
collection | PubMed |
description | In the human genome, heterozygous sites are genomic positions with different alleles inherited from each parent. On average, there is a heterozygous site every 1–2 kilobases (kb). Resolving whether two alleles in neighboring heterozygous positions are physically linked—that is, phased—is possible with a short-read sequencer if the sequencing library captures long-range information. TELL-Seq is a library preparation method based on millions of barcoded micro-sized beads that enables instrument-free phasing of a whole human genome in a single PCR tube. TELL-Seq incorporates a unique molecular identifier (barcode) to the short reads generated from the same high-molecular-weight (HMW) DNA fragment (known as ‘linked-reads’). However, genome-scale TELL-Seq is not cost-effective for applications focusing on a single locus or a few loci. Here, we present an optimized TELL-Seq protocol that enables the cost-effective phasing of enriched loci (targets) of varying sizes, purity levels, and heterozygosity. Targeted TELL-Seq maximizes linked-read efficiency and library yield while minimizing input requirements, fragment collisions on microbeads, and sequencing burden. To validate the targeted protocol, we phased seven 180–200 kb loci enriched by CRISPR/Cas9-mediated excision coupled with pulse-field electrophoresis, four 20 kb loci enriched by CRISPR/Cas9-mediated protection from exonuclease digestion, and six 2–13 kb loci amplified by PCR. The selected targets have clinical and research relevance (BRCA1, BRCA2, MLH1, MSH2, MSH6, APC, PMS2, SCN5A-SCN10A, and PKI3CA). These analyses reveal that targeted TELL-Seq provides a reliable way of phasing allelic variants within targets (2–200 kb in length) with the low cost and high accuracy of short-read sequencing. |
format | Online Article Text |
id | pubmed-10028795 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Cold Spring Harbor Laboratory |
record_format | MEDLINE/PubMed |
spelling | pubmed-100287952023-03-22 Targeted Phasing of 2–200 Kilobase DNA Fragments with a Short-Read Sequencer and a Single-Tube Linked-Read Library Method Mikhaylova, Veronika Rzepka, Madison Kawamura, Tetsuya Xia, Yu Chang, Peter L. Zhou, Shiguo Pham, Long Modi, Naisarg Yao, Likun Perez-Agustin, Adrian Pagans, Sara Boles, T. Christian Lei, Ming Wang, Yong Garcia-Bassets, Ivan Chen, Zhoutao bioRxiv Article In the human genome, heterozygous sites are genomic positions with different alleles inherited from each parent. On average, there is a heterozygous site every 1–2 kilobases (kb). Resolving whether two alleles in neighboring heterozygous positions are physically linked—that is, phased—is possible with a short-read sequencer if the sequencing library captures long-range information. TELL-Seq is a library preparation method based on millions of barcoded micro-sized beads that enables instrument-free phasing of a whole human genome in a single PCR tube. TELL-Seq incorporates a unique molecular identifier (barcode) to the short reads generated from the same high-molecular-weight (HMW) DNA fragment (known as ‘linked-reads’). However, genome-scale TELL-Seq is not cost-effective for applications focusing on a single locus or a few loci. Here, we present an optimized TELL-Seq protocol that enables the cost-effective phasing of enriched loci (targets) of varying sizes, purity levels, and heterozygosity. Targeted TELL-Seq maximizes linked-read efficiency and library yield while minimizing input requirements, fragment collisions on microbeads, and sequencing burden. To validate the targeted protocol, we phased seven 180–200 kb loci enriched by CRISPR/Cas9-mediated excision coupled with pulse-field electrophoresis, four 20 kb loci enriched by CRISPR/Cas9-mediated protection from exonuclease digestion, and six 2–13 kb loci amplified by PCR. The selected targets have clinical and research relevance (BRCA1, BRCA2, MLH1, MSH2, MSH6, APC, PMS2, SCN5A-SCN10A, and PKI3CA). These analyses reveal that targeted TELL-Seq provides a reliable way of phasing allelic variants within targets (2–200 kb in length) with the low cost and high accuracy of short-read sequencing. Cold Spring Harbor Laboratory 2023-03-06 /pmc/articles/PMC10028795/ /pubmed/36945366 http://dx.doi.org/10.1101/2023.03.05.531179 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License (https://creativecommons.org/licenses/by-nc-nd/4.0/) , which allows reusers to copy and distribute the material in any medium or format in unadapted form only, for noncommercial purposes only, and only so long as attribution is given to the creator. |
spellingShingle | Article Mikhaylova, Veronika Rzepka, Madison Kawamura, Tetsuya Xia, Yu Chang, Peter L. Zhou, Shiguo Pham, Long Modi, Naisarg Yao, Likun Perez-Agustin, Adrian Pagans, Sara Boles, T. Christian Lei, Ming Wang, Yong Garcia-Bassets, Ivan Chen, Zhoutao Targeted Phasing of 2–200 Kilobase DNA Fragments with a Short-Read Sequencer and a Single-Tube Linked-Read Library Method |
title | Targeted Phasing of 2–200 Kilobase DNA Fragments with a Short-Read Sequencer and a Single-Tube Linked-Read Library Method |
title_full | Targeted Phasing of 2–200 Kilobase DNA Fragments with a Short-Read Sequencer and a Single-Tube Linked-Read Library Method |
title_fullStr | Targeted Phasing of 2–200 Kilobase DNA Fragments with a Short-Read Sequencer and a Single-Tube Linked-Read Library Method |
title_full_unstemmed | Targeted Phasing of 2–200 Kilobase DNA Fragments with a Short-Read Sequencer and a Single-Tube Linked-Read Library Method |
title_short | Targeted Phasing of 2–200 Kilobase DNA Fragments with a Short-Read Sequencer and a Single-Tube Linked-Read Library Method |
title_sort | targeted phasing of 2–200 kilobase dna fragments with a short-read sequencer and a single-tube linked-read library method |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10028795/ https://www.ncbi.nlm.nih.gov/pubmed/36945366 http://dx.doi.org/10.1101/2023.03.05.531179 |
work_keys_str_mv | AT mikhaylovaveronika targetedphasingof2200kilobasednafragmentswithashortreadsequencerandasingletubelinkedreadlibrarymethod AT rzepkamadison targetedphasingof2200kilobasednafragmentswithashortreadsequencerandasingletubelinkedreadlibrarymethod AT kawamuratetsuya targetedphasingof2200kilobasednafragmentswithashortreadsequencerandasingletubelinkedreadlibrarymethod AT xiayu targetedphasingof2200kilobasednafragmentswithashortreadsequencerandasingletubelinkedreadlibrarymethod AT changpeterl targetedphasingof2200kilobasednafragmentswithashortreadsequencerandasingletubelinkedreadlibrarymethod AT zhoushiguo targetedphasingof2200kilobasednafragmentswithashortreadsequencerandasingletubelinkedreadlibrarymethod AT phamlong targetedphasingof2200kilobasednafragmentswithashortreadsequencerandasingletubelinkedreadlibrarymethod AT modinaisarg targetedphasingof2200kilobasednafragmentswithashortreadsequencerandasingletubelinkedreadlibrarymethod AT yaolikun targetedphasingof2200kilobasednafragmentswithashortreadsequencerandasingletubelinkedreadlibrarymethod AT perezagustinadrian targetedphasingof2200kilobasednafragmentswithashortreadsequencerandasingletubelinkedreadlibrarymethod AT paganssara targetedphasingof2200kilobasednafragmentswithashortreadsequencerandasingletubelinkedreadlibrarymethod AT bolestchristian targetedphasingof2200kilobasednafragmentswithashortreadsequencerandasingletubelinkedreadlibrarymethod AT leiming targetedphasingof2200kilobasednafragmentswithashortreadsequencerandasingletubelinkedreadlibrarymethod AT wangyong targetedphasingof2200kilobasednafragmentswithashortreadsequencerandasingletubelinkedreadlibrarymethod AT garciabassetsivan targetedphasingof2200kilobasednafragmentswithashortreadsequencerandasingletubelinkedreadlibrarymethod AT chenzhoutao targetedphasingof2200kilobasednafragmentswithashortreadsequencerandasingletubelinkedreadlibrarymethod |