Cargando…

Targeted Phasing of 2–200 Kilobase DNA Fragments with a Short-Read Sequencer and a Single-Tube Linked-Read Library Method

In the human genome, heterozygous sites are genomic positions with different alleles inherited from each parent. On average, there is a heterozygous site every 1–2 kilobases (kb). Resolving whether two alleles in neighboring heterozygous positions are physically linked—that is, phased—is possible wi...

Descripción completa

Detalles Bibliográficos
Autores principales: Mikhaylova, Veronika, Rzepka, Madison, Kawamura, Tetsuya, Xia, Yu, Chang, Peter L., Zhou, Shiguo, Pham, Long, Modi, Naisarg, Yao, Likun, Perez-Agustin, Adrian, Pagans, Sara, Boles, T. Christian, Lei, Ming, Wang, Yong, Garcia-Bassets, Ivan, Chen, Zhoutao
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10028795/
https://www.ncbi.nlm.nih.gov/pubmed/36945366
http://dx.doi.org/10.1101/2023.03.05.531179
Descripción
Sumario:In the human genome, heterozygous sites are genomic positions with different alleles inherited from each parent. On average, there is a heterozygous site every 1–2 kilobases (kb). Resolving whether two alleles in neighboring heterozygous positions are physically linked—that is, phased—is possible with a short-read sequencer if the sequencing library captures long-range information. TELL-Seq is a library preparation method based on millions of barcoded micro-sized beads that enables instrument-free phasing of a whole human genome in a single PCR tube. TELL-Seq incorporates a unique molecular identifier (barcode) to the short reads generated from the same high-molecular-weight (HMW) DNA fragment (known as ‘linked-reads’). However, genome-scale TELL-Seq is not cost-effective for applications focusing on a single locus or a few loci. Here, we present an optimized TELL-Seq protocol that enables the cost-effective phasing of enriched loci (targets) of varying sizes, purity levels, and heterozygosity. Targeted TELL-Seq maximizes linked-read efficiency and library yield while minimizing input requirements, fragment collisions on microbeads, and sequencing burden. To validate the targeted protocol, we phased seven 180–200 kb loci enriched by CRISPR/Cas9-mediated excision coupled with pulse-field electrophoresis, four 20 kb loci enriched by CRISPR/Cas9-mediated protection from exonuclease digestion, and six 2–13 kb loci amplified by PCR. The selected targets have clinical and research relevance (BRCA1, BRCA2, MLH1, MSH2, MSH6, APC, PMS2, SCN5A-SCN10A, and PKI3CA). These analyses reveal that targeted TELL-Seq provides a reliable way of phasing allelic variants within targets (2–200 kb in length) with the low cost and high accuracy of short-read sequencing.