Cargando…

Targeted Long-Read Sequencing of a Locus Under Long-Term Balancing Selection in Capsella

Rapid advances in short-read DNA sequencing technologies have revolutionized population genomic studies, but there are genomic regions where this technology reaches its limits. Limitations mostly arise due to the difficulties in assembly or alignment to genomic regions of high sequence divergence an...

Descripción completa

Detalles Bibliográficos
Autores principales: Bachmann, Jörg A., Tedder, Andrew, Laenen, Benjamin, Steige, Kim A., Slotte, Tanja
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Genetics Society of America 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5873921/
https://www.ncbi.nlm.nih.gov/pubmed/29476024
http://dx.doi.org/10.1534/g3.117.300467
_version_ 1783310071084613632
author Bachmann, Jörg A.
Tedder, Andrew
Laenen, Benjamin
Steige, Kim A.
Slotte, Tanja
author_facet Bachmann, Jörg A.
Tedder, Andrew
Laenen, Benjamin
Steige, Kim A.
Slotte, Tanja
author_sort Bachmann, Jörg A.
collection PubMed
description Rapid advances in short-read DNA sequencing technologies have revolutionized population genomic studies, but there are genomic regions where this technology reaches its limits. Limitations mostly arise due to the difficulties in assembly or alignment to genomic regions of high sequence divergence and high repeat content, which are typical characteristics for loci under strong long-term balancing selection. Studying genetic diversity at such loci therefore remains challenging. Here, we investigate the feasibility and error rates associated with targeted long-read sequencing of a locus under balancing selection. For this purpose, we generated bacterial artificial chromosomes (BACs) containing the Brassicaceae S-locus, a region under strong negative frequency-dependent selection which has previously proven difficult to assemble in its entirety using short reads. We sequence S-locus BACs with single-molecule long-read sequencing technology and conduct de novo assembly of these S-locus haplotypes. By comparing repeated assemblies resulting from independent long-read sequencing runs on the same BAC clone we do not detect any structural errors, suggesting that reliable assemblies are generated, but we estimate an indel error rate of 5.7×10(−5). A similar error rate was estimated based on comparison of Illumina short-read sequences and BAC assemblies. Our results show that, until de novo assembly of multiple individuals using long-read sequencing becomes feasible, targeted long-read sequencing of loci under balancing selection is a viable option with low error rates for single nucleotide polymorphisms or structural variation. We further find that short-read sequencing is a valuable complement, allowing correction of the relatively high rate of indel errors that result from this approach.
format Online
Article
Text
id pubmed-5873921
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Genetics Society of America
record_format MEDLINE/PubMed
spelling pubmed-58739212018-03-30 Targeted Long-Read Sequencing of a Locus Under Long-Term Balancing Selection in Capsella Bachmann, Jörg A. Tedder, Andrew Laenen, Benjamin Steige, Kim A. Slotte, Tanja G3 (Bethesda) Investigations Rapid advances in short-read DNA sequencing technologies have revolutionized population genomic studies, but there are genomic regions where this technology reaches its limits. Limitations mostly arise due to the difficulties in assembly or alignment to genomic regions of high sequence divergence and high repeat content, which are typical characteristics for loci under strong long-term balancing selection. Studying genetic diversity at such loci therefore remains challenging. Here, we investigate the feasibility and error rates associated with targeted long-read sequencing of a locus under balancing selection. For this purpose, we generated bacterial artificial chromosomes (BACs) containing the Brassicaceae S-locus, a region under strong negative frequency-dependent selection which has previously proven difficult to assemble in its entirety using short reads. We sequence S-locus BACs with single-molecule long-read sequencing technology and conduct de novo assembly of these S-locus haplotypes. By comparing repeated assemblies resulting from independent long-read sequencing runs on the same BAC clone we do not detect any structural errors, suggesting that reliable assemblies are generated, but we estimate an indel error rate of 5.7×10(−5). A similar error rate was estimated based on comparison of Illumina short-read sequences and BAC assemblies. Our results show that, until de novo assembly of multiple individuals using long-read sequencing becomes feasible, targeted long-read sequencing of loci under balancing selection is a viable option with low error rates for single nucleotide polymorphisms or structural variation. We further find that short-read sequencing is a valuable complement, allowing correction of the relatively high rate of indel errors that result from this approach. Genetics Society of America 2018-02-20 /pmc/articles/PMC5873921/ /pubmed/29476024 http://dx.doi.org/10.1534/g3.117.300467 Text en Copyright © 2018 Bachmann et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Investigations
Bachmann, Jörg A.
Tedder, Andrew
Laenen, Benjamin
Steige, Kim A.
Slotte, Tanja
Targeted Long-Read Sequencing of a Locus Under Long-Term Balancing Selection in Capsella
title Targeted Long-Read Sequencing of a Locus Under Long-Term Balancing Selection in Capsella
title_full Targeted Long-Read Sequencing of a Locus Under Long-Term Balancing Selection in Capsella
title_fullStr Targeted Long-Read Sequencing of a Locus Under Long-Term Balancing Selection in Capsella
title_full_unstemmed Targeted Long-Read Sequencing of a Locus Under Long-Term Balancing Selection in Capsella
title_short Targeted Long-Read Sequencing of a Locus Under Long-Term Balancing Selection in Capsella
title_sort targeted long-read sequencing of a locus under long-term balancing selection in capsella
topic Investigations
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5873921/
https://www.ncbi.nlm.nih.gov/pubmed/29476024
http://dx.doi.org/10.1534/g3.117.300467
work_keys_str_mv AT bachmannjorga targetedlongreadsequencingofalocusunderlongtermbalancingselectionincapsella
AT tedderandrew targetedlongreadsequencingofalocusunderlongtermbalancingselectionincapsella
AT laenenbenjamin targetedlongreadsequencingofalocusunderlongtermbalancingselectionincapsella
AT steigekima targetedlongreadsequencingofalocusunderlongtermbalancingselectionincapsella
AT slottetanja targetedlongreadsequencingofalocusunderlongtermbalancingselectionincapsella