Cargando…

CRISPR-Cas9-based repeat depletion for high-throughput genotyping of complex plant genomes

High-throughput genotyping enables the large-scale analysis of genetic diversity in population genomics and genome-wide association studies that combine the genotypic and phenotypic characterization of large collections of accessions. Sequencing-based approaches for genotyping are progressively repl...

Descripción completa

Detalles Bibliográficos
Autores principales: Rossato, Marzia, Marcolungo, Luca, De Antoni, Luca, Lopatriello, Giulia, Bellucci, Elisa, Cortinovis, Gaia, Frascarelli, Giulia, Nanni, Laura, Bitocchi, Elena, Di Vittori, Valerio, Vincenzi, Leonardo, Lucchini, Filippo, Bett, Kirstin E., Ramsay, Larissa, Konkin, David James, Delledonne, Massimo, Papa, Roberto
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10317117/
https://www.ncbi.nlm.nih.gov/pubmed/37127332
http://dx.doi.org/10.1101/gr.277628.122
_version_ 1785067837451141120
author Rossato, Marzia
Marcolungo, Luca
De Antoni, Luca
Lopatriello, Giulia
Bellucci, Elisa
Cortinovis, Gaia
Frascarelli, Giulia
Nanni, Laura
Bitocchi, Elena
Di Vittori, Valerio
Vincenzi, Leonardo
Lucchini, Filippo
Bett, Kirstin E.
Ramsay, Larissa
Konkin, David James
Delledonne, Massimo
Papa, Roberto
author_facet Rossato, Marzia
Marcolungo, Luca
De Antoni, Luca
Lopatriello, Giulia
Bellucci, Elisa
Cortinovis, Gaia
Frascarelli, Giulia
Nanni, Laura
Bitocchi, Elena
Di Vittori, Valerio
Vincenzi, Leonardo
Lucchini, Filippo
Bett, Kirstin E.
Ramsay, Larissa
Konkin, David James
Delledonne, Massimo
Papa, Roberto
author_sort Rossato, Marzia
collection PubMed
description High-throughput genotyping enables the large-scale analysis of genetic diversity in population genomics and genome-wide association studies that combine the genotypic and phenotypic characterization of large collections of accessions. Sequencing-based approaches for genotyping are progressively replacing traditional genotyping methods because of the lower ascertainment bias. However, genome-wide genotyping based on sequencing becomes expensive in species with large genomes and a high proportion of repetitive DNA. Here we describe the use of CRISPR-Cas9 technology to deplete repetitive elements in the 3.76-Gb genome of lentil (Lens culinaris), 84% consisting of repeats, thus concentrating the sequencing data on coding and regulatory regions (single-copy regions). We designed a custom set of 566,766 gRNAs targeting 2.9 Gbp of repeats and excluding repetitive regions overlapping annotated genes and putative regulatory elements based on ATAC-seq data. The novel depletion method removed ∼40% of reads mapping to repeats, increasing those mapping to single-copy regions by ∼2.6-fold. When analyzing 25 million fragments, this repeat-to-single-copy shift in the sequencing data increased the number of genotyped bases of ∼10-fold compared to nondepleted libraries. In the same condition, we were also able to identify ∼12-fold more genetic variants in the single-copy regions and increased the genotyping accuracy by rescuing thousands of heterozygous variants that otherwise would be missed because of low coverage. The method performed similarly regardless of the multiplexing level, type of library or genotypes, including different cultivars and a closely related species (L. orientalis). Our results showed that CRISPR-Cas9-driven repeat depletion focuses sequencing data on single-copy regions, thus improving high-density and genome-wide genotyping in large and repetitive genomes.
format Online
Article
Text
id pubmed-10317117
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cold Spring Harbor Laboratory Press
record_format MEDLINE/PubMed
spelling pubmed-103171172023-07-04 CRISPR-Cas9-based repeat depletion for high-throughput genotyping of complex plant genomes Rossato, Marzia Marcolungo, Luca De Antoni, Luca Lopatriello, Giulia Bellucci, Elisa Cortinovis, Gaia Frascarelli, Giulia Nanni, Laura Bitocchi, Elena Di Vittori, Valerio Vincenzi, Leonardo Lucchini, Filippo Bett, Kirstin E. Ramsay, Larissa Konkin, David James Delledonne, Massimo Papa, Roberto Genome Res Methods High-throughput genotyping enables the large-scale analysis of genetic diversity in population genomics and genome-wide association studies that combine the genotypic and phenotypic characterization of large collections of accessions. Sequencing-based approaches for genotyping are progressively replacing traditional genotyping methods because of the lower ascertainment bias. However, genome-wide genotyping based on sequencing becomes expensive in species with large genomes and a high proportion of repetitive DNA. Here we describe the use of CRISPR-Cas9 technology to deplete repetitive elements in the 3.76-Gb genome of lentil (Lens culinaris), 84% consisting of repeats, thus concentrating the sequencing data on coding and regulatory regions (single-copy regions). We designed a custom set of 566,766 gRNAs targeting 2.9 Gbp of repeats and excluding repetitive regions overlapping annotated genes and putative regulatory elements based on ATAC-seq data. The novel depletion method removed ∼40% of reads mapping to repeats, increasing those mapping to single-copy regions by ∼2.6-fold. When analyzing 25 million fragments, this repeat-to-single-copy shift in the sequencing data increased the number of genotyped bases of ∼10-fold compared to nondepleted libraries. In the same condition, we were also able to identify ∼12-fold more genetic variants in the single-copy regions and increased the genotyping accuracy by rescuing thousands of heterozygous variants that otherwise would be missed because of low coverage. The method performed similarly regardless of the multiplexing level, type of library or genotypes, including different cultivars and a closely related species (L. orientalis). Our results showed that CRISPR-Cas9-driven repeat depletion focuses sequencing data on single-copy regions, thus improving high-density and genome-wide genotyping in large and repetitive genomes. Cold Spring Harbor Laboratory Press 2023-05 /pmc/articles/PMC10317117/ /pubmed/37127332 http://dx.doi.org/10.1101/gr.277628.122 Text en © 2023 Rossato et al.; Published by Cold Spring Harbor Laboratory Press https://creativecommons.org/licenses/by/4.0/This article, published in Genome Research, is available under a Creative Commons License (Attribution 4.0 International), as described at http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Methods
Rossato, Marzia
Marcolungo, Luca
De Antoni, Luca
Lopatriello, Giulia
Bellucci, Elisa
Cortinovis, Gaia
Frascarelli, Giulia
Nanni, Laura
Bitocchi, Elena
Di Vittori, Valerio
Vincenzi, Leonardo
Lucchini, Filippo
Bett, Kirstin E.
Ramsay, Larissa
Konkin, David James
Delledonne, Massimo
Papa, Roberto
CRISPR-Cas9-based repeat depletion for high-throughput genotyping of complex plant genomes
title CRISPR-Cas9-based repeat depletion for high-throughput genotyping of complex plant genomes
title_full CRISPR-Cas9-based repeat depletion for high-throughput genotyping of complex plant genomes
title_fullStr CRISPR-Cas9-based repeat depletion for high-throughput genotyping of complex plant genomes
title_full_unstemmed CRISPR-Cas9-based repeat depletion for high-throughput genotyping of complex plant genomes
title_short CRISPR-Cas9-based repeat depletion for high-throughput genotyping of complex plant genomes
title_sort crispr-cas9-based repeat depletion for high-throughput genotyping of complex plant genomes
topic Methods
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10317117/
https://www.ncbi.nlm.nih.gov/pubmed/37127332
http://dx.doi.org/10.1101/gr.277628.122
work_keys_str_mv AT rossatomarzia crisprcas9basedrepeatdepletionforhighthroughputgenotypingofcomplexplantgenomes
AT marcolungoluca crisprcas9basedrepeatdepletionforhighthroughputgenotypingofcomplexplantgenomes
AT deantoniluca crisprcas9basedrepeatdepletionforhighthroughputgenotypingofcomplexplantgenomes
AT lopatriellogiulia crisprcas9basedrepeatdepletionforhighthroughputgenotypingofcomplexplantgenomes
AT belluccielisa crisprcas9basedrepeatdepletionforhighthroughputgenotypingofcomplexplantgenomes
AT cortinovisgaia crisprcas9basedrepeatdepletionforhighthroughputgenotypingofcomplexplantgenomes
AT frascarelligiulia crisprcas9basedrepeatdepletionforhighthroughputgenotypingofcomplexplantgenomes
AT nannilaura crisprcas9basedrepeatdepletionforhighthroughputgenotypingofcomplexplantgenomes
AT bitocchielena crisprcas9basedrepeatdepletionforhighthroughputgenotypingofcomplexplantgenomes
AT divittorivalerio crisprcas9basedrepeatdepletionforhighthroughputgenotypingofcomplexplantgenomes
AT vincenzileonardo crisprcas9basedrepeatdepletionforhighthroughputgenotypingofcomplexplantgenomes
AT lucchinifilippo crisprcas9basedrepeatdepletionforhighthroughputgenotypingofcomplexplantgenomes
AT bettkirstine crisprcas9basedrepeatdepletionforhighthroughputgenotypingofcomplexplantgenomes
AT ramsaylarissa crisprcas9basedrepeatdepletionforhighthroughputgenotypingofcomplexplantgenomes
AT konkindavidjames crisprcas9basedrepeatdepletionforhighthroughputgenotypingofcomplexplantgenomes
AT delledonnemassimo crisprcas9basedrepeatdepletionforhighthroughputgenotypingofcomplexplantgenomes
AT paparoberto crisprcas9basedrepeatdepletionforhighthroughputgenotypingofcomplexplantgenomes