Cargando…

An efficient CRISPR-Cas9 enrichment sequencing strategy for characterizing complex and highly duplicated genomic regions. A case study in the Prunus salicina LG3-MYB10 genes cluster

BACKGROUND: Genome complexity is largely linked to diversification and crop innovation. Examples of regions with duplicated genes with relevant roles in agricultural traits are found in many crops. In both duplicated and non-duplicated genes, much of the variability in agronomic traits is caused by...

Descripción completa

Detalles Bibliográficos
Autores principales: Fiol, Arnau, Jurado-Ruiz, Federico, López‑Girona, Elena, Aranzana, Maria José
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9419362/
https://www.ncbi.nlm.nih.gov/pubmed/36030243
http://dx.doi.org/10.1186/s13007-022-00937-4
_version_ 1784777159083032576
author Fiol, Arnau
Jurado-Ruiz, Federico
López‑Girona, Elena
Aranzana, Maria José
author_facet Fiol, Arnau
Jurado-Ruiz, Federico
López‑Girona, Elena
Aranzana, Maria José
author_sort Fiol, Arnau
collection PubMed
description BACKGROUND: Genome complexity is largely linked to diversification and crop innovation. Examples of regions with duplicated genes with relevant roles in agricultural traits are found in many crops. In both duplicated and non-duplicated genes, much of the variability in agronomic traits is caused by large as well as small and middle scale structural variants (SVs), which highlights the relevance of the identification and characterization of complex variability between genomes for plant breeding. RESULTS: Here we improve and demonstrate the use of CRISPR-Cas9 enrichment combined with long-read sequencing technology to resolve the MYB10 region in the linkage group 3 (LG3) of Japanese plum (Prunus salicina). This region, which has a length from 90 to 271 kb according to the P. salicina genomes available, is associated with fruit color variability in Prunus species. We demonstrate the high complexity of this region, with homology levels between Japanese plum varieties comparable to those between Prunus species. We cleaved MYB10 genes in five plum varieties using the Cas9 enzyme guided by a pool of crRNAs. The barcoded fragments were then pooled and sequenced in a single MinION Oxford Nanopore Technologies (ONT) run, yielding 194 Mb of sequence. The enrichment was confirmed by aligning the long reads to the plum reference genomes, with a mean read on-target value of 4.5% and a depth per sample of 11.9x. From the alignment, 3261 SNPs and 287 SVs were called and phased. A de novo assembly was constructed for each variety, which also allowed detection, at the haplotype level, of the variability in this region. CONCLUSIONS: CRISPR-Cas9 enrichment is a versatile and powerful tool for long-read targeted sequencing even on highly duplicated and/or polymorphic genomic regions, being especially useful when a reference genome is not available. Potential uses of this methodology as well as its limitations are further discussed. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13007-022-00937-4.
format Online
Article
Text
id pubmed-9419362
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-94193622022-08-28 An efficient CRISPR-Cas9 enrichment sequencing strategy for characterizing complex and highly duplicated genomic regions. A case study in the Prunus salicina LG3-MYB10 genes cluster Fiol, Arnau Jurado-Ruiz, Federico López‑Girona, Elena Aranzana, Maria José Plant Methods Methodology BACKGROUND: Genome complexity is largely linked to diversification and crop innovation. Examples of regions with duplicated genes with relevant roles in agricultural traits are found in many crops. In both duplicated and non-duplicated genes, much of the variability in agronomic traits is caused by large as well as small and middle scale structural variants (SVs), which highlights the relevance of the identification and characterization of complex variability between genomes for plant breeding. RESULTS: Here we improve and demonstrate the use of CRISPR-Cas9 enrichment combined with long-read sequencing technology to resolve the MYB10 region in the linkage group 3 (LG3) of Japanese plum (Prunus salicina). This region, which has a length from 90 to 271 kb according to the P. salicina genomes available, is associated with fruit color variability in Prunus species. We demonstrate the high complexity of this region, with homology levels between Japanese plum varieties comparable to those between Prunus species. We cleaved MYB10 genes in five plum varieties using the Cas9 enzyme guided by a pool of crRNAs. The barcoded fragments were then pooled and sequenced in a single MinION Oxford Nanopore Technologies (ONT) run, yielding 194 Mb of sequence. The enrichment was confirmed by aligning the long reads to the plum reference genomes, with a mean read on-target value of 4.5% and a depth per sample of 11.9x. From the alignment, 3261 SNPs and 287 SVs were called and phased. A de novo assembly was constructed for each variety, which also allowed detection, at the haplotype level, of the variability in this region. CONCLUSIONS: CRISPR-Cas9 enrichment is a versatile and powerful tool for long-read targeted sequencing even on highly duplicated and/or polymorphic genomic regions, being especially useful when a reference genome is not available. Potential uses of this methodology as well as its limitations are further discussed. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13007-022-00937-4. BioMed Central 2022-08-27 /pmc/articles/PMC9419362/ /pubmed/36030243 http://dx.doi.org/10.1186/s13007-022-00937-4 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Methodology
Fiol, Arnau
Jurado-Ruiz, Federico
López‑Girona, Elena
Aranzana, Maria José
An efficient CRISPR-Cas9 enrichment sequencing strategy for characterizing complex and highly duplicated genomic regions. A case study in the Prunus salicina LG3-MYB10 genes cluster
title An efficient CRISPR-Cas9 enrichment sequencing strategy for characterizing complex and highly duplicated genomic regions. A case study in the Prunus salicina LG3-MYB10 genes cluster
title_full An efficient CRISPR-Cas9 enrichment sequencing strategy for characterizing complex and highly duplicated genomic regions. A case study in the Prunus salicina LG3-MYB10 genes cluster
title_fullStr An efficient CRISPR-Cas9 enrichment sequencing strategy for characterizing complex and highly duplicated genomic regions. A case study in the Prunus salicina LG3-MYB10 genes cluster
title_full_unstemmed An efficient CRISPR-Cas9 enrichment sequencing strategy for characterizing complex and highly duplicated genomic regions. A case study in the Prunus salicina LG3-MYB10 genes cluster
title_short An efficient CRISPR-Cas9 enrichment sequencing strategy for characterizing complex and highly duplicated genomic regions. A case study in the Prunus salicina LG3-MYB10 genes cluster
title_sort efficient crispr-cas9 enrichment sequencing strategy for characterizing complex and highly duplicated genomic regions. a case study in the prunus salicina lg3-myb10 genes cluster
topic Methodology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9419362/
https://www.ncbi.nlm.nih.gov/pubmed/36030243
http://dx.doi.org/10.1186/s13007-022-00937-4
work_keys_str_mv AT fiolarnau anefficientcrisprcas9enrichmentsequencingstrategyforcharacterizingcomplexandhighlyduplicatedgenomicregionsacasestudyintheprunussalicinalg3myb10genescluster
AT juradoruizfederico anefficientcrisprcas9enrichmentsequencingstrategyforcharacterizingcomplexandhighlyduplicatedgenomicregionsacasestudyintheprunussalicinalg3myb10genescluster
AT lopezgironaelena anefficientcrisprcas9enrichmentsequencingstrategyforcharacterizingcomplexandhighlyduplicatedgenomicregionsacasestudyintheprunussalicinalg3myb10genescluster
AT aranzanamariajose anefficientcrisprcas9enrichmentsequencingstrategyforcharacterizingcomplexandhighlyduplicatedgenomicregionsacasestudyintheprunussalicinalg3myb10genescluster
AT fiolarnau efficientcrisprcas9enrichmentsequencingstrategyforcharacterizingcomplexandhighlyduplicatedgenomicregionsacasestudyintheprunussalicinalg3myb10genescluster
AT juradoruizfederico efficientcrisprcas9enrichmentsequencingstrategyforcharacterizingcomplexandhighlyduplicatedgenomicregionsacasestudyintheprunussalicinalg3myb10genescluster
AT lopezgironaelena efficientcrisprcas9enrichmentsequencingstrategyforcharacterizingcomplexandhighlyduplicatedgenomicregionsacasestudyintheprunussalicinalg3myb10genescluster
AT aranzanamariajose efficientcrisprcas9enrichmentsequencingstrategyforcharacterizingcomplexandhighlyduplicatedgenomicregionsacasestudyintheprunussalicinalg3myb10genescluster