Cargando…
An efficient CRISPR-Cas9 enrichment sequencing strategy for characterizing complex and highly duplicated genomic regions. A case study in the Prunus salicina LG3-MYB10 genes cluster
BACKGROUND: Genome complexity is largely linked to diversification and crop innovation. Examples of regions with duplicated genes with relevant roles in agricultural traits are found in many crops. In both duplicated and non-duplicated genes, much of the variability in agronomic traits is caused by...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9419362/ https://www.ncbi.nlm.nih.gov/pubmed/36030243 http://dx.doi.org/10.1186/s13007-022-00937-4 |
_version_ | 1784777159083032576 |
---|---|
author | Fiol, Arnau Jurado-Ruiz, Federico López‑Girona, Elena Aranzana, Maria José |
author_facet | Fiol, Arnau Jurado-Ruiz, Federico López‑Girona, Elena Aranzana, Maria José |
author_sort | Fiol, Arnau |
collection | PubMed |
description | BACKGROUND: Genome complexity is largely linked to diversification and crop innovation. Examples of regions with duplicated genes with relevant roles in agricultural traits are found in many crops. In both duplicated and non-duplicated genes, much of the variability in agronomic traits is caused by large as well as small and middle scale structural variants (SVs), which highlights the relevance of the identification and characterization of complex variability between genomes for plant breeding. RESULTS: Here we improve and demonstrate the use of CRISPR-Cas9 enrichment combined with long-read sequencing technology to resolve the MYB10 region in the linkage group 3 (LG3) of Japanese plum (Prunus salicina). This region, which has a length from 90 to 271 kb according to the P. salicina genomes available, is associated with fruit color variability in Prunus species. We demonstrate the high complexity of this region, with homology levels between Japanese plum varieties comparable to those between Prunus species. We cleaved MYB10 genes in five plum varieties using the Cas9 enzyme guided by a pool of crRNAs. The barcoded fragments were then pooled and sequenced in a single MinION Oxford Nanopore Technologies (ONT) run, yielding 194 Mb of sequence. The enrichment was confirmed by aligning the long reads to the plum reference genomes, with a mean read on-target value of 4.5% and a depth per sample of 11.9x. From the alignment, 3261 SNPs and 287 SVs were called and phased. A de novo assembly was constructed for each variety, which also allowed detection, at the haplotype level, of the variability in this region. CONCLUSIONS: CRISPR-Cas9 enrichment is a versatile and powerful tool for long-read targeted sequencing even on highly duplicated and/or polymorphic genomic regions, being especially useful when a reference genome is not available. Potential uses of this methodology as well as its limitations are further discussed. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13007-022-00937-4. |
format | Online Article Text |
id | pubmed-9419362 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-94193622022-08-28 An efficient CRISPR-Cas9 enrichment sequencing strategy for characterizing complex and highly duplicated genomic regions. A case study in the Prunus salicina LG3-MYB10 genes cluster Fiol, Arnau Jurado-Ruiz, Federico López‑Girona, Elena Aranzana, Maria José Plant Methods Methodology BACKGROUND: Genome complexity is largely linked to diversification and crop innovation. Examples of regions with duplicated genes with relevant roles in agricultural traits are found in many crops. In both duplicated and non-duplicated genes, much of the variability in agronomic traits is caused by large as well as small and middle scale structural variants (SVs), which highlights the relevance of the identification and characterization of complex variability between genomes for plant breeding. RESULTS: Here we improve and demonstrate the use of CRISPR-Cas9 enrichment combined with long-read sequencing technology to resolve the MYB10 region in the linkage group 3 (LG3) of Japanese plum (Prunus salicina). This region, which has a length from 90 to 271 kb according to the P. salicina genomes available, is associated with fruit color variability in Prunus species. We demonstrate the high complexity of this region, with homology levels between Japanese plum varieties comparable to those between Prunus species. We cleaved MYB10 genes in five plum varieties using the Cas9 enzyme guided by a pool of crRNAs. The barcoded fragments were then pooled and sequenced in a single MinION Oxford Nanopore Technologies (ONT) run, yielding 194 Mb of sequence. The enrichment was confirmed by aligning the long reads to the plum reference genomes, with a mean read on-target value of 4.5% and a depth per sample of 11.9x. From the alignment, 3261 SNPs and 287 SVs were called and phased. A de novo assembly was constructed for each variety, which also allowed detection, at the haplotype level, of the variability in this region. CONCLUSIONS: CRISPR-Cas9 enrichment is a versatile and powerful tool for long-read targeted sequencing even on highly duplicated and/or polymorphic genomic regions, being especially useful when a reference genome is not available. Potential uses of this methodology as well as its limitations are further discussed. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13007-022-00937-4. BioMed Central 2022-08-27 /pmc/articles/PMC9419362/ /pubmed/36030243 http://dx.doi.org/10.1186/s13007-022-00937-4 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Methodology Fiol, Arnau Jurado-Ruiz, Federico López‑Girona, Elena Aranzana, Maria José An efficient CRISPR-Cas9 enrichment sequencing strategy for characterizing complex and highly duplicated genomic regions. A case study in the Prunus salicina LG3-MYB10 genes cluster |
title | An efficient CRISPR-Cas9 enrichment sequencing strategy for characterizing complex and highly duplicated genomic regions. A case study in the Prunus salicina LG3-MYB10 genes cluster |
title_full | An efficient CRISPR-Cas9 enrichment sequencing strategy for characterizing complex and highly duplicated genomic regions. A case study in the Prunus salicina LG3-MYB10 genes cluster |
title_fullStr | An efficient CRISPR-Cas9 enrichment sequencing strategy for characterizing complex and highly duplicated genomic regions. A case study in the Prunus salicina LG3-MYB10 genes cluster |
title_full_unstemmed | An efficient CRISPR-Cas9 enrichment sequencing strategy for characterizing complex and highly duplicated genomic regions. A case study in the Prunus salicina LG3-MYB10 genes cluster |
title_short | An efficient CRISPR-Cas9 enrichment sequencing strategy for characterizing complex and highly duplicated genomic regions. A case study in the Prunus salicina LG3-MYB10 genes cluster |
title_sort | efficient crispr-cas9 enrichment sequencing strategy for characterizing complex and highly duplicated genomic regions. a case study in the prunus salicina lg3-myb10 genes cluster |
topic | Methodology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9419362/ https://www.ncbi.nlm.nih.gov/pubmed/36030243 http://dx.doi.org/10.1186/s13007-022-00937-4 |
work_keys_str_mv | AT fiolarnau anefficientcrisprcas9enrichmentsequencingstrategyforcharacterizingcomplexandhighlyduplicatedgenomicregionsacasestudyintheprunussalicinalg3myb10genescluster AT juradoruizfederico anefficientcrisprcas9enrichmentsequencingstrategyforcharacterizingcomplexandhighlyduplicatedgenomicregionsacasestudyintheprunussalicinalg3myb10genescluster AT lopezgironaelena anefficientcrisprcas9enrichmentsequencingstrategyforcharacterizingcomplexandhighlyduplicatedgenomicregionsacasestudyintheprunussalicinalg3myb10genescluster AT aranzanamariajose anefficientcrisprcas9enrichmentsequencingstrategyforcharacterizingcomplexandhighlyduplicatedgenomicregionsacasestudyintheprunussalicinalg3myb10genescluster AT fiolarnau efficientcrisprcas9enrichmentsequencingstrategyforcharacterizingcomplexandhighlyduplicatedgenomicregionsacasestudyintheprunussalicinalg3myb10genescluster AT juradoruizfederico efficientcrisprcas9enrichmentsequencingstrategyforcharacterizingcomplexandhighlyduplicatedgenomicregionsacasestudyintheprunussalicinalg3myb10genescluster AT lopezgironaelena efficientcrisprcas9enrichmentsequencingstrategyforcharacterizingcomplexandhighlyduplicatedgenomicregionsacasestudyintheprunussalicinalg3myb10genescluster AT aranzanamariajose efficientcrisprcas9enrichmentsequencingstrategyforcharacterizingcomplexandhighlyduplicatedgenomicregionsacasestudyintheprunussalicinalg3myb10genescluster |