Cargando…
Combined use of Oxford Nanopore and Illumina sequencing yields insights into soybean structural variation biology
BACKGROUND: Structural variants (SVs), including deletions, insertions, duplications, and inversions, are relatively long genomic variations implicated in a diverse range of processes from human disease to ecology and evolution. Given their complex signatures, tendency to occur in repeated regions,...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8867729/ https://www.ncbi.nlm.nih.gov/pubmed/35197050 http://dx.doi.org/10.1186/s12915-022-01255-w |
_version_ | 1784656114363662336 |
---|---|
author | Lemay, Marc-André Sibbesen, Jonas A. Torkamaneh, Davoud Hamel, Jérémie Levesque, Roger C. Belzile, François |
author_facet | Lemay, Marc-André Sibbesen, Jonas A. Torkamaneh, Davoud Hamel, Jérémie Levesque, Roger C. Belzile, François |
author_sort | Lemay, Marc-André |
collection | PubMed |
description | BACKGROUND: Structural variants (SVs), including deletions, insertions, duplications, and inversions, are relatively long genomic variations implicated in a diverse range of processes from human disease to ecology and evolution. Given their complex signatures, tendency to occur in repeated regions, and large size, discovering SVs based on short reads is challenging compared to single-nucleotide variants. The increasing availability of long-read technologies has greatly facilitated SV discovery; however, these technologies remain too costly to apply routinely to population-level studies. Here, we combined short-read and long-read sequencing technologies to provide a comprehensive population-scale assessment of structural variation in a panel of Canadian soybean cultivars. RESULTS: We used Oxford Nanopore long-read sequencing data (~12× mean coverage) for 17 samples to both benchmark SV calls made from Illumina short-read data and predict SVs that were subsequently genotyped in a population of 102 samples using Illumina data. Benchmarking results show that variants discovered using Oxford Nanopore can be accurately genotyped from the Illumina data. We first use the genotyped deletions and insertions for population genetics analyses and show that results are comparable to those based on single-nucleotide variants. We observe that the population frequency and distribution within the genome of deletions and insertions are constrained by the location of genes. Gene Ontology and PFAM domain enrichment analyses also confirm previous reports that genes harboring high-frequency deletions and insertions are enriched for functions in defense response. Finally, we discover polymorphic transposable elements from the deletions and insertions and report evidence of the recent activity of a Stowaway MITE. CONCLUSIONS: We show that structural variants discovered using Oxford Nanopore data can be genotyped with high accuracy from Illumina data. Our results demonstrate that long-read and short-read sequencing technologies can be efficiently combined to enhance SV analysis in large populations, providing a reusable framework for their study in a wider range of samples and non-model species. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12915-022-01255-w. |
format | Online Article Text |
id | pubmed-8867729 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-88677292022-02-25 Combined use of Oxford Nanopore and Illumina sequencing yields insights into soybean structural variation biology Lemay, Marc-André Sibbesen, Jonas A. Torkamaneh, Davoud Hamel, Jérémie Levesque, Roger C. Belzile, François BMC Biol Research Article BACKGROUND: Structural variants (SVs), including deletions, insertions, duplications, and inversions, are relatively long genomic variations implicated in a diverse range of processes from human disease to ecology and evolution. Given their complex signatures, tendency to occur in repeated regions, and large size, discovering SVs based on short reads is challenging compared to single-nucleotide variants. The increasing availability of long-read technologies has greatly facilitated SV discovery; however, these technologies remain too costly to apply routinely to population-level studies. Here, we combined short-read and long-read sequencing technologies to provide a comprehensive population-scale assessment of structural variation in a panel of Canadian soybean cultivars. RESULTS: We used Oxford Nanopore long-read sequencing data (~12× mean coverage) for 17 samples to both benchmark SV calls made from Illumina short-read data and predict SVs that were subsequently genotyped in a population of 102 samples using Illumina data. Benchmarking results show that variants discovered using Oxford Nanopore can be accurately genotyped from the Illumina data. We first use the genotyped deletions and insertions for population genetics analyses and show that results are comparable to those based on single-nucleotide variants. We observe that the population frequency and distribution within the genome of deletions and insertions are constrained by the location of genes. Gene Ontology and PFAM domain enrichment analyses also confirm previous reports that genes harboring high-frequency deletions and insertions are enriched for functions in defense response. Finally, we discover polymorphic transposable elements from the deletions and insertions and report evidence of the recent activity of a Stowaway MITE. CONCLUSIONS: We show that structural variants discovered using Oxford Nanopore data can be genotyped with high accuracy from Illumina data. Our results demonstrate that long-read and short-read sequencing technologies can be efficiently combined to enhance SV analysis in large populations, providing a reusable framework for their study in a wider range of samples and non-model species. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12915-022-01255-w. BioMed Central 2022-02-23 /pmc/articles/PMC8867729/ /pubmed/35197050 http://dx.doi.org/10.1186/s12915-022-01255-w Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Article Lemay, Marc-André Sibbesen, Jonas A. Torkamaneh, Davoud Hamel, Jérémie Levesque, Roger C. Belzile, François Combined use of Oxford Nanopore and Illumina sequencing yields insights into soybean structural variation biology |
title | Combined use of Oxford Nanopore and Illumina sequencing yields insights into soybean structural variation biology |
title_full | Combined use of Oxford Nanopore and Illumina sequencing yields insights into soybean structural variation biology |
title_fullStr | Combined use of Oxford Nanopore and Illumina sequencing yields insights into soybean structural variation biology |
title_full_unstemmed | Combined use of Oxford Nanopore and Illumina sequencing yields insights into soybean structural variation biology |
title_short | Combined use of Oxford Nanopore and Illumina sequencing yields insights into soybean structural variation biology |
title_sort | combined use of oxford nanopore and illumina sequencing yields insights into soybean structural variation biology |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8867729/ https://www.ncbi.nlm.nih.gov/pubmed/35197050 http://dx.doi.org/10.1186/s12915-022-01255-w |
work_keys_str_mv | AT lemaymarcandre combineduseofoxfordnanoporeandilluminasequencingyieldsinsightsintosoybeanstructuralvariationbiology AT sibbesenjonasa combineduseofoxfordnanoporeandilluminasequencingyieldsinsightsintosoybeanstructuralvariationbiology AT torkamanehdavoud combineduseofoxfordnanoporeandilluminasequencingyieldsinsightsintosoybeanstructuralvariationbiology AT hameljeremie combineduseofoxfordnanoporeandilluminasequencingyieldsinsightsintosoybeanstructuralvariationbiology AT levesquerogerc combineduseofoxfordnanoporeandilluminasequencingyieldsinsightsintosoybeanstructuralvariationbiology AT belzilefrancois combineduseofoxfordnanoporeandilluminasequencingyieldsinsightsintosoybeanstructuralvariationbiology |