Cargando…

Combined use of Oxford Nanopore and Illumina sequencing yields insights into soybean structural variation biology

BACKGROUND: Structural variants (SVs), including deletions, insertions, duplications, and inversions, are relatively long genomic variations implicated in a diverse range of processes from human disease to ecology and evolution. Given their complex signatures, tendency to occur in repeated regions,...

Descripción completa

Detalles Bibliográficos
Autores principales: Lemay, Marc-André, Sibbesen, Jonas A., Torkamaneh, Davoud, Hamel, Jérémie, Levesque, Roger C., Belzile, François
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8867729/
https://www.ncbi.nlm.nih.gov/pubmed/35197050
http://dx.doi.org/10.1186/s12915-022-01255-w
_version_ 1784656114363662336
author Lemay, Marc-André
Sibbesen, Jonas A.
Torkamaneh, Davoud
Hamel, Jérémie
Levesque, Roger C.
Belzile, François
author_facet Lemay, Marc-André
Sibbesen, Jonas A.
Torkamaneh, Davoud
Hamel, Jérémie
Levesque, Roger C.
Belzile, François
author_sort Lemay, Marc-André
collection PubMed
description BACKGROUND: Structural variants (SVs), including deletions, insertions, duplications, and inversions, are relatively long genomic variations implicated in a diverse range of processes from human disease to ecology and evolution. Given their complex signatures, tendency to occur in repeated regions, and large size, discovering SVs based on short reads is challenging compared to single-nucleotide variants. The increasing availability of long-read technologies has greatly facilitated SV discovery; however, these technologies remain too costly to apply routinely to population-level studies. Here, we combined short-read and long-read sequencing technologies to provide a comprehensive population-scale assessment of structural variation in a panel of Canadian soybean cultivars. RESULTS: We used Oxford Nanopore long-read sequencing data (~12× mean coverage) for 17 samples to both benchmark SV calls made from Illumina short-read data and predict SVs that were subsequently genotyped in a population of 102 samples using Illumina data. Benchmarking results show that variants discovered using Oxford Nanopore can be accurately genotyped from the Illumina data. We first use the genotyped deletions and insertions for population genetics analyses and show that results are comparable to those based on single-nucleotide variants. We observe that the population frequency and distribution within the genome of deletions and insertions are constrained by the location of genes. Gene Ontology and PFAM domain enrichment analyses also confirm previous reports that genes harboring high-frequency deletions and insertions are enriched for functions in defense response. Finally, we discover polymorphic transposable elements from the deletions and insertions and report evidence of the recent activity of a Stowaway MITE. CONCLUSIONS: We show that structural variants discovered using Oxford Nanopore data can be genotyped with high accuracy from Illumina data. Our results demonstrate that long-read and short-read sequencing technologies can be efficiently combined to enhance SV analysis in large populations, providing a reusable framework for their study in a wider range of samples and non-model species. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12915-022-01255-w.
format Online
Article
Text
id pubmed-8867729
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-88677292022-02-25 Combined use of Oxford Nanopore and Illumina sequencing yields insights into soybean structural variation biology Lemay, Marc-André Sibbesen, Jonas A. Torkamaneh, Davoud Hamel, Jérémie Levesque, Roger C. Belzile, François BMC Biol Research Article BACKGROUND: Structural variants (SVs), including deletions, insertions, duplications, and inversions, are relatively long genomic variations implicated in a diverse range of processes from human disease to ecology and evolution. Given their complex signatures, tendency to occur in repeated regions, and large size, discovering SVs based on short reads is challenging compared to single-nucleotide variants. The increasing availability of long-read technologies has greatly facilitated SV discovery; however, these technologies remain too costly to apply routinely to population-level studies. Here, we combined short-read and long-read sequencing technologies to provide a comprehensive population-scale assessment of structural variation in a panel of Canadian soybean cultivars. RESULTS: We used Oxford Nanopore long-read sequencing data (~12× mean coverage) for 17 samples to both benchmark SV calls made from Illumina short-read data and predict SVs that were subsequently genotyped in a population of 102 samples using Illumina data. Benchmarking results show that variants discovered using Oxford Nanopore can be accurately genotyped from the Illumina data. We first use the genotyped deletions and insertions for population genetics analyses and show that results are comparable to those based on single-nucleotide variants. We observe that the population frequency and distribution within the genome of deletions and insertions are constrained by the location of genes. Gene Ontology and PFAM domain enrichment analyses also confirm previous reports that genes harboring high-frequency deletions and insertions are enriched for functions in defense response. Finally, we discover polymorphic transposable elements from the deletions and insertions and report evidence of the recent activity of a Stowaway MITE. CONCLUSIONS: We show that structural variants discovered using Oxford Nanopore data can be genotyped with high accuracy from Illumina data. Our results demonstrate that long-read and short-read sequencing technologies can be efficiently combined to enhance SV analysis in large populations, providing a reusable framework for their study in a wider range of samples and non-model species. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12915-022-01255-w. BioMed Central 2022-02-23 /pmc/articles/PMC8867729/ /pubmed/35197050 http://dx.doi.org/10.1186/s12915-022-01255-w Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research Article
Lemay, Marc-André
Sibbesen, Jonas A.
Torkamaneh, Davoud
Hamel, Jérémie
Levesque, Roger C.
Belzile, François
Combined use of Oxford Nanopore and Illumina sequencing yields insights into soybean structural variation biology
title Combined use of Oxford Nanopore and Illumina sequencing yields insights into soybean structural variation biology
title_full Combined use of Oxford Nanopore and Illumina sequencing yields insights into soybean structural variation biology
title_fullStr Combined use of Oxford Nanopore and Illumina sequencing yields insights into soybean structural variation biology
title_full_unstemmed Combined use of Oxford Nanopore and Illumina sequencing yields insights into soybean structural variation biology
title_short Combined use of Oxford Nanopore and Illumina sequencing yields insights into soybean structural variation biology
title_sort combined use of oxford nanopore and illumina sequencing yields insights into soybean structural variation biology
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8867729/
https://www.ncbi.nlm.nih.gov/pubmed/35197050
http://dx.doi.org/10.1186/s12915-022-01255-w
work_keys_str_mv AT lemaymarcandre combineduseofoxfordnanoporeandilluminasequencingyieldsinsightsintosoybeanstructuralvariationbiology
AT sibbesenjonasa combineduseofoxfordnanoporeandilluminasequencingyieldsinsightsintosoybeanstructuralvariationbiology
AT torkamanehdavoud combineduseofoxfordnanoporeandilluminasequencingyieldsinsightsintosoybeanstructuralvariationbiology
AT hameljeremie combineduseofoxfordnanoporeandilluminasequencingyieldsinsightsintosoybeanstructuralvariationbiology
AT levesquerogerc combineduseofoxfordnanoporeandilluminasequencingyieldsinsightsintosoybeanstructuralvariationbiology
AT belzilefrancois combineduseofoxfordnanoporeandilluminasequencingyieldsinsightsintosoybeanstructuralvariationbiology