Cargando…

Identification of High-Confidence Structural Variants in Domesticated Rainbow Trout Using Whole-Genome Sequencing

Genomic structural variants (SVs) are a major source of genetic and phenotypic variation but have not been investigated systematically in rainbow trout (Oncorhynchus mykiss), an important aquaculture species of cold freshwater. The objectives of this study were 1) to identify and validate high-confi...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Sixin, Gao, Guangtu, Layer, Ryan M., Thorgaard, Gary H., Wiens, Gregory D., Leeds, Timothy D., Martin, Kyle E., Palti, Yniv
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7959816/
https://www.ncbi.nlm.nih.gov/pubmed/33732289
http://dx.doi.org/10.3389/fgene.2021.639355
_version_ 1783665033199222784
author Liu, Sixin
Gao, Guangtu
Layer, Ryan M.
Thorgaard, Gary H.
Wiens, Gregory D.
Leeds, Timothy D.
Martin, Kyle E.
Palti, Yniv
author_facet Liu, Sixin
Gao, Guangtu
Layer, Ryan M.
Thorgaard, Gary H.
Wiens, Gregory D.
Leeds, Timothy D.
Martin, Kyle E.
Palti, Yniv
author_sort Liu, Sixin
collection PubMed
description Genomic structural variants (SVs) are a major source of genetic and phenotypic variation but have not been investigated systematically in rainbow trout (Oncorhynchus mykiss), an important aquaculture species of cold freshwater. The objectives of this study were 1) to identify and validate high-confidence SVs in rainbow trout using whole-genome re-sequencing; and 2) to examine the contribution of transposable elements (TEs) to SVs in rainbow trout. A total of 96 rainbow trout, including 11 homozygous lines and 85 outbred fish from three breeding populations, were whole-genome sequenced with an average genome coverage of 17.2×. Putative SVs were identified using the program Smoove which integrates LUMPY and other associated tools into one package. After rigorous filtering, 13,863 high-confidence SVs were identified. Pacific Biosciences long-reads of Arlee, one of the homozygous lines used for SV detection, validated 98% (3,948 of 4,030) of the high-confidence SVs identified in the Arlee homozygous line. Based on principal component analysis, the 85 outbred fish clustered into three groups consistent with their populations of origin, further indicating that the high-confidence SVs identified in this study are robust. The repetitive DNA content of the high-confidence SV sequences was 86.5%, which is much higher than the 57.1% repetitive DNA content of the reference genome, and is also higher than the repetitive DNA content of Atlantic salmon SVs reported previously. TEs thus contribute substantially to SVs in rainbow trout as TEs make up the majority of repetitive sequences. Hundreds of the high-confidence SVs were annotated as exon-loss or gene-fusion variants, and may have phenotypic effects. The high-confidence SVs reported in this study provide a foundation for further rainbow trout SV studies.
format Online
Article
Text
id pubmed-7959816
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-79598162021-03-16 Identification of High-Confidence Structural Variants in Domesticated Rainbow Trout Using Whole-Genome Sequencing Liu, Sixin Gao, Guangtu Layer, Ryan M. Thorgaard, Gary H. Wiens, Gregory D. Leeds, Timothy D. Martin, Kyle E. Palti, Yniv Front Genet Genetics Genomic structural variants (SVs) are a major source of genetic and phenotypic variation but have not been investigated systematically in rainbow trout (Oncorhynchus mykiss), an important aquaculture species of cold freshwater. The objectives of this study were 1) to identify and validate high-confidence SVs in rainbow trout using whole-genome re-sequencing; and 2) to examine the contribution of transposable elements (TEs) to SVs in rainbow trout. A total of 96 rainbow trout, including 11 homozygous lines and 85 outbred fish from three breeding populations, were whole-genome sequenced with an average genome coverage of 17.2×. Putative SVs were identified using the program Smoove which integrates LUMPY and other associated tools into one package. After rigorous filtering, 13,863 high-confidence SVs were identified. Pacific Biosciences long-reads of Arlee, one of the homozygous lines used for SV detection, validated 98% (3,948 of 4,030) of the high-confidence SVs identified in the Arlee homozygous line. Based on principal component analysis, the 85 outbred fish clustered into three groups consistent with their populations of origin, further indicating that the high-confidence SVs identified in this study are robust. The repetitive DNA content of the high-confidence SV sequences was 86.5%, which is much higher than the 57.1% repetitive DNA content of the reference genome, and is also higher than the repetitive DNA content of Atlantic salmon SVs reported previously. TEs thus contribute substantially to SVs in rainbow trout as TEs make up the majority of repetitive sequences. Hundreds of the high-confidence SVs were annotated as exon-loss or gene-fusion variants, and may have phenotypic effects. The high-confidence SVs reported in this study provide a foundation for further rainbow trout SV studies. Frontiers Media S.A. 2021-02-25 /pmc/articles/PMC7959816/ /pubmed/33732289 http://dx.doi.org/10.3389/fgene.2021.639355 Text en Copyright © 2021 Liu, Gao, Layer, Thorgaard, Wiens, Leeds, Martin and Palti. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Liu, Sixin
Gao, Guangtu
Layer, Ryan M.
Thorgaard, Gary H.
Wiens, Gregory D.
Leeds, Timothy D.
Martin, Kyle E.
Palti, Yniv
Identification of High-Confidence Structural Variants in Domesticated Rainbow Trout Using Whole-Genome Sequencing
title Identification of High-Confidence Structural Variants in Domesticated Rainbow Trout Using Whole-Genome Sequencing
title_full Identification of High-Confidence Structural Variants in Domesticated Rainbow Trout Using Whole-Genome Sequencing
title_fullStr Identification of High-Confidence Structural Variants in Domesticated Rainbow Trout Using Whole-Genome Sequencing
title_full_unstemmed Identification of High-Confidence Structural Variants in Domesticated Rainbow Trout Using Whole-Genome Sequencing
title_short Identification of High-Confidence Structural Variants in Domesticated Rainbow Trout Using Whole-Genome Sequencing
title_sort identification of high-confidence structural variants in domesticated rainbow trout using whole-genome sequencing
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7959816/
https://www.ncbi.nlm.nih.gov/pubmed/33732289
http://dx.doi.org/10.3389/fgene.2021.639355
work_keys_str_mv AT liusixin identificationofhighconfidencestructuralvariantsindomesticatedrainbowtroutusingwholegenomesequencing
AT gaoguangtu identificationofhighconfidencestructuralvariantsindomesticatedrainbowtroutusingwholegenomesequencing
AT layerryanm identificationofhighconfidencestructuralvariantsindomesticatedrainbowtroutusingwholegenomesequencing
AT thorgaardgaryh identificationofhighconfidencestructuralvariantsindomesticatedrainbowtroutusingwholegenomesequencing
AT wiensgregoryd identificationofhighconfidencestructuralvariantsindomesticatedrainbowtroutusingwholegenomesequencing
AT leedstimothyd identificationofhighconfidencestructuralvariantsindomesticatedrainbowtroutusingwholegenomesequencing
AT martinkylee identificationofhighconfidencestructuralvariantsindomesticatedrainbowtroutusingwholegenomesequencing
AT paltiyniv identificationofhighconfidencestructuralvariantsindomesticatedrainbowtroutusingwholegenomesequencing