Cargando…
Identification of High-Confidence Structural Variants in Domesticated Rainbow Trout Using Whole-Genome Sequencing
Genomic structural variants (SVs) are a major source of genetic and phenotypic variation but have not been investigated systematically in rainbow trout (Oncorhynchus mykiss), an important aquaculture species of cold freshwater. The objectives of this study were 1) to identify and validate high-confi...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7959816/ https://www.ncbi.nlm.nih.gov/pubmed/33732289 http://dx.doi.org/10.3389/fgene.2021.639355 |
_version_ | 1783665033199222784 |
---|---|
author | Liu, Sixin Gao, Guangtu Layer, Ryan M. Thorgaard, Gary H. Wiens, Gregory D. Leeds, Timothy D. Martin, Kyle E. Palti, Yniv |
author_facet | Liu, Sixin Gao, Guangtu Layer, Ryan M. Thorgaard, Gary H. Wiens, Gregory D. Leeds, Timothy D. Martin, Kyle E. Palti, Yniv |
author_sort | Liu, Sixin |
collection | PubMed |
description | Genomic structural variants (SVs) are a major source of genetic and phenotypic variation but have not been investigated systematically in rainbow trout (Oncorhynchus mykiss), an important aquaculture species of cold freshwater. The objectives of this study were 1) to identify and validate high-confidence SVs in rainbow trout using whole-genome re-sequencing; and 2) to examine the contribution of transposable elements (TEs) to SVs in rainbow trout. A total of 96 rainbow trout, including 11 homozygous lines and 85 outbred fish from three breeding populations, were whole-genome sequenced with an average genome coverage of 17.2×. Putative SVs were identified using the program Smoove which integrates LUMPY and other associated tools into one package. After rigorous filtering, 13,863 high-confidence SVs were identified. Pacific Biosciences long-reads of Arlee, one of the homozygous lines used for SV detection, validated 98% (3,948 of 4,030) of the high-confidence SVs identified in the Arlee homozygous line. Based on principal component analysis, the 85 outbred fish clustered into three groups consistent with their populations of origin, further indicating that the high-confidence SVs identified in this study are robust. The repetitive DNA content of the high-confidence SV sequences was 86.5%, which is much higher than the 57.1% repetitive DNA content of the reference genome, and is also higher than the repetitive DNA content of Atlantic salmon SVs reported previously. TEs thus contribute substantially to SVs in rainbow trout as TEs make up the majority of repetitive sequences. Hundreds of the high-confidence SVs were annotated as exon-loss or gene-fusion variants, and may have phenotypic effects. The high-confidence SVs reported in this study provide a foundation for further rainbow trout SV studies. |
format | Online Article Text |
id | pubmed-7959816 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-79598162021-03-16 Identification of High-Confidence Structural Variants in Domesticated Rainbow Trout Using Whole-Genome Sequencing Liu, Sixin Gao, Guangtu Layer, Ryan M. Thorgaard, Gary H. Wiens, Gregory D. Leeds, Timothy D. Martin, Kyle E. Palti, Yniv Front Genet Genetics Genomic structural variants (SVs) are a major source of genetic and phenotypic variation but have not been investigated systematically in rainbow trout (Oncorhynchus mykiss), an important aquaculture species of cold freshwater. The objectives of this study were 1) to identify and validate high-confidence SVs in rainbow trout using whole-genome re-sequencing; and 2) to examine the contribution of transposable elements (TEs) to SVs in rainbow trout. A total of 96 rainbow trout, including 11 homozygous lines and 85 outbred fish from three breeding populations, were whole-genome sequenced with an average genome coverage of 17.2×. Putative SVs were identified using the program Smoove which integrates LUMPY and other associated tools into one package. After rigorous filtering, 13,863 high-confidence SVs were identified. Pacific Biosciences long-reads of Arlee, one of the homozygous lines used for SV detection, validated 98% (3,948 of 4,030) of the high-confidence SVs identified in the Arlee homozygous line. Based on principal component analysis, the 85 outbred fish clustered into three groups consistent with their populations of origin, further indicating that the high-confidence SVs identified in this study are robust. The repetitive DNA content of the high-confidence SV sequences was 86.5%, which is much higher than the 57.1% repetitive DNA content of the reference genome, and is also higher than the repetitive DNA content of Atlantic salmon SVs reported previously. TEs thus contribute substantially to SVs in rainbow trout as TEs make up the majority of repetitive sequences. Hundreds of the high-confidence SVs were annotated as exon-loss or gene-fusion variants, and may have phenotypic effects. The high-confidence SVs reported in this study provide a foundation for further rainbow trout SV studies. Frontiers Media S.A. 2021-02-25 /pmc/articles/PMC7959816/ /pubmed/33732289 http://dx.doi.org/10.3389/fgene.2021.639355 Text en Copyright © 2021 Liu, Gao, Layer, Thorgaard, Wiens, Leeds, Martin and Palti. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Liu, Sixin Gao, Guangtu Layer, Ryan M. Thorgaard, Gary H. Wiens, Gregory D. Leeds, Timothy D. Martin, Kyle E. Palti, Yniv Identification of High-Confidence Structural Variants in Domesticated Rainbow Trout Using Whole-Genome Sequencing |
title | Identification of High-Confidence Structural Variants in Domesticated Rainbow Trout Using Whole-Genome Sequencing |
title_full | Identification of High-Confidence Structural Variants in Domesticated Rainbow Trout Using Whole-Genome Sequencing |
title_fullStr | Identification of High-Confidence Structural Variants in Domesticated Rainbow Trout Using Whole-Genome Sequencing |
title_full_unstemmed | Identification of High-Confidence Structural Variants in Domesticated Rainbow Trout Using Whole-Genome Sequencing |
title_short | Identification of High-Confidence Structural Variants in Domesticated Rainbow Trout Using Whole-Genome Sequencing |
title_sort | identification of high-confidence structural variants in domesticated rainbow trout using whole-genome sequencing |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7959816/ https://www.ncbi.nlm.nih.gov/pubmed/33732289 http://dx.doi.org/10.3389/fgene.2021.639355 |
work_keys_str_mv | AT liusixin identificationofhighconfidencestructuralvariantsindomesticatedrainbowtroutusingwholegenomesequencing AT gaoguangtu identificationofhighconfidencestructuralvariantsindomesticatedrainbowtroutusingwholegenomesequencing AT layerryanm identificationofhighconfidencestructuralvariantsindomesticatedrainbowtroutusingwholegenomesequencing AT thorgaardgaryh identificationofhighconfidencestructuralvariantsindomesticatedrainbowtroutusingwholegenomesequencing AT wiensgregoryd identificationofhighconfidencestructuralvariantsindomesticatedrainbowtroutusingwholegenomesequencing AT leedstimothyd identificationofhighconfidencestructuralvariantsindomesticatedrainbowtroutusingwholegenomesequencing AT martinkylee identificationofhighconfidencestructuralvariantsindomesticatedrainbowtroutusingwholegenomesequencing AT paltiyniv identificationofhighconfidencestructuralvariantsindomesticatedrainbowtroutusingwholegenomesequencing |