Cargando…

A New Single Nucleotide Polymorphism Database for Rainbow Trout Generated Through Whole Genome Resequencing

Single-nucleotide polymorphisms (SNPs) are highly abundant markers, which are broadly distributed in animal genomes. For rainbow trout (Oncorhynchus mykiss), SNP discovery has been previously done through sequencing of restriction-site associated DNA (RAD) libraries, reduced representation libraries...

Descripción completa

Detalles Bibliográficos
Autores principales: Gao, Guangtu, Nome, Torfinn, Pearse, Devon E., Moen, Thomas, Naish, Kerry A., Thorgaard, Gary H., Lien, Sigbjørn, Palti, Yniv
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5928233/
https://www.ncbi.nlm.nih.gov/pubmed/29740479
http://dx.doi.org/10.3389/fgene.2018.00147
_version_ 1783319203297624064
author Gao, Guangtu
Nome, Torfinn
Pearse, Devon E.
Moen, Thomas
Naish, Kerry A.
Thorgaard, Gary H.
Lien, Sigbjørn
Palti, Yniv
author_facet Gao, Guangtu
Nome, Torfinn
Pearse, Devon E.
Moen, Thomas
Naish, Kerry A.
Thorgaard, Gary H.
Lien, Sigbjørn
Palti, Yniv
author_sort Gao, Guangtu
collection PubMed
description Single-nucleotide polymorphisms (SNPs) are highly abundant markers, which are broadly distributed in animal genomes. For rainbow trout (Oncorhynchus mykiss), SNP discovery has been previously done through sequencing of restriction-site associated DNA (RAD) libraries, reduced representation libraries (RRL) and RNA sequencing. Recently we have performed high coverage whole genome resequencing with 61 unrelated samples, representing a wide range of rainbow trout and steelhead populations, with 49 new samples added to 12 aquaculture samples from AquaGen (Norway) that we previously used for SNP discovery. Of the 49 new samples, 11 were double-haploid lines from Washington State University (WSU) and 38 represented wild and hatchery populations from a wide range of geographic distribution and with divergent migratory phenotypes. We then mapped the sequences to the new rainbow trout reference genome assembly (GCA_002163495.1) which is based on the Swanson YY doubled haploid line. Variant calling was conducted with FreeBayes and SAMtools mpileup, followed by filtering of SNPs based on quality score, sequence complexity, read depth on the locus, and number of genotyped samples. Results from the two variant calling programs were compared and genotypes of the double haploid samples were used for detecting and filtering putative paralogous sequence variants (PSVs) and multi-sequence variants (MSVs). Overall, 30,302,087 SNPs were identified on the rainbow trout genome 29 chromosomes and 1,139,018 on unplaced scaffolds, with 4,042,723 SNPs having high minor allele frequency (MAF > 0.25). The average SNP density on the chromosomes was one SNP per 64 bp, or 15.6 SNPs per 1 kb. Results from the phylogenetic analysis that we conducted indicate that the SNP markers contain enough population-specific polymorphisms for recovering population relationships despite the small sample size used. Intra-Population polymorphism assessment revealed high level of polymorphism and heterozygosity within each population. We also provide functional annotation based on the genome position of each SNP and evaluate the use of clonal lines for filtering of PSVs and MSVs. These SNPs form a new database, which provides an important resource for a new high density SNP array design and for other SNP genotyping platforms used for genetic and genomics studies of this iconic salmonid fish species.
format Online
Article
Text
id pubmed-5928233
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-59282332018-05-08 A New Single Nucleotide Polymorphism Database for Rainbow Trout Generated Through Whole Genome Resequencing Gao, Guangtu Nome, Torfinn Pearse, Devon E. Moen, Thomas Naish, Kerry A. Thorgaard, Gary H. Lien, Sigbjørn Palti, Yniv Front Genet Genetics Single-nucleotide polymorphisms (SNPs) are highly abundant markers, which are broadly distributed in animal genomes. For rainbow trout (Oncorhynchus mykiss), SNP discovery has been previously done through sequencing of restriction-site associated DNA (RAD) libraries, reduced representation libraries (RRL) and RNA sequencing. Recently we have performed high coverage whole genome resequencing with 61 unrelated samples, representing a wide range of rainbow trout and steelhead populations, with 49 new samples added to 12 aquaculture samples from AquaGen (Norway) that we previously used for SNP discovery. Of the 49 new samples, 11 were double-haploid lines from Washington State University (WSU) and 38 represented wild and hatchery populations from a wide range of geographic distribution and with divergent migratory phenotypes. We then mapped the sequences to the new rainbow trout reference genome assembly (GCA_002163495.1) which is based on the Swanson YY doubled haploid line. Variant calling was conducted with FreeBayes and SAMtools mpileup, followed by filtering of SNPs based on quality score, sequence complexity, read depth on the locus, and number of genotyped samples. Results from the two variant calling programs were compared and genotypes of the double haploid samples were used for detecting and filtering putative paralogous sequence variants (PSVs) and multi-sequence variants (MSVs). Overall, 30,302,087 SNPs were identified on the rainbow trout genome 29 chromosomes and 1,139,018 on unplaced scaffolds, with 4,042,723 SNPs having high minor allele frequency (MAF > 0.25). The average SNP density on the chromosomes was one SNP per 64 bp, or 15.6 SNPs per 1 kb. Results from the phylogenetic analysis that we conducted indicate that the SNP markers contain enough population-specific polymorphisms for recovering population relationships despite the small sample size used. Intra-Population polymorphism assessment revealed high level of polymorphism and heterozygosity within each population. We also provide functional annotation based on the genome position of each SNP and evaluate the use of clonal lines for filtering of PSVs and MSVs. These SNPs form a new database, which provides an important resource for a new high density SNP array design and for other SNP genotyping platforms used for genetic and genomics studies of this iconic salmonid fish species. Frontiers Media S.A. 2018-04-24 /pmc/articles/PMC5928233/ /pubmed/29740479 http://dx.doi.org/10.3389/fgene.2018.00147 Text en Copyright © 2018 Gao, Nome, Pearse, Moen, Naish, Thorgaard, Lien and Palti. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Gao, Guangtu
Nome, Torfinn
Pearse, Devon E.
Moen, Thomas
Naish, Kerry A.
Thorgaard, Gary H.
Lien, Sigbjørn
Palti, Yniv
A New Single Nucleotide Polymorphism Database for Rainbow Trout Generated Through Whole Genome Resequencing
title A New Single Nucleotide Polymorphism Database for Rainbow Trout Generated Through Whole Genome Resequencing
title_full A New Single Nucleotide Polymorphism Database for Rainbow Trout Generated Through Whole Genome Resequencing
title_fullStr A New Single Nucleotide Polymorphism Database for Rainbow Trout Generated Through Whole Genome Resequencing
title_full_unstemmed A New Single Nucleotide Polymorphism Database for Rainbow Trout Generated Through Whole Genome Resequencing
title_short A New Single Nucleotide Polymorphism Database for Rainbow Trout Generated Through Whole Genome Resequencing
title_sort new single nucleotide polymorphism database for rainbow trout generated through whole genome resequencing
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5928233/
https://www.ncbi.nlm.nih.gov/pubmed/29740479
http://dx.doi.org/10.3389/fgene.2018.00147
work_keys_str_mv AT gaoguangtu anewsinglenucleotidepolymorphismdatabaseforrainbowtroutgeneratedthroughwholegenomeresequencing
AT nometorfinn anewsinglenucleotidepolymorphismdatabaseforrainbowtroutgeneratedthroughwholegenomeresequencing
AT pearsedevone anewsinglenucleotidepolymorphismdatabaseforrainbowtroutgeneratedthroughwholegenomeresequencing
AT moenthomas anewsinglenucleotidepolymorphismdatabaseforrainbowtroutgeneratedthroughwholegenomeresequencing
AT naishkerrya anewsinglenucleotidepolymorphismdatabaseforrainbowtroutgeneratedthroughwholegenomeresequencing
AT thorgaardgaryh anewsinglenucleotidepolymorphismdatabaseforrainbowtroutgeneratedthroughwholegenomeresequencing
AT liensigbjørn anewsinglenucleotidepolymorphismdatabaseforrainbowtroutgeneratedthroughwholegenomeresequencing
AT paltiyniv anewsinglenucleotidepolymorphismdatabaseforrainbowtroutgeneratedthroughwholegenomeresequencing
AT gaoguangtu newsinglenucleotidepolymorphismdatabaseforrainbowtroutgeneratedthroughwholegenomeresequencing
AT nometorfinn newsinglenucleotidepolymorphismdatabaseforrainbowtroutgeneratedthroughwholegenomeresequencing
AT pearsedevone newsinglenucleotidepolymorphismdatabaseforrainbowtroutgeneratedthroughwholegenomeresequencing
AT moenthomas newsinglenucleotidepolymorphismdatabaseforrainbowtroutgeneratedthroughwholegenomeresequencing
AT naishkerrya newsinglenucleotidepolymorphismdatabaseforrainbowtroutgeneratedthroughwholegenomeresequencing
AT thorgaardgaryh newsinglenucleotidepolymorphismdatabaseforrainbowtroutgeneratedthroughwholegenomeresequencing
AT liensigbjørn newsinglenucleotidepolymorphismdatabaseforrainbowtroutgeneratedthroughwholegenomeresequencing
AT paltiyniv newsinglenucleotidepolymorphismdatabaseforrainbowtroutgeneratedthroughwholegenomeresequencing