Cargando…

Identification of SNPs associated with muscle yield and quality traits using allelic-imbalance analyses of pooled RNA-Seq samples in rainbow trout

BACKGROUND: Coding/functional SNPs change the biological function of a gene and, therefore, could serve as “large-effect” genetic markers. In this study, we used two bioinformatics pipelines, GATK and SAMtools, for discovering coding/functional SNPs with allelic-imbalances associated with total body...

Descripción completa

Detalles Bibliográficos
Autores principales: Al-Tobasei, Rafet, Ali, Ali, Leeds, Timothy D., Liu, Sixin, Palti, Yniv, Kenney, Brett, Salem, Mohamed
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5547479/
https://www.ncbi.nlm.nih.gov/pubmed/28784089
http://dx.doi.org/10.1186/s12864-017-3992-z
_version_ 1783255699027918848
author Al-Tobasei, Rafet
Ali, Ali
Leeds, Timothy D.
Liu, Sixin
Palti, Yniv
Kenney, Brett
Salem, Mohamed
author_facet Al-Tobasei, Rafet
Ali, Ali
Leeds, Timothy D.
Liu, Sixin
Palti, Yniv
Kenney, Brett
Salem, Mohamed
author_sort Al-Tobasei, Rafet
collection PubMed
description BACKGROUND: Coding/functional SNPs change the biological function of a gene and, therefore, could serve as “large-effect” genetic markers. In this study, we used two bioinformatics pipelines, GATK and SAMtools, for discovering coding/functional SNPs with allelic-imbalances associated with total body weight, muscle yield, muscle fat content, shear force, and whiteness. Phenotypic data were collected for approximately 500 fish, representing 98 families (5 fish/family), from a growth-selected line, and the muscle transcriptome was sequenced from 22 families with divergent phenotypes (4 low- versus 4 high-ranked families per trait). RESULTS: GATK detected 59,112 putative SNPs; of these SNPs, 4798 showed allelic imbalances (>2.0 as an amplification and <0.5 as loss of heterozygosity). SAMtools detected 87,066 putative SNPs; and of them, 4962 had allelic imbalances between the low- and high-ranked families. Only 1829 SNPs with allelic imbalances were common between the two datasets, indicating significant differences in algorithms. The two datasets contained 7930 non-redundant SNPs of which 4439 mapped to 1498 protein-coding genes (with 6.4% non-synonymous SNPs) and 684 mapped to 295 lncRNAs. Validation of a subset of 92 SNPs revealed 1) 86.7–93.8% success rate in calling polymorphic SNPs and 2) 95.4% consistent matching between DNA and cDNA genotypes indicating a high rate of identifying SNPs with allelic imbalances. In addition, 4.64% SNPs revealed random monoallelic expression. Genome distribution of the SNPs with allelic imbalances exhibited high density for all five traits in several chromosomes, especially chromosome 9, 20 and 28. Most of the SNP-harboring genes were assigned to important growth-related metabolic pathways. CONCLUSION: These results demonstrate utility of RNA-Seq in assessing phenotype-associated allelic imbalances in pooled RNA-Seq samples. The SNPs identified in this study were included in a new SNP-Chip design (available from Affymetrix) for genomic and genetic analyses in rainbow trout. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-017-3992-z) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5547479
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-55474792017-08-09 Identification of SNPs associated with muscle yield and quality traits using allelic-imbalance analyses of pooled RNA-Seq samples in rainbow trout Al-Tobasei, Rafet Ali, Ali Leeds, Timothy D. Liu, Sixin Palti, Yniv Kenney, Brett Salem, Mohamed BMC Genomics Research Article BACKGROUND: Coding/functional SNPs change the biological function of a gene and, therefore, could serve as “large-effect” genetic markers. In this study, we used two bioinformatics pipelines, GATK and SAMtools, for discovering coding/functional SNPs with allelic-imbalances associated with total body weight, muscle yield, muscle fat content, shear force, and whiteness. Phenotypic data were collected for approximately 500 fish, representing 98 families (5 fish/family), from a growth-selected line, and the muscle transcriptome was sequenced from 22 families with divergent phenotypes (4 low- versus 4 high-ranked families per trait). RESULTS: GATK detected 59,112 putative SNPs; of these SNPs, 4798 showed allelic imbalances (>2.0 as an amplification and <0.5 as loss of heterozygosity). SAMtools detected 87,066 putative SNPs; and of them, 4962 had allelic imbalances between the low- and high-ranked families. Only 1829 SNPs with allelic imbalances were common between the two datasets, indicating significant differences in algorithms. The two datasets contained 7930 non-redundant SNPs of which 4439 mapped to 1498 protein-coding genes (with 6.4% non-synonymous SNPs) and 684 mapped to 295 lncRNAs. Validation of a subset of 92 SNPs revealed 1) 86.7–93.8% success rate in calling polymorphic SNPs and 2) 95.4% consistent matching between DNA and cDNA genotypes indicating a high rate of identifying SNPs with allelic imbalances. In addition, 4.64% SNPs revealed random monoallelic expression. Genome distribution of the SNPs with allelic imbalances exhibited high density for all five traits in several chromosomes, especially chromosome 9, 20 and 28. Most of the SNP-harboring genes were assigned to important growth-related metabolic pathways. CONCLUSION: These results demonstrate utility of RNA-Seq in assessing phenotype-associated allelic imbalances in pooled RNA-Seq samples. The SNPs identified in this study were included in a new SNP-Chip design (available from Affymetrix) for genomic and genetic analyses in rainbow trout. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-017-3992-z) contains supplementary material, which is available to authorized users. BioMed Central 2017-08-07 /pmc/articles/PMC5547479/ /pubmed/28784089 http://dx.doi.org/10.1186/s12864-017-3992-z Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Al-Tobasei, Rafet
Ali, Ali
Leeds, Timothy D.
Liu, Sixin
Palti, Yniv
Kenney, Brett
Salem, Mohamed
Identification of SNPs associated with muscle yield and quality traits using allelic-imbalance analyses of pooled RNA-Seq samples in rainbow trout
title Identification of SNPs associated with muscle yield and quality traits using allelic-imbalance analyses of pooled RNA-Seq samples in rainbow trout
title_full Identification of SNPs associated with muscle yield and quality traits using allelic-imbalance analyses of pooled RNA-Seq samples in rainbow trout
title_fullStr Identification of SNPs associated with muscle yield and quality traits using allelic-imbalance analyses of pooled RNA-Seq samples in rainbow trout
title_full_unstemmed Identification of SNPs associated with muscle yield and quality traits using allelic-imbalance analyses of pooled RNA-Seq samples in rainbow trout
title_short Identification of SNPs associated with muscle yield and quality traits using allelic-imbalance analyses of pooled RNA-Seq samples in rainbow trout
title_sort identification of snps associated with muscle yield and quality traits using allelic-imbalance analyses of pooled rna-seq samples in rainbow trout
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5547479/
https://www.ncbi.nlm.nih.gov/pubmed/28784089
http://dx.doi.org/10.1186/s12864-017-3992-z
work_keys_str_mv AT altobaseirafet identificationofsnpsassociatedwithmuscleyieldandqualitytraitsusingallelicimbalanceanalysesofpooledrnaseqsamplesinrainbowtrout
AT aliali identificationofsnpsassociatedwithmuscleyieldandqualitytraitsusingallelicimbalanceanalysesofpooledrnaseqsamplesinrainbowtrout
AT leedstimothyd identificationofsnpsassociatedwithmuscleyieldandqualitytraitsusingallelicimbalanceanalysesofpooledrnaseqsamplesinrainbowtrout
AT liusixin identificationofsnpsassociatedwithmuscleyieldandqualitytraitsusingallelicimbalanceanalysesofpooledrnaseqsamplesinrainbowtrout
AT paltiyniv identificationofsnpsassociatedwithmuscleyieldandqualitytraitsusingallelicimbalanceanalysesofpooledrnaseqsamplesinrainbowtrout
AT kenneybrett identificationofsnpsassociatedwithmuscleyieldandqualitytraitsusingallelicimbalanceanalysesofpooledrnaseqsamplesinrainbowtrout
AT salemmohamed identificationofsnpsassociatedwithmuscleyieldandqualitytraitsusingallelicimbalanceanalysesofpooledrnaseqsamplesinrainbowtrout