Cargando…

Using SNP Weights Derived From Gene Expression Modules to Improve GWAS Power for Feed Efficiency in Pigs

The “large p small n” problem has posed a significant challenge in the analysis and interpretation of genome-wide association studies (GWAS). The use of prior information to rank genomic regions and perform SNP selection could increase the power of GWAS. In this study, we propose the use of gene exp...

Descripción completa

Detalles Bibliográficos
Autores principales: Keel, Brittney N., Snelling, Warren M., Lindholm-Perry, Amanda K., Oliver, William T., Kuehn, Larry A., Rohrer, Gary A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6985563/
https://www.ncbi.nlm.nih.gov/pubmed/32038708
http://dx.doi.org/10.3389/fgene.2019.01339
_version_ 1783491829485797376
author Keel, Brittney N.
Snelling, Warren M.
Lindholm-Perry, Amanda K.
Oliver, William T.
Kuehn, Larry A.
Rohrer, Gary A.
author_facet Keel, Brittney N.
Snelling, Warren M.
Lindholm-Perry, Amanda K.
Oliver, William T.
Kuehn, Larry A.
Rohrer, Gary A.
author_sort Keel, Brittney N.
collection PubMed
description The “large p small n” problem has posed a significant challenge in the analysis and interpretation of genome-wide association studies (GWAS). The use of prior information to rank genomic regions and perform SNP selection could increase the power of GWAS. In this study, we propose the use of gene expression data from RNA-Seq of multiple tissues as prior information to assign weights to SNP, select SNP based on a weight threshold, and utilize weighted hypothesis testing to conduct a GWAS. RNA-Seq libraries from hypothalamus, duodenum, ileum, and jejunum tissue of 30 pigs with divergent feed efficiency phenotypes were sequenced, and a three-way gene x individual x tissue clustering analysis was performed, using constrained tensor decomposition, to obtain a total of 10 gene expression modules. Loading values from each gene module were used to assign weights to 49,691 commercial SNP markers, and SNP were selected using a weight threshold, resulting in 10 SNP sets ranging in size from 101 to 955 markers. Weighted GWAS for feed intake in 4,200 pigs was performed separately for each of the 10 SNP sets. A total of 36 unique significant SNP associations were identified across the ten gene modules (SNP sets). For comparison, a standard unweighted GWAS using all 49,691 SNP was performed, and only 2 SNP were significant. None of the SNP from the unweighted analysis resided in known QTL related to swine feed efficiency (feed intake, average daily gain, and feed conversion ratio) compared to 29 (80.6%) in the weighted analyses, with 9 SNP residing in feed intake QTL. These results suggest that the heritability of feed intake is driven by many SNP that individually do not attain genome-wide significance in GWAS. Hence, the proposed procedure for prioritizing SNP based on gene expression data across multiple tissues provides a promising approach for improving the power of GWAS.
format Online
Article
Text
id pubmed-6985563
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-69855632020-02-07 Using SNP Weights Derived From Gene Expression Modules to Improve GWAS Power for Feed Efficiency in Pigs Keel, Brittney N. Snelling, Warren M. Lindholm-Perry, Amanda K. Oliver, William T. Kuehn, Larry A. Rohrer, Gary A. Front Genet Genetics The “large p small n” problem has posed a significant challenge in the analysis and interpretation of genome-wide association studies (GWAS). The use of prior information to rank genomic regions and perform SNP selection could increase the power of GWAS. In this study, we propose the use of gene expression data from RNA-Seq of multiple tissues as prior information to assign weights to SNP, select SNP based on a weight threshold, and utilize weighted hypothesis testing to conduct a GWAS. RNA-Seq libraries from hypothalamus, duodenum, ileum, and jejunum tissue of 30 pigs with divergent feed efficiency phenotypes were sequenced, and a three-way gene x individual x tissue clustering analysis was performed, using constrained tensor decomposition, to obtain a total of 10 gene expression modules. Loading values from each gene module were used to assign weights to 49,691 commercial SNP markers, and SNP were selected using a weight threshold, resulting in 10 SNP sets ranging in size from 101 to 955 markers. Weighted GWAS for feed intake in 4,200 pigs was performed separately for each of the 10 SNP sets. A total of 36 unique significant SNP associations were identified across the ten gene modules (SNP sets). For comparison, a standard unweighted GWAS using all 49,691 SNP was performed, and only 2 SNP were significant. None of the SNP from the unweighted analysis resided in known QTL related to swine feed efficiency (feed intake, average daily gain, and feed conversion ratio) compared to 29 (80.6%) in the weighted analyses, with 9 SNP residing in feed intake QTL. These results suggest that the heritability of feed intake is driven by many SNP that individually do not attain genome-wide significance in GWAS. Hence, the proposed procedure for prioritizing SNP based on gene expression data across multiple tissues provides a promising approach for improving the power of GWAS. Frontiers Media S.A. 2020-01-21 /pmc/articles/PMC6985563/ /pubmed/32038708 http://dx.doi.org/10.3389/fgene.2019.01339 Text en Copyright © 2020 Keel, Snelling, Lindholm-Perry, Oliver, Kuehn and Rohrer http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Keel, Brittney N.
Snelling, Warren M.
Lindholm-Perry, Amanda K.
Oliver, William T.
Kuehn, Larry A.
Rohrer, Gary A.
Using SNP Weights Derived From Gene Expression Modules to Improve GWAS Power for Feed Efficiency in Pigs
title Using SNP Weights Derived From Gene Expression Modules to Improve GWAS Power for Feed Efficiency in Pigs
title_full Using SNP Weights Derived From Gene Expression Modules to Improve GWAS Power for Feed Efficiency in Pigs
title_fullStr Using SNP Weights Derived From Gene Expression Modules to Improve GWAS Power for Feed Efficiency in Pigs
title_full_unstemmed Using SNP Weights Derived From Gene Expression Modules to Improve GWAS Power for Feed Efficiency in Pigs
title_short Using SNP Weights Derived From Gene Expression Modules to Improve GWAS Power for Feed Efficiency in Pigs
title_sort using snp weights derived from gene expression modules to improve gwas power for feed efficiency in pigs
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6985563/
https://www.ncbi.nlm.nih.gov/pubmed/32038708
http://dx.doi.org/10.3389/fgene.2019.01339
work_keys_str_mv AT keelbrittneyn usingsnpweightsderivedfromgeneexpressionmodulestoimprovegwaspowerforfeedefficiencyinpigs
AT snellingwarrenm usingsnpweightsderivedfromgeneexpressionmodulestoimprovegwaspowerforfeedefficiencyinpigs
AT lindholmperryamandak usingsnpweightsderivedfromgeneexpressionmodulestoimprovegwaspowerforfeedefficiencyinpigs
AT oliverwilliamt usingsnpweightsderivedfromgeneexpressionmodulestoimprovegwaspowerforfeedefficiencyinpigs
AT kuehnlarrya usingsnpweightsderivedfromgeneexpressionmodulestoimprovegwaspowerforfeedefficiencyinpigs
AT rohrergarya usingsnpweightsderivedfromgeneexpressionmodulestoimprovegwaspowerforfeedefficiencyinpigs