Cargando…

Increased prediction accuracy using a genomic feature model including prior information on quantitative trait locus regions in purebred Danish Duroc pigs

BACKGROUND: In animal breeding, genetic variance for complex traits is often estimated using linear mixed models that incorporate information from single nucleotide polymorphism (SNP) markers using a realized genomic relationship matrix. In such models, individual genetic markers are weighted equall...

Descripción completa

Detalles Bibliográficos
Autores principales: Sarup, Pernille, Jensen, Just, Ostersen, Tage, Henryon, Mark, Sørensen, Peter
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4700613/
https://www.ncbi.nlm.nih.gov/pubmed/26728402
http://dx.doi.org/10.1186/s12863-015-0322-9
_version_ 1782408348151316480
author Sarup, Pernille
Jensen, Just
Ostersen, Tage
Henryon, Mark
Sørensen, Peter
author_facet Sarup, Pernille
Jensen, Just
Ostersen, Tage
Henryon, Mark
Sørensen, Peter
author_sort Sarup, Pernille
collection PubMed
description BACKGROUND: In animal breeding, genetic variance for complex traits is often estimated using linear mixed models that incorporate information from single nucleotide polymorphism (SNP) markers using a realized genomic relationship matrix. In such models, individual genetic markers are weighted equally and genomic variation is treated as a “black box.” This approach is useful for selecting animals with high genetic potential, but it does not generate or utilise knowledge of the biological mechanisms underlying trait variation. Here we propose a linear mixed-model approach that can evaluate the collective effects of sets of SNPs and thereby open the “black box.” The described genomic feature best linear unbiased prediction (GFBLUP) model has two components that are defined by genomic features. RESULTS: We analysed data on average daily gain, feed efficiency, and lean meat percentage from 3,085 Duroc boars, along with genotypes from a 60 K SNP chip. In addition information on known quantitative trait loci (QTL) from the animal QTL database was integrated in the GFBLUP as a genomic feature. Our results showed that the most significant QTL categories were indeed biologically meaningful. Additionally, for high heritability traits, prediction accuracy was improved by the incorporation of biological knowledge in prediction models. A simulation study using the real genotypes and simulated phenotypes demonstrated challenges regarding detection of causal variants in low to medium heritability traits. CONCLUSIONS: The GFBLUP model showed increased predictive ability when enough causal variants were included in the genomic feature to explain over 10 % of the genomic variance, and when dilution by non-causal markers was minimal. In the observed data set, predictive ability was increased by the inclusion of prior QTL information obtained outside the training data set, but only for the trait with highest heritability. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12863-015-0322-9) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4700613
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-47006132016-01-06 Increased prediction accuracy using a genomic feature model including prior information on quantitative trait locus regions in purebred Danish Duroc pigs Sarup, Pernille Jensen, Just Ostersen, Tage Henryon, Mark Sørensen, Peter BMC Genet Research Article BACKGROUND: In animal breeding, genetic variance for complex traits is often estimated using linear mixed models that incorporate information from single nucleotide polymorphism (SNP) markers using a realized genomic relationship matrix. In such models, individual genetic markers are weighted equally and genomic variation is treated as a “black box.” This approach is useful for selecting animals with high genetic potential, but it does not generate or utilise knowledge of the biological mechanisms underlying trait variation. Here we propose a linear mixed-model approach that can evaluate the collective effects of sets of SNPs and thereby open the “black box.” The described genomic feature best linear unbiased prediction (GFBLUP) model has two components that are defined by genomic features. RESULTS: We analysed data on average daily gain, feed efficiency, and lean meat percentage from 3,085 Duroc boars, along with genotypes from a 60 K SNP chip. In addition information on known quantitative trait loci (QTL) from the animal QTL database was integrated in the GFBLUP as a genomic feature. Our results showed that the most significant QTL categories were indeed biologically meaningful. Additionally, for high heritability traits, prediction accuracy was improved by the incorporation of biological knowledge in prediction models. A simulation study using the real genotypes and simulated phenotypes demonstrated challenges regarding detection of causal variants in low to medium heritability traits. CONCLUSIONS: The GFBLUP model showed increased predictive ability when enough causal variants were included in the genomic feature to explain over 10 % of the genomic variance, and when dilution by non-causal markers was minimal. In the observed data set, predictive ability was increased by the inclusion of prior QTL information obtained outside the training data set, but only for the trait with highest heritability. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12863-015-0322-9) contains supplementary material, which is available to authorized users. BioMed Central 2016-01-05 /pmc/articles/PMC4700613/ /pubmed/26728402 http://dx.doi.org/10.1186/s12863-015-0322-9 Text en © Sarup et al. 2015 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Sarup, Pernille
Jensen, Just
Ostersen, Tage
Henryon, Mark
Sørensen, Peter
Increased prediction accuracy using a genomic feature model including prior information on quantitative trait locus regions in purebred Danish Duroc pigs
title Increased prediction accuracy using a genomic feature model including prior information on quantitative trait locus regions in purebred Danish Duroc pigs
title_full Increased prediction accuracy using a genomic feature model including prior information on quantitative trait locus regions in purebred Danish Duroc pigs
title_fullStr Increased prediction accuracy using a genomic feature model including prior information on quantitative trait locus regions in purebred Danish Duroc pigs
title_full_unstemmed Increased prediction accuracy using a genomic feature model including prior information on quantitative trait locus regions in purebred Danish Duroc pigs
title_short Increased prediction accuracy using a genomic feature model including prior information on quantitative trait locus regions in purebred Danish Duroc pigs
title_sort increased prediction accuracy using a genomic feature model including prior information on quantitative trait locus regions in purebred danish duroc pigs
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4700613/
https://www.ncbi.nlm.nih.gov/pubmed/26728402
http://dx.doi.org/10.1186/s12863-015-0322-9
work_keys_str_mv AT saruppernille increasedpredictionaccuracyusingagenomicfeaturemodelincludingpriorinformationonquantitativetraitlocusregionsinpurebreddanishdurocpigs
AT jensenjust increasedpredictionaccuracyusingagenomicfeaturemodelincludingpriorinformationonquantitativetraitlocusregionsinpurebreddanishdurocpigs
AT ostersentage increasedpredictionaccuracyusingagenomicfeaturemodelincludingpriorinformationonquantitativetraitlocusregionsinpurebreddanishdurocpigs
AT henryonmark increasedpredictionaccuracyusingagenomicfeaturemodelincludingpriorinformationonquantitativetraitlocusregionsinpurebreddanishdurocpigs
AT sørensenpeter increasedpredictionaccuracyusingagenomicfeaturemodelincludingpriorinformationonquantitativetraitlocusregionsinpurebreddanishdurocpigs