Cargando…
Increased prediction accuracy using a genomic feature model including prior information on quantitative trait locus regions in purebred Danish Duroc pigs
BACKGROUND: In animal breeding, genetic variance for complex traits is often estimated using linear mixed models that incorporate information from single nucleotide polymorphism (SNP) markers using a realized genomic relationship matrix. In such models, individual genetic markers are weighted equall...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4700613/ https://www.ncbi.nlm.nih.gov/pubmed/26728402 http://dx.doi.org/10.1186/s12863-015-0322-9 |
_version_ | 1782408348151316480 |
---|---|
author | Sarup, Pernille Jensen, Just Ostersen, Tage Henryon, Mark Sørensen, Peter |
author_facet | Sarup, Pernille Jensen, Just Ostersen, Tage Henryon, Mark Sørensen, Peter |
author_sort | Sarup, Pernille |
collection | PubMed |
description | BACKGROUND: In animal breeding, genetic variance for complex traits is often estimated using linear mixed models that incorporate information from single nucleotide polymorphism (SNP) markers using a realized genomic relationship matrix. In such models, individual genetic markers are weighted equally and genomic variation is treated as a “black box.” This approach is useful for selecting animals with high genetic potential, but it does not generate or utilise knowledge of the biological mechanisms underlying trait variation. Here we propose a linear mixed-model approach that can evaluate the collective effects of sets of SNPs and thereby open the “black box.” The described genomic feature best linear unbiased prediction (GFBLUP) model has two components that are defined by genomic features. RESULTS: We analysed data on average daily gain, feed efficiency, and lean meat percentage from 3,085 Duroc boars, along with genotypes from a 60 K SNP chip. In addition information on known quantitative trait loci (QTL) from the animal QTL database was integrated in the GFBLUP as a genomic feature. Our results showed that the most significant QTL categories were indeed biologically meaningful. Additionally, for high heritability traits, prediction accuracy was improved by the incorporation of biological knowledge in prediction models. A simulation study using the real genotypes and simulated phenotypes demonstrated challenges regarding detection of causal variants in low to medium heritability traits. CONCLUSIONS: The GFBLUP model showed increased predictive ability when enough causal variants were included in the genomic feature to explain over 10 % of the genomic variance, and when dilution by non-causal markers was minimal. In the observed data set, predictive ability was increased by the inclusion of prior QTL information obtained outside the training data set, but only for the trait with highest heritability. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12863-015-0322-9) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4700613 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-47006132016-01-06 Increased prediction accuracy using a genomic feature model including prior information on quantitative trait locus regions in purebred Danish Duroc pigs Sarup, Pernille Jensen, Just Ostersen, Tage Henryon, Mark Sørensen, Peter BMC Genet Research Article BACKGROUND: In animal breeding, genetic variance for complex traits is often estimated using linear mixed models that incorporate information from single nucleotide polymorphism (SNP) markers using a realized genomic relationship matrix. In such models, individual genetic markers are weighted equally and genomic variation is treated as a “black box.” This approach is useful for selecting animals with high genetic potential, but it does not generate or utilise knowledge of the biological mechanisms underlying trait variation. Here we propose a linear mixed-model approach that can evaluate the collective effects of sets of SNPs and thereby open the “black box.” The described genomic feature best linear unbiased prediction (GFBLUP) model has two components that are defined by genomic features. RESULTS: We analysed data on average daily gain, feed efficiency, and lean meat percentage from 3,085 Duroc boars, along with genotypes from a 60 K SNP chip. In addition information on known quantitative trait loci (QTL) from the animal QTL database was integrated in the GFBLUP as a genomic feature. Our results showed that the most significant QTL categories were indeed biologically meaningful. Additionally, for high heritability traits, prediction accuracy was improved by the incorporation of biological knowledge in prediction models. A simulation study using the real genotypes and simulated phenotypes demonstrated challenges regarding detection of causal variants in low to medium heritability traits. CONCLUSIONS: The GFBLUP model showed increased predictive ability when enough causal variants were included in the genomic feature to explain over 10 % of the genomic variance, and when dilution by non-causal markers was minimal. In the observed data set, predictive ability was increased by the inclusion of prior QTL information obtained outside the training data set, but only for the trait with highest heritability. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12863-015-0322-9) contains supplementary material, which is available to authorized users. BioMed Central 2016-01-05 /pmc/articles/PMC4700613/ /pubmed/26728402 http://dx.doi.org/10.1186/s12863-015-0322-9 Text en © Sarup et al. 2015 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Sarup, Pernille Jensen, Just Ostersen, Tage Henryon, Mark Sørensen, Peter Increased prediction accuracy using a genomic feature model including prior information on quantitative trait locus regions in purebred Danish Duroc pigs |
title | Increased prediction accuracy using a genomic feature model including prior information on quantitative trait locus regions in purebred Danish Duroc pigs |
title_full | Increased prediction accuracy using a genomic feature model including prior information on quantitative trait locus regions in purebred Danish Duroc pigs |
title_fullStr | Increased prediction accuracy using a genomic feature model including prior information on quantitative trait locus regions in purebred Danish Duroc pigs |
title_full_unstemmed | Increased prediction accuracy using a genomic feature model including prior information on quantitative trait locus regions in purebred Danish Duroc pigs |
title_short | Increased prediction accuracy using a genomic feature model including prior information on quantitative trait locus regions in purebred Danish Duroc pigs |
title_sort | increased prediction accuracy using a genomic feature model including prior information on quantitative trait locus regions in purebred danish duroc pigs |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4700613/ https://www.ncbi.nlm.nih.gov/pubmed/26728402 http://dx.doi.org/10.1186/s12863-015-0322-9 |
work_keys_str_mv | AT saruppernille increasedpredictionaccuracyusingagenomicfeaturemodelincludingpriorinformationonquantitativetraitlocusregionsinpurebreddanishdurocpigs AT jensenjust increasedpredictionaccuracyusingagenomicfeaturemodelincludingpriorinformationonquantitativetraitlocusregionsinpurebreddanishdurocpigs AT ostersentage increasedpredictionaccuracyusingagenomicfeaturemodelincludingpriorinformationonquantitativetraitlocusregionsinpurebreddanishdurocpigs AT henryonmark increasedpredictionaccuracyusingagenomicfeaturemodelincludingpriorinformationonquantitativetraitlocusregionsinpurebreddanishdurocpigs AT sørensenpeter increasedpredictionaccuracyusingagenomicfeaturemodelincludingpriorinformationonquantitativetraitlocusregionsinpurebreddanishdurocpigs |