Cargando…

Modeling heterogeneous (co)variances from adjacent-SNP groups improves genomic prediction for milk protein composition traits

BACKGROUND: Accurate genomic prediction requires a large reference population, which is problematic for traits that are expensive to measure. Traits related to milk protein composition are not routinely recorded due to costly procedures and are considered to be controlled by a few quantitative trait...

Descripción completa

Detalles Bibliográficos
Autores principales: Gebreyesus, Grum, Lund, Mogens S., Buitenhuis, Bart, Bovenhuis, Henk, Poulsen, Nina A., Janss, Luc G.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5718071/
https://www.ncbi.nlm.nih.gov/pubmed/29207947
http://dx.doi.org/10.1186/s12711-017-0364-8
_version_ 1783284273890983936
author Gebreyesus, Grum
Lund, Mogens S.
Buitenhuis, Bart
Bovenhuis, Henk
Poulsen, Nina A.
Janss, Luc G.
author_facet Gebreyesus, Grum
Lund, Mogens S.
Buitenhuis, Bart
Bovenhuis, Henk
Poulsen, Nina A.
Janss, Luc G.
author_sort Gebreyesus, Grum
collection PubMed
description BACKGROUND: Accurate genomic prediction requires a large reference population, which is problematic for traits that are expensive to measure. Traits related to milk protein composition are not routinely recorded due to costly procedures and are considered to be controlled by a few quantitative trait loci of large effect. The amount of variation explained may vary between regions leading to heterogeneous (co)variance patterns across the genome. Genomic prediction models that can efficiently take such heterogeneity of (co)variances into account can result in improved prediction reliability. In this study, we developed and implemented novel univariate and bivariate Bayesian prediction models, based on estimates of heterogeneous (co)variances for genome segments (BayesAS). Available data consisted of milk protein composition traits measured on cows and de-regressed proofs of total protein yield derived for bulls. Single-nucleotide polymorphisms (SNPs), from 50K SNP arrays, were grouped into non-overlapping genome segments. A segment was defined as one SNP, or a group of 50, 100, or 200 adjacent SNPs, or one chromosome, or the whole genome. Traditional univariate and bivariate genomic best linear unbiased prediction (GBLUP) models were also run for comparison. Reliabilities were calculated through a resampling strategy and using deterministic formula. RESULTS: BayesAS models improved prediction reliability for most of the traits compared to GBLUP models and this gain depended on segment size and genetic architecture of the traits. The gain in prediction reliability was especially marked for the protein composition traits β-CN, κ-CN and β-LG, for which prediction reliabilities were improved by 49 percentage points on average using the MT-BayesAS model with a 100-SNP segment size compared to the bivariate GBLUP. Prediction reliabilities were highest with the BayesAS model that uses a 100-SNP segment size. The bivariate versions of our BayesAS models resulted in extra gains of up to 6% in prediction reliability compared to the univariate versions. CONCLUSIONS: Substantial improvement in prediction reliability was possible for most of the traits related to milk protein composition using our novel BayesAS models. Grouping adjacent SNPs into segments provided enhanced information to estimate parameters and allowing the segments to have different (co)variances helped disentangle heterogeneous (co)variances across the genome.
format Online
Article
Text
id pubmed-5718071
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-57180712017-12-08 Modeling heterogeneous (co)variances from adjacent-SNP groups improves genomic prediction for milk protein composition traits Gebreyesus, Grum Lund, Mogens S. Buitenhuis, Bart Bovenhuis, Henk Poulsen, Nina A. Janss, Luc G. Genet Sel Evol Research Article BACKGROUND: Accurate genomic prediction requires a large reference population, which is problematic for traits that are expensive to measure. Traits related to milk protein composition are not routinely recorded due to costly procedures and are considered to be controlled by a few quantitative trait loci of large effect. The amount of variation explained may vary between regions leading to heterogeneous (co)variance patterns across the genome. Genomic prediction models that can efficiently take such heterogeneity of (co)variances into account can result in improved prediction reliability. In this study, we developed and implemented novel univariate and bivariate Bayesian prediction models, based on estimates of heterogeneous (co)variances for genome segments (BayesAS). Available data consisted of milk protein composition traits measured on cows and de-regressed proofs of total protein yield derived for bulls. Single-nucleotide polymorphisms (SNPs), from 50K SNP arrays, were grouped into non-overlapping genome segments. A segment was defined as one SNP, or a group of 50, 100, or 200 adjacent SNPs, or one chromosome, or the whole genome. Traditional univariate and bivariate genomic best linear unbiased prediction (GBLUP) models were also run for comparison. Reliabilities were calculated through a resampling strategy and using deterministic formula. RESULTS: BayesAS models improved prediction reliability for most of the traits compared to GBLUP models and this gain depended on segment size and genetic architecture of the traits. The gain in prediction reliability was especially marked for the protein composition traits β-CN, κ-CN and β-LG, for which prediction reliabilities were improved by 49 percentage points on average using the MT-BayesAS model with a 100-SNP segment size compared to the bivariate GBLUP. Prediction reliabilities were highest with the BayesAS model that uses a 100-SNP segment size. The bivariate versions of our BayesAS models resulted in extra gains of up to 6% in prediction reliability compared to the univariate versions. CONCLUSIONS: Substantial improvement in prediction reliability was possible for most of the traits related to milk protein composition using our novel BayesAS models. Grouping adjacent SNPs into segments provided enhanced information to estimate parameters and allowing the segments to have different (co)variances helped disentangle heterogeneous (co)variances across the genome. BioMed Central 2017-12-05 /pmc/articles/PMC5718071/ /pubmed/29207947 http://dx.doi.org/10.1186/s12711-017-0364-8 Text en © The Author(s) 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Gebreyesus, Grum
Lund, Mogens S.
Buitenhuis, Bart
Bovenhuis, Henk
Poulsen, Nina A.
Janss, Luc G.
Modeling heterogeneous (co)variances from adjacent-SNP groups improves genomic prediction for milk protein composition traits
title Modeling heterogeneous (co)variances from adjacent-SNP groups improves genomic prediction for milk protein composition traits
title_full Modeling heterogeneous (co)variances from adjacent-SNP groups improves genomic prediction for milk protein composition traits
title_fullStr Modeling heterogeneous (co)variances from adjacent-SNP groups improves genomic prediction for milk protein composition traits
title_full_unstemmed Modeling heterogeneous (co)variances from adjacent-SNP groups improves genomic prediction for milk protein composition traits
title_short Modeling heterogeneous (co)variances from adjacent-SNP groups improves genomic prediction for milk protein composition traits
title_sort modeling heterogeneous (co)variances from adjacent-snp groups improves genomic prediction for milk protein composition traits
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5718071/
https://www.ncbi.nlm.nih.gov/pubmed/29207947
http://dx.doi.org/10.1186/s12711-017-0364-8
work_keys_str_mv AT gebreyesusgrum modelingheterogeneouscovariancesfromadjacentsnpgroupsimprovesgenomicpredictionformilkproteincompositiontraits
AT lundmogenss modelingheterogeneouscovariancesfromadjacentsnpgroupsimprovesgenomicpredictionformilkproteincompositiontraits
AT buitenhuisbart modelingheterogeneouscovariancesfromadjacentsnpgroupsimprovesgenomicpredictionformilkproteincompositiontraits
AT bovenhuishenk modelingheterogeneouscovariancesfromadjacentsnpgroupsimprovesgenomicpredictionformilkproteincompositiontraits
AT poulsenninaa modelingheterogeneouscovariancesfromadjacentsnpgroupsimprovesgenomicpredictionformilkproteincompositiontraits
AT jansslucg modelingheterogeneouscovariancesfromadjacentsnpgroupsimprovesgenomicpredictionformilkproteincompositiontraits