Cargando…

Use and optimization of different sources of information for genomic prediction

BACKGROUND: Molecular data is now commonly used to predict breeding values (BV). Various methods to calculate genomic relationship matrices (GRM) have been developed, with some studies proposing regression of coefficients back to the reference matrix of pedigree-based relationship coefficients (A)....

Descripción completa

Detalles Bibliográficos
Autores principales: Ilska, Joanna J., Meuwissen, Theo H. E., Kranis, Andreas, Woolliams, John A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5725675/
https://www.ncbi.nlm.nih.gov/pubmed/29228899
http://dx.doi.org/10.1186/s12711-017-0365-7
_version_ 1783285580365299712
author Ilska, Joanna J.
Meuwissen, Theo H. E.
Kranis, Andreas
Woolliams, John A.
author_facet Ilska, Joanna J.
Meuwissen, Theo H. E.
Kranis, Andreas
Woolliams, John A.
author_sort Ilska, Joanna J.
collection PubMed
description BACKGROUND: Molecular data is now commonly used to predict breeding values (BV). Various methods to calculate genomic relationship matrices (GRM) have been developed, with some studies proposing regression of coefficients back to the reference matrix of pedigree-based relationship coefficients (A). The objective was to compare the utility of two GRM: a matrix based on linkage analysis (LA) and anchored to the pedigree, i.e. [Formula: see text] and a matrix based on linkage disequilibrium (LD), i.e. [Formula: see text] , using genomic and phenotypic data collected on 5416 broiler chickens. Furthermore, the effects of regressing the coefficients of [Formula: see text] back to A (LDA) and to [Formula: see text] (LDLA) were evaluated, using a range of weighting factors. The performance of the matrices and their composite products was assessed by the fit of the models to the data, and the empirical accuracy and bias of the BV that they predicted. The sensitivity to marker choice was examined by using two chips of equal density but including different single nucleotide polymorphisms (SNPs). RESULTS: The likelihood of models using GRM and composite matrices exceeded the likelihood of models based on pedigree alone and was highest with intermediate weighting factors for both the LDA and LDLA approaches. For these data, empirical accuracies were not strongly affected by the weighting factors, although they were highest when different sources of information were combined. The optimum weighting factors depended on the type of matrices used, as well as on the choice of SNPs from which the GRM were constructed. Prediction bias was strongly affected by the chip used and less by the form of the GRM. CONCLUSIONS: Our findings provide an empirical comparison of the efficacy of pedigree and genomic predictions in broiler chickens and examine the effects of fitting GRM with coefficients regressed back to a reference anchored to the pedigree, either A or [Formula: see text] . For the analysed dataset, the best results were obtained when [Formula: see text] was combined with relationships in A or [Formula: see text] , with optimum weighting factors that depended on the choice of SNPs used. The optimum weighting factor for broiler body weight differed from weighting factors that were based on the density of SNPs and theoretically derived using generalised assumptions.
format Online
Article
Text
id pubmed-5725675
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-57256752017-12-13 Use and optimization of different sources of information for genomic prediction Ilska, Joanna J. Meuwissen, Theo H. E. Kranis, Andreas Woolliams, John A. Genet Sel Evol Research Article BACKGROUND: Molecular data is now commonly used to predict breeding values (BV). Various methods to calculate genomic relationship matrices (GRM) have been developed, with some studies proposing regression of coefficients back to the reference matrix of pedigree-based relationship coefficients (A). The objective was to compare the utility of two GRM: a matrix based on linkage analysis (LA) and anchored to the pedigree, i.e. [Formula: see text] and a matrix based on linkage disequilibrium (LD), i.e. [Formula: see text] , using genomic and phenotypic data collected on 5416 broiler chickens. Furthermore, the effects of regressing the coefficients of [Formula: see text] back to A (LDA) and to [Formula: see text] (LDLA) were evaluated, using a range of weighting factors. The performance of the matrices and their composite products was assessed by the fit of the models to the data, and the empirical accuracy and bias of the BV that they predicted. The sensitivity to marker choice was examined by using two chips of equal density but including different single nucleotide polymorphisms (SNPs). RESULTS: The likelihood of models using GRM and composite matrices exceeded the likelihood of models based on pedigree alone and was highest with intermediate weighting factors for both the LDA and LDLA approaches. For these data, empirical accuracies were not strongly affected by the weighting factors, although they were highest when different sources of information were combined. The optimum weighting factors depended on the type of matrices used, as well as on the choice of SNPs from which the GRM were constructed. Prediction bias was strongly affected by the chip used and less by the form of the GRM. CONCLUSIONS: Our findings provide an empirical comparison of the efficacy of pedigree and genomic predictions in broiler chickens and examine the effects of fitting GRM with coefficients regressed back to a reference anchored to the pedigree, either A or [Formula: see text] . For the analysed dataset, the best results were obtained when [Formula: see text] was combined with relationships in A or [Formula: see text] , with optimum weighting factors that depended on the choice of SNPs used. The optimum weighting factor for broiler body weight differed from weighting factors that were based on the density of SNPs and theoretically derived using generalised assumptions. BioMed Central 2017-12-11 /pmc/articles/PMC5725675/ /pubmed/29228899 http://dx.doi.org/10.1186/s12711-017-0365-7 Text en © The Author(s) 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Ilska, Joanna J.
Meuwissen, Theo H. E.
Kranis, Andreas
Woolliams, John A.
Use and optimization of different sources of information for genomic prediction
title Use and optimization of different sources of information for genomic prediction
title_full Use and optimization of different sources of information for genomic prediction
title_fullStr Use and optimization of different sources of information for genomic prediction
title_full_unstemmed Use and optimization of different sources of information for genomic prediction
title_short Use and optimization of different sources of information for genomic prediction
title_sort use and optimization of different sources of information for genomic prediction
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5725675/
https://www.ncbi.nlm.nih.gov/pubmed/29228899
http://dx.doi.org/10.1186/s12711-017-0365-7
work_keys_str_mv AT ilskajoannaj useandoptimizationofdifferentsourcesofinformationforgenomicprediction
AT meuwissentheohe useandoptimizationofdifferentsourcesofinformationforgenomicprediction
AT kranisandreas useandoptimizationofdifferentsourcesofinformationforgenomicprediction
AT woolliamsjohna useandoptimizationofdifferentsourcesofinformationforgenomicprediction