Cargando…

Use and optimization of different sources of information for genomic prediction

BACKGROUND: Molecular data is now commonly used to predict breeding values (BV). Various methods to calculate genomic relationship matrices (GRM) have been developed, with some studies proposing regression of coefficients back to the reference matrix of pedigree-based relationship coefficients (A)....

Descripción completa

Detalles Bibliográficos
Autores principales:	Ilska, Joanna J., Meuwissen, Theo H. E., Kranis, Andreas, Woolliams, John A.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2017
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5725675/ https://www.ncbi.nlm.nih.gov/pubmed/29228899 http://dx.doi.org/10.1186/s12711-017-0365-7

_version_	1783285580365299712
author	Ilska, Joanna J. Meuwissen, Theo H. E. Kranis, Andreas Woolliams, John A.
author_facet	Ilska, Joanna J. Meuwissen, Theo H. E. Kranis, Andreas Woolliams, John A.
author_sort	Ilska, Joanna J.
collection	PubMed
description	BACKGROUND: Molecular data is now commonly used to predict breeding values (BV). Various methods to calculate genomic relationship matrices (GRM) have been developed, with some studies proposing regression of coefficients back to the reference matrix of pedigree-based relationship coefficients (A). The objective was to compare the utility of two GRM: a matrix based on linkage analysis (LA) and anchored to the pedigree, i.e. [Formula: see text] and a matrix based on linkage disequilibrium (LD), i.e. [Formula: see text] , using genomic and phenotypic data collected on 5416 broiler chickens. Furthermore, the effects of regressing the coefficients of [Formula: see text] back to A (LDA) and to [Formula: see text] (LDLA) were evaluated, using a range of weighting factors. The performance of the matrices and their composite products was assessed by the fit of the models to the data, and the empirical accuracy and bias of the BV that they predicted. The sensitivity to marker choice was examined by using two chips of equal density but including different single nucleotide polymorphisms (SNPs). RESULTS: The likelihood of models using GRM and composite matrices exceeded the likelihood of models based on pedigree alone and was highest with intermediate weighting factors for both the LDA and LDLA approaches. For these data, empirical accuracies were not strongly affected by the weighting factors, although they were highest when different sources of information were combined. The optimum weighting factors depended on the type of matrices used, as well as on the choice of SNPs from which the GRM were constructed. Prediction bias was strongly affected by the chip used and less by the form of the GRM. CONCLUSIONS: Our findings provide an empirical comparison of the efficacy of pedigree and genomic predictions in broiler chickens and examine the effects of fitting GRM with coefficients regressed back to a reference anchored to the pedigree, either A or [Formula: see text] . For the analysed dataset, the best results were obtained when [Formula: see text] was combined with relationships in A or [Formula: see text] , with optimum weighting factors that depended on the choice of SNPs used. The optimum weighting factor for broiler body weight differed from weighting factors that were based on the density of SNPs and theoretically derived using generalised assumptions.
format	Online Article Text
id	pubmed-5725675
institution	National Center for Biotechnology Information
language	English
publishDate	2017
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-57256752017-12-13 Use and optimization of different sources of information for genomic prediction Ilska, Joanna J. Meuwissen, Theo H. E. Kranis, Andreas Woolliams, John A. Genet Sel Evol Research Article BACKGROUND: Molecular data is now commonly used to predict breeding values (BV). Various methods to calculate genomic relationship matrices (GRM) have been developed, with some studies proposing regression of coefficients back to the reference matrix of pedigree-based relationship coefficients (A). The objective was to compare the utility of two GRM: a matrix based on linkage analysis (LA) and anchored to the pedigree, i.e. [Formula: see text] and a matrix based on linkage disequilibrium (LD), i.e. [Formula: see text] , using genomic and phenotypic data collected on 5416 broiler chickens. Furthermore, the effects of regressing the coefficients of [Formula: see text] back to A (LDA) and to [Formula: see text] (LDLA) were evaluated, using a range of weighting factors. The performance of the matrices and their composite products was assessed by the fit of the models to the data, and the empirical accuracy and bias of the BV that they predicted. The sensitivity to marker choice was examined by using two chips of equal density but including different single nucleotide polymorphisms (SNPs). RESULTS: The likelihood of models using GRM and composite matrices exceeded the likelihood of models based on pedigree alone and was highest with intermediate weighting factors for both the LDA and LDLA approaches. For these data, empirical accuracies were not strongly affected by the weighting factors, although they were highest when different sources of information were combined. The optimum weighting factors depended on the type of matrices used, as well as on the choice of SNPs from which the GRM were constructed. Prediction bias was strongly affected by the chip used and less by the form of the GRM. CONCLUSIONS: Our findings provide an empirical comparison of the efficacy of pedigree and genomic predictions in broiler chickens and examine the effects of fitting GRM with coefficients regressed back to a reference anchored to the pedigree, either A or [Formula: see text] . For the analysed dataset, the best results were obtained when [Formula: see text] was combined with relationships in A or [Formula: see text] , with optimum weighting factors that depended on the choice of SNPs used. The optimum weighting factor for broiler body weight differed from weighting factors that were based on the density of SNPs and theoretically derived using generalised assumptions. BioMed Central 2017-12-11 /pmc/articles/PMC5725675/ /pubmed/29228899 http://dx.doi.org/10.1186/s12711-017-0365-7 Text en © The Author(s) 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle	Research Article Ilska, Joanna J. Meuwissen, Theo H. E. Kranis, Andreas Woolliams, John A. Use and optimization of different sources of information for genomic prediction
title	Use and optimization of different sources of information for genomic prediction
title_full	Use and optimization of different sources of information for genomic prediction
title_fullStr	Use and optimization of different sources of information for genomic prediction
title_full_unstemmed	Use and optimization of different sources of information for genomic prediction
title_short	Use and optimization of different sources of information for genomic prediction
title_sort	use and optimization of different sources of information for genomic prediction
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5725675/ https://www.ncbi.nlm.nih.gov/pubmed/29228899 http://dx.doi.org/10.1186/s12711-017-0365-7
work_keys_str_mv	AT ilskajoannaj useandoptimizationofdifferentsourcesofinformationforgenomicprediction AT meuwissentheohe useandoptimizationofdifferentsourcesofinformationforgenomicprediction AT kranisandreas useandoptimizationofdifferentsourcesofinformationforgenomicprediction AT woolliamsjohna useandoptimizationofdifferentsourcesofinformationforgenomicprediction

Use and optimization of different sources of information for genomic prediction

Ejemplares similares