Cargando…

Genomic Prediction Within and Across Biparental Families: Means and Variances of Prediction Accuracy and Usefulness of Deterministic Equations

A major application of genomic prediction (GP) in plant breeding is the identification of superior inbred lines within families derived from biparental crosses. When models for various traits were trained within related or unrelated biparental families (BPFs), experimental studies found substantial...

Descripción completa

Detalles Bibliográficos
Autores principales:	Schopp, Pascal, Müller, Dominik, Wientjes, Yvonne C. J., Melchinger, Albrecht E.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Genetics Society of America 2017
Materias:	Genomic Selection
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5677162/ https://www.ncbi.nlm.nih.gov/pubmed/28916649 http://dx.doi.org/10.1534/g3.117.300076

_version_	1783277187191799808
author	Schopp, Pascal Müller, Dominik Wientjes, Yvonne C. J. Melchinger, Albrecht E.
author_facet	Schopp, Pascal Müller, Dominik Wientjes, Yvonne C. J. Melchinger, Albrecht E.
author_sort	Schopp, Pascal
collection	PubMed
description	A major application of genomic prediction (GP) in plant breeding is the identification of superior inbred lines within families derived from biparental crosses. When models for various traits were trained within related or unrelated biparental families (BPFs), experimental studies found substantial variation in prediction accuracy (PA), but little is known about the underlying factors. We used SNP marker genotypes of inbred lines from either elite germplasm or landraces of maize (Zea mays L.) as parents to generate in silico 300 BPFs of doubled-haploid lines. We analyzed PA within each BPF for 50 simulated polygenic traits, using genomic best linear unbiased prediction (GBLUP) models trained with individuals from either full-sib (FSF), half-sib (HSF), or unrelated families (URF) for various sizes ([Formula: see text]) of the training set and different heritabilities ([Formula: see text] In addition, we modified two deterministic equations for forecasting PA to account for inbreeding and genetic variance unexplained by the training set. Averaged across traits, PA was high within FSF (0.41–0.97) with large variation only for [Formula: see text] and [Formula: see text] [Formula: see text] For HSF and URF, PA was on average ∼40–60% lower and varied substantially among different combinations of BPFs used for model training and prediction as well as different traits. As exemplified by HSF results, PA of across-family GP can be very low if causal variants not segregating in the training set account for a sizeable proportion of the genetic variance among predicted individuals. Deterministic equations accurately forecast the PA expected over many traits, yet cannot capture trait-specific deviations. We conclude that model training within BPFs generally yields stable PA, whereas a high level of uncertainty is encountered in across-family GP. Our study shows the extent of variation in PA that must be at least reckoned with in practice and offers a starting point for the design of training sets composed of multiple BPFs.
format	Online Article Text
id	pubmed-5677162
institution	National Center for Biotechnology Information
language	English
publishDate	2017
publisher	Genetics Society of America
record_format	MEDLINE/PubMed
spelling	pubmed-56771622017-11-09 Genomic Prediction Within and Across Biparental Families: Means and Variances of Prediction Accuracy and Usefulness of Deterministic Equations Schopp, Pascal Müller, Dominik Wientjes, Yvonne C. J. Melchinger, Albrecht E. G3 (Bethesda) Genomic Selection A major application of genomic prediction (GP) in plant breeding is the identification of superior inbred lines within families derived from biparental crosses. When models for various traits were trained within related or unrelated biparental families (BPFs), experimental studies found substantial variation in prediction accuracy (PA), but little is known about the underlying factors. We used SNP marker genotypes of inbred lines from either elite germplasm or landraces of maize (Zea mays L.) as parents to generate in silico 300 BPFs of doubled-haploid lines. We analyzed PA within each BPF for 50 simulated polygenic traits, using genomic best linear unbiased prediction (GBLUP) models trained with individuals from either full-sib (FSF), half-sib (HSF), or unrelated families (URF) for various sizes ([Formula: see text]) of the training set and different heritabilities ([Formula: see text] In addition, we modified two deterministic equations for forecasting PA to account for inbreeding and genetic variance unexplained by the training set. Averaged across traits, PA was high within FSF (0.41–0.97) with large variation only for [Formula: see text] and [Formula: see text] [Formula: see text] For HSF and URF, PA was on average ∼40–60% lower and varied substantially among different combinations of BPFs used for model training and prediction as well as different traits. As exemplified by HSF results, PA of across-family GP can be very low if causal variants not segregating in the training set account for a sizeable proportion of the genetic variance among predicted individuals. Deterministic equations accurately forecast the PA expected over many traits, yet cannot capture trait-specific deviations. We conclude that model training within BPFs generally yields stable PA, whereas a high level of uncertainty is encountered in across-family GP. Our study shows the extent of variation in PA that must be at least reckoned with in practice and offers a starting point for the design of training sets composed of multiple BPFs. Genetics Society of America 2017-09-15 /pmc/articles/PMC5677162/ /pubmed/28916649 http://dx.doi.org/10.1534/g3.117.300076 Text en Copyright © 2017 Schopp et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Genomic Selection Schopp, Pascal Müller, Dominik Wientjes, Yvonne C. J. Melchinger, Albrecht E. Genomic Prediction Within and Across Biparental Families: Means and Variances of Prediction Accuracy and Usefulness of Deterministic Equations
title	Genomic Prediction Within and Across Biparental Families: Means and Variances of Prediction Accuracy and Usefulness of Deterministic Equations
title_full	Genomic Prediction Within and Across Biparental Families: Means and Variances of Prediction Accuracy and Usefulness of Deterministic Equations
title_fullStr	Genomic Prediction Within and Across Biparental Families: Means and Variances of Prediction Accuracy and Usefulness of Deterministic Equations
title_full_unstemmed	Genomic Prediction Within and Across Biparental Families: Means and Variances of Prediction Accuracy and Usefulness of Deterministic Equations
title_short	Genomic Prediction Within and Across Biparental Families: Means and Variances of Prediction Accuracy and Usefulness of Deterministic Equations
title_sort	genomic prediction within and across biparental families: means and variances of prediction accuracy and usefulness of deterministic equations
topic	Genomic Selection
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5677162/ https://www.ncbi.nlm.nih.gov/pubmed/28916649 http://dx.doi.org/10.1534/g3.117.300076
work_keys_str_mv	AT schopppascal genomicpredictionwithinandacrossbiparentalfamiliesmeansandvariancesofpredictionaccuracyandusefulnessofdeterministicequations AT mullerdominik genomicpredictionwithinandacrossbiparentalfamiliesmeansandvariancesofpredictionaccuracyandusefulnessofdeterministicequations AT wientjesyvonnecj genomicpredictionwithinandacrossbiparentalfamiliesmeansandvariancesofpredictionaccuracyandusefulnessofdeterministicequations AT melchingeralbrechte genomicpredictionwithinandacrossbiparentalfamiliesmeansandvariancesofpredictionaccuracyandusefulnessofdeterministicequations

Genomic Prediction Within and Across Biparental Families: Means and Variances of Prediction Accuracy and Usefulness of Deterministic Equations

Ejemplares similares