Cargando…

Evaluation of approaches for estimating the accuracy of genomic prediction in plant breeding

BACKGROUND: In genomic prediction, an important measure of accuracy is the correlation between the predicted and the true breeding values. Direct computation of this quantity for real datasets is not possible, because the true breeding value is unknown. Instead, the correlation between the predicted...

Descripción completa

Detalles Bibliográficos
Autores principales: Ould Estaghvirou, Sidi Boubacar, Ogutu, Joseph O, Schulz-Streeck, Torben, Knaak, Carsten, Ouzunova, Milena, Gordillo, Andres, Piepho, Hans-Peter
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3879103/
https://www.ncbi.nlm.nih.gov/pubmed/24314298
http://dx.doi.org/10.1186/1471-2164-14-860
_version_ 1782297920563838976
author Ould Estaghvirou, Sidi Boubacar
Ogutu, Joseph O
Schulz-Streeck, Torben
Knaak, Carsten
Ouzunova, Milena
Gordillo, Andres
Piepho, Hans-Peter
author_facet Ould Estaghvirou, Sidi Boubacar
Ogutu, Joseph O
Schulz-Streeck, Torben
Knaak, Carsten
Ouzunova, Milena
Gordillo, Andres
Piepho, Hans-Peter
author_sort Ould Estaghvirou, Sidi Boubacar
collection PubMed
description BACKGROUND: In genomic prediction, an important measure of accuracy is the correlation between the predicted and the true breeding values. Direct computation of this quantity for real datasets is not possible, because the true breeding value is unknown. Instead, the correlation between the predicted breeding values and the observed phenotypic values, called predictive ability, is often computed. In order to indirectly estimate predictive accuracy, this latter correlation is usually divided by an estimate of the square root of heritability. In this study we use simulation to evaluate estimates of predictive accuracy for seven methods, four (1 to 4) of which use an estimate of heritability to divide predictive ability computed by cross-validation. Between them the seven methods cover balanced and unbalanced datasets as well as correlated and uncorrelated genotypes. We propose one new indirect method (4) and two direct methods (5 and 6) for estimating predictive accuracy and compare their performances and those of four other existing approaches (three indirect (1 to 3) and one direct (7)) with simulated true predictive accuracy as the benchmark and with each other. RESULTS: The size of the estimated genetic variance and hence heritability exerted the strongest influence on the variation in the estimated predictive accuracy. Increasing the number of genotypes considerably increases the time required to compute predictive accuracy by all the seven methods, most notably for the five methods that require cross-validation (Methods 1, 2, 3, 4 and 6). A new method that we propose (Method 5) and an existing method (Method 7) used in animal breeding programs were the fastest and gave the least biased, most precise and stable estimates of predictive accuracy. Of the methods that use cross-validation Methods 4 and 6 were often the best. CONCLUSIONS: The estimated genetic variance and the number of genotypes had the greatest influence on predictive accuracy. Methods 5 and 7 were the fastest and produced the least biased, the most precise, robust and stable estimates of predictive accuracy. These properties argue for routinely using Methods 5 and 7 to assess predictive accuracy in genomic selection studies.
format Online
Article
Text
id pubmed-3879103
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-38791032014-01-08 Evaluation of approaches for estimating the accuracy of genomic prediction in plant breeding Ould Estaghvirou, Sidi Boubacar Ogutu, Joseph O Schulz-Streeck, Torben Knaak, Carsten Ouzunova, Milena Gordillo, Andres Piepho, Hans-Peter BMC Genomics Research Article BACKGROUND: In genomic prediction, an important measure of accuracy is the correlation between the predicted and the true breeding values. Direct computation of this quantity for real datasets is not possible, because the true breeding value is unknown. Instead, the correlation between the predicted breeding values and the observed phenotypic values, called predictive ability, is often computed. In order to indirectly estimate predictive accuracy, this latter correlation is usually divided by an estimate of the square root of heritability. In this study we use simulation to evaluate estimates of predictive accuracy for seven methods, four (1 to 4) of which use an estimate of heritability to divide predictive ability computed by cross-validation. Between them the seven methods cover balanced and unbalanced datasets as well as correlated and uncorrelated genotypes. We propose one new indirect method (4) and two direct methods (5 and 6) for estimating predictive accuracy and compare their performances and those of four other existing approaches (three indirect (1 to 3) and one direct (7)) with simulated true predictive accuracy as the benchmark and with each other. RESULTS: The size of the estimated genetic variance and hence heritability exerted the strongest influence on the variation in the estimated predictive accuracy. Increasing the number of genotypes considerably increases the time required to compute predictive accuracy by all the seven methods, most notably for the five methods that require cross-validation (Methods 1, 2, 3, 4 and 6). A new method that we propose (Method 5) and an existing method (Method 7) used in animal breeding programs were the fastest and gave the least biased, most precise and stable estimates of predictive accuracy. Of the methods that use cross-validation Methods 4 and 6 were often the best. CONCLUSIONS: The estimated genetic variance and the number of genotypes had the greatest influence on predictive accuracy. Methods 5 and 7 were the fastest and produced the least biased, the most precise, robust and stable estimates of predictive accuracy. These properties argue for routinely using Methods 5 and 7 to assess predictive accuracy in genomic selection studies. BioMed Central 2013-12-06 /pmc/articles/PMC3879103/ /pubmed/24314298 http://dx.doi.org/10.1186/1471-2164-14-860 Text en Copyright © 2013 Ould Estaghvirou et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Ould Estaghvirou, Sidi Boubacar
Ogutu, Joseph O
Schulz-Streeck, Torben
Knaak, Carsten
Ouzunova, Milena
Gordillo, Andres
Piepho, Hans-Peter
Evaluation of approaches for estimating the accuracy of genomic prediction in plant breeding
title Evaluation of approaches for estimating the accuracy of genomic prediction in plant breeding
title_full Evaluation of approaches for estimating the accuracy of genomic prediction in plant breeding
title_fullStr Evaluation of approaches for estimating the accuracy of genomic prediction in plant breeding
title_full_unstemmed Evaluation of approaches for estimating the accuracy of genomic prediction in plant breeding
title_short Evaluation of approaches for estimating the accuracy of genomic prediction in plant breeding
title_sort evaluation of approaches for estimating the accuracy of genomic prediction in plant breeding
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3879103/
https://www.ncbi.nlm.nih.gov/pubmed/24314298
http://dx.doi.org/10.1186/1471-2164-14-860
work_keys_str_mv AT ouldestaghvirousidiboubacar evaluationofapproachesforestimatingtheaccuracyofgenomicpredictioninplantbreeding
AT ogutujosepho evaluationofapproachesforestimatingtheaccuracyofgenomicpredictioninplantbreeding
AT schulzstreecktorben evaluationofapproachesforestimatingtheaccuracyofgenomicpredictioninplantbreeding
AT knaakcarsten evaluationofapproachesforestimatingtheaccuracyofgenomicpredictioninplantbreeding
AT ouzunovamilena evaluationofapproachesforestimatingtheaccuracyofgenomicpredictioninplantbreeding
AT gordilloandres evaluationofapproachesforestimatingtheaccuracyofgenomicpredictioninplantbreeding
AT piephohanspeter evaluationofapproachesforestimatingtheaccuracyofgenomicpredictioninplantbreeding