Cargando…

Robust estimation of heritability and predictive accuracy in plant breeding: evaluation using simulation and empirical data

BACKGROUND: Genomic prediction (GP) is used in animal and plant breeding to help identify the best genotypes for selection. One of the most important measures of the effectiveness and reliability of GP in plant breeding is predictive accuracy. An accurate estimate of this measure is thus central to...

Descripción completa

Detalles Bibliográficos
Autores principales: Lourenço, Vanda Milheiro, Ogutu, Joseph Ochieng, Piepho, Hans-Peter
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6958597/
https://www.ncbi.nlm.nih.gov/pubmed/31937245
http://dx.doi.org/10.1186/s12864-019-6429-z
_version_ 1783487447022174208
author Lourenço, Vanda Milheiro
Ogutu, Joseph Ochieng
Piepho, Hans-Peter
author_facet Lourenço, Vanda Milheiro
Ogutu, Joseph Ochieng
Piepho, Hans-Peter
author_sort Lourenço, Vanda Milheiro
collection PubMed
description BACKGROUND: Genomic prediction (GP) is used in animal and plant breeding to help identify the best genotypes for selection. One of the most important measures of the effectiveness and reliability of GP in plant breeding is predictive accuracy. An accurate estimate of this measure is thus central to GP. Moreover, regression models are the models of choice for analyzing field trial data in plant breeding. However, models that use the classical likelihood typically perform poorly, often resulting in biased parameter estimates, when their underlying assumptions are violated. This typically happens when data are contaminated with outliers. These biases often translate into inaccurate estimates of heritability and predictive accuracy, compromising the performance of GP. Since phenotypic data are susceptible to contamination, improving the methods for estimating heritability and predictive accuracy can enhance the performance of GP. Robust statistical methods provide an intuitively appealing and a theoretically well justified framework for overcoming some of the drawbacks of classical regression, most notably the departure from the normality assumption. We compare the performance of robust and classical approaches to two recently published methods for estimating heritability and predictive accuracy of GP using simulation of several plausible scenarios of random and block data contamination with outliers and commercial maize and rye breeding datasets. RESULTS: The robust approach generally performed as good as or better than the classical approach in phenotypic data analysis and in estimating the predictive accuracy of heritability and genomic prediction under both the random and block contamination scenarios. Notably, it consistently outperformed the classical approach under the random contamination scenario. Analyses of the empirical maize and rye datasets further reinforce the stability and reliability of the robust approach in the presence of outliers or missing data. CONCLUSIONS: The proposed robust approach enhances the predictive accuracy of heritability and genomic prediction by minimizing the deleterious effects of outliers for a broad range of simulation scenarios and empirical breeding datasets. Accordingly, plant breeders should seriously consider regularly using the robust alongside the classical approach and increasing the number of replicates to three or more, to further enhance the accuracy of the robust approach.
format Online
Article
Text
id pubmed-6958597
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-69585972020-01-17 Robust estimation of heritability and predictive accuracy in plant breeding: evaluation using simulation and empirical data Lourenço, Vanda Milheiro Ogutu, Joseph Ochieng Piepho, Hans-Peter BMC Genomics Methodology Article BACKGROUND: Genomic prediction (GP) is used in animal and plant breeding to help identify the best genotypes for selection. One of the most important measures of the effectiveness and reliability of GP in plant breeding is predictive accuracy. An accurate estimate of this measure is thus central to GP. Moreover, regression models are the models of choice for analyzing field trial data in plant breeding. However, models that use the classical likelihood typically perform poorly, often resulting in biased parameter estimates, when their underlying assumptions are violated. This typically happens when data are contaminated with outliers. These biases often translate into inaccurate estimates of heritability and predictive accuracy, compromising the performance of GP. Since phenotypic data are susceptible to contamination, improving the methods for estimating heritability and predictive accuracy can enhance the performance of GP. Robust statistical methods provide an intuitively appealing and a theoretically well justified framework for overcoming some of the drawbacks of classical regression, most notably the departure from the normality assumption. We compare the performance of robust and classical approaches to two recently published methods for estimating heritability and predictive accuracy of GP using simulation of several plausible scenarios of random and block data contamination with outliers and commercial maize and rye breeding datasets. RESULTS: The robust approach generally performed as good as or better than the classical approach in phenotypic data analysis and in estimating the predictive accuracy of heritability and genomic prediction under both the random and block contamination scenarios. Notably, it consistently outperformed the classical approach under the random contamination scenario. Analyses of the empirical maize and rye datasets further reinforce the stability and reliability of the robust approach in the presence of outliers or missing data. CONCLUSIONS: The proposed robust approach enhances the predictive accuracy of heritability and genomic prediction by minimizing the deleterious effects of outliers for a broad range of simulation scenarios and empirical breeding datasets. Accordingly, plant breeders should seriously consider regularly using the robust alongside the classical approach and increasing the number of replicates to three or more, to further enhance the accuracy of the robust approach. BioMed Central 2020-01-14 /pmc/articles/PMC6958597/ /pubmed/31937245 http://dx.doi.org/10.1186/s12864-019-6429-z Text en © Lourenço et al. 2020 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Lourenço, Vanda Milheiro
Ogutu, Joseph Ochieng
Piepho, Hans-Peter
Robust estimation of heritability and predictive accuracy in plant breeding: evaluation using simulation and empirical data
title Robust estimation of heritability and predictive accuracy in plant breeding: evaluation using simulation and empirical data
title_full Robust estimation of heritability and predictive accuracy in plant breeding: evaluation using simulation and empirical data
title_fullStr Robust estimation of heritability and predictive accuracy in plant breeding: evaluation using simulation and empirical data
title_full_unstemmed Robust estimation of heritability and predictive accuracy in plant breeding: evaluation using simulation and empirical data
title_short Robust estimation of heritability and predictive accuracy in plant breeding: evaluation using simulation and empirical data
title_sort robust estimation of heritability and predictive accuracy in plant breeding: evaluation using simulation and empirical data
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6958597/
https://www.ncbi.nlm.nih.gov/pubmed/31937245
http://dx.doi.org/10.1186/s12864-019-6429-z
work_keys_str_mv AT lourencovandamilheiro robustestimationofheritabilityandpredictiveaccuracyinplantbreedingevaluationusingsimulationandempiricaldata
AT ogutujosephochieng robustestimationofheritabilityandpredictiveaccuracyinplantbreedingevaluationusingsimulationandempiricaldata
AT piephohanspeter robustestimationofheritabilityandpredictiveaccuracyinplantbreedingevaluationusingsimulationandempiricaldata