Cargando…

Evaluating the accuracy of genomic prediction of growth and wood traits in two Eucalyptus species and their F(1) hybrids

BACKGROUND: Genomic prediction is a genomics assisted breeding methodology that can increase genetic gains by accelerating the breeding cycle and potentially improving the accuracy of breeding values. In this study, we use 41,304 informative SNPs genotyped in a Eucalyptus breeding population involvi...

Descripción completa

Detalles Bibliográficos
Autores principales: Tan, Biyue, Grattapaglia, Dario, Martins, Gustavo Salgado, Ferreira, Karina Zamprogno, Sundberg, Björn, Ingvarsson, Pär K.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5492818/
https://www.ncbi.nlm.nih.gov/pubmed/28662679
http://dx.doi.org/10.1186/s12870-017-1059-6
_version_ 1783247407759228928
author Tan, Biyue
Grattapaglia, Dario
Martins, Gustavo Salgado
Ferreira, Karina Zamprogno
Sundberg, Björn
Ingvarsson, Pär K.
author_facet Tan, Biyue
Grattapaglia, Dario
Martins, Gustavo Salgado
Ferreira, Karina Zamprogno
Sundberg, Björn
Ingvarsson, Pär K.
author_sort Tan, Biyue
collection PubMed
description BACKGROUND: Genomic prediction is a genomics assisted breeding methodology that can increase genetic gains by accelerating the breeding cycle and potentially improving the accuracy of breeding values. In this study, we use 41,304 informative SNPs genotyped in a Eucalyptus breeding population involving 90 E.grandis and 78 E.urophylla parents and their 949 F(1) hybrids to develop genomic prediction models for eight phenotypic traits - basic density and pulp yield, circumference at breast height and height and tree volume scored at age three and six years. We assessed the impact of different genomic prediction methods, the composition and size of the training and validation set and the number and genomic location of SNPs on the predictive ability (PA). RESULTS: Heritabilities estimated using the realized genomic relationship matrix (GRM) were considerably higher than estimates based on the expected pedigree, mainly due to inconsistencies in the expected pedigree that were readily corrected by the GRM. Moreover, the GRM more precisely capture Mendelian sampling among related individuals, such that the genetic covariance was based on the true proportion of the genome shared between individuals. PA improved considerably when increasing the size of the training set and by enhancing relatedness to the validation set. Prediction models trained on pure species parents could not predict well in F(1) hybrids, indicating that model training has to be carried out in hybrid populations if one is to predict in hybrid selection candidates. The different genomic prediction methods provided similar results for all traits, therefore either GBLUP or rrBLUP represents better compromises between computational time and prediction efficiency. Only slight improvement was observed in PA when more than 5000 SNPs were used for all traits. Using SNPs in intergenic regions provided slightly better PA than using SNPs sampled exclusively in genic regions. CONCLUSIONS: The size and composition of the training set and number of SNPs used are the two most important factors for model prediction, compared to the statistical methods and the genomic location of SNPs. Furthermore, training the prediction model based on pure parental species only provide limited ability to predict traits in interspecific hybrids. Our results provide additional promising perspectives for the implementation of genomic prediction in Eucalyptus breeding programs by the selection of interspecific hybrids. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12870-017-1059-6) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5492818
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-54928182017-06-30 Evaluating the accuracy of genomic prediction of growth and wood traits in two Eucalyptus species and their F(1) hybrids Tan, Biyue Grattapaglia, Dario Martins, Gustavo Salgado Ferreira, Karina Zamprogno Sundberg, Björn Ingvarsson, Pär K. BMC Plant Biol Research Article BACKGROUND: Genomic prediction is a genomics assisted breeding methodology that can increase genetic gains by accelerating the breeding cycle and potentially improving the accuracy of breeding values. In this study, we use 41,304 informative SNPs genotyped in a Eucalyptus breeding population involving 90 E.grandis and 78 E.urophylla parents and their 949 F(1) hybrids to develop genomic prediction models for eight phenotypic traits - basic density and pulp yield, circumference at breast height and height and tree volume scored at age three and six years. We assessed the impact of different genomic prediction methods, the composition and size of the training and validation set and the number and genomic location of SNPs on the predictive ability (PA). RESULTS: Heritabilities estimated using the realized genomic relationship matrix (GRM) were considerably higher than estimates based on the expected pedigree, mainly due to inconsistencies in the expected pedigree that were readily corrected by the GRM. Moreover, the GRM more precisely capture Mendelian sampling among related individuals, such that the genetic covariance was based on the true proportion of the genome shared between individuals. PA improved considerably when increasing the size of the training set and by enhancing relatedness to the validation set. Prediction models trained on pure species parents could not predict well in F(1) hybrids, indicating that model training has to be carried out in hybrid populations if one is to predict in hybrid selection candidates. The different genomic prediction methods provided similar results for all traits, therefore either GBLUP or rrBLUP represents better compromises between computational time and prediction efficiency. Only slight improvement was observed in PA when more than 5000 SNPs were used for all traits. Using SNPs in intergenic regions provided slightly better PA than using SNPs sampled exclusively in genic regions. CONCLUSIONS: The size and composition of the training set and number of SNPs used are the two most important factors for model prediction, compared to the statistical methods and the genomic location of SNPs. Furthermore, training the prediction model based on pure parental species only provide limited ability to predict traits in interspecific hybrids. Our results provide additional promising perspectives for the implementation of genomic prediction in Eucalyptus breeding programs by the selection of interspecific hybrids. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12870-017-1059-6) contains supplementary material, which is available to authorized users. BioMed Central 2017-06-29 /pmc/articles/PMC5492818/ /pubmed/28662679 http://dx.doi.org/10.1186/s12870-017-1059-6 Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Tan, Biyue
Grattapaglia, Dario
Martins, Gustavo Salgado
Ferreira, Karina Zamprogno
Sundberg, Björn
Ingvarsson, Pär K.
Evaluating the accuracy of genomic prediction of growth and wood traits in two Eucalyptus species and their F(1) hybrids
title Evaluating the accuracy of genomic prediction of growth and wood traits in two Eucalyptus species and their F(1) hybrids
title_full Evaluating the accuracy of genomic prediction of growth and wood traits in two Eucalyptus species and their F(1) hybrids
title_fullStr Evaluating the accuracy of genomic prediction of growth and wood traits in two Eucalyptus species and their F(1) hybrids
title_full_unstemmed Evaluating the accuracy of genomic prediction of growth and wood traits in two Eucalyptus species and their F(1) hybrids
title_short Evaluating the accuracy of genomic prediction of growth and wood traits in two Eucalyptus species and their F(1) hybrids
title_sort evaluating the accuracy of genomic prediction of growth and wood traits in two eucalyptus species and their f(1) hybrids
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5492818/
https://www.ncbi.nlm.nih.gov/pubmed/28662679
http://dx.doi.org/10.1186/s12870-017-1059-6
work_keys_str_mv AT tanbiyue evaluatingtheaccuracyofgenomicpredictionofgrowthandwoodtraitsintwoeucalyptusspeciesandtheirf1hybrids
AT grattapagliadario evaluatingtheaccuracyofgenomicpredictionofgrowthandwoodtraitsintwoeucalyptusspeciesandtheirf1hybrids
AT martinsgustavosalgado evaluatingtheaccuracyofgenomicpredictionofgrowthandwoodtraitsintwoeucalyptusspeciesandtheirf1hybrids
AT ferreirakarinazamprogno evaluatingtheaccuracyofgenomicpredictionofgrowthandwoodtraitsintwoeucalyptusspeciesandtheirf1hybrids
AT sundbergbjorn evaluatingtheaccuracyofgenomicpredictionofgrowthandwoodtraitsintwoeucalyptusspeciesandtheirf1hybrids
AT ingvarssonpark evaluatingtheaccuracyofgenomicpredictionofgrowthandwoodtraitsintwoeucalyptusspeciesandtheirf1hybrids