Cargando…

Historical Datasets Support Genomic Selection Models for the Prediction of Cotton Fiber Quality Phenotypes Across Multiple Environments

Genomic selection (GS) has successfully been used in plant breeding to improve selection efficiency and reduce breeding time and cost. However, there has not been a study to evaluate GS prediction models that may be used for predicting cotton breeding lines across multiple environments. In this stud...

Descripción completa

Detalles Bibliográficos
Autores principales: Gapare, Washington, Liu, Shiming, Conaty, Warren, Zhu, Qian-Hao, Gillespie, Vanessa, Llewellyn, Danny, Stiller, Warwick, Wilson, Iain
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Genetics Society of America 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5940163/
https://www.ncbi.nlm.nih.gov/pubmed/29559536
http://dx.doi.org/10.1534/g3.118.200140
_version_ 1783321060209328128
author Gapare, Washington
Liu, Shiming
Conaty, Warren
Zhu, Qian-Hao
Gillespie, Vanessa
Llewellyn, Danny
Stiller, Warwick
Wilson, Iain
author_facet Gapare, Washington
Liu, Shiming
Conaty, Warren
Zhu, Qian-Hao
Gillespie, Vanessa
Llewellyn, Danny
Stiller, Warwick
Wilson, Iain
author_sort Gapare, Washington
collection PubMed
description Genomic selection (GS) has successfully been used in plant breeding to improve selection efficiency and reduce breeding time and cost. However, there has not been a study to evaluate GS prediction models that may be used for predicting cotton breeding lines across multiple environments. In this study, we evaluated the performance of Bayes Ridge Regression, BayesA, BayesB, BayesC and Reproducing Kernel Hilbert Spaces regression models. We then extended the single-site GS model to accommodate genotype × environment interaction (G×E) in order to assess the merits of multi- over single-environment models in a practical breeding and selection context in cotton, a crop for which this has not previously been evaluated. Our study was based on a population of 215 upland cotton (Gossypium hirsutum) breeding lines which were evaluated for fiber length and strength at multiple locations in Australia and genotyped with 13,330 single nucleotide polymorphic (SNP) markers. BayesB, which assumes unique variance for each marker and a proportion of markers to have large effects, while most other markers have zero effect, was the preferred model. GS accuracy for fiber length based on a single-site model varied across sites, ranging from 0.27 to 0.77 (mean = 0.38), while that of fiber strength ranged from 0.19 to 0.58 (mean = 0.35) using randomly selected sub-populations as the training population. Prediction accuracies from the M×E model were higher than those for single-site and across-site models, with an average accuracy of 0.71 and 0.59 for fiber length and strength, respectively. The use of the M×E model could therefore identify which breeding lines have effects that are stable across environments and which ones are responsible for G×E and so reduce the amount of phenotypic screening required in cotton breeding programs to identify adaptable genotypes.
format Online
Article
Text
id pubmed-5940163
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Genetics Society of America
record_format MEDLINE/PubMed
spelling pubmed-59401632018-05-10 Historical Datasets Support Genomic Selection Models for the Prediction of Cotton Fiber Quality Phenotypes Across Multiple Environments Gapare, Washington Liu, Shiming Conaty, Warren Zhu, Qian-Hao Gillespie, Vanessa Llewellyn, Danny Stiller, Warwick Wilson, Iain G3 (Bethesda) Investigations Genomic selection (GS) has successfully been used in plant breeding to improve selection efficiency and reduce breeding time and cost. However, there has not been a study to evaluate GS prediction models that may be used for predicting cotton breeding lines across multiple environments. In this study, we evaluated the performance of Bayes Ridge Regression, BayesA, BayesB, BayesC and Reproducing Kernel Hilbert Spaces regression models. We then extended the single-site GS model to accommodate genotype × environment interaction (G×E) in order to assess the merits of multi- over single-environment models in a practical breeding and selection context in cotton, a crop for which this has not previously been evaluated. Our study was based on a population of 215 upland cotton (Gossypium hirsutum) breeding lines which were evaluated for fiber length and strength at multiple locations in Australia and genotyped with 13,330 single nucleotide polymorphic (SNP) markers. BayesB, which assumes unique variance for each marker and a proportion of markers to have large effects, while most other markers have zero effect, was the preferred model. GS accuracy for fiber length based on a single-site model varied across sites, ranging from 0.27 to 0.77 (mean = 0.38), while that of fiber strength ranged from 0.19 to 0.58 (mean = 0.35) using randomly selected sub-populations as the training population. Prediction accuracies from the M×E model were higher than those for single-site and across-site models, with an average accuracy of 0.71 and 0.59 for fiber length and strength, respectively. The use of the M×E model could therefore identify which breeding lines have effects that are stable across environments and which ones are responsible for G×E and so reduce the amount of phenotypic screening required in cotton breeding programs to identify adaptable genotypes. Genetics Society of America 2018-03-20 /pmc/articles/PMC5940163/ /pubmed/29559536 http://dx.doi.org/10.1534/g3.118.200140 Text en Copyright © 2018 Gapare et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Investigations
Gapare, Washington
Liu, Shiming
Conaty, Warren
Zhu, Qian-Hao
Gillespie, Vanessa
Llewellyn, Danny
Stiller, Warwick
Wilson, Iain
Historical Datasets Support Genomic Selection Models for the Prediction of Cotton Fiber Quality Phenotypes Across Multiple Environments
title Historical Datasets Support Genomic Selection Models for the Prediction of Cotton Fiber Quality Phenotypes Across Multiple Environments
title_full Historical Datasets Support Genomic Selection Models for the Prediction of Cotton Fiber Quality Phenotypes Across Multiple Environments
title_fullStr Historical Datasets Support Genomic Selection Models for the Prediction of Cotton Fiber Quality Phenotypes Across Multiple Environments
title_full_unstemmed Historical Datasets Support Genomic Selection Models for the Prediction of Cotton Fiber Quality Phenotypes Across Multiple Environments
title_short Historical Datasets Support Genomic Selection Models for the Prediction of Cotton Fiber Quality Phenotypes Across Multiple Environments
title_sort historical datasets support genomic selection models for the prediction of cotton fiber quality phenotypes across multiple environments
topic Investigations
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5940163/
https://www.ncbi.nlm.nih.gov/pubmed/29559536
http://dx.doi.org/10.1534/g3.118.200140
work_keys_str_mv AT gaparewashington historicaldatasetssupportgenomicselectionmodelsforthepredictionofcottonfiberqualityphenotypesacrossmultipleenvironments
AT liushiming historicaldatasetssupportgenomicselectionmodelsforthepredictionofcottonfiberqualityphenotypesacrossmultipleenvironments
AT conatywarren historicaldatasetssupportgenomicselectionmodelsforthepredictionofcottonfiberqualityphenotypesacrossmultipleenvironments
AT zhuqianhao historicaldatasetssupportgenomicselectionmodelsforthepredictionofcottonfiberqualityphenotypesacrossmultipleenvironments
AT gillespievanessa historicaldatasetssupportgenomicselectionmodelsforthepredictionofcottonfiberqualityphenotypesacrossmultipleenvironments
AT llewellyndanny historicaldatasetssupportgenomicselectionmodelsforthepredictionofcottonfiberqualityphenotypesacrossmultipleenvironments
AT stillerwarwick historicaldatasetssupportgenomicselectionmodelsforthepredictionofcottonfiberqualityphenotypesacrossmultipleenvironments
AT wilsoniain historicaldatasetssupportgenomicselectionmodelsforthepredictionofcottonfiberqualityphenotypesacrossmultipleenvironments