Cargando…

Optimization of genomic selection training populations with a genetic algorithm

In this article, we imagine a breeding scenario with a population of individuals that have been genotyped but not phenotyped. We derived a computationally efficient statistic that uses this genetic information to measure the reliability of genomic estimated breeding values (GEBV) for a given set of...

Descripción completa

Detalles Bibliográficos
Autores principales: Akdemir, Deniz, Sanchez, Julio I, Jannink, Jean-Luc
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4422310/
https://www.ncbi.nlm.nih.gov/pubmed/25943105
http://dx.doi.org/10.1186/s12711-015-0116-6
_version_ 1782370037833662464
author Akdemir, Deniz
Sanchez, Julio I
Jannink, Jean-Luc
author_facet Akdemir, Deniz
Sanchez, Julio I
Jannink, Jean-Luc
author_sort Akdemir, Deniz
collection PubMed
description In this article, we imagine a breeding scenario with a population of individuals that have been genotyped but not phenotyped. We derived a computationally efficient statistic that uses this genetic information to measure the reliability of genomic estimated breeding values (GEBV) for a given set of individuals (test set) based on a training set of individuals. We used this reliability measure with a genetic algorithm scheme to find an optimized training set from a larger set of candidate individuals. This subset was phenotyped to create the training set that was used in a genomic selection model to estimate GEBV in the test set. Our results show that, compared to a random sample of the same size, the use of a set of individuals selected by our method improved accuracies. We implemented the proposed training selection methodology on four sets of data on Arabidopsis, wheat, rice and maize. This dynamic model building process that takes genotypes of the individuals in the test sample into account while selecting the training individuals improves the performance of genomic selection models. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12711-015-0116-6) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4422310
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-44223102015-05-07 Optimization of genomic selection training populations with a genetic algorithm Akdemir, Deniz Sanchez, Julio I Jannink, Jean-Luc Genet Sel Evol Research In this article, we imagine a breeding scenario with a population of individuals that have been genotyped but not phenotyped. We derived a computationally efficient statistic that uses this genetic information to measure the reliability of genomic estimated breeding values (GEBV) for a given set of individuals (test set) based on a training set of individuals. We used this reliability measure with a genetic algorithm scheme to find an optimized training set from a larger set of candidate individuals. This subset was phenotyped to create the training set that was used in a genomic selection model to estimate GEBV in the test set. Our results show that, compared to a random sample of the same size, the use of a set of individuals selected by our method improved accuracies. We implemented the proposed training selection methodology on four sets of data on Arabidopsis, wheat, rice and maize. This dynamic model building process that takes genotypes of the individuals in the test sample into account while selecting the training individuals improves the performance of genomic selection models. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12711-015-0116-6) contains supplementary material, which is available to authorized users. BioMed Central 2015-05-06 /pmc/articles/PMC4422310/ /pubmed/25943105 http://dx.doi.org/10.1186/s12711-015-0116-6 Text en © Akdemir et al.; licensee BioMed Central. 2015 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Akdemir, Deniz
Sanchez, Julio I
Jannink, Jean-Luc
Optimization of genomic selection training populations with a genetic algorithm
title Optimization of genomic selection training populations with a genetic algorithm
title_full Optimization of genomic selection training populations with a genetic algorithm
title_fullStr Optimization of genomic selection training populations with a genetic algorithm
title_full_unstemmed Optimization of genomic selection training populations with a genetic algorithm
title_short Optimization of genomic selection training populations with a genetic algorithm
title_sort optimization of genomic selection training populations with a genetic algorithm
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4422310/
https://www.ncbi.nlm.nih.gov/pubmed/25943105
http://dx.doi.org/10.1186/s12711-015-0116-6
work_keys_str_mv AT akdemirdeniz optimizationofgenomicselectiontrainingpopulationswithageneticalgorithm
AT sanchezjulioi optimizationofgenomicselectiontrainingpopulationswithageneticalgorithm
AT janninkjeanluc optimizationofgenomicselectiontrainingpopulationswithageneticalgorithm