Cargando…

Optimizing Training Population Size and Genotyping Strategy for Genomic Prediction Using Association Study Results and Pedigree Information. A Case of Study in Advanced Wheat Breeding Lines

Wheat breeding programs generate a large amount of variation which cannot be completely explored because of limited phenotyping throughput. Genomic prediction (GP) has been proposed as a new tool which provides breeding values estimations without the need of phenotyping all the material produced but...

Descripción completa

Detalles Bibliográficos
Autores principales: Cericola, Fabio, Jahoor, Ahmed, Orabi, Jihad, Andersen, Jeppe R., Janss, Luc L., Jensen, Just
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5231327/
https://www.ncbi.nlm.nih.gov/pubmed/28081208
http://dx.doi.org/10.1371/journal.pone.0169606
_version_ 1782494465555955712
author Cericola, Fabio
Jahoor, Ahmed
Orabi, Jihad
Andersen, Jeppe R.
Janss, Luc L.
Jensen, Just
author_facet Cericola, Fabio
Jahoor, Ahmed
Orabi, Jihad
Andersen, Jeppe R.
Janss, Luc L.
Jensen, Just
author_sort Cericola, Fabio
collection PubMed
description Wheat breeding programs generate a large amount of variation which cannot be completely explored because of limited phenotyping throughput. Genomic prediction (GP) has been proposed as a new tool which provides breeding values estimations without the need of phenotyping all the material produced but only a subset of it named training population (TP). However, genotyping of all the accessions under analysis is needed and, therefore, optimizing TP dimension and genotyping strategy is pivotal to implement GP in commercial breeding schemes. Here, we explored the optimum TP size and we integrated pedigree records and genome wide association studies (GWAS) results to optimize the genotyping strategy. A total of 988 advanced wheat breeding lines were genotyped with the Illumina 15K SNPs wheat chip and phenotyped across several years and locations for yield, lodging, and starch content. Cross-validation using the largest possible TP size and all the SNPs available after editing (~11k), yielded predictive abilities (r(GP)) ranging between 0.5–0.6. In order to explore the Training population size, r(GP) were computed using progressively smaller TP. These exercises showed that TP of around 700 lines were enough to yield the highest observed r(GP). Moreover, r(GP) were calculated by randomly reducing the SNPs number. This showed that around 1K markers were enough to reach the highest observed r(GP). GWAS was used to identify markers associated with the traits analyzed. A GWAS-based selection of SNPs resulted in increased r(GP) when compared with random selection and few hundreds SNPs were sufficient to obtain the highest observed r(GP). For each of these scenarios, advantages of adding the pedigree information were shown. Our results indicate that moderate TP sizes were enough to yield high r(GP) and that pedigree information and GWAS results can be used to greatly optimize the genotyping strategy.
format Online
Article
Text
id pubmed-5231327
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-52313272017-01-31 Optimizing Training Population Size and Genotyping Strategy for Genomic Prediction Using Association Study Results and Pedigree Information. A Case of Study in Advanced Wheat Breeding Lines Cericola, Fabio Jahoor, Ahmed Orabi, Jihad Andersen, Jeppe R. Janss, Luc L. Jensen, Just PLoS One Research Article Wheat breeding programs generate a large amount of variation which cannot be completely explored because of limited phenotyping throughput. Genomic prediction (GP) has been proposed as a new tool which provides breeding values estimations without the need of phenotyping all the material produced but only a subset of it named training population (TP). However, genotyping of all the accessions under analysis is needed and, therefore, optimizing TP dimension and genotyping strategy is pivotal to implement GP in commercial breeding schemes. Here, we explored the optimum TP size and we integrated pedigree records and genome wide association studies (GWAS) results to optimize the genotyping strategy. A total of 988 advanced wheat breeding lines were genotyped with the Illumina 15K SNPs wheat chip and phenotyped across several years and locations for yield, lodging, and starch content. Cross-validation using the largest possible TP size and all the SNPs available after editing (~11k), yielded predictive abilities (r(GP)) ranging between 0.5–0.6. In order to explore the Training population size, r(GP) were computed using progressively smaller TP. These exercises showed that TP of around 700 lines were enough to yield the highest observed r(GP). Moreover, r(GP) were calculated by randomly reducing the SNPs number. This showed that around 1K markers were enough to reach the highest observed r(GP). GWAS was used to identify markers associated with the traits analyzed. A GWAS-based selection of SNPs resulted in increased r(GP) when compared with random selection and few hundreds SNPs were sufficient to obtain the highest observed r(GP). For each of these scenarios, advantages of adding the pedigree information were shown. Our results indicate that moderate TP sizes were enough to yield high r(GP) and that pedigree information and GWAS results can be used to greatly optimize the genotyping strategy. Public Library of Science 2017-01-12 /pmc/articles/PMC5231327/ /pubmed/28081208 http://dx.doi.org/10.1371/journal.pone.0169606 Text en © 2017 Cericola et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Cericola, Fabio
Jahoor, Ahmed
Orabi, Jihad
Andersen, Jeppe R.
Janss, Luc L.
Jensen, Just
Optimizing Training Population Size and Genotyping Strategy for Genomic Prediction Using Association Study Results and Pedigree Information. A Case of Study in Advanced Wheat Breeding Lines
title Optimizing Training Population Size and Genotyping Strategy for Genomic Prediction Using Association Study Results and Pedigree Information. A Case of Study in Advanced Wheat Breeding Lines
title_full Optimizing Training Population Size and Genotyping Strategy for Genomic Prediction Using Association Study Results and Pedigree Information. A Case of Study in Advanced Wheat Breeding Lines
title_fullStr Optimizing Training Population Size and Genotyping Strategy for Genomic Prediction Using Association Study Results and Pedigree Information. A Case of Study in Advanced Wheat Breeding Lines
title_full_unstemmed Optimizing Training Population Size and Genotyping Strategy for Genomic Prediction Using Association Study Results and Pedigree Information. A Case of Study in Advanced Wheat Breeding Lines
title_short Optimizing Training Population Size and Genotyping Strategy for Genomic Prediction Using Association Study Results and Pedigree Information. A Case of Study in Advanced Wheat Breeding Lines
title_sort optimizing training population size and genotyping strategy for genomic prediction using association study results and pedigree information. a case of study in advanced wheat breeding lines
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5231327/
https://www.ncbi.nlm.nih.gov/pubmed/28081208
http://dx.doi.org/10.1371/journal.pone.0169606
work_keys_str_mv AT cericolafabio optimizingtrainingpopulationsizeandgenotypingstrategyforgenomicpredictionusingassociationstudyresultsandpedigreeinformationacaseofstudyinadvancedwheatbreedinglines
AT jahoorahmed optimizingtrainingpopulationsizeandgenotypingstrategyforgenomicpredictionusingassociationstudyresultsandpedigreeinformationacaseofstudyinadvancedwheatbreedinglines
AT orabijihad optimizingtrainingpopulationsizeandgenotypingstrategyforgenomicpredictionusingassociationstudyresultsandpedigreeinformationacaseofstudyinadvancedwheatbreedinglines
AT andersenjepper optimizingtrainingpopulationsizeandgenotypingstrategyforgenomicpredictionusingassociationstudyresultsandpedigreeinformationacaseofstudyinadvancedwheatbreedinglines
AT jansslucl optimizingtrainingpopulationsizeandgenotypingstrategyforgenomicpredictionusingassociationstudyresultsandpedigreeinformationacaseofstudyinadvancedwheatbreedinglines
AT jensenjust optimizingtrainingpopulationsizeandgenotypingstrategyforgenomicpredictionusingassociationstudyresultsandpedigreeinformationacaseofstudyinadvancedwheatbreedinglines