Cargando…

Optimized Use of Low-Depth Genotyping-by-Sequencing for Genomic Prediction Among Multi-Parental Family Pools and Single Plants in Perennial Ryegrass (Lolium perenne L.)

Ryegrass single plants, bi-parental family pools, and multi-parental family pools are often genotyped, based on allele-frequencies using genotyping-by-sequencing (GBS) assays. GBS assays can be performed at low-coverage depth to reduce costs. However, reducing the coverage depth leads to a higher pr...

Descripción completa

Detalles Bibliográficos
Autores principales: Cericola, Fabio, Lenk, Ingo, Fè, Dario, Byrne, Stephen, Jensen, Christian S., Pedersen, Morten G., Asp, Torben, Jensen, Just, Janss, Luc
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5871745/
https://www.ncbi.nlm.nih.gov/pubmed/29619038
http://dx.doi.org/10.3389/fpls.2018.00369
_version_ 1783309687628759040
author Cericola, Fabio
Lenk, Ingo
Fè, Dario
Byrne, Stephen
Jensen, Christian S.
Pedersen, Morten G.
Asp, Torben
Jensen, Just
Janss, Luc
author_facet Cericola, Fabio
Lenk, Ingo
Fè, Dario
Byrne, Stephen
Jensen, Christian S.
Pedersen, Morten G.
Asp, Torben
Jensen, Just
Janss, Luc
author_sort Cericola, Fabio
collection PubMed
description Ryegrass single plants, bi-parental family pools, and multi-parental family pools are often genotyped, based on allele-frequencies using genotyping-by-sequencing (GBS) assays. GBS assays can be performed at low-coverage depth to reduce costs. However, reducing the coverage depth leads to a higher proportion of missing data, and leads to a reduction in accuracy when identifying the allele-frequency at each locus. As a consequence of the latter, genomic relationship matrices (GRMs) will be biased. This bias in GRMs affects variance estimates and the accuracy of GBLUP for genomic prediction (GBLUP-GP). We derived equations that describe the bias from low-coverage sequencing as an effect of binomial sampling of sequence reads, and allowed for any ploidy level of the sample considered. This allowed us to combine individual and pool genotypes in one GRM, treating pool-genotypes as a polyploid genotype, equal to the total ploidy-level of the parents of the pool. Using simulated data, we verified the magnitude of the GRM bias at different coverage depths for three different kinds of ryegrass breeding material: individual genotypes from single plants, pool-genotypes from F(2) families, and pool-genotypes from synthetic varieties. To better handle missing data, we also tested imputation procedures, which are suited for analyzing allele-frequency genomic data. The relative advantages of the bias-correction and the imputation of missing data were evaluated using real data. We examined a large dataset, including single plants, F(2) families, and synthetic varieties genotyped in three GBS assays, each with a different coverage depth, and evaluated them for heading date, crown rust resistance, and seed yield. Cross validations were used to test the accuracy using GBLUP approaches, demonstrating the feasibility of predicting among different breeding material. Bias-corrected GRMs proved to increase predictive accuracies when compared with standard approaches to construct GRMs. Among the imputation methods we tested, the random forest method yielded the highest predictive accuracy. The combinations of these two methods resulted in a meaningful increase of predictive ability (up to 0.09). The possibility of predicting across individuals and pools provides new opportunities for improving ryegrass breeding schemes.
format Online
Article
Text
id pubmed-5871745
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-58717452018-04-04 Optimized Use of Low-Depth Genotyping-by-Sequencing for Genomic Prediction Among Multi-Parental Family Pools and Single Plants in Perennial Ryegrass (Lolium perenne L.) Cericola, Fabio Lenk, Ingo Fè, Dario Byrne, Stephen Jensen, Christian S. Pedersen, Morten G. Asp, Torben Jensen, Just Janss, Luc Front Plant Sci Plant Science Ryegrass single plants, bi-parental family pools, and multi-parental family pools are often genotyped, based on allele-frequencies using genotyping-by-sequencing (GBS) assays. GBS assays can be performed at low-coverage depth to reduce costs. However, reducing the coverage depth leads to a higher proportion of missing data, and leads to a reduction in accuracy when identifying the allele-frequency at each locus. As a consequence of the latter, genomic relationship matrices (GRMs) will be biased. This bias in GRMs affects variance estimates and the accuracy of GBLUP for genomic prediction (GBLUP-GP). We derived equations that describe the bias from low-coverage sequencing as an effect of binomial sampling of sequence reads, and allowed for any ploidy level of the sample considered. This allowed us to combine individual and pool genotypes in one GRM, treating pool-genotypes as a polyploid genotype, equal to the total ploidy-level of the parents of the pool. Using simulated data, we verified the magnitude of the GRM bias at different coverage depths for three different kinds of ryegrass breeding material: individual genotypes from single plants, pool-genotypes from F(2) families, and pool-genotypes from synthetic varieties. To better handle missing data, we also tested imputation procedures, which are suited for analyzing allele-frequency genomic data. The relative advantages of the bias-correction and the imputation of missing data were evaluated using real data. We examined a large dataset, including single plants, F(2) families, and synthetic varieties genotyped in three GBS assays, each with a different coverage depth, and evaluated them for heading date, crown rust resistance, and seed yield. Cross validations were used to test the accuracy using GBLUP approaches, demonstrating the feasibility of predicting among different breeding material. Bias-corrected GRMs proved to increase predictive accuracies when compared with standard approaches to construct GRMs. Among the imputation methods we tested, the random forest method yielded the highest predictive accuracy. The combinations of these two methods resulted in a meaningful increase of predictive ability (up to 0.09). The possibility of predicting across individuals and pools provides new opportunities for improving ryegrass breeding schemes. Frontiers Media S.A. 2018-03-21 /pmc/articles/PMC5871745/ /pubmed/29619038 http://dx.doi.org/10.3389/fpls.2018.00369 Text en Copyright © 2018 Cericola, Lenk, Fè, Byrne, Jensen, Pedersen, Asp, Jensen and Janss. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Plant Science
Cericola, Fabio
Lenk, Ingo
Fè, Dario
Byrne, Stephen
Jensen, Christian S.
Pedersen, Morten G.
Asp, Torben
Jensen, Just
Janss, Luc
Optimized Use of Low-Depth Genotyping-by-Sequencing for Genomic Prediction Among Multi-Parental Family Pools and Single Plants in Perennial Ryegrass (Lolium perenne L.)
title Optimized Use of Low-Depth Genotyping-by-Sequencing for Genomic Prediction Among Multi-Parental Family Pools and Single Plants in Perennial Ryegrass (Lolium perenne L.)
title_full Optimized Use of Low-Depth Genotyping-by-Sequencing for Genomic Prediction Among Multi-Parental Family Pools and Single Plants in Perennial Ryegrass (Lolium perenne L.)
title_fullStr Optimized Use of Low-Depth Genotyping-by-Sequencing for Genomic Prediction Among Multi-Parental Family Pools and Single Plants in Perennial Ryegrass (Lolium perenne L.)
title_full_unstemmed Optimized Use of Low-Depth Genotyping-by-Sequencing for Genomic Prediction Among Multi-Parental Family Pools and Single Plants in Perennial Ryegrass (Lolium perenne L.)
title_short Optimized Use of Low-Depth Genotyping-by-Sequencing for Genomic Prediction Among Multi-Parental Family Pools and Single Plants in Perennial Ryegrass (Lolium perenne L.)
title_sort optimized use of low-depth genotyping-by-sequencing for genomic prediction among multi-parental family pools and single plants in perennial ryegrass (lolium perenne l.)
topic Plant Science
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5871745/
https://www.ncbi.nlm.nih.gov/pubmed/29619038
http://dx.doi.org/10.3389/fpls.2018.00369
work_keys_str_mv AT cericolafabio optimizeduseoflowdepthgenotypingbysequencingforgenomicpredictionamongmultiparentalfamilypoolsandsingleplantsinperennialryegrassloliumperennel
AT lenkingo optimizeduseoflowdepthgenotypingbysequencingforgenomicpredictionamongmultiparentalfamilypoolsandsingleplantsinperennialryegrassloliumperennel
AT fedario optimizeduseoflowdepthgenotypingbysequencingforgenomicpredictionamongmultiparentalfamilypoolsandsingleplantsinperennialryegrassloliumperennel
AT byrnestephen optimizeduseoflowdepthgenotypingbysequencingforgenomicpredictionamongmultiparentalfamilypoolsandsingleplantsinperennialryegrassloliumperennel
AT jensenchristians optimizeduseoflowdepthgenotypingbysequencingforgenomicpredictionamongmultiparentalfamilypoolsandsingleplantsinperennialryegrassloliumperennel
AT pedersenmorteng optimizeduseoflowdepthgenotypingbysequencingforgenomicpredictionamongmultiparentalfamilypoolsandsingleplantsinperennialryegrassloliumperennel
AT asptorben optimizeduseoflowdepthgenotypingbysequencingforgenomicpredictionamongmultiparentalfamilypoolsandsingleplantsinperennialryegrassloliumperennel
AT jensenjust optimizeduseoflowdepthgenotypingbysequencingforgenomicpredictionamongmultiparentalfamilypoolsandsingleplantsinperennialryegrassloliumperennel
AT janssluc optimizeduseoflowdepthgenotypingbysequencingforgenomicpredictionamongmultiparentalfamilypoolsandsingleplantsinperennialryegrassloliumperennel