Cargando…

Parent-progeny imputation from pooled samples for cost-efficient genotyping in plant breeding

The increased usage of whole-genome selection (WGS) and other molecular evaluation methods in plant breeding relies on the ability to genotype a very large number of untested individuals in each breeding cycle. Many plant breeding programs evaluate large biparental populations of homozygous individu...

Descripción completa

Detalles Bibliográficos
Autores principales: Technow, Frank, Gerke, Justin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5741258/
https://www.ncbi.nlm.nih.gov/pubmed/29272307
http://dx.doi.org/10.1371/journal.pone.0190271
_version_ 1783288170070147072
author Technow, Frank
Gerke, Justin
author_facet Technow, Frank
Gerke, Justin
author_sort Technow, Frank
collection PubMed
description The increased usage of whole-genome selection (WGS) and other molecular evaluation methods in plant breeding relies on the ability to genotype a very large number of untested individuals in each breeding cycle. Many plant breeding programs evaluate large biparental populations of homozygous individuals derived from homozygous parent inbred lines. This structure lends itself to parent-progeny imputation, which transfers the genotype scores of the parents to progeny individuals that are genotyped for a much smaller number of loci. Here we introduce a parent-progeny imputation method that infers individual genotypes from non-barcoded pooled samples of DNA of multiple individuals using a Hidden Markov Model (HMM). We demonstrate the method for pools of simulated maize double haploids (DH) from biparental populations, genotyped using a genotyping by sequencing (GBS) approach for 3,000 loci at 0.125x to 4x coverage. We observed high concordance between true and imputed marker scores and the HMM produced well-calibrated genotype probabilities that correctly reflected the uncertainty of the imputed scores. Genomic estimated breeding values (GEBV) calculated from the imputed scores closely matched GEBV calculated from the true marker scores. The within-population correlation between these sets of GEBV approached 0.95 at 1x and 4x coverage when pooling two or four individuals, respectively. Our approach can reduce the genotyping cost per individual by a factor up to the number of pooled individuals in GBS applications without the need for extra sequencing coverage, thereby enabling cost-effective large scale genotyping for applications such as WGS in plant breeding.
format Online
Article
Text
id pubmed-5741258
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-57412582018-01-09 Parent-progeny imputation from pooled samples for cost-efficient genotyping in plant breeding Technow, Frank Gerke, Justin PLoS One Research Article The increased usage of whole-genome selection (WGS) and other molecular evaluation methods in plant breeding relies on the ability to genotype a very large number of untested individuals in each breeding cycle. Many plant breeding programs evaluate large biparental populations of homozygous individuals derived from homozygous parent inbred lines. This structure lends itself to parent-progeny imputation, which transfers the genotype scores of the parents to progeny individuals that are genotyped for a much smaller number of loci. Here we introduce a parent-progeny imputation method that infers individual genotypes from non-barcoded pooled samples of DNA of multiple individuals using a Hidden Markov Model (HMM). We demonstrate the method for pools of simulated maize double haploids (DH) from biparental populations, genotyped using a genotyping by sequencing (GBS) approach for 3,000 loci at 0.125x to 4x coverage. We observed high concordance between true and imputed marker scores and the HMM produced well-calibrated genotype probabilities that correctly reflected the uncertainty of the imputed scores. Genomic estimated breeding values (GEBV) calculated from the imputed scores closely matched GEBV calculated from the true marker scores. The within-population correlation between these sets of GEBV approached 0.95 at 1x and 4x coverage when pooling two or four individuals, respectively. Our approach can reduce the genotyping cost per individual by a factor up to the number of pooled individuals in GBS applications without the need for extra sequencing coverage, thereby enabling cost-effective large scale genotyping for applications such as WGS in plant breeding. Public Library of Science 2017-12-22 /pmc/articles/PMC5741258/ /pubmed/29272307 http://dx.doi.org/10.1371/journal.pone.0190271 Text en © 2017 Technow, Gerke http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Technow, Frank
Gerke, Justin
Parent-progeny imputation from pooled samples for cost-efficient genotyping in plant breeding
title Parent-progeny imputation from pooled samples for cost-efficient genotyping in plant breeding
title_full Parent-progeny imputation from pooled samples for cost-efficient genotyping in plant breeding
title_fullStr Parent-progeny imputation from pooled samples for cost-efficient genotyping in plant breeding
title_full_unstemmed Parent-progeny imputation from pooled samples for cost-efficient genotyping in plant breeding
title_short Parent-progeny imputation from pooled samples for cost-efficient genotyping in plant breeding
title_sort parent-progeny imputation from pooled samples for cost-efficient genotyping in plant breeding
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5741258/
https://www.ncbi.nlm.nih.gov/pubmed/29272307
http://dx.doi.org/10.1371/journal.pone.0190271
work_keys_str_mv AT technowfrank parentprogenyimputationfrompooledsamplesforcostefficientgenotypinginplantbreeding
AT gerkejustin parentprogenyimputationfrompooledsamplesforcostefficientgenotypinginplantbreeding