Cargando…

Phasing and imputation of single nucleotide polymorphism data of missing parents of biparental plant populations

This paper presents an extension to a heuristic method for phasing and imputation of genotypes of descendants in biparental populations so that it can phase and impute genotypes of parents that are ungenotyped or partially genotyped. The imputed genotypes of the parent are used to impute low‐density...

Descripción completa

Detalles Bibliográficos
Autores principales: Gonen, Serap, Wimmer, Valentin, Gaynor, R. Chris, Byrne, Ed, Gorjanc, Gregor, Hickey, John M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8362159/
https://www.ncbi.nlm.nih.gov/pubmed/34413534
http://dx.doi.org/10.1002/csc2.20409
_version_ 1783738102821421056
author Gonen, Serap
Wimmer, Valentin
Gaynor, R. Chris
Byrne, Ed
Gorjanc, Gregor
Hickey, John M.
author_facet Gonen, Serap
Wimmer, Valentin
Gaynor, R. Chris
Byrne, Ed
Gorjanc, Gregor
Hickey, John M.
author_sort Gonen, Serap
collection PubMed
description This paper presents an extension to a heuristic method for phasing and imputation of genotypes of descendants in biparental populations so that it can phase and impute genotypes of parents that are ungenotyped or partially genotyped. The imputed genotypes of the parent are used to impute low‐density (Ld) genotyped descendants to high density (Hd). The extension was implemented as part of the AlphaPlantImpute software and works in three steps. First, it identifies whether a parent has no or Ld genotypes and identifies its relatives that have Hd genotypes. Second, using the Hd genotypes of relatives, it determines whether the parent is homozygous or heterozygous for a given locus. Third, it phases heterozygous positions of the parent by matching haplotypes to its relatives. We measured the accuracy (correlation between true and imputed genotypes) of imputing parent genotypes in simulated biparental populations from different scenarios. We tested the imputation accuracy of the missing parent's descendants using the true genotype of the parent and compared this with using the imputed genotypes of the parent. Across all scenarios, the imputation accuracy of a parent was >0.98 and did not drop below ∼0.96. The imputation accuracy of a parent was always higher when it was inbred than outbred. Including ancestors of the parent at Hd, increasing the number of crosses and the number of Hd descendants increased the imputation accuracy. The high imputation accuracy achieved for the parent translated to little or no impact on the imputation accuracy of its descendants.
format Online
Article
Text
id pubmed-8362159
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-83621592021-08-17 Phasing and imputation of single nucleotide polymorphism data of missing parents of biparental plant populations Gonen, Serap Wimmer, Valentin Gaynor, R. Chris Byrne, Ed Gorjanc, Gregor Hickey, John M. Crop Sci ORIGINAL RESEARCH ARTICLES This paper presents an extension to a heuristic method for phasing and imputation of genotypes of descendants in biparental populations so that it can phase and impute genotypes of parents that are ungenotyped or partially genotyped. The imputed genotypes of the parent are used to impute low‐density (Ld) genotyped descendants to high density (Hd). The extension was implemented as part of the AlphaPlantImpute software and works in three steps. First, it identifies whether a parent has no or Ld genotypes and identifies its relatives that have Hd genotypes. Second, using the Hd genotypes of relatives, it determines whether the parent is homozygous or heterozygous for a given locus. Third, it phases heterozygous positions of the parent by matching haplotypes to its relatives. We measured the accuracy (correlation between true and imputed genotypes) of imputing parent genotypes in simulated biparental populations from different scenarios. We tested the imputation accuracy of the missing parent's descendants using the true genotype of the parent and compared this with using the imputed genotypes of the parent. Across all scenarios, the imputation accuracy of a parent was >0.98 and did not drop below ∼0.96. The imputation accuracy of a parent was always higher when it was inbred than outbred. Including ancestors of the parent at Hd, increasing the number of crosses and the number of Hd descendants increased the imputation accuracy. The high imputation accuracy achieved for the parent translated to little or no impact on the imputation accuracy of its descendants. John Wiley and Sons Inc. 2021-06-14 2021 /pmc/articles/PMC8362159/ /pubmed/34413534 http://dx.doi.org/10.1002/csc2.20409 Text en © 2020 The Authors. Crop Science published by Wiley Periodicals LLC on behalf of Crop Science Society of America https://creativecommons.org/licenses/by/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle ORIGINAL RESEARCH ARTICLES
Gonen, Serap
Wimmer, Valentin
Gaynor, R. Chris
Byrne, Ed
Gorjanc, Gregor
Hickey, John M.
Phasing and imputation of single nucleotide polymorphism data of missing parents of biparental plant populations
title Phasing and imputation of single nucleotide polymorphism data of missing parents of biparental plant populations
title_full Phasing and imputation of single nucleotide polymorphism data of missing parents of biparental plant populations
title_fullStr Phasing and imputation of single nucleotide polymorphism data of missing parents of biparental plant populations
title_full_unstemmed Phasing and imputation of single nucleotide polymorphism data of missing parents of biparental plant populations
title_short Phasing and imputation of single nucleotide polymorphism data of missing parents of biparental plant populations
title_sort phasing and imputation of single nucleotide polymorphism data of missing parents of biparental plant populations
topic ORIGINAL RESEARCH ARTICLES
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8362159/
https://www.ncbi.nlm.nih.gov/pubmed/34413534
http://dx.doi.org/10.1002/csc2.20409
work_keys_str_mv AT gonenserap phasingandimputationofsinglenucleotidepolymorphismdataofmissingparentsofbiparentalplantpopulations
AT wimmervalentin phasingandimputationofsinglenucleotidepolymorphismdataofmissingparentsofbiparentalplantpopulations
AT gaynorrchris phasingandimputationofsinglenucleotidepolymorphismdataofmissingparentsofbiparentalplantpopulations
AT byrneed phasingandimputationofsinglenucleotidepolymorphismdataofmissingparentsofbiparentalplantpopulations
AT gorjancgregor phasingandimputationofsinglenucleotidepolymorphismdataofmissingparentsofbiparentalplantpopulations
AT hickeyjohnm phasingandimputationofsinglenucleotidepolymorphismdataofmissingparentsofbiparentalplantpopulations