Cargando…
Phasing and imputation of single nucleotide polymorphism data of missing parents of biparental plant populations
This paper presents an extension to a heuristic method for phasing and imputation of genotypes of descendants in biparental populations so that it can phase and impute genotypes of parents that are ungenotyped or partially genotyped. The imputed genotypes of the parent are used to impute low‐density...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
John Wiley and Sons Inc.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8362159/ https://www.ncbi.nlm.nih.gov/pubmed/34413534 http://dx.doi.org/10.1002/csc2.20409 |
_version_ | 1783738102821421056 |
---|---|
author | Gonen, Serap Wimmer, Valentin Gaynor, R. Chris Byrne, Ed Gorjanc, Gregor Hickey, John M. |
author_facet | Gonen, Serap Wimmer, Valentin Gaynor, R. Chris Byrne, Ed Gorjanc, Gregor Hickey, John M. |
author_sort | Gonen, Serap |
collection | PubMed |
description | This paper presents an extension to a heuristic method for phasing and imputation of genotypes of descendants in biparental populations so that it can phase and impute genotypes of parents that are ungenotyped or partially genotyped. The imputed genotypes of the parent are used to impute low‐density (Ld) genotyped descendants to high density (Hd). The extension was implemented as part of the AlphaPlantImpute software and works in three steps. First, it identifies whether a parent has no or Ld genotypes and identifies its relatives that have Hd genotypes. Second, using the Hd genotypes of relatives, it determines whether the parent is homozygous or heterozygous for a given locus. Third, it phases heterozygous positions of the parent by matching haplotypes to its relatives. We measured the accuracy (correlation between true and imputed genotypes) of imputing parent genotypes in simulated biparental populations from different scenarios. We tested the imputation accuracy of the missing parent's descendants using the true genotype of the parent and compared this with using the imputed genotypes of the parent. Across all scenarios, the imputation accuracy of a parent was >0.98 and did not drop below ∼0.96. The imputation accuracy of a parent was always higher when it was inbred than outbred. Including ancestors of the parent at Hd, increasing the number of crosses and the number of Hd descendants increased the imputation accuracy. The high imputation accuracy achieved for the parent translated to little or no impact on the imputation accuracy of its descendants. |
format | Online Article Text |
id | pubmed-8362159 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | John Wiley and Sons Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-83621592021-08-17 Phasing and imputation of single nucleotide polymorphism data of missing parents of biparental plant populations Gonen, Serap Wimmer, Valentin Gaynor, R. Chris Byrne, Ed Gorjanc, Gregor Hickey, John M. Crop Sci ORIGINAL RESEARCH ARTICLES This paper presents an extension to a heuristic method for phasing and imputation of genotypes of descendants in biparental populations so that it can phase and impute genotypes of parents that are ungenotyped or partially genotyped. The imputed genotypes of the parent are used to impute low‐density (Ld) genotyped descendants to high density (Hd). The extension was implemented as part of the AlphaPlantImpute software and works in three steps. First, it identifies whether a parent has no or Ld genotypes and identifies its relatives that have Hd genotypes. Second, using the Hd genotypes of relatives, it determines whether the parent is homozygous or heterozygous for a given locus. Third, it phases heterozygous positions of the parent by matching haplotypes to its relatives. We measured the accuracy (correlation between true and imputed genotypes) of imputing parent genotypes in simulated biparental populations from different scenarios. We tested the imputation accuracy of the missing parent's descendants using the true genotype of the parent and compared this with using the imputed genotypes of the parent. Across all scenarios, the imputation accuracy of a parent was >0.98 and did not drop below ∼0.96. The imputation accuracy of a parent was always higher when it was inbred than outbred. Including ancestors of the parent at Hd, increasing the number of crosses and the number of Hd descendants increased the imputation accuracy. The high imputation accuracy achieved for the parent translated to little or no impact on the imputation accuracy of its descendants. John Wiley and Sons Inc. 2021-06-14 2021 /pmc/articles/PMC8362159/ /pubmed/34413534 http://dx.doi.org/10.1002/csc2.20409 Text en © 2020 The Authors. Crop Science published by Wiley Periodicals LLC on behalf of Crop Science Society of America https://creativecommons.org/licenses/by/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | ORIGINAL RESEARCH ARTICLES Gonen, Serap Wimmer, Valentin Gaynor, R. Chris Byrne, Ed Gorjanc, Gregor Hickey, John M. Phasing and imputation of single nucleotide polymorphism data of missing parents of biparental plant populations |
title | Phasing and imputation of single nucleotide polymorphism data of missing parents of biparental plant populations |
title_full | Phasing and imputation of single nucleotide polymorphism data of missing parents of biparental plant populations |
title_fullStr | Phasing and imputation of single nucleotide polymorphism data of missing parents of biparental plant populations |
title_full_unstemmed | Phasing and imputation of single nucleotide polymorphism data of missing parents of biparental plant populations |
title_short | Phasing and imputation of single nucleotide polymorphism data of missing parents of biparental plant populations |
title_sort | phasing and imputation of single nucleotide polymorphism data of missing parents of biparental plant populations |
topic | ORIGINAL RESEARCH ARTICLES |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8362159/ https://www.ncbi.nlm.nih.gov/pubmed/34413534 http://dx.doi.org/10.1002/csc2.20409 |
work_keys_str_mv | AT gonenserap phasingandimputationofsinglenucleotidepolymorphismdataofmissingparentsofbiparentalplantpopulations AT wimmervalentin phasingandimputationofsinglenucleotidepolymorphismdataofmissingparentsofbiparentalplantpopulations AT gaynorrchris phasingandimputationofsinglenucleotidepolymorphismdataofmissingparentsofbiparentalplantpopulations AT byrneed phasingandimputationofsinglenucleotidepolymorphismdataofmissingparentsofbiparentalplantpopulations AT gorjancgregor phasingandimputationofsinglenucleotidepolymorphismdataofmissingparentsofbiparentalplantpopulations AT hickeyjohnm phasingandimputationofsinglenucleotidepolymorphismdataofmissingparentsofbiparentalplantpopulations |