Cargando…
Genome-wide imputation using the practical haplotype graph in the heterozygous crop cassava
Genomic applications such as genomic selection and genome-wide association have become increasingly common since the advent of genome sequencing. The cost of sequencing has decreased in the past two decades; however, genotyping costs are still prohibitive to gathering large datasets for these genomi...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8728015/ https://www.ncbi.nlm.nih.gov/pubmed/34751380 http://dx.doi.org/10.1093/g3journal/jkab383 |
_version_ | 1784626638553612288 |
---|---|
author | Long, Evan M Bradbury, Peter J Romay, M Cinta Buckler, Edward S Robbins, Kelly R |
author_facet | Long, Evan M Bradbury, Peter J Romay, M Cinta Buckler, Edward S Robbins, Kelly R |
author_sort | Long, Evan M |
collection | PubMed |
description | Genomic applications such as genomic selection and genome-wide association have become increasingly common since the advent of genome sequencing. The cost of sequencing has decreased in the past two decades; however, genotyping costs are still prohibitive to gathering large datasets for these genomic applications, especially in nonmodel species where resources are less abundant. Genotype imputation makes it possible to infer whole-genome information from limited input data, making large sampling for genomic applications more feasible. Imputation becomes increasingly difficult in heterozygous species where haplotypes must be phased. The practical haplotype graph (PHG) is a recently developed tool that can accurately impute genotypes, using a reference panel of haplotypes. We showcase the ability of the PHG to impute genomic information in the highly heterozygous crop cassava (Manihot esculenta). Accurately phased haplotypes were sampled from runs of homozygosity across a diverse panel of individuals to populate PHG, which proved more accurate than relying on computational phasing methods. The PHG achieved high imputation accuracy, using sparse skim-sequencing input, which translated to substantial genomic prediction accuracy in cross-validation testing. The PHG showed improved imputation accuracy, compared to a standard imputation tool Beagle, especially in predicting rare alleles. |
format | Online Article Text |
id | pubmed-8728015 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-87280152022-01-05 Genome-wide imputation using the practical haplotype graph in the heterozygous crop cassava Long, Evan M Bradbury, Peter J Romay, M Cinta Buckler, Edward S Robbins, Kelly R G3 (Bethesda) Software and Data Resources Genomic applications such as genomic selection and genome-wide association have become increasingly common since the advent of genome sequencing. The cost of sequencing has decreased in the past two decades; however, genotyping costs are still prohibitive to gathering large datasets for these genomic applications, especially in nonmodel species where resources are less abundant. Genotype imputation makes it possible to infer whole-genome information from limited input data, making large sampling for genomic applications more feasible. Imputation becomes increasingly difficult in heterozygous species where haplotypes must be phased. The practical haplotype graph (PHG) is a recently developed tool that can accurately impute genotypes, using a reference panel of haplotypes. We showcase the ability of the PHG to impute genomic information in the highly heterozygous crop cassava (Manihot esculenta). Accurately phased haplotypes were sampled from runs of homozygosity across a diverse panel of individuals to populate PHG, which proved more accurate than relying on computational phasing methods. The PHG achieved high imputation accuracy, using sparse skim-sequencing input, which translated to substantial genomic prediction accuracy in cross-validation testing. The PHG showed improved imputation accuracy, compared to a standard imputation tool Beagle, especially in predicting rare alleles. Oxford University Press 2021-11-09 /pmc/articles/PMC8728015/ /pubmed/34751380 http://dx.doi.org/10.1093/g3journal/jkab383 Text en © The Author(s) 2021. Published by Oxford University Press on behalf of Genetics Society of America. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Software and Data Resources Long, Evan M Bradbury, Peter J Romay, M Cinta Buckler, Edward S Robbins, Kelly R Genome-wide imputation using the practical haplotype graph in the heterozygous crop cassava |
title | Genome-wide imputation using the practical haplotype graph in the heterozygous crop cassava |
title_full | Genome-wide imputation using the practical haplotype graph in the heterozygous crop cassava |
title_fullStr | Genome-wide imputation using the practical haplotype graph in the heterozygous crop cassava |
title_full_unstemmed | Genome-wide imputation using the practical haplotype graph in the heterozygous crop cassava |
title_short | Genome-wide imputation using the practical haplotype graph in the heterozygous crop cassava |
title_sort | genome-wide imputation using the practical haplotype graph in the heterozygous crop cassava |
topic | Software and Data Resources |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8728015/ https://www.ncbi.nlm.nih.gov/pubmed/34751380 http://dx.doi.org/10.1093/g3journal/jkab383 |
work_keys_str_mv | AT longevanm genomewideimputationusingthepracticalhaplotypegraphintheheterozygouscropcassava AT bradburypeterj genomewideimputationusingthepracticalhaplotypegraphintheheterozygouscropcassava AT romaymcinta genomewideimputationusingthepracticalhaplotypegraphintheheterozygouscropcassava AT buckleredwards genomewideimputationusingthepracticalhaplotypegraphintheheterozygouscropcassava AT robbinskellyr genomewideimputationusingthepracticalhaplotypegraphintheheterozygouscropcassava |