Cargando…

Genome-wide imputation using the practical haplotype graph in the heterozygous crop cassava

Genomic applications such as genomic selection and genome-wide association have become increasingly common since the advent of genome sequencing. The cost of sequencing has decreased in the past two decades; however, genotyping costs are still prohibitive to gathering large datasets for these genomi...

Descripción completa

Detalles Bibliográficos
Autores principales: Long, Evan M, Bradbury, Peter J, Romay, M Cinta, Buckler, Edward S, Robbins, Kelly R
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8728015/
https://www.ncbi.nlm.nih.gov/pubmed/34751380
http://dx.doi.org/10.1093/g3journal/jkab383
_version_ 1784626638553612288
author Long, Evan M
Bradbury, Peter J
Romay, M Cinta
Buckler, Edward S
Robbins, Kelly R
author_facet Long, Evan M
Bradbury, Peter J
Romay, M Cinta
Buckler, Edward S
Robbins, Kelly R
author_sort Long, Evan M
collection PubMed
description Genomic applications such as genomic selection and genome-wide association have become increasingly common since the advent of genome sequencing. The cost of sequencing has decreased in the past two decades; however, genotyping costs are still prohibitive to gathering large datasets for these genomic applications, especially in nonmodel species where resources are less abundant. Genotype imputation makes it possible to infer whole-genome information from limited input data, making large sampling for genomic applications more feasible. Imputation becomes increasingly difficult in heterozygous species where haplotypes must be phased. The practical haplotype graph (PHG) is a recently developed tool that can accurately impute genotypes, using a reference panel of haplotypes. We showcase the ability of the PHG to impute genomic information in the highly heterozygous crop cassava (Manihot esculenta). Accurately phased haplotypes were sampled from runs of homozygosity across a diverse panel of individuals to populate PHG, which proved more accurate than relying on computational phasing methods. The PHG achieved high imputation accuracy, using sparse skim-sequencing input, which translated to substantial genomic prediction accuracy in cross-validation testing. The PHG showed improved imputation accuracy, compared to a standard imputation tool Beagle, especially in predicting rare alleles.
format Online
Article
Text
id pubmed-8728015
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-87280152022-01-05 Genome-wide imputation using the practical haplotype graph in the heterozygous crop cassava Long, Evan M Bradbury, Peter J Romay, M Cinta Buckler, Edward S Robbins, Kelly R G3 (Bethesda) Software and Data Resources Genomic applications such as genomic selection and genome-wide association have become increasingly common since the advent of genome sequencing. The cost of sequencing has decreased in the past two decades; however, genotyping costs are still prohibitive to gathering large datasets for these genomic applications, especially in nonmodel species where resources are less abundant. Genotype imputation makes it possible to infer whole-genome information from limited input data, making large sampling for genomic applications more feasible. Imputation becomes increasingly difficult in heterozygous species where haplotypes must be phased. The practical haplotype graph (PHG) is a recently developed tool that can accurately impute genotypes, using a reference panel of haplotypes. We showcase the ability of the PHG to impute genomic information in the highly heterozygous crop cassava (Manihot esculenta). Accurately phased haplotypes were sampled from runs of homozygosity across a diverse panel of individuals to populate PHG, which proved more accurate than relying on computational phasing methods. The PHG achieved high imputation accuracy, using sparse skim-sequencing input, which translated to substantial genomic prediction accuracy in cross-validation testing. The PHG showed improved imputation accuracy, compared to a standard imputation tool Beagle, especially in predicting rare alleles. Oxford University Press 2021-11-09 /pmc/articles/PMC8728015/ /pubmed/34751380 http://dx.doi.org/10.1093/g3journal/jkab383 Text en © The Author(s) 2021. Published by Oxford University Press on behalf of Genetics Society of America. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Software and Data Resources
Long, Evan M
Bradbury, Peter J
Romay, M Cinta
Buckler, Edward S
Robbins, Kelly R
Genome-wide imputation using the practical haplotype graph in the heterozygous crop cassava
title Genome-wide imputation using the practical haplotype graph in the heterozygous crop cassava
title_full Genome-wide imputation using the practical haplotype graph in the heterozygous crop cassava
title_fullStr Genome-wide imputation using the practical haplotype graph in the heterozygous crop cassava
title_full_unstemmed Genome-wide imputation using the practical haplotype graph in the heterozygous crop cassava
title_short Genome-wide imputation using the practical haplotype graph in the heterozygous crop cassava
title_sort genome-wide imputation using the practical haplotype graph in the heterozygous crop cassava
topic Software and Data Resources
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8728015/
https://www.ncbi.nlm.nih.gov/pubmed/34751380
http://dx.doi.org/10.1093/g3journal/jkab383
work_keys_str_mv AT longevanm genomewideimputationusingthepracticalhaplotypegraphintheheterozygouscropcassava
AT bradburypeterj genomewideimputationusingthepracticalhaplotypegraphintheheterozygouscropcassava
AT romaymcinta genomewideimputationusingthepracticalhaplotypegraphintheheterozygouscropcassava
AT buckleredwards genomewideimputationusingthepracticalhaplotypegraphintheheterozygouscropcassava
AT robbinskellyr genomewideimputationusingthepracticalhaplotypegraphintheheterozygouscropcassava