Cargando…

AlphaFamImpute: high-accuracy imputation in full-sib families from genotype-by-sequencing data

SUMMARY: AlphaFamImpute is an imputation package for calling, phasing and imputing genome-wide genotypes in outbred full-sib families from single nucleotide polymorphism (SNP) array and genotype-by-sequencing (GBS) data. GBS data are increasingly being used to genotype individuals, especially when S...

Descripción completa

Detalles Bibliográficos
Autores principales: Whalen, Andrew, Gorjanc, Gregor, Hickey, John M
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7520044/
https://www.ncbi.nlm.nih.gov/pubmed/32467963
http://dx.doi.org/10.1093/bioinformatics/btaa499
_version_ 1783587698884214784
author Whalen, Andrew
Gorjanc, Gregor
Hickey, John M
author_facet Whalen, Andrew
Gorjanc, Gregor
Hickey, John M
author_sort Whalen, Andrew
collection PubMed
description SUMMARY: AlphaFamImpute is an imputation package for calling, phasing and imputing genome-wide genotypes in outbred full-sib families from single nucleotide polymorphism (SNP) array and genotype-by-sequencing (GBS) data. GBS data are increasingly being used to genotype individuals, especially when SNP arrays do not exist for a population of interest. Low-coverage GBS produces data with a large number of missing or incorrect naïve genotype calls, which can be improved by identifying shared haplotype segments between full-sib individuals. Here, we present AlphaFamImpute, an algorithm specifically designed to exploit the genetic structure of full-sib families. It performs imputation using a two-step approach. In the first step, it phases and imputes parental genotypes based on the segregation states of their offspring (i.e. which pair of parental haplotypes the offspring inherited). In the second step, it phases and imputes the offspring genotypes by detecting which haplotype segments the offspring inherited from their parents. With a series of simulations, we find that AlphaFamImpute obtains high-accuracy genotypes, even when the parents are not genotyped and individuals are sequenced at <1x coverage. AVAILABILITY AND IMPLEMENTATION: AlphaFamImpute is available as a Python package from the AlphaGenes website http://www.AlphaGenes.roslin.ed.ac.uk/AlphaFamImpute. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-7520044
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-75200442020-09-30 AlphaFamImpute: high-accuracy imputation in full-sib families from genotype-by-sequencing data Whalen, Andrew Gorjanc, Gregor Hickey, John M Bioinformatics Applications Notes SUMMARY: AlphaFamImpute is an imputation package for calling, phasing and imputing genome-wide genotypes in outbred full-sib families from single nucleotide polymorphism (SNP) array and genotype-by-sequencing (GBS) data. GBS data are increasingly being used to genotype individuals, especially when SNP arrays do not exist for a population of interest. Low-coverage GBS produces data with a large number of missing or incorrect naïve genotype calls, which can be improved by identifying shared haplotype segments between full-sib individuals. Here, we present AlphaFamImpute, an algorithm specifically designed to exploit the genetic structure of full-sib families. It performs imputation using a two-step approach. In the first step, it phases and imputes parental genotypes based on the segregation states of their offspring (i.e. which pair of parental haplotypes the offspring inherited). In the second step, it phases and imputes the offspring genotypes by detecting which haplotype segments the offspring inherited from their parents. With a series of simulations, we find that AlphaFamImpute obtains high-accuracy genotypes, even when the parents are not genotyped and individuals are sequenced at <1x coverage. AVAILABILITY AND IMPLEMENTATION: AlphaFamImpute is available as a Python package from the AlphaGenes website http://www.AlphaGenes.roslin.ed.ac.uk/AlphaFamImpute. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2020-05-28 /pmc/articles/PMC7520044/ /pubmed/32467963 http://dx.doi.org/10.1093/bioinformatics/btaa499 Text en © The Author(s) 2020. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Applications Notes
Whalen, Andrew
Gorjanc, Gregor
Hickey, John M
AlphaFamImpute: high-accuracy imputation in full-sib families from genotype-by-sequencing data
title AlphaFamImpute: high-accuracy imputation in full-sib families from genotype-by-sequencing data
title_full AlphaFamImpute: high-accuracy imputation in full-sib families from genotype-by-sequencing data
title_fullStr AlphaFamImpute: high-accuracy imputation in full-sib families from genotype-by-sequencing data
title_full_unstemmed AlphaFamImpute: high-accuracy imputation in full-sib families from genotype-by-sequencing data
title_short AlphaFamImpute: high-accuracy imputation in full-sib families from genotype-by-sequencing data
title_sort alphafamimpute: high-accuracy imputation in full-sib families from genotype-by-sequencing data
topic Applications Notes
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7520044/
https://www.ncbi.nlm.nih.gov/pubmed/32467963
http://dx.doi.org/10.1093/bioinformatics/btaa499
work_keys_str_mv AT whalenandrew alphafamimputehighaccuracyimputationinfullsibfamiliesfromgenotypebysequencingdata
AT gorjancgregor alphafamimputehighaccuracyimputationinfullsibfamiliesfromgenotypebysequencingdata
AT hickeyjohnm alphafamimputehighaccuracyimputationinfullsibfamiliesfromgenotypebysequencingdata