Cargando…

Achieving improved accuracy for imputation of ancient DNA

MOTIVATION: Genotype imputation has the potential to increase the amount of information that can be gained from the often limited biological material available in ancient samples. As many widely used tools have been developed with modern data in mind, their design is not necessarily reflective of th...

Descripción completa

Detalles Bibliográficos
Autores principales: Ausmees, Kristiina, Nettelblad, Carl
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9805568/
https://www.ncbi.nlm.nih.gov/pubmed/36377787
http://dx.doi.org/10.1093/bioinformatics/btac738
_version_ 1784862355226624000
author Ausmees, Kristiina
Nettelblad, Carl
author_facet Ausmees, Kristiina
Nettelblad, Carl
author_sort Ausmees, Kristiina
collection PubMed
description MOTIVATION: Genotype imputation has the potential to increase the amount of information that can be gained from the often limited biological material available in ancient samples. As many widely used tools have been developed with modern data in mind, their design is not necessarily reflective of the requirements in studies of ancient DNA. Here, we investigate if an imputation method based on the full probabilistic Li and Stephens model of haplotype frequencies might be beneficial for the particular challenges posed by ancient data. RESULTS: We present an implementation called prophaser and compare imputation performance to two alternative pipelines that have been used in the ancient DNA community based on the Beagle software. Considering empirical ancient data downsampled to lower coverages as well as present-day samples with artificially thinned genotypes, we show that the proposed method is advantageous at lower coverages, where it yields improved accuracy and ability to capture rare variation. The software prophaser is optimized for running in a massively parallel manner and achieved reasonable runtimes on the experiments performed when executed on a GPU. AVAILABILITY AND IMPLEMENTATION: The C++ code for prophaser is available in the GitHub repository https://github.com/scicompuu/prophaser. SUPPLEMENTARY INFORMATION: Supplementary information is available at Bioinformatics online.
format Online
Article
Text
id pubmed-9805568
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-98055682023-01-03 Achieving improved accuracy for imputation of ancient DNA Ausmees, Kristiina Nettelblad, Carl Bioinformatics Original Paper MOTIVATION: Genotype imputation has the potential to increase the amount of information that can be gained from the often limited biological material available in ancient samples. As many widely used tools have been developed with modern data in mind, their design is not necessarily reflective of the requirements in studies of ancient DNA. Here, we investigate if an imputation method based on the full probabilistic Li and Stephens model of haplotype frequencies might be beneficial for the particular challenges posed by ancient data. RESULTS: We present an implementation called prophaser and compare imputation performance to two alternative pipelines that have been used in the ancient DNA community based on the Beagle software. Considering empirical ancient data downsampled to lower coverages as well as present-day samples with artificially thinned genotypes, we show that the proposed method is advantageous at lower coverages, where it yields improved accuracy and ability to capture rare variation. The software prophaser is optimized for running in a massively parallel manner and achieved reasonable runtimes on the experiments performed when executed on a GPU. AVAILABILITY AND IMPLEMENTATION: The C++ code for prophaser is available in the GitHub repository https://github.com/scicompuu/prophaser. SUPPLEMENTARY INFORMATION: Supplementary information is available at Bioinformatics online. Oxford University Press 2022-11-15 /pmc/articles/PMC9805568/ /pubmed/36377787 http://dx.doi.org/10.1093/bioinformatics/btac738 Text en © The Author(s) 2022. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Paper
Ausmees, Kristiina
Nettelblad, Carl
Achieving improved accuracy for imputation of ancient DNA
title Achieving improved accuracy for imputation of ancient DNA
title_full Achieving improved accuracy for imputation of ancient DNA
title_fullStr Achieving improved accuracy for imputation of ancient DNA
title_full_unstemmed Achieving improved accuracy for imputation of ancient DNA
title_short Achieving improved accuracy for imputation of ancient DNA
title_sort achieving improved accuracy for imputation of ancient dna
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9805568/
https://www.ncbi.nlm.nih.gov/pubmed/36377787
http://dx.doi.org/10.1093/bioinformatics/btac738
work_keys_str_mv AT ausmeeskristiina achievingimprovedaccuracyforimputationofancientdna
AT nettelbladcarl achievingimprovedaccuracyforimputationofancientdna