Cargando…
Achieving improved accuracy for imputation of ancient DNA
MOTIVATION: Genotype imputation has the potential to increase the amount of information that can be gained from the often limited biological material available in ancient samples. As many widely used tools have been developed with modern data in mind, their design is not necessarily reflective of th...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9805568/ https://www.ncbi.nlm.nih.gov/pubmed/36377787 http://dx.doi.org/10.1093/bioinformatics/btac738 |
_version_ | 1784862355226624000 |
---|---|
author | Ausmees, Kristiina Nettelblad, Carl |
author_facet | Ausmees, Kristiina Nettelblad, Carl |
author_sort | Ausmees, Kristiina |
collection | PubMed |
description | MOTIVATION: Genotype imputation has the potential to increase the amount of information that can be gained from the often limited biological material available in ancient samples. As many widely used tools have been developed with modern data in mind, their design is not necessarily reflective of the requirements in studies of ancient DNA. Here, we investigate if an imputation method based on the full probabilistic Li and Stephens model of haplotype frequencies might be beneficial for the particular challenges posed by ancient data. RESULTS: We present an implementation called prophaser and compare imputation performance to two alternative pipelines that have been used in the ancient DNA community based on the Beagle software. Considering empirical ancient data downsampled to lower coverages as well as present-day samples with artificially thinned genotypes, we show that the proposed method is advantageous at lower coverages, where it yields improved accuracy and ability to capture rare variation. The software prophaser is optimized for running in a massively parallel manner and achieved reasonable runtimes on the experiments performed when executed on a GPU. AVAILABILITY AND IMPLEMENTATION: The C++ code for prophaser is available in the GitHub repository https://github.com/scicompuu/prophaser. SUPPLEMENTARY INFORMATION: Supplementary information is available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-9805568 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-98055682023-01-03 Achieving improved accuracy for imputation of ancient DNA Ausmees, Kristiina Nettelblad, Carl Bioinformatics Original Paper MOTIVATION: Genotype imputation has the potential to increase the amount of information that can be gained from the often limited biological material available in ancient samples. As many widely used tools have been developed with modern data in mind, their design is not necessarily reflective of the requirements in studies of ancient DNA. Here, we investigate if an imputation method based on the full probabilistic Li and Stephens model of haplotype frequencies might be beneficial for the particular challenges posed by ancient data. RESULTS: We present an implementation called prophaser and compare imputation performance to two alternative pipelines that have been used in the ancient DNA community based on the Beagle software. Considering empirical ancient data downsampled to lower coverages as well as present-day samples with artificially thinned genotypes, we show that the proposed method is advantageous at lower coverages, where it yields improved accuracy and ability to capture rare variation. The software prophaser is optimized for running in a massively parallel manner and achieved reasonable runtimes on the experiments performed when executed on a GPU. AVAILABILITY AND IMPLEMENTATION: The C++ code for prophaser is available in the GitHub repository https://github.com/scicompuu/prophaser. SUPPLEMENTARY INFORMATION: Supplementary information is available at Bioinformatics online. Oxford University Press 2022-11-15 /pmc/articles/PMC9805568/ /pubmed/36377787 http://dx.doi.org/10.1093/bioinformatics/btac738 Text en © The Author(s) 2022. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Paper Ausmees, Kristiina Nettelblad, Carl Achieving improved accuracy for imputation of ancient DNA |
title | Achieving improved accuracy for imputation of ancient DNA |
title_full | Achieving improved accuracy for imputation of ancient DNA |
title_fullStr | Achieving improved accuracy for imputation of ancient DNA |
title_full_unstemmed | Achieving improved accuracy for imputation of ancient DNA |
title_short | Achieving improved accuracy for imputation of ancient DNA |
title_sort | achieving improved accuracy for imputation of ancient dna |
topic | Original Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9805568/ https://www.ncbi.nlm.nih.gov/pubmed/36377787 http://dx.doi.org/10.1093/bioinformatics/btac738 |
work_keys_str_mv | AT ausmeeskristiina achievingimprovedaccuracyforimputationofancientdna AT nettelbladcarl achievingimprovedaccuracyforimputationofancientdna |