Cargando…

Joint Estimation of Contamination, Error and Demography for Nuclear DNA from Ancient Humans

When sequencing an ancient DNA sample from a hominin fossil, DNA from present-day humans involved in excavation and extraction will be sequenced along with the endogenous material. This type of contamination is problematic for downstream analyses as it will introduce a bias towards the population of...

Descripción completa

Detalles Bibliográficos
Autores principales: Racimo, Fernando, Renaud, Gabriel, Slatkin, Montgomery
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4822957/
https://www.ncbi.nlm.nih.gov/pubmed/27049965
http://dx.doi.org/10.1371/journal.pgen.1005972
_version_ 1782425834603151360
author Racimo, Fernando
Renaud, Gabriel
Slatkin, Montgomery
author_facet Racimo, Fernando
Renaud, Gabriel
Slatkin, Montgomery
author_sort Racimo, Fernando
collection PubMed
description When sequencing an ancient DNA sample from a hominin fossil, DNA from present-day humans involved in excavation and extraction will be sequenced along with the endogenous material. This type of contamination is problematic for downstream analyses as it will introduce a bias towards the population of the contaminating individual(s). Quantifying the extent of contamination is a crucial step as it allows researchers to account for possible biases that may arise in downstream genetic analyses. Here, we present an MCMC algorithm to co-estimate the contamination rate, sequencing error rate and demographic parameters—including drift times and admixture rates—for an ancient nuclear genome obtained from human remains, when the putative contaminating DNA comes from present-day humans. We assume we have a large panel representing the putative contaminant population (e.g. European, East Asian or African). The method is implemented in a C++ program called ‘Demographic Inference with Contamination and Error’ (DICE). We applied it to simulations and genome data from ancient Neanderthals and modern humans. With reasonable levels of genome sequence coverage (>3X), we find we can recover accurate estimates of all these parameters, even when the contamination rate is as high as 50%.
format Online
Article
Text
id pubmed-4822957
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-48229572016-04-22 Joint Estimation of Contamination, Error and Demography for Nuclear DNA from Ancient Humans Racimo, Fernando Renaud, Gabriel Slatkin, Montgomery PLoS Genet Research Article When sequencing an ancient DNA sample from a hominin fossil, DNA from present-day humans involved in excavation and extraction will be sequenced along with the endogenous material. This type of contamination is problematic for downstream analyses as it will introduce a bias towards the population of the contaminating individual(s). Quantifying the extent of contamination is a crucial step as it allows researchers to account for possible biases that may arise in downstream genetic analyses. Here, we present an MCMC algorithm to co-estimate the contamination rate, sequencing error rate and demographic parameters—including drift times and admixture rates—for an ancient nuclear genome obtained from human remains, when the putative contaminating DNA comes from present-day humans. We assume we have a large panel representing the putative contaminant population (e.g. European, East Asian or African). The method is implemented in a C++ program called ‘Demographic Inference with Contamination and Error’ (DICE). We applied it to simulations and genome data from ancient Neanderthals and modern humans. With reasonable levels of genome sequence coverage (>3X), we find we can recover accurate estimates of all these parameters, even when the contamination rate is as high as 50%. Public Library of Science 2016-04-06 /pmc/articles/PMC4822957/ /pubmed/27049965 http://dx.doi.org/10.1371/journal.pgen.1005972 Text en © 2016 Racimo et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Racimo, Fernando
Renaud, Gabriel
Slatkin, Montgomery
Joint Estimation of Contamination, Error and Demography for Nuclear DNA from Ancient Humans
title Joint Estimation of Contamination, Error and Demography for Nuclear DNA from Ancient Humans
title_full Joint Estimation of Contamination, Error and Demography for Nuclear DNA from Ancient Humans
title_fullStr Joint Estimation of Contamination, Error and Demography for Nuclear DNA from Ancient Humans
title_full_unstemmed Joint Estimation of Contamination, Error and Demography for Nuclear DNA from Ancient Humans
title_short Joint Estimation of Contamination, Error and Demography for Nuclear DNA from Ancient Humans
title_sort joint estimation of contamination, error and demography for nuclear dna from ancient humans
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4822957/
https://www.ncbi.nlm.nih.gov/pubmed/27049965
http://dx.doi.org/10.1371/journal.pgen.1005972
work_keys_str_mv AT racimofernando jointestimationofcontaminationerroranddemographyfornucleardnafromancienthumans
AT renaudgabriel jointestimationofcontaminationerroranddemographyfornucleardnafromancienthumans
AT slatkinmontgomery jointestimationofcontaminationerroranddemographyfornucleardnafromancienthumans