Cargando…
Joint Estimation of Contamination, Error and Demography for Nuclear DNA from Ancient Humans
When sequencing an ancient DNA sample from a hominin fossil, DNA from present-day humans involved in excavation and extraction will be sequenced along with the endogenous material. This type of contamination is problematic for downstream analyses as it will introduce a bias towards the population of...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4822957/ https://www.ncbi.nlm.nih.gov/pubmed/27049965 http://dx.doi.org/10.1371/journal.pgen.1005972 |
_version_ | 1782425834603151360 |
---|---|
author | Racimo, Fernando Renaud, Gabriel Slatkin, Montgomery |
author_facet | Racimo, Fernando Renaud, Gabriel Slatkin, Montgomery |
author_sort | Racimo, Fernando |
collection | PubMed |
description | When sequencing an ancient DNA sample from a hominin fossil, DNA from present-day humans involved in excavation and extraction will be sequenced along with the endogenous material. This type of contamination is problematic for downstream analyses as it will introduce a bias towards the population of the contaminating individual(s). Quantifying the extent of contamination is a crucial step as it allows researchers to account for possible biases that may arise in downstream genetic analyses. Here, we present an MCMC algorithm to co-estimate the contamination rate, sequencing error rate and demographic parameters—including drift times and admixture rates—for an ancient nuclear genome obtained from human remains, when the putative contaminating DNA comes from present-day humans. We assume we have a large panel representing the putative contaminant population (e.g. European, East Asian or African). The method is implemented in a C++ program called ‘Demographic Inference with Contamination and Error’ (DICE). We applied it to simulations and genome data from ancient Neanderthals and modern humans. With reasonable levels of genome sequence coverage (>3X), we find we can recover accurate estimates of all these parameters, even when the contamination rate is as high as 50%. |
format | Online Article Text |
id | pubmed-4822957 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-48229572016-04-22 Joint Estimation of Contamination, Error and Demography for Nuclear DNA from Ancient Humans Racimo, Fernando Renaud, Gabriel Slatkin, Montgomery PLoS Genet Research Article When sequencing an ancient DNA sample from a hominin fossil, DNA from present-day humans involved in excavation and extraction will be sequenced along with the endogenous material. This type of contamination is problematic for downstream analyses as it will introduce a bias towards the population of the contaminating individual(s). Quantifying the extent of contamination is a crucial step as it allows researchers to account for possible biases that may arise in downstream genetic analyses. Here, we present an MCMC algorithm to co-estimate the contamination rate, sequencing error rate and demographic parameters—including drift times and admixture rates—for an ancient nuclear genome obtained from human remains, when the putative contaminating DNA comes from present-day humans. We assume we have a large panel representing the putative contaminant population (e.g. European, East Asian or African). The method is implemented in a C++ program called ‘Demographic Inference with Contamination and Error’ (DICE). We applied it to simulations and genome data from ancient Neanderthals and modern humans. With reasonable levels of genome sequence coverage (>3X), we find we can recover accurate estimates of all these parameters, even when the contamination rate is as high as 50%. Public Library of Science 2016-04-06 /pmc/articles/PMC4822957/ /pubmed/27049965 http://dx.doi.org/10.1371/journal.pgen.1005972 Text en © 2016 Racimo et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Racimo, Fernando Renaud, Gabriel Slatkin, Montgomery Joint Estimation of Contamination, Error and Demography for Nuclear DNA from Ancient Humans |
title | Joint Estimation of Contamination, Error and Demography for Nuclear DNA from Ancient Humans |
title_full | Joint Estimation of Contamination, Error and Demography for Nuclear DNA from Ancient Humans |
title_fullStr | Joint Estimation of Contamination, Error and Demography for Nuclear DNA from Ancient Humans |
title_full_unstemmed | Joint Estimation of Contamination, Error and Demography for Nuclear DNA from Ancient Humans |
title_short | Joint Estimation of Contamination, Error and Demography for Nuclear DNA from Ancient Humans |
title_sort | joint estimation of contamination, error and demography for nuclear dna from ancient humans |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4822957/ https://www.ncbi.nlm.nih.gov/pubmed/27049965 http://dx.doi.org/10.1371/journal.pgen.1005972 |
work_keys_str_mv | AT racimofernando jointestimationofcontaminationerroranddemographyfornucleardnafromancienthumans AT renaudgabriel jointestimationofcontaminationerroranddemographyfornucleardnafromancienthumans AT slatkinmontgomery jointestimationofcontaminationerroranddemographyfornucleardnafromancienthumans |