Cargando…

TKGWV2: an ancient DNA relatedness pipeline for ultra-low coverage whole genome shotgun data

Estimation of genetically related individuals is playing an increasingly important role in the ancient DNA field. In recent years, the numbers of sequenced individuals from single sites have been increasing, reflecting a growing interest in understanding the familial and social organisation of ancie...

Descripción completa

Detalles Bibliográficos
Autores principales: Fernandes, Daniel M., Cheronet, Olivia, Gelabert, Pere, Pinhasi, Ron
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8553948/
https://www.ncbi.nlm.nih.gov/pubmed/34711884
http://dx.doi.org/10.1038/s41598-021-00581-3
_version_ 1784591685360025600
author Fernandes, Daniel M.
Cheronet, Olivia
Gelabert, Pere
Pinhasi, Ron
author_facet Fernandes, Daniel M.
Cheronet, Olivia
Gelabert, Pere
Pinhasi, Ron
author_sort Fernandes, Daniel M.
collection PubMed
description Estimation of genetically related individuals is playing an increasingly important role in the ancient DNA field. In recent years, the numbers of sequenced individuals from single sites have been increasing, reflecting a growing interest in understanding the familial and social organisation of ancient populations. Although a few different methods have been specifically developed for ancient DNA, namely to tackle issues such as low-coverage homozygous data, they require a 0.1–1× minimum average genomic coverage per analysed pair of individuals. Here we present an updated version of a method that enables estimates of 1st and 2nd-degrees of relatedness with as little as 0.026× average coverage, or around 18,000 SNPs from 1.3 million aligned reads per sample with average length of 62 bp—four times less data than 0.1× coverage at similar read lengths. By using simulated data to estimate false positive error rates, we further show that a threshold even as low as 0.012×, or around 4000 SNPs from 600,000 reads, will always show 1st-degree relationships as related. Lastly, by applying this method to published data, we are able to identify previously undocumented relationships using individuals that had been excluded from prior kinship analysis due to their very low coverage. This methodological improvement has the potential to enable relatedness estimation on ancient whole genome shotgun data during routine low-coverage screening, and therefore improve project management when decisions need to be made on which individuals are to be further sequenced.
format Online
Article
Text
id pubmed-8553948
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-85539482021-11-01 TKGWV2: an ancient DNA relatedness pipeline for ultra-low coverage whole genome shotgun data Fernandes, Daniel M. Cheronet, Olivia Gelabert, Pere Pinhasi, Ron Sci Rep Article Estimation of genetically related individuals is playing an increasingly important role in the ancient DNA field. In recent years, the numbers of sequenced individuals from single sites have been increasing, reflecting a growing interest in understanding the familial and social organisation of ancient populations. Although a few different methods have been specifically developed for ancient DNA, namely to tackle issues such as low-coverage homozygous data, they require a 0.1–1× minimum average genomic coverage per analysed pair of individuals. Here we present an updated version of a method that enables estimates of 1st and 2nd-degrees of relatedness with as little as 0.026× average coverage, or around 18,000 SNPs from 1.3 million aligned reads per sample with average length of 62 bp—four times less data than 0.1× coverage at similar read lengths. By using simulated data to estimate false positive error rates, we further show that a threshold even as low as 0.012×, or around 4000 SNPs from 600,000 reads, will always show 1st-degree relationships as related. Lastly, by applying this method to published data, we are able to identify previously undocumented relationships using individuals that had been excluded from prior kinship analysis due to their very low coverage. This methodological improvement has the potential to enable relatedness estimation on ancient whole genome shotgun data during routine low-coverage screening, and therefore improve project management when decisions need to be made on which individuals are to be further sequenced. Nature Publishing Group UK 2021-10-28 /pmc/articles/PMC8553948/ /pubmed/34711884 http://dx.doi.org/10.1038/s41598-021-00581-3 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Fernandes, Daniel M.
Cheronet, Olivia
Gelabert, Pere
Pinhasi, Ron
TKGWV2: an ancient DNA relatedness pipeline for ultra-low coverage whole genome shotgun data
title TKGWV2: an ancient DNA relatedness pipeline for ultra-low coverage whole genome shotgun data
title_full TKGWV2: an ancient DNA relatedness pipeline for ultra-low coverage whole genome shotgun data
title_fullStr TKGWV2: an ancient DNA relatedness pipeline for ultra-low coverage whole genome shotgun data
title_full_unstemmed TKGWV2: an ancient DNA relatedness pipeline for ultra-low coverage whole genome shotgun data
title_short TKGWV2: an ancient DNA relatedness pipeline for ultra-low coverage whole genome shotgun data
title_sort tkgwv2: an ancient dna relatedness pipeline for ultra-low coverage whole genome shotgun data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8553948/
https://www.ncbi.nlm.nih.gov/pubmed/34711884
http://dx.doi.org/10.1038/s41598-021-00581-3
work_keys_str_mv AT fernandesdanielm tkgwv2anancientdnarelatednesspipelineforultralowcoveragewholegenomeshotgundata
AT cheronetolivia tkgwv2anancientdnarelatednesspipelineforultralowcoveragewholegenomeshotgundata
AT gelabertpere tkgwv2anancientdnarelatednesspipelineforultralowcoveragewholegenomeshotgundata
AT pinhasiron tkgwv2anancientdnarelatednesspipelineforultralowcoveragewholegenomeshotgundata