Cargando…
TKGWV2: an ancient DNA relatedness pipeline for ultra-low coverage whole genome shotgun data
Estimation of genetically related individuals is playing an increasingly important role in the ancient DNA field. In recent years, the numbers of sequenced individuals from single sites have been increasing, reflecting a growing interest in understanding the familial and social organisation of ancie...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8553948/ https://www.ncbi.nlm.nih.gov/pubmed/34711884 http://dx.doi.org/10.1038/s41598-021-00581-3 |
_version_ | 1784591685360025600 |
---|---|
author | Fernandes, Daniel M. Cheronet, Olivia Gelabert, Pere Pinhasi, Ron |
author_facet | Fernandes, Daniel M. Cheronet, Olivia Gelabert, Pere Pinhasi, Ron |
author_sort | Fernandes, Daniel M. |
collection | PubMed |
description | Estimation of genetically related individuals is playing an increasingly important role in the ancient DNA field. In recent years, the numbers of sequenced individuals from single sites have been increasing, reflecting a growing interest in understanding the familial and social organisation of ancient populations. Although a few different methods have been specifically developed for ancient DNA, namely to tackle issues such as low-coverage homozygous data, they require a 0.1–1× minimum average genomic coverage per analysed pair of individuals. Here we present an updated version of a method that enables estimates of 1st and 2nd-degrees of relatedness with as little as 0.026× average coverage, or around 18,000 SNPs from 1.3 million aligned reads per sample with average length of 62 bp—four times less data than 0.1× coverage at similar read lengths. By using simulated data to estimate false positive error rates, we further show that a threshold even as low as 0.012×, or around 4000 SNPs from 600,000 reads, will always show 1st-degree relationships as related. Lastly, by applying this method to published data, we are able to identify previously undocumented relationships using individuals that had been excluded from prior kinship analysis due to their very low coverage. This methodological improvement has the potential to enable relatedness estimation on ancient whole genome shotgun data during routine low-coverage screening, and therefore improve project management when decisions need to be made on which individuals are to be further sequenced. |
format | Online Article Text |
id | pubmed-8553948 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-85539482021-11-01 TKGWV2: an ancient DNA relatedness pipeline for ultra-low coverage whole genome shotgun data Fernandes, Daniel M. Cheronet, Olivia Gelabert, Pere Pinhasi, Ron Sci Rep Article Estimation of genetically related individuals is playing an increasingly important role in the ancient DNA field. In recent years, the numbers of sequenced individuals from single sites have been increasing, reflecting a growing interest in understanding the familial and social organisation of ancient populations. Although a few different methods have been specifically developed for ancient DNA, namely to tackle issues such as low-coverage homozygous data, they require a 0.1–1× minimum average genomic coverage per analysed pair of individuals. Here we present an updated version of a method that enables estimates of 1st and 2nd-degrees of relatedness with as little as 0.026× average coverage, or around 18,000 SNPs from 1.3 million aligned reads per sample with average length of 62 bp—four times less data than 0.1× coverage at similar read lengths. By using simulated data to estimate false positive error rates, we further show that a threshold even as low as 0.012×, or around 4000 SNPs from 600,000 reads, will always show 1st-degree relationships as related. Lastly, by applying this method to published data, we are able to identify previously undocumented relationships using individuals that had been excluded from prior kinship analysis due to their very low coverage. This methodological improvement has the potential to enable relatedness estimation on ancient whole genome shotgun data during routine low-coverage screening, and therefore improve project management when decisions need to be made on which individuals are to be further sequenced. Nature Publishing Group UK 2021-10-28 /pmc/articles/PMC8553948/ /pubmed/34711884 http://dx.doi.org/10.1038/s41598-021-00581-3 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Fernandes, Daniel M. Cheronet, Olivia Gelabert, Pere Pinhasi, Ron TKGWV2: an ancient DNA relatedness pipeline for ultra-low coverage whole genome shotgun data |
title | TKGWV2: an ancient DNA relatedness pipeline for ultra-low coverage whole genome shotgun data |
title_full | TKGWV2: an ancient DNA relatedness pipeline for ultra-low coverage whole genome shotgun data |
title_fullStr | TKGWV2: an ancient DNA relatedness pipeline for ultra-low coverage whole genome shotgun data |
title_full_unstemmed | TKGWV2: an ancient DNA relatedness pipeline for ultra-low coverage whole genome shotgun data |
title_short | TKGWV2: an ancient DNA relatedness pipeline for ultra-low coverage whole genome shotgun data |
title_sort | tkgwv2: an ancient dna relatedness pipeline for ultra-low coverage whole genome shotgun data |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8553948/ https://www.ncbi.nlm.nih.gov/pubmed/34711884 http://dx.doi.org/10.1038/s41598-021-00581-3 |
work_keys_str_mv | AT fernandesdanielm tkgwv2anancientdnarelatednesspipelineforultralowcoveragewholegenomeshotgundata AT cheronetolivia tkgwv2anancientdnarelatednesspipelineforultralowcoveragewholegenomeshotgundata AT gelabertpere tkgwv2anancientdnarelatednesspipelineforultralowcoveragewholegenomeshotgundata AT pinhasiron tkgwv2anancientdnarelatednesspipelineforultralowcoveragewholegenomeshotgundata |