Cargando…
Impacts of low coverage depths and post-mortem DNA damage on variant calling: a simulation study
BACKGROUND: Massively parallel sequencing platforms, featuring high throughput and relatively short read lengths, are well suited to ancient DNA (aDNA) studies. Variant identification from short-read alignment could be hindered, however, by low DNA concentrations common to historic samples, which co...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4312461/ https://www.ncbi.nlm.nih.gov/pubmed/25613391 http://dx.doi.org/10.1186/s12864-015-1219-8 |
_version_ | 1782355124655489024 |
---|---|
author | Parks, Matthew Lambert, David |
author_facet | Parks, Matthew Lambert, David |
author_sort | Parks, Matthew |
collection | PubMed |
description | BACKGROUND: Massively parallel sequencing platforms, featuring high throughput and relatively short read lengths, are well suited to ancient DNA (aDNA) studies. Variant identification from short-read alignment could be hindered, however, by low DNA concentrations common to historic samples, which constrain sequencing depths, and post-mortem DNA damage patterns. RESULTS: We simulated pairs of sequences to act as reference and sample genomes at varied GC contents and divergence levels. Short-read sequence pools were generated from sample sequences, and subjected to varying levels of “post-mortem” damage by adjusting levels of fragmentation and fragmentation biases, transition rates at sequence ends, and sequencing depths. Mapping of sample read pools to reference sequences revealed several trends, including decreased alignment success with increased read length and decreased variant recovery with increased divergence. Variants were generally called with high accuracy, however identification of SNPs (single-nucleotide polymorphisms) was less accurate for high damage/low divergence samples. Modest increases in sequencing depth resulted in rapid gains in total variant recovery, and limited improvements to recovery of heterozygous variants. CONCLUSIONS: This in silico study suggests aDNA-associated damage patterns minimally impact variant call accuracy and recovery from short-read alignment, while modest increases in sequencing depth can greatly improve variant recovery. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-015-1219-8) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4312461 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-43124612015-02-01 Impacts of low coverage depths and post-mortem DNA damage on variant calling: a simulation study Parks, Matthew Lambert, David BMC Genomics Research Article BACKGROUND: Massively parallel sequencing platforms, featuring high throughput and relatively short read lengths, are well suited to ancient DNA (aDNA) studies. Variant identification from short-read alignment could be hindered, however, by low DNA concentrations common to historic samples, which constrain sequencing depths, and post-mortem DNA damage patterns. RESULTS: We simulated pairs of sequences to act as reference and sample genomes at varied GC contents and divergence levels. Short-read sequence pools were generated from sample sequences, and subjected to varying levels of “post-mortem” damage by adjusting levels of fragmentation and fragmentation biases, transition rates at sequence ends, and sequencing depths. Mapping of sample read pools to reference sequences revealed several trends, including decreased alignment success with increased read length and decreased variant recovery with increased divergence. Variants were generally called with high accuracy, however identification of SNPs (single-nucleotide polymorphisms) was less accurate for high damage/low divergence samples. Modest increases in sequencing depth resulted in rapid gains in total variant recovery, and limited improvements to recovery of heterozygous variants. CONCLUSIONS: This in silico study suggests aDNA-associated damage patterns minimally impact variant call accuracy and recovery from short-read alignment, while modest increases in sequencing depth can greatly improve variant recovery. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-015-1219-8) contains supplementary material, which is available to authorized users. BioMed Central 2015-01-23 /pmc/articles/PMC4312461/ /pubmed/25613391 http://dx.doi.org/10.1186/s12864-015-1219-8 Text en © Parks and Lambert; licensee Biomed Central. 2015 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Parks, Matthew Lambert, David Impacts of low coverage depths and post-mortem DNA damage on variant calling: a simulation study |
title | Impacts of low coverage depths and post-mortem DNA damage on variant calling: a simulation study |
title_full | Impacts of low coverage depths and post-mortem DNA damage on variant calling: a simulation study |
title_fullStr | Impacts of low coverage depths and post-mortem DNA damage on variant calling: a simulation study |
title_full_unstemmed | Impacts of low coverage depths and post-mortem DNA damage on variant calling: a simulation study |
title_short | Impacts of low coverage depths and post-mortem DNA damage on variant calling: a simulation study |
title_sort | impacts of low coverage depths and post-mortem dna damage on variant calling: a simulation study |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4312461/ https://www.ncbi.nlm.nih.gov/pubmed/25613391 http://dx.doi.org/10.1186/s12864-015-1219-8 |
work_keys_str_mv | AT parksmatthew impactsoflowcoveragedepthsandpostmortemdnadamageonvariantcallingasimulationstudy AT lambertdavid impactsoflowcoveragedepthsandpostmortemdnadamageonvariantcallingasimulationstudy |