Cargando…

Impacts of low coverage depths and post-mortem DNA damage on variant calling: a simulation study

BACKGROUND: Massively parallel sequencing platforms, featuring high throughput and relatively short read lengths, are well suited to ancient DNA (aDNA) studies. Variant identification from short-read alignment could be hindered, however, by low DNA concentrations common to historic samples, which co...

Descripción completa

Detalles Bibliográficos
Autores principales: Parks, Matthew, Lambert, David
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4312461/
https://www.ncbi.nlm.nih.gov/pubmed/25613391
http://dx.doi.org/10.1186/s12864-015-1219-8
_version_ 1782355124655489024
author Parks, Matthew
Lambert, David
author_facet Parks, Matthew
Lambert, David
author_sort Parks, Matthew
collection PubMed
description BACKGROUND: Massively parallel sequencing platforms, featuring high throughput and relatively short read lengths, are well suited to ancient DNA (aDNA) studies. Variant identification from short-read alignment could be hindered, however, by low DNA concentrations common to historic samples, which constrain sequencing depths, and post-mortem DNA damage patterns. RESULTS: We simulated pairs of sequences to act as reference and sample genomes at varied GC contents and divergence levels. Short-read sequence pools were generated from sample sequences, and subjected to varying levels of “post-mortem” damage by adjusting levels of fragmentation and fragmentation biases, transition rates at sequence ends, and sequencing depths. Mapping of sample read pools to reference sequences revealed several trends, including decreased alignment success with increased read length and decreased variant recovery with increased divergence. Variants were generally called with high accuracy, however identification of SNPs (single-nucleotide polymorphisms) was less accurate for high damage/low divergence samples. Modest increases in sequencing depth resulted in rapid gains in total variant recovery, and limited improvements to recovery of heterozygous variants. CONCLUSIONS: This in silico study suggests aDNA-associated damage patterns minimally impact variant call accuracy and recovery from short-read alignment, while modest increases in sequencing depth can greatly improve variant recovery. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-015-1219-8) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4312461
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-43124612015-02-01 Impacts of low coverage depths and post-mortem DNA damage on variant calling: a simulation study Parks, Matthew Lambert, David BMC Genomics Research Article BACKGROUND: Massively parallel sequencing platforms, featuring high throughput and relatively short read lengths, are well suited to ancient DNA (aDNA) studies. Variant identification from short-read alignment could be hindered, however, by low DNA concentrations common to historic samples, which constrain sequencing depths, and post-mortem DNA damage patterns. RESULTS: We simulated pairs of sequences to act as reference and sample genomes at varied GC contents and divergence levels. Short-read sequence pools were generated from sample sequences, and subjected to varying levels of “post-mortem” damage by adjusting levels of fragmentation and fragmentation biases, transition rates at sequence ends, and sequencing depths. Mapping of sample read pools to reference sequences revealed several trends, including decreased alignment success with increased read length and decreased variant recovery with increased divergence. Variants were generally called with high accuracy, however identification of SNPs (single-nucleotide polymorphisms) was less accurate for high damage/low divergence samples. Modest increases in sequencing depth resulted in rapid gains in total variant recovery, and limited improvements to recovery of heterozygous variants. CONCLUSIONS: This in silico study suggests aDNA-associated damage patterns minimally impact variant call accuracy and recovery from short-read alignment, while modest increases in sequencing depth can greatly improve variant recovery. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-015-1219-8) contains supplementary material, which is available to authorized users. BioMed Central 2015-01-23 /pmc/articles/PMC4312461/ /pubmed/25613391 http://dx.doi.org/10.1186/s12864-015-1219-8 Text en © Parks and Lambert; licensee Biomed Central. 2015 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Parks, Matthew
Lambert, David
Impacts of low coverage depths and post-mortem DNA damage on variant calling: a simulation study
title Impacts of low coverage depths and post-mortem DNA damage on variant calling: a simulation study
title_full Impacts of low coverage depths and post-mortem DNA damage on variant calling: a simulation study
title_fullStr Impacts of low coverage depths and post-mortem DNA damage on variant calling: a simulation study
title_full_unstemmed Impacts of low coverage depths and post-mortem DNA damage on variant calling: a simulation study
title_short Impacts of low coverage depths and post-mortem DNA damage on variant calling: a simulation study
title_sort impacts of low coverage depths and post-mortem dna damage on variant calling: a simulation study
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4312461/
https://www.ncbi.nlm.nih.gov/pubmed/25613391
http://dx.doi.org/10.1186/s12864-015-1219-8
work_keys_str_mv AT parksmatthew impactsoflowcoveragedepthsandpostmortemdnadamageonvariantcallingasimulationstudy
AT lambertdavid impactsoflowcoveragedepthsandpostmortemdnadamageonvariantcallingasimulationstudy