Cargando…

Patterns of sequencing coverage bias revealed by ultra-deep sequencing of vertebrate mitochondria

BACKGROUND: Genome and transcriptome sequencing applications that rely on variation in sequence depth can be negatively affected if there are systematic biases in coverage. We have investigated patterns of local variation in sequencing coverage by utilising ultra-deep sequencing (>100,000X) of mt...

Descripción completa

Detalles Bibliográficos
Autores principales: Ekblom, Robert, Smeds, Linnéa, Ellegren, Hans
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4070552/
https://www.ncbi.nlm.nih.gov/pubmed/24923674
http://dx.doi.org/10.1186/1471-2164-15-467
_version_ 1782322706916573184
author Ekblom, Robert
Smeds, Linnéa
Ellegren, Hans
author_facet Ekblom, Robert
Smeds, Linnéa
Ellegren, Hans
author_sort Ekblom, Robert
collection PubMed
description BACKGROUND: Genome and transcriptome sequencing applications that rely on variation in sequence depth can be negatively affected if there are systematic biases in coverage. We have investigated patterns of local variation in sequencing coverage by utilising ultra-deep sequencing (>100,000X) of mtDNA obtained during sequencing of two vertebrate genomes, wolverine (Gulo gulo) and collared flycatcher (Ficedula albicollis). With such extreme depth, stochastic variation in coverage should be negligible, which allows us to provide a very detailed, fine-scale picture of sequence dependent coverage variation and sequencing error rates. RESULTS: Sequencing coverage showed up to six-fold variation across the complete mtDNA and this variation was highly repeatable in sequencing of multiple individuals of the same species. Moreover, coverage in orthologous regions was correlated between the two species and was negatively correlated with GC content. We also found a negative correlation between the site-specific sequencing error rate and coverage, with certain sequence motifs “CCNGCC” being particularly prone to high rates of error and low coverage. CONCLUSIONS: Our results demonstrate that inherent sequence characteristics govern variation in coverage and suggest that some of this variation, like GC content, should be controlled for in, for example, RNA-Seq and detection of copy number variation. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi: 10.1186/1471-2164-15-467) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4070552
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-40705522014-06-27 Patterns of sequencing coverage bias revealed by ultra-deep sequencing of vertebrate mitochondria Ekblom, Robert Smeds, Linnéa Ellegren, Hans BMC Genomics Research Article BACKGROUND: Genome and transcriptome sequencing applications that rely on variation in sequence depth can be negatively affected if there are systematic biases in coverage. We have investigated patterns of local variation in sequencing coverage by utilising ultra-deep sequencing (>100,000X) of mtDNA obtained during sequencing of two vertebrate genomes, wolverine (Gulo gulo) and collared flycatcher (Ficedula albicollis). With such extreme depth, stochastic variation in coverage should be negligible, which allows us to provide a very detailed, fine-scale picture of sequence dependent coverage variation and sequencing error rates. RESULTS: Sequencing coverage showed up to six-fold variation across the complete mtDNA and this variation was highly repeatable in sequencing of multiple individuals of the same species. Moreover, coverage in orthologous regions was correlated between the two species and was negatively correlated with GC content. We also found a negative correlation between the site-specific sequencing error rate and coverage, with certain sequence motifs “CCNGCC” being particularly prone to high rates of error and low coverage. CONCLUSIONS: Our results demonstrate that inherent sequence characteristics govern variation in coverage and suggest that some of this variation, like GC content, should be controlled for in, for example, RNA-Seq and detection of copy number variation. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi: 10.1186/1471-2164-15-467) contains supplementary material, which is available to authorized users. BioMed Central 2014-06-12 /pmc/articles/PMC4070552/ /pubmed/24923674 http://dx.doi.org/10.1186/1471-2164-15-467 Text en © Ekblom et al.; licensee BioMed Central Ltd. 2014 This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Ekblom, Robert
Smeds, Linnéa
Ellegren, Hans
Patterns of sequencing coverage bias revealed by ultra-deep sequencing of vertebrate mitochondria
title Patterns of sequencing coverage bias revealed by ultra-deep sequencing of vertebrate mitochondria
title_full Patterns of sequencing coverage bias revealed by ultra-deep sequencing of vertebrate mitochondria
title_fullStr Patterns of sequencing coverage bias revealed by ultra-deep sequencing of vertebrate mitochondria
title_full_unstemmed Patterns of sequencing coverage bias revealed by ultra-deep sequencing of vertebrate mitochondria
title_short Patterns of sequencing coverage bias revealed by ultra-deep sequencing of vertebrate mitochondria
title_sort patterns of sequencing coverage bias revealed by ultra-deep sequencing of vertebrate mitochondria
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4070552/
https://www.ncbi.nlm.nih.gov/pubmed/24923674
http://dx.doi.org/10.1186/1471-2164-15-467
work_keys_str_mv AT ekblomrobert patternsofsequencingcoveragebiasrevealedbyultradeepsequencingofvertebratemitochondria
AT smedslinnea patternsofsequencingcoveragebiasrevealedbyultradeepsequencingofvertebratemitochondria
AT ellegrenhans patternsofsequencingcoveragebiasrevealedbyultradeepsequencingofvertebratemitochondria