Cargando…

Contaminating DNA in human saliva alters the detection of variants from whole genome sequencing

Cells obtained from human saliva are commonly used as an alternative DNA source when blood is difficult or less convenient to collect. Although DNA extracted from saliva is considered to be of comparable quality to that derived from blood, recent studies have shown that non-human contaminating DNA d...

Descripción completa

Detalles Bibliográficos
Autores principales: Samson, C. A., Whitford, W., Snell, R. G., Jacobsen, J. C., Lehnert, K.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7648094/
https://www.ncbi.nlm.nih.gov/pubmed/33159102
http://dx.doi.org/10.1038/s41598-020-76022-4
_version_ 1783607045154406400
author Samson, C. A.
Whitford, W.
Snell, R. G.
Jacobsen, J. C.
Lehnert, K.
author_facet Samson, C. A.
Whitford, W.
Snell, R. G.
Jacobsen, J. C.
Lehnert, K.
author_sort Samson, C. A.
collection PubMed
description Cells obtained from human saliva are commonly used as an alternative DNA source when blood is difficult or less convenient to collect. Although DNA extracted from saliva is considered to be of comparable quality to that derived from blood, recent studies have shown that non-human contaminating DNA derived from saliva can confound whole genome sequencing results. The most concerning complication is that non-human reads align to the human reference genome using standard methodology, which can critically affect the resulting variant genotypes identified in a genome. We identified clusters of anomalous variants in saliva DNA derived reads which aligned in an atypical manner. These reads had only short regions of identity to the human reference sequence, flanked by soft clipped sequence. Sequence comparisons of atypically aligning reads from eight human saliva-derived samples to RefSeq genomes revealed the majority to be of bacterial origin (63.46%). To partition the non-human reads during the alignment step, a decoy of the most prevalent bacterial genome sequences was designed and utilised. This reduced the number of atypically aligning reads when trialled on the eight saliva-derived samples by 44% and most importantly prevented the associated anomalous genotype calls. Saliva derived DNA is often contaminated by DNA from other species. This can lead to non-human reads aligning to the human reference genome using current alignment best-practices, impacting variant identification. This problem can be diminished by using a bacterial decoy in the alignment process.
format Online
Article
Text
id pubmed-7648094
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-76480942020-11-12 Contaminating DNA in human saliva alters the detection of variants from whole genome sequencing Samson, C. A. Whitford, W. Snell, R. G. Jacobsen, J. C. Lehnert, K. Sci Rep Article Cells obtained from human saliva are commonly used as an alternative DNA source when blood is difficult or less convenient to collect. Although DNA extracted from saliva is considered to be of comparable quality to that derived from blood, recent studies have shown that non-human contaminating DNA derived from saliva can confound whole genome sequencing results. The most concerning complication is that non-human reads align to the human reference genome using standard methodology, which can critically affect the resulting variant genotypes identified in a genome. We identified clusters of anomalous variants in saliva DNA derived reads which aligned in an atypical manner. These reads had only short regions of identity to the human reference sequence, flanked by soft clipped sequence. Sequence comparisons of atypically aligning reads from eight human saliva-derived samples to RefSeq genomes revealed the majority to be of bacterial origin (63.46%). To partition the non-human reads during the alignment step, a decoy of the most prevalent bacterial genome sequences was designed and utilised. This reduced the number of atypically aligning reads when trialled on the eight saliva-derived samples by 44% and most importantly prevented the associated anomalous genotype calls. Saliva derived DNA is often contaminated by DNA from other species. This can lead to non-human reads aligning to the human reference genome using current alignment best-practices, impacting variant identification. This problem can be diminished by using a bacterial decoy in the alignment process. Nature Publishing Group UK 2020-11-06 /pmc/articles/PMC7648094/ /pubmed/33159102 http://dx.doi.org/10.1038/s41598-020-76022-4 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Samson, C. A.
Whitford, W.
Snell, R. G.
Jacobsen, J. C.
Lehnert, K.
Contaminating DNA in human saliva alters the detection of variants from whole genome sequencing
title Contaminating DNA in human saliva alters the detection of variants from whole genome sequencing
title_full Contaminating DNA in human saliva alters the detection of variants from whole genome sequencing
title_fullStr Contaminating DNA in human saliva alters the detection of variants from whole genome sequencing
title_full_unstemmed Contaminating DNA in human saliva alters the detection of variants from whole genome sequencing
title_short Contaminating DNA in human saliva alters the detection of variants from whole genome sequencing
title_sort contaminating dna in human saliva alters the detection of variants from whole genome sequencing
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7648094/
https://www.ncbi.nlm.nih.gov/pubmed/33159102
http://dx.doi.org/10.1038/s41598-020-76022-4
work_keys_str_mv AT samsonca contaminatingdnainhumansalivaaltersthedetectionofvariantsfromwholegenomesequencing
AT whitfordw contaminatingdnainhumansalivaaltersthedetectionofvariantsfromwholegenomesequencing
AT snellrg contaminatingdnainhumansalivaaltersthedetectionofvariantsfromwholegenomesequencing
AT jacobsenjc contaminatingdnainhumansalivaaltersthedetectionofvariantsfromwholegenomesequencing
AT lehnertk contaminatingdnainhumansalivaaltersthedetectionofvariantsfromwholegenomesequencing