Cargando…

aKmerBroom: Ancient oral DNA decontamination using Bloom filters on k-mer sets

Dental calculus samples are modeled as a mixture of DNA coming from dental plaque and contaminants. Current computational decontamination methods such as Recentrifuge and DeconSeq require either a reference database or sequenced negative controls, and therefore have limited use cases. We present a r...

Descripción completa

Detalles Bibliográficos
Autores principales: Duitama González, Camila, Rangavittal, Samarth, Vicedomini, Riccardo, Chikhi, Rayan, Richard, Hugues
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10590965/
https://www.ncbi.nlm.nih.gov/pubmed/37876815
http://dx.doi.org/10.1016/j.isci.2023.108057
Descripción
Sumario:Dental calculus samples are modeled as a mixture of DNA coming from dental plaque and contaminants. Current computational decontamination methods such as Recentrifuge and DeconSeq require either a reference database or sequenced negative controls, and therefore have limited use cases. We present a reference-free decontamination tool tailored for the removal of contaminant DNA of ancient oral sample called aKmerBroom. Our tool builds a Bloom filter of known ancient and modern oral k-mers, then scans an input set of ancient metagenomic reads using multiple passes to iteratively retain reads likely to be of oral origin. On synthetic data, aKmerBroom achieves over [Formula: see text] sensitivity and [Formula: see text] specificity. On real datasets, aKmerBroom shows higher read retainment ([Formula: see text] on average) than other methods. We anticipate aKmerBroom will be a valuable tool for the processing of ancient oral samples as it will prevent contaminated datasets from being completely discarded in downstream analyses.