Cargando…

INSIDER: alignment-free detection of foreign DNA sequences

External DNA sequences can be inserted into an organism’s genome either through natural processes such as gene transfer, or through targeted genome engineering strategies. Being able to robustly identify such foreign DNA is a crucial capability for health and biosecurity applications, such as anti-m...

Descripción completa

Detalles Bibliográficos
Autores principales: Tay, Aidan P., Hosking, Brendan, Hosking, Cameron, Bauer, Denis C., Wilson, Laurence O.W.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Research Network of Computational and Structural Biotechnology 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8273350/
https://www.ncbi.nlm.nih.gov/pubmed/34285780
http://dx.doi.org/10.1016/j.csbj.2021.06.045
_version_ 1783721351490568192
author Tay, Aidan P.
Hosking, Brendan
Hosking, Cameron
Bauer, Denis C.
Wilson, Laurence O.W.
author_facet Tay, Aidan P.
Hosking, Brendan
Hosking, Cameron
Bauer, Denis C.
Wilson, Laurence O.W.
author_sort Tay, Aidan P.
collection PubMed
description External DNA sequences can be inserted into an organism’s genome either through natural processes such as gene transfer, or through targeted genome engineering strategies. Being able to robustly identify such foreign DNA is a crucial capability for health and biosecurity applications, such as anti-microbial resistance (AMR) detection or monitoring gene drives. This capability does not exist for poorly characterised host genomes or with limited information about the integrated sequence. To address this, we developed the INserted Sequence Information DEtectoR (INSIDER). INSIDER analyses whole genome sequencing data and identifies segments of potentially foreign origin by their significant shift in k-mer signatures. We demonstrate the power of INSIDER to separate integrated DNA sequences from normal genomic sequences on a synthetic dataset simulating the insertion of a CRISPR-Cas gene drive into wild-type yeast. As a proof-of-concept, we use INSIDER to detect the exact AMR plasmid in whole genome sequencing data from a Citrobacter freundii patient isolate. INSIDER streamlines the process of identifying integrated DNA in poorly characterised wild species or when the insert is of unknown origin, thus enhancing the monitoring of emerging biosecurity threats.
format Online
Article
Text
id pubmed-8273350
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Research Network of Computational and Structural Biotechnology
record_format MEDLINE/PubMed
spelling pubmed-82733502021-07-19 INSIDER: alignment-free detection of foreign DNA sequences Tay, Aidan P. Hosking, Brendan Hosking, Cameron Bauer, Denis C. Wilson, Laurence O.W. Comput Struct Biotechnol J Research Article External DNA sequences can be inserted into an organism’s genome either through natural processes such as gene transfer, or through targeted genome engineering strategies. Being able to robustly identify such foreign DNA is a crucial capability for health and biosecurity applications, such as anti-microbial resistance (AMR) detection or monitoring gene drives. This capability does not exist for poorly characterised host genomes or with limited information about the integrated sequence. To address this, we developed the INserted Sequence Information DEtectoR (INSIDER). INSIDER analyses whole genome sequencing data and identifies segments of potentially foreign origin by their significant shift in k-mer signatures. We demonstrate the power of INSIDER to separate integrated DNA sequences from normal genomic sequences on a synthetic dataset simulating the insertion of a CRISPR-Cas gene drive into wild-type yeast. As a proof-of-concept, we use INSIDER to detect the exact AMR plasmid in whole genome sequencing data from a Citrobacter freundii patient isolate. INSIDER streamlines the process of identifying integrated DNA in poorly characterised wild species or when the insert is of unknown origin, thus enhancing the monitoring of emerging biosecurity threats. Research Network of Computational and Structural Biotechnology 2021-06-29 /pmc/articles/PMC8273350/ /pubmed/34285780 http://dx.doi.org/10.1016/j.csbj.2021.06.045 Text en Crown Copyright © 2021 Published by Elsevier B.V. on behalf of Research Network of Computational and Structural Biotechnology. https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Research Article
Tay, Aidan P.
Hosking, Brendan
Hosking, Cameron
Bauer, Denis C.
Wilson, Laurence O.W.
INSIDER: alignment-free detection of foreign DNA sequences
title INSIDER: alignment-free detection of foreign DNA sequences
title_full INSIDER: alignment-free detection of foreign DNA sequences
title_fullStr INSIDER: alignment-free detection of foreign DNA sequences
title_full_unstemmed INSIDER: alignment-free detection of foreign DNA sequences
title_short INSIDER: alignment-free detection of foreign DNA sequences
title_sort insider: alignment-free detection of foreign dna sequences
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8273350/
https://www.ncbi.nlm.nih.gov/pubmed/34285780
http://dx.doi.org/10.1016/j.csbj.2021.06.045
work_keys_str_mv AT tayaidanp insideralignmentfreedetectionofforeigndnasequences
AT hoskingbrendan insideralignmentfreedetectionofforeigndnasequences
AT hoskingcameron insideralignmentfreedetectionofforeigndnasequences
AT bauerdenisc insideralignmentfreedetectionofforeigndnasequences
AT wilsonlaurenceow insideralignmentfreedetectionofforeigndnasequences