Cargando…
INSIDER: alignment-free detection of foreign DNA sequences
External DNA sequences can be inserted into an organism’s genome either through natural processes such as gene transfer, or through targeted genome engineering strategies. Being able to robustly identify such foreign DNA is a crucial capability for health and biosecurity applications, such as anti-m...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Research Network of Computational and Structural Biotechnology
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8273350/ https://www.ncbi.nlm.nih.gov/pubmed/34285780 http://dx.doi.org/10.1016/j.csbj.2021.06.045 |
_version_ | 1783721351490568192 |
---|---|
author | Tay, Aidan P. Hosking, Brendan Hosking, Cameron Bauer, Denis C. Wilson, Laurence O.W. |
author_facet | Tay, Aidan P. Hosking, Brendan Hosking, Cameron Bauer, Denis C. Wilson, Laurence O.W. |
author_sort | Tay, Aidan P. |
collection | PubMed |
description | External DNA sequences can be inserted into an organism’s genome either through natural processes such as gene transfer, or through targeted genome engineering strategies. Being able to robustly identify such foreign DNA is a crucial capability for health and biosecurity applications, such as anti-microbial resistance (AMR) detection or monitoring gene drives. This capability does not exist for poorly characterised host genomes or with limited information about the integrated sequence. To address this, we developed the INserted Sequence Information DEtectoR (INSIDER). INSIDER analyses whole genome sequencing data and identifies segments of potentially foreign origin by their significant shift in k-mer signatures. We demonstrate the power of INSIDER to separate integrated DNA sequences from normal genomic sequences on a synthetic dataset simulating the insertion of a CRISPR-Cas gene drive into wild-type yeast. As a proof-of-concept, we use INSIDER to detect the exact AMR plasmid in whole genome sequencing data from a Citrobacter freundii patient isolate. INSIDER streamlines the process of identifying integrated DNA in poorly characterised wild species or when the insert is of unknown origin, thus enhancing the monitoring of emerging biosecurity threats. |
format | Online Article Text |
id | pubmed-8273350 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Research Network of Computational and Structural Biotechnology |
record_format | MEDLINE/PubMed |
spelling | pubmed-82733502021-07-19 INSIDER: alignment-free detection of foreign DNA sequences Tay, Aidan P. Hosking, Brendan Hosking, Cameron Bauer, Denis C. Wilson, Laurence O.W. Comput Struct Biotechnol J Research Article External DNA sequences can be inserted into an organism’s genome either through natural processes such as gene transfer, or through targeted genome engineering strategies. Being able to robustly identify such foreign DNA is a crucial capability for health and biosecurity applications, such as anti-microbial resistance (AMR) detection or monitoring gene drives. This capability does not exist for poorly characterised host genomes or with limited information about the integrated sequence. To address this, we developed the INserted Sequence Information DEtectoR (INSIDER). INSIDER analyses whole genome sequencing data and identifies segments of potentially foreign origin by their significant shift in k-mer signatures. We demonstrate the power of INSIDER to separate integrated DNA sequences from normal genomic sequences on a synthetic dataset simulating the insertion of a CRISPR-Cas gene drive into wild-type yeast. As a proof-of-concept, we use INSIDER to detect the exact AMR plasmid in whole genome sequencing data from a Citrobacter freundii patient isolate. INSIDER streamlines the process of identifying integrated DNA in poorly characterised wild species or when the insert is of unknown origin, thus enhancing the monitoring of emerging biosecurity threats. Research Network of Computational and Structural Biotechnology 2021-06-29 /pmc/articles/PMC8273350/ /pubmed/34285780 http://dx.doi.org/10.1016/j.csbj.2021.06.045 Text en Crown Copyright © 2021 Published by Elsevier B.V. on behalf of Research Network of Computational and Structural Biotechnology. https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Research Article Tay, Aidan P. Hosking, Brendan Hosking, Cameron Bauer, Denis C. Wilson, Laurence O.W. INSIDER: alignment-free detection of foreign DNA sequences |
title | INSIDER: alignment-free detection of foreign DNA sequences |
title_full | INSIDER: alignment-free detection of foreign DNA sequences |
title_fullStr | INSIDER: alignment-free detection of foreign DNA sequences |
title_full_unstemmed | INSIDER: alignment-free detection of foreign DNA sequences |
title_short | INSIDER: alignment-free detection of foreign DNA sequences |
title_sort | insider: alignment-free detection of foreign dna sequences |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8273350/ https://www.ncbi.nlm.nih.gov/pubmed/34285780 http://dx.doi.org/10.1016/j.csbj.2021.06.045 |
work_keys_str_mv | AT tayaidanp insideralignmentfreedetectionofforeigndnasequences AT hoskingbrendan insideralignmentfreedetectionofforeigndnasequences AT hoskingcameron insideralignmentfreedetectionofforeigndnasequences AT bauerdenisc insideralignmentfreedetectionofforeigndnasequences AT wilsonlaurenceow insideralignmentfreedetectionofforeigndnasequences |