Cargando…

Studying pathogens degrades BLAST-based pathogen identification

As synthetic biology becomes increasingly capable and accessible, it is likewise increasingly critical to be able to make accurate biosecurity determinations regarding the pathogenicity or toxicity of particular nucleic acid or amino acid sequences. At present, this is typically done using the BLAST...

Descripción completa

Detalles Bibliográficos
Autores principales: Beal, Jacob, Clore, Adam, Manthey, Jeff
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10068195/
https://www.ncbi.nlm.nih.gov/pubmed/37012314
http://dx.doi.org/10.1038/s41598-023-32481-z
_version_ 1785018630058016768
author Beal, Jacob
Clore, Adam
Manthey, Jeff
author_facet Beal, Jacob
Clore, Adam
Manthey, Jeff
author_sort Beal, Jacob
collection PubMed
description As synthetic biology becomes increasingly capable and accessible, it is likewise increasingly critical to be able to make accurate biosecurity determinations regarding the pathogenicity or toxicity of particular nucleic acid or amino acid sequences. At present, this is typically done using the BLAST algorithm to determine the best match with sequences in the NCBI nucleic acid and protein databases. Neither BLAST nor any of the NCBI databases, however, are actually designed for biosafety determination. Critically, taxonomic errors or ambiguities in the NCBI nucleic acid and protein databases can also cause errors in BLAST-based taxonomic categorization. With heavily studied taxa and frequently used biotechnology tools, even low frequency taxonomic categorization issues can lead to high rates of errors in biosecurity decision-making. Here we focus on the implications for false positives, finding that BLAST against NCBI’s protein database will now incorrectly categorize a number of commonly used biotechnology tool sequences as the pathogens or toxins with which they have been used. Paradoxically, this implies that problems are expected to be most acute for the pathogens and toxins of highest interest and for the most widely used biotechnology tools. We thus conclude that biosecurity tools should shift away from BLAST against general purpose databases and towards new methods that are specifically tailored for biosafety purposes.
format Online
Article
Text
id pubmed-10068195
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-100681952023-04-03 Studying pathogens degrades BLAST-based pathogen identification Beal, Jacob Clore, Adam Manthey, Jeff Sci Rep Article As synthetic biology becomes increasingly capable and accessible, it is likewise increasingly critical to be able to make accurate biosecurity determinations regarding the pathogenicity or toxicity of particular nucleic acid or amino acid sequences. At present, this is typically done using the BLAST algorithm to determine the best match with sequences in the NCBI nucleic acid and protein databases. Neither BLAST nor any of the NCBI databases, however, are actually designed for biosafety determination. Critically, taxonomic errors or ambiguities in the NCBI nucleic acid and protein databases can also cause errors in BLAST-based taxonomic categorization. With heavily studied taxa and frequently used biotechnology tools, even low frequency taxonomic categorization issues can lead to high rates of errors in biosecurity decision-making. Here we focus on the implications for false positives, finding that BLAST against NCBI’s protein database will now incorrectly categorize a number of commonly used biotechnology tool sequences as the pathogens or toxins with which they have been used. Paradoxically, this implies that problems are expected to be most acute for the pathogens and toxins of highest interest and for the most widely used biotechnology tools. We thus conclude that biosecurity tools should shift away from BLAST against general purpose databases and towards new methods that are specifically tailored for biosafety purposes. Nature Publishing Group UK 2023-04-03 /pmc/articles/PMC10068195/ /pubmed/37012314 http://dx.doi.org/10.1038/s41598-023-32481-z Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Beal, Jacob
Clore, Adam
Manthey, Jeff
Studying pathogens degrades BLAST-based pathogen identification
title Studying pathogens degrades BLAST-based pathogen identification
title_full Studying pathogens degrades BLAST-based pathogen identification
title_fullStr Studying pathogens degrades BLAST-based pathogen identification
title_full_unstemmed Studying pathogens degrades BLAST-based pathogen identification
title_short Studying pathogens degrades BLAST-based pathogen identification
title_sort studying pathogens degrades blast-based pathogen identification
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10068195/
https://www.ncbi.nlm.nih.gov/pubmed/37012314
http://dx.doi.org/10.1038/s41598-023-32481-z
work_keys_str_mv AT bealjacob studyingpathogensdegradesblastbasedpathogenidentification
AT cloreadam studyingpathogensdegradesblastbasedpathogenidentification
AT mantheyjeff studyingpathogensdegradesblastbasedpathogenidentification