Cargando…
Studying pathogens degrades BLAST-based pathogen identification
As synthetic biology becomes increasingly capable and accessible, it is likewise increasingly critical to be able to make accurate biosecurity determinations regarding the pathogenicity or toxicity of particular nucleic acid or amino acid sequences. At present, this is typically done using the BLAST...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10068195/ https://www.ncbi.nlm.nih.gov/pubmed/37012314 http://dx.doi.org/10.1038/s41598-023-32481-z |
_version_ | 1785018630058016768 |
---|---|
author | Beal, Jacob Clore, Adam Manthey, Jeff |
author_facet | Beal, Jacob Clore, Adam Manthey, Jeff |
author_sort | Beal, Jacob |
collection | PubMed |
description | As synthetic biology becomes increasingly capable and accessible, it is likewise increasingly critical to be able to make accurate biosecurity determinations regarding the pathogenicity or toxicity of particular nucleic acid or amino acid sequences. At present, this is typically done using the BLAST algorithm to determine the best match with sequences in the NCBI nucleic acid and protein databases. Neither BLAST nor any of the NCBI databases, however, are actually designed for biosafety determination. Critically, taxonomic errors or ambiguities in the NCBI nucleic acid and protein databases can also cause errors in BLAST-based taxonomic categorization. With heavily studied taxa and frequently used biotechnology tools, even low frequency taxonomic categorization issues can lead to high rates of errors in biosecurity decision-making. Here we focus on the implications for false positives, finding that BLAST against NCBI’s protein database will now incorrectly categorize a number of commonly used biotechnology tool sequences as the pathogens or toxins with which they have been used. Paradoxically, this implies that problems are expected to be most acute for the pathogens and toxins of highest interest and for the most widely used biotechnology tools. We thus conclude that biosecurity tools should shift away from BLAST against general purpose databases and towards new methods that are specifically tailored for biosafety purposes. |
format | Online Article Text |
id | pubmed-10068195 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-100681952023-04-03 Studying pathogens degrades BLAST-based pathogen identification Beal, Jacob Clore, Adam Manthey, Jeff Sci Rep Article As synthetic biology becomes increasingly capable and accessible, it is likewise increasingly critical to be able to make accurate biosecurity determinations regarding the pathogenicity or toxicity of particular nucleic acid or amino acid sequences. At present, this is typically done using the BLAST algorithm to determine the best match with sequences in the NCBI nucleic acid and protein databases. Neither BLAST nor any of the NCBI databases, however, are actually designed for biosafety determination. Critically, taxonomic errors or ambiguities in the NCBI nucleic acid and protein databases can also cause errors in BLAST-based taxonomic categorization. With heavily studied taxa and frequently used biotechnology tools, even low frequency taxonomic categorization issues can lead to high rates of errors in biosecurity decision-making. Here we focus on the implications for false positives, finding that BLAST against NCBI’s protein database will now incorrectly categorize a number of commonly used biotechnology tool sequences as the pathogens or toxins with which they have been used. Paradoxically, this implies that problems are expected to be most acute for the pathogens and toxins of highest interest and for the most widely used biotechnology tools. We thus conclude that biosecurity tools should shift away from BLAST against general purpose databases and towards new methods that are specifically tailored for biosafety purposes. Nature Publishing Group UK 2023-04-03 /pmc/articles/PMC10068195/ /pubmed/37012314 http://dx.doi.org/10.1038/s41598-023-32481-z Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Beal, Jacob Clore, Adam Manthey, Jeff Studying pathogens degrades BLAST-based pathogen identification |
title | Studying pathogens degrades BLAST-based pathogen identification |
title_full | Studying pathogens degrades BLAST-based pathogen identification |
title_fullStr | Studying pathogens degrades BLAST-based pathogen identification |
title_full_unstemmed | Studying pathogens degrades BLAST-based pathogen identification |
title_short | Studying pathogens degrades BLAST-based pathogen identification |
title_sort | studying pathogens degrades blast-based pathogen identification |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10068195/ https://www.ncbi.nlm.nih.gov/pubmed/37012314 http://dx.doi.org/10.1038/s41598-023-32481-z |
work_keys_str_mv | AT bealjacob studyingpathogensdegradesblastbasedpathogenidentification AT cloreadam studyingpathogensdegradesblastbasedpathogenidentification AT mantheyjeff studyingpathogensdegradesblastbasedpathogenidentification |