Cargando…

The Establishment and Application of a Kraken Classifier for Salmonella Plasmid Sequence Prediction

INTRODUCTION: Salmonella is a key intestinal pathogen of foodborne disease, and the plasmids in Salmonella are related to many biological characteristics, including virulence and drug resistance. A large number of plasmid contigs have been sequenced in bacterial draft genomes, however, these are oft...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Zhenpeng, Pang, Bo, Lu, Xin, Kan, Biao
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Editorial Office of CCDCW, Chinese Center for Disease Control and Prevention 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9889229/
https://www.ncbi.nlm.nih.gov/pubmed/36751662
http://dx.doi.org/10.46234/ccdcw2022.225
_version_ 1784880685834567680
author Li, Zhenpeng
Pang, Bo
Lu, Xin
Kan, Biao
author_facet Li, Zhenpeng
Pang, Bo
Lu, Xin
Kan, Biao
author_sort Li, Zhenpeng
collection PubMed
description INTRODUCTION: Salmonella is a key intestinal pathogen of foodborne disease, and the plasmids in Salmonella are related to many biological characteristics, including virulence and drug resistance. A large number of plasmid contigs have been sequenced in bacterial draft genomes, however, these are often difficult to distinguish from chromosomal contigs. METHODS: In this study, three different customized Kraken databases were used to build three different Kraken classifiers. Complete genome benchmark datasets and simulated draft genome benchmark datasets were constructed. Five-fold cross-validation was used to evaluate the performance of the three different Kraken classifiers by two benchmark datasets. RESULTS: The predictive performance of the classifier based on all National Center for Biotechnology Information plasmids and Salmonella complete genomes was optimal. This optimal Kraken classifier was performed with Salmonella isolated in China. The plasmid carrying rate of Salmonella in China is 91.01%, and it was found that the Kraken classifier could find more plasmid contigs and antibiotic resistance genes (ARGs) than results derived from a plasmid replicon-based method (PlasmidFinder). Moreover, it was found that in the strains carrying ARGs, plasmids carried more ARGs [three, 95% confidence interval (CI): 1–14] than chromosomes (one, 95% CI: 1–7). DISCUSSION: We found building a high-quality customized database as a Kraken classifier to be ideal for the prediction of Salmonella plasmid sequences from bacterial draft genomes. In the future, the Kraken classifier established in this study will play a significant role in ARG monitoring.
format Online
Article
Text
id pubmed-9889229
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Editorial Office of CCDCW, Chinese Center for Disease Control and Prevention
record_format MEDLINE/PubMed
spelling pubmed-98892292023-02-06 The Establishment and Application of a Kraken Classifier for Salmonella Plasmid Sequence Prediction Li, Zhenpeng Pang, Bo Lu, Xin Kan, Biao China CDC Wkly Methods and Applications INTRODUCTION: Salmonella is a key intestinal pathogen of foodborne disease, and the plasmids in Salmonella are related to many biological characteristics, including virulence and drug resistance. A large number of plasmid contigs have been sequenced in bacterial draft genomes, however, these are often difficult to distinguish from chromosomal contigs. METHODS: In this study, three different customized Kraken databases were used to build three different Kraken classifiers. Complete genome benchmark datasets and simulated draft genome benchmark datasets were constructed. Five-fold cross-validation was used to evaluate the performance of the three different Kraken classifiers by two benchmark datasets. RESULTS: The predictive performance of the classifier based on all National Center for Biotechnology Information plasmids and Salmonella complete genomes was optimal. This optimal Kraken classifier was performed with Salmonella isolated in China. The plasmid carrying rate of Salmonella in China is 91.01%, and it was found that the Kraken classifier could find more plasmid contigs and antibiotic resistance genes (ARGs) than results derived from a plasmid replicon-based method (PlasmidFinder). Moreover, it was found that in the strains carrying ARGs, plasmids carried more ARGs [three, 95% confidence interval (CI): 1–14] than chromosomes (one, 95% CI: 1–7). DISCUSSION: We found building a high-quality customized database as a Kraken classifier to be ideal for the prediction of Salmonella plasmid sequences from bacterial draft genomes. In the future, the Kraken classifier established in this study will play a significant role in ARG monitoring. Editorial Office of CCDCW, Chinese Center for Disease Control and Prevention 2022-12-09 /pmc/articles/PMC9889229/ /pubmed/36751662 http://dx.doi.org/10.46234/ccdcw2022.225 Text en Copyright and License information: Editorial Office of CCDCW, Chinese Center for Disease Control and Prevention 2022 https://creativecommons.org/licenses/by-nc-sa/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 4.0 Unported License. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-sa/4.0/ (https://creativecommons.org/licenses/by-nc-sa/4.0/)
spellingShingle Methods and Applications
Li, Zhenpeng
Pang, Bo
Lu, Xin
Kan, Biao
The Establishment and Application of a Kraken Classifier for Salmonella Plasmid Sequence Prediction
title The Establishment and Application of a Kraken Classifier for Salmonella Plasmid Sequence Prediction
title_full The Establishment and Application of a Kraken Classifier for Salmonella Plasmid Sequence Prediction
title_fullStr The Establishment and Application of a Kraken Classifier for Salmonella Plasmid Sequence Prediction
title_full_unstemmed The Establishment and Application of a Kraken Classifier for Salmonella Plasmid Sequence Prediction
title_short The Establishment and Application of a Kraken Classifier for Salmonella Plasmid Sequence Prediction
title_sort establishment and application of a kraken classifier for salmonella plasmid sequence prediction
topic Methods and Applications
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9889229/
https://www.ncbi.nlm.nih.gov/pubmed/36751662
http://dx.doi.org/10.46234/ccdcw2022.225
work_keys_str_mv AT lizhenpeng theestablishmentandapplicationofakrakenclassifierforsalmonellaplasmidsequenceprediction
AT pangbo theestablishmentandapplicationofakrakenclassifierforsalmonellaplasmidsequenceprediction
AT luxin theestablishmentandapplicationofakrakenclassifierforsalmonellaplasmidsequenceprediction
AT kanbiao theestablishmentandapplicationofakrakenclassifierforsalmonellaplasmidsequenceprediction
AT lizhenpeng establishmentandapplicationofakrakenclassifierforsalmonellaplasmidsequenceprediction
AT pangbo establishmentandapplicationofakrakenclassifierforsalmonellaplasmidsequenceprediction
AT luxin establishmentandapplicationofakrakenclassifierforsalmonellaplasmidsequenceprediction
AT kanbiao establishmentandapplicationofakrakenclassifierforsalmonellaplasmidsequenceprediction