Cargando…
HyperVR: a hybrid deep ensemble learning approach for simultaneously predicting virulence factors and antibiotic resistance genes
Infectious diseases emerge unprecedentedly, posing serious challenges to public health and the global economy. Virulence factors (VFs) enable pathogens to adhere, reproduce and cause damage to host cells, and antibiotic resistance genes (ARGs) allow pathogens to evade otherwise curable treatments. S...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9918863/ https://www.ncbi.nlm.nih.gov/pubmed/36789031 http://dx.doi.org/10.1093/nargab/lqad012 |
_version_ | 1784886681209405440 |
---|---|
author | Ji, Boya Pi, Wending Liu, Wenjuan Liu, Yannan Cui, Yujun Zhang, Xianglilan Peng, Shaoliang |
author_facet | Ji, Boya Pi, Wending Liu, Wenjuan Liu, Yannan Cui, Yujun Zhang, Xianglilan Peng, Shaoliang |
author_sort | Ji, Boya |
collection | PubMed |
description | Infectious diseases emerge unprecedentedly, posing serious challenges to public health and the global economy. Virulence factors (VFs) enable pathogens to adhere, reproduce and cause damage to host cells, and antibiotic resistance genes (ARGs) allow pathogens to evade otherwise curable treatments. Simultaneous identification of VFs and ARGs can save pathogen surveillance time, especially in situ epidemic pathogen detection. However, most tools can only predict either VFs or ARGs. Few tools that predict VFs and ARGs simultaneously usually have high false-negative rates, are sensitive to the cutoff thresholds and can only identify conserved genes. For better simultaneous prediction of VFs and ARGs, we propose a hybrid deep ensemble learning approach called HyperVR. By considering both best hit scores and statistical gene sequence patterns, HyperVR combines classical machine learning and deep learning to simultaneously and accurately predict VFs, ARGs and negative genes (neither VFs nor ARGs). For the prediction of individual VFs and ARGs, in silico spike-in experiment (the VFs and ARGs in real metagenomic data), and pseudo-VFs and -ARGs (gene fragments), HyperVR outperforms the current state-of-the-art prediction tools. HyperVR uses only gene sequence information without strict cutoff thresholds, hence making prediction straightforward and reliable. |
format | Online Article Text |
id | pubmed-9918863 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-99188632023-02-13 HyperVR: a hybrid deep ensemble learning approach for simultaneously predicting virulence factors and antibiotic resistance genes Ji, Boya Pi, Wending Liu, Wenjuan Liu, Yannan Cui, Yujun Zhang, Xianglilan Peng, Shaoliang NAR Genom Bioinform Standard Article Infectious diseases emerge unprecedentedly, posing serious challenges to public health and the global economy. Virulence factors (VFs) enable pathogens to adhere, reproduce and cause damage to host cells, and antibiotic resistance genes (ARGs) allow pathogens to evade otherwise curable treatments. Simultaneous identification of VFs and ARGs can save pathogen surveillance time, especially in situ epidemic pathogen detection. However, most tools can only predict either VFs or ARGs. Few tools that predict VFs and ARGs simultaneously usually have high false-negative rates, are sensitive to the cutoff thresholds and can only identify conserved genes. For better simultaneous prediction of VFs and ARGs, we propose a hybrid deep ensemble learning approach called HyperVR. By considering both best hit scores and statistical gene sequence patterns, HyperVR combines classical machine learning and deep learning to simultaneously and accurately predict VFs, ARGs and negative genes (neither VFs nor ARGs). For the prediction of individual VFs and ARGs, in silico spike-in experiment (the VFs and ARGs in real metagenomic data), and pseudo-VFs and -ARGs (gene fragments), HyperVR outperforms the current state-of-the-art prediction tools. HyperVR uses only gene sequence information without strict cutoff thresholds, hence making prediction straightforward and reliable. Oxford University Press 2023-02-11 /pmc/articles/PMC9918863/ /pubmed/36789031 http://dx.doi.org/10.1093/nargab/lqad012 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Standard Article Ji, Boya Pi, Wending Liu, Wenjuan Liu, Yannan Cui, Yujun Zhang, Xianglilan Peng, Shaoliang HyperVR: a hybrid deep ensemble learning approach for simultaneously predicting virulence factors and antibiotic resistance genes |
title | HyperVR: a hybrid deep ensemble learning approach for simultaneously predicting virulence factors and antibiotic resistance genes |
title_full | HyperVR: a hybrid deep ensemble learning approach for simultaneously predicting virulence factors and antibiotic resistance genes |
title_fullStr | HyperVR: a hybrid deep ensemble learning approach for simultaneously predicting virulence factors and antibiotic resistance genes |
title_full_unstemmed | HyperVR: a hybrid deep ensemble learning approach for simultaneously predicting virulence factors and antibiotic resistance genes |
title_short | HyperVR: a hybrid deep ensemble learning approach for simultaneously predicting virulence factors and antibiotic resistance genes |
title_sort | hypervr: a hybrid deep ensemble learning approach for simultaneously predicting virulence factors and antibiotic resistance genes |
topic | Standard Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9918863/ https://www.ncbi.nlm.nih.gov/pubmed/36789031 http://dx.doi.org/10.1093/nargab/lqad012 |
work_keys_str_mv | AT jiboya hypervrahybriddeepensemblelearningapproachforsimultaneouslypredictingvirulencefactorsandantibioticresistancegenes AT piwending hypervrahybriddeepensemblelearningapproachforsimultaneouslypredictingvirulencefactorsandantibioticresistancegenes AT liuwenjuan hypervrahybriddeepensemblelearningapproachforsimultaneouslypredictingvirulencefactorsandantibioticresistancegenes AT liuyannan hypervrahybriddeepensemblelearningapproachforsimultaneouslypredictingvirulencefactorsandantibioticresistancegenes AT cuiyujun hypervrahybriddeepensemblelearningapproachforsimultaneouslypredictingvirulencefactorsandantibioticresistancegenes AT zhangxianglilan hypervrahybriddeepensemblelearningapproachforsimultaneouslypredictingvirulencefactorsandantibioticresistancegenes AT pengshaoliang hypervrahybriddeepensemblelearningapproachforsimultaneouslypredictingvirulencefactorsandantibioticresistancegenes |