Cargando…

HyperVR: a hybrid deep ensemble learning approach for simultaneously predicting virulence factors and antibiotic resistance genes

Infectious diseases emerge unprecedentedly, posing serious challenges to public health and the global economy. Virulence factors (VFs) enable pathogens to adhere, reproduce and cause damage to host cells, and antibiotic resistance genes (ARGs) allow pathogens to evade otherwise curable treatments. S...

Descripción completa

Detalles Bibliográficos
Autores principales: Ji, Boya, Pi, Wending, Liu, Wenjuan, Liu, Yannan, Cui, Yujun, Zhang, Xianglilan, Peng, Shaoliang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9918863/
https://www.ncbi.nlm.nih.gov/pubmed/36789031
http://dx.doi.org/10.1093/nargab/lqad012
_version_ 1784886681209405440
author Ji, Boya
Pi, Wending
Liu, Wenjuan
Liu, Yannan
Cui, Yujun
Zhang, Xianglilan
Peng, Shaoliang
author_facet Ji, Boya
Pi, Wending
Liu, Wenjuan
Liu, Yannan
Cui, Yujun
Zhang, Xianglilan
Peng, Shaoliang
author_sort Ji, Boya
collection PubMed
description Infectious diseases emerge unprecedentedly, posing serious challenges to public health and the global economy. Virulence factors (VFs) enable pathogens to adhere, reproduce and cause damage to host cells, and antibiotic resistance genes (ARGs) allow pathogens to evade otherwise curable treatments. Simultaneous identification of VFs and ARGs can save pathogen surveillance time, especially in situ epidemic pathogen detection. However, most tools can only predict either VFs or ARGs. Few tools that predict VFs and ARGs simultaneously usually have high false-negative rates, are sensitive to the cutoff thresholds and can only identify conserved genes. For better simultaneous prediction of VFs and ARGs, we propose a hybrid deep ensemble learning approach called HyperVR. By considering both best hit scores and statistical gene sequence patterns, HyperVR combines classical machine learning and deep learning to simultaneously and accurately predict VFs, ARGs and negative genes (neither VFs nor ARGs). For the prediction of individual VFs and ARGs, in silico spike-in experiment (the VFs and ARGs in real metagenomic data), and pseudo-VFs and -ARGs (gene fragments), HyperVR outperforms the current state-of-the-art prediction tools. HyperVR uses only gene sequence information without strict cutoff thresholds, hence making prediction straightforward and reliable.
format Online
Article
Text
id pubmed-9918863
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-99188632023-02-13 HyperVR: a hybrid deep ensemble learning approach for simultaneously predicting virulence factors and antibiotic resistance genes Ji, Boya Pi, Wending Liu, Wenjuan Liu, Yannan Cui, Yujun Zhang, Xianglilan Peng, Shaoliang NAR Genom Bioinform Standard Article Infectious diseases emerge unprecedentedly, posing serious challenges to public health and the global economy. Virulence factors (VFs) enable pathogens to adhere, reproduce and cause damage to host cells, and antibiotic resistance genes (ARGs) allow pathogens to evade otherwise curable treatments. Simultaneous identification of VFs and ARGs can save pathogen surveillance time, especially in situ epidemic pathogen detection. However, most tools can only predict either VFs or ARGs. Few tools that predict VFs and ARGs simultaneously usually have high false-negative rates, are sensitive to the cutoff thresholds and can only identify conserved genes. For better simultaneous prediction of VFs and ARGs, we propose a hybrid deep ensemble learning approach called HyperVR. By considering both best hit scores and statistical gene sequence patterns, HyperVR combines classical machine learning and deep learning to simultaneously and accurately predict VFs, ARGs and negative genes (neither VFs nor ARGs). For the prediction of individual VFs and ARGs, in silico spike-in experiment (the VFs and ARGs in real metagenomic data), and pseudo-VFs and -ARGs (gene fragments), HyperVR outperforms the current state-of-the-art prediction tools. HyperVR uses only gene sequence information without strict cutoff thresholds, hence making prediction straightforward and reliable. Oxford University Press 2023-02-11 /pmc/articles/PMC9918863/ /pubmed/36789031 http://dx.doi.org/10.1093/nargab/lqad012 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Standard Article
Ji, Boya
Pi, Wending
Liu, Wenjuan
Liu, Yannan
Cui, Yujun
Zhang, Xianglilan
Peng, Shaoliang
HyperVR: a hybrid deep ensemble learning approach for simultaneously predicting virulence factors and antibiotic resistance genes
title HyperVR: a hybrid deep ensemble learning approach for simultaneously predicting virulence factors and antibiotic resistance genes
title_full HyperVR: a hybrid deep ensemble learning approach for simultaneously predicting virulence factors and antibiotic resistance genes
title_fullStr HyperVR: a hybrid deep ensemble learning approach for simultaneously predicting virulence factors and antibiotic resistance genes
title_full_unstemmed HyperVR: a hybrid deep ensemble learning approach for simultaneously predicting virulence factors and antibiotic resistance genes
title_short HyperVR: a hybrid deep ensemble learning approach for simultaneously predicting virulence factors and antibiotic resistance genes
title_sort hypervr: a hybrid deep ensemble learning approach for simultaneously predicting virulence factors and antibiotic resistance genes
topic Standard Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9918863/
https://www.ncbi.nlm.nih.gov/pubmed/36789031
http://dx.doi.org/10.1093/nargab/lqad012
work_keys_str_mv AT jiboya hypervrahybriddeepensemblelearningapproachforsimultaneouslypredictingvirulencefactorsandantibioticresistancegenes
AT piwending hypervrahybriddeepensemblelearningapproachforsimultaneouslypredictingvirulencefactorsandantibioticresistancegenes
AT liuwenjuan hypervrahybriddeepensemblelearningapproachforsimultaneouslypredictingvirulencefactorsandantibioticresistancegenes
AT liuyannan hypervrahybriddeepensemblelearningapproachforsimultaneouslypredictingvirulencefactorsandantibioticresistancegenes
AT cuiyujun hypervrahybriddeepensemblelearningapproachforsimultaneouslypredictingvirulencefactorsandantibioticresistancegenes
AT zhangxianglilan hypervrahybriddeepensemblelearningapproachforsimultaneouslypredictingvirulencefactorsandantibioticresistancegenes
AT pengshaoliang hypervrahybriddeepensemblelearningapproachforsimultaneouslypredictingvirulencefactorsandantibioticresistancegenes