Cargando…

HMD-ARG: hierarchical multi-task deep learning for annotating antibiotic resistance genes

BACKGROUND: The spread of antibiotic resistance has become one of the most urgent threats to global health, which is estimated to cause 700,000 deaths each year globally. Its surrogates, antibiotic resistance genes (ARGs), are highly transmittable between food, water, animal, and human to mitigate t...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Yu, Xu, Zeling, Han, Wenkai, Cao, Huiluo, Umarov, Ramzan, Yan, Aixin, Fan, Ming, Chen, Huan, Duarte, Carlos M., Li, Lihua, Ho, Pak-Leung, Gao, Xin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7871585/
https://www.ncbi.nlm.nih.gov/pubmed/33557954
http://dx.doi.org/10.1186/s40168-021-01002-3
_version_ 1783649036399542272
author Li, Yu
Xu, Zeling
Han, Wenkai
Cao, Huiluo
Umarov, Ramzan
Yan, Aixin
Fan, Ming
Chen, Huan
Duarte, Carlos M.
Li, Lihua
Ho, Pak-Leung
Gao, Xin
author_facet Li, Yu
Xu, Zeling
Han, Wenkai
Cao, Huiluo
Umarov, Ramzan
Yan, Aixin
Fan, Ming
Chen, Huan
Duarte, Carlos M.
Li, Lihua
Ho, Pak-Leung
Gao, Xin
author_sort Li, Yu
collection PubMed
description BACKGROUND: The spread of antibiotic resistance has become one of the most urgent threats to global health, which is estimated to cause 700,000 deaths each year globally. Its surrogates, antibiotic resistance genes (ARGs), are highly transmittable between food, water, animal, and human to mitigate the efficacy of antibiotics. Accurately identifying ARGs is thus an indispensable step to understanding the ecology, and transmission of ARGs between environmental and human-associated reservoirs. Unfortunately, the previous computational methods for identifying ARGs are mostly based on sequence alignment, which cannot identify novel ARGs, and their applications are limited by currently incomplete knowledge about ARGs. RESULTS: Here, we propose an end-to-end Hierarchical Multi-task Deep learning framework for ARG annotation (HMD-ARG). Taking raw sequence encoding as input, HMD-ARG can identify, without querying against existing sequence databases, multiple ARG properties simultaneously, including if the input protein sequence is an ARG, and if so, what antibiotic family it is resistant to, what resistant mechanism the ARG takes, and if the ARG is an intrinsic one or acquired one. In addition, if the predicted antibiotic family is beta-lactamase, HMD-ARG further predicts the subclass of beta-lactamase that the ARG is resistant to. Comprehensive experiments, including cross-fold validation, third-party dataset validation in human gut microbiota, wet-experimental functional validation, and structural investigation of predicted conserved sites, demonstrate not only the superior performance of our method over the state-of-art methods, but also the effectiveness and robustness of the proposed method. CONCLUSIONS: We propose a hierarchical multi-task method, HMD-ARG, which is based on deep learning and can provide detailed annotations of ARGs from three important aspects: resistant antibiotic class, resistant mechanism, and gene mobility. We believe that HMD-ARG can serve as a powerful tool to identify antibiotic resistance genes and, therefore mitigate their global threat. Our method and the constructed database are available at http://www.cbrc.kaust.edu.sa/HMDARG/. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s40168-021-01002-3.
format Online
Article
Text
id pubmed-7871585
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-78715852021-02-09 HMD-ARG: hierarchical multi-task deep learning for annotating antibiotic resistance genes Li, Yu Xu, Zeling Han, Wenkai Cao, Huiluo Umarov, Ramzan Yan, Aixin Fan, Ming Chen, Huan Duarte, Carlos M. Li, Lihua Ho, Pak-Leung Gao, Xin Microbiome Methodology BACKGROUND: The spread of antibiotic resistance has become one of the most urgent threats to global health, which is estimated to cause 700,000 deaths each year globally. Its surrogates, antibiotic resistance genes (ARGs), are highly transmittable between food, water, animal, and human to mitigate the efficacy of antibiotics. Accurately identifying ARGs is thus an indispensable step to understanding the ecology, and transmission of ARGs between environmental and human-associated reservoirs. Unfortunately, the previous computational methods for identifying ARGs are mostly based on sequence alignment, which cannot identify novel ARGs, and their applications are limited by currently incomplete knowledge about ARGs. RESULTS: Here, we propose an end-to-end Hierarchical Multi-task Deep learning framework for ARG annotation (HMD-ARG). Taking raw sequence encoding as input, HMD-ARG can identify, without querying against existing sequence databases, multiple ARG properties simultaneously, including if the input protein sequence is an ARG, and if so, what antibiotic family it is resistant to, what resistant mechanism the ARG takes, and if the ARG is an intrinsic one or acquired one. In addition, if the predicted antibiotic family is beta-lactamase, HMD-ARG further predicts the subclass of beta-lactamase that the ARG is resistant to. Comprehensive experiments, including cross-fold validation, third-party dataset validation in human gut microbiota, wet-experimental functional validation, and structural investigation of predicted conserved sites, demonstrate not only the superior performance of our method over the state-of-art methods, but also the effectiveness and robustness of the proposed method. CONCLUSIONS: We propose a hierarchical multi-task method, HMD-ARG, which is based on deep learning and can provide detailed annotations of ARGs from three important aspects: resistant antibiotic class, resistant mechanism, and gene mobility. We believe that HMD-ARG can serve as a powerful tool to identify antibiotic resistance genes and, therefore mitigate their global threat. Our method and the constructed database are available at http://www.cbrc.kaust.edu.sa/HMDARG/. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s40168-021-01002-3. BioMed Central 2021-02-08 /pmc/articles/PMC7871585/ /pubmed/33557954 http://dx.doi.org/10.1186/s40168-021-01002-3 Text en © The Author(s) 2021 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Methodology
Li, Yu
Xu, Zeling
Han, Wenkai
Cao, Huiluo
Umarov, Ramzan
Yan, Aixin
Fan, Ming
Chen, Huan
Duarte, Carlos M.
Li, Lihua
Ho, Pak-Leung
Gao, Xin
HMD-ARG: hierarchical multi-task deep learning for annotating antibiotic resistance genes
title HMD-ARG: hierarchical multi-task deep learning for annotating antibiotic resistance genes
title_full HMD-ARG: hierarchical multi-task deep learning for annotating antibiotic resistance genes
title_fullStr HMD-ARG: hierarchical multi-task deep learning for annotating antibiotic resistance genes
title_full_unstemmed HMD-ARG: hierarchical multi-task deep learning for annotating antibiotic resistance genes
title_short HMD-ARG: hierarchical multi-task deep learning for annotating antibiotic resistance genes
title_sort hmd-arg: hierarchical multi-task deep learning for annotating antibiotic resistance genes
topic Methodology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7871585/
https://www.ncbi.nlm.nih.gov/pubmed/33557954
http://dx.doi.org/10.1186/s40168-021-01002-3
work_keys_str_mv AT liyu hmdarghierarchicalmultitaskdeeplearningforannotatingantibioticresistancegenes
AT xuzeling hmdarghierarchicalmultitaskdeeplearningforannotatingantibioticresistancegenes
AT hanwenkai hmdarghierarchicalmultitaskdeeplearningforannotatingantibioticresistancegenes
AT caohuiluo hmdarghierarchicalmultitaskdeeplearningforannotatingantibioticresistancegenes
AT umarovramzan hmdarghierarchicalmultitaskdeeplearningforannotatingantibioticresistancegenes
AT yanaixin hmdarghierarchicalmultitaskdeeplearningforannotatingantibioticresistancegenes
AT fanming hmdarghierarchicalmultitaskdeeplearningforannotatingantibioticresistancegenes
AT chenhuan hmdarghierarchicalmultitaskdeeplearningforannotatingantibioticresistancegenes
AT duartecarlosm hmdarghierarchicalmultitaskdeeplearningforannotatingantibioticresistancegenes
AT lilihua hmdarghierarchicalmultitaskdeeplearningforannotatingantibioticresistancegenes
AT hopakleung hmdarghierarchicalmultitaskdeeplearningforannotatingantibioticresistancegenes
AT gaoxin hmdarghierarchicalmultitaskdeeplearningforannotatingantibioticresistancegenes