Cargando…
HMD-ARG: hierarchical multi-task deep learning for annotating antibiotic resistance genes
BACKGROUND: The spread of antibiotic resistance has become one of the most urgent threats to global health, which is estimated to cause 700,000 deaths each year globally. Its surrogates, antibiotic resistance genes (ARGs), are highly transmittable between food, water, animal, and human to mitigate t...
Autores principales: | , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7871585/ https://www.ncbi.nlm.nih.gov/pubmed/33557954 http://dx.doi.org/10.1186/s40168-021-01002-3 |
_version_ | 1783649036399542272 |
---|---|
author | Li, Yu Xu, Zeling Han, Wenkai Cao, Huiluo Umarov, Ramzan Yan, Aixin Fan, Ming Chen, Huan Duarte, Carlos M. Li, Lihua Ho, Pak-Leung Gao, Xin |
author_facet | Li, Yu Xu, Zeling Han, Wenkai Cao, Huiluo Umarov, Ramzan Yan, Aixin Fan, Ming Chen, Huan Duarte, Carlos M. Li, Lihua Ho, Pak-Leung Gao, Xin |
author_sort | Li, Yu |
collection | PubMed |
description | BACKGROUND: The spread of antibiotic resistance has become one of the most urgent threats to global health, which is estimated to cause 700,000 deaths each year globally. Its surrogates, antibiotic resistance genes (ARGs), are highly transmittable between food, water, animal, and human to mitigate the efficacy of antibiotics. Accurately identifying ARGs is thus an indispensable step to understanding the ecology, and transmission of ARGs between environmental and human-associated reservoirs. Unfortunately, the previous computational methods for identifying ARGs are mostly based on sequence alignment, which cannot identify novel ARGs, and their applications are limited by currently incomplete knowledge about ARGs. RESULTS: Here, we propose an end-to-end Hierarchical Multi-task Deep learning framework for ARG annotation (HMD-ARG). Taking raw sequence encoding as input, HMD-ARG can identify, without querying against existing sequence databases, multiple ARG properties simultaneously, including if the input protein sequence is an ARG, and if so, what antibiotic family it is resistant to, what resistant mechanism the ARG takes, and if the ARG is an intrinsic one or acquired one. In addition, if the predicted antibiotic family is beta-lactamase, HMD-ARG further predicts the subclass of beta-lactamase that the ARG is resistant to. Comprehensive experiments, including cross-fold validation, third-party dataset validation in human gut microbiota, wet-experimental functional validation, and structural investigation of predicted conserved sites, demonstrate not only the superior performance of our method over the state-of-art methods, but also the effectiveness and robustness of the proposed method. CONCLUSIONS: We propose a hierarchical multi-task method, HMD-ARG, which is based on deep learning and can provide detailed annotations of ARGs from three important aspects: resistant antibiotic class, resistant mechanism, and gene mobility. We believe that HMD-ARG can serve as a powerful tool to identify antibiotic resistance genes and, therefore mitigate their global threat. Our method and the constructed database are available at http://www.cbrc.kaust.edu.sa/HMDARG/. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s40168-021-01002-3. |
format | Online Article Text |
id | pubmed-7871585 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-78715852021-02-09 HMD-ARG: hierarchical multi-task deep learning for annotating antibiotic resistance genes Li, Yu Xu, Zeling Han, Wenkai Cao, Huiluo Umarov, Ramzan Yan, Aixin Fan, Ming Chen, Huan Duarte, Carlos M. Li, Lihua Ho, Pak-Leung Gao, Xin Microbiome Methodology BACKGROUND: The spread of antibiotic resistance has become one of the most urgent threats to global health, which is estimated to cause 700,000 deaths each year globally. Its surrogates, antibiotic resistance genes (ARGs), are highly transmittable between food, water, animal, and human to mitigate the efficacy of antibiotics. Accurately identifying ARGs is thus an indispensable step to understanding the ecology, and transmission of ARGs between environmental and human-associated reservoirs. Unfortunately, the previous computational methods for identifying ARGs are mostly based on sequence alignment, which cannot identify novel ARGs, and their applications are limited by currently incomplete knowledge about ARGs. RESULTS: Here, we propose an end-to-end Hierarchical Multi-task Deep learning framework for ARG annotation (HMD-ARG). Taking raw sequence encoding as input, HMD-ARG can identify, without querying against existing sequence databases, multiple ARG properties simultaneously, including if the input protein sequence is an ARG, and if so, what antibiotic family it is resistant to, what resistant mechanism the ARG takes, and if the ARG is an intrinsic one or acquired one. In addition, if the predicted antibiotic family is beta-lactamase, HMD-ARG further predicts the subclass of beta-lactamase that the ARG is resistant to. Comprehensive experiments, including cross-fold validation, third-party dataset validation in human gut microbiota, wet-experimental functional validation, and structural investigation of predicted conserved sites, demonstrate not only the superior performance of our method over the state-of-art methods, but also the effectiveness and robustness of the proposed method. CONCLUSIONS: We propose a hierarchical multi-task method, HMD-ARG, which is based on deep learning and can provide detailed annotations of ARGs from three important aspects: resistant antibiotic class, resistant mechanism, and gene mobility. We believe that HMD-ARG can serve as a powerful tool to identify antibiotic resistance genes and, therefore mitigate their global threat. Our method and the constructed database are available at http://www.cbrc.kaust.edu.sa/HMDARG/. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s40168-021-01002-3. BioMed Central 2021-02-08 /pmc/articles/PMC7871585/ /pubmed/33557954 http://dx.doi.org/10.1186/s40168-021-01002-3 Text en © The Author(s) 2021 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Methodology Li, Yu Xu, Zeling Han, Wenkai Cao, Huiluo Umarov, Ramzan Yan, Aixin Fan, Ming Chen, Huan Duarte, Carlos M. Li, Lihua Ho, Pak-Leung Gao, Xin HMD-ARG: hierarchical multi-task deep learning for annotating antibiotic resistance genes |
title | HMD-ARG: hierarchical multi-task deep learning for annotating antibiotic resistance genes |
title_full | HMD-ARG: hierarchical multi-task deep learning for annotating antibiotic resistance genes |
title_fullStr | HMD-ARG: hierarchical multi-task deep learning for annotating antibiotic resistance genes |
title_full_unstemmed | HMD-ARG: hierarchical multi-task deep learning for annotating antibiotic resistance genes |
title_short | HMD-ARG: hierarchical multi-task deep learning for annotating antibiotic resistance genes |
title_sort | hmd-arg: hierarchical multi-task deep learning for annotating antibiotic resistance genes |
topic | Methodology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7871585/ https://www.ncbi.nlm.nih.gov/pubmed/33557954 http://dx.doi.org/10.1186/s40168-021-01002-3 |
work_keys_str_mv | AT liyu hmdarghierarchicalmultitaskdeeplearningforannotatingantibioticresistancegenes AT xuzeling hmdarghierarchicalmultitaskdeeplearningforannotatingantibioticresistancegenes AT hanwenkai hmdarghierarchicalmultitaskdeeplearningforannotatingantibioticresistancegenes AT caohuiluo hmdarghierarchicalmultitaskdeeplearningforannotatingantibioticresistancegenes AT umarovramzan hmdarghierarchicalmultitaskdeeplearningforannotatingantibioticresistancegenes AT yanaixin hmdarghierarchicalmultitaskdeeplearningforannotatingantibioticresistancegenes AT fanming hmdarghierarchicalmultitaskdeeplearningforannotatingantibioticresistancegenes AT chenhuan hmdarghierarchicalmultitaskdeeplearningforannotatingantibioticresistancegenes AT duartecarlosm hmdarghierarchicalmultitaskdeeplearningforannotatingantibioticresistancegenes AT lilihua hmdarghierarchicalmultitaskdeeplearningforannotatingantibioticresistancegenes AT hopakleung hmdarghierarchicalmultitaskdeeplearningforannotatingantibioticresistancegenes AT gaoxin hmdarghierarchicalmultitaskdeeplearningforannotatingantibioticresistancegenes |