Cargando…

Integrating transformer and imbalanced multi-label learning to identify antimicrobial peptides and their functional activities

MOTIVATION: Antimicrobial peptides (AMPs) have the potential to inhibit multiple types of pathogens and to heal infections. Computational strategies can assist in characterizing novel AMPs from proteome or collections of synthetic sequences and discovering their functional abilities toward different...

Descripción completa

Detalles Bibliográficos
Autores principales: Pang, Yuxuan, Yao, Lantian, Xu, Jingyi, Wang, Zhuo, Lee, Tzong-Yi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9750108/
https://www.ncbi.nlm.nih.gov/pubmed/36326438
http://dx.doi.org/10.1093/bioinformatics/btac711
_version_ 1784850180186570752
author Pang, Yuxuan
Yao, Lantian
Xu, Jingyi
Wang, Zhuo
Lee, Tzong-Yi
author_facet Pang, Yuxuan
Yao, Lantian
Xu, Jingyi
Wang, Zhuo
Lee, Tzong-Yi
author_sort Pang, Yuxuan
collection PubMed
description MOTIVATION: Antimicrobial peptides (AMPs) have the potential to inhibit multiple types of pathogens and to heal infections. Computational strategies can assist in characterizing novel AMPs from proteome or collections of synthetic sequences and discovering their functional abilities toward different microbial targets without intensive labor. RESULTS: Here, we present a deep learning-based method for computer-aided novel AMP discovery that utilizes the transformer neural network architecture with knowledge from natural language processing to extract peptide sequence information. We implemented the method for two AMP-related tasks: the first is to discriminate AMPs from other peptides, and the second task is identifying AMPs functional activities related to seven different targets (gram-negative bacteria, gram-positive bacteria, fungi, viruses, cancer cells, parasites and mammalian cell inhibition), which is a multi-label problem. In addition, asymmetric loss was adopted to resolve the intrinsic imbalance of dataset, particularly for the multi-label scenarios. The evaluation showed that our proposed scheme achieves the best performance for the first task (96.85% balanced accuracy) and has a more unbiased prediction for the second task (79.83% balanced accuracy averaged across all functional activities) when compared with that of strategies without imbalanced learning or deep learning. AVAILABILITY AND IMPLEMENTATION: The source code and data of this study are available at https://github.com/BiOmicsLab/TransImbAMP. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-9750108
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-97501082022-12-15 Integrating transformer and imbalanced multi-label learning to identify antimicrobial peptides and their functional activities Pang, Yuxuan Yao, Lantian Xu, Jingyi Wang, Zhuo Lee, Tzong-Yi Bioinformatics Original Paper MOTIVATION: Antimicrobial peptides (AMPs) have the potential to inhibit multiple types of pathogens and to heal infections. Computational strategies can assist in characterizing novel AMPs from proteome or collections of synthetic sequences and discovering their functional abilities toward different microbial targets without intensive labor. RESULTS: Here, we present a deep learning-based method for computer-aided novel AMP discovery that utilizes the transformer neural network architecture with knowledge from natural language processing to extract peptide sequence information. We implemented the method for two AMP-related tasks: the first is to discriminate AMPs from other peptides, and the second task is identifying AMPs functional activities related to seven different targets (gram-negative bacteria, gram-positive bacteria, fungi, viruses, cancer cells, parasites and mammalian cell inhibition), which is a multi-label problem. In addition, asymmetric loss was adopted to resolve the intrinsic imbalance of dataset, particularly for the multi-label scenarios. The evaluation showed that our proposed scheme achieves the best performance for the first task (96.85% balanced accuracy) and has a more unbiased prediction for the second task (79.83% balanced accuracy averaged across all functional activities) when compared with that of strategies without imbalanced learning or deep learning. AVAILABILITY AND IMPLEMENTATION: The source code and data of this study are available at https://github.com/BiOmicsLab/TransImbAMP. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2022-11-03 /pmc/articles/PMC9750108/ /pubmed/36326438 http://dx.doi.org/10.1093/bioinformatics/btac711 Text en © The Author(s) 2022. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Paper
Pang, Yuxuan
Yao, Lantian
Xu, Jingyi
Wang, Zhuo
Lee, Tzong-Yi
Integrating transformer and imbalanced multi-label learning to identify antimicrobial peptides and their functional activities
title Integrating transformer and imbalanced multi-label learning to identify antimicrobial peptides and their functional activities
title_full Integrating transformer and imbalanced multi-label learning to identify antimicrobial peptides and their functional activities
title_fullStr Integrating transformer and imbalanced multi-label learning to identify antimicrobial peptides and their functional activities
title_full_unstemmed Integrating transformer and imbalanced multi-label learning to identify antimicrobial peptides and their functional activities
title_short Integrating transformer and imbalanced multi-label learning to identify antimicrobial peptides and their functional activities
title_sort integrating transformer and imbalanced multi-label learning to identify antimicrobial peptides and their functional activities
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9750108/
https://www.ncbi.nlm.nih.gov/pubmed/36326438
http://dx.doi.org/10.1093/bioinformatics/btac711
work_keys_str_mv AT pangyuxuan integratingtransformerandimbalancedmultilabellearningtoidentifyantimicrobialpeptidesandtheirfunctionalactivities
AT yaolantian integratingtransformerandimbalancedmultilabellearningtoidentifyantimicrobialpeptidesandtheirfunctionalactivities
AT xujingyi integratingtransformerandimbalancedmultilabellearningtoidentifyantimicrobialpeptidesandtheirfunctionalactivities
AT wangzhuo integratingtransformerandimbalancedmultilabellearningtoidentifyantimicrobialpeptidesandtheirfunctionalactivities
AT leetzongyi integratingtransformerandimbalancedmultilabellearningtoidentifyantimicrobialpeptidesandtheirfunctionalactivities