Cargando…

A novel hybrid framework for metabolic pathways prediction based on the graph attention network

BACKGROUND: Making clear what kinds of metabolic pathways a drug compound involves in can help researchers understand how the drug is absorbed, distributed, metabolized, and excreted. The characteristics of a compound such as structure, composition and so on directly determine the metabolic pathways...

Descripción completa

Detalles Bibliográficos
Autores principales: Yang, Zhihui, Liu, Juan, Shah, Hayat Ali, Feng, Jing
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9520805/
https://www.ncbi.nlm.nih.gov/pubmed/36171550
http://dx.doi.org/10.1186/s12859-022-04856-y
_version_ 1784799708098592768
author Yang, Zhihui
Liu, Juan
Shah, Hayat Ali
Feng, Jing
author_facet Yang, Zhihui
Liu, Juan
Shah, Hayat Ali
Feng, Jing
author_sort Yang, Zhihui
collection PubMed
description BACKGROUND: Making clear what kinds of metabolic pathways a drug compound involves in can help researchers understand how the drug is absorbed, distributed, metabolized, and excreted. The characteristics of a compound such as structure, composition and so on directly determine the metabolic pathways it participates in. METHODS: We developed a novel hybrid framework based on the graph attention network (GAT) to predict the metabolic pathway classes that a compound involves in, named HFGAT, by making use of its global and local characteristics. The framework mainly consists of a two-branch feature extracting layer and a fully connected (FC) layer. In the two-branch feature extracting layer, one branch is responsible to extract global features of the compound; and the other branch introduces a GAT consisting of two graph attention layers to extract local structural features of the compound. Both the global and the local features of the compound are then integrated into the FC layer which outputs the predicted result of metabolic pathway categories that the compound belongs to. RESULTS: We compared the multi-class classification performance of HFGAT with six other representative methods, including five classic machine learning methods and one graph convolutional network (GCN) based deep learning method, on the benchmark dataset containing 6999 compounds belonging to 11 pathway categories. The results showed that the deep learning-based methods (HFGAT, GCN-based method) outperformed the traditional machine learning methods in the prediction of metabolic pathways and our proposed HFGAT method performed better than the GCN-based method. Moreover, HFGAT achieved higher [Formula: see text] scores on 8 of 11 classes than the GCN-based method. CONCLUSIONS: Our proposed HFGAT makes use of both the global and local information of the compounds to predict their metabolic pathway categories and has achieved a significant performance. Compared with the GCN model, the introduction of the GAT can help our model pay more attention to substructures of the compound that are useful for the prediction task. The study provided a potential method for drug discovery with all types of metabolic reactions that may be involved in the decomposition and synthesis of pharmaceutical compounds in the organism.
format Online
Article
Text
id pubmed-9520805
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-95208052022-09-30 A novel hybrid framework for metabolic pathways prediction based on the graph attention network Yang, Zhihui Liu, Juan Shah, Hayat Ali Feng, Jing BMC Bioinformatics Methodology BACKGROUND: Making clear what kinds of metabolic pathways a drug compound involves in can help researchers understand how the drug is absorbed, distributed, metabolized, and excreted. The characteristics of a compound such as structure, composition and so on directly determine the metabolic pathways it participates in. METHODS: We developed a novel hybrid framework based on the graph attention network (GAT) to predict the metabolic pathway classes that a compound involves in, named HFGAT, by making use of its global and local characteristics. The framework mainly consists of a two-branch feature extracting layer and a fully connected (FC) layer. In the two-branch feature extracting layer, one branch is responsible to extract global features of the compound; and the other branch introduces a GAT consisting of two graph attention layers to extract local structural features of the compound. Both the global and the local features of the compound are then integrated into the FC layer which outputs the predicted result of metabolic pathway categories that the compound belongs to. RESULTS: We compared the multi-class classification performance of HFGAT with six other representative methods, including five classic machine learning methods and one graph convolutional network (GCN) based deep learning method, on the benchmark dataset containing 6999 compounds belonging to 11 pathway categories. The results showed that the deep learning-based methods (HFGAT, GCN-based method) outperformed the traditional machine learning methods in the prediction of metabolic pathways and our proposed HFGAT method performed better than the GCN-based method. Moreover, HFGAT achieved higher [Formula: see text] scores on 8 of 11 classes than the GCN-based method. CONCLUSIONS: Our proposed HFGAT makes use of both the global and local information of the compounds to predict their metabolic pathway categories and has achieved a significant performance. Compared with the GCN model, the introduction of the GAT can help our model pay more attention to substructures of the compound that are useful for the prediction task. The study provided a potential method for drug discovery with all types of metabolic reactions that may be involved in the decomposition and synthesis of pharmaceutical compounds in the organism. BioMed Central 2022-09-28 /pmc/articles/PMC9520805/ /pubmed/36171550 http://dx.doi.org/10.1186/s12859-022-04856-y Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Methodology
Yang, Zhihui
Liu, Juan
Shah, Hayat Ali
Feng, Jing
A novel hybrid framework for metabolic pathways prediction based on the graph attention network
title A novel hybrid framework for metabolic pathways prediction based on the graph attention network
title_full A novel hybrid framework for metabolic pathways prediction based on the graph attention network
title_fullStr A novel hybrid framework for metabolic pathways prediction based on the graph attention network
title_full_unstemmed A novel hybrid framework for metabolic pathways prediction based on the graph attention network
title_short A novel hybrid framework for metabolic pathways prediction based on the graph attention network
title_sort novel hybrid framework for metabolic pathways prediction based on the graph attention network
topic Methodology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9520805/
https://www.ncbi.nlm.nih.gov/pubmed/36171550
http://dx.doi.org/10.1186/s12859-022-04856-y
work_keys_str_mv AT yangzhihui anovelhybridframeworkformetabolicpathwayspredictionbasedonthegraphattentionnetwork
AT liujuan anovelhybridframeworkformetabolicpathwayspredictionbasedonthegraphattentionnetwork
AT shahhayatali anovelhybridframeworkformetabolicpathwayspredictionbasedonthegraphattentionnetwork
AT fengjing anovelhybridframeworkformetabolicpathwayspredictionbasedonthegraphattentionnetwork
AT yangzhihui novelhybridframeworkformetabolicpathwayspredictionbasedonthegraphattentionnetwork
AT liujuan novelhybridframeworkformetabolicpathwayspredictionbasedonthegraphattentionnetwork
AT shahhayatali novelhybridframeworkformetabolicpathwayspredictionbasedonthegraphattentionnetwork
AT fengjing novelhybridframeworkformetabolicpathwayspredictionbasedonthegraphattentionnetwork