Cargando…

DeepBindPoc: a deep learning method to rank ligand binding pockets using molecular vector representation

Accurate identification of ligand-binding pockets in a protein is important for structure-based drug design. In recent years, several deep learning models were developed to learn important physical–chemical and spatial information to predict ligand-binding pockets in a protein. However, ranking the...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Haiping, Saravanan, Konda Mani, Lin, Jinzhi, Liao, Linbu, Ng, Justin Tze-Yang, Zhou, Jiaxiu, Wei, Yanjie
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7144620/
https://www.ncbi.nlm.nih.gov/pubmed/32292649
http://dx.doi.org/10.7717/peerj.8864
_version_ 1783519871084003328
author Zhang, Haiping
Saravanan, Konda Mani
Lin, Jinzhi
Liao, Linbu
Ng, Justin Tze-Yang
Zhou, Jiaxiu
Wei, Yanjie
author_facet Zhang, Haiping
Saravanan, Konda Mani
Lin, Jinzhi
Liao, Linbu
Ng, Justin Tze-Yang
Zhou, Jiaxiu
Wei, Yanjie
author_sort Zhang, Haiping
collection PubMed
description Accurate identification of ligand-binding pockets in a protein is important for structure-based drug design. In recent years, several deep learning models were developed to learn important physical–chemical and spatial information to predict ligand-binding pockets in a protein. However, ranking the native ligand binding pockets from a pool of predicted pockets is still a hard task for computational molecular biologists using a single web-based tool. Hence, we believe, by using closer to real application data set as training and by providing ligand information, an enhanced model to identify accurate pockets can be obtained. In this article, we propose a new deep learning method called DeepBindPoc for identifying and ranking ligand-binding pockets in proteins. The model is built by using information about the binding pocket and associated ligand. We take advantage of the mol2vec tool to represent both the given ligand and pocket as vectors to construct a densely fully connected layer model. During the training, important features for pocket-ligand binding are automatically extracted and high-level information is preserved appropriately. DeepBindPoc demonstrated a strong complementary advantage for the detection of native-like pockets when combined with traditional popular methods, such as fpocket and P2Rank. The proposed method is extensively tested and validated with standard procedures on multiple datasets, including a dataset with G-protein Coupled receptors. The systematic testing and validation of our method suggest that DeepBindPoc is a valuable tool to rank near-native pockets for theoretically modeled protein with unknown experimental active site but have known ligand. The DeepBindPoc model described in this article is available at GitHub (https://github.com/haiping1010/DeepBindPoc) and the webserver is available at (http://cbblab.siat.ac.cn/DeepBindPoc/index.php).
format Online
Article
Text
id pubmed-7144620
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-71446202020-04-14 DeepBindPoc: a deep learning method to rank ligand binding pockets using molecular vector representation Zhang, Haiping Saravanan, Konda Mani Lin, Jinzhi Liao, Linbu Ng, Justin Tze-Yang Zhou, Jiaxiu Wei, Yanjie PeerJ Bioinformatics Accurate identification of ligand-binding pockets in a protein is important for structure-based drug design. In recent years, several deep learning models were developed to learn important physical–chemical and spatial information to predict ligand-binding pockets in a protein. However, ranking the native ligand binding pockets from a pool of predicted pockets is still a hard task for computational molecular biologists using a single web-based tool. Hence, we believe, by using closer to real application data set as training and by providing ligand information, an enhanced model to identify accurate pockets can be obtained. In this article, we propose a new deep learning method called DeepBindPoc for identifying and ranking ligand-binding pockets in proteins. The model is built by using information about the binding pocket and associated ligand. We take advantage of the mol2vec tool to represent both the given ligand and pocket as vectors to construct a densely fully connected layer model. During the training, important features for pocket-ligand binding are automatically extracted and high-level information is preserved appropriately. DeepBindPoc demonstrated a strong complementary advantage for the detection of native-like pockets when combined with traditional popular methods, such as fpocket and P2Rank. The proposed method is extensively tested and validated with standard procedures on multiple datasets, including a dataset with G-protein Coupled receptors. The systematic testing and validation of our method suggest that DeepBindPoc is a valuable tool to rank near-native pockets for theoretically modeled protein with unknown experimental active site but have known ligand. The DeepBindPoc model described in this article is available at GitHub (https://github.com/haiping1010/DeepBindPoc) and the webserver is available at (http://cbblab.siat.ac.cn/DeepBindPoc/index.php). PeerJ Inc. 2020-04-06 /pmc/articles/PMC7144620/ /pubmed/32292649 http://dx.doi.org/10.7717/peerj.8864 Text en © 2020 Zhang et al. https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.
spellingShingle Bioinformatics
Zhang, Haiping
Saravanan, Konda Mani
Lin, Jinzhi
Liao, Linbu
Ng, Justin Tze-Yang
Zhou, Jiaxiu
Wei, Yanjie
DeepBindPoc: a deep learning method to rank ligand binding pockets using molecular vector representation
title DeepBindPoc: a deep learning method to rank ligand binding pockets using molecular vector representation
title_full DeepBindPoc: a deep learning method to rank ligand binding pockets using molecular vector representation
title_fullStr DeepBindPoc: a deep learning method to rank ligand binding pockets using molecular vector representation
title_full_unstemmed DeepBindPoc: a deep learning method to rank ligand binding pockets using molecular vector representation
title_short DeepBindPoc: a deep learning method to rank ligand binding pockets using molecular vector representation
title_sort deepbindpoc: a deep learning method to rank ligand binding pockets using molecular vector representation
topic Bioinformatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7144620/
https://www.ncbi.nlm.nih.gov/pubmed/32292649
http://dx.doi.org/10.7717/peerj.8864
work_keys_str_mv AT zhanghaiping deepbindpocadeeplearningmethodtorankligandbindingpocketsusingmolecularvectorrepresentation
AT saravanankondamani deepbindpocadeeplearningmethodtorankligandbindingpocketsusingmolecularvectorrepresentation
AT linjinzhi deepbindpocadeeplearningmethodtorankligandbindingpocketsusingmolecularvectorrepresentation
AT liaolinbu deepbindpocadeeplearningmethodtorankligandbindingpocketsusingmolecularvectorrepresentation
AT ngjustintzeyang deepbindpocadeeplearningmethodtorankligandbindingpocketsusingmolecularvectorrepresentation
AT zhoujiaxiu deepbindpocadeeplearningmethodtorankligandbindingpocketsusingmolecularvectorrepresentation
AT weiyanjie deepbindpocadeeplearningmethodtorankligandbindingpocketsusingmolecularvectorrepresentation