Cargando…

CAT-CPI: Combining CNN and transformer to learn compound image features for predicting compound-protein interactions

Compound-protein interaction (CPI) prediction is a foundational task for drug discovery, which process is time-consuming and costly. The effectiveness of CPI prediction can be greatly improved using deep learning methods to accelerate drug development. Large number of recent research results in the...

Descripción completa

Detalles Bibliográficos
Autores principales: Qian, Ying, Wu, Jian, Zhang, Qian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9520300/
https://www.ncbi.nlm.nih.gov/pubmed/36188230
http://dx.doi.org/10.3389/fmolb.2022.963912
_version_ 1784799592697561088
author Qian, Ying
Wu, Jian
Zhang, Qian
author_facet Qian, Ying
Wu, Jian
Zhang, Qian
author_sort Qian, Ying
collection PubMed
description Compound-protein interaction (CPI) prediction is a foundational task for drug discovery, which process is time-consuming and costly. The effectiveness of CPI prediction can be greatly improved using deep learning methods to accelerate drug development. Large number of recent research results in the field of computer vision, especially in deep learning, have proved that the position, geometry, spatial structure and other features of objects in an image can be well characterized. We propose a novel molecular image-based model named CAT-CPI (combining CNN and transformer to predict CPI) for CPI task. We use Convolution Neural Network (CNN) to learn local features of molecular images and then use transformer encoder to capture the semantic relationships of these features. To extract protein sequence feature, we propose to use a k-gram based method and obtain the semantic relationships of sub-sequences by transformer encoder. In addition, we build a Feature Relearning (FR) module to learn interaction features of compounds and proteins. We evaluated CAT-CPI on three benchmark datasets—Human, Celegans, and Davis—and the experimental results demonstrate that CAT-CPI presents competitive performance against state-of-the-art predictors. In addition, we carry out Drug-Drug Interaction (DDI) experiments to verify the strong potential of the methods based on molecular images and FR module.
format Online
Article
Text
id pubmed-9520300
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-95203002022-09-30 CAT-CPI: Combining CNN and transformer to learn compound image features for predicting compound-protein interactions Qian, Ying Wu, Jian Zhang, Qian Front Mol Biosci Molecular Biosciences Compound-protein interaction (CPI) prediction is a foundational task for drug discovery, which process is time-consuming and costly. The effectiveness of CPI prediction can be greatly improved using deep learning methods to accelerate drug development. Large number of recent research results in the field of computer vision, especially in deep learning, have proved that the position, geometry, spatial structure and other features of objects in an image can be well characterized. We propose a novel molecular image-based model named CAT-CPI (combining CNN and transformer to predict CPI) for CPI task. We use Convolution Neural Network (CNN) to learn local features of molecular images and then use transformer encoder to capture the semantic relationships of these features. To extract protein sequence feature, we propose to use a k-gram based method and obtain the semantic relationships of sub-sequences by transformer encoder. In addition, we build a Feature Relearning (FR) module to learn interaction features of compounds and proteins. We evaluated CAT-CPI on three benchmark datasets—Human, Celegans, and Davis—and the experimental results demonstrate that CAT-CPI presents competitive performance against state-of-the-art predictors. In addition, we carry out Drug-Drug Interaction (DDI) experiments to verify the strong potential of the methods based on molecular images and FR module. Frontiers Media S.A. 2022-09-15 /pmc/articles/PMC9520300/ /pubmed/36188230 http://dx.doi.org/10.3389/fmolb.2022.963912 Text en Copyright © 2022 Qian, Wu and Zhang. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Molecular Biosciences
Qian, Ying
Wu, Jian
Zhang, Qian
CAT-CPI: Combining CNN and transformer to learn compound image features for predicting compound-protein interactions
title CAT-CPI: Combining CNN and transformer to learn compound image features for predicting compound-protein interactions
title_full CAT-CPI: Combining CNN and transformer to learn compound image features for predicting compound-protein interactions
title_fullStr CAT-CPI: Combining CNN and transformer to learn compound image features for predicting compound-protein interactions
title_full_unstemmed CAT-CPI: Combining CNN and transformer to learn compound image features for predicting compound-protein interactions
title_short CAT-CPI: Combining CNN and transformer to learn compound image features for predicting compound-protein interactions
title_sort cat-cpi: combining cnn and transformer to learn compound image features for predicting compound-protein interactions
topic Molecular Biosciences
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9520300/
https://www.ncbi.nlm.nih.gov/pubmed/36188230
http://dx.doi.org/10.3389/fmolb.2022.963912
work_keys_str_mv AT qianying catcpicombiningcnnandtransformertolearncompoundimagefeaturesforpredictingcompoundproteininteractions
AT wujian catcpicombiningcnnandtransformertolearncompoundimagefeaturesforpredictingcompoundproteininteractions
AT zhangqian catcpicombiningcnnandtransformertolearncompoundimagefeaturesforpredictingcompoundproteininteractions