Cargando…
CAT-CPI: Combining CNN and transformer to learn compound image features for predicting compound-protein interactions
Compound-protein interaction (CPI) prediction is a foundational task for drug discovery, which process is time-consuming and costly. The effectiveness of CPI prediction can be greatly improved using deep learning methods to accelerate drug development. Large number of recent research results in the...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9520300/ https://www.ncbi.nlm.nih.gov/pubmed/36188230 http://dx.doi.org/10.3389/fmolb.2022.963912 |
_version_ | 1784799592697561088 |
---|---|
author | Qian, Ying Wu, Jian Zhang, Qian |
author_facet | Qian, Ying Wu, Jian Zhang, Qian |
author_sort | Qian, Ying |
collection | PubMed |
description | Compound-protein interaction (CPI) prediction is a foundational task for drug discovery, which process is time-consuming and costly. The effectiveness of CPI prediction can be greatly improved using deep learning methods to accelerate drug development. Large number of recent research results in the field of computer vision, especially in deep learning, have proved that the position, geometry, spatial structure and other features of objects in an image can be well characterized. We propose a novel molecular image-based model named CAT-CPI (combining CNN and transformer to predict CPI) for CPI task. We use Convolution Neural Network (CNN) to learn local features of molecular images and then use transformer encoder to capture the semantic relationships of these features. To extract protein sequence feature, we propose to use a k-gram based method and obtain the semantic relationships of sub-sequences by transformer encoder. In addition, we build a Feature Relearning (FR) module to learn interaction features of compounds and proteins. We evaluated CAT-CPI on three benchmark datasets—Human, Celegans, and Davis—and the experimental results demonstrate that CAT-CPI presents competitive performance against state-of-the-art predictors. In addition, we carry out Drug-Drug Interaction (DDI) experiments to verify the strong potential of the methods based on molecular images and FR module. |
format | Online Article Text |
id | pubmed-9520300 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-95203002022-09-30 CAT-CPI: Combining CNN and transformer to learn compound image features for predicting compound-protein interactions Qian, Ying Wu, Jian Zhang, Qian Front Mol Biosci Molecular Biosciences Compound-protein interaction (CPI) prediction is a foundational task for drug discovery, which process is time-consuming and costly. The effectiveness of CPI prediction can be greatly improved using deep learning methods to accelerate drug development. Large number of recent research results in the field of computer vision, especially in deep learning, have proved that the position, geometry, spatial structure and other features of objects in an image can be well characterized. We propose a novel molecular image-based model named CAT-CPI (combining CNN and transformer to predict CPI) for CPI task. We use Convolution Neural Network (CNN) to learn local features of molecular images and then use transformer encoder to capture the semantic relationships of these features. To extract protein sequence feature, we propose to use a k-gram based method and obtain the semantic relationships of sub-sequences by transformer encoder. In addition, we build a Feature Relearning (FR) module to learn interaction features of compounds and proteins. We evaluated CAT-CPI on three benchmark datasets—Human, Celegans, and Davis—and the experimental results demonstrate that CAT-CPI presents competitive performance against state-of-the-art predictors. In addition, we carry out Drug-Drug Interaction (DDI) experiments to verify the strong potential of the methods based on molecular images and FR module. Frontiers Media S.A. 2022-09-15 /pmc/articles/PMC9520300/ /pubmed/36188230 http://dx.doi.org/10.3389/fmolb.2022.963912 Text en Copyright © 2022 Qian, Wu and Zhang. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Molecular Biosciences Qian, Ying Wu, Jian Zhang, Qian CAT-CPI: Combining CNN and transformer to learn compound image features for predicting compound-protein interactions |
title | CAT-CPI: Combining CNN and transformer to learn compound image features for predicting compound-protein interactions |
title_full | CAT-CPI: Combining CNN and transformer to learn compound image features for predicting compound-protein interactions |
title_fullStr | CAT-CPI: Combining CNN and transformer to learn compound image features for predicting compound-protein interactions |
title_full_unstemmed | CAT-CPI: Combining CNN and transformer to learn compound image features for predicting compound-protein interactions |
title_short | CAT-CPI: Combining CNN and transformer to learn compound image features for predicting compound-protein interactions |
title_sort | cat-cpi: combining cnn and transformer to learn compound image features for predicting compound-protein interactions |
topic | Molecular Biosciences |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9520300/ https://www.ncbi.nlm.nih.gov/pubmed/36188230 http://dx.doi.org/10.3389/fmolb.2022.963912 |
work_keys_str_mv | AT qianying catcpicombiningcnnandtransformertolearncompoundimagefeaturesforpredictingcompoundproteininteractions AT wujian catcpicombiningcnnandtransformertolearncompoundimagefeaturesforpredictingcompoundproteininteractions AT zhangqian catcpicombiningcnnandtransformertolearncompoundimagefeaturesforpredictingcompoundproteininteractions |