Cargando…
Bayesian neural network with pretrained protein embedding enhances prediction accuracy of drug-protein interaction
MOTIVATION: Characterizing drug–protein interactions (DPIs) is crucial to the high-throughput screening for drug discovery. The deep learning-based approaches have attracted attention because they can predict DPIs without human trial and error. However, because data labeling requires significant res...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8545317/ https://www.ncbi.nlm.nih.gov/pubmed/33978713 http://dx.doi.org/10.1093/bioinformatics/btab346 |
_version_ | 1784589990924124160 |
---|---|
author | Kim, QHwan Ko, Joon-Hyuk Kim, Sunghoon Park, Nojun Jhe, Wonho |
author_facet | Kim, QHwan Ko, Joon-Hyuk Kim, Sunghoon Park, Nojun Jhe, Wonho |
author_sort | Kim, QHwan |
collection | PubMed |
description | MOTIVATION: Characterizing drug–protein interactions (DPIs) is crucial to the high-throughput screening for drug discovery. The deep learning-based approaches have attracted attention because they can predict DPIs without human trial and error. However, because data labeling requires significant resources, the available protein data size is relatively small, which consequently decreases model performance. Here, we propose two methods to construct a deep learning framework that exhibits superior performance with a small labeled dataset. RESULTS: At first, we use transfer learning in encoding protein sequences with a pretrained model, which trains general sequence representations in an unsupervised manner. Second, we use a Bayesian neural network to make a robust model by estimating the data uncertainty. Our resulting model performs better than the previous baselines at predicting interactions between molecules and proteins. We also show that the quantified uncertainty from the Bayesian inference is related to confidence and can be used for screening DPI data points. AVAILABILITY AND IMPLEMENTATION: The code is available at https://github.com/QHwan/PretrainDPI. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-8545317 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-85453172021-10-26 Bayesian neural network with pretrained protein embedding enhances prediction accuracy of drug-protein interaction Kim, QHwan Ko, Joon-Hyuk Kim, Sunghoon Park, Nojun Jhe, Wonho Bioinformatics Original Papers MOTIVATION: Characterizing drug–protein interactions (DPIs) is crucial to the high-throughput screening for drug discovery. The deep learning-based approaches have attracted attention because they can predict DPIs without human trial and error. However, because data labeling requires significant resources, the available protein data size is relatively small, which consequently decreases model performance. Here, we propose two methods to construct a deep learning framework that exhibits superior performance with a small labeled dataset. RESULTS: At first, we use transfer learning in encoding protein sequences with a pretrained model, which trains general sequence representations in an unsupervised manner. Second, we use a Bayesian neural network to make a robust model by estimating the data uncertainty. Our resulting model performs better than the previous baselines at predicting interactions between molecules and proteins. We also show that the quantified uncertainty from the Bayesian inference is related to confidence and can be used for screening DPI data points. AVAILABILITY AND IMPLEMENTATION: The code is available at https://github.com/QHwan/PretrainDPI. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2021-05-12 /pmc/articles/PMC8545317/ /pubmed/33978713 http://dx.doi.org/10.1093/bioinformatics/btab346 Text en © The Author(s) 2021. Published by Oxford University Press. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Original Papers Kim, QHwan Ko, Joon-Hyuk Kim, Sunghoon Park, Nojun Jhe, Wonho Bayesian neural network with pretrained protein embedding enhances prediction accuracy of drug-protein interaction |
title | Bayesian neural network with pretrained protein embedding enhances prediction accuracy of drug-protein interaction |
title_full | Bayesian neural network with pretrained protein embedding enhances prediction accuracy of drug-protein interaction |
title_fullStr | Bayesian neural network with pretrained protein embedding enhances prediction accuracy of drug-protein interaction |
title_full_unstemmed | Bayesian neural network with pretrained protein embedding enhances prediction accuracy of drug-protein interaction |
title_short | Bayesian neural network with pretrained protein embedding enhances prediction accuracy of drug-protein interaction |
title_sort | bayesian neural network with pretrained protein embedding enhances prediction accuracy of drug-protein interaction |
topic | Original Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8545317/ https://www.ncbi.nlm.nih.gov/pubmed/33978713 http://dx.doi.org/10.1093/bioinformatics/btab346 |
work_keys_str_mv | AT kimqhwan bayesianneuralnetworkwithpretrainedproteinembeddingenhancespredictionaccuracyofdrugproteininteraction AT kojoonhyuk bayesianneuralnetworkwithpretrainedproteinembeddingenhancespredictionaccuracyofdrugproteininteraction AT kimsunghoon bayesianneuralnetworkwithpretrainedproteinembeddingenhancespredictionaccuracyofdrugproteininteraction AT parknojun bayesianneuralnetworkwithpretrainedproteinembeddingenhancespredictionaccuracyofdrugproteininteraction AT jhewonho bayesianneuralnetworkwithpretrainedproteinembeddingenhancespredictionaccuracyofdrugproteininteraction |