Cargando…

Bayesian neural network with pretrained protein embedding enhances prediction accuracy of drug-protein interaction

MOTIVATION: Characterizing drug–protein interactions (DPIs) is crucial to the high-throughput screening for drug discovery. The deep learning-based approaches have attracted attention because they can predict DPIs without human trial and error. However, because data labeling requires significant res...

Descripción completa

Detalles Bibliográficos
Autores principales: Kim, QHwan, Ko, Joon-Hyuk, Kim, Sunghoon, Park, Nojun, Jhe, Wonho
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8545317/
https://www.ncbi.nlm.nih.gov/pubmed/33978713
http://dx.doi.org/10.1093/bioinformatics/btab346
_version_ 1784589990924124160
author Kim, QHwan
Ko, Joon-Hyuk
Kim, Sunghoon
Park, Nojun
Jhe, Wonho
author_facet Kim, QHwan
Ko, Joon-Hyuk
Kim, Sunghoon
Park, Nojun
Jhe, Wonho
author_sort Kim, QHwan
collection PubMed
description MOTIVATION: Characterizing drug–protein interactions (DPIs) is crucial to the high-throughput screening for drug discovery. The deep learning-based approaches have attracted attention because they can predict DPIs without human trial and error. However, because data labeling requires significant resources, the available protein data size is relatively small, which consequently decreases model performance. Here, we propose two methods to construct a deep learning framework that exhibits superior performance with a small labeled dataset. RESULTS: At first, we use transfer learning in encoding protein sequences with a pretrained model, which trains general sequence representations in an unsupervised manner. Second, we use a Bayesian neural network to make a robust model by estimating the data uncertainty. Our resulting model performs better than the previous baselines at predicting interactions between molecules and proteins. We also show that the quantified uncertainty from the Bayesian inference is related to confidence and can be used for screening DPI data points. AVAILABILITY AND IMPLEMENTATION: The code is available at https://github.com/QHwan/PretrainDPI. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-8545317
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-85453172021-10-26 Bayesian neural network with pretrained protein embedding enhances prediction accuracy of drug-protein interaction Kim, QHwan Ko, Joon-Hyuk Kim, Sunghoon Park, Nojun Jhe, Wonho Bioinformatics Original Papers MOTIVATION: Characterizing drug–protein interactions (DPIs) is crucial to the high-throughput screening for drug discovery. The deep learning-based approaches have attracted attention because they can predict DPIs without human trial and error. However, because data labeling requires significant resources, the available protein data size is relatively small, which consequently decreases model performance. Here, we propose two methods to construct a deep learning framework that exhibits superior performance with a small labeled dataset. RESULTS: At first, we use transfer learning in encoding protein sequences with a pretrained model, which trains general sequence representations in an unsupervised manner. Second, we use a Bayesian neural network to make a robust model by estimating the data uncertainty. Our resulting model performs better than the previous baselines at predicting interactions between molecules and proteins. We also show that the quantified uncertainty from the Bayesian inference is related to confidence and can be used for screening DPI data points. AVAILABILITY AND IMPLEMENTATION: The code is available at https://github.com/QHwan/PretrainDPI. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online. Oxford University Press 2021-05-12 /pmc/articles/PMC8545317/ /pubmed/33978713 http://dx.doi.org/10.1093/bioinformatics/btab346 Text en © The Author(s) 2021. Published by Oxford University Press. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Original Papers
Kim, QHwan
Ko, Joon-Hyuk
Kim, Sunghoon
Park, Nojun
Jhe, Wonho
Bayesian neural network with pretrained protein embedding enhances prediction accuracy of drug-protein interaction
title Bayesian neural network with pretrained protein embedding enhances prediction accuracy of drug-protein interaction
title_full Bayesian neural network with pretrained protein embedding enhances prediction accuracy of drug-protein interaction
title_fullStr Bayesian neural network with pretrained protein embedding enhances prediction accuracy of drug-protein interaction
title_full_unstemmed Bayesian neural network with pretrained protein embedding enhances prediction accuracy of drug-protein interaction
title_short Bayesian neural network with pretrained protein embedding enhances prediction accuracy of drug-protein interaction
title_sort bayesian neural network with pretrained protein embedding enhances prediction accuracy of drug-protein interaction
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8545317/
https://www.ncbi.nlm.nih.gov/pubmed/33978713
http://dx.doi.org/10.1093/bioinformatics/btab346
work_keys_str_mv AT kimqhwan bayesianneuralnetworkwithpretrainedproteinembeddingenhancespredictionaccuracyofdrugproteininteraction
AT kojoonhyuk bayesianneuralnetworkwithpretrainedproteinembeddingenhancespredictionaccuracyofdrugproteininteraction
AT kimsunghoon bayesianneuralnetworkwithpretrainedproteinembeddingenhancespredictionaccuracyofdrugproteininteraction
AT parknojun bayesianneuralnetworkwithpretrainedproteinembeddingenhancespredictionaccuracyofdrugproteininteraction
AT jhewonho bayesianneuralnetworkwithpretrainedproteinembeddingenhancespredictionaccuracyofdrugproteininteraction