Cargando…

A review on compound-protein interaction prediction methods: Data, format, representation and model

There has recently been a rapid progress in computational methods for determining protein targets of small molecule drugs, which will be termed as compound protein interaction (CPI). In this review, we comprehensively review topics related to computational prediction of CPI. Data for CPI has been ac...

Descripción completa

Detalles Bibliográficos
Autores principales: Lim, Sangsoo, Lu, Yijingxiu, Cho, Chang Yun, Sung, Inyoung, Kim, Jungwoo, Kim, Youngkuk, Park, Sungjoon, Kim, Sun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Research Network of Computational and Structural Biotechnology 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8008185/
https://www.ncbi.nlm.nih.gov/pubmed/33841755
http://dx.doi.org/10.1016/j.csbj.2021.03.004
_version_ 1783672647913046016
author Lim, Sangsoo
Lu, Yijingxiu
Cho, Chang Yun
Sung, Inyoung
Kim, Jungwoo
Kim, Youngkuk
Park, Sungjoon
Kim, Sun
author_facet Lim, Sangsoo
Lu, Yijingxiu
Cho, Chang Yun
Sung, Inyoung
Kim, Jungwoo
Kim, Youngkuk
Park, Sungjoon
Kim, Sun
author_sort Lim, Sangsoo
collection PubMed
description There has recently been a rapid progress in computational methods for determining protein targets of small molecule drugs, which will be termed as compound protein interaction (CPI). In this review, we comprehensively review topics related to computational prediction of CPI. Data for CPI has been accumulated and curated significantly both in quantity and quality. Computational methods have become powerful ever to analyze such complex the data. Thus, recent successes in the improved quality of CPI prediction are due to use of both sophisticated computational techniques and higher quality information in the databases. The goal of this article is to provide reviews of topics related to CPI, such as data, format, representation, to computational models, so that researchers can take full advantages of these resources to develop novel prediction methods. Chemical compounds and protein data from various resources were discussed in terms of data formats and encoding schemes. For the CPI methods, we grouped prediction methods into five categories from traditional machine learning techniques to state-of-the-art deep learning techniques. In closing, we discussed emerging machine learning topics to help both experimental and computational scientists leverage the current knowledge and strategies to develop more powerful and accurate CPI prediction methods.
format Online
Article
Text
id pubmed-8008185
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Research Network of Computational and Structural Biotechnology
record_format MEDLINE/PubMed
spelling pubmed-80081852021-04-08 A review on compound-protein interaction prediction methods: Data, format, representation and model Lim, Sangsoo Lu, Yijingxiu Cho, Chang Yun Sung, Inyoung Kim, Jungwoo Kim, Youngkuk Park, Sungjoon Kim, Sun Comput Struct Biotechnol J Review Article There has recently been a rapid progress in computational methods for determining protein targets of small molecule drugs, which will be termed as compound protein interaction (CPI). In this review, we comprehensively review topics related to computational prediction of CPI. Data for CPI has been accumulated and curated significantly both in quantity and quality. Computational methods have become powerful ever to analyze such complex the data. Thus, recent successes in the improved quality of CPI prediction are due to use of both sophisticated computational techniques and higher quality information in the databases. The goal of this article is to provide reviews of topics related to CPI, such as data, format, representation, to computational models, so that researchers can take full advantages of these resources to develop novel prediction methods. Chemical compounds and protein data from various resources were discussed in terms of data formats and encoding schemes. For the CPI methods, we grouped prediction methods into five categories from traditional machine learning techniques to state-of-the-art deep learning techniques. In closing, we discussed emerging machine learning topics to help both experimental and computational scientists leverage the current knowledge and strategies to develop more powerful and accurate CPI prediction methods. Research Network of Computational and Structural Biotechnology 2021-03-10 /pmc/articles/PMC8008185/ /pubmed/33841755 http://dx.doi.org/10.1016/j.csbj.2021.03.004 Text en © 2021 The Authors http://creativecommons.org/licenses/by-nc-nd/4.0/ This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Review Article
Lim, Sangsoo
Lu, Yijingxiu
Cho, Chang Yun
Sung, Inyoung
Kim, Jungwoo
Kim, Youngkuk
Park, Sungjoon
Kim, Sun
A review on compound-protein interaction prediction methods: Data, format, representation and model
title A review on compound-protein interaction prediction methods: Data, format, representation and model
title_full A review on compound-protein interaction prediction methods: Data, format, representation and model
title_fullStr A review on compound-protein interaction prediction methods: Data, format, representation and model
title_full_unstemmed A review on compound-protein interaction prediction methods: Data, format, representation and model
title_short A review on compound-protein interaction prediction methods: Data, format, representation and model
title_sort review on compound-protein interaction prediction methods: data, format, representation and model
topic Review Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8008185/
https://www.ncbi.nlm.nih.gov/pubmed/33841755
http://dx.doi.org/10.1016/j.csbj.2021.03.004
work_keys_str_mv AT limsangsoo areviewoncompoundproteininteractionpredictionmethodsdataformatrepresentationandmodel
AT luyijingxiu areviewoncompoundproteininteractionpredictionmethodsdataformatrepresentationandmodel
AT chochangyun areviewoncompoundproteininteractionpredictionmethodsdataformatrepresentationandmodel
AT sunginyoung areviewoncompoundproteininteractionpredictionmethodsdataformatrepresentationandmodel
AT kimjungwoo areviewoncompoundproteininteractionpredictionmethodsdataformatrepresentationandmodel
AT kimyoungkuk areviewoncompoundproteininteractionpredictionmethodsdataformatrepresentationandmodel
AT parksungjoon areviewoncompoundproteininteractionpredictionmethodsdataformatrepresentationandmodel
AT kimsun areviewoncompoundproteininteractionpredictionmethodsdataformatrepresentationandmodel
AT limsangsoo reviewoncompoundproteininteractionpredictionmethodsdataformatrepresentationandmodel
AT luyijingxiu reviewoncompoundproteininteractionpredictionmethodsdataformatrepresentationandmodel
AT chochangyun reviewoncompoundproteininteractionpredictionmethodsdataformatrepresentationandmodel
AT sunginyoung reviewoncompoundproteininteractionpredictionmethodsdataformatrepresentationandmodel
AT kimjungwoo reviewoncompoundproteininteractionpredictionmethodsdataformatrepresentationandmodel
AT kimyoungkuk reviewoncompoundproteininteractionpredictionmethodsdataformatrepresentationandmodel
AT parksungjoon reviewoncompoundproteininteractionpredictionmethodsdataformatrepresentationandmodel
AT kimsun reviewoncompoundproteininteractionpredictionmethodsdataformatrepresentationandmodel