Cargando…
A review on compound-protein interaction prediction methods: Data, format, representation and model
There has recently been a rapid progress in computational methods for determining protein targets of small molecule drugs, which will be termed as compound protein interaction (CPI). In this review, we comprehensively review topics related to computational prediction of CPI. Data for CPI has been ac...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Research Network of Computational and Structural Biotechnology
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8008185/ https://www.ncbi.nlm.nih.gov/pubmed/33841755 http://dx.doi.org/10.1016/j.csbj.2021.03.004 |
_version_ | 1783672647913046016 |
---|---|
author | Lim, Sangsoo Lu, Yijingxiu Cho, Chang Yun Sung, Inyoung Kim, Jungwoo Kim, Youngkuk Park, Sungjoon Kim, Sun |
author_facet | Lim, Sangsoo Lu, Yijingxiu Cho, Chang Yun Sung, Inyoung Kim, Jungwoo Kim, Youngkuk Park, Sungjoon Kim, Sun |
author_sort | Lim, Sangsoo |
collection | PubMed |
description | There has recently been a rapid progress in computational methods for determining protein targets of small molecule drugs, which will be termed as compound protein interaction (CPI). In this review, we comprehensively review topics related to computational prediction of CPI. Data for CPI has been accumulated and curated significantly both in quantity and quality. Computational methods have become powerful ever to analyze such complex the data. Thus, recent successes in the improved quality of CPI prediction are due to use of both sophisticated computational techniques and higher quality information in the databases. The goal of this article is to provide reviews of topics related to CPI, such as data, format, representation, to computational models, so that researchers can take full advantages of these resources to develop novel prediction methods. Chemical compounds and protein data from various resources were discussed in terms of data formats and encoding schemes. For the CPI methods, we grouped prediction methods into five categories from traditional machine learning techniques to state-of-the-art deep learning techniques. In closing, we discussed emerging machine learning topics to help both experimental and computational scientists leverage the current knowledge and strategies to develop more powerful and accurate CPI prediction methods. |
format | Online Article Text |
id | pubmed-8008185 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Research Network of Computational and Structural Biotechnology |
record_format | MEDLINE/PubMed |
spelling | pubmed-80081852021-04-08 A review on compound-protein interaction prediction methods: Data, format, representation and model Lim, Sangsoo Lu, Yijingxiu Cho, Chang Yun Sung, Inyoung Kim, Jungwoo Kim, Youngkuk Park, Sungjoon Kim, Sun Comput Struct Biotechnol J Review Article There has recently been a rapid progress in computational methods for determining protein targets of small molecule drugs, which will be termed as compound protein interaction (CPI). In this review, we comprehensively review topics related to computational prediction of CPI. Data for CPI has been accumulated and curated significantly both in quantity and quality. Computational methods have become powerful ever to analyze such complex the data. Thus, recent successes in the improved quality of CPI prediction are due to use of both sophisticated computational techniques and higher quality information in the databases. The goal of this article is to provide reviews of topics related to CPI, such as data, format, representation, to computational models, so that researchers can take full advantages of these resources to develop novel prediction methods. Chemical compounds and protein data from various resources were discussed in terms of data formats and encoding schemes. For the CPI methods, we grouped prediction methods into five categories from traditional machine learning techniques to state-of-the-art deep learning techniques. In closing, we discussed emerging machine learning topics to help both experimental and computational scientists leverage the current knowledge and strategies to develop more powerful and accurate CPI prediction methods. Research Network of Computational and Structural Biotechnology 2021-03-10 /pmc/articles/PMC8008185/ /pubmed/33841755 http://dx.doi.org/10.1016/j.csbj.2021.03.004 Text en © 2021 The Authors http://creativecommons.org/licenses/by-nc-nd/4.0/ This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). |
spellingShingle | Review Article Lim, Sangsoo Lu, Yijingxiu Cho, Chang Yun Sung, Inyoung Kim, Jungwoo Kim, Youngkuk Park, Sungjoon Kim, Sun A review on compound-protein interaction prediction methods: Data, format, representation and model |
title | A review on compound-protein interaction prediction methods: Data, format, representation and model |
title_full | A review on compound-protein interaction prediction methods: Data, format, representation and model |
title_fullStr | A review on compound-protein interaction prediction methods: Data, format, representation and model |
title_full_unstemmed | A review on compound-protein interaction prediction methods: Data, format, representation and model |
title_short | A review on compound-protein interaction prediction methods: Data, format, representation and model |
title_sort | review on compound-protein interaction prediction methods: data, format, representation and model |
topic | Review Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8008185/ https://www.ncbi.nlm.nih.gov/pubmed/33841755 http://dx.doi.org/10.1016/j.csbj.2021.03.004 |
work_keys_str_mv | AT limsangsoo areviewoncompoundproteininteractionpredictionmethodsdataformatrepresentationandmodel AT luyijingxiu areviewoncompoundproteininteractionpredictionmethodsdataformatrepresentationandmodel AT chochangyun areviewoncompoundproteininteractionpredictionmethodsdataformatrepresentationandmodel AT sunginyoung areviewoncompoundproteininteractionpredictionmethodsdataformatrepresentationandmodel AT kimjungwoo areviewoncompoundproteininteractionpredictionmethodsdataformatrepresentationandmodel AT kimyoungkuk areviewoncompoundproteininteractionpredictionmethodsdataformatrepresentationandmodel AT parksungjoon areviewoncompoundproteininteractionpredictionmethodsdataformatrepresentationandmodel AT kimsun areviewoncompoundproteininteractionpredictionmethodsdataformatrepresentationandmodel AT limsangsoo reviewoncompoundproteininteractionpredictionmethodsdataformatrepresentationandmodel AT luyijingxiu reviewoncompoundproteininteractionpredictionmethodsdataformatrepresentationandmodel AT chochangyun reviewoncompoundproteininteractionpredictionmethodsdataformatrepresentationandmodel AT sunginyoung reviewoncompoundproteininteractionpredictionmethodsdataformatrepresentationandmodel AT kimjungwoo reviewoncompoundproteininteractionpredictionmethodsdataformatrepresentationandmodel AT kimyoungkuk reviewoncompoundproteininteractionpredictionmethodsdataformatrepresentationandmodel AT parksungjoon reviewoncompoundproteininteractionpredictionmethodsdataformatrepresentationandmodel AT kimsun reviewoncompoundproteininteractionpredictionmethodsdataformatrepresentationandmodel |