Cargando…
Scalable prediction of compound-protein interactions using minwise hashing
The identification of compound-protein interactions plays key roles in the drug development toward discovery of new drug leads and new therapeutic protein targets. There is therefore a strong incentive to develop new efficient methods for predicting compound-protein interactions on a genome-wide sca...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4029277/ https://www.ncbi.nlm.nih.gov/pubmed/24564870 http://dx.doi.org/10.1186/1752-0509-7-S6-S3 |
_version_ | 1782317184458948608 |
---|---|
author | Tabei, Yasuo Yamanishi, Yoshihiro |
author_facet | Tabei, Yasuo Yamanishi, Yoshihiro |
author_sort | Tabei, Yasuo |
collection | PubMed |
description | The identification of compound-protein interactions plays key roles in the drug development toward discovery of new drug leads and new therapeutic protein targets. There is therefore a strong incentive to develop new efficient methods for predicting compound-protein interactions on a genome-wide scale. In this paper we develop a novel chemogenomic method to make a scalable prediction of compound-protein interactions from heterogeneous biological data using minwise hashing. The proposed method mainly consists of two steps: 1) construction of new compact fingerprints for compound-protein pairs by an improved minwise hashing algorithm, and 2) application of a sparsity-induced classifier to the compact fingerprints. We test the proposed method on its ability to make a large-scale prediction of compound-protein interactions from compound substructure fingerprints and protein domain fingerprints, and show superior performance of the proposed method compared with the previous chemogenomic methods in terms of prediction accuracy, computational efficiency, and interpretability of the predictive model. All the previously developed methods are not computationally feasible for the full dataset consisting of about 200 millions of compound-protein pairs. The proposed method is expected to be useful for virtual screening of a huge number of compounds against many protein targets. |
format | Online Article Text |
id | pubmed-4029277 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-40292772014-06-04 Scalable prediction of compound-protein interactions using minwise hashing Tabei, Yasuo Yamanishi, Yoshihiro BMC Syst Biol Research The identification of compound-protein interactions plays key roles in the drug development toward discovery of new drug leads and new therapeutic protein targets. There is therefore a strong incentive to develop new efficient methods for predicting compound-protein interactions on a genome-wide scale. In this paper we develop a novel chemogenomic method to make a scalable prediction of compound-protein interactions from heterogeneous biological data using minwise hashing. The proposed method mainly consists of two steps: 1) construction of new compact fingerprints for compound-protein pairs by an improved minwise hashing algorithm, and 2) application of a sparsity-induced classifier to the compact fingerprints. We test the proposed method on its ability to make a large-scale prediction of compound-protein interactions from compound substructure fingerprints and protein domain fingerprints, and show superior performance of the proposed method compared with the previous chemogenomic methods in terms of prediction accuracy, computational efficiency, and interpretability of the predictive model. All the previously developed methods are not computationally feasible for the full dataset consisting of about 200 millions of compound-protein pairs. The proposed method is expected to be useful for virtual screening of a huge number of compounds against many protein targets. BioMed Central 2013-12-13 /pmc/articles/PMC4029277/ /pubmed/24564870 http://dx.doi.org/10.1186/1752-0509-7-S6-S3 Text en Copyright © 2013 Tabei and Yamanishi; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Tabei, Yasuo Yamanishi, Yoshihiro Scalable prediction of compound-protein interactions using minwise hashing |
title | Scalable prediction of compound-protein interactions using minwise hashing |
title_full | Scalable prediction of compound-protein interactions using minwise hashing |
title_fullStr | Scalable prediction of compound-protein interactions using minwise hashing |
title_full_unstemmed | Scalable prediction of compound-protein interactions using minwise hashing |
title_short | Scalable prediction of compound-protein interactions using minwise hashing |
title_sort | scalable prediction of compound-protein interactions using minwise hashing |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4029277/ https://www.ncbi.nlm.nih.gov/pubmed/24564870 http://dx.doi.org/10.1186/1752-0509-7-S6-S3 |
work_keys_str_mv | AT tabeiyasuo scalablepredictionofcompoundproteininteractionsusingminwisehashing AT yamanishiyoshihiro scalablepredictionofcompoundproteininteractionsusingminwisehashing |