Cargando…

Scalable prediction of compound-protein interactions using minwise hashing

The identification of compound-protein interactions plays key roles in the drug development toward discovery of new drug leads and new therapeutic protein targets. There is therefore a strong incentive to develop new efficient methods for predicting compound-protein interactions on a genome-wide sca...

Descripción completa

Detalles Bibliográficos
Autores principales: Tabei, Yasuo, Yamanishi, Yoshihiro
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4029277/
https://www.ncbi.nlm.nih.gov/pubmed/24564870
http://dx.doi.org/10.1186/1752-0509-7-S6-S3
_version_ 1782317184458948608
author Tabei, Yasuo
Yamanishi, Yoshihiro
author_facet Tabei, Yasuo
Yamanishi, Yoshihiro
author_sort Tabei, Yasuo
collection PubMed
description The identification of compound-protein interactions plays key roles in the drug development toward discovery of new drug leads and new therapeutic protein targets. There is therefore a strong incentive to develop new efficient methods for predicting compound-protein interactions on a genome-wide scale. In this paper we develop a novel chemogenomic method to make a scalable prediction of compound-protein interactions from heterogeneous biological data using minwise hashing. The proposed method mainly consists of two steps: 1) construction of new compact fingerprints for compound-protein pairs by an improved minwise hashing algorithm, and 2) application of a sparsity-induced classifier to the compact fingerprints. We test the proposed method on its ability to make a large-scale prediction of compound-protein interactions from compound substructure fingerprints and protein domain fingerprints, and show superior performance of the proposed method compared with the previous chemogenomic methods in terms of prediction accuracy, computational efficiency, and interpretability of the predictive model. All the previously developed methods are not computationally feasible for the full dataset consisting of about 200 millions of compound-protein pairs. The proposed method is expected to be useful for virtual screening of a huge number of compounds against many protein targets.
format Online
Article
Text
id pubmed-4029277
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-40292772014-06-04 Scalable prediction of compound-protein interactions using minwise hashing Tabei, Yasuo Yamanishi, Yoshihiro BMC Syst Biol Research The identification of compound-protein interactions plays key roles in the drug development toward discovery of new drug leads and new therapeutic protein targets. There is therefore a strong incentive to develop new efficient methods for predicting compound-protein interactions on a genome-wide scale. In this paper we develop a novel chemogenomic method to make a scalable prediction of compound-protein interactions from heterogeneous biological data using minwise hashing. The proposed method mainly consists of two steps: 1) construction of new compact fingerprints for compound-protein pairs by an improved minwise hashing algorithm, and 2) application of a sparsity-induced classifier to the compact fingerprints. We test the proposed method on its ability to make a large-scale prediction of compound-protein interactions from compound substructure fingerprints and protein domain fingerprints, and show superior performance of the proposed method compared with the previous chemogenomic methods in terms of prediction accuracy, computational efficiency, and interpretability of the predictive model. All the previously developed methods are not computationally feasible for the full dataset consisting of about 200 millions of compound-protein pairs. The proposed method is expected to be useful for virtual screening of a huge number of compounds against many protein targets. BioMed Central 2013-12-13 /pmc/articles/PMC4029277/ /pubmed/24564870 http://dx.doi.org/10.1186/1752-0509-7-S6-S3 Text en Copyright © 2013 Tabei and Yamanishi; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Tabei, Yasuo
Yamanishi, Yoshihiro
Scalable prediction of compound-protein interactions using minwise hashing
title Scalable prediction of compound-protein interactions using minwise hashing
title_full Scalable prediction of compound-protein interactions using minwise hashing
title_fullStr Scalable prediction of compound-protein interactions using minwise hashing
title_full_unstemmed Scalable prediction of compound-protein interactions using minwise hashing
title_short Scalable prediction of compound-protein interactions using minwise hashing
title_sort scalable prediction of compound-protein interactions using minwise hashing
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4029277/
https://www.ncbi.nlm.nih.gov/pubmed/24564870
http://dx.doi.org/10.1186/1752-0509-7-S6-S3
work_keys_str_mv AT tabeiyasuo scalablepredictionofcompoundproteininteractionsusingminwisehashing
AT yamanishiyoshihiro scalablepredictionofcompoundproteininteractionsusingminwisehashing