Cargando…

Integrating Statistical Predictions and Experimental Verifications for Enhancing Protein-Chemical Interaction Predictions in Virtual Screening

Predictions of interactions between target proteins and potential leads are of great benefit in the drug discovery process. We present a comprehensively applicable statistical prediction method for interactions between any proteins and chemical compounds, which requires only protein sequence data an...

Descripción completa

Detalles Bibliográficos
Autores principales:	Nagamine, Nobuyoshi, Shirakawa, Takayuki, Minato, Yusuke, Torii, Kentaro, Kobayashi, Hiroki, Imoto, Masaya, Sakakibara, Yasubumi
Formato:	Texto
Lenguaje:	English
Publicado:	Public Library of Science 2009
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2685987/ https://www.ncbi.nlm.nih.gov/pubmed/19503826 http://dx.doi.org/10.1371/journal.pcbi.1000397

_version_	1782167367709622272
author	Nagamine, Nobuyoshi Shirakawa, Takayuki Minato, Yusuke Torii, Kentaro Kobayashi, Hiroki Imoto, Masaya Sakakibara, Yasubumi
author_facet	Nagamine, Nobuyoshi Shirakawa, Takayuki Minato, Yusuke Torii, Kentaro Kobayashi, Hiroki Imoto, Masaya Sakakibara, Yasubumi
author_sort	Nagamine, Nobuyoshi
collection	PubMed
description	Predictions of interactions between target proteins and potential leads are of great benefit in the drug discovery process. We present a comprehensively applicable statistical prediction method for interactions between any proteins and chemical compounds, which requires only protein sequence data and chemical structure data and utilizes the statistical learning method of support vector machines. In order to realize reasonable comprehensive predictions which can involve many false positives, we propose two approaches for reduction of false positives: (i) efficient use of multiple statistical prediction models in the framework of two-layer SVM and (ii) reasonable design of the negative data to construct statistical prediction models. In two-layer SVM, outputs produced by the first-layer SVM models, which are constructed with different negative samples and reflect different aspects of classifications, are utilized as inputs to the second-layer SVM. In order to design negative data which produce fewer false positive predictions, we iteratively construct SVM models or classification boundaries from positive and tentative negative samples and select additional negative sample candidates according to pre-determined rules. Moreover, in order to fully utilize the advantages of statistical learning methods, we propose a strategy to effectively feedback experimental results to computational predictions with consideration of biological effects of interest. We show the usefulness of our approach in predicting potential ligands binding to human androgen receptors from more than 19 million chemical compounds and verifying these predictions by in vitro binding. Moreover, we utilize this experimental validation as feedback to enhance subsequent computational predictions, and experimentally validate these predictions again. This efficient procedure of the iteration of the in silico prediction and in vitro or in vivo experimental verifications with the sufficient feedback enabled us to identify novel ligand candidates which were distant from known ligands in the chemical space.
format	Text
id	pubmed-2685987
institution	National Center for Biotechnology Information
language	English
publishDate	2009
publisher	Public Library of Science
record_format	MEDLINE/PubMed
spelling	pubmed-26859872009-06-05 Integrating Statistical Predictions and Experimental Verifications for Enhancing Protein-Chemical Interaction Predictions in Virtual Screening Nagamine, Nobuyoshi Shirakawa, Takayuki Minato, Yusuke Torii, Kentaro Kobayashi, Hiroki Imoto, Masaya Sakakibara, Yasubumi PLoS Comput Biol Research Article Predictions of interactions between target proteins and potential leads are of great benefit in the drug discovery process. We present a comprehensively applicable statistical prediction method for interactions between any proteins and chemical compounds, which requires only protein sequence data and chemical structure data and utilizes the statistical learning method of support vector machines. In order to realize reasonable comprehensive predictions which can involve many false positives, we propose two approaches for reduction of false positives: (i) efficient use of multiple statistical prediction models in the framework of two-layer SVM and (ii) reasonable design of the negative data to construct statistical prediction models. In two-layer SVM, outputs produced by the first-layer SVM models, which are constructed with different negative samples and reflect different aspects of classifications, are utilized as inputs to the second-layer SVM. In order to design negative data which produce fewer false positive predictions, we iteratively construct SVM models or classification boundaries from positive and tentative negative samples and select additional negative sample candidates according to pre-determined rules. Moreover, in order to fully utilize the advantages of statistical learning methods, we propose a strategy to effectively feedback experimental results to computational predictions with consideration of biological effects of interest. We show the usefulness of our approach in predicting potential ligands binding to human androgen receptors from more than 19 million chemical compounds and verifying these predictions by in vitro binding. Moreover, we utilize this experimental validation as feedback to enhance subsequent computational predictions, and experimentally validate these predictions again. This efficient procedure of the iteration of the in silico prediction and in vitro or in vivo experimental verifications with the sufficient feedback enabled us to identify novel ligand candidates which were distant from known ligands in the chemical space. Public Library of Science 2009-06-05 /pmc/articles/PMC2685987/ /pubmed/19503826 http://dx.doi.org/10.1371/journal.pcbi.1000397 Text en Nagamine et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle	Research Article Nagamine, Nobuyoshi Shirakawa, Takayuki Minato, Yusuke Torii, Kentaro Kobayashi, Hiroki Imoto, Masaya Sakakibara, Yasubumi Integrating Statistical Predictions and Experimental Verifications for Enhancing Protein-Chemical Interaction Predictions in Virtual Screening
title	Integrating Statistical Predictions and Experimental Verifications for Enhancing Protein-Chemical Interaction Predictions in Virtual Screening
title_full	Integrating Statistical Predictions and Experimental Verifications for Enhancing Protein-Chemical Interaction Predictions in Virtual Screening
title_fullStr	Integrating Statistical Predictions and Experimental Verifications for Enhancing Protein-Chemical Interaction Predictions in Virtual Screening
title_full_unstemmed	Integrating Statistical Predictions and Experimental Verifications for Enhancing Protein-Chemical Interaction Predictions in Virtual Screening
title_short	Integrating Statistical Predictions and Experimental Verifications for Enhancing Protein-Chemical Interaction Predictions in Virtual Screening
title_sort	integrating statistical predictions and experimental verifications for enhancing protein-chemical interaction predictions in virtual screening
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2685987/ https://www.ncbi.nlm.nih.gov/pubmed/19503826 http://dx.doi.org/10.1371/journal.pcbi.1000397
work_keys_str_mv	AT nagaminenobuyoshi integratingstatisticalpredictionsandexperimentalverificationsforenhancingproteinchemicalinteractionpredictionsinvirtualscreening AT shirakawatakayuki integratingstatisticalpredictionsandexperimentalverificationsforenhancingproteinchemicalinteractionpredictionsinvirtualscreening AT minatoyusuke integratingstatisticalpredictionsandexperimentalverificationsforenhancingproteinchemicalinteractionpredictionsinvirtualscreening AT toriikentaro integratingstatisticalpredictionsandexperimentalverificationsforenhancingproteinchemicalinteractionpredictionsinvirtualscreening AT kobayashihiroki integratingstatisticalpredictionsandexperimentalverificationsforenhancingproteinchemicalinteractionpredictionsinvirtualscreening AT imotomasaya integratingstatisticalpredictionsandexperimentalverificationsforenhancingproteinchemicalinteractionpredictionsinvirtualscreening AT sakakibarayasubumi integratingstatisticalpredictionsandexperimentalverificationsforenhancingproteinchemicalinteractionpredictionsinvirtualscreening

Integrating Statistical Predictions and Experimental Verifications for Enhancing Protein-Chemical Interaction Predictions in Virtual Screening

Ejemplares similares