Cargando…

Implementation of multiple-instance learning in drug activity prediction

BACKGROUND: In the context of drug discovery and development, much effort has been exerted to determine which conformers of a given molecule are responsible for the observed biological activity. In this work we aimed to predict bioactive conformers using a variant of supervised learning, named multi...

Descripción completa

Detalles Bibliográficos
Autores principales:	Fu, Gang, Nan, Xiaofei, Liu, Haining, Patel, Ronak Y, Daga, Pankaj R, Chen, Yixin, Wilkins, Dawn E, Doerksen, Robert J
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2012
Materias:	Proceedings
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3439725/ https://www.ncbi.nlm.nih.gov/pubmed/23046442 http://dx.doi.org/10.1186/1471-2105-13-S15-S3

_version_	1782243055027355648
author	Fu, Gang Nan, Xiaofei Liu, Haining Patel, Ronak Y Daga, Pankaj R Chen, Yixin Wilkins, Dawn E Doerksen, Robert J
author_facet	Fu, Gang Nan, Xiaofei Liu, Haining Patel, Ronak Y Daga, Pankaj R Chen, Yixin Wilkins, Dawn E Doerksen, Robert J
author_sort	Fu, Gang
collection	PubMed
description	BACKGROUND: In the context of drug discovery and development, much effort has been exerted to determine which conformers of a given molecule are responsible for the observed biological activity. In this work we aimed to predict bioactive conformers using a variant of supervised learning, named multiple-instance learning. A single molecule, treated as a bag of conformers, is biologically active if and only if at least one of its conformers, treated as an instance, is responsible for the observed bioactivity; and a molecule is inactive if none of its conformers is responsible for the observed bioactivity. The implementation requires instance-based embedding, and joint feature selection and classification. The goal of the present project is to implement multiple-instance learning in drug activity prediction, and subsequently to identify the bioactive conformers for each molecule. METHODS: We encoded the 3-dimensional structures using pharmacophore fingerprints which are binary strings, and accomplished instance-based embedding using calculated dissimilarity distances. Four dissimilarity measures were employed and their performances were compared. 1-norm SVM was used for joint feature selection and classification. The approach was applied to four data sets, and the best proposed model for each data set was determined by using the dissimilarity measure yielding the smallest number of selected features. RESULTS: The predictive abilities of the proposed approach were compared with three classical predictive models without instance-based embedding. The proposed approach produced the best predictive models for one data set and second best predictive models for the rest of the data sets, based on the external validations. To validate the ability of the proposed approach to find bioactive conformers, 12 small molecules with co-crystallized structures were seeded in one data set. 10 out of 12 co-crystallized structures were indeed identified as significant conformers using the proposed approach. CONCLUSIONS: The proposed approach was proven not to suffer from overfitting and to be highly competitive with classical predictive models, so it is very powerful for drug activity prediction. The approach was also validated as a useful method for pursuit of bioactive conformers.
format	Online Article Text
id	pubmed-3439725
institution	National Center for Biotechnology Information
language	English
publishDate	2012
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-34397252012-09-17 Implementation of multiple-instance learning in drug activity prediction Fu, Gang Nan, Xiaofei Liu, Haining Patel, Ronak Y Daga, Pankaj R Chen, Yixin Wilkins, Dawn E Doerksen, Robert J BMC Bioinformatics Proceedings BACKGROUND: In the context of drug discovery and development, much effort has been exerted to determine which conformers of a given molecule are responsible for the observed biological activity. In this work we aimed to predict bioactive conformers using a variant of supervised learning, named multiple-instance learning. A single molecule, treated as a bag of conformers, is biologically active if and only if at least one of its conformers, treated as an instance, is responsible for the observed bioactivity; and a molecule is inactive if none of its conformers is responsible for the observed bioactivity. The implementation requires instance-based embedding, and joint feature selection and classification. The goal of the present project is to implement multiple-instance learning in drug activity prediction, and subsequently to identify the bioactive conformers for each molecule. METHODS: We encoded the 3-dimensional structures using pharmacophore fingerprints which are binary strings, and accomplished instance-based embedding using calculated dissimilarity distances. Four dissimilarity measures were employed and their performances were compared. 1-norm SVM was used for joint feature selection and classification. The approach was applied to four data sets, and the best proposed model for each data set was determined by using the dissimilarity measure yielding the smallest number of selected features. RESULTS: The predictive abilities of the proposed approach were compared with three classical predictive models without instance-based embedding. The proposed approach produced the best predictive models for one data set and second best predictive models for the rest of the data sets, based on the external validations. To validate the ability of the proposed approach to find bioactive conformers, 12 small molecules with co-crystallized structures were seeded in one data set. 10 out of 12 co-crystallized structures were indeed identified as significant conformers using the proposed approach. CONCLUSIONS: The proposed approach was proven not to suffer from overfitting and to be highly competitive with classical predictive models, so it is very powerful for drug activity prediction. The approach was also validated as a useful method for pursuit of bioactive conformers. BioMed Central 2012-09-11 /pmc/articles/PMC3439725/ /pubmed/23046442 http://dx.doi.org/10.1186/1471-2105-13-S15-S3 Text en Copyright ©2012 Fu et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Proceedings Fu, Gang Nan, Xiaofei Liu, Haining Patel, Ronak Y Daga, Pankaj R Chen, Yixin Wilkins, Dawn E Doerksen, Robert J Implementation of multiple-instance learning in drug activity prediction
title	Implementation of multiple-instance learning in drug activity prediction
title_full	Implementation of multiple-instance learning in drug activity prediction
title_fullStr	Implementation of multiple-instance learning in drug activity prediction
title_full_unstemmed	Implementation of multiple-instance learning in drug activity prediction
title_short	Implementation of multiple-instance learning in drug activity prediction
title_sort	implementation of multiple-instance learning in drug activity prediction
topic	Proceedings
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3439725/ https://www.ncbi.nlm.nih.gov/pubmed/23046442 http://dx.doi.org/10.1186/1471-2105-13-S15-S3
work_keys_str_mv	AT fugang implementationofmultipleinstancelearningindrugactivityprediction AT nanxiaofei implementationofmultipleinstancelearningindrugactivityprediction AT liuhaining implementationofmultipleinstancelearningindrugactivityprediction AT patelronaky implementationofmultipleinstancelearningindrugactivityprediction AT dagapankajr implementationofmultipleinstancelearningindrugactivityprediction AT chenyixin implementationofmultipleinstancelearningindrugactivityprediction AT wilkinsdawne implementationofmultipleinstancelearningindrugactivityprediction AT doerksenrobertj implementationofmultipleinstancelearningindrugactivityprediction

Implementation of multiple-instance learning in drug activity prediction

Ejemplares similares