Cargando…

Flexible Analog Search with Kernel PCA Embedded Molecule Vectors

Studying analog series to find structural transformations that enhance the activity and ADME properties of lead compounds is an important part of drug development. Matched molecular pair (MMP) search is a powerful tool for analog analysis that imitates researchers' ability to select pairs of co...

Descripción completa

Detalles Bibliográficos
Autores principales: Rensi, Stefano, Altman, Russ B.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Research Network of Computational and Structural Biotechnology 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5396859/
https://www.ncbi.nlm.nih.gov/pubmed/28458783
http://dx.doi.org/10.1016/j.csbj.2017.03.003
_version_ 1783230153336291328
author Rensi, Stefano
Altman, Russ B.
author_facet Rensi, Stefano
Altman, Russ B.
author_sort Rensi, Stefano
collection PubMed
description Studying analog series to find structural transformations that enhance the activity and ADME properties of lead compounds is an important part of drug development. Matched molecular pair (MMP) search is a powerful tool for analog analysis that imitates researchers' ability to select pairs of compounds that differ only by small well-defined transformations. Abstraction is a challenge for existing MMP search algorithms, which can result in the omission of relevant, inexact MMPs, and inclusion of irrelevant, contextually dissimilar MMPs. In this work, we present a new method for MMP search that returns approximate results and enables flexible control over abstraction of contextual information. We illustrate the concepts and mechanics of our method with a series of exemplar MMP queries, and then benchmark search accuracy using MMPs found by fragment indexing. We show that we can search for MMPs in a context dependent manner, and accurately approximate context independent fragment index based MMP search over a range of fingerprint and dataset conditions. Our method can be used to search for pairwise correspondences among analog sets and bolster MMP datasets where data is missing or incomplete.
format Online
Article
Text
id pubmed-5396859
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Research Network of Computational and Structural Biotechnology
record_format MEDLINE/PubMed
spelling pubmed-53968592017-04-28 Flexible Analog Search with Kernel PCA Embedded Molecule Vectors Rensi, Stefano Altman, Russ B. Comput Struct Biotechnol J Research Article Studying analog series to find structural transformations that enhance the activity and ADME properties of lead compounds is an important part of drug development. Matched molecular pair (MMP) search is a powerful tool for analog analysis that imitates researchers' ability to select pairs of compounds that differ only by small well-defined transformations. Abstraction is a challenge for existing MMP search algorithms, which can result in the omission of relevant, inexact MMPs, and inclusion of irrelevant, contextually dissimilar MMPs. In this work, we present a new method for MMP search that returns approximate results and enables flexible control over abstraction of contextual information. We illustrate the concepts and mechanics of our method with a series of exemplar MMP queries, and then benchmark search accuracy using MMPs found by fragment indexing. We show that we can search for MMPs in a context dependent manner, and accurately approximate context independent fragment index based MMP search over a range of fingerprint and dataset conditions. Our method can be used to search for pairwise correspondences among analog sets and bolster MMP datasets where data is missing or incomplete. Research Network of Computational and Structural Biotechnology 2017-03-24 /pmc/articles/PMC5396859/ /pubmed/28458783 http://dx.doi.org/10.1016/j.csbj.2017.03.003 Text en © 2017 The Authors http://creativecommons.org/licenses/by/4.0/ This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Research Article
Rensi, Stefano
Altman, Russ B.
Flexible Analog Search with Kernel PCA Embedded Molecule Vectors
title Flexible Analog Search with Kernel PCA Embedded Molecule Vectors
title_full Flexible Analog Search with Kernel PCA Embedded Molecule Vectors
title_fullStr Flexible Analog Search with Kernel PCA Embedded Molecule Vectors
title_full_unstemmed Flexible Analog Search with Kernel PCA Embedded Molecule Vectors
title_short Flexible Analog Search with Kernel PCA Embedded Molecule Vectors
title_sort flexible analog search with kernel pca embedded molecule vectors
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5396859/
https://www.ncbi.nlm.nih.gov/pubmed/28458783
http://dx.doi.org/10.1016/j.csbj.2017.03.003
work_keys_str_mv AT rensistefano flexibleanalogsearchwithkernelpcaembeddedmoleculevectors
AT altmanrussb flexibleanalogsearchwithkernelpcaembeddedmoleculevectors