Cargando…

Ligand-Based Virtual Screening Based on the Graph Edit Distance

Chemical compounds can be represented as attributed graphs. An attributed graph is a mathematical model of an object composed of two types of representations: nodes and edges. Nodes are individual components, and edges are relations between these components. In this case, pharmacophore-type node des...

Descripción completa

Detalles Bibliográficos
Autores principales: Rica, Elena, Álvarez, Susana, Serratosa, Francesc
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8658044/
https://www.ncbi.nlm.nih.gov/pubmed/34884555
http://dx.doi.org/10.3390/ijms222312751
_version_ 1784612640506511360
author Rica, Elena
Álvarez, Susana
Serratosa, Francesc
author_facet Rica, Elena
Álvarez, Susana
Serratosa, Francesc
author_sort Rica, Elena
collection PubMed
description Chemical compounds can be represented as attributed graphs. An attributed graph is a mathematical model of an object composed of two types of representations: nodes and edges. Nodes are individual components, and edges are relations between these components. In this case, pharmacophore-type node descriptions are represented by nodes and chemical bounds by edges. If we want to obtain the bioactivity dissimilarity between two chemical compounds, a distance between attributed graphs can be used. The Graph Edit Distance allows computing this distance, and it is defined as the cost of transforming one graph into another. Nevertheless, to define this dissimilarity, the transformation cost must be properly tuned. The aim of this paper is to analyse the structural-based screening methods to verify the quality of the Harper transformation costs proposal and to present an algorithm to learn these transformation costs such that the bioactivity dissimilarity is properly defined in a ligand-based virtual screening application. The goodness of the dissimilarity is represented by the classification accuracy. Six publicly available datasets—CAPST, DUD-E, GLL&GDD, NRLiSt-BDB, MUV and ULS-UDS—have been used to validate our methodology and show that with our learned costs, we obtain the highest ratios in identifying the bioactivity similarity in a structurally diverse group of molecules.
format Online
Article
Text
id pubmed-8658044
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-86580442021-12-10 Ligand-Based Virtual Screening Based on the Graph Edit Distance Rica, Elena Álvarez, Susana Serratosa, Francesc Int J Mol Sci Article Chemical compounds can be represented as attributed graphs. An attributed graph is a mathematical model of an object composed of two types of representations: nodes and edges. Nodes are individual components, and edges are relations between these components. In this case, pharmacophore-type node descriptions are represented by nodes and chemical bounds by edges. If we want to obtain the bioactivity dissimilarity between two chemical compounds, a distance between attributed graphs can be used. The Graph Edit Distance allows computing this distance, and it is defined as the cost of transforming one graph into another. Nevertheless, to define this dissimilarity, the transformation cost must be properly tuned. The aim of this paper is to analyse the structural-based screening methods to verify the quality of the Harper transformation costs proposal and to present an algorithm to learn these transformation costs such that the bioactivity dissimilarity is properly defined in a ligand-based virtual screening application. The goodness of the dissimilarity is represented by the classification accuracy. Six publicly available datasets—CAPST, DUD-E, GLL&GDD, NRLiSt-BDB, MUV and ULS-UDS—have been used to validate our methodology and show that with our learned costs, we obtain the highest ratios in identifying the bioactivity similarity in a structurally diverse group of molecules. MDPI 2021-11-25 /pmc/articles/PMC8658044/ /pubmed/34884555 http://dx.doi.org/10.3390/ijms222312751 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Rica, Elena
Álvarez, Susana
Serratosa, Francesc
Ligand-Based Virtual Screening Based on the Graph Edit Distance
title Ligand-Based Virtual Screening Based on the Graph Edit Distance
title_full Ligand-Based Virtual Screening Based on the Graph Edit Distance
title_fullStr Ligand-Based Virtual Screening Based on the Graph Edit Distance
title_full_unstemmed Ligand-Based Virtual Screening Based on the Graph Edit Distance
title_short Ligand-Based Virtual Screening Based on the Graph Edit Distance
title_sort ligand-based virtual screening based on the graph edit distance
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8658044/
https://www.ncbi.nlm.nih.gov/pubmed/34884555
http://dx.doi.org/10.3390/ijms222312751
work_keys_str_mv AT ricaelena ligandbasedvirtualscreeningbasedonthegrapheditdistance
AT alvarezsusana ligandbasedvirtualscreeningbasedonthegrapheditdistance
AT serratosafrancesc ligandbasedvirtualscreeningbasedonthegrapheditdistance