Cargando…

Exploring chemical space for lead identification by propagating on chemical similarity network

MOTIVATION: Lead identification is a fundamental step to prioritize candidate compounds for downstream drug discovery process. Machine learning (ML) and deep learning (DL) approaches are widely used to identify lead compounds using both chemical property and experimental information. However, ML or...

Descripción completa

Detalles Bibliográficos
Autores principales:	Yi, Jungseob, Lee, Sangseon, Lim, Sangsoo, Cho, Changyun, Piao, Yinhua, Yeo, Marie, Kim, Dongkyu, Kim, Sun, Lee, Sunho
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Research Network of Computational and Structural Biotechnology 2023
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10480321/ https://www.ncbi.nlm.nih.gov/pubmed/37680266 http://dx.doi.org/10.1016/j.csbj.2023.08.016

_version_	1785101769732259840
author	Yi, Jungseob Lee, Sangseon Lim, Sangsoo Cho, Changyun Piao, Yinhua Yeo, Marie Kim, Dongkyu Kim, Sun Lee, Sunho
author_facet	Yi, Jungseob Lee, Sangseon Lim, Sangsoo Cho, Changyun Piao, Yinhua Yeo, Marie Kim, Dongkyu Kim, Sun Lee, Sunho
author_sort	Yi, Jungseob
collection	PubMed
description	MOTIVATION: Lead identification is a fundamental step to prioritize candidate compounds for downstream drug discovery process. Machine learning (ML) and deep learning (DL) approaches are widely used to identify lead compounds using both chemical property and experimental information. However, ML or DL methods rarely consider compound similarity information directly since ML and DL models use abstract representation of molecules for model construction. Alternatively, data mining approaches are also used to explore chemical space with drug candidates by screening undesirable compounds. A major challenge for data mining approaches is to develop efficient data mining methods that search large chemical space for desirable lead compounds with low false positive rate. RESULTS: In this work, we developed a network propagation (NP) based data mining method for lead identification that performs search on an ensemble of chemical similarity networks. We compiled 14 fingerprint-based similarity networks. Given a target protein of interest, we use a deep learning-based drug target interaction model to narrow down compound candidates and then we use network propagation to prioritize drug candidates that are highly correlated with drug activity score such as IC(50). In an extensive experiment with BindingDB, we showed that our approach successfully discovered intentionally unlabeled compounds for given targets. To further demonstrate the prediction power of our approach, we identified 24 candidate leads for CLK1. Two out of five synthesizable candidates were experimentally validated in binding assays. In conclusion, our framework can be very useful for lead identification from very large compound databases such as ZINC.
format	Online Article Text
id	pubmed-10480321
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	Research Network of Computational and Structural Biotechnology
record_format	MEDLINE/PubMed
spelling	pubmed-104803212023-09-07 Exploring chemical space for lead identification by propagating on chemical similarity network Yi, Jungseob Lee, Sangseon Lim, Sangsoo Cho, Changyun Piao, Yinhua Yeo, Marie Kim, Dongkyu Kim, Sun Lee, Sunho Comput Struct Biotechnol J Research Article MOTIVATION: Lead identification is a fundamental step to prioritize candidate compounds for downstream drug discovery process. Machine learning (ML) and deep learning (DL) approaches are widely used to identify lead compounds using both chemical property and experimental information. However, ML or DL methods rarely consider compound similarity information directly since ML and DL models use abstract representation of molecules for model construction. Alternatively, data mining approaches are also used to explore chemical space with drug candidates by screening undesirable compounds. A major challenge for data mining approaches is to develop efficient data mining methods that search large chemical space for desirable lead compounds with low false positive rate. RESULTS: In this work, we developed a network propagation (NP) based data mining method for lead identification that performs search on an ensemble of chemical similarity networks. We compiled 14 fingerprint-based similarity networks. Given a target protein of interest, we use a deep learning-based drug target interaction model to narrow down compound candidates and then we use network propagation to prioritize drug candidates that are highly correlated with drug activity score such as IC(50). In an extensive experiment with BindingDB, we showed that our approach successfully discovered intentionally unlabeled compounds for given targets. To further demonstrate the prediction power of our approach, we identified 24 candidate leads for CLK1. Two out of five synthesizable candidates were experimentally validated in binding assays. In conclusion, our framework can be very useful for lead identification from very large compound databases such as ZINC. Research Network of Computational and Structural Biotechnology 2023-08-25 /pmc/articles/PMC10480321/ /pubmed/37680266 http://dx.doi.org/10.1016/j.csbj.2023.08.016 Text en © 2023 The Author(s) https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle	Research Article Yi, Jungseob Lee, Sangseon Lim, Sangsoo Cho, Changyun Piao, Yinhua Yeo, Marie Kim, Dongkyu Kim, Sun Lee, Sunho Exploring chemical space for lead identification by propagating on chemical similarity network
title	Exploring chemical space for lead identification by propagating on chemical similarity network
title_full	Exploring chemical space for lead identification by propagating on chemical similarity network
title_fullStr	Exploring chemical space for lead identification by propagating on chemical similarity network
title_full_unstemmed	Exploring chemical space for lead identification by propagating on chemical similarity network
title_short	Exploring chemical space for lead identification by propagating on chemical similarity network
title_sort	exploring chemical space for lead identification by propagating on chemical similarity network
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10480321/ https://www.ncbi.nlm.nih.gov/pubmed/37680266 http://dx.doi.org/10.1016/j.csbj.2023.08.016
work_keys_str_mv	AT yijungseob exploringchemicalspaceforleadidentificationbypropagatingonchemicalsimilaritynetwork AT leesangseon exploringchemicalspaceforleadidentificationbypropagatingonchemicalsimilaritynetwork AT limsangsoo exploringchemicalspaceforleadidentificationbypropagatingonchemicalsimilaritynetwork AT chochangyun exploringchemicalspaceforleadidentificationbypropagatingonchemicalsimilaritynetwork AT piaoyinhua exploringchemicalspaceforleadidentificationbypropagatingonchemicalsimilaritynetwork AT yeomarie exploringchemicalspaceforleadidentificationbypropagatingonchemicalsimilaritynetwork AT kimdongkyu exploringchemicalspaceforleadidentificationbypropagatingonchemicalsimilaritynetwork AT kimsun exploringchemicalspaceforleadidentificationbypropagatingonchemicalsimilaritynetwork AT leesunho exploringchemicalspaceforleadidentificationbypropagatingonchemicalsimilaritynetwork

Exploring chemical space for lead identification by propagating on chemical similarity network

Ejemplares similares