Cargando…

An Unbiased Method To Build Benchmarking Sets for Ligand-Based Virtual Screening and its Application To GPCRs

[Image: see text] Benchmarking data sets have become common in recent years for the purpose of virtual screening, though the main focus had been placed on the structure-based virtual screening (SBVS) approaches. Due to the lack of crystal structures, there is great need for unbiased benchmarking set...

Descripción completa

Detalles Bibliográficos
Autores principales:	Xia, Jie, Jin, Hongwei, Liu, Zhenming, Zhang, Liangren, Wang, Xiang Simon
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	American Chemical Society 2014
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4038372/ https://www.ncbi.nlm.nih.gov/pubmed/24749745 http://dx.doi.org/10.1021/ci500062f

_version_	1782318338432565248
author	Xia, Jie Jin, Hongwei Liu, Zhenming Zhang, Liangren Wang, Xiang Simon
author_facet	Xia, Jie Jin, Hongwei Liu, Zhenming Zhang, Liangren Wang, Xiang Simon
author_sort	Xia, Jie
collection	PubMed
description	[Image: see text] Benchmarking data sets have become common in recent years for the purpose of virtual screening, though the main focus had been placed on the structure-based virtual screening (SBVS) approaches. Due to the lack of crystal structures, there is great need for unbiased benchmarking sets to evaluate various ligand-based virtual screening (LBVS) methods for important drug targets such as G protein-coupled receptors (GPCRs). To date these ready-to-apply data sets for LBVS are fairly limited, and the direct usage of benchmarking sets designed for SBVS could bring the biases to the evaluation of LBVS. Herein, we propose an unbiased method to build benchmarking sets for LBVS and validate it on a multitude of GPCRs targets. To be more specific, our methods can (1) ensure chemical diversity of ligands, (2) maintain the physicochemical similarity between ligands and decoys, (3) make the decoys dissimilar in chemical topology to all ligands to avoid false negatives, and (4) maximize spatial random distribution of ligands and decoys. We evaluated the quality of our Unbiased Ligand Set (ULS) and Unbiased Decoy Set (UDS) using three common LBVS approaches, with Leave-One-Out (LOO) Cross-Validation (CV) and a metric of average AUC of the ROC curves. Our method has greatly reduced the “artificial enrichment” and “analogue bias” of a published GPCRs benchmarking set, i.e., GPCR Ligand Library (GLL)/GPCR Decoy Database (GDD). In addition, we addressed an important issue about the ratio of decoys per ligand and found that for a range of 30 to 100 it does not affect the quality of the benchmarking set, so we kept the original ratio of 39 from the GLL/GDD.
format	Online Article Text
id	pubmed-4038372
institution	National Center for Biotechnology Information
language	English
publishDate	2014
publisher	American Chemical Society
record_format	MEDLINE/PubMed
spelling	pubmed-40383722015-04-21 An Unbiased Method To Build Benchmarking Sets for Ligand-Based Virtual Screening and its Application To GPCRs Xia, Jie Jin, Hongwei Liu, Zhenming Zhang, Liangren Wang, Xiang Simon J Chem Inf Model [Image: see text] Benchmarking data sets have become common in recent years for the purpose of virtual screening, though the main focus had been placed on the structure-based virtual screening (SBVS) approaches. Due to the lack of crystal structures, there is great need for unbiased benchmarking sets to evaluate various ligand-based virtual screening (LBVS) methods for important drug targets such as G protein-coupled receptors (GPCRs). To date these ready-to-apply data sets for LBVS are fairly limited, and the direct usage of benchmarking sets designed for SBVS could bring the biases to the evaluation of LBVS. Herein, we propose an unbiased method to build benchmarking sets for LBVS and validate it on a multitude of GPCRs targets. To be more specific, our methods can (1) ensure chemical diversity of ligands, (2) maintain the physicochemical similarity between ligands and decoys, (3) make the decoys dissimilar in chemical topology to all ligands to avoid false negatives, and (4) maximize spatial random distribution of ligands and decoys. We evaluated the quality of our Unbiased Ligand Set (ULS) and Unbiased Decoy Set (UDS) using three common LBVS approaches, with Leave-One-Out (LOO) Cross-Validation (CV) and a metric of average AUC of the ROC curves. Our method has greatly reduced the “artificial enrichment” and “analogue bias” of a published GPCRs benchmarking set, i.e., GPCR Ligand Library (GLL)/GPCR Decoy Database (GDD). In addition, we addressed an important issue about the ratio of decoys per ligand and found that for a range of 30 to 100 it does not affect the quality of the benchmarking set, so we kept the original ratio of 39 from the GLL/GDD. American Chemical Society 2014-04-21 2014-05-27 /pmc/articles/PMC4038372/ /pubmed/24749745 http://dx.doi.org/10.1021/ci500062f Text en Copyright © 2014 American Chemical Society
spellingShingle	Xia, Jie Jin, Hongwei Liu, Zhenming Zhang, Liangren Wang, Xiang Simon An Unbiased Method To Build Benchmarking Sets for Ligand-Based Virtual Screening and its Application To GPCRs
title	An Unbiased Method To Build Benchmarking Sets for Ligand-Based Virtual Screening and its Application To GPCRs
title_full	An Unbiased Method To Build Benchmarking Sets for Ligand-Based Virtual Screening and its Application To GPCRs
title_fullStr	An Unbiased Method To Build Benchmarking Sets for Ligand-Based Virtual Screening and its Application To GPCRs
title_full_unstemmed	An Unbiased Method To Build Benchmarking Sets for Ligand-Based Virtual Screening and its Application To GPCRs
title_short	An Unbiased Method To Build Benchmarking Sets for Ligand-Based Virtual Screening and its Application To GPCRs
title_sort	unbiased method to build benchmarking sets for ligand-based virtual screening and its application to gpcrs
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4038372/ https://www.ncbi.nlm.nih.gov/pubmed/24749745 http://dx.doi.org/10.1021/ci500062f
work_keys_str_mv	AT xiajie anunbiasedmethodtobuildbenchmarkingsetsforligandbasedvirtualscreeninganditsapplicationtogpcrs AT jinhongwei anunbiasedmethodtobuildbenchmarkingsetsforligandbasedvirtualscreeninganditsapplicationtogpcrs AT liuzhenming anunbiasedmethodtobuildbenchmarkingsetsforligandbasedvirtualscreeninganditsapplicationtogpcrs AT zhangliangren anunbiasedmethodtobuildbenchmarkingsetsforligandbasedvirtualscreeninganditsapplicationtogpcrs AT wangxiangsimon anunbiasedmethodtobuildbenchmarkingsetsforligandbasedvirtualscreeninganditsapplicationtogpcrs AT xiajie unbiasedmethodtobuildbenchmarkingsetsforligandbasedvirtualscreeninganditsapplicationtogpcrs AT jinhongwei unbiasedmethodtobuildbenchmarkingsetsforligandbasedvirtualscreeninganditsapplicationtogpcrs AT liuzhenming unbiasedmethodtobuildbenchmarkingsetsforligandbasedvirtualscreeninganditsapplicationtogpcrs AT zhangliangren unbiasedmethodtobuildbenchmarkingsetsforligandbasedvirtualscreeninganditsapplicationtogpcrs AT wangxiangsimon unbiasedmethodtobuildbenchmarkingsetsforligandbasedvirtualscreeninganditsapplicationtogpcrs

An Unbiased Method To Build Benchmarking Sets for Ligand-Based Virtual Screening and its Application To GPCRs

Ejemplares similares