Cargando…

Benchmarking Ligand-Based Virtual High-Throughput Screening with the PubChem Database

With the rapidly increasing availability of High-Throughput Screening (HTS) data in the public domain, such as the PubChem database, methods for ligand-based computer-aided drug discovery (LB-CADD) have the potential to accelerate and reduce the cost of probe development and drug discovery efforts i...

Descripción completa

Detalles Bibliográficos
Autores principales:	Butkiewicz, Mariusz, Lowe, Edward W., Mueller, Ralf, Mendenhall, Jeffrey L., Teixeira, Pedro L., Weaver, C. David, Meiler, Jens
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2013
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3759399/ https://www.ncbi.nlm.nih.gov/pubmed/23299552 http://dx.doi.org/10.3390/molecules18010735

_version_	1782477252252925952
author	Butkiewicz, Mariusz Lowe, Edward W. Mueller, Ralf Mendenhall, Jeffrey L. Teixeira, Pedro L. Weaver, C. David Meiler, Jens
author_facet	Butkiewicz, Mariusz Lowe, Edward W. Mueller, Ralf Mendenhall, Jeffrey L. Teixeira, Pedro L. Weaver, C. David Meiler, Jens
author_sort	Butkiewicz, Mariusz
collection	PubMed
description	With the rapidly increasing availability of High-Throughput Screening (HTS) data in the public domain, such as the PubChem database, methods for ligand-based computer-aided drug discovery (LB-CADD) have the potential to accelerate and reduce the cost of probe development and drug discovery efforts in academia. We assemble nine data sets from realistic HTS campaigns representing major families of drug target proteins for benchmarking LB-CADD methods. Each data set is public domain through PubChem and carefully collated through confirmation screens validating active compounds. These data sets provide the foundation for benchmarking a new cheminformatics framework BCL::ChemInfo, which is freely available for non-commercial use. Quantitative structure activity relationship (QSAR) models are built using Artificial Neural Networks (ANNs), Support Vector Machines (SVMs), Decision Trees (DTs), and Kohonen networks (KNs). Problem-specific descriptor optimization protocols are assessed including Sequential Feature Forward Selection (SFFS) and various information content measures. Measures of predictive power and confidence are evaluated through cross-validation, and a consensus prediction scheme is tested that combines orthogonal machine learning algorithms into a single predictor. Enrichments ranging from 15 to 101 for a TPR cutoff of 25% are observed.
format	Online Article Text
id	pubmed-3759399
institution	National Center for Biotechnology Information
language	English
publishDate	2013
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-37593992013-09-02 Benchmarking Ligand-Based Virtual High-Throughput Screening with the PubChem Database Butkiewicz, Mariusz Lowe, Edward W. Mueller, Ralf Mendenhall, Jeffrey L. Teixeira, Pedro L. Weaver, C. David Meiler, Jens Molecules Article With the rapidly increasing availability of High-Throughput Screening (HTS) data in the public domain, such as the PubChem database, methods for ligand-based computer-aided drug discovery (LB-CADD) have the potential to accelerate and reduce the cost of probe development and drug discovery efforts in academia. We assemble nine data sets from realistic HTS campaigns representing major families of drug target proteins for benchmarking LB-CADD methods. Each data set is public domain through PubChem and carefully collated through confirmation screens validating active compounds. These data sets provide the foundation for benchmarking a new cheminformatics framework BCL::ChemInfo, which is freely available for non-commercial use. Quantitative structure activity relationship (QSAR) models are built using Artificial Neural Networks (ANNs), Support Vector Machines (SVMs), Decision Trees (DTs), and Kohonen networks (KNs). Problem-specific descriptor optimization protocols are assessed including Sequential Feature Forward Selection (SFFS) and various information content measures. Measures of predictive power and confidence are evaluated through cross-validation, and a consensus prediction scheme is tested that combines orthogonal machine learning algorithms into a single predictor. Enrichments ranging from 15 to 101 for a TPR cutoff of 25% are observed. MDPI 2013-01-08 /pmc/articles/PMC3759399/ /pubmed/23299552 http://dx.doi.org/10.3390/molecules18010735 Text en © 2013 by the authors; licensee MDPI, Basel, Switzerland. http://creativecommons.org/licenses/by/3.0/ This article is an open-access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).
spellingShingle	Article Butkiewicz, Mariusz Lowe, Edward W. Mueller, Ralf Mendenhall, Jeffrey L. Teixeira, Pedro L. Weaver, C. David Meiler, Jens Benchmarking Ligand-Based Virtual High-Throughput Screening with the PubChem Database
title	Benchmarking Ligand-Based Virtual High-Throughput Screening with the PubChem Database
title_full	Benchmarking Ligand-Based Virtual High-Throughput Screening with the PubChem Database
title_fullStr	Benchmarking Ligand-Based Virtual High-Throughput Screening with the PubChem Database
title_full_unstemmed	Benchmarking Ligand-Based Virtual High-Throughput Screening with the PubChem Database
title_short	Benchmarking Ligand-Based Virtual High-Throughput Screening with the PubChem Database
title_sort	benchmarking ligand-based virtual high-throughput screening with the pubchem database
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3759399/ https://www.ncbi.nlm.nih.gov/pubmed/23299552 http://dx.doi.org/10.3390/molecules18010735
work_keys_str_mv	AT butkiewiczmariusz benchmarkingligandbasedvirtualhighthroughputscreeningwiththepubchemdatabase AT loweedwardw benchmarkingligandbasedvirtualhighthroughputscreeningwiththepubchemdatabase AT muellerralf benchmarkingligandbasedvirtualhighthroughputscreeningwiththepubchemdatabase AT mendenhalljeffreyl benchmarkingligandbasedvirtualhighthroughputscreeningwiththepubchemdatabase AT teixeirapedrol benchmarkingligandbasedvirtualhighthroughputscreeningwiththepubchemdatabase AT weavercdavid benchmarkingligandbasedvirtualhighthroughputscreeningwiththepubchemdatabase AT meilerjens benchmarkingligandbasedvirtualhighthroughputscreeningwiththepubchemdatabase

Benchmarking Ligand-Based Virtual High-Throughput Screening with the PubChem Database

Ejemplares similares