Cargando…

eTOXlab, an open source modeling framework for implementing predictive models in production environments

BACKGROUND: Computational models based in Quantitative-Structure Activity Relationship (QSAR) methodologies are widely used tools for predicting the biological properties of new compounds. In many instances, such models are used as a routine in the industry (e.g. food, cosmetic or pharmaceutical ind...

Descripción completa

Detalles Bibliográficos
Autores principales: Carrió, Pau, López, Oriol, Sanz, Ferran, Pastor, Manuel
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4358905/
https://www.ncbi.nlm.nih.gov/pubmed/25774224
http://dx.doi.org/10.1186/s13321-015-0058-6
_version_ 1782361306160955392
author Carrió, Pau
López, Oriol
Sanz, Ferran
Pastor, Manuel
author_facet Carrió, Pau
López, Oriol
Sanz, Ferran
Pastor, Manuel
author_sort Carrió, Pau
collection PubMed
description BACKGROUND: Computational models based in Quantitative-Structure Activity Relationship (QSAR) methodologies are widely used tools for predicting the biological properties of new compounds. In many instances, such models are used as a routine in the industry (e.g. food, cosmetic or pharmaceutical industry) for the early assessment of the biological properties of new compounds. However, most of the tools currently available for developing QSAR models are not well suited for supporting the whole QSAR model life cycle in production environments. RESULTS: We have developed eTOXlab; an open source modeling framework designed to be used at the core of a self-contained virtual machine that can be easily deployed in production environments, providing predictions as web services. eTOXlab consists on a collection of object-oriented Python modules with methods mapping common tasks of standard modeling workflows. This framework allows building and validating QSAR models as well as predicting the properties of new compounds using either a command line interface or a graphic user interface (GUI). Simple models can be easily generated by setting a few parameters, while more complex models can be implemented by overriding pieces of the original source code. eTOXlab benefits from the object-oriented capabilities of Python for providing high flexibility: any model implemented using eTOXlab inherits the features implemented in the parent model, like common tools and services or the automatic exposure of the models as prediction web services. The particular eTOXlab architecture as a self-contained, portable prediction engine allows building models with confidential information within corporate facilities, which can be safely exported and used for prediction without disclosing the structures of the training series. CONCLUSIONS: The software presented here provides full support to the specific needs of users that want to develop, use and maintain predictive models in corporate environments. The technologies used by eTOXlab (web services, VM, object-oriented programming) provide an elegant solution to common practical issues; the system can be installed easily in heterogeneous environments and integrates well with other software. Moreover, the system provides a simple and safe solution for building models with confidential structures that can be shared without disclosing sensitive information. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13321-015-0058-6) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4358905
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-43589052015-03-14 eTOXlab, an open source modeling framework for implementing predictive models in production environments Carrió, Pau López, Oriol Sanz, Ferran Pastor, Manuel J Cheminform Software BACKGROUND: Computational models based in Quantitative-Structure Activity Relationship (QSAR) methodologies are widely used tools for predicting the biological properties of new compounds. In many instances, such models are used as a routine in the industry (e.g. food, cosmetic or pharmaceutical industry) for the early assessment of the biological properties of new compounds. However, most of the tools currently available for developing QSAR models are not well suited for supporting the whole QSAR model life cycle in production environments. RESULTS: We have developed eTOXlab; an open source modeling framework designed to be used at the core of a self-contained virtual machine that can be easily deployed in production environments, providing predictions as web services. eTOXlab consists on a collection of object-oriented Python modules with methods mapping common tasks of standard modeling workflows. This framework allows building and validating QSAR models as well as predicting the properties of new compounds using either a command line interface or a graphic user interface (GUI). Simple models can be easily generated by setting a few parameters, while more complex models can be implemented by overriding pieces of the original source code. eTOXlab benefits from the object-oriented capabilities of Python for providing high flexibility: any model implemented using eTOXlab inherits the features implemented in the parent model, like common tools and services or the automatic exposure of the models as prediction web services. The particular eTOXlab architecture as a self-contained, portable prediction engine allows building models with confidential information within corporate facilities, which can be safely exported and used for prediction without disclosing the structures of the training series. CONCLUSIONS: The software presented here provides full support to the specific needs of users that want to develop, use and maintain predictive models in corporate environments. The technologies used by eTOXlab (web services, VM, object-oriented programming) provide an elegant solution to common practical issues; the system can be installed easily in heterogeneous environments and integrates well with other software. Moreover, the system provides a simple and safe solution for building models with confidential structures that can be shared without disclosing sensitive information. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13321-015-0058-6) contains supplementary material, which is available to authorized users. Springer International Publishing 2015-03-11 /pmc/articles/PMC4358905/ /pubmed/25774224 http://dx.doi.org/10.1186/s13321-015-0058-6 Text en © Carrió et al.; licensee Springer. 2015 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited.
spellingShingle Software
Carrió, Pau
López, Oriol
Sanz, Ferran
Pastor, Manuel
eTOXlab, an open source modeling framework for implementing predictive models in production environments
title eTOXlab, an open source modeling framework for implementing predictive models in production environments
title_full eTOXlab, an open source modeling framework for implementing predictive models in production environments
title_fullStr eTOXlab, an open source modeling framework for implementing predictive models in production environments
title_full_unstemmed eTOXlab, an open source modeling framework for implementing predictive models in production environments
title_short eTOXlab, an open source modeling framework for implementing predictive models in production environments
title_sort etoxlab, an open source modeling framework for implementing predictive models in production environments
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4358905/
https://www.ncbi.nlm.nih.gov/pubmed/25774224
http://dx.doi.org/10.1186/s13321-015-0058-6
work_keys_str_mv AT carriopau etoxlabanopensourcemodelingframeworkforimplementingpredictivemodelsinproductionenvironments
AT lopezoriol etoxlabanopensourcemodelingframeworkforimplementingpredictivemodelsinproductionenvironments
AT sanzferran etoxlabanopensourcemodelingframeworkforimplementingpredictivemodelsinproductionenvironments
AT pastormanuel etoxlabanopensourcemodelingframeworkforimplementingpredictivemodelsinproductionenvironments