Cargando…

QSAR DataBank - an approach for the digital organization and archiving of QSAR model information

BACKGROUND: Research efforts in the field of descriptive and predictive Quantitative Structure-Activity Relationships or Quantitative Structure–Property Relationships produce around one thousand scientific publications annually. All the materials and results are mainly communicated using printed med...

Descripción completa

Detalles Bibliográficos
Autores principales: Ruusmann, Villu, Sild, Sulev, Maran, Uko
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4047268/
https://www.ncbi.nlm.nih.gov/pubmed/24910716
http://dx.doi.org/10.1186/1758-2946-6-25
_version_ 1782480386090074112
author Ruusmann, Villu
Sild, Sulev
Maran, Uko
author_facet Ruusmann, Villu
Sild, Sulev
Maran, Uko
author_sort Ruusmann, Villu
collection PubMed
description BACKGROUND: Research efforts in the field of descriptive and predictive Quantitative Structure-Activity Relationships or Quantitative Structure–Property Relationships produce around one thousand scientific publications annually. All the materials and results are mainly communicated using printed media. The printed media in its present form have obvious limitations when they come to effectively representing mathematical models, including complex and non-linear, and large bodies of associated numerical chemical data. It is not supportive of secondary information extraction or reuse efforts while in silico studies poses additional requirements for accessibility, transparency and reproducibility of the research. This gap can and should be bridged by introducing domain-specific digital data exchange standards and tools. The current publication presents a formal specification of the quantitative structure-activity relationship data organization and archival format called the QSAR DataBank (QsarDB for shorter, or QDB for shortest). RESULTS: The article describes QsarDB data schema, which formalizes QSAR concepts (objects and relationships between them) and QsarDB data format, which formalizes their presentation for computer systems. The utility and benefits of QsarDB have been thoroughly tested by solving everyday QSAR and predictive modeling problems, with examples in the field of predictive toxicology, and can be applied for a wide variety of other endpoints. The work is accompanied with open source reference implementation and tools. CONCLUSIONS: The proposed open data, open source, and open standards design is open to public and proprietary extensions on many levels. Selected use cases exemplify the benefits of the proposed QsarDB data format. General ideas for future development are discussed.
format Online
Article
Text
id pubmed-4047268
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-40472682014-06-07 QSAR DataBank - an approach for the digital organization and archiving of QSAR model information Ruusmann, Villu Sild, Sulev Maran, Uko J Cheminform Methodology BACKGROUND: Research efforts in the field of descriptive and predictive Quantitative Structure-Activity Relationships or Quantitative Structure–Property Relationships produce around one thousand scientific publications annually. All the materials and results are mainly communicated using printed media. The printed media in its present form have obvious limitations when they come to effectively representing mathematical models, including complex and non-linear, and large bodies of associated numerical chemical data. It is not supportive of secondary information extraction or reuse efforts while in silico studies poses additional requirements for accessibility, transparency and reproducibility of the research. This gap can and should be bridged by introducing domain-specific digital data exchange standards and tools. The current publication presents a formal specification of the quantitative structure-activity relationship data organization and archival format called the QSAR DataBank (QsarDB for shorter, or QDB for shortest). RESULTS: The article describes QsarDB data schema, which formalizes QSAR concepts (objects and relationships between them) and QsarDB data format, which formalizes their presentation for computer systems. The utility and benefits of QsarDB have been thoroughly tested by solving everyday QSAR and predictive modeling problems, with examples in the field of predictive toxicology, and can be applied for a wide variety of other endpoints. The work is accompanied with open source reference implementation and tools. CONCLUSIONS: The proposed open data, open source, and open standards design is open to public and proprietary extensions on many levels. Selected use cases exemplify the benefits of the proposed QsarDB data format. General ideas for future development are discussed. BioMed Central 2014-05-14 /pmc/articles/PMC4047268/ /pubmed/24910716 http://dx.doi.org/10.1186/1758-2946-6-25 Text en Copyright © 2014 Ruusmann et al.; licensee Chemistry Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology
Ruusmann, Villu
Sild, Sulev
Maran, Uko
QSAR DataBank - an approach for the digital organization and archiving of QSAR model information
title QSAR DataBank - an approach for the digital organization and archiving of QSAR model information
title_full QSAR DataBank - an approach for the digital organization and archiving of QSAR model information
title_fullStr QSAR DataBank - an approach for the digital organization and archiving of QSAR model information
title_full_unstemmed QSAR DataBank - an approach for the digital organization and archiving of QSAR model information
title_short QSAR DataBank - an approach for the digital organization and archiving of QSAR model information
title_sort qsar databank - an approach for the digital organization and archiving of qsar model information
topic Methodology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4047268/
https://www.ncbi.nlm.nih.gov/pubmed/24910716
http://dx.doi.org/10.1186/1758-2946-6-25
work_keys_str_mv AT ruusmannvillu qsardatabankanapproachforthedigitalorganizationandarchivingofqsarmodelinformation
AT sildsulev qsardatabankanapproachforthedigitalorganizationandarchivingofqsarmodelinformation
AT maranuko qsardatabankanapproachforthedigitalorganizationandarchivingofqsarmodelinformation