Cargando…

Data governance in predictive toxicology: A review

BACKGROUND: Due to recent advances in data storage and sharing for further data processing in predictive toxicology, there is an increasing need for flexible data representations, secure and consistent data curation and automated data quality checking. Toxicity prediction involves multidisciplinary...

Descripción completa

Detalles Bibliográficos
Autores principales: Fu, Xin, Wojak, Anna, Neagu, Daniel, Ridley, Mick, Travis, Kim
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3584675/
https://www.ncbi.nlm.nih.gov/pubmed/21752279
http://dx.doi.org/10.1186/1758-2946-3-24
_version_ 1782261055958810624
author Fu, Xin
Wojak, Anna
Neagu, Daniel
Ridley, Mick
Travis, Kim
author_facet Fu, Xin
Wojak, Anna
Neagu, Daniel
Ridley, Mick
Travis, Kim
author_sort Fu, Xin
collection PubMed
description BACKGROUND: Due to recent advances in data storage and sharing for further data processing in predictive toxicology, there is an increasing need for flexible data representations, secure and consistent data curation and automated data quality checking. Toxicity prediction involves multidisciplinary data. There are hundreds of collections of chemical, biological and toxicological data that are widely dispersed, mostly in the open literature, professional research bodies and commercial companies. In order to better manage and make full use of such large amount of toxicity data, there is a trend to develop functionalities aiming towards data governance in predictive toxicology to formalise a set of processes to guarantee high data quality and better data management. In this paper, data quality mainly refers in a data storage sense (e.g. accuracy, completeness and integrity) and not in a toxicological sense (e.g. the quality of experimental results). RESULTS: This paper reviews seven widely used predictive toxicology data sources and applications, with a particular focus on their data governance aspects, including: data accuracy, data completeness, data integrity, metadata and its management, data availability and data authorisation. This review reveals the current problems (e.g. lack of systematic and standard measures of data quality) and desirable needs (e.g. better management and further use of captured metadata and the development of flexible multi-level user access authorisation schemas) of predictive toxicology data sources development. The analytical results will help to address a significant gap in toxicology data quality assessment and lead to the development of novel frameworks for predictive toxicology data and model governance. CONCLUSIONS: While the discussed public data sources are well developed, there nevertheless remain some gaps in the development of a data governance framework to support predictive toxicology. In this paper, data governance is identified as the new challenge in predictive toxicology, and a good use of it may provide a promising framework for developing high quality and easy accessible toxicity data repositories. This paper also identifies important research directions that require further investigation in this area.
format Online
Article
Text
id pubmed-3584675
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-35846752013-03-01 Data governance in predictive toxicology: A review Fu, Xin Wojak, Anna Neagu, Daniel Ridley, Mick Travis, Kim J Cheminform Review BACKGROUND: Due to recent advances in data storage and sharing for further data processing in predictive toxicology, there is an increasing need for flexible data representations, secure and consistent data curation and automated data quality checking. Toxicity prediction involves multidisciplinary data. There are hundreds of collections of chemical, biological and toxicological data that are widely dispersed, mostly in the open literature, professional research bodies and commercial companies. In order to better manage and make full use of such large amount of toxicity data, there is a trend to develop functionalities aiming towards data governance in predictive toxicology to formalise a set of processes to guarantee high data quality and better data management. In this paper, data quality mainly refers in a data storage sense (e.g. accuracy, completeness and integrity) and not in a toxicological sense (e.g. the quality of experimental results). RESULTS: This paper reviews seven widely used predictive toxicology data sources and applications, with a particular focus on their data governance aspects, including: data accuracy, data completeness, data integrity, metadata and its management, data availability and data authorisation. This review reveals the current problems (e.g. lack of systematic and standard measures of data quality) and desirable needs (e.g. better management and further use of captured metadata and the development of flexible multi-level user access authorisation schemas) of predictive toxicology data sources development. The analytical results will help to address a significant gap in toxicology data quality assessment and lead to the development of novel frameworks for predictive toxicology data and model governance. CONCLUSIONS: While the discussed public data sources are well developed, there nevertheless remain some gaps in the development of a data governance framework to support predictive toxicology. In this paper, data governance is identified as the new challenge in predictive toxicology, and a good use of it may provide a promising framework for developing high quality and easy accessible toxicity data repositories. This paper also identifies important research directions that require further investigation in this area. BioMed Central 2011-07-13 /pmc/articles/PMC3584675/ /pubmed/21752279 http://dx.doi.org/10.1186/1758-2946-3-24 Text en Copyright ©2011 Fu et al; licensee Chemistry Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Review
Fu, Xin
Wojak, Anna
Neagu, Daniel
Ridley, Mick
Travis, Kim
Data governance in predictive toxicology: A review
title Data governance in predictive toxicology: A review
title_full Data governance in predictive toxicology: A review
title_fullStr Data governance in predictive toxicology: A review
title_full_unstemmed Data governance in predictive toxicology: A review
title_short Data governance in predictive toxicology: A review
title_sort data governance in predictive toxicology: a review
topic Review
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3584675/
https://www.ncbi.nlm.nih.gov/pubmed/21752279
http://dx.doi.org/10.1186/1758-2946-3-24
work_keys_str_mv AT fuxin datagovernanceinpredictivetoxicologyareview
AT wojakanna datagovernanceinpredictivetoxicologyareview
AT neagudaniel datagovernanceinpredictivetoxicologyareview
AT ridleymick datagovernanceinpredictivetoxicologyareview
AT traviskim datagovernanceinpredictivetoxicologyareview