Cargando…

Data Curation can Improve the Prediction Accuracy of Metabolic Intrinsic Clearance

A key consideration at the screening stages of drug discovery is in vitro metabolic stability, often measured in human liver microsomes. Computational prediction models can be built using a large quantity of experimental data available from public databases, but these databases typically contain dat...

Descripción completa

Detalles Bibliográficos
Autores principales: Esaki, Tsuyoshi, Watanabe, Reiko, Kawashima, Hitoshi, Ohashi, Rikiya, Natsume‐Kitatani, Yayoi, Nagao, Chioko, Mizuguchi, Kenji
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6586005/
https://www.ncbi.nlm.nih.gov/pubmed/30247811
http://dx.doi.org/10.1002/minf.201800086
_version_ 1783428818705317888
author Esaki, Tsuyoshi
Watanabe, Reiko
Kawashima, Hitoshi
Ohashi, Rikiya
Natsume‐Kitatani, Yayoi
Nagao, Chioko
Mizuguchi, Kenji
author_facet Esaki, Tsuyoshi
Watanabe, Reiko
Kawashima, Hitoshi
Ohashi, Rikiya
Natsume‐Kitatani, Yayoi
Nagao, Chioko
Mizuguchi, Kenji
author_sort Esaki, Tsuyoshi
collection PubMed
description A key consideration at the screening stages of drug discovery is in vitro metabolic stability, often measured in human liver microsomes. Computational prediction models can be built using a large quantity of experimental data available from public databases, but these databases typically contain data measured using various protocols in different laboratories, raising the issue of data quality. In this study, we retrieved the intrinsic clearance (CL (int)) measurements from an open database and performed extensive manual curation. Then, chemical descriptors were calculated using freely available software, and prediction models were built using machine learning algorithms. The models trained on the curated data showed better performance than those trained on the non‐curated data and achieved performance comparable to previously published models, showing the importance of manual curation in data preparation. The curated data were made available, to make our models fully reproducible.
format Online
Article
Text
id pubmed-6586005
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-65860052019-06-27 Data Curation can Improve the Prediction Accuracy of Metabolic Intrinsic Clearance Esaki, Tsuyoshi Watanabe, Reiko Kawashima, Hitoshi Ohashi, Rikiya Natsume‐Kitatani, Yayoi Nagao, Chioko Mizuguchi, Kenji Mol Inform Full Papers A key consideration at the screening stages of drug discovery is in vitro metabolic stability, often measured in human liver microsomes. Computational prediction models can be built using a large quantity of experimental data available from public databases, but these databases typically contain data measured using various protocols in different laboratories, raising the issue of data quality. In this study, we retrieved the intrinsic clearance (CL (int)) measurements from an open database and performed extensive manual curation. Then, chemical descriptors were calculated using freely available software, and prediction models were built using machine learning algorithms. The models trained on the curated data showed better performance than those trained on the non‐curated data and achieved performance comparable to previously published models, showing the importance of manual curation in data preparation. The curated data were made available, to make our models fully reproducible. John Wiley and Sons Inc. 2018-09-24 2019-01 /pmc/articles/PMC6586005/ /pubmed/30247811 http://dx.doi.org/10.1002/minf.201800086 Text en © 2018 The Authors. Published by Wiley-VCH Verlag GmbH & Co. KGaA. This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Full Papers
Esaki, Tsuyoshi
Watanabe, Reiko
Kawashima, Hitoshi
Ohashi, Rikiya
Natsume‐Kitatani, Yayoi
Nagao, Chioko
Mizuguchi, Kenji
Data Curation can Improve the Prediction Accuracy of Metabolic Intrinsic Clearance
title Data Curation can Improve the Prediction Accuracy of Metabolic Intrinsic Clearance
title_full Data Curation can Improve the Prediction Accuracy of Metabolic Intrinsic Clearance
title_fullStr Data Curation can Improve the Prediction Accuracy of Metabolic Intrinsic Clearance
title_full_unstemmed Data Curation can Improve the Prediction Accuracy of Metabolic Intrinsic Clearance
title_short Data Curation can Improve the Prediction Accuracy of Metabolic Intrinsic Clearance
title_sort data curation can improve the prediction accuracy of metabolic intrinsic clearance
topic Full Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6586005/
https://www.ncbi.nlm.nih.gov/pubmed/30247811
http://dx.doi.org/10.1002/minf.201800086
work_keys_str_mv AT esakitsuyoshi datacurationcanimprovethepredictionaccuracyofmetabolicintrinsicclearance
AT watanabereiko datacurationcanimprovethepredictionaccuracyofmetabolicintrinsicclearance
AT kawashimahitoshi datacurationcanimprovethepredictionaccuracyofmetabolicintrinsicclearance
AT ohashirikiya datacurationcanimprovethepredictionaccuracyofmetabolicintrinsicclearance
AT natsumekitataniyayoi datacurationcanimprovethepredictionaccuracyofmetabolicintrinsicclearance
AT nagaochioko datacurationcanimprovethepredictionaccuracyofmetabolicintrinsicclearance
AT mizuguchikenji datacurationcanimprovethepredictionaccuracyofmetabolicintrinsicclearance