Cargando…
Data Curation can Improve the Prediction Accuracy of Metabolic Intrinsic Clearance
A key consideration at the screening stages of drug discovery is in vitro metabolic stability, often measured in human liver microsomes. Computational prediction models can be built using a large quantity of experimental data available from public databases, but these databases typically contain dat...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
John Wiley and Sons Inc.
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6586005/ https://www.ncbi.nlm.nih.gov/pubmed/30247811 http://dx.doi.org/10.1002/minf.201800086 |
_version_ | 1783428818705317888 |
---|---|
author | Esaki, Tsuyoshi Watanabe, Reiko Kawashima, Hitoshi Ohashi, Rikiya Natsume‐Kitatani, Yayoi Nagao, Chioko Mizuguchi, Kenji |
author_facet | Esaki, Tsuyoshi Watanabe, Reiko Kawashima, Hitoshi Ohashi, Rikiya Natsume‐Kitatani, Yayoi Nagao, Chioko Mizuguchi, Kenji |
author_sort | Esaki, Tsuyoshi |
collection | PubMed |
description | A key consideration at the screening stages of drug discovery is in vitro metabolic stability, often measured in human liver microsomes. Computational prediction models can be built using a large quantity of experimental data available from public databases, but these databases typically contain data measured using various protocols in different laboratories, raising the issue of data quality. In this study, we retrieved the intrinsic clearance (CL (int)) measurements from an open database and performed extensive manual curation. Then, chemical descriptors were calculated using freely available software, and prediction models were built using machine learning algorithms. The models trained on the curated data showed better performance than those trained on the non‐curated data and achieved performance comparable to previously published models, showing the importance of manual curation in data preparation. The curated data were made available, to make our models fully reproducible. |
format | Online Article Text |
id | pubmed-6586005 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | John Wiley and Sons Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-65860052019-06-27 Data Curation can Improve the Prediction Accuracy of Metabolic Intrinsic Clearance Esaki, Tsuyoshi Watanabe, Reiko Kawashima, Hitoshi Ohashi, Rikiya Natsume‐Kitatani, Yayoi Nagao, Chioko Mizuguchi, Kenji Mol Inform Full Papers A key consideration at the screening stages of drug discovery is in vitro metabolic stability, often measured in human liver microsomes. Computational prediction models can be built using a large quantity of experimental data available from public databases, but these databases typically contain data measured using various protocols in different laboratories, raising the issue of data quality. In this study, we retrieved the intrinsic clearance (CL (int)) measurements from an open database and performed extensive manual curation. Then, chemical descriptors were calculated using freely available software, and prediction models were built using machine learning algorithms. The models trained on the curated data showed better performance than those trained on the non‐curated data and achieved performance comparable to previously published models, showing the importance of manual curation in data preparation. The curated data were made available, to make our models fully reproducible. John Wiley and Sons Inc. 2018-09-24 2019-01 /pmc/articles/PMC6586005/ /pubmed/30247811 http://dx.doi.org/10.1002/minf.201800086 Text en © 2018 The Authors. Published by Wiley-VCH Verlag GmbH & Co. KGaA. This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Full Papers Esaki, Tsuyoshi Watanabe, Reiko Kawashima, Hitoshi Ohashi, Rikiya Natsume‐Kitatani, Yayoi Nagao, Chioko Mizuguchi, Kenji Data Curation can Improve the Prediction Accuracy of Metabolic Intrinsic Clearance |
title | Data Curation can Improve the Prediction Accuracy of Metabolic Intrinsic Clearance |
title_full | Data Curation can Improve the Prediction Accuracy of Metabolic Intrinsic Clearance |
title_fullStr | Data Curation can Improve the Prediction Accuracy of Metabolic Intrinsic Clearance |
title_full_unstemmed | Data Curation can Improve the Prediction Accuracy of Metabolic Intrinsic Clearance |
title_short | Data Curation can Improve the Prediction Accuracy of Metabolic Intrinsic Clearance |
title_sort | data curation can improve the prediction accuracy of metabolic intrinsic clearance |
topic | Full Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6586005/ https://www.ncbi.nlm.nih.gov/pubmed/30247811 http://dx.doi.org/10.1002/minf.201800086 |
work_keys_str_mv | AT esakitsuyoshi datacurationcanimprovethepredictionaccuracyofmetabolicintrinsicclearance AT watanabereiko datacurationcanimprovethepredictionaccuracyofmetabolicintrinsicclearance AT kawashimahitoshi datacurationcanimprovethepredictionaccuracyofmetabolicintrinsicclearance AT ohashirikiya datacurationcanimprovethepredictionaccuracyofmetabolicintrinsicclearance AT natsumekitataniyayoi datacurationcanimprovethepredictionaccuracyofmetabolicintrinsicclearance AT nagaochioko datacurationcanimprovethepredictionaccuracyofmetabolicintrinsicclearance AT mizuguchikenji datacurationcanimprovethepredictionaccuracyofmetabolicintrinsicclearance |