Cargando…

An automated approach to identify scientific publications reporting pharmacokinetic parameters

Pharmacokinetic (PK) predictions of new chemical entities are aided by prior knowledge from other compounds. The development of robust algorithms that improve preclinical and clinical phases of drug development remains constrained by the need to search, curate and standardise PK information across t...

Descripción completa

Detalles Bibliográficos
Autores principales: Gonzalez Hernandez, Ferran, Carter, Simon J, Iso-Sipilä, Juha, Goldsmith, Paul, Almousa, Ahmed A., Gastine, Silke, Lilaonitkul, Watjana, Kloprogge, Frank, Standing, Joseph F
Formato: Online Artículo Texto
Lenguaje:English
Publicado: F1000 Research Limited 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8343403/
https://www.ncbi.nlm.nih.gov/pubmed/34381873
http://dx.doi.org/10.12688/wellcomeopenres.16718.1
_version_ 1783734282707009536
author Gonzalez Hernandez, Ferran
Carter, Simon J
Iso-Sipilä, Juha
Goldsmith, Paul
Almousa, Ahmed A.
Gastine, Silke
Lilaonitkul, Watjana
Kloprogge, Frank
Standing, Joseph F
author_facet Gonzalez Hernandez, Ferran
Carter, Simon J
Iso-Sipilä, Juha
Goldsmith, Paul
Almousa, Ahmed A.
Gastine, Silke
Lilaonitkul, Watjana
Kloprogge, Frank
Standing, Joseph F
author_sort Gonzalez Hernandez, Ferran
collection PubMed
description Pharmacokinetic (PK) predictions of new chemical entities are aided by prior knowledge from other compounds. The development of robust algorithms that improve preclinical and clinical phases of drug development remains constrained by the need to search, curate and standardise PK information across the constantly-growing scientific literature. The lack of centralised, up-to-date and comprehensive repositories of PK data represents a significant limitation in the drug development pipeline.In this work, we propose a machine learning approach to automatically identify and characterise scientific publications reporting PK parameters from in vivo data, providing a centralised repository of PK literature. A dataset of 4,792 PubMed publications was labelled by field experts depending on whether in vivo PK parameters were estimated in the study. Different classification pipelines were compared using a bootstrap approach and the best-performing architecture was used to develop a comprehensive and automatically-updated repository of PK publications. The best-performing architecture encoded documents using unigram features and mean pooling of BioBERT embeddings obtaining an F1 score of 83.8% on the test set. The pipeline retrieved over 121K PubMed publications in which in vivo PK parameters were estimated and it was scheduled to perform weekly updates on newly published articles. All the relevant documents were released through a publicly available web interface (https://app.pkpdai.com) and characterised by the drugs, species and conditions mentioned in the abstract, to facilitate the subsequent search of relevant PK data. This automated, open-access repository can be used to accelerate the search and comparison of PK results, curate ADME datasets, and facilitate subsequent text mining tasks in the PK domain.
format Online
Article
Text
id pubmed-8343403
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher F1000 Research Limited
record_format MEDLINE/PubMed
spelling pubmed-83434032021-08-10 An automated approach to identify scientific publications reporting pharmacokinetic parameters Gonzalez Hernandez, Ferran Carter, Simon J Iso-Sipilä, Juha Goldsmith, Paul Almousa, Ahmed A. Gastine, Silke Lilaonitkul, Watjana Kloprogge, Frank Standing, Joseph F Wellcome Open Res Method Article Pharmacokinetic (PK) predictions of new chemical entities are aided by prior knowledge from other compounds. The development of robust algorithms that improve preclinical and clinical phases of drug development remains constrained by the need to search, curate and standardise PK information across the constantly-growing scientific literature. The lack of centralised, up-to-date and comprehensive repositories of PK data represents a significant limitation in the drug development pipeline.In this work, we propose a machine learning approach to automatically identify and characterise scientific publications reporting PK parameters from in vivo data, providing a centralised repository of PK literature. A dataset of 4,792 PubMed publications was labelled by field experts depending on whether in vivo PK parameters were estimated in the study. Different classification pipelines were compared using a bootstrap approach and the best-performing architecture was used to develop a comprehensive and automatically-updated repository of PK publications. The best-performing architecture encoded documents using unigram features and mean pooling of BioBERT embeddings obtaining an F1 score of 83.8% on the test set. The pipeline retrieved over 121K PubMed publications in which in vivo PK parameters were estimated and it was scheduled to perform weekly updates on newly published articles. All the relevant documents were released through a publicly available web interface (https://app.pkpdai.com) and characterised by the drugs, species and conditions mentioned in the abstract, to facilitate the subsequent search of relevant PK data. This automated, open-access repository can be used to accelerate the search and comparison of PK results, curate ADME datasets, and facilitate subsequent text mining tasks in the PK domain. F1000 Research Limited 2021-04-21 /pmc/articles/PMC8343403/ /pubmed/34381873 http://dx.doi.org/10.12688/wellcomeopenres.16718.1 Text en Copyright: © 2021 Gonzalez Hernandez F et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Method Article
Gonzalez Hernandez, Ferran
Carter, Simon J
Iso-Sipilä, Juha
Goldsmith, Paul
Almousa, Ahmed A.
Gastine, Silke
Lilaonitkul, Watjana
Kloprogge, Frank
Standing, Joseph F
An automated approach to identify scientific publications reporting pharmacokinetic parameters
title An automated approach to identify scientific publications reporting pharmacokinetic parameters
title_full An automated approach to identify scientific publications reporting pharmacokinetic parameters
title_fullStr An automated approach to identify scientific publications reporting pharmacokinetic parameters
title_full_unstemmed An automated approach to identify scientific publications reporting pharmacokinetic parameters
title_short An automated approach to identify scientific publications reporting pharmacokinetic parameters
title_sort automated approach to identify scientific publications reporting pharmacokinetic parameters
topic Method Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8343403/
https://www.ncbi.nlm.nih.gov/pubmed/34381873
http://dx.doi.org/10.12688/wellcomeopenres.16718.1
work_keys_str_mv AT gonzalezhernandezferran anautomatedapproachtoidentifyscientificpublicationsreportingpharmacokineticparameters
AT cartersimonj anautomatedapproachtoidentifyscientificpublicationsreportingpharmacokineticparameters
AT isosipilajuha anautomatedapproachtoidentifyscientificpublicationsreportingpharmacokineticparameters
AT goldsmithpaul anautomatedapproachtoidentifyscientificpublicationsreportingpharmacokineticparameters
AT almousaahmeda anautomatedapproachtoidentifyscientificpublicationsreportingpharmacokineticparameters
AT gastinesilke anautomatedapproachtoidentifyscientificpublicationsreportingpharmacokineticparameters
AT lilaonitkulwatjana anautomatedapproachtoidentifyscientificpublicationsreportingpharmacokineticparameters
AT kloproggefrank anautomatedapproachtoidentifyscientificpublicationsreportingpharmacokineticparameters
AT standingjosephf anautomatedapproachtoidentifyscientificpublicationsreportingpharmacokineticparameters
AT gonzalezhernandezferran automatedapproachtoidentifyscientificpublicationsreportingpharmacokineticparameters
AT cartersimonj automatedapproachtoidentifyscientificpublicationsreportingpharmacokineticparameters
AT isosipilajuha automatedapproachtoidentifyscientificpublicationsreportingpharmacokineticparameters
AT goldsmithpaul automatedapproachtoidentifyscientificpublicationsreportingpharmacokineticparameters
AT almousaahmeda automatedapproachtoidentifyscientificpublicationsreportingpharmacokineticparameters
AT gastinesilke automatedapproachtoidentifyscientificpublicationsreportingpharmacokineticparameters
AT lilaonitkulwatjana automatedapproachtoidentifyscientificpublicationsreportingpharmacokineticparameters
AT kloproggefrank automatedapproachtoidentifyscientificpublicationsreportingpharmacokineticparameters
AT standingjosephf automatedapproachtoidentifyscientificpublicationsreportingpharmacokineticparameters