Cargando…

How can natural language processing help model informed drug development?: a review

OBJECTIVE: To summarize applications of natural language processing (NLP) in model informed drug development (MIDD) and identify potential areas of improvement. MATERIALS AND METHODS: Publications found on PubMed and Google Scholar, websites and GitHub repositories for NLP libraries and models. Publ...

Descripción completa

Detalles Bibliográficos
Autores principales: Bhatnagar, Roopal, Sardar, Sakshi, Beheshti, Maedeh, Podichetty, Jagdeep T
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9188322/
https://www.ncbi.nlm.nih.gov/pubmed/35702625
http://dx.doi.org/10.1093/jamiaopen/ooac043
_version_ 1784725349258493952
author Bhatnagar, Roopal
Sardar, Sakshi
Beheshti, Maedeh
Podichetty, Jagdeep T
author_facet Bhatnagar, Roopal
Sardar, Sakshi
Beheshti, Maedeh
Podichetty, Jagdeep T
author_sort Bhatnagar, Roopal
collection PubMed
description OBJECTIVE: To summarize applications of natural language processing (NLP) in model informed drug development (MIDD) and identify potential areas of improvement. MATERIALS AND METHODS: Publications found on PubMed and Google Scholar, websites and GitHub repositories for NLP libraries and models. Publications describing applications of NLP in MIDD were reviewed. The applications were stratified into 3 stages: drug discovery, clinical trials, and pharmacovigilance. Key NLP functionalities used for these applications were assessed. Programming libraries and open-source resources for the implementation of NLP functionalities in MIDD were identified. RESULTS: NLP has been utilized to aid various processes in drug development lifecycle such as gene-disease mapping, biomarker discovery, patient-trial matching, adverse drug events detection, etc. These applications commonly use NLP functionalities of named entity recognition, word embeddings, entity resolution, assertion status detection, relation extraction, and topic modeling. The current state-of-the-art for implementing these functionalities in MIDD applications are transformer models that utilize transfer learning for enhanced performance. Various libraries in python, R, and Java like huggingface, sparkNLP, and KoRpus as well as open-source platforms such as DisGeNet, DeepEnroll, and Transmol have enabled convenient implementation of NLP models to MIDD applications. DISCUSSION: Challenges such as reproducibility, explainability, fairness, limited data, limited language-support, and security need to be overcome to ensure wider adoption of NLP in MIDD landscape. There are opportunities to improve the performance of existing models and expand the use of NLP in newer areas of MIDD. CONCLUSIONS: This review provides an overview of the potential and pitfalls of current NLP approaches in MIDD.
format Online
Article
Text
id pubmed-9188322
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-91883222022-06-13 How can natural language processing help model informed drug development?: a review Bhatnagar, Roopal Sardar, Sakshi Beheshti, Maedeh Podichetty, Jagdeep T JAMIA Open Review OBJECTIVE: To summarize applications of natural language processing (NLP) in model informed drug development (MIDD) and identify potential areas of improvement. MATERIALS AND METHODS: Publications found on PubMed and Google Scholar, websites and GitHub repositories for NLP libraries and models. Publications describing applications of NLP in MIDD were reviewed. The applications were stratified into 3 stages: drug discovery, clinical trials, and pharmacovigilance. Key NLP functionalities used for these applications were assessed. Programming libraries and open-source resources for the implementation of NLP functionalities in MIDD were identified. RESULTS: NLP has been utilized to aid various processes in drug development lifecycle such as gene-disease mapping, biomarker discovery, patient-trial matching, adverse drug events detection, etc. These applications commonly use NLP functionalities of named entity recognition, word embeddings, entity resolution, assertion status detection, relation extraction, and topic modeling. The current state-of-the-art for implementing these functionalities in MIDD applications are transformer models that utilize transfer learning for enhanced performance. Various libraries in python, R, and Java like huggingface, sparkNLP, and KoRpus as well as open-source platforms such as DisGeNet, DeepEnroll, and Transmol have enabled convenient implementation of NLP models to MIDD applications. DISCUSSION: Challenges such as reproducibility, explainability, fairness, limited data, limited language-support, and security need to be overcome to ensure wider adoption of NLP in MIDD landscape. There are opportunities to improve the performance of existing models and expand the use of NLP in newer areas of MIDD. CONCLUSIONS: This review provides an overview of the potential and pitfalls of current NLP approaches in MIDD. Oxford University Press 2022-06-11 /pmc/articles/PMC9188322/ /pubmed/35702625 http://dx.doi.org/10.1093/jamiaopen/ooac043 Text en © The Author(s) 2022. Published by Oxford University Press on behalf of the American Medical Informatics Association. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Review
Bhatnagar, Roopal
Sardar, Sakshi
Beheshti, Maedeh
Podichetty, Jagdeep T
How can natural language processing help model informed drug development?: a review
title How can natural language processing help model informed drug development?: a review
title_full How can natural language processing help model informed drug development?: a review
title_fullStr How can natural language processing help model informed drug development?: a review
title_full_unstemmed How can natural language processing help model informed drug development?: a review
title_short How can natural language processing help model informed drug development?: a review
title_sort how can natural language processing help model informed drug development?: a review
topic Review
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9188322/
https://www.ncbi.nlm.nih.gov/pubmed/35702625
http://dx.doi.org/10.1093/jamiaopen/ooac043
work_keys_str_mv AT bhatnagarroopal howcannaturallanguageprocessinghelpmodelinformeddrugdevelopmentareview
AT sardarsakshi howcannaturallanguageprocessinghelpmodelinformeddrugdevelopmentareview
AT beheshtimaedeh howcannaturallanguageprocessinghelpmodelinformeddrugdevelopmentareview
AT podichettyjagdeept howcannaturallanguageprocessinghelpmodelinformeddrugdevelopmentareview