Cargando…
Language‐agnostic pharmacovigilant text mining to elicit side effects from clinical notes and hospital medication records
We sought to craft a drug safety signalling pipeline associating latent information in clinical free text with exposures to single drugs and drug pairs. Data arose from 12 secondary and tertiary public hospitals in two Danish regions, comprising approximately half the Danish population. Notes were o...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
John Wiley and Sons Inc.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9541191/ https://www.ncbi.nlm.nih.gov/pubmed/35834334 http://dx.doi.org/10.1111/bcpt.13773 |
_version_ | 1784803870666391552 |
---|---|
author | Kaas‐Hansen, Benjamin Skov Placido, Davide Rodríguez, Cristina Leal Thorsen‐Meyer, Hans‐Christian Gentile, Simona Nielsen, Anna Pors Brunak, Søren Jürgens, Gesche Andersen, Stig Ejdrup |
author_facet | Kaas‐Hansen, Benjamin Skov Placido, Davide Rodríguez, Cristina Leal Thorsen‐Meyer, Hans‐Christian Gentile, Simona Nielsen, Anna Pors Brunak, Søren Jürgens, Gesche Andersen, Stig Ejdrup |
author_sort | Kaas‐Hansen, Benjamin Skov |
collection | PubMed |
description | We sought to craft a drug safety signalling pipeline associating latent information in clinical free text with exposures to single drugs and drug pairs. Data arose from 12 secondary and tertiary public hospitals in two Danish regions, comprising approximately half the Danish population. Notes were operationalised with a fastText embedding, based on which we trained 10 270 neural‐network models (one for each distinct single‐drug/drug‐pair exposure) predicting the risk of exposure given an embedding vector. We included 2 905 251 admissions between May 2008 and June 2016, with 13 740 564 distinct drug prescriptions; the median number of prescriptions was 5 (IQR: 3–9) and in 1 184 340 (41%) admissions patients used ≥5 drugs concomitantly. A total of 10 788 259 clinical notes were included, with 179 441 739 tokens retained after pruning. Of 345 single‐drug signals reviewed, 28 (8.1%) represented possibly undescribed relationships; 186 (54%) signals were clinically meaningful. Sixteen (14%) of the 115 drug‐pair signals were possible interactions, and two (1.7%) were known. In conclusion, we built a language‐agnostic pipeline for mining associations between free‐text information and medication exposure without manual curation, predicting not the likely outcome of a range of exposures but also the likely exposures for outcomes of interest. Our approach may help overcome limitations of text mining methods relying on curated data in English and can help leverage non‐English free text for pharmacovigilance. |
format | Online Article Text |
id | pubmed-9541191 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | John Wiley and Sons Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-95411912022-10-14 Language‐agnostic pharmacovigilant text mining to elicit side effects from clinical notes and hospital medication records Kaas‐Hansen, Benjamin Skov Placido, Davide Rodríguez, Cristina Leal Thorsen‐Meyer, Hans‐Christian Gentile, Simona Nielsen, Anna Pors Brunak, Søren Jürgens, Gesche Andersen, Stig Ejdrup Basic Clin Pharmacol Toxicol ORIGINAL ARTICLES We sought to craft a drug safety signalling pipeline associating latent information in clinical free text with exposures to single drugs and drug pairs. Data arose from 12 secondary and tertiary public hospitals in two Danish regions, comprising approximately half the Danish population. Notes were operationalised with a fastText embedding, based on which we trained 10 270 neural‐network models (one for each distinct single‐drug/drug‐pair exposure) predicting the risk of exposure given an embedding vector. We included 2 905 251 admissions between May 2008 and June 2016, with 13 740 564 distinct drug prescriptions; the median number of prescriptions was 5 (IQR: 3–9) and in 1 184 340 (41%) admissions patients used ≥5 drugs concomitantly. A total of 10 788 259 clinical notes were included, with 179 441 739 tokens retained after pruning. Of 345 single‐drug signals reviewed, 28 (8.1%) represented possibly undescribed relationships; 186 (54%) signals were clinically meaningful. Sixteen (14%) of the 115 drug‐pair signals were possible interactions, and two (1.7%) were known. In conclusion, we built a language‐agnostic pipeline for mining associations between free‐text information and medication exposure without manual curation, predicting not the likely outcome of a range of exposures but also the likely exposures for outcomes of interest. Our approach may help overcome limitations of text mining methods relying on curated data in English and can help leverage non‐English free text for pharmacovigilance. John Wiley and Sons Inc. 2022-07-26 2022-10 /pmc/articles/PMC9541191/ /pubmed/35834334 http://dx.doi.org/10.1111/bcpt.13773 Text en © 2022 The Authors. Basic & Clinical Pharmacology & Toxicology published by John Wiley & Sons Ltd on behalf of Nordic Association for the Publication of BCPT (former Nordic Pharmacological Society). https://creativecommons.org/licenses/by-nc/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited and is not used for commercial purposes. |
spellingShingle | ORIGINAL ARTICLES Kaas‐Hansen, Benjamin Skov Placido, Davide Rodríguez, Cristina Leal Thorsen‐Meyer, Hans‐Christian Gentile, Simona Nielsen, Anna Pors Brunak, Søren Jürgens, Gesche Andersen, Stig Ejdrup Language‐agnostic pharmacovigilant text mining to elicit side effects from clinical notes and hospital medication records |
title | Language‐agnostic pharmacovigilant text mining to elicit side effects from clinical notes and hospital medication records |
title_full | Language‐agnostic pharmacovigilant text mining to elicit side effects from clinical notes and hospital medication records |
title_fullStr | Language‐agnostic pharmacovigilant text mining to elicit side effects from clinical notes and hospital medication records |
title_full_unstemmed | Language‐agnostic pharmacovigilant text mining to elicit side effects from clinical notes and hospital medication records |
title_short | Language‐agnostic pharmacovigilant text mining to elicit side effects from clinical notes and hospital medication records |
title_sort | language‐agnostic pharmacovigilant text mining to elicit side effects from clinical notes and hospital medication records |
topic | ORIGINAL ARTICLES |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9541191/ https://www.ncbi.nlm.nih.gov/pubmed/35834334 http://dx.doi.org/10.1111/bcpt.13773 |
work_keys_str_mv | AT kaashansenbenjaminskov languageagnosticpharmacovigilanttextminingtoelicitsideeffectsfromclinicalnotesandhospitalmedicationrecords AT placidodavide languageagnosticpharmacovigilanttextminingtoelicitsideeffectsfromclinicalnotesandhospitalmedicationrecords AT rodriguezcristinaleal languageagnosticpharmacovigilanttextminingtoelicitsideeffectsfromclinicalnotesandhospitalmedicationrecords AT thorsenmeyerhanschristian languageagnosticpharmacovigilanttextminingtoelicitsideeffectsfromclinicalnotesandhospitalmedicationrecords AT gentilesimona languageagnosticpharmacovigilanttextminingtoelicitsideeffectsfromclinicalnotesandhospitalmedicationrecords AT nielsenannapors languageagnosticpharmacovigilanttextminingtoelicitsideeffectsfromclinicalnotesandhospitalmedicationrecords AT brunaksøren languageagnosticpharmacovigilanttextminingtoelicitsideeffectsfromclinicalnotesandhospitalmedicationrecords AT jurgensgesche languageagnosticpharmacovigilanttextminingtoelicitsideeffectsfromclinicalnotesandhospitalmedicationrecords AT andersenstigejdrup languageagnosticpharmacovigilanttextminingtoelicitsideeffectsfromclinicalnotesandhospitalmedicationrecords |