Cargando…

Language‐agnostic pharmacovigilant text mining to elicit side effects from clinical notes and hospital medication records

We sought to craft a drug safety signalling pipeline associating latent information in clinical free text with exposures to single drugs and drug pairs. Data arose from 12 secondary and tertiary public hospitals in two Danish regions, comprising approximately half the Danish population. Notes were o...

Descripción completa

Detalles Bibliográficos
Autores principales: Kaas‐Hansen, Benjamin Skov, Placido, Davide, Rodríguez, Cristina Leal, Thorsen‐Meyer, Hans‐Christian, Gentile, Simona, Nielsen, Anna Pors, Brunak, Søren, Jürgens, Gesche, Andersen, Stig Ejdrup
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9541191/
https://www.ncbi.nlm.nih.gov/pubmed/35834334
http://dx.doi.org/10.1111/bcpt.13773
_version_ 1784803870666391552
author Kaas‐Hansen, Benjamin Skov
Placido, Davide
Rodríguez, Cristina Leal
Thorsen‐Meyer, Hans‐Christian
Gentile, Simona
Nielsen, Anna Pors
Brunak, Søren
Jürgens, Gesche
Andersen, Stig Ejdrup
author_facet Kaas‐Hansen, Benjamin Skov
Placido, Davide
Rodríguez, Cristina Leal
Thorsen‐Meyer, Hans‐Christian
Gentile, Simona
Nielsen, Anna Pors
Brunak, Søren
Jürgens, Gesche
Andersen, Stig Ejdrup
author_sort Kaas‐Hansen, Benjamin Skov
collection PubMed
description We sought to craft a drug safety signalling pipeline associating latent information in clinical free text with exposures to single drugs and drug pairs. Data arose from 12 secondary and tertiary public hospitals in two Danish regions, comprising approximately half the Danish population. Notes were operationalised with a fastText embedding, based on which we trained 10 270 neural‐network models (one for each distinct single‐drug/drug‐pair exposure) predicting the risk of exposure given an embedding vector. We included 2 905 251 admissions between May 2008 and June 2016, with 13 740 564 distinct drug prescriptions; the median number of prescriptions was 5 (IQR: 3–9) and in 1 184 340 (41%) admissions patients used ≥5 drugs concomitantly. A total of 10 788 259 clinical notes were included, with 179 441 739 tokens retained after pruning. Of 345 single‐drug signals reviewed, 28 (8.1%) represented possibly undescribed relationships; 186 (54%) signals were clinically meaningful. Sixteen (14%) of the 115 drug‐pair signals were possible interactions, and two (1.7%) were known. In conclusion, we built a language‐agnostic pipeline for mining associations between free‐text information and medication exposure without manual curation, predicting not the likely outcome of a range of exposures but also the likely exposures for outcomes of interest. Our approach may help overcome limitations of text mining methods relying on curated data in English and can help leverage non‐English free text for pharmacovigilance.
format Online
Article
Text
id pubmed-9541191
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-95411912022-10-14 Language‐agnostic pharmacovigilant text mining to elicit side effects from clinical notes and hospital medication records Kaas‐Hansen, Benjamin Skov Placido, Davide Rodríguez, Cristina Leal Thorsen‐Meyer, Hans‐Christian Gentile, Simona Nielsen, Anna Pors Brunak, Søren Jürgens, Gesche Andersen, Stig Ejdrup Basic Clin Pharmacol Toxicol ORIGINAL ARTICLES We sought to craft a drug safety signalling pipeline associating latent information in clinical free text with exposures to single drugs and drug pairs. Data arose from 12 secondary and tertiary public hospitals in two Danish regions, comprising approximately half the Danish population. Notes were operationalised with a fastText embedding, based on which we trained 10 270 neural‐network models (one for each distinct single‐drug/drug‐pair exposure) predicting the risk of exposure given an embedding vector. We included 2 905 251 admissions between May 2008 and June 2016, with 13 740 564 distinct drug prescriptions; the median number of prescriptions was 5 (IQR: 3–9) and in 1 184 340 (41%) admissions patients used ≥5 drugs concomitantly. A total of 10 788 259 clinical notes were included, with 179 441 739 tokens retained after pruning. Of 345 single‐drug signals reviewed, 28 (8.1%) represented possibly undescribed relationships; 186 (54%) signals were clinically meaningful. Sixteen (14%) of the 115 drug‐pair signals were possible interactions, and two (1.7%) were known. In conclusion, we built a language‐agnostic pipeline for mining associations between free‐text information and medication exposure without manual curation, predicting not the likely outcome of a range of exposures but also the likely exposures for outcomes of interest. Our approach may help overcome limitations of text mining methods relying on curated data in English and can help leverage non‐English free text for pharmacovigilance. John Wiley and Sons Inc. 2022-07-26 2022-10 /pmc/articles/PMC9541191/ /pubmed/35834334 http://dx.doi.org/10.1111/bcpt.13773 Text en © 2022 The Authors. Basic & Clinical Pharmacology & Toxicology published by John Wiley & Sons Ltd on behalf of Nordic Association for the Publication of BCPT (former Nordic Pharmacological Society). https://creativecommons.org/licenses/by-nc/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited and is not used for commercial purposes.
spellingShingle ORIGINAL ARTICLES
Kaas‐Hansen, Benjamin Skov
Placido, Davide
Rodríguez, Cristina Leal
Thorsen‐Meyer, Hans‐Christian
Gentile, Simona
Nielsen, Anna Pors
Brunak, Søren
Jürgens, Gesche
Andersen, Stig Ejdrup
Language‐agnostic pharmacovigilant text mining to elicit side effects from clinical notes and hospital medication records
title Language‐agnostic pharmacovigilant text mining to elicit side effects from clinical notes and hospital medication records
title_full Language‐agnostic pharmacovigilant text mining to elicit side effects from clinical notes and hospital medication records
title_fullStr Language‐agnostic pharmacovigilant text mining to elicit side effects from clinical notes and hospital medication records
title_full_unstemmed Language‐agnostic pharmacovigilant text mining to elicit side effects from clinical notes and hospital medication records
title_short Language‐agnostic pharmacovigilant text mining to elicit side effects from clinical notes and hospital medication records
title_sort language‐agnostic pharmacovigilant text mining to elicit side effects from clinical notes and hospital medication records
topic ORIGINAL ARTICLES
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9541191/
https://www.ncbi.nlm.nih.gov/pubmed/35834334
http://dx.doi.org/10.1111/bcpt.13773
work_keys_str_mv AT kaashansenbenjaminskov languageagnosticpharmacovigilanttextminingtoelicitsideeffectsfromclinicalnotesandhospitalmedicationrecords
AT placidodavide languageagnosticpharmacovigilanttextminingtoelicitsideeffectsfromclinicalnotesandhospitalmedicationrecords
AT rodriguezcristinaleal languageagnosticpharmacovigilanttextminingtoelicitsideeffectsfromclinicalnotesandhospitalmedicationrecords
AT thorsenmeyerhanschristian languageagnosticpharmacovigilanttextminingtoelicitsideeffectsfromclinicalnotesandhospitalmedicationrecords
AT gentilesimona languageagnosticpharmacovigilanttextminingtoelicitsideeffectsfromclinicalnotesandhospitalmedicationrecords
AT nielsenannapors languageagnosticpharmacovigilanttextminingtoelicitsideeffectsfromclinicalnotesandhospitalmedicationrecords
AT brunaksøren languageagnosticpharmacovigilanttextminingtoelicitsideeffectsfromclinicalnotesandhospitalmedicationrecords
AT jurgensgesche languageagnosticpharmacovigilanttextminingtoelicitsideeffectsfromclinicalnotesandhospitalmedicationrecords
AT andersenstigejdrup languageagnosticpharmacovigilanttextminingtoelicitsideeffectsfromclinicalnotesandhospitalmedicationrecords