Cargando…

Artificial Intelligence for Unstructured Healthcare Data: Application to Coding of Patient Reporting of Adverse Drug Reactions

Adverse drug reaction (ADR) reporting is a major component of drug safety monitoring; its input will, however, only be optimized if systems can manage to deal with its tremendous flow of information, based primarily on unstructured text fields. The aim of this study was to develop an automated syste...

Descripción completa

Detalles Bibliográficos
Autores principales: Létinier, Louis, Jouganous, Julien, Benkebil, Mehdi, Bel‐Létoile, Alicia, Goehrs, Clément, Singier, Allison, Rouby, Franck, Lacroix, Clémence, Miremont, Ghada, Micallef, Joëlle, Salvo, Francesco, Pariente, Antoine
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8359992/
https://www.ncbi.nlm.nih.gov/pubmed/33866552
http://dx.doi.org/10.1002/cpt.2266
_version_ 1783737651627556864
author Létinier, Louis
Jouganous, Julien
Benkebil, Mehdi
Bel‐Létoile, Alicia
Goehrs, Clément
Singier, Allison
Rouby, Franck
Lacroix, Clémence
Miremont, Ghada
Micallef, Joëlle
Salvo, Francesco
Pariente, Antoine
author_facet Létinier, Louis
Jouganous, Julien
Benkebil, Mehdi
Bel‐Létoile, Alicia
Goehrs, Clément
Singier, Allison
Rouby, Franck
Lacroix, Clémence
Miremont, Ghada
Micallef, Joëlle
Salvo, Francesco
Pariente, Antoine
author_sort Létinier, Louis
collection PubMed
description Adverse drug reaction (ADR) reporting is a major component of drug safety monitoring; its input will, however, only be optimized if systems can manage to deal with its tremendous flow of information, based primarily on unstructured text fields. The aim of this study was to develop an automated system allowing to code ADRs from patient reports. Our system was based on a knowledge base about drugs, enriched by supervised machine learning (ML) models trained on patients reporting data. To train our models, we selected all cases of ADRs reported by patients to a French Pharmacovigilance Centre through a national web‐portal between March 2017 and March 2019 (n = 2,058 reports). We tested both conventional ML models and deep‐learning models. We performed an external validation using a dataset constituted of a random sample of ADRs reported to the Marseille Pharmacovigilance Centre over the same period (n = 187). Here, we show that regarding area under the curve (AUC) and F‐measure, the best model to identify ADRs was gradient boosting trees (LGBM), with an AUC of 0.93 (0.92–0.94) and F‐measure of 0.72 (0.68–0.75). This model was run for external validation showing an AUC of 0.91 and a F‐measure of 0.58. We evaluated an artificial intelligence pipeline that was found able to learn how to identify correctly ADRs from unstructured data. This result allowed us to start a new study using more data to further improve our performance and offer a tool that is useful in practice to efficiently manage drug safety information.
format Online
Article
Text
id pubmed-8359992
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-83599922021-08-17 Artificial Intelligence for Unstructured Healthcare Data: Application to Coding of Patient Reporting of Adverse Drug Reactions Létinier, Louis Jouganous, Julien Benkebil, Mehdi Bel‐Létoile, Alicia Goehrs, Clément Singier, Allison Rouby, Franck Lacroix, Clémence Miremont, Ghada Micallef, Joëlle Salvo, Francesco Pariente, Antoine Clin Pharmacol Ther Research Adverse drug reaction (ADR) reporting is a major component of drug safety monitoring; its input will, however, only be optimized if systems can manage to deal with its tremendous flow of information, based primarily on unstructured text fields. The aim of this study was to develop an automated system allowing to code ADRs from patient reports. Our system was based on a knowledge base about drugs, enriched by supervised machine learning (ML) models trained on patients reporting data. To train our models, we selected all cases of ADRs reported by patients to a French Pharmacovigilance Centre through a national web‐portal between March 2017 and March 2019 (n = 2,058 reports). We tested both conventional ML models and deep‐learning models. We performed an external validation using a dataset constituted of a random sample of ADRs reported to the Marseille Pharmacovigilance Centre over the same period (n = 187). Here, we show that regarding area under the curve (AUC) and F‐measure, the best model to identify ADRs was gradient boosting trees (LGBM), with an AUC of 0.93 (0.92–0.94) and F‐measure of 0.72 (0.68–0.75). This model was run for external validation showing an AUC of 0.91 and a F‐measure of 0.58. We evaluated an artificial intelligence pipeline that was found able to learn how to identify correctly ADRs from unstructured data. This result allowed us to start a new study using more data to further improve our performance and offer a tool that is useful in practice to efficiently manage drug safety information. John Wiley and Sons Inc. 2021-05-08 2021-08 /pmc/articles/PMC8359992/ /pubmed/33866552 http://dx.doi.org/10.1002/cpt.2266 Text en © 2021 The Authors. Clinical Pharmacology & Therapeutics published by Wiley Periodicals LLC on behalf of American Society for Clinical Pharmacology and Therapeutics https://creativecommons.org/licenses/by-nc/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited and is not used for commercial purposes.
spellingShingle Research
Létinier, Louis
Jouganous, Julien
Benkebil, Mehdi
Bel‐Létoile, Alicia
Goehrs, Clément
Singier, Allison
Rouby, Franck
Lacroix, Clémence
Miremont, Ghada
Micallef, Joëlle
Salvo, Francesco
Pariente, Antoine
Artificial Intelligence for Unstructured Healthcare Data: Application to Coding of Patient Reporting of Adverse Drug Reactions
title Artificial Intelligence for Unstructured Healthcare Data: Application to Coding of Patient Reporting of Adverse Drug Reactions
title_full Artificial Intelligence for Unstructured Healthcare Data: Application to Coding of Patient Reporting of Adverse Drug Reactions
title_fullStr Artificial Intelligence for Unstructured Healthcare Data: Application to Coding of Patient Reporting of Adverse Drug Reactions
title_full_unstemmed Artificial Intelligence for Unstructured Healthcare Data: Application to Coding of Patient Reporting of Adverse Drug Reactions
title_short Artificial Intelligence for Unstructured Healthcare Data: Application to Coding of Patient Reporting of Adverse Drug Reactions
title_sort artificial intelligence for unstructured healthcare data: application to coding of patient reporting of adverse drug reactions
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8359992/
https://www.ncbi.nlm.nih.gov/pubmed/33866552
http://dx.doi.org/10.1002/cpt.2266
work_keys_str_mv AT letinierlouis artificialintelligenceforunstructuredhealthcaredataapplicationtocodingofpatientreportingofadversedrugreactions
AT jouganousjulien artificialintelligenceforunstructuredhealthcaredataapplicationtocodingofpatientreportingofadversedrugreactions
AT benkebilmehdi artificialintelligenceforunstructuredhealthcaredataapplicationtocodingofpatientreportingofadversedrugreactions
AT belletoilealicia artificialintelligenceforunstructuredhealthcaredataapplicationtocodingofpatientreportingofadversedrugreactions
AT goehrsclement artificialintelligenceforunstructuredhealthcaredataapplicationtocodingofpatientreportingofadversedrugreactions
AT singierallison artificialintelligenceforunstructuredhealthcaredataapplicationtocodingofpatientreportingofadversedrugreactions
AT roubyfranck artificialintelligenceforunstructuredhealthcaredataapplicationtocodingofpatientreportingofadversedrugreactions
AT lacroixclemence artificialintelligenceforunstructuredhealthcaredataapplicationtocodingofpatientreportingofadversedrugreactions
AT miremontghada artificialintelligenceforunstructuredhealthcaredataapplicationtocodingofpatientreportingofadversedrugreactions
AT micallefjoelle artificialintelligenceforunstructuredhealthcaredataapplicationtocodingofpatientreportingofadversedrugreactions
AT salvofrancesco artificialintelligenceforunstructuredhealthcaredataapplicationtocodingofpatientreportingofadversedrugreactions
AT parienteantoine artificialintelligenceforunstructuredhealthcaredataapplicationtocodingofpatientreportingofadversedrugreactions