Cargando…

Annotation and detection of drug effects in text for pharmacovigilance

Pharmacovigilance (PV) databases record the benefits and risks of different drugs, as a means to ensure their safe and effective use. Creating and maintaining such resources can be complex, since a particular medication may have divergent effects in different individuals, due to specific patient cha...

Descripción completa

Detalles Bibliográficos
Autores principales: Thompson, Paul, Daikou, Sophia, Ueno, Kenju, Batista-Navarro, Riza, Tsujii, Jun’ichi, Ananiadou, Sophia
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6089860/
https://www.ncbi.nlm.nih.gov/pubmed/30105604
http://dx.doi.org/10.1186/s13321-018-0290-y
_version_ 1783347088555245568
author Thompson, Paul
Daikou, Sophia
Ueno, Kenju
Batista-Navarro, Riza
Tsujii, Jun’ichi
Ananiadou, Sophia
author_facet Thompson, Paul
Daikou, Sophia
Ueno, Kenju
Batista-Navarro, Riza
Tsujii, Jun’ichi
Ananiadou, Sophia
author_sort Thompson, Paul
collection PubMed
description Pharmacovigilance (PV) databases record the benefits and risks of different drugs, as a means to ensure their safe and effective use. Creating and maintaining such resources can be complex, since a particular medication may have divergent effects in different individuals, due to specific patient characteristics and/or interactions with other drugs being administered. Textual information from various sources can provide important evidence to curators of PV databases about the usage and effects of drug targets in different medical subjects. However, the efficient identification of relevant evidence can be challenging, due to the increasing volume of textual data. Text mining (TM) techniques can support curators by automatically detecting complex information, such as interactions between drugs, diseases and adverse effects. This semantic information supports the quick identification of documents containing information of interest (e.g., the different types of patients in which a given adverse drug reaction has been observed to occur). TM tools are typically adapted to different domains by applying machine learning methods to corpora that are manually labelled by domain experts using annotation guidelines to ensure consistency. We present a semantically annotated corpus of 597 MEDLINE abstracts, PHAEDRA, encoding rich information on drug effects and their interactions, whose quality is assured through the use of detailed annotation guidelines and the demonstration of high levels of inter-annotator agreement (e.g., 92.6% F-Score for identifying named entities and 78.4% F-Score for identifying complex events, when relaxed matching criteria are applied). To our knowledge, the corpus is unique in the domain of PV, according to the level of detail of its annotations. To illustrate the utility of the corpus, we have trained TM tools based on its rich labels to recognise drug effects in text automatically. The corpus and annotation guidelines are available at: http://www.nactem.ac.uk/PHAEDRA/.
format Online
Article
Text
id pubmed-6089860
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-60898602018-09-11 Annotation and detection of drug effects in text for pharmacovigilance Thompson, Paul Daikou, Sophia Ueno, Kenju Batista-Navarro, Riza Tsujii, Jun’ichi Ananiadou, Sophia J Cheminform Research Article Pharmacovigilance (PV) databases record the benefits and risks of different drugs, as a means to ensure their safe and effective use. Creating and maintaining such resources can be complex, since a particular medication may have divergent effects in different individuals, due to specific patient characteristics and/or interactions with other drugs being administered. Textual information from various sources can provide important evidence to curators of PV databases about the usage and effects of drug targets in different medical subjects. However, the efficient identification of relevant evidence can be challenging, due to the increasing volume of textual data. Text mining (TM) techniques can support curators by automatically detecting complex information, such as interactions between drugs, diseases and adverse effects. This semantic information supports the quick identification of documents containing information of interest (e.g., the different types of patients in which a given adverse drug reaction has been observed to occur). TM tools are typically adapted to different domains by applying machine learning methods to corpora that are manually labelled by domain experts using annotation guidelines to ensure consistency. We present a semantically annotated corpus of 597 MEDLINE abstracts, PHAEDRA, encoding rich information on drug effects and their interactions, whose quality is assured through the use of detailed annotation guidelines and the demonstration of high levels of inter-annotator agreement (e.g., 92.6% F-Score for identifying named entities and 78.4% F-Score for identifying complex events, when relaxed matching criteria are applied). To our knowledge, the corpus is unique in the domain of PV, according to the level of detail of its annotations. To illustrate the utility of the corpus, we have trained TM tools based on its rich labels to recognise drug effects in text automatically. The corpus and annotation guidelines are available at: http://www.nactem.ac.uk/PHAEDRA/. Springer International Publishing 2018-08-13 /pmc/articles/PMC6089860/ /pubmed/30105604 http://dx.doi.org/10.1186/s13321-018-0290-y Text en © The Author(s) 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Thompson, Paul
Daikou, Sophia
Ueno, Kenju
Batista-Navarro, Riza
Tsujii, Jun’ichi
Ananiadou, Sophia
Annotation and detection of drug effects in text for pharmacovigilance
title Annotation and detection of drug effects in text for pharmacovigilance
title_full Annotation and detection of drug effects in text for pharmacovigilance
title_fullStr Annotation and detection of drug effects in text for pharmacovigilance
title_full_unstemmed Annotation and detection of drug effects in text for pharmacovigilance
title_short Annotation and detection of drug effects in text for pharmacovigilance
title_sort annotation and detection of drug effects in text for pharmacovigilance
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6089860/
https://www.ncbi.nlm.nih.gov/pubmed/30105604
http://dx.doi.org/10.1186/s13321-018-0290-y
work_keys_str_mv AT thompsonpaul annotationanddetectionofdrugeffectsintextforpharmacovigilance
AT daikousophia annotationanddetectionofdrugeffectsintextforpharmacovigilance
AT uenokenju annotationanddetectionofdrugeffectsintextforpharmacovigilance
AT batistanavarroriza annotationanddetectionofdrugeffectsintextforpharmacovigilance
AT tsujiijunichi annotationanddetectionofdrugeffectsintextforpharmacovigilance
AT ananiadousophia annotationanddetectionofdrugeffectsintextforpharmacovigilance