Cargando…

An ensemble of neural models for nested adverse drug events and medication extraction with subwords

OBJECTIVE: This article describes an ensembling system to automatically extract adverse drug events and drug related entities from clinical narratives, which was developed for the 2018 n2c2 Shared Task Track 2. MATERIALS AND METHODS: We designed a neural model to tackle both nested (entities embedde...

Descripción completa

Detalles Bibliográficos
Autores principales:	Ju, Meizhi, Nguyen, Nhung T H, Miwa, Makoto, Ananiadou, Sophia
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Oxford University Press 2019
Materias:	Research and Applications
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6913208/ https://www.ncbi.nlm.nih.gov/pubmed/31197355 http://dx.doi.org/10.1093/jamia/ocz075

_version_	1783479619175841792
author	Ju, Meizhi Nguyen, Nhung T H Miwa, Makoto Ananiadou, Sophia
author_facet	Ju, Meizhi Nguyen, Nhung T H Miwa, Makoto Ananiadou, Sophia
author_sort	Ju, Meizhi
collection	PubMed
description	OBJECTIVE: This article describes an ensembling system to automatically extract adverse drug events and drug related entities from clinical narratives, which was developed for the 2018 n2c2 Shared Task Track 2. MATERIALS AND METHODS: We designed a neural model to tackle both nested (entities embedded in other entities) and polysemous entities (entities annotated with multiple semantic types) based on MIMIC III discharge summaries. To better represent rare and unknown words in entities, we further tokenized the MIMIC III data set by splitting the words into finer-grained subwords. We finally combined all the models to boost the performance. Additionally, we implemented a featured-based conditional random field model and created an ensemble to combine its predictions with those of the neural model. RESULTS: Our method achieved 92.78% lenient micro F1-score, with 95.99% lenient precision, and 89.79% lenient recall, respectively. Experimental results showed that combining the predictions of either multiple models, or of a single model with different settings can improve performance. DISCUSSION: Analysis of the development set showed that our neural models can detect more informative text regions than feature-based conditional random field models. Furthermore, most entity types significantly benefit from subword representation, which also allows us to extract sparse entities, especially nested entities. CONCLUSION: The overall results have demonstrated that the ensemble method can accurately recognize entities, including nested and polysemous entities. Additionally, our method can recognize sparse entities by reconsidering the clinical narratives at a finer-grained subword level, rather than at the word level.
format	Online Article Text
id	pubmed-6913208
institution	National Center for Biotechnology Information
language	English
publishDate	2019
publisher	Oxford University Press
record_format	MEDLINE/PubMed
spelling	pubmed-69132082019-12-19 An ensemble of neural models for nested adverse drug events and medication extraction with subwords Ju, Meizhi Nguyen, Nhung T H Miwa, Makoto Ananiadou, Sophia J Am Med Inform Assoc Research and Applications OBJECTIVE: This article describes an ensembling system to automatically extract adverse drug events and drug related entities from clinical narratives, which was developed for the 2018 n2c2 Shared Task Track 2. MATERIALS AND METHODS: We designed a neural model to tackle both nested (entities embedded in other entities) and polysemous entities (entities annotated with multiple semantic types) based on MIMIC III discharge summaries. To better represent rare and unknown words in entities, we further tokenized the MIMIC III data set by splitting the words into finer-grained subwords. We finally combined all the models to boost the performance. Additionally, we implemented a featured-based conditional random field model and created an ensemble to combine its predictions with those of the neural model. RESULTS: Our method achieved 92.78% lenient micro F1-score, with 95.99% lenient precision, and 89.79% lenient recall, respectively. Experimental results showed that combining the predictions of either multiple models, or of a single model with different settings can improve performance. DISCUSSION: Analysis of the development set showed that our neural models can detect more informative text regions than feature-based conditional random field models. Furthermore, most entity types significantly benefit from subword representation, which also allows us to extract sparse entities, especially nested entities. CONCLUSION: The overall results have demonstrated that the ensemble method can accurately recognize entities, including nested and polysemous entities. Additionally, our method can recognize sparse entities by reconsidering the clinical narratives at a finer-grained subword level, rather than at the word level. Oxford University Press 2019-06-14 /pmc/articles/PMC6913208/ /pubmed/31197355 http://dx.doi.org/10.1093/jamia/ocz075 Text en © The Author(s) 2019. Published by Oxford University Press on behalf of the American Medical Informatics Association. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Research and Applications Ju, Meizhi Nguyen, Nhung T H Miwa, Makoto Ananiadou, Sophia An ensemble of neural models for nested adverse drug events and medication extraction with subwords
title	An ensemble of neural models for nested adverse drug events and medication extraction with subwords
title_full	An ensemble of neural models for nested adverse drug events and medication extraction with subwords
title_fullStr	An ensemble of neural models for nested adverse drug events and medication extraction with subwords
title_full_unstemmed	An ensemble of neural models for nested adverse drug events and medication extraction with subwords
title_short	An ensemble of neural models for nested adverse drug events and medication extraction with subwords
title_sort	ensemble of neural models for nested adverse drug events and medication extraction with subwords
topic	Research and Applications
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6913208/ https://www.ncbi.nlm.nih.gov/pubmed/31197355 http://dx.doi.org/10.1093/jamia/ocz075
work_keys_str_mv	AT jumeizhi anensembleofneuralmodelsfornestedadversedrugeventsandmedicationextractionwithsubwords AT nguyennhungth anensembleofneuralmodelsfornestedadversedrugeventsandmedicationextractionwithsubwords AT miwamakoto anensembleofneuralmodelsfornestedadversedrugeventsandmedicationextractionwithsubwords AT ananiadousophia anensembleofneuralmodelsfornestedadversedrugeventsandmedicationextractionwithsubwords AT jumeizhi ensembleofneuralmodelsfornestedadversedrugeventsandmedicationextractionwithsubwords AT nguyennhungth ensembleofneuralmodelsfornestedadversedrugeventsandmedicationextractionwithsubwords AT miwamakoto ensembleofneuralmodelsfornestedadversedrugeventsandmedicationextractionwithsubwords AT ananiadousophia ensembleofneuralmodelsfornestedadversedrugeventsandmedicationextractionwithsubwords

An ensemble of neural models for nested adverse drug events and medication extraction with subwords

Ejemplares similares