Cargando…

Informative Causality Extraction from Medical Literature via Dependency-Tree–Based Patterns

Extracting cause-effect entities from medical literature is an important task in medical information retrieval. A solution for solving this task can be used for compilation of various causality relations, such as causality between disease and symptoms, between medications and side effects, and betwe...

Descripción completa

Detalles Bibliográficos
Autores principales: Kabir, M. Ahsanul, Almulhim, AlJohara, Luo, Xiao, Al Hasan, Mohammad
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9131716/
https://www.ncbi.nlm.nih.gov/pubmed/35637864
http://dx.doi.org/10.1007/s41666-022-00116-z
_version_ 1784713231659433984
author Kabir, M. Ahsanul
Almulhim, AlJohara
Luo, Xiao
Al Hasan, Mohammad
author_facet Kabir, M. Ahsanul
Almulhim, AlJohara
Luo, Xiao
Al Hasan, Mohammad
author_sort Kabir, M. Ahsanul
collection PubMed
description Extracting cause-effect entities from medical literature is an important task in medical information retrieval. A solution for solving this task can be used for compilation of various causality relations, such as causality between disease and symptoms, between medications and side effects, and between genes and diseases. Existing solutions for extracting cause-effect entities work well for sentences where the cause and the effect phrases are name entities, single-word nouns, or noun phrases consisting of two to three words. Unfortunately, in medical literature, cause and effect phrases in a sentence are not simply nouns or noun phrases, rather they are complex phrases consisting of several words, and existing methods fail to correctly extract the cause and effect entities in such sentences. Partial extraction of cause and effect entities conveys poor quality, non-informative, and often, contradictory facts, comparing to the one intended in the given sentence. In this work, we solve this problem by designing an unsupervised method for cause and effect phrase extraction, PatternCausality, which is specifically suitable for the medical literature. Our proposed approach first uses a collection of cause-effect dependency patterns as template to extract head words of cause and effect phrases and then it uses a novel phrase extraction method to obtain complete and meaningful cause and effect phrases from a sentence. Experiments on a cause-effect dataset built from sentences from PubMed articles show that for extracting cause and effect entities, PatternCausality is substantially better than the existing methods—with an order of magnitude improvement in the F-score metric over the best of the existing methods. We also build different variants of PatternCausality, which use different phrase extraction methods; all variants are better than the existing methods. PatternCausality and its variants also show modest performance improvement over the existing methods for extracting cause and effect entities in a domain-neutral benchmark dataset, in which cause and effect entities are nouns or noun phrases consisting of one to two words.
format Online
Article
Text
id pubmed-9131716
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-91317162022-05-26 Informative Causality Extraction from Medical Literature via Dependency-Tree–Based Patterns Kabir, M. Ahsanul Almulhim, AlJohara Luo, Xiao Al Hasan, Mohammad J Healthc Inform Res Research Article Extracting cause-effect entities from medical literature is an important task in medical information retrieval. A solution for solving this task can be used for compilation of various causality relations, such as causality between disease and symptoms, between medications and side effects, and between genes and diseases. Existing solutions for extracting cause-effect entities work well for sentences where the cause and the effect phrases are name entities, single-word nouns, or noun phrases consisting of two to three words. Unfortunately, in medical literature, cause and effect phrases in a sentence are not simply nouns or noun phrases, rather they are complex phrases consisting of several words, and existing methods fail to correctly extract the cause and effect entities in such sentences. Partial extraction of cause and effect entities conveys poor quality, non-informative, and often, contradictory facts, comparing to the one intended in the given sentence. In this work, we solve this problem by designing an unsupervised method for cause and effect phrase extraction, PatternCausality, which is specifically suitable for the medical literature. Our proposed approach first uses a collection of cause-effect dependency patterns as template to extract head words of cause and effect phrases and then it uses a novel phrase extraction method to obtain complete and meaningful cause and effect phrases from a sentence. Experiments on a cause-effect dataset built from sentences from PubMed articles show that for extracting cause and effect entities, PatternCausality is substantially better than the existing methods—with an order of magnitude improvement in the F-score metric over the best of the existing methods. We also build different variants of PatternCausality, which use different phrase extraction methods; all variants are better than the existing methods. PatternCausality and its variants also show modest performance improvement over the existing methods for extracting cause and effect entities in a domain-neutral benchmark dataset, in which cause and effect entities are nouns or noun phrases consisting of one to two words. Springer International Publishing 2022-05-25 /pmc/articles/PMC9131716/ /pubmed/35637864 http://dx.doi.org/10.1007/s41666-022-00116-z Text en © The Author(s), under exclusive licence to Springer Nature Switzerland AG 2022
spellingShingle Research Article
Kabir, M. Ahsanul
Almulhim, AlJohara
Luo, Xiao
Al Hasan, Mohammad
Informative Causality Extraction from Medical Literature via Dependency-Tree–Based Patterns
title Informative Causality Extraction from Medical Literature via Dependency-Tree–Based Patterns
title_full Informative Causality Extraction from Medical Literature via Dependency-Tree–Based Patterns
title_fullStr Informative Causality Extraction from Medical Literature via Dependency-Tree–Based Patterns
title_full_unstemmed Informative Causality Extraction from Medical Literature via Dependency-Tree–Based Patterns
title_short Informative Causality Extraction from Medical Literature via Dependency-Tree–Based Patterns
title_sort informative causality extraction from medical literature via dependency-tree–based patterns
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9131716/
https://www.ncbi.nlm.nih.gov/pubmed/35637864
http://dx.doi.org/10.1007/s41666-022-00116-z
work_keys_str_mv AT kabirmahsanul informativecausalityextractionfrommedicalliteratureviadependencytreebasedpatterns
AT almulhimaljohara informativecausalityextractionfrommedicalliteratureviadependencytreebasedpatterns
AT luoxiao informativecausalityextractionfrommedicalliteratureviadependencytreebasedpatterns
AT alhasanmohammad informativecausalityextractionfrommedicalliteratureviadependencytreebasedpatterns