Cargando…

Hedge Scope Detection in Biomedical Texts: An Effective Dependency-Based Method

Hedge detection is used to distinguish uncertain information from facts, which is of essential importance in biomedical information extraction. The task of hedge detection is often divided into two subtasks: detecting uncertain cues and their linguistic scope. Hedge scope is a sequence of tokens inc...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhou, Huiwei, Deng, Huijie, Huang, Degen, Zhu, Minling
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4517914/
https://www.ncbi.nlm.nih.gov/pubmed/26218847
http://dx.doi.org/10.1371/journal.pone.0133715
_version_ 1782383262763581440
author Zhou, Huiwei
Deng, Huijie
Huang, Degen
Zhu, Minling
author_facet Zhou, Huiwei
Deng, Huijie
Huang, Degen
Zhu, Minling
author_sort Zhou, Huiwei
collection PubMed
description Hedge detection is used to distinguish uncertain information from facts, which is of essential importance in biomedical information extraction. The task of hedge detection is often divided into two subtasks: detecting uncertain cues and their linguistic scope. Hedge scope is a sequence of tokens including the hedge cue in a sentence. Previous hedge scope detection methods usually take all tokens in a sentence as candidate boundaries, which inevitably generate a large number of negatives for classifiers. The imbalanced instances seriously mislead classifiers and result in lower performance. This paper proposes a dependency-based candidate boundary selection method (DCBS), which selects the most likely tokens as candidate boundaries and removes the exceptional tokens which have less potential to improve the performance based on dependency tree. In addition, we employ the composite kernel to integrate lexical and syntactic information and demonstrate the effectiveness of structured syntactic features for hedge scope detection. Experiments on the CoNLL-2010 Shared Task corpus show that our method achieves 71.92% F1-score on the golden standard cues, which is 4.11% higher than the system without using DCBS. Although the candidate boundary selection method is only evaluated on hedge scope detection here, it can be popularized to other kinds of scope learning tasks.
format Online
Article
Text
id pubmed-4517914
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-45179142015-07-31 Hedge Scope Detection in Biomedical Texts: An Effective Dependency-Based Method Zhou, Huiwei Deng, Huijie Huang, Degen Zhu, Minling PLoS One Research Article Hedge detection is used to distinguish uncertain information from facts, which is of essential importance in biomedical information extraction. The task of hedge detection is often divided into two subtasks: detecting uncertain cues and their linguistic scope. Hedge scope is a sequence of tokens including the hedge cue in a sentence. Previous hedge scope detection methods usually take all tokens in a sentence as candidate boundaries, which inevitably generate a large number of negatives for classifiers. The imbalanced instances seriously mislead classifiers and result in lower performance. This paper proposes a dependency-based candidate boundary selection method (DCBS), which selects the most likely tokens as candidate boundaries and removes the exceptional tokens which have less potential to improve the performance based on dependency tree. In addition, we employ the composite kernel to integrate lexical and syntactic information and demonstrate the effectiveness of structured syntactic features for hedge scope detection. Experiments on the CoNLL-2010 Shared Task corpus show that our method achieves 71.92% F1-score on the golden standard cues, which is 4.11% higher than the system without using DCBS. Although the candidate boundary selection method is only evaluated on hedge scope detection here, it can be popularized to other kinds of scope learning tasks. Public Library of Science 2015-07-28 /pmc/articles/PMC4517914/ /pubmed/26218847 http://dx.doi.org/10.1371/journal.pone.0133715 Text en © 2015 Zhou et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Zhou, Huiwei
Deng, Huijie
Huang, Degen
Zhu, Minling
Hedge Scope Detection in Biomedical Texts: An Effective Dependency-Based Method
title Hedge Scope Detection in Biomedical Texts: An Effective Dependency-Based Method
title_full Hedge Scope Detection in Biomedical Texts: An Effective Dependency-Based Method
title_fullStr Hedge Scope Detection in Biomedical Texts: An Effective Dependency-Based Method
title_full_unstemmed Hedge Scope Detection in Biomedical Texts: An Effective Dependency-Based Method
title_short Hedge Scope Detection in Biomedical Texts: An Effective Dependency-Based Method
title_sort hedge scope detection in biomedical texts: an effective dependency-based method
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4517914/
https://www.ncbi.nlm.nih.gov/pubmed/26218847
http://dx.doi.org/10.1371/journal.pone.0133715
work_keys_str_mv AT zhouhuiwei hedgescopedetectioninbiomedicaltextsaneffectivedependencybasedmethod
AT denghuijie hedgescopedetectioninbiomedicaltextsaneffectivedependencybasedmethod
AT huangdegen hedgescopedetectioninbiomedicaltextsaneffectivedependencybasedmethod
AT zhuminling hedgescopedetectioninbiomedicaltextsaneffectivedependencybasedmethod