Cargando…

Predicting speculation: a simple disambiguation approach to hedge detection in biomedical literature

BACKGROUND: This paper presents a novel approach to the problem of hedge detection, which involves identifying so-called hedge cues for labeling sentences as certain or uncertain. This is the classification problem for Task 1 of the CoNLL-2010 Shared Task, which focuses on hedging in the biomedical...

Descripción completa

Detalles Bibliográficos
Autor principal:	Velldal, Erik
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2011
Materias:	Research
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3239307/ https://www.ncbi.nlm.nih.gov/pubmed/22166306 http://dx.doi.org/10.1186/2041-1480-2-S5-S7

_version_	1782219164409135104
author	Velldal, Erik
author_facet	Velldal, Erik
author_sort	Velldal, Erik
collection	PubMed
description	BACKGROUND: This paper presents a novel approach to the problem of hedge detection, which involves identifying so-called hedge cues for labeling sentences as certain or uncertain. This is the classification problem for Task 1 of the CoNLL-2010 Shared Task, which focuses on hedging in the biomedical domain. We here propose to view hedge detection as a simple disambiguation problem, restricted to words that have previously been observed as hedge cues. As the feature space for the classifier is still very large, we also perform experiments with dimensionality reduction using the method of random indexing. RESULTS: The SVM-based classifiers developed in this paper achieves the best published results so far for sentence-level uncertainty prediction on the CoNLL-2010 Shared Task test data. We also show that the technique of random indexing can be successfully applied for reducing the dimensionality of the original feature space by several orders of magnitude, without sacrificing classifier performance. CONCLUSIONS: This paper introduces a simplified approach to detecting speculation or uncertainty in text, focusing on the biomedical domain. Evaluated at the sentence-level, our SVM-based classifiers achieve the best published results so far. We also show that the feature space can be aggressively compressed using random indexing while still maintaining comparable classifier performance.
format	Online Article Text
id	pubmed-3239307
institution	National Center for Biotechnology Information
language	English
publishDate	2011
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-32393072011-12-16 Predicting speculation: a simple disambiguation approach to hedge detection in biomedical literature Velldal, Erik J Biomed Semantics Research BACKGROUND: This paper presents a novel approach to the problem of hedge detection, which involves identifying so-called hedge cues for labeling sentences as certain or uncertain. This is the classification problem for Task 1 of the CoNLL-2010 Shared Task, which focuses on hedging in the biomedical domain. We here propose to view hedge detection as a simple disambiguation problem, restricted to words that have previously been observed as hedge cues. As the feature space for the classifier is still very large, we also perform experiments with dimensionality reduction using the method of random indexing. RESULTS: The SVM-based classifiers developed in this paper achieves the best published results so far for sentence-level uncertainty prediction on the CoNLL-2010 Shared Task test data. We also show that the technique of random indexing can be successfully applied for reducing the dimensionality of the original feature space by several orders of magnitude, without sacrificing classifier performance. CONCLUSIONS: This paper introduces a simplified approach to detecting speculation or uncertainty in text, focusing on the biomedical domain. Evaluated at the sentence-level, our SVM-based classifiers achieve the best published results so far. We also show that the feature space can be aggressively compressed using random indexing while still maintaining comparable classifier performance. BioMed Central 2011-10-06 /pmc/articles/PMC3239307/ /pubmed/22166306 http://dx.doi.org/10.1186/2041-1480-2-S5-S7 Text en Copyright ©2011 Velldal; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Research Velldal, Erik Predicting speculation: a simple disambiguation approach to hedge detection in biomedical literature
title	Predicting speculation: a simple disambiguation approach to hedge detection in biomedical literature
title_full	Predicting speculation: a simple disambiguation approach to hedge detection in biomedical literature
title_fullStr	Predicting speculation: a simple disambiguation approach to hedge detection in biomedical literature
title_full_unstemmed	Predicting speculation: a simple disambiguation approach to hedge detection in biomedical literature
title_short	Predicting speculation: a simple disambiguation approach to hedge detection in biomedical literature
title_sort	predicting speculation: a simple disambiguation approach to hedge detection in biomedical literature
topic	Research
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3239307/ https://www.ncbi.nlm.nih.gov/pubmed/22166306 http://dx.doi.org/10.1186/2041-1480-2-S5-S7
work_keys_str_mv	AT velldalerik predictingspeculationasimpledisambiguationapproachtohedgedetectioninbiomedicalliterature

Predicting speculation: a simple disambiguation approach to hedge detection in biomedical literature

Ejemplares similares