Cargando…

Analysis of syntactic and semantic features for fine-grained event-spatial understanding in outbreak news reports

BACKGROUND: Previous studies have suggested that epidemiological reasoning needs a fine-grained modelling of events, especially their spatial and temporal attributes. While the temporal analysis of events has been intensively studied, far less attention has been paid to their spatial analysis. This...

Descripción completa

Detalles Bibliográficos
Autores principales: Chanlekha, Hutchatai, Collier, Nigel
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2895733/
https://www.ncbi.nlm.nih.gov/pubmed/20618984
http://dx.doi.org/10.1186/2041-1480-1-3
_version_ 1782183285810528256
author Chanlekha, Hutchatai
Collier, Nigel
author_facet Chanlekha, Hutchatai
Collier, Nigel
author_sort Chanlekha, Hutchatai
collection PubMed
description BACKGROUND: Previous studies have suggested that epidemiological reasoning needs a fine-grained modelling of events, especially their spatial and temporal attributes. While the temporal analysis of events has been intensively studied, far less attention has been paid to their spatial analysis. This article aims at filling the gap concerning automatic event-spatial attribute analysis in order to support health surveillance and epidemiological reasoning. RESULTS: In this work, we propose a methodology that provides a detailed analysis on each event reported in news articles to recover the most specific locations where it occurs. Various features for recognizing spatial attributes of the events were studied and incorporated into the models which were trained by several machine learning techniques. The best performance for spatial attribute recognition is very promising; 85.9% F-score (86.75% precision/85.1% recall). CONCLUSIONS: We extended our work on event-spatial attribute recognition by focusing on machine learning techniques, which are CRF, SVM, and Decision tree. Our approach avoided the costly development of an external knowledge base by employing the feature sources that can be acquired locally from the analyzed document. The results showed that the CRF model performed the best. Our study indicated that the nearest location and previous event location are the most important features for the CRF and SVM model, while the location extracted from the verb's subject is the most important to the Decision tree model.
format Text
id pubmed-2895733
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-28957332010-07-06 Analysis of syntactic and semantic features for fine-grained event-spatial understanding in outbreak news reports Chanlekha, Hutchatai Collier, Nigel J Biomed Semantics Research BACKGROUND: Previous studies have suggested that epidemiological reasoning needs a fine-grained modelling of events, especially their spatial and temporal attributes. While the temporal analysis of events has been intensively studied, far less attention has been paid to their spatial analysis. This article aims at filling the gap concerning automatic event-spatial attribute analysis in order to support health surveillance and epidemiological reasoning. RESULTS: In this work, we propose a methodology that provides a detailed analysis on each event reported in news articles to recover the most specific locations where it occurs. Various features for recognizing spatial attributes of the events were studied and incorporated into the models which were trained by several machine learning techniques. The best performance for spatial attribute recognition is very promising; 85.9% F-score (86.75% precision/85.1% recall). CONCLUSIONS: We extended our work on event-spatial attribute recognition by focusing on machine learning techniques, which are CRF, SVM, and Decision tree. Our approach avoided the costly development of an external knowledge base by employing the feature sources that can be acquired locally from the analyzed document. The results showed that the CRF model performed the best. Our study indicated that the nearest location and previous event location are the most important features for the CRF and SVM model, while the location extracted from the verb's subject is the most important to the Decision tree model. BioMed Central 2010-03-31 /pmc/articles/PMC2895733/ /pubmed/20618984 http://dx.doi.org/10.1186/2041-1480-1-3 Text en Copyright ©2010 Chanlekha and Collier; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Chanlekha, Hutchatai
Collier, Nigel
Analysis of syntactic and semantic features for fine-grained event-spatial understanding in outbreak news reports
title Analysis of syntactic and semantic features for fine-grained event-spatial understanding in outbreak news reports
title_full Analysis of syntactic and semantic features for fine-grained event-spatial understanding in outbreak news reports
title_fullStr Analysis of syntactic and semantic features for fine-grained event-spatial understanding in outbreak news reports
title_full_unstemmed Analysis of syntactic and semantic features for fine-grained event-spatial understanding in outbreak news reports
title_short Analysis of syntactic and semantic features for fine-grained event-spatial understanding in outbreak news reports
title_sort analysis of syntactic and semantic features for fine-grained event-spatial understanding in outbreak news reports
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2895733/
https://www.ncbi.nlm.nih.gov/pubmed/20618984
http://dx.doi.org/10.1186/2041-1480-1-3
work_keys_str_mv AT chanlekhahutchatai analysisofsyntacticandsemanticfeaturesforfinegrainedeventspatialunderstandinginoutbreaknewsreports
AT colliernigel analysisofsyntacticandsemanticfeaturesforfinegrainedeventspatialunderstandinginoutbreaknewsreports