Cargando…

What’s missing in geographical parsing?

Geographical data can be obtained by converting place names from free-format text into geographical coordinates. The ability to geo-locate events in textual reports represents a valuable source of information in many real-world applications such as emergency responses, real-time social media geograp...

Descripción completa

Detalles Bibliográficos
Autores principales: Gritta, Milan, Pilehvar, Mohammad Taher, Limsopatham, Nut, Collier, Nigel
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer Netherlands 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6560650/
https://www.ncbi.nlm.nih.gov/pubmed/31258456
http://dx.doi.org/10.1007/s10579-017-9385-8
_version_ 1783425989114593280
author Gritta, Milan
Pilehvar, Mohammad Taher
Limsopatham, Nut
Collier, Nigel
author_facet Gritta, Milan
Pilehvar, Mohammad Taher
Limsopatham, Nut
Collier, Nigel
author_sort Gritta, Milan
collection PubMed
description Geographical data can be obtained by converting place names from free-format text into geographical coordinates. The ability to geo-locate events in textual reports represents a valuable source of information in many real-world applications such as emergency responses, real-time social media geographical event analysis, understanding location instructions in auto-response systems and more. However, geoparsing is still widely regarded as a challenge because of domain language diversity, place name ambiguity, metonymic language and limited leveraging of context as we show in our analysis. Results to date, whilst promising, are on laboratory data and unlike in wider NLP are often not cross-compared. In this study, we evaluate and analyse the performance of a number of leading geoparsers on a number of corpora and highlight the challenges in detail. We also publish an automatically geotagged Wikipedia corpus to alleviate the dearth of (open source) corpora in this domain.
format Online
Article
Text
id pubmed-6560650
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Springer Netherlands
record_format MEDLINE/PubMed
spelling pubmed-65606502019-06-26 What’s missing in geographical parsing? Gritta, Milan Pilehvar, Mohammad Taher Limsopatham, Nut Collier, Nigel Lang Resour Eval Survey Geographical data can be obtained by converting place names from free-format text into geographical coordinates. The ability to geo-locate events in textual reports represents a valuable source of information in many real-world applications such as emergency responses, real-time social media geographical event analysis, understanding location instructions in auto-response systems and more. However, geoparsing is still widely regarded as a challenge because of domain language diversity, place name ambiguity, metonymic language and limited leveraging of context as we show in our analysis. Results to date, whilst promising, are on laboratory data and unlike in wider NLP are often not cross-compared. In this study, we evaluate and analyse the performance of a number of leading geoparsers on a number of corpora and highlight the challenges in detail. We also publish an automatically geotagged Wikipedia corpus to alleviate the dearth of (open source) corpora in this domain. Springer Netherlands 2017-03-07 2018 /pmc/articles/PMC6560650/ /pubmed/31258456 http://dx.doi.org/10.1007/s10579-017-9385-8 Text en © The Author(s) 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
spellingShingle Survey
Gritta, Milan
Pilehvar, Mohammad Taher
Limsopatham, Nut
Collier, Nigel
What’s missing in geographical parsing?
title What’s missing in geographical parsing?
title_full What’s missing in geographical parsing?
title_fullStr What’s missing in geographical parsing?
title_full_unstemmed What’s missing in geographical parsing?
title_short What’s missing in geographical parsing?
title_sort what’s missing in geographical parsing?
topic Survey
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6560650/
https://www.ncbi.nlm.nih.gov/pubmed/31258456
http://dx.doi.org/10.1007/s10579-017-9385-8
work_keys_str_mv AT grittamilan whatsmissingingeographicalparsing
AT pilehvarmohammadtaher whatsmissingingeographicalparsing
AT limsopathamnut whatsmissingingeographicalparsing
AT colliernigel whatsmissingingeographicalparsing