Cargando…
What’s missing in geographical parsing?
Geographical data can be obtained by converting place names from free-format text into geographical coordinates. The ability to geo-locate events in textual reports represents a valuable source of information in many real-world applications such as emergency responses, real-time social media geograp...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer Netherlands
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6560650/ https://www.ncbi.nlm.nih.gov/pubmed/31258456 http://dx.doi.org/10.1007/s10579-017-9385-8 |
_version_ | 1783425989114593280 |
---|---|
author | Gritta, Milan Pilehvar, Mohammad Taher Limsopatham, Nut Collier, Nigel |
author_facet | Gritta, Milan Pilehvar, Mohammad Taher Limsopatham, Nut Collier, Nigel |
author_sort | Gritta, Milan |
collection | PubMed |
description | Geographical data can be obtained by converting place names from free-format text into geographical coordinates. The ability to geo-locate events in textual reports represents a valuable source of information in many real-world applications such as emergency responses, real-time social media geographical event analysis, understanding location instructions in auto-response systems and more. However, geoparsing is still widely regarded as a challenge because of domain language diversity, place name ambiguity, metonymic language and limited leveraging of context as we show in our analysis. Results to date, whilst promising, are on laboratory data and unlike in wider NLP are often not cross-compared. In this study, we evaluate and analyse the performance of a number of leading geoparsers on a number of corpora and highlight the challenges in detail. We also publish an automatically geotagged Wikipedia corpus to alleviate the dearth of (open source) corpora in this domain. |
format | Online Article Text |
id | pubmed-6560650 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Springer Netherlands |
record_format | MEDLINE/PubMed |
spelling | pubmed-65606502019-06-26 What’s missing in geographical parsing? Gritta, Milan Pilehvar, Mohammad Taher Limsopatham, Nut Collier, Nigel Lang Resour Eval Survey Geographical data can be obtained by converting place names from free-format text into geographical coordinates. The ability to geo-locate events in textual reports represents a valuable source of information in many real-world applications such as emergency responses, real-time social media geographical event analysis, understanding location instructions in auto-response systems and more. However, geoparsing is still widely regarded as a challenge because of domain language diversity, place name ambiguity, metonymic language and limited leveraging of context as we show in our analysis. Results to date, whilst promising, are on laboratory data and unlike in wider NLP are often not cross-compared. In this study, we evaluate and analyse the performance of a number of leading geoparsers on a number of corpora and highlight the challenges in detail. We also publish an automatically geotagged Wikipedia corpus to alleviate the dearth of (open source) corpora in this domain. Springer Netherlands 2017-03-07 2018 /pmc/articles/PMC6560650/ /pubmed/31258456 http://dx.doi.org/10.1007/s10579-017-9385-8 Text en © The Author(s) 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. |
spellingShingle | Survey Gritta, Milan Pilehvar, Mohammad Taher Limsopatham, Nut Collier, Nigel What’s missing in geographical parsing? |
title | What’s missing in geographical parsing? |
title_full | What’s missing in geographical parsing? |
title_fullStr | What’s missing in geographical parsing? |
title_full_unstemmed | What’s missing in geographical parsing? |
title_short | What’s missing in geographical parsing? |
title_sort | what’s missing in geographical parsing? |
topic | Survey |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6560650/ https://www.ncbi.nlm.nih.gov/pubmed/31258456 http://dx.doi.org/10.1007/s10579-017-9385-8 |
work_keys_str_mv | AT grittamilan whatsmissingingeographicalparsing AT pilehvarmohammadtaher whatsmissingingeographicalparsing AT limsopathamnut whatsmissingingeographicalparsing AT colliernigel whatsmissingingeographicalparsing |