Cargando…
An illustrated approach to Soft Textual Cartography
We propose and illustrate an approach of Soft Textual Cartography consisting in the clustering of regions by taking into account both their spatial relationships and their textual description within a corpus. We reduce large geo-referenced textual content into topics and merge them with their spatia...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer International Publishing
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6214316/ https://www.ncbi.nlm.nih.gov/pubmed/30839805 http://dx.doi.org/10.1007/s41109-018-0087-y |
_version_ | 1783367964623372288 |
---|---|
author | Ceré, Raphaël Egloff, Mattia |
author_facet | Ceré, Raphaël Egloff, Mattia |
author_sort | Ceré, Raphaël |
collection | PubMed |
description | We propose and illustrate an approach of Soft Textual Cartography consisting in the clustering of regions by taking into account both their spatial relationships and their textual description within a corpus. We reduce large geo-referenced textual content into topics and merge them with their spatial configuration to reveal spatial patterns. The strategy consists in constructing a complex weighted network, reflecting the geographical layout, and whose nodes are further characterised by their thematic dissimilarity, extracted form topic modelling. A soft k-means procedure, taking into account both aspects through expectation maximisation on Gaussian mixture models and label propagation, converges towards a soft membership, to be further compared with expert knowledge on regions. Application on the Wikipedia pages of Swiss municipalities demonstrate the potential of the approach, revealing textual autocorrelation and associations with official classifications. The synergy of the spatial and textual aspects appears promising in topic interpretation and geographical information retrieval, and able to incorporate expert knowledge through the choice of the initial membership. |
format | Online Article Text |
id | pubmed-6214316 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Springer International Publishing |
record_format | MEDLINE/PubMed |
spelling | pubmed-62143162018-11-13 An illustrated approach to Soft Textual Cartography Ceré, Raphaël Egloff, Mattia Appl Netw Sci Research We propose and illustrate an approach of Soft Textual Cartography consisting in the clustering of regions by taking into account both their spatial relationships and their textual description within a corpus. We reduce large geo-referenced textual content into topics and merge them with their spatial configuration to reveal spatial patterns. The strategy consists in constructing a complex weighted network, reflecting the geographical layout, and whose nodes are further characterised by their thematic dissimilarity, extracted form topic modelling. A soft k-means procedure, taking into account both aspects through expectation maximisation on Gaussian mixture models and label propagation, converges towards a soft membership, to be further compared with expert knowledge on regions. Application on the Wikipedia pages of Swiss municipalities demonstrate the potential of the approach, revealing textual autocorrelation and associations with official classifications. The synergy of the spatial and textual aspects appears promising in topic interpretation and geographical information retrieval, and able to incorporate expert knowledge through the choice of the initial membership. Springer International Publishing 2018-08-13 2018 /pmc/articles/PMC6214316/ /pubmed/30839805 http://dx.doi.org/10.1007/s41109-018-0087-y Text en © The Author(s) 2018 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. |
spellingShingle | Research Ceré, Raphaël Egloff, Mattia An illustrated approach to Soft Textual Cartography |
title | An illustrated approach to Soft Textual Cartography |
title_full | An illustrated approach to Soft Textual Cartography |
title_fullStr | An illustrated approach to Soft Textual Cartography |
title_full_unstemmed | An illustrated approach to Soft Textual Cartography |
title_short | An illustrated approach to Soft Textual Cartography |
title_sort | illustrated approach to soft textual cartography |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6214316/ https://www.ncbi.nlm.nih.gov/pubmed/30839805 http://dx.doi.org/10.1007/s41109-018-0087-y |
work_keys_str_mv | AT cereraphael anillustratedapproachtosofttextualcartography AT egloffmattia anillustratedapproachtosofttextualcartography AT cereraphael illustratedapproachtosofttextualcartography AT egloffmattia illustratedapproachtosofttextualcartography |