Cargando…

An illustrated approach to Soft Textual Cartography

We propose and illustrate an approach of Soft Textual Cartography consisting in the clustering of regions by taking into account both their spatial relationships and their textual description within a corpus. We reduce large geo-referenced textual content into topics and merge them with their spatia...

Descripción completa

Detalles Bibliográficos
Autores principales: Ceré, Raphaël, Egloff, Mattia
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6214316/
https://www.ncbi.nlm.nih.gov/pubmed/30839805
http://dx.doi.org/10.1007/s41109-018-0087-y
_version_ 1783367964623372288
author Ceré, Raphaël
Egloff, Mattia
author_facet Ceré, Raphaël
Egloff, Mattia
author_sort Ceré, Raphaël
collection PubMed
description We propose and illustrate an approach of Soft Textual Cartography consisting in the clustering of regions by taking into account both their spatial relationships and their textual description within a corpus. We reduce large geo-referenced textual content into topics and merge them with their spatial configuration to reveal spatial patterns. The strategy consists in constructing a complex weighted network, reflecting the geographical layout, and whose nodes are further characterised by their thematic dissimilarity, extracted form topic modelling. A soft k-means procedure, taking into account both aspects through expectation maximisation on Gaussian mixture models and label propagation, converges towards a soft membership, to be further compared with expert knowledge on regions. Application on the Wikipedia pages of Swiss municipalities demonstrate the potential of the approach, revealing textual autocorrelation and associations with official classifications. The synergy of the spatial and textual aspects appears promising in topic interpretation and geographical information retrieval, and able to incorporate expert knowledge through the choice of the initial membership.
format Online
Article
Text
id pubmed-6214316
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-62143162018-11-13 An illustrated approach to Soft Textual Cartography Ceré, Raphaël Egloff, Mattia Appl Netw Sci Research We propose and illustrate an approach of Soft Textual Cartography consisting in the clustering of regions by taking into account both their spatial relationships and their textual description within a corpus. We reduce large geo-referenced textual content into topics and merge them with their spatial configuration to reveal spatial patterns. The strategy consists in constructing a complex weighted network, reflecting the geographical layout, and whose nodes are further characterised by their thematic dissimilarity, extracted form topic modelling. A soft k-means procedure, taking into account both aspects through expectation maximisation on Gaussian mixture models and label propagation, converges towards a soft membership, to be further compared with expert knowledge on regions. Application on the Wikipedia pages of Swiss municipalities demonstrate the potential of the approach, revealing textual autocorrelation and associations with official classifications. The synergy of the spatial and textual aspects appears promising in topic interpretation and geographical information retrieval, and able to incorporate expert knowledge through the choice of the initial membership. Springer International Publishing 2018-08-13 2018 /pmc/articles/PMC6214316/ /pubmed/30839805 http://dx.doi.org/10.1007/s41109-018-0087-y Text en © The Author(s) 2018 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
spellingShingle Research
Ceré, Raphaël
Egloff, Mattia
An illustrated approach to Soft Textual Cartography
title An illustrated approach to Soft Textual Cartography
title_full An illustrated approach to Soft Textual Cartography
title_fullStr An illustrated approach to Soft Textual Cartography
title_full_unstemmed An illustrated approach to Soft Textual Cartography
title_short An illustrated approach to Soft Textual Cartography
title_sort illustrated approach to soft textual cartography
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6214316/
https://www.ncbi.nlm.nih.gov/pubmed/30839805
http://dx.doi.org/10.1007/s41109-018-0087-y
work_keys_str_mv AT cereraphael anillustratedapproachtosofttextualcartography
AT egloffmattia anillustratedapproachtosofttextualcartography
AT cereraphael illustratedapproachtosofttextualcartography
AT egloffmattia illustratedapproachtosofttextualcartography