Cargando…

A manually annotated corpus in French for the study of urbanization and the natural risk prevention

Land artificialization is a serious problem of civilization. Urban planning and natural risk management are aimed to improve it. In France, these practices operate the Local Land Plans (PLU – Plan Local d’Urbanisme) and the Natural risk prevention plans (PPRn – Plan de Prévention des Risques naturel...

Descripción completa

Detalles Bibliográficos
Autores principales: Koptelov, Maksim, Holveck, Margaux, Cremilleux, Bruno, Reynaud, Justine, Roche, Mathieu, Teisseire, Maguelonne
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10665325/
https://www.ncbi.nlm.nih.gov/pubmed/37993460
http://dx.doi.org/10.1038/s41597-023-02705-y
_version_ 1785148844778979328
author Koptelov, Maksim
Holveck, Margaux
Cremilleux, Bruno
Reynaud, Justine
Roche, Mathieu
Teisseire, Maguelonne
author_facet Koptelov, Maksim
Holveck, Margaux
Cremilleux, Bruno
Reynaud, Justine
Roche, Mathieu
Teisseire, Maguelonne
author_sort Koptelov, Maksim
collection PubMed
description Land artificialization is a serious problem of civilization. Urban planning and natural risk management are aimed to improve it. In France, these practices operate the Local Land Plans (PLU – Plan Local d’Urbanisme) and the Natural risk prevention plans (PPRn – Plan de Prévention des Risques naturels) containing land use rules. To facilitate automatic extraction of the rules, we manually annotated a number of those documents concerning Montpellier, a rapidly evolving agglomeration exposed to natural risks. We defined a format for labeled examples in which each entry includes title and subtitle. In addition, we proposed a hierarchical representation of class labels to generalize the use of our corpus. Our corpus, consisting of 1934 textual segments, each of which labeled by one of the 4 classes (Verifiable, Non-verifiable, Informative and Not pertinent) is the first corpus in the French language in the fields of urban planning and natural risk management. Along with presenting the corpus, we tested a state-of-the-art approach for text classification to demonstrate its usability for automatic rule extraction.
format Online
Article
Text
id pubmed-10665325
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-106653252023-11-22 A manually annotated corpus in French for the study of urbanization and the natural risk prevention Koptelov, Maksim Holveck, Margaux Cremilleux, Bruno Reynaud, Justine Roche, Mathieu Teisseire, Maguelonne Sci Data Data Descriptor Land artificialization is a serious problem of civilization. Urban planning and natural risk management are aimed to improve it. In France, these practices operate the Local Land Plans (PLU – Plan Local d’Urbanisme) and the Natural risk prevention plans (PPRn – Plan de Prévention des Risques naturels) containing land use rules. To facilitate automatic extraction of the rules, we manually annotated a number of those documents concerning Montpellier, a rapidly evolving agglomeration exposed to natural risks. We defined a format for labeled examples in which each entry includes title and subtitle. In addition, we proposed a hierarchical representation of class labels to generalize the use of our corpus. Our corpus, consisting of 1934 textual segments, each of which labeled by one of the 4 classes (Verifiable, Non-verifiable, Informative and Not pertinent) is the first corpus in the French language in the fields of urban planning and natural risk management. Along with presenting the corpus, we tested a state-of-the-art approach for text classification to demonstrate its usability for automatic rule extraction. Nature Publishing Group UK 2023-11-22 /pmc/articles/PMC10665325/ /pubmed/37993460 http://dx.doi.org/10.1038/s41597-023-02705-y Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Data Descriptor
Koptelov, Maksim
Holveck, Margaux
Cremilleux, Bruno
Reynaud, Justine
Roche, Mathieu
Teisseire, Maguelonne
A manually annotated corpus in French for the study of urbanization and the natural risk prevention
title A manually annotated corpus in French for the study of urbanization and the natural risk prevention
title_full A manually annotated corpus in French for the study of urbanization and the natural risk prevention
title_fullStr A manually annotated corpus in French for the study of urbanization and the natural risk prevention
title_full_unstemmed A manually annotated corpus in French for the study of urbanization and the natural risk prevention
title_short A manually annotated corpus in French for the study of urbanization and the natural risk prevention
title_sort manually annotated corpus in french for the study of urbanization and the natural risk prevention
topic Data Descriptor
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10665325/
https://www.ncbi.nlm.nih.gov/pubmed/37993460
http://dx.doi.org/10.1038/s41597-023-02705-y
work_keys_str_mv AT koptelovmaksim amanuallyannotatedcorpusinfrenchforthestudyofurbanizationandthenaturalriskprevention
AT holveckmargaux amanuallyannotatedcorpusinfrenchforthestudyofurbanizationandthenaturalriskprevention
AT cremilleuxbruno amanuallyannotatedcorpusinfrenchforthestudyofurbanizationandthenaturalriskprevention
AT reynaudjustine amanuallyannotatedcorpusinfrenchforthestudyofurbanizationandthenaturalriskprevention
AT rochemathieu amanuallyannotatedcorpusinfrenchforthestudyofurbanizationandthenaturalriskprevention
AT teisseiremaguelonne amanuallyannotatedcorpusinfrenchforthestudyofurbanizationandthenaturalriskprevention
AT koptelovmaksim manuallyannotatedcorpusinfrenchforthestudyofurbanizationandthenaturalriskprevention
AT holveckmargaux manuallyannotatedcorpusinfrenchforthestudyofurbanizationandthenaturalriskprevention
AT cremilleuxbruno manuallyannotatedcorpusinfrenchforthestudyofurbanizationandthenaturalriskprevention
AT reynaudjustine manuallyannotatedcorpusinfrenchforthestudyofurbanizationandthenaturalriskprevention
AT rochemathieu manuallyannotatedcorpusinfrenchforthestudyofurbanizationandthenaturalriskprevention
AT teisseiremaguelonne manuallyannotatedcorpusinfrenchforthestudyofurbanizationandthenaturalriskprevention