Cargando…
A manually annotated corpus in French for the study of urbanization and the natural risk prevention
Land artificialization is a serious problem of civilization. Urban planning and natural risk management are aimed to improve it. In France, these practices operate the Local Land Plans (PLU – Plan Local d’Urbanisme) and the Natural risk prevention plans (PPRn – Plan de Prévention des Risques naturel...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10665325/ https://www.ncbi.nlm.nih.gov/pubmed/37993460 http://dx.doi.org/10.1038/s41597-023-02705-y |
_version_ | 1785148844778979328 |
---|---|
author | Koptelov, Maksim Holveck, Margaux Cremilleux, Bruno Reynaud, Justine Roche, Mathieu Teisseire, Maguelonne |
author_facet | Koptelov, Maksim Holveck, Margaux Cremilleux, Bruno Reynaud, Justine Roche, Mathieu Teisseire, Maguelonne |
author_sort | Koptelov, Maksim |
collection | PubMed |
description | Land artificialization is a serious problem of civilization. Urban planning and natural risk management are aimed to improve it. In France, these practices operate the Local Land Plans (PLU – Plan Local d’Urbanisme) and the Natural risk prevention plans (PPRn – Plan de Prévention des Risques naturels) containing land use rules. To facilitate automatic extraction of the rules, we manually annotated a number of those documents concerning Montpellier, a rapidly evolving agglomeration exposed to natural risks. We defined a format for labeled examples in which each entry includes title and subtitle. In addition, we proposed a hierarchical representation of class labels to generalize the use of our corpus. Our corpus, consisting of 1934 textual segments, each of which labeled by one of the 4 classes (Verifiable, Non-verifiable, Informative and Not pertinent) is the first corpus in the French language in the fields of urban planning and natural risk management. Along with presenting the corpus, we tested a state-of-the-art approach for text classification to demonstrate its usability for automatic rule extraction. |
format | Online Article Text |
id | pubmed-10665325 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-106653252023-11-22 A manually annotated corpus in French for the study of urbanization and the natural risk prevention Koptelov, Maksim Holveck, Margaux Cremilleux, Bruno Reynaud, Justine Roche, Mathieu Teisseire, Maguelonne Sci Data Data Descriptor Land artificialization is a serious problem of civilization. Urban planning and natural risk management are aimed to improve it. In France, these practices operate the Local Land Plans (PLU – Plan Local d’Urbanisme) and the Natural risk prevention plans (PPRn – Plan de Prévention des Risques naturels) containing land use rules. To facilitate automatic extraction of the rules, we manually annotated a number of those documents concerning Montpellier, a rapidly evolving agglomeration exposed to natural risks. We defined a format for labeled examples in which each entry includes title and subtitle. In addition, we proposed a hierarchical representation of class labels to generalize the use of our corpus. Our corpus, consisting of 1934 textual segments, each of which labeled by one of the 4 classes (Verifiable, Non-verifiable, Informative and Not pertinent) is the first corpus in the French language in the fields of urban planning and natural risk management. Along with presenting the corpus, we tested a state-of-the-art approach for text classification to demonstrate its usability for automatic rule extraction. Nature Publishing Group UK 2023-11-22 /pmc/articles/PMC10665325/ /pubmed/37993460 http://dx.doi.org/10.1038/s41597-023-02705-y Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Data Descriptor Koptelov, Maksim Holveck, Margaux Cremilleux, Bruno Reynaud, Justine Roche, Mathieu Teisseire, Maguelonne A manually annotated corpus in French for the study of urbanization and the natural risk prevention |
title | A manually annotated corpus in French for the study of urbanization and the natural risk prevention |
title_full | A manually annotated corpus in French for the study of urbanization and the natural risk prevention |
title_fullStr | A manually annotated corpus in French for the study of urbanization and the natural risk prevention |
title_full_unstemmed | A manually annotated corpus in French for the study of urbanization and the natural risk prevention |
title_short | A manually annotated corpus in French for the study of urbanization and the natural risk prevention |
title_sort | manually annotated corpus in french for the study of urbanization and the natural risk prevention |
topic | Data Descriptor |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10665325/ https://www.ncbi.nlm.nih.gov/pubmed/37993460 http://dx.doi.org/10.1038/s41597-023-02705-y |
work_keys_str_mv | AT koptelovmaksim amanuallyannotatedcorpusinfrenchforthestudyofurbanizationandthenaturalriskprevention AT holveckmargaux amanuallyannotatedcorpusinfrenchforthestudyofurbanizationandthenaturalriskprevention AT cremilleuxbruno amanuallyannotatedcorpusinfrenchforthestudyofurbanizationandthenaturalriskprevention AT reynaudjustine amanuallyannotatedcorpusinfrenchforthestudyofurbanizationandthenaturalriskprevention AT rochemathieu amanuallyannotatedcorpusinfrenchforthestudyofurbanizationandthenaturalriskprevention AT teisseiremaguelonne amanuallyannotatedcorpusinfrenchforthestudyofurbanizationandthenaturalriskprevention AT koptelovmaksim manuallyannotatedcorpusinfrenchforthestudyofurbanizationandthenaturalriskprevention AT holveckmargaux manuallyannotatedcorpusinfrenchforthestudyofurbanizationandthenaturalriskprevention AT cremilleuxbruno manuallyannotatedcorpusinfrenchforthestudyofurbanizationandthenaturalriskprevention AT reynaudjustine manuallyannotatedcorpusinfrenchforthestudyofurbanizationandthenaturalriskprevention AT rochemathieu manuallyannotatedcorpusinfrenchforthestudyofurbanizationandthenaturalriskprevention AT teisseiremaguelonne manuallyannotatedcorpusinfrenchforthestudyofurbanizationandthenaturalriskprevention |