Cargando…

Introducing the CLEF 2020 HIPE Shared Task: Named Entity Recognition and Linking on Historical Newspapers

Since its introduction some twenty years ago, named entity (NE) processing has become an essential component of virtually any text mining application and has undergone major changes. Recently, two main trends characterise its developments: the adoption of deep learning architectures and the consider...

Descripción completa

Detalles Bibliográficos
Autores principales: Ehrmann, Maud, Romanello, Matteo, Bircher, Stefan, Clematide, Simon
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7148064/
http://dx.doi.org/10.1007/978-3-030-45442-5_68
_version_ 1783520523433541632
author Ehrmann, Maud
Romanello, Matteo
Bircher, Stefan
Clematide, Simon
author_facet Ehrmann, Maud
Romanello, Matteo
Bircher, Stefan
Clematide, Simon
author_sort Ehrmann, Maud
collection PubMed
description Since its introduction some twenty years ago, named entity (NE) processing has become an essential component of virtually any text mining application and has undergone major changes. Recently, two main trends characterise its developments: the adoption of deep learning architectures and the consideration of textual material originating from historical and cultural heritage collections. While the former opens up new opportunities, the latter introduces new challenges with heterogeneous, historical and noisy inputs. If NE processing tools are increasingly being used in the context of historical documents, performance values are below the ones on contemporary data and are hardly comparable. In this context, this paper introduces the CLEF 2020 Evaluation Lab HIPE (Identifying Historical People, Places and other Entities) on named entity recognition and linking on diachronic historical newspaper material in French, German and English. Our objective is threefold: strengthening the robustness of existing approaches on non-standard inputs, enabling performance comparison of NE processing on historical texts, and, in the long run, fostering efficient semantic indexing of historical documents in order to support scholarship on digital cultural heritage collections.
format Online
Article
Text
id pubmed-7148064
institution National Center for Biotechnology Information
language English
publishDate 2020
record_format MEDLINE/PubMed
spelling pubmed-71480642020-04-13 Introducing the CLEF 2020 HIPE Shared Task: Named Entity Recognition and Linking on Historical Newspapers Ehrmann, Maud Romanello, Matteo Bircher, Stefan Clematide, Simon Advances in Information Retrieval Article Since its introduction some twenty years ago, named entity (NE) processing has become an essential component of virtually any text mining application and has undergone major changes. Recently, two main trends characterise its developments: the adoption of deep learning architectures and the consideration of textual material originating from historical and cultural heritage collections. While the former opens up new opportunities, the latter introduces new challenges with heterogeneous, historical and noisy inputs. If NE processing tools are increasingly being used in the context of historical documents, performance values are below the ones on contemporary data and are hardly comparable. In this context, this paper introduces the CLEF 2020 Evaluation Lab HIPE (Identifying Historical People, Places and other Entities) on named entity recognition and linking on diachronic historical newspaper material in French, German and English. Our objective is threefold: strengthening the robustness of existing approaches on non-standard inputs, enabling performance comparison of NE processing on historical texts, and, in the long run, fostering efficient semantic indexing of historical documents in order to support scholarship on digital cultural heritage collections. 2020-03-24 /pmc/articles/PMC7148064/ http://dx.doi.org/10.1007/978-3-030-45442-5_68 Text en © Springer Nature Switzerland AG 2020 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Article
Ehrmann, Maud
Romanello, Matteo
Bircher, Stefan
Clematide, Simon
Introducing the CLEF 2020 HIPE Shared Task: Named Entity Recognition and Linking on Historical Newspapers
title Introducing the CLEF 2020 HIPE Shared Task: Named Entity Recognition and Linking on Historical Newspapers
title_full Introducing the CLEF 2020 HIPE Shared Task: Named Entity Recognition and Linking on Historical Newspapers
title_fullStr Introducing the CLEF 2020 HIPE Shared Task: Named Entity Recognition and Linking on Historical Newspapers
title_full_unstemmed Introducing the CLEF 2020 HIPE Shared Task: Named Entity Recognition and Linking on Historical Newspapers
title_short Introducing the CLEF 2020 HIPE Shared Task: Named Entity Recognition and Linking on Historical Newspapers
title_sort introducing the clef 2020 hipe shared task: named entity recognition and linking on historical newspapers
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7148064/
http://dx.doi.org/10.1007/978-3-030-45442-5_68
work_keys_str_mv AT ehrmannmaud introducingtheclef2020hipesharedtasknamedentityrecognitionandlinkingonhistoricalnewspapers
AT romanellomatteo introducingtheclef2020hipesharedtasknamedentityrecognitionandlinkingonhistoricalnewspapers
AT bircherstefan introducingtheclef2020hipesharedtasknamedentityrecognitionandlinkingonhistoricalnewspapers
AT clematidesimon introducingtheclef2020hipesharedtasknamedentityrecognitionandlinkingonhistoricalnewspapers