Cargando…

Contextualized Embeddings in Named-Entity Recognition: An Empirical Study on Generalization

Contextualized embeddings use unsupervised language model pretraining to compute word representations depending on their context. This is intuitively useful for generalization, especially in Named-Entity Recognition where it is crucial to detect mentions never seen during training. However, standard...

Descripción completa

Detalles Bibliográficos
Autores principales:	Taillé, Bruno, Guigue, Vincent, Gallinari, Patrick
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	2020
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7148073/ http://dx.doi.org/10.1007/978-3-030-45442-5_48

_version_	1783520525511819264
author	Taillé, Bruno Guigue, Vincent Gallinari, Patrick
author_facet	Taillé, Bruno Guigue, Vincent Gallinari, Patrick
author_sort	Taillé, Bruno
collection	PubMed
description	Contextualized embeddings use unsupervised language model pretraining to compute word representations depending on their context. This is intuitively useful for generalization, especially in Named-Entity Recognition where it is crucial to detect mentions never seen during training. However, standard English benchmarks overestimate the importance of lexical over contextual features because of an unrealistic lexical overlap between train and test mentions. In this paper, we perform an empirical analysis of the generalization capabilities of state-of-the-art contextualized embeddings by separating mentions by novelty and with out-of-domain evaluation. We show that they are particularly beneficial for unseen mentions detection, especially out-of-domain. For models trained on CoNLL03, language model contextualization leads to a +1.2% maximal relative micro-F1 score increase in-domain against +13% out-of-domain on the WNUT dataset (The code is available at https://github.com/btaille/contener).
format	Online Article Text
id	pubmed-7148073
institution	National Center for Biotechnology Information
language	English
publishDate	2020
record_format	MEDLINE/PubMed
spelling	pubmed-71480732020-04-13 Contextualized Embeddings in Named-Entity Recognition: An Empirical Study on Generalization Taillé, Bruno Guigue, Vincent Gallinari, Patrick Advances in Information Retrieval Article Contextualized embeddings use unsupervised language model pretraining to compute word representations depending on their context. This is intuitively useful for generalization, especially in Named-Entity Recognition where it is crucial to detect mentions never seen during training. However, standard English benchmarks overestimate the importance of lexical over contextual features because of an unrealistic lexical overlap between train and test mentions. In this paper, we perform an empirical analysis of the generalization capabilities of state-of-the-art contextualized embeddings by separating mentions by novelty and with out-of-domain evaluation. We show that they are particularly beneficial for unseen mentions detection, especially out-of-domain. For models trained on CoNLL03, language model contextualization leads to a +1.2% maximal relative micro-F1 score increase in-domain against +13% out-of-domain on the WNUT dataset (The code is available at https://github.com/btaille/contener). 2020-03-24 /pmc/articles/PMC7148073/ http://dx.doi.org/10.1007/978-3-030-45442-5_48 Text en © Springer Nature Switzerland AG 2020 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle	Article Taillé, Bruno Guigue, Vincent Gallinari, Patrick Contextualized Embeddings in Named-Entity Recognition: An Empirical Study on Generalization
title	Contextualized Embeddings in Named-Entity Recognition: An Empirical Study on Generalization
title_full	Contextualized Embeddings in Named-Entity Recognition: An Empirical Study on Generalization
title_fullStr	Contextualized Embeddings in Named-Entity Recognition: An Empirical Study on Generalization
title_full_unstemmed	Contextualized Embeddings in Named-Entity Recognition: An Empirical Study on Generalization
title_short	Contextualized Embeddings in Named-Entity Recognition: An Empirical Study on Generalization
title_sort	contextualized embeddings in named-entity recognition: an empirical study on generalization
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7148073/ http://dx.doi.org/10.1007/978-3-030-45442-5_48
work_keys_str_mv	AT taillebruno contextualizedembeddingsinnamedentityrecognitionanempiricalstudyongeneralization AT guiguevincent contextualizedembeddingsinnamedentityrecognitionanempiricalstudyongeneralization AT gallinaripatrick contextualizedembeddingsinnamedentityrecognitionanempiricalstudyongeneralization

Contextualized Embeddings in Named-Entity Recognition: An Empirical Study on Generalization

Ejemplares similares