Cargando…

Investigating the cross-lingual translatability of VerbNet-style classification

VerbNet—the most extensive online verb lexicon currently available for English—has proved useful in supporting a variety of NLP tasks. However, its exploitation in multilingual NLP has been limited by the fact that such classifications are available for few languages only. Since manual development o...

Descripción completa

Detalles Bibliográficos
Autores principales: Majewska, Olga, Vulić, Ivan, McCarthy, Diana, Huang, Yan, Murakami, Akira, Laippala, Veronika, Korhonen, Anna
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer Netherlands 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6428229/
https://www.ncbi.nlm.nih.gov/pubmed/30956632
http://dx.doi.org/10.1007/s10579-017-9403-x
_version_ 1783405368174444544
author Majewska, Olga
Vulić, Ivan
McCarthy, Diana
Huang, Yan
Murakami, Akira
Laippala, Veronika
Korhonen, Anna
author_facet Majewska, Olga
Vulić, Ivan
McCarthy, Diana
Huang, Yan
Murakami, Akira
Laippala, Veronika
Korhonen, Anna
author_sort Majewska, Olga
collection PubMed
description VerbNet—the most extensive online verb lexicon currently available for English—has proved useful in supporting a variety of NLP tasks. However, its exploitation in multilingual NLP has been limited by the fact that such classifications are available for few languages only. Since manual development of VerbNet is a major undertaking, researchers have recently translated VerbNet classes from English to other languages. However, no systematic investigation has been conducted into the applicability and accuracy of such a translation approach across different, typologically diverse languages. Our study is aimed at filling this gap. We develop a systematic method for translation of VerbNet classes from English to other languages which we first apply to Polish and subsequently to Croatian, Mandarin, Japanese, Italian, and Finnish. Our results on Polish demonstrate high translatability with all the classes (96% of English member verbs successfully translated into Polish) and strong inter-annotator agreement, revealing a promising degree of overlap in the resultant classifications. The results on other languages are equally promising. This demonstrates that VerbNet classes have strong cross-lingual potential and the proposed method could be applied to obtain gold standards for automatic verb classification in different languages. We make our annotation guidelines and the six language-specific verb classifications available with this paper. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1007/s10579-017-9403-x) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6428229
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Springer Netherlands
record_format MEDLINE/PubMed
spelling pubmed-64282292019-04-05 Investigating the cross-lingual translatability of VerbNet-style classification Majewska, Olga Vulić, Ivan McCarthy, Diana Huang, Yan Murakami, Akira Laippala, Veronika Korhonen, Anna Lang Resour Eval Original Paper VerbNet—the most extensive online verb lexicon currently available for English—has proved useful in supporting a variety of NLP tasks. However, its exploitation in multilingual NLP has been limited by the fact that such classifications are available for few languages only. Since manual development of VerbNet is a major undertaking, researchers have recently translated VerbNet classes from English to other languages. However, no systematic investigation has been conducted into the applicability and accuracy of such a translation approach across different, typologically diverse languages. Our study is aimed at filling this gap. We develop a systematic method for translation of VerbNet classes from English to other languages which we first apply to Polish and subsequently to Croatian, Mandarin, Japanese, Italian, and Finnish. Our results on Polish demonstrate high translatability with all the classes (96% of English member verbs successfully translated into Polish) and strong inter-annotator agreement, revealing a promising degree of overlap in the resultant classifications. The results on other languages are equally promising. This demonstrates that VerbNet classes have strong cross-lingual potential and the proposed method could be applied to obtain gold standards for automatic verb classification in different languages. We make our annotation guidelines and the six language-specific verb classifications available with this paper. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1007/s10579-017-9403-x) contains supplementary material, which is available to authorized users. Springer Netherlands 2017-10-20 2018 /pmc/articles/PMC6428229/ /pubmed/30956632 http://dx.doi.org/10.1007/s10579-017-9403-x Text en © The Author(s) 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
spellingShingle Original Paper
Majewska, Olga
Vulić, Ivan
McCarthy, Diana
Huang, Yan
Murakami, Akira
Laippala, Veronika
Korhonen, Anna
Investigating the cross-lingual translatability of VerbNet-style classification
title Investigating the cross-lingual translatability of VerbNet-style classification
title_full Investigating the cross-lingual translatability of VerbNet-style classification
title_fullStr Investigating the cross-lingual translatability of VerbNet-style classification
title_full_unstemmed Investigating the cross-lingual translatability of VerbNet-style classification
title_short Investigating the cross-lingual translatability of VerbNet-style classification
title_sort investigating the cross-lingual translatability of verbnet-style classification
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6428229/
https://www.ncbi.nlm.nih.gov/pubmed/30956632
http://dx.doi.org/10.1007/s10579-017-9403-x
work_keys_str_mv AT majewskaolga investigatingthecrosslingualtranslatabilityofverbnetstyleclassification
AT vulicivan investigatingthecrosslingualtranslatabilityofverbnetstyleclassification
AT mccarthydiana investigatingthecrosslingualtranslatabilityofverbnetstyleclassification
AT huangyan investigatingthecrosslingualtranslatabilityofverbnetstyleclassification
AT murakamiakira investigatingthecrosslingualtranslatabilityofverbnetstyleclassification
AT laippalaveronika investigatingthecrosslingualtranslatabilityofverbnetstyleclassification
AT korhonenanna investigatingthecrosslingualtranslatabilityofverbnetstyleclassification