Cargando…
Investigating the cross-lingual translatability of VerbNet-style classification
VerbNet—the most extensive online verb lexicon currently available for English—has proved useful in supporting a variety of NLP tasks. However, its exploitation in multilingual NLP has been limited by the fact that such classifications are available for few languages only. Since manual development o...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer Netherlands
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6428229/ https://www.ncbi.nlm.nih.gov/pubmed/30956632 http://dx.doi.org/10.1007/s10579-017-9403-x |
_version_ | 1783405368174444544 |
---|---|
author | Majewska, Olga Vulić, Ivan McCarthy, Diana Huang, Yan Murakami, Akira Laippala, Veronika Korhonen, Anna |
author_facet | Majewska, Olga Vulić, Ivan McCarthy, Diana Huang, Yan Murakami, Akira Laippala, Veronika Korhonen, Anna |
author_sort | Majewska, Olga |
collection | PubMed |
description | VerbNet—the most extensive online verb lexicon currently available for English—has proved useful in supporting a variety of NLP tasks. However, its exploitation in multilingual NLP has been limited by the fact that such classifications are available for few languages only. Since manual development of VerbNet is a major undertaking, researchers have recently translated VerbNet classes from English to other languages. However, no systematic investigation has been conducted into the applicability and accuracy of such a translation approach across different, typologically diverse languages. Our study is aimed at filling this gap. We develop a systematic method for translation of VerbNet classes from English to other languages which we first apply to Polish and subsequently to Croatian, Mandarin, Japanese, Italian, and Finnish. Our results on Polish demonstrate high translatability with all the classes (96% of English member verbs successfully translated into Polish) and strong inter-annotator agreement, revealing a promising degree of overlap in the resultant classifications. The results on other languages are equally promising. This demonstrates that VerbNet classes have strong cross-lingual potential and the proposed method could be applied to obtain gold standards for automatic verb classification in different languages. We make our annotation guidelines and the six language-specific verb classifications available with this paper. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1007/s10579-017-9403-x) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-6428229 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Springer Netherlands |
record_format | MEDLINE/PubMed |
spelling | pubmed-64282292019-04-05 Investigating the cross-lingual translatability of VerbNet-style classification Majewska, Olga Vulić, Ivan McCarthy, Diana Huang, Yan Murakami, Akira Laippala, Veronika Korhonen, Anna Lang Resour Eval Original Paper VerbNet—the most extensive online verb lexicon currently available for English—has proved useful in supporting a variety of NLP tasks. However, its exploitation in multilingual NLP has been limited by the fact that such classifications are available for few languages only. Since manual development of VerbNet is a major undertaking, researchers have recently translated VerbNet classes from English to other languages. However, no systematic investigation has been conducted into the applicability and accuracy of such a translation approach across different, typologically diverse languages. Our study is aimed at filling this gap. We develop a systematic method for translation of VerbNet classes from English to other languages which we first apply to Polish and subsequently to Croatian, Mandarin, Japanese, Italian, and Finnish. Our results on Polish demonstrate high translatability with all the classes (96% of English member verbs successfully translated into Polish) and strong inter-annotator agreement, revealing a promising degree of overlap in the resultant classifications. The results on other languages are equally promising. This demonstrates that VerbNet classes have strong cross-lingual potential and the proposed method could be applied to obtain gold standards for automatic verb classification in different languages. We make our annotation guidelines and the six language-specific verb classifications available with this paper. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1007/s10579-017-9403-x) contains supplementary material, which is available to authorized users. Springer Netherlands 2017-10-20 2018 /pmc/articles/PMC6428229/ /pubmed/30956632 http://dx.doi.org/10.1007/s10579-017-9403-x Text en © The Author(s) 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. |
spellingShingle | Original Paper Majewska, Olga Vulić, Ivan McCarthy, Diana Huang, Yan Murakami, Akira Laippala, Veronika Korhonen, Anna Investigating the cross-lingual translatability of VerbNet-style classification |
title | Investigating the cross-lingual translatability of VerbNet-style classification |
title_full | Investigating the cross-lingual translatability of VerbNet-style classification |
title_fullStr | Investigating the cross-lingual translatability of VerbNet-style classification |
title_full_unstemmed | Investigating the cross-lingual translatability of VerbNet-style classification |
title_short | Investigating the cross-lingual translatability of VerbNet-style classification |
title_sort | investigating the cross-lingual translatability of verbnet-style classification |
topic | Original Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6428229/ https://www.ncbi.nlm.nih.gov/pubmed/30956632 http://dx.doi.org/10.1007/s10579-017-9403-x |
work_keys_str_mv | AT majewskaolga investigatingthecrosslingualtranslatabilityofverbnetstyleclassification AT vulicivan investigatingthecrosslingualtranslatabilityofverbnetstyleclassification AT mccarthydiana investigatingthecrosslingualtranslatabilityofverbnetstyleclassification AT huangyan investigatingthecrosslingualtranslatabilityofverbnetstyleclassification AT murakamiakira investigatingthecrosslingualtranslatabilityofverbnetstyleclassification AT laippalaveronika investigatingthecrosslingualtranslatabilityofverbnetstyleclassification AT korhonenanna investigatingthecrosslingualtranslatabilityofverbnetstyleclassification |