Cargando…
Hindi CCGbank: A CCG treebank from the Hindi dependency treebank
In this paper, we present an approach for automatically creating a combinatory categorial grammar (CCG) treebank from a dependency treebank for the subject–object–verb language Hindi. Rather than a direct conversion from dependency trees to CCG trees, we propose a two stage approach: a language inde...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer Netherlands
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6954025/ https://www.ncbi.nlm.nih.gov/pubmed/31983909 http://dx.doi.org/10.1007/s10579-017-9379-6 |
_version_ | 1783486720929431552 |
---|---|
author | Ambati, Bharat Ram Deoskar, Tejaswini Steedman, Mark |
author_facet | Ambati, Bharat Ram Deoskar, Tejaswini Steedman, Mark |
author_sort | Ambati, Bharat Ram |
collection | PubMed |
description | In this paper, we present an approach for automatically creating a combinatory categorial grammar (CCG) treebank from a dependency treebank for the subject–object–verb language Hindi. Rather than a direct conversion from dependency trees to CCG trees, we propose a two stage approach: a language independent generic algorithm first extracts a CCG lexicon from the dependency treebank. An exhaustive CCG parser then creates a treebank of CCG derivations. We also discuss special cases of this generic algorithm to handle linguistic phenomena specific to Hindi. In doing so we extract different constructions with long-range dependencies like coordinate constructions and non-projective dependencies resulting from constructions like relative clauses, noun elaboration and verbal modifiers. |
format | Online Article Text |
id | pubmed-6954025 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Springer Netherlands |
record_format | MEDLINE/PubMed |
spelling | pubmed-69540252020-01-23 Hindi CCGbank: A CCG treebank from the Hindi dependency treebank Ambati, Bharat Ram Deoskar, Tejaswini Steedman, Mark Lang Resour Eval Original Paper In this paper, we present an approach for automatically creating a combinatory categorial grammar (CCG) treebank from a dependency treebank for the subject–object–verb language Hindi. Rather than a direct conversion from dependency trees to CCG trees, we propose a two stage approach: a language independent generic algorithm first extracts a CCG lexicon from the dependency treebank. An exhaustive CCG parser then creates a treebank of CCG derivations. We also discuss special cases of this generic algorithm to handle linguistic phenomena specific to Hindi. In doing so we extract different constructions with long-range dependencies like coordinate constructions and non-projective dependencies resulting from constructions like relative clauses, noun elaboration and verbal modifiers. Springer Netherlands 2017-01-25 2018 /pmc/articles/PMC6954025/ /pubmed/31983909 http://dx.doi.org/10.1007/s10579-017-9379-6 Text en © The Author(s) 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. |
spellingShingle | Original Paper Ambati, Bharat Ram Deoskar, Tejaswini Steedman, Mark Hindi CCGbank: A CCG treebank from the Hindi dependency treebank |
title | Hindi CCGbank: A CCG treebank from the Hindi dependency treebank |
title_full | Hindi CCGbank: A CCG treebank from the Hindi dependency treebank |
title_fullStr | Hindi CCGbank: A CCG treebank from the Hindi dependency treebank |
title_full_unstemmed | Hindi CCGbank: A CCG treebank from the Hindi dependency treebank |
title_short | Hindi CCGbank: A CCG treebank from the Hindi dependency treebank |
title_sort | hindi ccgbank: a ccg treebank from the hindi dependency treebank |
topic | Original Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6954025/ https://www.ncbi.nlm.nih.gov/pubmed/31983909 http://dx.doi.org/10.1007/s10579-017-9379-6 |
work_keys_str_mv | AT ambatibharatram hindiccgbankaccgtreebankfromthehindidependencytreebank AT deoskartejaswini hindiccgbankaccgtreebankfromthehindidependencytreebank AT steedmanmark hindiccgbankaccgtreebankfromthehindidependencytreebank |