Cargando…

Hindi CCGbank: A CCG treebank from the Hindi dependency treebank

In this paper, we present an approach for automatically creating a combinatory categorial grammar (CCG) treebank from a dependency treebank for the subject–object–verb language Hindi. Rather than a direct conversion from dependency trees to CCG trees, we propose a two stage approach: a language inde...

Descripción completa

Detalles Bibliográficos
Autores principales: Ambati, Bharat Ram, Deoskar, Tejaswini, Steedman, Mark
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer Netherlands 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6954025/
https://www.ncbi.nlm.nih.gov/pubmed/31983909
http://dx.doi.org/10.1007/s10579-017-9379-6
_version_ 1783486720929431552
author Ambati, Bharat Ram
Deoskar, Tejaswini
Steedman, Mark
author_facet Ambati, Bharat Ram
Deoskar, Tejaswini
Steedman, Mark
author_sort Ambati, Bharat Ram
collection PubMed
description In this paper, we present an approach for automatically creating a combinatory categorial grammar (CCG) treebank from a dependency treebank for the subject–object–verb language Hindi. Rather than a direct conversion from dependency trees to CCG trees, we propose a two stage approach: a language independent generic algorithm first extracts a CCG lexicon from the dependency treebank. An exhaustive CCG parser then creates a treebank of CCG derivations. We also discuss special cases of this generic algorithm to handle linguistic phenomena specific to Hindi. In doing so we extract different constructions with long-range dependencies like coordinate constructions and non-projective dependencies resulting from constructions like relative clauses, noun elaboration and verbal modifiers.
format Online
Article
Text
id pubmed-6954025
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Springer Netherlands
record_format MEDLINE/PubMed
spelling pubmed-69540252020-01-23 Hindi CCGbank: A CCG treebank from the Hindi dependency treebank Ambati, Bharat Ram Deoskar, Tejaswini Steedman, Mark Lang Resour Eval Original Paper In this paper, we present an approach for automatically creating a combinatory categorial grammar (CCG) treebank from a dependency treebank for the subject–object–verb language Hindi. Rather than a direct conversion from dependency trees to CCG trees, we propose a two stage approach: a language independent generic algorithm first extracts a CCG lexicon from the dependency treebank. An exhaustive CCG parser then creates a treebank of CCG derivations. We also discuss special cases of this generic algorithm to handle linguistic phenomena specific to Hindi. In doing so we extract different constructions with long-range dependencies like coordinate constructions and non-projective dependencies resulting from constructions like relative clauses, noun elaboration and verbal modifiers. Springer Netherlands 2017-01-25 2018 /pmc/articles/PMC6954025/ /pubmed/31983909 http://dx.doi.org/10.1007/s10579-017-9379-6 Text en © The Author(s) 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
spellingShingle Original Paper
Ambati, Bharat Ram
Deoskar, Tejaswini
Steedman, Mark
Hindi CCGbank: A CCG treebank from the Hindi dependency treebank
title Hindi CCGbank: A CCG treebank from the Hindi dependency treebank
title_full Hindi CCGbank: A CCG treebank from the Hindi dependency treebank
title_fullStr Hindi CCGbank: A CCG treebank from the Hindi dependency treebank
title_full_unstemmed Hindi CCGbank: A CCG treebank from the Hindi dependency treebank
title_short Hindi CCGbank: A CCG treebank from the Hindi dependency treebank
title_sort hindi ccgbank: a ccg treebank from the hindi dependency treebank
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6954025/
https://www.ncbi.nlm.nih.gov/pubmed/31983909
http://dx.doi.org/10.1007/s10579-017-9379-6
work_keys_str_mv AT ambatibharatram hindiccgbankaccgtreebankfromthehindidependencytreebank
AT deoskartejaswini hindiccgbankaccgtreebankfromthehindidependencytreebank
AT steedmanmark hindiccgbankaccgtreebankfromthehindidependencytreebank