Cargando…

Semi-Supervised Morphosyntactic Classification of Old Icelandic

We present IceMorph, a semi-supervised morphosyntactic analyzer of Old Icelandic. In addition to machine-read corpora and dictionaries, it applies a small set of declension prototypes to map corpus words to dictionary entries. A web-based GUI allows expert users to modify and augment data through an...

Descripción completa

Detalles Bibliográficos
Autores principales: Urban, Kryztof, Tangherlini, Timothy R., Vijūnas, Aurelijus, Broadwell, Peter M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4100772/
https://www.ncbi.nlm.nih.gov/pubmed/25029462
http://dx.doi.org/10.1371/journal.pone.0102366
Descripción
Sumario:We present IceMorph, a semi-supervised morphosyntactic analyzer of Old Icelandic. In addition to machine-read corpora and dictionaries, it applies a small set of declension prototypes to map corpus words to dictionary entries. A web-based GUI allows expert users to modify and augment data through an online process. A machine learning module incorporates prototype data, edit-distance metrics, and expert feedback to continuously update part-of-speech and morphosyntactic classification. An advantage of the analyzer is its ability to achieve competitive classification accuracy with minimum training data.