Cargando…

Semi-Supervised Morphosyntactic Classification of Old Icelandic

We present IceMorph, a semi-supervised morphosyntactic analyzer of Old Icelandic. In addition to machine-read corpora and dictionaries, it applies a small set of declension prototypes to map corpus words to dictionary entries. A web-based GUI allows expert users to modify and augment data through an...

Descripción completa

Detalles Bibliográficos
Autores principales: Urban, Kryztof, Tangherlini, Timothy R., Vijūnas, Aurelijus, Broadwell, Peter M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4100772/
https://www.ncbi.nlm.nih.gov/pubmed/25029462
http://dx.doi.org/10.1371/journal.pone.0102366
_version_ 1782326709007155200
author Urban, Kryztof
Tangherlini, Timothy R.
Vijūnas, Aurelijus
Broadwell, Peter M.
author_facet Urban, Kryztof
Tangherlini, Timothy R.
Vijūnas, Aurelijus
Broadwell, Peter M.
author_sort Urban, Kryztof
collection PubMed
description We present IceMorph, a semi-supervised morphosyntactic analyzer of Old Icelandic. In addition to machine-read corpora and dictionaries, it applies a small set of declension prototypes to map corpus words to dictionary entries. A web-based GUI allows expert users to modify and augment data through an online process. A machine learning module incorporates prototype data, edit-distance metrics, and expert feedback to continuously update part-of-speech and morphosyntactic classification. An advantage of the analyzer is its ability to achieve competitive classification accuracy with minimum training data.
format Online
Article
Text
id pubmed-4100772
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-41007722014-07-18 Semi-Supervised Morphosyntactic Classification of Old Icelandic Urban, Kryztof Tangherlini, Timothy R. Vijūnas, Aurelijus Broadwell, Peter M. PLoS One Research Article We present IceMorph, a semi-supervised morphosyntactic analyzer of Old Icelandic. In addition to machine-read corpora and dictionaries, it applies a small set of declension prototypes to map corpus words to dictionary entries. A web-based GUI allows expert users to modify and augment data through an online process. A machine learning module incorporates prototype data, edit-distance metrics, and expert feedback to continuously update part-of-speech and morphosyntactic classification. An advantage of the analyzer is its ability to achieve competitive classification accuracy with minimum training data. Public Library of Science 2014-07-16 /pmc/articles/PMC4100772/ /pubmed/25029462 http://dx.doi.org/10.1371/journal.pone.0102366 Text en © 2014 Urban et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Urban, Kryztof
Tangherlini, Timothy R.
Vijūnas, Aurelijus
Broadwell, Peter M.
Semi-Supervised Morphosyntactic Classification of Old Icelandic
title Semi-Supervised Morphosyntactic Classification of Old Icelandic
title_full Semi-Supervised Morphosyntactic Classification of Old Icelandic
title_fullStr Semi-Supervised Morphosyntactic Classification of Old Icelandic
title_full_unstemmed Semi-Supervised Morphosyntactic Classification of Old Icelandic
title_short Semi-Supervised Morphosyntactic Classification of Old Icelandic
title_sort semi-supervised morphosyntactic classification of old icelandic
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4100772/
https://www.ncbi.nlm.nih.gov/pubmed/25029462
http://dx.doi.org/10.1371/journal.pone.0102366
work_keys_str_mv AT urbankryztof semisupervisedmorphosyntacticclassificationofoldicelandic
AT tangherlinitimothyr semisupervisedmorphosyntacticclassificationofoldicelandic
AT vijunasaurelijus semisupervisedmorphosyntacticclassificationofoldicelandic
AT broadwellpeterm semisupervisedmorphosyntacticclassificationofoldicelandic