Cargando…
Semi-Supervised Morphosyntactic Classification of Old Icelandic
We present IceMorph, a semi-supervised morphosyntactic analyzer of Old Icelandic. In addition to machine-read corpora and dictionaries, it applies a small set of declension prototypes to map corpus words to dictionary entries. A web-based GUI allows expert users to modify and augment data through an...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4100772/ https://www.ncbi.nlm.nih.gov/pubmed/25029462 http://dx.doi.org/10.1371/journal.pone.0102366 |
_version_ | 1782326709007155200 |
---|---|
author | Urban, Kryztof Tangherlini, Timothy R. Vijūnas, Aurelijus Broadwell, Peter M. |
author_facet | Urban, Kryztof Tangherlini, Timothy R. Vijūnas, Aurelijus Broadwell, Peter M. |
author_sort | Urban, Kryztof |
collection | PubMed |
description | We present IceMorph, a semi-supervised morphosyntactic analyzer of Old Icelandic. In addition to machine-read corpora and dictionaries, it applies a small set of declension prototypes to map corpus words to dictionary entries. A web-based GUI allows expert users to modify and augment data through an online process. A machine learning module incorporates prototype data, edit-distance metrics, and expert feedback to continuously update part-of-speech and morphosyntactic classification. An advantage of the analyzer is its ability to achieve competitive classification accuracy with minimum training data. |
format | Online Article Text |
id | pubmed-4100772 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-41007722014-07-18 Semi-Supervised Morphosyntactic Classification of Old Icelandic Urban, Kryztof Tangherlini, Timothy R. Vijūnas, Aurelijus Broadwell, Peter M. PLoS One Research Article We present IceMorph, a semi-supervised morphosyntactic analyzer of Old Icelandic. In addition to machine-read corpora and dictionaries, it applies a small set of declension prototypes to map corpus words to dictionary entries. A web-based GUI allows expert users to modify and augment data through an online process. A machine learning module incorporates prototype data, edit-distance metrics, and expert feedback to continuously update part-of-speech and morphosyntactic classification. An advantage of the analyzer is its ability to achieve competitive classification accuracy with minimum training data. Public Library of Science 2014-07-16 /pmc/articles/PMC4100772/ /pubmed/25029462 http://dx.doi.org/10.1371/journal.pone.0102366 Text en © 2014 Urban et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Urban, Kryztof Tangherlini, Timothy R. Vijūnas, Aurelijus Broadwell, Peter M. Semi-Supervised Morphosyntactic Classification of Old Icelandic |
title | Semi-Supervised Morphosyntactic Classification of Old Icelandic |
title_full | Semi-Supervised Morphosyntactic Classification of Old Icelandic |
title_fullStr | Semi-Supervised Morphosyntactic Classification of Old Icelandic |
title_full_unstemmed | Semi-Supervised Morphosyntactic Classification of Old Icelandic |
title_short | Semi-Supervised Morphosyntactic Classification of Old Icelandic |
title_sort | semi-supervised morphosyntactic classification of old icelandic |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4100772/ https://www.ncbi.nlm.nih.gov/pubmed/25029462 http://dx.doi.org/10.1371/journal.pone.0102366 |
work_keys_str_mv | AT urbankryztof semisupervisedmorphosyntacticclassificationofoldicelandic AT tangherlinitimothyr semisupervisedmorphosyntacticclassificationofoldicelandic AT vijunasaurelijus semisupervisedmorphosyntacticclassificationofoldicelandic AT broadwellpeterm semisupervisedmorphosyntacticclassificationofoldicelandic |