Cargando…
Improving part-of-speech tagging in Amharic language using deep neural network
To date, several POS taggers have been introduced to facilitate the success of semantic analysis for different languages. However, the task of POS tagging becomes a bit intricate in morphologically complex languages, like Amharic. In this paper, we evaluated different models such as bidirectional lo...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10394909/ https://www.ncbi.nlm.nih.gov/pubmed/37539248 http://dx.doi.org/10.1016/j.heliyon.2023.e17175 |
_version_ | 1785083476274315264 |
---|---|
author | Hirpassa, Sintayehu Lehal, G.S. |
author_facet | Hirpassa, Sintayehu Lehal, G.S. |
author_sort | Hirpassa, Sintayehu |
collection | PubMed |
description | To date, several POS taggers have been introduced to facilitate the success of semantic analysis for different languages. However, the task of POS tagging becomes a bit intricate in morphologically complex languages, like Amharic. In this paper, we evaluated different models such as bidirectional long short term memory, convolutional neural network in combination with bidirectional long short term memory, and conditional random field for Amharic POS tagging. Various features, both language-dependent and -independent, have been explored in a conditional random field model. Besides, word-level and character-level features are analyzed in deep neural network models. A convolutional neural network is utilized for encoding features at the word and character level. Each model's performance has evaluated on the dataset that contained 321 K tokens and manually tagged with 31 POS tags. Lastly, the best performance obtained by an end-to-end deep neural network model, convolutional neural network in combination with bidirectional long term short memory and conditional random field, is 97.23% accuracy. This is the highest accuracy for Amharic POS tagging task and is competent with contemporary taggers currently existing in different languages. |
format | Online Article Text |
id | pubmed-10394909 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Elsevier |
record_format | MEDLINE/PubMed |
spelling | pubmed-103949092023-08-03 Improving part-of-speech tagging in Amharic language using deep neural network Hirpassa, Sintayehu Lehal, G.S. Heliyon Research Article To date, several POS taggers have been introduced to facilitate the success of semantic analysis for different languages. However, the task of POS tagging becomes a bit intricate in morphologically complex languages, like Amharic. In this paper, we evaluated different models such as bidirectional long short term memory, convolutional neural network in combination with bidirectional long short term memory, and conditional random field for Amharic POS tagging. Various features, both language-dependent and -independent, have been explored in a conditional random field model. Besides, word-level and character-level features are analyzed in deep neural network models. A convolutional neural network is utilized for encoding features at the word and character level. Each model's performance has evaluated on the dataset that contained 321 K tokens and manually tagged with 31 POS tags. Lastly, the best performance obtained by an end-to-end deep neural network model, convolutional neural network in combination with bidirectional long term short memory and conditional random field, is 97.23% accuracy. This is the highest accuracy for Amharic POS tagging task and is competent with contemporary taggers currently existing in different languages. Elsevier 2023-06-21 /pmc/articles/PMC10394909/ /pubmed/37539248 http://dx.doi.org/10.1016/j.heliyon.2023.e17175 Text en © 2023 The Authors https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). |
spellingShingle | Research Article Hirpassa, Sintayehu Lehal, G.S. Improving part-of-speech tagging in Amharic language using deep neural network |
title | Improving part-of-speech tagging in Amharic language using deep neural network |
title_full | Improving part-of-speech tagging in Amharic language using deep neural network |
title_fullStr | Improving part-of-speech tagging in Amharic language using deep neural network |
title_full_unstemmed | Improving part-of-speech tagging in Amharic language using deep neural network |
title_short | Improving part-of-speech tagging in Amharic language using deep neural network |
title_sort | improving part-of-speech tagging in amharic language using deep neural network |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10394909/ https://www.ncbi.nlm.nih.gov/pubmed/37539248 http://dx.doi.org/10.1016/j.heliyon.2023.e17175 |
work_keys_str_mv | AT hirpassasintayehu improvingpartofspeechtagginginamhariclanguageusingdeepneuralnetwork AT lehalgs improvingpartofspeechtagginginamhariclanguageusingdeepneuralnetwork |