Cargando…

Improving part-of-speech tagging in Amharic language using deep neural network

To date, several POS taggers have been introduced to facilitate the success of semantic analysis for different languages. However, the task of POS tagging becomes a bit intricate in morphologically complex languages, like Amharic. In this paper, we evaluated different models such as bidirectional lo...

Descripción completa

Detalles Bibliográficos
Autores principales: Hirpassa, Sintayehu, Lehal, G.S.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10394909/
https://www.ncbi.nlm.nih.gov/pubmed/37539248
http://dx.doi.org/10.1016/j.heliyon.2023.e17175
_version_ 1785083476274315264
author Hirpassa, Sintayehu
Lehal, G.S.
author_facet Hirpassa, Sintayehu
Lehal, G.S.
author_sort Hirpassa, Sintayehu
collection PubMed
description To date, several POS taggers have been introduced to facilitate the success of semantic analysis for different languages. However, the task of POS tagging becomes a bit intricate in morphologically complex languages, like Amharic. In this paper, we evaluated different models such as bidirectional long short term memory, convolutional neural network in combination with bidirectional long short term memory, and conditional random field for Amharic POS tagging. Various features, both language-dependent and -independent, have been explored in a conditional random field model. Besides, word-level and character-level features are analyzed in deep neural network models. A convolutional neural network is utilized for encoding features at the word and character level. Each model's performance has evaluated on the dataset that contained 321 K tokens and manually tagged with 31 POS tags. Lastly, the best performance obtained by an end-to-end deep neural network model, convolutional neural network in combination with bidirectional long term short memory and conditional random field, is 97.23% accuracy. This is the highest accuracy for Amharic POS tagging task and is competent with contemporary taggers currently existing in different languages.
format Online
Article
Text
id pubmed-10394909
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-103949092023-08-03 Improving part-of-speech tagging in Amharic language using deep neural network Hirpassa, Sintayehu Lehal, G.S. Heliyon Research Article To date, several POS taggers have been introduced to facilitate the success of semantic analysis for different languages. However, the task of POS tagging becomes a bit intricate in morphologically complex languages, like Amharic. In this paper, we evaluated different models such as bidirectional long short term memory, convolutional neural network in combination with bidirectional long short term memory, and conditional random field for Amharic POS tagging. Various features, both language-dependent and -independent, have been explored in a conditional random field model. Besides, word-level and character-level features are analyzed in deep neural network models. A convolutional neural network is utilized for encoding features at the word and character level. Each model's performance has evaluated on the dataset that contained 321 K tokens and manually tagged with 31 POS tags. Lastly, the best performance obtained by an end-to-end deep neural network model, convolutional neural network in combination with bidirectional long term short memory and conditional random field, is 97.23% accuracy. This is the highest accuracy for Amharic POS tagging task and is competent with contemporary taggers currently existing in different languages. Elsevier 2023-06-21 /pmc/articles/PMC10394909/ /pubmed/37539248 http://dx.doi.org/10.1016/j.heliyon.2023.e17175 Text en © 2023 The Authors https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Research Article
Hirpassa, Sintayehu
Lehal, G.S.
Improving part-of-speech tagging in Amharic language using deep neural network
title Improving part-of-speech tagging in Amharic language using deep neural network
title_full Improving part-of-speech tagging in Amharic language using deep neural network
title_fullStr Improving part-of-speech tagging in Amharic language using deep neural network
title_full_unstemmed Improving part-of-speech tagging in Amharic language using deep neural network
title_short Improving part-of-speech tagging in Amharic language using deep neural network
title_sort improving part-of-speech tagging in amharic language using deep neural network
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10394909/
https://www.ncbi.nlm.nih.gov/pubmed/37539248
http://dx.doi.org/10.1016/j.heliyon.2023.e17175
work_keys_str_mv AT hirpassasintayehu improvingpartofspeechtagginginamhariclanguageusingdeepneuralnetwork
AT lehalgs improvingpartofspeechtagginginamhariclanguageusingdeepneuralnetwork