Cargando…
Automated Amharic News Categorization Using Deep Learning Models
For decades, machine learning techniques have been used to process Amharic texts. The potential application of deep learning on Amharic document classification has not been exploited due to a lack of language resources. In this paper, we present a deep learning model for Amharic news document classi...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8331297/ https://www.ncbi.nlm.nih.gov/pubmed/34354742 http://dx.doi.org/10.1155/2021/3774607 |
_version_ | 1783732891292794880 |
---|---|
author | Endalie, Demeke Haile, Getamesay |
author_facet | Endalie, Demeke Haile, Getamesay |
author_sort | Endalie, Demeke |
collection | PubMed |
description | For decades, machine learning techniques have been used to process Amharic texts. The potential application of deep learning on Amharic document classification has not been exploited due to a lack of language resources. In this paper, we present a deep learning model for Amharic news document classification. The proposed model uses fastText to generate text vectors to represent semantic meaning of texts and solve the problem of traditional methods. The text vectors matrix is then fed into the embedding layer of a convolutional neural network (CNN), which automatically extracts features. We conduct experiments on a data set with six news categories, and our approach produced a classification accuracy of 93.79%. We compared our method to well-known machine learning algorithms such as support vector machine (SVM), multilayer perceptron (MLP), decision tree (DT), XGBoost (XGB), and random forest (RF) and achieved good results. |
format | Online Article Text |
id | pubmed-8331297 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Hindawi |
record_format | MEDLINE/PubMed |
spelling | pubmed-83312972021-08-04 Automated Amharic News Categorization Using Deep Learning Models Endalie, Demeke Haile, Getamesay Comput Intell Neurosci Research Article For decades, machine learning techniques have been used to process Amharic texts. The potential application of deep learning on Amharic document classification has not been exploited due to a lack of language resources. In this paper, we present a deep learning model for Amharic news document classification. The proposed model uses fastText to generate text vectors to represent semantic meaning of texts and solve the problem of traditional methods. The text vectors matrix is then fed into the embedding layer of a convolutional neural network (CNN), which automatically extracts features. We conduct experiments on a data set with six news categories, and our approach produced a classification accuracy of 93.79%. We compared our method to well-known machine learning algorithms such as support vector machine (SVM), multilayer perceptron (MLP), decision tree (DT), XGBoost (XGB), and random forest (RF) and achieved good results. Hindawi 2021-07-27 /pmc/articles/PMC8331297/ /pubmed/34354742 http://dx.doi.org/10.1155/2021/3774607 Text en Copyright © 2021 Demeke Endalie and Getamesay Haile. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Endalie, Demeke Haile, Getamesay Automated Amharic News Categorization Using Deep Learning Models |
title | Automated Amharic News Categorization Using Deep Learning Models |
title_full | Automated Amharic News Categorization Using Deep Learning Models |
title_fullStr | Automated Amharic News Categorization Using Deep Learning Models |
title_full_unstemmed | Automated Amharic News Categorization Using Deep Learning Models |
title_short | Automated Amharic News Categorization Using Deep Learning Models |
title_sort | automated amharic news categorization using deep learning models |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8331297/ https://www.ncbi.nlm.nih.gov/pubmed/34354742 http://dx.doi.org/10.1155/2021/3774607 |
work_keys_str_mv | AT endaliedemeke automatedamharicnewscategorizationusingdeeplearningmodels AT hailegetamesay automatedamharicnewscategorizationusingdeeplearningmodels |