Cargando…

Building an Ensemble of Fine-Tuned Naive Bayesian Classifiers for Text Classification

Text classification is one domain in which the naive Bayesian (NB) learning algorithm performs remarkably well. However, making further improvement in performance using ensemble-building techniques proved to be a challenge because NB is a stable algorithm. This work shows that, while an ensemble of...

Descripción completa

Detalles Bibliográficos
Autores principales: El Hindi, Khalil, AlSalman, Hussien, Qasem, Safwan, Al Ahmadi, Saad
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7512419/
https://www.ncbi.nlm.nih.gov/pubmed/33266581
http://dx.doi.org/10.3390/e20110857
_version_ 1783586153608249344
author El Hindi, Khalil
AlSalman, Hussien
Qasem, Safwan
Al Ahmadi, Saad
author_facet El Hindi, Khalil
AlSalman, Hussien
Qasem, Safwan
Al Ahmadi, Saad
author_sort El Hindi, Khalil
collection PubMed
description Text classification is one domain in which the naive Bayesian (NB) learning algorithm performs remarkably well. However, making further improvement in performance using ensemble-building techniques proved to be a challenge because NB is a stable algorithm. This work shows that, while an ensemble of NB classifiers achieves little or no improvement in terms of classification accuracy, an ensemble of fine-tuned NB classifiers can achieve a remarkable improvement in accuracy. We propose a fine-tuning algorithm for text classification that is both more accurate and less stable than the NB algorithm and the fine-tuning NB (FTNB) algorithm. This improvement makes it more suitable than the FTNB algorithm for building ensembles of classifiers using bagging. Our empirical experiments, using 16-benchmark text-classification data sets, show significant improvement for most data sets.
format Online
Article
Text
id pubmed-7512419
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-75124192020-11-09 Building an Ensemble of Fine-Tuned Naive Bayesian Classifiers for Text Classification El Hindi, Khalil AlSalman, Hussien Qasem, Safwan Al Ahmadi, Saad Entropy (Basel) Article Text classification is one domain in which the naive Bayesian (NB) learning algorithm performs remarkably well. However, making further improvement in performance using ensemble-building techniques proved to be a challenge because NB is a stable algorithm. This work shows that, while an ensemble of NB classifiers achieves little or no improvement in terms of classification accuracy, an ensemble of fine-tuned NB classifiers can achieve a remarkable improvement in accuracy. We propose a fine-tuning algorithm for text classification that is both more accurate and less stable than the NB algorithm and the fine-tuning NB (FTNB) algorithm. This improvement makes it more suitable than the FTNB algorithm for building ensembles of classifiers using bagging. Our empirical experiments, using 16-benchmark text-classification data sets, show significant improvement for most data sets. MDPI 2018-11-07 /pmc/articles/PMC7512419/ /pubmed/33266581 http://dx.doi.org/10.3390/e20110857 Text en © 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
El Hindi, Khalil
AlSalman, Hussien
Qasem, Safwan
Al Ahmadi, Saad
Building an Ensemble of Fine-Tuned Naive Bayesian Classifiers for Text Classification
title Building an Ensemble of Fine-Tuned Naive Bayesian Classifiers for Text Classification
title_full Building an Ensemble of Fine-Tuned Naive Bayesian Classifiers for Text Classification
title_fullStr Building an Ensemble of Fine-Tuned Naive Bayesian Classifiers for Text Classification
title_full_unstemmed Building an Ensemble of Fine-Tuned Naive Bayesian Classifiers for Text Classification
title_short Building an Ensemble of Fine-Tuned Naive Bayesian Classifiers for Text Classification
title_sort building an ensemble of fine-tuned naive bayesian classifiers for text classification
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7512419/
https://www.ncbi.nlm.nih.gov/pubmed/33266581
http://dx.doi.org/10.3390/e20110857
work_keys_str_mv AT elhindikhalil buildinganensembleoffinetunednaivebayesianclassifiersfortextclassification
AT alsalmanhussien buildinganensembleoffinetunednaivebayesianclassifiersfortextclassification
AT qasemsafwan buildinganensembleoffinetunednaivebayesianclassifiersfortextclassification
AT alahmadisaad buildinganensembleoffinetunednaivebayesianclassifiersfortextclassification