Cargando…

An efficient approach for textual data classification using deep learning

Text categorization is an effective activity that can be accomplished using a variety of classification algorithms. In machine learning, the classifier is built by learning the features of categories from a set of preset training data. Similarly, deep learning offers enormous benefits for text class...

Descripción completa

Detalles Bibliográficos
Autores principales: Alqahtani, Abdullah, Ullah Khan, Habib, Alsubai, Shtwai, Sha, Mohemmed, Almadhor, Ahmad, Iqbal, Tayyab, Abbas, Sidra
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9521674/
https://www.ncbi.nlm.nih.gov/pubmed/36185709
http://dx.doi.org/10.3389/fncom.2022.992296
Descripción
Sumario:Text categorization is an effective activity that can be accomplished using a variety of classification algorithms. In machine learning, the classifier is built by learning the features of categories from a set of preset training data. Similarly, deep learning offers enormous benefits for text classification since they execute highly accurately with lower-level engineering and processing. This paper employs machine and deep learning techniques to classify textual data. Textual data contains much useless information that must be pre-processed. We clean the data, impute missing values, and eliminate the repeated columns. Next, we employ machine learning algorithms: logistic regression, random forest, K-nearest neighbors (KNN), and deep learning algorithms: long short-term memory (LSTM), artificial neural network (ANN), and gated recurrent unit (GRU) for classification. Results reveal that LSTM achieves 92% accuracy outperforming all other model and baseline studies.