Cargando…

A comparative analysis on question classification task based on deep learning approaches

Question classification is one of the essential tasks for automatic question answering implementation in natural language processing (NLP). Recently, there have been several text-mining issues such as text classification, document categorization, web mining, sentiment analysis, and spam filtering th...

Descripción completa

Detalles Bibliográficos
Autores principales: Zulqarnain, Muhammad, Khalaf Zager Alsaedi, Ahmed, Ghazali, Rozaida, Ghouse, Muhammad Ghulam, Sharif, Wareesa, Aida Husaini, Noor
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8356656/
https://www.ncbi.nlm.nih.gov/pubmed/34435091
http://dx.doi.org/10.7717/peerj-cs.570
_version_ 1783736987840151552
author Zulqarnain, Muhammad
Khalaf Zager Alsaedi, Ahmed
Ghazali, Rozaida
Ghouse, Muhammad Ghulam
Sharif, Wareesa
Aida Husaini, Noor
author_facet Zulqarnain, Muhammad
Khalaf Zager Alsaedi, Ahmed
Ghazali, Rozaida
Ghouse, Muhammad Ghulam
Sharif, Wareesa
Aida Husaini, Noor
author_sort Zulqarnain, Muhammad
collection PubMed
description Question classification is one of the essential tasks for automatic question answering implementation in natural language processing (NLP). Recently, there have been several text-mining issues such as text classification, document categorization, web mining, sentiment analysis, and spam filtering that have been successfully achieved by deep learning approaches. In this study, we illustrated and investigated our work on certain deep learning approaches for question classification tasks in an extremely inflected Turkish language. In this study, we trained and tested the deep learning architectures on the questions dataset in Turkish. In addition to this, we used three main deep learning approaches (Gated Recurrent Unit (GRU), Long Short-Term Memory (LSTM), Convolutional Neural Networks (CNN)) and we also applied two different deep learning combinations of CNN-GRU and CNN-LSTM architectures. Furthermore, we applied the Word2vec technique with both skip-gram and CBOW methods for word embedding with various vector sizes on a large corpus composed of user questions. By comparing analysis, we conducted an experiment on deep learning architectures based on test and 10-cross fold validation accuracy. Experiment results were obtained to illustrate the effectiveness of various Word2vec techniques that have a considerable impact on the accuracy rate using different deep learning approaches. We attained an accuracy of 93.7% by using these techniques on the question dataset.
format Online
Article
Text
id pubmed-8356656
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-83566562021-08-24 A comparative analysis on question classification task based on deep learning approaches Zulqarnain, Muhammad Khalaf Zager Alsaedi, Ahmed Ghazali, Rozaida Ghouse, Muhammad Ghulam Sharif, Wareesa Aida Husaini, Noor PeerJ Comput Sci Data Mining and Machine Learning Question classification is one of the essential tasks for automatic question answering implementation in natural language processing (NLP). Recently, there have been several text-mining issues such as text classification, document categorization, web mining, sentiment analysis, and spam filtering that have been successfully achieved by deep learning approaches. In this study, we illustrated and investigated our work on certain deep learning approaches for question classification tasks in an extremely inflected Turkish language. In this study, we trained and tested the deep learning architectures on the questions dataset in Turkish. In addition to this, we used three main deep learning approaches (Gated Recurrent Unit (GRU), Long Short-Term Memory (LSTM), Convolutional Neural Networks (CNN)) and we also applied two different deep learning combinations of CNN-GRU and CNN-LSTM architectures. Furthermore, we applied the Word2vec technique with both skip-gram and CBOW methods for word embedding with various vector sizes on a large corpus composed of user questions. By comparing analysis, we conducted an experiment on deep learning architectures based on test and 10-cross fold validation accuracy. Experiment results were obtained to illustrate the effectiveness of various Word2vec techniques that have a considerable impact on the accuracy rate using different deep learning approaches. We attained an accuracy of 93.7% by using these techniques on the question dataset. PeerJ Inc. 2021-08-03 /pmc/articles/PMC8356656/ /pubmed/34435091 http://dx.doi.org/10.7717/peerj-cs.570 Text en © 2021 Zulqarnain et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited.
spellingShingle Data Mining and Machine Learning
Zulqarnain, Muhammad
Khalaf Zager Alsaedi, Ahmed
Ghazali, Rozaida
Ghouse, Muhammad Ghulam
Sharif, Wareesa
Aida Husaini, Noor
A comparative analysis on question classification task based on deep learning approaches
title A comparative analysis on question classification task based on deep learning approaches
title_full A comparative analysis on question classification task based on deep learning approaches
title_fullStr A comparative analysis on question classification task based on deep learning approaches
title_full_unstemmed A comparative analysis on question classification task based on deep learning approaches
title_short A comparative analysis on question classification task based on deep learning approaches
title_sort comparative analysis on question classification task based on deep learning approaches
topic Data Mining and Machine Learning
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8356656/
https://www.ncbi.nlm.nih.gov/pubmed/34435091
http://dx.doi.org/10.7717/peerj-cs.570
work_keys_str_mv AT zulqarnainmuhammad acomparativeanalysisonquestionclassificationtaskbasedondeeplearningapproaches
AT khalafzageralsaediahmed acomparativeanalysisonquestionclassificationtaskbasedondeeplearningapproaches
AT ghazalirozaida acomparativeanalysisonquestionclassificationtaskbasedondeeplearningapproaches
AT ghousemuhammadghulam acomparativeanalysisonquestionclassificationtaskbasedondeeplearningapproaches
AT sharifwareesa acomparativeanalysisonquestionclassificationtaskbasedondeeplearningapproaches
AT aidahusaininoor acomparativeanalysisonquestionclassificationtaskbasedondeeplearningapproaches
AT zulqarnainmuhammad comparativeanalysisonquestionclassificationtaskbasedondeeplearningapproaches
AT khalafzageralsaediahmed comparativeanalysisonquestionclassificationtaskbasedondeeplearningapproaches
AT ghazalirozaida comparativeanalysisonquestionclassificationtaskbasedondeeplearningapproaches
AT ghousemuhammadghulam comparativeanalysisonquestionclassificationtaskbasedondeeplearningapproaches
AT sharifwareesa comparativeanalysisonquestionclassificationtaskbasedondeeplearningapproaches
AT aidahusaininoor comparativeanalysisonquestionclassificationtaskbasedondeeplearningapproaches