Cargando…
A comparative analysis on question classification task based on deep learning approaches
Question classification is one of the essential tasks for automatic question answering implementation in natural language processing (NLP). Recently, there have been several text-mining issues such as text classification, document categorization, web mining, sentiment analysis, and spam filtering th...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
PeerJ Inc.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8356656/ https://www.ncbi.nlm.nih.gov/pubmed/34435091 http://dx.doi.org/10.7717/peerj-cs.570 |
_version_ | 1783736987840151552 |
---|---|
author | Zulqarnain, Muhammad Khalaf Zager Alsaedi, Ahmed Ghazali, Rozaida Ghouse, Muhammad Ghulam Sharif, Wareesa Aida Husaini, Noor |
author_facet | Zulqarnain, Muhammad Khalaf Zager Alsaedi, Ahmed Ghazali, Rozaida Ghouse, Muhammad Ghulam Sharif, Wareesa Aida Husaini, Noor |
author_sort | Zulqarnain, Muhammad |
collection | PubMed |
description | Question classification is one of the essential tasks for automatic question answering implementation in natural language processing (NLP). Recently, there have been several text-mining issues such as text classification, document categorization, web mining, sentiment analysis, and spam filtering that have been successfully achieved by deep learning approaches. In this study, we illustrated and investigated our work on certain deep learning approaches for question classification tasks in an extremely inflected Turkish language. In this study, we trained and tested the deep learning architectures on the questions dataset in Turkish. In addition to this, we used three main deep learning approaches (Gated Recurrent Unit (GRU), Long Short-Term Memory (LSTM), Convolutional Neural Networks (CNN)) and we also applied two different deep learning combinations of CNN-GRU and CNN-LSTM architectures. Furthermore, we applied the Word2vec technique with both skip-gram and CBOW methods for word embedding with various vector sizes on a large corpus composed of user questions. By comparing analysis, we conducted an experiment on deep learning architectures based on test and 10-cross fold validation accuracy. Experiment results were obtained to illustrate the effectiveness of various Word2vec techniques that have a considerable impact on the accuracy rate using different deep learning approaches. We attained an accuracy of 93.7% by using these techniques on the question dataset. |
format | Online Article Text |
id | pubmed-8356656 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | PeerJ Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-83566562021-08-24 A comparative analysis on question classification task based on deep learning approaches Zulqarnain, Muhammad Khalaf Zager Alsaedi, Ahmed Ghazali, Rozaida Ghouse, Muhammad Ghulam Sharif, Wareesa Aida Husaini, Noor PeerJ Comput Sci Data Mining and Machine Learning Question classification is one of the essential tasks for automatic question answering implementation in natural language processing (NLP). Recently, there have been several text-mining issues such as text classification, document categorization, web mining, sentiment analysis, and spam filtering that have been successfully achieved by deep learning approaches. In this study, we illustrated and investigated our work on certain deep learning approaches for question classification tasks in an extremely inflected Turkish language. In this study, we trained and tested the deep learning architectures on the questions dataset in Turkish. In addition to this, we used three main deep learning approaches (Gated Recurrent Unit (GRU), Long Short-Term Memory (LSTM), Convolutional Neural Networks (CNN)) and we also applied two different deep learning combinations of CNN-GRU and CNN-LSTM architectures. Furthermore, we applied the Word2vec technique with both skip-gram and CBOW methods for word embedding with various vector sizes on a large corpus composed of user questions. By comparing analysis, we conducted an experiment on deep learning architectures based on test and 10-cross fold validation accuracy. Experiment results were obtained to illustrate the effectiveness of various Word2vec techniques that have a considerable impact on the accuracy rate using different deep learning approaches. We attained an accuracy of 93.7% by using these techniques on the question dataset. PeerJ Inc. 2021-08-03 /pmc/articles/PMC8356656/ /pubmed/34435091 http://dx.doi.org/10.7717/peerj-cs.570 Text en © 2021 Zulqarnain et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited. |
spellingShingle | Data Mining and Machine Learning Zulqarnain, Muhammad Khalaf Zager Alsaedi, Ahmed Ghazali, Rozaida Ghouse, Muhammad Ghulam Sharif, Wareesa Aida Husaini, Noor A comparative analysis on question classification task based on deep learning approaches |
title | A comparative analysis on question classification task based on deep learning approaches |
title_full | A comparative analysis on question classification task based on deep learning approaches |
title_fullStr | A comparative analysis on question classification task based on deep learning approaches |
title_full_unstemmed | A comparative analysis on question classification task based on deep learning approaches |
title_short | A comparative analysis on question classification task based on deep learning approaches |
title_sort | comparative analysis on question classification task based on deep learning approaches |
topic | Data Mining and Machine Learning |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8356656/ https://www.ncbi.nlm.nih.gov/pubmed/34435091 http://dx.doi.org/10.7717/peerj-cs.570 |
work_keys_str_mv | AT zulqarnainmuhammad acomparativeanalysisonquestionclassificationtaskbasedondeeplearningapproaches AT khalafzageralsaediahmed acomparativeanalysisonquestionclassificationtaskbasedondeeplearningapproaches AT ghazalirozaida acomparativeanalysisonquestionclassificationtaskbasedondeeplearningapproaches AT ghousemuhammadghulam acomparativeanalysisonquestionclassificationtaskbasedondeeplearningapproaches AT sharifwareesa acomparativeanalysisonquestionclassificationtaskbasedondeeplearningapproaches AT aidahusaininoor acomparativeanalysisonquestionclassificationtaskbasedondeeplearningapproaches AT zulqarnainmuhammad comparativeanalysisonquestionclassificationtaskbasedondeeplearningapproaches AT khalafzageralsaediahmed comparativeanalysisonquestionclassificationtaskbasedondeeplearningapproaches AT ghazalirozaida comparativeanalysisonquestionclassificationtaskbasedondeeplearningapproaches AT ghousemuhammadghulam comparativeanalysisonquestionclassificationtaskbasedondeeplearningapproaches AT sharifwareesa comparativeanalysisonquestionclassificationtaskbasedondeeplearningapproaches AT aidahusaininoor comparativeanalysisonquestionclassificationtaskbasedondeeplearningapproaches |