Cargando…
Question classification based on Bloom’s taxonomy cognitive domain using modified TF-IDF and word2vec
The assessment of examination questions is crucial in educational institutes since examination is one of the most common methods to evaluate students’ achievement in specific course. Therefore, there is a crucial need to construct a balanced and high-quality exam, which satisfies different cognitive...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7081997/ https://www.ncbi.nlm.nih.gov/pubmed/32191738 http://dx.doi.org/10.1371/journal.pone.0230442 |
_version_ | 1783508272128458752 |
---|---|
author | Mohammed, Manal Omar, Nazlia |
author_facet | Mohammed, Manal Omar, Nazlia |
author_sort | Mohammed, Manal |
collection | PubMed |
description | The assessment of examination questions is crucial in educational institutes since examination is one of the most common methods to evaluate students’ achievement in specific course. Therefore, there is a crucial need to construct a balanced and high-quality exam, which satisfies different cognitive levels. Thus, many lecturers rely on Bloom’s taxonomy cognitive domain, which is a popular framework developed for the purpose of assessing students’ intellectual abilities and skills. Several works have been proposed to automatically handle the classification of questions in accordance with Bloom’s taxonomy. Most of these works classify questions according to specific domain. As a result, there is a lack of technique of classifying questions that belong to the multi-domain areas. The aim of this paper is to present a classification model to classify exam questions based on Bloom’s taxonomy that belong to several areas. This study proposes a method for classifying questions automatically, by extracting two features, TFPOS-IDF and word2vec. The purpose of the first feature was to calculate the term frequency-inverse document frequency based on part of speech, in order to assign a suitable weight for essential words in the question. The second feature, pre-trained word2vec, was used to boost the classification process. Then, the combination of these features was fed into three different classifiers; K-Nearest Neighbour, Logistic Regression, and Support Vector Machine, in order to classify the questions. The experiments used two datasets. The first dataset contained 141 questions, while the other dataset contained 600 questions. The classification result for the first dataset achieved an average of 71.1%, 82.3% and 83.7% weighted F1-measure respectively. The classification result for the second dataset achieved an average of 85.4%, 89.4% and 89.7% weighted F1-measure respectively. The finding from this study showed that the proposed method is significant in classifying questions from multiple domains based on Bloom’s taxonomy. |
format | Online Article Text |
id | pubmed-7081997 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-70819972020-03-24 Question classification based on Bloom’s taxonomy cognitive domain using modified TF-IDF and word2vec Mohammed, Manal Omar, Nazlia PLoS One Research Article The assessment of examination questions is crucial in educational institutes since examination is one of the most common methods to evaluate students’ achievement in specific course. Therefore, there is a crucial need to construct a balanced and high-quality exam, which satisfies different cognitive levels. Thus, many lecturers rely on Bloom’s taxonomy cognitive domain, which is a popular framework developed for the purpose of assessing students’ intellectual abilities and skills. Several works have been proposed to automatically handle the classification of questions in accordance with Bloom’s taxonomy. Most of these works classify questions according to specific domain. As a result, there is a lack of technique of classifying questions that belong to the multi-domain areas. The aim of this paper is to present a classification model to classify exam questions based on Bloom’s taxonomy that belong to several areas. This study proposes a method for classifying questions automatically, by extracting two features, TFPOS-IDF and word2vec. The purpose of the first feature was to calculate the term frequency-inverse document frequency based on part of speech, in order to assign a suitable weight for essential words in the question. The second feature, pre-trained word2vec, was used to boost the classification process. Then, the combination of these features was fed into three different classifiers; K-Nearest Neighbour, Logistic Regression, and Support Vector Machine, in order to classify the questions. The experiments used two datasets. The first dataset contained 141 questions, while the other dataset contained 600 questions. The classification result for the first dataset achieved an average of 71.1%, 82.3% and 83.7% weighted F1-measure respectively. The classification result for the second dataset achieved an average of 85.4%, 89.4% and 89.7% weighted F1-measure respectively. The finding from this study showed that the proposed method is significant in classifying questions from multiple domains based on Bloom’s taxonomy. Public Library of Science 2020-03-19 /pmc/articles/PMC7081997/ /pubmed/32191738 http://dx.doi.org/10.1371/journal.pone.0230442 Text en © 2020 Mohammed, Omar http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Mohammed, Manal Omar, Nazlia Question classification based on Bloom’s taxonomy cognitive domain using modified TF-IDF and word2vec |
title | Question classification based on Bloom’s taxonomy cognitive domain using modified TF-IDF and word2vec |
title_full | Question classification based on Bloom’s taxonomy cognitive domain using modified TF-IDF and word2vec |
title_fullStr | Question classification based on Bloom’s taxonomy cognitive domain using modified TF-IDF and word2vec |
title_full_unstemmed | Question classification based on Bloom’s taxonomy cognitive domain using modified TF-IDF and word2vec |
title_short | Question classification based on Bloom’s taxonomy cognitive domain using modified TF-IDF and word2vec |
title_sort | question classification based on bloom’s taxonomy cognitive domain using modified tf-idf and word2vec |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7081997/ https://www.ncbi.nlm.nih.gov/pubmed/32191738 http://dx.doi.org/10.1371/journal.pone.0230442 |
work_keys_str_mv | AT mohammedmanal questionclassificationbasedonbloomstaxonomycognitivedomainusingmodifiedtfidfandword2vec AT omarnazlia questionclassificationbasedonbloomstaxonomycognitivedomainusingmodifiedtfidfandword2vec |