Cargando…

EdNet: A Large-Scale Hierarchical Dataset in Education

Advances in Artificial Intelligence in Education (AIEd) and the ever-growing scale of Interactive Educational Systems (IESs) have led to the rise of data-driven approaches for knowledge tracing and learning path recommendation. Unfortunately, collecting student interaction data is challenging and co...

Descripción completa

Detalles Bibliográficos
Autores principales: Choi, Youngduck, Lee, Youngnam, Shin, Dongmin, Cho, Junghyun, Park, Seoyon, Lee, Seewoo, Baek, Jineon, Bae, Chan, Kim, Byungsoo, Heo, Jaewe
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7334672/
http://dx.doi.org/10.1007/978-3-030-52240-7_13
_version_ 1783553976386453504
author Choi, Youngduck
Lee, Youngnam
Shin, Dongmin
Cho, Junghyun
Park, Seoyon
Lee, Seewoo
Baek, Jineon
Bae, Chan
Kim, Byungsoo
Heo, Jaewe
author_facet Choi, Youngduck
Lee, Youngnam
Shin, Dongmin
Cho, Junghyun
Park, Seoyon
Lee, Seewoo
Baek, Jineon
Bae, Chan
Kim, Byungsoo
Heo, Jaewe
author_sort Choi, Youngduck
collection PubMed
description Advances in Artificial Intelligence in Education (AIEd) and the ever-growing scale of Interactive Educational Systems (IESs) have led to the rise of data-driven approaches for knowledge tracing and learning path recommendation. Unfortunately, collecting student interaction data is challenging and costly. As a result, there is no public large-scale benchmark dataset reflecting the wide variety of student behaviors observed in modern IESs. Although several datasets, such as ASSISTments, Junyi Academy, Synthetic and STATICS are publicly available and widely used, they are not large enough to leverage the full potential of state-of-the-art data-driven models. Furthermore, the recorded behavior is limited to question-solving activities. To this end, we introduce EdNet, a large-scale hierarchical dataset of diverse student activities collected by Santa, a multi-platform self-study solution equipped with an artificial intelligence tutoring system. EdNet contains 131,417,236 interactions from 784,309 students collected over more than 2 years, making it the largest public IES dataset released to date. Unlike existing datasets, EdNet records a wide variety of student actions ranging from question-solving to lecture consumption to item purchasing. Also, EdNet has a hierarchical structure which divides the student actions into 4 different levels of abstractions. The features of EdNet are domain-agnostic, allowing EdNet to be easily extended to different domains. The dataset is publicly released for research purposes. We plan to host challenges in multiple AIEd tasks with EdNet to provide a common ground for the fair comparison between different state-of-the-art models and to encourage the development of practical and effective methods.
format Online
Article
Text
id pubmed-7334672
institution National Center for Biotechnology Information
language English
publishDate 2020
record_format MEDLINE/PubMed
spelling pubmed-73346722020-07-06 EdNet: A Large-Scale Hierarchical Dataset in Education Choi, Youngduck Lee, Youngnam Shin, Dongmin Cho, Junghyun Park, Seoyon Lee, Seewoo Baek, Jineon Bae, Chan Kim, Byungsoo Heo, Jaewe Artificial Intelligence in Education Article Advances in Artificial Intelligence in Education (AIEd) and the ever-growing scale of Interactive Educational Systems (IESs) have led to the rise of data-driven approaches for knowledge tracing and learning path recommendation. Unfortunately, collecting student interaction data is challenging and costly. As a result, there is no public large-scale benchmark dataset reflecting the wide variety of student behaviors observed in modern IESs. Although several datasets, such as ASSISTments, Junyi Academy, Synthetic and STATICS are publicly available and widely used, they are not large enough to leverage the full potential of state-of-the-art data-driven models. Furthermore, the recorded behavior is limited to question-solving activities. To this end, we introduce EdNet, a large-scale hierarchical dataset of diverse student activities collected by Santa, a multi-platform self-study solution equipped with an artificial intelligence tutoring system. EdNet contains 131,417,236 interactions from 784,309 students collected over more than 2 years, making it the largest public IES dataset released to date. Unlike existing datasets, EdNet records a wide variety of student actions ranging from question-solving to lecture consumption to item purchasing. Also, EdNet has a hierarchical structure which divides the student actions into 4 different levels of abstractions. The features of EdNet are domain-agnostic, allowing EdNet to be easily extended to different domains. The dataset is publicly released for research purposes. We plan to host challenges in multiple AIEd tasks with EdNet to provide a common ground for the fair comparison between different state-of-the-art models and to encourage the development of practical and effective methods. 2020-06-10 /pmc/articles/PMC7334672/ http://dx.doi.org/10.1007/978-3-030-52240-7_13 Text en © Springer Nature Switzerland AG 2020 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Article
Choi, Youngduck
Lee, Youngnam
Shin, Dongmin
Cho, Junghyun
Park, Seoyon
Lee, Seewoo
Baek, Jineon
Bae, Chan
Kim, Byungsoo
Heo, Jaewe
EdNet: A Large-Scale Hierarchical Dataset in Education
title EdNet: A Large-Scale Hierarchical Dataset in Education
title_full EdNet: A Large-Scale Hierarchical Dataset in Education
title_fullStr EdNet: A Large-Scale Hierarchical Dataset in Education
title_full_unstemmed EdNet: A Large-Scale Hierarchical Dataset in Education
title_short EdNet: A Large-Scale Hierarchical Dataset in Education
title_sort ednet: a large-scale hierarchical dataset in education
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7334672/
http://dx.doi.org/10.1007/978-3-030-52240-7_13
work_keys_str_mv AT choiyoungduck ednetalargescalehierarchicaldatasetineducation
AT leeyoungnam ednetalargescalehierarchicaldatasetineducation
AT shindongmin ednetalargescalehierarchicaldatasetineducation
AT chojunghyun ednetalargescalehierarchicaldatasetineducation
AT parkseoyon ednetalargescalehierarchicaldatasetineducation
AT leeseewoo ednetalargescalehierarchicaldatasetineducation
AT baekjineon ednetalargescalehierarchicaldatasetineducation
AT baechan ednetalargescalehierarchicaldatasetineducation
AT kimbyungsoo ednetalargescalehierarchicaldatasetineducation
AT heojaewe ednetalargescalehierarchicaldatasetineducation