Cargando…

An FAQ dataset for E-learning system used on a Japanese University

In this data article, we present an FAQ dataset written in Japanese and its translation to English in order to train chatbot models for e-learning systems. We first collected raw Q&A data reported as the difficulties from April 2015 to July 2018 by users of the e-learning system introduced at To...

Descripción completa

Detalles Bibliográficos
Autores principales: Sumikawa, Yasunobu, Fujiyoshi, Masaaki, Hatakeyama, Hisashi, Nagai, Masahiro
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6595272/
https://www.ncbi.nlm.nih.gov/pubmed/31294048
http://dx.doi.org/10.1016/j.dib.2019.104001
_version_ 1783430371342286848
author Sumikawa, Yasunobu
Fujiyoshi, Masaaki
Hatakeyama, Hisashi
Nagai, Masahiro
author_facet Sumikawa, Yasunobu
Fujiyoshi, Masaaki
Hatakeyama, Hisashi
Nagai, Masahiro
author_sort Sumikawa, Yasunobu
collection PubMed
description In this data article, we present an FAQ dataset written in Japanese and its translation to English in order to train chatbot models for e-learning systems. We first collected raw Q&A data reported as the difficulties from April 2015 to July 2018 by users of the e-learning system introduced at Tokyo Metropolitan University. We then divided them into 11 categories according to features provided by the e-learning system. Finally, we integrated questions with the same answers in order to create the FAQ form. The dataset contains 427 questions and 79 answers that were examined by experts with experience in using the e-learning system for more than three years. Using this dataset, we performed statistical analyses to evaluate the qualities of the FAQ dataset. The proposed applications of the dataset include not only academic research but also activities; for example, translating from Japanese to another one like Chinese, adapting/modifying our dataset for another e-learning system, and developing language models to obtain highly accurate responses from chatbots.
format Online
Article
Text
id pubmed-6595272
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-65952722019-07-10 An FAQ dataset for E-learning system used on a Japanese University Sumikawa, Yasunobu Fujiyoshi, Masaaki Hatakeyama, Hisashi Nagai, Masahiro Data Brief Computer Science In this data article, we present an FAQ dataset written in Japanese and its translation to English in order to train chatbot models for e-learning systems. We first collected raw Q&A data reported as the difficulties from April 2015 to July 2018 by users of the e-learning system introduced at Tokyo Metropolitan University. We then divided them into 11 categories according to features provided by the e-learning system. Finally, we integrated questions with the same answers in order to create the FAQ form. The dataset contains 427 questions and 79 answers that were examined by experts with experience in using the e-learning system for more than three years. Using this dataset, we performed statistical analyses to evaluate the qualities of the FAQ dataset. The proposed applications of the dataset include not only academic research but also activities; for example, translating from Japanese to another one like Chinese, adapting/modifying our dataset for another e-learning system, and developing language models to obtain highly accurate responses from chatbots. Elsevier 2019-05-24 /pmc/articles/PMC6595272/ /pubmed/31294048 http://dx.doi.org/10.1016/j.dib.2019.104001 Text en © 2019 The Authors http://creativecommons.org/licenses/by/4.0/ This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Computer Science
Sumikawa, Yasunobu
Fujiyoshi, Masaaki
Hatakeyama, Hisashi
Nagai, Masahiro
An FAQ dataset for E-learning system used on a Japanese University
title An FAQ dataset for E-learning system used on a Japanese University
title_full An FAQ dataset for E-learning system used on a Japanese University
title_fullStr An FAQ dataset for E-learning system used on a Japanese University
title_full_unstemmed An FAQ dataset for E-learning system used on a Japanese University
title_short An FAQ dataset for E-learning system used on a Japanese University
title_sort faq dataset for e-learning system used on a japanese university
topic Computer Science
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6595272/
https://www.ncbi.nlm.nih.gov/pubmed/31294048
http://dx.doi.org/10.1016/j.dib.2019.104001
work_keys_str_mv AT sumikawayasunobu anfaqdatasetforelearningsystemusedonajapaneseuniversity
AT fujiyoshimasaaki anfaqdatasetforelearningsystemusedonajapaneseuniversity
AT hatakeyamahisashi anfaqdatasetforelearningsystemusedonajapaneseuniversity
AT nagaimasahiro anfaqdatasetforelearningsystemusedonajapaneseuniversity
AT sumikawayasunobu faqdatasetforelearningsystemusedonajapaneseuniversity
AT fujiyoshimasaaki faqdatasetforelearningsystemusedonajapaneseuniversity
AT hatakeyamahisashi faqdatasetforelearningsystemusedonajapaneseuniversity
AT nagaimasahiro faqdatasetforelearningsystemusedonajapaneseuniversity