Cargando…

A dataset for connecting similar past and present causalities

In this data article, we present a dataset that includes past causalities and categories to connect similar past and present causalities. First, we collect past causalities by referencing certain well-known Japanese high-school textbooks. Subsequently, we select 138 causalities that are useful for a...

Descripción completa

Detalles Bibliográficos
Autores principales: Ikejiri, Ryohei, Sumikawa, Yasunobu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7013155/
https://www.ncbi.nlm.nih.gov/pubmed/32071969
http://dx.doi.org/10.1016/j.dib.2020.105185
_version_ 1783496355854942208
author Ikejiri, Ryohei
Sumikawa, Yasunobu
author_facet Ikejiri, Ryohei
Sumikawa, Yasunobu
author_sort Ikejiri, Ryohei
collection PubMed
description In this data article, we present a dataset that includes past causalities and categories to connect similar past and present causalities. First, we collect past causalities by referencing certain well-known Japanese high-school textbooks. Subsequently, we select 138 causalities that are useful for analogizing from the causalities to considering solutions for confront present social issues. To enhance the analogy, we describe each causality in three contexts: background including problems, solution methods, and their results. We define 13 categories based on the selected causalities and Encyclopedia of Historiography. The past causalities belong to more than one category. In addition, to train machine learning models including classifier, we collect 900 past events from Wikipedia, and assign one or more categories to the past event data. We perform statistical analyses to understand the quality of the dataset. The proposed applications of the dataset include training machine learning models such as classifiers for past causalities and information retrieval for ranking present social issues according to the similarities between the present and past causalities.
format Online
Article
Text
id pubmed-7013155
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-70131552020-02-18 A dataset for connecting similar past and present causalities Ikejiri, Ryohei Sumikawa, Yasunobu Data Brief Computer Science In this data article, we present a dataset that includes past causalities and categories to connect similar past and present causalities. First, we collect past causalities by referencing certain well-known Japanese high-school textbooks. Subsequently, we select 138 causalities that are useful for analogizing from the causalities to considering solutions for confront present social issues. To enhance the analogy, we describe each causality in three contexts: background including problems, solution methods, and their results. We define 13 categories based on the selected causalities and Encyclopedia of Historiography. The past causalities belong to more than one category. In addition, to train machine learning models including classifier, we collect 900 past events from Wikipedia, and assign one or more categories to the past event data. We perform statistical analyses to understand the quality of the dataset. The proposed applications of the dataset include training machine learning models such as classifiers for past causalities and information retrieval for ranking present social issues according to the similarities between the present and past causalities. Elsevier 2020-01-27 /pmc/articles/PMC7013155/ /pubmed/32071969 http://dx.doi.org/10.1016/j.dib.2020.105185 Text en © 2020 The Author(s) http://creativecommons.org/licenses/by/4.0/ This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Computer Science
Ikejiri, Ryohei
Sumikawa, Yasunobu
A dataset for connecting similar past and present causalities
title A dataset for connecting similar past and present causalities
title_full A dataset for connecting similar past and present causalities
title_fullStr A dataset for connecting similar past and present causalities
title_full_unstemmed A dataset for connecting similar past and present causalities
title_short A dataset for connecting similar past and present causalities
title_sort dataset for connecting similar past and present causalities
topic Computer Science
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7013155/
https://www.ncbi.nlm.nih.gov/pubmed/32071969
http://dx.doi.org/10.1016/j.dib.2020.105185
work_keys_str_mv AT ikejiriryohei adatasetforconnectingsimilarpastandpresentcausalities
AT sumikawayasunobu adatasetforconnectingsimilarpastandpresentcausalities
AT ikejiriryohei datasetforconnectingsimilarpastandpresentcausalities
AT sumikawayasunobu datasetforconnectingsimilarpastandpresentcausalities