Cargando…
A dataset for connecting similar past and present causalities
In this data article, we present a dataset that includes past causalities and categories to connect similar past and present causalities. First, we collect past causalities by referencing certain well-known Japanese high-school textbooks. Subsequently, we select 138 causalities that are useful for a...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7013155/ https://www.ncbi.nlm.nih.gov/pubmed/32071969 http://dx.doi.org/10.1016/j.dib.2020.105185 |
_version_ | 1783496355854942208 |
---|---|
author | Ikejiri, Ryohei Sumikawa, Yasunobu |
author_facet | Ikejiri, Ryohei Sumikawa, Yasunobu |
author_sort | Ikejiri, Ryohei |
collection | PubMed |
description | In this data article, we present a dataset that includes past causalities and categories to connect similar past and present causalities. First, we collect past causalities by referencing certain well-known Japanese high-school textbooks. Subsequently, we select 138 causalities that are useful for analogizing from the causalities to considering solutions for confront present social issues. To enhance the analogy, we describe each causality in three contexts: background including problems, solution methods, and their results. We define 13 categories based on the selected causalities and Encyclopedia of Historiography. The past causalities belong to more than one category. In addition, to train machine learning models including classifier, we collect 900 past events from Wikipedia, and assign one or more categories to the past event data. We perform statistical analyses to understand the quality of the dataset. The proposed applications of the dataset include training machine learning models such as classifiers for past causalities and information retrieval for ranking present social issues according to the similarities between the present and past causalities. |
format | Online Article Text |
id | pubmed-7013155 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Elsevier |
record_format | MEDLINE/PubMed |
spelling | pubmed-70131552020-02-18 A dataset for connecting similar past and present causalities Ikejiri, Ryohei Sumikawa, Yasunobu Data Brief Computer Science In this data article, we present a dataset that includes past causalities and categories to connect similar past and present causalities. First, we collect past causalities by referencing certain well-known Japanese high-school textbooks. Subsequently, we select 138 causalities that are useful for analogizing from the causalities to considering solutions for confront present social issues. To enhance the analogy, we describe each causality in three contexts: background including problems, solution methods, and their results. We define 13 categories based on the selected causalities and Encyclopedia of Historiography. The past causalities belong to more than one category. In addition, to train machine learning models including classifier, we collect 900 past events from Wikipedia, and assign one or more categories to the past event data. We perform statistical analyses to understand the quality of the dataset. The proposed applications of the dataset include training machine learning models such as classifiers for past causalities and information retrieval for ranking present social issues according to the similarities between the present and past causalities. Elsevier 2020-01-27 /pmc/articles/PMC7013155/ /pubmed/32071969 http://dx.doi.org/10.1016/j.dib.2020.105185 Text en © 2020 The Author(s) http://creativecommons.org/licenses/by/4.0/ This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Computer Science Ikejiri, Ryohei Sumikawa, Yasunobu A dataset for connecting similar past and present causalities |
title | A dataset for connecting similar past and present causalities |
title_full | A dataset for connecting similar past and present causalities |
title_fullStr | A dataset for connecting similar past and present causalities |
title_full_unstemmed | A dataset for connecting similar past and present causalities |
title_short | A dataset for connecting similar past and present causalities |
title_sort | dataset for connecting similar past and present causalities |
topic | Computer Science |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7013155/ https://www.ncbi.nlm.nih.gov/pubmed/32071969 http://dx.doi.org/10.1016/j.dib.2020.105185 |
work_keys_str_mv | AT ikejiriryohei adatasetforconnectingsimilarpastandpresentcausalities AT sumikawayasunobu adatasetforconnectingsimilarpastandpresentcausalities AT ikejiriryohei datasetforconnectingsimilarpastandpresentcausalities AT sumikawayasunobu datasetforconnectingsimilarpastandpresentcausalities |