Cargando…
Nonsemantic word graphs of texts spanning ∼ 4500 years, including pre-literate Amerindian oral narratives
Non-semantic word graphs obtained from oral reports are useful to describe cognitive decline in psychiatric conditions such as Schizophrenia, as well as education-related gains in discourse structure during typical development. Here we provide non-semantic word graph attributes of texts spanning app...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8379624/ https://www.ncbi.nlm.nih.gov/pubmed/34458523 http://dx.doi.org/10.1016/j.dib.2021.107296 |
_version_ | 1783741045480095744 |
---|---|
author | Mota, Natália Bezerra Pinheiro, Sylvia Guerreiro, Antonio Copelli, Mauro Ribeiro, Sidarta |
author_facet | Mota, Natália Bezerra Pinheiro, Sylvia Guerreiro, Antonio Copelli, Mauro Ribeiro, Sidarta |
author_sort | Mota, Natália Bezerra |
collection | PubMed |
description | Non-semantic word graphs obtained from oral reports are useful to describe cognitive decline in psychiatric conditions such as Schizophrenia, as well as education-related gains in discourse structure during typical development. Here we provide non-semantic word graph attributes of texts spanning approximately 4500 years of history, and pre-literate Amerindian oral narratives. The dataset assessed comprises 707 literary texts representative of 9 different Afro-Eurasian traditions (Syro-Mesopotamian, Egyptian, Hinduist, Persian, Judeo-Christian, Greek-Roman, Medieval, Modern and Contemporary), and Amerindian narratives (N = 39) obtained from a single ethnic group from South America (Kalapalo, N = 18), or from a mixed ethnic group from South, Central and North America (non-Kalapalo, N = 21). The present article provides detailed information about each text or narrative, including measurements of four graph attributes of interest: number of nodes (lexical diversity), repeated edges (short-range recurrence), largest strongly connected component (long-range recurrence), and average shortest path (graph length). |
format | Online Article Text |
id | pubmed-8379624 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Elsevier |
record_format | MEDLINE/PubMed |
spelling | pubmed-83796242021-08-27 Nonsemantic word graphs of texts spanning ∼ 4500 years, including pre-literate Amerindian oral narratives Mota, Natália Bezerra Pinheiro, Sylvia Guerreiro, Antonio Copelli, Mauro Ribeiro, Sidarta Data Brief Data Article Non-semantic word graphs obtained from oral reports are useful to describe cognitive decline in psychiatric conditions such as Schizophrenia, as well as education-related gains in discourse structure during typical development. Here we provide non-semantic word graph attributes of texts spanning approximately 4500 years of history, and pre-literate Amerindian oral narratives. The dataset assessed comprises 707 literary texts representative of 9 different Afro-Eurasian traditions (Syro-Mesopotamian, Egyptian, Hinduist, Persian, Judeo-Christian, Greek-Roman, Medieval, Modern and Contemporary), and Amerindian narratives (N = 39) obtained from a single ethnic group from South America (Kalapalo, N = 18), or from a mixed ethnic group from South, Central and North America (non-Kalapalo, N = 21). The present article provides detailed information about each text or narrative, including measurements of four graph attributes of interest: number of nodes (lexical diversity), repeated edges (short-range recurrence), largest strongly connected component (long-range recurrence), and average shortest path (graph length). Elsevier 2021-08-14 /pmc/articles/PMC8379624/ /pubmed/34458523 http://dx.doi.org/10.1016/j.dib.2021.107296 Text en © 2021 The Authors. Published by Elsevier Inc. https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). |
spellingShingle | Data Article Mota, Natália Bezerra Pinheiro, Sylvia Guerreiro, Antonio Copelli, Mauro Ribeiro, Sidarta Nonsemantic word graphs of texts spanning ∼ 4500 years, including pre-literate Amerindian oral narratives |
title | Nonsemantic word graphs of texts spanning ∼ 4500 years, including pre-literate Amerindian oral narratives |
title_full | Nonsemantic word graphs of texts spanning ∼ 4500 years, including pre-literate Amerindian oral narratives |
title_fullStr | Nonsemantic word graphs of texts spanning ∼ 4500 years, including pre-literate Amerindian oral narratives |
title_full_unstemmed | Nonsemantic word graphs of texts spanning ∼ 4500 years, including pre-literate Amerindian oral narratives |
title_short | Nonsemantic word graphs of texts spanning ∼ 4500 years, including pre-literate Amerindian oral narratives |
title_sort | nonsemantic word graphs of texts spanning ∼ 4500 years, including pre-literate amerindian oral narratives |
topic | Data Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8379624/ https://www.ncbi.nlm.nih.gov/pubmed/34458523 http://dx.doi.org/10.1016/j.dib.2021.107296 |
work_keys_str_mv | AT motanataliabezerra nonsemanticwordgraphsoftextsspanning4500yearsincludingpreliterateamerindianoralnarratives AT pinheirosylvia nonsemanticwordgraphsoftextsspanning4500yearsincludingpreliterateamerindianoralnarratives AT guerreiroantonio nonsemanticwordgraphsoftextsspanning4500yearsincludingpreliterateamerindianoralnarratives AT copellimauro nonsemanticwordgraphsoftextsspanning4500yearsincludingpreliterateamerindianoralnarratives AT ribeirosidarta nonsemanticwordgraphsoftextsspanning4500yearsincludingpreliterateamerindianoralnarratives |