Cargando…

Nonsemantic word graphs of texts spanning ∼ 4500 years, including pre-literate Amerindian oral narratives

Non-semantic word graphs obtained from oral reports are useful to describe cognitive decline in psychiatric conditions such as Schizophrenia, as well as education-related gains in discourse structure during typical development. Here we provide non-semantic word graph attributes of texts spanning app...

Descripción completa

Detalles Bibliográficos
Autores principales: Mota, Natália Bezerra, Pinheiro, Sylvia, Guerreiro, Antonio, Copelli, Mauro, Ribeiro, Sidarta
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8379624/
https://www.ncbi.nlm.nih.gov/pubmed/34458523
http://dx.doi.org/10.1016/j.dib.2021.107296
_version_ 1783741045480095744
author Mota, Natália Bezerra
Pinheiro, Sylvia
Guerreiro, Antonio
Copelli, Mauro
Ribeiro, Sidarta
author_facet Mota, Natália Bezerra
Pinheiro, Sylvia
Guerreiro, Antonio
Copelli, Mauro
Ribeiro, Sidarta
author_sort Mota, Natália Bezerra
collection PubMed
description Non-semantic word graphs obtained from oral reports are useful to describe cognitive decline in psychiatric conditions such as Schizophrenia, as well as education-related gains in discourse structure during typical development. Here we provide non-semantic word graph attributes of texts spanning approximately 4500 years of history, and pre-literate Amerindian oral narratives. The dataset assessed comprises 707 literary texts representative of 9 different Afro-Eurasian traditions (Syro-Mesopotamian, Egyptian, Hinduist, Persian, Judeo-Christian, Greek-Roman, Medieval, Modern and Contemporary), and Amerindian narratives (N = 39) obtained from a single ethnic group from South America (Kalapalo, N = 18), or from a mixed ethnic group from South, Central and North America (non-Kalapalo, N = 21). The present article provides detailed information about each text or narrative, including measurements of four graph attributes of interest: number of nodes (lexical diversity), repeated edges (short-range recurrence), largest strongly connected component (long-range recurrence), and average shortest path (graph length).
format Online
Article
Text
id pubmed-8379624
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-83796242021-08-27 Nonsemantic word graphs of texts spanning ∼ 4500 years, including pre-literate Amerindian oral narratives Mota, Natália Bezerra Pinheiro, Sylvia Guerreiro, Antonio Copelli, Mauro Ribeiro, Sidarta Data Brief Data Article Non-semantic word graphs obtained from oral reports are useful to describe cognitive decline in psychiatric conditions such as Schizophrenia, as well as education-related gains in discourse structure during typical development. Here we provide non-semantic word graph attributes of texts spanning approximately 4500 years of history, and pre-literate Amerindian oral narratives. The dataset assessed comprises 707 literary texts representative of 9 different Afro-Eurasian traditions (Syro-Mesopotamian, Egyptian, Hinduist, Persian, Judeo-Christian, Greek-Roman, Medieval, Modern and Contemporary), and Amerindian narratives (N = 39) obtained from a single ethnic group from South America (Kalapalo, N = 18), or from a mixed ethnic group from South, Central and North America (non-Kalapalo, N = 21). The present article provides detailed information about each text or narrative, including measurements of four graph attributes of interest: number of nodes (lexical diversity), repeated edges (short-range recurrence), largest strongly connected component (long-range recurrence), and average shortest path (graph length). Elsevier 2021-08-14 /pmc/articles/PMC8379624/ /pubmed/34458523 http://dx.doi.org/10.1016/j.dib.2021.107296 Text en © 2021 The Authors. Published by Elsevier Inc. https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Data Article
Mota, Natália Bezerra
Pinheiro, Sylvia
Guerreiro, Antonio
Copelli, Mauro
Ribeiro, Sidarta
Nonsemantic word graphs of texts spanning ∼ 4500 years, including pre-literate Amerindian oral narratives
title Nonsemantic word graphs of texts spanning ∼ 4500 years, including pre-literate Amerindian oral narratives
title_full Nonsemantic word graphs of texts spanning ∼ 4500 years, including pre-literate Amerindian oral narratives
title_fullStr Nonsemantic word graphs of texts spanning ∼ 4500 years, including pre-literate Amerindian oral narratives
title_full_unstemmed Nonsemantic word graphs of texts spanning ∼ 4500 years, including pre-literate Amerindian oral narratives
title_short Nonsemantic word graphs of texts spanning ∼ 4500 years, including pre-literate Amerindian oral narratives
title_sort nonsemantic word graphs of texts spanning ∼ 4500 years, including pre-literate amerindian oral narratives
topic Data Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8379624/
https://www.ncbi.nlm.nih.gov/pubmed/34458523
http://dx.doi.org/10.1016/j.dib.2021.107296
work_keys_str_mv AT motanataliabezerra nonsemanticwordgraphsoftextsspanning4500yearsincludingpreliterateamerindianoralnarratives
AT pinheirosylvia nonsemanticwordgraphsoftextsspanning4500yearsincludingpreliterateamerindianoralnarratives
AT guerreiroantonio nonsemanticwordgraphsoftextsspanning4500yearsincludingpreliterateamerindianoralnarratives
AT copellimauro nonsemanticwordgraphsoftextsspanning4500yearsincludingpreliterateamerindianoralnarratives
AT ribeirosidarta nonsemanticwordgraphsoftextsspanning4500yearsincludingpreliterateamerindianoralnarratives