Cargando…

Topological properties and organizing principles of semantic networks

Interpreting natural language is an increasingly important task in computer algorithms due to the growing availability of unstructured textual data. Natural Language Processing (NLP) applications rely on semantic networks for structured knowledge representation. The fundamental properties of semanti...

Descripción completa

Detalles Bibliográficos
Autores principales: Budel, Gabriel, Jin, Ying, Van Mieghem, Piet, Kitsak, Maksim
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10359341/
https://www.ncbi.nlm.nih.gov/pubmed/37474614
http://dx.doi.org/10.1038/s41598-023-37294-8
_version_ 1785075859809370112
author Budel, Gabriel
Jin, Ying
Van Mieghem, Piet
Kitsak, Maksim
author_facet Budel, Gabriel
Jin, Ying
Van Mieghem, Piet
Kitsak, Maksim
author_sort Budel, Gabriel
collection PubMed
description Interpreting natural language is an increasingly important task in computer algorithms due to the growing availability of unstructured textual data. Natural Language Processing (NLP) applications rely on semantic networks for structured knowledge representation. The fundamental properties of semantic networks must be taken into account when designing NLP algorithms, yet they remain to be structurally investigated. We study the properties of semantic networks from ConceptNet, defined by 7 semantic relations from 11 different languages. We find that semantic networks have universal basic properties: they are sparse, highly clustered, and many exhibit power-law degree distributions. Our findings show that the majority of the considered networks are scale-free. Some networks exhibit language-specific properties determined by grammatical rules, for example networks from highly inflected languages, such as e.g. Latin, German, French and Spanish, show peaks in the degree distribution that deviate from a power law. We find that depending on the semantic relation type and the language, the link formation in semantic networks is guided by different principles. In some networks the connections are similarity-based, while in others the connections are more complementarity-based. Finally, we demonstrate how knowledge of similarity and complementarity in semantic networks can improve NLP algorithms in missing link inference.
format Online
Article
Text
id pubmed-10359341
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-103593412023-07-22 Topological properties and organizing principles of semantic networks Budel, Gabriel Jin, Ying Van Mieghem, Piet Kitsak, Maksim Sci Rep Article Interpreting natural language is an increasingly important task in computer algorithms due to the growing availability of unstructured textual data. Natural Language Processing (NLP) applications rely on semantic networks for structured knowledge representation. The fundamental properties of semantic networks must be taken into account when designing NLP algorithms, yet they remain to be structurally investigated. We study the properties of semantic networks from ConceptNet, defined by 7 semantic relations from 11 different languages. We find that semantic networks have universal basic properties: they are sparse, highly clustered, and many exhibit power-law degree distributions. Our findings show that the majority of the considered networks are scale-free. Some networks exhibit language-specific properties determined by grammatical rules, for example networks from highly inflected languages, such as e.g. Latin, German, French and Spanish, show peaks in the degree distribution that deviate from a power law. We find that depending on the semantic relation type and the language, the link formation in semantic networks is guided by different principles. In some networks the connections are similarity-based, while in others the connections are more complementarity-based. Finally, we demonstrate how knowledge of similarity and complementarity in semantic networks can improve NLP algorithms in missing link inference. Nature Publishing Group UK 2023-07-20 /pmc/articles/PMC10359341/ /pubmed/37474614 http://dx.doi.org/10.1038/s41598-023-37294-8 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Budel, Gabriel
Jin, Ying
Van Mieghem, Piet
Kitsak, Maksim
Topological properties and organizing principles of semantic networks
title Topological properties and organizing principles of semantic networks
title_full Topological properties and organizing principles of semantic networks
title_fullStr Topological properties and organizing principles of semantic networks
title_full_unstemmed Topological properties and organizing principles of semantic networks
title_short Topological properties and organizing principles of semantic networks
title_sort topological properties and organizing principles of semantic networks
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10359341/
https://www.ncbi.nlm.nih.gov/pubmed/37474614
http://dx.doi.org/10.1038/s41598-023-37294-8
work_keys_str_mv AT budelgabriel topologicalpropertiesandorganizingprinciplesofsemanticnetworks
AT jinying topologicalpropertiesandorganizingprinciplesofsemanticnetworks
AT vanmieghempiet topologicalpropertiesandorganizingprinciplesofsemanticnetworks
AT kitsakmaksim topologicalpropertiesandorganizingprinciplesofsemanticnetworks