Cargando…

The impossibility of low-rank representations for triangle-rich complex networks

The study of complex networks is a significant development in modern science, and has enriched the social sciences, biology, physics, and computer science. Models and algorithms for such networks are pervasive in our society, and impact human behavior via social networks, search engines, and recomme...

Descripción completa

Detalles Bibliográficos
Autores principales: Seshadhri, C., Sharma, Aneesh, Stolman, Andrew, Goel, Ashish
Formato: Online Artículo Texto
Lenguaje:English
Publicado: National Academy of Sciences 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7084093/
https://www.ncbi.nlm.nih.gov/pubmed/32123073
http://dx.doi.org/10.1073/pnas.1911030117
_version_ 1783508654988722176
author Seshadhri, C.
Sharma, Aneesh
Stolman, Andrew
Goel, Ashish
author_facet Seshadhri, C.
Sharma, Aneesh
Stolman, Andrew
Goel, Ashish
author_sort Seshadhri, C.
collection PubMed
description The study of complex networks is a significant development in modern science, and has enriched the social sciences, biology, physics, and computer science. Models and algorithms for such networks are pervasive in our society, and impact human behavior via social networks, search engines, and recommender systems, to name a few. A widely used algorithmic technique for modeling such complex networks is to construct a low-dimensional Euclidean embedding of the vertices of the network, where proximity of vertices is interpreted as the likelihood of an edge. Contrary to the common view, we argue that such graph embeddings do not capture salient properties of complex networks. The two properties we focus on are low degree and large clustering coefficients, which have been widely established to be empirically true for real-world networks. We mathematically prove that any embedding (that uses dot products to measure similarity) that can successfully create these two properties must have a rank that is nearly linear in the number of vertices. Among other implications, this establishes that popular embedding techniques such as singular value decomposition and node2vec fail to capture significant structural aspects of real-world complex networks. Furthermore, we empirically study a number of different embedding techniques based on dot product, and show that they all fail to capture the triangle structure.
format Online
Article
Text
id pubmed-7084093
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher National Academy of Sciences
record_format MEDLINE/PubMed
spelling pubmed-70840932020-03-24 The impossibility of low-rank representations for triangle-rich complex networks Seshadhri, C. Sharma, Aneesh Stolman, Andrew Goel, Ashish Proc Natl Acad Sci U S A Physical Sciences The study of complex networks is a significant development in modern science, and has enriched the social sciences, biology, physics, and computer science. Models and algorithms for such networks are pervasive in our society, and impact human behavior via social networks, search engines, and recommender systems, to name a few. A widely used algorithmic technique for modeling such complex networks is to construct a low-dimensional Euclidean embedding of the vertices of the network, where proximity of vertices is interpreted as the likelihood of an edge. Contrary to the common view, we argue that such graph embeddings do not capture salient properties of complex networks. The two properties we focus on are low degree and large clustering coefficients, which have been widely established to be empirically true for real-world networks. We mathematically prove that any embedding (that uses dot products to measure similarity) that can successfully create these two properties must have a rank that is nearly linear in the number of vertices. Among other implications, this establishes that popular embedding techniques such as singular value decomposition and node2vec fail to capture significant structural aspects of real-world complex networks. Furthermore, we empirically study a number of different embedding techniques based on dot product, and show that they all fail to capture the triangle structure. National Academy of Sciences 2020-03-17 2020-03-02 /pmc/articles/PMC7084093/ /pubmed/32123073 http://dx.doi.org/10.1073/pnas.1911030117 Text en Copyright © 2020 the Author(s). Published by PNAS. http://creativecommons.org/licenses/by/4.0/ https://creativecommons.org/licenses/by/4.0/This open access article is distributed under Creative Commons Attribution License 4.0 (CC BY) (http://creativecommons.org/licenses/by/4.0/) .
spellingShingle Physical Sciences
Seshadhri, C.
Sharma, Aneesh
Stolman, Andrew
Goel, Ashish
The impossibility of low-rank representations for triangle-rich complex networks
title The impossibility of low-rank representations for triangle-rich complex networks
title_full The impossibility of low-rank representations for triangle-rich complex networks
title_fullStr The impossibility of low-rank representations for triangle-rich complex networks
title_full_unstemmed The impossibility of low-rank representations for triangle-rich complex networks
title_short The impossibility of low-rank representations for triangle-rich complex networks
title_sort impossibility of low-rank representations for triangle-rich complex networks
topic Physical Sciences
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7084093/
https://www.ncbi.nlm.nih.gov/pubmed/32123073
http://dx.doi.org/10.1073/pnas.1911030117
work_keys_str_mv AT seshadhric theimpossibilityoflowrankrepresentationsfortrianglerichcomplexnetworks
AT sharmaaneesh theimpossibilityoflowrankrepresentationsfortrianglerichcomplexnetworks
AT stolmanandrew theimpossibilityoflowrankrepresentationsfortrianglerichcomplexnetworks
AT goelashish theimpossibilityoflowrankrepresentationsfortrianglerichcomplexnetworks
AT seshadhric impossibilityoflowrankrepresentationsfortrianglerichcomplexnetworks
AT sharmaaneesh impossibilityoflowrankrepresentationsfortrianglerichcomplexnetworks
AT stolmanandrew impossibilityoflowrankrepresentationsfortrianglerichcomplexnetworks
AT goelashish impossibilityoflowrankrepresentationsfortrianglerichcomplexnetworks