Cargando…

Representing higher-order dependencies in networks

To ensure the correctness of network analysis methods, the network (as the input) has to be a sufficiently accurate representation of the underlying data. However, when representing sequential data from complex systems, such as global shipping traffic or Web clickstream traffic as networks, conventi...

Descripción completa

Detalles Bibliográficos
Autores principales: Xu, Jian, Wickramarathne, Thanuka L., Chawla, Nitesh V.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Association for the Advancement of Science 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4928957/
https://www.ncbi.nlm.nih.gov/pubmed/27386539
http://dx.doi.org/10.1126/sciadv.1600028
_version_ 1782440530178736128
author Xu, Jian
Wickramarathne, Thanuka L.
Chawla, Nitesh V.
author_facet Xu, Jian
Wickramarathne, Thanuka L.
Chawla, Nitesh V.
author_sort Xu, Jian
collection PubMed
description To ensure the correctness of network analysis methods, the network (as the input) has to be a sufficiently accurate representation of the underlying data. However, when representing sequential data from complex systems, such as global shipping traffic or Web clickstream traffic as networks, conventional network representations that implicitly assume the Markov property (first-order dependency) can quickly become limiting. This assumption holds that, when movements are simulated on the network, the next movement depends only on the current node, discounting the fact that the movement may depend on several previous steps. However, we show that data derived from many complex systems can show up to fifth-order dependencies. In these cases, the oversimplifying assumption of the first-order network representation can lead to inaccurate network analysis results. To address this problem, we propose the higher-order network (HON) representation that can discover and embed variable orders of dependencies in a network representation. Through a comprehensive empirical evaluation and analysis, we establish several desirable characteristics of HON, including accuracy, scalability, and direct compatibility with the existing suite of network analysis methods. We illustrate how HON can be applied to a broad variety of tasks, such as random walking, clustering, and ranking, and we demonstrate that, by using it as input, HON yields more accurate results without any modification to these tasks.
format Online
Article
Text
id pubmed-4928957
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher American Association for the Advancement of Science
record_format MEDLINE/PubMed
spelling pubmed-49289572016-07-06 Representing higher-order dependencies in networks Xu, Jian Wickramarathne, Thanuka L. Chawla, Nitesh V. Sci Adv Research Articles To ensure the correctness of network analysis methods, the network (as the input) has to be a sufficiently accurate representation of the underlying data. However, when representing sequential data from complex systems, such as global shipping traffic or Web clickstream traffic as networks, conventional network representations that implicitly assume the Markov property (first-order dependency) can quickly become limiting. This assumption holds that, when movements are simulated on the network, the next movement depends only on the current node, discounting the fact that the movement may depend on several previous steps. However, we show that data derived from many complex systems can show up to fifth-order dependencies. In these cases, the oversimplifying assumption of the first-order network representation can lead to inaccurate network analysis results. To address this problem, we propose the higher-order network (HON) representation that can discover and embed variable orders of dependencies in a network representation. Through a comprehensive empirical evaluation and analysis, we establish several desirable characteristics of HON, including accuracy, scalability, and direct compatibility with the existing suite of network analysis methods. We illustrate how HON can be applied to a broad variety of tasks, such as random walking, clustering, and ranking, and we demonstrate that, by using it as input, HON yields more accurate results without any modification to these tasks. American Association for the Advancement of Science 2016-05-20 /pmc/articles/PMC4928957/ /pubmed/27386539 http://dx.doi.org/10.1126/sciadv.1600028 Text en Copyright © 2016, The Authors http://creativecommons.org/licenses/by-nc/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution-NonCommercial license (http://creativecommons.org/licenses/by-nc/4.0/) , which permits use, distribution, and reproduction in any medium, so long as the resultant use is not for commercial advantage and provided the original work is properly cited.
spellingShingle Research Articles
Xu, Jian
Wickramarathne, Thanuka L.
Chawla, Nitesh V.
Representing higher-order dependencies in networks
title Representing higher-order dependencies in networks
title_full Representing higher-order dependencies in networks
title_fullStr Representing higher-order dependencies in networks
title_full_unstemmed Representing higher-order dependencies in networks
title_short Representing higher-order dependencies in networks
title_sort representing higher-order dependencies in networks
topic Research Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4928957/
https://www.ncbi.nlm.nih.gov/pubmed/27386539
http://dx.doi.org/10.1126/sciadv.1600028
work_keys_str_mv AT xujian representinghigherorderdependenciesinnetworks
AT wickramarathnethanukal representinghigherorderdependenciesinnetworks
AT chawlaniteshv representinghigherorderdependenciesinnetworks