Cargando…

Classification in Networked Data with Heterophily

In the real world, a large amount of data can be described by networks using relations between data. The data described by networks can be called networked data. Classification is one of the main tasks in analyzing networked data. Most of the previous methods find the class of the unlabeled node usi...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Zhenwen, Yin, Fengjing, Tan, Wentang, Xiao, Weidong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3659646/
https://www.ncbi.nlm.nih.gov/pubmed/23737709
http://dx.doi.org/10.1155/2013/236769
_version_ 1782270481898930176
author Wang, Zhenwen
Yin, Fengjing
Tan, Wentang
Xiao, Weidong
author_facet Wang, Zhenwen
Yin, Fengjing
Tan, Wentang
Xiao, Weidong
author_sort Wang, Zhenwen
collection PubMed
description In the real world, a large amount of data can be described by networks using relations between data. The data described by networks can be called networked data. Classification is one of the main tasks in analyzing networked data. Most of the previous methods find the class of the unlabeled node using the classes of its neighbor nodes. However, in the networks with heterophily, most of connected nodes belong to different classes. It is hard to get the correct class using the classes of neighbor nodes, so the previous methods have a low level of performance in the networks with heterophily. In this paper, a probabilistic method is proposed to address this problem. Firstly, the class propagating distribution of the node is proposed to describe the probabilities that its neighbor nodes belong to each class. After that, the class propagating distributions of neighbor nodes are used to calculate the class of the unlabeled node. At last, a classification algorithm based on class propagating distribution is presented in the form of matrix operations. In empirical study, we apply the proposed algorithm to the real-world datasets, compared with some other algorithms. The experimental results show that the proposed algorithm performs better when the networks are of heterophily.
format Online
Article
Text
id pubmed-3659646
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-36596462013-06-04 Classification in Networked Data with Heterophily Wang, Zhenwen Yin, Fengjing Tan, Wentang Xiao, Weidong ScientificWorldJournal Research Article In the real world, a large amount of data can be described by networks using relations between data. The data described by networks can be called networked data. Classification is one of the main tasks in analyzing networked data. Most of the previous methods find the class of the unlabeled node using the classes of its neighbor nodes. However, in the networks with heterophily, most of connected nodes belong to different classes. It is hard to get the correct class using the classes of neighbor nodes, so the previous methods have a low level of performance in the networks with heterophily. In this paper, a probabilistic method is proposed to address this problem. Firstly, the class propagating distribution of the node is proposed to describe the probabilities that its neighbor nodes belong to each class. After that, the class propagating distributions of neighbor nodes are used to calculate the class of the unlabeled node. At last, a classification algorithm based on class propagating distribution is presented in the form of matrix operations. In empirical study, we apply the proposed algorithm to the real-world datasets, compared with some other algorithms. The experimental results show that the proposed algorithm performs better when the networks are of heterophily. Hindawi Publishing Corporation 2013-04-30 /pmc/articles/PMC3659646/ /pubmed/23737709 http://dx.doi.org/10.1155/2013/236769 Text en Copyright © 2013 Zhenwen Wang et al. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Wang, Zhenwen
Yin, Fengjing
Tan, Wentang
Xiao, Weidong
Classification in Networked Data with Heterophily
title Classification in Networked Data with Heterophily
title_full Classification in Networked Data with Heterophily
title_fullStr Classification in Networked Data with Heterophily
title_full_unstemmed Classification in Networked Data with Heterophily
title_short Classification in Networked Data with Heterophily
title_sort classification in networked data with heterophily
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3659646/
https://www.ncbi.nlm.nih.gov/pubmed/23737709
http://dx.doi.org/10.1155/2013/236769
work_keys_str_mv AT wangzhenwen classificationinnetworkeddatawithheterophily
AT yinfengjing classificationinnetworkeddatawithheterophily
AT tanwentang classificationinnetworkeddatawithheterophily
AT xiaoweidong classificationinnetworkeddatawithheterophily