Cargando…
Classification in Networked Data with Heterophily
In the real world, a large amount of data can be described by networks using relations between data. The data described by networks can be called networked data. Classification is one of the main tasks in analyzing networked data. Most of the previous methods find the class of the unlabeled node usi...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi Publishing Corporation
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3659646/ https://www.ncbi.nlm.nih.gov/pubmed/23737709 http://dx.doi.org/10.1155/2013/236769 |
_version_ | 1782270481898930176 |
---|---|
author | Wang, Zhenwen Yin, Fengjing Tan, Wentang Xiao, Weidong |
author_facet | Wang, Zhenwen Yin, Fengjing Tan, Wentang Xiao, Weidong |
author_sort | Wang, Zhenwen |
collection | PubMed |
description | In the real world, a large amount of data can be described by networks using relations between data. The data described by networks can be called networked data. Classification is one of the main tasks in analyzing networked data. Most of the previous methods find the class of the unlabeled node using the classes of its neighbor nodes. However, in the networks with heterophily, most of connected nodes belong to different classes. It is hard to get the correct class using the classes of neighbor nodes, so the previous methods have a low level of performance in the networks with heterophily. In this paper, a probabilistic method is proposed to address this problem. Firstly, the class propagating distribution of the node is proposed to describe the probabilities that its neighbor nodes belong to each class. After that, the class propagating distributions of neighbor nodes are used to calculate the class of the unlabeled node. At last, a classification algorithm based on class propagating distribution is presented in the form of matrix operations. In empirical study, we apply the proposed algorithm to the real-world datasets, compared with some other algorithms. The experimental results show that the proposed algorithm performs better when the networks are of heterophily. |
format | Online Article Text |
id | pubmed-3659646 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | Hindawi Publishing Corporation |
record_format | MEDLINE/PubMed |
spelling | pubmed-36596462013-06-04 Classification in Networked Data with Heterophily Wang, Zhenwen Yin, Fengjing Tan, Wentang Xiao, Weidong ScientificWorldJournal Research Article In the real world, a large amount of data can be described by networks using relations between data. The data described by networks can be called networked data. Classification is one of the main tasks in analyzing networked data. Most of the previous methods find the class of the unlabeled node using the classes of its neighbor nodes. However, in the networks with heterophily, most of connected nodes belong to different classes. It is hard to get the correct class using the classes of neighbor nodes, so the previous methods have a low level of performance in the networks with heterophily. In this paper, a probabilistic method is proposed to address this problem. Firstly, the class propagating distribution of the node is proposed to describe the probabilities that its neighbor nodes belong to each class. After that, the class propagating distributions of neighbor nodes are used to calculate the class of the unlabeled node. At last, a classification algorithm based on class propagating distribution is presented in the form of matrix operations. In empirical study, we apply the proposed algorithm to the real-world datasets, compared with some other algorithms. The experimental results show that the proposed algorithm performs better when the networks are of heterophily. Hindawi Publishing Corporation 2013-04-30 /pmc/articles/PMC3659646/ /pubmed/23737709 http://dx.doi.org/10.1155/2013/236769 Text en Copyright © 2013 Zhenwen Wang et al. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Wang, Zhenwen Yin, Fengjing Tan, Wentang Xiao, Weidong Classification in Networked Data with Heterophily |
title | Classification in Networked Data with Heterophily |
title_full | Classification in Networked Data with Heterophily |
title_fullStr | Classification in Networked Data with Heterophily |
title_full_unstemmed | Classification in Networked Data with Heterophily |
title_short | Classification in Networked Data with Heterophily |
title_sort | classification in networked data with heterophily |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3659646/ https://www.ncbi.nlm.nih.gov/pubmed/23737709 http://dx.doi.org/10.1155/2013/236769 |
work_keys_str_mv | AT wangzhenwen classificationinnetworkeddatawithheterophily AT yinfengjing classificationinnetworkeddatawithheterophily AT tanwentang classificationinnetworkeddatawithheterophily AT xiaoweidong classificationinnetworkeddatawithheterophily |