Cargando…

Learning Document Semantic Representation with Hybrid Deep Belief Network

High-level abstraction, for example, semantic representation, is vital for document classification and retrieval. However, how to learn document semantic representation is still a topic open for discussion in information retrieval and natural language processing. In this paper, we propose a new Hybr...

Descripción completa

Detalles Bibliográficos
Autores principales: Yan, Yan, Yin, Xu-Cheng, Li, Sujian, Yang, Mingyuan, Hao, Hong-Wei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4386712/
https://www.ncbi.nlm.nih.gov/pubmed/25878657
http://dx.doi.org/10.1155/2015/650527
_version_ 1782365201053515776
author Yan, Yan
Yin, Xu-Cheng
Li, Sujian
Yang, Mingyuan
Hao, Hong-Wei
author_facet Yan, Yan
Yin, Xu-Cheng
Li, Sujian
Yang, Mingyuan
Hao, Hong-Wei
author_sort Yan, Yan
collection PubMed
description High-level abstraction, for example, semantic representation, is vital for document classification and retrieval. However, how to learn document semantic representation is still a topic open for discussion in information retrieval and natural language processing. In this paper, we propose a new Hybrid Deep Belief Network (HDBN) which uses Deep Boltzmann Machine (DBM) on the lower layers together with Deep Belief Network (DBN) on the upper layers. The advantage of DBM is that it employs undirected connection when training weight parameters which can be used to sample the states of nodes on each layer more successfully and it is also an effective way to remove noise from the different document representation type; the DBN can enhance extract abstract of the document in depth, making the model learn sufficient semantic representation. At the same time, we explore different input strategies for semantic distributed representation. Experimental results show that our model using the word embedding instead of single word has better performance.
format Online
Article
Text
id pubmed-4386712
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-43867122015-04-15 Learning Document Semantic Representation with Hybrid Deep Belief Network Yan, Yan Yin, Xu-Cheng Li, Sujian Yang, Mingyuan Hao, Hong-Wei Comput Intell Neurosci Research Article High-level abstraction, for example, semantic representation, is vital for document classification and retrieval. However, how to learn document semantic representation is still a topic open for discussion in information retrieval and natural language processing. In this paper, we propose a new Hybrid Deep Belief Network (HDBN) which uses Deep Boltzmann Machine (DBM) on the lower layers together with Deep Belief Network (DBN) on the upper layers. The advantage of DBM is that it employs undirected connection when training weight parameters which can be used to sample the states of nodes on each layer more successfully and it is also an effective way to remove noise from the different document representation type; the DBN can enhance extract abstract of the document in depth, making the model learn sufficient semantic representation. At the same time, we explore different input strategies for semantic distributed representation. Experimental results show that our model using the word embedding instead of single word has better performance. Hindawi Publishing Corporation 2015 2015-03-23 /pmc/articles/PMC4386712/ /pubmed/25878657 http://dx.doi.org/10.1155/2015/650527 Text en Copyright © 2015 Yan Yan et al. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Yan, Yan
Yin, Xu-Cheng
Li, Sujian
Yang, Mingyuan
Hao, Hong-Wei
Learning Document Semantic Representation with Hybrid Deep Belief Network
title Learning Document Semantic Representation with Hybrid Deep Belief Network
title_full Learning Document Semantic Representation with Hybrid Deep Belief Network
title_fullStr Learning Document Semantic Representation with Hybrid Deep Belief Network
title_full_unstemmed Learning Document Semantic Representation with Hybrid Deep Belief Network
title_short Learning Document Semantic Representation with Hybrid Deep Belief Network
title_sort learning document semantic representation with hybrid deep belief network
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4386712/
https://www.ncbi.nlm.nih.gov/pubmed/25878657
http://dx.doi.org/10.1155/2015/650527
work_keys_str_mv AT yanyan learningdocumentsemanticrepresentationwithhybriddeepbeliefnetwork
AT yinxucheng learningdocumentsemanticrepresentationwithhybriddeepbeliefnetwork
AT lisujian learningdocumentsemanticrepresentationwithhybriddeepbeliefnetwork
AT yangmingyuan learningdocumentsemanticrepresentationwithhybriddeepbeliefnetwork
AT haohongwei learningdocumentsemanticrepresentationwithhybriddeepbeliefnetwork