Cargando…

Document Re-ranking by Generality in Bio-medical Information Retrieval

Document ranking is an important process in information retrieval (IR). It presents retrieved documents in an order of their estimated degrees of relevance to query. Traditional document ranking methods are mostly based on the similarity computations between documents and query. In this paper we arg...

Descripción completa

Detalles Bibliográficos
Autores principales: Yan, Xin, Li, Xue, Song, Dawei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2005
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7121049/
http://dx.doi.org/10.1007/11581062_28
_version_ 1783515113209200640
author Yan, Xin
Li, Xue
Song, Dawei
author_facet Yan, Xin
Li, Xue
Song, Dawei
author_sort Yan, Xin
collection PubMed
description Document ranking is an important process in information retrieval (IR). It presents retrieved documents in an order of their estimated degrees of relevance to query. Traditional document ranking methods are mostly based on the similarity computations between documents and query. In this paper we argue that the similarity-based document ranking is insufficient in some cases. There are two reasons. Firstly it is about the increased information variety. There are far too many different types documents available now for user to search. The second is about the users variety. In many cases user may want to retrieve documents that are not only similar but also general or broad regarding a certain topic. This is particularly the case in some domains such as bio-medical IR. In this paper we propose a novel approach to re-rank the retrieved documents by incorporating the similarity with their generality. By an ontology-based analysis on the semantic cohesion of text, document generality can be quantified. The retrieved documents are then re-ranked by their combined scores of similarity and the closeness of documents’ generality to the query’s. Our experiments have shown an encouraging performance on a large bio-medical document collection, OHSUMED, containing 348,566 medical journal references and 101 test queries.
format Online
Article
Text
id pubmed-7121049
institution National Center for Biotechnology Information
language English
publishDate 2005
record_format MEDLINE/PubMed
spelling pubmed-71210492020-04-06 Document Re-ranking by Generality in Bio-medical Information Retrieval Yan, Xin Li, Xue Song, Dawei Web Information Systems Engineering – WISE 2005 Article Document ranking is an important process in information retrieval (IR). It presents retrieved documents in an order of their estimated degrees of relevance to query. Traditional document ranking methods are mostly based on the similarity computations between documents and query. In this paper we argue that the similarity-based document ranking is insufficient in some cases. There are two reasons. Firstly it is about the increased information variety. There are far too many different types documents available now for user to search. The second is about the users variety. In many cases user may want to retrieve documents that are not only similar but also general or broad regarding a certain topic. This is particularly the case in some domains such as bio-medical IR. In this paper we propose a novel approach to re-rank the retrieved documents by incorporating the similarity with their generality. By an ontology-based analysis on the semantic cohesion of text, document generality can be quantified. The retrieved documents are then re-ranked by their combined scores of similarity and the closeness of documents’ generality to the query’s. Our experiments have shown an encouraging performance on a large bio-medical document collection, OHSUMED, containing 348,566 medical journal references and 101 test queries. 2005 /pmc/articles/PMC7121049/ http://dx.doi.org/10.1007/11581062_28 Text en © Springer-Verlag Berlin Heidelberg 2005 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Article
Yan, Xin
Li, Xue
Song, Dawei
Document Re-ranking by Generality in Bio-medical Information Retrieval
title Document Re-ranking by Generality in Bio-medical Information Retrieval
title_full Document Re-ranking by Generality in Bio-medical Information Retrieval
title_fullStr Document Re-ranking by Generality in Bio-medical Information Retrieval
title_full_unstemmed Document Re-ranking by Generality in Bio-medical Information Retrieval
title_short Document Re-ranking by Generality in Bio-medical Information Retrieval
title_sort document re-ranking by generality in bio-medical information retrieval
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7121049/
http://dx.doi.org/10.1007/11581062_28
work_keys_str_mv AT yanxin documentrerankingbygeneralityinbiomedicalinformationretrieval
AT lixue documentrerankingbygeneralityinbiomedicalinformationretrieval
AT songdawei documentrerankingbygeneralityinbiomedicalinformationretrieval