Cargando…

Promoting ranking diversity for genomics search with relevance-novelty combined model

BACKGROUND: In the biomedical domain, the desired information of a question (query) asked by biologists usually is a list of a certain type of entities covering different aspects that are related to the question, such as genes, proteins, diseases, mutations, etc. Hence it is important for a biomedic...

Descripción completa

Detalles Bibliográficos
Autores principales: Yin, Xiaoshi, Li, Zhoujun, Huang, Jimmy Xiangji, Hu, Xiaohua
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3226257/
https://www.ncbi.nlm.nih.gov/pubmed/21989180
http://dx.doi.org/10.1186/1471-2105-12-S5-S8
_version_ 1782217587791233024
author Yin, Xiaoshi
Li, Zhoujun
Huang, Jimmy Xiangji
Hu, Xiaohua
author_facet Yin, Xiaoshi
Li, Zhoujun
Huang, Jimmy Xiangji
Hu, Xiaohua
author_sort Yin, Xiaoshi
collection PubMed
description BACKGROUND: In the biomedical domain, the desired information of a question (query) asked by biologists usually is a list of a certain type of entities covering different aspects that are related to the question, such as genes, proteins, diseases, mutations, etc. Hence it is important for a biomedical information retrieval system to be able to provide comprehensive and diverse answers to fulfill biologists’ information needs. However, traditional retrieval models assume that the relevance of a document is independent of the relevance of other documents. This assumption may result in high redundancy and low diversity in the retrieval ranked lists. RESULTS: In this paper, we propose a relevance-novelty combined model, named RelNov model, based on the framework of an undirected graphical model. It consists of two component models, namely the aspect-term relevance model and the aspect-term novelty model. They model the relevance of a document and the novelty of a document respectively. We show that our approach can achieve 16.4% improvement over the highest aspect level MAP reported in the TREC 2007 Genomics track, and 9.8% improvement over the highest passage level MAP reported in the TREC 2007 Genomics track. CONCLUSIONS: The proposed combination model which models aspects, terms, topic relevance and document novelty as potential functions is demonstrated to be effective in promoting ranking diversity as well as in improving relevance of ranked lists for genomics search. We also show that the use of aspect plays an important role in the model. Moreover, the proposed model can integrate various different relevance and novelty measures easily.
format Online
Article
Text
id pubmed-3226257
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-32262572011-11-30 Promoting ranking diversity for genomics search with relevance-novelty combined model Yin, Xiaoshi Li, Zhoujun Huang, Jimmy Xiangji Hu, Xiaohua BMC Bioinformatics Proceedings BACKGROUND: In the biomedical domain, the desired information of a question (query) asked by biologists usually is a list of a certain type of entities covering different aspects that are related to the question, such as genes, proteins, diseases, mutations, etc. Hence it is important for a biomedical information retrieval system to be able to provide comprehensive and diverse answers to fulfill biologists’ information needs. However, traditional retrieval models assume that the relevance of a document is independent of the relevance of other documents. This assumption may result in high redundancy and low diversity in the retrieval ranked lists. RESULTS: In this paper, we propose a relevance-novelty combined model, named RelNov model, based on the framework of an undirected graphical model. It consists of two component models, namely the aspect-term relevance model and the aspect-term novelty model. They model the relevance of a document and the novelty of a document respectively. We show that our approach can achieve 16.4% improvement over the highest aspect level MAP reported in the TREC 2007 Genomics track, and 9.8% improvement over the highest passage level MAP reported in the TREC 2007 Genomics track. CONCLUSIONS: The proposed combination model which models aspects, terms, topic relevance and document novelty as potential functions is demonstrated to be effective in promoting ranking diversity as well as in improving relevance of ranked lists for genomics search. We also show that the use of aspect plays an important role in the model. Moreover, the proposed model can integrate various different relevance and novelty measures easily. BioMed Central 2011-07-27 /pmc/articles/PMC3226257/ /pubmed/21989180 http://dx.doi.org/10.1186/1471-2105-12-S5-S8 Text en Copyright ©2011 Yin et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Yin, Xiaoshi
Li, Zhoujun
Huang, Jimmy Xiangji
Hu, Xiaohua
Promoting ranking diversity for genomics search with relevance-novelty combined model
title Promoting ranking diversity for genomics search with relevance-novelty combined model
title_full Promoting ranking diversity for genomics search with relevance-novelty combined model
title_fullStr Promoting ranking diversity for genomics search with relevance-novelty combined model
title_full_unstemmed Promoting ranking diversity for genomics search with relevance-novelty combined model
title_short Promoting ranking diversity for genomics search with relevance-novelty combined model
title_sort promoting ranking diversity for genomics search with relevance-novelty combined model
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3226257/
https://www.ncbi.nlm.nih.gov/pubmed/21989180
http://dx.doi.org/10.1186/1471-2105-12-S5-S8
work_keys_str_mv AT yinxiaoshi promotingrankingdiversityforgenomicssearchwithrelevancenoveltycombinedmodel
AT lizhoujun promotingrankingdiversityforgenomicssearchwithrelevancenoveltycombinedmodel
AT huangjimmyxiangji promotingrankingdiversityforgenomicssearchwithrelevancenoveltycombinedmodel
AT huxiaohua promotingrankingdiversityforgenomicssearchwithrelevancenoveltycombinedmodel