Cargando…

Improved biomedical term selection in pseudo relevance feedback

Biomedical information retrieval systems are becoming popular and complex due to massive amount of ever-growing biomedical literature. Users are unable to construct a precise and accurate query that represents the intended information in a clear manner. Therefore, query is expanded with the terms or...

Descripción completa

Detalles Bibliográficos
Autores principales: Nabeel Asim, Muhammad, Wasim, Muhammad, Usman Ghani Khan, Muhammad, Mahmood, Waqar
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6030818/
https://www.ncbi.nlm.nih.gov/pubmed/29982558
http://dx.doi.org/10.1093/database/bay056
_version_ 1783337201089642496
author Nabeel Asim, Muhammad
Wasim, Muhammad
Usman Ghani Khan, Muhammad
Mahmood, Waqar
author_facet Nabeel Asim, Muhammad
Wasim, Muhammad
Usman Ghani Khan, Muhammad
Mahmood, Waqar
author_sort Nabeel Asim, Muhammad
collection PubMed
description Biomedical information retrieval systems are becoming popular and complex due to massive amount of ever-growing biomedical literature. Users are unable to construct a precise and accurate query that represents the intended information in a clear manner. Therefore, query is expanded with the terms or features that retrieve more relevant information. Selection of appropriate expansion terms plays key role to improve the performance of retrieval task. We propose document frequency chi-square, a newer version of chi-square in pseudo relevance feedback for term selection. The effects of pre-processing on the performance of information retrieval specifically in biomedical domain are also depicted. On average, the proposed algorithm outperformed state-of-the-art term selection algorithms by 88% at pre-defined test points. Our experiments also conclude that, stemming cause a decrease in overall performance of the pseudo relevance feedback based information retrieval system particularly in biomedical domain. Database URL: http://biodb.sdau.edu.cn/gan/
format Online
Article
Text
id pubmed-6030818
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-60308182018-07-10 Improved biomedical term selection in pseudo relevance feedback Nabeel Asim, Muhammad Wasim, Muhammad Usman Ghani Khan, Muhammad Mahmood, Waqar Database (Oxford) Original Article Biomedical information retrieval systems are becoming popular and complex due to massive amount of ever-growing biomedical literature. Users are unable to construct a precise and accurate query that represents the intended information in a clear manner. Therefore, query is expanded with the terms or features that retrieve more relevant information. Selection of appropriate expansion terms plays key role to improve the performance of retrieval task. We propose document frequency chi-square, a newer version of chi-square in pseudo relevance feedback for term selection. The effects of pre-processing on the performance of information retrieval specifically in biomedical domain are also depicted. On average, the proposed algorithm outperformed state-of-the-art term selection algorithms by 88% at pre-defined test points. Our experiments also conclude that, stemming cause a decrease in overall performance of the pseudo relevance feedback based information retrieval system particularly in biomedical domain. Database URL: http://biodb.sdau.edu.cn/gan/ Oxford University Press 2018-07-02 /pmc/articles/PMC6030818/ /pubmed/29982558 http://dx.doi.org/10.1093/database/bay056 Text en © The Author(s) 2018. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Article
Nabeel Asim, Muhammad
Wasim, Muhammad
Usman Ghani Khan, Muhammad
Mahmood, Waqar
Improved biomedical term selection in pseudo relevance feedback
title Improved biomedical term selection in pseudo relevance feedback
title_full Improved biomedical term selection in pseudo relevance feedback
title_fullStr Improved biomedical term selection in pseudo relevance feedback
title_full_unstemmed Improved biomedical term selection in pseudo relevance feedback
title_short Improved biomedical term selection in pseudo relevance feedback
title_sort improved biomedical term selection in pseudo relevance feedback
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6030818/
https://www.ncbi.nlm.nih.gov/pubmed/29982558
http://dx.doi.org/10.1093/database/bay056
work_keys_str_mv AT nabeelasimmuhammad improvedbiomedicaltermselectioninpseudorelevancefeedback
AT wasimmuhammad improvedbiomedicaltermselectioninpseudorelevancefeedback
AT usmanghanikhanmuhammad improvedbiomedicaltermselectioninpseudorelevancefeedback
AT mahmoodwaqar improvedbiomedicaltermselectioninpseudorelevancefeedback