Cargando…
Improved biomedical term selection in pseudo relevance feedback
Biomedical information retrieval systems are becoming popular and complex due to massive amount of ever-growing biomedical literature. Users are unable to construct a precise and accurate query that represents the intended information in a clear manner. Therefore, query is expanded with the terms or...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6030818/ https://www.ncbi.nlm.nih.gov/pubmed/29982558 http://dx.doi.org/10.1093/database/bay056 |
_version_ | 1783337201089642496 |
---|---|
author | Nabeel Asim, Muhammad Wasim, Muhammad Usman Ghani Khan, Muhammad Mahmood, Waqar |
author_facet | Nabeel Asim, Muhammad Wasim, Muhammad Usman Ghani Khan, Muhammad Mahmood, Waqar |
author_sort | Nabeel Asim, Muhammad |
collection | PubMed |
description | Biomedical information retrieval systems are becoming popular and complex due to massive amount of ever-growing biomedical literature. Users are unable to construct a precise and accurate query that represents the intended information in a clear manner. Therefore, query is expanded with the terms or features that retrieve more relevant information. Selection of appropriate expansion terms plays key role to improve the performance of retrieval task. We propose document frequency chi-square, a newer version of chi-square in pseudo relevance feedback for term selection. The effects of pre-processing on the performance of information retrieval specifically in biomedical domain are also depicted. On average, the proposed algorithm outperformed state-of-the-art term selection algorithms by 88% at pre-defined test points. Our experiments also conclude that, stemming cause a decrease in overall performance of the pseudo relevance feedback based information retrieval system particularly in biomedical domain. Database URL: http://biodb.sdau.edu.cn/gan/ |
format | Online Article Text |
id | pubmed-6030818 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-60308182018-07-10 Improved biomedical term selection in pseudo relevance feedback Nabeel Asim, Muhammad Wasim, Muhammad Usman Ghani Khan, Muhammad Mahmood, Waqar Database (Oxford) Original Article Biomedical information retrieval systems are becoming popular and complex due to massive amount of ever-growing biomedical literature. Users are unable to construct a precise and accurate query that represents the intended information in a clear manner. Therefore, query is expanded with the terms or features that retrieve more relevant information. Selection of appropriate expansion terms plays key role to improve the performance of retrieval task. We propose document frequency chi-square, a newer version of chi-square in pseudo relevance feedback for term selection. The effects of pre-processing on the performance of information retrieval specifically in biomedical domain are also depicted. On average, the proposed algorithm outperformed state-of-the-art term selection algorithms by 88% at pre-defined test points. Our experiments also conclude that, stemming cause a decrease in overall performance of the pseudo relevance feedback based information retrieval system particularly in biomedical domain. Database URL: http://biodb.sdau.edu.cn/gan/ Oxford University Press 2018-07-02 /pmc/articles/PMC6030818/ /pubmed/29982558 http://dx.doi.org/10.1093/database/bay056 Text en © The Author(s) 2018. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Article Nabeel Asim, Muhammad Wasim, Muhammad Usman Ghani Khan, Muhammad Mahmood, Waqar Improved biomedical term selection in pseudo relevance feedback |
title | Improved biomedical term selection in pseudo relevance feedback |
title_full | Improved biomedical term selection in pseudo relevance feedback |
title_fullStr | Improved biomedical term selection in pseudo relevance feedback |
title_full_unstemmed | Improved biomedical term selection in pseudo relevance feedback |
title_short | Improved biomedical term selection in pseudo relevance feedback |
title_sort | improved biomedical term selection in pseudo relevance feedback |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6030818/ https://www.ncbi.nlm.nih.gov/pubmed/29982558 http://dx.doi.org/10.1093/database/bay056 |
work_keys_str_mv | AT nabeelasimmuhammad improvedbiomedicaltermselectioninpseudorelevancefeedback AT wasimmuhammad improvedbiomedicaltermselectioninpseudorelevancefeedback AT usmanghanikhanmuhammad improvedbiomedicaltermselectioninpseudorelevancefeedback AT mahmoodwaqar improvedbiomedicaltermselectioninpseudorelevancefeedback |