Cargando…
An improved BM25 algorithm for clinical decision support in Precision Medicine based on co-word analysis and Cuckoo Search
BACKGROUND: Retrieving gene and disease information from a vast collection of biomedical abstracts to provide doctors with clinical decision support is one of the important research directions of Precision Medicine. METHOD: We propose a novel article retrieval method based on expanded word and co-wo...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7927407/ https://www.ncbi.nlm.nih.gov/pubmed/33653325 http://dx.doi.org/10.1186/s12911-021-01454-5 |
_version_ | 1783659662629928960 |
---|---|
author | Zhang, Zicheng |
author_facet | Zhang, Zicheng |
author_sort | Zhang, Zicheng |
collection | PubMed |
description | BACKGROUND: Retrieving gene and disease information from a vast collection of biomedical abstracts to provide doctors with clinical decision support is one of the important research directions of Precision Medicine. METHOD: We propose a novel article retrieval method based on expanded word and co-word analyses, also conducting Cuckoo Search to optimize parameters of the retrieval function. The main goal is to retrieve the abstracts of biomedical articles that refer to treatments. The methods mentioned in this manuscript adopt the BM25 algorithm to calculate the score of abstracts. We, however, propose an improved version of BM25 that computes the scores of expanded words and co-word leading to a composite retrieval function, which is then optimized using the Cuckoo Search. The proposed method aims to find both disease and gene information in the abstract of the same biomedical article. This is to achieve higher relevance and hence score of articles. Besides, we investigate the influence of different parameters on the retrieval algorithm and summarize how they meet various retrieval needs. RESULTS: The data used in this manuscript is sourced from medical articles presented in Text Retrieval Conference (TREC): Clinical Decision Support (CDS) Tracks of 2017, 2018, and 2019 in Precision Medicine. A total of 120 topics are tested. Three indicators are employed for the comparison of utilized methods, which are selected among the ones based only on the BM25 algorithm and its improved version to conduct comparable experiments. The results showed that the proposed algorithm achieves better results. CONCLUSION: The proposed method, an improved version of the BM25 algorithm, utilizes both co-word implementation and Cuckoo Search, which has been verified achieving better results on a large number of experimental sets. Besides, a relatively simple query expansion method is implemented in this manuscript. Future research will focus on ontology and semantic networks to expand the query vocabulary. |
format | Online Article Text |
id | pubmed-7927407 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-79274072021-03-03 An improved BM25 algorithm for clinical decision support in Precision Medicine based on co-word analysis and Cuckoo Search Zhang, Zicheng BMC Med Inform Decis Mak Research Article BACKGROUND: Retrieving gene and disease information from a vast collection of biomedical abstracts to provide doctors with clinical decision support is one of the important research directions of Precision Medicine. METHOD: We propose a novel article retrieval method based on expanded word and co-word analyses, also conducting Cuckoo Search to optimize parameters of the retrieval function. The main goal is to retrieve the abstracts of biomedical articles that refer to treatments. The methods mentioned in this manuscript adopt the BM25 algorithm to calculate the score of abstracts. We, however, propose an improved version of BM25 that computes the scores of expanded words and co-word leading to a composite retrieval function, which is then optimized using the Cuckoo Search. The proposed method aims to find both disease and gene information in the abstract of the same biomedical article. This is to achieve higher relevance and hence score of articles. Besides, we investigate the influence of different parameters on the retrieval algorithm and summarize how they meet various retrieval needs. RESULTS: The data used in this manuscript is sourced from medical articles presented in Text Retrieval Conference (TREC): Clinical Decision Support (CDS) Tracks of 2017, 2018, and 2019 in Precision Medicine. A total of 120 topics are tested. Three indicators are employed for the comparison of utilized methods, which are selected among the ones based only on the BM25 algorithm and its improved version to conduct comparable experiments. The results showed that the proposed algorithm achieves better results. CONCLUSION: The proposed method, an improved version of the BM25 algorithm, utilizes both co-word implementation and Cuckoo Search, which has been verified achieving better results on a large number of experimental sets. Besides, a relatively simple query expansion method is implemented in this manuscript. Future research will focus on ontology and semantic networks to expand the query vocabulary. BioMed Central 2021-03-02 /pmc/articles/PMC7927407/ /pubmed/33653325 http://dx.doi.org/10.1186/s12911-021-01454-5 Text en © The Author(s) 2021 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Article Zhang, Zicheng An improved BM25 algorithm for clinical decision support in Precision Medicine based on co-word analysis and Cuckoo Search |
title | An improved BM25 algorithm for clinical decision support in Precision Medicine based on co-word analysis and Cuckoo Search |
title_full | An improved BM25 algorithm for clinical decision support in Precision Medicine based on co-word analysis and Cuckoo Search |
title_fullStr | An improved BM25 algorithm for clinical decision support in Precision Medicine based on co-word analysis and Cuckoo Search |
title_full_unstemmed | An improved BM25 algorithm for clinical decision support in Precision Medicine based on co-word analysis and Cuckoo Search |
title_short | An improved BM25 algorithm for clinical decision support in Precision Medicine based on co-word analysis and Cuckoo Search |
title_sort | improved bm25 algorithm for clinical decision support in precision medicine based on co-word analysis and cuckoo search |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7927407/ https://www.ncbi.nlm.nih.gov/pubmed/33653325 http://dx.doi.org/10.1186/s12911-021-01454-5 |
work_keys_str_mv | AT zhangzicheng animprovedbm25algorithmforclinicaldecisionsupportinprecisionmedicinebasedoncowordanalysisandcuckoosearch AT zhangzicheng improvedbm25algorithmforclinicaldecisionsupportinprecisionmedicinebasedoncowordanalysisandcuckoosearch |