Cargando…

An improved BM25 algorithm for clinical decision support in Precision Medicine based on co-word analysis and Cuckoo Search

BACKGROUND: Retrieving gene and disease information from a vast collection of biomedical abstracts to provide doctors with clinical decision support is one of the important research directions of Precision Medicine. METHOD: We propose a novel article retrieval method based on expanded word and co-wo...

Descripción completa

Detalles Bibliográficos
Autor principal: Zhang, Zicheng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7927407/
https://www.ncbi.nlm.nih.gov/pubmed/33653325
http://dx.doi.org/10.1186/s12911-021-01454-5
_version_ 1783659662629928960
author Zhang, Zicheng
author_facet Zhang, Zicheng
author_sort Zhang, Zicheng
collection PubMed
description BACKGROUND: Retrieving gene and disease information from a vast collection of biomedical abstracts to provide doctors with clinical decision support is one of the important research directions of Precision Medicine. METHOD: We propose a novel article retrieval method based on expanded word and co-word analyses, also conducting Cuckoo Search to optimize parameters of the retrieval function. The main goal is to retrieve the abstracts of biomedical articles that refer to treatments. The methods mentioned in this manuscript adopt the BM25 algorithm to calculate the score of abstracts. We, however, propose an improved version of BM25 that computes the scores of expanded words and co-word leading to a composite retrieval function, which is then optimized using the Cuckoo Search. The proposed method aims to find both disease and gene information in the abstract of the same biomedical article. This is to achieve higher relevance and hence score of articles. Besides, we investigate the influence of different parameters on the retrieval algorithm and summarize how they meet various retrieval needs. RESULTS: The data used in this manuscript is sourced from medical articles presented in Text Retrieval Conference (TREC): Clinical Decision Support (CDS) Tracks of 2017, 2018, and 2019 in Precision Medicine. A total of 120 topics are tested. Three indicators are employed for the comparison of utilized methods, which are selected among the ones based only on the BM25 algorithm and its improved version to conduct comparable experiments. The results showed that the proposed algorithm achieves better results. CONCLUSION: The proposed method, an improved version of the BM25 algorithm, utilizes both co-word implementation and Cuckoo Search, which has been verified achieving better results on a large number of experimental sets. Besides, a relatively simple query expansion method is implemented in this manuscript. Future research will focus on ontology and semantic networks to expand the query vocabulary.
format Online
Article
Text
id pubmed-7927407
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-79274072021-03-03 An improved BM25 algorithm for clinical decision support in Precision Medicine based on co-word analysis and Cuckoo Search Zhang, Zicheng BMC Med Inform Decis Mak Research Article BACKGROUND: Retrieving gene and disease information from a vast collection of biomedical abstracts to provide doctors with clinical decision support is one of the important research directions of Precision Medicine. METHOD: We propose a novel article retrieval method based on expanded word and co-word analyses, also conducting Cuckoo Search to optimize parameters of the retrieval function. The main goal is to retrieve the abstracts of biomedical articles that refer to treatments. The methods mentioned in this manuscript adopt the BM25 algorithm to calculate the score of abstracts. We, however, propose an improved version of BM25 that computes the scores of expanded words and co-word leading to a composite retrieval function, which is then optimized using the Cuckoo Search. The proposed method aims to find both disease and gene information in the abstract of the same biomedical article. This is to achieve higher relevance and hence score of articles. Besides, we investigate the influence of different parameters on the retrieval algorithm and summarize how they meet various retrieval needs. RESULTS: The data used in this manuscript is sourced from medical articles presented in Text Retrieval Conference (TREC): Clinical Decision Support (CDS) Tracks of 2017, 2018, and 2019 in Precision Medicine. A total of 120 topics are tested. Three indicators are employed for the comparison of utilized methods, which are selected among the ones based only on the BM25 algorithm and its improved version to conduct comparable experiments. The results showed that the proposed algorithm achieves better results. CONCLUSION: The proposed method, an improved version of the BM25 algorithm, utilizes both co-word implementation and Cuckoo Search, which has been verified achieving better results on a large number of experimental sets. Besides, a relatively simple query expansion method is implemented in this manuscript. Future research will focus on ontology and semantic networks to expand the query vocabulary. BioMed Central 2021-03-02 /pmc/articles/PMC7927407/ /pubmed/33653325 http://dx.doi.org/10.1186/s12911-021-01454-5 Text en © The Author(s) 2021 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research Article
Zhang, Zicheng
An improved BM25 algorithm for clinical decision support in Precision Medicine based on co-word analysis and Cuckoo Search
title An improved BM25 algorithm for clinical decision support in Precision Medicine based on co-word analysis and Cuckoo Search
title_full An improved BM25 algorithm for clinical decision support in Precision Medicine based on co-word analysis and Cuckoo Search
title_fullStr An improved BM25 algorithm for clinical decision support in Precision Medicine based on co-word analysis and Cuckoo Search
title_full_unstemmed An improved BM25 algorithm for clinical decision support in Precision Medicine based on co-word analysis and Cuckoo Search
title_short An improved BM25 algorithm for clinical decision support in Precision Medicine based on co-word analysis and Cuckoo Search
title_sort improved bm25 algorithm for clinical decision support in precision medicine based on co-word analysis and cuckoo search
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7927407/
https://www.ncbi.nlm.nih.gov/pubmed/33653325
http://dx.doi.org/10.1186/s12911-021-01454-5
work_keys_str_mv AT zhangzicheng animprovedbm25algorithmforclinicaldecisionsupportinprecisionmedicinebasedoncowordanalysisandcuckoosearch
AT zhangzicheng improvedbm25algorithmforclinicaldecisionsupportinprecisionmedicinebasedoncowordanalysisandcuckoosearch