Cargando…
Incorporating Statistical Topic Models in the Retrieval of Healthcare Documents
Patients often search for information on the web about treatments and diseases after they are discharged from the hospital. However, searching for medical information on the web poses challenges due to related terms and synonymy for the same disease and treatment. In this paper, we present a method...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
American Medical Informatics Association
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4525234/ https://www.ncbi.nlm.nih.gov/pubmed/26306280 |
_version_ | 1782384297613721600 |
---|---|
author | Caballero, Karla Akella, Ram |
author_facet | Caballero, Karla Akella, Ram |
author_sort | Caballero, Karla |
collection | PubMed |
description | Patients often search for information on the web about treatments and diseases after they are discharged from the hospital. However, searching for medical information on the web poses challenges due to related terms and synonymy for the same disease and treatment. In this paper, we present a method that combines Statistical Topics Models, Language Models and Natural Language Processing to retrieve healthcare related documents. In addition, we test if the incorporation of terms extracted from the patient’s discharge summary improves the retrieval performance. We show that the proposed framework outperformed the winner of the retrieval CLEF eHealth 2013 challenge by 68% in the MAP measure (0:5226 vs 0:3108), and by 13% in NDCG (0:5202 vs 0:3637). Compared with standard language models, we obtain an improvement of 92% in MAP (0:2666) and 45% in NDCG. (0:3637) |
format | Online Article Text |
id | pubmed-4525234 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | American Medical Informatics Association |
record_format | MEDLINE/PubMed |
spelling | pubmed-45252342015-08-24 Incorporating Statistical Topic Models in the Retrieval of Healthcare Documents Caballero, Karla Akella, Ram AMIA Jt Summits Transl Sci Proc Articles Patients often search for information on the web about treatments and diseases after they are discharged from the hospital. However, searching for medical information on the web poses challenges due to related terms and synonymy for the same disease and treatment. In this paper, we present a method that combines Statistical Topics Models, Language Models and Natural Language Processing to retrieve healthcare related documents. In addition, we test if the incorporation of terms extracted from the patient’s discharge summary improves the retrieval performance. We show that the proposed framework outperformed the winner of the retrieval CLEF eHealth 2013 challenge by 68% in the MAP measure (0:5226 vs 0:3108), and by 13% in NDCG (0:5202 vs 0:3637). Compared with standard language models, we obtain an improvement of 92% in MAP (0:2666) and 45% in NDCG. (0:3637) American Medical Informatics Association 2015-03-25 /pmc/articles/PMC4525234/ /pubmed/26306280 Text en ©2015 AMIA - All rights reserved. This is an Open Access article: verbatim copying and redistribution of this article are permitted in all media for any purpose |
spellingShingle | Articles Caballero, Karla Akella, Ram Incorporating Statistical Topic Models in the Retrieval of Healthcare Documents |
title | Incorporating Statistical Topic Models in the Retrieval of Healthcare Documents |
title_full | Incorporating Statistical Topic Models in the Retrieval of Healthcare Documents |
title_fullStr | Incorporating Statistical Topic Models in the Retrieval of Healthcare Documents |
title_full_unstemmed | Incorporating Statistical Topic Models in the Retrieval of Healthcare Documents |
title_short | Incorporating Statistical Topic Models in the Retrieval of Healthcare Documents |
title_sort | incorporating statistical topic models in the retrieval of healthcare documents |
topic | Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4525234/ https://www.ncbi.nlm.nih.gov/pubmed/26306280 |
work_keys_str_mv | AT caballerokarla incorporatingstatisticaltopicmodelsintheretrievalofhealthcaredocuments AT akellaram incorporatingstatisticaltopicmodelsintheretrievalofhealthcaredocuments |