Cargando…

Incorporating Statistical Topic Models in the Retrieval of Healthcare Documents

Patients often search for information on the web about treatments and diseases after they are discharged from the hospital. However, searching for medical information on the web poses challenges due to related terms and synonymy for the same disease and treatment. In this paper, we present a method...

Descripción completa

Detalles Bibliográficos
Autores principales: Caballero, Karla, Akella, Ram
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Medical Informatics Association 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4525234/
https://www.ncbi.nlm.nih.gov/pubmed/26306280
_version_ 1782384297613721600
author Caballero, Karla
Akella, Ram
author_facet Caballero, Karla
Akella, Ram
author_sort Caballero, Karla
collection PubMed
description Patients often search for information on the web about treatments and diseases after they are discharged from the hospital. However, searching for medical information on the web poses challenges due to related terms and synonymy for the same disease and treatment. In this paper, we present a method that combines Statistical Topics Models, Language Models and Natural Language Processing to retrieve healthcare related documents. In addition, we test if the incorporation of terms extracted from the patient’s discharge summary improves the retrieval performance. We show that the proposed framework outperformed the winner of the retrieval CLEF eHealth 2013 challenge by 68% in the MAP measure (0:5226 vs 0:3108), and by 13% in NDCG (0:5202 vs 0:3637). Compared with standard language models, we obtain an improvement of 92% in MAP (0:2666) and 45% in NDCG. (0:3637)
format Online
Article
Text
id pubmed-4525234
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher American Medical Informatics Association
record_format MEDLINE/PubMed
spelling pubmed-45252342015-08-24 Incorporating Statistical Topic Models in the Retrieval of Healthcare Documents Caballero, Karla Akella, Ram AMIA Jt Summits Transl Sci Proc Articles Patients often search for information on the web about treatments and diseases after they are discharged from the hospital. However, searching for medical information on the web poses challenges due to related terms and synonymy for the same disease and treatment. In this paper, we present a method that combines Statistical Topics Models, Language Models and Natural Language Processing to retrieve healthcare related documents. In addition, we test if the incorporation of terms extracted from the patient’s discharge summary improves the retrieval performance. We show that the proposed framework outperformed the winner of the retrieval CLEF eHealth 2013 challenge by 68% in the MAP measure (0:5226 vs 0:3108), and by 13% in NDCG (0:5202 vs 0:3637). Compared with standard language models, we obtain an improvement of 92% in MAP (0:2666) and 45% in NDCG. (0:3637) American Medical Informatics Association 2015-03-25 /pmc/articles/PMC4525234/ /pubmed/26306280 Text en ©2015 AMIA - All rights reserved. This is an Open Access article: verbatim copying and redistribution of this article are permitted in all media for any purpose
spellingShingle Articles
Caballero, Karla
Akella, Ram
Incorporating Statistical Topic Models in the Retrieval of Healthcare Documents
title Incorporating Statistical Topic Models in the Retrieval of Healthcare Documents
title_full Incorporating Statistical Topic Models in the Retrieval of Healthcare Documents
title_fullStr Incorporating Statistical Topic Models in the Retrieval of Healthcare Documents
title_full_unstemmed Incorporating Statistical Topic Models in the Retrieval of Healthcare Documents
title_short Incorporating Statistical Topic Models in the Retrieval of Healthcare Documents
title_sort incorporating statistical topic models in the retrieval of healthcare documents
topic Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4525234/
https://www.ncbi.nlm.nih.gov/pubmed/26306280
work_keys_str_mv AT caballerokarla incorporatingstatisticaltopicmodelsintheretrievalofhealthcaredocuments
AT akellaram incorporatingstatisticaltopicmodelsintheretrievalofhealthcaredocuments