Cargando…
SimQ: Real-Time Retrieval of Similar Consumer Health Questions
BACKGROUND: There has been a significant increase in the popularity of Web-based question-and-answer (Q&A) services that provide health care information for consumers. Large amounts of Q&As have been archived in these online communities, which form a valuable knowledge base for consumers who...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
JMIR Publications Inc.
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4376128/ https://www.ncbi.nlm.nih.gov/pubmed/25689608 http://dx.doi.org/10.2196/jmir.3388 |
_version_ | 1782363687219101696 |
---|---|
author | Luo, Jake Zhang, Guo-Qiang Wentz, Susan Cui, Licong Xu, Rong |
author_facet | Luo, Jake Zhang, Guo-Qiang Wentz, Susan Cui, Licong Xu, Rong |
author_sort | Luo, Jake |
collection | PubMed |
description | BACKGROUND: There has been a significant increase in the popularity of Web-based question-and-answer (Q&A) services that provide health care information for consumers. Large amounts of Q&As have been archived in these online communities, which form a valuable knowledge base for consumers who seek answers to their health care concerns. However, due to consumers’ possible lack of professional knowledge, it is still very challenging for them to find Q&As that are closely relevant to their own health problems. Consumers often repeatedly ask similar questions that have already been answered previously by other users. OBJECTIVE: In this study, we aim to develop efficient informatics methods that can retrieve similar Web-based consumer health questions using syntactic and semantic analysis. METHODS: We propose the “SimQ” to achieve this objective. SimQ is an informatics framework that compares the similarity of archived health questions and retrieves answers to satisfy consumers’ information needs. Statistical syntactic parsing was used to analyze each question’s syntactic structure. Standardized Unified Medical Language System (UMLS) was employed to annotate semantic types and extract medical concepts. Finally, the similarity between sentences was calculated using both semantic and syntactic features. RESULTS: We used 2000 randomly selected consumer questions to evaluate the system’s performance. The results show that SimQ reached the highest precision of 72.2%, recall of 78.0%, and F-score of 75.0% when using compositional feature representations. CONCLUSIONS: We demonstrated that SimQ complements the existing Q&A services of Netwellness, a not-for-profit community-based consumer health information service that consists of nearly 70,000 Q&As and serves over 3 million users each year. SimQ not only reduces response delay by instantly providing closely related questions and answers, but also helps consumers to improve the understanding of their health concerns. |
format | Online Article Text |
id | pubmed-4376128 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | JMIR Publications Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-43761282015-04-02 SimQ: Real-Time Retrieval of Similar Consumer Health Questions Luo, Jake Zhang, Guo-Qiang Wentz, Susan Cui, Licong Xu, Rong J Med Internet Res Original Paper BACKGROUND: There has been a significant increase in the popularity of Web-based question-and-answer (Q&A) services that provide health care information for consumers. Large amounts of Q&As have been archived in these online communities, which form a valuable knowledge base for consumers who seek answers to their health care concerns. However, due to consumers’ possible lack of professional knowledge, it is still very challenging for them to find Q&As that are closely relevant to their own health problems. Consumers often repeatedly ask similar questions that have already been answered previously by other users. OBJECTIVE: In this study, we aim to develop efficient informatics methods that can retrieve similar Web-based consumer health questions using syntactic and semantic analysis. METHODS: We propose the “SimQ” to achieve this objective. SimQ is an informatics framework that compares the similarity of archived health questions and retrieves answers to satisfy consumers’ information needs. Statistical syntactic parsing was used to analyze each question’s syntactic structure. Standardized Unified Medical Language System (UMLS) was employed to annotate semantic types and extract medical concepts. Finally, the similarity between sentences was calculated using both semantic and syntactic features. RESULTS: We used 2000 randomly selected consumer questions to evaluate the system’s performance. The results show that SimQ reached the highest precision of 72.2%, recall of 78.0%, and F-score of 75.0% when using compositional feature representations. CONCLUSIONS: We demonstrated that SimQ complements the existing Q&A services of Netwellness, a not-for-profit community-based consumer health information service that consists of nearly 70,000 Q&As and serves over 3 million users each year. SimQ not only reduces response delay by instantly providing closely related questions and answers, but also helps consumers to improve the understanding of their health concerns. JMIR Publications Inc. 2015-02-17 /pmc/articles/PMC4376128/ /pubmed/25689608 http://dx.doi.org/10.2196/jmir.3388 Text en ©Jake Luo, Guo-Qiang Zhang, Susan Wentz, Licong Cui, Rong Xu. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 17.02.2015. http://creativecommons.org/licenses/by/2.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on http://www.jmir.org/, as well as this copyright and license information must be included. |
spellingShingle | Original Paper Luo, Jake Zhang, Guo-Qiang Wentz, Susan Cui, Licong Xu, Rong SimQ: Real-Time Retrieval of Similar Consumer Health Questions |
title | SimQ: Real-Time Retrieval of Similar Consumer Health Questions |
title_full | SimQ: Real-Time Retrieval of Similar Consumer Health Questions |
title_fullStr | SimQ: Real-Time Retrieval of Similar Consumer Health Questions |
title_full_unstemmed | SimQ: Real-Time Retrieval of Similar Consumer Health Questions |
title_short | SimQ: Real-Time Retrieval of Similar Consumer Health Questions |
title_sort | simq: real-time retrieval of similar consumer health questions |
topic | Original Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4376128/ https://www.ncbi.nlm.nih.gov/pubmed/25689608 http://dx.doi.org/10.2196/jmir.3388 |
work_keys_str_mv | AT luojake simqrealtimeretrievalofsimilarconsumerhealthquestions AT zhangguoqiang simqrealtimeretrievalofsimilarconsumerhealthquestions AT wentzsusan simqrealtimeretrievalofsimilarconsumerhealthquestions AT cuilicong simqrealtimeretrievalofsimilarconsumerhealthquestions AT xurong simqrealtimeretrievalofsimilarconsumerhealthquestions |