Cargando…

An Interpretable Classification Framework for Information Extraction from Online Healthcare Forums

Online healthcare forums (OHFs) have become increasingly popular for patients to share their health-related experiences. The healthcare-related texts posted in OHFs could help doctors and patients better understand specific diseases and the situations of other patients. To extract the meaning of a p...

Descripción completa

Detalles Bibliográficos
Autores principales: Gao, Jun, Liu, Ninghao, Lawley, Mark, Hu, Xia
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5559930/
https://www.ncbi.nlm.nih.gov/pubmed/29065580
http://dx.doi.org/10.1155/2017/2460174
_version_ 1783257603980132352
author Gao, Jun
Liu, Ninghao
Lawley, Mark
Hu, Xia
author_facet Gao, Jun
Liu, Ninghao
Lawley, Mark
Hu, Xia
author_sort Gao, Jun
collection PubMed
description Online healthcare forums (OHFs) have become increasingly popular for patients to share their health-related experiences. The healthcare-related texts posted in OHFs could help doctors and patients better understand specific diseases and the situations of other patients. To extract the meaning of a post, a commonly used way is to classify the sentences into several predefined categories of different semantics. However, the unstructured form of online posts brings challenges to existing classification algorithms. In addition, though many sophisticated classification models such as deep neural networks may have good predictive power, it is hard to interpret the models and the prediction results, which is, however, critical in healthcare applications. To tackle the challenges above, we propose an effective and interpretable OHF post classification framework. Specifically, we classify sentences into three classes: medication, symptom, and background. Each sentence is projected into an interpretable feature space consisting of labeled sequential patterns, UMLS semantic types, and other heuristic features. A forest-based model is developed for categorizing OHF posts. An interpretation method is also developed, where the decision rules can be explicitly extracted to gain an insight of useful information in texts. Experimental results on real-world OHF data demonstrate the effectiveness of our proposed computational framework.
format Online
Article
Text
id pubmed-5559930
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-55599302017-08-24 An Interpretable Classification Framework for Information Extraction from Online Healthcare Forums Gao, Jun Liu, Ninghao Lawley, Mark Hu, Xia J Healthc Eng Research Article Online healthcare forums (OHFs) have become increasingly popular for patients to share their health-related experiences. The healthcare-related texts posted in OHFs could help doctors and patients better understand specific diseases and the situations of other patients. To extract the meaning of a post, a commonly used way is to classify the sentences into several predefined categories of different semantics. However, the unstructured form of online posts brings challenges to existing classification algorithms. In addition, though many sophisticated classification models such as deep neural networks may have good predictive power, it is hard to interpret the models and the prediction results, which is, however, critical in healthcare applications. To tackle the challenges above, we propose an effective and interpretable OHF post classification framework. Specifically, we classify sentences into three classes: medication, symptom, and background. Each sentence is projected into an interpretable feature space consisting of labeled sequential patterns, UMLS semantic types, and other heuristic features. A forest-based model is developed for categorizing OHF posts. An interpretation method is also developed, where the decision rules can be explicitly extracted to gain an insight of useful information in texts. Experimental results on real-world OHF data demonstrate the effectiveness of our proposed computational framework. Hindawi 2017 2017-08-03 /pmc/articles/PMC5559930/ /pubmed/29065580 http://dx.doi.org/10.1155/2017/2460174 Text en Copyright © 2017 Jun Gao et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Gao, Jun
Liu, Ninghao
Lawley, Mark
Hu, Xia
An Interpretable Classification Framework for Information Extraction from Online Healthcare Forums
title An Interpretable Classification Framework for Information Extraction from Online Healthcare Forums
title_full An Interpretable Classification Framework for Information Extraction from Online Healthcare Forums
title_fullStr An Interpretable Classification Framework for Information Extraction from Online Healthcare Forums
title_full_unstemmed An Interpretable Classification Framework for Information Extraction from Online Healthcare Forums
title_short An Interpretable Classification Framework for Information Extraction from Online Healthcare Forums
title_sort interpretable classification framework for information extraction from online healthcare forums
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5559930/
https://www.ncbi.nlm.nih.gov/pubmed/29065580
http://dx.doi.org/10.1155/2017/2460174
work_keys_str_mv AT gaojun aninterpretableclassificationframeworkforinformationextractionfromonlinehealthcareforums
AT liuninghao aninterpretableclassificationframeworkforinformationextractionfromonlinehealthcareforums
AT lawleymark aninterpretableclassificationframeworkforinformationextractionfromonlinehealthcareforums
AT huxia aninterpretableclassificationframeworkforinformationextractionfromonlinehealthcareforums
AT gaojun interpretableclassificationframeworkforinformationextractionfromonlinehealthcareforums
AT liuninghao interpretableclassificationframeworkforinformationextractionfromonlinehealthcareforums
AT lawleymark interpretableclassificationframeworkforinformationextractionfromonlinehealthcareforums
AT huxia interpretableclassificationframeworkforinformationextractionfromonlinehealthcareforums