Cargando…
Mining Health-Related Issues in Consumer Product Reviews by Using Scalable Text Analytics
In an era when most of our life activities are digitized and recorded, opportunities abound to gain insights about population health. Online product reviews present a unique data source that is currently underexplored. Health-related information, although scarce, can be systematically mined in onlin...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Libertas Academica
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4915789/ https://www.ncbi.nlm.nih.gov/pubmed/27375358 http://dx.doi.org/10.4137/BII.S37791 |
_version_ | 1782438739382894592 |
---|---|
author | Torii, Manabu Tilak, Sameer S. Doan, Son Zisook, Daniel S. Fan, Jung-wei |
author_facet | Torii, Manabu Tilak, Sameer S. Doan, Son Zisook, Daniel S. Fan, Jung-wei |
author_sort | Torii, Manabu |
collection | PubMed |
description | In an era when most of our life activities are digitized and recorded, opportunities abound to gain insights about population health. Online product reviews present a unique data source that is currently underexplored. Health-related information, although scarce, can be systematically mined in online product reviews. Leveraging natural language processing and machine learning tools, we were able to mine 1.3 million grocery product reviews for health-related information. The objectives of the study were as follows: (1) conduct quantitative and qualitative analysis on the types of health issues found in consumer product reviews; (2) develop a machine learning classifier to detect reviews that contain health-related issues; and (3) gain insights about the task characteristics and challenges for text analytics to guide future research. |
format | Online Article Text |
id | pubmed-4915789 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Libertas Academica |
record_format | MEDLINE/PubMed |
spelling | pubmed-49157892016-07-01 Mining Health-Related Issues in Consumer Product Reviews by Using Scalable Text Analytics Torii, Manabu Tilak, Sameer S. Doan, Son Zisook, Daniel S. Fan, Jung-wei Biomed Inform Insights Original Research In an era when most of our life activities are digitized and recorded, opportunities abound to gain insights about population health. Online product reviews present a unique data source that is currently underexplored. Health-related information, although scarce, can be systematically mined in online product reviews. Leveraging natural language processing and machine learning tools, we were able to mine 1.3 million grocery product reviews for health-related information. The objectives of the study were as follows: (1) conduct quantitative and qualitative analysis on the types of health issues found in consumer product reviews; (2) develop a machine learning classifier to detect reviews that contain health-related issues; and (3) gain insights about the task characteristics and challenges for text analytics to guide future research. Libertas Academica 2016-06-20 /pmc/articles/PMC4915789/ /pubmed/27375358 http://dx.doi.org/10.4137/BII.S37791 Text en © 2016 the author(s), publisher and licensee Libertas Academica Ltd. This is an open-access article distributed under the terms of the Creative Commons CC-BY-NC 3.0 License. |
spellingShingle | Original Research Torii, Manabu Tilak, Sameer S. Doan, Son Zisook, Daniel S. Fan, Jung-wei Mining Health-Related Issues in Consumer Product Reviews by Using Scalable Text Analytics |
title | Mining Health-Related Issues in Consumer Product Reviews by Using Scalable Text Analytics |
title_full | Mining Health-Related Issues in Consumer Product Reviews by Using Scalable Text Analytics |
title_fullStr | Mining Health-Related Issues in Consumer Product Reviews by Using Scalable Text Analytics |
title_full_unstemmed | Mining Health-Related Issues in Consumer Product Reviews by Using Scalable Text Analytics |
title_short | Mining Health-Related Issues in Consumer Product Reviews by Using Scalable Text Analytics |
title_sort | mining health-related issues in consumer product reviews by using scalable text analytics |
topic | Original Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4915789/ https://www.ncbi.nlm.nih.gov/pubmed/27375358 http://dx.doi.org/10.4137/BII.S37791 |
work_keys_str_mv | AT toriimanabu mininghealthrelatedissuesinconsumerproductreviewsbyusingscalabletextanalytics AT tilaksameers mininghealthrelatedissuesinconsumerproductreviewsbyusingscalabletextanalytics AT doanson mininghealthrelatedissuesinconsumerproductreviewsbyusingscalabletextanalytics AT zisookdaniels mininghealthrelatedissuesinconsumerproductreviewsbyusingscalabletextanalytics AT fanjungwei mininghealthrelatedissuesinconsumerproductreviewsbyusingscalabletextanalytics |