Cargando…

Mining Health-Related Issues in Consumer Product Reviews by Using Scalable Text Analytics

In an era when most of our life activities are digitized and recorded, opportunities abound to gain insights about population health. Online product reviews present a unique data source that is currently underexplored. Health-related information, although scarce, can be systematically mined in onlin...

Descripción completa

Detalles Bibliográficos
Autores principales: Torii, Manabu, Tilak, Sameer S., Doan, Son, Zisook, Daniel S., Fan, Jung-wei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Libertas Academica 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4915789/
https://www.ncbi.nlm.nih.gov/pubmed/27375358
http://dx.doi.org/10.4137/BII.S37791
Descripción
Sumario:In an era when most of our life activities are digitized and recorded, opportunities abound to gain insights about population health. Online product reviews present a unique data source that is currently underexplored. Health-related information, although scarce, can be systematically mined in online product reviews. Leveraging natural language processing and machine learning tools, we were able to mine 1.3 million grocery product reviews for health-related information. The objectives of the study were as follows: (1) conduct quantitative and qualitative analysis on the types of health issues found in consumer product reviews; (2) develop a machine learning classifier to detect reviews that contain health-related issues; and (3) gain insights about the task characteristics and challenges for text analytics to guide future research.