Cargando…

Automatic Identification of Web-Based Risk Markers for Health Events

BACKGROUND: The escalating cost of global health care is driving the development of new technologies to identify early indicators of an individual’s risk of disease. Traditionally, epidemiologists have identified such risk factors using medical databases and lengthy clinical studies but these are of...

Descripción completa

Detalles Bibliográficos
Autores principales: Yom-Tov, Elad, Borsa, Diana, Hayward, Andrew C, McKendry, Rachel A, Cox, Ingemar J
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications Inc. 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4327439/
https://www.ncbi.nlm.nih.gov/pubmed/25626480
http://dx.doi.org/10.2196/jmir.4082
_version_ 1782357079548231680
author Yom-Tov, Elad
Borsa, Diana
Hayward, Andrew C
McKendry, Rachel A
Cox, Ingemar J
author_facet Yom-Tov, Elad
Borsa, Diana
Hayward, Andrew C
McKendry, Rachel A
Cox, Ingemar J
author_sort Yom-Tov, Elad
collection PubMed
description BACKGROUND: The escalating cost of global health care is driving the development of new technologies to identify early indicators of an individual’s risk of disease. Traditionally, epidemiologists have identified such risk factors using medical databases and lengthy clinical studies but these are often limited in size and cost and can fail to take full account of diseases where there are social stigmas or to identify transient acute risk factors. OBJECTIVE: Here we report that Web search engine queries coupled with information on Wikipedia access patterns can be used to infer health events associated with an individual user and automatically generate Web-based risk markers for some of the common medical conditions worldwide, from cardiovascular disease to sexually transmitted infections and mental health conditions, as well as pregnancy. METHODS: Using anonymized datasets, we present methods to first distinguish individuals likely to have experienced specific health events, and classify them into distinct categories. We then use the self-controlled case series method to find the incidence of health events in risk periods directly following a user’s search for a query category, and compare to the incidence during other periods for the same individuals. RESULTS: Searches for pet stores were risk markers for allergy. We also identified some possible new risk markers; for example: searching for fast food and theme restaurants was associated with a transient increase in risk of myocardial infarction, suggesting this exposure goes beyond a long-term risk factor but may also act as an acute trigger of myocardial infarction. Dating and adult content websites were risk markers for sexually transmitted infections, such as human immunodeficiency virus (HIV). CONCLUSIONS: Web-based methods provide a powerful, low-cost approach to automatically identify risk factors, and support more timely and personalized public health efforts to bring human and economic benefits.
format Online
Article
Text
id pubmed-4327439
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher JMIR Publications Inc.
record_format MEDLINE/PubMed
spelling pubmed-43274392015-03-05 Automatic Identification of Web-Based Risk Markers for Health Events Yom-Tov, Elad Borsa, Diana Hayward, Andrew C McKendry, Rachel A Cox, Ingemar J J Med Internet Res Original Paper BACKGROUND: The escalating cost of global health care is driving the development of new technologies to identify early indicators of an individual’s risk of disease. Traditionally, epidemiologists have identified such risk factors using medical databases and lengthy clinical studies but these are often limited in size and cost and can fail to take full account of diseases where there are social stigmas or to identify transient acute risk factors. OBJECTIVE: Here we report that Web search engine queries coupled with information on Wikipedia access patterns can be used to infer health events associated with an individual user and automatically generate Web-based risk markers for some of the common medical conditions worldwide, from cardiovascular disease to sexually transmitted infections and mental health conditions, as well as pregnancy. METHODS: Using anonymized datasets, we present methods to first distinguish individuals likely to have experienced specific health events, and classify them into distinct categories. We then use the self-controlled case series method to find the incidence of health events in risk periods directly following a user’s search for a query category, and compare to the incidence during other periods for the same individuals. RESULTS: Searches for pet stores were risk markers for allergy. We also identified some possible new risk markers; for example: searching for fast food and theme restaurants was associated with a transient increase in risk of myocardial infarction, suggesting this exposure goes beyond a long-term risk factor but may also act as an acute trigger of myocardial infarction. Dating and adult content websites were risk markers for sexually transmitted infections, such as human immunodeficiency virus (HIV). CONCLUSIONS: Web-based methods provide a powerful, low-cost approach to automatically identify risk factors, and support more timely and personalized public health efforts to bring human and economic benefits. JMIR Publications Inc. 2015-01-27 /pmc/articles/PMC4327439/ /pubmed/25626480 http://dx.doi.org/10.2196/jmir.4082 Text en ©Elad Yom-Tov, Diana Borsa, Andrew C Hayward, Rachel A McKendry, Ingemar J Cox. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 27.01.2015. http://creativecommons.org/licenses/by/2.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on http://www.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Yom-Tov, Elad
Borsa, Diana
Hayward, Andrew C
McKendry, Rachel A
Cox, Ingemar J
Automatic Identification of Web-Based Risk Markers for Health Events
title Automatic Identification of Web-Based Risk Markers for Health Events
title_full Automatic Identification of Web-Based Risk Markers for Health Events
title_fullStr Automatic Identification of Web-Based Risk Markers for Health Events
title_full_unstemmed Automatic Identification of Web-Based Risk Markers for Health Events
title_short Automatic Identification of Web-Based Risk Markers for Health Events
title_sort automatic identification of web-based risk markers for health events
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4327439/
https://www.ncbi.nlm.nih.gov/pubmed/25626480
http://dx.doi.org/10.2196/jmir.4082
work_keys_str_mv AT yomtovelad automaticidentificationofwebbasedriskmarkersforhealthevents
AT borsadiana automaticidentificationofwebbasedriskmarkersforhealthevents
AT haywardandrewc automaticidentificationofwebbasedriskmarkersforhealthevents
AT mckendryrachela automaticidentificationofwebbasedriskmarkersforhealthevents
AT coxingemarj automaticidentificationofwebbasedriskmarkersforhealthevents