Cargando…

Seeing the “Big” Picture: Big Data Methods for Exploring Relationships Between Usage, Language, and Outcome in Internet Intervention Data

BACKGROUND: Assessing the efficacy of Internet interventions that are already in the market introduces both challenges and opportunities. While vast, often unprecedented amounts of data may be available (hundreds of thousands, and sometimes millions of participants with high dimensions of assessed v...

Descripción completa

Detalles Bibliográficos
Autores principales: Carpenter, Jordan, Crutchley, Patrick, Zilca, Ran D, Schwartz, H Andrew, Smith, Laura K, Cobb, Angela M, Parks, Acacia C
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5023946/
https://www.ncbi.nlm.nih.gov/pubmed/27580524
http://dx.doi.org/10.2196/jmir.5725
_version_ 1782453716924760064
author Carpenter, Jordan
Crutchley, Patrick
Zilca, Ran D
Schwartz, H Andrew
Smith, Laura K
Cobb, Angela M
Parks, Acacia C
author_facet Carpenter, Jordan
Crutchley, Patrick
Zilca, Ran D
Schwartz, H Andrew
Smith, Laura K
Cobb, Angela M
Parks, Acacia C
author_sort Carpenter, Jordan
collection PubMed
description BACKGROUND: Assessing the efficacy of Internet interventions that are already in the market introduces both challenges and opportunities. While vast, often unprecedented amounts of data may be available (hundreds of thousands, and sometimes millions of participants with high dimensions of assessed variables), the data are observational in nature, are partly unstructured (eg, free text, images, sensor data), do not include a natural control group to be used for comparison, and typically exhibit high attrition rates. New approaches are therefore needed to use these existing data and derive new insights that can augment traditional smaller-group randomized controlled trials. OBJECTIVE: Our objective was to demonstrate how emerging big data approaches can help explore questions about the effectiveness and process of an Internet well-being intervention. METHODS: We drew data from the user base of a well-being website and app called Happify. To explore effectiveness, multilevel models focusing on within-person variation explored whether greater usage predicted higher well-being in a sample of 152,747 users. In addition, to explore the underlying processes that accompany improvement, we analyzed language for 10,818 users who had a sufficient volume of free-text response and timespan of platform usage. A topic model constructed from this free text provided language-based correlates of individual user improvement in outcome measures, providing insights into the beneficial underlying processes experienced by users. RESULTS: On a measure of positive emotion, the average user improved 1.38 points per week (SE 0.01, t122,455=113.60, P<.001, 95% CI 1.36–1.41), about a 27% increase over 8 weeks. Within a given individual user, more usage predicted more positive emotion and less usage predicted less positive emotion (estimate 0.09, SE 0.01, t6047=9.15, P=.001, 95% CI .07–.12). This estimate predicted that a given user would report positive emotion 1.26 points higher after a 2-week period when they used Happify daily than during a week when they didn’t use it at all. Among highly engaged users, 200 automatically clustered topics showed a significant (corrected P<.001) effect on change in well-being over time, illustrating which topics may be more beneficial than others when engaging with the interventions. In particular, topics that are related to addressing negative thoughts and feelings were correlated with improvement over time. CONCLUSIONS: Using observational analyses on naturalistic big data, we can explore the relationship between usage and well-being among people using an Internet well-being intervention and provide new insights into the underlying mechanisms that accompany it. By leveraging big data to power these new types of analyses, we can explore the workings of an intervention from new angles, and harness the insights that surface to feed back into the intervention and improve it further in the future.
format Online
Article
Text
id pubmed-5023946
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-50239462016-10-03 Seeing the “Big” Picture: Big Data Methods for Exploring Relationships Between Usage, Language, and Outcome in Internet Intervention Data Carpenter, Jordan Crutchley, Patrick Zilca, Ran D Schwartz, H Andrew Smith, Laura K Cobb, Angela M Parks, Acacia C J Med Internet Res Original Paper BACKGROUND: Assessing the efficacy of Internet interventions that are already in the market introduces both challenges and opportunities. While vast, often unprecedented amounts of data may be available (hundreds of thousands, and sometimes millions of participants with high dimensions of assessed variables), the data are observational in nature, are partly unstructured (eg, free text, images, sensor data), do not include a natural control group to be used for comparison, and typically exhibit high attrition rates. New approaches are therefore needed to use these existing data and derive new insights that can augment traditional smaller-group randomized controlled trials. OBJECTIVE: Our objective was to demonstrate how emerging big data approaches can help explore questions about the effectiveness and process of an Internet well-being intervention. METHODS: We drew data from the user base of a well-being website and app called Happify. To explore effectiveness, multilevel models focusing on within-person variation explored whether greater usage predicted higher well-being in a sample of 152,747 users. In addition, to explore the underlying processes that accompany improvement, we analyzed language for 10,818 users who had a sufficient volume of free-text response and timespan of platform usage. A topic model constructed from this free text provided language-based correlates of individual user improvement in outcome measures, providing insights into the beneficial underlying processes experienced by users. RESULTS: On a measure of positive emotion, the average user improved 1.38 points per week (SE 0.01, t122,455=113.60, P<.001, 95% CI 1.36–1.41), about a 27% increase over 8 weeks. Within a given individual user, more usage predicted more positive emotion and less usage predicted less positive emotion (estimate 0.09, SE 0.01, t6047=9.15, P=.001, 95% CI .07–.12). This estimate predicted that a given user would report positive emotion 1.26 points higher after a 2-week period when they used Happify daily than during a week when they didn’t use it at all. Among highly engaged users, 200 automatically clustered topics showed a significant (corrected P<.001) effect on change in well-being over time, illustrating which topics may be more beneficial than others when engaging with the interventions. In particular, topics that are related to addressing negative thoughts and feelings were correlated with improvement over time. CONCLUSIONS: Using observational analyses on naturalistic big data, we can explore the relationship between usage and well-being among people using an Internet well-being intervention and provide new insights into the underlying mechanisms that accompany it. By leveraging big data to power these new types of analyses, we can explore the workings of an intervention from new angles, and harness the insights that surface to feed back into the intervention and improve it further in the future. JMIR Publications 2016-08-31 /pmc/articles/PMC5023946/ /pubmed/27580524 http://dx.doi.org/10.2196/jmir.5725 Text en ©Jordan Carpenter, Patrick Crutchley, Ran D Zilca, H Andrew Schwartz, Laura K Smith, Angela M Cobb, Acacia C Parks. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 31.08.2016. https://creativecommons.org/licenses/by/2.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0/ (https://creativecommons.org/licenses/by/2.0/) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on http://www.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Carpenter, Jordan
Crutchley, Patrick
Zilca, Ran D
Schwartz, H Andrew
Smith, Laura K
Cobb, Angela M
Parks, Acacia C
Seeing the “Big” Picture: Big Data Methods for Exploring Relationships Between Usage, Language, and Outcome in Internet Intervention Data
title Seeing the “Big” Picture: Big Data Methods for Exploring Relationships Between Usage, Language, and Outcome in Internet Intervention Data
title_full Seeing the “Big” Picture: Big Data Methods for Exploring Relationships Between Usage, Language, and Outcome in Internet Intervention Data
title_fullStr Seeing the “Big” Picture: Big Data Methods for Exploring Relationships Between Usage, Language, and Outcome in Internet Intervention Data
title_full_unstemmed Seeing the “Big” Picture: Big Data Methods for Exploring Relationships Between Usage, Language, and Outcome in Internet Intervention Data
title_short Seeing the “Big” Picture: Big Data Methods for Exploring Relationships Between Usage, Language, and Outcome in Internet Intervention Data
title_sort seeing the “big” picture: big data methods for exploring relationships between usage, language, and outcome in internet intervention data
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5023946/
https://www.ncbi.nlm.nih.gov/pubmed/27580524
http://dx.doi.org/10.2196/jmir.5725
work_keys_str_mv AT carpenterjordan seeingthebigpicturebigdatamethodsforexploringrelationshipsbetweenusagelanguageandoutcomeininternetinterventiondata
AT crutchleypatrick seeingthebigpicturebigdatamethodsforexploringrelationshipsbetweenusagelanguageandoutcomeininternetinterventiondata
AT zilcarand seeingthebigpicturebigdatamethodsforexploringrelationshipsbetweenusagelanguageandoutcomeininternetinterventiondata
AT schwartzhandrew seeingthebigpicturebigdatamethodsforexploringrelationshipsbetweenusagelanguageandoutcomeininternetinterventiondata
AT smithlaurak seeingthebigpicturebigdatamethodsforexploringrelationshipsbetweenusagelanguageandoutcomeininternetinterventiondata
AT cobbangelam seeingthebigpicturebigdatamethodsforexploringrelationshipsbetweenusagelanguageandoutcomeininternetinterventiondata
AT parksacaciac seeingthebigpicturebigdatamethodsforexploringrelationshipsbetweenusagelanguageandoutcomeininternetinterventiondata