Cargando…
Seeing the “Big” Picture: Big Data Methods for Exploring Relationships Between Usage, Language, and Outcome in Internet Intervention Data
BACKGROUND: Assessing the efficacy of Internet interventions that are already in the market introduces both challenges and opportunities. While vast, often unprecedented amounts of data may be available (hundreds of thousands, and sometimes millions of participants with high dimensions of assessed v...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
JMIR Publications
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5023946/ https://www.ncbi.nlm.nih.gov/pubmed/27580524 http://dx.doi.org/10.2196/jmir.5725 |
_version_ | 1782453716924760064 |
---|---|
author | Carpenter, Jordan Crutchley, Patrick Zilca, Ran D Schwartz, H Andrew Smith, Laura K Cobb, Angela M Parks, Acacia C |
author_facet | Carpenter, Jordan Crutchley, Patrick Zilca, Ran D Schwartz, H Andrew Smith, Laura K Cobb, Angela M Parks, Acacia C |
author_sort | Carpenter, Jordan |
collection | PubMed |
description | BACKGROUND: Assessing the efficacy of Internet interventions that are already in the market introduces both challenges and opportunities. While vast, often unprecedented amounts of data may be available (hundreds of thousands, and sometimes millions of participants with high dimensions of assessed variables), the data are observational in nature, are partly unstructured (eg, free text, images, sensor data), do not include a natural control group to be used for comparison, and typically exhibit high attrition rates. New approaches are therefore needed to use these existing data and derive new insights that can augment traditional smaller-group randomized controlled trials. OBJECTIVE: Our objective was to demonstrate how emerging big data approaches can help explore questions about the effectiveness and process of an Internet well-being intervention. METHODS: We drew data from the user base of a well-being website and app called Happify. To explore effectiveness, multilevel models focusing on within-person variation explored whether greater usage predicted higher well-being in a sample of 152,747 users. In addition, to explore the underlying processes that accompany improvement, we analyzed language for 10,818 users who had a sufficient volume of free-text response and timespan of platform usage. A topic model constructed from this free text provided language-based correlates of individual user improvement in outcome measures, providing insights into the beneficial underlying processes experienced by users. RESULTS: On a measure of positive emotion, the average user improved 1.38 points per week (SE 0.01, t122,455=113.60, P<.001, 95% CI 1.36–1.41), about a 27% increase over 8 weeks. Within a given individual user, more usage predicted more positive emotion and less usage predicted less positive emotion (estimate 0.09, SE 0.01, t6047=9.15, P=.001, 95% CI .07–.12). This estimate predicted that a given user would report positive emotion 1.26 points higher after a 2-week period when they used Happify daily than during a week when they didn’t use it at all. Among highly engaged users, 200 automatically clustered topics showed a significant (corrected P<.001) effect on change in well-being over time, illustrating which topics may be more beneficial than others when engaging with the interventions. In particular, topics that are related to addressing negative thoughts and feelings were correlated with improvement over time. CONCLUSIONS: Using observational analyses on naturalistic big data, we can explore the relationship between usage and well-being among people using an Internet well-being intervention and provide new insights into the underlying mechanisms that accompany it. By leveraging big data to power these new types of analyses, we can explore the workings of an intervention from new angles, and harness the insights that surface to feed back into the intervention and improve it further in the future. |
format | Online Article Text |
id | pubmed-5023946 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | JMIR Publications |
record_format | MEDLINE/PubMed |
spelling | pubmed-50239462016-10-03 Seeing the “Big” Picture: Big Data Methods for Exploring Relationships Between Usage, Language, and Outcome in Internet Intervention Data Carpenter, Jordan Crutchley, Patrick Zilca, Ran D Schwartz, H Andrew Smith, Laura K Cobb, Angela M Parks, Acacia C J Med Internet Res Original Paper BACKGROUND: Assessing the efficacy of Internet interventions that are already in the market introduces both challenges and opportunities. While vast, often unprecedented amounts of data may be available (hundreds of thousands, and sometimes millions of participants with high dimensions of assessed variables), the data are observational in nature, are partly unstructured (eg, free text, images, sensor data), do not include a natural control group to be used for comparison, and typically exhibit high attrition rates. New approaches are therefore needed to use these existing data and derive new insights that can augment traditional smaller-group randomized controlled trials. OBJECTIVE: Our objective was to demonstrate how emerging big data approaches can help explore questions about the effectiveness and process of an Internet well-being intervention. METHODS: We drew data from the user base of a well-being website and app called Happify. To explore effectiveness, multilevel models focusing on within-person variation explored whether greater usage predicted higher well-being in a sample of 152,747 users. In addition, to explore the underlying processes that accompany improvement, we analyzed language for 10,818 users who had a sufficient volume of free-text response and timespan of platform usage. A topic model constructed from this free text provided language-based correlates of individual user improvement in outcome measures, providing insights into the beneficial underlying processes experienced by users. RESULTS: On a measure of positive emotion, the average user improved 1.38 points per week (SE 0.01, t122,455=113.60, P<.001, 95% CI 1.36–1.41), about a 27% increase over 8 weeks. Within a given individual user, more usage predicted more positive emotion and less usage predicted less positive emotion (estimate 0.09, SE 0.01, t6047=9.15, P=.001, 95% CI .07–.12). This estimate predicted that a given user would report positive emotion 1.26 points higher after a 2-week period when they used Happify daily than during a week when they didn’t use it at all. Among highly engaged users, 200 automatically clustered topics showed a significant (corrected P<.001) effect on change in well-being over time, illustrating which topics may be more beneficial than others when engaging with the interventions. In particular, topics that are related to addressing negative thoughts and feelings were correlated with improvement over time. CONCLUSIONS: Using observational analyses on naturalistic big data, we can explore the relationship between usage and well-being among people using an Internet well-being intervention and provide new insights into the underlying mechanisms that accompany it. By leveraging big data to power these new types of analyses, we can explore the workings of an intervention from new angles, and harness the insights that surface to feed back into the intervention and improve it further in the future. JMIR Publications 2016-08-31 /pmc/articles/PMC5023946/ /pubmed/27580524 http://dx.doi.org/10.2196/jmir.5725 Text en ©Jordan Carpenter, Patrick Crutchley, Ran D Zilca, H Andrew Schwartz, Laura K Smith, Angela M Cobb, Acacia C Parks. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 31.08.2016. https://creativecommons.org/licenses/by/2.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0/ (https://creativecommons.org/licenses/by/2.0/) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on http://www.jmir.org/, as well as this copyright and license information must be included. |
spellingShingle | Original Paper Carpenter, Jordan Crutchley, Patrick Zilca, Ran D Schwartz, H Andrew Smith, Laura K Cobb, Angela M Parks, Acacia C Seeing the “Big” Picture: Big Data Methods for Exploring Relationships Between Usage, Language, and Outcome in Internet Intervention Data |
title | Seeing the “Big” Picture: Big Data Methods for Exploring Relationships Between Usage, Language, and Outcome in Internet Intervention Data |
title_full | Seeing the “Big” Picture: Big Data Methods for Exploring Relationships Between Usage, Language, and Outcome in Internet Intervention Data |
title_fullStr | Seeing the “Big” Picture: Big Data Methods for Exploring Relationships Between Usage, Language, and Outcome in Internet Intervention Data |
title_full_unstemmed | Seeing the “Big” Picture: Big Data Methods for Exploring Relationships Between Usage, Language, and Outcome in Internet Intervention Data |
title_short | Seeing the “Big” Picture: Big Data Methods for Exploring Relationships Between Usage, Language, and Outcome in Internet Intervention Data |
title_sort | seeing the “big” picture: big data methods for exploring relationships between usage, language, and outcome in internet intervention data |
topic | Original Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5023946/ https://www.ncbi.nlm.nih.gov/pubmed/27580524 http://dx.doi.org/10.2196/jmir.5725 |
work_keys_str_mv | AT carpenterjordan seeingthebigpicturebigdatamethodsforexploringrelationshipsbetweenusagelanguageandoutcomeininternetinterventiondata AT crutchleypatrick seeingthebigpicturebigdatamethodsforexploringrelationshipsbetweenusagelanguageandoutcomeininternetinterventiondata AT zilcarand seeingthebigpicturebigdatamethodsforexploringrelationshipsbetweenusagelanguageandoutcomeininternetinterventiondata AT schwartzhandrew seeingthebigpicturebigdatamethodsforexploringrelationshipsbetweenusagelanguageandoutcomeininternetinterventiondata AT smithlaurak seeingthebigpicturebigdatamethodsforexploringrelationshipsbetweenusagelanguageandoutcomeininternetinterventiondata AT cobbangelam seeingthebigpicturebigdatamethodsforexploringrelationshipsbetweenusagelanguageandoutcomeininternetinterventiondata AT parksacaciac seeingthebigpicturebigdatamethodsforexploringrelationshipsbetweenusagelanguageandoutcomeininternetinterventiondata |