Cargando…

The Benefits of Crowdsourcing to Seed and Align an Algorithm in an mHealth Intervention for African American and Hispanic Adults: Survey Study

BACKGROUND: The lack of publicly available and culturally relevant data sets on African American and bilingual/Spanish-speaking Hispanic adults’ disease prevention and health promotion priorities presents a major challenge for researchers and developers who want to create and test personalized tools...

Descripción completa

Detalles Bibliográficos
Autores principales: Sehgal, Neil Jay, Huang, Shuo, Johnson, Neil Mason, Dickerson, John, Jackson, Devlon, Baur, Cynthia
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9257620/
https://www.ncbi.nlm.nih.gov/pubmed/35727616
http://dx.doi.org/10.2196/30216
_version_ 1784741375160352768
author Sehgal, Neil Jay
Huang, Shuo
Johnson, Neil Mason
Dickerson, John
Jackson, Devlon
Baur, Cynthia
author_facet Sehgal, Neil Jay
Huang, Shuo
Johnson, Neil Mason
Dickerson, John
Jackson, Devlon
Baur, Cynthia
author_sort Sehgal, Neil Jay
collection PubMed
description BACKGROUND: The lack of publicly available and culturally relevant data sets on African American and bilingual/Spanish-speaking Hispanic adults’ disease prevention and health promotion priorities presents a major challenge for researchers and developers who want to create and test personalized tools built on and aligned with those priorities. Personalization depends on prediction and performance data. A recommender system (RecSys) could predict the most culturally and personally relevant preventative health information and serve it to African American and Hispanic users via a novel smartphone app. However, early in a user’s experience, a RecSys can face the “cold start problem” of serving untailored and irrelevant content before it learns user preferences. For underserved African American and Hispanic populations, who are consistently being served health content targeted toward the White majority, the cold start problem can become an example of algorithmic bias. To avoid this, a RecSys needs population-appropriate seed data aligned with the app’s purposes. Crowdsourcing provides a means to generate population-appropriate seed data. OBJECTIVE: Our objective was to identify and test a method to address the lack of culturally specific preventative personal health data and sidestep the type of algorithmic bias inherent in a RecSys not trained in the population of focus. We did this by collecting a large amount of data quickly and at low cost from members of the population of focus, thereby generating a novel data set based on prevention-focused, population-relevant health goals. We seeded our RecSys with data collected anonymously from self-identified Hispanic and self-identified non-Hispanic African American/Black adult respondents, using Amazon Mechanical Turk (MTurk). METHODS: MTurk provided the crowdsourcing platform for a web-based survey in which respondents completed a personal profile and a health information–seeking assessment, and provided data on family health history and personal health history. Respondents then selected their top 3 health goals related to preventable health conditions, and for each goal, reviewed and rated the top 3 information returns by importance, personal utility, whether the item should be added to their personal health library, and their satisfaction with the quality of the information returned. This paper reports the article ratings because our intent was to assess the benefits of crowdsourcing to seed a RecSys. The analysis of the data from health goals will be reported in future papers. RESULTS: The MTurk crowdsourcing approach generated 985 valid responses from 485 (49%) self-identified Hispanic and 500 (51%) self-identified non-Hispanic African American adults over the course of only 64 days at a cost of US $6.74 per respondent. Respondents rated 92 unique articles to inform the RecSys. CONCLUSIONS: Researchers have options such as MTurk as a quick, low-cost means to avoid the cold start problem for algorithms and to sidestep bias and low relevance for an intended population of app users. Seeding a RecSys with responses from people like the intended users allows for the development of a digital health tool that can recommend information to users based on similar demography, health goals, and health history. This approach minimizes the potential, initial gaps in algorithm performance; allows for quicker algorithm refinement in use; and may deliver a better user experience to individuals seeking preventative health information to improve health and achieve health goals.
format Online
Article
Text
id pubmed-9257620
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-92576202022-07-07 The Benefits of Crowdsourcing to Seed and Align an Algorithm in an mHealth Intervention for African American and Hispanic Adults: Survey Study Sehgal, Neil Jay Huang, Shuo Johnson, Neil Mason Dickerson, John Jackson, Devlon Baur, Cynthia J Med Internet Res Original Paper BACKGROUND: The lack of publicly available and culturally relevant data sets on African American and bilingual/Spanish-speaking Hispanic adults’ disease prevention and health promotion priorities presents a major challenge for researchers and developers who want to create and test personalized tools built on and aligned with those priorities. Personalization depends on prediction and performance data. A recommender system (RecSys) could predict the most culturally and personally relevant preventative health information and serve it to African American and Hispanic users via a novel smartphone app. However, early in a user’s experience, a RecSys can face the “cold start problem” of serving untailored and irrelevant content before it learns user preferences. For underserved African American and Hispanic populations, who are consistently being served health content targeted toward the White majority, the cold start problem can become an example of algorithmic bias. To avoid this, a RecSys needs population-appropriate seed data aligned with the app’s purposes. Crowdsourcing provides a means to generate population-appropriate seed data. OBJECTIVE: Our objective was to identify and test a method to address the lack of culturally specific preventative personal health data and sidestep the type of algorithmic bias inherent in a RecSys not trained in the population of focus. We did this by collecting a large amount of data quickly and at low cost from members of the population of focus, thereby generating a novel data set based on prevention-focused, population-relevant health goals. We seeded our RecSys with data collected anonymously from self-identified Hispanic and self-identified non-Hispanic African American/Black adult respondents, using Amazon Mechanical Turk (MTurk). METHODS: MTurk provided the crowdsourcing platform for a web-based survey in which respondents completed a personal profile and a health information–seeking assessment, and provided data on family health history and personal health history. Respondents then selected their top 3 health goals related to preventable health conditions, and for each goal, reviewed and rated the top 3 information returns by importance, personal utility, whether the item should be added to their personal health library, and their satisfaction with the quality of the information returned. This paper reports the article ratings because our intent was to assess the benefits of crowdsourcing to seed a RecSys. The analysis of the data from health goals will be reported in future papers. RESULTS: The MTurk crowdsourcing approach generated 985 valid responses from 485 (49%) self-identified Hispanic and 500 (51%) self-identified non-Hispanic African American adults over the course of only 64 days at a cost of US $6.74 per respondent. Respondents rated 92 unique articles to inform the RecSys. CONCLUSIONS: Researchers have options such as MTurk as a quick, low-cost means to avoid the cold start problem for algorithms and to sidestep bias and low relevance for an intended population of app users. Seeding a RecSys with responses from people like the intended users allows for the development of a digital health tool that can recommend information to users based on similar demography, health goals, and health history. This approach minimizes the potential, initial gaps in algorithm performance; allows for quicker algorithm refinement in use; and may deliver a better user experience to individuals seeking preventative health information to improve health and achieve health goals. JMIR Publications 2022-06-21 /pmc/articles/PMC9257620/ /pubmed/35727616 http://dx.doi.org/10.2196/30216 Text en ©Neil Jay Sehgal, Shuo Huang, Neil Mason Johnson, John Dickerson, Devlon Jackson, Cynthia Baur. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 21.06.2022. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on https://www.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Sehgal, Neil Jay
Huang, Shuo
Johnson, Neil Mason
Dickerson, John
Jackson, Devlon
Baur, Cynthia
The Benefits of Crowdsourcing to Seed and Align an Algorithm in an mHealth Intervention for African American and Hispanic Adults: Survey Study
title The Benefits of Crowdsourcing to Seed and Align an Algorithm in an mHealth Intervention for African American and Hispanic Adults: Survey Study
title_full The Benefits of Crowdsourcing to Seed and Align an Algorithm in an mHealth Intervention for African American and Hispanic Adults: Survey Study
title_fullStr The Benefits of Crowdsourcing to Seed and Align an Algorithm in an mHealth Intervention for African American and Hispanic Adults: Survey Study
title_full_unstemmed The Benefits of Crowdsourcing to Seed and Align an Algorithm in an mHealth Intervention for African American and Hispanic Adults: Survey Study
title_short The Benefits of Crowdsourcing to Seed and Align an Algorithm in an mHealth Intervention for African American and Hispanic Adults: Survey Study
title_sort benefits of crowdsourcing to seed and align an algorithm in an mhealth intervention for african american and hispanic adults: survey study
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9257620/
https://www.ncbi.nlm.nih.gov/pubmed/35727616
http://dx.doi.org/10.2196/30216
work_keys_str_mv AT sehgalneiljay thebenefitsofcrowdsourcingtoseedandalignanalgorithminanmhealthinterventionforafricanamericanandhispanicadultssurveystudy
AT huangshuo thebenefitsofcrowdsourcingtoseedandalignanalgorithminanmhealthinterventionforafricanamericanandhispanicadultssurveystudy
AT johnsonneilmason thebenefitsofcrowdsourcingtoseedandalignanalgorithminanmhealthinterventionforafricanamericanandhispanicadultssurveystudy
AT dickersonjohn thebenefitsofcrowdsourcingtoseedandalignanalgorithminanmhealthinterventionforafricanamericanandhispanicadultssurveystudy
AT jacksondevlon thebenefitsofcrowdsourcingtoseedandalignanalgorithminanmhealthinterventionforafricanamericanandhispanicadultssurveystudy
AT baurcynthia thebenefitsofcrowdsourcingtoseedandalignanalgorithminanmhealthinterventionforafricanamericanandhispanicadultssurveystudy
AT sehgalneiljay benefitsofcrowdsourcingtoseedandalignanalgorithminanmhealthinterventionforafricanamericanandhispanicadultssurveystudy
AT huangshuo benefitsofcrowdsourcingtoseedandalignanalgorithminanmhealthinterventionforafricanamericanandhispanicadultssurveystudy
AT johnsonneilmason benefitsofcrowdsourcingtoseedandalignanalgorithminanmhealthinterventionforafricanamericanandhispanicadultssurveystudy
AT dickersonjohn benefitsofcrowdsourcingtoseedandalignanalgorithminanmhealthinterventionforafricanamericanandhispanicadultssurveystudy
AT jacksondevlon benefitsofcrowdsourcingtoseedandalignanalgorithminanmhealthinterventionforafricanamericanandhispanicadultssurveystudy
AT baurcynthia benefitsofcrowdsourcingtoseedandalignanalgorithminanmhealthinterventionforafricanamericanandhispanicadultssurveystudy