Cargando…

Influence of User Profile Attributes on e-Cigarette–Related Searches on YouTube: Machine Learning Clustering and Classification

BACKGROUND: The proliferation of e-cigarette content on YouTube is concerning because of its possible effect on youth use behaviors. YouTube has a personalized search and recommendation algorithm that derives attributes from a user’s profile, such as age and sex. However, little is known about wheth...

Descripción completa

Detalles Bibliográficos
Autores principales: Murthy, Dhiraj, Lee, Juhan, Dashtian, Hassan, Kong, Grace
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10139687/
https://www.ncbi.nlm.nih.gov/pubmed/37124246
http://dx.doi.org/10.2196/42218
_version_ 1785033000171339776
author Murthy, Dhiraj
Lee, Juhan
Dashtian, Hassan
Kong, Grace
author_facet Murthy, Dhiraj
Lee, Juhan
Dashtian, Hassan
Kong, Grace
author_sort Murthy, Dhiraj
collection PubMed
description BACKGROUND: The proliferation of e-cigarette content on YouTube is concerning because of its possible effect on youth use behaviors. YouTube has a personalized search and recommendation algorithm that derives attributes from a user’s profile, such as age and sex. However, little is known about whether e-cigarette content is shown differently based on user characteristics. OBJECTIVE: The aim of this study was to understand the influence of age and sex attributes of user profiles on e-cigarette–related YouTube search results. METHODS: We created 16 fictitious YouTube profiles with ages of 16 and 24 years, sex (female and male), and ethnicity/race to search for 18 e-cigarette–related search terms. We used unsupervised (k-means clustering and classification) and supervised (graph convolutional network) machine learning and network analysis to characterize the variation in the search results of each profile. We further examined whether user attributes may play a role in e-cigarette–related content exposure by using networks and degree centrality. RESULTS: We analyzed 4201 nonduplicate videos. Our k-means clustering suggested that the videos could be clustered into 3 categories. The graph convolutional network achieved high accuracy (0.72). Videos were classified based on content into 4 categories: product review (49.3%), health information (15.1%), instruction (26.9%), and other (8.5%). Underage users were exposed mostly to instructional videos (37.5%), with some indication that more female 16-year-old profiles were exposed to this content, while young adult age groups (24 years) were exposed mostly to product review videos (39.2%). CONCLUSIONS: Our results indicate that demographic attributes factor into YouTube’s algorithmic systems in the context of e-cigarette–related queries on YouTube. Specifically, differences in the age and sex attributes of user profiles do result in variance in both the videos presented in YouTube search results as well as in the types of these videos. We find that underage profiles were exposed to e-cigarette content despite YouTube’s age-restriction policy that ostensibly prohibits certain e-cigarette content. Greater enforcement of policies to restrict youth access to e-cigarette content is needed.
format Online
Article
Text
id pubmed-10139687
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-101396872023-04-28 Influence of User Profile Attributes on e-Cigarette–Related Searches on YouTube: Machine Learning Clustering and Classification Murthy, Dhiraj Lee, Juhan Dashtian, Hassan Kong, Grace JMIR Infodemiology Original Paper BACKGROUND: The proliferation of e-cigarette content on YouTube is concerning because of its possible effect on youth use behaviors. YouTube has a personalized search and recommendation algorithm that derives attributes from a user’s profile, such as age and sex. However, little is known about whether e-cigarette content is shown differently based on user characteristics. OBJECTIVE: The aim of this study was to understand the influence of age and sex attributes of user profiles on e-cigarette–related YouTube search results. METHODS: We created 16 fictitious YouTube profiles with ages of 16 and 24 years, sex (female and male), and ethnicity/race to search for 18 e-cigarette–related search terms. We used unsupervised (k-means clustering and classification) and supervised (graph convolutional network) machine learning and network analysis to characterize the variation in the search results of each profile. We further examined whether user attributes may play a role in e-cigarette–related content exposure by using networks and degree centrality. RESULTS: We analyzed 4201 nonduplicate videos. Our k-means clustering suggested that the videos could be clustered into 3 categories. The graph convolutional network achieved high accuracy (0.72). Videos were classified based on content into 4 categories: product review (49.3%), health information (15.1%), instruction (26.9%), and other (8.5%). Underage users were exposed mostly to instructional videos (37.5%), with some indication that more female 16-year-old profiles were exposed to this content, while young adult age groups (24 years) were exposed mostly to product review videos (39.2%). CONCLUSIONS: Our results indicate that demographic attributes factor into YouTube’s algorithmic systems in the context of e-cigarette–related queries on YouTube. Specifically, differences in the age and sex attributes of user profiles do result in variance in both the videos presented in YouTube search results as well as in the types of these videos. We find that underage profiles were exposed to e-cigarette content despite YouTube’s age-restriction policy that ostensibly prohibits certain e-cigarette content. Greater enforcement of policies to restrict youth access to e-cigarette content is needed. JMIR Publications 2023-04-12 /pmc/articles/PMC10139687/ /pubmed/37124246 http://dx.doi.org/10.2196/42218 Text en ©Dhiraj Murthy, Juhan Lee, Hassan Dashtian, Grace Kong. Originally published in JMIR Infodemiology (https://infodemiology.jmir.org), 12.04.2023. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Infodemiology, is properly cited. The complete bibliographic information, a link to the original publication on https://infodemiology.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Murthy, Dhiraj
Lee, Juhan
Dashtian, Hassan
Kong, Grace
Influence of User Profile Attributes on e-Cigarette–Related Searches on YouTube: Machine Learning Clustering and Classification
title Influence of User Profile Attributes on e-Cigarette–Related Searches on YouTube: Machine Learning Clustering and Classification
title_full Influence of User Profile Attributes on e-Cigarette–Related Searches on YouTube: Machine Learning Clustering and Classification
title_fullStr Influence of User Profile Attributes on e-Cigarette–Related Searches on YouTube: Machine Learning Clustering and Classification
title_full_unstemmed Influence of User Profile Attributes on e-Cigarette–Related Searches on YouTube: Machine Learning Clustering and Classification
title_short Influence of User Profile Attributes on e-Cigarette–Related Searches on YouTube: Machine Learning Clustering and Classification
title_sort influence of user profile attributes on e-cigarette–related searches on youtube: machine learning clustering and classification
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10139687/
https://www.ncbi.nlm.nih.gov/pubmed/37124246
http://dx.doi.org/10.2196/42218
work_keys_str_mv AT murthydhiraj influenceofuserprofileattributesonecigaretterelatedsearchesonyoutubemachinelearningclusteringandclassification
AT leejuhan influenceofuserprofileattributesonecigaretterelatedsearchesonyoutubemachinelearningclusteringandclassification
AT dashtianhassan influenceofuserprofileattributesonecigaretterelatedsearchesonyoutubemachinelearningclusteringandclassification
AT konggrace influenceofuserprofileattributesonecigaretterelatedsearchesonyoutubemachinelearningclusteringandclassification