Cargando…
Influence of User Profile Attributes on e-Cigarette–Related Searches on YouTube: Machine Learning Clustering and Classification
BACKGROUND: The proliferation of e-cigarette content on YouTube is concerning because of its possible effect on youth use behaviors. YouTube has a personalized search and recommendation algorithm that derives attributes from a user’s profile, such as age and sex. However, little is known about wheth...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
JMIR Publications
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10139687/ https://www.ncbi.nlm.nih.gov/pubmed/37124246 http://dx.doi.org/10.2196/42218 |
_version_ | 1785033000171339776 |
---|---|
author | Murthy, Dhiraj Lee, Juhan Dashtian, Hassan Kong, Grace |
author_facet | Murthy, Dhiraj Lee, Juhan Dashtian, Hassan Kong, Grace |
author_sort | Murthy, Dhiraj |
collection | PubMed |
description | BACKGROUND: The proliferation of e-cigarette content on YouTube is concerning because of its possible effect on youth use behaviors. YouTube has a personalized search and recommendation algorithm that derives attributes from a user’s profile, such as age and sex. However, little is known about whether e-cigarette content is shown differently based on user characteristics. OBJECTIVE: The aim of this study was to understand the influence of age and sex attributes of user profiles on e-cigarette–related YouTube search results. METHODS: We created 16 fictitious YouTube profiles with ages of 16 and 24 years, sex (female and male), and ethnicity/race to search for 18 e-cigarette–related search terms. We used unsupervised (k-means clustering and classification) and supervised (graph convolutional network) machine learning and network analysis to characterize the variation in the search results of each profile. We further examined whether user attributes may play a role in e-cigarette–related content exposure by using networks and degree centrality. RESULTS: We analyzed 4201 nonduplicate videos. Our k-means clustering suggested that the videos could be clustered into 3 categories. The graph convolutional network achieved high accuracy (0.72). Videos were classified based on content into 4 categories: product review (49.3%), health information (15.1%), instruction (26.9%), and other (8.5%). Underage users were exposed mostly to instructional videos (37.5%), with some indication that more female 16-year-old profiles were exposed to this content, while young adult age groups (24 years) were exposed mostly to product review videos (39.2%). CONCLUSIONS: Our results indicate that demographic attributes factor into YouTube’s algorithmic systems in the context of e-cigarette–related queries on YouTube. Specifically, differences in the age and sex attributes of user profiles do result in variance in both the videos presented in YouTube search results as well as in the types of these videos. We find that underage profiles were exposed to e-cigarette content despite YouTube’s age-restriction policy that ostensibly prohibits certain e-cigarette content. Greater enforcement of policies to restrict youth access to e-cigarette content is needed. |
format | Online Article Text |
id | pubmed-10139687 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | JMIR Publications |
record_format | MEDLINE/PubMed |
spelling | pubmed-101396872023-04-28 Influence of User Profile Attributes on e-Cigarette–Related Searches on YouTube: Machine Learning Clustering and Classification Murthy, Dhiraj Lee, Juhan Dashtian, Hassan Kong, Grace JMIR Infodemiology Original Paper BACKGROUND: The proliferation of e-cigarette content on YouTube is concerning because of its possible effect on youth use behaviors. YouTube has a personalized search and recommendation algorithm that derives attributes from a user’s profile, such as age and sex. However, little is known about whether e-cigarette content is shown differently based on user characteristics. OBJECTIVE: The aim of this study was to understand the influence of age and sex attributes of user profiles on e-cigarette–related YouTube search results. METHODS: We created 16 fictitious YouTube profiles with ages of 16 and 24 years, sex (female and male), and ethnicity/race to search for 18 e-cigarette–related search terms. We used unsupervised (k-means clustering and classification) and supervised (graph convolutional network) machine learning and network analysis to characterize the variation in the search results of each profile. We further examined whether user attributes may play a role in e-cigarette–related content exposure by using networks and degree centrality. RESULTS: We analyzed 4201 nonduplicate videos. Our k-means clustering suggested that the videos could be clustered into 3 categories. The graph convolutional network achieved high accuracy (0.72). Videos were classified based on content into 4 categories: product review (49.3%), health information (15.1%), instruction (26.9%), and other (8.5%). Underage users were exposed mostly to instructional videos (37.5%), with some indication that more female 16-year-old profiles were exposed to this content, while young adult age groups (24 years) were exposed mostly to product review videos (39.2%). CONCLUSIONS: Our results indicate that demographic attributes factor into YouTube’s algorithmic systems in the context of e-cigarette–related queries on YouTube. Specifically, differences in the age and sex attributes of user profiles do result in variance in both the videos presented in YouTube search results as well as in the types of these videos. We find that underage profiles were exposed to e-cigarette content despite YouTube’s age-restriction policy that ostensibly prohibits certain e-cigarette content. Greater enforcement of policies to restrict youth access to e-cigarette content is needed. JMIR Publications 2023-04-12 /pmc/articles/PMC10139687/ /pubmed/37124246 http://dx.doi.org/10.2196/42218 Text en ©Dhiraj Murthy, Juhan Lee, Hassan Dashtian, Grace Kong. Originally published in JMIR Infodemiology (https://infodemiology.jmir.org), 12.04.2023. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Infodemiology, is properly cited. The complete bibliographic information, a link to the original publication on https://infodemiology.jmir.org/, as well as this copyright and license information must be included. |
spellingShingle | Original Paper Murthy, Dhiraj Lee, Juhan Dashtian, Hassan Kong, Grace Influence of User Profile Attributes on e-Cigarette–Related Searches on YouTube: Machine Learning Clustering and Classification |
title | Influence of User Profile Attributes on e-Cigarette–Related Searches on YouTube: Machine Learning Clustering and Classification |
title_full | Influence of User Profile Attributes on e-Cigarette–Related Searches on YouTube: Machine Learning Clustering and Classification |
title_fullStr | Influence of User Profile Attributes on e-Cigarette–Related Searches on YouTube: Machine Learning Clustering and Classification |
title_full_unstemmed | Influence of User Profile Attributes on e-Cigarette–Related Searches on YouTube: Machine Learning Clustering and Classification |
title_short | Influence of User Profile Attributes on e-Cigarette–Related Searches on YouTube: Machine Learning Clustering and Classification |
title_sort | influence of user profile attributes on e-cigarette–related searches on youtube: machine learning clustering and classification |
topic | Original Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10139687/ https://www.ncbi.nlm.nih.gov/pubmed/37124246 http://dx.doi.org/10.2196/42218 |
work_keys_str_mv | AT murthydhiraj influenceofuserprofileattributesonecigaretterelatedsearchesonyoutubemachinelearningclusteringandclassification AT leejuhan influenceofuserprofileattributesonecigaretterelatedsearchesonyoutubemachinelearningclusteringandclassification AT dashtianhassan influenceofuserprofileattributesonecigaretterelatedsearchesonyoutubemachinelearningclusteringandclassification AT konggrace influenceofuserprofileattributesonecigaretterelatedsearchesonyoutubemachinelearningclusteringandclassification |