Cargando…

Predicting Popularity of Video Streaming Services with Representation Learning: A Survey and a Real-World Case Study

The Internet’s popularization has increased the amount of content produced and consumed on the web. To take advantage of this new market, major content producers such as Netflix and Amazon Prime have emerged, focusing on video streaming services. However, despite the large number and diversity of vi...

Descripción completa

Detalles Bibliográficos
Autores principales: de Sá, Sidney Loyola, Rocha, Antonio A. de A., Paes, Aline
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8588537/
https://www.ncbi.nlm.nih.gov/pubmed/34770633
http://dx.doi.org/10.3390/s21217328
_version_ 1784598487106584576
author de Sá, Sidney Loyola
Rocha, Antonio A. de A.
Paes, Aline
author_facet de Sá, Sidney Loyola
Rocha, Antonio A. de A.
Paes, Aline
author_sort de Sá, Sidney Loyola
collection PubMed
description The Internet’s popularization has increased the amount of content produced and consumed on the web. To take advantage of this new market, major content producers such as Netflix and Amazon Prime have emerged, focusing on video streaming services. However, despite the large number and diversity of videos made available by these content providers, few of them attract the attention of most users. For example, in the data explored in this article, only 6% of the most popular videos account for 85% of total views. Finding out in advance which videos will be popular is not trivial, especially given many influencing variables. Nevertheless, a tool with this ability would be of great value to help dimension network infrastructure and properly recommend new content to users. In this way, this manuscript examines the machine learning-based approaches that have been proposed to solve the prediction of web content popularity. To this end, we first survey the literature and elaborate a taxonomy that classifies models according to predictive features and describes state-of-the-art features and techniques used to solve this task. While analyzing previous works, we saw an opportunity to use textual features for video prediction. Thus, additionally, we propose a case study that combines features acquired through attribute engineering and word embedding to predict the popularity of a video. The first approach is based on predictive attributes defined by resource engineering. The second takes advantage of word embeddings from video descriptions and titles. We experimented with the proposed techniques in a set of videos from GloboPlay, the largest provider of video streaming services in Latin America. A combination of engineering features and embeddings using the Random Forest algorithm achieved the best result, with an accuracy of 87%.
format Online
Article
Text
id pubmed-8588537
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-85885372021-11-13 Predicting Popularity of Video Streaming Services with Representation Learning: A Survey and a Real-World Case Study de Sá, Sidney Loyola Rocha, Antonio A. de A. Paes, Aline Sensors (Basel) Article The Internet’s popularization has increased the amount of content produced and consumed on the web. To take advantage of this new market, major content producers such as Netflix and Amazon Prime have emerged, focusing on video streaming services. However, despite the large number and diversity of videos made available by these content providers, few of them attract the attention of most users. For example, in the data explored in this article, only 6% of the most popular videos account for 85% of total views. Finding out in advance which videos will be popular is not trivial, especially given many influencing variables. Nevertheless, a tool with this ability would be of great value to help dimension network infrastructure and properly recommend new content to users. In this way, this manuscript examines the machine learning-based approaches that have been proposed to solve the prediction of web content popularity. To this end, we first survey the literature and elaborate a taxonomy that classifies models according to predictive features and describes state-of-the-art features and techniques used to solve this task. While analyzing previous works, we saw an opportunity to use textual features for video prediction. Thus, additionally, we propose a case study that combines features acquired through attribute engineering and word embedding to predict the popularity of a video. The first approach is based on predictive attributes defined by resource engineering. The second takes advantage of word embeddings from video descriptions and titles. We experimented with the proposed techniques in a set of videos from GloboPlay, the largest provider of video streaming services in Latin America. A combination of engineering features and embeddings using the Random Forest algorithm achieved the best result, with an accuracy of 87%. MDPI 2021-11-03 /pmc/articles/PMC8588537/ /pubmed/34770633 http://dx.doi.org/10.3390/s21217328 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
de Sá, Sidney Loyola
Rocha, Antonio A. de A.
Paes, Aline
Predicting Popularity of Video Streaming Services with Representation Learning: A Survey and a Real-World Case Study
title Predicting Popularity of Video Streaming Services with Representation Learning: A Survey and a Real-World Case Study
title_full Predicting Popularity of Video Streaming Services with Representation Learning: A Survey and a Real-World Case Study
title_fullStr Predicting Popularity of Video Streaming Services with Representation Learning: A Survey and a Real-World Case Study
title_full_unstemmed Predicting Popularity of Video Streaming Services with Representation Learning: A Survey and a Real-World Case Study
title_short Predicting Popularity of Video Streaming Services with Representation Learning: A Survey and a Real-World Case Study
title_sort predicting popularity of video streaming services with representation learning: a survey and a real-world case study
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8588537/
https://www.ncbi.nlm.nih.gov/pubmed/34770633
http://dx.doi.org/10.3390/s21217328
work_keys_str_mv AT desasidneyloyola predictingpopularityofvideostreamingserviceswithrepresentationlearningasurveyandarealworldcasestudy
AT rochaantonioadea predictingpopularityofvideostreamingserviceswithrepresentationlearningasurveyandarealworldcasestudy
AT paesaline predictingpopularityofvideostreamingserviceswithrepresentationlearningasurveyandarealworldcasestudy