Cargando…
Predicting Popularity of Video Streaming Services with Representation Learning: A Survey and a Real-World Case Study
The Internet’s popularization has increased the amount of content produced and consumed on the web. To take advantage of this new market, major content producers such as Netflix and Amazon Prime have emerged, focusing on video streaming services. However, despite the large number and diversity of vi...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8588537/ https://www.ncbi.nlm.nih.gov/pubmed/34770633 http://dx.doi.org/10.3390/s21217328 |
_version_ | 1784598487106584576 |
---|---|
author | de Sá, Sidney Loyola Rocha, Antonio A. de A. Paes, Aline |
author_facet | de Sá, Sidney Loyola Rocha, Antonio A. de A. Paes, Aline |
author_sort | de Sá, Sidney Loyola |
collection | PubMed |
description | The Internet’s popularization has increased the amount of content produced and consumed on the web. To take advantage of this new market, major content producers such as Netflix and Amazon Prime have emerged, focusing on video streaming services. However, despite the large number and diversity of videos made available by these content providers, few of them attract the attention of most users. For example, in the data explored in this article, only 6% of the most popular videos account for 85% of total views. Finding out in advance which videos will be popular is not trivial, especially given many influencing variables. Nevertheless, a tool with this ability would be of great value to help dimension network infrastructure and properly recommend new content to users. In this way, this manuscript examines the machine learning-based approaches that have been proposed to solve the prediction of web content popularity. To this end, we first survey the literature and elaborate a taxonomy that classifies models according to predictive features and describes state-of-the-art features and techniques used to solve this task. While analyzing previous works, we saw an opportunity to use textual features for video prediction. Thus, additionally, we propose a case study that combines features acquired through attribute engineering and word embedding to predict the popularity of a video. The first approach is based on predictive attributes defined by resource engineering. The second takes advantage of word embeddings from video descriptions and titles. We experimented with the proposed techniques in a set of videos from GloboPlay, the largest provider of video streaming services in Latin America. A combination of engineering features and embeddings using the Random Forest algorithm achieved the best result, with an accuracy of 87%. |
format | Online Article Text |
id | pubmed-8588537 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-85885372021-11-13 Predicting Popularity of Video Streaming Services with Representation Learning: A Survey and a Real-World Case Study de Sá, Sidney Loyola Rocha, Antonio A. de A. Paes, Aline Sensors (Basel) Article The Internet’s popularization has increased the amount of content produced and consumed on the web. To take advantage of this new market, major content producers such as Netflix and Amazon Prime have emerged, focusing on video streaming services. However, despite the large number and diversity of videos made available by these content providers, few of them attract the attention of most users. For example, in the data explored in this article, only 6% of the most popular videos account for 85% of total views. Finding out in advance which videos will be popular is not trivial, especially given many influencing variables. Nevertheless, a tool with this ability would be of great value to help dimension network infrastructure and properly recommend new content to users. In this way, this manuscript examines the machine learning-based approaches that have been proposed to solve the prediction of web content popularity. To this end, we first survey the literature and elaborate a taxonomy that classifies models according to predictive features and describes state-of-the-art features and techniques used to solve this task. While analyzing previous works, we saw an opportunity to use textual features for video prediction. Thus, additionally, we propose a case study that combines features acquired through attribute engineering and word embedding to predict the popularity of a video. The first approach is based on predictive attributes defined by resource engineering. The second takes advantage of word embeddings from video descriptions and titles. We experimented with the proposed techniques in a set of videos from GloboPlay, the largest provider of video streaming services in Latin America. A combination of engineering features and embeddings using the Random Forest algorithm achieved the best result, with an accuracy of 87%. MDPI 2021-11-03 /pmc/articles/PMC8588537/ /pubmed/34770633 http://dx.doi.org/10.3390/s21217328 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article de Sá, Sidney Loyola Rocha, Antonio A. de A. Paes, Aline Predicting Popularity of Video Streaming Services with Representation Learning: A Survey and a Real-World Case Study |
title | Predicting Popularity of Video Streaming Services with Representation Learning: A Survey and a Real-World Case Study |
title_full | Predicting Popularity of Video Streaming Services with Representation Learning: A Survey and a Real-World Case Study |
title_fullStr | Predicting Popularity of Video Streaming Services with Representation Learning: A Survey and a Real-World Case Study |
title_full_unstemmed | Predicting Popularity of Video Streaming Services with Representation Learning: A Survey and a Real-World Case Study |
title_short | Predicting Popularity of Video Streaming Services with Representation Learning: A Survey and a Real-World Case Study |
title_sort | predicting popularity of video streaming services with representation learning: a survey and a real-world case study |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8588537/ https://www.ncbi.nlm.nih.gov/pubmed/34770633 http://dx.doi.org/10.3390/s21217328 |
work_keys_str_mv | AT desasidneyloyola predictingpopularityofvideostreamingserviceswithrepresentationlearningasurveyandarealworldcasestudy AT rochaantonioadea predictingpopularityofvideostreamingserviceswithrepresentationlearningasurveyandarealworldcasestudy AT paesaline predictingpopularityofvideostreamingserviceswithrepresentationlearningasurveyandarealworldcasestudy |