Cargando…

Comparing performance of ensemble methods in predicting movie box office revenue

While many business intelligence methods have been applied to predict movie box office revenue, the studies using an ensemble approach to predict box office revenue are almost nonexistent. In this study, we propose decision trees, k-nearest-neighbors (k-NN), and linear regression using ensemble meth...

Descripción completa

Detalles Bibliográficos
Autores principales: Lee, Sangjae, KC, Bikash, Choeh, Joon Yeon
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7322254/
https://www.ncbi.nlm.nih.gov/pubmed/32613125
http://dx.doi.org/10.1016/j.heliyon.2020.e04260
_version_ 1783551609524977664
author Lee, Sangjae
KC, Bikash
Choeh, Joon Yeon
author_facet Lee, Sangjae
KC, Bikash
Choeh, Joon Yeon
author_sort Lee, Sangjae
collection PubMed
description While many business intelligence methods have been applied to predict movie box office revenue, the studies using an ensemble approach to predict box office revenue are almost nonexistent. In this study, we propose decision trees, k-nearest-neighbors (k-NN), and linear regression using ensemble methods and the prediction performance of decision trees based on random forests, bagging and boosting are compared with that of k-NN and linear regression based on bagging and boosting using the sample of 1439 movies. The results indicate that ensemble methods based on decision trees (random forests, bagging, boosting) outperform ensemble methods based on k-NN (bagging, boosting) in predicting box office at week 1, 2, 3 after release. Decision trees using ensemble methods provide better prediction performance than ensemble methods based on linear regression analysis in the box office at week 1 after release. This is explained by the results that after comparing the prediction performance between ensemble methods and non-ensemble methods. For decision tree methods, unlike the other methods, the prediction performance of ensemble methods is greater than that of non-ensemble methods. This shows that decision trees using ensemble methods provide better application effectiveness of ensemble methods than k-NN and linear regression analysis.
format Online
Article
Text
id pubmed-7322254
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-73222542020-06-30 Comparing performance of ensemble methods in predicting movie box office revenue Lee, Sangjae KC, Bikash Choeh, Joon Yeon Heliyon Article While many business intelligence methods have been applied to predict movie box office revenue, the studies using an ensemble approach to predict box office revenue are almost nonexistent. In this study, we propose decision trees, k-nearest-neighbors (k-NN), and linear regression using ensemble methods and the prediction performance of decision trees based on random forests, bagging and boosting are compared with that of k-NN and linear regression based on bagging and boosting using the sample of 1439 movies. The results indicate that ensemble methods based on decision trees (random forests, bagging, boosting) outperform ensemble methods based on k-NN (bagging, boosting) in predicting box office at week 1, 2, 3 after release. Decision trees using ensemble methods provide better prediction performance than ensemble methods based on linear regression analysis in the box office at week 1 after release. This is explained by the results that after comparing the prediction performance between ensemble methods and non-ensemble methods. For decision tree methods, unlike the other methods, the prediction performance of ensemble methods is greater than that of non-ensemble methods. This shows that decision trees using ensemble methods provide better application effectiveness of ensemble methods than k-NN and linear regression analysis. Elsevier 2020-06-20 /pmc/articles/PMC7322254/ /pubmed/32613125 http://dx.doi.org/10.1016/j.heliyon.2020.e04260 Text en © 2020 Published by Elsevier Ltd. http://creativecommons.org/licenses/by-nc-nd/4.0/ This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Article
Lee, Sangjae
KC, Bikash
Choeh, Joon Yeon
Comparing performance of ensemble methods in predicting movie box office revenue
title Comparing performance of ensemble methods in predicting movie box office revenue
title_full Comparing performance of ensemble methods in predicting movie box office revenue
title_fullStr Comparing performance of ensemble methods in predicting movie box office revenue
title_full_unstemmed Comparing performance of ensemble methods in predicting movie box office revenue
title_short Comparing performance of ensemble methods in predicting movie box office revenue
title_sort comparing performance of ensemble methods in predicting movie box office revenue
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7322254/
https://www.ncbi.nlm.nih.gov/pubmed/32613125
http://dx.doi.org/10.1016/j.heliyon.2020.e04260
work_keys_str_mv AT leesangjae comparingperformanceofensemblemethodsinpredictingmovieboxofficerevenue
AT kcbikash comparingperformanceofensemblemethodsinpredictingmovieboxofficerevenue
AT choehjoonyeon comparingperformanceofensemblemethodsinpredictingmovieboxofficerevenue