Cargando…
Comparing performance of ensemble methods in predicting movie box office revenue
While many business intelligence methods have been applied to predict movie box office revenue, the studies using an ensemble approach to predict box office revenue are almost nonexistent. In this study, we propose decision trees, k-nearest-neighbors (k-NN), and linear regression using ensemble meth...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7322254/ https://www.ncbi.nlm.nih.gov/pubmed/32613125 http://dx.doi.org/10.1016/j.heliyon.2020.e04260 |
_version_ | 1783551609524977664 |
---|---|
author | Lee, Sangjae KC, Bikash Choeh, Joon Yeon |
author_facet | Lee, Sangjae KC, Bikash Choeh, Joon Yeon |
author_sort | Lee, Sangjae |
collection | PubMed |
description | While many business intelligence methods have been applied to predict movie box office revenue, the studies using an ensemble approach to predict box office revenue are almost nonexistent. In this study, we propose decision trees, k-nearest-neighbors (k-NN), and linear regression using ensemble methods and the prediction performance of decision trees based on random forests, bagging and boosting are compared with that of k-NN and linear regression based on bagging and boosting using the sample of 1439 movies. The results indicate that ensemble methods based on decision trees (random forests, bagging, boosting) outperform ensemble methods based on k-NN (bagging, boosting) in predicting box office at week 1, 2, 3 after release. Decision trees using ensemble methods provide better prediction performance than ensemble methods based on linear regression analysis in the box office at week 1 after release. This is explained by the results that after comparing the prediction performance between ensemble methods and non-ensemble methods. For decision tree methods, unlike the other methods, the prediction performance of ensemble methods is greater than that of non-ensemble methods. This shows that decision trees using ensemble methods provide better application effectiveness of ensemble methods than k-NN and linear regression analysis. |
format | Online Article Text |
id | pubmed-7322254 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Elsevier |
record_format | MEDLINE/PubMed |
spelling | pubmed-73222542020-06-30 Comparing performance of ensemble methods in predicting movie box office revenue Lee, Sangjae KC, Bikash Choeh, Joon Yeon Heliyon Article While many business intelligence methods have been applied to predict movie box office revenue, the studies using an ensemble approach to predict box office revenue are almost nonexistent. In this study, we propose decision trees, k-nearest-neighbors (k-NN), and linear regression using ensemble methods and the prediction performance of decision trees based on random forests, bagging and boosting are compared with that of k-NN and linear regression based on bagging and boosting using the sample of 1439 movies. The results indicate that ensemble methods based on decision trees (random forests, bagging, boosting) outperform ensemble methods based on k-NN (bagging, boosting) in predicting box office at week 1, 2, 3 after release. Decision trees using ensemble methods provide better prediction performance than ensemble methods based on linear regression analysis in the box office at week 1 after release. This is explained by the results that after comparing the prediction performance between ensemble methods and non-ensemble methods. For decision tree methods, unlike the other methods, the prediction performance of ensemble methods is greater than that of non-ensemble methods. This shows that decision trees using ensemble methods provide better application effectiveness of ensemble methods than k-NN and linear regression analysis. Elsevier 2020-06-20 /pmc/articles/PMC7322254/ /pubmed/32613125 http://dx.doi.org/10.1016/j.heliyon.2020.e04260 Text en © 2020 Published by Elsevier Ltd. http://creativecommons.org/licenses/by-nc-nd/4.0/ This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). |
spellingShingle | Article Lee, Sangjae KC, Bikash Choeh, Joon Yeon Comparing performance of ensemble methods in predicting movie box office revenue |
title | Comparing performance of ensemble methods in predicting movie box office revenue |
title_full | Comparing performance of ensemble methods in predicting movie box office revenue |
title_fullStr | Comparing performance of ensemble methods in predicting movie box office revenue |
title_full_unstemmed | Comparing performance of ensemble methods in predicting movie box office revenue |
title_short | Comparing performance of ensemble methods in predicting movie box office revenue |
title_sort | comparing performance of ensemble methods in predicting movie box office revenue |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7322254/ https://www.ncbi.nlm.nih.gov/pubmed/32613125 http://dx.doi.org/10.1016/j.heliyon.2020.e04260 |
work_keys_str_mv | AT leesangjae comparingperformanceofensemblemethodsinpredictingmovieboxofficerevenue AT kcbikash comparingperformanceofensemblemethodsinpredictingmovieboxofficerevenue AT choehjoonyeon comparingperformanceofensemblemethodsinpredictingmovieboxofficerevenue |