Cargando…
A big data analysis of Twitter data during premier league matches: do tweets contain information valuable for in-play forecasting of goals in football?
Data-related analysis in football increasingly benefits from Big Data approaches and machine learning methods. One relevant application of data analysis in football is forecasting, which relies on understanding and accurately modelling the process of a match. The present paper tackles two neglected...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer Vienna
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8714875/ https://www.ncbi.nlm.nih.gov/pubmed/34976228 http://dx.doi.org/10.1007/s13278-021-00842-z |
_version_ | 1784624022036676608 |
---|---|
author | Wunderlich, Fabian Memmert, Daniel |
author_facet | Wunderlich, Fabian Memmert, Daniel |
author_sort | Wunderlich, Fabian |
collection | PubMed |
description | Data-related analysis in football increasingly benefits from Big Data approaches and machine learning methods. One relevant application of data analysis in football is forecasting, which relies on understanding and accurately modelling the process of a match. The present paper tackles two neglected facets of forecasting in football: Forecasts on the total number of goals and in-play forecasting (forecasts based on within-match information). Sentiment analysis techniques were used to extract the information reflected in almost two million tweets from more than 400 Premier League matches. By means of wordclouds and timely analysis of several tweet-based features, the Twitter communication over the full course of matches and shortly before and after goals was visualized and systematically analysed. Moreover, several forecasting models including a random forest model have been used to obtain in-play forecasts. Results suggest that in-play forecasting of goals is highly challenging, and in-play information does not improve forecasting accuracy. An additional analysis of goals from more than 30,000 matches from the main European football leagues supports the notion that the predictive value of in-play information is highly limited compared to pre-game information. This is a relevant result for coaches, match analysts and broadcasters who should not overestimate the value of in-play information. The present study also sheds light on how the perception and behaviour of Twitter users change over the course of a football match. A main result is that the sentiment of Twitter users decreases when the match progresses, which might be caused by an unjustified high expectation of football fans before the match. |
format | Online Article Text |
id | pubmed-8714875 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Springer Vienna |
record_format | MEDLINE/PubMed |
spelling | pubmed-87148752021-12-29 A big data analysis of Twitter data during premier league matches: do tweets contain information valuable for in-play forecasting of goals in football? Wunderlich, Fabian Memmert, Daniel Soc Netw Anal Min Original Article Data-related analysis in football increasingly benefits from Big Data approaches and machine learning methods. One relevant application of data analysis in football is forecasting, which relies on understanding and accurately modelling the process of a match. The present paper tackles two neglected facets of forecasting in football: Forecasts on the total number of goals and in-play forecasting (forecasts based on within-match information). Sentiment analysis techniques were used to extract the information reflected in almost two million tweets from more than 400 Premier League matches. By means of wordclouds and timely analysis of several tweet-based features, the Twitter communication over the full course of matches and shortly before and after goals was visualized and systematically analysed. Moreover, several forecasting models including a random forest model have been used to obtain in-play forecasts. Results suggest that in-play forecasting of goals is highly challenging, and in-play information does not improve forecasting accuracy. An additional analysis of goals from more than 30,000 matches from the main European football leagues supports the notion that the predictive value of in-play information is highly limited compared to pre-game information. This is a relevant result for coaches, match analysts and broadcasters who should not overestimate the value of in-play information. The present study also sheds light on how the perception and behaviour of Twitter users change over the course of a football match. A main result is that the sentiment of Twitter users decreases when the match progresses, which might be caused by an unjustified high expectation of football fans before the match. Springer Vienna 2021-12-29 2022 /pmc/articles/PMC8714875/ /pubmed/34976228 http://dx.doi.org/10.1007/s13278-021-00842-z Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Original Article Wunderlich, Fabian Memmert, Daniel A big data analysis of Twitter data during premier league matches: do tweets contain information valuable for in-play forecasting of goals in football? |
title | A big data analysis of Twitter data during premier league matches: do tweets contain information valuable for in-play forecasting of goals in football? |
title_full | A big data analysis of Twitter data during premier league matches: do tweets contain information valuable for in-play forecasting of goals in football? |
title_fullStr | A big data analysis of Twitter data during premier league matches: do tweets contain information valuable for in-play forecasting of goals in football? |
title_full_unstemmed | A big data analysis of Twitter data during premier league matches: do tweets contain information valuable for in-play forecasting of goals in football? |
title_short | A big data analysis of Twitter data during premier league matches: do tweets contain information valuable for in-play forecasting of goals in football? |
title_sort | big data analysis of twitter data during premier league matches: do tweets contain information valuable for in-play forecasting of goals in football? |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8714875/ https://www.ncbi.nlm.nih.gov/pubmed/34976228 http://dx.doi.org/10.1007/s13278-021-00842-z |
work_keys_str_mv | AT wunderlichfabian abigdataanalysisoftwitterdataduringpremierleaguematchesdotweetscontaininformationvaluableforinplayforecastingofgoalsinfootball AT memmertdaniel abigdataanalysisoftwitterdataduringpremierleaguematchesdotweetscontaininformationvaluableforinplayforecastingofgoalsinfootball AT wunderlichfabian bigdataanalysisoftwitterdataduringpremierleaguematchesdotweetscontaininformationvaluableforinplayforecastingofgoalsinfootball AT memmertdaniel bigdataanalysisoftwitterdataduringpremierleaguematchesdotweetscontaininformationvaluableforinplayforecastingofgoalsinfootball |