Cargando…
Weekly ILI patient ratio change prediction using news articles with support vector machine
BACKGROUND: Influenza continues to pose a serious threat to human health worldwide. For this reason, detecting influenza infection patterns is critical. However, as the epidemic spread of influenza occurs sporadically and rapidly, it is not easy to estimate the future variance of influenza virus inf...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6528302/ https://www.ncbi.nlm.nih.gov/pubmed/31109286 http://dx.doi.org/10.1186/s12859-019-2894-2 |
_version_ | 1783420187682275328 |
---|---|
author | Kim, Juhyeon Ahn, Insung |
author_facet | Kim, Juhyeon Ahn, Insung |
author_sort | Kim, Juhyeon |
collection | PubMed |
description | BACKGROUND: Influenza continues to pose a serious threat to human health worldwide. For this reason, detecting influenza infection patterns is critical. However, as the epidemic spread of influenza occurs sporadically and rapidly, it is not easy to estimate the future variance of influenza virus infection. Furthermore, accumulating influenza related data is not easy, because the type of data that is associated with influenza is very limited. For these reasons, identifying useful data and building a prediction model with these data are necessary steps toward predicting if the number of patients will increase or decrease. On the Internet, numerous press releases are published every day that reflect currently pending issues. RESULTS: In this research, we collected Internet articles related to infectious diseases from the Centre for Health Protection (CHP), which is maintained the by Hong Kong Department of Health, to see if news text data could be used to predict the spread of influenza. In total, 7769 articles related to infectious diseases published from 2004 January to 2018 January were collected. We evaluated the predictive ability of article text data from the period of 2013–2018 for each of the weekly time horizons. The support vector machine (SVM) model was used for prediction in order to examine the use of information embedded in the web articles and detect the pattern of influenza spread variance. The prediction result using news text data with SVM exhibited a mean accuracy of 86.7 % on predicting whether weekly ILI patient ratio would increase or decrease, and a root mean square error of 0.611 on estimating the weekly ILI patient ratio. CONCLUSIONS: In order to remedy the problems of conventional data, using news articles can be a suitable choice, because they can help estimate if ILI patient ratio will increase or decrease as well as how many patients will be affected, as shown in the result of research. Thus, advancements in research on using news articles for influenza prediction should continue to be pursed, as the result showed acceptable performance as compared to existing influenza prediction researches. |
format | Online Article Text |
id | pubmed-6528302 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-65283022019-05-28 Weekly ILI patient ratio change prediction using news articles with support vector machine Kim, Juhyeon Ahn, Insung BMC Bioinformatics Research Article BACKGROUND: Influenza continues to pose a serious threat to human health worldwide. For this reason, detecting influenza infection patterns is critical. However, as the epidemic spread of influenza occurs sporadically and rapidly, it is not easy to estimate the future variance of influenza virus infection. Furthermore, accumulating influenza related data is not easy, because the type of data that is associated with influenza is very limited. For these reasons, identifying useful data and building a prediction model with these data are necessary steps toward predicting if the number of patients will increase or decrease. On the Internet, numerous press releases are published every day that reflect currently pending issues. RESULTS: In this research, we collected Internet articles related to infectious diseases from the Centre for Health Protection (CHP), which is maintained the by Hong Kong Department of Health, to see if news text data could be used to predict the spread of influenza. In total, 7769 articles related to infectious diseases published from 2004 January to 2018 January were collected. We evaluated the predictive ability of article text data from the period of 2013–2018 for each of the weekly time horizons. The support vector machine (SVM) model was used for prediction in order to examine the use of information embedded in the web articles and detect the pattern of influenza spread variance. The prediction result using news text data with SVM exhibited a mean accuracy of 86.7 % on predicting whether weekly ILI patient ratio would increase or decrease, and a root mean square error of 0.611 on estimating the weekly ILI patient ratio. CONCLUSIONS: In order to remedy the problems of conventional data, using news articles can be a suitable choice, because they can help estimate if ILI patient ratio will increase or decrease as well as how many patients will be affected, as shown in the result of research. Thus, advancements in research on using news articles for influenza prediction should continue to be pursed, as the result showed acceptable performance as compared to existing influenza prediction researches. BioMed Central 2019-05-20 /pmc/articles/PMC6528302/ /pubmed/31109286 http://dx.doi.org/10.1186/s12859-019-2894-2 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Kim, Juhyeon Ahn, Insung Weekly ILI patient ratio change prediction using news articles with support vector machine |
title | Weekly ILI patient ratio change prediction using news articles with support vector machine |
title_full | Weekly ILI patient ratio change prediction using news articles with support vector machine |
title_fullStr | Weekly ILI patient ratio change prediction using news articles with support vector machine |
title_full_unstemmed | Weekly ILI patient ratio change prediction using news articles with support vector machine |
title_short | Weekly ILI patient ratio change prediction using news articles with support vector machine |
title_sort | weekly ili patient ratio change prediction using news articles with support vector machine |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6528302/ https://www.ncbi.nlm.nih.gov/pubmed/31109286 http://dx.doi.org/10.1186/s12859-019-2894-2 |
work_keys_str_mv | AT kimjuhyeon weeklyilipatientratiochangepredictionusingnewsarticleswithsupportvectormachine AT ahninsung weeklyilipatientratiochangepredictionusingnewsarticleswithsupportvectormachine |