Cargando…

Weekly ILI patient ratio change prediction using news articles with support vector machine

BACKGROUND: Influenza continues to pose a serious threat to human health worldwide. For this reason, detecting influenza infection patterns is critical. However, as the epidemic spread of influenza occurs sporadically and rapidly, it is not easy to estimate the future variance of influenza virus inf...

Descripción completa

Detalles Bibliográficos
Autores principales: Kim, Juhyeon, Ahn, Insung
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6528302/
https://www.ncbi.nlm.nih.gov/pubmed/31109286
http://dx.doi.org/10.1186/s12859-019-2894-2
_version_ 1783420187682275328
author Kim, Juhyeon
Ahn, Insung
author_facet Kim, Juhyeon
Ahn, Insung
author_sort Kim, Juhyeon
collection PubMed
description BACKGROUND: Influenza continues to pose a serious threat to human health worldwide. For this reason, detecting influenza infection patterns is critical. However, as the epidemic spread of influenza occurs sporadically and rapidly, it is not easy to estimate the future variance of influenza virus infection. Furthermore, accumulating influenza related data is not easy, because the type of data that is associated with influenza is very limited. For these reasons, identifying useful data and building a prediction model with these data are necessary steps toward predicting if the number of patients will increase or decrease. On the Internet, numerous press releases are published every day that reflect currently pending issues. RESULTS: In this research, we collected Internet articles related to infectious diseases from the Centre for Health Protection (CHP), which is maintained the by Hong Kong Department of Health, to see if news text data could be used to predict the spread of influenza. In total, 7769 articles related to infectious diseases published from 2004 January to 2018 January were collected. We evaluated the predictive ability of article text data from the period of 2013–2018 for each of the weekly time horizons. The support vector machine (SVM) model was used for prediction in order to examine the use of information embedded in the web articles and detect the pattern of influenza spread variance. The prediction result using news text data with SVM exhibited a mean accuracy of 86.7 % on predicting whether weekly ILI patient ratio would increase or decrease, and a root mean square error of 0.611 on estimating the weekly ILI patient ratio. CONCLUSIONS: In order to remedy the problems of conventional data, using news articles can be a suitable choice, because they can help estimate if ILI patient ratio will increase or decrease as well as how many patients will be affected, as shown in the result of research. Thus, advancements in research on using news articles for influenza prediction should continue to be pursed, as the result showed acceptable performance as compared to existing influenza prediction researches.
format Online
Article
Text
id pubmed-6528302
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-65283022019-05-28 Weekly ILI patient ratio change prediction using news articles with support vector machine Kim, Juhyeon Ahn, Insung BMC Bioinformatics Research Article BACKGROUND: Influenza continues to pose a serious threat to human health worldwide. For this reason, detecting influenza infection patterns is critical. However, as the epidemic spread of influenza occurs sporadically and rapidly, it is not easy to estimate the future variance of influenza virus infection. Furthermore, accumulating influenza related data is not easy, because the type of data that is associated with influenza is very limited. For these reasons, identifying useful data and building a prediction model with these data are necessary steps toward predicting if the number of patients will increase or decrease. On the Internet, numerous press releases are published every day that reflect currently pending issues. RESULTS: In this research, we collected Internet articles related to infectious diseases from the Centre for Health Protection (CHP), which is maintained the by Hong Kong Department of Health, to see if news text data could be used to predict the spread of influenza. In total, 7769 articles related to infectious diseases published from 2004 January to 2018 January were collected. We evaluated the predictive ability of article text data from the period of 2013–2018 for each of the weekly time horizons. The support vector machine (SVM) model was used for prediction in order to examine the use of information embedded in the web articles and detect the pattern of influenza spread variance. The prediction result using news text data with SVM exhibited a mean accuracy of 86.7 % on predicting whether weekly ILI patient ratio would increase or decrease, and a root mean square error of 0.611 on estimating the weekly ILI patient ratio. CONCLUSIONS: In order to remedy the problems of conventional data, using news articles can be a suitable choice, because they can help estimate if ILI patient ratio will increase or decrease as well as how many patients will be affected, as shown in the result of research. Thus, advancements in research on using news articles for influenza prediction should continue to be pursed, as the result showed acceptable performance as compared to existing influenza prediction researches. BioMed Central 2019-05-20 /pmc/articles/PMC6528302/ /pubmed/31109286 http://dx.doi.org/10.1186/s12859-019-2894-2 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Kim, Juhyeon
Ahn, Insung
Weekly ILI patient ratio change prediction using news articles with support vector machine
title Weekly ILI patient ratio change prediction using news articles with support vector machine
title_full Weekly ILI patient ratio change prediction using news articles with support vector machine
title_fullStr Weekly ILI patient ratio change prediction using news articles with support vector machine
title_full_unstemmed Weekly ILI patient ratio change prediction using news articles with support vector machine
title_short Weekly ILI patient ratio change prediction using news articles with support vector machine
title_sort weekly ili patient ratio change prediction using news articles with support vector machine
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6528302/
https://www.ncbi.nlm.nih.gov/pubmed/31109286
http://dx.doi.org/10.1186/s12859-019-2894-2
work_keys_str_mv AT kimjuhyeon weeklyilipatientratiochangepredictionusingnewsarticleswithsupportvectormachine
AT ahninsung weeklyilipatientratiochangepredictionusingnewsarticleswithsupportvectormachine