Cargando…

A machine learning method to monitor China’s AIDS epidemics with data from Baidu trends

BACKGROUND: AIDS is a worrying public health issue in China and lacks timely and effective surveillance. With the diffusion and adoption of the Internet, the ‘big data’ aggregated from Internet search engines, which contain users’ information on the concern or reality of their health status, provide...

Descripción completa

Detalles Bibliográficos
Autores principales: Nan, Yongqing, Gao, Yanyan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6040727/
https://www.ncbi.nlm.nih.gov/pubmed/29995920
http://dx.doi.org/10.1371/journal.pone.0199697
_version_ 1783338881369767936
author Nan, Yongqing
Gao, Yanyan
author_facet Nan, Yongqing
Gao, Yanyan
author_sort Nan, Yongqing
collection PubMed
description BACKGROUND: AIDS is a worrying public health issue in China and lacks timely and effective surveillance. With the diffusion and adoption of the Internet, the ‘big data’ aggregated from Internet search engines, which contain users’ information on the concern or reality of their health status, provide a new opportunity for AIDS surveillance. This paper uses search engine data to monitor and forecast AIDS in China. METHODS: A machine learning method, artificial neural networks (ANNs), is used to forecast AIDS incidences and deaths. Search trend data related to AIDS from the largest Chinese search engine, Baidu.com, are collected and selected as the input variables of ANNs, and officially reported actual AIDS incidences and deaths are used as the output variable. Three criteria, the mean absolute percentage error, the root mean squared percentage error, and the index of agreement, are used to test the forecasting performance of the ANN method. RESULTS: Based on the monthly time series data from January 2011 to June 2017, this article finds that, under the three criteria, the ANN method can lead to satisfactory forecasting of AIDS incidences and deaths, regardless of the change in the number of search queries. CONCLUSIONS: Despite the inability to self-detect HIV/AIDS through online searching, Internet-based data should be adopted as a timely, cost-effective complement to a traditional AIDS surveillance system.
format Online
Article
Text
id pubmed-6040727
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-60407272018-07-19 A machine learning method to monitor China’s AIDS epidemics with data from Baidu trends Nan, Yongqing Gao, Yanyan PLoS One Research Article BACKGROUND: AIDS is a worrying public health issue in China and lacks timely and effective surveillance. With the diffusion and adoption of the Internet, the ‘big data’ aggregated from Internet search engines, which contain users’ information on the concern or reality of their health status, provide a new opportunity for AIDS surveillance. This paper uses search engine data to monitor and forecast AIDS in China. METHODS: A machine learning method, artificial neural networks (ANNs), is used to forecast AIDS incidences and deaths. Search trend data related to AIDS from the largest Chinese search engine, Baidu.com, are collected and selected as the input variables of ANNs, and officially reported actual AIDS incidences and deaths are used as the output variable. Three criteria, the mean absolute percentage error, the root mean squared percentage error, and the index of agreement, are used to test the forecasting performance of the ANN method. RESULTS: Based on the monthly time series data from January 2011 to June 2017, this article finds that, under the three criteria, the ANN method can lead to satisfactory forecasting of AIDS incidences and deaths, regardless of the change in the number of search queries. CONCLUSIONS: Despite the inability to self-detect HIV/AIDS through online searching, Internet-based data should be adopted as a timely, cost-effective complement to a traditional AIDS surveillance system. Public Library of Science 2018-07-11 /pmc/articles/PMC6040727/ /pubmed/29995920 http://dx.doi.org/10.1371/journal.pone.0199697 Text en © 2018 Nan, Gao http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Nan, Yongqing
Gao, Yanyan
A machine learning method to monitor China’s AIDS epidemics with data from Baidu trends
title A machine learning method to monitor China’s AIDS epidemics with data from Baidu trends
title_full A machine learning method to monitor China’s AIDS epidemics with data from Baidu trends
title_fullStr A machine learning method to monitor China’s AIDS epidemics with data from Baidu trends
title_full_unstemmed A machine learning method to monitor China’s AIDS epidemics with data from Baidu trends
title_short A machine learning method to monitor China’s AIDS epidemics with data from Baidu trends
title_sort machine learning method to monitor china’s aids epidemics with data from baidu trends
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6040727/
https://www.ncbi.nlm.nih.gov/pubmed/29995920
http://dx.doi.org/10.1371/journal.pone.0199697
work_keys_str_mv AT nanyongqing amachinelearningmethodtomonitorchinasaidsepidemicswithdatafrombaidutrends
AT gaoyanyan amachinelearningmethodtomonitorchinasaidsepidemicswithdatafrombaidutrends
AT nanyongqing machinelearningmethodtomonitorchinasaidsepidemicswithdatafrombaidutrends
AT gaoyanyan machinelearningmethodtomonitorchinasaidsepidemicswithdatafrombaidutrends