five

A machine learning method to monitor China’s AIDS epidemics with data from Baidu Trends

收藏
NIAID Data Ecosystem2026-03-10 收录
下载链接:
http://datadryad.org/dataset/doi%253A10.5061%252Fdryad.f45s8
下载链接
链接失效反馈
官方服务:
资源简介:
Background: AIDS victims’ unwillingness to report their disease, due to social discrimination against them, makes it hard for disease control departments to accurately monitor the disease’s dynamics through traditional surveillance tools, such as over-the-counter drug sales and hospital or self-reported data. With the diffusion and adoption of the Internet, the ‘big data’ aggregated from Internet search engines, which contain users’ information on the concern or reality of their health status, provide a new opportunity for AIDS surveillance. This paper uses search engine data to monitor and forecast AIDS in China. Methods: A machine learning method, artificial neural networks (ANNs), is used to forecast AIDS occurrences and deaths. Search trend data related to AIDS from the largest Chinese search engine, Baidu.com, are collected and selected as the input variables of ANNs, and officially reported actual AIDS occurrences and deaths are used for the output variable. Three criteria, the mean absolute percentage error, the root mean squared percentage error, and the index of agreement, are used to test the forecasting performance of the ANN method. Results: Based on the monthly time-series data from January 2011 to June 2017, this article finds that, under three criteria, the ANN method can lead to satisfactory forecasting of AIDS occurrences and deaths, regardless of the change of the number of search queries. Conclusions: Internet-based data should be adopted as a real-time, cost-effective complement to a traditional AIDS surveillance system.
创建时间:
2019-01-04
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作