學術產出-Proceedings

Article View/Open

Publication Export

Google ScholarTM

政大圖書館

Citation Infomation

  • No doi shows Citation Infomation
題名 A study of machine learning models in epidemic surveillance: Using the query logs of search engines
作者 Fang, Z.-H.;Tzeng, J.-S.;Chen, C.C.;Chou, Tzuchuan
周子全
貢獻者 國發所
關鍵詞 Classification models; Data sets; Economic damages; Machine-learning; Query logs; Research issues; Text mining; Classification (of information); Epidemiology; Health care; Information retrieval; Information systems; Learning systems; Monitoring; Query processing; Search engines
日期 2010
上傳時間 20-May-2015 17:05:56 (UTC+8)
摘要 Epidemics inevitably result in a large number of deaths and always cause considerable social and economic damage. Epidemic surveillance has thus become an important healthcare research issue. In 2009, Ginsberg et al. observed that the query logs of search engines can be used to estimate the status of epidemics in a timely manner. In this paper, we model epidemic surveillance as a classification problem and employ query statistics from Google to classify the status of a dengue fever epidemic. The query logs of twenty-three dengue-related keywords serve as observations for machine learning and testing, and a number of machine learning models are investigated to evaluate their surveillance performance. Evaluations based on a 5-year real world dataset demonstrate that search engine query logs can be used to construct accurate epidemic status classifiers. Moreover, the learned classifiers generally outperform conventional regression approaches. We also apply various machine learning models, including generative, discriminative, sequential, and non-sequential classification models, to demonstrate their applicability to epidemic surveillance.
關聯 PACIS 2010 - 14th Pacific Asia Conference on Information Systems
資料類型 conference
dc.contributor 國發所
dc.creator (作者) Fang, Z.-H.;Tzeng, J.-S.;Chen, C.C.;Chou, Tzuchuan
dc.creator (作者) 周子全zh_TW
dc.date (日期) 2010
dc.date.accessioned 20-May-2015 17:05:56 (UTC+8)-
dc.date.available 20-May-2015 17:05:56 (UTC+8)-
dc.date.issued (上傳時間) 20-May-2015 17:05:56 (UTC+8)-
dc.identifier.uri (URI) http://nccur.lib.nccu.edu.tw/handle/140.119/75204-
dc.description.abstract (摘要) Epidemics inevitably result in a large number of deaths and always cause considerable social and economic damage. Epidemic surveillance has thus become an important healthcare research issue. In 2009, Ginsberg et al. observed that the query logs of search engines can be used to estimate the status of epidemics in a timely manner. In this paper, we model epidemic surveillance as a classification problem and employ query statistics from Google to classify the status of a dengue fever epidemic. The query logs of twenty-three dengue-related keywords serve as observations for machine learning and testing, and a number of machine learning models are investigated to evaluate their surveillance performance. Evaluations based on a 5-year real world dataset demonstrate that search engine query logs can be used to construct accurate epidemic status classifiers. Moreover, the learned classifiers generally outperform conventional regression approaches. We also apply various machine learning models, including generative, discriminative, sequential, and non-sequential classification models, to demonstrate their applicability to epidemic surveillance.
dc.format.extent 176 bytes-
dc.format.mimetype text/html-
dc.relation (關聯) PACIS 2010 - 14th Pacific Asia Conference on Information Systems
dc.subject (關鍵詞) Classification models; Data sets; Economic damages; Machine-learning; Query logs; Research issues; Text mining; Classification (of information); Epidemiology; Health care; Information retrieval; Information systems; Learning systems; Monitoring; Query processing; Search engines
dc.title (題名) A study of machine learning models in epidemic surveillance: Using the query logs of search engines
dc.type (資料類型) conferenceen