Paper Unlock

Authors: Harshavardhan Achrekar 1 ; Avinash Gandhe 2 ; Ross Lazarus 3 ; Ssu-Hsin Yu 2 and Benyuan Liu 1

Affiliations: 1 University of Massachusetts Lowell, United States ; 2 Scientific Systems Company Inc, United States ; 3 Harvard Medical School, United States

ISBN: 978-989-8425-88-1

Keyword(s): Flu trends, Online social networks, Prediction.

Related Ontology Subjects/Areas/Topics: Artificial Intelligence ; Biomedical Engineering ; Business Analytics ; Data Engineering ; Data Mining ; Databases and Information Systems Integration ; Datamining ; e-Health for Public Health ; Enterprise Information Systems ; Health Information Systems ; Pattern Recognition and Machine Learning ; Sensor Networks ; Signal Processing ; Soft Computing

Abstract: Seasonal influenza epidemics causes severe illnesses and 250,000 to 500,000 deaths worldwide each year. Other pandemics like the 1918 “Spanish Flu” may change into a devastating one. Reducing the impact of these threats is of paramount importance for health authorities, and studies have shown that effective interventions can be taken to contain the epidemics, if early detection can be made. In this paper, we introduce the Social Network Enabled Flu Trends (SNEFT), a continuous data collection framework which monitors flu related tweets and track the emergence and spread of an influenza. We show that text mining significantly enhances the correlation between the Twitter and the Influenza like Illness (ILI) rates provided by Centers for Disease Control and Prevention (CDC). For accurate prediction, we implemented an auto-regression with exogenous input (ARX) model which uses current Twitter data, and CDC ILI rates from previous weeks to predict current influenza statistics. Our results show that, while previous ILI data from CDC offer a true (but delayed) assessment of a flu epidemic, Twitter data provides a real-time assessment of the current epidemic condition and can be used to compensate for the lack of current ILI data. We observe that the Twitter data is highly correlated with the ILI rates across different regions within USA and can be used to effectively improve the accuracy of our prediction. Our age-based flu prediction analysis indicates that for most of the regions, Twitter data best fit the age groups of 5-24 and 25-49 years, correlating well with the fact that these are likely, the most active user age groups on Twitter. Therefore, Twitter data can act as supplementary indicator to gauge influenza within a population and helps discovering flu trends ahead of CDC. (More)

PDF ImageFull Text

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Achrekar H., Gandhe A., Lazarus R., Yu S. and Liu B. (2012). TWITTER IMPROVES SEASONAL INFLUENZA PREDICTION.In Proceedings of the International Conference on Health Informatics - Volume 1: HEALTHINF, (BIOSTEC 2012) ISBN 978-989-8425-88-1, pages 61-70. DOI: 10.5220/0003780600610070

author={Harshavardhan Achrekar and Avinash Gandhe and Ross Lazarus and Ssu-Hsin Yu and Benyuan Liu},
booktitle={Proceedings of the International Conference on Health Informatics - Volume 1: HEALTHINF, (BIOSTEC 2012)},


JO - Proceedings of the International Conference on Health Informatics - Volume 1: HEALTHINF, (BIOSTEC 2012)
SN - 978-989-8425-88-1
AU - Achrekar H.
AU - Gandhe A.
AU - Lazarus R.
AU - Yu S.
AU - Liu B.
PY - 2012
SP - 61
EP - 70
DO - 10.5220/0003780600610070

Login or register to post comments.

Comments on this Paper: Be the first to review this paper.