Forecasting Influenza Dynamics with Neural Networks using...
Transcript of Forecasting Influenza Dynamics with Neural Networks using...
Forecasting Influenza Dynamics with Neural
Networks using Signals from Social Media
Ellyn Ayton and Svitlana Volkova
File
Nam
e //
File
Dat
e //
PN
NL-
SA-#
####
ABOUT Pacific Northwest National Laboratory The Pacific Northwest National Laboratory, located in southeastern Washington State, is a U.S. Department of Energy Office of Science laboratory that solves complex problems in energy, national security, and the environment, and advances scientific frontiers in the chemical, biological, materials, environmental, and computational sciences. The Laboratory employs nearly 5,000 staff members, has an annual budget in excess of $1 billion, and has been managed by Ohio-based Battelle since 1965. For more information on the science you see here, please contact: Svitlana Volkova Pacific Northwest National Laboratory Richland, WA 99352 (509) 372-6585 [email protected]
Motivation
• 500, 000 deaths worldwide from influenza• Influenza monitoring data is 1-2 weeks old when released•Utilize social media signals as source to track the severityand spread of the influenza
Contributions:•Contrasted the predictive power of social media signals•Experimented with location-specific vs.location-independent models
•Compared neural network performance to thestate-of-the-art machine learning models
Data
L14 i2 L3 L27
L20
i25
L28
L11
L25
L10 L4 L29 L0 L23
L12
L30
L33
L15
L13
L31 i3
L32
L22
L21
i27
L34
L37
L16
i20
L19
i17
Locations
0.0e+00
5.0e+06
1.0e+07
1.5e+07
The number of tweets collected within a 25-mile radius for 31 geolocations
ILI-Related Clinical Visit Data
3 × 10−2
6 × 10−2
9 × 10−2
0 50 100
150
200
Week
ILI
LocationL10
L12
L13
L15
L22
L23
Weekly ILI proportion dynamics between 2011 and 2014 for six example geo-locations
ILI =# of weekly ILI visits per location# of total weekly visits per location
Predictors
• ILI: ILI proportions• Tweets: Unigrams, bigrams, trigrams, and TFIDF• Network: Hashtags and mentions as tweet ngrams• Topics: LDA topic representations• Embeddings: Pre-trained embeddings using Word2Vec• Stylistic: Proportion of emotions, ellongations,mentions, URLs, RTs, capitalization etc.
• Syntactic: Part-of-speech tags
Models
Baselines: AdaBoost with Decision Trees and SupportVector Machines with Linear and RBF kernels.LSTM: A one-layer Long Short-Term Memory (LSTM)neural network for regression and evaluated using 4-fold crossvalidation. LSTMs capture long term dependencies.
it = σ(Wixt + Uiht−1 + bi)ct = tanh(Wcxt + Ucht−1 + bc)ft = σ(Wfxf + Ufht−1 + bf)Ct = it ∗ ct + ft ∗ Ct−1ot = σ(Woxt + Uoht−1 + VoCt + bo)ht = ot ∗ tanh(Ct)
.4….1
.3….3
.3….1
.3….3
LSTM layer
Fully connected
layer
SM predictorstweets,
network, embeddings
t0
Predicted weekly ILI proportions
}}
.03
.03 .01 .02 .05 ILI predictorst1 t3 t4
Nowcasting: Predicting this week’s ILI value using 4weeks of past data.Forecasting: Predicting the ILI values for next week andin two weeks using 4 weeks of past data.
Evaluation Metrics
Pearson Correlation – the linear dependence be-tween the predicted and observed values:
r =∑ni=1(Yti − Y ′ti)(Yti − Y ′ti))
√√√√√∑ni=1(Yti − Y ′ti)
2√√√√√√∑ni=1(Yti − Y ′ti)
2
Root Mean Squared Error (RMSE) – the dif-ference between the predicted and observed values:
RMSE =√√√√√√√√√√√√1n
n∑i=1
(Yti − Yti)2
RMSPE: root mean squared percent errorMAPE: maximum absolute percent error
Experimental ResultsTrue vs. predicted ILI dynamics as a function of time
ILI nowcasting using social media signals
Summary
•LSTM learned from SM datayield the best performance
•Text embeddings, tweet andnetwork signals are predictive
•Location-specific modelsoutperform location-independent
Future Work:•Combine Twitter + ILI signals•Run experiments for 31 locations***This is a joint work with Dr.Courtney D. Corley