Influence of External Factors on the Price of a Survival Minimum … · 2020-04-19 · Influence of...

3
Influence of External Factors on the Price of a Survival Minimum Expenditure Basket in Syria Damian Durrer ETH Zurich [email protected] Dominique Heyn ETH Zurich [email protected] Heiko Kromer ETH Zurich [email protected] Jonathan Mendieta University of Geneva, ETH Zurich [email protected] 1 INTRODUCTION One approach of providing humanitarian aid is in the form of cash- based assistance. Organisations such as the European Commission actively promote this approach [3]. To enable ecient distribution of funds, IMPACT Initiatives conducts a monthly market research in Syria to compute the price of a Survival Minimum Expenditure Basket (SMEB), that "represents the minimum [...] items required to support a 6-person household for a month" [4]. These items vary by season and include food- and non-food items. In this work, the inuence of external factors on the price of the SMEB in Syria were analyzed. 2 DATA Several sources of data provided inputs for the analysis: Data on the SMEB prices were assembled by IMPACT Ini- tiatives as part of the REACH initiative [4]. The price infor- mation on SMEB items is gathered by local informants in at least three towns per sub-district. This analysis is based on data from October 2017 to August 2019 and aggregated by month and sub-district. Missing data points of up to three consecutive months in a particular sub-district were linearly interpolated. Further, to avoid over-representation of certain months in the analysis, all sub-districts with data for less than 15 months (after interpolation) were excluded. This nally left the data-set with 646 unique SMEB prices from 32 sub-districts over 22 months. To evaluate the relationship between war activity and the SMEB price, a curated data-set published by the Armed Con- ict Location & Event Data Project (ACLED) was used [7]. This data contains information of dierent types war-related events like battles, riots, strategic developments, etc. for many countries. For this analysis, war activity in a sub- district was measured as the total count of all war events per month. Statistics used as predictors to identify Syrian sub-districts at risk of a large change in the SMEB price are based on the events-table in the GDELT database [5]. This data-set is updated every 15 minutes and thus allows for classication in near real-time. Every instance of this data-set represents an event in Syria between 2017 and 2019. The features of an event consist of, among others, the Goldstein Scale of an event, i.e. the potential of the event to increase or de- crease the stability of the country, the number of times the event was mentioned in news articles, and the average tone of the coverage. To aggregate the statistics by month and sub-district, these features were analyzed by mean, median, minimum, maximum, 10th and 90th percentile. Furthermore, the statistics were analyzed for the product of some features, such as the goldstein scale weighted by the number of men- tions, and statistics such as the total number of events were added. The nal selection of statistics was made by itera- tively discarding statistics that have a vif factor above 10 (resp. 100), i.e. statistics that are correlated with a subset of the remaining statistics. To analyze the correlation between the price of the SMEB and the number of displaced people in a sub-district, data from REACH Resource Centre was used [2]. This data contains an estimate of the number of displaced people per month and town. Here, the towns and months only partially agree with the towns and months of the data-set on the SMEB prices. 3 DATA EXPLORATION The SMEB price is correlated to the local exchange rate between the US Dollar (USD) and the Syrian Pound (SYP). While the ocial exchange rate has been pegged at 515 SYP per USD since July 2017 [1], the price of the USD on informal local markets moves with the SMEB price. Visual analysis suggests that periods of elevated levels of war activity, measured by the total number of reported war events [7], coincide with changes in long-term trends in the SMEB price and might be depending on the trend of the USD/SYP exchange rate (Figures 1, 2 and 3). Attempts to preform a multiple linear regression to predict the change in the SMEB price based on previous changes in the SMEB price, previous changes in the USD/SYP exchange rate and current and past levels of war activity were not successful. Some stepwise attribute selection algorithms suggest that changes in the (normal- ized) number of war events and their interaction with previous SMEB price changes improve the t of the baseline model, which only includes lagged values of the SMEB price change and the local USD/SYP currency rate change (evaluated by the Bayesian Informa- tion Criterion). Yet, assessing the models via k-fold cross validation, where each fold corresponds to one month, no model outperformed

Transcript of Influence of External Factors on the Price of a Survival Minimum … · 2020-04-19 · Influence of...

Page 1: Influence of External Factors on the Price of a Survival Minimum … · 2020-04-19 · Influence of External Factors on the Price of a Survival Minimum Expenditure Basket in Syria

Influence of External Factors on the Price of a SurvivalMinimum Expenditure Basket in Syria

Damian DurrerETH Zurich

[email protected]

Dominique HeynETH Zurich

[email protected]

Heiko KromerETH Zurich

[email protected]

Jonathan MendietaUniversity of Geneva, ETH [email protected]

1 INTRODUCTIONOne approach of providing humanitarian aid is in the form of cash-based assistance. Organisations such as the European Commissionactively promote this approach [3]. To enable e�cient distributionof funds, IMPACT Initiatives conducts a monthly market researchin Syria to compute the price of a Survival Minimum ExpenditureBasket (SMEB), that "represents the minimum [...] items requiredto support a 6-person household for a month" [4]. These items varyby season and include food- and non-food items.

In this work, the in�uence of external factors on the price of theSMEB in Syria were analyzed.

2 DATASeveral sources of data provided inputs for the analysis:

• Data on the SMEB prices were assembled by IMPACT Ini-tiatives as part of the REACH initiative [4]. The price infor-mation on SMEB items is gathered by local informants in atleast three towns per sub-district. This analysis is based ondata from October 2017 to August 2019 and aggregated bymonth and sub-district. Missing data points of up to threeconsecutive months in a particular sub-district were linearlyinterpolated. Further, to avoid over-representation of certainmonths in the analysis, all sub-districts with data for lessthan 15 months (after interpolation) were excluded. This�nally left the data-set with 646 unique SMEB prices from32 sub-districts over 22 months.

• To evaluate the relationship between war activity and theSMEB price, a curated data-set published by the Armed Con-�ict Location & Event Data Project (ACLED) was used [7].This data contains information of di�erent types war-relatedevents like battles, riots, strategic developments, etc. formany countries. For this analysis, war activity in a sub-district was measured as the total count of all war events permonth.

• Statistics used as predictors to identify Syrian sub-districtsat risk of a large change in the SMEB price are based onthe events-table in the GDELT database [5]. This data-set isupdated every 15 minutes and thus allows for classi�cationin near real-time. Every instance of this data-set representsan event in Syria between 2017 and 2019. The features ofan event consist of, among others, the Goldstein Scale of

an event, i.e. the potential of the event to increase or de-crease the stability of the country, the number of times theevent was mentioned in news articles, and the average toneof the coverage. To aggregate the statistics by month andsub-district, these features were analyzed by mean, median,minimum, maximum, 10th and 90th percentile. Furthermore,the statistics were analyzed for the product of some features,such as the goldstein scale weighted by the number of men-tions, and statistics such as the total number of events wereadded. The �nal selection of statistics was made by itera-tively discarding statistics that have a vif factor above 10(resp. 100), i.e. statistics that are correlated with a subset ofthe remaining statistics.

• To analyze the correlation between the price of the SMEB andthe number of displaced people in a sub-district, data fromREACH Resource Centre was used [2]. This data contains anestimate of the number of displaced people per month andtown. Here, the towns and months only partially agree withthe towns and months of the data-set on the SMEB prices.

3 DATA EXPLORATIONThe SMEB price is correlated to the local exchange rate betweenthe US Dollar (USD) and the Syrian Pound (SYP). While the o�cialexchange rate has been pegged at 515 SYP per USD since July 2017[1], the price of the USD on informal local markets moves with theSMEB price.

Visual analysis suggests that periods of elevated levels of waractivity, measured by the total number of reported war events [7],coincide with changes in long-term trends in the SMEB price andmight be depending on the trend of the USD/SYP exchange rate(Figures 1, 2 and 3).

Attempts to preform a multiple linear regression to predict thechange in the SMEB price based on previous changes in the SMEBprice, previous changes in the USD/SYP exchange rate and currentand past levels of war activity were not successful. Some stepwiseattribute selection algorithms suggest that changes in the (normal-ized) number of war events and their interaction with previousSMEB price changes improve the �t of the baseline model, whichonly includes lagged values of the SMEB price change and the localUSD/SYP currency rate change (evaluated by the Bayesian Informa-tion Criterion). Yet, assessing the models via k-fold cross validation,where each fold corresponds to one month, no model outperformed

Page 2: Influence of External Factors on the Price of a Survival Minimum … · 2020-04-19 · Influence of External Factors on the Price of a Survival Minimum Expenditure Basket in Syria

Damian Durrer, Dominique Heyn, Heiko Kromer, and Jonathan Mendieta

Figure 1: Normalized number of war events per Syrian sub-district between November 2017 and August 2019. Solid line: Mean(smoothed), Dashed line: 80% quantile (smoothed), Red background:Periods with increased levels of war activity

Figure 2: SMEB price (excl. water) in SYP per Syrian sub-districtbetween November 2017 and August 2019. Red background: Periodswith increased levels of war activity

the baseline of always predicting a change of zero (evaluated bythe mean squared error).

4 SPATIAL MAPPINGGiven the fact that IMPACT [4] and GDELT [5] data-sets shared noexplicit common feature for a posterior data-sets merging, an addi-tional pre-processing step on the GDELT data-set was implemented,in order to generate a feature that could satisfy the merging con-dition. Moreover, implicit features such as geo-coordinates whichare associated on each event, could be adopted in an intermediatestep to ful�ll the conditioning.

A spatial-mapping procedure on the geo-coordinates featureswithin the GDELT data-set was applied. The procedure consisted inallocating the events into the corresponding sub-district of Syria byconsidering the information contained within the geo-coordinates.

Figure 3: Price for buying USD with SYP in local markets perSyrian sub-district between November 2017 and August 2019. Redbackground: Periods with increased levels of war activity

Figure 4: Average sentiment score of news for di�erent sub-districts of Syria for one month

If one point was overlayed by a polygon representing a Syria’ssub-district, then the event associated with that point was labeledwith the name of the sub-district. If a particluar point could not beclassi�ed because it was not overlayed by any sub-district polygon,a nearest neighbor approach was used as an alternative.

The spatial information (e.g. polygons) was obtained from aSyria’s "Shape�le" [6]. The result of this pre-processing step wasa GDELT data-set where each news event is mapped to a uniqueSyrian sub-district. Figure 4 depicts an instance of the news’ impacton a sub-district level. This allows tomerge the GDELT and IMPACTdata-sets on the common features of the sub-district name and thedate �elds.

5 MODELA multi-class classi�er for the IMPACT-GDELT data-set was pro-posed, in order To identify sub-districts at risk of a signi�cantchange in SMEB price. Particularly, it was tested whether themodel �t of a baseline model relying exclusively on IMPACT datais improved by adding external factors from GDELT. A "signi�cant

Page 3: Influence of External Factors on the Price of a Survival Minimum … · 2020-04-19 · Influence of External Factors on the Price of a Survival Minimum Expenditure Basket in Syria

Influence of External Factors on the Price of a Survival Minimum Expenditure Basket in Syria

change" is de�ned as a change in the SMEB price of at least 10%(three classes: increase, decrease, stable)

5.1 Random Forest Classi�erAfter various attempts with di�erent classi�ers (e.g. SVM, Logistic-regression), a Random Forest classi�er with parameters of numberof trees of 500 and class balanced weights returned the best solutionin terms of balanced accuracy score and interpretability. The latteris commonly materialized within the feature importance plot thatcomes for free on Random Forests.

A baseline Random Forest model was built using only the IM-PACT data-set. Speci�cally, the local USD/SYP exchange rate anda seasonality factor were selected. The balanced accuracy fromthe baseline model was of 0.4322. Adding statistics extracted fromthe GDELT data-set as predictors the accuracy improved to 0.4549.While the impact on the predictive power is relatively small, theVariable Importance Plot (Figure 5 shows that the external factorslessen the importance of the exchange rate between the USD andthe SYP. This implies that some information on the exchange ratemight be contained in the external factors.

(a) Baseline model (b) External factors model

Figure 5: Variable Importance Plot from Random Forest Classi�er

6 OTHER ATTEMPTS6.1 Displacement DataThe correlation analysis of the displacement data [2] lead to incon-clusive results. While the relative change in the upper estimate ofdisplaced people strongly correlates with the change of the SMEBprice for some districts, this result can not be observed on the gov-ernorate level (Figure 6 in the Appendix). This is a point for furtherresearch, as the months and sub-districts, for which the displace-ment data and the data on the SMEB price are provided, agree onlypartially.

7 CONCLUSIONIn this work, it was shown that the local exchange rate between theUSD and the SYP might play a role in the relationship between thenumber of war events and the price of the SMEB. Furthermore, abaseline model that classi�es signi�cant changes in the SMEB pricewas enhanced by including external factors. Finally, depending onthe granularity of the approach, a correlation between the numberof displaced people and the change of SMEB price was found forsome districts.

All of these �ndings are to be taken with caution. While sometrends and correlations were observed, they mostly remain statisti-cally insigni�cant and can therefore only serve as a guidance forfurther inquiry, not as a robust statistical model.

In further research, an imbalance of the number of events of theGDELT data-set in di�erent sub-districts can be taken into account.Further, more datasets on the number of displaced people can beused to analyze the relationship between the number of displacedpeople and the price of the SMEB. Finally, it is to be analyzed howthe informal exchange rate in local markets between the USD andthe SYP is in�uenced by external factors.

REFERENCES[1] 2019. (2019). https://www.xe.com/currencycharts/?from=USD&to=SYP&view=2Y[2] REACH Resource Centre. 2019. (2019). https://www.reachresourcecentre.

info/country/syria/cycle/727/p/2/?toip-group=data&toip=dataset-database#cycle-727

[3] European Commission. 2019. (2019). https://ec.europa.eu/echo/what/humanitarian-aid/cash-transfers-and-vouchers_en

[4] Impact Initiatives. 2019. Reach Resource Center. (2019). https://www.reachresourcecentre.info/

[5] Philip A. Schrodt Kalev Leetaru. 2013. GDELT: Global data on events, location,and tone. (2013). https://www.gdeltproject.org/

[6] United Nations. 2019. Syrian Arab Republic: Administrative Bound-aries, Populated Places. (2019). https://data.humdata.org/dataset/syrian-arab-republic-administrative-boundaries-populated-places

[7] Andrew Linke Håvard Hegre Raleigh, Clionadh and Joakim Karlsen. 2010. Intro-ducing ACLED-Armed Con�ict Location and Event Data. Journal of Peace Research47(5) (2010). https://www.acleddata.com/data/

A APPENDIX

(a) District level

(b) Governorate level

Figure 6: Correlation between relative change in upper estimateof displaced people and change in SMEB