Quantifying Unobserved Attributes in Expert Elicitation of Terrorist Preferences Vicki Bier, Chen...
-
date post
21-Dec-2015 -
Category
Documents
-
view
223 -
download
0
Transcript of Quantifying Unobserved Attributes in Expert Elicitation of Terrorist Preferences Vicki Bier, Chen...
Quantifying Unobserved Attributes in Expert Elicitation
of Terrorist PreferencesVicki Bier, Chen Wang
University of Wisconsin-Madison
Research Goals• To construct a reasonable defender prior
distribution over possible terrorist preferences: – By explicitly modeling the defender’s uncertainty about
unobserved attributes– I.e., attributes that may be important to the terrorist, but are
un-quantified or unobserved by the defender
• To simplify the task of quantifying threat probabilities for subject-matter experts:– By using ordinal rather than cardinal estimates– To increase the acceptance of quantitative approaches in
the intelligence community
Indirect Expert Elicitation
• Allows experts to express their knowledge as rank orderings rather than numerical values:– Simplifies the process of bringing expert knowledge to bear
• Well suited to elicitation challenges commonly encountered in the intelligence community:– High uncertainty and sparse data– Reluctance on the part of experts to express their
knowledge in probabilistic form
Mathematical Approach• Experts provide rank orderings of selected targets or
attack strategies:– Reflecting their knowledge of adversary preferences
• Adversary preferences are assumed to follow an additive multi-attribute utility function:– With an “error term” representing the effect of any attributes
that have not been identified or observed by the defender
– Assumed to be independent and identically distributed!
• Probabilistic inversion is used to estimate the values of both attribute weights and unobserved attributes: – To yield the best fit to the stated rank orderings
– Taking into account expert consensus or disagreement
Urban AreaProperty Loss($ million),
X1
Fatalities,X2
Population,X3
Population Density
(per sq mile),X4
New York, NY 413 304 9,314,235 8,159 Chicago 115 54 8,272,768 1,634
San Francisco 57 24 1,731,183 1,705 Washington, DC-MD-VA-WV 36 29 4,923,153 756
Los Angeles-Long Beach 34 17 9,519,338 2,344 Philadelphia, PA-NJ 21 9 5,100,931 1,323
Boston, MA-NH 18 12 3,406,829 1,685 Houston 11 9 4,177,646 706
Newark, NJ 7.3 4 2,032,989 1,289 Seattle-Bellevue-Everett 6.7 4 2,414,616 546
Jersey City 4.4 2 608,975 13,044 Detroit 4.2 1.9 4,441,551 1,140
Las Vegas, NV-AZ 4.1 1 1,563,282 40 Oakland, CA 4 1 2,392,557 1,642
Orange County, CA 3.7 2 2,846,289 3,606 Cleveland-Lorain-Elyria 3 0.5 2,250,871 832
San Diego 2.8 1 2,813,833 670 Miami, FL 2.7 0.5 2,253,362 1,158
Minneapolis-St. Paul, MN-WI 2.7 0.4 2,968,806 490 Denver 2.5 1.1 2,109,282 561
Attribute Values for the Case Study
3 Hypothetical Expert Groups• Each group has 10 experts:
– Each of whom ranks the top 10 targets
• Group 1: – All find X1 (property loss) and X2 (fatalities) important,
but not X4 (population density)– Little or no weight on unobserved attributes
• Group 2: – All think X4 (population density) is important– Opinions reflect an unobserved attribute,
corresponding to presence of entertainment industry
• Group 3 (expert disagreement):– Five experts from Group 1, and five from Group 2
Modeling Unobserved Attributes• Attempt to fit the attribute weights to the stated rank
orderings• Use trial and error to find the weight for the unobserved
attribute that yields the lowest infeasibility• Resulting weights:
– Group 1: 0.02– Group 2: 0.09 (largest)– Group 3: 0.08
Results of Probabilistic Inversion
• Group 1 yields high weights on X1 and X2:– Low weight on X4
• Group 2 yields the highest weight on X4:– With Group 3 intermediate between Groups 1 and 2
• As expected, Group 2 has the largest infeasibility:– Since the experts in Group 2 take unobserved attributes into account,
even the best fit performs worse than the other two groups
E[X1](Property
Loss)E[X2]
(Fatalities)E[X3]
(Population)
E[X4] (Population
Density)
X5Unobserved Attributes
Relative Information (Infeasibility)
Group 1 0.367 0.552 0.023 0.038 0.02 0.272Group 2 0.210 0.265 0.090 0.345 0.09 1.067Group 3 0.325 0.366 0.113 0.117 0.08 0.016
Distributions of Attribute Weights
Group 1
Group 2
Group 3
Uniform prior
Probabilistic inversion
X1 and X2 increase
X4 increases
Inconsistent judgments –
higher variance
x_1
0.0 0.2 0.4 0.6 0.8 1.0
010
0020
0030
0040
00
x_2
Fre
quen
cy
0.0 0.2 0.4 0.6 0.8 1.0
050
015
0025
0035
00 x_3
Fre
quen
cy
0.0 0.2 0.4 0.6 0.8 1.0
010
0030
0050
00
x_4
Fre
quen
cy
0.0 0.2 0.4 0.6 0.8 1.0
050
0015
000
x_1
0.0 0.2 0.4 0.6 0.8 1.0
010
0030
0050
00
x_2
Fre
quen
cy
0.0 0.2 0.4 0.6 0.8 1.0
010
0030
0050
00
x_3
Fre
quen
cy
0.0 0.2 0.4 0.6 0.8 1.0
040
0080
0012
000
x_4
Fre
quen
cy
0.0 0.2 0.4 0.6 0.8 1.0
020
0040
0060
0080
00
x_1
0.0 0.2 0.4 0.6 0.8 1.0
010
0030
0050
00
x_2
Fre
quen
cy
0.0 0.2 0.4 0.6 0.8 1.0
010
0030
00
x_3
Fre
quen
cy
0.0 0.2 0.4 0.6 0.8 1.0
020
0060
00
x_4
Fre
quen
cy
0.0 0.2 0.4 0.6 0.8 1.0
020
0060
0010
000
Unobserved Attributes• Can look at the posterior distributions for the unobserved attribute:
– To identify candidate unobserved attribute(s)
Group 2
• Posterior correlations for unobserved attributes:
– Positive correlation between Los Angeles and Las Vegas suggests that some experts consider presence of an entertainment industry important
LA, Jersey City, and Las
Vegas increase
epsilon_NYC0
2000
4000
6000
epsilon_LA
Fre
quen
cy
010
0030
0050
00
epsilon_Jersey
Fre
quen
cy
010
0030
0050
00 epsilon_LV
Fre
quen
cy
010
0030
0050
00
Uniform prior
Probabilistic inversion
NYC DC LA Las VegasNYC 1 -0.201 0.034 0.045DC 1 -0.011 0.014LA 1 0.161
Las Vegas 1
Predicted Rankings With Unobserved Attributes
Rank Group 1 Group 2 Group 31 NYC NYC NYC2 Chicago Jersey City Chicago3 San Francisco Chicago LA4 DC LA DC5 LA Orange County San Francisco6 Boston San Francisco Jersey City7 Philadelphia DC Boston8 Houston Boston Philadelphia9 Jersey City Philadelphia Houston
10 Newark Detroit Orange County11 Seattle Houston Newark12 Orange County Oakland Oakland13 Detroit Newark Detroit14 Oakland Miami Miami15 San Diego Las Vegas Seattle16 Miami Seattle Minneapolis17 Denver Cleveland San Diego18 Minneapolis Minneapolis Las Vegas19 Cleveland San Diego Cleveland20 Las Vegas Denver Denver
Red –Increased
Blue –decreased
Changes by more than 1 place are colored.
Compared to case without unobserved attributes
Assessment of Results• Predicted rankings are more consistent with expert
judgments when unobserved attributes are included, especially for Group 2: – Las Vegas gets high rankings due to unobserved attribute
(presence of entertainment industry)– The model without unobserved attributes does not have the
flexibility to adequately reflect expert judgments
• Can also be used as a basis for inference about what unobserved attributes
• For example, if LA and Las Vegas are rated higher than their known attribute values would suggest: – That might indicate the need to include presence of a large
entertainment industry as a terrorist attribute
Assessment of Results• Results may be better than direct weight elicitation• For example, some experts may put high weight on
population density:– Without realizing this implies a high ranking for Jersey City
• Can deal with conflicting and/or inconsistent expert opinions:– By (possibly multi-modal) distributions of attribute weights
with high variance
Pre-Posterior Analysis
• How do the models perform with Bayesian updating:– Especially after an unexpected attack?
• Problem:– Some targets have zero probability of being attacked in the
model– Model would break in the event of an attack on such a target– Cannot condition on a set of measure zero!
• Model without unobserved attributes is especially poor in this respect
• May need to consider non-uniform (e.g., U-shaped) prior distributions for the unobserved attributes
Bayesian Updating• Consider a target with a positive probability of being
attacked:– Assume an (unexpected) attack on Jersey City
• Probability that the next attack is also on Jersey City becomes quite high (maybe unrealistically high)
• What happens to the attribute weights?
Group 1
Group 2
Group 3
Prior mean
Posterior mean
X1 and X2 decrease;
X4 increases
An Attack on Jersey City
x_1
0.0 0.2 0.4 0.6 0.8 1.0
020
060
010
00
x_2
Fre
quen
cy
0.0 0.2 0.4 0.6 0.8 1.0
020
060
010
00
x_3
Fre
quen
cy
0.0 0.2 0.4 0.6 0.8 1.0
020
040
060
0
x_4
Fre
quen
cy
0.0 0.2 0.4 0.6 0.8 1.0
020
040
060
0
x_1
0.0 0.2 0.4 0.6 0.8 1.0
050
100
150
200
x_2
Fre
quen
cy
0.0 0.2 0.4 0.6 0.8 1.0
050
100
150
200
x_3
Fre
quen
cy
0.0 0.2 0.4 0.6 0.8 1.0
050
100
150
200
x_4
Fre
quen
cy
0.0 0.2 0.4 0.6 0.8 1.0
050
100
150
x_1
0.0 0.2 0.4 0.6 0.8 1.0
050
010
0020
00
x_2
Fre
quen
cy
0.0 0.2 0.4 0.6 0.8 1.0
050
010
0020
00
x_3
Fre
quen
cy
0.0 0.2 0.4 0.6 0.8 1.0
020
060
010
00
x_4
Fre
quen
cy
0.0 0.2 0.4 0.6 0.8 1.0
020
060
010
00
epsilon_NYC
0.0 0.2 0.4 0.6 0.8 1.0
01
23
45
epsilon_Jersey
Fre
quen
cy
0.0 0.2 0.4 0.6 0.8 1.0
01
23
45
67
epsilon_Las Vegas
Fre
quen
cy
0.0 0.2 0.4 0.6 0.8 1.0
02
46
8
Posterior of Unobserved Attribute
Group 1
Group 2
Group 3
Jersey City has higher values on the unobserved
attributeepsilon_NYC
0.0 0.2 0.4 0.6 0.8 1.0
010
020
030
040
050
0 epsilon_Jersey
Fre
quen
cy
0.0 0.2 0.4 0.6 0.8 1.0
010
020
030
040
050
0 epsilon_Las Vegas
Fre
quen
cy
0.0 0.2 0.4 0.6 0.8 1.0
010
020
030
040
050
0
epsilon_NYC
0.0 0.2 0.4 0.6 0.8 1.0
0100
200
300
400
epsilon_Jersey
Fre
quenc
y
0.0 0.2 0.4 0.6 0.8 1.0
0100
200
300
400
epsilon_Las Vegas
Fre
quenc
y
0.0 0.2 0.4 0.6 0.8 1.0
0100
200
300
400
Prior mean
Posterior mean
Predicted Rankings after an Attack on Jersey City
Rank Group 1 Group 2 Group 31 NYC Jersey City NYC2 Jersey City NYC Jersey City3 Chicago Orange Orange4 LA LA LA5 San Francisco Chicago Chicago6 Orange Boston Boston7 DC San Francisco San Francisco8 Boston Philadelphia Philadelphia9 Philadelphia Oakland Oakland
10 Oakland Detroit Detroit11 Detroit DC DC12 Newark Newark Newark13 Houston Miami Miami14 Miami Houston Houston15 Cleveland Cleveland Cleveland16 San Diego San Diego San Diego17 Seattle Seattle Seattle18 Minneapollis Minneapollis Minneapollis19 Denver Denver Denver20 LV LV LV
Red –increased
Blue –decreased
Jersey City ranks higher, but not the
highest!
This seems reasonable
Groups 1 and 3 still consider
NYC more attractive
Changes by more than 2 places are colored.
Group 1
Group 2
Group 3
x_1
0.0 0.4 0.8
0500
1000
1500
x_2
Fre
quency
0.0 0.4 0.8
0500
1000
1500
2000 x_3
Fre
quency
0.0 0.4 0.8
0500
1500
2500
x_4
Fre
quency
0.0 0.4 0.8
01000
2000
3000
4000
x_1
0.0 0.4 0.8
0200
400
600
800
1000 x_2
Fre
quency
0.0 0.4 0.8
0200
600
1000
1400 x_3
Fre
quency
0.0 0.4 0.8
0200
600
1000
x_4
Fre
quency
0.0 0.4 0.8
02000
4000
6000
x_1
0.0 0.4 0.8
0500
1000
1500
x_2
Fre
quency
0.0 0.4 0.8
0500
1000
1500
x_3
Fre
quency
0.0 0.4 0.8
01000
2000
3000
x_4
Fre
quency
0.0 0.4 0.8
0500
1000
2000
Prior mean
Posterior mean No significant changes
An Attack on New York City
Future Directions
• An alternative approach for fitting expert opinions:– Bayesian density estimation
• Sensitivity analysis on the performance of the two approaches: – Computational behavior (convergence properties, run times)– Reasonableness of predicted rankings– Performance with Bayesian updating after expected and
unexpected attacks
• Obtain stakeholder feedback on applicability of methodology and realism of results