University of California, Los Angeleshelper.ipam.ucla.edu/publications/caws3/caws3_13063.pdf ·...

Matching Methods for High-Dimensional Data withApplications to Text

Molly Roberts, Brandon Stewart and Rich Nielsen

UCSD, Princeton and MIT

May 11, 2016

Roberts (UCSD) Text Matching May 11, 2016 1 / 32

Introduction

How do people react to online repression?

I Lots of governments try to control online information

I Censoring the whole internet is hard (# of bloggers � # of censors)

I Limited external enforcement gov’ts scare people into self-policing

I Governments might jail some bloggers toscare people

I Then encourage self-censorship by signalingoff-limits topics

Roberts (UCSD) Text Matching 28 April 2016 2 / 32

Introduction

I Lots of governments try to control online informationI Censoring the whole internet is hard (# of bloggers � # of censors)I Limited external enforcement gov’ts scare people into self-policing

I But this could turn out one of two ways:I Bloggers might take cues to self-censor ORI Bloggers might hate censorship and rebel

I how censorship works: important tounderstand self-censorship

I BUT self-censorship very hard to measure

Introduction

I But this could turn out one of two ways:

I Bloggers might take cues to self-censor ORI Bloggers might hate censorship and rebel

Introduction

I But this could turn out one of two ways:I Bloggers might take cues to self-censor OR

I Bloggers might hate censorship and rebel

Introduction

The perfect experiment

1. Be the Chinese government

2. Randomly assign censorship

3. See what bloggers write after censorship

Problem 1: unethicalProblem 2: we aren’t the Chinese government

Introduction

Problem 1: unethical

Problem 2: we aren’t the Chinese government

Introduction

How Can We Measure Deterrence?

The best approximation:

Find two bloggers

Xsimilar users,

Xsimilar censorship histories,

Xsimilar numbers of posts

Xsimilar previous post sensitivity......

with very similar postswith similar posts

written on the same day

Only one censoredOnly one censored

Censorship ‘Mistake’

Does the censored blogger’s behavior change?

Does the censored blogger stay away from the topic?

Does the censored blogger pursue the topic?

Introduction

Find two bloggers

Xsimilar users,

Introduction

Find two bloggers

Xsimilar users,

Introduction

Find two bloggers

Xsimilar users,

Introduction

Find two bloggers

Xsimilar users,

with very similar posts

with similar posts

Introduction

Find two bloggers

Xsimilar users,

with very similar posts

with similar posts

Introduction

Find two bloggers

Xsimilar users,

Only one censored

Introduction

Find two bloggers

Xsimilar users,

Only one censored

Introduction

Find two bloggers

Xsimilar users,

Only one censored

Introduction

Find two bloggers

Xsimilar users,

Only one censored

Introduction

Find two bloggers

Xsimilar users,

Only one censored

Introduction

Text Matching

I Text as pre-treatment confounder a surprisingly frequent problemI Applications

I Does censorship change a bloggers behavior?I Do targeted killings of islamic extremists create interest in their work?I In International Relations, are women cited less frequently than men?I Control for letters of recommendation, trade treaties, Congressional

bills, etc

I BUT existing matching methods impossible to apply tohigh-dimensional data

I You can’t possibly match on every word! (and you wouldn’t want to)I We care about controlling for covariates predictive of treatmentI But with text, we don’t know what predicts treatment

Very little work on this.

Introduction

Text Matching

I Text as pre-treatment confounder

a surprisingly frequent problemI Applications

bills, etc

Introduction

Text Matching

I Text as pre-treatment confounder a surprisingly frequent problem

I ApplicationsI Does censorship change a bloggers behavior?I Do targeted killings of islamic extremists create interest in their work?I In International Relations, are women cited less frequently than men?I Control for letters of recommendation, trade treaties, Congressional

bills, etc

Introduction

Text Matching

bills, etc

Introduction

Text Matching

I Does censorship change a bloggers behavior?

I Do targeted killings of islamic extremists create interest in their work?I In International Relations, are women cited less frequently than men?I Control for letters of recommendation, trade treaties, Congressional

bills, etc

Introduction

Text Matching

I Does censorship change a bloggers behavior?I Do targeted killings of islamic extremists create interest in their work?

I In International Relations, are women cited less frequently than men?I Control for letters of recommendation, trade treaties, Congressional

bills, etc

Introduction

Text Matching

I Does censorship change a bloggers behavior?I Do targeted killings of islamic extremists create interest in their work?I In International Relations, are women cited less frequently than men?

I Control for letters of recommendation, trade treaties, Congressionalbills, etc

Introduction

Text Matching

bills, etc

Introduction

Text Matching

bills, etc

Introduction

Text Matching

bills, etc

I You can’t possibly match on every word! (and you wouldn’t want to)

I We care about controlling for covariates predictive of treatmentI But with text, we don’t know what predicts treatment

Introduction

Text Matching

bills, etc

I You can’t possibly match on every word! (and you wouldn’t want to)I We care about controlling for covariates predictive of treatment

I But with text, we don’t know what predicts treatment

Introduction

Text Matching

bills, etc

Introduction

Text Matching

bills, etc

Introduction

Our Approach to Text Matching

1. Construct analogs to current methods

I Propensity score matching Multinomial Inverse RegressionI Coarsened exact matching Topically Coarsened Matching

2. Identify benefits and drawbacks of each

3. Create a new method

Topical Inverse Regression Matching (TIRM),by combining the two

Introduction

1. Construct analogs to current methods

I Propensity score matching Multinomial Inverse RegressionI Coarsened exact matching Topically Coarsened Matching

Introduction

1. Construct analogs to current methodsI Propensity score matching Multinomial Inverse Regression

I Coarsened exact matching Topically Coarsened Matching

Introduction

1. Construct analogs to current methodsI Propensity score matching Multinomial Inverse RegressionI Coarsened exact matching Topically Coarsened Matching

Introduction

3. Create a new method Topical Inverse Regression Matching (TIRM)

,by combining the two

Introduction

3. Create a new method Topical Inverse Regression Matching (TIRM),by combining the two

Introduction

Outline of Talk

I A Quick Review of Matching

I Text Analogs to Current Matching Method

I Topical Inverse Regression Matching

I Applications

Introduction

Outline of Talk

I Applications

Introduction

Outline of Talk

I Applications

Introduction

Outline of Talk

I Applications

Introduction

Outline of Talk

I Applications

Matching Methods for Text

Previous Approaches to Matching

I Goal: ti ⊥⊥ yi (1), yi (0)|~xiI Many approaches: propensity score matching, coarsened exact matching, genetic

matching, covariate-balanced propensity scores, entropy balancing, synthetic matching,

mahalanobis matching, exact matching, subclassification matching, nearest neighbor

matching, full matching . . .

I Today two of these strategies:

1. model p(ti |~xi ) propensity scores2. match on all ~xi coarsened exact matching

I Both strategies scale poorly with high-dimensional covariates.

I Goal: ti ⊥⊥ yi (1), yi (0)|~xi

I Many approaches: propensity score matching, coarsened exact matching, genetic

I Goal: ti ⊥⊥ yi (1), yi (0)|~xiI Many approaches: propensity score matching

, coarsened exact matching, genetic

I Goal: ti ⊥⊥ yi (1), yi (0)|~xiI Many approaches: propensity score matching, coarsened exact matching

, genetic

matching

, covariate-balanced propensity scores, entropy balancing, synthetic matching,

matching, covariate-balanced propensity scores, entropy balancing, synthetic matching

1. model p(ti |~xi ) propensity scores

2. match on all ~xi coarsened exact matching

Propensity Scores: An Analog for Text

I Classical approachI fit logistic regression π̂i = p(ti |~xi )I match units with similar probability of treatmentI pros: units matched by scalar (π̂i ) instead of long vector (~xi )I cons: only produces balance in expectation

I Problem: high-dimensional confoundersI X is N × V (# of documents by # of words in vocab)I can only estimate π̂i well when N � V , which isn’t the case!

I Classical approach

I fit logistic regression π̂i = p(ti |~xi )I match units with similar probability of treatmentI pros: units matched by scalar (π̂i ) instead of long vector (~xi )I cons: only produces balance in expectation

I Classical approachI fit logistic regression π̂i = p(ti |~xi )

I match units with similar probability of treatmentI pros: units matched by scalar (π̂i ) instead of long vector (~xi )I cons: only produces balance in expectation

I Classical approachI fit logistic regression π̂i = p(ti |~xi )I match units with similar probability of treatment

I pros: units matched by scalar (π̂i ) instead of long vector (~xi )I cons: only produces balance in expectation

I Classical approachI fit logistic regression π̂i = p(ti |~xi )I match units with similar probability of treatmentI pros: units matched by scalar (π̂i ) instead of long vector (~xi )

I cons: only produces balance in expectation

I Problem: high-dimensional confounders

I X is N × V (# of documents by # of words in vocab)I can only estimate π̂i well when N � V , which isn’t the case!

I Problem: high-dimensional confoundersI X is N × V (# of documents by # of words in vocab)

I can only estimate π̂i well when N � V , which isn’t the case!

I Solution: Multinomial Inverse Regression (Cook 2007, Taddy 2013)

I assume xi ∼Multinomial(~qi ,mi =∑

v xi,v )I where qi,v ∝ exp(αv + tiφv )I φv measures relationship between treatment and wordI projection zi = Φ′(~xi/mi ) is a sufficient reduction X ⊥⊥ T |Z estimate π̂i with projection

I Match on zi or π̂i

v xi,v )

I where qi,v ∝ exp(αv + tiφv )

I φv measures relationship between treatment and wordI projection zi = Φ′(~xi/mi ) is a sufficient reduction X ⊥⊥ T |Z estimate π̂i with projection

v xi,v )I where qi,v ∝ exp(αv + tiφv )

v xi,v )I where qi,v ∝ exp(αv + tiφv )I φv measures relationship between treatment and word

I projection zi = Φ′(~xi/mi ) is a sufficient reduction X ⊥⊥ T |Z estimate π̂i with projection

v xi,v )I where qi,v ∝ exp(αv + tiφv )I φv measures relationship between treatment and word

I projection zi = Φ′(~xi/mi ) is a sufficient reduction X ⊥⊥ T |Z estimate π̂i with projection

Problems with MNIR Matching

Posts equally likely to be treated are not always semantically similar:

I wouldn’t be a problem in expectation BUT

I hard to assess balance in the text case

I could be more efficient if matches were more similar

Coarsened Exact Matching: An Analog for Text

I Classical approach

I coarsen each variable into natural categoriesi.e. years of education {high school, elementary school, college}

I exactly match on coarsened variableI pros: bounds imbalance on each variable

I Problem: high-dimensional confounderI thousands of variables, even if we coarsen, no exact match

I Classical approachI coarsen each variable into natural categories

i.e. years of education {high school, elementary school, college}

I exactly match on coarsened variableI pros: bounds imbalance on each variable

i.e. years of education {high school, elementary school, college}I exactly match on coarsened variable

I pros: bounds imbalance on each variable

i.e. years of education {high school, elementary school, college}I exactly match on coarsened variableI pros: bounds imbalance on each variable

I Problem: high-dimensional confounder

I thousands of variables, even if we coarsen, no exact match

I Solution: topically coarsened matching

I innovation: coarsen across variablessimple example: “tax”, “income”, “tariff” “economics”

I topics must be equivalent across documents instead of wordsI bounds imbalance across groups of stochastically equivalent words

I Estimate a topic model

I Match on the topic density rather than raw word counts

I Solution: topically coarsened matchingI innovation: coarsen across variables

simple example: “tax”, “income”, “tariff” “economics”

I topics must be equivalent across documents instead of wordsI bounds imbalance across groups of stochastically equivalent words

simple example: “tax”, “income”, “tariff” “economics”I topics must be equivalent across documents instead of words

I bounds imbalance across groups of stochastically equivalent words

simple example: “tax”, “income”, “tariff” “economics”I topics must be equivalent across documents instead of wordsI bounds imbalance across groups of stochastically equivalent words

Problems with Topical CEM

Topics aren’t always the most important predictor of treatment:

Topical Inverse Regression Matching (TIRM)

I We need something that:

1. Bounds imbalance between documents2. Doesn’t leave out important words

I TIRM: Jointly estimate probability of treatment and topic densityI Match on topic proportions & topic-specific probability of treatment

I topical bounding propertiesI estimates which words associated with treatment

I Ingredients:

I Structural Topic Model (Roberts, Stewart, Tingley et al 2014)I with treatment as content covariate

1. Bounds imbalance between documents

2. Doesn’t leave out important words

I Ingredients:

I TIRM: Jointly estimate probability of treatment and topic density

I Match on topic proportions & topic-specific probability of treatment

I Ingredients:

I topical bounding properties

I estimates which words associated with treatment

I Ingredients:

I Ingredients:I Structural Topic Model (Roberts, Stewart, Tingley et al 2014)

I with treatment as content covariate

I Ingredients:I Structural Topic Model (Roberts, Stewart, Tingley et al 2014)I with treatment as content covariate

Structural Topic Model

I STM adds a “structure” to the Latent Dirichlet Allocation (Blei, Ngand Jordan 2003) via a prior

I Replace topic prevalence prior → (heuristically) glm with arbitrarycovariates(Blei and Lafferty 2006, Mimno and McCallum 2008)

I Replace the distribution over words → multinomial logit (Eisenstein,Ahmed and Xing 2011)

I Documents have different expected topic proportions based onobserved covariates.

I Topics are now deviations from a baseline distribution.

P(word |topic , doc) ∝

exp(κ(m)+ topic∗κ(k)+ covariatedoc∗κ(c) + topic*covariatedoc∗κ(int))

κ(c) and κ(int) how words are related to treatment.

κ(c) and κ(int) how words are related to treatment.Roberts (UCSD) Text Matching 28 April 2016 17 / 32

Match on:

1. θ: Estimated topic proportion (K covariates)

2. proj :

I let (xi/mi ) % of document i that is word xI (κ(c))′(xi/mi )

covariate-only projection

I (κ(c))′(xi/mi ) + 1mi

∑v xi,v

(int)v

)′θi

topic-covariate projection

3. Any other covariates you think are important

We generally use CEM to match but other methods could be used.

Limitations of TIRM

I New: relies on a parametric method to reduce dimensions

I Old: requires SUTVA, relevant covariates

Match on:

2. proj :

I let (xi/mi ) % of document i that is word xI (κ(c))′(xi/mi )

I (κ(c))′(xi/mi ) + 1mi

∑v xi,v

(int)v

)′θi

Limitations of TIRM

Match on:

2. proj :I let (xi/mi ) % of document i that is word x

I (κ(c))′(xi/mi )

I (κ(c))′(xi/mi ) + 1mi

∑v xi,v

(int)v

)′θi

Limitations of TIRM

Match on:

2. proj :I let (xi/mi ) % of document i that is word xI (κ(c))′(xi/mi )

I (κ(c))′(xi/mi ) + 1mi

∑v xi,v

(int)v

)′θi

Limitations of TIRM

Match on:

2. proj :I let (xi/mi ) % of document i that is word xI (κ(c))′(xi/mi ) covariate-only projection

I (κ(c))′(xi/mi ) + 1mi

∑v xi,v

(int)v

)′θi

Limitations of TIRM

Match on:

I (κ(c))′(xi/mi ) + 1mi

∑v xi,v

(int)v

)′θi

Limitations of TIRM

Match on:

I (κ(c))′(xi/mi ) + 1mi

∑v xi,v

(int)v

)′θi

)topic-covariate projection

Limitations of TIRM

Match on:

I (κ(c))′(xi/mi ) + 1mi

∑v xi,v

(int)v

)′θi

Limitations of TIRM

Match on:

I (κ(c))′(xi/mi ) + 1mi

∑v xi,v

(int)v

)′θi

Limitations of TIRM

Match on:

I (κ(c))′(xi/mi ) + 1mi

∑v xi,v

(int)v

)′θi

Limitations of TIRM

Match on:

I (κ(c))′(xi/mi ) + 1mi

∑v xi,v

(int)v

)′θi

Limitations of TIRM

Match on:

I (κ(c))′(xi/mi ) + 1mi

∑v xi,v

(int)v

)′θi

Limitations of TIRM

Simulations

Set up:

1. Simulate 200 outcome and treatment with confounding topics andwords

2. Estimate STM3. Condition on topics and projection

−2.0 −1.5 −1.0 −0.5 0.0

Estimated Effect

Naive EstimatorTopics OnlyTopics and ProjectionTrue

Simulations

Set up:

−2.0 −1.5 −1.0 −0.5 0.0

Estimated Effect

Simulations

Set up:

2. Estimate STM

3. Condition on topics and projection

−2.0 −1.5 −1.0 −0.5 0.0

Estimated Effect

Simulations

Set up:

−2.0 −1.5 −1.0 −0.5 0.0

Estimated Effect

Simulations

Set up:

−2.0 −1.5 −1.0 −0.5 0.0

Estimated Effect

Estimating Censorship Effects

Example 1: How do bloggers react to censorship?

I Data: 593 bloggers over 6 months spanning 2011,2012

I 150,000 posts

I Return to blogs to measure censorship

I Find censors’ mistakes: two similar blogs, different censorship

I Also match on date, previous censorship, previous sensitivity.

I How do ’treated’ bloggers react to censorship?I Outcome: Bloggers’ writings after censorship:

I censorship rate afterI sensitivity of blog text after (estimated by TIRM)I topical content of blogs after

I 150,000 posts

I How do ’treated’ bloggers react to censorship?

I Outcome: Bloggers’ writings after censorship:

I 150,000 posts

I censorship rate after

I sensitivity of blog text after (estimated by TIRM)I topical content of blogs after

I 150,000 posts

I censorship rate afterI sensitivity of blog text after (estimated by TIRM)

I topical content of blogs after

I 150,000 posts

TIRM Finds Almost Identical Posts

Original Data

String Kernel Similarity

0.0 0.2 0.4 0.6 0.8 1.0

Topic Match

0.0 0.2 0.4 0.6 0.8 1.0

MNIR Match

0.0 0.2 0.4 0.6 0.8 1.0

Results

I We find 46 matched blogs (censors’ mistakes)

I Nearly perfect matches

I Most matched posts are about Bo Xilai incident, Maoist protests

I 5 posts before treatment:

I No statistical difference between actual censorshipI No statistical difference between TIRM-predicted censorshipI (Not surprising, we are matching on these!)

I 5 posts after treatment:

I Treated group: 20% censorship

Control group: 7% censorship

I TIRM estimates treated text significantly more sensitive than controlI Treated group talks significantly more about Bo Xilai incident after

censorship than controlI Treated group talks significantly more about CCP History/Mao after

censorship than control

Results

I 5 posts before treatment:I No statistical difference between actual censorship

I No statistical difference between TIRM-predicted censorshipI (Not surprising, we are matching on these!)

Results

I 5 posts before treatment:I No statistical difference between actual censorshipI No statistical difference between TIRM-predicted censorship

I (Not surprising, we are matching on these!)

Results

I 5 posts before treatment:I No statistical difference between actual censorshipI No statistical difference between TIRM-predicted censorshipI (Not surprising, we are matching on these!)

Results

I 5 posts after treatment:I Treated group: 20% censorship

Control group: 7% censorshipI TIRM estimates treated text significantly more sensitive than controlI Treated group talks significantly more about Bo Xilai incident after

Results

I 5 posts after treatment:I Treated group: 20% censorship Control group: 7% censorship

Results

I 5 posts after treatment:I Treated group: 20% censorship Control group: 7% censorshipI TIRM estimates treated text significantly more sensitive than control

I Treated group talks significantly more about Bo Xilai incident aftercensorship than control

I Treated group talks significantly more about CCP History/Mao aftercensorship than control

Results

I 5 posts after treatment:I Treated group: 20% censorship Control group: 7% censorshipI TIRM estimates treated text significantly more sensitive than controlI Treated group talks significantly more about Bo Xilai incident after

I Treated group talks significantly more about CCP History/Mao aftercensorship than control

Results

I 5 posts after treatment:I Treated group: 20% censorship Control group: 7% censorshipI TIRM estimates treated text significantly more sensitive than controlI Treated group talks significantly more about Bo Xilai incident after

Estimating Effects of Gender Citations in IR

Example 2: Does gender affect citations in PoliticalScience?

I Maliniak, Powers, Walter (2013): women get cited less than men in IR

I Problem: women write about different topics than men

I Maliniak et al solution: Code articles into (many) categories

I Our solution: Text matching!

I Data: 3,201 journal articles from top 12 IR journals, 1980-2006.

I Code lots of variables, including gender, article age, tenure, etc.

I Treatment: all-female Control: co-ed/all-male

I Our motive: Find similar articles, see how they are cited differently.

Words men and women use differently in IR

Original Data:

Topic Matching:

TIRM Reduces Topical Differences

Mean topic difference (Women−Men)

−0.10 −0.05 0.00 0.05 0.10

Topic 2 : model, variabl, data, effect, measur

Topic 9 : nuclear, weapon, arm, forc, defens

Topic 6 : game, will, cooper, can, strategi

Topic 14 : war, conflict, state, disput, democraci

Topic 1 : state, power, intern, system, polit

Topic 7 : polici, foreign, public, polit, decis

Topic 12 : soviet, militari, war, forc, defens

Topic 13 : trade, econom, polici, bank, intern

Topic 8 : polit, parti, polici, govern, vote

Topic 15 : war, israel, peac, conflict, arab

Topic 3 : polit, conflict, group, ethnic, state

Topic 4 : econom, develop, industri, countri, world

Topic 10 : state, china, unit, foreign, polici

Topic 11 : intern, state, organ, institut, law

Topic 5 : polit, social, one, theori, world

Full Data Set (Unmatched)TIRMMNIRTopic MatchingHuman Coding Matched

TIRM Reduces Topical Differences

Mean topic difference (Women−Men)

−0.10 −0.05 0.00 0.05 0.10

Topic 2 : model, variabl, data, effect, measur

Topic 9 : nuclear, weapon, arm, forc, defens

Topic 6 : game, will, cooper, can, strategi

Topic 14 : war, conflict, state, disput, democraci

Topic 1 : state, power, intern, system, polit

Topic 7 : polici, foreign, public, polit, decis

Topic 12 : soviet, militari, war, forc, defens

Topic 13 : trade, econom, polici, bank, intern

Topic 8 : polit, parti, polici, govern, vote

Topic 15 : war, israel, peac, conflict, arab

Topic 3 : polit, conflict, group, ethnic, state

Topic 4 : econom, develop, industri, countri, world

Topic 10 : state, china, unit, foreign, polici

Topic 11 : intern, state, organ, institut, law

Topic 5 : polit, social, one, theori, world

Full Data Set (Unmatched)TIRMMNIRTopic MatchingHuman Coding Matched

Results

I Maliniak et al: Women receive 80% the citations of men

I In our data: women receive fewer citations robust across matches

I Final match: Women receive 40-60% the citations of men

I Still looking into why we are getting more extreme results

I Could be the difference is in very high citation counts

Results

Ex. 3: Did killing Bin Laden make his ideas less popular?

“His death will serve as a global clarion call for another generation ofjihadists.”– Ed Husain (CFR)

“al-Qaida may emerge even more radical, and more closely united underthe banner of an iconic martyr.”– Abdel Bari Atwan (The Guardian)

“His death will serve as a global clarion call for another generation ofjihadists.”– Ed Husain (CFR)

“al-Qaida may emerge even more radical, and more closely united underthe banner of an iconic martyr.”– Abdel Bari Atwan (The Guardian)

“The idea that Obama made a strategic misstep by killing a manresponsible for the death of thousands of U.S. citizens and committed tokilling thousands more is absurd. Rather than making him a martyr, BinLaden’s killing demonstrated that he was, like the rest of us, mortal.” –Robert Simcox (LA Times)

We don’t really know.

Usama Bin Laden Anwar al-Awlaki Abu Yahya al-Libi5/2/2011 9/30/2011 6/5/2012

We don’t really know.

Usama Bin Laden Anwar al-Awlaki Abu Yahya al-Libi5/2/2011 9/30/2011 6/5/2012

Empirical Strategy

I View-count data from a Jihadist website, scraped over time

I Does targeted killing of Bin Laden increase views of his work?

I TIRM matching + match on pre-treatment page views.

I QOI is ATT: nearest neighbor matching instead of CEM

I Validation: Matches accord with sub-pages on website

Empirical Strategy

Martyr Effect: Clear short-term increase in page views

2012 2013 2014

−3000

−2000

−1000

−05−

−07−

−09−

−11−

Matching on topics and page views

Figure: Estimated effects of Usama Bin Laden’s death (on May 2, 2011) onsubsequent page views of his documents on a large jihadist web-library.

Martyr Effect: Clear short-term increase in page views

2012 2013 2014

−3000

−2000

−1000

−05−

−07−

−09−

−11−

Matching on topics and page views

Figure: Estimated effects of Usama Bin Laden’s death (on May 2, 2011) onsubsequent page views of his documents on a large jihadist web-library.

Conclusion

I Lots of applications measure pre-treatment confounders with text

I No methods developed yet to do thisI We develop a new method, Topical Inverse Regression Matching

I Matching on topical density estimate

bounds differences betweentopics

I Match on probability of treatment

balances on words related totreatment

I Future work:

I Develop theoretical properties of TIRMI Extend to high-dimensional cases other than textI Create an R package

Conclusion

I Future work:

Conclusion

I No methods developed yet to do this

I We develop a new method, Topical Inverse Regression Matching

I Future work:

Conclusion

I Future work:

Conclusion

I Future work:

Conclusion

I Matching on topical density estimate bounds differences betweentopics

I Future work:

Conclusion

I Future work:

Conclusion

I Match on probability of treatment balances on words related totreatment

I Future work:

Conclusion

I Future work:

Conclusion

I Future work:I Develop theoretical properties of TIRM

I Extend to high-dimensional cases other than textI Create an R package

Conclusion

I Future work:I Develop theoretical properties of TIRMI Extend to high-dimensional cases other than text

I Create an R package

Conclusion

I Future work:I Develop theoretical properties of TIRMI Extend to high-dimensional cases other than textI Create an R package

University of California, Los Angeleshelper.ipam.ucla.edu/publications/caws3/caws3_13063.pdf ·...

Documents

Transcript of University of California, Los Angeleshelper.ipam.ucla.edu/publications/caws3/caws3_13063.pdf ·...

Comparing Point Clouds - University of California, Los Angeleshelper.ipam.ucla.edu/publications/mgaws3/mgaws3_5106.pdf · 28 Conclusions Theoretical and computational framework for

Rainbows LEVELED BOOK P · 2020. 5. 3. · a rainbow actuall orms a ull ircle . From he ground mos imes we an onl see a rainbow’s o alf . ometimes, though luck airline ilots ma

University of California, Los Angeleshelper.ipam.ucla.edu/publications/maws2/maws2_5442.pdf · State of the Art Computer Simulations: Where Are We Now? 10 Petaflop 1 Petaflop 100

Paying Attention - University of California, Los Angeleshelper.ipam.ucla.edu/publications/sl2014/sl2014_11588.pdf · 2014. 1. 7. · Paying Attention: How Cognitive Biases Shape Social

Michel Bierlaire - University of California, Los Angeleshelper.ipam.ucla.edu/publications/traws4/traws4_12678.pdf · Introduction Outline 1 Introduction 2 Demand 3 Supply 4 Integrated

presentation.001 - University of California, Los Angeleshelper.ipam.ucla.edu/publications/sl2014/sl2014_11593.pdf · Limits to sociability Cognitive Dunbar's number o There is a cognitive

Books/20-21/20-21_MBB...ETHEL UNIERSITY ILOTS MEN’S BETB RECORD BOOK - 2 QUICK FACTS Table of Contents Honors & Accolades 3

University of California, Los Angeleshelper.ipam.ucla.edu/publications/es2002/es2002_1439.pdf · 2003-10-20 · Created Date: 4/6/2002 2:26:14 PM

01/14/01 S2K+1 Field Training Spring 2001. 01/14/01 FAR 121 Air Safety PAD Operational Responsibility shared by: –P ilots –A TC –D ispatchers Enhancing.

Sigles - · PDF fileRCC-M - Edition 2012 - Modificatif Mars 2014 n° 143-2014 VII PLAN GENERAL Sigles TOME I - MATERIELS DES ILOTS NUCLEAIRE VOLUME "A" : GENERALITES A

SERVICES ILOTS ROBOTISÉS - FB Solutions · 2019. 1. 29. · tube de levage table lvatrice. machines d’emballage convoyage i transitique manutention i levage ilots robotisÉs services

University of California, Los Angeleshelper.ipam.ucla.edu/publications/plws4/plws4_10635.pdfmany-body quantum problems just as you need them classically. • Both the wavefunction

Session 4: Cultural stereotypes · Session 4: Cultural stereotypes Global Aviation Gender Summit 9 August 2018 O. ... 4,1% P ilots 3,7% P ilots 4% P ilots. GLOBAL AVIATION GENDER

Rédaction en cours d’un guide...en été en réalisant des expérimentations pilotes et mesurant leur efficacité dans le cadre du projet européen de lutte contre les ilots de

TRAVAUX CARREFOUR BOURGES AMENAGEMENT SDCH · TRAVAUX CARREFOUR BOURGES AMENAGEMENT SDCH . v o ie ire— Ilots ou Mur MUR 1 MUR 2 110t A 110t B 110t C Ilots ou Mur MUR 3 110t D 110t

LACK ILOTS LACK PILOTS August OF MERICA OF AMERICA BPA ATIS · Fish Fry FunRaiser ... the Jesse Hayes Fly-in sponsored by the ... was that he wished that he could have spent the entire

of HadooIrN Lr - Defense Technical Information Center 1 Background Research Objectiv3s 2 ... In1 1973 crosstie demand rnc ased easTh os 1,0) ... O twum Giaa n zte k ilots iu tzeklpi

PLANTATIONS FORESTIÈRES ET ILOTS BOISÉS EN ...ce qui entraîne à son tour une lente et inégale croissance économique. La population de la sous-région est également en proie

Café Le bon Coin Notes par Jocelyne Fangeatculture21.org/wp-content/uploads/2016/01/CR-Ilots2... · 2018-03-01 · Ilots-Mots 2: 15 Avril 2017 Animé par Olivier Nicol Café Le bon

University of California, Los Angeleshelper.ipam.ucla.edu/publications/mpsws1/mpsws1_13607.pdf · Conducting Polymers / Organic Crystals Surface Electrochemistry / Electron Transport