Workshop on Statistical Mediation and Moderation: Statistical Mediation Paul Jose Victoria...

Workshop on Statistical Mediation and Moderation:

Statistical Mediation

Paul JoseVictoria University of Wellington

27 March, 2008SASP Conference

What do you want to know?

Let’s briefly have each person state what he or she would like to learn this morning.

Also, what is your level of statistical knowledge/experience?

Okay, let me tell you what I’m planning to cover.

What am I doing today?

I want to define mediation and moderation

How are they similar or different? Basic mediation and moderation Advanced mediation and moderation Questions and answers

Where does one start? I began to be interested in mediation and

moderation because I found that I was increasingly using these approaches in understanding “process” among variables.

I found that there was little about these techniques in traditional statistics textbooks—I mostly obtained information through word-of-mouth.

. . . and I was confused. I don’t like being confused, so I did something about it. I educated myself on these techniques. And now I can pass on what I’ve learned. Let me list what I consider to be the main sources of confusion.

Five major sources of confusion

1. Moderation and mediation sound alike. It makes it seem that they are very similar, and or they derive from the same origin. They are somewhat similar (cousins), but they don’t come from the same place.

2. Second, statistics textbooks typically do not do a very good job of explaining these two approaches. Exception: Howell (2006).

3. Third, reports of moderation and mediation in the research literature are not always clear or accurately performed.

More confusion

4. Both are special cases of two separate broad statistical approaches: mediation is a special case of semi-partial correlations (path modeling) and moderation is a special case of statistical interactions (from ANOVA). Both are included under GLM, but this is not usually appreciated.

5. It’s not entirely clear what distinguishes a moderating variable from a mediating variable. Can one a priori define mediating and moderating variables?

One last stumbling block

Problem: there are no easily used statistics programmes that compute mediation and moderation. Can do analyses in SPSS and other programmes that do regression, but there is no graphing capability dedicated to either mediation or moderation (except ModGraph and MedGraph).

What we have here is a case of the users “getting ahead” of the statisticians in the sense that researchers frequently use mediation and moderation but many statisticians aren’t even familiar with the terms.

Background and history Most people’s awareness of this area comes from

this article:

Baron, Reuben M. & Kenny, David A.(1986). The moderator-mediator variable distinction in social psychological research: Conceptual, strategic, and statistical considerations.Journal of Personality and Social Psychology. Vol 51(6), pp. 1173-1182.

Cited about 6,500 times by PsychInfo’s count. And that’s just in Psychology.

Most people are unclear about what they said and how to perform the techniques.

Let’s get started:Similarities and differences

Similarities: They both involve three variables; You can use regression to compute both; You wish to see how a third variable affects a basic relationship

(IV to DV).Differences: You create a product term in moderation; not in mediation; You don’t have to centre anything in mediation; Moderation can be used on concurrent or longitudinal data, but

mediation is best used on longitudinal data. Graphing is critical for moderation; helpful for mediation.

How do you know if you have a moderator or a mediator?

What’s the diff? Moderators tend to be variables that are relatively

immune to change over time (personality trait, gender, ethnic group, etc.).

Mediators tend to be variables that change in relation to other variables (anxiety, helpfulness, honesty, mood).

However, there is a class of variables (e.g., coping efforts/strategies) that might be examined in both ways. These two categories are not mutually exclusive.

So let’s focus on mediation first

Definition: “A mediating variable is one which specifies how (or the mechanism by which) a given effect occurs between an independent variable (IV) and a dependent variable (DV).” (Holmbeck, 1997, p. 599).

The question you wish to answer is whether the effect of the IV on the DV is at least partially mediated by a third variable (MV).

You can answer this question with two regressions (and a correlation matrix).

Let’s consider a specific example.

An example from my research

Stressorintensity

Depression

Rumination

The theories

Susan Nolen-Hoeksema believes that an individual who ruminates more ends up more depressed. X => Y. Notice that it’s a causal statement.

I don’t disagree with her, but I think that this simple effect should be embedded within the stress and coping context.

We know that stress leads to depression. The question I want to ask is whether at least part of the effect of stress on depression occurs because certain individuals ruminate about stressful events, and this rumination leads to depression.

The basic relationship

Stressorintensity

Depression

One must have a significant correlation between the IV and DV (in fact among all 3 variables). The essential question is whether by adding a third variable, one can at least partially explain the basic relationship. Let’s look at some real data.

The two steps

Stressorintensity

Depression

Stressorintensity

Depression

Rumination

Step 1

Step 2

.45***

.45***

.51*** .46***

(.29**)

(.32**)

Baron & Kenny’s 4 criteria

1. IV to MV must be significant2. IV to DV must be significant3. MV to DV must be significant (when

entered with the IV)4. “The effect of the IV on the DV must be

less in the third equation than the second. Perfect mediation holds if the IV has no effect when the mediator is controlled.”

“must be less” is measured with the Sobel formula (see following pages)

“Perfect mediation” occurs when the original relationship goes to zero. This never happens in psychology. I have a proposal for how to deal with this issue, presented below.

What changed? Note that the beta weight from IV to DV

changed: from .45 to .29. What does that tell us? According to Baron and Kenny (1986), if one

obtains a significant drop in beta for this relationship, then one has obtained significant mediation.

How can one test whether this is significant or not? (It is not simply whether it goes from significant to non-significant.) One needs to compute the Sobel’s test:

z-value = a*b/SQRT(b2*sa2 + a2*sb

2)

Who ‘ya gonna call?

Many people have been using a web-site by Preacher and Leonardelli, and it’s quite useful for computing the Sobel’s statistic: http://www.psych.ku.edu/preacher/sobel/sobel.htm

Let me show you how to use the site. It is generally very helpful.

I have invented my own programme to do what P & L’s site does, and MORE. Let’s check it out too.

Preparatory work

Before we run off to use these, please know that you have to obtain some statistical information first:

1. Compute a correlation matrix of the 3 variables;

2. Perform a multiple regression of the IV on the mediating variable; and

3. Perform a multiple regression of the IV and mediator on the DV (simultaneous inclusion).

Correlation matrix

Correlations

1 .508** .449**

.000 .000

186 186 186

.508** 1 .464**

.000 .000

186 186 186

.449** .464** 1

.000 .000

186 186 186

Pearson Correlation

Sig. (2-tailed)

N

Pearson Correlation

Sig. (2-tailed)

N

Pearson Correlation

Sig. (2-tailed)

N

Stress total

Rumination total

depression total

Stress totalRumination

totaldepression

total

Correlation is significant at the 0.01 level (2-tailed).**.

Results from the two regressions

1st regression (Stress on Rumination):B 7.501 (unstand regression coefficient)

se .938 (standard error)

2nd regression (Stress, Rumination on Depression):You select the B and se for the mediating variable

here:

B .069se .016new beta for Stress .288new beta for Rumination .317

(P & L web-site needs the first four values.)

Okay, go to the programmes

It is necessary to have written down the pertinent statistical output, or to have printed off the relevant sections.

Can do both programmes on the internet.

If you’re away from the internet you can download the Excel macro of MedGraph and run it whenever you want.

MedGraph outputType of Mediation Partial

Sobel z-value 3.795737 significance .000147

Standardized coefficient of Stress on Depression

Direct: .288

Indirect: .161

Independent Variable:Stress

.449***

Outcome Variable:Depression

(.288***)

.508***

.464***

(.317***)

Mediating Variable:

Rumination

Comparison of web-sites

Preacher’s site has been around longer, it allows variations on the Sobel formula, and gives you an alternate way to compute the Sobel’s t.

My site results in a graphical presentation of results, I think it’s harder to make mistakes with my programme, and it has/will have information about the type of mediation.

My criteria for type of mediation

At present my programme stipulates: None: non-significant Sobel’s z-value Partial: significant Sobel’s and significant

basic relationship in the 2nd regression (IV to DV)

Full: significant Sobel’s and non-significant basic relationship in the 2nd regression (IV to DV)

Dave Kenny argues against this (see his web-site), and I tend to agree with him now. My new approach is on the following page.

What kind of mediation? None: non-significant Sobel’s z-value Partial: significant Sobel’s and ratio < .80.

(ratio is indirect/total; in this case it’s .161/.449)

Full: significant Sobel’s and ratio > .80------------------------------------------------------ In the present case we have a significant

Sobel’s and ratio = .36. Thus, we have partial mediation. Notice that I don’t use the term “perfect mediation”. There is no consensus on the partial/full mediation issue.

Causal finding? Many researchers would be keen to argue

from this result that the experience of stress leads to rumination, which in turn partially leads to depressive symptoms, i.e., a causal argument. Is this merited?

Cole and Maxwell (2003) argue strenuously that concurrent mediation CANNOT support a causal statement. They argue that few concurrent mediation results actually turn out to hold up in longitudinal data. What do they mean?

Shared and unique variance

Basic relationship is just a correlation between two variables.

Stress Depressivesymptoms

Three variables: mediation

Stress Depressivesymptoms

Rumination

The green area indicates the degree of shared variance amongthe three variables: that’s the size of the “indirect effect”. It is hard to argue that these relationships are causal with these data: they are the size of shared and unique variance.

Direct effect

Indirect effect

Warnings! One must have all three correlations be significant before

launching this. K now suggests that 1st one may be optional. Be sure that you do the regressions correctly, and that you are

taking the proper statistical information from the print-outs (B vs. ).

Some people make causal arguments from these results. They are shaky at best.

Types of specification error: 1) ordering of variables, 2) variables with/without error, and 3) “third variable problem”

Longitudinal data are best. Bootstrapping is best with small N samples. Path models involving more than three variables is the general

case—don’t do a bunch of three-variable mediation analyses when you can do one path model.

Specification error

Major boogeyman in path model analytic work: have you correctly specified your model?

Several issues here: Temporal order of variables Variables measured with error Missing variable?

Why is your proposed model the best?

Rumination Stress intensity

Depressivesymptoms

There are exactly 6 combinations of any three variables—why is yourproposed model the best? Why not test all of them? I have, and inthe present case I find six instances of partial mediation. Which is correct? They all tell us something useful about shared and unique variance.

Variables measured with error

One can obtain biased estimates of the indirect effect if the MV is measured with significant error. (Same is true of the IV and DV too, by the way.)

Answer? Do mediation in a latent variable path model in SEM. Possible but not easy. Also, a lot of the times one doesn’t have a sufficient N or multiple indicators of the variables (3 indicators per variable). Would look like this:

Latent variable path model

Rumination

Stress intensity Depression.30***

(.20***)

.40*** .24***

Indirect effect = .10; direct effect = .20; ratio = .33 (.36 in MR)

Missing variable? This is the old “third variable problem”, but

in this case we might wish to call it the “fourth variable problem”.

My student, Kirsty Weir, suggests that anxiety/worry might “explain” the relationship between rumination and depression. Graph is on the following page.

One can never completely resolve this question: include the likely candidates and try to reject them.

The road from stress to depression

Note that the Rum to Dep path was removed because it was non-significant when we added the 4th variable (control). Is the 3-variablemediation pattern wrong then?

Bootstrapping

David MacKinnon and others have argued that typical multiple regression analysis is unbiased only for large samples. (present case N = 575)

They suggest: Large sample: use MR Small sample: use bootstrapping

What is bootstrapping?

Wave of the future

Bootstrapping is a compilation of regression results from many subsets of the original dataset.

The programme selects a subset of the data (e.g., 50 from 100 participants), runs the regression analysis, stores the result, does it again and again up to a predetermined number of times, and then compiles the results of the repeated analyses.

Baron & Kenny didn’t mention this—wasn’t used in 1986 very much at all. It is performed now, but infrequently. It is the wave of the future.

So how does one do this?

If you toddle off to SPSS to do this, you will be disappointed. Although it can perform bootstrapping, it is not set up to do mediation bootstrapping.

Preacher and Hayes (see the Preacher web-site on mediation) offers two different macros: SAS and SPSS. Download it and use it within SPSS. (not easy)

Let’s look at the results of the SPSS macro.

Macro output

It’s telling us that the indirect effect was significant—agrees with the multiple regression result, but this is an unbiased estimate. (z = 3.80 before)

Run MATRIX procedure:

DIRECT AND TOTAL EFFECTS Coeff s.e. t Sig(two)b(YX) .3934 .0288 13.6685 .0000b(MX) 1.0412 .0691 15.0779 .0000b(YM.X) .1369 .0165 8.3200 .0000b(YX.M) .2508 .0322 7.8002 .0000

INDIRECT EFFECT AND SIGNIFICANCE USING NORMAL DISTRIBUTION Value s.e. LL 95 CI UL 95 CI Z Sig(two) Sobel .1426 .0196 .1042 .1810 7.2723 .0000

BOOTSTRAP RESULTS FOR INDIRECT EFFECT Mean s.e. LL 95 CI UL 95 CI LL 99 CI UL 99 CI Effect .1434 .0239 .1001 .1939 .0879 .2113

SAMPLE SIZE 575

NUMBER OF BOOTSTRAP RESAMPLES 2000

Mediation with longitudinal data

. . . is very complicated but is very illuminating.

Much of structural equation modelling (SEM) is devoted to trying to understand mediational models.

Path modelling with longitudinal data is hard to do but will generate very interesting and interpretable results.

One should obtain the same variables at different times of measurement to allow residualisation.

Hierarchical multiple regression

Dep

Rum

Dep

Time 1 Time 2

This is N-H’s hypothesis: Rum1 should explain unique variance in Dep2 after Dep1 is entered, i.e., explaining new variance in the residual.

1st step

2nd step

Back to Venn diagrams,but with a difference

Dep1

Dep2

Stability coefficient: typically medium to large. The purple area is the residual variance. It represents the change in depression over this time period. The overlapping area refers to the stability of depression over this time period.

Does Rum1 predict any of the residual?

Dep1 Dep2

Rum1

The red area is the amount of variance in Dep2 explained by Rum1, i.e., the degree to which Rum1 explains change in depression over time.

So what’s the answer? Perform a hierarchical regression:

IV DV1) Dep1 Dep22) Rum1

I found that N-H’s hypothesis was not supported: Rum1 did not explain any of the residual of Dep2 after Dep1 was entered.

.72***

.05ns

This is what it looks like

Dep1 Dep2

Rum1

Although Dep1 and Rum1 are significantly correlated, Rum1 doesn’t explain much new variance in Dep2 above and beyondwhat Dep1 can do.

The other direction

IV DV1) Rum1 Rum22) Dep1This result suggests that depression may

contribute to rumination over a 3-month period of time, but not the other way around.

It is recommended that you perform a path analysis in SEM for this type of analysis: allows for concurrent correlation (see next page).

.64***

.08*

Two time points

Rum

Dep

Rum

Dep

Time 1 Time 2

SEM computes all of these relationships simultaneously, allowing one to identify the unique relationships. Enact in LISREL, EQS, AMOS, etc. What did I find?

Same basic results

Rum

Dep

Rum

Dep

Time 1 Time 2

.63***

.74***

.08*.47** .43**

But you get model fit indices, modification indices, and so forth . . . I deleted the Rum1 to Dep2 path because it was non-significant.

Three time points and three variables

Stress Stress Stress

Rum. Rum. Rum.

Dep. Dep. Dep.

Time 1 Time 2 Time 3

N-H

myhypoth

MR

?

SEM yielded this result

Stress Stress Stress

Rum. Rum. Rum.

Dep. Dep. Dep.

.11*

.08*

.74*** .59***

.61*** .59***

.72*** .51***

Powerful but hard to do

Need to have three times of measurement reasonably spread out so that stability coefficients are not too high.

Need to have good measures (small measurement error) or do latent variable longitudinal path modelling.

This type of test of mediation is very stringent because it occurs over time and must be strong enough to exist against the backdrop of the stability coefficients, i.e., these residualised effects explain change in other variables.

Back to types of mediation

Why do I think in terms of null, partial, and full mediation?

Because SEM-based path models yield those three possible patterns.

Sociological point: basic mediation (e.g., B&K) is rooted in multiple regression where issues of model specification are not salient. On the other hand, if you learn SEM, then you will think in terms like I’ve enunciated above. Confusions occur because of the anachronisms in the field of mediation (harkening back to MR rather than embracing path modelling).

Let’s bring mediation to a close

I’ve covered many powerful techniques that derive from the basic mediation paradigm.

Remaining issues:1. Logistic mediation

2. Mediation in other contexts: HLM

Still much to learn and master, but this is a good start.

Workshop on Statistical Mediation and Moderation: Statistical Mediation Paul Jose Victoria...

Documents

Transcript of Workshop on Statistical Mediation and Moderation: Statistical Mediation Paul Jose Victoria...