The design of animal experiments · 2021. 1. 29. · 290 animal studies scored for blinding,...

45
1 The design of animal experiments Michael FW Festing c/o Understanding Animal Research, 25 Shaftsbury Av. London, UK. [email protected]

Transcript of The design of animal experiments · 2021. 1. 29. · 290 animal studies scored for blinding,...

Page 1: The design of animal experiments · 2021. 1. 29. · 290 animal studies scored for blinding, randomisation and positive/negative outcome, as defined by authors Babasta et al 2003

1

The design of animal

experiments

Michael FW Festing c/o Understanding Animal Research, 25 Shaftsbury

Av. London, UK.

[email protected]

Page 2: The design of animal experiments · 2021. 1. 29. · 290 animal studies scored for blinding, randomisation and positive/negative outcome, as defined by authors Babasta et al 2003

2

Replacement

e.g. in-vitro methods, less sentient animals Refinement

e.g. anaesthesia and analgesia, environmental

enrichment Reduction

Research strategy

Controlling variability

Experimental design and statistics

Principles of Humane Experimental

Technique

(Russell and Burch 1959)

Page 3: The design of animal experiments · 2021. 1. 29. · 290 animal studies scored for blinding, randomisation and positive/negative outcome, as defined by authors Babasta et al 2003

3

A well designed experiment

Absence of bias Experimental unit, randomisation, blinding

High power Low noise (uniform material, blocking, covariance)

High signal (sensitive subjects, high dose)

Large sample size

Wide range of applicability Replicate over other factors (e.g. sex, strain): factorial

designs

Simplicity

Amenable to a statistical analysis

Page 4: The design of animal experiments · 2021. 1. 29. · 290 animal studies scored for blinding, randomisation and positive/negative outcome, as defined by authors Babasta et al 2003

4

The animal as the experimental unit

Animals individually treated. May be individually housed or grouped

N=8 n=4

Page 5: The design of animal experiments · 2021. 1. 29. · 290 animal studies scored for blinding, randomisation and positive/negative outcome, as defined by authors Babasta et al 2003

5

A cage as the Experimental

Unit.

Treatment in water or diet.

N=4 n=2

Treated Treated Control Control

Page 6: The design of animal experiments · 2021. 1. 29. · 290 animal studies scored for blinding, randomisation and positive/negative outcome, as defined by authors Babasta et al 2003

6

An animal for a period of time: repeated

measures or crossover design

Animal 1 2 3

Treatment 1 Treatment 2

N 4 4 4

N=12 n= 6

Page 7: The design of animal experiments · 2021. 1. 29. · 290 animal studies scored for blinding, randomisation and positive/negative outcome, as defined by authors Babasta et al 2003

7

Teratology: mother treated,

young measured

Mother is the experimental unit.

N=2 n=1

Page 8: The design of animal experiments · 2021. 1. 29. · 290 animal studies scored for blinding, randomisation and positive/negative outcome, as defined by authors Babasta et al 2003

8

Failure to identify the experimental unit

correctly in a 2(strains) x 3(treatments) x

6(times) factorial design

ELD group ELD group

Single cage of 8 mice killed at each time point (288 mice in total)

Page 9: The design of animal experiments · 2021. 1. 29. · 290 animal studies scored for blinding, randomisation and positive/negative outcome, as defined by authors Babasta et al 2003

9

Experimental units must be

randomised to treatments

Physical: numbers on cards. Shuffle and take

one

Tables of random numbers in most text

books

Use computer. e.g. EXCEL or a statistical

package such as MINITAB

Page 10: The design of animal experiments · 2021. 1. 29. · 290 animal studies scored for blinding, randomisation and positive/negative outcome, as defined by authors Babasta et al 2003

10

Randomisation

Original Randomised 1 2 1 3 1 3 1 1 2 2 2 1 2 2 2 1 3 3 3 2 3 3 3 1

NB Randomisation should include housing and order in which observations are made

Page 11: The design of animal experiments · 2021. 1. 29. · 290 animal studies scored for blinding, randomisation and positive/negative outcome, as defined by authors Babasta et al 2003

11

Failure to randomise and/or blind

leads to more “positive” results

Blind/not blind odds ratio 3.4 (95% CI 1.7-6.9) Random/not random odds ratio 3.2 (95% CI 1.3-7.7) Blind Random/ odds ratio 5.2 (95% CI 2.0-13.5) not blind random 290 animal studies scored for blinding, randomisation and positive/negative outcome, as defined by authors Babasta et al 2003 Acad. emerg. med. 10:684-687

Page 12: The design of animal experiments · 2021. 1. 29. · 290 animal studies scored for blinding, randomisation and positive/negative outcome, as defined by authors Babasta et al 2003

12

Some factors (e.g. strain, sex) can not be

randomised so special care is needed to ensure

comparability

Outbred TO (8-12 weeks

commercial)

Inbred CBA (12-16

weeks Home bred)

Six cages of 7-9 mice of each strain: error bars are SEMs

"CBA mice showed greater

variability in body weights than

TO mice..."

Page 13: The design of animal experiments · 2021. 1. 29. · 290 animal studies scored for blinding, randomisation and positive/negative outcome, as defined by authors Babasta et al 2003

13

A well designed experiment

Absence of bias Experimental unit, randomisation, blinding

High power Low noise (uniform material, blocking, covariance)

High signal (sensitive subjects, high dose)

Large sample size

Wide range of applicability Replicate over other factors (e.g. sex, strain): factorial

designs

Simplicity

Amenable to a statistical analysis

Page 14: The design of animal experiments · 2021. 1. 29. · 290 animal studies scored for blinding, randomisation and positive/negative outcome, as defined by authors Babasta et al 2003

14

High power: (good chance of detecting the effect

of a treatment, if there is one)

High Signal/Noise ratio

= High Standardized effect size = High d=|m1-m2|/s

= High (Difference between means)/SD

Student’s t =( X1-X2)/Sqrt (2S2/n)

Page 15: The design of animal experiments · 2021. 1. 29. · 290 animal studies scored for blinding, randomisation and positive/negative outcome, as defined by authors Babasta et al 2003

15

Power Analysis for sample size and

effects of variation

A mathematical relationship between six variables

Needs subjective estimate of effect size to be detected (signal)

Has to be done separately for each character

Not easy to apply to complex designs

Essential for expensive, simple, large experiments (clinical trials)

Useful for exploring effect of variability

A second method “The Resource Equation” is described later

Page 16: The design of animal experiments · 2021. 1. 29. · 290 animal studies scored for blinding, randomisation and positive/negative outcome, as defined by authors Babasta et al 2003

16

Power analysis: the variables

Sample size

Signal a) Effect size of scientific interest

or b) actual response

Chance of a false positive result. Significance level

(0.05)

Sidedness of statistical test (usually 2-sided)

Power of the Experiment (80-90%?)

Noise Variability of the

experimental material

Page 17: The design of animal experiments · 2021. 1. 29. · 290 animal studies scored for blinding, randomisation and positive/negative outcome, as defined by authors Babasta et al 2003

17

Group size and Signal/noise

ratio

0

20

40

60

80

100

120

140

0 0.5 1 1.5 2 2.5 3

Effect size (Std. Devs.)

Gro

up

siz

e

90%

80%

Assuming 2-sample, 2 sided t-test and 5% significance level

Signal/noise ratio

Power

Neutral

Bad

Good

Page 18: The design of animal experiments · 2021. 1. 29. · 290 animal studies scored for blinding, randomisation and positive/negative outcome, as defined by authors Babasta et al 2003

18

Comparison of two anaesthetics for dogs

under clinical conditions (Vet. Anaesthes. Analges.)

Unsexed healthy clinic dogs, • Weight 3.8 to 42.6 kg. • Systolic BP 141 (SD 36) mm Hg

Assume: • a 20 mmHg difference between anaesthetics is of clinical importance, • a significance level of a=0.05 • a power=90% • a 2-sided t-test

Signal/Noise ratio 20/36 = 0.56 Required sample size 68/group

Page 19: The design of animal experiments · 2021. 1. 29. · 290 animal studies scored for blinding, randomisation and positive/negative outcome, as defined by authors Babasta et al 2003

19

Power and sample size

calculations using nQuery Advisor

Page 20: The design of animal experiments · 2021. 1. 29. · 290 animal studies scored for blinding, randomisation and positive/negative outcome, as defined by authors Babasta et al 2003

20

A second paper described:

• Male Beagles weight 17-23 kg • mean BP 108 (SD 9) mm Hg. • Want to detect 20mm difference between groups (as before)

With the same assumptions as previous slide:

Signal/noise ratio = 20/9 = 2.22 Required sample size 6/group

Page 21: The design of animal experiments · 2021. 1. 29. · 290 animal studies scored for blinding, randomisation and positive/negative outcome, as defined by authors Babasta et al 2003

21

Summary for two sources of dogs: aim is to

be able to detect a 20mmHg change in blood

pressure

Type of dog SDev Signal/noise Sample %Power (n=8) size/gp(1) (2)

Random dogs 36 0.56 68 18 Male beagles 9 2.22 6 98 (1) Sample size: 90% power (2) Power, Sample size 8/group

Assumes a=5%, 2-sided t-test and effect size 20mmHg

The scientific dilemma: With small sample sizes we can not detect an important effect in genetically heterogeneous animals. We can detect the effect in genetically homogeneous animals, but are they representative?

Page 22: The design of animal experiments · 2021. 1. 29. · 290 animal studies scored for blinding, randomisation and positive/negative outcome, as defined by authors Babasta et al 2003

22

Variation in kidney weight in

58 groups of rats

0

10

20

30

40

50

60

70

80

90

1 5 9 13 17 21 25 29 33 37 41 45 49 53 57

Sample number

Va

ria

bil

ity Mycoplasma

Outbred

F1

F2

Gartner,K. (1990), Laboratory Animals, 24:71-77.

Page 23: The design of animal experiments · 2021. 1. 29. · 290 animal studies scored for blinding, randomisation and positive/negative outcome, as defined by authors Babasta et al 2003

23

Required sample sizes

Factor Type Std.Dev Signal/

noise*

Sample

size

Power**

Genetics F1 hybrid 13.5 0.74 30 80

F2 hybrid 18.4 0.54 55 53

Outbred 20.1 0.49 67 46

Disease Mycoplasma

free

18.6 0.54 55 53

With

Mycoplasma

43.3 0.23 298 14

*signal is 10 units, two sided t-test, a=0.05, power = 80% ** Assuming fixed sample size of 30/group

Page 24: The design of animal experiments · 2021. 1. 29. · 290 animal studies scored for blinding, randomisation and positive/negative outcome, as defined by authors Babasta et al 2003

24

The randomised block design: another

method of controlling noise

B C A

A C B

B A C

A C B

B C A B1 B2 B3 B4 B5

Treaments A, B & C

• Randomisation is within-block • Can be multiple differences

between blocks • Heterogeneous age/weight • Different shelves/rooms • Natural structure (litters) • Split experiment in time

Page 25: The design of animal experiments · 2021. 1. 29. · 290 animal studies scored for blinding, randomisation and positive/negative outcome, as defined by authors Babasta et al 2003

25

A randomised block

experiment

0

50

100

150

200

250

300

350

400

450

500

1 2 3

Week

Ap

op

tosis

sco

re

Control

CGP

STAU

365 398 421 423 432 459 308 320 329

Treatment effect p=0.023 (2-way ANOVA)

Page 26: The design of animal experiments · 2021. 1. 29. · 290 animal studies scored for blinding, randomisation and positive/negative outcome, as defined by authors Babasta et al 2003

26

Analysis of apoptosis data

Analysis of Variance for Score

Source DF SS MS F P

Block 2 21764.2 10882.1 114.82 0.000

Treatmen 2 2129.6 1064.8 11.23 0.023

Error 4 379.1 94.8

Total 8 24272.9

Page 27: The design of animal experiments · 2021. 1. 29. · 290 animal studies scored for blinding, randomisation and positive/negative outcome, as defined by authors Babasta et al 2003

27

-10.0 -7.5 -5.0 -2.5 0.0 2.5 5.0 7.5

0

1

2

3

Residual

Fre

quency

Histogram of Residuals

0 1 2 3 4 5 6 7 8 9

-20

-10

0

10

20

Observation Number

Resid

ual

I Chart of Residuals

Mean=3.16E-14

UCL=20.17

LCL=-20.17

300 350 400 450

-10

0

10

Fit

Resid

ual

Residuals vs. Fits

-1.5 -1.0 -0.5 0.0 0.5 1.0 1.5

-10

0

10

Normal Plot of Residuals

Normal Score

Re

sid

ual

Residual Model Diagnostics

Page 28: The design of animal experiments · 2021. 1. 29. · 290 animal studies scored for blinding, randomisation and positive/negative outcome, as defined by authors Babasta et al 2003

28

Another method of determining sample size:

The Resource Equation

Depends on the law of diminishing returns

Simple. No subjective parameters

Useful for complex designs and/or multiple outcomes

(characters)

Does not require estimate of Standard Deviation

Crude compared with Power Analysis

E= (Total number of animals)-(number of groups) 10<E<20 (but give some tolerance)

Page 29: The design of animal experiments · 2021. 1. 29. · 290 animal studies scored for blinding, randomisation and positive/negative outcome, as defined by authors Babasta et al 2003

29

0 5 10 15 20 25 30 35

2.0

4.5

7.0

9.5

12.0

Degrees of freedom

Stu

den

t's t

, 5

% c

ritica

l valu

e

E= (total numbers)-(number of groups)

10<E<20

The Resource Equation & Sample Size

But if experimental subjects are cheap (e.g. multi-well plates, E can be much higher

Page 30: The design of animal experiments · 2021. 1. 29. · 290 animal studies scored for blinding, randomisation and positive/negative outcome, as defined by authors Babasta et al 2003

30

A well designed experiment

Absence of bias Experimental unit, randomisation, blinding

High power Low noise (uniform material, blocking, covariance)

High signal (sensitive subjects, high dose)

Large sample size

Wide range of applicability Replicate over other factors to (e.g. sex, strain) to increase

generality: factorial designs

Simplicity

Amenable to a statistical analysis

Page 31: The design of animal experiments · 2021. 1. 29. · 290 animal studies scored for blinding, randomisation and positive/negative outcome, as defined by authors Babasta et al 2003

31

Factorial designs

Single factor design

Treated Control

E=16-2 = 14

One variable at a time (OVAT)

Treated Control Treated Control

E=16-2 = 14 E=16-2 = 14

Factorial design

Treated Control

E=16-4 = 12

Page 32: The design of animal experiments · 2021. 1. 29. · 290 animal studies scored for blinding, randomisation and positive/negative outcome, as defined by authors Babasta et al 2003

32

Factorial designs

(By using a factorial design)”.... an experimental investigation, at the same time as it is made more comprehensive, may also be made more efficient if by more efficient we mean that more knowledge and a higher degree of precision are obtainable by the same number of observations.” R.A. Fisher, 1960

Page 33: The design of animal experiments · 2021. 1. 29. · 290 animal studies scored for blinding, randomisation and positive/negative outcome, as defined by authors Babasta et al 2003

33

A 4x2 factorial design

Analysed with Student’s t-test: This is not appropriate because: 1. Each test is based on too few animals (n=3-4), so lacks power 2. It does not indicate whether there are strain differences in protein thiol status 3. It does not indicate whether dose/response differs between strains 4. A two-way design should be analysed using a 2-way ANOVA

Page 34: The design of animal experiments · 2021. 1. 29. · 290 animal studies scored for blinding, randomisation and positive/negative outcome, as defined by authors Babasta et al 2003

34

Incorrect statistical analysis leading to

excessive numbers of animals

8 mice per group 8 groups = 64 mice. E= 64-8 =56

Alternative 3 mice per group: 8 groups E=24-8 = 16 Saving:40 mice Formal test of interaction

One experiment or 4 separate experiments?

Page 35: The design of animal experiments · 2021. 1. 29. · 290 animal studies scored for blinding, randomisation and positive/negative outcome, as defined by authors Babasta et al 2003

35

2 (strains) x 4 (Animal units)

factorial

Page 36: The design of animal experiments · 2021. 1. 29. · 290 animal studies scored for blinding, randomisation and positive/negative outcome, as defined by authors Babasta et al 2003

36

Effect of chloramphenicol

(2000mg/kg) on RBC count

Strain Control Treated C3H 7.85 7.81 8.77 7.21 8.48 6.96 8.22 7.10 CD-1 9.01 9.18 7.76 8.31 8.42 8.47 8.83 8.67

Tests: Use a two-way ANOVA with interaction

1. Do the treatment means averaged across strains differ?

2. Do the strains differ, averaged across treatments

3. Do the two strains respond to the same extent?

Should not be analysed using two t-tests 1. Each test lacks power due to small sample size 2. Will not give a test of whether strains differ in response

Page 37: The design of animal experiments · 2021. 1. 29. · 290 animal studies scored for blinding, randomisation and positive/negative outcome, as defined by authors Babasta et al 2003

37

A 2x2 factorial design with

interaction

Source DF SS MS F P

strain 1 2.4414 2.4414 13.13 0.003

Treatment 1 0.8236 0.8236 4.43 0.057

strain*treat. 1 1.4702 1.4702 7.91 0.016

Error 12 2.2308 0.1859

Total 15 6.9659

6.5

7

7.5

8

8.5

9

Control Treated Control Treated

Strain and treatment

Red

blo

od

cell

co

un

t

C3H CD-1

Pooled variance

Page 38: The design of animal experiments · 2021. 1. 29. · 290 animal studies scored for blinding, randomisation and positive/negative outcome, as defined by authors Babasta et al 2003

38

Use of several inbred strains to reduce

noise, increase signal and explore

generality

500 1000 1500 2000 2500

CD-1 8 8 8 8 8 8

CBA 2 2 2 2 2

C3H 2 2 2 2 2

BALB/c 2 2 2 2 2

C57BL 2 2 2 2 2

2

2

2

2

Inbred

0

Outbred

Dose of chloramphenicol (mg/kg)

Festing et al (2001) Fd. Chem.Tox. 39:375

Effect of chloramphenicol on mouse haematology

Page 39: The design of animal experiments · 2021. 1. 29. · 290 animal studies scored for blinding, randomisation and positive/negative outcome, as defined by authors Babasta et al 2003

39

WBC Strain Control Treated CBA 1.90 0.40 CBA 2.60 0.20 C3H 2.10 0.40 C3H 2.20 0.40 BALB/c 1.60 1.30 BALB/c 0.50 1.40 C57BL 2.30 0.80 C57BL 2.20 1.10 CD-1 3.00 1.90 CD-1 1.70 1.90 CD-1 1.50 3.50 CD-1 2.00 1.20 CD-1 3.80 2.30 CD-1 0.90 1.00 CD-1 2.60 1.30 CD-1 2.30 1.60

Example of a factorial compared with a single factor design

Four inbred strains One outbred stock

Page 40: The design of animal experiments · 2021. 1. 29. · 290 animal studies scored for blinding, randomisation and positive/negative outcome, as defined by authors Babasta et al 2003

40

Signal Noise

Strain N 0 2500 (Difference) (SD) Signal/noise p

CBA 4 2.25 0.30 1.95 0.34 5.73

C3H 4 2.15 0.40 1.85 0.34 5.44

BALB/c 4 1.05 1.35 (-0.30) 0.34 (-0.88)

C57BL 4 2.25 0.95 1.30 0.34 3.82

Mean 16 1.93 1.20 0.73 0.34 2.15 <0.001

Dose * strain <0.001

WBC counts following chloramphenicol at

2500mg/kg

Signal Noise

Strain N 0 2500 (Difference) (SD) Signal/noise p

CD-1 16 2.23 1.83 0.40 0.86 0.47 0.38

White blood cell counts

Page 41: The design of animal experiments · 2021. 1. 29. · 290 animal studies scored for blinding, randomisation and positive/negative outcome, as defined by authors Babasta et al 2003

41

Genetics is important: Twenty two Nobel Prizes since 1960

for work depending on inbred strains

Cancer

mmTV

Transmissable

encephalopathacies/prions

Pruisner

Retroviruses, Oncogenes & growth factors

Cohen, Levi-montalcini, Varmus, Bishop, Baltimore, Temin

Humoral immunity/antibodies

T-cell receptor

Tonegawa, Jerne

Cell mediated immunity

Immunological tolerance

H2 restriction, immune responses

Medawar, Burnet, Doherty, Zinkanagel

Benacerraf (G.pigs)

Genetics

Snell C.C. Little, DBA, 1909

Inbred Strains and derivatives

Jackson Laboratory

monoclonal antibodies

BALB/c mice

Kohler and Millstein

Smell Axel & Buck

ES cells & “knockouts” Evans, Capecchi, Smithies

Page 42: The design of animal experiments · 2021. 1. 29. · 290 animal studies scored for blinding, randomisation and positive/negative outcome, as defined by authors Babasta et al 2003

42

18th Annual Short Course on Experimental

Models of Human Cancer

August 21-30, 2009

Bar Harbor, ME

courses.jax.org

Page 43: The design of animal experiments · 2021. 1. 29. · 290 animal studies scored for blinding, randomisation and positive/negative outcome, as defined by authors Babasta et al 2003

43

Conclusions

Five requirements for a good design Unbiased (randomisation, blinding)

Powerful (signal/noise ratio: control variability)

Wide range of applicability (factorial designs, common but frequently analysed incorrectly)

Simple

Amenable to statistical analysis

Mistakes in design and analysis are common

Better training in experimental design would improve the quality of research, save money, time and animals

Page 44: The design of animal experiments · 2021. 1. 29. · 290 animal studies scored for blinding, randomisation and positive/negative outcome, as defined by authors Babasta et al 2003

44

Page 45: The design of animal experiments · 2021. 1. 29. · 290 animal studies scored for blinding, randomisation and positive/negative outcome, as defined by authors Babasta et al 2003

45