Lecture 6 - Statistical tests 6 - Statistical tests.pdf · Lecture 6 – Statistical Tests...

Lecture 6 – Statistical Tests Confidence intervals Student test ANOVA test Fisher test for variances Nonparametric tests

Gauss Curve

Average=100, Standard deviation=15

Sample from a normal population

In the following 9 images you will see: In the first one – the distribution of

values for a Gauss population In the second - the distribution of

averages computed for groups of twoindividuals

In the next images, the distribution ofaverages computed for groups of 3, 4, 9,16, 25, 36 and 100 individuals

Conclusion: Averages calculated on a sample of

size n drawn from a Gaussiandistributed population have: A Gaussian distribution An average equal to the average of the

population of origin Standard deviation equal to the deviation

of the population divided by square rootof n= STANDARD ERROR!

Err = σ / √n

Confidence interval- definition We call confidence interval for the

mean an interval of real numberswhere we are almost certain that thereal mean is located.

The level of confidence can bechosen; usually it is 95% or 99%

95,0

n

stmn

stmP cc

Example: CI95%

Values: Latencies measured over the optic nerve Average: 112,2 Standard deviation: 12,5 Number of cases: 156 Standard error: Coefficient used for 95% confidence: The limits of the interval are:

=> CI95%=[110.24; 114,16]

What is a statistical test ?

It is a decision method that helps us tovalidate or invalidate a statisticalhypothesis with a certain degree ofconfidence.

Statistical tests

Statistical tests verify the truthfulnessof statistical hypotheses => a processcalled statistical inference hypothesis H0 (or null hypothesis) data have

no connections between them/ comparedvalues do not differ

hypothesis H1 (or alternate hypothesis) datahave connections between them/ comparedvalues do differ

Statistical tests

The result of the test is denoted as “p”. “p” is a number between 0 and 1 (or 0% and 100%),

and it represents the probability to make an error if wereject H0, the null hypothesis.

Interpreting p (the same for all statistical tests): p > 0.05, the statistical relationship is not significant (NS). p < 0.05, the statistical relationship is significant (S, 95%

confidence). p < 0.01, the statistical relationship is significant (S, 99%

confidence). p < 0.001, the statistical relationship is highly significant

(HS, 99.9% confidence).

Student “t” test Test for comparing two means when the

standard deviations are equal It can be used if:

1. measurements from the two samples areindependent

2. the samples come from populations that arenormally distributed (which must be verified beforeapplying the test)

3. the populations from which the samples are drawnhave equal dispersions (or standard deviations)

Student “t” testWe call H0 the null hypothesis - the assumption

that the averages of the populations (from whichthe samples are drawn) are equal -----H0 : m1 = m2

We call H1 the alternate hypothesis - theassumption that the averages of the populations aredifferent ----- H1 : m1 m2

If the test doesn’t reject H0, we say that data donot support the hypothesis that the populations’means are different

If the test rejects H0, we say that data support thehypothesis that the populations’ means are different

Example

Pacient No. Localised infection Sepsis1 25 552 20 883 110 534 45 305 50 726 50 527 72 918 53 709 30 11010 50 12311 27 5612 35 3113 85 10014 22 7015 78 4416 65 7017 85 9018 85 12319 55 8520 25 7221 50 7522 85 5023 75 8524 40 10725 50 110

Media 54.68 76.48Deviaţia standard 24.42 26.55

Dispersia 596.56 704.84C.V. (%) 44.67% 34.81%

We analyzed 25 patients with localised infectionand 25 with sepsis, among other measurementswe recorded ESR value.

p<0,05 – we reject H0with a confidence of 95%

54.68

76.48

0

20

40

60

80

100

120

Localised infection Sepsis

Mea

n ±

stan

dar

d d

evia

tio

n

p test Student = 0.004 - S

ESR

t‐Test: Two‐Sample Assuming Equal Variances

Localised infection Sepsis EXPLANATIONSMean 54.68 76.48 Samples’ averagesVariance 596.56 704.84 Samples’ varianceObservations 25 25 Number of patientsPooled Variance 650.70Hypothesized Mean Difference 0df 48 Degrees of freedom: 25+25‐2t Stat ‐3.02149 Computed t valueP(T<=t) one‐tail 0.00201t Critical one‐tail 1.67722P(T<=t) two‐tail 0.00403 p – test resultt Critical two‐tail 2.01063 Theoretical t value)

Confidence interval analysis

We notice that the upper limit of the 95% confidence interval forthe lower average is below the lower limit of the 95% for thehigher average.In conclusion, we can say that the two averages are different witha confidence level of 95% - which we did using Student’s t test.

Parameter Localised infection SepsisNo. of patients 25 25Mean value 54.68 76.48Standard deviation 24.42 26.55Standard error 4.88 5.31t95% for df=48 2.064 2.064Error level for 95% conf. 10.08 10.96Lower limit 44.60 65.52Upper limit 64.76 87.44

Student’s t test - variants

t test for samples with equalvariances

t test for samples with unequalvariances

t test for samples with pairedmeasurements

These variants differ in how the coefficient t is calculated, and thus the

p-value.

ANOVA test Compares the averages of several samples simultaneously. H0: m1 = m2 = m3 = m4 (for 4 samples) H1: at least two averages are significantly different The result is a number p that is interpreted as follows:

If p> 0.05 we do not reject H0, we say that the difference is not significant with 95% confidence

If p <0.05 H0 is rejected with a confidence level of 95% -At least two averrages are significantly different

If p <0.001 H0 is rejected with a confidence level of 99.9%. The difference is highly significant

Exemplu

In three cities from Dolj county data were taken on dietaryhabits and their relationship to obesity and diabetes. Amongother data we collected the weight of individuals as well asdata on smoking

Individuals, regardless of gender or age group, were dividedinto four categories: non-smokers, former smokers, lightsmokers (less than 10 cigarettes per day) and smokers (morethan 10 cigarettes per day)

An interesting question is whether there was a link betweensmoking habit and body weight in these individuals.

H0: Whether or not smoking, body weight is the same H1: At least two of the four categories had different body

weights

ANOVA test - results

Fischer’s test is used to verify the equalityof variances for two normally distributedindependent variables. Null hypothesis is H0: σ1

2=σ22

Bartlett's test is used to verify theequality of variances for several normallydistributed independent variables Null hypothesis is H0: σ1

2=σ22=...=σk

2

THEY ARE PARAMETRIC TESTS

Variance comparison tests

Nonparametric tests These are tests that do not make assumptions

about data distribution.

They can be applied in all circumstances, if themeasurements are independent

Usually they are used when we are not allowedto apply a parametric test

These tests have less accurate results thanparametric tests, so we apply them only when itis not possible to apply a parametric test

Nonparametric tests

Comparing means Random data Paired data

2 samples Mann-Whitney Wilcoxon

>2 samples Kruskal-Wallis Friedman

Comparingvariances Levene

Lecture 6 - Statistical tests 6 - Statistical tests.pdf · Lecture 6 – Statistical Tests...

Documents

Transcript of Lecture 6 - Statistical tests 6 - Statistical tests.pdf · Lecture 6 – Statistical Tests...