hypothesis testing with special focus on simulation

Hypothesis Testing for Simulation 1

hypothesis testing with special focus on

simulation

Hypothesis Test answers yes/no question with some statistical certainty

H0 = default hypothesisis a statement

Ha = alternate hypothesisis the precise opposite

X = test statistic (RANDOM!)sufficient (uses all avail. data)often Z, T, N are used as notation

FX = its probability distribution

a = P[reject H0 | H0 true]

ca = critical region for a

a = P[X in ca | H0]

a is our (controllable) risk

TWISTED LOGIC

We WANT to reject H0 and conclude Ha, so...We make a very small, so...If we can reject, we have strong

evidence that Ha is true

This construct often leads to inconclusive results“There is no significant statistical

evidence that...”

IMPORTANT

Inability to reject <> H0 true

POWER OF THE TEST

b = P[X not in ca | Ha]

1-b = P[correctly rejecting]

VENACULAR

a is type I errorProbability of incorrectly rejecting

b is type II errorProbability of incorrectly missing the opportunity to reject

UNOFFICIAL VENACULAR

type III error – answered the wrong question

type IV error – perfect answer delivered too late

EXAMPLE!

Dial-up ISP has long experience & knows...

STHdownloadE

We get DSL, observe 12 samples

9.11ˆ

IS DSL FASTER?

H0: mDSL = 50

Ha: mDSL < 50

test with P[type I] = 0.01

PROBABILITY THEORY

Z ~ tn-1Must know the probability distribution of the

test statistic IOT construct critical region

for n = 12, a = 0.01, ca = -2.718

99% of the probabilityabove -2.718

our test statistic-2.33

129.11

)5042(

0.021 called the p-value

Given H0, we expect to see a test statistic as extreme as Z roughly 2% of the time.

-2.718(0.01)

-1.796(0.05)-2.33

(0.021)

CONFIDENCE INTERVALS

For a given aP[la <= m <= ua] = 1-a

mBased on the sampleSo they are RANDOM!

GOODNESS-OF-FIT TEST

Discrete, categorized dataRolls of diceMiss distances in 5-ft. increments

H0 assumes a fully-specified probability modelHa: the glove does not fit!

TEST STATISTIC

ectedobsX

“chi-squared distribution with gnu degrees of freedom”

n = observations - estimated param

Did you know... if Zi~N(0, 1), then

Z12+ Z2

2+...+ Zn2 ~ cn

H0 always results in a set of category cells with expected frequencies

EXAMPLECoin is tossed 100 timesH0: Coin Fair

CELLS AND EXPECTED FREQUENCIES

EXPECT

EXAMPLE

Cannon places rounds around a targetH0: miss distance ~ expon(0.1m)

Record data in 5m intervals(0-5), (5-10), ...(25+)

EXPONENTIALS

0 10 20 30 40 50 60

E(X)=1/l

RESULTS

RIGHT OBS 1-exp(-0.1x) PROB EXPECT (OBS-EXPECT)^2

5.00 30 0.39 0.39 39.35 2.22

10.00 17 0.63 0.24 23.87 1.97

15.00 21 0.78 0.14 14.47 2.94

20.00 11 0.86 0.09 8.78 0.56

25.00 11 0.92 0.05 5.33 6.05

30+ 10 1.00 0.08 8.21 0.39

100.00 14.14

0 10 20 30 40

observed

expected

TEST RESULTS

Degrees of Freedom6 cells0 parameters estimatedn = 6

For the c62 distribution, the p-

value for 14.14 is about p=0.025

REJECT at any a > 0.025

DIFFERENT H0

H0: the miss distances are exponentially distributed

Ha: the exponential shape is incorrect

We estimate the parameter, we lose one degree of freedom

RESULTS 2

LEFT RIGHT OBS 1-exp(-0.0738x) PROBEXPE

CT (OBS-EXPECT)^2

0.00 5.00 30 0.31 0.31 30.86 0.02

5.00 10.00 17 0.52 0.21 21.34 0.88

10.00 15.00 21 0.67 0.15 14.75 2.65

15.00 20.00 11 0.77 0.10 10.20 0.06

20.00 25.00 11 0.84 0.07 7.05 2.21

25.00 30+ 10 1.00 0.16 15.80 2.13

0 10 20 30 40

observed

expected

p-value for 7.83 is larger than 0.05

CANNOT REJECT

CONCLUSION?

SIMULATION vs. STATISTICS

StatisticsSample is fixed and givenConclusion is unknownSignificance is powerful

SimulationSample is arbitrarily largeConclusion is knownWe need another thought about what is

meaningful

SAMPLE SIZE EFFECT

m = 100s = 10

HOW LARGE IS A DIFFERENCE BEFORE IT IS MEANINGFUL?

mu lower upper sigma

10 101.0468 98.16152 103.9321 5.547101100 101.3426 99.8384 102.8468 9.144828500 101.0861 100.3455 101.8266 10.06773

1000 100.8007 100.2762 101.3253 10.0847

0 200 400 600 800 1000

SUMMARY

You probably knew the mechanics of HT

You might have a new perspective

hypothesis testing with special focus on simulation

Documents

Transcript of hypothesis testing with special focus on simulation

Noun-Phrase Anaphora and Focus: The Informational Load ...Noun-Phrase Anaphora and Focus: The Informational Load Hypothesis Amit Almor Brown University The processing of noun-phrase

Hypothesis Testing for Simulation 1 hypothesis testing with special focus on simulation.

Hypothesis Testing Judicial Analogy Hypothesis Testing Hypothesis testing Null hypothesis Purpose Test the viability Null hypothesis Population.

Testing Hypothesis About Proportions Chapter 20. Objectives Hypothesis Null hypothesis Alternative hypothesis Two-sided alternative One-sided alternative.

The hypothesis that most people already think is true. Ex. Eating a good breakfast before a test will help you focus Notation NULL HYPOTHESIS HoHo.

Simulation of turbocharged SI-engines - with focus on the turbine

Market Simulation Presentation...Winter 2017 Release Market Simulation Focus Page 5 Week Dates (Tuesday thru Friday, Monday is Maintenance) Focus 1 12/5-12/8 EIM Idaho Scenarios RSI

Hypothesis Testing by Simulation: An Environmental Example · HYPOTHESIS GENERATION AND TESTING Designing alternative models Hypothesis No. 1 : 16 two compartments in a simple physical

A MONTE CARLO SIMULATION STUDY OF THE PERFORMANCE OF HYPOTHESIS

Null hypothesis AND ALTERNAT HYPOTHESIS

The so-called Extraterrestrial Hypothesis · Extraterrestrial hypothesis 1 Extraterrestrial hypothesis The extraterrestrial hypothesis (ETH) is the hypothesis that some unidentified

Hypothesis Testing by Simulation: An Environmental Example · HYPOTHESIS TESTING BY SIMULATION : AN ENVIRONMENTAL EXAMPLE Kurt Fedra INTRODUCTION: Hypothesis Testing and Simulation

On Testing the Simulation Theory...wave-particle duality experiments (illustrated in Figures5,6and7) aimed at testing the simulation theory by testing the hypothesis that reality is

DEVELOPING HYPOTHESIS AND RESEARCH QUESTION HYPOTHESIS AND RESEA… · Research Question A research focus should be narrow, not ... For example, “What can be done to prevent substance

Fast Lithography Simulation under Focus Variations for OPC ... lithography... · Fast Lithography Simulation under Focus Variations for OPC and Layout Optimizations Peng Yu a,DavidZ.Pana

Testing of Hypothesis Fundamentals of Hypothesis.

Knowledge gap ecpr · The knowledge gap hypothesis In the most influential formulation of the knowledge gap hypothesis, the focus is on the link between socioeconomic status and changing

SCHOOL OF GRADUATE AND POSTDOCTORAL STUDES our focus … · SCHOOL OF GRADUATE AND POSTDOCTORAL STUDES our focus on the 2019 Hypothesis Experiment Analysis Conclusion. ... Ashmita

Simulation of IC Engines with Special Focus on Spray Models of CI Engines

By Nico Arguelles. .. Sabotage hypothesis Static spark hypothesis Lightning hypothesis Engine failure hypothesis Fuel leak And others.