Outline - math.ntnu.edu.twmath.ntnu.edu.tw/~rtsai/104/bayes/slides/lecture1.pdf · Slide 13— PhD...

8
Faculty of Life Sciences Frequentist and Bayesian statistics Claus Ekstrøm E-mail: [email protected] Outline 1 Frequentists and Bayesians What is a probability? Interpretation of results / inference 2 Comparisons 3 Markov chain Monte Carlo Slide 2— PhD (Aug 23rd 2011) — Frequentist and Bayesian statistics What is a probability? Two schools in statistics: frequentists and Bayesians. Slide 3— PhD (Aug 23rd 2011) — Frequentist and Bayesian statistics

Transcript of Outline - math.ntnu.edu.twmath.ntnu.edu.tw/~rtsai/104/bayes/slides/lecture1.pdf · Slide 13— PhD...

Page 1: Outline - math.ntnu.edu.twmath.ntnu.edu.tw/~rtsai/104/bayes/slides/lecture1.pdf · Slide 13— PhD (Aug 23rd 2011) — Frequentist and Bayesian statistics In summary • Afrequentistisapersonwhoselong-runambitionisto

Faculty of Life Sciences

Frequentist and Bayesian statistics

Claus EkstrømE-mail: [email protected]

Outline

1 Frequentists and Bayesians• What is a probability?• Interpretation of results / inference

2 Comparisons

3 Markov chain Monte Carlo

Slide 2— PhD (Aug 23rd 2011) — Frequentist and Bayesian statistics

What is a probability?Two schools in statistics: frequentists and Bayesians.

Slide 3— PhD (Aug 23rd 2011) — Frequentist and Bayesian statistics

Page 2: Outline - math.ntnu.edu.twmath.ntnu.edu.tw/~rtsai/104/bayes/slides/lecture1.pdf · Slide 13— PhD (Aug 23rd 2011) — Frequentist and Bayesian statistics In summary • Afrequentistisapersonwhoselong-runambitionisto

Frequentist school

School of Jerzy Neyman, Egon Pearson and Ronald Fischer.

Slide 4— PhD (Aug 23rd 2011) — Frequentist and Bayesian statistics

Bayesian school

“School” of Thomas Bayes

P(H|D) =P(D|H) ·P(H)!P(D|H) ·P(H)dH

Slide 5— PhD (Aug 23rd 2011) — Frequentist and Bayesian statistics

FrequentistsFrequentists talk about probabilities in relation toexperiments with a random component.Relative frequency of an event, A, is defined as

P(A) =number of outcomes consistent with A

number of experiments

The probability of event A is the limiting relative frequency.

0 20 40 60 80 100

0.0

0.2

0.4

0.6

0.8

1.0

n

Rel

ativ

e fre

quen

cy

Slide 6— PhD (Aug 23rd 2011) — Frequentist and Bayesian statistics

Page 3: Outline - math.ntnu.edu.twmath.ntnu.edu.tw/~rtsai/104/bayes/slides/lecture1.pdf · Slide 13— PhD (Aug 23rd 2011) — Frequentist and Bayesian statistics In summary • Afrequentistisapersonwhoselong-runambitionisto

Frequentists — 2The definition restricts the things we can add probabilities to:

What is the probability of there being life on Mars100 billion years ago?

We assume that there is an unknown but fixed underlyingparameter, θ , for a population (i.e., the mean height onDanish men).Random variation (environmental factors, measurementerrors, ...) means that each observation does not result in thetrue value.

Slide 7— PhD (Aug 23rd 2011) — Frequentist and Bayesian statistics

The meta-experiment ideaFrequentists think of meta-experiments and consider thecurrent dataset as a single realization from all possibledatasets.

Slide 8— PhD (Aug 23rd 2011) — Frequentist and Bayesian statistics

The meta-experiment ideaFrequentists think of meta-experiments and consider thecurrent dataset as a single realization from all possibledatasets.

167.2 cm

Slide 8— PhD (Aug 23rd 2011) — Frequentist and Bayesian statistics

Page 4: Outline - math.ntnu.edu.twmath.ntnu.edu.tw/~rtsai/104/bayes/slides/lecture1.pdf · Slide 13— PhD (Aug 23rd 2011) — Frequentist and Bayesian statistics In summary • Afrequentistisapersonwhoselong-runambitionisto

The meta-experiment ideaFrequentists think of meta-experiments and consider thecurrent dataset as a single realization from all possibledatasets.

167.2 cm175.5 cm

Slide 8— PhD (Aug 23rd 2011) — Frequentist and Bayesian statistics

The meta-experiment ideaFrequentists think of meta-experiments and consider thecurrent dataset as a single realization from all possibledatasets.

167.2 cm175.5 cm187.7 cm

Slide 8— PhD (Aug 23rd 2011) — Frequentist and Bayesian statistics

The meta-experiment ideaFrequentists think of meta-experiments and consider thecurrent dataset as a single realization from all possibledatasets.

167.2 cm175.5 cm187.7 cm182.0 cm

Slide 8— PhD (Aug 23rd 2011) — Frequentist and Bayesian statistics

Page 5: Outline - math.ntnu.edu.twmath.ntnu.edu.tw/~rtsai/104/bayes/slides/lecture1.pdf · Slide 13— PhD (Aug 23rd 2011) — Frequentist and Bayesian statistics In summary • Afrequentistisapersonwhoselong-runambitionisto

Confidence intervals

Thus a frequentist believes that a population mean is real,but unknown, and unknowable, and can only be estimatedfrom the data.Knowing the distribution for the sample mean, he constructsa confidence interval, centered at the sample mean.

• Either the true mean is in the interval or it is not. Can’tsay there’s a 95% probability (long-run fraction havingthis characteristic) that the true mean is in this interval,because it’s either already in, or it’s not.

• Reason: true mean is fixed value, which doesn’t have adistribution.

• The sample mean does have a distribution! Thus mustuse statements like “95% of similar intervals wouldcontain the true mean, if each interval were constructedfrom a different random sample like this one.”

Slide 9— PhD (Aug 23rd 2011) — Frequentist and Bayesian statistics

Maximum likelihood

How will the frequentist estimate the parameter?

Slide 10— PhD (Aug 23rd 2011) — Frequentist and Bayesian statistics

Maximum likelihood

How will the frequentist estimate the parameter?Answer: maximum likelihood.

Slide 10— PhD (Aug 23rd 2011) — Frequentist and Bayesian statistics

Page 6: Outline - math.ntnu.edu.twmath.ntnu.edu.tw/~rtsai/104/bayes/slides/lecture1.pdf · Slide 13— PhD (Aug 23rd 2011) — Frequentist and Bayesian statistics In summary • Afrequentistisapersonwhoselong-runambitionisto

Maximum likelihood

How will the frequentist estimate the parameter?Answer: maximum likelihood.

Basic idea

Our best estimate of the parameter(s) are the one(s) thatmake our observed data most likely. We know what we haveobserved so far (our data). Our best “guess” would thereforebe to select parameters that make our observations mostlikely.

Binomial distribution:

P(Y = y) =

"n

y

#py (1−p)n−y

Slide 10— PhD (Aug 23rd 2011) — Frequentist and Bayesian statistics

BayesiansEach investigator is entitled to his/hers personal belief ... theprior information. No fixed values for parameters but adistribution.

All distributions are subjective.Yours is as good as mine.

Can still talk about the mean— but it is the mean of mydistribution.

In many cases trying tocircumvent by using vaguepriors.

Thumb tack pin pointingdown:

0.0 0.2 0.4 0.6 0.8 1.0

0.0

0.5

1.0

1.5

2.0

2.5

3.0

Theta

Prio

r dis

tribu

tion

Slide 11— PhD (Aug 23rd 2011) — Frequentist and Bayesian statistics

Credibility intervals

Bayesians have an altogether different world-view.

They say that only the data are real. The population mean isan abstraction, and as such some values are more believablethan others based on the data and their prior beliefs.

Slide 12— PhD (Aug 23rd 2011) — Frequentist and Bayesian statistics

Page 7: Outline - math.ntnu.edu.twmath.ntnu.edu.tw/~rtsai/104/bayes/slides/lecture1.pdf · Slide 13— PhD (Aug 23rd 2011) — Frequentist and Bayesian statistics In summary • Afrequentistisapersonwhoselong-runambitionisto

Credibility intervals

Bayesians have an altogether different world-view.

They say that only the data are real. The population mean isan abstraction, and as such some values are more believablethan others based on the data and their prior beliefs.

The Bayesian constructs a credibility interval, centered nearthe sample mean, but tempered by “prior” beliefs concerningthe mean.

Now the Bayesian can say what the frequentist cannot:“There is a 95% probability (degree of believability) that thisinterval contains the mean.”

Slide 12— PhD (Aug 23rd 2011) — Frequentist and Bayesian statistics

Comparison

Advantages Disadvantages

Frequentist Objective Confidence intervals(not quite the desi-red)

Calculations

Bayesian Credibility intervals(usually the desired)

Subjective

Complex models Calculations

Slide 13— PhD (Aug 23rd 2011) — Frequentist and Bayesian statistics

In summary

• A frequentist is a person whose long-run ambition is tobe wrong 5% of the time.

• A Bayesian is one who, vaguely expecting a horse, andcatching a glimpse of a donkey, strongly believes he hasseen a mule.

Slide 14— PhD (Aug 23rd 2011) — Frequentist and Bayesian statistics

Page 8: Outline - math.ntnu.edu.twmath.ntnu.edu.tw/~rtsai/104/bayes/slides/lecture1.pdf · Slide 13— PhD (Aug 23rd 2011) — Frequentist and Bayesian statistics In summary • Afrequentistisapersonwhoselong-runambitionisto

In summary

• A frequentist is a person whose long-run ambition is tobe wrong 5% of the time.

• A Bayesian is one who, vaguely expecting a horse, andcatching a glimpse of a donkey, strongly believes he hasseen a mule.

A frequentist uses impeccable logic to answer thewrong question, while a Bayesean answers the rightquestion by making assumptions that nobody canfully believe in.

P. G. Hamer

Slide 14— PhD (Aug 23rd 2011) — Frequentist and Bayesian statistics

Jury duty

Slide 15— PhD (Aug 23rd 2011) — Frequentist and Bayesian statistics

Example: speed of light

What is the speed of light in vacuum “really”?Results (m/s)

299792459.2

299792460.0

299792456.3

299792458.1

299792459.5

Slide 16— PhD (Aug 23rd 2011) — Frequentist and Bayesian statistics