Multidimensional Gaussian distribution and classification ...€¦ · The multidimensional Gaussian...

Multidimensional Gaussian distribution andclassification with Gaussians

Guido Sanguinetti

Informatics 2B— Learning and Data Lecture 96 March 2012

Informatics 2B: Learning and Data Lecture 9 Multidimensional Gaussian distribution and classification with Gaussians1

Overview

Today’s lecture

Gaussians

The multidimensional Gaussian distribution

Bayes theorem and probability density functions

The Gaussian classifier

(One-dimensional) Gaussian distribution

One-dimensional Gaussian with zero mean and unit variance(µ = 0, σ2 = 1):

−4 −3 −2 −1 0 1 2 3 40

pdf of Gaussian Distribution

mean=0variance=1

p(x |µ, σ2) = N(x ;µ, σ2) =1√

2πσ2exp

(−(x − µ)2

)Informatics 2B: Learning and Data Lecture 9 Multidimensional Gaussian distribution and classification with Gaussians3

The d-dimensional vector x is multivariate Gaussian if it has aprobability density function of the following form:

p(x|µ,Σ) =1

(2π)d/2|Σ|1/2exp

2(x− µ)T Σ−1(x− µ)

)The pdf is parameterized by the mean vector µ and thecovariance matrix Σ.

The 1-dimensional Gaussian is a special case of this pdf

The argument to the exponential 0.5(x− µ)T Σ−1(x− µ) isreferred to as a quadratic form.

p(x|µ,Σ) =1

(2π)d/2|Σ|1/2exp

2(x− µ)T Σ−1(x− µ)

p(x|µ,Σ) =1

(2π)d/2|Σ|1/2exp

2(x− µ)T Σ−1(x− µ)

Covariance matrix

The mean vector µ is the expectation of x:

µ = E [x]

The covariance matrix Σ is the expectation of the deviation ofx from the mean:

Σ = E [(x− µ)(x− µ)T ]

Σ is a d × d symmetric matrix:

Σij = E [(xi − µi )(xj − µj)] = E [(xj − µj)(xi − µi )] = Σji

The sign of the covariance helps to determine the relationshipbetween two components:

If xj is large when xi is large, then (xj − µj)(xi − µi ) will tendto be positive;If xj is small when xi is large, then (xj − µj)(xi − µi ) will tendto be negative.

Covariance matrix

µ = E [x]

Σ = E [(x− µ)(x− µ)T ]

Covariance matrix

µ = E [x]

Σ = E [(x− µ)(x− µ)T ]

Covariance matrix

µ = E [x]

Σ = E [(x− µ)(x− µ)T ]

Covariance matrix

µ = E [x]

Σ = E [(x− µ)(x− µ)T ]

If xj is large when xi is large, then (xj − µj)(xi − µi ) will tendto be positive;

If xj is small when xi is large, then (xj − µj)(xi − µi ) will tendto be negative.

Covariance matrix

µ = E [x]

Σ = E [(x− µ)(x− µ)T ]

Correlation matrix

The covariance matrix is not scale-independent: Define thecorrelation coefficient:

ρ(xj , xk) = ρjk =Sjk√SjjSkk

Scale-independent (ie independent of the measurement units)and location-independent, ie:

ρ(xj , xk) = ρ(axj + b, sxk + t)

The correlation coefficient satisfies −1 ≤ ρ ≤ 1, and

ρ(x , y) = +1 if y = ax + b a > 0

ρ(x , y) = −1 if y = ax + b a < 0

Correlation matrix

ρ(x , y) = +1 if y = ax + b a > 0

ρ(x , y) = −1 if y = ax + b a < 0

Correlation matrix

ρ(x , y) = +1 if y = ax + b a > 0

ρ(x , y) = −1 if y = ax + b a < 0

Spherical Gaussian

−2−1.5

−1−0.5

Surface plot of p(x1, x2)

p(x 1, x

Contour plot of p(x1, x2)

−2 −1.5 −1 −0.5 0 0.5 1 1.5 2−2

−1.5

−0.5

(1 00 1

)ρ12 = 0

Diagonal Covariance Gaussian

−4−2

p(x 1, x

−4 −3 −2 −1 0 1 2 3 4−4

(1 00 4

)ρ12 = 0

Full covariance Gaussian

−4−3

−2−1

p(x 1, x

−4 −3 −2 −1 0 1 2 3 4−4

(1 −1−1 4

)ρ12 = −0.5

Parameter estimation

It is possible to show that the mean vector µ̂ and covariancematrix Σ̂ that maximize the likelihood of the training data aregiven by:

µ̂ =1

N∑n=1

Σ̂ =1

N∑n=1

(xn − µ̂)(xn − µ̂)T

The mean of the distribution is estimated by the sample meanand the covariance by the sample covariance

Example data

−4 −2 0 2 4 6 8 10−5

Maximum likelihood fit to a Gaussian

−4 −2 0 2 4 6 8 10−5

Bayes theorem and probability densities

Rules for probability densities are similar to those forprobabilities:

p(x , y) = p(x |y)p(y)

p(x) =

∫p(x , y)dy

We may mix probabilities of discrete variables and probabilitydensities of continuous variables:

p(x ,Z ) = p(x |Z )P(Z )

Bayes’ theorem for continuous data x and class C :

P(C |x) =p(x |C )P(C )

P(C |x) ∝ p(x |C )P(C )

p(x , y) = p(x |y)p(y)

p(x) =

∫p(x , y)dy

p(x ,Z ) = p(x |Z )P(Z )

P(C |x) =p(x |C )P(C )

P(C |x) ∝ p(x |C )P(C )

p(x , y) = p(x |y)p(y)

p(x) =

∫p(x , y)dy

p(x ,Z ) = p(x |Z )P(Z )

P(C |x) =p(x |C )P(C )

P(C |x) ∝ p(x |C )P(C )

Bayes theorem and univariate Gaussians

If p(x | C ) is Gaussian with mean µc and variance σ2c :

P(C | x) ∝ p(x | C )P(C )

∝ N(x ;µc , σ2c )P(C )

∝ 1√2πσ2

(−(x − µc)2

)P(C )

Taking logs, we have the log likelihood LL(x | C ):

LL(x | C ) = ln p(x | µc , σ2c )

(− ln(2π)− lnσ2

c −(x − µc)2

)The log posterior probability LP(C | x) is:

LP(C | x) ∝ LL(x | C ) + LP(C )

c −(x − µc)2

)+ ln P(C )

P(C | x) ∝ p(x | C )P(C )

∝ N(x ;µc , σ2c )P(C )

∝ 1√2πσ2

(−(x − µc)2

)P(C )

LL(x | C ) = ln p(x | µc , σ2c )

c −(x − µc)2

The log posterior probability LP(C | x) is:

LP(C | x) ∝ LL(x | C ) + LP(C )

c −(x − µc)2

)+ ln P(C )

P(C | x) ∝ p(x | C )P(C )

∝ N(x ;µc , σ2c )P(C )

∝ 1√2πσ2

(−(x − µc)2

)P(C )

LL(x | C ) = ln p(x | µc , σ2c )

c −(x − µc)2

)The log posterior probability LP(C | x) is:

LP(C | x) ∝ LL(x | C ) + LP(C )

c −(x − µc)2

)+ ln P(C )

Example: 1-dimensional Gaussian classifier

Two classes, S and T , with some observations:

Class S 10 8 10 10 11 11

Class T 12 9 15 10 13 13

Assume that each class may be modelled by a Gaussian. Themean and variance of each pdf are estimated by the samplemean and sample variance:

µ(S) = 10 σ2(S) = 1

µ(T ) = 12 σ2(T ) = 4

Class S 10 8 10 10 11 11

Class T 12 9 15 10 13 13

µ(S) = 10 σ2(S) = 1

µ(T ) = 12 σ2(T ) = 4

Gaussian pdfs for S and T

0 5 10 15 20 250

P(x|S)

P(x(T)

Class S 10 8 10 10 11 11

Class T 12 9 15 10 13 13

µ(S) = 10 σ2(S) = 1

µ(T ) = 12 σ2(T ) = 4

The following unlabelled data points are available:

x1 = 10 x2 = 11 x3 = 6

To which class should each of the data points be assigned?Assume the two classes have equal prior probabilities.

Log odds

Take the log odds (posterior probability ratios):

lnP(S |X = x)

P(T |X = x)= −1

((x − µS)2

− (x − µT )2

+ lnσ2S − lnσ2

)+ ln P(S)− ln P(T )

In the example the priors are equal, so:

lnP(S |X = x)

P(T |X = x)= −1

((x − µS)2

− (x − µT )2

+ lnσ2S − lnσ2

)= −1

((x − 10)2 − (x − 12)2

4− ln 4

)If log odds are less than 0 assign to T , otherwise assign to S .

Log odds

lnP(S |X = x)

P(T |X = x)= −1

((x − µS)2

− (x − µT )2

+ lnσ2S − lnσ2

lnP(S |X = x)

P(T |X = x)= −1

((x − µS)2

− (x − µT )2

+ lnσ2S − lnσ2

)= −1

((x − 10)2 − (x − 12)2

4− ln 4

If log odds are less than 0 assign to T , otherwise assign to S .

Log odds

lnP(S |X = x)

P(T |X = x)= −1

((x − µS)2

− (x − µT )2

+ lnσ2S − lnσ2

lnP(S |X = x)

P(T |X = x)= −1

((x − µS)2

− (x − µT )2

+ lnσ2S − lnσ2

)= −1

((x − 10)2 − (x − 12)2

4− ln 4

)If log odds are less than 0 assign to T , otherwise assign to S .

Log odds

5 6 7 8 9 10 11 12 13 14 15−12

Example: unequal priors

Now, assume P(S) = 0.3, P(T ) = 0.7. Including this priorinformation, to which class should each of the above test datapoints (x1, x2, x3) be assigned?

Again compute the log odds:

lnP(S |X = x)

P(T |X = x)= −1

((x − µS)2

− (x − µT )2

+ lnσ2S − lnσ2

= −1

((x − 10)2 − (x − 12)2

4− ln 4

= −1

((x − 10)2 − (x − 12)2

4− ln 4

)+ ln(3/7)

Example: unequal priors

Now, assume P(S) = 0.3, P(T ) = 0.7. Including this priorinformation, to which class should each of the above test datapoints (x1, x2, x3) be assigned?

Again compute the log odds:

lnP(S |X = x)

P(T |X = x)= −1

((x − µS)2

− (x − µT )2

+ lnσ2S − lnσ2

= −1

((x − 10)2 − (x − 12)2

4− ln 4

= −1

((x − 10)2 − (x − 12)2

4− ln 4

)+ ln(3/7)

Log odds

5 6 7 8 9 10 11 12 13 14 15−12

P(S)=0.5P(S)=0.3

Multivariate Gaussian classifier

Multivariate Gaussian (in d dimensions):

p(x|µ,Σ) =1

(2π)d/2|Σ|1/2exp

2(x− µ)T Σ−1(x− µ)

Log likelihood:

LL(x|µ,Σ) = −d

2ln(2π)− 1

2ln |Σ| − 1

2(x− µ)T Σ−1(x− µ)

If p(x | C ) ∼ p(x | µ,Σ), the log posterior probability is:

ln P(C |x) ∝ −1

2(x− µ)T Σ−1(x− µ)− 1

2ln |Σ|+ ln P(C )

p(x|µ,Σ) =1

(2π)d/2|Σ|1/2exp

2(x− µ)T Σ−1(x− µ)

)Log likelihood:

LL(x|µ,Σ) = −d

2ln(2π)− 1

2ln |Σ| − 1

2(x− µ)T Σ−1(x− µ)

ln P(C |x) ∝ −1

2(x− µ)T Σ−1(x− µ)− 1

2ln |Σ|+ ln P(C )

p(x|µ,Σ) =1

(2π)d/2|Σ|1/2exp

2(x− µ)T Σ−1(x− µ)

)Log likelihood:

LL(x|µ,Σ) = −d

2ln(2π)− 1

2ln |Σ| − 1

2(x− µ)T Σ−1(x− µ)

ln P(C |x) ∝ −1

2(x− µ)T Σ−1(x− µ)− 1

2ln |Σ|+ ln P(C )

Example

2-dimensional data from three classes (A, B, C ).

The classes have equal prior probabilities.

200 points in each class

Load into Matlab (n × 2 matrices, each row is a data point)and display using a scatter plot:

xa = load(’trainA.dat’);xb = load(’trainB.dat’);xc = load(’trainC.dat’);hold on;scatter(xa(:, 1), xa(:,2), ’r’, ’o’);scatter(xb(:, 1), xb(:,2), ’b’, ’x’);scatter(xc(:, 1), xc(:,2), ’c’, ’*’);

Example

Training data

−8 −6 −4 −2 0 2 4 6−8

Gaussians estimated from training data

−8 −6 −4 −2 0 2 4 6−8

Testing data

Testing data — with estimated class distributions

−8 −6 −4 −2 0 2 4 6−8

Testing data — with true classes indicated

−8 −6 −4 −2 0 2 4 6−8

Classifying test data from class A

Classifying test data from class B

−8 −6 −4 −2 0 2 4 6−8

Classifying test data from class C

Results

Analyze results by percent correct, and in more detail with aconfusion matrix

Rows of a confusion matrix correspond to the predicted classes(classifier outputs)Columns correspond to the true class labelsElement (r , c) is the number of patterns from true class c thatwere classified as class rTotal number of correctly classified patterns is obtained bysumming the numbers on the leading diagonal

Confusion matrix in this case:

True classTest Data A B C

Predicted A 77 5 9class B 15 88 2

C 8 7 89

Overall proportion of test patterns correctly classified is(77 + 88 + 89)/300 = 254/300 = 0.85.

Results

Rows of a confusion matrix correspond to the predicted classes(classifier outputs)

Columns correspond to the true class labelsElement (r , c) is the number of patterns from true class c thatwere classified as class rTotal number of correctly classified patterns is obtained bysumming the numbers on the leading diagonal

C 8 7 89

Results

Rows of a confusion matrix correspond to the predicted classes(classifier outputs)Columns correspond to the true class labels

Element (r , c) is the number of patterns from true class c thatwere classified as class rTotal number of correctly classified patterns is obtained bysumming the numbers on the leading diagonal

C 8 7 89

Results

Rows of a confusion matrix correspond to the predicted classes(classifier outputs)Columns correspond to the true class labelsElement (r , c) is the number of patterns from true class c thatwere classified as class r

Total number of correctly classified patterns is obtained bysumming the numbers on the leading diagonal

C 8 7 89

Results

C 8 7 89

Results

C 8 7 89

Results

C 8 7 89

Overall proportion of test patterns correctly classified is(77 + 88 + 89)/300 = 254/300 = 0.85.Informatics 2B: Learning and Data Lecture 9 Multidimensional Gaussian distribution and classification with Gaussians32

Decision Regions

−8 −6 −4 −2 0 2 4 6 8−8

Decision regions for 3−class example

Example: Classifying spoken vowels

10 Spoken vowels in American English

Vowels can be characterised by formant frequencies —resonances of vocal tract

there are usually three or four identifiable formantsfirst two formants written as F1 and F2

Peterson-Barney data — recordings of spoken vowels byAmerican men, women, and children

two examples of each vowel per personfor this example, data split into training and test setschildren’s data not used in this exampledifferent speakers in training and test sets

(see http://en.wikipedia.org/wiki/Vowel for more)

Classify the data using a Gaussian classifier

Assume equal priors

The data

Ten steady-state vowels, frequencies of F1 and F2 at their centre:

IY — “bee”

IH — “big”

EH — “red”

AE — “at”

AH — “honey”

AA — “heart”

AO — “frost”

UH — “could”

UW — “you”

ER — “bird”

Vowel data — 10 classes

0 200 400 600 800 1000 1200500

Peterson−Barney F1−F2 Vowel Training Data

F1 / Hz

F2 / H

Gaussian for class 1 (IY)

0 200 400 600 800 1000 1200500

F1 / HzF2 / Hz

IYIHEHAEAHAAAOUHUWER

Gaussian for class 2 (IH)

0 200 400 600 800 1000 1200500

F1 / Hz

F2 / H

Gaussian for class 3 (EH)

0 200 400 600 800 1000 1200500

F1 / Hz

F2 / H

Gaussian for class 4 (AE)

0 200 400 600 800 1000 1200500

F1 / Hz

F2 / H

Gaussian for class 5 (AH)

0 200 400 600 800 1000 1200500

F1 / Hz

F2 / H

Gaussian for class 6 (AA)

0 200 400 600 800 1000 1200500

F1 / Hz

F2 / H

Gaussian for class 7 (AO)

0 200 400 600 800 1000 1200500

F1 / Hz

F2 / H

Gaussian for class 8 (UH)

0 200 400 600 800 1000 1200500

F1 / Hz

F2 / H

Gaussian for class 9 (UW)

0 200 400 600 800 1000 1200500

F1 / Hz

F2 / H

Gaussian for class 10 (ER)

0 200 400 600 800 1000 1200500

F1 / HzF2 / Hz

IYIHEHAEAH

AA AOUH

UWER Informatics 2B: Learning and Data Lecture 9 Multidimensional Gaussian distribution and classification with Gaussians46

Gaussians for each class

0 200 400 600 800 1000 1200500

F1 / Hz

F2 / H

Peterson−Barney F1−F2 Vowel Test Data

Test data for class 1 (IY)

0 200 400 600 800 1000 1200500

F1 / Hz

F2 / H

Confusion matrix

True classIY

IY 20IH 0

EH 0AE 0AH 0AA 0AO 0UH 0UW 0ER 0

% corr. 100

Test data for class 2 (IH)

0 200 400 600 800 1000 1200500

F1 / Hz

F2 / H

Confusion matrix

True classIY IH

IY 20 0IH 0 20

EH 0 0AE 0 0AH 0 0AA 0 0AO 0 0UH 0 0UW 0 0ER 0 0

% corr. 100 100

Test data for class 3 (EH)

0 200 400 600 800 1000 1200500

F1 / Hz

F2 / H

Confusion matrix

True classIY IH EH

IY 20 0 0IH 0 20 0

EH 0 0 15AE 0 0 1AH 0 0 0AA 0 0 0AO 0 0 0UH 0 0 0UW 0 0 0ER 0 0 4

% corr. 100 100 75

Test data for class 4 (AE)

0 200 400 600 800 1000 1200500

F1 / Hz

F2 / H

Confusion matrix

True classIY IH EH AE

IY 20 0 0 0IH 0 20 0 0

EH 0 0 15 3AE 0 0 1 16AH 0 0 0 1AA 0 0 0 0AO 0 0 0 0UH 0 0 0 0UW 0 0 0 0ER 0 0 4 0

% corr. 100 100 75 80

Test data for class 5 (AH)

0 200 400 600 800 1000 1200500

F1 / Hz

F2 / H

Confusion matrix

True classIY IH EH AE AH

IY 20 0 0 0 0IH 0 20 0 0 0

EH 0 0 15 3 0AE 0 0 1 16 0AH 0 0 0 1 18AA 0 0 0 0 2AO 0 0 0 0 0UH 0 0 0 0 0UW 0 0 0 0 0ER 0 0 4 0 0

% corr. 100 100 75 80 90

Test data for class 6 (AA)

0 200 400 600 800 1000 1200500

F1 / Hz

F2 / H

Confusion matrix

True classIY IH EH AE AH AA

IY 20 0 0 0 0 0IH 0 20 0 0 0 0

EH 0 0 15 3 0 0AE 0 0 1 16 0 0AH 0 0 0 1 18 2AA 0 0 0 0 2 17AO 0 0 0 0 0 1UH 0 0 0 0 0 0UW 0 0 0 0 0 0ER 0 0 4 0 0 0

% corr. 100 100 75 80 90 85

Test data for class 7 (AO)

0 200 400 600 800 1000 1200500

F1 / Hz

F2 / H

Confusion matrix

True classIY IH EH AE AH AA AO

IY 20 0 0 0 0 0 0IH 0 20 0 0 0 0 0

EH 0 0 15 3 0 0 0AE 0 0 1 16 0 0 0AH 0 0 0 1 18 2 0AA 0 0 0 0 2 17 4AO 0 0 0 0 0 1 16UH 0 0 0 0 0 0 0UW 0 0 0 0 0 0 0ER 0 0 4 0 0 0 0

% corr. 100 100 75 80 90 85 80

Test data for class 8 (UH)

0 200 400 600 800 1000 1200500

F1 / Hz

F2 / H

Confusion matrix

True classIY IH EH AE AH AA AO UH

IY 20 0 0 0 0 0 0 0IH 0 20 0 0 0 0 0 0

EH 0 0 15 3 0 0 0 0AE 0 0 1 16 0 0 0 0AH 0 0 0 1 18 2 0 2AA 0 0 0 0 2 17 4 0AO 0 0 0 0 0 1 16 0UH 0 0 0 0 0 0 0 18UW 0 0 0 0 0 0 0 0ER 0 0 4 0 0 0 0 0

% corr. 100 100 75 80 90 85 80 90

Test data for class 9 (UW)

0 200 400 600 800 1000 1200500

F1 / Hz

F2 / H

Confusion matrix

True classIY IH EH AE AH AA AO UH UW

IY 20 0 0 0 0 0 0 0 0IH 0 20 0 0 0 0 0 0 0

EH 0 0 15 3 0 0 0 0 0AE 0 0 1 16 0 0 0 0 0AH 0 0 0 1 18 2 0 2 0AA 0 0 0 0 2 17 4 0 0AO 0 0 0 0 0 1 16 0 0UH 0 0 0 0 0 0 0 18 5UW 0 0 0 0 0 0 0 0 15ER 0 0 4 0 0 0 0 0 0

% corr. 100 100 75 80 90 85 80 90 75

Test data for class 10 (ER)

0 200 400 600 800 1000 1200500

F1 / Hz

F2 / H

Final confusion matrix

True classIY IH EH AE AH AA AO UH UW ER

IY 20 0 0 0 0 0 0 0 0 0IH 0 20 0 0 0 0 0 0 0 0

EH 0 0 15 3 0 0 0 0 0 0AE 0 0 1 16 0 0 0 0 0 0AH 0 0 0 1 18 2 0 2 0 0AA 0 0 0 0 2 17 4 0 0 0AO 0 0 0 0 0 1 16 0 0 0UH 0 0 0 0 0 0 0 18 5 2UW 0 0 0 0 0 0 0 0 15 0ER 0 0 4 0 0 0 0 0 0 18

% corr. 100 100 75 80 90 85 80 90 75 90Total: 86.5% correct

Classifying the training set

0 200 400 600 800 1000 1200500

F1 / HzF2 / Hz

IYIHEHAEAH

AA AOUH

UWER Informatics 2B: Learning and Data Lecture 9 Multidimensional Gaussian distribution and classification with Gaussians68

Training set confusion matrix

True classIY IH EH AE AH AA AO UH UW ER

IY 99 8 0 0 0 0 0 0 0 0IH 3 85 15 0 0 0 0 0 0 3

EH 0 7 69 11 0 0 0 0 0 11AE 0 0 5 86 4 0 0 0 0 4AH 0 0 0 3 87 8 3 2 0 1AA 0 0 0 0 4 82 10 0 0 0AO 0 0 0 0 5 12 86 2 0 0UH 0 0 0 0 0 0 2 73 19 10UW 0 0 0 0 0 0 1 15 79 1ER 0 2 13 2 2 0 0 10 4 72% 97.1 83.3 67.6 84.3 85.3 80.4 84.3 71.6 77.5 70.6

Total: 80.2% correct

Decision Regions

0 200 400 600 800 1000 1200500

F1 / Hz

F2 / H

Peterson−Barney F1−F2 Gaussian Decision Regions

Summary

Using Bayes’ theorem with pdfs

The Gaussian classifier: 1-dimensional and multi-dimensional

Vowel classification example

Multidimensional Gaussian distribution and classification ...€¦ · The multidimensional Gaussian...

Documents

Transcript of Multidimensional Gaussian distribution and classification ...€¦ · The multidimensional Gaussian...

4.7 Brownian Bridge(part 2) 報告人：李振綱. Outline 4.7.4 Multidimensional Distribution of the Brownian Bridge 4.7.4 Multidimensional Distribution of the Brownian.

Evaluating,Expressing, and Propagating Uncertainty for NIST€¦ · Probability distribution, Random Variable 14 Student’s distribution 14 Gaussian distribution 16 Lognormal distribution

Conditional Gaussian Distribution Learning for Open Set ...novel method, Conditional Gaussian Distribution Learning (CGDL), for open set recognition. Different from tradi-tional VAEs,

The Normal Distribution (Gaussian Distribution) Honors Analysis Learning Target: I can analyze data using the normal distribution.

Chapter 5 Continuous Distributions The Gaussian (Normal) Distribution.

10. Gaussian Processes - Regression€¦ · Deﬁnition: A Gaussian process is a collection of random variables, any ﬁnite number of which have a joint Gaussian distribution. ...

The Normal Distribution (Gaussian Distribution)

PubTeX output 1999.11.24: · PDF filepassing to SAS programs 1165 ... Poisson distribution 283 ... uniform distribution 284 Wald (Inverse Gaussian) distribution 284 Weibull distribution

The exponentiated generalized inverse Gaussian distribution

PRIMER ON ERRORS 8.13 U.Becker 9,2007 Random & Systematic Errors Distribution of random errors Binomial, Poisson, Gaussian Poisson Gaussian.

Continuous-Variable Quantum Key Distribution with Gaussian … · 2018-05-14 · Continuous-Variable Quantum Key Distribution with Gaussian Modulation { The Theory of Practical Implementations

The Normal (Gaussian) distribution - macs.hw.ac.ukgeorges/teach/stats2/stats2_webslides_partc.pdf · The Normal (Gaussian) distribution ... Summarising continuous data: Graphical

CONTINUOUS VARIABLE QUANTUM INFORMATION · Gaussian states q p are states whose Wigner distribution is a Gaussian function in phase space • Recall: Gaussian probability function

The normal inverse Gaussian distribution: Exposition and ... slaidid/Teneng ESS2012.pdf · the normal inverse Gaussian distribution; despite different time horizons •Modeling the

Gas Distribution Modeling using Sparse Gaussian Process ...Gas Distribution Modeling using Sparse Gaussian Process Mixture Models Cyrill Stachniss 1Christian Plagemann Achim Lilienthal2

Lecture 8: Gaussian Distribution - Astronomybelz/phys3719/lecture08.pdf · Lecture 8: Gaussian Distribution Physics 3719 Spring Semester 2011 Carl Friedrich Gauss (1777-1855) on the

Latent Gaussian Processes for Distribution Estimation of …mlg.eng.cam.ac.uk/.../PDFs/NIPS_2014_Latent_presentation.pdf · 2015-01-21 · Categorical Latent Gaussian Process 1. Sparse

Statistical Theory; Why is the Gaussian Distribution so popular?

Information Analysis Gaussian or Normal Distribution.

Uncertainty Representation. Gaussian Distribution variance Standard deviation.