Bayesian Networks Graphical Models - cs.ubc.casiamakx/532R/slides/directed_models.pdf · Learning...

Bayesian Networks

Graphical ModelsGraphical Models

Siamak Ravanbakhsh Winter 2018

Learning objectivesLearning objectives

what is a Bayesian network?factorizationconditional independencies

how to read it from the graphequivalent class of Bayesian networks

how are they related?

Representing distributionsRepresenting distributions

give a large number of random variables X , … ,X1 n

how to represent

number of parameters exponential in n (curse of dimensionality)

need to leverage some structure in P

P (X , … ,X )1 n

IndependenceIndependence & representation & representation

for discrete domains

representation ofexponential in n:

P (X = x , … ,x ) = θ1 n i ,…,i1 n

V al(X ) = {1, … ,D} ∀ii

O(D }n

X ⊥ X ∀i, ji j

P (X = x , … ,x ) = P (X = x ) = θ1d

nd ∏i i i

d ∏i i,d

P (X = x , … ,x ) = θ1 n i ,…,i1 n

assuming independence

linear-sized representation:

a particular assignment (d) in discrete domain

V al(X ) = {1, … ,D} ∀ii

O(D }n

X ⊥ X ∀i, ji j

P (X = x , … ,x ) = P (X = x ) = θ1d

nd ∏i i i

d ∏i i,d

P (X = x , … ,x ) = θ1 n i ,…,i1 n

assuming independence

linear-sized representation:

a particular assignment (d) in discrete domain

independence assumption is too restrictive

V al(X ) = {1, … ,D} ∀ii

O(D }n

Independence and representationIndependence and representation

For a Gaussian distribution:

from quadratic

to a linear-sized representation

P (X = x , … ,x ) = exp − (x − μ )1 n ∏i √ 2πσi2

1 ( 2σi2

P (X = x , … ,x ) = exp − (x− μ) Σ (x− μ)1 n √ ∣2πΣ∣ 1 ( 2

1 T −1 )n x n matrix

Using the Using the chain rulechain rule

P (X) = P (X )P (X ∣ X ) …P (X ∣ X , … ,X )1 2 1 n 1 n−1

pick an ordering of the variables

parameterize each termdoes it compress the representation?

P (X) = P (X )P (X ∣ X ) …P (X ∣ X , … ,X )1 2 1 n 1 n−1

original #params D − 1n

parameterize each termdoes it compress the representation?

P (X) = P (X )P (X ∣ X ) …P (X ∣ X , … ,X )1 2 1 n 1 n−1

original #params D − 1n

new #params (D − 1) + (D − D) + … + (D − D ) = D − 12 n n−1 n

P (X )1 P (X ∣ X )2 1 P (X ∣ X , … ,X )n 1 n−1

P (X) = P (X )P (X ∣ X ) …P (X ∣ X , … ,X )1 2 1 n 1 n−1

simplify the conditionals

flexible compression of P

P (X) = P (X )P (X ∣ X ) …P (X ∣ X , … ,X )1 2 1 n 1 n−1

simplify the conditionals

flexible compression of P

A Bayesian network!

Chain rule; Chain rule; simplificationsimplification

P (X) = P (X )P (X ∣ X )P (X ∣ X ,X ) …P (X ∣ X , … ,X )1 2 1 3 1 2 n 1 n−1

an extreme form of simplification

P (X) = P (X )P (X ∣ X )P (X ∣ X ) …P (X ∣ X )1 2 1 3 1 n 1

Chain rule; Chain rule; simplificationsimplification

P (X) = P (X )P (X ∣ X )P (X ∣ X ,X ) …P (X ∣ X , … ,X )1 2 1 3 1 2 n 1 n−1

an extreme form of simplification

P (X) = P (X )P (X ∣ X )P (X ∣ X ) …P (X ∣ X )1 2 1 3 1 n 1

# params (D − 1) + (n − 1)(D − D)2

O(nD )2 instead of O(D )n

Idiot BayesIdiot Bayes

OrOr naive Bayes naive Bayes

P (class,X) = P (class)P (X ∣ class)P (X ∣ class) …P (X ∣ class)2 3 n

independence assumption: X ⊥ X ∣ classi −i

for classification (use Bayes rule)

P (class ∣ X) ∝ P (class)P (X ∣ class)P (X ∣ class) …P (X ∣ class)2 3 n

Example: medical diagnosis (what if two symptoms are correlated?)

Simplifying the chain rule; Simplifying the chain rule; general casegeneral case

P (X) = P (X )P (X ∣ X ) …P (X ∣ X , … ,X )1 2 1 n 1 n−1

simplify the full conditionals:

Directed Acyclic Graph (DAG)

Simplifying the chain rule; Simplifying the chain rule; general casegeneral case

P (X) = P (X )P (X ∣ X ) …P (X ∣ X , … ,X )1 2 1 n 1 n−1

simplify the full conditionals:

represent it using a

P (X) = P (X ∣ Pa )∏i i Xi

Bayesian network

a topological ordering

identifying a DAGhas a topological ordering?no directed path from a node to itself?

DAG; DAG; identificationidentification

Example:is this a DAG?a topological ordering: G,A,B,D,C,E,F

A,B,C,G,D,E,F

Example:is this a DAG?a topological ordering: G,A,B,D,C,E,F how about this?

A,B,C,G,D,E,F

Bayesian network (BN); Bayesian network (BN); exampleexample

P (I,D,G,S,L) = P (I)P (D)P (G ∣ I,D)P (S ∣ I)P (L ∣ G)more intelligent

better SAT score

more difficult

better

P (I,D,G,S,L) = P (I)P (D)P (G ∣ I,D)P (S ∣ I)P (L ∣ G)

Conditional Probability Table (CPT)

more intelligent

better SAT score

more difficult

better

P (i , d , g , s , l ) = P (i )P (d )P (g ∣ i , d )P (s ∣ i )P (l ∣ g )1 0 2 1 0 1 0 2 1 0 1 0 0 2

Conditional Probability Table (CPT)

more intelligent

better SAT score

more difficult

better

= .7 × .6 × .08 × .05 × .4 ≈ .0006

answering probabilistic queries

P (Y = y ∣ E = e) ?evidence

Intuition for reasoning in a BNIntuition for reasoning in a BN

P (L = l ∣ S = s ) =1 1P (S=s )1

P (L=l ,S=s )1 1

an inference problem

how to calculate? ... later

P (L = l ∣ S = s ) =1 1P (S=s )1

P (L=l ,S=s )1 1

P (S = s ) = P (d, i, g, s, l)1 ∑d,i,g,l

marginal (prior) probabilityof getting a good letter

marginal posteriorgiven low intelligence

... and an easy exam

causal reasoning (topdown)

P (l ) ≈ .501

P (l ∣ i ) ≈ .3891 0

P (l ∣ i , d ) ≈ .521 0 0

more intelligent

better SAT score

more difficult

better

(marginal) priorof a high intelligence

(marginal) posteriorgiven a bad letter

... and a bad grade

evidential reasoning (bottomup)

P (i ) ≈ .301

P (i ∣ l ) ≈ .141 0

P (i ∣ l , g ) ≈ .081 0 3

more intelligent

better SAT score

more difficult

better

priorof a high intelligence

posteriorgiven a bad letter

... and a bad grade

a difficult exam explains away the grade

Explaining away (vstructure)

P (i ) ≈ .301

P (i ∣ l ) ≈ .141 0

P (i ∣ l , g ) ≈ .081 0 3

Intuition for Reasoning in BNIntuition for Reasoning in BN

P (i ∣ l , g , d ) ≈ .111 0 3 1

more intelligent

better SAT score

more difficult

better

DAG; DAG; semanticssemantics

associating P with a DAG:

factorization of the joint probability:

conditional independencies in P from the DAG

P (X) = P (X ∣ Pa )∏i i Xi

Bayesian networks; Bayesian networks; factorizationfactorization

more intelligent

better SAT score

more difficult

better

P (X) = P (X ∣ Pa )∏i i Xi

In general

L ⊥ D, I,S ∣ G

quality of the letter (L) only depends on the grade (G)

Bayesian networks;Bayesian networks; conditioinal independencies conditioinal independencies

L ⊥ D, I,S ∣ G

D ⊥ S ?

How about the following assertions?

L ⊥ D, I,S ∣ G

D ⊥ S ?

L ⊥ D, I,S ∣ G

D ⊥ S ?D ⊥ S ∣ I ?

L ⊥ D, I,S ∣ G

D ⊥ S ?D ⊥ S ∣ I ?

L ⊥ D, I,S ∣ G

D ⊥ S ?D ⊥ S ∣ I ?

D ⊥ S ∣ L ?

L ⊥ D, I,S ∣ G

D ⊥ S ?D ⊥ S ∣ I ?

D ⊥ S ∣ L ?

L ⊥ D, I,S ∣ G

D ⊥ S ?D ⊥ S ∣ I ?

D ⊥ S ∣ L ?

read from the graph?

Conditional independencies (CI); Conditional independencies (CI); notationnotation

1. set of all CIs of the distribution P

2. set of local CIs from the graph (DAG)

3. set of all (global) CIs from the graph

I (G)ℓ

LocalLocal conditional independencies (conditional independencies (CICIs)s)

X ⊥ NonDescendents ∣ Parentsi Xi Xifor any node Xi

A ⊥ G ∣ ∅

B ⊥ G ∣ A

C ⊥ G,D,A ∣ B

G ⊥ A,B,C

D ⊥ A,C ∣ B,G

E ⊥ A,G ∣ B,C,DF ⊥ A,B,C,D,G ∣ E

graph G

I (G) = {ℓ

LocalLocal CIs CIs

X ⊥ NonDescendents ∣ Parentsi Xi Xifor any node Xi

D ⊥ I,SI ⊥ D

G ⊥ S ∣ IS ⊥ G,L,D ∣ I

L ⊥ D, I,S ∣ G

graph G

I (G) = {ℓ

use the factorized form

Local CIsLocal CIs from factorizationfrom factorization

P (X) = P (X ∣ Pa )∏i i Xi

P (X ,NonDesc ∣ Pa ) = P (X ∣ Pa )P (NonDesc ∣ Pa )i Xi Xi i Xi Xi Xi

to show

X ⊥ NonDesc ∣ Pai Xi Xiwhich means

S ⊥ G ∣ I P (D, I,G,S,L) = P (D)P (I)P (G ∣ D, I)P (S ∣ I)P (L ∣ G)

Local CIsLocal CIs from factorization; from factorization; example example

P (G,S ∣ I) = =P (D,I,G,S,L)∑d,g,s,l

P (D,I,G,S,L)∑d,l

=P (D)P (I)P (G∣D,I)P (S∣I)P (L∣G)∑d,g,s,l

P (D)P (I)P (G∣D,I)P (S∣I)P (L∣G)∑d,lP (G,S ∣ I) = =P (D,I,G,S,L)∑d,g,s,l

P (D,I,G,S,L)∑d,l

=P (I) P (D)P (G∣D,I)P (S∣I)P (L∣G)∑d,g,s,l

P (I)P (S∣I) P (D)P (G∣D,I)P (L∣G)∑d,l

P (D,I,G,S,L)∑d,l

=P (I) P (D)P (G∣D,I)P (S∣I)P (L∣G)∑d,g,s,l

P (I)P (S∣I) P (D)P (G∣D,I)P (L∣G)∑d,l

= P (S ∣ I)P (G ∣ I)1P (S∣I) P (D)P (G∣D,I)P (L∣G)∑d,l

FactorizationFactorization from local CIsfrom local CIs

from local CIs I (G) = {X ⊥ NonDesc ∣ Pa ∣ i}ℓ i Xi Xi

find a topological ordering (parents before children):

use the chain rule

simplify using local CIs

X , … ,Xi1 in

P (X) = P (X ) P (X ∣ X , … ,X )i1 ∏j=2n

ij i1 ij−1

P (X) = P (X ) P (X ∣ Pa )i1 ∏j=2n

ij Xij

FactorizationFactorization from local CIs;from local CIs; example example

local CIs (D ⊥ I,S), (I ⊥ D) , (G ⊥ S ∣ I),

(S ⊥ G,L,D ∣ I), (L ⊥ D, I,S ∣ G)

I (G) = {ℓ

P (D, I,G,S,L) = P (D)P (I ∣ D)P (G ∣ D, I)P (L ∣ D, I,G)P (S ∣ D, I,G,L)

a topological ordering: D, I, G, L, S

I (G)ℓ

P (D, I,G,S,L) = P (D)P (I)P (G ∣ D, I)P (L ∣ G)P (S ∣ I)

use the chain rule

simplify using

Factorization local CIsFactorization local CIs

P factorizes according to G

P (X) = P (X ∣ Pa )∏i i Xi

⇔ holds in PI (G)ℓ

P factorizes according to G I (G) ⊆ I(P )ℓ

P (X) = P (X ∣ Pa )∏i i Xi

P factorizes according to G I (G) ⊆ I(P )ℓ

P (X) = P (X ∣ Pa )∏i i Xi

is an I-map for PG

it does not mislead usabout independencies in P

Independence map (I-map); Independence map (I-map); exampleexample

GG ′

P (A,B,C) = P (C)P (A ∣ C)P (B ∣ C)P (A,B,C) = P (C)P (A ∣ C)P (B)

the term is used for both graphs and distributions.

easy to checkfactorization of P over

both are I-maps for P

Independence map (I-map); Independence map (I-map); exampleexample

I (G ) ⊆ I(P )ℓ′G ′

GG ′

I (G) ⊆ I (G )ℓ ℓ′

P (A,B,C) = P (C)P (A ∣ C)P (B ∣ C)P (A,B,C) = P (C)P (A ∣ C)P (B)

⇒G,G ′

the term is used for both graphs and distributions.

Summary Summary so farso far

simplification of the chain rule P (X) = P (X ∣ Pa )∏i i Xi

Bayes-net represented using a DAGnaive Bayeslocal conditional independencies

hold in a Bayes-netimply a Bayes-net

I = {X ⊥ NonDesc ∣ Pa ∣ i}i Xi Xi

GlobalGlobal CIs CIs from the graphfrom the graph

for any subset of vars X, Y and Z, we can ask X ⊥ Y ∣ Z?

global CI: the set of all such CIs

factorized form of P global CIs I (G) ⊆ I(G) ⊆ I(P )ℓ⇒

algorithm: directed separation (D-separation)

C ⊥ D ∣ B,F ?

Example:

Three canonical settingsThree canonical settings

1. causal / evidence trail

P (X,Y ,Z) = P (X)P (Y ∣X)P (Z ∣ Y )

P (Z ∣ X,Y ) = = = P (Z ∣ Y )P (X,Y )

P (X,Y ,Z)P (X)P (Y ∣X)

P (X)P (Y ∣X)P (Z∣Y )

conditional Independence:

marginal independence: P (X,Z) ≠ P (X)P (Z)

for three random variables

2. common cause

P (X,Y ,Z) = P (Y )P (X ∣ Y )P (Z ∣ Y )

P (X,Z ∣ Y ) = = P (X ∣ Y )P (Z ∣ Y )P (Y )

P (X,Y ,Z)

conditional independence:

marginal independence: P (X,Z) ≠ P (X)P (Z)

3. common effect

P (X,Y ,Z) = P (X)P (Z)P (Y ∣ X,Z)

P (X,Z ∣ Y ) = ≠ P (X ∣ Y )P (Z ∣ Y )P (Y )

P (X,Y ,Z)

conditional independence:

marginal independence:P (X,Z) = P (X,Y ,Z) = P (X)P (Z) P (Y ∣ X,Z) = P (X)P (Z)∑Y ∑Y

a.k.a. collider, v-structure

3. common effect

P (X,Z ∣ W ) ≠ P (X ∣ W )P (Z ∣ W )

conditional Independence:w

even observing a descendant of Y makes X, Z dependent

Putting the three cases togetherPutting the three cases together

consider all paths between variables in X and Y

X ,X ⊥ Y ∣ Z ,Z ?1 2 1 1 2

so far X ⊥ Y ∣ Z

X ,X ⊥ Y ∣ Z ,Z ?1 2 1 1 2

had we not observed Z1

(X ,X ⊥ Y ∣ Z ) ∈ I(G)1 2 1 2

(a.k.a. Bayes-Ball algorithm)

See whether at least one ball from X reaches Y

X ⊥ Y ∣ Z ?

image from:https://www.cs.ubc.ca/~murphyk/Bayes/bnintro.html

Z is shaded

D-seperationD-seperation

D-separation; D-separation; algorithmalgorithm

input: graph G and X, Y, Z

output: mark the variables in Z and all of their ancestors in Gbreadth-first-search starting from X

stop any trail that reaches a blocked node

a node in Y is reached?

X ⊥ Y ∣ Z ?

unmarked middle of a collider (V-structure)

in Z otherwise

Linear time complexity

D-separation D-separation quizquiz

G ⊥ S ∣ ∅?

D ⊥ L ∣ G?

G ⊥ S ∣ ∅?

D ⊥ L ∣ G?

G ⊥ S ∣ ∅?

D ⊥ L ∣ G?

D ⊥ I,S ∣ ∅?

G ⊥ S ∣ ∅?

D ⊥ L ∣ G?

D ⊥ I,S ∣ ∅?

G ⊥ S ∣ ∅?

D ⊥ L ∣ G?

D ⊥ I,S ∣ ∅?

D,L ⊥ S ∣ I,G?

G ⊥ S ∣ ∅?

D ⊥ L ∣ G?

D ⊥ I,S ∣ ∅?

D,L ⊥ S ∣ I,G?

I-map I-map quizquiz

is G an I-MAP for the following distribution?

P (D, I,G,S,L) = P (L)P (S)P (G ∣ D, I)P (D)P (I)

I-map I-map quizquiz

is G an I-MAP for the following distribution?

P (D, I,G,S,L) = P (L)P (S)P (G ∣ D, I)P (D)P (I)

conditional independencies of the distributioninferred from the graph

local CI:global CI: D-separation

graph and distribution are combined:

factorization of the distributionaccording to the graph

X ⊥ NonDescendents ∣ Parentsi Xi Xi

P (X) = P (X ∣ Pa )∏i i Xi

factorization of the distributionlocal conditional independenciesglobal conditional independencies

identify the samefamily of distributions

Equivalence class of DAGsEquivalence class of DAGs

Two DAGs are I-equivalent if I(G) = I(G )′

P factorizes on both of these graphs

From d-separation algorithm it is sufficient

same undirected skeletonsame v-structures

P factorizes on both of these graphs

different v-structures, yet I(G) = I(G ) = ∅′

here, v-structures are irrelevant for I-equivalent because:

parents are connected (moral parents!)

I(G) = I(G ) ⇔′ same undirected skeletonsame immoralities

X ⊥ Y ∣ Z ?ZZ

I-Equivalence I-Equivalence quizquiz

do these DAGs have the same set of CIs?

X X YY

do these DAGs have the same set of CIs? no!

X X YY

do these DAGs have the same set of CIs? no!

X X YY

X ⊥ Z ∣ W

MinimalMinimal I-map I-map G is minimal I-map for P:

G is an I-map for P:removing any edge destroys this property

P (X,Y ,Z,W ) = P (X ∣ Y ,Z)P (W )P (Y ∣ Z)P (Z) Example:

I(G) ⊆ I(P )

which graph G to use for P?

Z Z ZY Y Y

min. IMAP min. IMAP NOT an IMAP

Minimal I-map Minimal I-map from CIfrom CI

input: or an oracle; an orderingoutput: a minimal I-map G for i=1...n

find minimal s.t.set

I(P ) X , … ,X1 n

(X ⊥ X , … ,X −U ∣ U)i 1 i−1U ⊆ {X , … ,X }1 i−1

X1 XnXi

Pa ← UXi X ⊥ NonDesc ∣ Pai Xi Xi

Minimal I-map from CIMinimal I-map from CI

input: or an oracle; an orderingoutput: a minimal I-map G

I(P ) X , … ,X1 n

different orderings give different graphs

Minimal I-map from CIMinimal I-map from CI

input: or an oracle; an orderingoutput: a minimal I-map G

I(P ) X , … ,X1 n

different orderings give different graphs Example:

D,I,S,G,L(a topological ordering)

L,S,G,I,D L,D,S,I,G

Perfect MAP (Perfect MAP (P-MAPP-MAP))which graph G to use for P?

D,I,S,G,L L,S,G,I,D L,D,S,I,G

all the graphs above are minimal I-MAPsPerfect MAP:

I(G) ⊆ I(P )I(G) = I(P )

Perfect map (Perfect map (P-mapP-map))which graph G to use for P?

Perfect MAP: I(G)=I(P )P may not have a P-map in the form of BN

Example:P (x, y, z) = {1/12,

1/6,if x ⊗ y ⊗ z = 0if x ⊗ y ⊗ z = 1

(X ⊥ Y ), (Y ⊥ Z), (X ⊥ Z) ∈ I(P )(X ⊥ Y ∣ Z), (Y ⊥ Z ∣ Z), (X ⊥ Z ∣ Y ) ∉ I(P )

Perfect MAP: I(G)=I(P )P may not have a P-map in the form of a BNif P has a P-map: is it unique?

Example:

I(P ) = {(X ⊥ Y ,Z ∣ ∅), (X ⊥ Y ∣ Z), (X ⊥ Z ∣ Y )}X XY Y

unique up to I-equivalence

Perfect MAP: I(G)=I(P )P may not have a P-map in the form of a BNif P has a P-map: is it unique?

Example:

I(P ) = {(X ⊥ Y ,Z ∣ ∅), (X ⊥ Y ∣ Z), (X ⊥ Z ∣ Y )}X XY Y

How to find P-MAPs? discussed in learning BNs

unique up to I-equivalence

SummarySummary

factorization of the dist.local CIsglobal CIs

can be represented using an equivalent class of graphs:

alternative factorizationdifferent local CIssame global CIs

identify the samefamily of distributions

Bayesian Networks Graphical Models - cs.ubc.casiamakx/532R/slides/directed_models.pdf · Learning...

Documents

Transcript of Bayesian Networks Graphical Models - cs.ubc.casiamakx/532R/slides/directed_models.pdf · Learning...

Topic Models. Outline Review of directed models – Independencies, d-separation, and “explaining away” – Learning for Bayes nets Directed models for text.

CPSC 532R/533R 2019/2020 Term 2 - cs.ubc.ca

Visual AI - cs.ubc.carhodin/20_CPSC_532R_533R/... · Visual AI CPSC 532R/533R –2019/2020 Term 2 Lecture 1. Overview and programming environment Helge Rhodin

ABC Methods for Bayesian Model Choiceconferences.inf.ed.ac.uk/bayes250/slides/robert.pdf · ABC Methods for Bayesian Model Choice Approximate Bayesian computation Regular Bayesian

Bayesian Belief Networks Compound Bayesian Decision Theory

An Introduction to Bayesian Phylogeneticsphylo.bio.ku.edu/woodshole/Bayesian-WoodsHole_22July2015.pdf · An Introduction to Bayesian Phylogenetics • Bayesian inference in general

Relationship between the directed & undirected models ...siamakx/532R/slides/converting_directed... · Markov network Bayes-net ⇒ Markov network ... we get a perfect map directed

Bayesian Reinforcement Learning - mlg.eng.cam.ac.ukmlg.eng.cam.ac.uk/rowan/files/BayesianReinforcementLearning.pdf · Introduction Bayesian Reinforcement Learning Bayesian Reinforcement

Bayes’ Nets II Independence - University of Victoriamaryam/AISpring94/Slides/10_BayesNet_inependence.pdf · • Bayes nets compactly encode joint distributions • Guaranteed independencies

Bayesian Optimization with Robust Bayesian Neural …papers.nips.cc/paper/6117-bayesian-optimization-with-robust... · Bayesian Optimization with Robust Bayesian Neural Networks Jost

Bayesian Network Modelling · Bayesian Networks in Genetics & Systems Biology Bayesian networks areversatileand have several potential applications because: dynamic Bayesian networkscan

Bayesian Learning of Bayesian Networks with Informative Priors - …stoics.org.uk/~nicos/pbs/amai.pdf · 2019. 3. 4. · Bayesian Learning of Bayesian Networks Bayesian Learning of

Bayesian Estimation - · PDF fileIntroduction Bayesian estimation: the basics Priors Evaluating the posterior Bayesian inference and model comparison Bayesian estimation in Dynare

02-03 Belief networks · Belief networks UFC/DC CK0031/CK0248 2018.2 On structure Independencies Specifications Belief networks Conditional independence Impact of collisions Path

Bayesian Vector Autoregressions - Northwestern Universityfaculty.wcas.northwestern.edu/~lchrist/course/Korea_2016/bayesian... · Bayesian Vector Autoregressions Vector Autoregressions

bayesian bayesian network

CPSC 532R/533R 2019/2020 Term 2rhodin/20_CPSC_532R_533R/lectures/... · 2020. 1. 30. · CPSC 532R/533R - Visual AI - Helge Rhodin 6 Recap: Feature map size after convolutional kernels

Bayes Nets II: Independence Day - University Of Marylandusers.umiacs.umd.edu/.../cs421-day16-bayesnets2.pdf · Bayes nets compactly encode joint distributions Guaranteed independencies

CPSC 532R/533R 2019/2020 Term 2interactive tools • remote access possible (start Jupyter Lab on the server, access url on client) CPSC 532R/533R - Visual AI - Helge Rhodin 6 Course

Mathematical Statistics, Lecture 3 Bayesian Models · R. i. Bayesian Models Bayesian Framework Examples. Bayesian Model for Bernoulli Trials. ... Mathematical Statistics, Lecture