1 Using Bayesian Network for combining classifiers Leonardo Nogueira Matos Departamento de...

56
1 Using Bayesian Network for combining classifiers Leonardo Nogueira Matos Departamento de Computação Universidade Federal de Sergipe

Transcript of 1 Using Bayesian Network for combining classifiers Leonardo Nogueira Matos Departamento de...

Page 1: 1 Using Bayesian Network for combining classifiers Leonardo Nogueira Matos Departamento de Computação Universidade Federal de Sergipe.

1

Using Bayesian Network for combining classifiers

Leonardo Nogueira MatosDepartamento de Computação

Universidade Federal de Sergipe

Page 2: 1 Using Bayesian Network for combining classifiers Leonardo Nogueira Matos Departamento de Computação Universidade Federal de Sergipe.

2

Agenda

Why combining classifiers?

Bayesian network principles

Bayesian network as an ensemble of classifiers

Experimental results

Future works and conclusions

Page 3: 1 Using Bayesian Network for combining classifiers Leonardo Nogueira Matos Departamento de Computação Universidade Federal de Sergipe.

3

Why combining classifiers?

Classifiers can colabore with each other

Minimizes computational effort for training

Maximizes global recognition rate

Page 4: 1 Using Bayesian Network for combining classifiers Leonardo Nogueira Matos Departamento de Computação Universidade Federal de Sergipe.

4

Why not to do so?

Because combining individual preditions can be so difficult as divising a robust single classifier

Page 5: 1 Using Bayesian Network for combining classifiers Leonardo Nogueira Matos Departamento de Computação Universidade Federal de Sergipe.

5

Why not to do so?

Because combining individual preditions can be so difficult as divising a robust single classifier

Decision

Classifiers

Combiner

Page 6: 1 Using Bayesian Network for combining classifiers Leonardo Nogueira Matos Departamento de Computação Universidade Federal de Sergipe.

6

Approaches for combining classifiers

L1. Data Level L3. Decision Level L2. Feature Level

Fixed rules Trainable rules

Page 7: 1 Using Bayesian Network for combining classifiers Leonardo Nogueira Matos Departamento de Computação Universidade Federal de Sergipe.

7

Approaches for combining classifiers

L1. Data Level L3. Decision Level L2. Feature Level

Fixed rules Trainable rules

Page 8: 1 Using Bayesian Network for combining classifiers Leonardo Nogueira Matos Departamento de Computação Universidade Federal de Sergipe.

8

Why not to do so?

Because combining individual preditions can be so difficult as divising a robust single classifier

Decision

Classifiers

Combiner

p(w|x)

p(w|x)

p(w|x)

Page 9: 1 Using Bayesian Network for combining classifiers Leonardo Nogueira Matos Departamento de Computação Universidade Federal de Sergipe.

9

Approaches for combining classifiers

L1. Data Level L3. Decision Level L2. Feature Level

Fixed rules Trainable rules

Page 10: 1 Using Bayesian Network for combining classifiers Leonardo Nogueira Matos Departamento de Computação Universidade Federal de Sergipe.

10

Existent scenarios

Pattern space

Pattern21

classifiers

classifiers

Page 11: 1 Using Bayesian Network for combining classifiers Leonardo Nogueira Matos Departamento de Computação Universidade Federal de Sergipe.

11

Our scenery

Pattern space

classifiers

Page 12: 1 Using Bayesian Network for combining classifiers Leonardo Nogueira Matos Departamento de Computação Universidade Federal de Sergipe.

12

A closed look

Page 13: 1 Using Bayesian Network for combining classifiers Leonardo Nogueira Matos Departamento de Computação Universidade Federal de Sergipe.

13

A closed look – discriminant function

Page 14: 1 Using Bayesian Network for combining classifiers Leonardo Nogueira Matos Departamento de Computação Universidade Federal de Sergipe.

14

A closed look – using multiple classifiers

Page 15: 1 Using Bayesian Network for combining classifiers Leonardo Nogueira Matos Departamento de Computação Universidade Federal de Sergipe.

15

A closed look – using multiple classifiers

The challegers:

How can we combine classifier's output?How can we identify regions in pattern space?

Page 16: 1 Using Bayesian Network for combining classifiers Leonardo Nogueira Matos Departamento de Computação Universidade Federal de Sergipe.

16

Agenda

Why combining classifiers?

Bayesian network principles

Bayesian network as an ensemble of classifiers

Experimental results

Future works and conclusions

Page 17: 1 Using Bayesian Network for combining classifiers Leonardo Nogueira Matos Departamento de Computação Universidade Federal de Sergipe.

17

Bayesian network principles

A

B C

Those circles represent binary random variables

Page 18: 1 Using Bayesian Network for combining classifiers Leonardo Nogueira Matos Departamento de Computação Universidade Federal de Sergipe.

18

Bayesian network principles

A

B C

Those circles represent binary random variables

a0a1

b0b1

c0c1

Page 19: 1 Using Bayesian Network for combining classifiers Leonardo Nogueira Matos Departamento de Computação Universidade Federal de Sergipe.

19

Bayesian network principles

A

B C

Those circles represent binary random variables

a0a1

b0b1

c0c1

a0 b0 c0

a1 b1 c1

⋮ ⋮ ⋮aN bN cN

dataset

Page 20: 1 Using Bayesian Network for combining classifiers Leonardo Nogueira Matos Departamento de Computação Universidade Federal de Sergipe.

20

Bayesian network principles

A

B C

Those circles represent binary random variables

a0a1

b0b1

c0c1

a0 b0 c0

a1 b1 c1

⋮ ⋮ ⋮aN bN cNinstance

Page 21: 1 Using Bayesian Network for combining classifiers Leonardo Nogueira Matos Departamento de Computação Universidade Federal de Sergipe.

21

Bayesian network principles

A

B C

Jointly probability inference is a combinatorial problem

P abc = P a P b∣a P c∣ab

2 possibilities

4 possibilities

Page 22: 1 Using Bayesian Network for combining classifiers Leonardo Nogueira Matos Departamento de Computação Universidade Federal de Sergipe.

22

Bayesian network principles

A

B C

Jointly probability inference is a combinatorial problem

P abc = P a P b∣a P c∣ab

P abc = P a P b∣a P c

Independence makes computation alittle more simple

Page 23: 1 Using Bayesian Network for combining classifiers Leonardo Nogueira Matos Departamento de Computação Universidade Federal de Sergipe.

23

Bayesian network principles

A

B C

Arest – indicates statistical dependence between variables

Page 24: 1 Using Bayesian Network for combining classifiers Leonardo Nogueira Matos Departamento de Computação Universidade Federal de Sergipe.

24

Bayesian network principles

A

B C

Arc – represents causality

Page 25: 1 Using Bayesian Network for combining classifiers Leonardo Nogueira Matos Departamento de Computação Universidade Federal de Sergipe.

25

Bayesian network principles

A

B C

A Bayesian network is a DAG (DirectAciclic Graph) where nodes representrandom variables and arcs representcausality relatioship

Page 26: 1 Using Bayesian Network for combining classifiers Leonardo Nogueira Matos Departamento de Computação Universidade Federal de Sergipe.

26

Bayesian network principles

A

B C

There are polinomial time algorithmsto compute inference in BN

Page 27: 1 Using Bayesian Network for combining classifiers Leonardo Nogueira Matos Departamento de Computação Universidade Federal de Sergipe.

27

Bayesian network principles

A

B C

There are polinomial time algorithmsto compute inference in BN

Evidence

Page 28: 1 Using Bayesian Network for combining classifiers Leonardo Nogueira Matos Departamento de Computação Universidade Federal de Sergipe.

28

Bayesian network principles

A

B C

There are polinomial time algorithmsto compute inference in BN

Evidence messages

Page 29: 1 Using Bayesian Network for combining classifiers Leonardo Nogueira Matos Departamento de Computação Universidade Federal de Sergipe.

29

Bayesian network principles

A

B C

There are polinomial time algorithmsto compute inference in BN

Evidence

[P a0∣bP a1∣b]

[P c0∣b P c1∣b ]

Page 30: 1 Using Bayesian Network for combining classifiers Leonardo Nogueira Matos Departamento de Computação Universidade Federal de Sergipe.

30

Agenda

Why combining classifiers?

Bayesian network principles

Bayesian network as an ensemble of classifiers

Experimental results

Future works and conclusions

Page 31: 1 Using Bayesian Network for combining classifiers Leonardo Nogueira Matos Departamento de Computação Universidade Federal de Sergipe.

31

A Fundamental Goal

Page 32: 1 Using Bayesian Network for combining classifiers Leonardo Nogueira Matos Departamento de Computação Universidade Federal de Sergipe.

32

Another insight

From a statistical point-of-view a Bayesian network is also a graphicalmodel to represents a complex and factored probability distribution function

Page 33: 1 Using Bayesian Network for combining classifiers Leonardo Nogueira Matos Departamento de Computação Universidade Federal de Sergipe.

33

Another insight

From a statistical point-of-view a Bayesian network is also a graphicalmodel to represents a complex and factored probability distribution function

Page 34: 1 Using Bayesian Network for combining classifiers Leonardo Nogueira Matos Departamento de Computação Universidade Federal de Sergipe.

34

Another insight

From a statistical point-of-view a Bayesian network is also a graphicalmodel to represents a complex and factored probability distribution function

The challegers:

How can we combine classifier's output?How can we identify regions in pattern space?

Page 35: 1 Using Bayesian Network for combining classifiers Leonardo Nogueira Matos Departamento de Computação Universidade Federal de Sergipe.

35

How can we combine classifier's output?

We use a BN as a graphical model of the pdf P(w|x)

We assume that classifier participate in computing that function

Each classifier must be a statistical classifier

Page 36: 1 Using Bayesian Network for combining classifiers Leonardo Nogueira Matos Departamento de Computação Universidade Federal de Sergipe.

36

How can we identify regions in pattern space?

Page 37: 1 Using Bayesian Network for combining classifiers Leonardo Nogueira Matos Departamento de Computação Universidade Federal de Sergipe.

37

Splitting pattern space

Page 38: 1 Using Bayesian Network for combining classifiers Leonardo Nogueira Matos Departamento de Computação Universidade Federal de Sergipe.

38

Defining a region

Page 39: 1 Using Bayesian Network for combining classifiers Leonardo Nogueira Matos Departamento de Computação Universidade Federal de Sergipe.

39

Patterns in a region

Page 40: 1 Using Bayesian Network for combining classifiers Leonardo Nogueira Matos Departamento de Computação Universidade Federal de Sergipe.

40

Algorithm

Page 41: 1 Using Bayesian Network for combining classifiers Leonardo Nogueira Matos Departamento de Computação Universidade Federal de Sergipe.

41

Bayesian Network Structure

Page 42: 1 Using Bayesian Network for combining classifiers Leonardo Nogueira Matos Departamento de Computação Universidade Federal de Sergipe.

42

Bayesian networks for combining classifiers

Page 43: 1 Using Bayesian Network for combining classifiers Leonardo Nogueira Matos Departamento de Computação Universidade Federal de Sergipe.

43

Agenda

Why combining classifiers?

Bayesian network principles

Bayesian network as an ensemble of classifiers

Experimental results

Future works and conclusions

Page 44: 1 Using Bayesian Network for combining classifiers Leonardo Nogueira Matos Departamento de Computação Universidade Federal de Sergipe.

44

Results with UCI databases

Page 45: 1 Using Bayesian Network for combining classifiers Leonardo Nogueira Matos Departamento de Computação Universidade Federal de Sergipe.

45

Results with NIST database

Page 46: 1 Using Bayesian Network for combining classifiers Leonardo Nogueira Matos Departamento de Computação Universidade Federal de Sergipe.

46

System I classifiers

Page 47: 1 Using Bayesian Network for combining classifiers Leonardo Nogueira Matos Departamento de Computação Universidade Federal de Sergipe.

47

Preliminaries

Page 48: 1 Using Bayesian Network for combining classifiers Leonardo Nogueira Matos Departamento de Computação Universidade Federal de Sergipe.

48

Results with the complete dataset

Page 49: 1 Using Bayesian Network for combining classifiers Leonardo Nogueira Matos Departamento de Computação Universidade Federal de Sergipe.

49

Agenda

Why combining classifiers?

Bayesian network principles

Bayesian network as an ensemble of classifiers

Experimental results

Future works and conclusions

Page 50: 1 Using Bayesian Network for combining classifiers Leonardo Nogueira Matos Departamento de Computação Universidade Federal de Sergipe.

50

Future works

Page 51: 1 Using Bayesian Network for combining classifiers Leonardo Nogueira Matos Departamento de Computação Universidade Federal de Sergipe.

51

Future works

Page 52: 1 Using Bayesian Network for combining classifiers Leonardo Nogueira Matos Departamento de Computação Universidade Federal de Sergipe.

52

Future works

Page 53: 1 Using Bayesian Network for combining classifiers Leonardo Nogueira Matos Departamento de Computação Universidade Federal de Sergipe.

53

Future works

Page 54: 1 Using Bayesian Network for combining classifiers Leonardo Nogueira Matos Departamento de Computação Universidade Federal de Sergipe.

54

Future works

Pattern space

Pattern21

classifiers

classifiers

Page 55: 1 Using Bayesian Network for combining classifiers Leonardo Nogueira Matos Departamento de Computação Universidade Federal de Sergipe.

55

ConclusionsWe have developed a method for combining classifiers using a Bayesian network

A BN act as trainable ensemble of statistical classifiers

The method is not suitable for small size dataset

Experimental results reveal a good performance with a large dataset

As a future work we intend to use a similar approach for splitting the feature vector and combine classifiers specialized on each piece of it.

Page 56: 1 Using Bayesian Network for combining classifiers Leonardo Nogueira Matos Departamento de Computação Universidade Federal de Sergipe.

56

Thank you!

[email protected]