TreeConnect: A Sparse Alternative to Fully€¦ · TreeConnect: A Sparse Alternative to Fully...

28
TreeConnect: A Sparse Alternative to Fully Connected Layers Oliver Richter, Roger Wattenhofer Distributed Computing Group (DisCo) ETH Zurich Switzerland 1

Transcript of TreeConnect: A Sparse Alternative to Fully€¦ · TreeConnect: A Sparse Alternative to Fully...

Page 1: TreeConnect: A Sparse Alternative to Fully€¦ · TreeConnect: A Sparse Alternative to Fully Connected Layers Oliver Richter, Roger Wattenhofer Distributed Computing Group (DisCo)

TreeConnect: A Sparse Alternative to Fully Connected Layers

Oliver Richter, Roger Wattenhofer

Distributed Computing Group (DisCo)ETH Zurich

Switzerland 1

Page 2: TreeConnect: A Sparse Alternative to Fully€¦ · TreeConnect: A Sparse Alternative to Fully Connected Layers Oliver Richter, Roger Wattenhofer Distributed Computing Group (DisCo)

“cat”

Page 3: TreeConnect: A Sparse Alternative to Fully€¦ · TreeConnect: A Sparse Alternative to Fully Connected Layers Oliver Richter, Roger Wattenhofer Distributed Computing Group (DisCo)

“dog”

Page 4: TreeConnect: A Sparse Alternative to Fully€¦ · TreeConnect: A Sparse Alternative to Fully Connected Layers Oliver Richter, Roger Wattenhofer Distributed Computing Group (DisCo)
Page 5: TreeConnect: A Sparse Alternative to Fully€¦ · TreeConnect: A Sparse Alternative to Fully Connected Layers Oliver Richter, Roger Wattenhofer Distributed Computing Group (DisCo)

Convolutional Neural Network (CNN)

Fully Connected Layers (FC)

Page 6: TreeConnect: A Sparse Alternative to Fully€¦ · TreeConnect: A Sparse Alternative to Fully Connected Layers Oliver Richter, Roger Wattenhofer Distributed Computing Group (DisCo)

Convolutional Neural Network (CNN)

Fully Connected Layers (FC)

Page 7: TreeConnect: A Sparse Alternative to Fully€¦ · TreeConnect: A Sparse Alternative to Fully Connected Layers Oliver Richter, Roger Wattenhofer Distributed Computing Group (DisCo)
Page 8: TreeConnect: A Sparse Alternative to Fully€¦ · TreeConnect: A Sparse Alternative to Fully Connected Layers Oliver Richter, Roger Wattenhofer Distributed Computing Group (DisCo)
Page 9: TreeConnect: A Sparse Alternative to Fully€¦ · TreeConnect: A Sparse Alternative to Fully Connected Layers Oliver Richter, Roger Wattenhofer Distributed Computing Group (DisCo)

Channel

Page 10: TreeConnect: A Sparse Alternative to Fully€¦ · TreeConnect: A Sparse Alternative to Fully Connected Layers Oliver Richter, Roger Wattenhofer Distributed Computing Group (DisCo)
Page 11: TreeConnect: A Sparse Alternative to Fully€¦ · TreeConnect: A Sparse Alternative to Fully Connected Layers Oliver Richter, Roger Wattenhofer Distributed Computing Group (DisCo)
Page 12: TreeConnect: A Sparse Alternative to Fully€¦ · TreeConnect: A Sparse Alternative to Fully Connected Layers Oliver Richter, Roger Wattenhofer Distributed Computing Group (DisCo)

n 𝜣(n2)m = r n.

Page 13: TreeConnect: A Sparse Alternative to Fully€¦ · TreeConnect: A Sparse Alternative to Fully Connected Layers Oliver Richter, Roger Wattenhofer Distributed Computing Group (DisCo)

n

mhc

1

2

3

= n0.5𝜣(n1.5)

= (n m)0.5. = r n.

c . n hc c

. + c m. = n . r 0.5 nn0.5

. + n0.5 r n. .

Page 14: TreeConnect: A Sparse Alternative to Fully€¦ · TreeConnect: A Sparse Alternative to Fully Connected Layers Oliver Richter, Roger Wattenhofer Distributed Computing Group (DisCo)
Page 15: TreeConnect: A Sparse Alternative to Fully€¦ · TreeConnect: A Sparse Alternative to Fully Connected Layers Oliver Richter, Roger Wattenhofer Distributed Computing Group (DisCo)
Page 16: TreeConnect: A Sparse Alternative to Fully€¦ · TreeConnect: A Sparse Alternative to Fully Connected Layers Oliver Richter, Roger Wattenhofer Distributed Computing Group (DisCo)
Page 17: TreeConnect: A Sparse Alternative to Fully€¦ · TreeConnect: A Sparse Alternative to Fully Connected Layers Oliver Richter, Roger Wattenhofer Distributed Computing Group (DisCo)

L

1 2 3

𝜣(n(2 L - 1) / L). 𝜣(n2)

Page 18: TreeConnect: A Sparse Alternative to Fully€¦ · TreeConnect: A Sparse Alternative to Fully Connected Layers Oliver Richter, Roger Wattenhofer Distributed Computing Group (DisCo)

IMDB sentiment classification

Page 19: TreeConnect: A Sparse Alternative to Fully€¦ · TreeConnect: A Sparse Alternative to Fully Connected Layers Oliver Richter, Roger Wattenhofer Distributed Computing Group (DisCo)

IMDB sentiment classification

Page 20: TreeConnect: A Sparse Alternative to Fully€¦ · TreeConnect: A Sparse Alternative to Fully Connected Layers Oliver Richter, Roger Wattenhofer Distributed Computing Group (DisCo)

Reuters News Categorization

Page 21: TreeConnect: A Sparse Alternative to Fully€¦ · TreeConnect: A Sparse Alternative to Fully Connected Layers Oliver Richter, Roger Wattenhofer Distributed Computing Group (DisCo)

Reuters News Categorization

47,150

25,806

Page 22: TreeConnect: A Sparse Alternative to Fully€¦ · TreeConnect: A Sparse Alternative to Fully Connected Layers Oliver Richter, Roger Wattenhofer Distributed Computing Group (DisCo)

Reuters News Categorization

Page 23: TreeConnect: A Sparse Alternative to Fully€¦ · TreeConnect: A Sparse Alternative to Fully Connected Layers Oliver Richter, Roger Wattenhofer Distributed Computing Group (DisCo)

CIFAR-10 Image Classification

Page 24: TreeConnect: A Sparse Alternative to Fully€¦ · TreeConnect: A Sparse Alternative to Fully Connected Layers Oliver Richter, Roger Wattenhofer Distributed Computing Group (DisCo)

CIFAR-10 Image Classification

Page 25: TreeConnect: A Sparse Alternative to Fully€¦ · TreeConnect: A Sparse Alternative to Fully Connected Layers Oliver Richter, Roger Wattenhofer Distributed Computing Group (DisCo)

CIFAR-10 Image Classification

Page 26: TreeConnect: A Sparse Alternative to Fully€¦ · TreeConnect: A Sparse Alternative to Fully Connected Layers Oliver Richter, Roger Wattenhofer Distributed Computing Group (DisCo)

CIFAR-10 Image Classification

Page 27: TreeConnect: A Sparse Alternative to Fully€¦ · TreeConnect: A Sparse Alternative to Fully Connected Layers Oliver Richter, Roger Wattenhofer Distributed Computing Group (DisCo)

Related Work

Page 28: TreeConnect: A Sparse Alternative to Fully€¦ · TreeConnect: A Sparse Alternative to Fully Connected Layers Oliver Richter, Roger Wattenhofer Distributed Computing Group (DisCo)

Questions & Answers

https://github.com/OliverRichter/TreeConnect

Code?

Further Questions?

[email protected]