Tensor methods applied in Artificial Intelligence and data analysis. · 2018. 11. 23. · Tensor...
Transcript of Tensor methods applied in Artificial Intelligence and data analysis. · 2018. 11. 23. · Tensor...
Tensor methods applied in Artificial Intelligence and data analysis.
Tensor methods applied in Artificial Intelligenceand data analysis.
Kim Batselier
1 / 22
Tensor methods applied in Artificial Intelligence and data analysis.
Introduction
2 / 22
Tensor methods applied in Artificial Intelligence and data analysis.
Introduction
Data-driven technology
3 / 22
Tensor methods applied in Artificial Intelligence and data analysis.
Introduction
Data-driven technology
The nature of data
Data comes in many forms:
wordsnumbersimagessound....
Organized in multi-dimensional arrays = tensors
My research is about developing data-driven technology withtensors
4 / 22
Tensor methods applied in Artificial Intelligence and data analysis.
Introduction
Data-driven technology
The nature of data
Data comes in many forms:
wordsnumbersimagessound....
Organized in multi-dimensional arrays = tensors
My research is about developing data-driven technology withtensors
4 / 22
Tensor methods applied in Artificial Intelligence and data analysis.
Introduction
Data-driven technology
The nature of data
Data comes in many forms:
wordsnumbersimagessound....
Organized in multi-dimensional arrays = tensors
My research is about developing data-driven technology withtensors
4 / 22
Tensor methods applied in Artificial Intelligence and data analysis.
Introduction
Data-driven technology
321× 481 image, 95% missing.
5 / 22
Tensor methods applied in Artificial Intelligence and data analysis.
Introduction
Data-driven technology
What is an image?
An array I of numbers
Each pixel is uniquely determined by its location and its color.
The location of each pixel is determined by 2 numbers(height, width)
The color of each pixel is determined by 3 numbers (redintensity, green intensity, blue intensity)
6 / 22
Tensor methods applied in Artificial Intelligence and data analysis.
Introduction
Data-driven technology
What is an image?
An array I of numbers
Each pixel is uniquely determined by its location and its color.
The location of each pixel is determined by 2 numbers(height, width)
The color of each pixel is determined by 3 numbers (redintensity, green intensity, blue intensity)
6 / 22
Tensor methods applied in Artificial Intelligence and data analysis.
Introduction
Data-driven technology
What is an image?
An array I of numbers
Each pixel is uniquely determined by its location and its color.
The location of each pixel is determined by 2 numbers(height, width)
The color of each pixel is determined by 3 numbers (redintensity, green intensity, blue intensity)
7 / 22
Tensor methods applied in Artificial Intelligence and data analysis.
Introduction
Data-driven technology
An image as a tensor
All location and color information of a whole image can bestored into a 3-dimensional array
Image I of height H and width W ⇒ I ∈ RH×W×3
Small example
For an image I of size 321× 481, we have
I(:, :, 1) ∈ R321×481 = Red intensities of all pixels.
I(:, :, 2) ∈ R321×481 = Green intensities of all pixels.
I(:, :, 3) ∈ R321×481 = Blue intensities of all pixels.
8 / 22
Tensor methods applied in Artificial Intelligence and data analysis.
Introduction
Data-driven technology
Problem statement
Given an image I and a partially observed instance X = (I)Ω.Can we use the observed X to fill-in the missing parts?
Solution strategy
Formulate as a optimization problem. Find the tensor A thatminimizes ||(A− I)Ω||
Problem: We don’t know I! So instead we find a tensor Athat minimizes ||(A−X )Ω||This is an ill-posed problem: the number of possible solutionsis infinite
We want to add the additional constraint that neighbouringpixels of A should be interrelated
⇒ How to introduce thisconstraint? Answer: Tensor Networks
9 / 22
Tensor methods applied in Artificial Intelligence and data analysis.
Introduction
Data-driven technology
Problem statement
Given an image I and a partially observed instance X = (I)Ω.Can we use the observed X to fill-in the missing parts?
Solution strategy
Formulate as a optimization problem. Find the tensor A thatminimizes ||(A− I)Ω||
Problem: We don’t know I! So instead we find a tensor Athat minimizes ||(A−X )Ω||This is an ill-posed problem: the number of possible solutionsis infinite
We want to add the additional constraint that neighbouringpixels of A should be interrelated
⇒ How to introduce thisconstraint? Answer: Tensor Networks
9 / 22
Tensor methods applied in Artificial Intelligence and data analysis.
Introduction
Data-driven technology
Problem statement
Given an image I and a partially observed instance X = (I)Ω.Can we use the observed X to fill-in the missing parts?
Solution strategy
Formulate as a optimization problem. Find the tensor A thatminimizes ||(A− I)Ω||Problem: We don’t know I! So instead we find a tensor Athat minimizes ||(A−X )Ω||
This is an ill-posed problem: the number of possible solutionsis infinite
We want to add the additional constraint that neighbouringpixels of A should be interrelated
⇒ How to introduce thisconstraint? Answer: Tensor Networks
9 / 22
Tensor methods applied in Artificial Intelligence and data analysis.
Introduction
Data-driven technology
Problem statement
Given an image I and a partially observed instance X = (I)Ω.Can we use the observed X to fill-in the missing parts?
Solution strategy
Formulate as a optimization problem. Find the tensor A thatminimizes ||(A− I)Ω||Problem: We don’t know I! So instead we find a tensor Athat minimizes ||(A−X )Ω||This is an ill-posed problem: the number of possible solutionsis infinite
We want to add the additional constraint that neighbouringpixels of A should be interrelated
⇒ How to introduce thisconstraint? Answer: Tensor Networks
9 / 22
Tensor methods applied in Artificial Intelligence and data analysis.
Introduction
Data-driven technology
Problem statement
Given an image I and a partially observed instance X = (I)Ω.Can we use the observed X to fill-in the missing parts?
Solution strategy
Formulate as a optimization problem. Find the tensor A thatminimizes ||(A− I)Ω||Problem: We don’t know I! So instead we find a tensor Athat minimizes ||(A−X )Ω||This is an ill-posed problem: the number of possible solutionsis infinite
We want to add the additional constraint that neighbouringpixels of A should be interrelated
⇒ How to introduce thisconstraint? Answer: Tensor Networks
9 / 22
Tensor methods applied in Artificial Intelligence and data analysis.
Introduction
Data-driven technology
Problem statement
Given an image I and a partially observed instance X = (I)Ω.Can we use the observed X to fill-in the missing parts?
Solution strategy
Formulate as a optimization problem. Find the tensor A thatminimizes ||(A− I)Ω||Problem: We don’t know I! So instead we find a tensor Athat minimizes ||(A−X )Ω||This is an ill-posed problem: the number of possible solutionsis infinite
We want to add the additional constraint that neighbouringpixels of A should be interrelated ⇒ How to introduce thisconstraint?
Answer: Tensor Networks
9 / 22
Tensor methods applied in Artificial Intelligence and data analysis.
Introduction
Data-driven technology
Problem statement
Given an image I and a partially observed instance X = (I)Ω.Can we use the observed X to fill-in the missing parts?
Solution strategy
Formulate as a optimization problem. Find the tensor A thatminimizes ||(A− I)Ω||Problem: We don’t know I! So instead we find a tensor Athat minimizes ||(A−X )Ω||This is an ill-posed problem: the number of possible solutionsis infinite
We want to add the additional constraint that neighbouringpixels of A should be interrelated ⇒ How to introduce thisconstraint? Answer: Tensor Networks
9 / 22
Tensor methods applied in Artificial Intelligence and data analysis.
Tensor networks: a gentle introduction
Tensors
Multi-dimensional arrays are also called tensorsA ∈ RI1×I2×I3×···×Id
d = the order of the tensor/ the number of modes.
I1, I2, . . . , Id = the dimensions of the tensor.
Some familiar examples
d = 1⇒
a vector, e.g. a ∈ R5
d = 2⇒
a matrix, e.g. A ∈ R10×3
d = 0⇒
a scalar/a number, e.g. a ∈ R1
10 / 22
Tensor methods applied in Artificial Intelligence and data analysis.
Tensor networks: a gentle introduction
Tensors
Multi-dimensional arrays are also called tensorsA ∈ RI1×I2×I3×···×Id
d = the order of the tensor/ the number of modes.
I1, I2, . . . , Id = the dimensions of the tensor.
Some familiar examples
d = 1⇒
a vector, e.g. a ∈ R5
d = 2⇒
a matrix, e.g. A ∈ R10×3
d = 0⇒
a scalar/a number, e.g. a ∈ R1
10 / 22
Tensor methods applied in Artificial Intelligence and data analysis.
Tensor networks: a gentle introduction
Tensors
Multi-dimensional arrays are also called tensorsA ∈ RI1×I2×I3×···×Id
d = the order of the tensor/ the number of modes.
I1, I2, . . . , Id = the dimensions of the tensor.
Some familiar examples
d = 1⇒
a vector, e.g. a ∈ R5
d = 2⇒
a matrix, e.g. A ∈ R10×3
d = 0⇒
a scalar/a number, e.g. a ∈ R1
10 / 22
Tensor methods applied in Artificial Intelligence and data analysis.
Tensor networks: a gentle introduction
Tensors
Multi-dimensional arrays are also called tensorsA ∈ RI1×I2×I3×···×Id
d = the order of the tensor/ the number of modes.
I1, I2, . . . , Id = the dimensions of the tensor.
Some familiar examples
d = 1⇒
a vector, e.g. a ∈ R5
d = 2⇒
a matrix, e.g. A ∈ R10×3
d = 0⇒
a scalar/a number, e.g. a ∈ R1
10 / 22
Tensor methods applied in Artificial Intelligence and data analysis.
Tensor networks: a gentle introduction
Tensors
Multi-dimensional arrays are also called tensorsA ∈ RI1×I2×I3×···×Id
d = the order of the tensor/ the number of modes.
I1, I2, . . . , Id = the dimensions of the tensor.
Some familiar examples
d = 1⇒
a vector, e.g. a ∈ R5
d = 2⇒
a matrix, e.g. A ∈ R10×3
d = 0⇒
a scalar/a number, e.g. a ∈ R1
10 / 22
Tensor methods applied in Artificial Intelligence and data analysis.
Tensor networks: a gentle introduction
Tensors
Multi-dimensional arrays are also called tensorsA ∈ RI1×I2×I3×···×Id
d = the order of the tensor/ the number of modes.
I1, I2, . . . , Id = the dimensions of the tensor.
Some familiar examples
d = 1⇒ a vector, e.g. a ∈ R5
d = 2⇒
a matrix, e.g. A ∈ R10×3
d = 0⇒
a scalar/a number, e.g. a ∈ R1
10 / 22
Tensor methods applied in Artificial Intelligence and data analysis.
Tensor networks: a gentle introduction
Tensors
Multi-dimensional arrays are also called tensorsA ∈ RI1×I2×I3×···×Id
d = the order of the tensor/ the number of modes.
I1, I2, . . . , Id = the dimensions of the tensor.
Some familiar examples
d = 1⇒ a vector, e.g. a ∈ R5
d = 2⇒
a matrix, e.g. A ∈ R10×3
d = 0⇒
a scalar/a number, e.g. a ∈ R1
10 / 22
Tensor methods applied in Artificial Intelligence and data analysis.
Tensor networks: a gentle introduction
Tensors
Multi-dimensional arrays are also called tensorsA ∈ RI1×I2×I3×···×Id
d = the order of the tensor/ the number of modes.
I1, I2, . . . , Id = the dimensions of the tensor.
Some familiar examples
d = 1⇒ a vector, e.g. a ∈ R5
d = 2⇒ a matrix, e.g. A ∈ R10×3
d = 0⇒
a scalar/a number, e.g. a ∈ R1
10 / 22
Tensor methods applied in Artificial Intelligence and data analysis.
Tensor networks: a gentle introduction
Tensors
Multi-dimensional arrays are also called tensorsA ∈ RI1×I2×I3×···×Id
d = the order of the tensor/ the number of modes.
I1, I2, . . . , Id = the dimensions of the tensor.
Some familiar examples
d = 1⇒ a vector, e.g. a ∈ R5
d = 2⇒ a matrix, e.g. A ∈ R10×3
d = 0⇒
a scalar/a number, e.g. a ∈ R1
10 / 22
Tensor methods applied in Artificial Intelligence and data analysis.
Tensor networks: a gentle introduction
Tensors
Multi-dimensional arrays are also called tensorsA ∈ RI1×I2×I3×···×Id
d = the order of the tensor/ the number of modes.
I1, I2, . . . , Id = the dimensions of the tensor.
Some familiar examples
d = 1⇒ a vector, e.g. a ∈ R5
d = 2⇒ a matrix, e.g. A ∈ R10×3
d = 0⇒ a scalar/a number, e.g. a ∈ R1
10 / 22
Tensor methods applied in Artificial Intelligence and data analysis.
Tensor networks: a gentle introduction
Tensor network building blocks
Aa a A
Tensor networks
C(i, j) =∑
k A(i, k) B(k, j)
is visually represented by
A Bi k j
11 / 22
Tensor methods applied in Artificial Intelligence and data analysis.
Tensor networks: a gentle introduction
Tensor network building blocks
Aa a A
Tensor networks
C(i, j) =∑
k A(i, k) B(k, j)
is visually represented by
A Bi k j
11 / 22
Tensor methods applied in Artificial Intelligence and data analysis.
Tensor networks: a gentle introduction
Tensor network building blocks
Aa a A
Tensor networks
C(i, j) =∑
k A(i, k) B(k, j) is visually represented by
A Bi k j
11 / 22
Tensor methods applied in Artificial Intelligence and data analysis.
Tensor networks: a gentle introduction
Tensor networks
a bk
c =∑
k a(k) b(k) = aT b
Tensor networks
AB c
D(i, j) =∑
k,l A(k, j, l) B(i, k) c(l)
12 / 22
Tensor methods applied in Artificial Intelligence and data analysis.
Tensor networks: a gentle introduction
Tensor networks
a bk
c =∑
k a(k) b(k) = aT b
Tensor networks
AB c
D(i, j) =∑
k,l A(k, j, l) B(i, k) c(l)
12 / 22
Tensor methods applied in Artificial Intelligence and data analysis.
Tensor networks: a gentle introduction
Tensor networks
a bk
c =∑
k a(k) b(k) = aT b
Tensor networks
AB c
D(i, j) =∑
k,l A(k, j, l) B(i, k) c(l)
12 / 22
Tensor methods applied in Artificial Intelligence and data analysis.
Tensor networks: a gentle introduction
Tensor networks
a bk
c =∑
k a(k) b(k) = aT b
Tensor networks
AB c
D(i, j) =∑
k,l A(k, j, l) B(i, k) c(l)
12 / 22
Tensor methods applied in Artificial Intelligence and data analysis.
Tensor networks: a gentle introduction
The total number of elements in a tensor of order d anddimensions I1 = I2 = · · · = Id = I is Id.
⇒ This is known as the curse of dimensionality
If the tensor entries are interrelated, then the number of degrees offreedom can be drastically reduced via tensor decompositions.
13 / 22
Tensor methods applied in Artificial Intelligence and data analysis.
Tensor networks: a gentle introduction
The total number of elements in a tensor of order d anddimensions I1 = I2 = · · · = Id = I is Id.
⇒ This is known as the curse of dimensionality
If the tensor entries are interrelated, then the number of degrees offreedom can be drastically reduced via tensor decompositions.
13 / 22
Tensor methods applied in Artificial Intelligence and data analysis.
Tensor networks: a gentle introduction
The total number of elements in a tensor of order d anddimensions I1 = I2 = · · · = Id = I is Id.
⇒ This is known as the curse of dimensionality
If the tensor entries are interrelated, then the number of degrees offreedom can be drastically reduced via tensor decompositions.
13 / 22
Tensor methods applied in Artificial Intelligence and data analysis.
Tensor networks: a gentle introduction
Small 4× 3 example
How many degrees of freedom does this matrix have?54 −30 −18−45 25 1581 −45 −279 −5 −3
=
6−591
(9 −5 −3)
14 / 22
Tensor methods applied in Artificial Intelligence and data analysis.
Tensor networks: a gentle introduction
Small 4× 3 example
How many degrees of freedom does this matrix have?54 −30 −18−45 25 1581 −45 −279 −5 −3
=
6−591
(9 −5 −3)
14 / 22
Tensor methods applied in Artificial Intelligence and data analysis.
Tensor networks: a gentle introduction
Small 4× 3 example
C =
6−591
(9 −5 −3)= a bT
C(i, j) =1∑
k=1
a(i, k) b(k, j)
a b1
Take-away message
Tensor networks with small interconnection dimensions haveinterrelated entries.
15 / 22
Tensor methods applied in Artificial Intelligence and data analysis.
Tensor networks: a gentle introduction
Small 4× 3 example
C =
6−591
(9 −5 −3)= a bT
C(i, j) =
1∑k=1
a(i, k) b(k, j)
a b1
Take-away message
Tensor networks with small interconnection dimensions haveinterrelated entries.
15 / 22
Tensor methods applied in Artificial Intelligence and data analysis.
Tensor networks: a gentle introduction
Small 4× 3 example
C =
6−591
(9 −5 −3)= a bT
C(i, j) =
1∑k=1
a(i, k) b(k, j)
a b1
Take-away message
Tensor networks with small interconnection dimensions haveinterrelated entries.
15 / 22
Tensor methods applied in Artificial Intelligence and data analysis.
Tensor networks: a gentle introduction
Small 4× 3 example
C =
6−591
(9 −5 −3)= a bT
C(i, j) =
1∑k=1
a(i, k) b(k, j)
a b1
Take-away message
Tensor networks with small interconnection dimensions haveinterrelated entries.
15 / 22
Tensor methods applied in Artificial Intelligence and data analysis.
Conclusion
Problem statement
Given an image I and a partially observed instance X = (I)Ω.Can we use the observed X to fill-in the missing parts?
Solution strategy
Formulate as a optimization problem. Find the tensor A thatminimizes ||(A− I)Ω||Problem: We don’t know I! So instead we find a tensor Athat minimizes ||(A−X )Ω||This is an ill-posed problem: the number of possible solutionsis infinite
We want to add the additional constraint that neighbouringpixels of A should be interrelated ⇒ How to introduce thisconstraint? Answer: Tensor Networks
16 / 22
Tensor methods applied in Artificial Intelligence and data analysis.
Conclusion
For our 321× 481 image:
R2
R1
R3
321 481 3321
481
3
17 / 22
Tensor methods applied in Artificial Intelligence and data analysis.
Conclusion
18 / 22
Tensor methods applied in Artificial Intelligence and data analysis.
Conclusion
19 / 22
Tensor methods applied in Artificial Intelligence and data analysis.
Conclusion
20 / 22
Tensor methods applied in Artificial Intelligence and data analysis.
Conclusion
21 / 22
Tensor methods applied in Artificial Intelligence and data analysis.
Final words :)
My research
Compressing data Learning nonlinearclassifiers from data
Estimation ofmodels from data
Converting data intotensor networks
web: https://sites.google.com/view/kim-batselier/home
e-mail: k dφτ batselier ατ tudelft dφτ nl
22 / 22