Karthik Gurumoorthy Ajit Rajwade Arunava Banerjee Anand Rangarajan Department of CISE University...
-
Upload
gregory-ross -
Category
Documents
-
view
217 -
download
0
Transcript of Karthik Gurumoorthy Ajit Rajwade Arunava Banerjee Anand Rangarajan Department of CISE University...
![Page 1: Karthik Gurumoorthy Ajit Rajwade Arunava Banerjee Anand Rangarajan Department of CISE University of Florida 1.](https://reader036.fdocuments.in/reader036/viewer/2022062422/56649eb75503460f94bc12cf/html5/thumbnails/1.jpg)
Karthik GurumoorthyAjit Rajwade
Arunava BanerjeeAnand Rangarajan
Department of CISEUniversity of Florida
1
![Page 2: Karthik Gurumoorthy Ajit Rajwade Arunava Banerjee Anand Rangarajan Department of CISE University of Florida 1.](https://reader036.fdocuments.in/reader036/viewer/2022062422/56649eb75503460f94bc12cf/html5/thumbnails/2.jpg)
A new approach to lossy image compression based on machine learning.
Key idea: Learning of Matrix Ortho-normal Bases from training data to efficiently code images.
Applied to compression of well-known face databases like ORL, Yale.
Competitive with JPEG.2
![Page 3: Karthik Gurumoorthy Ajit Rajwade Arunava Banerjee Anand Rangarajan Department of CISE University of Florida 1.](https://reader036.fdocuments.in/reader036/viewer/2022062422/56649eb75503460f94bc12cf/html5/thumbnails/3.jpg)
Vector
Conventional learning methods in vision like PCA,
ICA, etc.
21 MM
Image 211 MM
3
![Page 4: Karthik Gurumoorthy Ajit Rajwade Arunava Banerjee Anand Rangarajan Department of CISE University of Florida 1.](https://reader036.fdocuments.in/reader036/viewer/2022062422/56649eb75503460f94bc12cf/html5/thumbnails/4.jpg)
Our approach following Rangarajan [EMMCVPR-2001] &
Ye [JMLR-2004]
Treated as a21 MM
Image 21 MM
Matrix
4
![Page 5: Karthik Gurumoorthy Ajit Rajwade Arunava Banerjee Anand Rangarajan Department of CISE University of Florida 1.](https://reader036.fdocuments.in/reader036/viewer/2022062422/56649eb75503460f94bc12cf/html5/thumbnails/5.jpg)
Image of size divided into N patches of size
each treated as a Matrix.
21 MM
Image
21 qq
5
21 MM
2211 , MqMq
![Page 6: Karthik Gurumoorthy Ajit Rajwade Arunava Banerjee Anand Rangarajan Department of CISE University of Florida 1.](https://reader036.fdocuments.in/reader036/viewer/2022062422/56649eb75503460f94bc12cf/html5/thumbnails/6.jpg)
6
=P U S V
TUSVP
U and V:Ortho-normal matrices
S: Diagonal Matrix of singular values
![Page 7: Karthik Gurumoorthy Ajit Rajwade Arunava Banerjee Anand Rangarajan Department of CISE University of Florida 1.](https://reader036.fdocuments.in/reader036/viewer/2022062422/56649eb75503460f94bc12cf/html5/thumbnails/7.jpg)
7
useful for compression (e.g.: SSVD [Ranade et al-IVC 2007]).
![Page 8: Karthik Gurumoorthy Ajit Rajwade Arunava Banerjee Anand Rangarajan Department of CISE University of Florida 1.](https://reader036.fdocuments.in/reader036/viewer/2022062422/56649eb75503460f94bc12cf/html5/thumbnails/8.jpg)
8
Consider a set of N image patches:
SVD of each patch gives:
Costly in terms of storage as we need to store N ortho-normal basis pairs.
Tiiii VSUP
![Page 9: Karthik Gurumoorthy Ajit Rajwade Arunava Banerjee Anand Rangarajan Department of CISE University of Florida 1.](https://reader036.fdocuments.in/reader036/viewer/2022062422/56649eb75503460f94bc12cf/html5/thumbnails/9.jpg)
Produce ortho-normal basis-pairs, common for all N patches.
Since storing the basis pairs is not expensive.
9
NK
NK
![Page 10: Karthik Gurumoorthy Ajit Rajwade Arunava Banerjee Anand Rangarajan Department of CISE University of Florida 1.](https://reader036.fdocuments.in/reader036/viewer/2022062422/56649eb75503460f94bc12cf/html5/thumbnails/10.jpg)
10
iP aU iaS aV
Taiaai VSUP
iaS•Non-diagonal•Non-sparse
aiTaia VPUS
![Page 11: Karthik Gurumoorthy Ajit Rajwade Arunava Banerjee Anand Rangarajan Department of CISE University of Florida 1.](https://reader036.fdocuments.in/reader036/viewer/2022062422/56649eb75503460f94bc12cf/html5/thumbnails/11.jpg)
What sparse matrix will optimally reconstruct from ?
Optimally = least error:
Sparse = matrix has at most some non-zero elements.
2|||| FTaiaa VSUP
11
),( aa VUiPiaS
iaS T
![Page 12: Karthik Gurumoorthy Ajit Rajwade Arunava Banerjee Anand Rangarajan Department of CISE University of Florida 1.](https://reader036.fdocuments.in/reader036/viewer/2022062422/56649eb75503460f94bc12cf/html5/thumbnails/12.jpg)
We have a simple, provably optimal greedy method to compute such a
1. Compute the matrix . 2. In matrix , nullify all except the largest
elements to produce .
aiTaia VPUW
12
iaS
iaW T
iaS
![Page 13: Karthik Gurumoorthy Ajit Rajwade Arunava Banerjee Anand Rangarajan Department of CISE University of Florida 1.](https://reader036.fdocuments.in/reader036/viewer/2022062422/56649eb75503460f94bc12cf/html5/thumbnails/13.jpg)
A set of N image patches .
Learning K << N ortho-normal basis
pairs )},{( aa VU
13
)1(, NiPi
2
1 1
||||}),),,({( FTaiaai
N
i
K
aiaiaiaaa VSUPMMSVUE
aIVVUU aTaa
Ta , aiTSia ,,|||| 0
MembershipsProjection Matrices
![Page 14: Karthik Gurumoorthy Ajit Rajwade Arunava Banerjee Anand Rangarajan Department of CISE University of Florida 1.](https://reader036.fdocuments.in/reader036/viewer/2022062422/56649eb75503460f94bc12cf/html5/thumbnails/14.jpg)
Input: N image patches of size .
Output: K pairs of ortho-normal bases
called as dictionary.
14
21 qq
)},{( ii VU
![Page 15: Karthik Gurumoorthy Ajit Rajwade Arunava Banerjee Anand Rangarajan Department of CISE University of Florida 1.](https://reader036.fdocuments.in/reader036/viewer/2022062422/56649eb75503460f94bc12cf/html5/thumbnails/15.jpg)
Divide each test image into patches of size
Fix per-pixel average error (say e), similar to the “quality” user-parameter in JPEG.
21 qq
15
![Page 16: Karthik Gurumoorthy Ajit Rajwade Arunava Banerjee Anand Rangarajan Department of CISE University of Florida 1.](https://reader036.fdocuments.in/reader036/viewer/2022062422/56649eb75503460f94bc12cf/html5/thumbnails/16.jpg)
16
.
.
.
.
.
.
1U 1V
2U 2V
KU KV
iP
.
.
.
111 VPUS iT
i
222 VPUS iT
i
KiTKiK VPUS
eqq
VSUP FTaiaai
21
2||||
222 VPUS iT
i
![Page 17: Karthik Gurumoorthy Ajit Rajwade Arunava Banerjee Anand Rangarajan Department of CISE University of Florida 1.](https://reader036.fdocuments.in/reader036/viewer/2022062422/56649eb75503460f94bc12cf/html5/thumbnails/17.jpg)
)(log10 10 ePSNR
RPP = number of bits per pixel
17
![Page 18: Karthik Gurumoorthy Ajit Rajwade Arunava Banerjee Anand Rangarajan Department of CISE University of Florida 1.](https://reader036.fdocuments.in/reader036/viewer/2022062422/56649eb75503460f94bc12cf/html5/thumbnails/18.jpg)
18
0.5 bits 0.92 bits 1.36 bits
1.78 bits 3.023 bits
![Page 19: Karthik Gurumoorthy Ajit Rajwade Arunava Banerjee Anand Rangarajan Department of CISE University of Florida 1.](https://reader036.fdocuments.in/reader036/viewer/2022062422/56649eb75503460f94bc12cf/html5/thumbnails/19.jpg)
Size of original database is 3.46 MB.Size of dictionary of 50 ortho-normal
basis pairs is 56 KB=0.05MB.Size of database after compression
and coding with our method with e = 0.0001 is 1.3 MB.
Total compression rate achieved is 61%.
19
![Page 20: Karthik Gurumoorthy Ajit Rajwade Arunava Banerjee Anand Rangarajan Department of CISE University of Florida 1.](https://reader036.fdocuments.in/reader036/viewer/2022062422/56649eb75503460f94bc12cf/html5/thumbnails/20.jpg)
)(log10 10 ePSNR
RPP = number of bits per pixel
20
![Page 21: Karthik Gurumoorthy Ajit Rajwade Arunava Banerjee Anand Rangarajan Department of CISE University of Florida 1.](https://reader036.fdocuments.in/reader036/viewer/2022062422/56649eb75503460f94bc12cf/html5/thumbnails/21.jpg)
New lossy image compression method using machine learning.
Key idea 1: matrix based image representation.
Key idea2: Learning small set of matrix ortho-normal basis pairs tuned to a database.
Results competitive with JPEG standard.
Future extensions: video compression.21
![Page 22: Karthik Gurumoorthy Ajit Rajwade Arunava Banerjee Anand Rangarajan Department of CISE University of Florida 1.](https://reader036.fdocuments.in/reader036/viewer/2022062422/56649eb75503460f94bc12cf/html5/thumbnails/22.jpg)
A. Rangarajan, Learning matrix space image representations, Energy Minimizing Methods in Computer Vision and Pattern Recognition, 2001.
J. Ye, Generalized low rank approximation of matrices, Journal of Machine Learning Research ,2004.
M. Aharon, M. Elad and A. Bruckstein, The K-SVD: An algorithm for designing of overcomplete dictionaries for sparse representation. IEEE Transactions on Signal Processing, 2006.
A. Ranade, S. Mahabalarao and S. Kale. A variation on SVD based image compression. Image and Vision Computing, 2007.
22
![Page 23: Karthik Gurumoorthy Ajit Rajwade Arunava Banerjee Anand Rangarajan Department of CISE University of Florida 1.](https://reader036.fdocuments.in/reader036/viewer/2022062422/56649eb75503460f94bc12cf/html5/thumbnails/23.jpg)
23