Multimedia Databases SVD II. SVD - Detailed outline Motivation Definition - properties...

Multimedia Databases

SVD II

SVD - Detailed outline Motivation Definition - properties Interpretation Complexity Case studies SVD properties More case studies Conclusions

SVD - detailed outline ... Case studies SVD properties more case studies

google/Kleinberg algorithms Conclusions

Optimality of SVD

Def: The Frobenius norm of a n x m matrix M is

(reminder) The rank of a matrix M is the number of independent rows (or columns) of M

Let A=UVT and Ak = Uk k VkT (SVD approximation of

Theorem: [Eckart and Young] Among all n x m matrices C of rank at most k, we have that:

2],[ jiMMF

FFk CAAA

SVD - Other properties - summary

can produce orthogonal basis can solve over- and under-

determined linear problems (see C(1) property)

can compute ‘fixed points’ (= ‘steady state prob. in Markov chains’) (see C(4) property)

SVD -outline of properties

(A): obvious (B): less obvious (C): least obvious (and most

powerful!)

Properties - by defn.:

A(0): A[n x m] = U [ n x r ] [ r x r ] VT [ r x m]

A(1): UT [r x n] U [n x r ] = I [r x r ] (identity

matrix)

A(2): VT [r x n] V [n x r ] = I [r x r ]

A(3): k = diag( 1k, 2

k, ... rk ) (k: ANY

real number)

A(4): AT = V UT

Less obvious properties

B(1): A [n x m] (AT) [m x n] = ??

B(1): A [n x m] (AT) [m x n] = U 2 UT

symmetric; Intuition?

B(1): A [n x m] (AT) [m x n] = U 2 UT

symmetric; Intuition?‘document-to-document’ similarity

matrix

B(2): symmetrically, for ‘V’ (AT) [m x n] A [n x m] = V 2 VT

Intuition?

B(3): ( (AT) [m x n] A [n x m] ) k= V 2k VT

B(4): (AT A )

k ~ v1 12k v1

T for k>>1

where v1: [m x 1] first column (eigenvector) of V

1: strongest eigenvalue

B(4): (AT A )

k ~ v1 12k v1

T for k>>1

B(5): (AT A )

k v’ ~ (constant) v1

ie., for (almost) any v’, it converges to a vector parallel to v1

Thus, useful to compute first eigenvector/value (as well as the next ones…)

Less obvious properties - repeated:

B(1): A [n x m] (AT) [m x n] = U 2 UT

B(2): (AT) [m x n] A [n x m] = V 2 VT

B(3): ( (AT) [m x n] A [n x m] ) k= V 2k VT

B(4): (AT A )

k ~ v1 12k v1

B(5): (AT A )

Least obvious propertiesA(0): A[n x m] = U [ n x r ] [ r x r ] VT [ r x m]

C(1): A [n x m] x [m x 1] = b [n x 1]

let x0 = V (-1) UT b

if under-specified, x0 gives ‘shortest’ solution

if over-specified, it gives the ‘solution’ with the smallest least squares error

Least obvious propertiesIllustration: under-specified, eg [1 2] [w z] T = 4 (ie, 1 w + 2 z = 4)

1 2 3 4

2all possible solutionsx0

z shortest-length solution

Least obvious properties - cont’d

C(2): A [n x m] v1 [m x 1] = 1 u1 [n x 1]

where v1 , u1 the first (column) vectors of V, U. (v1 == right-eigenvector)

C(3): symmetrically: u1T A = 1 v1

u1 == left-eigenvector

Therefore:

Least obvious properties - cont’d

C(4): AT A v1 = 12 v1

(fixed point - the usual dfn of eigenvector for a symmetric matrix)

Least obvious properties - altogether

C(1): A [n x m] x [m x 1] = b [n x 1]

then, x0 = V (-1) UT b: shortest, actual or least-squares solution

C(2): A [n x m] v1 [m x 1] = 1 u1 [n x 1]

C(3): u1T A = 1 v1

C(4): AT A v1 = 12 v1

Properties - conclusionsA(0): A[n x m] = U [ n x r ] [ r x r ] VT [ r x m]

B(5): (AT A )

C(1): A [n x m] x [m x 1] = b [n x 1]

then, x0 = V (-1) UT b: shortest, actual or least-squares solution

C(4): AT A v1 = 12 v1

SVD - detailed outline ... Case studies SVD properties more case studies

Kleinberg/google algorithms Conclusions

Kleinberg’s Algorithm Main idea: In many cases, when you

search the web using some terms, the most relevant pages may not contain this term (or contain the term only a few times) Harvard : www.harvard.edu Search Engines: yahoo, google, altavista

Authorities and hubs

Kleinberg’s algorithm Problem dfn: given the web and a query find the most ‘authoritative’ web pages

for this query

Step 0: find all pages containing the query terms (root set)

Step 1: expand by one move forward and backward (base set)

Kleinberg’s algorithm Step 1: expand by one move

forward and backward

Kleinberg’s algorithm on the resulting graph, give high

score (= ‘authorities’) to nodes that many important nodes point to

give high importance score (‘hubs’) to nodes that point to good ‘authorities’)

hubs authorities

Kleinberg’s algorithm

observations recursive definition! each node (say, ‘i’-th node) has

both an authoritativeness score ai and a hubness score hi

Let E be the set of edges and A be the adjacency matrix: the (i,j) is 1 if the edge from i to j exists

Let h and a be [n x 1] vectors with the ‘hubness’ and ‘authoritativiness’ scores.

Then:ai = hk + hl + hm

that isai = Sum (hj) over all j

that (j,i) edge existsora = AT h

symmetrically, for the ‘hubness’:

hi = an + ap + aq

that ishi = Sum (qj) over all j

that (i,j) edge existsorh = A a

In conclusion, we want vectors h and a such that:

h = A aa = AT h

Recall properties:C(2): A [n x m] v1 [m x 1] = 1 u1 [n x 1]

C(3): u1T A = 1 v1

Kleinberg’s algorithmIn short, the solutions to

h = A aa = AT h

are the left- and right- eigenvectors of the adjacency matrix A.

Starting from random a’ and iterating, we’ll eventually converge

(Q: to which of all the eigenvectors? why?)

A: to the ones of the strongest eigenvalue, because of property B(5):B(5): (AT

A ) k v’ ~ (constant) v1

Kleinberg’s algorithm - results

Eg., for the query ‘java’:0.328 www.gamelan.com0.251 java.sun.com0.190 www.digitalfocus.com (“the

java developer”)

Kleinberg’s algorithm - discussion

‘authority’ score can be used to find ‘similar pages’ to page p (how?)

closely related to ‘citation analysis’, social networs / ‘small world’ phenomena

google/page-rank algorithm

closely related: The Web is a directed graph of connected nodes

imagine a particle randomly moving along the edges (*)

compute its steady-state probabilities. That gives the PageRank of each pages (the importance of this page

(*) with occasional random jumps

PageRank Definition Assume a page A and pages P1, P2,

…, Pm that point to A. Let d is a damping factor. PR(A) the pagerank of A. C(A) the out-degree of A. Then:

)1(()1()(

TPRddAPR

google/page-rank algorithm Compute the PR of each

page~identical problem: given a Markov Chain, compute the steady state probabilities p1 ... p5

Computing PageRank Iterative procedure Also, … navigate the web by

randomly follow links or with prob p jump to a random page. Let A the adjacency matrix (n x n), di out-degree of page i

Prob(Ai->Aj) = pn-1+(1-p)di–1Aij

A’[i,j] = Prob(Ai->Aj)

google/page-rank algorithm Let A’ be the transition matrix (=

adjacency matrix, row-normalized : sum of each row = 1)

1/2 1/2

google/page-rank algorithm A p = p

1/2 1/2

A p = p

google/page-rank algorithm A p = p thus, p is the eigenvector that

corresponds to the highest eigenvalue (=1, since the matrix is row-normalized)

Kleinberg/google - conclusions

SVD helps in graph analysis:hub/authority scores: strongest left-

and right- eigenvectors of the adjacency matrix

random walk on a graph: steady state probabilities are given by the strongest eigenvector of the transition matrix

Conclusions SVD: a valuable tool given a document-term matrix, it

finds ‘concepts’ (LSI) ... and can reduce dimensionality

(KL) ... and can find rules (PCA;

RatioRules)

Conclusions cont’d ... and can find fixed-points or

steady-state probabilities (google/ Kleinberg/ Markov Chains)

... and can solve optimally over- and under-constraint linear systems (least squares)

References Brin, S. and L. Page (1998). Anatomy of a

Large-Scale Hypertextual Web Search Engine. 7th Intl World Wide Web Conf.

Chen, C. M. and N. Roussopoulos (May 1994). Adaptive Selectivity Estimation Using Query Feedback. Proc. of the ACM-SIGMOD , Minneapolis, MN.

References cont’d

Kleinberg, J. (1998). Authoritative sources in a hyperlinked environment. Proc. 9th ACM-SIAM Symposium on Discrete Algorithms.

Press, W. H., S. A. Teukolsky, et al. (1992). Numerical Recipes in C, Cambridge University Press.

Multimedia Databases SVD II. SVD - Detailed outline Motivation Definition - properties...

Documents

Transcript of Multimedia Databases SVD II. SVD - Detailed outline Motivation Definition - properties...

APV25 for SuperBelle SVD

SVD-CNN:AConvolutionalNeuralNetworkModelwith ...

MAIUS - SVD Curia

SVD Summary

3.5 Singular Value Decomposition (or SVD) 355 3.5 ...carlmeyer.com/Singular Value Discussion.pdf3.5 Singular Value Decomposition (or SVD) 355 3.5 SINGULAR VALUE DECOMPOSITION (OR SVD)

1 Mechanics news from KEK S.Tanaka (KEK) SVD/PXD meeting SVD parallel session.

Gouge Jig SVD-186 (SVD-185) - Tormek | en · PDF fileGouge Jig SVD-186 (SVD-185) ... in order to round off the heel of the grinding bevel. ... The larger edge angle is beneﬁ cial

Blind Medical Image Watermarking with LWT – SVD for ... · Blind Medical Image Watermarking with LWT – SVD for Telemedicine Applications . ... svd based watermarking technique

Visualization and SVD

250 svd dfn - Carnegie Mellon School of Computer Sciencechristos/courses/826.F19/FOILS-pdf/250... · 2019. 10. 9. · SVD: • • –SVD: picks up linear correlations

SvD Insikt

Chapter 7. From Eigen Value, SVD, to LSAxiajingbo.weebly.com/uploads/1/3/3/0/13306375/ch7...Chapter 7. From Eigen Value, SVD, to LSA -- Case Study III. Semantic Relation Jingbo Xia

SVD 5 Finance

Mathematical background PCA SVD Some PCA and SVD … · 2007-10-16 · Mathematical Background eigenvectors properties (continue) 3. If we scale the eigenvector by an amount, we will

gouge Jig SVd-185

SEPTEMBER - SVD Curia

Lecture5: SingularValueDecomposition(SVD) · SingularValueDecomposition(SVD) Introduction Wehaveseenthatsymmetricmatricesarealways(orthogonally)diagonalizable. Thatis,for any symmetric

SVD and Spectral Analysis

SVD 1 Idea Generation

Gouge Jig SVD-186 (SVD-186)go.rockler.com/tech/Tormek-Gouge-Jig-Instructions.pdf75 Gouge Jig SVD-186 (SVD-186)Grinding direction:The lower sleeve (4) has been shortened in order to