Data, Covariance, and Correlation Matrix

Nathaniel E. Helwig

Assistant Professor of Psychology and StatisticsUniversity of Minnesota (Twin Cities)

Updated 16-Jan-2017

Nathaniel E. Helwig (U of Minnesota) Data, Covariance, and Correlation Matrix Updated 16-Jan-2017 : Slide 1

Copyright

Outline of Notes

1) The Data MatrixDefinitionPropertiesR code

2) The Covariance MatrixDefinitionPropertiesR code

3) The Correlation MatrixDefinitionPropertiesR code

4) Miscellaneous TopicsCrossproduct calculationsVec and KroneckerVisualizing data

The Data Matrix

The Data Matrix Definition

The Organization of Data

The data matrix refers to the array of numbers

x11 x12 · · · x1px21 x22 · · · x2px31 x32 · · · x3p...

.... . .

...xn1 xn2 · · · xnp

where xij is the j-th variable collected from the i-th item (e.g., subject).

items/subjects are rowsvariables are columns

X is a data matrix of order n × p (# items by # variables).

Collection of Column Vectors

We can view a data matrix as a collection of column vectors:

x1 x2 · · · xp

where xj is the j-th column of X for j ∈ {1, . . . ,p}.

The n × 1 vector xj gives the j-th variable’s scores for the n items.

Collection of Row Vectors

We can view a data matrix as a collection of row vectors:

x′1x′2...

where x′i is the i-th row of X for i ∈ {1, . . . ,n}.

The 1× p vector x′i gives the i-th item’s scores for the p variables.

The Data Matrix Properties

Calculating Variable (Column) Means

The sample mean of the j-th variable is given by

x̄j =1n

n∑i=1

= n−11′nxj

where1n denotes an n × 1 vector of onesxj denotes the j-th column of X

The Data Matrix Properties

Calculating Item (Row) Means

The sample mean of the i-th item is given by

x̄i =1p

p∑j=1

= p−1x′i1p

where1p denotes an p × 1 vector of onesx′i denotes the i-th row of X

The Data Matrix R Code

Data Frame and Matrix Classes in R

> data(mtcars)> class(mtcars)[1] "data.frame"> dim(mtcars)[1] 32 11> head(mtcars)

mpg cyl disp hp drat wt qsec vs am gear carbMazda RX4 21.0 6 160 110 3.90 2.620 16.46 0 1 4 4Mazda RX4 Wag 21.0 6 160 110 3.90 2.875 17.02 0 1 4 4Datsun 710 22.8 4 108 93 3.85 2.320 18.61 1 1 4 1Hornet 4 Drive 21.4 6 258 110 3.08 3.215 19.44 1 0 3 1Hornet Sportabout 18.7 8 360 175 3.15 3.440 17.02 0 0 3 2Valiant 18.1 6 225 105 2.76 3.460 20.22 1 0 3 1> X <- as.matrix(mtcars)> class(X)[1] "matrix"

Row and Column Means

> # get row means (3 ways)> rowMeans(X)[1:3]

Mazda RX4 Mazda RX4 Wag Datsun 71029.90727 29.98136 23.59818

> c(mean(X[1,]), mean(X[2,]), mean(X[3,]))[1] 29.90727 29.98136 23.59818> apply(X,1,mean)[1:3]

Mazda RX4 Mazda RX4 Wag Datsun 71029.90727 29.98136 23.59818

> # get column means (3 ways)> colMeans(X)[1:3]

mpg cyl disp20.09062 6.18750 230.72188> c(mean(X[,1]), mean(X[,2]), mean(X[,3]))[1] 20.09062 6.18750 230.72188> apply(X,2,mean)[1:3]

mpg cyl disp20.09062 6.18750 230.72188

Other Row and Column Functions

> # get column medians> apply(X,2,median)[1:3]mpg cyl disp19.2 6.0 196.3> c(median(X[,1]), median(X[,2]), median(X[,3]))[1] 19.2 6.0 196.3

> # get column ranges> apply(X,2,range)[,1:3]

mpg cyl disp[1,] 10.4 4 71.1[2,] 33.9 8 472.0> cbind(range(X[,1]), range(X[,2]), range(X[,3]))

[,1] [,2] [,3][1,] 10.4 4 71.1[2,] 33.9 8 472.0

The Covariance Matrix

The Covariance Matrix Definition

The Covariation of Data

The covariance matrix refers to the symmetric array of numbers

1 s12 s13 · · · s1ps21 s2

2 s23 · · · s2ps31 s32 s2

3 · · · s3p...

......

. . ....

sp1 sp2 sp3 · · · s2p

s2j = (1/n)

∑ni=1(xij − x̄j)

2 is the variance of the j-th variable

sjk = (1/n)∑n

i=1(xij − x̄j)(xik − x̄k ) is the covariance between thej-th and k -th variablesx̄j = (1/n)

∑ni=1 xij is the mean of the j-th variable

The Covariance Matrix Definition

Covariance Matrix from Data Matrix

We can calculate the covariance matrix such as

X′cXc

where Xc = X− 1nx̄′ = CX withx̄′ = (x̄1, . . . , x̄p) denoting the vector of variable meansC = In − n−11n1′n denoting a centering matrix

Note that the centered matrix Xc has the form

x11 − x̄1 x12 − x̄2 · · · x1p − x̄px21 − x̄1 x22 − x̄2 · · · x2p − x̄px31 − x̄1 x32 − x̄2 · · · x3p − x̄p

......

. . ....

xn1 − x̄1 xn2 − x̄2 · · · xnp − x̄p

The Covariance Matrix Properties

Variances are Nonnegative

Variances are sums-of-squares, which implies that s2j ≥ 0 ∀j .

s2j > 0 as long as there does not exist an α such that xj = α1n

This implies that. . .tr(S) ≥ 0 where tr(·) denotes the matrix trace function∑p

j=1 λj ≥ 0 where (λ1, . . . , λp) are the eigenvalues of S

If n < p, then λj = 0 for at least one j ∈ {1, . . . ,p}. If n ≥ p and the pcolumns of X are linearly independent, then λj > 0 for all j ∈ {1, . . . ,p}.

The Covariance Matrix Properties

The Cauchy-Schwarz Inequality

From the Cauchy-Schwarz inequality we have that

s2jk ≤ s2

with the equality holding if and only if xj and xk are linearly dependent.

We could also write the Cauchy-Schwarz inequality as

|sjk | ≤ sjsk

where sj and sk denote the standard deviations of the variables.

The Covariance Matrix R Code

Covariance Matrix by Hand (hard way)

> n <- nrow(X)> C <- diag(n) - matrix(1/n, n, n)> Xc <- C %*% X> S <- t(Xc) %*% Xc / (n-1)> S[1:3,1:6]

mpg cyl disp hp drat wtmpg 36.324103 -9.172379 -633.0972 -320.7321 2.1950635 -5.116685cyl -9.172379 3.189516 199.6603 101.9315 -0.6683669 1.367371disp -633.097208 199.660282 15360.7998 6721.1587 -47.0640192 107.684204

# or #

> Xc <- scale(X, center=TRUE, scale=FALSE)> S <- t(Xc) %*% Xc / (n-1)> S[1:3,1:6]

The Covariance Matrix R Code

Covariance Matrix using cov Function (easy way)

# calculate covariance matrix> S <- cov(X)> dim(S)[1] 11 11

# check variance> S[1,1][1] 36.3241> var(X[,1])[1] 36.3241> sum((X[,1]-mean(X[,1]))^2) / (n-1)[1] 36.3241

# check covariance> S[1:3,1:6]

The Correlation Matrix

The Correlation Matrix Definition

The Correlation of Data

The correlation matrix refers to the symmetric array of numbers

1 r12 r13 · · · r1p

r21 1 r23 · · · r2pr31 r32 1 · · · r3p...

......

. . ....

rp1 rp2 rp3 · · · 1

rjk =sjk

∑ni=1(xij − x̄j)(xik − x̄k )√∑n

i=1(xij − x̄j)2√∑n

i=1(xik − x̄k )2

is the Pearson correlation coefficient between variables xj and xk .

The Correlation Matrix Definition

Correlation Matrix from Data Matrix

We can calculate the correlation matrix such as

X′sXs

where Xs = CXD−1 withC = In − n−11n1′n denoting a centering matrixD = diag(s1, . . . , sp) denoting a diagonal scaling matrix

Note that the standardized matrix Xs has the form

(x11 − x̄1)/s1 (x12 − x̄2)/s2 · · · (x1p − x̄p)/sp(x21 − x̄1)/s1 (x22 − x̄2)/s2 · · · (x2p − x̄p)/sp(x31 − x̄1)/s1 (x32 − x̄2)/s2 · · · (x3p − x̄p)/sp

......

. . ....

(xn1 − x̄1)/s1 (xn2 − x̄2)/s2 · · · (xnp − x̄p)/sp

The Correlation Matrix Properties

Correlation of a Variable with Itself is One

Assuming that s2j > 0 for all j ∈ {1, . . . ,p}, we have that

Cor(xj ,xk ) =

∑ni=1(xij − x̄j)(xik − x̄k )√∑n

i=1(xij − x̄j)2√∑n

i=1(xik − x̄k )2=

{1 if j = krjk if j 6= k

Because rjk = 1 whenever j = k , we know thattr(R) = p where tr(·) denotes the matrix trace function∑p

j=1 λj = p where (λ1, . . . , λp) are the eigenvalues of R

We also know that the eigenvalues satisfyλj = 0 for at least one j ∈ {1, . . . ,p} if n < pλj > 0 ∀j if columns of X are linearly independent

The Correlation Matrix Properties

The Cauchy-Schwarz Inequality (revisited)

Reminder: the Cauchy-Schwarz inequality implies that

s2jk ≤ s2

with the equality holding if and only if xj and xk are linearly dependent.

Rearranging the terms, we have that

s2j s2

k≤ 1 ←→ r2

jk ≤ 1

which implies that |rjk | ≤ 1 with equality holding if and only ifxj = α1n + βxk for some scalars α ∈ R and β ∈ R.

The Correlation Matrix R Code

Correlation Matrix by Hand (hard way)

> n <- nrow(X)> C <- diag(n) - matrix(1/n, n, n)> D <- diag(apply(X, 2, sd))> Xs <- C %*% X %*% solve(D)> R <- t(Xs) %*% Xs / (n-1)> R[1:3,1:6]

[,1] [,2] [,3] [,4] [,5] [,6][1,] 1.0000000 -0.8521620 -0.8475514 -0.7761684 0.6811719 -0.8676594[2,] -0.8521620 1.0000000 0.9020329 0.8324475 -0.6999381 0.7824958[3,] -0.8475514 0.9020329 1.0000000 0.7909486 -0.7102139 0.8879799

# or #

> Xs <- scale(X, center=TRUE, scale=TRUE)> R <- t(Xs) %*% Xs / (n-1)> R[1:3,1:6]

The Correlation Matrix R Code

Correlation Matrix using cor Function (easy way)

# calculate correlation matrix> R <- cor(X)> dim(R)[1] 11 11

# check correlation of mpg and cyl> R[1,2][1] -0.852162> cor(X[,1],X[,2])[1] -0.852162> cov(X[,1],X[,2]) / (sd(X[,1]) * sd(X[,2]))[1] -0.852162

# check correlations> R[1:3,1:6]

Miscellaneous Topics

Miscellaneous Topics Crossproduct Calculations

Two Types of Matrix Crossproducts

We often need to calculate one of two different types of crossproducts:X′Y = “regular” crossproduct of X and YXY′ = “transpose” crossproduct of X and Y

Regular crossproduct is X′ being post-multipled by Y.

Transpose crossproduct is X being post-multipled by Y′.

Miscellaneous Topics Crossproduct Calculations

Simple and Efficient Crossproducts in R

> X <- matrix(rnorm(2*3),2,3)> Y <- matrix(rnorm(2*3),2,3)> t(X) %*% Y

[,1] [,2] [,3][1,] 0.1342302 -1.8181837 -1.107821[2,] 1.1014703 -0.6619466 -1.356606[3,] 0.8760823 -1.0077151 -1.340044> crossprod(X, Y)

[,1] [,2] [,3][1,] 0.1342302 -1.8181837 -1.107821[2,] 1.1014703 -0.6619466 -1.356606[3,] 0.8760823 -1.0077151 -1.340044> X %*% t(Y)

[,1] [,2][1,] 0.8364239 3.227566[2,] -1.3899946 -2.704184> tcrossprod(X, Y)

[,1] [,2][1,] 0.8364239 3.227566[2,] -1.3899946 -2.704184

Miscellaneous Topics Vec Operator and Kronecker Product

Turning a Matrix into a Vector

The vectorization (vec) operator turns a matrix into a vector:

vec(X) =(x11, x21, . . . , xn1, x12, . . . , xn2, . . . , x1p, . . . , xnp

)′where the vectorization is done column-by-column.

In R, we just use the combine function c to vectorize a matrix> X <- matrix(1:6,2,3)> X

[,1] [,2] [,3][1,] 1 3 5[2,] 2 4 6> c(X)[1] 1 2 3 4 5 6> c(t(X))[1] 1 3 5 2 4 6

vec Operator Properties

Some useful properties of the vec(·) operator include:vec(a′) = vec(a) = a for any vector a ∈ Rm

vec(ab′) = b⊗ a for any vectors a ∈ Rm and b ∈ Rn

vec(A)′vec(B) = tr(A′B) for any matrices A,B ∈ Rm×n

vec(ABC) = (C′ ⊗ A)vec(B) if the product ABC exists

Note: ⊗ is the Kronecker product, which is defined on the next slide.

Kronecker Product of Two Matrices

Given X = {xij}n×p and Y = {yij}m×q, the Kronecker product is

X⊗ Y =

x11Y x12Y · · · x1pYx21Y x22Y · · · x2pY

......

. . ....

xn1Y xn2Y · · · xnpY

which is a matrix of order mn × pq.

In R, the kronecker function calculates Kronecker products> X <- matrix(1:4,2,2)> Y <- matrix(5:10,2,3)> kronecker(X, Y)

[,1] [,2] [,3] [,4] [,5] [,6][1,] 5 7 9 15 21 27[2,] 6 8 10 18 24 30[3,] 10 14 18 20 28 36[4,] 12 16 20 24 32 40

Kronecker Product Properties

Some useful properties of the Kronecker product include:1 A⊗ a = Aa = aA = a⊗ A for any a ∈ R and A ∈ Rm×n

2 (A⊗ B)′ = A′ ⊗ B′ for any matrices A ∈ Rm×n and B ∈ Rp×q

3 a′ ⊗ b = ba′ = b⊗ a′ for any vectors a ∈ Rm and b ∈ Rn

4 tr(A⊗ B) = tr(A)tr(B) for any matrices A ∈ Rm×m and B ∈ Rp×p

5 (A⊗ B)−1 = A−1 ⊗ B−1 for any invertible matrices A and B6 (A⊗ B)† = A† ⊗ B† where (·)† is Moore-Penrose pseudoinverse7 |A⊗ B| = |A|p|B|m for any matrices A ∈ Rm×m and B ∈ Rp×p

8 rank(A⊗ B) = rank(A)rank(B) for any matrices A and B9 A⊗ (B + C) = A⊗ B + A⊗ C for any matrices A, B, and C

10 (A + B)⊗ C = A⊗ C + B⊗ C for any matrices A, B, and C11 (A⊗ B)⊗ C = A⊗ (B⊗ C) for any matrices A, B, and C12 (A⊗ B)(C⊗ D) = (AC)⊗ (BD) for any matrices A, B, C, and D

Common Application of Vec and Kronecker

Suppose the rows of X are iid samples from some multivariatedistribution with mean µ = (µ1, . . . , µp)′ and covariance matrix Σ.

xiiid∼ (µ,Σ) where xi is the i-th row of X

If we let y = vec(X′), then the expectation and covariance areE(y) = 1n ⊗ µ is the mean vectorV (y) = In ⊗Σ is the covariance matrix

Note that the covariance matrix is block diagonal

In ⊗Σ =

Σ 0 · · · 00 Σ · · · 0...

.... . .

...0 0 · · · Σ

given that data from different subjects are assumed to be independent.Nathaniel E. Helwig (U of Minnesota) Data, Covariance, and Correlation Matrix Updated 16-Jan-2017 : Slide 34

Miscellaneous Topics Visualizing Multivariate Data

Two Versions of a Scatterplot in R

●●

● ●

●●

50 100 150 200 250 300

●●

● ●

●●

plot(mtcars$hp, mtcars$mpg, xlab="HP", ylab="MPG")library(car)scatterplot(mtcars$hp, mtcars$mpg, xlab="HP", ylab="MPG")

Two Versions of a Scatterplot Matrix in R

100 300

●●●

●●

2 3 4 5

●●●

●●

●● ●

●●

● ●

●●●

●●

●●●

●●

●●●

●● ●

●●

●● ●

10 15 20 25 30

●●

50 150 250

● ●

●●

● 468

100 300

●●●

●●

2 3 4 5

●●●

●●

●● ●

●●

● ●

●●●

●●

●●●

●●

●●●

●● ●

●●

●● ●

10 15 20 25 302

●●

50 150 250

● ●

●●

cylint <- as.integer(factor(mtcars$cyl))pairs(~mpg+disp+hp+wt, data=mtcars, col=cylint, pch=cylint)library(car)scatterplotMatrix(~mpg+disp+hp+wt|cyl, data=mtcars)

Three-Dimensional Scatterplot in R

50 100 150 200 250 300 350

●●

● ●

50 100 150 200 250 300 350

●●

● ●

library(scatterplot3d)sp3d <- scatterplot3d(mtcars$hp, mtcars$wt, mtcars$mpg,

type="h", color=cylint, pch=cylint,xlab="HP", ylab="WT", zlab="MPG")

fitmod <- lm(mpg ~ hp + wt, data=mtcars)sp3d$plane3d(fitmod)

Color Image (Heat Map) Plots in R

50 100 150 200 250 300

fitmod <- lm(mpg ~ hp + wt, data=mtcars)hpseq <- seq(50, 330, by=20)wtseq <- seq(1.5, 5.4, length=15)newdata <- expand.grid(hp=hpseq, wt=wtseq)fit <- predict(fitmod, newdata)fitmat <- matrix(fit, 15, 15)image(hpseq, wtseq, fitmat, xlab="HP", ylab="WT")library(bigsplines)imagebar(hpseq, wtseq, fitmat, xlab="HP", ylab="WT", zlab="MPG", col=heat.colors(12), ncolor=12)

Correlation Matrix Plot in R

● ●

●●

−0.8

−0.6

−0.4

−0.2

hp drat

vs am gear

carb−1

−0.8

−0.6

−0.4

−0.2

−0.85

−0.78

−0.87

−0.55

−0.7

−0.59

−0.81

−0.52

−0.49

−0.71

−0.43

−0.71

−0.59

−0.56

−0.45

−0.71

−0.72

−0.24

−0.13

−0.71

−0.09

−0.17

−0.55

−0.69

−0.58

−0.23

−0.21

−0.66

−0.57

0.06 0.27

cmat <- cor(mtcars)library(corrplot)corrplot(cmat, method="circle")corrplot.mixed(cmat, lower="number", upper="ellipse")

Correlation Matrix Color Image (Heat Map) in R

mpg disp drat qsec am carb

−0.85

−0.85 0.9

−0.78 0.83 0.79

0.68 −0.7 −0.71−0.45

−0.87 0.78 0.89 0.66 −0.71

0.42 −0.59−0.43−0.71 0.09 −0.17

0.66 −0.81−0.71−0.72 0.44 −0.55 0.74

0.6 −0.52−0.59−0.24 0.71 −0.69−0.23 0.17

0.48 −0.49−0.56−0.13 0.7 −0.58−0.21 0.21 0.79

−0.55 0.53 0.39 0.75 −0.09 0.43 −0.66−0.57 0.06 0.27

cmat <- cor(mtcars)p <- nrow(cmat)library(RColorBrewer)imagebar(1:p, 1:p, cmat[,p:1], axes=F, zlim=c(-1,1), xlab="", ylab="", col=brewer.pal(7, "RdBu"))axis(1, 1:p, labels=rownames(cmat))axis(2, p:1, labels=colnames(cmat))for(k in 1:p) { for(j in 1:k) { if(j < k) text(j, p+1-k, labels=round(cmat[j,k],2), cex=0.75) } }

Data, Covariance, and Correlation Matrix -...

Transcript of Data, Covariance, and Correlation Matrix -...

Data, Covariance, and Correlation Matrix -...

Documents

Transcript of Data, Covariance, and Correlation Matrix -...

New —MAZDA MAZDA CX- New MAZDA CX-3 ER MAZDA 2 …

PRODUCT GUIDE SMF BATTERY - middle-east.zeetex · 5. MAZDA CX-9 Mazda 3 Mazda 6 2.3 Mazda 6 2.5 Mazda 626 Mazda 929 Mazda 929 / MX6 4CYL Mazda 323 MPV MX-5 Pick-up Xedos Xedos 9 &

Mazda MX-5 2007. Internet: Mazda Helpline: 08457 48 48 48 Mazda MX-5 Your personal Mazda MX-5 ... Mazda MX-5 6 Unity

Rx4 CleanAir Realtor Presentation

21.0 Thermal Energy

2014 Mazda 6 | Mazda Chesapeake | Cavalier Mazda

Transponding BDL168 RX4 Application - Digitrax, Inc. · PDF fileDigitrax Transponding with BDL168 & RX4 Application ... If any unit in a train is ... and Magnematic couplers or electric

OWNER’S MANUAL FAN INSTALLATION TEKIN RX4 … · Controls -RX4 Fwd/Brk or Fwd/Brk/Rev Input (#Cells) RX4 (2S-6S LiPo) 7.4V-22.2V Motor Limits - RX4 Brushless Brushed Fwd Mode Brushed

Pharmacy Benefits Rx4 Traditional replaces Rx3

RainShadow-RX4 BP - finished

2014 Mazda CX-9 | Mazda Chesapeake | Cavalier Mazda

2011 Mazda 3 Hall Mazda WI

2014 Mazda 3 | Mazda Chesapeake | Cavalier Mazda

Accessing Databases from R - Meetupfiles.meetup.com/1781511/useR Vignette - Accessing Databases fro… · Mazda RX4 Wag 21.0 6 160 110 3.90 2.875 17.02 0 1 4 4 Mazda RX4 Wag Datsun

Global Network - Mazda · 32 Ford Lio Ho Motor* 33 Vina Mazda Automobile Manufacturing* 34 Mazda Malaysia Distributors 35 Mazda Sales (Thailand) 36 PT. Mazda Motor Indonesia 37 Mazda

OBDLink MX+...Mazda Mazda3 MPS 2010–2011, 2013 Mazda Mazda5 2006–2015 Mazda Mazda5 (EURO) 2005, 2007 Mazda Mazda6 2003–2019 Mazda Mazda6 (EURO) 2002–2005, 2007 Mazda Mazdaspeed3

Mazda 6 vs. Honda Accord Vehicle Comparison | Mazda Chesapeake | Cavalier Mazda

Tekla Structures 21.0

RX4 - Agrowtekagrowtek.com/doc/ds/RX4.pdf · Technology to Help You Grow AGROWtEK RX4 i Programmable Outlet Relay RX4i independently operates four receptacles for control of equipment

Mtysun Auto Parts Co.,Ltd · kia pride mazda 626 mazda mx-6 ford probe mazda 323 1.8 16v mazda 323 2.0 24v mazda family mazda family mazda 2000 acura integra honda civic honda civic