The Department of Economic and Management Sciences Prof. Andy Mauromoustakos Prof. Andy...

18
The Department of Economic and Management Sciences Prof. Andy Prof. Andy Mauromoustakos Mauromoustakos Analysis of multivariate Data using JMP “Heavy metals in marine sediments, differences between locations “ Presented by: Tareq Altamimi econ0306

Transcript of The Department of Economic and Management Sciences Prof. Andy Mauromoustakos Prof. Andy...

Page 1: The Department of Economic and Management Sciences Prof. Andy Mauromoustakos Prof. Andy Mauromoustakos Analysis of multivariate Data using JMP “Heavy metals.

The Department of Economic and Management Sciences

Prof. Andy MauromoustakosProf. Andy Mauromoustakos

Analysis of multivariate Data using JMP

“Heavy metals in marine sediments, differences between locations “

Presented by:

Tareq Altamimi econ0306

Page 2: The Department of Economic and Management Sciences Prof. Andy Mauromoustakos Prof. Andy Mauromoustakos Analysis of multivariate Data using JMP “Heavy metals.

Slides IndexSlides Index Presentation overview.Presentation overview.

Why MANOVA.Why MANOVA.

JMP Analysis.JMP Analysis.

Database Variables definition.Database Variables definition.

Database Overview & Objectives.Database Overview & Objectives.

MANOVA Tests.MANOVA Tests.

Correlation between Variables.Correlation between Variables.

MANOVA Analysis.MANOVA Analysis.

Repeated measures Response.Repeated measures Response.

Analysis Results.Analysis Results.

Conclusion.Conclusion.

References.References.

Click INDEX Click INDEX to return back to return back here or Enter here or Enter

to go to the to go to the next Slidenext Slide

Principle components.Principle components.

Page 3: The Department of Economic and Management Sciences Prof. Andy Mauromoustakos Prof. Andy Mauromoustakos Analysis of multivariate Data using JMP “Heavy metals.

Presentation Presentation OverviewOverview::

This presentation is a description of an important statistical subject which is “multivariate This presentation is a description of an important statistical subject which is “multivariate analysis” using JMP program. The analysis in this presentation is done for a database analysis” using JMP program. The analysis in this presentation is done for a database collected by scientists in the University of Melbourne at the department of biology. The collected by scientists in the University of Melbourne at the department of biology. The analysis is mainly about heavyanalysis is mainly about heavy metals in marine sediments metals in marine sediments,, differences between differences between lllocationslocations. . We take three different locations with four observations(experiments) each.We take three different locations with four observations(experiments) each. JMP make it easy to see the results of this changes in the heavy metals in marine sediments JMP make it easy to see the results of this changes in the heavy metals in marine sediments in an easy graphically presented data. This graphics is called charts, these chars can be in an easy graphically presented data. This graphics is called charts, these chars can be translated easily by statisticians. This presentation also will include a translation of the translated easily by statisticians. This presentation also will include a translation of the results of analysis in three different locations which this database was tested on.results of analysis in three different locations which this database was tested on. The Analysis in this project is done on multivariate analysis on variance The Analysis in this project is done on multivariate analysis on variance (MANOVA)(MANOVA) whichwhich is used to see the main and interaction effects of categorical variables on multiple is used to see the main and interaction effects of categorical variables on multiple dependent interval variables. MANOVA uses one or more categorical independents as dependent interval variables. MANOVA uses one or more categorical independents as predictorspredictors. . MANOVA tests the differences in the centred (vector) of means of the multiple MANOVA tests the differences in the centred (vector) of means of the multiple interval dependents, for various categories of the independent(s). There are multiple interval dependents, for various categories of the independent(s). There are multiple potential purposes for MANOVA. potential purposes for MANOVA. So all of the analysis here will be done by MANOVA So all of the analysis here will be done by MANOVA using JMP Program. using JMP Program.

INDEX

Page 4: The Department of Economic and Management Sciences Prof. Andy Mauromoustakos Prof. Andy Mauromoustakos Analysis of multivariate Data using JMP “Heavy metals.

JMP dynamically links statistics and graphics so you can easily explore data, make discoveries, and gain the knowledge you need to make better decisions. Click on a point in a graph to highlight the corresponding observation everywhere it is represented in JMP: in other graphs, in 3-D spinning plots, and in the data tables. JMP provides a comprehensive set of statistical tools as well as Design of Experiments (DOE) and advanced quality control (QC and SPC) tools for Six Sigma in a single package. Advanced modelling techniques include ANOVA and MANOVA, stepwise, log linear, ordinal logistic regression, survival/reliability, true non-linear modelling, partitioning (decision trees), neural networks, time series; multivariate, cluster, discriminant, and partial least squares (PLS). The JMP Scripting Language (JSL) lets you capture the results of your work in automatically-generated scripts, and offers all the power of a programming language, complete with matrix algebra support, so you can create custom analyses, interactive graphics, and more.

Multivariate statistics were developed to handle situations where multiple variables or measures are involved. Any analysis of more than two variables or measures can loosely be considered a multivariate statistical analysis or multivariate analysis. One of the primary goals of multivariate statistical analysis is to describe the relationships among a set of variables. The multivariate analysis is widely used in various fields, such as agriculture, food and life sciences, business and engineering and so on. Repeated measures analysis (also called longitudinal data” when repeated measurements are taken on each subject and you want to analyse effects both between subjects and within subjects across the measurements. This multivariate approach is especially important when the correlation structure across the measurements is arbitrary.

Why MANOVA, why Repeated Measures?

Why JMP?

INDEX

Page 5: The Department of Economic and Management Sciences Prof. Andy Mauromoustakos Prof. Andy Mauromoustakos Analysis of multivariate Data using JMP “Heavy metals.

The JMP way of doing things is best summarised in the following four pointsThe JMP way of doing things is best summarised in the following four points

• Variables are assigned to one of the three levels of measurement: nominal, ordinal or continuous. This assignment is under user control and may be changed at will. It is used to allow JMP to decide what summary statistics to provide and what techniques are suitable for analysis using the variable.

• In any analysis, each variable may be assigned one of the roles: X, Y, weight, frequency or label. • Using a combination of the above two pieces of information, JMP is able to decide on an appropriate

analysis if the user chooses one of the following activities, referred to as 'personalities': • Distribution of Y for single variable summaries and plots. • Fit Y by X for one response and one explanatory variable, the techniques employed are

ANOVA, LS regression, logistic regression or contingency table analysis depending on the levels of the X and Y variables.

• Fit Model for variable numbers of responses and explanatory variables, under this heading a range of techniques are available including: ANOVA, ANCOVA, MANOVA, LS regression and stepwise procedures, logistic and ordinal regression, log-linear models, proportional hazard models, screening models and D-optimal designs.

• Non-linear Fit for non-linear models specified by the user and fit using Gauss-Newton or Newton-Raphson and with either one of a range of built in loss functions or one specified by the user.

• Correlation of Ys for examination of the correlation or covariance structure of a set of variables, including scatter plot matrices, PCA and factor analysis.

• Cluster for cluster analysis using hierarchical and K- means approaches. • Survival for survival analysis using Kaplan-Meier, Cox regression and non-linear survival

models. • Most of the analysis 'personalities' produce graphs, many of them dynamic, as part of their standard output.

There are always a range of additional outputs both textual and graphical available from analyses.

INDEX

Page 6: The Department of Economic and Management Sciences Prof. Andy Mauromoustakos Prof. Andy Mauromoustakos Analysis of multivariate Data using JMP “Heavy metals.

MANOVA Tests:MANOVA Tests: MANOVA has four main tests described as following:Hotelling's T-Square is the most common, traditional test where there are two groups formed by the independent variables. Note one may see the related statistic, Hotelling's Trace (a.k.a. Lawley-Hotelling or Hotelling-Lawley Trace). To convert from the Trace coefficient to the T-Square coefficient, multiply the Trace coefficient by (N-g), where N is the sample size across all groups and g is the number of groups. The T-Square result will still have the same F value, degrees of freedom, and significance level as the Trace coefficient. Wilks' lambda, U. This is the most common, traditional test where there are more than two groups formed by the independent variables. It is a measure of the difference between groups of the centroid (vector) of means on the independent variables. The smaller the lambda, the greater the differences. The Bartlett's V transformation of lambda is then used to compute the significance of lambda. Wilks's lambda is used, in conjunction with Bartlett's V, as a multivariate significance test of mean differences in MANOVA, for the case of multiple interval dependents and multiple (>2) groups formed by the independent(s). The t-test, Hotelling's T, and the F test are special cases of Wilks's lambda. Pillai-Bartlett trace, V. Multiple discriminant analysis (MDA) is the part of MANOVA where canonical roots are calculated. Each significant root is a dimension on which the vector of group means is differentiated. The Pillai-Bartlett trace is the sum of explained variances on the discriminant variates, which are the variables which are computed based on the canonical coefficients for a given root. Olson (1976) found V to be the most robust of the four tests and is sometimes preferred for this reason. Roy's greatest characteristic root (GCR) is similar to the Pillai-Bartlett trace but is based only on the first (and hence most important) root. Specifically, let lambda be the largest eigen value, then GCR = lambda/(1 + lambda).

Why can't I just use multiple univariate ANOVA tests rather than MANOVA, one for each dependent variable in my set?

• If the dependent variables are uncorrelated with each other, it may be that a series of univariate ANOVA tests would be acceptable. Where the dependents are correlated (most of the time), MANOVA is superior. This is because ANOVA only tests differences in means, whereas MANOVA is sensitive not only to mean differences but also to the direction and size ofcorrelations among the dependents.

There are multiple potential purposes for MANOVAThere are multiple potential purposes for MANOVA.

To compare groups formed by categorical independent variables on group differences in a set of interval dependent variables.

To use lack of difference for a set of dependent variables as a criterion for reducing a set of independent variables to a smaller, more easily modeled number of variables.

To identify the independent variables which differentiate a set of dependent variables the most.

INDEX

Page 7: The Department of Economic and Management Sciences Prof. Andy Mauromoustakos Prof. Andy Mauromoustakos Analysis of multivariate Data using JMP “Heavy metals.

One objective for this particular study was to determine if there is any differences of the percentage of heavy metals in marine sediments on different locations.

Treatment Design:

The treatments included the quantities of cooper(CU), lead(PB), nickel(NI) and manganese(MN). In every location we get 4 different samples and we made the experiment on it. We did this in the four location which are Delray, Seaspray and Woodside.

Experiment Design:

Samples of marine sediments were randomly assigned to the four treatments in a completely randomized design. The treated marine sediments samples were placed in airtight containers and incubated under conditions conducive to microbial activity. This experiment was done to discover it there is differences in the percentage of heavy metals in marine sediments if the sample is taken from different locations?, is there any effect of the location in these percentages?

These differences can be measured by measuring the three main heavy metals in it. The heavy measures were measured in three different locations. In each location we have four different samples to make sure that we have reached to a general conclusion about this kind of amazing marine sediments. The heavy metal quantity in each marine sediment sample was recorded on an idealized experiment area. The data is already shown in the previous slides and a profile plot from the Fit Model’s MANOVA personality is shown on the right.

Heavy metals in marine sediments, differences between locationsHeavy metals in marine sediments, differences between locations :- :-

Research objectives:

The University of Melbourne is doing a research on Marine sediments. The main aim is to discuss the heavy metals in this marine sediments if the location is changed, is there any changes in the levels of these sediments?. The answer of this problem will be discuss in this database study on this presentation. This analysis will be done using JMP software.

INDEX

Page 8: The Department of Economic and Management Sciences Prof. Andy Mauromoustakos Prof. Andy Mauromoustakos Analysis of multivariate Data using JMP “Heavy metals.

Database Variables Definition:Database Variables Definition: * Site: the first Variable in this database is the site which is the place where the experiment had done. In this database we have three different sites , which is (Delray beach, Sea spray, Woodside). This Variable is important because it divides the data in three different group depending on the area the experiment had been done.The following variable are continuous as CU,PB,NI and MN are dependant variables, this is considered as one of the main conditions of MANOVA.

Factor Variable:

Site: It is having values of the area where the marine sediments had already taken from.

Responses:Responses:

• CU: is the concentration of Copper in marine sediment, on every site we have four different tests.

• PB: This Variable represent is the concentration of lead in Marine sediment.

• NI: It is the concentration of nickel in marine sediment.

• MN: It is the concentration of manganese in marine Sediment.

This last four Variables are considered the most important variables because it will show to us the differences or the similarity of the marine sediments in three different sites. Some other Variable used in this database is the log10 transformation of the previous mentioned variables, in addition to the log10 transformation of FE:

* LCU - log10 transformation of CU * LPB - log10 transformation of PB

* LNI - log10 transformation of NI * LMN - log10 transformation of MN

* LFE –log10 transformation of FE

This data which we have is eligible to the MANOVA conditions as we have continuous dependant variable and categorical independent variable. NEXT

Page 9: The Department of Economic and Management Sciences Prof. Andy Mauromoustakos Prof. Andy Mauromoustakos Analysis of multivariate Data using JMP “Heavy metals.

* SITE - sites from which data were collected (Delray Beach, Seaspray,Woodside)

* CU - concentration of copper

* PB - concentration of lead

* NI - concentration of nickel

* MN - concentration of manganese

* LCU - log10 transformation of CU

* LPB - log10 transformation of PB

* LNI - log10 transformation of NI

* LMN - log10 transformation of MN

*LFE –log10 transformation of FE

WoodsideSea spray

DelrayMMarine

arine sediments

sediments

SITE CU PB NI MN LCU LPB LFE LNI LMNDelray 7,15 12 7,4 137 0,854306 1,0791812 4,060698 0,8692317 2,136721Delray 5,85 6,35 7,05 106,5 0,7671559 0,8027737 3,989005 0,8481891 2,02735Delray 5,45 4,95 10,4 102,5 0,7363965 0,6946052 4,125481 1,0170333 2,010724Delray 8,5 7,05 6,4 125 0,9294189 0,8481891 4,023252 0,80618 2,09691Seaspray 9,45 10,5 12,15 162,5 0,9754318 1,0211893 4,1959 1,0845763 2,210853Seaspray 6,3 18,35 11,85 129,5 0,7993405 1,2636361 4,186108 1,0737184 2,11227Seaspray 7,8 18,1 10,7 224,5 0,8920946 1,2576786 4,164353 1,0293838 2,351216Seaspray 9,15 16,55 12,2 247 0,9614211 1,218798 4,181844 1,0863598 2,392697Woodside 10,8 28,9 18,3 403,5 1,0334238 1,4608978 4,205475 1,2624511 2,605844Woodside 21,2 11,6 18,2 377 1,3263359 1,064458 4,214844 1,2600714 2,576341Woodside 18,2 13,85 15,3 256 1,2600714 1,1414498 4,201397 1,1846914 2,40824Woodside 7,6 9,35 9,25 440 0,8808136 0,9708116 4,09691 0,9661417 2,643453

Database of “heavy metals in marine sediments, differences between locations.Database of “heavy metals in marine sediments, differences between locations.

NEXT

Page 10: The Department of Economic and Management Sciences Prof. Andy Mauromoustakos Prof. Andy Mauromoustakos Analysis of multivariate Data using JMP “Heavy metals.

The database in JMP program

This picture showing us how the data look like at the JMP program during the analysis.To know what does this mean look

at the previous page in the previous page

My models (scripts)

INDEX

Page 11: The Department of Economic and Management Sciences Prof. Andy Mauromoustakos Prof. Andy Mauromoustakos Analysis of multivariate Data using JMP “Heavy metals.

Correlation between variablesCorrelation between variablesThe table behind is called the Correlations table, which is a matrix of correlation coefficients that summarizes the strength of the linear relationships between the each response which are the heavy metal variables (CU,PB,NI, MN, LCU,LFE,LNI,LMN).

The scatter plot matrix on the left showing that there is a relation between the different items in the data so it shows that the data is correlated, this correlation is not a high correlation but it is correlated. If we get the LFE variable with the LNI it shows a strong relationship between both in the plot. INDEX

Page 12: The Department of Economic and Management Sciences Prof. Andy Mauromoustakos Prof. Andy Mauromoustakos Analysis of multivariate Data using JMP “Heavy metals.

Description of how some analysis work in JMP.Description of how some analysis work in JMP.

In MANOVA analysis you can select a response design that indicates whether you want to use the response variables individually or in some linear combination. JMP like most software supports several response designs, but it also allows you to “build your own”. Included designs:

•Rep. Measures Automatic analysis of repeated measures design.

This is the way used in this database analysis.

•Sum the sum of the responses, one value

•Identity each response, the identity matrix (no transformation M=I)

•Contrast each response (except the first) minus the first

•Polynomial orthogonal polynomials

•Helmert each response versus the ones after it, except the last

•Profile each response versus all others, except the last

•Mean each response versus the mean of the others, except the last

•Compound for response forming a compound of more than one effect

•Custom any M matrix you want to enter a d edit yourself.

INDEX

Page 13: The Department of Economic and Management Sciences Prof. Andy Mauromoustakos Prof. Andy Mauromoustakos Analysis of multivariate Data using JMP “Heavy metals.

Eigenvalue

Percent

Cum Percent

CU

PB

NI

MN

LCU

LPB

LFE

LNI

LMN

Eigenvectors

5.9961

66.6238

66.6238

0.30689

0.28402

0.38542

0.31499

0.32571

0.29889

0.34783

0.38117

0.34013

1.4966

16.6294

83.2533

-0.48944

0.55689

-0.03867

-0.07459

-0.42388

0.49873

0.10778

0.00331

-0.05681

0.9337

10.3744

93.6277

-0.13965

0.00850

-0.22932

0.63071

-0.08358

0.00549

-0.37664

-0.26851

0.55595

0.4129

4.5878

98.2155

0.36319

0.28818

-0.21616

-0.23685

0.44464

0.43570

-0.39609

-0.35435

-0.11058

0.1392

1.5467

99.7622

-0.03391

-0.32411

-0.52273

-0.08129

0.12391

0.26660

0.65158

-0.23751

0.21576

0.0138

0.1535

99.9157

0.45889

-0.41377

-0.00612

0.09486

-0.54944

0.48289

-0.16609

0.19372

-0.09487

0.0042

0.0463

99.9620

0.35010

0.27606

0.05367

0.42995

-0.21464

-0.19574

0.33455

-0.48911

-0.42183

0.0028

0.0315

99.9935

0.41989

0.40570

-0.38505

-0.31247

-0.32339

-0.35899

0.03491

0.20203

0.37109

0.0006

0.0065

100.0000

-0.06522

0.09499

-0.57116

0.37845

0.20410

-0.02472

-0.05811

0.53203

-0.43425

Principal Components

The Spinning Plot platform displays a three-dimensional spinnable plot.

This third column shows the cumulative percent of variation represented by the eigenvalues. The first three principal components account for 93.6277% of the variation in the sample.

INDEX

Page 14: The Department of Economic and Management Sciences Prof. Andy Mauromoustakos Prof. Andy Mauromoustakos Analysis of multivariate Data using JMP “Heavy metals.

Analysis with MANOVAAnalysis with MANOVA

Least Squares Means Report:

This graph give us for each pure nominal effect, the overall least squares means of all the heavy metals and their log transformations and profile plots of the means. Shows the profile plot of the metal and their transformations logs and the table of least squares means.

The second graph is showing to us every site mean so it dividing to us the over all means depending on the location where every sample is gotten from. It also include the table of least square means organized by sites (locations). From here we can see that the least square mean of the Woodside area is more than the Seapray and the Seapray results is also more than Delray. This also can be noticed from the table of least squares of every variable described under this graph.

SIT

E L

S M

eans

0

50

100

150

200

250

300

350

400

450

Seaspray

Woodside

Delray

CU PB NI MN LCU LPB LFE LNI LMN

Responses

Delray

Seaspray

Woodside

6,7375

8,175

14,45

7,5875

15,875

15,925

7,8125

11,725

15,2625

117,75

190,875

369,125

0,82181932

0,907072

1,12516117

0,8561873

1,1903255

1,1594043

4,049609

4,18205125

4,1796565

0,88515852

1,06850957

1,1683389

2,06792625

2,266759

2,5584695

SITE

CU PB NI MN LCU LPB LFE LNI LMN

Ove

rall L

S M

eans

0

50

100

150

200

250

300

350

400

450

CU PB NI MN LCU LPB LFE LNI LMN

Responses

9,7875 13,1291667 11,6 225,916667 0,95135083 1,06863903 4,13710558 1,040669 2,29771825

Overall Means

CU PB NI MN LCU LPB LFE LNI LMN

NEXT

Page 15: The Department of Economic and Management Sciences Prof. Andy Mauromoustakos Prof. Andy Mauromoustakos Analysis of multivariate Data using JMP “Heavy metals.

The Partial Correlation table here shows the covariance matrix and the partial correlation matrix of residuals from the initial fit. The partial correlation table shows the partial correlations of each pair of variables after adjusting for all the other variables so we can notice how its designed to make relations between every heavy metal here. Notice that the diagonal is 1 always in the partial correlation.

CU

PB

NI

MN

LCU

LPB

LFE

LNI

LMN

14.6471528

-5.0664583

5.10090278

-90.075

0.50908533

-0.0593424

0.08025832

0.16467826

-0.077684

-5.0664583

33.6957639

6.14270833

47.6875

-0.1095526

0.92665951

0.05731365

0.18238988

0.10783013

5.10090278

6.14270833

7.21347222

-42.159028

0.16650174

0.13264857

0.10885945

0.25725092

-0.0640452

-90.075

47.6875

-42.159028

3188.79167

-2.6229898

1.08255294

-0.984555

-1.7540909

5.40295939

0.50908533

-0.1095526

0.16650174

-2.6229898

0.01879656

0.00012066

0.00255351

0.00502964

-0.0012138

-0.0593424

0.92665951

0.13264857

1.08255294

0.00012066

0.02820926

0.00111961

0.00342083

0.00368664

0.08025832

0.05731365

0.10885945

-0.984555

0.00255351

0.00111961

0.00222144

0.00420309

-0.0014722

0.16467826

0.18238988

0.25725092

-1.7540909

0.00502964

0.00342083

0.00420309

0.00953408

-0.0028589

-0.077684

0.10783013

-0.0640452

5.40295939

-0.0012138

0.00368664

-0.0014722

-0.0028589

0.01031465

P.Cov

CU PB NI MN LCU LPB LFE LNI LMN

CU

PB

NI

MN

LCU

LPB

LFE

LNI

LMN

1.0000

-0.2281

0.4962

-0.4168

0.9702

-0.0923

0.4449

0.4407

-0.1999

-0.2281

1.0000

0.3940

0.1455

-0.1377

0.9505

0.2095

0.3218

0.1829

0.4962

0.3940

1.0000

-0.2780

0.4522

0.2941

0.8600

0.9809

-0.2348

-0.4168

0.1455

-0.2780

1.0000

-0.3388

0.1141

-0.3699

-0.3181

0.9421

0.9702

-0.1377

0.4522

-0.3388

1.0000

0.0052

0.3952

0.3757

-0.0872

-0.0923

0.9505

0.2941

0.1141

0.0052

1.0000

0.1414

0.2086

0.2161

0.4449

0.2095

0.8600

-0.3699

0.3952

0.1414

1.0000

0.9133

-0.3076

0.4407

0.3218

0.9809

-0.3181

0.3757

0.2086

0.9133

1.0000

-0.2883

-0.1999

0.1829

-0.2348

0.9421

-0.0872

0.2161

-0.3076

-0.2883

1.0000

P.Corr

CU PB NI MN LCU LPB LFE LNI LMN

Partial Correlation

Partial covariance and correlation tables: Partial covariance and correlation tables: CU

PB

NI

MN

LCU

LPB

LFE

LNI

LMN

131.824375

-45.598125

45.908125

-810.675

4.58176793

-0.5340813

0.72232487

1.48210433

-0.699156

-45.598125

303.261875

55.284375

429.1875

-0.9859736

8.33993563

0.51582288

1.64150892

0.97047116

45.908125

55.284375

64.92125

-379.43125

1.49851569

1.19383717

0.97973505

2.3152583

-0.5764072

-810.675

429.1875

-379.43125

28699.125

-23.606908

9.74297645

-8.8609946

-15.786818

48.6266345

4.58176793

-0.9859736

1.49851569

-23.606908

0.16916901

0.00108591

0.02298157

0.04526677

-0.0109245

-0.5340813

8.33993563

1.19383717

9.74297645

0.00108591

0.25388336

0.01007649

0.03078743

0.03317973

0.72232487

0.51582288

0.97973505

-8.8609946

0.02298157

0.01007649

0.01999297

0.03782781

-0.01325

1.48210433

1.64150892

2.3152583

-15.786818

0.04526677

0.03078743

0.03782781

0.08580676

-0.0257297

-0.699156

0.97047116

-0.5764072

48.6266345

-0.0109245

0.03317973

-0.01325

-0.0257297

0.09283183

E

CU PB NI MN LCU LPB LFE LNI LMN

CU

PB

NI

MN

LCU

LPB

LFE

LNI

LMN

134.56625

102.04

113.706875

4216.4875

5.10744575

3.49980566

1.57113336

4.09869972

7.86616037

102.04

184.265417

126.288125

3614.36667

4.32873157

7.06092801

2.90902192

5.18070623

7.66975505

113.706875

126.288125

111.09875

3719.20625

4.48658444

4.60919815

1.971417

4.24026802

7.28587499

4216.4875

3614.36667

3719.20625

133746.292

161.81474

126.857767

55.9315542

136.515453

253.129801

5.10744575

4.32873157

4.48658444

161.81474

0.19579625

0.151628

0.06695685

0.16440443

0.30582963

3.49980566

7.06092801

4.60919815

126.857767

0.151628

0.27272668

0.1116809

0.19205713

0.27487817

1.57113336

2.90902192

1.971417

55.9315542

0.06695685

0.1116809

0.04594538

0.08116168

0.11923894

4.09869972

5.18070623

4.24026802

136.515453

0.16440443

0.19205713

0.08116168

0.16503284

0.2726529

7.86616037

7.66975505

7.28587499

253.129801

0.30582963

0.27487817

0.11923894

0.2726529

0.48701621

Whole Model H

CU PB NI MN LCU LPB LFE LNI LMN

CU

PB

NI

MN

LCU

LPB

LFE

LNI

LMN

1149.54187

1542.02062

1362.42

26533.9125

111.736155

125.511654

485.903051

122.226574

269.867008

1542.02062

2068.50021

1827.58

35593.1708

149.885324

168.36408

651.800985

163.957401

362.00551

1362.42

1827.58

1614.72

31447.6

132.428036

148.754553

575.885097

144.861125

319.84238

26533.9125

35593.1708

31447.6

612460.083

2579.11211

2897.08042

11215.6932

2821.25366

6229.11418

111.736155

149.885324

132.428036

2579.11211

10.8608209

12.1998076

47.2300661

11.8804958

26.2312341

125.511654

168.36408

148.754553

2897.08042

12.1998076

13.7038726

53.0528701

13.3451942

29.4651769

485.903051

651.800985

575.885097

11215.6932

47.2300661

53.0528701

205.387711

51.6642904

114.070836

122.226574

163.957401

144.861125

2821.25366

11.8804958

13.3451942

51.6642904

12.9959036

28.6939698

269.867008

362.00551

319.84238

6229.11418

26.2312341

29.4651769

114.070836

28.6939698

63.3541099

Intercept

CU PB NI MN LCU LPB LFE LNI LMN

CU

PB

NI

MN

LCU

LPB

LFE

LNI

LMN

134.56625

102.04

113.706875

4216.4875

5.10744575

3.49980566

1.57113336

4.09869972

7.86616037

102.04

184.265417

126.288125

3614.36667

4.32873157

7.06092801

2.90902192

5.18070623

7.66975505

113.706875

126.288125

111.09875

3719.20625

4.48658444

4.60919815

1.971417

4.24026802

7.28587499

4216.4875

3614.36667

3719.20625

133746.292

161.81474

126.857767

55.9315542

136.515453

253.129801

5.10744575

4.32873157

4.48658444

161.81474

0.19579625

0.151628

0.06695685

0.16440443

0.30582963

3.49980566

7.06092801

4.60919815

126.857767

0.151628

0.27272668

0.1116809

0.19205713

0.27487817

1.57113336

2.90902192

1.971417

55.9315542

0.06695685

0.1116809

0.04594538

0.08116168

0.11923894

4.09869972

5.18070623

4.24026802

136.515453

0.16440443

0.19205713

0.08116168

0.16503284

0.2726529

7.86616037

7.66975505

7.28587499

253.129801

0.30582963

0.27487817

0.11923894

0.2726529

0.48701621

SITE

CU PB NI MN LCU LPB LFE LNI LMN

Overall E&H Matrices

The main ingredients of multivariate tests are the E and the H matrices: •The elements of the E matrix are the cross products of the residuals. “E & H meaning click here•The H matrices correspond to hypothesis sums of squares and cross products. •There is an H matrix for the whole model and for each effect in the model. Diagonal elements of the E and H matrices correspond to the hypothesis (numerator) and error (denominator) sum of squares for the univariate F tests. New E and H matrices for any given response design are formed from these initial matrices, and the multivariate test statistics are computed from them.

INDEX

Page 16: The Department of Economic and Management Sciences Prof. Andy Mauromoustakos Prof. Andy Mauromoustakos Analysis of multivariate Data using JMP “Heavy metals.

The MANOVA Analysis The MANOVA Analysis In this MANOVA analysis we choose to use the Repeated measures response because the data has several observations in every site we have. If we look to the F-test we can notice that its intercept test prob>f = 0.0001 which is less than 0.05(α). From the main principals of the Multivariate tests we will notice that in Roy's max root test prob>f is 0.0353 which is less than 0.05. From here we can understand that there is a difference in the level of heavy metals in marine sediments when we change the location where the sample is gotten from.

The result of this test improve that the The result of this test improve that the level of heavy metals is varied whenever we level of heavy metals is varied whenever we change the location. change the location. INDEX

Page 17: The Department of Economic and Management Sciences Prof. Andy Mauromoustakos Prof. Andy Mauromoustakos Analysis of multivariate Data using JMP “Heavy metals.

Conclusion:Conclusion:

• After applying a statistics analysis to After applying a statistics analysis to the data of marine sediments in three the data of marine sediments in three different locations (Woodside,Sea different locations (Woodside,Sea spray,Delray) we discover that the spray,Delray) we discover that the heavy metals levels in the marine heavy metals levels in the marine sediments depend rationally on the area sediments depend rationally on the area where these marine sediments is where these marine sediments is located.located.

INDEX

Page 18: The Department of Economic and Management Sciences Prof. Andy Mauromoustakos Prof. Andy Mauromoustakos Analysis of multivariate Data using JMP “Heavy metals.

Experimental Design and Data Analysis for Biologists

Gerry Quinn & Mick Keough

Chapter 16: Multivariate analysis of variance and discriminant analysis Published by Cambridge University Press 2002.

* The database is a study of * The database is a study of “University of Melbourne”.“University of Melbourne”.

http://www.jmp.com/ & other alternative website.

PowerPoint presentation on “Marketing Research Part B Continuous Data Applications Multivariate Analysis” and other PDF and printed material.

Dr. Andy Mauromoustakos

JMP version 5 help.

Home

INDEX