3 principal components analysis

Biology

Chemistry

Informatics

Principal Component Analysis (PCA) of metabolomic sample processing methods

Goal:Use PCA to identify the major modes of variance

(Used DATA: Pumpkin data 1.csv)

Topics: 1. Principal component number selection2. Data pretreatment3. PCA results visualization

Biology

Chemistry

Informatics

Principal Components Analysis Pr

Steps1. Calculate a PCA model2. Select optimal model principal component (PC) 3. Overview PCA scores and loadings plots4. Repeat steps 1-2 using data centering and scaling

Visualize:5. Sample scores annotated by extraction and treatment6. Leverage and DmodX (distance from model plane)7. Variable loadings and biplots

Exercise:8. How many PCs are needed to capture 80% variance for raw data and

scaled data?9. Are their any moderate or extreme outliers?10.What variables contribute most to the variance for raw and scaled data?

Biology

Chemistry

Informatics

PCA Variance Explained(raw data)

• PCs can be selected to explain a minimum %variance in the data (~80%)

• PCs explaining below 1% variance can be excluded

• q2 is the cross-validated PCA prediction of left out data

Biology

Chemistry

Informatics

PCA Scores (raw data)P

• Hotelling's T2 ellipse shows 95% CI for bivariate normal distribution

• Samples lying outside of the ellipse could be outliers

Biology

Chemistry

Informatics

PCA Loadings (raw data, centered)P

• Unscaled data PCA loadings are highly correlated with magnitude

Biology

Chemistry

Informatics

PCA Biplot (raw data)P

• Biplots can be used to rapidly overview the correlation between sample scores and variable loadings

Sample contains high maleic acid

Sample contains high sucrose (low maleic acid)

Biology

Chemistry

Informatics

PCA Leverage and DmodX(raw data)

Leverage is the distance to samples center in the PCA plane (extreme outliers)

Distance to model X (DmodX) is the orthogonal distance to the PCA plane (moderate outliers)

Biology

Chemistry

Informatics

PCA Variance Explained(autoscaled)

Biology

Chemistry

Informatics

PCA Scores (autoscaled)P

• Loadings on PC1 describe differences due to extraction

• Loadings on PC2 describe differences due to treatment

Biology

Chemistry

Informatics

PCA Leverage and DmodX(autoscaled)

• Samples with both high leverage and DomdX are likely outliers

• Evaluate PCA results after their removal

Biology

Chemistry

Informatics

PCA Loadings (autoscaled)P

• Scaled loadings are independent of variable magnitude and show a rich variance structure of the data

Biology

Chemistry

Informatics

isRelationship between scores and loadings (autoscaled)

Higher in 100% MeOH

ExtractionLower in 100% MeOH

Biology

Chemistry

Informatics

Loadings and ScoresP

Highest positive loading on PC1

Highest negative loading on PC1

3 principal components analysis

Documents

Transcript of 3 principal components analysis

Teori Principal Components Analysis (PCA)

Principal Components Analysis ( PCA)

Exploratory Factor Analysis – Principal Components Analysis · Exploratory Factor Analysis – Principal Components Analysis 1) ... An overview of the SPSS factor analysis ... In

Principal Components Analysis with SAS

Principal Components Analysis - PyBay 2016

Principal Components Analysis - Penn State Engineering ...rtc12/CSE586Spring2010/lectures/pcaLectureShort.… · • principal components analysis (PCA)is a technique that can be

Factor Analysis and Principal Components

Principal Components Analysis

[George H. Dunteman] Principal Components Analysis(BookFi.org)

Functional Principal Components Analysis, Implementation ...

Principal Components Analysis - CMU Statistics

Principal Components Analysis Semantic Differential Data ...

Principal Components (PCA) and Exploratory Factor Analysis ...€¦ · •Principal components analysis •Running a PCA with 8 components in SPSS •Running a PCA with 2 components

Principal Components Factor Analysis

Lesson 7_ Principal Components Analysis (PCA)

Exploratory Factor Analysis Principal Components Analysis ... · Exploratory Factor Analysis (and Principal Components Analysis) ‐and‐ Conducting Exploratory Analyses with CFA

Correspondence Analysis for Principal Components ...

Principal Components Analysis with SPSS

PCAtools: PCAtools: Everything Principal Components Analysis

Principal components analysis(pca)fulledited