Microarray Preprocessing MBP1010H, Department of Medical Biophysics Irakli (Erik) Dzneladze.

17
Microarray Preprocessing MBP1010H, Department of Medical Biophysics Irakli (Erik) Dzneladze

Transcript of Microarray Preprocessing MBP1010H, Department of Medical Biophysics Irakli (Erik) Dzneladze.

Page 1: Microarray Preprocessing MBP1010H, Department of Medical Biophysics Irakli (Erik) Dzneladze.

Microarray Preprocessing

MBP1010H, Department of Medical BiophysicsIrakli (Erik) Dzneladze

Page 2: Microarray Preprocessing MBP1010H, Department of Medical Biophysics Irakli (Erik) Dzneladze.

Affymetrix microarray processing in R

Page 3: Microarray Preprocessing MBP1010H, Department of Medical Biophysics Irakli (Erik) Dzneladze.

• Four separate processing steps: background correction, normalization, pm correction and summary expression value computation

• Single Affy function runs selected algorithms in sequence

Affy processing pipeline

Page 4: Microarray Preprocessing MBP1010H, Department of Medical Biophysics Irakli (Erik) Dzneladze.

Step 1 - background correction

• Scanner image picks up background noise in every image

• Background noise may be due to unbound fluorescent dyes (e.g. Cy3 and Cy5) used to label the RNA on the chip

• This background is quantified and subtracted from probe intensity values

Page 5: Microarray Preprocessing MBP1010H, Department of Medical Biophysics Irakli (Erik) Dzneladze.

Step 2 - normalization

• The hybridization step cannot be perfectly controlled.

• Event though RNA is quantified prior to hybridization, it is impossible to get the exact same amount of RNA to hybridize to each chip

• The result of this is chip to chip differences in overall distribution of probe intensity values

• The purpose of normalization is to minimize these systematic differences between chips so that individual chips can be compared to each other

Page 6: Microarray Preprocessing MBP1010H, Department of Medical Biophysics Irakli (Erik) Dzneladze.

Step 3 – pm correction

• Affymetrix GeneChips contain both perfect match (mm) and mismatch (mm) probes

• Mm probes quantify non-specific and cross-hybridization

• Originally mm signal was subtracted from pm signal to correct for non-specific and cross hybridization

• Many researchers prefer to ignore the mm probes entirely and use uncorrected pm probes alone

Page 7: Microarray Preprocessing MBP1010H, Department of Medical Biophysics Irakli (Erik) Dzneladze.

Step 4 – summary expression value computation

• Each gene is represented by one or more probes sets

• Each probe set includes 11-20 probe pairs

• Expression value for a gene is a summary of corresponding probe-level data

• i.e. probe level intensity values correlated with “gene expression”

Page 8: Microarray Preprocessing MBP1010H, Department of Medical Biophysics Irakli (Erik) Dzneladze.

Instructions can be found in the manual

Page 9: Microarray Preprocessing MBP1010H, Department of Medical Biophysics Irakli (Erik) Dzneladze.

Some terms used in the manual

Page 10: Microarray Preprocessing MBP1010H, Department of Medical Biophysics Irakli (Erik) Dzneladze.

Preprocessing ExampleAgilent Platform (Cy3 and Cy5)

Page 11: Microarray Preprocessing MBP1010H, Department of Medical Biophysics Irakli (Erik) Dzneladze.
Page 12: Microarray Preprocessing MBP1010H, Department of Medical Biophysics Irakli (Erik) Dzneladze.
Page 13: Microarray Preprocessing MBP1010H, Department of Medical Biophysics Irakli (Erik) Dzneladze.
Page 14: Microarray Preprocessing MBP1010H, Department of Medical Biophysics Irakli (Erik) Dzneladze.
Page 15: Microarray Preprocessing MBP1010H, Department of Medical Biophysics Irakli (Erik) Dzneladze.
Page 16: Microarray Preprocessing MBP1010H, Department of Medical Biophysics Irakli (Erik) Dzneladze.
Page 17: Microarray Preprocessing MBP1010H, Department of Medical Biophysics Irakli (Erik) Dzneladze.