Introduction to MicroArrays and Gene Expression Profiling · Introduction to MicroArrays and Gene...

43
Introduction to MicroArrays and Gene Expression Profiling Haibe-Kains B 1,2 Bontempi G 2 Sotiriou C 1 1 Unit´ e Microarray, Institut Jules Bordet 2 Machine Learning Group, Universit´ e Libre de Bruxelles March 23, 2007 Haibe-Kains B (ULB) IDDI presentation part 1 March 23, 2007 1 / 43

Transcript of Introduction to MicroArrays and Gene Expression Profiling · Introduction to MicroArrays and Gene...

Page 1: Introduction to MicroArrays and Gene Expression Profiling · Introduction to MicroArrays and Gene Expression Pro ling Haibe-Kains B1;2 Bontempi G2 Sotiriou C1 1Unit e Microarray,

Introduction to MicroArrays and Gene ExpressionProfiling

Haibe-Kains B1,2 Bontempi G2 Sotiriou C1

1Unite Microarray, Institut Jules Bordet

2Machine Learning Group, Universite Libre de Bruxelles

March 23, 2007

Haibe-Kains B (ULB) IDDI presentation part 1 March 23, 2007 1 / 43

Page 2: Introduction to MicroArrays and Gene Expression Profiling · Introduction to MicroArrays and Gene Expression Pro ling Haibe-Kains B1;2 Bontempi G2 Sotiriou C1 1Unit e Microarray,

Research GroupsFunctional Genomic Unit

7 researchers (1 prof, 3 postDocs, 6 PhD students), 3 technicians).

Research topics : Genomic analyses and clinic studies.

Website : http://www.bordet.be/en/services/medical/array/practical.htm.

National scientific collaborations : ULB, Erasme, ULg, Gembloux

International scientific collaborations : Genome Institute of Singapore,John Radcliffe Hospital, Karolinska Institute and Hospital, MDAnderson Cancer Center, Netherlands Cancer Institute, Swiss Institutefor Experimental Cancer Research, NCI/NIH, Gustave-RoussyInstitute, IDDI

Haibe-Kains B (ULB) IDDI presentation part 1 March 23, 2007 2 / 43

Page 3: Introduction to MicroArrays and Gene Expression Profiling · Introduction to MicroArrays and Gene Expression Pro ling Haibe-Kains B1;2 Bontempi G2 Sotiriou C1 1Unit e Microarray,

Research GroupsMachine Learning Group

7 researchers (1 prof, 6 PhD students), 4 graduate students).

Research topics : Bioinformatics, Classification, Regression, Timeseries prediction, Sensor networks.

Website : http://www.ulb.ac.be/di/mlg.

Scientific collaborations in ULB : IRIDIA (Sciences Appliquees),Physiologie Molculaire de la Cellule (IBMM), Conformation desMacromolcules Biologiques et Bioinformatique (IBMM), CENOLI(Sciences), Microarray Unit (Hopital Jules Bordet), Serviced’Anesthesie (Erasme).

Scientific collaborations outside ULB : UCL Machine Learning Group(B), Politecnico di Milano (I), Universita del Sannio (I), GeorgeMason University (US).

The MLG is part to the ”Groupe de Contact FNRS” on MachineLearning and to CINBIOS (http://babylone.ulb.ac.be/Joomla/)

Haibe-Kains B (ULB) IDDI presentation part 1 March 23, 2007 3 / 43

Page 4: Introduction to MicroArrays and Gene Expression Profiling · Introduction to MicroArrays and Gene Expression Pro ling Haibe-Kains B1;2 Bontempi G2 Sotiriou C1 1Unit e Microarray,

Outline

1 MicroArray Analysis Design2 MicroArray Technologies

1 Introduction2 Differences3 Comparison4 Affymetrix

3 Preprocessing1 Visualization2 Missing Data

4 Biological Annotations1 NCBI Entrez

5 MicroArray Databases

Haibe-Kains B (ULB) IDDI presentation part 1 March 23, 2007 4 / 43

Page 5: Introduction to MicroArrays and Gene Expression Profiling · Introduction to MicroArrays and Gene Expression Pro ling Haibe-Kains B1;2 Bontempi G2 Sotiriou C1 1Unit e Microarray,

Microarray Analysis Design

Experimentaldesign

Normalization

Imageanalysis

Expressionquantification

Biologicalquestion

Microarrayexperiment

Preprocessing

Class predictionClass discoveryClass

comparison

Biological verification and interpretation

Statistics

Biology

Graphs and networks

Haibe-Kains B (ULB) IDDI presentation part 1 March 23, 2007 5 / 43

Page 6: Introduction to MicroArrays and Gene Expression Profiling · Introduction to MicroArrays and Gene Expression Pro ling Haibe-Kains B1;2 Bontempi G2 Sotiriou C1 1Unit e Microarray,

Part I

MicroArray Technologies

Haibe-Kains B (ULB) IDDI presentation part 1 March 23, 2007 6 / 43

Page 7: Introduction to MicroArrays and Gene Expression Profiling · Introduction to MicroArrays and Gene Expression Pro ling Haibe-Kains B1;2 Bontempi G2 Sotiriou C1 1Unit e Microarray,

Introduction

Enormous advances in genomics and molecular biology during lastdecade

High-throughput techniques like microarrays carry the promise to givenew insights into functionalities of whole genomes in a systematicmanner

Moreover, the analysis of such a vast amount of data leads to newchallenges in statistics and bioinformatics

Haibe-Kains B (ULB) IDDI presentation part 1 March 23, 2007 7 / 43

Page 8: Introduction to MicroArrays and Gene Expression Profiling · Introduction to MicroArrays and Gene Expression Pro ling Haibe-Kains B1;2 Bontempi G2 Sotiriou C1 1Unit e Microarray,

MicroArrayDefinition

A microarray is composed of

DNA fragments (probes) fixed on a solid support

ordered position of probes

principle of hybridization to a specific probe of complementarysequence

molecular labeling

ß Simultaneous detection of thousands of sequences in parallel

Haibe-Kains B (ULB) IDDI presentation part 1 March 23, 2007 8 / 43

Page 9: Introduction to MicroArrays and Gene Expression Profiling · Introduction to MicroArrays and Gene Expression Pro ling Haibe-Kains B1;2 Bontempi G2 Sotiriou C1 1Unit e Microarray,

Probes

Haibe-Kains B (ULB) IDDI presentation part 1 March 23, 2007 9 / 43

Page 10: Introduction to MicroArrays and Gene Expression Profiling · Introduction to MicroArrays and Gene Expression Pro ling Haibe-Kains B1;2 Bontempi G2 Sotiriou C1 1Unit e Microarray,

TechnologiesDifferences

There exist several high-throughput methods to simultaneously measurethe expression of a large number of genes :

cDNA microarray

oligonucleotide microarrayI short oligonucleotide (Affymetrix c©)I long oligonucleotide (Agilent c©, CodeLink c©)

multiplex quantitative RT-PCR

Haibe-Kains B (ULB) IDDI presentation part 1 March 23, 2007 10 / 43

Page 11: Introduction to MicroArrays and Gene Expression Profiling · Introduction to MicroArrays and Gene Expression Pro ling Haibe-Kains B1;2 Bontempi G2 Sotiriou C1 1Unit e Microarray,

TechnologiesDifferences cont’d

We could classify microarray technologies by the building process and thegene expression measurement

spotted vs synthesized : some microarrays are built using a robot thatputs a small amount of RNA (probes) on a glass. For others, theprobes are synthesized directly on the glass using a technology similarto integrated circuits manufacturing.

dual-channel vs one-channel : some microarrays uses 2 types oflabeling to detect hybridization. The first label is for the target tissueand the second one for the reference tissue. For other microarraytechnologies, only one labeling/tissue is used.

Haibe-Kains B (ULB) IDDI presentation part 1 March 23, 2007 11 / 43

Page 12: Introduction to MicroArrays and Gene Expression Profiling · Introduction to MicroArrays and Gene Expression Pro ling Haibe-Kains B1;2 Bontempi G2 Sotiriou C1 1Unit e Microarray,

MicroArray Comparison

cDNA short oligonucleotide long oligonuleotide

Haibe-Kains B (ULB) IDDI presentation part 1 March 23, 2007 12 / 43

Page 13: Introduction to MicroArrays and Gene Expression Profiling · Introduction to MicroArrays and Gene Expression Pro ling Haibe-Kains B1;2 Bontempi G2 Sotiriou C1 1Unit e Microarray,

TechnologiesAffymetrix c©

Affymetrix c© is a short oligonucleotide (synthesized, one-channel)microarray technology

We will focus on this platform in order to overview the maincharacteristics of microarray technology

Haibe-Kains B (ULB) IDDI presentation part 1 March 23, 2007 13 / 43

Page 14: Introduction to MicroArrays and Gene Expression Profiling · Introduction to MicroArrays and Gene Expression Pro ling Haibe-Kains B1;2 Bontempi G2 Sotiriou C1 1Unit e Microarray,

Affymetrix c©

Design

Haibe-Kains B (ULB) IDDI presentation part 1 March 23, 2007 14 / 43

Page 15: Introduction to MicroArrays and Gene Expression Profiling · Introduction to MicroArrays and Gene Expression Pro ling Haibe-Kains B1;2 Bontempi G2 Sotiriou C1 1Unit e Microarray,

Affymetrix c©

Equipment

Haibe-Kains B (ULB) IDDI presentation part 1 March 23, 2007 15 / 43

Page 16: Introduction to MicroArrays and Gene Expression Profiling · Introduction to MicroArrays and Gene Expression Pro ling Haibe-Kains B1;2 Bontempi G2 Sotiriou C1 1Unit e Microarray,

Affymetrix c©

GeneChip

Haibe-Kains B (ULB) IDDI presentation part 1 March 23, 2007 16 / 43

Page 17: Introduction to MicroArrays and Gene Expression Profiling · Introduction to MicroArrays and Gene Expression Pro ling Haibe-Kains B1;2 Bontempi G2 Sotiriou C1 1Unit e Microarray,

Affymetrix c©

GeneChip Structure

1 gene is represented by 1 or more probe sets

1 probe set includes 11 to 20 probe pairs

1 probe pair includes a perfect match (PM) value and a mismatchvalue (MM)

Gene

Probe

ProbesetProbeset Probeset

ProbeProbe

PM MM PM MM PM MM

Haibe-Kains B (ULB) IDDI presentation part 1 March 23, 2007 17 / 43

Page 18: Introduction to MicroArrays and Gene Expression Profiling · Introduction to MicroArrays and Gene Expression Pro ling Haibe-Kains B1;2 Bontempi G2 Sotiriou C1 1Unit e Microarray,

Affymetrix c©

GeneChip Structure cont’d

Haibe-Kains B (ULB) IDDI presentation part 1 March 23, 2007 18 / 43

Page 19: Introduction to MicroArrays and Gene Expression Profiling · Introduction to MicroArrays and Gene Expression Pro ling Haibe-Kains B1;2 Bontempi G2 Sotiriou C1 1Unit e Microarray,

Affymetrix c©

Hybridization

Haibe-Kains B (ULB) IDDI presentation part 1 March 23, 2007 19 / 43

Page 20: Introduction to MicroArrays and Gene Expression Profiling · Introduction to MicroArrays and Gene Expression Pro ling Haibe-Kains B1;2 Bontempi G2 Sotiriou C1 1Unit e Microarray,

Affymetrix c©

Detection

Haibe-Kains B (ULB) IDDI presentation part 1 March 23, 2007 20 / 43

Page 21: Introduction to MicroArrays and Gene Expression Profiling · Introduction to MicroArrays and Gene Expression Pro ling Haibe-Kains B1;2 Bontempi G2 Sotiriou C1 1Unit e Microarray,

Affymetrix c©

Good choice ?

Advantages :I Commercially available for several years (strong manufacturing)I Large number of published studies (generally accepted method)I No reference sample ß possible comparison between studies

Disadvantages :I Cost of the devices and the chips (but easy use)I Changes in probe design is hard (but new program permits to create

his own design)I Short oligos → several oligos per gene, specificity/sensitivity trade-off

(complex methods to get gene expression)

Haibe-Kains B (ULB) IDDI presentation part 1 March 23, 2007 21 / 43

Page 22: Introduction to MicroArrays and Gene Expression Profiling · Introduction to MicroArrays and Gene Expression Pro ling Haibe-Kains B1;2 Bontempi G2 Sotiriou C1 1Unit e Microarray,

Part II

Preprocessing

Haibe-Kains B (ULB) IDDI presentation part 1 March 23, 2007 22 / 43

Page 23: Introduction to MicroArrays and Gene Expression Profiling · Introduction to MicroArrays and Gene Expression Pro ling Haibe-Kains B1;2 Bontempi G2 Sotiriou C1 1Unit e Microarray,

Affymetrix c© Preprocessing

We will focus on preprocessing methods for Affymetrix c© data

image analysis : get raw probe intensities from chip image

expression quantification : get gene expressions from raw probeintensities

normalization : remove systematic bias to compare gene expressions

There exist several methods but we will focus on robust multi-arrayanalysis (RMA) methods [Irizarry et al., 2003]

Haibe-Kains B (ULB) IDDI presentation part 1 March 23, 2007 23 / 43

Page 24: Introduction to MicroArrays and Gene Expression Profiling · Introduction to MicroArrays and Gene Expression Pro ling Haibe-Kains B1;2 Bontempi G2 Sotiriou C1 1Unit e Microarray,

Affymetrix c© Preprocessing

Main software from Affymetrix c© was MicroArray cont’d 5 (MAS5),now called GeneChip Operating Software 1.4 (GCOS)

DAT file is the image file

Haibe-Kains B (ULB) IDDI presentation part 1 March 23, 2007 24 / 43

Page 25: Introduction to MicroArrays and Gene Expression Profiling · Introduction to MicroArrays and Gene Expression Pro ling Haibe-Kains B1;2 Bontempi G2 Sotiriou C1 1Unit e Microarray,

Affymetrix c© Preprocessing

Each probe cell is composed by 10x10 pixels

remove outer 36 pixels → 8x8 pixels

probe cell signal, PM or MM, is the 75th percentile of the 8x8 pixelvalues

Haibe-Kains B (ULB) IDDI presentation part 1 March 23, 2007 25 / 43

Page 26: Introduction to MicroArrays and Gene Expression Profiling · Introduction to MicroArrays and Gene Expression Pro ling Haibe-Kains B1;2 Bontempi G2 Sotiriou C1 1Unit e Microarray,

Affymetrix c© Preprocessing

CEL file is the cell intensity file including probe level PM and MMvalues

The last file provided by Affymetrix c© concerns the annotations of thechip

CDF file is the chip description file describing which probes go inwhich probesets (genes, gene fragments, ESTs)

Most of microarray analyses start from CEL files

Haibe-Kains B (ULB) IDDI presentation part 1 March 23, 2007 26 / 43

Page 27: Introduction to MicroArrays and Gene Expression Profiling · Introduction to MicroArrays and Gene Expression Pro ling Haibe-Kains B1;2 Bontempi G2 Sotiriou C1 1Unit e Microarray,

Affymetrix c© Preprocessing

For each probe set, summarization of the probe level data (11-20 PMand MM pairs) into a single expression measure

RMA procedure

use only PM and ignore MM

adjust for background on the raw intensity scale

carry out quantile normalization [Bolstad et al., 2003] of PM − BGand call the result n(PM − BG )

Take log2 of normalized background adjusted PM

Carry out a medianpolish of the quantities log2n(PM − BG ) tosummarize the probe level values in one probeset value

Haibe-Kains B (ULB) IDDI presentation part 1 March 23, 2007 27 / 43

Page 28: Introduction to MicroArrays and Gene Expression Profiling · Introduction to MicroArrays and Gene Expression Pro ling Haibe-Kains B1;2 Bontempi G2 Sotiriou C1 1Unit e Microarray,

Affymetrix c© PreprocessingQuantile Normalization

I 1

I J

I j

I 3I 2

<I 1>

<I J>

<I j>

<I 3><I 2>

I 1

I J

I j

I 3I 2

I 1

I J

I j

I 3I 2

Chip 1 Chip 2Average

chip Chip I

Haibe-Kains B (ULB) IDDI presentation part 1 March 23, 2007 28 / 43

Page 29: Introduction to MicroArrays and Gene Expression Profiling · Introduction to MicroArrays and Gene Expression Pro ling Haibe-Kains B1;2 Bontempi G2 Sotiriou C1 1Unit e Microarray,

Affymetrix c© PreprocessingQuantile Normalization

6 8 10 12 14 16

0.0

0.1

0.2

0.3

0.4

log2 PM

dens

ityafter quantile normalization

Haibe-Kains B (ULB) IDDI presentation part 1 March 23, 2007 29 / 43

Page 30: Introduction to MicroArrays and Gene Expression Profiling · Introduction to MicroArrays and Gene Expression Pro ling Haibe-Kains B1;2 Bontempi G2 Sotiriou C1 1Unit e Microarray,

Visualizing MicroArray Data

In order to visualize microarray data, we draw a heatmap

it is an image where the gene expressions are represented by colorsI red for high gene expressionI green for low gene expression

Patients are in columns and genes in rows

Microarray data are usually standardized

x − center(x)

scale(x)

may improve the comparison between microarray experiments butmay also remove some interesting features in the data.

We usually apply gene centering in one-channel microarray data, iexij = xij − center(x.j) with center being the median.

Haibe-Kains B (ULB) IDDI presentation part 1 March 23, 2007 30 / 43

Page 31: Introduction to MicroArrays and Gene Expression Profiling · Introduction to MicroArrays and Gene Expression Pro ling Haibe-Kains B1;2 Bontempi G2 Sotiriou C1 1Unit e Microarray,

Visualizing MicroArray DataHeatmap

patie

nt.1

patie

nt.2

patie

nt.3

patie

nt.4

patie

nt.5

patie

nt.6

1007_s_at

1053_at

117_at

121_at

1255_g_at

1294_at

1316_at

1320_at

1405_i_at

1431_at

1438_at

1487_at

1494_f_at

1598_g_at

160020_at

1729_at

1773_at

177_at

179_at

1861_at

Haibe-Kains B (ULB) IDDI presentation part 1 March 23, 2007 31 / 43

Page 32: Introduction to MicroArrays and Gene Expression Profiling · Introduction to MicroArrays and Gene Expression Pro ling Haibe-Kains B1;2 Bontempi G2 Sotiriou C1 1Unit e Microarray,

Missing Data

Depending of the microarray technology and the preprocessingmethods, some gene expressions may be missing

If further analyses require all gene expressions, you canI Replace a missing value by mean or median of row/columnI Replace a missing value by a prediction using

F KNN [Troyanskaya et al., 2001]F regression [Kim et al., 2005]F bayesian estimation [Oba et al., 2004]

Such methods have an impact on the overall analysis[Scheel et al., 2005]

Haibe-Kains B (ULB) IDDI presentation part 1 March 23, 2007 32 / 43

Page 33: Introduction to MicroArrays and Gene Expression Profiling · Introduction to MicroArrays and Gene Expression Pro ling Haibe-Kains B1;2 Bontempi G2 Sotiriou C1 1Unit e Microarray,

Part III

Biological Annotations

Haibe-Kains B (ULB) IDDI presentation part 1 March 23, 2007 33 / 43

Page 34: Introduction to MicroArrays and Gene Expression Profiling · Introduction to MicroArrays and Gene Expression Pro ling Haibe-Kains B1;2 Bontempi G2 Sotiriou C1 1Unit e Microarray,

NCBI Entrez

OMIM PubMed

Nucleotidesequences

TaxonomyProtein

sequencesMaps & Genomes

3D structures

Full Text Electronic Journals

Help is available at http://www.ncbi.nlm.nih.gov/EntrezHaibe-Kains B (ULB) IDDI presentation part 1 March 23, 2007 34 / 43

Page 35: Introduction to MicroArrays and Gene Expression Profiling · Introduction to MicroArrays and Gene Expression Pro ling Haibe-Kains B1;2 Bontempi G2 Sotiriou C1 1Unit e Microarray,

Part IV

MicroArray Databases

Haibe-Kains B (ULB) IDDI presentation part 1 March 23, 2007 35 / 43

Page 36: Introduction to MicroArrays and Gene Expression Profiling · Introduction to MicroArrays and Gene Expression Pro ling Haibe-Kains B1;2 Bontempi G2 Sotiriou C1 1Unit e Microarray,

Human Cancer Microarray Datasets

Differences between datasets due to :

Different types of cancer : breast, lymphoma, lung, brain ...

Different types of microarray technology

Be careful when comparing different datasetsI mapping of the probes through annotations (eg gene ids, unigene

clusters)I meta-analysis to consider several datasets in one study as in

[Shen et al., 2004, Rhodes et al., 2004, Sotiriou et al., 2006].

Haibe-Kains B (ULB) IDDI presentation part 1 March 23, 2007 36 / 43

Page 37: Introduction to MicroArrays and Gene Expression Profiling · Introduction to MicroArrays and Gene Expression Pro ling Haibe-Kains B1;2 Bontempi G2 Sotiriou C1 1Unit e Microarray,

Databases

Databases of microarray datasetsI gene expression omnibus (GEO) from NCBI :

http://www.ncbi.nlm.nih.gov/geo/I array express (AE) from EBI : http://www.ebi.ac.uk/arrayexpress/I oncomine : http://www.oncomine.org

Databases for mappingI Cleanex from SIB : http://www.cleanex.isb-sib.chI Adapt from Paterson Institute for Cancer Research :

http://bioinformatics.picr.man.ac.uk/adapt

Haibe-Kains B (ULB) IDDI presentation part 1 March 23, 2007 37 / 43

Page 38: Introduction to MicroArrays and Gene Expression Profiling · Introduction to MicroArrays and Gene Expression Pro ling Haibe-Kains B1;2 Bontempi G2 Sotiriou C1 1Unit e Microarray,

Links

Master in Bioinformatics at ULB and other belgian universities :http://www.bioinfomaster.ulb.ac.be/

Personal homepage : http://www.ulb.ac.be/di/map/bhaibeka/

This presentation : http://www.ulb.ac.be/di/map/bhaibeka/papers/haibekains2007iddimarray.pdf

Haibe-Kains B (ULB) IDDI presentation part 1 March 23, 2007 38 / 43

Page 39: Introduction to MicroArrays and Gene Expression Profiling · Introduction to MicroArrays and Gene Expression Pro ling Haibe-Kains B1;2 Bontempi G2 Sotiriou C1 1Unit e Microarray,

Thank you for your attention.

Haibe-Kains B (ULB) IDDI presentation part 1 March 23, 2007 39 / 43

Page 40: Introduction to MicroArrays and Gene Expression Profiling · Introduction to MicroArrays and Gene Expression Pro ling Haibe-Kains B1;2 Bontempi G2 Sotiriou C1 1Unit e Microarray,

For Further Reading I

Bolstad, B. M., Irizarry, R. A., Astrand, M., and Speed, T. P. (2003).A comparison of normalization methods for high densityoligonucleotide array data based on variance and bias.Bioinformatics, 19(2):185–193.

Irizarry, R. A., Boldstad, B. M., Collin, F., Cope, L. M., Hobbs, B.,and Speed, T. R. (2003).Summaries of affymetrix genechip probe level data.Nucleic Acids Research, 31(4).

Kim, H., Golub, G. H., and Park, H. (2005).Missing value estimation for dna microarray gene expression data:local least squares imputation.Bioinformatics, 21(2):187–198.

Haibe-Kains B (ULB) IDDI presentation part 1 March 23, 2007 40 / 43

Page 41: Introduction to MicroArrays and Gene Expression Profiling · Introduction to MicroArrays and Gene Expression Pro ling Haibe-Kains B1;2 Bontempi G2 Sotiriou C1 1Unit e Microarray,

For Further Reading II

Oba, S., Sato, M., Takemasa, I., Monden, M., Matsubara, K., andIshii, S. (2004).A bayesian missing value estimation method for gene expression profiledata.Bioinformatics, 19(16).

Rhodes, D. R., Yu, J., Shanker, K., Desphande, N., Varambally, R.,Ghosh, D., Barrette, T., Pandey, A., and Chinnaiyan, A. M. (2004).Latge-scale meta-analysis of cancer microarray data identifies commontranscriptional profiles of neoplastic transformation and progression.PNAS, 101(25):9309–9314.

Haibe-Kains B (ULB) IDDI presentation part 1 March 23, 2007 41 / 43

Page 42: Introduction to MicroArrays and Gene Expression Profiling · Introduction to MicroArrays and Gene Expression Pro ling Haibe-Kains B1;2 Bontempi G2 Sotiriou C1 1Unit e Microarray,

For Further Reading III

Scheel, I., Aldrin, M., Glad, I. K., Serum, R., Lyng, H., and Frigessi,A. (2005).The influence of missing value imputation on detection of differentiallyexpressed genes from microarray data.Bioinformatics, 21(23):4272–4279.

Shen, R., Ghosh, D., and Chinnaiyan, A. M. (2004).Prognostic meta-signature of breast cancer developed by two-stagemixture modeling of microarray data.BMC Genomics, 5(94).

Haibe-Kains B (ULB) IDDI presentation part 1 March 23, 2007 42 / 43

Page 43: Introduction to MicroArrays and Gene Expression Profiling · Introduction to MicroArrays and Gene Expression Pro ling Haibe-Kains B1;2 Bontempi G2 Sotiriou C1 1Unit e Microarray,

For Further Reading IV

Sotiriou, C., Wirapati, P., Loi, S. M., Desmedt, C., Durbecq, V.,Harris, A., Bergh, J., Smeds, J., Haibe-Kains, B., Larsimont, D.,Cardoso, F., Buyse, M., Delorenzi, M., and Piccart, M. (2006).Gene expression profiling in breast cancer: Understanding themolecular basis of histologic grade to improve prognosis.Journal of National Cancer Institute, 98:262–272.

Troyanskaya, O., Cantor, M., Sherlock, G., Brown, P., Hastie, T.,Tibshirani, R., Bolstein, D., and Altman, R. B. (2001).Missing value estimation methods for dna microarrays.Bioinformatics, 17(6):520–525.

Haibe-Kains B (ULB) IDDI presentation part 1 March 23, 2007 43 / 43