Application of the Shapley value to microarray data analysis

27
Application of the Shapley value to microarray data analysis. Stefano Moretti DIMA: Mathematics Department, University of Genova and IST: National Cancer Research Institute Fioravante Patrone DIMA: Mathematics Department, University of Genova VI Spanish Meeting on Game Theory and Practice Elche, 12-14 July 2004

Transcript of Application of the Shapley value to microarray data analysis

Page 1: Application of the Shapley value to microarray data analysis

Application of the Shapley valueto microarray data analysis.

Stefano Moretti

DIMA: Mathematics Department, University of Genova and

IST: National Cancer Research Institute

Fioravante Patrone

DIMA: Mathematics Department, University of Genova

VI Spanish Meeting on Game Theory and PracticeElche, 12-14 July 2004

Page 2: Application of the Shapley value to microarray data analysis

N.B.

• E’ una versione “ridotta” dellapresentazione fatta ad Elche

• E’ stata omessa la parte tecnica in cui siindividua una nuova caratterizzazione del valore Shapley, singificativa per lo specifico contesto applicativo

Page 3: Application of the Shapley value to microarray data analysis

Plan:

• What is a “microarray” and why is itinteresting?

• A game to play• The Shapley value is of any help?• Related developments and comments

Page 4: Application of the Shapley value to microarray data analysis
Page 5: Application of the Shapley value to microarray data analysis

Fundamental principle: hybridization

ATATCGGCATCAGTCGATCGATCATCGATCGAT

UAUAGCCGUAGUCAGCUAGCUAGUAGCUAGCUA

ATATCGGCATCAGTCGATCGATCATCGATCGAT

DNA

mRNA

cDNA

DNA RNA Protein

Transcription

Reverse transcription

TranslationReplication

Page 6: Application of the Shapley value to microarray data analysis

(Slide source: http://www.bsi.vt.edu/)

Page 7: Application of the Shapley value to microarray data analysis

Experiments with cDNA microarray

Page 8: Application of the Shapley value to microarray data analysis
Page 9: Application of the Shapley value to microarray data analysis
Page 10: Application of the Shapley value to microarray data analysis
Page 11: Application of the Shapley value to microarray data analysis

Matrix:

Page 12: Application of the Shapley value to microarray data analysis

Intensity rate

• M<0, the gene is more “expressed” in the control sample (marked in green)

• M=0, the gene is equally expressed• M>0, the gene is more “expressed” in the

tumor (or treated...) sample (marked in green)

Page 13: Application of the Shapley value to microarray data analysis

6.12.47g3

1.69.81.1g2

12204.2g1

s3s2s1

Microarray expression data from desease samples

0.53.55g3

2.17.84.2g2

2.76.34.1g1

s3s2s1

Microarray expression data from normal samples

50.57.82.16.32.7><

cutoffs

101g3111g2110g1s3s2s1

Discretized matrix

Page 14: Application of the Shapley value to microarray data analysis

EXAM PLE:

Microarray TU game:• Players are genes;• games with [0-1] characteristic function;• on each sample:

-If a coalition has value 1 then that coalitions activates the disease;-If a coalition has value 0 then that coalition does not activate the disease.

101g3

111g2

110g1

s3s2s1The corresponding [0,1]-game <{g1,g2,g3},v>:v({g1,g2})=v({g3,g2})=1/3v({g1,g2,g3})=1 andv(S)=0 for each other different coalition S.

The Shapley value is: (5/18,8/18,5/18).

Microarray discr. data

Page 15: Application of the Shapley value to microarray data analysis

Application 1: tumor versus normal

• Alon et al. (1999)wereinterested in identifying coregulated fam iliesof genesin tum orand norm alcolon tissues.

• Theystudied 6500 hum an genes.

• G eneswerecollected on 62 sam ples, –40 tum or colon tissues–22 norm al colon tissues

Page 16: Application of the Shapley value to microarray data analysis

Espression values of one gene in 22 normal samples

Gene labels

Expr

essi

onva

lues

Valuesconsideredunderexpressed

Valuesconsideredoverexpressed

Page 17: Application of the Shapley value to microarray data analysis

Shapley value of 2000 genes

Page 18: Application of the Shapley value to microarray data analysis

Shapley value of 100 genes

Page 19: Application of the Shapley value to microarray data analysis

MYOSIN REGULATORY LIGHT CHAIN 2, SMOOTH MUSCLE ISOFORM

H.sapiens mRNA for GCAP-II/uroguanylin precursor

Human desmin gene, complete cds.

Human vasoactive intestinal peptide (VIP) mRNA, complete cds.

It has been suggested topromote the growth and proliferation of tumor cells (Fujarewicz & Wiench, 2003).

Group of four genes with the highest Shapley values (in decreasing order from top to down).

It might provide an indication propensityfor metastasis of cells (Moler et al., 2000).

Page 20: Application of the Shapley value to microarray data analysis

Application 2: ALL versus AML

• G olub etal.(1999) wereinterested in identifyinggenesthatare differentially expressed in patientswith two typeof leukem ias, Acute Lym phoblasticLeukem ia (ALL) and Acute M yeloid Leukem ia(AM L).

• Theystudied 6817 hum an genes.

• G eneswerecollected on 38 sam ples, – 27 ALL cases– 11 AM L cases

• We discretized the expression values of genes in ALLsamples on the basis of expression values in AMLsamples

Page 21: Application of the Shapley value to microarray data analysis

Espression values of one gene in 11 AML samples

Gene labels

Expr

essi

onva

lues

Valuesconsideredoverexpressed

Page 22: Application of the Shapley value to microarray data analysis

Shapley value of 3051 genes

Page 23: Application of the Shapley value to microarray data analysis

Shapley value of 50 genes

Page 24: Application of the Shapley value to microarray data analysis

TOP2B Topoisomerase (DNA) II beta (180kD)

SPTAN1 Spectrin, alpha, non-erythrocytic 1 (alpha-fodrin)

Oncoprotein 18 (Op18) gene

Macmarcks

KIAA0181 gene, partial cds

Encode a critical protein for the cell cycle progression related to leukemia (Glub et al., 1999)

Group of five genes with the same highest Shapley value.

Page 25: Application of the Shapley value to microarray data analysis

Related references

• Kaufman, Kupiec and Ruppin: Multi-Knockout Genetic Network Analysis: The Rad6 Example, preprint

• Keinan, Kaufman, Sachs, Hilgetag and Ruppin: Fair localization of function via multi-lesion analysis, to appear on Neuroinformatics, 2004.

Page 26: Application of the Shapley value to microarray data analysis

three comments

• Be humble: many approaches, a lot of knowledge around

• Be determined: if you have an idea, follow it to see really what it can give

• Be critical: a characteristic which is not as widespread as it should be

Page 27: Application of the Shapley value to microarray data analysis

The END

Thanks for your attention