Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

85
Inferring gene regulatory networks with non-stationary dynamic Bayesian networks Dirk Husmeier Frank Dondelinger Sophie Lebre Biomathematics & Statistics Scotland

description

Dirk Husmeier Frank Dondelinger Sophie Lebre. Inferring gene regulatory networks with non-stationary dynamic Bayesian networks. Biomathematics & Statistics Scotland. Overview. Introduction Non-homogeneous dynamic Bayesian network for non-stationary processes Flexible network structure - PowerPoint PPT Presentation

Transcript of Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

Page 1: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

Dirk Husmeier Frank Dondelinger

Sophie Lebre

Biomathematics & Statistics Scotland

Page 2: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

Overview

• Introduction

• Non-homogeneous dynamic Bayesian network for non-stationary processes

• Flexible network structure

• Open problems

Page 3: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

Can we learn signalling pathways from postgenomic data?

From Sachs et al Science 2005

Page 4: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

Network reconstruction from postgenomic data

Page 5: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

Friedman et al. (2000), J. Comp. Biol. 7, 601-620

Marriage between

graph theory

and

probability theory

Page 6: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

Bayes net

ODE model

Page 7: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

A

CB

D

E F

NODES

EDGES

Graph theory

•Directed acyclic graph (DAG) representing conditional independence relations.

Probability theory

•It is possible to score a network in light of the data: P(D|M), D:data, M: network structure.

•We can infer how well a particular network explains the observed data.

),|()|(),|()|()|()(

),,,,,(

DCFPDEPCBDPACPABPAP

FEDCBAP

Page 8: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

[A]= w1[P1] + w2[P2] + w3[P3] +

w4[P4] + noise

BGe (Linear model)

A

P1

P2

P4

P3

w1

w4

w2

w3

Page 9: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

BDe (Nonlinear discretized model)

P1

P2

P1

P2

Activator

Repressor

Activator

Repressor

Activation

Inhibition

Allow for noise: probabilities

Conditional multinomial distribution

P

P

Page 10: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

Model Parameters q

Integral analytically tractable!

Page 11: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

BDe: UAI 1994

BGe: UAI 1995

Page 12: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

Dynamic Bayesian network

Page 13: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

Example: 2 genes 16 different network structures

Best network: maximum score

Page 14: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

Identify the best network structure

Ideal scenario: Large data sets, low noise

Page 15: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

Uncertainty about the best network structure

Limited number of experimental replications, high noise

Page 16: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

Sample of high-scoring networks

Page 17: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

Sample of high-scoring networks

Feature extraction, e.g. marginal posterior probabilities of the edges

Page 18: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

Sample of high-scoring networks

Feature extraction, e.g. marginal posterior probabilities of the edges

High-confident edge

High-confident non-edge

Uncertainty about edges

Page 19: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

Can we generalize this scheme to more than 2 genes?

In principle yes.

However …

Page 20: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

Number of structures

Number of nodes

Page 21: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

Configuration space of network structures

Find the high-scoring structures

Sampling from the posterior distribution

Taken from the MSc thesis by Ben Calderhead

Page 22: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

Madigan & York (1995), Guidici & Castello (2003)

Page 23: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

Configuration space of network structures

MCMC Local change

If accept

If accept with probability

Taken from the MSc thesis by Ben Calderhead

Page 24: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

Overview

• Introduction

• Non-homogeneous dynamic Bayesian networks for non-stationary processes

• Flexible network structure

• Open problems

Page 25: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks
Page 26: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

Dynamic Bayesian network

Page 27: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

Example: 4 genes, 10 time points

t1 t2 t3 t4 t5 t6 t7 t8 t9 t10

X(1) X1,1 X1,2 X1,3 X1,4 X1,5 X1,6 X1,7 X1,8 X1,9 X1,10

X(2) X2,1 X2,2 X2,3 X2,4 X2,5 X2,6 X2,7 X2,8 X2,9 X2,10

X(3) X3,1 X3,2 X3,3 X3,4 X3,5 X3,6 X3,7 X3,8 X3,9 X3,10

X(4) X4,1 X4,2 X4,3 X4,4 X4,5 X4,6 X4,7 X4,8 X4,9 X4,10

Page 28: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

t1 t2 t3 t4 t5 t6 t7 t8 t9 t10

X(1) X1,1 X1,2 X1,3 X1,4 X1,5 X1,6 X1,7 X1,8 X1,9 X1,10

X(2) X2,1 X2,2 X2,3 X2,4 X2,5 X2,6 X2,7 X2,8 X2,9 X2,10

X(3) X3,1 X3,2 X3,3 X3,4 X3,5 X3,6 X3,7 X3,8 X3,9 X3,10

X(4) X4,1 X4,2 X4,3 X4,4 X4,5 X4,6 X4,7 X4,8 X4,9 X4,10

Standard dynamic Bayesian network: homogeneous model

Page 29: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

Limitations of the homogeneity assumption

Page 30: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

Our new model: heterogeneous dynamic Bayesian network. Here: 2 components

t1 t2 t3 t4 t5 t6 t7 t8 t9 t10

X(1) X1,1 X1,2 X1,3 X1,4 X1,5 X1,6 X1,7 X1,8 X1,9 X1,10

X(2) X2,1 X2,2 X2,3 X2,4 X2,5 X2,6 X2,7 X2,8 X2,9 X2,10

X(3) X3,1 X3,2 X3,3 X3,4 X3,5 X3,6 X3,7 X3,8 X3,9 X3,10

X(4) X4,1 X4,2 X4,3 X4,4 X4,5 X4,6 X4,7 X4,8 X4,9 X4,10

Page 31: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

t1 t2 t3 t4 t5 t6 t7 t8 t9 t10

X(1) X1,1 X1,2 X1,3 X1,4 X1,5 X1,6 X1,7 X1,8 X1,9 X1,10

X(2) X2,1 X2,2 X2,3 X2,4 X2,5 X2,6 X2,7 X2,8 X2,9 X2,10

X(3) X3,1 X3,2 X3,3 X3,4 X3,5 X3,6 X3,7 X3,8 X3,9 X3,10

X(4) X4,1 X4,2 X4,3 X4,4 X4,5 X4,6 X4,7 X4,8 X4,9 X4,10

Our new model: heterogeneous dynamic Bayesian network. Here: 3 components

Page 32: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

Learning with MCMC

q

k

h

Number of components (here: 3)

Allocation vector

Page 33: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

Non-homogeneous model

Non-linear model

Page 34: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

[A]= w1[P1] + w2[P2] + w3[P3] +

w4[P4] + noise

BGe: Linear model

A

P1

P2

P4

P3

w1

w4

w2

w3

Page 35: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

BDe: Nonlinear discretized model

P1

P2

P1

P2

Activator

Repressor

Activator

Repressor

Activation

Inhibition

Allow for noise: probabilities

Conditional multinomial distribution

P

P

Page 36: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

Pros and cons of the two models

Linear Gaussian model

• Restriction to linear processes

• Original data no information loss

Multinomial model

• Nonlinear model

• Discretization information loss

Page 37: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

Can we get an approximate nonlinear model without data discretization?

y

x

Page 38: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

Can we get an approximate nonlinear model without data discretization?

Idea: piecewise linear model

y

x

Page 39: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

t1 t2 t3 t4 t5 t6 t7 t8 t9 t10

X(1) X1,1 X1,2 X1,3 X1,4 X1,5 X1,6 X1,7 X1,8 X1,9 X1,10

X(2) X2,1 X2,2 X2,3 X2,4 X2,5 X2,6 X2,7 X2,8 X2,9 X2,10

X(3) X3,1 X3,2 X3,3 X3,4 X3,5 X3,6 X3,7 X3,8 X3,9 X3,10

X(4) X4,1 X4,2 X4,3 X4,4 X4,5 X4,6 X4,7 X4,8 X4,9 X4,10

Inhomogeneous dynamic Bayesian network with common changepoints

Page 40: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

Inhomogenous dynamic Bayesian network with node-specific changepoints

t1 t2 t3 t4 t5 t6 t7 t8 t9 t10

X(1) X1,1 X1,2 X1,3 X1,4 X1,5 X1,6 X1,7 X1,8 X1,9 X1,10

X(2) X2,1 X2,2 X2,3 X2,4 X2,5 X2,6 X2,7 X2,8 X2,9 X2,10

X(3) X3,1 X3,2 X3,3 X3,4 X3,5 X3,6 X3,7 X3,8 X3,9 X3,10

X(4) X4,1 X4,2 X4,3 X4,4 X4,5 X4,6 X4,7 X4,8 X4,9 X4,10

Page 41: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

NIPS 2009

Page 42: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

Overview

• Introduction

• Non-homogeneous dynamic Bayesian network for non-stationary processes

• Flexible network structure

• Open problems

Page 43: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

Non-stationarity in the regulatory process

Page 44: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

Non-stationarity in the network structure

Page 45: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

ICML 2010

Page 46: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

Flexible network structure with regularization

Page 47: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

Flexible network structure with regularization

Page 48: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

Flexible network structure with regularization

Page 49: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

Morphogenesis in Drosophila melanogaster

• Gene expression measurements over 66 time steps of 4028 genes (Arbeitman et al., Science, 2002).

• Selection of 11 genes involved in muscle development.

Zhao et al. (2006),

Bioinformatics 22

Page 50: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

Transition probabilities: flexible structure with regularization

Morphogenetic transitions: Embryo larva larva pupa pupa adult

Page 51: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

Comparison with:

Dondelinger, Lèbre & Husmeier Ahmed & Xing

Page 52: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks
Page 53: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks
Page 54: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks
Page 55: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks
Page 56: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks
Page 57: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

Collaboration with Frank Dondelinger and Sophie Lèbre

NIPS 2010

Page 58: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks
Page 59: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

Method based on homogeneous DBNs

Method based on differential equations

Page 60: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks
Page 61: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

Sample of high-scoring networks

Page 62: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

Sample of high-scoring networks

Feature extraction, e.g. marginal posterior probabilities of the edges

Page 63: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

Method based on homogeneous DBNs

Method based on differential equations

Page 64: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

Overview

• Introduction

• Non-homogeneous dynamic Bayesian network for non-stationary processes

• Flexible network structure

• Open problems

Page 65: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks
Page 66: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

Exponential versus binomial prior distribution

Exploration of various information sharing options

Page 67: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

How to deal with static data?

Page 68: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

Change-point process

Free allocation

Page 69: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

Allocation sampler versus change-point process

• More flexibility, unrestricted mixture model.

• Not restricted to time series

• Higher computational costs

• Incorporates plausible prior knowledge for time series.

• Reduced complexity• Less universal, not

applicable to static data

Page 70: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

Marco GrzegorczykUniversity of Dortmund

Germany

Frank Dondelinger Biomathematics & Statistics Scotland

United Kingdom

Sophie LèbreUniversité de Strasbourg

France

Acknowledgements

Page 71: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

Further details for discussion during

question time

Page 72: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

Details on exponential prior

Page 73: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

Hierarchical Bayesian model

Page 74: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

Hierarchical Bayesian model

Page 75: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks
Page 76: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

MCMC scheme (for symmetric proposal distributions)

Page 77: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

Details on other priors

Page 78: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks
Page 79: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks
Page 80: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

where

Page 81: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks
Page 82: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks
Page 83: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

Partition function

Ignoring the fan-in restriction:

Number of genes

Page 84: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

Simulation study• We randomly generated 10 networks with 10

nodes each.• Number of regulators for each node drawn from

a Poisson distribution with mean=3.• 5 time series segments• Network changes: number of changes drawn

from a Poisson distribution.• For each segment: time series of length 50

generated from a linear regression model, interaction parameters drawn from N(0,1), iid Gaussian noise from N(0,1).

Page 85: Inferring gene regulatory networks with non-stationary dynamic Bayesian networks

Synthetic simulation study

No information sharing between

adjacent segments

Information sharing between adjacent

segments

Frank Dondelinger, Sophie Lèbre, Dirk Husmeier: ICML 2010