INTERPRETING COMPLEX DNA PROFILE EVIDENCE:  BAYESIAN NETWORKS TO THE RESCUE

29
INTERPRETING COMPLEX DNA PROFILE EVIDENCE: BAYESIAN NETWORKS TO THE RESCUE Philip Dawid University of Cambridge

description

INTERPRETING COMPLEX DNA PROFILE EVIDENCE:  BAYESIAN NETWORKS TO THE RESCUE. Philip Dawid University of Cambridge. TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: A. Difficulties of Formalizing Reasoning. - PowerPoint PPT Presentation

Transcript of INTERPRETING COMPLEX DNA PROFILE EVIDENCE:  BAYESIAN NETWORKS TO THE RESCUE

Page 1: INTERPRETING COMPLEX DNA PROFILE EVIDENCE:   BAYESIAN NETWORKS  TO THE RESCUE

INTERPRETING COMPLEX DNA PROFILE EVIDENCE:  BAYESIAN NETWORKS

TO THE RESCUE

INTERPRETING COMPLEX DNA PROFILE EVIDENCE:  BAYESIAN NETWORKS

TO THE RESCUE

Philip DawidUniversity of Cambridge

Page 2: INTERPRETING COMPLEX DNA PROFILE EVIDENCE:   BAYESIAN NETWORKS  TO THE RESCUE

Difficulties of Formalizing Reasoning

Difficulties of Formalizing Reasoning

• Classical logic does not readily handle “non-monotonic” reasoning

• Reasoning with uncertainty is especially delicate

– but specification and manipulation of probabilities appears problematic

Page 3: INTERPRETING COMPLEX DNA PROFILE EVIDENCE:   BAYESIAN NETWORKS  TO THE RESCUE

Example: “Explaining Away”Example: “Explaining Away”

• Burglar alarm is ringing– Break-in?– Earthquake?

• Radio reports earthquake in vicinity– report earthquake– earthquake alarm – alarm break-in

• So report break-in ???

Page 4: INTERPRETING COMPLEX DNA PROFILE EVIDENCE:   BAYESIAN NETWORKS  TO THE RESCUE

PROBABILISTIC REASONING IN

INTELLIGENT SYSTEMSNetworks of Plausible Inference

Pearl 1988

PROBABILISTIC REASONING IN

INTELLIGENT SYSTEMSNetworks of Plausible Inference

Pearl 1988

Page 5: INTERPRETING COMPLEX DNA PROFILE EVIDENCE:   BAYESIAN NETWORKS  TO THE RESCUE

Go with the (causal) flow ?Go with the (causal) flow ?

Page 6: INTERPRETING COMPLEX DNA PROFILE EVIDENCE:   BAYESIAN NETWORKS  TO THE RESCUE

BAYESIAN NETWORKS BAYESIAN NETWORKS

• Handle complex problems involving probabilistic uncertainty

• Modular structure• Intuitive graphical representation• Precise semantics

– relevance (conditional independence)

• Correct accounting for evidence• Computational algorithms

– elegant and efficient

Page 7: INTERPRETING COMPLEX DNA PROFILE EVIDENCE:   BAYESIAN NETWORKS  TO THE RESCUE

AN APPLICATIONAN APPLICATION

• Forensic Identification

• DNA Profiling

• Disputed Paternity

Page 8: INTERPRETING COMPLEX DNA PROFILE EVIDENCE:   BAYESIAN NETWORKS  TO THE RESCUE

A typical DNA profileA typical DNA profileMarker Genotype

FGA 20/24

FES 8/11

TH01 7/9

VWA 15/18

D3S1358 15

TPOX 8/10

CSF1PO 11/12

D5S818 12

D13S317 11/13

D7S820 8/9

D16S539 12/13

D2S1338 24/25

D8S1179 12

D21S11 30/33.2

D18S51 14/22

D19S433 14/14.2

Page 9: INTERPRETING COMPLEX DNA PROFILE EVIDENCE:   BAYESIAN NETWORKS  TO THE RESCUE

Disputed PaternityDisputed Paternity

We have DNA data D from a disputed child cc, its mother mm and the putative father pfpf We have DNA data D from a disputed child cc, its mother mm and the putative father pfpf

If the true father tftf is not pfpf, he is a “random” alternative father afafIf the true father tftf is not pfpf, he is a “random” alternative father afaf

Straightforward to compute the evidence (LIKELIHOOD RATIO) in favor of paternity (Essen-Möller 1938)

Straightforward to compute the evidence (LIKELIHOOD RATIO) in favor of paternity (Essen-Möller 1938)

Page 10: INTERPRETING COMPLEX DNA PROFILE EVIDENCE:   BAYESIAN NETWORKS  TO THE RESCUE

MISSING DNA DATAMISSING DNA DATA

• What if we can not obtain DNA from the suspect ? (or other relevant individual?)

• Sometimes we can obtain indirect information by DNA profiling of relatives

• But analysis is complex and subtle…

Page 11: INTERPRETING COMPLEX DNA PROFILE EVIDENCE:   BAYESIAN NETWORKS  TO THE RESCUE

Network RepresentationNetwork Representation

We have DNA data D from a disputed child cc, its mother mm and the putative father pfpf

We have DNA data D from a disputed child cc, its mother mm and the putative father pfpf

child

founder

founder

hypothesis

Building blocks: founder, child

query

founder

If pfpf is not the true father tftf, this is a “random” alternative father afaf

If pfpf is not the true father tftf, this is a “random” alternative father afaf

, query

Page 12: INTERPRETING COMPLEX DNA PROFILE EVIDENCE:   BAYESIAN NETWORKS  TO THE RESCUE

Complex Paternity CaseComplex Paternity Case

We have DNA from a disputed child c1c1 and its mother m1m1 but not from the putative father pfpf. We do have DNA from c2c2 an undisputed child of pfpf, and from her mother m2m2 as well as from two undisputed full brothers b1b1 and b2b2 of pfpf.

We have DNA from a disputed child c1c1 and its mother m1m1 but not from the putative father pfpf. We do have DNA from c2c2 an undisputed child of pfpf, and from her mother m2m2 as well as from two undisputed full brothers b1b1 and b2b2 of pfpf.

founder

founder

founder

founder

founder

child

child

child

child child

query

hypothesis

Building blocks: founder, child, query

Page 13: INTERPRETING COMPLEX DNA PROFILE EVIDENCE:   BAYESIAN NETWORKS  TO THE RESCUE

• Each building block (founderfounder / childchild / queryquery) in a pedigree can be an INSTANCE of a generic CLASS network — which can itself have further structure

• The pedigree is built up using simple mouse clicks to insert new nodes/instances and connect them up

• Genotype data are entered and propagated using simple mouse clicks

Object-Oriented Bayesian NetworkObject-Oriented Bayesian Network

HUGIN 6HUGIN 6

Page 14: INTERPRETING COMPLEX DNA PROFILE EVIDENCE:   BAYESIAN NETWORKS  TO THE RESCUE

Under the microscope…Under the microscope…

• Each CLASS is itself a Bayesian Network, with internal structure

• Recursive: can contain instances of further class networks

• Communication via input and output nodes

Page 15: INTERPRETING COMPLEX DNA PROFILE EVIDENCE:   BAYESIAN NETWORKS  TO THE RESCUE

Lowest Level Building BlocksLowest Level Building BlocksDNA MARKER having associated repertory of alleles together with their frequenciesgene

mendel

MENDELIAN SEGREGATIONChild’s gene copies paternal or maternal gene, according to outcome of fair coin flip

GENOTYPE consisting of maximum and minimum of paternal and maternal genes

genotype

Page 16: INTERPRETING COMPLEX DNA PROFILE EVIDENCE:   BAYESIAN NETWORKS  TO THE RESCUE

founderfounder

FOUNDER INDIVIDUAL represented by a pair of genes pgin and mgin (instances of gene) sampled independently from population distribution, and combined in instance gt of genotype

gene gene

genotype

Page 17: INTERPRETING COMPLEX DNA PROFILE EVIDENCE:   BAYESIAN NETWORKS  TO THE RESCUE

childchildCHILD INDIVIDUALpaternal [maternal] gene selected by instances fmeiosis

[mmeiosis] of mendel from father’s [mother’s] two genes, and combined in instance cgt of genotype

mendel mendel

genotype

Page 18: INTERPRETING COMPLEX DNA PROFILE EVIDENCE:   BAYESIAN NETWORKS  TO THE RESCUE

queryquery

query

QUERY INDIVIDUALChoice of true father’s paternal gene tfpg [maternal gene mfpg] as either that of f1 or that of f2, according as tf=f1? is true or false.

QUERY INDIVIDUALChoice of true father’s paternal gene tfpg [maternal gene mfpg] as either that of f1 or that of f2, according as tf=f1? is true or false.

Page 19: INTERPRETING COMPLEX DNA PROFILE EVIDENCE:   BAYESIAN NETWORKS  TO THE RESCUE

Complex Paternity CaseComplex Paternity Case

query

hypothesis

• Measurements for 12 DNA markers on all 6 individuals

• Enter data, “propagate” through system

• Overall Likelihood Ratio in favour of paternity:

1300

founder

founder

founder

founder

founder

child

child

child

child child

Page 20: INTERPRETING COMPLEX DNA PROFILE EVIDENCE:   BAYESIAN NETWORKS  TO THE RESCUE

MORE COMPLEX DNA CASES

MORE COMPLEX DNA CASES

• Mutation• Silent/missed alleles,…• Mixed crime stains

– rape– scuffle

• Multiple perpetrators and stains• Database search• Contamination, laboratory errors

– …

Page 21: INTERPRETING COMPLEX DNA PROFILE EVIDENCE:   BAYESIAN NETWORKS  TO THE RESCUE

MUTATIONMUTATION

mendelmut

+ appropriate network mut to describe mutation process

Page 22: INTERPRETING COMPLEX DNA PROFILE EVIDENCE:   BAYESIAN NETWORKS  TO THE RESCUE

COMBINATIONCOMBINATION

• Can combine any or all of above features (and others), by using all appropriate subnetworks

• Can use any desired pedigree network

– no visible difference at top level

• Simply enter data (and desired parameter-values) and propagate…

Page 23: INTERPRETING COMPLEX DNA PROFILE EVIDENCE:   BAYESIAN NETWORKS  TO THE RESCUE

Paternity testingPaternity testing

Page 24: INTERPRETING COMPLEX DNA PROFILE EVIDENCE:   BAYESIAN NETWORKS  TO THE RESCUE

Paternity testing with brother tooPaternity testing with brother too

Page 25: INTERPRETING COMPLEX DNA PROFILE EVIDENCE:   BAYESIAN NETWORKS  TO THE RESCUE

Overall likelihood ratio is

Consider additional evidence (likelihood ratio) LRB carried by the brother’s data B

where D denotes data on triplet (pf, c, m)

overall D BLR LR LR= ´

Page 26: INTERPRETING COMPLEX DNA PROFILE EVIDENCE:   BAYESIAN NETWORKS  TO THE RESCUE

mgt = 12/15 pfgt = 14 cgt = 12Incompatible tripletIncompatible triplet

16/20 12/14 14 22

p(silent) LRD LRB LRB LRB LRB

0 0 1 0.55 1 3334

0.000015 0.5 1 0.55 1.00 1595

0.0001 2.5 1 0.55 1.00 404

0.001 7.5 1 0.55 1.00 46

B =

p22 = .0003

*Maximum LRoverall is 1027, at p(silent) = 0.0000642

*

p12 = .0003

Page 27: INTERPRETING COMPLEX DNA PROFILE EVIDENCE:   BAYESIAN NETWORKS  TO THE RESCUE

ExtensionsExtensions

• Estimation of mutation rates from paternity data

• Peak area data– mixtures– contamination– low copy number

Page 28: INTERPRETING COMPLEX DNA PROFILE EVIDENCE:   BAYESIAN NETWORKS  TO THE RESCUE

Thanks to:

Julia Mortera Paola Vicard

Steffen LauritzenRobert Cowell

and

The Leverhulme Trust

Thanks to:

Julia Mortera Paola Vicard

Steffen LauritzenRobert Cowell

and

The Leverhulme Trust

Page 29: INTERPRETING COMPLEX DNA PROFILE EVIDENCE:   BAYESIAN NETWORKS  TO THE RESCUE

and especially to

JUDEA PEARL

and especially to

JUDEA PEARL

who made it all possible