Creating and Maintaining Databases

22
Creating and Maintaining Databases Dr. Pushkin Kachroo

description

Creating and Maintaining Databases. Dr. Pushkin Kachroo. Enrollment. Collect Private Information, e.g. fingerprint Follow “enrollment policy” Policy should be: acceptable to the public Clear on how, where and when the private info will be used. Enrollment Steps. Positive Enrollment: - PowerPoint PPT Presentation

Transcript of Creating and Maintaining Databases

Page 1: Creating and Maintaining Databases

Creating and Maintaining Databases

Dr. Pushkin Kachroo

Page 2: Creating and Maintaining Databases

Enrollment

• Collect Private Information, e.g. fingerprint

• Follow “enrollment policy”

• Policy should be:– acceptable to the public– Clear on how, where and when the private

info will be used

Page 3: Creating and Maintaining Databases

Enrollment Steps

• Positive Enrollment:– Trusted Individuals

– Enrollment Policy EM

– Authentication through:• Seed Documents (Birth Cert., passport)

– Store machine representation of the enrolled in Verification Database M

Page 4: Creating and Maintaining Databases

Enrollment Steps

• Negative Enrollment:– Criminal Identification

– Enrollment Policy EN

– Store machine representation of the enrolled in Screening Database N

Page 5: Creating and Maintaining Databases

General Enrollment

• Target Population: World W

• Ground Truth: legacy databases:– Criminal or civil– Can contain Fake and Duplicate Identities

Page 6: Creating and Maintaining Databases

Fake Identity

• Created Identity– Non-existent person– Biometric screening against criminal

databases might catch the “fake”

• Stolen Identity

Page 7: Creating and Maintaining Databases

The Zoo

• Sheep: – Real world biometric distinctive and stable

• Goats:– Difficult to authenticate

• Lambs:– Enrolled that are easy to imitate (cause passive FA)

• Wolves:– Good at imitating (cause active FA)

• Chameleons:– Easy to imitate and are good at imitating

Page 8: Creating and Maintaining Databases

Sample Quality Control

• Random False Reject/Accept caused by Adverse Signal Acquisition

• Solution– Better User Interface– Better model probabilistic into feature

extraction/matching– Interactively improve input

Page 9: Creating and Maintaining Databases

Quality Control

• Define “desirable”• Quality related to process-ability• Quantify quality to decide action based on the

level of quality, e.g. present info differently, apply image enhancement etc.

• Compromise between convenience and quality– Affects FTE, and also FA and FR

• ROC can be improved by eliminating poor data

Page 10: Creating and Maintaining Databases

ROC-Quality Control

FMR (False Match Rate)

FN

MR

(F

alse

Non

-mat

ch R

ate)

Throw out bad data

Page 11: Creating and Maintaining Databases

Training

• Like Machine Learning

• Relate scores to probability that the biometric matches someone or doesn’t

Training

Testing

Page 12: Creating and Maintaining Databases

Enrollment as System Training

• Assigning IDs to Subjects

• Three possibilities

– Correct

– Someone faking enrolled (duplicate)

– Someone faking unenrolled (fake)

– PD=Prob(duplicate)

– PF=Prob(fake)

Page 13: Creating and Maintaining Databases

Database Integrity

• How well database reflects the truth data• Database duplication: Purge detected

duplicates• PD=FNMRE X PDEA

– Prob of duplicate= Match bet. 2 samples not detected; double enroll

• PF=FMRE X PIA

– Prob of fake enroll= Match bet. 2 samples falsely detected; Impersonation attack

Page 14: Creating and Maintaining Databases

PD-PF

FMR (PF…)

FN

MR

(P

D..

)

Page 15: Creating and Maintaining Databases

Probabilistic Enrollment

• Enrollment Process Goal:– Build access control for from

that are authorized– Likelihood of d_i given stored token B_i

midi ,...,1,

miBd ii ,...,1)|( Prob

WM

Page 16: Creating and Maintaining Databases

Probabilistic Enrollment

• Enrollment Process Goal:– Machine representation of the “real” biometric

• Assumption about score : likelihood that we have the same subject– True if equivalently– .

iiiiii ssFNMRBBd )1(11);|(Prob

),( iii BBss

)|()|( iiii BBTsBBmatchcorrect ProbProb

ii FNMRFMR

iksBBs iki ),(

Page 17: Creating and Maintaining Databases

Probabilistic Enrollment..

• For realistic assumptions we need to model the world

• Probabilitycan be approximated unrealistically by

• We need (given biomeric data collected during enrollment, O)

miBd ii ,...,1)|( Prob

),( iii BBss

miOdi ,...,1)|( Prob

Page 18: Creating and Maintaining Databases

Modeling the World-1

)(

)()|()|(

O

ddOOd ii

i Prob

ProbProbProb

)( idProb Prior probability that subject d_i is present

)(OProb Prior probability that this observation will occur

m

jj

im

jjj

iii

dO

dO

ddO

ddOOd

11

)|(

)|(

)()|(

)()|()|(

Prob

Prob

ProbProb

ProbProbProb

Modeling numerator on right is a matter of fitting model to data; rest impractical/impossible

Page 19: Creating and Maintaining Databases

Modeling the World-2

• Cohorts– Models of most similar subjects

• World Modeling:– Reduce cohorts to a single model

Page 20: Creating and Maintaining Databases

Modeling the World-3

)|()|(

)|()|(

i

ii dODO

dOOd

ProbProb

ProbProb

For Cohort Modeling

)|()|(

)|()|(

ii

ii dODO

dOOd

ProbProb

ProbProb

Page 21: Creating and Maintaining Databases

Updating Probabilities

)|()(

)|(

)(

)()|(

)(

)|(

),(

)()|,(),|(

OdO

dO

O

ddO

O

dO

OO

ddOOOOd

ii

iii

iii

ProbProb

Prob

Prob

ProbProb

Prob

Prob

Prob

ProbProbProb

)()|()()|()( iiii DDOddOO ProbProbProbProbProb

)(1)( ii dD ProbProb

Page 22: Creating and Maintaining Databases

Use of Probabilities

• Accuracy improvements

• Define measure of biometric integrity

• Integrity of different biometrics can be combined etc.

m

dMI

m

ii

1

)()(

Prob