Using Data Mining Techniques to Improve Efficiency in Police Intelligence

46
Using Data Mining Techniques to Improve Efficiency in Police Intelligence Dr Rick Adderley

description

This is presentation that was given by Dr Rick Adderley at the UK KDD Symposium on 29th September 2011.

Transcript of Using Data Mining Techniques to Improve Efficiency in Police Intelligence

Page 1: Using Data Mining Techniques to Improve Efficiency in Police Intelligence

Using Data Mining Techniques to Improve Efficiency in Police

Intelligence

Dr Rick Adderley

Page 2: Using Data Mining Techniques to Improve Efficiency in Police Intelligence

Recent Riots

• London

Page 3: Using Data Mining Techniques to Improve Efficiency in Police Intelligence

Recent Riots

• Birmingham

Page 4: Using Data Mining Techniques to Improve Efficiency in Police Intelligence

Recent Riots

• Manchester

Page 5: Using Data Mining Techniques to Improve Efficiency in Police Intelligence

Recent Riots – Policing Cuts

The chair of South Wales Police Federation has warned if "savage" cuts go ahead police will not be able to respond effectively to future riots.

The Mayor of London Boris Johnson has warned the government against cutting police numbers. Mr Johnson said the case for cuts had been "substantially weakened" by the riots and that he opposed the Home Secretary's plans to reduce forces' budgets.

Page 6: Using Data Mining Techniques to Improve Efficiency in Police Intelligence

Policing Cuts

Views wanted on £134m police cuts in Greater Manchester

Home secretary defends cuts to policing budget

Crime concerns over Surrey Police cutsClaims that crime will rise in Surrey because of cuts to police funding have been rebutted by the force.

Welsh police voice fears over budget cuts

South Yorkshire Police chief warns of crime rise

Page 7: Using Data Mining Techniques to Improve Efficiency in Police Intelligence

• How can the provision of intelligence be strengthened and improved in the light of severe cuts?

Providing Operational Intelligence

Page 8: Using Data Mining Techniques to Improve Efficiency in Police Intelligence

• Retired Police Inspector– FLINTS

• Business Intelligence Company– 2003– Policing and Security– EU Research Projects

• PhD – Offender Profiling and Crime Trend Analysis

Introduction

Page 9: Using Data Mining Techniques to Improve Efficiency in Police Intelligence

• Validated Examples1. Interview Lists2. Early Detection of Crime Series3. Modelling Forensic Recovery4. Automatic Identification of Priority & Prolific

Offenders

Introduction

Page 10: Using Data Mining Techniques to Improve Efficiency in Police Intelligence

• IBM SPSS Modeller (Clementine)• SAS Enterprise Miner• Insightful Miner…or…

Data Mining Tools in PolicingAll of the Validated Examples can be accomplished by using:

Page 11: Using Data Mining Techniques to Improve Efficiency in Police Intelligence

Data Mining Tools in Policing

Page 12: Using Data Mining Techniques to Improve Efficiency in Police Intelligence

• To automatically interrogate one or more data sets with a view to providing information that will save time, reduce crime, deter offending and enhance dynamic business processes.

Data Mining

Page 13: Using Data Mining Techniques to Improve Efficiency in Police Intelligence

• Offenders Learn from Offenders

Additional Complication

Page 14: Using Data Mining Techniques to Improve Efficiency in Police Intelligence

• Problem Outline– Analysts Time Constraints– Examine Index Crime & Compare with:• Keyword Type Search• Personal Memory

– Produce a List “Matching” Index Crime• 1 to 2 Hours• 10% to 15% Accurate

– Interviewing Officers Refine List

1. Interview Lists

Page 15: Using Data Mining Techniques to Improve Efficiency in Police Intelligence

• MLP– Training Set– Testing Set

1. Interview Lists

Page 16: Using Data Mining Techniques to Improve Efficiency in Police Intelligence

1. Interview Lists

Billy Smith

Crime

BCU

Beat

PostCode

Page 17: Using Data Mining Techniques to Improve Efficiency in Police Intelligence

• Results of Modelling– MLP– Takes Into Account Whole Range of Criminality– Improved Accuracy• 75% to 85%

– Independent Validation• Intelligence Unit Sergeants• Intelligence Unit Analyst

1. Interview Lists

Page 18: Using Data Mining Techniques to Improve Efficiency in Police Intelligence

• Spate of Burglaries/Robberies in an Area– Are They Linked– Who May Be Responsible

2. Early Detection of Crime Series

Page 19: Using Data Mining Techniques to Improve Efficiency in Police Intelligence

• Self Organising Map

2. Early Detection of Crime Series

Page 20: Using Data Mining Techniques to Improve Efficiency in Police Intelligence

2. Early Detection of Crime Series

Page 21: Using Data Mining Techniques to Improve Efficiency in Police Intelligence

2. Early Detection of Crime Series

Page 22: Using Data Mining Techniques to Improve Efficiency in Police Intelligence

2. Early Detection of Crime Series

Each Cluster willContain Crimes That Are Similar

Page 23: Using Data Mining Techniques to Improve Efficiency in Police Intelligence

• Model Current Offenders• Overlay Onto Crime Map

2. Early Detection of Crime Series

Page 24: Using Data Mining Techniques to Improve Efficiency in Police Intelligence

2. Early Detection of Crime Series SOMKey ENTRY FEATURE FEATURE_TYPE METHOD ROOMS_SEARCHED SEARCH_TYPE CR_NO LASTNAME FIRSTNAME DOBK6-3 Rear Window Fixed Climbed All Tidy ZZ/1686/05 K6-3 Side Window Fixed Smash All Tidy ZZ/2139/05 K6-3 Rear Window Fixed Smash All Tidy ZZ/2493/05 K6-3 Rear Window Fixed Forced All Tidy ZZ/4371/05 K6-3 Rear Window Fixed Forced All Tidy ZZ/1681/05 KHAN VIJAY 21/04/1977K7-9 Front Door Fixed Insecure Down Tidy ZZ/1862/05 K7-9 Front Door Roller Insecure One Tidy ZZ/1056/05 K7-9 Front Door Fixed Insecure Down Tidy ZZ/1520/05 K7-9 Front Door Fixed Insecure Down Tidy ZZ/1578/05 K7-9 Front Door Fixed Insecure One Tidy ZZ/2056/05 K7-9 Front Door Fixed Insecure Down Tidy ZZ/2256/05 K7-9 Front Door Fixed Insecure One Tidy ZZ/3937/05 K7-9 Front Door Fixed Insecure Many Tidy ZZ/694/05 BULLEN IAN 11/11/1982K7-9 Front Door Fixed Insecure One Tidy ZZ/3332/05 SINGH JARNAIL 29/08/1979K9-2 Rear Window Casement Smash All UnTidy ZZ/305/05 K9-2 Rear Window Transom Forced All UnTidy ZZ/457/05 K9-2 Rear Window Casement Smash All UnTidy ZZ/788/05 K9-2 Rear Window Casement Smash All UnTidy ZZ/1587/05 K9-2 Rear Window Casement Smash All UnTidy ZZ/1865/05 K9-2 Rear Window Casement Smash All UnTidy ZZ/1979/05 K9-2 Rear Window Transom Smash All UnTidy ZZ/1980/05 K9-2 Rear Window Transom Forced All UnTidy ZZ/3309/05 K9-2 Rear Window Casement Smash All UnTidy ZZ/1668/05 BURTON ALAN 09/06/1979

Page 25: Using Data Mining Techniques to Improve Efficiency in Police Intelligence

• Northamptonshire Forensic Science Department– Dr John Bond

• Motivation:-

3. Modelling Forensic Recovery

Page 26: Using Data Mining Techniques to Improve Efficiency in Police Intelligence

3. Modelling Forensic Recovery

Page 27: Using Data Mining Techniques to Improve Efficiency in Police Intelligence

• Northamptonshire Forensic Science Department– Dr John Bond

• Motivation:-– Which Crimes Should be Attended First?– Which Crime Give Best Opportunity of Forensic

Recovery?

3. Modelling Forensic Recovery

Page 28: Using Data Mining Techniques to Improve Efficiency in Police Intelligence

• CRISP-DM– www.the-modeling-agency.com/crisp-dm.pdf

– Naïve Bayes Algorithm– Q-Prop Neural Network Algorithm

3. Modelling Forensic Recovery

Page 29: Using Data Mining Techniques to Improve Efficiency in Police Intelligence

3. Modelling Forensic Recovery

• Algorithm Results– 10 Fold Cross Validation on 28,490 Volume Crime

Records

Basic Variables Basic + Temporal Basic + MO vars

Bayes 68.59 69.2 74.5

Q-Prop 63.59 64.01 69.5

Page 30: Using Data Mining Techniques to Improve Efficiency in Police Intelligence

3. Modelling Forensic Recovery

• Results Using Live Data

Page 31: Using Data Mining Techniques to Improve Efficiency in Police Intelligence

Crime Ref SubDiv Offence Set Month Forensic Collected

Probability Predict Class

NN/24163/03

NN BDwell December 1 0.82 1

NC/2835/05 NC BDwell May 1 0.68 1

NN/7587/02 NN BOther June 0 0.67 1

NN/16787/02

NN TOMV October 1 0.62 1

NN/12133/04

NN TOMV July 1 0.62 1

NC/1757/01 NC TOMV May 0 0.49 0

NC/14454/02

NC TOMV November 0 0.37 0

NN/31877/03

NN BOther March 0 0.20 0

NW/14184/05

NW TFMV February 0 0.16 0

ND/6657/00 ND TFMV January 0 0.16 0

Page 32: Using Data Mining Techniques to Improve Efficiency in Police Intelligence

3. Modelling Forensic Recovery

• Gwent Police Trial– 11,800 Volume Crime Records– 10 Fold Cross Validation• Q-Prop Accuracy 81.79%• Naïve Bayes Accuracy 88.84%

Page 33: Using Data Mining Techniques to Improve Efficiency in Police Intelligence

3. Modelling Forensic Recovery

• How Good are the Models?

Page 34: Using Data Mining Techniques to Improve Efficiency in Police Intelligence

3. Modelling Forensic Recovery

• Every Northamptonshire CSI– 50 Random Crimes– Assess whether a Forensic Sample would be

collected

Page 35: Using Data Mining Techniques to Improve Efficiency in Police Intelligence

3. Modelling Forensic Recovery

• Every Northamptonshire CSI– 50 Random Crimes– Assess whether a Forensic Sample would be

collected• 41% Accuracy

Page 36: Using Data Mining Techniques to Improve Efficiency in Police Intelligence

• Which Offenders are Causing most Harm• Including Cross Border Offenders – Force Priorities– Harm Matrix

4. Priority & Prolific Offenders

Page 37: Using Data Mining Techniques to Improve Efficiency in Police Intelligence

• Which Offenders are Causing most Harm – Current Process:• Offender is “Nominated”• Scored Against Matrix• Placed on List

– Infrequently Reviewed• Insufficient Time• 20 Minutes to 2 Hours to Complete Scoring

4. Priority & Prolific Offenders

Page 38: Using Data Mining Techniques to Improve Efficiency in Police Intelligence

• Automated Process:

4. Priority & Prolific Offenders

Page 39: Using Data Mining Techniques to Improve Efficiency in Police Intelligence

4. Priority & Prolific OffendersOffender L2Offender

Offend Priority Nab Live Priority Nab Community Safety

Control Of Offenders Reduce Crime

Total Prism Score BEAT AREA CRIME

Eric Smith 20 10 6 0 0 62 98 Z2

Paul Jones 20 0 6 0 0 71 97 Z2

Mary Hands 20 10 6 4 0 54 94 Z1

John Fresh 20 0 0 20 20 30 90 Z2

Ali Khan 20 10 6 0 0 54 90 Z1

Ming Hu 20 10 6 4 0 50 90 Z1

Graham Zhu 20 10 6 0 0 50 86 Z1

Fred Brown 20 0 0 0 0 60 80 Z1

Sally Johns 20 12 6 0 0 40 78 Z1

David Green 20 0 0 16 10 30 76 Z2

Alison Blue 20 10 6 0 0 40 76 Z1

Tom Black 20 0 0 4 0 50 74 Z2

Vinny Smith 20 0 0 24 0 30 74 Z1

Saad Wang 20 12 6 0 0 36 74 Z1

Mendip Kaur 20 12 0 0 0 40 72 Z1

Brian Ling 20 0 0 0 0 52 72 Z1

Billy Smith 20 10 0 0 20 22 72 Z1

Ho Tu 20 10 6 0 0 36 72 Z1

Paul Wells 20 0 0 0 30 20 70 Z2

Page 40: Using Data Mining Techniques to Improve Efficiency in Police Intelligence

4. Priority & Prolific OffendersOffender

A1

A2

A3

A4

A5

A6

A7

A8

A9

B1

B2

B3

B4

B5

B6

B7

B8

C1

C2

C3

C4

TotalCrimes

NumBCUs

OffenderLatest BCU

Eric Smith 0 0 0 0 0 3 0 0 0 5 1 3 5 0 0 0 0 0 0 0 0 17 5 X1

Paul Jones 0 0 1 0 0 0 4 0 0 0 0 1 0 0 0 0 0 0 0 0 1 7 4 V3

Mary Hands 0 0 0 0 0 0 0 0 1 4 0 0 0 1 0 0 0 1 0 0 0 7 4 V2

John Fresh 0 0 1 0 0 1 0 0 0 0 0 0 0 0 0 0 1 0 0 0 1 4 4 V2

Ali Khan 0 0 0 0 0 0 0 0 0 0 0 0 1 0 1 0 1 0 1 0 0 4 4 V2

Ming Hu 0 4 0 0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 5 0 0 10 3 U2

Jill King 0 0 0 0 0 0 0 4 0 0 0 0 1 0 0 0 3 0 0 0 0 8 3 W1

Fred Brown 0 0 0 2 0 0 0 0 0 3 2 0 0 0 0 0 0 0 0 0 0 7 3 Z1

Sally Johns 0 0 2 4 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 0 0 7 3 Z1

Lin Ho Pu 0 0 1 0 0 0 1 0 0 0 0 0 0 0 0 0 0 4 0 0 0 6 3 V2

Brian Ling 3 0 0 0 0 0 0 0 0 0 2 0 0 0 0 0 1 0 0 0 0 6 3 T1

Billy Smith 0 0 0 0 0 0 3 0 0 0 1 0 0 0 0 0 1 0 0 0 0 5 3 U2

Ho Tu 0 2 0 1 0 0 0 0 0 0 0 2 0 0 0 0 0 0 0 0 0 5 3 M3

Paul Wells 0 0 0 0 0 0 0 0 3 0 1 0 0 0 0 0 1 0 0 0 0 5 3 T2

Page 41: Using Data Mining Techniques to Improve Efficiency in Police Intelligence

• Motivation:– Sample Offender Test• Same Offender• 4 BSU’s Where Offender Not Known• 20 Minutes to 2 Hours• Scores From Low 100’s to High 400’s

– Place / Not Place on List

• Scores Not Related To Time Taken

4. Priority & Prolific Offenders

Page 42: Using Data Mining Techniques to Improve Efficiency in Police Intelligence

• Benefits:– Every Offender is Scored– Objective Scoring• Defendable• Repeatable

– Process Can Be Frequently Run• Offenders’ Scores Updated• Current

4. Priority & Prolific Offenders

Page 43: Using Data Mining Techniques to Improve Efficiency in Police Intelligence

• Offender Networks:– The Data WILL Contain Networks– Which Networks Cause the Most Harm• Prioritisation• Scoring

– Degrees of Freedom• Dependant Upon Priorities

4. Priority & Prolific Offenders

Page 44: Using Data Mining Techniques to Improve Efficiency in Police Intelligence

• Offender Networks:

4. Priority & Prolific Offenders

Page 45: Using Data Mining Techniques to Improve Efficiency in Police Intelligence

Questions?

Page 46: Using Data Mining Techniques to Improve Efficiency in Police Intelligence

Using Data Mining Techniques to Improve Efficiency in Police

Intelligence

Dr Rick Adderleywww.a-esolutions.com