Driving the Future of Smart Cities - How to Beat the Traffic (Pivotal talk at Strata 2014)

34
A NEW PLATFORM FOR A NEW ERA

description

As traffic volumes in cities around the world are constantly growing we are faced with the challenge to track and control car movements in a more detailed and intelligent way to beat the traffic. Real-time information on traffic including automotive sensors and crowd-sourced data feeds are an interesting new source of data. However, to utilize this data to its full extent and turn it into valuable information, intelligent methods for analyzing and predicting traffic are needed. Pivotal’s Data Science Team has developed several innovative methods to analyze this traffic flow information harvested from real-time and in-car data sources including GPS. These methods by themselves are highly useful for predicting future traffic conditions and dissecting traffic data. We will describe how we created these algorithms and show different interesting results from their application. This example demonstrates how deeper insights into a problem can be found by combining different machine learning methods.

Transcript of Driving the Future of Smart Cities - How to Beat the Traffic (Pivotal talk at Strata 2014)

Page 1: Driving the Future of Smart Cities - How to Beat the Traffic (Pivotal talk at Strata 2014)

A NEW PLATFORM FOR A NEW ERA

Page 2: Driving the Future of Smart Cities - How to Beat the Traffic (Pivotal talk at Strata 2014)

2 © Copyright 2014 Pivotal. All rights reserved. 2 © Copyright 2013 Pivotal. All rights reserved.

Driving the Future of Smart Cities How to Beat the Traffic

Strata Santa Clara – February 13, 2013

Alexander Kagoshima, Data Scientist, @akagoshima Noelle Sio, Senior Data Scientist, @noellesio Ian Huston, Data Scientist, @ianhuston

@gopivotal

Page 3: Driving the Future of Smart Cities - How to Beat the Traffic (Pivotal talk at Strata 2014)

3 © Copyright 2014 Pivotal. All rights reserved.

What Matters: Apps. Data. Analytics.

Apps power businesses, and those apps generate data Analytic insights from that data drive new app functionality, which in-turn drives new data The faster you can move around that cycle, the faster you learn, innovate & pull away from the competition

@noellesio, @gopivotal

Page 4: Driving the Future of Smart Cities - How to Beat the Traffic (Pivotal talk at Strata 2014)

4 © Copyright 2014 Pivotal. All rights reserved.

Pivotal’s Opportunity

Uniquely positioned to help enterprises modernize each facet of this cycle today Comprehensive portfolio of products spanning Apps, Data & Analytics Converging these technologies into a coherent, next-gen Enterprise PaaS platform

@noellesio, @gopivotal

Page 5: Driving the Future of Smart Cities - How to Beat the Traffic (Pivotal talk at Strata 2014)

5 © Copyright 2014 Pivotal. All rights reserved.

The Connected Car Drives Innovation

Communication

Handsfree telephony

WiFi hotspot Car2X

comms

Rich media comms

Telematics

eMobility solutions

Car2X solutions

Remote activation

Floating car data

Stolen vehicle recovery Remote

diagnosis

Driver assistance

Real-time parking info

Behaviour monitoring

Navigation

Hybrid Navigation

Vehicle Tracking

Map/PoI updates

Geo fencing

Fleet Management

Responsive PoIs

Predictive traffic info

Concierge services

Next gen Navigation

Social Media

Share my trip

Car sharing

Car search

Traffic updates

eCommerce

City toll

Payment solutions

Parking space reservation

Pay as you drive

Road tolls

Entertainment

Music streaming

Online games

Web radio

VoD Environmental

browsing

@noellesio, @gopivotal

Page 6: Driving the Future of Smart Cities - How to Beat the Traffic (Pivotal talk at Strata 2014)

6 © Copyright 2014 Pivotal. All rights reserved.

Possible Data Science Use-Cases !  Predictive Car Maintenance

–  More accurately predict part failure –  Optimize part repair and replacement schedule

!  Leveraging Driving Behavior –  Useful to differentiate insurance pricing based on driving style –  Optimize car design

!  Improving GPS Systems –  Establish baseline for traffic congestion –  Gain a detailed view on traffic –  Create more meaningful metrics for routing

!  Predictive Power for Assistance Systems –  Optimize fuel efficiency –  Predict the future state of a car in the next 2 minutes

(starts, stops, emergency braking)

!  Traffic Light Assistance –  Signal timing of traffic lights –  Crowd sourcing of traffic signals

@noellesio, @gopivotal

Page 7: Driving the Future of Smart Cities - How to Beat the Traffic (Pivotal talk at Strata 2014)

7 © Copyright 2014 Pivotal. All rights reserved. 7 © Copyright 2013 Pivotal. All rights reserved.

What does traffic data look like?

Page 8: Driving the Future of Smart Cities - How to Beat the Traffic (Pivotal talk at Strata 2014)

8 © Copyright 2014 Pivotal. All rights reserved.

…like this?

@noellesio, @gopivotal

Page 9: Driving the Future of Smart Cities - How to Beat the Traffic (Pivotal talk at Strata 2014)

9 © Copyright 2014 Pivotal. All rights reserved.

How fast are vehicles moving?

@noellesio, @gopivotal

Page 10: Driving the Future of Smart Cities - How to Beat the Traffic (Pivotal talk at Strata 2014)

10 © Copyright 2014 Pivotal. All rights reserved.

0 50 100 150 200

0.00

00.

005

0.01

00.

015

0.02

00.

025

0.03

0

dens

ity

Link 1000064869

km/h

How fast are vehicles moving?

@noellesio, @gopivotal

Page 11: Driving the Future of Smart Cities - How to Beat the Traffic (Pivotal talk at Strata 2014)

11 © Copyright 2014 Pivotal. All rights reserved.

When do disruptions happen?

@noellesio, @gopivotal

Page 12: Driving the Future of Smart Cities - How to Beat the Traffic (Pivotal talk at Strata 2014)

12 © Copyright 2014 Pivotal. All rights reserved.

When will the light change? [s

econ

ds]

@noellesio, @gopivotal

Page 13: Driving the Future of Smart Cities - How to Beat the Traffic (Pivotal talk at Strata 2014)

13 © Copyright 2014 Pivotal. All rights reserved.

Taking Lessons From Other Disciplines

Change-Point Detection can be used to uncover regimes in wind-turbine data. It can also be applied to uncover regimes in traffic light switching patterns.

@noellesio, @gopivotal

Page 14: Driving the Future of Smart Cities - How to Beat the Traffic (Pivotal talk at Strata 2014)

14 © Copyright 2014 Pivotal. All rights reserved.

0 50 100 150 200

0.00

00.

005

0.01

00.

015

0.02

00.

025

0.03

0

dens

ity

CombinedComponent 1Component 2Component 3Component 4

Link 1000064869

Different Cell Populations

Taking Lessons From Other Disciplines

Different Driving Conditions

@noellesio, @gopivotal

Page 15: Driving the Future of Smart Cities - How to Beat the Traffic (Pivotal talk at Strata 2014)

15 © Copyright 2014 Pivotal. All rights reserved.

Understanding Traffic Flow A dynamic, more detailed understanding of traffic is now possible. Can we answer both ‘What velocity?’ and ‘Why?’

Context !  Current GPS systems are based on

average velocity over street segments

!  Real-time traffic information (e.g. Waze) does not deliver detailed view nor prediction

@akagoshima, @gopivotal

Page 16: Driving the Future of Smart Cities - How to Beat the Traffic (Pivotal talk at Strata 2014)

16 © Copyright 2014 Pivotal. All rights reserved.

Our Approach – Multi-step algorithm

Step 1: Answer ‘What velocity?’

First find distinct velocity groups

Step 2: Answer ‘Why?’

Find influencing effects

From our experience, real-world data often requires multi-step procedures

0 50 100 150 200

0.00

00.

005

0.01

00.

015

0.02

00.

025

0.03

0

dens

ity

CombinedComponent 1Component 2Component 3Component 4

Link 1000064869

km/h

? ?

? ? ?

?

@akagoshima, @gopivotal

Page 17: Driving the Future of Smart Cities - How to Beat the Traffic (Pivotal talk at Strata 2014)

17 © Copyright 2014 Pivotal. All rights reserved.

Find Velocity Groups !  Velocity distributions can be fit well

with Gaussians

!  An ‘overlay’ of multiple Gaussians is called Gaussian Mixture Model

!  GMM fitting of the velocity distribution is done by Expectation-Maximization algorithm

!  Shapes and positions of Gaussians determine velocity groups

0 50 100 150 200

0.00

00.

005

0.01

00.

015

0.02

00.

025

0.03

0

dens

ity

CombinedComponent 1Component 2Component 3Component 4

Link 1000064869

km/h

@akagoshima, @gopivotal

Page 18: Driving the Future of Smart Cities - How to Beat the Traffic (Pivotal talk at Strata 2014)

18 © Copyright 2014 Pivotal. All rights reserved.

Gaussian Mixture Model

0 50 100 150 200

0.000

0.005

0.010

0.015

0.020

0.025

0.030

dens

ity

CombinedComponent 1Component 2Component 3Component 4

Link 1000064869

km/h 0 50 100 150 200

0.00

00.

005

0.01

00.

015

0.02

00.

025

0.03

0

dens

ity

CombinedComponent 1Component 2Component 3Component 4

Link 1000064869

km/h

0 50 100 150 200

0.00

00.

005

0.01

00.

015

0.02

00.

025

0.03

0

dens

ity

CombinedComponent 1Component 2Component 3Component 4

Link 1000064869

km/h

0 50 100 150 200

0.00

00.

005

0.01

00.

015

0.02

00.

025

0.03

0

dens

ity

CombinedComponent 1Component 2Component 3Component 4

Link 1000064869

km/h

@akagoshima, @gopivotal

Page 19: Driving the Future of Smart Cities - How to Beat the Traffic (Pivotal talk at Strata 2014)

19 © Copyright 2014 Pivotal. All rights reserved.

0 50 100 150 200 250

0.00

0.01

0.02

0.03

0.04

dens

ity

CombinedComponent 1Component 2Component 3

Predict Gaussians !  The second step seeks to explain/predict

which Gaussian a data point belongs to "  Classification task!

!  Features for classification: –  Time of day, day of week –  Weather –  Direction –  Special Events –  …

? ? ? … ?

@akagoshima, @gopivotal

Page 20: Driving the Future of Smart Cities - How to Beat the Traffic (Pivotal talk at Strata 2014)

20 © Copyright 2014 Pivotal. All rights reserved.

White and Black Boxes

!  Generate an interpretable model description

" Explanation of behavior

!  Can capture more complex correlations

" Prediction of Gaussian assignment

!  Analyze correlations between features of a data point and its assignment to a Gaussian

!  From a Machine Learning point of view, this is classification

@akagoshima, @gopivotal

Page 21: Driving the Future of Smart Cities - How to Beat the Traffic (Pivotal talk at Strata 2014)

21 © Copyright 2014 Pivotal. All rights reserved.

0 50 100 150 200 250

0.00

0.01

0.02

0.03

0.04

dens

ity

CombinedComponent 1Component 2Component 3

km/h

Putting it all together…

Average velocity = 45 km/h •  Weekdays between 2 – 8pm

Average velocity = 85 km/h •  Weekdays after 8pm •  Weekdays before 2pm, exiting

Average velocity = 120 km/h •  Weekends •  Weekdays before 2pm, not exiting

Two-Step algorithm: GMM + Classification •  Identified multiple velocity profiles for every road segment •  Intuitive and easily interpretable results •  Highly scalable for more features and data

@akagoshima, @gopivotal

Page 22: Driving the Future of Smart Cities - How to Beat the Traffic (Pivotal talk at Strata 2014)

22 © Copyright 2014 Pivotal. All rights reserved.

Better Travel Time Prediction

! Traffic profiles emerged from data –  Without using metadata, we uncovered road

segment traffic patterns

!  Identified Bias Effects –  Inferring the impact of turns and day of week on velocity –  Able to predict rush hour by day and time by road segment

! Traffic Light Patterns –  Infer public transportation effects on traffic –  Automatically determine different switching patterns

@akagoshima, @gopivotal

Page 23: Driving the Future of Smart Cities - How to Beat the Traffic (Pivotal talk at Strata 2014)

23 © Copyright 2014 Pivotal. All rights reserved.

London Road Traffic Disruptions Can we predict when unexpected incidents will end? Publicly available data:

!  Transport for London traffic feed (refreshed every 5 minutes)

!  Weather Underground reports

Photo by James Blunt Photography on Flickr (CC BY-ND 2.0)

@ianhuston, @gopivotal

Page 24: Driving the Future of Smart Cities - How to Beat the Traffic (Pivotal talk at Strata 2014)

24 © Copyright 2014 Pivotal. All rights reserved.

@ianhuston, @gopivotal

Page 25: Driving the Future of Smart Cities - How to Beat the Traffic (Pivotal talk at Strata 2014)

25 © Copyright 2014 Pivotal. All rights reserved.

Major storm hits UK (Photo: BBC)

@ianhuston, @gopivotal

Page 26: Driving the Future of Smart Cities - How to Beat the Traffic (Pivotal talk at Strata 2014)

26 © Copyright 2014 Pivotal. All rights reserved.

@ianhuston, @gopivotal

Page 27: Driving the Future of Smart Cities - How to Beat the Traffic (Pivotal talk at Strata 2014)

27 © Copyright 2014 Pivotal. All rights reserved.

@ianhuston, @gopivotal

Page 28: Driving the Future of Smart Cities - How to Beat the Traffic (Pivotal talk at Strata 2014)

28 © Copyright 2014 Pivotal. All rights reserved.

Durations are very different for different types of incident. Mean duration for Surface Damage incidents is 107 hours!

@ianhuston, @gopivotal

Page 29: Driving the Future of Smart Cities - How to Beat the Traffic (Pivotal talk at Strata 2014)

29 © Copyright 2014 Pivotal. All rights reserved.

Rain affects duration in a surprising way. Incidents which start when it is raining finish faster than others.

@ianhuston, @gopivotal

Page 30: Driving the Future of Smart Cities - How to Beat the Traffic (Pivotal talk at Strata 2014)

30 © Copyright 2014 Pivotal. All rights reserved.

Linear Regression •  Disruption reports &

weather features

Random Forests •  Rounded categorical •  Regression

Category MAP •  Only use category of

incident •  Maximum Likelihood

estimate

Models

@ianhuston, @gopivotal

Page 31: Driving the Future of Smart Cities - How to Beat the Traffic (Pivotal talk at Strata 2014)

31 © Copyright 2014 Pivotal. All rights reserved.

Live Predictions

http://ds-demo-transport.cfapps.io Using:

@ianhuston, @gopivotal

Page 32: Driving the Future of Smart Cities - How to Beat the Traffic (Pivotal talk at Strata 2014)

32 © Copyright 2014 Pivotal. All rights reserved.

Summary

! Making use of a vibrant ecosystem of traffic data ! Innovative approaches needed to generate value

from abundant and complex sources ! Connecting predictive models to traffic in the

physical world is the future of smart cities

Page 33: Driving the Future of Smart Cities - How to Beat the Traffic (Pivotal talk at Strata 2014)

33 © Copyright 2014 Pivotal. All rights reserved. 33 © Copyright 2013 Pivotal. All rights reserved.

Thank You!

Check out more of our Data Science use-cases at www.goPivotal.com

Page 34: Driving the Future of Smart Cities - How to Beat the Traffic (Pivotal talk at Strata 2014)

A NEW PLATFORM FOR A NEW ERA