What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are...

88
What Does The Crowd Say About You? Evaluating Aggregation-Based Location Privacy PETS 2017 Apostolos Pyrgelis 1 , Carmela Troncoso 2 and Emiliano De Cristofaro 1 1 University College London, 2 IMDEA Software Institute July 19, 2017 Minneapolis, USA A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Transcript of What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are...

Page 1: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

What Does The Crowd Say About You?Evaluating Aggregation-Based Location Privacy

PETS 2017

Apostolos Pyrgelis 1, Carmela Troncoso 2 and Emiliano De Cristofaro 1

1University College London, 2IMDEA Software Institute

July 19, 2017Minneapolis, USA

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 2: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

Table of Contents

1 Introduction

2 Framework

3 Evaluation of Raw Aggregates

4 Evaluation of Defenses

5 Conclusion

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 3: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

Motivation

Aggregate locations are useful for performing mobilityanalytics, e.g., predicting traffic volumes, detecting anomalies

Aggregation is often considered a privacy-friendly approach,especially when done in a privacy-preserving way

Open Problem: No sound methodology to reason aboutprivacy leakage for individuals from the aggregates

...no way to evaluate potential defense mechanisms

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 4: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

Motivation

Aggregate locations are useful for performing mobilityanalytics, e.g., predicting traffic volumes, detecting anomalies

Aggregation is often considered a privacy-friendly approach,especially when done in a privacy-preserving way

Open Problem: No sound methodology to reason aboutprivacy leakage for individuals from the aggregates

...no way to evaluate potential defense mechanisms

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 5: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

Motivation

Aggregate locations are useful for performing mobilityanalytics, e.g., predicting traffic volumes, detecting anomalies

Aggregation is often considered a privacy-friendly approach,especially when done in a privacy-preserving way

Open Problem: No sound methodology to reason aboutprivacy leakage for individuals from the aggregates

...no way to evaluate potential defense mechanisms

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 6: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

Motivation

Aggregate locations are useful for performing mobilityanalytics, e.g., predicting traffic volumes, detecting anomalies

Aggregation is often considered a privacy-friendly approach,especially when done in a privacy-preserving way

Open Problem: No sound methodology to reason aboutprivacy leakage for individuals from the aggregates

...no way to evaluate potential defense mechanisms

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 7: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

Motivation

Aggregate locations are useful for performing mobilityanalytics, e.g., predicting traffic volumes, detecting anomalies

Aggregation is often considered a privacy-friendly approach,especially when done in a privacy-preserving way

Open Problem: No sound methodology to reason aboutprivacy leakage for individuals from the aggregates

...no way to evaluate potential defense mechanisms

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 8: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

In this work...

We introduce a framework to address this gap

We use it to measure users’ privacy loss from aggregatelocation time-series

We evaluate two real-world mobility datasets (TFL, SFC)

We study the privacy protection offered by defensemechanisms based on DP (output and input perturbation)

Raw aggregates do leak information about users’ locationsand mobility profiles

DP provides strong privacy protection when the utility of theaggregates is poor

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 9: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

In this work...

We introduce a framework to address this gap

We use it to measure users’ privacy loss from aggregatelocation time-series

We evaluate two real-world mobility datasets (TFL, SFC)

We study the privacy protection offered by defensemechanisms based on DP (output and input perturbation)

Raw aggregates do leak information about users’ locationsand mobility profiles

DP provides strong privacy protection when the utility of theaggregates is poor

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 10: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

In this work...

We introduce a framework to address this gap

We use it to measure users’ privacy loss from aggregatelocation time-series

We evaluate two real-world mobility datasets (TFL, SFC)

We study the privacy protection offered by defensemechanisms based on DP (output and input perturbation)

Raw aggregates do leak information about users’ locationsand mobility profiles

DP provides strong privacy protection when the utility of theaggregates is poor

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 11: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

In this work...

We introduce a framework to address this gap

We use it to measure users’ privacy loss from aggregatelocation time-series

We evaluate two real-world mobility datasets (TFL, SFC)

We study the privacy protection offered by defensemechanisms based on DP (output and input perturbation)

Raw aggregates do leak information about users’ locationsand mobility profiles

DP provides strong privacy protection when the utility of theaggregates is poor

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 12: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

In this work...

We introduce a framework to address this gap

We use it to measure users’ privacy loss from aggregatelocation time-series

We evaluate two real-world mobility datasets (TFL, SFC)

We study the privacy protection offered by defensemechanisms based on DP (output and input perturbation)

Raw aggregates do leak information about users’ locationsand mobility profiles

DP provides strong privacy protection when the utility of theaggregates is poor

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 13: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

In this work...

We introduce a framework to address this gap

We use it to measure users’ privacy loss from aggregatelocation time-series

We evaluate two real-world mobility datasets (TFL, SFC)

We study the privacy protection offered by defensemechanisms based on DP (output and input perturbation)

Raw aggregates do leak information about users’ locationsand mobility profiles

DP provides strong privacy protection when the utility of theaggregates is poor

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 14: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

In this work...

We introduce a framework to address this gap

We use it to measure users’ privacy loss from aggregatelocation time-series

We evaluate two real-world mobility datasets (TFL, SFC)

We study the privacy protection offered by defensemechanisms based on DP (output and input perturbation)

Raw aggregates do leak information about users’ locationsand mobility profiles

DP provides strong privacy protection when the utility of theaggregates is poor

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 15: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

Table of Contents

1 Introduction

2 Framework

3 Evaluation of Raw Aggregates

4 Evaluation of Defenses

5 Conclusion

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 16: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

Adversarial Prior Knowledge

Prior knowledge might come from social networks, data leaks,location traces released by providers or personal knowledge,e.g. home / work pair

In this work, we explore several possible approaches

Probabilistic:

Frequency of locations (over time)Location Seasonality (day / week)

Assignment:

Most popular locationsAll prior locationsLast Season (hour / day / week)

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 17: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

Adversarial Prior Knowledge

Prior knowledge might come from social networks, data leaks,location traces released by providers or personal knowledge,e.g. home / work pair

In this work, we explore several possible approaches

Probabilistic:

Frequency of locations (over time)Location Seasonality (day / week)

Assignment:

Most popular locationsAll prior locationsLast Season (hour / day / week)

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 18: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

Adversarial Prior Knowledge

Prior knowledge might come from social networks, data leaks,location traces released by providers or personal knowledge,e.g. home / work pair

In this work, we explore several possible approaches

Probabilistic:

Frequency of locations (over time)Location Seasonality (day / week)

Assignment:

Most popular locationsAll prior locationsLast Season (hour / day / week)

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 19: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

Adversarial Prior Knowledge

Prior knowledge might come from social networks, data leaks,location traces released by providers or personal knowledge,e.g. home / work pair

In this work, we explore several possible approaches

Probabilistic:

Frequency of locations (over time)Location Seasonality (day / week)

Assignment:

Most popular locationsAll prior locationsLast Season (hour / day / week)

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 20: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

Adversarial Prior Knowledge

Prior knowledge might come from social networks, data leaks,location traces released by providers or personal knowledge,e.g. home / work pair

In this work, we explore several possible approaches

Probabilistic:

Frequency of locations (over time)Location Seasonality (day / week)

Assignment:

Most popular locationsAll prior locationsLast Season (hour / day / week)

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 21: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

Inference Strategies

Bayesian Update:Posterior probability of a user being in a location at a certaintime, given the prior and the aggregates

MAX-ROI:Greedy strategy that assigns the most probable users to eachlocation, until the aggregates are consumed

MAX-USER:Greedy strategy that assigns each user to her most likelylocations, until the aggregates are consumed

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 22: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

Inference Strategies

Bayesian Update:Posterior probability of a user being in a location at a certaintime, given the prior and the aggregates

MAX-ROI:Greedy strategy that assigns the most probable users to eachlocation, until the aggregates are consumed

MAX-USER:Greedy strategy that assigns each user to her most likelylocations, until the aggregates are consumed

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 23: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

Inference Strategies

Bayesian Update:Posterior probability of a user being in a location at a certaintime, given the prior and the aggregates

MAX-ROI:Greedy strategy that assigns the most probable users to eachlocation, until the aggregates are consumed

MAX-USER:Greedy strategy that assigns each user to her most likelylocations, until the aggregates are consumed

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 24: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

Inference Strategies

Bayesian Update:Posterior probability of a user being in a location at a certaintime, given the prior and the aggregates

MAX-ROI:Greedy strategy that assigns the most probable users to eachlocation, until the aggregates are consumed

MAX-USER:Greedy strategy that assigns each user to her most likelylocations, until the aggregates are consumed

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 25: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

Timeline

Observation Period: used to build prior knowledge for eachuser

Inference period: launch the attacks

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 26: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

Timeline

Observation Period: used to build prior knowledge for eachuser

Inference period: launch the attacks

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 27: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

Timeline

Observation Period: used to build prior knowledge for eachuser

Inference period: launch the attacks

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 28: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

Adversarial Goals

User Profiling:

Infer the probability of a user being in a location at a certaintime

Adversarial Error: JS-divergence from the ground truth

User Localization:

Predict where the user will be at a certain time

Adversarial Error: 1 - F1

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 29: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

Adversarial Goals

User Profiling:

Infer the probability of a user being in a location at a certaintime

Adversarial Error: JS-divergence from the ground truth

User Localization:

Predict where the user will be at a certain time

Adversarial Error: 1 - F1

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 30: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

Adversarial Goals

User Profiling:

Infer the probability of a user being in a location at a certaintime

Adversarial Error: JS-divergence from the ground truth

User Localization:

Predict where the user will be at a certain time

Adversarial Error: 1 - F1

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 31: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

Privacy Loss

Privacy Loss (PL):

normalized reduction in adversarial error with vs without theaggregates

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 32: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

Privacy Loss

Privacy Loss (PL):

normalized reduction in adversarial error with vs without theaggregates

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 33: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

Table of Contents

1 Introduction

2 Framework

3 Evaluation of Raw Aggregates

4 Evaluation of Defenses

5 Conclusion

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 34: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

Datasets

Tranport For London (TFL):

60M trips - 4M unique oyster cards - 582 stations (regions ofinterest - ROIs)

Monday, March 1 - Sunday, March 28, 2010

Sample the top 10K oyster ids per total # of trips, beingactive for 115 ± 21 out of the 672 timeslots and reporting171 ± 26 ROIs in total (sparse, regular)

San Francisco Cabs (SFC):

11M GPS coordinates - 534 cabs in SF - May 19 to June 8,2008

Grid 10 × 10 = 100 ROIs of 0.5 × 0.37 mi2

Taxis are active for 340 ± 94 out of the 504 timeslots andreport 3, 663 ± 1, 116 ROIs in total (dense, irregular)

Prior Knowledge: Split data according to timeInference Period: Last week of each dataset

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 35: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

Datasets

Tranport For London (TFL):

60M trips - 4M unique oyster cards - 582 stations (regions ofinterest - ROIs)

Monday, March 1 - Sunday, March 28, 2010

Sample the top 10K oyster ids per total # of trips, beingactive for 115 ± 21 out of the 672 timeslots and reporting171 ± 26 ROIs in total (sparse, regular)

San Francisco Cabs (SFC):

11M GPS coordinates - 534 cabs in SF - May 19 to June 8,2008

Grid 10 × 10 = 100 ROIs of 0.5 × 0.37 mi2

Taxis are active for 340 ± 94 out of the 504 timeslots andreport 3, 663 ± 1, 116 ROIs in total (dense, irregular)

Prior Knowledge: Split data according to timeInference Period: Last week of each dataset

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 36: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

Datasets

Tranport For London (TFL):

60M trips - 4M unique oyster cards - 582 stations (regions ofinterest - ROIs)

Monday, March 1 - Sunday, March 28, 2010

Sample the top 10K oyster ids per total # of trips, beingactive for 115 ± 21 out of the 672 timeslots and reporting171 ± 26 ROIs in total (sparse, regular)

San Francisco Cabs (SFC):

11M GPS coordinates - 534 cabs in SF - May 19 to June 8,2008

Grid 10 × 10 = 100 ROIs of 0.5 × 0.37 mi2

Taxis are active for 340 ± 94 out of the 504 timeslots andreport 3, 663 ± 1, 116 ROIs in total (dense, irregular)

Prior Knowledge: Split data according to timeInference Period: Last week of each dataset

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 37: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

Datasets

Tranport For London (TFL):

60M trips - 4M unique oyster cards - 582 stations (regions ofinterest - ROIs)

Monday, March 1 - Sunday, March 28, 2010

Sample the top 10K oyster ids per total # of trips, beingactive for 115 ± 21 out of the 672 timeslots and reporting171 ± 26 ROIs in total (sparse, regular)

San Francisco Cabs (SFC):

11M GPS coordinates - 534 cabs in SF - May 19 to June 8,2008

Grid 10 × 10 = 100 ROIs of 0.5 × 0.37 mi2

Taxis are active for 340 ± 94 out of the 504 timeslots andreport 3, 663 ± 1, 116 ROIs in total (dense, irregular)

Prior Knowledge: Split data according to time

Inference Period: Last week of each dataset

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 38: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

Datasets

Tranport For London (TFL):

60M trips - 4M unique oyster cards - 582 stations (regions ofinterest - ROIs)

Monday, March 1 - Sunday, March 28, 2010

Sample the top 10K oyster ids per total # of trips, beingactive for 115 ± 21 out of the 672 timeslots and reporting171 ± 26 ROIs in total (sparse, regular)

San Francisco Cabs (SFC):

11M GPS coordinates - 534 cabs in SF - May 19 to June 8,2008

Grid 10 × 10 = 100 ROIs of 0.5 × 0.37 mi2

Taxis are active for 340 ± 94 out of the 504 timeslots andreport 3, 663 ± 1, 116 ROIs in total (dense, irregular)

Prior Knowledge: Split data according to timeInference Period: Last week of each dataset

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 39: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

User Profiling

Prior Knowledge: Location frequency, over the observation period

0.0 0.2 0.4 0.6 0.8 1.0Adversarial Error

0.0

0.2

0.4

0.6

0.8

1.0

CD

F -

Oyst

ers

FREQ_ROI

Ap

BAYES

MAX_ROI

MAX_USER

TFL

0.0 0.2 0.4 0.6 0.8 1.0Adversarial Error

0.0

0.2

0.4

0.6

0.8

1.0

CD

F -

Cabs

FREQ_ROI

Ap

BAYES

MAX_ROI

MAX_USER

SFC

TFL vs. SFC: Inferring mobility profiles of commuters easier thancabs

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 40: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

User Profiling

Prior Knowledge: Location frequency, over the observation period

0.0 0.2 0.4 0.6 0.8 1.0Adversarial Error

0.0

0.2

0.4

0.6

0.8

1.0

CD

F -

Oyst

ers

FREQ_ROI

Ap

BAYES

MAX_ROI

MAX_USER

TFL

0.0 0.2 0.4 0.6 0.8 1.0Adversarial Error

0.0

0.2

0.4

0.6

0.8

1.0

CD

F -

Cabs

FREQ_ROI

Ap

BAYES

MAX_ROI

MAX_USER

SFC

TFL vs. SFC: Inferring mobility profiles of commuters easier thancabs

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 41: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

User Profiling

Prior Knowledge: Location frequency, over the observation period

0.0 0.2 0.4 0.6 0.8 1.0Adversarial Error

0.0

0.2

0.4

0.6

0.8

1.0

CD

F -

Oyst

ers

FREQ_ROI

Ap

BAYES

MAX_ROI

MAX_USER

TFL

0.0 0.2 0.4 0.6 0.8 1.0Adversarial Error

0.0

0.2

0.4

0.6

0.8

1.0

CD

F -

Cabs

FREQ_ROI

Ap

BAYES

MAX_ROI

MAX_USER

SFC

TFL vs. SFC: Inferring mobility profiles of commuters easier thancabs

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 42: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

User Profiling

Prior Knowledge: Location frequency, over the observation period

0.0 0.2 0.4 0.6 0.8 1.0Adversarial Error

0.0

0.2

0.4

0.6

0.8

1.0

CD

F -

Oyst

ers

FREQ_ROI

Ap

BAYES

MAX_ROI

MAX_USER

TFL

0.0 0.2 0.4 0.6 0.8 1.0Adversarial Error

0.0

0.2

0.4

0.6

0.8

1.0

CD

F -

Cabs

FREQ_ROI

Ap

BAYES

MAX_ROI

MAX_USER

SFC

TFL vs. SFC: Inferring mobility profiles of commuters easier thancabs

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 43: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

User Profiling

Prior Knowledge: Location frequency, over the observation period

0.0 0.2 0.4 0.6 0.8 1.0Adversarial Error

0.0

0.2

0.4

0.6

0.8

1.0

CD

F -

Oyst

ers

FREQ_ROI

Ap

BAYES

MAX_ROI

MAX_USER

TFL

0.0 0.2 0.4 0.6 0.8 1.0Adversarial Error

0.0

0.2

0.4

0.6

0.8

1.0

CD

F -

Cabs

FREQ_ROI

Ap

BAYES

MAX_ROI

MAX_USER

SFC

TFL vs. SFC: Inferring mobility profiles of commuters easier thancabs

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 44: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

Implications of Regular Mobility Patterns

Prior Knowledge: Location frequency, for time instances of anyday

Mar-22 Mar-23 Mar-24 Mar-25 Mar-26 Mar-27 Mar-28Date

0.0

0.1

0.2

0.3

0.4

0.5

0.6

Advers

ari

al Err

or

ROI_DAY Ap BAYES MAX_ROI MAX_USER

TFL

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 45: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

Implications of Regular Mobility Patterns

Prior Knowledge: Location frequency, for time instances of anyday

Mar-22 Mar-23 Mar-24 Mar-25 Mar-26 Mar-27 Mar-28Date

0.0

0.1

0.2

0.3

0.4

0.5

0.6

Advers

ari

al Err

or

ROI_DAY Ap BAYES MAX_ROI MAX_USER

TFL

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 46: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

Implications of Regular Mobility Patterns

Prior Knowledge: Location frequency, for time instances of anyday

Mar-22 Mar-23 Mar-24 Mar-25 Mar-26 Mar-27 Mar-28Date

0.0

0.1

0.2

0.3

0.4

0.5

0.6

Advers

ari

al Err

or

ROI_DAY Ap BAYES MAX_ROI MAX_USER

TFL

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 47: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

User Localization

Prior Knowledge: Location frequency, for time instances of aweek

0.0 0.2 0.4 0.6 0.8 1.0Adversarial Error

0.0

0.2

0.4

0.6

0.8

1.0

CD

F -

Oyst

ers

ALL

BAYES_ALL

POP

BAYES_POP

MAX_ROI

MAX_USER

TFL

0.0 0.2 0.4 0.6 0.8 1.0Adversarial Error

0.0

0.2

0.4

0.6

0.8

1.0

CD

F -

Cabs

ALL

BAYES_ALL

POP

BAYES_POP

MAX_ROI

MAX_USER

SFC

TFL vs SFC: Commuters are best localized via their most popularROIs, whereas cabs via their last hour’s ROIs

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 48: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

User Localization

Prior Knowledge: Location frequency, for time instances of aweek

0.0 0.2 0.4 0.6 0.8 1.0Adversarial Error

0.0

0.2

0.4

0.6

0.8

1.0

CD

F -

Oyst

ers

ALL

BAYES_ALL

POP

BAYES_POP

MAX_ROI

MAX_USER

TFL

0.0 0.2 0.4 0.6 0.8 1.0Adversarial Error

0.0

0.2

0.4

0.6

0.8

1.0

CD

F -

Cabs

ALL

BAYES_ALL

POP

BAYES_POP

MAX_ROI

MAX_USER

SFC

TFL vs SFC: Commuters are best localized via their most popularROIs, whereas cabs via their last hour’s ROIs

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 49: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

User Localization

Prior Knowledge: Location frequency, for time instances of aweek

0.0 0.2 0.4 0.6 0.8 1.0Adversarial Error

0.0

0.2

0.4

0.6

0.8

1.0

CD

F -

Oyst

ers

ALL

BAYES_ALL

POP

BAYES_POP

MAX_ROI

MAX_USER

TFL

0.0 0.2 0.4 0.6 0.8 1.0Adversarial Error

0.0

0.2

0.4

0.6

0.8

1.0

CD

F -

Cabs

ALL

BAYES_ALL

POP

BAYES_POP

MAX_ROI

MAX_USER

SFC

TFL vs SFC: Commuters are best localized via their most popularROIs, whereas cabs via their last hour’s ROIs

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 50: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

User Localization

Prior Knowledge: Location frequency, for time instances of aweek

0.0 0.2 0.4 0.6 0.8 1.0Adversarial Error

0.0

0.2

0.4

0.6

0.8

1.0

CD

F -

Oyst

ers

ALL

BAYES_ALL

POP

BAYES_POP

MAX_ROI

MAX_USER

TFL

0.0 0.2 0.4 0.6 0.8 1.0Adversarial Error

0.0

0.2

0.4

0.6

0.8

1.0

CD

F -

Cabs

ALL

BAYES_ALL

POP

BAYES_POP

MAX_ROI

MAX_USER

SFC

TFL vs SFC: Commuters are best localized via their most popularROIs, whereas cabs via their last hour’s ROIs

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 51: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

User Localization

Prior Knowledge: Location frequency, for time instances of aweek

0.0 0.2 0.4 0.6 0.8 1.0Adversarial Error

0.0

0.2

0.4

0.6

0.8

1.0

CD

F -

Oyst

ers

ALL

BAYES_ALL

POP

BAYES_POP

MAX_ROI

MAX_USER

TFL

0.0 0.2 0.4 0.6 0.8 1.0Adversarial Error

0.0

0.2

0.4

0.6

0.8

1.0

CD

F -

Cabs

ALL

BAYES_ALL

POP

BAYES_POP

MAX_ROI

MAX_USER

SFC

TFL vs SFC: Commuters are best localized via their most popularROIs, whereas cabs via their last hour’s ROIs

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 52: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

Take Aways

Aggregates do help the adversary to profile and localize users

Degree of privacy loss depends on the prior

Assignment priors yield smaller privacy leakage compared toprobabilistic ones

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 53: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

Take Aways

Aggregates do help the adversary to profile and localize users

Degree of privacy loss depends on the prior

Assignment priors yield smaller privacy leakage compared toprobabilistic ones

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 54: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

Take Aways

Aggregates do help the adversary to profile and localize users

Degree of privacy loss depends on the prior

Assignment priors yield smaller privacy leakage compared toprobabilistic ones

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 55: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

Take Aways

Aggregates do help the adversary to profile and localize users

Degree of privacy loss depends on the prior

Assignment priors yield smaller privacy leakage compared toprobabilistic ones

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 56: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

Table of Contents

1 Introduction

2 Framework

3 Evaluation of Raw Aggregates

4 Evaluation of Defenses

5 Conclusion

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 57: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

Evaluation of Defenses

Location aggregates is a suitable setting for differentialprivacy (DP)

How much privacy does DP provide? (with respect to ε andutility)

Privacy Gain (PG): Normalized increase in adversarial errorgiven DP aggregates compared to that with raw aggregates

Utility: Mean Relative Error (MRE)

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 58: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

Evaluation of Defenses

Location aggregates is a suitable setting for differentialprivacy (DP)

How much privacy does DP provide? (with respect to ε andutility)

Privacy Gain (PG): Normalized increase in adversarial errorgiven DP aggregates compared to that with raw aggregates

Utility: Mean Relative Error (MRE)

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 59: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

Evaluation of Defenses

Location aggregates is a suitable setting for differentialprivacy (DP)

How much privacy does DP provide? (with respect to ε andutility)

Privacy Gain (PG): Normalized increase in adversarial errorgiven DP aggregates compared to that with raw aggregates

Utility: Mean Relative Error (MRE)

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 60: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

Evaluation of Defenses

Location aggregates is a suitable setting for differentialprivacy (DP)

How much privacy does DP provide? (with respect to ε andutility)

Privacy Gain (PG): Normalized increase in adversarial errorgiven DP aggregates compared to that with raw aggregates

Utility: Mean Relative Error (MRE)

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 61: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

Evaluation of Defenses

Location aggregates is a suitable setting for differentialprivacy (DP)

How much privacy does DP provide? (with respect to ε andutility)

Privacy Gain (PG): Normalized increase in adversarial errorgiven DP aggregates compared to that with raw aggregates

Utility: Mean Relative Error (MRE)

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 62: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

Output Perturbation ( 1 / 2 )

Simple Counter Mechanism (SCM)

Provides event-level privacy, i.e., it protects whether or not auser was in a specific location at a specific time

Can be configured to achieve stronger guarantees, (e.g., ε-DP)

Fourier Perturbation Algorithm (FPA)

Improves the privacy/utility trade-off by reducing the amountof noise required

Noise addition is done on the compressed domain (DFT),ε-DP per time-series

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 63: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

Output Perturbation ( 1 / 2 )

Simple Counter Mechanism (SCM)

Provides event-level privacy, i.e., it protects whether or not auser was in a specific location at a specific time

Can be configured to achieve stronger guarantees, (e.g., ε-DP)

Fourier Perturbation Algorithm (FPA)

Improves the privacy/utility trade-off by reducing the amountof noise required

Noise addition is done on the compressed domain (DFT),ε-DP per time-series

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 64: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

Output Perturbation ( 1 / 2 )

Simple Counter Mechanism (SCM)

Provides event-level privacy, i.e., it protects whether or not auser was in a specific location at a specific time

Can be configured to achieve stronger guarantees, (e.g., ε-DP)

Fourier Perturbation Algorithm (FPA)

Improves the privacy/utility trade-off by reducing the amountof noise required

Noise addition is done on the compressed domain (DFT),ε-DP per time-series

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 65: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

Output Perturbation ( 1 / 2 )

Simple Counter Mechanism (SCM)

Provides event-level privacy, i.e., it protects whether or not auser was in a specific location at a specific time

Can be configured to achieve stronger guarantees, (e.g., ε-DP)

Fourier Perturbation Algorithm (FPA)

Improves the privacy/utility trade-off by reducing the amountof noise required

Noise addition is done on the compressed domain (DFT),ε-DP per time-series

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 66: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

Output Perturbation ( 1 / 2 )

Simple Counter Mechanism (SCM)

Provides event-level privacy, i.e., it protects whether or not auser was in a specific location at a specific time

Can be configured to achieve stronger guarantees, (e.g., ε-DP)

Fourier Perturbation Algorithm (FPA)

Improves the privacy/utility trade-off by reducing the amountof noise required

Noise addition is done on the compressed domain (DFT),ε-DP per time-series

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 67: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

Output Perturbation ( 1 / 2 )

Simple Counter Mechanism (SCM)

Provides event-level privacy, i.e., it protects whether or not auser was in a specific location at a specific time

Can be configured to achieve stronger guarantees, (e.g., ε-DP)

Fourier Perturbation Algorithm (FPA)

Improves the privacy/utility trade-off by reducing the amountof noise required

Noise addition is done on the compressed domain (DFT),ε-DP per time-series

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 68: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

Output Perturbation ( 1 / 2 )

Simple Counter Mechanism (SCM)

Provides event-level privacy, i.e., it protects whether or not auser was in a specific location at a specific time

Can be configured to achieve stronger guarantees, (e.g., ε-DP)

Fourier Perturbation Algorithm (FPA)

Improves the privacy/utility trade-off by reducing the amountof noise required

Noise addition is done on the compressed domain (DFT),ε-DP per time-series

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 69: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

Output Perturbation ( 2 / 2 )

Privacy:

0.001 0.01 0.1 1.0ε

0.0

0.2

0.4

0.6

0.8

1.0

Pri

vacy

Gain

Prior: FREQ_ROI, Inference: MAX_ROI

SCM - Lap( ∆ / ε) SCM - Lap( | T'| / ε) FPA SCM - Lap(1 / ε)

Utility:

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 70: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

Output Perturbation ( 2 / 2 )

Privacy:

0.001 0.01 0.1 1.0ε

0.0

0.2

0.4

0.6

0.8

1.0

Pri

vacy

Gain

Prior: FREQ_ROI, Inference: MAX_ROI

SCM - Lap( ∆ / ε) SCM - Lap( | T'| / ε) FPA SCM - Lap(1 / ε)

Utility:

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 71: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

Output Perturbation ( 2 / 2 )

Privacy:

0.001 0.01 0.1 1.0ε

0.0

0.2

0.4

0.6

0.8

1.0

Pri

vacy

Gain

Prior: FREQ_ROI, Inference: MAX_ROI

SCM - Lap( ∆ / ε) SCM - Lap( | T'| / ε) FPA SCM - Lap(1 / ε)

Utility:

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 72: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

Input Perturbation ( 1 / 2 )

Randomized Response

SpotMe mechanism is focused on aggregate locationtime-series

Users report to be in a location with some probability p, orreport the truth with probability 1 − p

Aggregator collects user perturbed inputs and estimates theaggregates

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 73: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

Input Perturbation ( 1 / 2 )

Randomized Response

SpotMe mechanism is focused on aggregate locationtime-series

Users report to be in a location with some probability p, orreport the truth with probability 1 − p

Aggregator collects user perturbed inputs and estimates theaggregates

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 74: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

Input Perturbation ( 1 / 2 )

Randomized Response

SpotMe mechanism is focused on aggregate locationtime-series

Users report to be in a location with some probability p, orreport the truth with probability 1 − p

Aggregator collects user perturbed inputs and estimates theaggregates

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 75: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

Input Perturbation ( 1 / 2 )

Randomized Response

SpotMe mechanism is focused on aggregate locationtime-series

Users report to be in a location with some probability p, orreport the truth with probability 1 − p

Aggregator collects user perturbed inputs and estimates theaggregates

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 76: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

Input Perturbation ( 1 / 2 )

Randomized Response

SpotMe mechanism is focused on aggregate locationtime-series

Users report to be in a location with some probability p, orreport the truth with probability 1 − p

Aggregator collects user perturbed inputs and estimates theaggregates

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 77: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

Input Perturbation ( 2 / 2 )

Privacy:

0.1 0.3 0.5 0.7 0.9p

0.0

0.2

0.4

0.6

0.8

1.0

Pri

vacy

Gain

Prior: FREQ_ROI

TFL - MAX_ROI SFC - MAX_USER

Utility:

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 78: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

Input Perturbation ( 2 / 2 )

Privacy:

0.1 0.3 0.5 0.7 0.9p

0.0

0.2

0.4

0.6

0.8

1.0

Pri

vacy

Gain

Prior: FREQ_ROI

TFL - MAX_ROI SFC - MAX_USER

Utility:

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 79: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

Input Perturbation ( 2 / 2 )

Privacy:

0.1 0.3 0.5 0.7 0.9p

0.0

0.2

0.4

0.6

0.8

1.0

Pri

vacy

Gain

Prior: FREQ_ROI

TFL - MAX_ROI SFC - MAX_USER

Utility:

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 80: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

Table of Contents

1 Introduction

2 Framework

3 Evaluation of Raw Aggregates

4 Evaluation of Defenses

5 Conclusion

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 81: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

Conclusion

We presented a framework that allows us to reason aboutindividuals’ privacy loss from aggregate location time-seriesrelease

Location aggregates enable an adversary with some priorknowledge to profile and localize users

DP mechanisms improve privacy when the utility of thetime-series is poor

Need for novel defense mechanisms for privacy-friendlymobility analytics

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 82: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

Conclusion

We presented a framework that allows us to reason aboutindividuals’ privacy loss from aggregate location time-seriesrelease

Location aggregates enable an adversary with some priorknowledge to profile and localize users

DP mechanisms improve privacy when the utility of thetime-series is poor

Need for novel defense mechanisms for privacy-friendlymobility analytics

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 83: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

Conclusion

We presented a framework that allows us to reason aboutindividuals’ privacy loss from aggregate location time-seriesrelease

Location aggregates enable an adversary with some priorknowledge to profile and localize users

DP mechanisms improve privacy when the utility of thetime-series is poor

Need for novel defense mechanisms for privacy-friendlymobility analytics

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 84: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

Conclusion

We presented a framework that allows us to reason aboutindividuals’ privacy loss from aggregate location time-seriesrelease

Location aggregates enable an adversary with some priorknowledge to profile and localize users

DP mechanisms improve privacy when the utility of thetime-series is poor

Need for novel defense mechanisms for privacy-friendlymobility analytics

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 85: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

Conclusion

We presented a framework that allows us to reason aboutindividuals’ privacy loss from aggregate location time-seriesrelease

Location aggregates enable an adversary with some priorknowledge to profile and localize users

DP mechanisms improve privacy when the utility of thetime-series is poor

Need for novel defense mechanisms for privacy-friendlymobility analytics

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 86: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

The end...

Thanks for your attention! Any questions?

Contact Details: [email protected]

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 87: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

The end...

Thanks for your attention! Any questions?

Contact Details: [email protected]

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy

Page 88: What Does The Crowd Say About You? Evaluating Aggregation ... · Motivation Aggregate locations are useful for performing mobility analytics, e.g., predicting tra c volumes, detecting

The end...

Thanks for your attention! Any questions?

Contact Details: [email protected]

A. Pyrgelis, C. Troncoso, E. De Cristofaro Evaluating Aggregation-Based Location Privacy