COMP 381. Exercise: Data Collection 1.Who are the “fact collectors”? (make a list—be specific)...

15
PRIVACY: DATA MINING 9 FEBRUARY COMP 381
  • date post

    19-Dec-2015
  • Category

    Documents

  • view

    212
  • download

    0

Transcript of COMP 381. Exercise: Data Collection 1.Who are the “fact collectors”? (make a list—be specific)...

PRIVACY: DATA MINING9 FEBRUARY

COMP 381

Exercise: Data Collection

1. Who are the “fact collectors”? (make a list—be specific)

2. What KINDS of ‘facts’ are stored about you/us? (make a list—be specific)

Fact Collectors (class list)

government

census

taxes

work

cars, houses, real estate

voting record and party affiliation

bank accounts

medical records

criminal history; civil suits

guns

birth/death

insurance

schools

grades

finances

address

family background

SSN

medical/mental health

salary

degrees/education

schedule

on-campus purchases

access to buildings/dorms

computer usage/printing

activities

criminal records

employers

conflict of interest

investments

political activities

knowledge

websites

phone calls

purchases

SSN

companies that I buy from

buy

sites

credit ratings

address & phone

email

other companies

purchased info from other companies

info based on IP address

public info (see above)

Google

family and friends

Data & informational privacy

One should ask:• Who has ACCESS?

• Who should have ACCESS?

• Need to know?

• Why?

• How long is the data kept?

Question Is there a problem with a search engine

monopoly (or oligolopy)?

Mowshowitz and Kumar, And Then There Were Three

Search Engine US Market Shares(December 2010)

Google 66.6 Yahoo 16.0 Microsoft 12.0 Ask 3.5 AOL 1.9

Source:http://searchenginewatch.com/3641746

Global

Grocery Store Receipts

eggsmilk

breadcheeseplates

napkinstrash bags

eggscheese

sour creambreadchipssoup

milkeggsbreadbutter

cheesegumsoda

breadeggsmilk

cheesediapers

juice

Diapers and beer?Super Bowl

Sunday?

http://portal.acm.org/citation.cfm?doid=358108.358114

Clique = a tightly-knit group of people

Clique = a set of vertices in a graph that are all connected to each other by edges of the graph

Maximal Clique = a clique that is not a subset of a larger clique

Friend Grouper: Definitions

AliceBob

EvaDavid

Carol

online social network is rarely a perfect representation of the real social networknew member“acquaintances” or “friends”

Friend Grouper: Problem Addressed

Real Life Online

Alice Bob

EvaDavid

Carol

Alice Bob

EvaDavid

Carol

Triangle Closure Nodes = People Edges = Social Relationships

Individuals with common friends are more likely to become friends

Leads to “people you may know”

Recommending Movies Predicting Ratings Data-Mining Contest:

Release rating data, sans “identifying information”

Prize awardedAdditional contests: canceled

○ http://bits.blogs.nytimes.com/2010/03/12/netflix-cancels-contest-plans-and-settles-suit/

Borderline: Netflix

“people you may know”Ex-boyfriend/girlfriend you don’t want to talk

to…EVERPerson who used to bully you

“reconnect”Person in coma in Australia

Borderline: Facebook

Types of Invasions

IndividualCameras

GovernmentalSnuggly bear on warrantless wiretapping

InstitutionalPrivacy and social networks