Analysis - University of Pennsylvaniaryrogers/Leverage_Talk.pdf · At the end of May, the ImageNet...

10

Transcript of Analysis - University of Pennsylvaniaryrogers/Leverage_Talk.pdf · At the end of May, the ImageNet...

Page 1: Analysis - University of Pennsylvaniaryrogers/Leverage_Talk.pdf · At the end of May, the ImageNet Large Scale Visual Recognition Competition, or LSVRC (currently the word's top annual
Page 2: Analysis - University of Pennsylvaniaryrogers/Leverage_Talk.pdf · At the end of May, the ImageNet Large Scale Visual Recognition Competition, or LSVRC (currently the word's top annual

Analysis 𝑡

𝑡

𝑡 𝑋

Page 3: Analysis - University of Pennsylvaniaryrogers/Leverage_Talk.pdf · At the end of May, the ImageNet Large Scale Visual Recognition Competition, or LSVRC (currently the word's top annual

𝑡

𝑡 𝑋

Analysis 𝑡# ← 𝑡 𝑋

Page 4: Analysis - University of Pennsylvaniaryrogers/Leverage_Talk.pdf · At the end of May, the ImageNet Large Scale Visual Recognition Competition, or LSVRC (currently the word's top annual

𝑡 𝑋

𝑡

Analysis 𝑡# ← 𝑡 𝑋𝑡# 𝑋#

A lot of existing theory assumes tests are selected independently of the data.

𝑡#

Page 5: Analysis - University of Pennsylvaniaryrogers/Leverage_Talk.pdf · At the end of May, the ImageNet Large Scale Visual Recognition Competition, or LSVRC (currently the word's top annual

Ideal World

How can we provide statistically valid answers to adaptively chosen analyses?

Real World

Page 6: Analysis - University of Pennsylvaniaryrogers/Leverage_Talk.pdf · At the end of May, the ImageNet Large Scale Visual Recognition Competition, or LSVRC (currently the word's top annual

Ideal World

How can we provide statistically valid answers to adaptively chosen analyses?

Real World

I

n

t

r

o

d

u

c

t

i

o

n

M

o

d

e

l

R

e

s

u

l

t

s

K

e

y

I

d

e

a

s

P

r

o

o

f

S

k

e

t

c

h

Adaptivity causes r

eal problems

I

n

t

r

o

d

u

c

t

i

o

n

M

o

d

e

l

R

e

s

u

l

t

s

K

e

y

I

d

e

a

s

P

r

o

o

f

S

k

e

t

c

h

Adaptivity causes real problems

Introduction Model Results Key Ideas Proof Sketch

Adaptivity causes real problems

Page 7: Analysis - University of Pennsylvaniaryrogers/Leverage_Talk.pdf · At the end of May, the ImageNet Large Scale Visual Recognition Competition, or LSVRC (currently the word's top annual

𝑋~𝑃(

How can we provide statistically valid answers to adaptively chosen analyses?

𝑡𝑡(𝑋)

𝑡′ ← 𝑡 𝑋

𝑡′𝑡′(𝑋)

Page 8: Analysis - University of Pennsylvaniaryrogers/Leverage_Talk.pdf · At the end of May, the ImageNet Large Scale Visual Recognition Competition, or LSVRC (currently the word's top annual

𝑋~𝑃(

𝑡𝑡(𝑋)

𝑡′ ← 𝑎

𝑡′𝑡′(𝑋)

Answer: Limit the info learned about the dataset with each analysis [Dwork,Feldman,Hardt,Pitassi,Reingold,Roth’15].

𝑎

Page 9: Analysis - University of Pennsylvaniaryrogers/Leverage_Talk.pdf · At the end of May, the ImageNet Large Scale Visual Recognition Competition, or LSVRC (currently the word's top annual

[Dwork,McSherry,Nissim,Smith’06]

𝐴:𝐷( → 𝑌 𝜀, 𝛿𝑥, 𝑥′ ∈ 𝐷( 𝑆 ⊆ 𝑌

𝑃 𝐴 𝑥 ∈ 𝑆 ≤ 𝑒;𝑃 𝐴 𝑥# ∈ 𝑆 + 𝛿

Page 10: Analysis - University of Pennsylvaniaryrogers/Leverage_Talk.pdf · At the end of May, the ImageNet Large Scale Visual Recognition Competition, or LSVRC (currently the word's top annual

[R, Roth, Smith, Thakkar’16].

[Gaboardi, Lim, R, Vadhan’16], [Kifer,R’16].

[R,Roth,Ullman,Vadhan’16].