Class Presentationfidler/teaching/2015/slides/CSC2523/adverseri… · Title: Class Presentation...

42
ADVERSARIAL EXAMPLES (In 15 minutes or less) Neill Patterson, MscAC

Transcript of Class Presentationfidler/teaching/2015/slides/CSC2523/adverseri… · Title: Class Presentation...

Page 1: Class Presentationfidler/teaching/2015/slides/CSC2523/adverseri… · Title: Class Presentation Created Date: 3/22/2016 5:04:31 PM

ADVERSARIAL EXAMPLES(In 15 minutes or less)

Neill Patterson, MscAC

Page 2: Class Presentationfidler/teaching/2015/slides/CSC2523/adverseri… · Title: Class Presentation Created Date: 3/22/2016 5:04:31 PM

PART I - BASIC CONCEPTS

Page 3: Class Presentationfidler/teaching/2015/slides/CSC2523/adverseri… · Title: Class Presentation Created Date: 3/22/2016 5:04:31 PM

WE TRAIN MODELS BY TAKING GRADIENTS W.R.T. WEIGHTS

w w � ⌘rJw

Page 4: Class Presentationfidler/teaching/2015/slides/CSC2523/adverseri… · Title: Class Presentation Created Date: 3/22/2016 5:04:31 PM

“Panda”Change weights via

gradient descent

Page 5: Class Presentationfidler/teaching/2015/slides/CSC2523/adverseri… · Title: Class Presentation Created Date: 3/22/2016 5:04:31 PM

WE’RE GOING TO TAKE GRADIENTS W.R.T. PIXELS INSTEAD

x x± ⌘rJx

Page 6: Class Presentationfidler/teaching/2015/slides/CSC2523/adverseri… · Title: Class Presentation Created Date: 3/22/2016 5:04:31 PM

WE ARE GOING TO TAKE GRADIENTS W.R.T. PIXELS INSTEAD

x x± ⌘rJx

Page 7: Class Presentationfidler/teaching/2015/slides/CSC2523/adverseri… · Title: Class Presentation Created Date: 3/22/2016 5:04:31 PM

“Panda”

“Vulture”

Change pixels via

gradient descent

Page 8: Class Presentationfidler/teaching/2015/slides/CSC2523/adverseri… · Title: Class Presentation Created Date: 3/22/2016 5:04:31 PM

KEY IDEA: ADD SMALL, WORST-CASE PIXEL DISTORTION TO CAUSE

MISCLASSIFICATIONS

Page 9: Class Presentationfidler/teaching/2015/slides/CSC2523/adverseri… · Title: Class Presentation Created Date: 3/22/2016 5:04:31 PM

+ =

“Panda” “Gibbon”

99% confidence58% confidence

Page 10: Class Presentationfidler/teaching/2015/slides/CSC2523/adverseri… · Title: Class Presentation Created Date: 3/22/2016 5:04:31 PM

THINK OF ADVERSARIAL EXAMPLES AS WORST-CASE DOPPLEGÄNGERS

Page 11: Class Presentationfidler/teaching/2015/slides/CSC2523/adverseri… · Title: Class Presentation Created Date: 3/22/2016 5:04:31 PM

DEMO

Page 12: Class Presentationfidler/teaching/2015/slides/CSC2523/adverseri… · Title: Class Presentation Created Date: 3/22/2016 5:04:31 PM

Sanja Fidler Fiddler Crab

Page 13: Class Presentationfidler/teaching/2015/slides/CSC2523/adverseri… · Title: Class Presentation Created Date: 3/22/2016 5:04:31 PM

PART II - HARNESSING ADVERSARIAL EXAMPLES

Page 14: Class Presentationfidler/teaching/2015/slides/CSC2523/adverseri… · Title: Class Presentation Created Date: 3/22/2016 5:04:31 PM

KEY IDEA: MAKE TRAINING MORE DIFFICULT TO GET STRONGER MODELS

(DROPOUT, RANDOM NOISE, ETC)

Page 15: Class Presentationfidler/teaching/2015/slides/CSC2523/adverseri… · Title: Class Presentation Created Date: 3/22/2016 5:04:31 PM

TRAIN WITH ADVERSARIAL EXAMPLES FOR BETTER

GENERALIZATION

Page 16: Class Presentationfidler/teaching/2015/slides/CSC2523/adverseri… · Title: Class Presentation Created Date: 3/22/2016 5:04:31 PM

THE FAST GRADIENT SIGN METHOD OF IAN GOODFELLOW

Page 17: Class Presentationfidler/teaching/2015/slides/CSC2523/adverseri… · Title: Class Presentation Created Date: 3/22/2016 5:04:31 PM

QUICKLY GENERATING ADVERSARIAL EXAMPLES

Page 18: Class Presentationfidler/teaching/2015/slides/CSC2523/adverseri… · Title: Class Presentation Created Date: 3/22/2016 5:04:31 PM

WHAT DIRECTION SHOULD YOU MOVE TOWARDS?

Page 19: Class Presentationfidler/teaching/2015/slides/CSC2523/adverseri… · Title: Class Presentation Created Date: 3/22/2016 5:04:31 PM

INSTEAD OF MOVING TOWARDS A SPECIFIC TYPE OF ERROR, MOVE

AWAY FROM THE CORRECT LABEL

Page 20: Class Presentationfidler/teaching/2015/slides/CSC2523/adverseri… · Title: Class Presentation Created Date: 3/22/2016 5:04:31 PM

“Panda”

“Vulture”

“House”

“Truck”

Page 21: Class Presentationfidler/teaching/2015/slides/CSC2523/adverseri… · Title: Class Presentation Created Date: 3/22/2016 5:04:31 PM

HOW BIG A STEP SHOULD YOU TAKE IF YOU WANT IMPERCEPTIBLE

DISTORTION?

Page 22: Class Presentationfidler/teaching/2015/slides/CSC2523/adverseri… · Title: Class Presentation Created Date: 3/22/2016 5:04:31 PM

PIXELS ARE STORED AS SIGNED 8-BIT INTEGERS. ADD JUST LESS THAN1-BIT OF DISTORTION TO EACH PIXEL

0.07 <1

27⇡ 0.08

Page 23: Class Presentationfidler/teaching/2015/slides/CSC2523/adverseri… · Title: Class Presentation Created Date: 3/22/2016 5:04:31 PM

WE WANT PRECISELY THIS AMOUNT OF DISTORTION, SO NO MATTER HOW

SMALL (OR BIG) THE GRADIENT, JUST TAKE THE SIGN OF IT AND MULTIPLY BY 0.07

x+ 0.07⇥ sign(rJ

x

)

Page 24: Class Presentationfidler/teaching/2015/slides/CSC2523/adverseri… · Title: Class Presentation Created Date: 3/22/2016 5:04:31 PM

INCORPORATING ADVERSARIAL EXAMPLES INTO YOUR COST

FUNCTION

Page 25: Class Presentationfidler/teaching/2015/slides/CSC2523/adverseri… · Title: Class Presentation Created Date: 3/22/2016 5:04:31 PM

GENERATE ADVERSARIAL EXAMPLES AT EACH ITERATION OF TRAINING, BUT

DON’T WANT TO KEEP THEM AROUND IN MEMORY FOREVER

Page 26: Class Presentationfidler/teaching/2015/slides/CSC2523/adverseri… · Title: Class Presentation Created Date: 3/22/2016 5:04:31 PM

INSTEAD, MODIFY THE COST FUNCTION TO BE A COMBINATION OF ORIGINAL AND ADVERSARIAL INPUTS

Page 27: Class Presentationfidler/teaching/2015/slides/CSC2523/adverseri… · Title: Class Presentation Created Date: 3/22/2016 5:04:31 PM

Parameters

New cost function

inputs

labels

eJ(✓,x, y) =

Page 28: Class Presentationfidler/teaching/2015/slides/CSC2523/adverseri… · Title: Class Presentation Created Date: 3/22/2016 5:04:31 PM

Old cost functioneJ(✓,x, y) = J(✓,x, y) +

Page 29: Class Presentationfidler/teaching/2015/slides/CSC2523/adverseri… · Title: Class Presentation Created Date: 3/22/2016 5:04:31 PM

Adversarial example

eJ(✓,x, y) = J(✓,x+ ✏signrx

J| {z }, y)J(✓,x, y) +

Old cost function

Page 30: Class Presentationfidler/teaching/2015/slides/CSC2523/adverseri… · Title: Class Presentation Created Date: 3/22/2016 5:04:31 PM

eJ(✓,x, y) = J(✓,x, y) + J(✓,x+ ✏signrx

J, y)↵ (1� ↵)

mixing components

Page 31: Class Presentationfidler/teaching/2015/slides/CSC2523/adverseri… · Title: Class Presentation Created Date: 3/22/2016 5:04:31 PM

eJ(✓,x, y) = J(✓,x, y) + J(✓,x+ ✏signrx

J, y)↵ (1� ↵)

“Train with a mix of original and adversarial examples”

Page 32: Class Presentationfidler/teaching/2015/slides/CSC2523/adverseri… · Title: Class Presentation Created Date: 3/22/2016 5:04:31 PM

NOW DO S.G.D. ON THIS NEW COST FUNCTION, BY TAKING GRADIENTS W.R.T. WEIGHTS

w w � ⌘r eJw

Page 33: Class Presentationfidler/teaching/2015/slides/CSC2523/adverseri… · Title: Class Presentation Created Date: 3/22/2016 5:04:31 PM

PART III - MISCELLANEOUS TIPS FOR TRAINING

Page 34: Class Presentationfidler/teaching/2015/slides/CSC2523/adverseri… · Title: Class Presentation Created Date: 3/22/2016 5:04:31 PM

YOU NEED MORE MODEL CAPACITY(ADVERSARIAL EXAMPLES DO NOT LIE ON THE MANIFOLD OF REALISTIC IMAGES)

Page 35: Class Presentationfidler/teaching/2015/slides/CSC2523/adverseri… · Title: Class Presentation Created Date: 3/22/2016 5:04:31 PM

FOR EARLY STOPPING, BASE YOUR DECISION ON THE VALIDATION ERROR

OF ADVERSARIAL EXAMPLES ONLY

Page 36: Class Presentationfidler/teaching/2015/slides/CSC2523/adverseri… · Title: Class Presentation Created Date: 3/22/2016 5:04:31 PM

RESULTS

Page 37: Class Presentationfidler/teaching/2015/slides/CSC2523/adverseri… · Title: Class Presentation Created Date: 3/22/2016 5:04:31 PM

BETTER GENERALIZATION ABOVE AND BEYOND DROPOUT

0.94% error(MNIST)

0.84% error

Page 38: Class Presentationfidler/teaching/2015/slides/CSC2523/adverseri… · Title: Class Presentation Created Date: 3/22/2016 5:04:31 PM

BETTER GENERALIZATION ABOVE AND BEYOND DROPOUT

0.94% error(MNIST)

0.84% error

Page 39: Class Presentationfidler/teaching/2015/slides/CSC2523/adverseri… · Title: Class Presentation Created Date: 3/22/2016 5:04:31 PM

RESISTANCE TO ADVERSARIAL EXAMPLES

89.4% error(97.6% confidence)

17.9% error

Page 40: Class Presentationfidler/teaching/2015/slides/CSC2523/adverseri… · Title: Class Presentation Created Date: 3/22/2016 5:04:31 PM

MATHEMATICAL PROPERTIES OF ADVERSARIAL EXAMPLES

Page 41: Class Presentationfidler/teaching/2015/slides/CSC2523/adverseri… · Title: Class Presentation Created Date: 3/22/2016 5:04:31 PM

MATHEMATICAL PROPERTIES OF ADVERSARIAL EXAMPLES

(Ain’t nobody got time for that)

Page 42: Class Presentationfidler/teaching/2015/slides/CSC2523/adverseri… · Title: Class Presentation Created Date: 3/22/2016 5:04:31 PM

THANK YOU FOR YOUR TIME!