Strata San Jose 2016 - Reduce False Positives in Security

Transcript of Strata San Jose 2016 - Reduce False Positives in Security

Powerball Predictor

Photo Credit: Sean McGrath

Crystal ball tells me with 99% accuracy if a powerball prediction is a winner.

https://www.flickr.com/photos/mcgraths/3248483447/in/photolist-5X4jXi-6yxZ1a-CGf4U-66Tr3c-6ee5Rq-4hvaHA-iM1FMn-2jXobW-b6Wy1i-fs2WHp-nFuJt-54rhWJ-9vmRZL-7Ut5P3-54wa3g-4TYKY5-8V2KxT-da3A28-gMhZLV-CThDXh-Dn3ayt-6R8RM1-derkYj-73QdMF-8MojHP-daiKW9-d3ZCYA-dern3A-daxCQ3-dcDDna-oP14hJ-bnyK58-da3Asi-da3Abr-h1hb3G-bnyH3K-daxAXa-daxCNC-d3ZKDf-daxFoY-exQXYU-adoDkQ-qXoNsv-7v5B4X-7vB19q-85bu8F-7ki5n9-54mYde-aWPLh6-4yqu7S

Powerball Predictor

● ~300 million samples.● ~ 3 million false positives.● 1 true positive.

Powerball Predictor

The overwhelming majority of tickets are not winners.

Failing to recognize this is falling victim to the base rate fallacy.

Security Crystal Ball

The overwhelming majority of log entries and data points do not represent fraud and intrusions.

Failing to recognize this is falling victim to the base rate fallacy.

FRAUD Intrusion

Detection

System

Source: MXLabs

http://blog.mxlab.eu/2014/12/29/phishing-email-your-netflix-account-has-been-suspeded/

Base Rate Fallacy

Why False Positives?

Case Study: Outlier Detection

Using an outlier detection system to identify fraudsters within the environment.

For a set of generating mechanisms find the unusual ones.

Example Time Series

Photo credit SuperCar-RoadTrip.fr under Creative Commons Attribution 2.0

Change in the data over time in unforeseen ways.

Concept Drift

https://www.flickr.com/photos/xavier33300/15052804730/in/photolist-oWayq3-oW3LQq-e4YqAw-oV975K-66wm4g-pcovST-66B8md-5uPbDV-6KP3D9-5VhQtC-92osSH-mDdQG-or2WMB-fsicY7-bXNoqA-4xEjhm-4xDRCT-495Vvz-c2Cg3S-oVbx1m-kSsu4z-7cykK1-q5SXqg-C2bEC-oW7p6e-bPjkni-oW5DRv-oVyHA6-oYz2jo-pdiszz-bYvBeh-66wQRP-e4SLwX-r58ZaU-e4SPm8-5nnjfS-8XXrpQ-eNsVxi-mo1x3z-kLKuy5-aM6MW4-oV9nEw-495Vkt-5RJF4s-82XhV5-r2zZ5S-pcbxb3-pcz7Gg-cZEutW-oW6F25

https://creativecommons.org/licenses/by/2.0/legalcode

Solution: Feedback Loop

Explicit Feedback Loop

Photo credit Alan Levine under Creative Commons Attribution 2.0

https://www.flickr.com/photos/cogdog/14279306964/in/photolist-nKPbtE-fzviyT-3UaCt1-6KaJw3-61foih-61bb2t-5ZMwNj-7aQ4JT-6CmbTU-61bb4V-uepYJ-76DVSc-rTiuCw-d7KKVA-9E4LD8-7X3Pgt-7xUnZs-pFFHx-b1krRX-7xUnTC-4TGezD-7xUnWs-pX71kj-cojunw-4THtui-bxCqMR-8EqL2E-b9txnz-bzr6BP-64Qk5s-5ZHa5i-8tAUF4-5ZMnbW-5ZMngC-rpFkJ-5ZMnR7-4TFMCg-5xayDj-jibVc8-5ZHabP-5ZMnAY-5xazh7-5ZHaoc-5x6b9t-nQ5aQ3-mYboTU-5x6bmK-5ZH9FP-5ZHajM-4KRRtx

Explicit Feedback Loop

Photo credit Alan Levine under Creative Commons Attribution 2.0

Implicit Feedback Loop

https://www.flickr.com/photos/cogdog/14279306964/in/photolist-nKPbtE-fzviyT-3UaCt1-6KaJw3-61foih-61bb2t-5ZMwNj-7aQ4JT-6CmbTU-61bb4V-uepYJ-76DVSc-rTiuCw-d7KKVA-9E4LD8-7X3Pgt-7xUnZs-pFFHx-b1krRX-7xUnTC-4TGezD-7xUnWs-pX71kj-cojunw-4THtui-bxCqMR-8EqL2E-b9txnz-bzr6BP-64Qk5s-5ZHa5i-8tAUF4-5ZMnbW-5ZMngC-rpFkJ-5ZMnR7-4TFMCg-5xayDj-jibVc8-5ZHabP-5ZMnAY-5xazh7-5ZHaoc-5x6b9t-nQ5aQ3-mYboTU-5x6bmK-5ZH9FP-5ZHajM-4KRRtx

Fraud: Takeaways

- Concept Drift is a shift in behavior.- Feedback combats concept drift.- Implicit Feedback > Explicit Feedback

IDS: Anatomy of Successful Detection

Context: Security Analyst

Red team Kill Chain

Blue team Kill Chain

False positives: Lose Ability to Triage

Fact: You cannot salvage a false positive with Contextual Info or Visualization

What is a Successful detection?

Properties + Frameworks

Successful detection captures Adversary TTP from Sensor data ignoring Expected activity

Source: @MSwannMSFT

Properties of a Successful Detection

Adaptability

Credible

Interpretability

Actionable

Basic Advanced

Less Useful

More U

seful

Sophistication of Algorithms

Usefulness of A

lerts

Secu

rity

Dom

ain

Kno

wle

dge

Framework for a Successful detection

Basic Advanced

Less Useful

More U

seful

Usefulness of A

lerts

Secu

rity

Dom

ain

Kno

wle

dge

Outlier

Basic Advanced

Less Useful

More U

seful

Usefulness of A

lerts

Secu

rity

Dom

ain

Kno

wle

dge

Outlier

Anomaly

Increase Complexity

Basic Advanced

Less Useful

More U

seful

Usefulness of A

lerts

Secu

rity

Dom

ain

Kno

wle

dge

Outlier

AnomalyIncrease Complexity

Security InterestingAlerts

Incr

e ase

Dom

ain

Kno

wle

dgeSuccessful

Detections incorporate Domain Knowledge Alerts

How to encode Domain Knowledge: Embrace Rules

• Business Heuristics to filter out the “Security interesting anomalies”

• Rules can take many forms: •TI feeds •IOCs, IOAs•TTPs

• Rules are awesome • Credible, Interpretable, Adaptable (to some

extent), Actionable!• Highest Precision • Highest Recall

Three ways to combine ML and Rules

Three Ways to combine Rules and ML 1.Above Machine Learning Systems

a.Business Heuristics to filter alerts i. “For account _foo_, only raise sev 2 alerts until March 28th, 2016”,

Work by Dan Mace et. al, Microsoft

2. Below Machine Learning Systemsa. Featurizations - “If IP address present in List of malicious IP dataset, flag 1”b. Utilizes Threat Intel feeds (Cymru, Virus total, FireEye)

3: Combining Rules and Machine Learning together using Markov Logic Networks

Initial Ideas given by Vinod Nair, MSR

Intuition

•Rules alone place a set of hard constraintson the set of possible worlds•Let’s make them soft constraints:When a world violates a formula,It becomes less probable, not impossible•Give each formula a weight(Higher weight ⇒ Stronger constraint)

Source: Lectures by Pedro Domingos

Interactive logons from service accounts causes attack

Similar service accounts tend to have similar logon behavior

Example: Service Accounts

Domain Knowledge

Encode as First Order Logic

1.5

1.1

AssociateEach Rule With the Learned Weight

1.5

1.1

Attack(A)

InteractiveLogon(A)

InteractiveLogon(B)

Attack(B)

Consider two service accounts: A,B

1.5

1.1

Attack(A)

InteractiveLogon(A)

InteractiveLogon(B)

Attack(B)Similar(A,

B)

Similar(B,A)

Similar(A,A)

Similar(B,B)

1.5

1.1

Attack(A)

InteractiveLogon(A)

InteractiveLogon(B)

Attack(B)Similar(A,

B)

Similar(B,A)

Similar(A,A)

Similar(B,B)

1.5

1.1

Attack(A)

InteractiveLogon(A)

InteractiveLogon(B)

Attack(B)Similar(A,

B)

Similar(B,A)

Similar(A,A)

Similar(B,B)

•How to learn the structure? •Begin with hand-coded rules•Use Inductive Logic Programming, but need to infer arbitrary clause

•How to learn the weights? •For generative learning, depend on pseudolikelihood

•Checkout Alchemy -- http://alchemy.cs.washington.edu/

http://alchemy.cs.washington.edu/

Call for Action - After the conference • One Week

•Review •@CodyRioux - IPython Notebook•@Ram_ssk - Follow Up material

•Think comprehensively about Rules

• One Month •Ask your data scientists to literature review section

•Implement the rules on TOP of ML systems

• One quarter•Implement a feedback system to capture training data

•Implement all TI feeds within an ML System

•Play with Alchemy

Literature● The Base-Rate Fallacy and its Implications for the Difficulty of Intrusion Detection

(Alexsson, 1999)

● Enhancing Performance Prediction Robustness by Combining Analytical Modeling

and Machine Learning (Didona et al., 2015)

● Richardson, Matthew, and Pedro Domingos. "Markov logic networks."Machine

learning 62.1-2 (2006): 107-136.

Strata San Jose 2016 - Reduce False Positives in Security

Engineering

Transcript of Strata San Jose 2016 - Reduce False Positives in Security

Reduction of False Positives in Structure-Based Virtual ...

Tail estimation for false positives in high-throughput testingrootzen/papers/high-throughput... · 2012-03-16 · Tail estimation for false positives in high-throughput testing ...

CTScanninginSuspectedStrokeorHeadTrauma:IsitWorth Going ... · Chest CT results were 6 true-positives, 15 true-negatives, 5 false-positives, and 1 false-negative. Diagnostic performance

Tail estimation methods for the number of false positives ...rootzen/papers/False-Positives-in-High... · For high throughput screening it seems to be more useful to have good estimates

Decreasing false positives in automated testing

Astrophysical false positives in direct imaging for ...obswildif/publications/2013_hd8049.pdf · Astrophysical false positives in direct imaging for exoplanets: a ... also be exploited

Handling False Positives in PVS-Studio and CppCat

FEV’s Greatest Bloopers: False Positives in Formal Equivalence

Efficiently Deducing Ids False Positives System Profiling 33223

Reducing False Positives In Automated Testing

Reducing False Positives with Automated NMR Verification · Reducing False Positives with Automated NMR Verification Author: Ryan Sasaki Subject: SMASH 2011 Created Date: 10/25/2011

QA/QC 2 Strategies to Reduce False Positives and False ... · QA/QC 2 – Strategies to Reduce False Positives and False Negatives ... The challenge of identifying intermediate and

Modern Diagnostic Tests for Chronic Lyme Disease and Co ...determine false-positive results, the C6 ELISA generated 73.6% false positives with 5% prevalence and 4.7% false positives

Reducing False-Positives and False-Negatives in Security Event Data Using Context

Reducing False Positives of a Bloom Filter using Cross ... · Reducing False Positives of a Bloom Filter using Cross-Checking Bloom Filters Hyesook Lim1, Nara Lee1, Jungwon Lee1 and

Understanding and Improving Bloom Filter Configuration for ...steffan/papers/jeffrey_masc_thesis.pdf · False positives can lead to unnecessary transactional aborts, false bug reports,

State-of-the-Art Blocking of False Positives

REDUCING FALSE POSITIVES BY MARKING AND OVERCLASSIFYING · Reducing false positives by marking and overclassifying 19 Stage I. Detecting skin prototypes regions in the image obtained

LNCS 4338 - Reducing False Positives in Video Shot ...mnitya/Icvgip06.pdf · Reducing False Positives in Video Shot Detection Using Learning Techniques 423 1. Illumination changes:

Central Statistical False Positives –Predicting True ...