Ml based detection of users anomaly activities (20th OWASP Night Tokyo, English)

ML based detection of users anomaly activities

Yury LeonychevESG, Rakuten inc.OWASP Night 9/3/2016

Agenda

• Case study presentation• Workshop format

What WhereIDE Continuum Analytics Anaconda https://www.continuum.io/downloads

Python3+NumPy+SciPy+ScikitLearn

https://www.python.org/downloads/http://www.scipy.org/install.html

Model Application https://github.com/tracer0tong/buzzboard

Abstract problem definition

1. Browser based activitya. Normal user interacts with browserb. Web application generated activity

2. HTTP request activitya. Normal UAb. Headless browser or script/bot

3. Frontend/Backend data exchange

Methodology (CRISP-DM)https://en.wikipedia.org/wiki/Cross_Industry_Standard_Process_for_Data_Mining

https://en.wikipedia.org/wiki/Cross_Industry_Standard_Process_for_Data_Mining#/media/File:CRISP-DM_Process_Diagram.pngBy Kenneth Jensen License: CC BY-SA 3.0

Model description

1. Business understanding – we want to classify “bad” and “good” users, where “bad” users couldn’t enter CAPTCHA, but “good” users – could.

2. Data understanding – HTTP requests and result of CAPTCHA checks.

3. Data preparation – collect requests, prove that this is full set. Get data from users and collect to database.

4. Create model. Define and tune settings for Decision Tree.5. Calculate mistakes, validate model.6. Deploy model to production.

Feature extraction

Direct IndirectSize of HTTP request IP address reputation

Length of URI address User reputation

User Agent History based features

Amount of HTTP headers Time based features

Response code/Response time Business logic based features

… …

Application workflow

Application workflow (Learning Mode)

Application workflow (Strict Mode)

Decomposition

Offline computations

• Offline with Hadoop, Spark (MLlib), Elasticsearch• Realtime with Spark (Streams and MLlib), Kafka• Same technologies available in AWS and Azure

Continuous experiment

Knowledge matters!

• You should understand what are you doing!– Is it normal to have 1.0 accuracy?– Could we measure Mean Squared Error for our model application?– Have we already chose correct algorithm and parameters?– This is correct feature?

METHODS = ['GET', 'POST', 'PUT', 'DELETE', 'OPTIONS', 'HEAD']def MethodFeature(request): return METHODS.index(request.method)

Conclusion

• Use a decomposition (different levels of classification)• Use flexible features collection• Prefer offline computations• Give yourself field for experiments• Don’t forget ML integration – continuous process• Get knowledges about ML

QUESTIONS?

Yury LeonychevESG, Rakuten inc.OWASP Night 9/3/2016Yury.Leonychev@Rakuten.com

Ml based detection of users anomaly activities (20th OWASP Night Tokyo, English)

Data & Analytics

Transcript of Ml based detection of users anomaly activities (20th OWASP Night Tokyo, English)

The OWASP Foundation OWASP Global Update Seba Deleersnyder OWASP Foundation Board Member.

OWASP Day IV•180 blog monitorati OWASP-Italy Day IV – 6th, Nov 09 OWASP 11 OWASP Top Ten

The OWASP Foundation OWASP Cross Site Scripting (XSS) Exploits ...

OWASP Top 10 · OWASP Top 10 from a developer’s perspective John Wilander, OWASP/Omegapoint, IBWAS’10

OWASP – Top 10 · • OWASP Code Review Guide • OWASP Testing Guide • OWASP Top Ten Project . OWASP – TOP 10 • OWASP Top 10 Web Application Security Risks for 2010 are:

The OWASP Foundation OWASP Chennai 2007 Phishing.

02 OWASP BNL10 Training - Tour of OWASP Projects V2

The OWASP Foundation OWASP AppSec Aguascalientes 2010 ¿Qué es OWASP? Miguel Pérez-Milicua Softtek Miembro de OWASP capítulo Aguacalientes.

Webscarab, an introduction. - OWASP€¦ · OWASP 19 Spider plug-in

OWASP (Membership) and new OWASP Projects · OWASP 7 OWASP

OWASP Toronto - OWASP Foundation | Open Source Foundation ...

OWASP Top-10 2013 Tobias Gondrom (OWASP Project Leader)

OWASP Testing Guide - OWASP Summit 2011

Mobile security, OWASP Mobile Top 10, OWASP Seraphimdroid

Owasp tools - OWASP Serbia

OWASP BeLux 2008-03-04 OWASP Update · 2020-05-18 · OWASP 5 Program for this evening:

OWASP TOP 10 vs OWASP ASVS - owasp-stl.orgowasp-stl.org/decks/Top10vsASVS.pdf · The OWASP Top Ten •The OWASP Top 10 provides a list of the 10 Most Critical Web Application Security

Owasp Top 10 - Owasp Pune Chapter - January 2008

Proactive Web Application Defenses. Jim Manico @manicode – OWASP Volunteer Global OWASP Board Member Global OWASP Board Member OWASP Cheat-Sheet Series.

The OWASP Foundation OWASP Top 10 2010 Kuai Hinojosa Software Security Consultant at Cigital OWASP Global Education Committee OWASP.