aleph - Malware analysis pipelining for the masses

56
aleph Malware analysis pipe-lining for the masses

Transcript of aleph - Malware analysis pipelining for the masses

Page 1: aleph - Malware analysis pipelining for the masses

alephMalware analysis pipe-lining for the

masses

Page 2: aleph - Malware analysis pipelining for the masses

Who we are?

Jan Seidl @jseidl

Aleph Project Lead Developer

*NIX/BSD freak

Digital tools blacksmith / python & C lover

Lousy guitar player

Coffee dependent

Hates printers, doesn't likes social networks anything

Selectively-social

Page 3: aleph - Malware analysis pipelining for the masses

Who we are?

Actually we are a bunch of people...

Page 4: aleph - Malware analysis pipelining for the masses

Malware

Page 5: aleph - Malware analysis pipelining for the masses

Definition

'Malware' is an umbrella term used to refer to a variety of

forms of hostile or intrusive software, including computer

viruses, worms, trojan horses, ransomware,

spyware, adware, scareware, and other malicious

programs.

It can take the form of executable code, scripts, active

content, and other software

Wikipedia

Page 6: aleph - Malware analysis pipelining for the masses

Malware growth

Page 7: aleph - Malware analysis pipelining for the masses

Detecting malware

Page 8: aleph - Malware analysis pipelining for the masses

Detecting malware● Signature-based

● Sample must be previously-known and flagged as malicious

● Heuristics-based● Can trigger loads of false-positives

● Behavior-based● Can trigger loads of false-positives

Page 9: aleph - Malware analysis pipelining for the masses

Detection is not enoughYou need to understand the malware

Page 10: aleph - Malware analysis pipelining for the masses

Understanding malware

● Features extraction

● Which characteristics this file has?

● Feature correlation

● Make sense of features combinations / disposition

● Sample correlation & Family classification

● Identify common features between different samples

Page 11: aleph - Malware analysis pipelining for the masses

Understanding malware

● Enables you to identify families

● Enables you to identify acting groups

● Enables you to identify techniques

● Enables you to identify trends

Page 12: aleph - Malware analysis pipelining for the masses

Manual approachEveryday workhorse method

Page 13: aleph - Malware analysis pipelining for the masses

Manual approach

● Use lots of separate tools to extract data from sample (each in its own format)

● Correlate output from the tools using spreadsheets, word files, napkins, tears

Page 14: aleph - Malware analysis pipelining for the masses
Page 15: aleph - Malware analysis pipelining for the masses

Manual approach

● Find out new samples embedded into original sample

● Rinse, repeat, get more whiskey/coffee

Page 16: aleph - Malware analysis pipelining for the masses
Page 17: aleph - Malware analysis pipelining for the masses

Automated approachPipeline method

Page 18: aleph - Malware analysis pipelining for the masses

Automated approach

● Insert sample into one end● Wait until processing is done● Get report on the other end● Get emotional about hours of work saved● Focus on most important evidences

Page 19: aleph - Malware analysis pipelining for the masses
Page 20: aleph - Malware analysis pipelining for the masses

Commercial playersGiving you some free analysis while you

feed their database for free

Page 21: aleph - Malware analysis pipelining for the masses

● Akana (Android files)● Anubis● BitBlaze Malware Analysis Service● Comodo Automated Analysis System

and Valkyrie● EUREKA Malware Analysis Internet

Service● Joe Sandbox Document Analyzer (PDF,

RTF and MS Office files)● Malwr● MASTIFF● VxStream Sandbox (Hybrid Analysis)● VirusTotal*● ThreatExpert● ThreatTrack● ViCheck● VisualThreat (Android files)● XecScan (PDF and MS Office files from

targeted attacks)

Page 22: aleph - Malware analysis pipelining for the masses

Bringing it to the massesGiving you the ability to spin up your

own commercial-grade malware research infrastructure

Page 23: aleph - Malware analysis pipelining for the masses

Components

Page 24: aleph - Malware analysis pipelining for the masses

Aleph ProcessFrom fetching samples to report building

Page 25: aleph - Malware analysis pipelining for the masses

Main Features

● Cross-platform (tested on: Windows, Linux, OS X)Almost all modules are pure-python

● Scalable + Easily Extensible

● Web Interface for browsing reports

Page 26: aleph - Malware analysis pipelining for the masses

From fetching samples to report building

Page 27: aleph - Malware analysis pipelining for the masses

Aleph Process

Sample

Sample data

Aleph Process

Sample Sample data

Page 28: aleph - Malware analysis pipelining for the masses

Aleph Process

Collector Sample Manager

Sample Queue

Sample Plugins

Data

Page 29: aleph - Malware analysis pipelining for the masses

Aleph Process: Collection

●Detect new file on medium (filesystem, email account etc)

●Check if meets predefined criteria (min/max size)

Page 30: aleph - Malware analysis pipelining for the masses

Aleph Process: Triage

●Detect file type (mimetype)

●Calculate hashes

●Add sample to process queue

Page 31: aleph - Malware analysis pipelining for the masses

Aleph Process: Processing

●Enumerate plugins suitable for sample mimetype

●Run plugins and extract features

●Save features as structured data into database

Page 32: aleph - Malware analysis pipelining for the masses

Aleph Process: Reporting

●Fetch sample information from database

●Generate report based on retrieved data

Page 33: aleph - Malware analysis pipelining for the masses

Currently supported files●Windows Portable Executable (PE) (exe,

cpl & dll)

Coming up support for:● Android APK

● PDF Documents

● Linux ELF

● iOS Apps

● URLs & Emails

● Apple Mach-O

● MS Office Documents

● SWF & Much more!

Page 34: aleph - Malware analysis pipelining for the masses

Aleph: Web Interface

Page 35: aleph - Malware analysis pipelining for the masses

Aleph: Web Interface

Page 36: aleph - Malware analysis pipelining for the masses

Aleph: Web Interface

Page 37: aleph - Malware analysis pipelining for the masses

Aleph: Web Interface

Page 38: aleph - Malware analysis pipelining for the masses

Aleph: Web Interface

Page 39: aleph - Malware analysis pipelining for the masses

Shifting the paradigmNew exciting features coming up

Page 40: aleph - Malware analysis pipelining for the masses

DecouplingDividing the functionality into

standalone components

Page 41: aleph - Malware analysis pipelining for the masses

Decoupling

Collector

Sample

WebInterface

AlephProcessor

RDBMSSample

DatastoreStorage

StorageConnector

Web UI Info

HTTP API, SSH...

TransportConnector

Transport

Executable, Email, URL,PDF, APK etc

Local File, FTP, Email etc Redis, RabbitMQ,Memcached

SQLite, MySQL,PostgreSQL, SQL Server

Filesystem, VXVault, Amazon S3, Azure etc

ElasticSearch

Sample + Initial Meta

Sample + Initial Meta

Send & Retrieve

Download

Sample + Initial Meta

Page 42: aleph - Malware analysis pipelining for the masses

Adapters & ConnectorsAbstracting functionality so you use

what you have, or what you're familiar with...

Page 43: aleph - Malware analysis pipelining for the masses

Adapters & Connectors

CollectorManager

Sample File Collector

E-Mail Collector

FTP Collector

Sample

Sample

Transport

TransportConnector

Redis, RabbitMQ,Memcached

Page 44: aleph - Malware analysis pipelining for the masses

Scoring SystemThe path to the Evil Index

Page 45: aleph - Malware analysis pipelining for the masses

Data Correlation PluginsAutomatic linking and enrichment of

gathered data

Page 46: aleph - Malware analysis pipelining for the masses

Data Visualization PluginsBecause everyone loves graphs and scatterplots

Page 47: aleph - Malware analysis pipelining for the masses

Online DisassemblyDisassembled code & memory dumps will be saved as a new sample/artifact

Page 48: aleph - Malware analysis pipelining for the masses

Tons of more cool plugins

●VxCage●Viper● Java Decompiler●Maltrieve

Page 49: aleph - Malware analysis pipelining for the masses

CollaborationWhat got us here in the first place

Page 50: aleph - Malware analysis pipelining for the masses

Collaboration

The power of many

Page 51: aleph - Malware analysis pipelining for the masses

Deployment TypesFrom small to huge scaling

Page 52: aleph - Malware analysis pipelining for the masses

Deployment Types

Deployed in a single host containing all the required services.

3rd Party Software Aleph Components

Redis Local Filesystem Elasticsearch SQLite

Collector Processor Web Interface

Page 53: aleph - Malware analysis pipelining for the masses

Deployment Types

Deployed across multiple hosts in order to achieve HP and HA.

Datastore Host GroupElasticsearch Cluster

Nodes

Transport Host GroupRabbitMQ Cluster Nodes

Processing Host GroupAleph Cluster Nodes

Web Interface Host Group

NGinx Cluster Nodes

Collection Host GroupAleph Cluster Nodes

Storage Host GroupDFS Cluster Nodes

Page 54: aleph - Malware analysis pipelining for the masses

Use, fork and contribute!

https://github.com/trendmicro/Aleph

Page 55: aleph - Malware analysis pipelining for the masses

Questions?We can try to answer some….

Page 56: aleph - Malware analysis pipelining for the masses

Thank you!

Jan Seidl @jseidl <[email protected]>

Slides: http://slideshare.net/jseidl Codes: https://github.com/jseidl