Simseer.com - Malware Similarity and Clustering Made Easy

24
Simseer.com Malware Similarity and Clustering Made Easy Silvio Cesare <[email protected]>

description

 

Transcript of Simseer.com - Malware Similarity and Clustering Made Easy

Page 1: Simseer.com - Malware Similarity and Clustering Made Easy

Simseer.comMalware Similarity and Clustering Made Easy

Silvio Cesare <[email protected]>

Page 2: Simseer.com - Malware Similarity and Clustering Made Easy

Introduction• Simseer.com is a set of web services to analyse

malware using program structure as a signature.. Why?

• AV String signatures not very robust.

• Can’t detect ‘approximate’ matches.

• Hard to generate signature for an entire family.

• Program structure improves signature-based methods.

Page 3: Simseer.com - Malware Similarity and Clustering Made Easy

Who am I?

•Ph.D. Student at Deakin University.

•Presented at Ruxcon, Black Hat, AusCERT, etc.

•Published in academia.

•Book author

•Recently relocated to Canberra.

Page 4: Simseer.com - Malware Similarity and Clustering Made Easy

Outline

1. Introduction

2. Simseer.com’s Malware Services

3. Supporting Infrastructure

4. Other Services

5. Conclusion

Page 5: Simseer.com - Malware Similarity and Clustering Made Easy

Signatures

•In my other presentations.•Signature is based on ‘set of control flow

graphs’

Page 6: Simseer.com - Malware Similarity and Clustering Made Easy

Signature Extraction

•Transform ‘set of control flow graphs’ into a ‘feature vector’

•Decompilation + N-Grams

L_0

L_3

L_6

L_7L_1

L_2 L_4

L_5

true

true

true

true

true

W|IEH}Rproc(){L_0: while (v1 || v2) {L_1: if (v3) {L_2: } else {L_4: }L_5: }L_7: return;}

W|IEH}R

W|IE|IEHIEH}EH}R

Page 7: Simseer.com - Malware Similarity and Clustering Made Easy

Simseer

•Begin start of demo...

•A revamp of my existing http://www.FooCodeChu.com service.

•Submit an archive of malware samples.

•Results▫A similarity matrix comparing samples.▫An evolutionary tree showing relationships.

Page 8: Simseer.com - Malware Similarity and Clustering Made Easy

Submission Page

Page 9: Simseer.com - Malware Similarity and Clustering Made Easy

Results

Page 10: Simseer.com - Malware Similarity and Clustering Made Easy

Simseer

•Demo complete...

•Use ‘distance between vectors’ to show similarity.

•Visualize using phylogenetics software.

Page 11: Simseer.com - Malware Similarity and Clustering Made Easy

SimseerCluster• Begin demo...

• A new service.

• Submit an archive of malware samples.

• Define the number of clusters.

• Results▫ Samples grouped into clusters.▫ Cross checking samples with AV.▫ Identification of families.

Page 12: Simseer.com - Malware Similarity and Clustering Made Easy

Submission Page

Page 13: Simseer.com - Malware Similarity and Clustering Made Easy

Results

Page 14: Simseer.com - Malware Similarity and Clustering Made Easy

SimseerCluster

•Demo complete...

•Use ‘similarity matrix’ and ‘cosine similarity’.

•Pass to ‘cluster analysis software’ – The Weka Machine Learning Toolkit.

•Use Hierarchical clustering.

Page 15: Simseer.com - Malware Similarity and Clustering Made Easy

SimseerSearch• Begin demo...

• A new service.

• Submit a malware sample.

• Specify threshold of similarity.

• Results▫ All samples in database similar to query.▫ An AV report.▫ Heuristics to detect obfuscations (packing).

Page 16: Simseer.com - Malware Similarity and Clustering Made Easy

Submission Page

Page 17: Simseer.com - Malware Similarity and Clustering Made Easy

Results

Page 18: Simseer.com - Malware Similarity and Clustering Made Easy

SimseerSearch

•Demo complete...

•Use ‘nearest neighbour similarity search’ based on ‘Euclidean distance’.

•Packer detection based on entropy analysis.

q

Query Malicious

Query Benign

d(p,q)

p

r

Malware

Query

Page 19: Simseer.com - Malware Similarity and Clustering Made Easy

Supporting Infrastructure

Page 20: Simseer.com - Malware Similarity and Clustering Made Easy

Other Services

•Other services on the same infrastructure▫Clonewise▫Bugwise

Page 21: Simseer.com - Malware Similarity and Clustering Made Easy

Clonewise – Detecting embedded libraries.

Page 22: Simseer.com - Malware Similarity and Clustering Made Easy

Bugwise on real Debian Linux binaries

Page 23: Simseer.com - Malware Similarity and Clustering Made Easy

Future Work

•Integrate Cuckoo sandbox▫Unpacking with Volatility.▫Non EXE formats (PDF, DOC, etc).▫API Call classification (non signature-

based).

Page 24: Simseer.com - Malware Similarity and Clustering Made Easy

Conclusion

•Free services.

•Control flow better than traditional string signatures.

•Try it!

•http://www.simseer.com