Free simulation sandipchaudhari_2006

38
19-21 October 2006 Smashing Heap by Free Simulation Sandip Chaudhari [email protected] Acknowledgements Thanks to everyone in my Security Team for their support and encouragement, especially to Jonathan Leonard, Jeremy Jethro and Nick Seidenman.

description

The overflowed memory is given a value such that a previous call to free() is simulated, causing next malloc() call to misinterpret that the memory was free'd before. This technique is what we would call as - Free Simulation.

Transcript of Free simulation sandipchaudhari_2006

Page 1: Free simulation sandipchaudhari_2006

19-21 October 2006

Smashing Heap by Free Simulation

Sandip [email protected]

AcknowledgementsThanks to everyone in my Security Team for their support and encouragement, especially to Jonathan Leonard, Jeremy Jethro and Nick Seidenman.

Page 2: Free simulation sandipchaudhari_2006

Slide 2 / 38

Abstract

The exploit can be achieved without the need of any call to the free() function.

The overflowed memory is given a value such that a previous call to free() is simulated, causing next malloc() call to misinterpret that the memory was free'd before. We call this technique - Free Simulation.

Though the Free Simulation technique demonstrated in this paper, has been tried successfully on AIX, Solaris and Windows XP SP2 it may be applicable on all systems having in-band heap memory management.

Page 3: Free simulation sandipchaudhari_2006

Slide 3 / 38

Introduction

Almost all the papers referenced in the References section [1] through [7], discuss about heap overflows, that seem to talk or provide sample code snippet where free() is being called.

What if free() is never called and the process takes in user input that can lead to heap overflow? Is it still possible to exploit such a process that never calls free()? Answer to this is yes, and that's what this presentation is all about.

Page 4: Free simulation sandipchaudhari_2006

Slide 4 / 38

Core Ideas

Heap Overflow technique when the free() is being called, is usually referred to as 4-byte overwrite.

The core idea is “to attack the memory management algorithm, first (publicly?) demonstrated by Solar Designer for a heap overflow found in the Netscape browser” [3], [1]. This attack on memory management algorithm always necessarily involves pointer assignment instructions.

Page 5: Free simulation sandipchaudhari_2006

Slide 5 / 38

Logical Constructs

Lets refer to the primitive logical constructs involving these pointer assignments, that get executed on a call to free() [4 – Section: Anatomy of a Heap Overflow Exploit] & [5].

unlink() frontlink()

#define unlink(P, BK, FD) { [1] FD = P->fd; [2] BK = P->bk; [3] FD->bk = BK; [4] BK->fd = FD; }

#define frontlink(A, P, S, IDX, BK, FD) { [1] FD = start_of_bin(IDX); [2] while ( FD != BK && S < chunksize (FD) ) { [3] FD = FD->fd; } [4] BK = FD->bk; [5] FD->bk = BK->fd = P;}

Note: Both the above macros are a set of logical statements that explain pointer assignments. Either or both of these maybe executed on call to free(). “P” is the pointer that has been passed to be free'd.

Page 6: Free simulation sandipchaudhari_2006

Slide 6 / 38

What exactly is theFree Simulation?

Free Simulation is the allocation of address space on simulated free region in the memory with our choice of length, and in certain cases may be located anywhere we choose in the process address space. Free Simulation can be differentiated broadly into 2 cases:

Arbitrary buffer allocation – The heap datastructure pointers are manipulated such that the simulated free buffer space, when allocated can exist arbitrarily anywhere in process memory address space (Free Simulation on AIX, Free Simulation on Solaris (<40 bytes buffer))

Arbitrary address over-write (4-byte i.e word-size overwrite) – The heap datastructure pointers are manipulated such that the pointer assignments causes an address to be overwritten arbitrarily anywhere in the process memory address space (Free Simulation on Solaris (>=40 bytes buffer), Windows XP SP-2)

Page 7: Free simulation sandipchaudhari_2006

Slide 7 / 38

What exactly is theFree Simulation? (contd.)

An example of the usual state of heap with a few allocated chunks.

Heap

allocated unallocatedallocatedallocated

Page 8: Free simulation sandipchaudhari_2006

Slide 8 / 38

What exactly is theFree Simulation? (contd.)

Lets try to represent heap state at the moment of time when a number of chunks have been malloc'ed and a few have been free'd.headers

Heap in-band header with pointers to previously free'd chunks

allocated unallocatedfree'd allocated

Pointer to previously free'd chunk

Heap

Page 9: Free simulation sandipchaudhari_2006

Slide 9 / 38

What exactly is theFree Simulation? (contd.)

After the overflow and the Free Simulation

Target

Heap

Stack

Simulatedfree chunk

Pointer to previously free'd chunk

allocated unallocatedallocated[overflow]

allocated

Page 10: Free simulation sandipchaudhari_2006

Slide 10 / 38

Conditional Triggers

The conditional triggers are instructions in the malloc call that check if there is some previous or last free'd memory available (of appropriate size) that can be used to allocate the new chunk.

'if ( previous free'd chunk available && requested size <= available free'd size ) then ...‘

This logical free conditional primitive lies at the heart of successful Free Simulation. It triggers the execution of further pointer assignment instructions.

Page 11: Free simulation sandipchaudhari_2006

Slide 11 / 38

Free Simulation on Aix

Heap – Malloc'ed

chunks [MC]

Heap Data Structure [HDS]

<__heaps+1056>:

total heap space allocated

<__heaps+1040>: next free pointer

[NFP]

<__heaps+1064>:

total number of heapchunks malloc'ed

<__heaps+1068>:

pointer to current chunk [CP]

<__heaps+1076>:

pointer to the address of pointer of

previous free chunk[PPPF]

Actual malloc'ed memory

0x00000000 else pointer to previously

free'd chunk [PPF] User specified size of chunkelse real size of previously

free'd chunk

Unallocated memory

Unallocated memory else size of previously

free'd memory

0x00000000 else pointer to previously

free'd chunk [PPF]

Page 12: Free simulation sandipchaudhari_2006

Slide 12 / 38

Free Simulation on Aix (contd.)

Usually, the PPF (Pointer to Previously Free’d chunk) is NULL and NFP (Next Free Pointer) points to the last PPF (NULL), indicating availability of free memory

We will try to understand what happens if PPF is not NULL, which is very important for the success of exploitation by Free Simulation.

The PPF which is usually NULL, is assigned the value of the address of previously free'd chunk! The core idea is to overflow the malloc() allocated chunk by 2 words having the first word as an address so that it will be now interpreted as PPF and second word as some arbitrary size.

Page 13: Free simulation sandipchaudhari_2006

Slide 13 / 38

Free Simulation on Aix (contd.)

How about pointing PPF to the stack? Possible? Yes! In a way, we are smashing the heap, simulating free() and then smashing the stack!

Thus the simulated free space can be located anywhere in the process writable memory address space.

The reason this is possible is because the malloc() function trusts the address retrieved, as a valid heap address for memory allocation.

Page 14: Free simulation sandipchaudhari_2006

Slide 14 / 38

Free Simulation conditional trigger for Aix

if ( Pointer to Previously Free'd chunk [PPF] != NULL && requested_size <= chunk_size ) ... [A]

{consider the value of PPF as an address of previously free'd chunk and try to allocate memory on this free'd chunk

}

The above [A] is mere a logical summary of conditional instructions or statements. The “if” condition may have been actually implemented as “while”. The conditional statements make it very clear that triggering Free Simulation on Aix is quite easy.

Page 15: Free simulation sandipchaudhari_2006

Slide 15 / 38

Free Simulation on Solaris – I [size < 40 bytes]

Heap – Malloc'ed

chunks [MC]

Heap Data Structure [HDS]

Data: if allocated else, Next Free Pointer

[NFP]: if unallocatedelse, Pointer to Previous Free'd

chunk [PPF]: if free'dBased on bit0 and bit1 of size

0x0 or junk[alignment word]

Actual malloc memory

Available chunk size orallocated chunksize |OR| 'ed with

flags. [bit0 and bit1]

Available chunk size orallocated chunksize |OR| 'ed with

flags. [bit0 and bit1]

0x0 or junk[alignment word]

Unallocated Memory

Next Free Pointer [NFP]: if unallocated

else, Pointer to Previous Free'd chunk [PPF]: if free'd

Based on bit0 and bit1 of size

<List+4>:Next Free chunk Pointer

orPrevious Free'd chunk Pointer

<freeidx>:index or count of free'd chunk

in a bin's list

<flist – flist+124>:List of free'd pointers

<Lfree>:List of bins / nodes of flists

Page 16: Free simulation sandipchaudhari_2006

Slide 16 / 38

Free Simulation on Solaris – I [size < 40 bytes] (contd.)

On Solaris, 2 types of data-structures are involved in the heap management, based on chunk size asked for. If the requested chunk size is less than 40 bytes then the linked-lists are involved and different section of code gets executed, while for sizes greater than 40 bytes, binary search tree structure is maintained.

The decision to allocate or consider the previously free'd chunk of memory for allocation is based primarily on the bit0 and the bit1 of the size word on the Solaris. The size is specified in bytes and we get the last 2 bits free. bit0: 1 if chunk is allocated else 0. bit1: 1 if a previous block has been free'd

in local list of the bin else 0.

Page 17: Free simulation sandipchaudhari_2006

Slide 17 / 38

Free Simulation on Solaris – I [size < 40 bytes] (contd.)

The freelists are structured like lists in bins of various chunk sizes.

In case malloc finds that bit1 state is “1” it considers that there is a previously free'd memory chunk. The word next to the immediate next word is considered as the address pointing to the previously free’d chunk.

In case of Solaris, we overflow the allocated chunks' boundary such that the bit1 of the size in the header of next chunk is set to “1” and the word next to immediate-next word maybe given the address where we would like to overwrite, on the next memory write operation.

Page 18: Free simulation sandipchaudhari_2006

Slide 18 / 38

Free Simulation on Solaris – I [size < 40 bytes] (contd.)

As before in AIX, again, the simulated free space can be located anywhere in the process writable memory address space.

Again, the reason this is possible is because the malloc() function trusts the address retrieved, as a valid heap address for memory allocation.

Page 19: Free simulation sandipchaudhari_2006

Slide 19 / 38

Free Simulation conditional trigger for Solaris - I

If ( size.bit1 equals 1 ) .... [B]{

After size checks, consider address next to immediate-next word as previously free'd chunk and assign it to the Next Pointer of Heap Data Structure. Again, after size checks this simulated free space will be used to allocate memory on any call to malloc() in the future.

}

As stated before in case of AIX, the above [B] is mere logical summary of conditional instructions for Solaris. The conditional “if” is logical and it may be a conditional “while” in actual implementation.

Page 20: Free simulation sandipchaudhari_2006

Slide 20 / 38

Free Simulation on Solaris – II [size >= 40 bytes]

“Once upon a free()” paper [8] published in Phrack magazine demonstrates heap-overflow exploit by calls only to malloc() that further calls realfree().

The focus is on creation of the fake-chunk that leads to 4-byte overwrite when the heap-management data is manipulated.

The paper also clearly mentions that -- “Overflowed chunk must not be the last chunk”.

Again before, we will simulate free() by overflowing the last malloc'ed chunk. This is achieved using the 4-byte overwrite technique.

We take advantage of delayed free calls and achieve 4-byte overwrite in the realfree()'s coalesce operation. This is similar to exploit mentioned in [8] but differing by overflowing last malloc'ed chunk.

Page 21: Free simulation sandipchaudhari_2006

Slide 21 / 38

We will refer the opensolaris site [9] for source code to better understand the exploit.Source: mallint.h80 /* the proto-word; size must be ALIGN bytes */ 81 typedef union _w_ { 82 size_t w_i; /* an unsigned int */ 83 struct _t_ *w_p[2]; /* two pointers */ 84 } WORD; 86 /* structure of a node in the free tree */ 87 typedef struct _t_ { 88 WORD t_s; /* size of this element */ 89 WORD t_p; /* parent node */ 90 WORD t_l; /* left child */ 91 WORD t_r; /* right child */ 92 WORD t_n; /* next in link list */ 93 WORD t_d; /* dummy to reserve space for self-pointer */ 94 } TREE;

Few important macros.Source: mallint.h98 #define RSIZE(b) (((b)->t_s).w_i & ~BITS01)112 /* set/test indicator if a block is in the tree or in a list */ 113 #define SETNOTREE(b) (LEFT(b) = (TREE *)(-1)) 114 #define ISNOTREE(b) (LEFT(b) == (TREE *)(-1)) 121 #define NEXT(b) ((TREE *)(((char *)(b)) + RSIZE(b) + WORDSIZE))

Free simulation on Solaris – II [size >= 40 bytes] (contd.)

Page 22: Free simulation sandipchaudhari_2006

Slide 22 / 38

Free simulation on Solaris – II [size >= 40 bytes] (contd.)

Sections of functions relevant to our exploitSource: malloc.c – realfree()511 /* see if coalescing with next block is warranted */ 512 np = NEXT(tp); 513 if (!ISBIT0(SIZE(np))) { 514 if (np != Bottom) 515 t_delete(np); 516 SIZE(tp) += SIZE(np) + WORDSIZE; 517 }

Source: malloc.c – t_delete()756 /* if this is a non-tree node */757 if (ISNOTREE(op)) { 758 tp = LINKBAK(op); 759 if ((sp = LINKFOR(op)) != NULL) 760 LINKBAK(sp) = tp; 761 LINKFOR(tp) = sp; 762 return; 763 }

Note, the above highlighted assignments in orange (760 and 761) are the two word assignments, where user controlled data (8-byte overwrite in this case, but we will still refer it as 4-byte) can be injected. We can summarize above operation in instructions as:

0xff2c7808 <t_delete+52>: st %o0, [ %o1 + 8 ]0xff2c780c <t_delete+56>: st %o1, [ %o0 + 0x20 ]

Page 23: Free simulation sandipchaudhari_2006

Slide 23 / 38

Free simulation on Solaris – II [size >= 40 bytes] (contd.)

Malloc'ed heap chunk and overflow

We have 2 structures involved: t1.t_* and t2.t_*t_s : Size. We assign this to - 2 so that np = NEXT(p) will return np pointing to t1.t_j and bit0 is '0' for both t1.t_s and t2.t_s.t_j : As every pointer in this structure occupies 2 words owing to alignment logic, we can consider all t_j as junk.t_p : Pointer to previous node, can be junk for t1.t_p, and t2.t_p can be the address with which the return address on the stack is to be replaced. t_l : can be junk for t1.t_l but must be “-1” for t2.t_l, thus guarantee that malloc() would not interpret the node as a tree node but would interpret it as a list node.t_r : can be completely ignored and hence can be junk.t_n : t1.t_n can be junk but t2.t_n will be the address we would like to overwrite – 8.t_d : Maybe ignored and can be junk for both t1.t_d and t2.t_d.

t1.t_s [ > - 4 < 0 ]

Allocated memory

Unallocated memory

t2.t_d

t1

t2

t2.t_st1.t_j

t1.t_p t2.t_j

t2.t_pt1.t_j

t1.t_l t2.t_j

t2.t_lt1.t_j

t1.t_r t2.t_j

t2.t_rt1.t_j

t1.t_n t2.t_j

t2.t_nt1.t_j

t1.t_d t2.t_j

Page 24: Free simulation sandipchaudhari_2006

Slide 24 / 38

Free Simulation conditional trigger for Solaris - II

1. if ( size.bit0 equals 0 ) ....[C]

{ consider this as a free chunk, check if next chunk is also free and if

coalesce is possible. }2. if ( next chunk size.bit0 equals 0 ) {

Next chunk in contiguous memory block is free proceed with coalesce. }3. size should be such that NEXT(p) calculation will return our fake-chunk as next

chunk.4. The returned fake chunk should bypass is-bottom check [np != Bottom]. Would

be automatically taken care of.5. The value of left-node pointer t_l of fake chunk must be '-1' for interpretation as

list node rather than tree node.6. If ( value of left-node equals -1) ....[D] {

interpret it as list-node and proceed further with coalesce operation involving pointer assignments.

} As stated before for AIX, the above [C] and [D] are mere logical summaries of conditional

instructions for Solaris. Step [C] indicates Free Simulation. The remaining steps including

[D] indicate the trigger to coalesce, the fake-chunk creation, and the coalesce operation that involves pointer assignments for the linked-list.

Page 25: Free simulation sandipchaudhari_2006

Slide 25 / 38

Free Simulation - Windows XP SP2

4-byte overwrite or arbitrary 4*n bytes overwrite still possible on older windows = (windows < XP-SP2)

Since SP-2 MS introduced Heap Protection

Is Free Simulation still possible on XP SP2?

Page 26: Free simulation sandipchaudhari_2006

Slide 26 / 38

Windows Heap Overflow Exploit Research (Time Progression)

Halvar Flake - "Third Generation Exploitation"http://www.blackhat.com/presentations/win-usa-02/halvarflake-winsec02.ppt

David Litchfield - "Windows Heap Overflows"http://www.blackhat.com/presentations/win-usa-04/bh-win-04-litchfield/bh-win-04-litchfield.ppt

Matt Conover, Oded Horowitz - "Reliable Windows Exploits"http://cansecwest.com/csw04/csw04-Oded+Connover.ppt

Alexander Anisimov - "Defeating Windows XP SP2 Heap protection and DEP bypass"http://www.maxpatrol.com/defeating-xpsp2-heap-protection.pdf

Nicolas Falliere - A new way to bypass Windows heap protectionshttp://www.securityfocus.com/infocus/1846

Brett Moore - Exploiting Freelist[0] on Windows XP Service Pack 2 http://www.securiteam.com/securityreviews/5MP020UHFI.html

Page 27: Free simulation sandipchaudhari_2006

Slide 27 / 38

Free Simulation – Windows XP SP2

Presenters’ research did lead to possibility of heap overflow exploitation on SP2 using Free Simulation. It turned out though, to be very similar to something that has been discussed in Brett Moore’s paper, with few minor differences.

Similar to the examples shown in previous slides, we will be overwriting 4-byte word on stack address having a function return address, with an address now pointing to heap.

The value overwritten is partially controlled, pointing back to address containing the [stack’s address – 4].

Page 28: Free simulation sandipchaudhari_2006

Slide 28 / 38

Free Simulation – Windows XP SP2 Reaching Freelist[0]

The malloc() calls try to allocate a chunk of requested size in certain order shown below for chunks < 512k:1. Lookaside (for size <1k)2. Freelist [indices > 0] (for size <1k)3. Freelist[0] (for size >1k or if none found in 1, 2 for <1k)4. When not pointing to any free’d chunk, Freelist[0] points to

the free-region beyond last chuck. If such a case or when no free’d chunk in Freelist[0] with size big enough of the requested size, allocation takes place in the free-region, beyond last chunk

Our focus would be to make malloc() reach Freelist[0] and re-apply the concepts of Free-Simulation for successful exploitation.

Page 29: Free simulation sandipchaudhari_2006

Slide 29 / 38

Free Simulation – Windows XP SP2Library function calls

Many library functions use malloc() internally. These functions usually need varying chunk sizes.

Such functions form excellent candidates for this exploit technique, as they have greater chance of hitting Freelist[0].

Page 30: Free simulation sandipchaudhari_2006

Slide 30 / 38

Free Simulation – Windows XP SP2Library function calls

In our example we exploit the malloc() called by printf() function.

We will focus on exploitation and change of control flow using only one overflow.

Brett Moore’s paper aptly hints, that such technique if used, needs the address to constitute a valid instruction

We will see, one of the address of low level function

on stack called by malloc() itself does form valid instruction that gets executed, in our example. Our shell-code starts right from the next word!

Page 31: Free simulation sandipchaudhari_2006

Slide 31 / 38

Free Simulation – Windows XP SP2

HDS

Freelist[0]

Freelist[1]

HDS Header

Lookaside[0]

Lookaside[1]

_heap_alloc_dbg

HeapStack

Chunk Header

Simulated Free Chunk

Chunk Data

printf

Shell Codemalloc

Page 32: Free simulation sandipchaudhari_2006

Slide 32 / 38

Free Simulation – Windows XP SP2 Conditional Trigger

1. The allocation code must somehow reach Freelist[0]

2. Freelist[0] must point to the header of our simulated free chunk

3. The simulated free chunk’s size must be greater than the size of the requested chunk+8. This would trigger the re-link and our 4-byte overwrite.

4. The stack address at the function return pointer is overwritten with address pointing back to heap, should be interpretable as valid instruction.

Page 33: Free simulation sandipchaudhari_2006

Slide 33 / 38

Free Simulation – Windows XP SP2Demo!

Though exploiting heap overflow using Free Simulation on SP2 is still a possibility, Heap Protection definitely puts forth many limitations.

Page 34: Free simulation sandipchaudhari_2006

Slide 34 / 38

Advantages of the Free Simulation

Relatively easy to exploit.

Provides a consistent and generic model to pursue the heap overflow-based exploits.

For processes / applications where free() is never called, Free Simulation maybe the best technique to exploit.

Usually data-write follows after a chunk from malloc has been obtained, favoring Free Simulation exploitation.

Some heap algorithms do not actually free the memory at the free() call. This delayed/lazy free() is feasible due to certain supportive free-structures like free-list / flist (Solaris). Whenever a malloc() is called it internally calls free() or rather the realfree() (especially on Solaris) that actually free's the memory. Hence focus on malloc() calls might provide easier approach and save time.

Usually, malloc() and realloc() calls are called more frequently compared to free().

Exploitation can be triggered at a considerably earlier stage in a process's life cycle because of the fact that the malloc() (memory allocation) always precedes free().

Enables arbitrary overwrites anywhere in the process memory regions including stack, heap, function pointers, Procedure Linkage Table.

Page 35: Free simulation sandipchaudhari_2006

Slide 35 / 38

Limitations of Free Simulation

Usually works well and easily when the overflow occurs in last malloc'ed chunk. For overflows in in-between malloc'ed chunks, depends on implementation of the memory allocation algorithm.

On Windows XP SP2, can be triggered only for allocation of chunks in free-space pointed by the Freelist[0].

Page 36: Free simulation sandipchaudhari_2006

Slide 36 / 38

Preventive Measures

Best preventive measure is at the code-implementation level itself by altogether avoiding or by careful usage of function calls that may potentially lead to the memory overflows.

Implementation of heap algorithm with total removal of in-band memory management information between data can completely protect against any manipulation. Many such implementations are already available for *nix platforms and can be linked with the systems’ library. Some have also integrated such protection schemes into default distros (OpenBsd).

At system level NX [Non Executable pages], non-executable data region (that includes heap with stack, on AIX – sedmgr), cookies, write protected guard bands between heap data segments and heap management structures, can make heap overflow exploitation almost impossible.

Implementation and integration of such preventive measures by various operating systems is already pushing (4*1) or (4*n) memory over-writes in history.

Page 37: Free simulation sandipchaudhari_2006

Slide 37 / 38

References

1. http://md.hudora.de/presentations/summerschool/2005-09-21/vansprundel-ctt-heapoverflows.pdf - Generic Heap Overflow Exploitations.

2. http://www.blackhat.com/presentations/win-usa-02/halvarflake-winsec02.ppt – Third Generation Exploitation.

3. http://www.openwall.com/advisories/OW-002-netscape-jpeg/ - Solar Designer

4. https://www.usenix.org/publications/library/proceedings/lisa03/tech/full_papers/robertson/robertson_html/ - Run-time Detection of Heap-based Overflows (Anatomy of a Heap Overflow Exploit, Logical Constructs).

5. http://doc.bughunter.net/buffer-overflow/heap-corruption.html

6. http://www.w00w00.org/files/articles/heaptut.txt

7. http://cansecwest.com/csw04/csw04-Oded+Connover.ppt

8. http://www.phrack.org/phrack/57/p57-0x09 – Once Upon a free()

9. http://cvs.opensolaris.org/source/ - Solaris source code on OpenSolaris website

10. http://www.blackhat.com/presentations/win-usa-04/bh-win-04-litchfield/bh-win-04-litchfield.ppt David Litchfield - "Windows Heap Overflows"

11. http://www.maxpatrol.com/defeating-xpsp2-heap-protection.pdf Alexander Anisimov - “Defeating Windows XP SP2 Heap protection and DEP Bypass”

12. http://www.securityfocus.com/infocus/1846 Nicolas Falliere - A new way to bypass Windows heap protections

13. http://www.securiteam.com/securityreviews/5MP020UHFI.html Brett Moore - Exploiting Freelist[0] on Windows XP Service Pack

Page 38: Free simulation sandipchaudhari_2006

Slide 38 / 38

Questions

?