Spring 2014 Program Analysis and Verification Lecture 9: Abstract Interpretation I

Spring 2014Program Analysis and Verification

Lecture 9: Abstract Interpretation I

Roman ManevichBen-Gurion University

Syllabus

Semantics

NaturalSemantics

Structural semantics

AxiomaticVerification

StaticAnalysis

AutomatingHoare Logic

Control Flow Graphs

Equation Systems

CollectingSemantics

AbstractInterpretation fundamentals

Lattices

Galois Connections

Fixed-Points

Widening/Narrowing

Domain constructors

InterproceduralAnalysis

AnalysisTechniques

Numerical Domains

Alias analysis

ShapeAnalysis

Crafting your own

From proofs to abstractions

Systematically developing

transformers

Previously

• Another static analysis example – constant propagation

• Basic concepts in static analysis– Control flow graphs– Equation systems– Collecting semantics– (Trace semantics)

Annotating programsAnnotate(P, S) = case S is x:=aexpr return {P} x:=aexpr {F*[x:=aexpr] P} case S is S1; S2

let Annotate(P, S1) be {P} A1 {Q1} let Annotate(Q1, S2) be {Q1} A2 {Q2} return {P} A1; {Q1} A2 {Q2} case S is if bexpr then S1 else S2

let Pt = F[assume bexpr] P let Pf = F[assume bexpr] P let Annotate(Pt, S1) be {Pt} A1 {Q1} let Annotate(Pf, S2) be {Pf} A2 {Q2} return {P} if bexpr then {Pt} A1 {Q1}

else {Pf} A2 {Q2} {Q1 Q2}

case S is while bexpr do S N := Nc := P // Initialize repeat

let Pt = F[assume bexpr] Nc

let Annotate(Pt, S) be {Nc} Abody {N} Nc := Nc N

until N = Nc return {P} INV= {N} while bexpr do {Pt} Abody {F[assume bexpr](N)}

Collecting semantics example: input 1

if x > 0

x := x - 1

[x1][x2][x3]…label0: if x <= 0 goto label1 x := x – 1 goto label0

label1:

if x > 0

x := x - 1

[x-1][x2]

[x2][x-1]

label1:

if x > 0

x := x - 1

[x-1][x2]

[x2][x3]

[x3][x-1]

label1:

ad infinitum – fixed point

if x > 0

x := x - 1

[x0][x-1]

label0: if x <= 0 goto label1 x := x – 1 goto label0

label1:

[x-1][x-2]…

Predicates at fixed point

if x > 0

x := x - 1

label0: if x <= 0 goto label1 x := x – 1 goto label0

label1:

{true}

{x>0}{x0} {x0}

Equational definition example• A vector of variables R[0, 1, 2, 3, 4]• R[0] = {xZ} // established input

R[1] = R[0] R[4]R[2] = R[1] {s | s(x) > 0}R[3] = R[1] {s | s(x) 0}R[4] = x:=x-1 R[2]

• A (recursive) system of equations

if x > 0

x := x-1

R[2]R[4]

Semantic function for assume x>0

Semantic function for x:=x-1 lifted to sets of states

General definition• A vector of variables R[0, …, k] one per input/output of a node

– R[0] is for entry• For node n with multiple predecessors add equation

R[n] = {R[k] | k is a predecessor of n}• For an atomic operation node R[m] S R[n] add equation

R[n] = S R[m]

• Transform if b then S1 else S2

to (assume b; S1) or (assume b; S2)

if x > 0

x := x-1

R[2]R[4]

Current lecture

• Semantic domains– Preorders– Partial orders (posets)– Pointed posets– Ascending/descending chains– The height of a poset– Join and Meet operators– Complete lattices– Constructing new lattices from old

Appendix A.

By Rama (Own work) [CC-BY-SA-2.0-fr (http://creativecommons.org/licenses/by-sa/2.0/fr/deed.en)], via Wikimedia Commons

Abstractinterpretation

Theory[1977]

Abstract Interpretation [CC77]• A very general mathematical framework

for approximating semantics– Generalizes Hoare Logic– Generalizes weakest precondition calculus

• Allows designing sound static analysis algorithms– Usually compute by iterating to a fixed-point– Not specific to any programming language style

• Results of an abstract interpretation are (loop) invariants– Can be interpreted as axiomatic verification assertions and

used for verification

Annotating programsAnnotate(P, S) = case S is x:=aexpr return {P} x:=aexpr {F*[x:=aexpr] P} case S is S1; S2

let Annotate(P, S1) be {P} A1 {Q1} let Annotate(Q1, S2) be {Q1} A2 {Q2} return {P} A1; {Q1} A2 {Q2} case S is if bexpr then S1 else S2

let Pt = F[assume bexpr] P let Pf = F[assume bexpr] P let Annotate(Pt, S1) be {Pt} A1 {Q1} let Annotate(Pf, S2) be {Pf} A2 {Q2} return {P} if bexpr then {Pt} A1 {Q1}

else {Pf} A2 {Q2} {Q1 Q2}

case S is while bexpr do S N := Nc := P // Initialize repeat

let Pt = F[assume bexpr] Nc

let Annotate(Pt, S) be {Nc} Abody {N} Nc := Nc N

until N = Nc return {P} INV= {N} while bexpr do {Pt} Abody {F[assume bexpr](N)}

Approximates concrete semantics sp(x:=aexpr, P) F*[x:=aexpr]

Approximates disjunction

{ P’ } S { Q’ } { P } S { Q }[consp] if PP’ and Q’Q

The big picture• Use semantic domains to define both concrete

semantics and abstract semantics• Relate semantics in a sound way• Interpret program over abstract semantics

set of states set of statescollecting semantics

statement Sset of states

abstract representationof sets of states

abstract semanticsstatement S abstract

representationof sets of states

meaningabstraction meaningabstraction

A theoryof semantic

domains

By Brett Jordan David Macdonald [CC-BY-2.0 (http://creativecommons.org/licenses/by/2.0)], via Wikimedia Commons

1. Approximating elements2. Approximating sets of elements

Overall idea

• A semantic domain can be used to define properties (representations of predicates)– Also called abstract states

• Common representations– Logical formulas– Automata– Specialized graphs

A taxonomy of semantic domain typesComplete Lattice(D, , , , , )

Lattice(D, , , , , )

Join semilattice(D, , , )

Meet semilattice(D, , , )

Complete partial order (CPO)(D, , )

Partial order (poset)(D, )

Preorder(D, )

preorders

Preorder

• Let D be a set of elements• We say that a binary order relation over D

is a preorder if the following conditions hold for every d, d’, d’’ D– Reflexive: d d– Transitive: d d’ and d’ d’’ implies d d’’

• There may exist d, d’ such thatd d’ and d’ d yet d d’

Preorder examples• SAV-predicates– SAV-factoids

= { x = y | x, y Var } { x = y + z | x, y, z Var }– SAV-predicates = 2

– Order relation 1: P1 set P2 iff P1 P2

– Order relation 2: P1 imp P2 iff P1 P2

– Which order relation is stronger(contains more pairs)?

– Which order relation is easier to check?– What if both P1 and P2 are in the image of explicate?

SAV preorder 1: P1 set P2 iff P1 P2

{x=y} {x=x+x} {y=y+y}

{y=x} {y=x+y} {y=y+x} {x=x+y} {x=y+x}

{x=y, y=x} {x=y, x=x+x} {x=x+y, x=y+x}…

{x=y, x=x+x, x=x+y} {x=y, x=x+x, x=x+y}…

{x=y, y=x, x=x+x, y=y+y, y=x+y, y=y+x, x=x+y, x=y+x}

Var = {x, y}

SAV preorder 2: P1 imp P2 iff P1 P2

{x=y} {x=x+x} {y=y+y}

{y=x} {y=x+y} {y=y+x} {x=x+y} {x=y+x}

{x=y, y=x} {x=x+y, x=y+x}…

{x=y, x=x+x, x=x+y} {x=y, x=x+x, x=x+y}

{x=y, y=x, x=x+x, y=y+y, y=x+y, y=y+x, x=x+y, x=y+x}

{x=y, x=x+x}

Var = {x, y}

Preorder examples

• CP-predicates– CP-factoids

= { x = c | x Var, c Z }– CP-predicates = 2

– Order relation 1: P1 set P2 iff P1 P2

– Order relation 2: P1 imp P2 iff P1 P2

– Is there a difference?• {x=5, x=7, x=9} {x=5, x=7}• {x=5, x=7, x=9} {x=5, x=7}• {x=5, x=7} {x=5, x=7, x=9}

CP preorder example

{x=-3} {x=-1} {x=0}

{x=-2} {x=1} {x=2} {x=3}… …

Var = {x}

CP preorder example

{x=-3} {x=3} {y=-5}

{x=0} {y=0} {y=36}… …

{x=-3, y=-5} {x=0, y=0} {x=3, y=36}

Var = {x, y}

The problem with preorders

• Equivalent elements have different representations– {x=y, x=a+b} S {Q}– {x=y, y=a+b} S {Q’}

• Leads to unpredictability• Which result should our static analysis give?

• Equivalent elements have different representations– {x=y, x=a+b} assume ya+b {x=y, x=a+b}– {x=y, y=a+b} assume ya+b {false}

• Equivalent elements have different representations– {x=y, x=a+b} assume xa+b {false}– {x=y, y=a+b} assume xa+b {x=y, x=a+b}

In practice many static analyses still use preorders

Partial orders

Partially ordered sets (partial orders)

• A partially ordered set (Poset for short)is a pair (D , )

• D is a set of elements – a semantic domain• is a partial order between pairs of elements

from D. That is : D D with the following properties, for all d, d’, d’’ in D– Reflexive: d d– Transitive: d d’ and d’ d’’ implies d d’’– Anti-symmetric: d d’ and d’ d implies d = d’

• If d d’ and d d’ we write d d’

Makes it easier to choose the best element

Partially ordered sets (partial orders)

• A partially ordered set (Poset for short)is a pair (D , )

• D is a set of elements – a semantic domain• is a partial order between pairs of elements

from D. That is : D D with the following properties, for all d, d’, d’’ in D– Reflexive: d d– Transitive: d d’ and d’ d’’ implies d d’’– Anti-symmetric: d d’ and d’ d implies d = d’

• If d d’ and d d’ we write d d’

SAV partial order• SAV-predicates– SAV-factoids

= { x = y | x, y Var } { x = y + z | x, y, z Var }– SAV-predicates = 2

• Order relation 1: P1 set P2 iff P1 P2

Is this a partial order?• Order relation 2: P1 imp P2 iff P1 P2

that is models(P1) models(P2)Is this a partial order?

• Order relation 3: P1 set* P2 iff Explicate(P1) set Explicate(P2)Is this a partial order?

CP partial order

• CP-predicates– CP-factoids

= { x = c | x Var, c Z }– CP-predicates = 2

• Order relation 1: P1 set P2 iff P1 P2

Is it a partial order?• Order relation 2: P1 imp P2 iff P1 P2

Is it a partial order?

Can we define a more precise partial order?

CP partial order

• CP-predicates– CP-factoids false = { x = c | x Var, c Z }– CP-predicates = 2 {false}– Define reduce : 2 2

reduce(P) = if exists {x=c1, x=c2}P then {false} else P

– false = { P2 | P=reduce(P) } {false}

• Order relation: P1 P2 if P1 P2 or P1={false}

Pointed poset

• A poset (D, ) with a least element is called a pointed poset– For all dD we have that d

• The pointed poset is denoted by (D , , )• We can always transform a poset (D, ) into a

pointed poset by adding a special bottom element

(D {}, {d | dD}, )• Example: false = { P2 | P=reduce(P) } {false}

chains

Chains• If d d’ and d d’ we write d d’• Similarly define d d’• Let (D, ) be a poset• An ascending chain is a sequence

x1 x2 … xk …• A descending chain is a sequence

x1 x2 … xk …• The height of a poset is the length of the maximal

ascending chain– What is the height of the SAV poset?– What is the height of the CP poset?

Ascending chain example

x<0 x>0

41By Viviana Pastor (originally posted to Flickr as Harbour Bridge 1) [CC-BY-2.0 (http://creativecommons.org/licenses/by/2.0)], via Wikimedia Commons

Joining elements

Bounds• Let (D , ) be a poset• Let X D be a set of elements from D• An element dD is an upper bound (ub) of X iff for

every xD we have that xd• An element dD is a lower bound (lb) of X

iff for every xD we have that dx• An element dD is the least upper bound (lub) of X

iff d is the minimal of all upper bounds of X• An element dD is the greatest lower bound (glb)

of X iff d is the maximal of all lower bounds of X

Bounds example

x<0 x>0

the signs lattice(for variable x)

x0 and true are upper bounds

x<0 x>0

x0 is the least upper bound

x<0 x>0

Join (confluence) operator• Assume a poset (D, )• Let X D be a subset of D (finite/infinite)• The join of X is defined as

– X = the least upper bound (LUB) of all elements in X if it exists– X = min{ b | forall xX we have that xb}– The supremum of the elements in X– A kind of abstract union (disjunction) operator

• Properties of a join operator– Commutative: x y = y x– Associative: (x y) z = x (y z)– Idempotent: x x = x

• x y = y iff x y

Properties of join

• Can be used to define partial orderx y = y iff x y

• Monotone: if y z then (x y) (x z)• x = x• x =

Meet operator• Assume a poset (D, )• Let X D be a subset of D (finite/infinite)• The meet of X is defined as– X = the greatest lower bound (GLB) of all elements in X if it exists– X = max{ b | forall xX we have that bx}– The infimum of the elements in X– A kind of abstract intersection (conjunction) operator

• Properties of a join operator– Commutative: x y = y x– Associative: (x y) z = x (y z)– Idempotent: x x = x

Complete partial orders

Complete partial order (CPO)

• A CPO is a partial order where each ascending chain has a supremum

lattices

Complete lattice

• A complete lattice (D, , , , , ) is• A set of elements D• A partial order x y• A join operator • A meet operator

Join semilattice

• A complete lattice (D, , , ) is• A set of elements D with • A partial order x y• A join operator

Meet semilattice

• A complete lattice (D, , , ) is• A set of elements D with • A partial order x y• A meet operator

Powerset lattices

• For a set of elements X we define the powerset lattice for X as

(2X, , , , , X)– Notice it is a complete lattice

• For a set of program states State, we define the collecting lattice

(2State, , , , , State)

Composing lattices

One lattice per variable

x<0 x>0

y<0 y>0

How can we compose them?

Cartesian product of complete lattices• For two complete lattices

L1 = (D1, 1, 1, 1, 1, 1) L2 = (D2, 2, 2, 2, 2, 2)

• Define the posetLcart = (D1D2, cart, cart, cart, cart, cart)as follows:– (x1, x2) cart (y1, y2) iff

x1 1 y1 andx2 2 y2

– cart = ? cart = ? cart = ? cart = ?

• Lemma: L is a complete lattice• Define the Cartesian constructor Lcart = Cart(L1, L2)

Cartesian product exampletrue

x<0,y<0 x<0,y=0 x<0,y>0 x=0,y<0 x=0,y=0 x=0,y>0 x>0,y<0 x>0,y=0 x>0,y>0

x0,y<0 x0,y<0 x0,y=0 x0,y=0 x0,y>0 x0,y>0 x>0,y0 x>0,y0……

x0,y0 x0,y0 x0,y0x0,y0

x0 x0 y0 y0

(false, false)

(true, true)

How does it represent(x<0y<0) (x>0y>0)?

Disjunctive completion• For a complete lattice

L = (D, , , , , )• Define the powerset lattice

L = (2D, , , , , ) = ? = ? = ? = ? = ?

• Lemma: L is a complete lattice• L contains all subsets of D, which can be thought of

as disjunctions of the corresponding predicates• Define the disjunctive completion constructor

L = Disj(L)

The base lattice CPfalse

{x=-1}{x=-2} {x=1} {x=2} ……

The disjunctive completion of CPfalse

{x=-1}{x=-2} {x=1} {x=2} ……

{x=-2x=-1} {x=-2x=0} {x=-2x=1} {x=1x=2}… … …

{x=0 x=1x=2}{x=-1 x=1x=-2}… ………

What is the height of this lattice?

Relational product of lattices

• L1 = (D1, 1, 1, 1, 1, 1)L2 = (D2, 2, 2, 2, 2, 2)

• Lrel = (2D1D2, rel, rel, rel, rel, rel)as follows:– Lrel = ?

Relational product of lattices

• L1 = (D1, 1, 1, 1, 1, 1)L2 = (D2, 2, 2, 2, 2, 2)

• Lrel = (2D1D2, rel, rel, rel, rel, rel)as follows:– Lrel = Disj(Cart(L1, L2))

• Lemma: L is a complete lattice• What does it buy us?

Cartesian product exampletrue

x<0,y<0 x<0,y=0 x<0,y>0 x=0,y<0 x=0,y=0 x=0,y>0 x>0,y<0 x>0,y=0 x>0,y>0

x0,y<0 x0,y<0 x0,y=0 x0,y=0 x0,y>0 x0,y>0 x>0,y0 x>0,y0……

x0,y0 x0,y0 x0,y0x0,y0

x0 x0 y0 y0

Relational product exampletrue

(x<0y<0)(x>0y>0)

x0 x0 y0 y0

(x<0y<0)(x>0y=0) (x<0y0)(x<0y0)

Collecting semantics

1 label0: if x <= 0 goto label1 x := x – 1 goto label0

label1:

if x > 0

x := x - 1

[x-2]…

Defining the collecting semantics

• How should we represent the set of states at a given control-flow node by a lattice?

• How should we represent the sets of states at all control-flow nodes by a lattice?

Finite maps• For a complete lattice

L = (D, , , , , )and finite set V

• Define the posetLVL = (VD, VL, VL, VL, VL, VL)as follows:– f1 VL f2 iff for all vV

f1(v) f2(v)– VL = ? VL = ? VL = ? VL = ?

• Lemma: L is a complete lattice• Define the map constructor LVL = Map(V, L)

The collecting lattice

• Lattice for a given control-flow node v: ?

• Lattice for entire control-flow graph with nodes V:

?• We will use this lattice as a baseline for static

analysis and define abstractions of its elements

The collecting lattice

• Lattice for a given control-flow node v: Lv=(2State, , , , , State)

• Lattice for entire control-flow graph with nodes V:

LCFG = Map(V, Lv)• We will use this lattice as a baseline for static

analysis and define abstractions of its elements

Equational definition of the semantics

• Define variables of type set of states for each control-flow node

• Define constraints between them

if x > 0

x := x - 1

R[entry]

R[3]R[exit]

Equational definition of the semantics• R[2] = R[entry] x:=x-1 R[3]• R[3] = R[2] {s | s(x) > 0}• R[exit] = R[2] {s | s(x) 0}• A system of recursive equations• How can we approximate it using what

we have learned so far?if x > 0

x := x - 1

R[entry]

R[3]R[exit]

An abstract semantics• R[2] = R[entry] x:=x-1# R[3]• R[3] = R[2] {s | s(x) > 0}#

• R[exit] = R[2] {s | s(x) 0}#

• A system of recursive equations

if x > 0

x := x - 1

R[entry]

R[3]R[exit]

Abstract transformer for x:=x-1

Abstract representationof {s | s(x) < 0}

Next lecture:abstract interpretation II

Spring 2014 Program Analysis and Verification Lecture 9: Abstract Interpretation I

Documents

Transcript of Spring 2014 Program Analysis and Verification Lecture 9: Abstract Interpretation I

Eliminating Stack Overflow by Abstract Interpretation

Internship report : Abstract interpretation, contractsand ...michael.monerau.com › Reports › MSR_eng.pdf · Internship report : Abstract interpretation, contracts and object invariants

Iterative Program Analysis Abstract Interpretation

Abstract Interpretation Part I

Lecture 6. Abstract Interpretation

Static Program Analysis using Abstract Interpretation

Abstract Interpretation of Constraint Programming Seminar ...

Abstract Interpretation for Dummies

Abstract Interpretation: concrete and abstract semanticsavp/08_AVP_2013.pdf · Abstract Interpretation • Abstract Interpretation is: –Computing the semantics of a program in an

Abstract Interpretation and Abstract Domainsavp/domains.pdfAbstract Interpretation and Abstract Domains with special attention to the congruence domain Stefan Bygde May 2006 Department

An Abstract Interpretation Framework for Refactoring

Certiï¬cate Translation in Abstract Interpretation

Program Analysis and Verification 0368-4479 Noam Rinetzky Lecture 6: Abstract Interpretation 1 Slides credit: Roman Manevich, Mooly Sagiv, Eran Yahav.

Lecture 10 Abstract Interpretation using Fixpoints

Abstract Interpretation â€“ Part II

Abstract Interpretation with Inﬁnitesimalssc40/pubs/abstract...3 abstract interpretation with inﬁnitesimals and build the theory of nonstandard abstract interpretation. An example

Spring 2014 Program Analysis and Verification Lecture 12: Abstract Interpretation IV

Abstract Interpretation and Program Verification€¦ · Propositional, quantified boolean formulas, first-order theories, Horn clauses … Use of scalable symbolic reasoning techniques

Spring 2014 Program Analysis and Verification Lecture 13: Abstract Interpretation V

Abstract Interpretation by Dynamic Partitioning