Loops or Lather, Rinse, Repeat… CS153: Compilers Greg Morrisett.

Loopsor

Lather, Rinse, Repeat…

CS153: Compilers

Greg Morrisett

Loop Optimizations:The vast majority of time is spent in loops.

So we want techniques for improving them:• Loop invariant removal• Induction variable elimination• Loop unrolling & loop peeling• Loop fusion• Loop interchange• Loop parallelization

Example: invariant removal

L0: t := 0

L1: i := i + 1

t := a + b

*i := t

if i<N goto L1 else L2

L2: x := t

Example: invariant removal

L0: t := 0

t := a + b

L1: i := i + 1

*i := t

L2: x := t

Example: induction variableL0: i := 0 /* s=0; */

s := 0 /* for(i=0;i<100;i++)*/

jump L2 /* s += a[i]; */

L1: t1 := i*4 Note: t1 == i*4

t2 := a+t1 at each point in loop

t3 := *t2

s := s + t3

i := i+1

L2: if i < 100 goto L1 else goto L3

L3: ...

Example: induction variableL0: i := 0

s := 0

t1 := 0

jump L2

L1: t2 := a+t1

t3 := *t2

s := s + t3

i := i+1

t1 := t1+4

L3: ...

Example: induction variableL0: i := 0 s := 0 t1 := 1 t2 := a jump L2 L1: t2 := t2+4 ; t2 == a+t1 == a+i*4 t3 := *t2 s := s + t3

i := i+1 t1 := t1+4L2: if i < 100 goto L1 else goto L3L3: ...

Notice t1 no longer used!

s := 0

t2 := a

jump L2

L1: i := i+1

t2 := t2+4

t3 := *t2

s := s + t3

L3: ...

s := 0

t2 := a

t5 := t2+400

jump L2

L1: i := i+1

t2 := t2+4

t3 := *t2

s := s + t3

L2: if t2 < t5 goto L1 else goto L3

L3: ...

s := 0

t2 := a

t5 := t2+400

jump L2

L1: i := i+1

t2 := t2+4

t3 := *t2

s := s + t3

L3: ...

Example: induction variableL0: s := 0

t2 := a

t5 := t2+400

jump L2

L1: t2 := t2+4

t3 := *t2

s := s + t3

L3: ...

Gotta find loops first:What is a loop?

– can't just "look" at graphs– we're going to assume some additional structure

Defn: a loop is a subset S of nodes where:• there is a distinguished header node h• you can get from h to any node in S• you can get from any node in S to h• there's no edge from a node outside S to any

other node than h.

Examples:

Consider:

Does it have a "loop"?

This graph is called irreducible

According to our definition, no loop.But obviously, there's a cycle…

• a can't be header: no edge from c or b to it.

• b can't be header: has outside edge from a.

• c can't be header for same reason.

Reducible Flow GraphsSo why did we define loops this way?• header gives us a "handle" for the loop.

– e.g., a good spot for hoisting invariant statements

• structured control-flow only produces reducible graphs.– a graph where all cycles are loops according to

our definition.– Java: only reducible graphs– C/C++: goto can produce irreducible graph

• many analyses & loop optimizations depend upon having reducible graphs.

Finding LoopsDefn: node d dominates node n if every path from

the start node to n must go through d.

Defn: an edge from n to a dominator d is called a back-edge.

Defn: a natural loop of a back edge nd is the set of nodes x such that d dominates x and there is a path from x to n not including d.

So that's how we find loops!

Example:a

a dominates a,b,c,d,e,f,g,hb dominates b,c,d,e,f,g,hc dominates c,ed dominates de dominates ef dominates f,g,hg dominates g,hh dominates h

back-edges? f->b, g->a

loops?

Calculating Dominators:D[n] : the set of nodes that dominate n.

D[n] = {n} (D[p1] D[p2] … D[pm])

where pred[n] = {p1,p2,…,pn}

It's pretty easy to solve this equation.– start off assuming D[n] is all nodes.

• except for the start node.• (only the start node dominates itself.)

– iteratively update D[n] based on predecessors until you reach a fixed point.

Representing Dominators

• We don't actually need to keep around the set of all dominators for each node.

• Instead, we construct a dominator tree.– if both d and e dominate n, then either d

dominates e or vice versa.– that tells us there is a "closest" or immediate

dominator.

Example:a

Immediate Dominator Tree

Nested Loops

• A header can participate in multiple loops.

• If loops A and B share header h and all of the nodes in B are a subset of A, then we say B is nested within A.

• We usually concentrate our attention on nested loops first (since we spend more time in them.)

Loop InvariantsA definition x:=… reaches a control-flow point if

there is a path from the assignment to that point that contains no other assignment to x.

An assignment x := v1 v2 is invariant for a loop if for both operands v1 & v2 either– they are constant, or– all of their definitions that reach the assignment are

outside the loop, or– only one definition reaches the assignment and it is

loop invariant.

Example:

L0: t := 0

L1: i := i + 1

t := a + b

*i := t

L2: x := t

Calculating Reaching Defn's:Assign a unique id to each definition.Define defs(x) to be the set of all definitions of the

temp x. Gen Kill

d : x := v1 v2 {d} defs(x) - {d}d : x := v {d} defs(x) - {d}<everything else> { } { }

DefIn[n] = DefOut[p1] … DefOut[pn] where Pred[n] = {p1,…,pn}

DefOut[n] = Gen[n] (DefIn[n] - Kill[n])

Hoisting

We would like to hoist invariant computations out of the loop.

But this is trickier than it sounds:

• We need to potentially introduce an extra node in the CFG, right before the header to place the hoisted statements (the pre-header.)

• Even then, we can run into trouble…

Valid Hoisting:

L0: t := 0

L1: i := i + 1

t := a + b

*i := t

L2: x := t

Valid Hoisting:

L0: t := 0

t := a + b

L1: i := i + 1

*i := t

L2: x := t

Invalid Hoisting:

L0: t := 0

L1: i := i + 1

*i := t

t := a + b

L2: x := t

t's definition isloop invariant buthoisting it conflictswith this use of the old t.

Conditions for Safe Hoisting:

An invariant assignment d:x:= v1v2 is safe to hoist if:– d dominates all loop exits at which x is live-

out, and– there is only one definition of x in the loop,

and– x is not live-out at the entry point for the loop

(the pre-header.)

Ugh! – life would be so much more pleasant in a

functional world…

Induction Variabless := 0

i := 0

L1: if i >= n goto L2

j := i*4

k := j+a

x := *k

s := s+x

i := i+1

L2: ...

We can express j & k as linear functions of i:

j = 4*i + 0k = 4*i + a

where the coefficientsare either constantsor loop-invariant.

i := 0

j := i*4

k := j+a

x := *k

s := s+x

i := i+1

L2: ...

So let's represent them as triples of the form (t, e0, e1):

j = (i, 0, 4) k = (i, a, 4)i = (i, 0, 1)

i := 0

j := i*4

k := j+a

x := *k

s := s+x

i := i+1

L2: ...

Note that i only changes bythe same amount each iteration of the loop.

We say that i is a linear induction variable.

So it's easy to express thechange in j & k.

i := 0

j := i*4

k := j+a

x := *k

s := s+x

i := i+1

L2: ...

If i changes by c, then since:

j = 4*i + 0k = 4*i + a

we know that j & k change by 4*c.

Detecting Induction VariablesDefn: x is a basic induction variable in a loop L if

the only definitions of x within L are of the form x:=x+c or x:=x-c where c is loop invariant.

Defn: k is a derived induction variable in loop L if:1. There is only one def of k within L of the form

k:=j*c or k:=j+c where j is an induction variable and c is loop invariant, and:

2. If j is an induction variable in the family of i, then:

• the only def of j that reaches k is the one in the loop• and there is no def of i on any path between the def

of j and the def of k.

Strength ReductionFor each derived induction variable j of the form

(i, e0, e1) make a fresh temp j'.

At the loop pre-header, initialize j' to e0.

After each i:=i+c, define j':=j'+(e1*c).– note that e1*c can be computed in the loop header

(i.e., it's loop invariant.)

Replace the unique assignment of j in the loop with j := j'.

Examples := 0

i := 0

j' := 0

k' := a

j := i*4

k := j+a

x := *k

s := s+x

i := i+1

L2: ...

Examples := 0i := 0j' := 0k' := a

L1: if i >= n goto L2j := i*4k := j+ax := *ks := s+xi := i+1j' := j'+4k' := k'+4

L2: ...

Examples := 0i := 0j' := 0k' := a

L1: if i >= n goto L2j := j'k := k'x := *ks := s+xi := i+1j' := j'+4k' := k'+4

L2: ...

Copy-propagation orcoalescing will eliminatethe distinction between j/j' and k/k'.

Useless Variabless := 0

i := 0

j' := 0

k' := a

x := *k'

s := s+x

i := i+1

j' := j'+4

k' := k'+4

L2: ...

A variable is useless for Lif it is not live out at all exitsfrom L and its only use isin a definition of itself.

For example, j' is useless.

We can delete uselessvariables from loops.

Useless Variabless := 0

i := 0

j' := 0

k' := a

x := *k'

s := s+x

i := i+1

k' := k'+4

L2: ...

DCE will pick up thedead initialization in thepre-header…

Almost Useless Variabless := 0

i := 0

k' := a

x := *k'

s := s+x

i := i+1

k' := k'+4

L2: ...

The variable i is almostuseless. It would be if itweren't used in thecomparison…

See Appel for how todetermine when/how it'ssafe to rewrite this testin terms of other induction variables inthe family of i.

Loops or Lather, Rinse, Repeat… CS153: Compilers Greg Morrisett.

Documents

Transcript of Loops or Lather, Rinse, Repeat… CS153: Compilers Greg Morrisett.

CS711: Reference Monitors Part 1: OS & SFI Greg Morrisett Cornell University.

The Third Annual ICFP Programming Contest John Reppy Greg Morrisett Bell Labs Cornell University.

CS153: Compilers Lecture 6: Intermediate Representation ...

CS153: Compilers Lecture 11: LR Parsing

From System F to TypedAssemblyLanguage by Greg Morrisett ...hendren/621/2012/olivier-TAL.pdf · From System F to TypedAssemblyLanguage by Greg Morrisett et al. Olivier S. McGill University,

CS153: Compilers Lecture 7: Structured Data in LLVM IR

LATHER ON THE SUN SCREEN. HISTORY OF CLASSROOM TECH.

Drywall/Lather Apprentice Sherrean Carr Gavilan - Gavilan College

Soap Making with Beeswax - Eclectic Bee FarmContributes to: fluffy lather, stable lather, conditioning, moisturizing, quicker trace, softer soap. Often used to superfat soaps. Castor

PROVINCIAL PLAN OF TRAINING FOR THE LATHER (Interior ... · Lather (Interior Systems Mechanic) 4800 Hour Programs Requirements for Progression Progress To First Year Apprentice Completion

Language-Based Security Reference Monitors Greg Morrisett Cornell University.

Tutorial on Language-Based Security PLDI 2003 San Diego Greg Morrisett Cornell University jgm@cs.cornell.edu.

Graph-Coloring Register Allocation CS153: Compilers Greg Morrisett.

Language-Based Security Proof-Carrying Code Greg Morrisett Cornell University Thanks to G.Necula & P.Lee.

CS153: Lecture 3 - Processes - nju.edu.cn

CS153: Compilers Lecture 2: Assembly

STAL David Walker (joint work with Karl Crary, Neal Glew and Greg Morrisett) .

Lather (Interior Systems Mechanic) (Wall and Ceiling ... · Foreword This Lather (Interior Systems Mechanic) (Wall and Ceiling Installer) Program Outline is intended as a guide for

Lather Up to Avoid Getting Sick - Kettering Health Network · That’s what more people are saying when the drink menu comes around, according to a ... Lather Up to Avoid Getting

CS153: Compilers Lecture 3: Lexical Analysis