EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2:...

81
EI209 Chapter 2.1 Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (http://tdt.sjtu.edu.cn/~hjzhu/ ) Course Website: http://nsec.sjtu.edu.cn/teaching/ei209.ht ml [Adapted from Computer Organization and Design, 4 th Edition, Patterson & Hennessy, © 2012, MK]

description

EI209 Chapter 2.3Haojin Zhu, SJTU, 2015 The Language a Computer Understands  Word a computer understands: instruction  Vocabulary of all words a computer understands: instruction set (aka instruction set architecture or ISA)  Different computers may have different vocabularies (i.e., different ISAs) l iPhone (ARM) not same as Macbook (x86)  Or the same vocabulary (i.e., same ISA) l iPhone and iPad computers have same instruction set (ARM)

Transcript of EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2:...

Page 1: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.1 Haojin Zhu, SJTU, 2015

EI 209 Computer Organization

Fall 2015

Chapter 2: Instructions: Language of the Computer

Haojin Zhu (http://tdt.sjtu.edu.cn/~hjzhu/ )

Course Website: http://nsec.sjtu.edu.cn/teaching/ei209.html

[Adapted from Computer Organization and Design, 4th Edition, Patterson & Hennessy, © 2012, MK]

Page 2: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.2 Haojin Zhu, SJTU, 2015

Assignment I Assignment I

Chapter I: 1.16

Coding Assignment I:

Design a program to check if your computer is big endian or little endian?

Please submit the source code and the execution results.

Submission Due: Oct.8

Page 3: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.3 Haojin Zhu, SJTU, 2015

The Language a Computer Understands

Word a computer understands: instruction Vocabulary of all words a computer understands:

instruction set (aka instruction set architecture or ISA) Different computers may have different vocabularies (i.e.,

different ISAs) iPhone (ARM) not same as Macbook (x86)

Or the same vocabulary (i.e., same ISA) iPhone and iPad computers have same instruction set (ARM)

Page 4: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.4 Haojin Zhu, SJTU, 2015

The Language a Computer Understands

Why not all the same? Why not all different? What might be pros and cons?

Single ISA (to rule them all):- Leverage common compilers, operating systems, etc.- BUT fairly easy to retarget these for different ISAs (e.g., Linux, gcc)

Multiple ISAs:- Specialized instructions for specialized applications- Different tradeoffs in resources used (e.g., functionality, memory

demands, complexity, power consumption, etc.)- Competition and innovation is good, especially in emerging

environments (e.g., mobile devices)

Page 5: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.5 Haojin Zhu, SJTU, 2015

MIPS: Instruction Set for EI209

MIPS is a real-world ISA (see www.mips.com) Standard instruction set for networking equipment Was also used in original Nintendo-64!

Elegant example of a Reduced Instruction Set Computer (RISC) instruction set

Invented by John Hennessy @ Stanford Why not Berkeley/Sun RISC invented by Dave Patterson? Ask him!

Page 6: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.6 Haojin Zhu, SJTU, 2015

RISC Design Principles

Basic RISC principle: “A simpler CPU (the hardware that interprets machine language) is a faster CPU” (CPU Core)

Focus of the RISC design is reduction of the number and complexity of instructions in the ISA

A number of the more common strategies include: Fixed instruction length, generally a single word;

Simplifies process of fetching instructions from memory Simplified addressing modes;

Simplifies process of fetching operands from memory Fewer and simpler instructions in the instruction set;

Simplifies process of executing instructions Only load and store instructions access memory;

E.g., no add memory to register, add memory to memory, etc. Let the compiler do it. Use a good compiler to break complex high-level

language statements into a number of simple assembly language statements

Page 7: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.7 Haojin Zhu, SJTU, 2015

Mainstream ISAs

ARM (Advanced RISC Machine) is most popular RISC In every smart phone-like device (e.g., iPhone, iPad, iPod, …)

Intel 80x86 is another popular ISA and is used in Macbook and PCs (Core i3, Core i5, Core i7, …)

x86 is a Complex Instruction Set Computer (CISC) 20x ARM sold vs. 80x86 (i.e., 5 billion vs. 0.3 billion)

Page 8: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.8 Haojin Zhu, SJTU, 2015

MIPSGreen Card

Fall 2012 -- Lecture #6 8

Page 9: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.9 Haojin Zhu, SJTU, 2015

MIPSGreen Card

Fall 2012 -- Lecture #6 9

Page 10: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.10 Haojin Zhu, SJTU, 2015

Two Key Principles of Machine Design1. Instructions are represented as numbers and, as

such, are indistinguishable from data

2. Programs are stored in alterable memory (that can be read or written to)

just like data

Stored-program concept Programs can be shipped as files

of binary numbers – binary compatibility

Computers can inherit ready-made software provided they are compatible with an existing ISA – leads industry to align around a small number of ISAs

Accounting prg (machine code)

C compiler (machine code)

Payroll data

Source code in C for Acct prg

Memory

Page 11: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.11 Haojin Zhu, SJTU, 2015

MIPS-32 ISA Instruction Categories

Computational Load/Store Jump and Branch Floating Point

- coprocessor Memory Management Special

R0 - R31

PCHILO

Registers

op

op

op

rs rt rd sa funct

rs rt immediate

jump target

3 Instruction Formats: all 32 bits wide

R format

I format

J format

Page 12: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.12 Haojin Zhu, SJTU, 2015

MIPS (RISC) Design Principles Simplicity favors regularity

fixed size instructions small number of instruction formats opcode always the first 6 bits

Smaller is faster limited instruction set limited number of registers in register file limited number of addressing modes

Make the common case fast arithmetic operands from the register file (load-store machine) allow instructions to contain immediate operands

Good design demands good compromises three instruction formats

Page 13: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.14 Haojin Zhu, SJTU, 2015

MIPS Arithmetic Instructions MIPS assembly language arithmetic statement

add $t0, $s1, $s2

sub $t0, $s1, $s2

Each arithmetic instruction performs one operation

Each specifies exactly three operands that are all contained in the datapath’s register file ($t0,$s1,$s2)

destination source1 op source2

Instruction Format (R format)

0 17 18 8 0 0x22

Page 14: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.15 Haojin Zhu, SJTU, 2015

MIPS fields are given names to make them easier to refer to

MIPS Instruction Fields

op rs rt rd shamt funct

op 6-bits opcode that specifies the operationrs 5-bits register file address of the first source operandrt 5-bits register file address of the second source

operandrd 5-bits register file address of the result’s destinationshamt 5-bits shift amount (for shift instructions)funct 6-bits function code augmenting the opcode

Page 15: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.16 Haojin Zhu, SJTU, 2015

Computer Hardware Operands High-Level Programming languages:

could have millions of variables Instruction sets have fixed, small number Called registers

“Bricks” of computer hardware Fastest way to store data in computer hardware Visible to (the “assembly language”) programmer

MIPS Instruction Set has 32 integer registers

Page 16: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.17 Haojin Zhu, SJTU, 2015

Why Just 32 Registers? RISC Design Principle: Smaller is faster

But you can be too small … Hardware would likely be slower with 64, 128, or 256

registers 32 is enough for compiler to translate typical C programs,

and not run out of registers very often ARM instruction set has only 16 registers May be faster, but compiler may run out of registers

too often (aka “spilling registers to memory”)

Page 17: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.18 Haojin Zhu, SJTU, 2015

Size of Registers Bit is the atom of Computer Hardware:

contains either 0 or 1 True “alphabet” of computer hardware is 0, 1 Will eventually express MIPS instructions as

combinations of 0s and 1s (in Machine Language) MIPS registers are 32 bits wide MIPS calls this quantity a word

Some computers use 16-bit or 64-bit wide words E.g., Intel 8086 (16-bit), MIPS64 (64-bit)

Page 18: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.19 Haojin Zhu, SJTU, 2015

Some Examples of 64-bit processors

What is the first 64-bit Apple phone?

What is the first 64-bit android phone?

Iphone 5s HTC Desire 820

Page 19: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.20 Haojin Zhu, SJTU, 2015

MIPS Register FileRegister File

src1 addr

src2 addr

dst addr

write data

32 bits

src1data

src2data

32locations

325

32

5

5

32

Holds thirty-two 32-bit registers Two read ports and One write port

Registers are Faster than main memory

- But register files with more locations are slower (e.g., a 64 word file could be as much as 50% slower than a 32 word file)

- Read/write port increase impacts speed quadratically Easier for a compiler to use

- e.g., (A*B) – (C*D) – (E*F) can do multiplies in any order vs. stack

Can hold variables so that- code density improves (since register are named with fewer bits

than a memory location)

write control

Page 20: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.21 Haojin Zhu, SJTU, 2015

Aside: MIPS Register Convention

Name Register Number

Usage Preserve on call?

$zero 0 constant 0 (hardware) n.a.$at 1 reserved for assembler n.a.$v0 - $v1 2-3 returned values no$a0 - $a3 4-7 arguments yes$t0 - $t7 8-15 temporaries no$s0 - $s7 16-23 saved values yes$t8 - $t9 24-25 temporaries no$gp 28 global pointer yes$sp 29 stack pointer yes$fp 30 frame pointer yes$ra 31 return addr (hardware) yes

Page 21: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.22 Haojin Zhu, SJTU, 2015

MIPS Memory Access Instructions MIPS has two basic data transfer instructions for

accessing memory

lw $t0, 4($s3) #load word from memory

sw $t0, 8($s3) #store word to memory The data is loaded into (lw) or stored from (sw) a register

in the register file – a 5 bit address

The memory address – a 32 bit address – is formed by adding the contents of the base address register to the offset value

A 16-bit field meaning access is limited to memory locations within a region of 213 or 8,192 words (215 or 32,768 bytes) of the address in the base register

Page 22: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.23 Haojin Zhu, SJTU, 2015

Load/Store Instruction Format (I format):

lw $t0, 24($s3)

Machine Language - Load Instruction

35 19 8 2410

Memory

data word address (hex)0x000000000x000000040x000000080x0000000c

0xf f f f f f f f

$s3 0x12004094

2410 + $s3 =

. . . 0001 1000+ . . . 1001 0100 . . . 1010 1100 = 0x120040ac

0x120040ac $t0

Page 23: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.24 Haojin Zhu, SJTU, 2015

Byte Addresses Since 8-bit bytes are so useful, most architectures

address individual bytes in memory Alignment restriction - the memory address of a word must be

on natural word boundaries (a multiple of 4 in MIPS-32)

Big Endian: leftmost byte is word address IBM 360/370, Motorola 68k, MIPS, Sparc, HP PA

Little Endian: rightmost byte is word addressIntel 80x86, DEC Vax, DEC Alpha (Windows NT)

msb lsb3 2 1 0

little endian byte 0

0 1 2 3big endian byte 0

Page 24: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.25 Haojin Zhu, SJTU, 2015

Aside: Loading and Storing Bytes MIPS provides special instructions to move bytes

lb $t0, 1($s3) #load byte from memory

sb $t0, 6($s3) #store byte to memory

0x28 19 8 16 bit offset

What 8 bits get loaded and stored? load byte places the byte from memory in the rightmost 8 bits of

the destination register- what happens to the other bits in the register?

store byte takes the byte from the rightmost 8 bits of a register and writes it to a byte in memory

- what happens to the other bits in the memory word?

Page 25: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.26 Haojin Zhu, SJTU, 2015

Example of Loading and Storing Bytes Given following code sequence and memory state what is

the state of the memory after executing the code?add $s3, $zero, $zerolb $t0, 1($s3)sb $t0, 6($s3)

Memory

0x 0 0 9 0 1 2 A 0Data Word

Address (Decimal)

0

4

8

12

16

20

24

0x F F F F F F F F

0x 0 1 0 0 0 4 0 2

0x 1 0 0 0 0 0 1 0

0x 0 0 0 0 0 0 0 0

0x 0 0 0 0 0 0 0 0

0x 0 0 0 0 0 0 0 0

What value is left in $t0?

What if the machine was little Endian?

What word is changed in Memory and to what?

Page 26: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.27 Haojin Zhu, SJTU, 2015

Example of Loading and Storing Bytes Given following code sequence and memory state what is

the state of the memory after executing the code?add $s3, $zero, $zerolb $t0, 1($s3)sb $t0, 6($s3)

Memory

0x 0 0 9 0 1 2 A 0Data Word

Address (Decimal)

0

4

8

12

16

20

24

0x F F F F F F F F

0x 0 1 0 0 0 4 0 2

0x 1 0 0 0 0 0 1 0

0x 0 0 0 0 0 0 0 0

0x 0 0 0 0 0 0 0 0

0x 0 0 0 0 0 0 0 0$t0 = 0x00000090

mem(4) = 0xFFFF90FF

mem(4) = 0xFF12FFFF

What value is left in $t0?

What if the machine was little Endian?

What word is changed in Memory and to what?

$t0 = 0x00000012

Page 27: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.28 Haojin Zhu, SJTU, 2015

Speed of Registers vs. Memory Given that

Registers: 32 words (128 Bytes) Memory: Billions of bytes (2 GB to 8 GB on laptop)

and the RISC principle is… Smaller is faster

How much faster are registers than memory?? About 100-500 times faster!

in terms of latency of one access

Page 28: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.29 Haojin Zhu, SJTU, 2015

addi $sp, $sp, 4 #$sp = $sp + 4

slti $t0, $s2, 15 #$t0 = 1 if $s2<15

Machine format (I format):

MIPS Immediate Instructions

0x0A 18 8 0x0F

Small constants are used often in typical code Possible approaches?

put “typical constants” in memory and load them create hard-wired registers (like $zero) for constants like 1 have special instructions that contain constants !

The constant is kept inside the instruction itself! Immediate format limits values to the range +215–1 to -215

what about upper 16 bits?

Page 29: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.30 Haojin Zhu, SJTU, 2015

We'd also like to be able to load a 32 bit constant into a register, for this we must use two instructions

a new "load upper immediate" instruction

lui $t0, 1010101010101010

Then must get the lower order bits right, use ori $t0, $t0, 1010101010101010

Aside: How About Larger Constants?

16 0 8 10101010101010102

1010101010101010

0000000000000000 1010101010101010

0000000000000000

1010101010101010 1010101010101010why can’t addi be used as the second instruction for this 32 bit constant?

Page 30: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.31 Haojin Zhu, SJTU, 2015

Review: Unsigned Binary RepresentationHex Binary Decimal

0x00000000 0…0000 00x00000001 0…0001 10x00000002 0…0010 20x00000003 0…0011 30x00000004 0…0100 40x00000005 0…0101 50x00000006 0…0110 60x00000007 0…0111 70x00000008 0…1000 80x00000009 0…1001 9

…0xFFFFFFFC 1…11000xFFFFFFFD 1…11010xFFFFFFFE 1…11100xFFFFFFFF 1…1111 232 - 1

232 - 2

232 - 3232 - 4

232 - 1

1 1 1 . . . 1 1 1 1 bit

31 30 29 . . . 3 2 1 0 bit position

231 230 229 . . . 23 22 21 20 bit weight

1 0 0 0 . . . 0 0 0 0 - 1

Page 31: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.32 Haojin Zhu, SJTU, 2015

Possible Representations of ComputersSign Mag. Two’s Comp. One’s Comp.

1000 = -81111 = -7 1001= -7 1000 = -71110 = -6 1010 = -6 1001 = -61101 = -5 1011 = -5 1010 = -51100 = -4 1100 = -4 1011 = -41011 = -3 1101 = -3 1100 = -31010 = -2 1110 = -2 1101 = -21001 = -1 1111 = -1 1110 = -11000 = -0 1111 = -00000 = +0 0000 = 0 0000 = +00001 = +1 0001 = +1 0001 = +10010 = +2 0010 = +2 0010 = +20011 = +3 0011 = +3 0011 = +30100 = +4 0100 = +4 0100 = +40101 = +5 0101 = +5 0101 = +50110 = +6 0110 = +6 0110 = +60111 = +7 0111 = +7 0111 = +7

Which one is best? Why?

Page 32: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.33 Haojin Zhu, SJTU, 2015

Review: Signed Binary Representation2’sc binary decimal

1000 -81001 -71010 -61011 -51100 -41101 -31110 -21111 -10000 00001 10010 20011 30100 40101 50110 60111 7

23 - 1 =

-(23 - 1) =

-23 =

1010

complement all the bits

1011

and add a 1

complement all the bits

0101

and add a 1

0110

why it holds?

Page 33: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.34 Haojin Zhu, SJTU, 2015

MIPS Shift Operations Need operations to pack and unpack 8-bit characters into

32-bit words Shifts move all the bits in a word left or right

sll $t2, $s0, 8 #$t2 = $s0 << 8 bits

srl $t2, $s0, 8 #$t2 = $s0 >> 8 bits Instruction Format (R format)

Such shifts are called logical because they fill with zeros

Notice that a 5-bit shamt field is enough to shift a 32-bit value 25 – 1 or 31 bit positions

0 16 10 8 0x00

Page 34: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.35 Haojin Zhu, SJTU, 2015

MIPS Logical Operations There are a number of bit-wise logical operations in the

MIPS ISA

and $t0, $t1, $t2 #$t0 = $t1 & $t2

or $t0, $t1, $t2 #$t0 = $t1 | $t2nor $t0, $t1, $t2 #$t0 = not($t1 | $t2)

Instruction Format (R format)

andi $t0, $t1, 0xFF00 #$t0 = $t1 & ff00

ori $t0, $t1, 0xFF00 #$t0 = $t1 | ff00 Instruction Format (I format)

0 9 10 8 0 0x24

0x0D 9 8 0xFF00

Page 35: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.36 Haojin Zhu, SJTU, 2015

MIPS conditional branch instructions:

bne $s0, $s1, Lbl #go to Lbl if $s0$s1 beq $s0, $s1, Lbl #go to Lbl if $s0=$s1

Ex: if (i==j) h = i + j;

bne $s0, $s1, Lbl1add $s3, $s0, $s1

Lbl1: ...

MIPS Control Flow Instructions

Instruction Format (I format):

0x05 16 17 16 bit offset

How is the branch destination address specified?

Page 36: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.37 Haojin Zhu, SJTU, 2015

Specifying Branch Destinations Use a register (like in lw and sw) added to the 16-bit offset

which register? Instruction Address Register (the PC)- its use is automatically implied by instruction- PC gets updated (PC+4) during the fetch cycle so that it holds the

address of the next instruction limits the branch distance to -215 to +215-1 (word) instructions from

the (instruction after the) branch instruction, but most branches are local anyway

PCAdd

32

32 3232

32

offset

16

32

00

sign-extend

from the low order 16 bits of the branch instruction

branch dstaddress

?Add

4 32

Page 37: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.38 Haojin Zhu, SJTU, 2015

We have beq, bne, but what about other kinds of branches (e.g., branch-if-less-than)? For this, we need yet another instruction, slt

Set on less than instruction: slt $t0, $s0, $s1 # if $s0 < $s1 then

# $t0 = 1 else # $t0 = 0

Instruction format (R format):

Alternate versions of slt slti $t0, $s0, 25# if $s0 < 25 then $t0=1 ...

sltu $t0, $s0, $s1 # if $s0 < $s1 then $t0=1 ...

sltiu $t0, $s0, 25 # if $s0 < 25 then $t0=1 ...2

In Support of Branch Instructions

0 16 17 8 0x24

Page 38: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.39 Haojin Zhu, SJTU, 2015

Aside: More Branch Instructions Can use slt, beq, bne, and the fixed value of 0 in

register $zero to create other conditions less than blt $s1, $s2, Label

less than or equal to ble $s1, $s2, Label greater than bgt $s1, $s2, Label great than or equal to bge $s1, $s2, Label

slt $at, $s1, $s2 #$at set to 1 ifbne $at, $zero, Label #$s1 < $s2

Such branches are included in the instruction set as pseudo instructions - recognized (and expanded) by the assembler

Its why the assembler needs a reserved register ($at)

Page 39: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.40 Haojin Zhu, SJTU, 2015

MIPS Instructions

Consider a comparison instruction: slt $t0, $t1, $zeroand $t1 contains the 32-bit number 1111 01…01

What gets stored in $t0?

Page 40: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.41 Haojin Zhu, SJTU, 2015

MIPS Instructions

Consider a comparison instruction: slt $t0, $t1, $zeroand $t1 contains the 32-bit number 1111 01…01

What gets stored in $t0?The result depends on whether $t1 is a signed or unsigned number – the compiler/programmer must track this and accordingly use either slt or sltu

slt $t0, $t1, $zero stores 1 in $t0 sltu $t0, $t1, $zero stores 0 in $t0

Page 41: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.42 Haojin Zhu, SJTU, 2015

Bounds Check Shortcut Treating signed numbers as if they were unsigned gives

a low cost way of checking if 0 ≤ x < y (index out of bounds for arrays)

sltu $t0, $s1, $t2 # $t0 = 0 if # $s1 > $t2 (max)# or $s1 < 0 (min)

beq $t0, $zero, IOOB # go to IOOB if# $t0 = 0

The key is that negative integers in two’s complement look like large numbers in unsigned notation. Thus, an unsigned comparison of x < y also checks if x is negative as well as if x is less than y.

Page 42: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.43 Haojin Zhu, SJTU, 2015

MIPS also has an unconditional branch instruction or jump instruction:

j label #go to label

Other Control Flow Instructions

Instruction Format (J Format):

0x02 26-bit address

PC4

32

26

32

00

from the low order 26 bits of the jump instruction

Why shift left by two bits?

Page 43: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.44 Haojin Zhu, SJTU, 2015

Aside: Branching Far Away What if the branch destination is further away than can

be captured in 16 bits?

The assembler comes to the rescue – it inserts an unconditional jump to the branch target and inverts the condition

beq $s0, $s1, L1

becomes

bne $s0, $s1, L2j L1

L2:

Page 44: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.45 Haojin Zhu, SJTU, 2015

Compiling Another While Loop Compile the assembly code for the C while loop where

i is in $s3, k is in $s5, and the base address of the array save is in $s6

while (save[i] == k) i += 1;

Page 45: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.46 Haojin Zhu, SJTU, 2015

Compiling Another While Loop Compile the assembly code for the C while loop where

i is in $s3, k is in $s5, and the base address of the array save is in $s6

while (save[i] == k) i += 1;

Loop: sll $t1, $s3, 2add $t1, $t1, $s6lw $t0, 0($t1)bne $t0, $s5, Exitaddi $s3, $s3, 1j Loop

Exit: . . .

Page 46: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

Can make a “for” loop with only j

Can make a “for” loop with only beq

Code for “else” part can come before code for “then” in translation of “if”

Can make an unconditional branch from aconditional branch instruction

Which of the following is FALSE?

Page 47: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.48 Haojin Zhu, SJTU, 2015

Review: MIPS Addressing Modes Illustrated1. Register addressing

op rs rt rd funct Registerword operand

op rs rt offset2. Base (displacement) addressing

base register

Memoryword or byte operand

3. Immediate addressingop rs rt operand

4. PC-relative addressingop rs rt offset

Program Counter (PC)

Memorybranch destination instruction

5. Pseudo-direct addressingop jump address

Program Counter (PC)

Memoryjump destination instruction||

Page 48: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.49 Haojin Zhu, SJTU, 2015

End of Lecture #1 The next topic is procedure

Page 49: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.50 Haojin Zhu, SJTU, 2015

MIPS procedure call instruction:

jal ProcedureAddress #jump and link

Saves PC+4 in register $ra to have a link to the next instruction for the procedure return

Machine format (J format):

Then can do procedure return with a

jr $ra #return Instruction format (R format):

Instructions for Accessing Procedures

0x03 26 bit address

0 31 0x08

Page 50: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.51 Haojin Zhu, SJTU, 2015

Six Steps in Execution of a Procedure1. Main routine (caller) places parameters in a place

where the procedure (callee) can access them $a0 - $a3: four argument registers

2. Caller transfers control to the callee

3. Callee acquires the storage resources needed

4. Callee performs the desired task

5. Callee places the result value in a place where the caller can access it $v0 - $v1: two value registers for result values

6. Callee returns control to the caller $ra: one return address register to return to the point of origin

Page 51: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.52 Haojin Zhu, SJTU, 2015

MIPS Function Call Conventions Registers faster than memory, so use them $a0–$a3: four argument registers to pass parameters $v0–$v1: two value registers to return values $ra: one return address register to return to the point of origin (7 + $zero +$at of 32, 23 left!)

$t0-$t9: 10 x temporaries (intermediates) $s0-$s7: 8 x “saved” temporaries (program variables) 18 registers 32 – (18 + 9) = 5 left

Page 52: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.53 Haojin Zhu, SJTU, 2015

Notes on Functions Calling program (caller) puts parameters into registers $a0-$a3 and uses jal X to invoke X (callee)

Must have register in computer with address of currently executing instruction

Instead of Instruction Address Register (better name), historically called Program Counter (PC)

It’s a program’s counter; it doesn’t count programs! jr $ra puts address inside $ra into PC What value does jal X place into $ra? ????

Student Roulette?

Page 53: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.55 Haojin Zhu, SJTU, 2015

Aside: Spilling Registers What if the callee needs to use more registers than

allocated to argument and return values? callee uses a stack – a last-in-first-out queue

low addr

high addr

$sp

One of the general registers, $sp ($29), is used to address the stack (which “grows” from high address to low address)

add data onto the stack – push $sp = $sp – 4

data on stack at new $sp

remove data from the stack – pop data from stack at $sp $sp

= $sp + 4

top of stack

(28 out of 32, 4 left!)

Page 54: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.57 Haojin Zhu, SJTU, 2015

Compiling a C Leaf Procedure Leaf procedures are ones that do not call other

procedures. Give the MIPS assembler code forint leaf_ex (int g, int h, int i, int j)

{ int f;f = (g+h) – (i+j);return f; }

where g, h, i, and j are in $a0, $a1, $a2, $a3

Page 55: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.58 Haojin Zhu, SJTU, 2015

Leaf Procedure Example C code:

int leaf_example (int g, h, i, j){ int f; f = (g + h) - (i + j); return f;}

Arguments g, …, j are passed in $a0, …, $a3 f in $s0 (we need to save $s0 on stack – we will see why later) Results are returned in $v0, $v1

$a0$a1$a2$a3

$v0$v1

argument registers

result registers

procedure

Page 56: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.59 Haojin Zhu, SJTU, 2015

Procedure Call Instructions Procedure call: jump and link

jal ProcedureLabel Address of following instruction put in $ra Jumps to target address

Procedure return: jump register

jr $ra Copies $ra to program counter Can also be used for computed jumps

- e.g., for case/switch statements

Example:

Page 57: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.60 Haojin Zhu, SJTU, 2015

Leaf Procedure ExampleMIPS code:

leaf_example: addi $sp, $sp, -4 sw $s0, 0($sp) add $t0, $a0, $a1 add $t1, $a2, $a3 sub $s0, $t0, $t1 add $v0, $s0, $zero lw $s0, 0($sp) addi $sp, $sp, 4 jr $ra

Save $s0 on stack

Procedure body

Restore $s0

Result

Return

Page 58: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.61 Haojin Zhu, SJTU, 2015

Procedure Call Mechanics

Old Stack Frame

arg registers

return address

Saved registers

local variables

New Stack Frame

$fp

$fp

$sp

$sp

Low Address

High Address

compiler

ISAHW

compiler

addressing

$gp

PC

$spstack

dynamic data

static data

text

reserved

System Wide Memory Map

Page 59: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.62 Haojin Zhu, SJTU, 2015

Example of the Stack Frame

Call Sequence1. place excess arguments2. save caller save registers ($a0-$a3, $t0-$t9)3. jal4. allocate stack frame5. save callee save registers ($s0-$s9, $fp, $ra)6 set frame pointer

Return1. place function argument in $v02. restore callee save registers3. restore $fp4. pop frame5. jr $31

arg 1arg 2

..calleesaved

registers

callersaved

registerslocal

variables..

$fp$ra

$s0-$s9

$a0-$a3$t0-$t9

$fp

$sp

Page 60: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.63 Haojin Zhu, SJTU, 2015

Nested Procedures What happens to return addresses with nested

procedures?int rt_1 (int i) {

if (i == 0) return 0;else return rt_2(i-1); }

caller: jal rt_1next: . . .

rt_1: bne $a0, $zero, to_2add $v0, $zero, $zerojr $ra

to_2: addi $a0, $a0, -1jal rt_2jr $ra

rt_2: . . .

Page 61: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.64 Haojin Zhu, SJTU, 2015

Nested Procedures Outcomecaller: jal rt_1next: . . .rt_1: bne $a0, $zero, to_2

add $v0, $zero, $zerojr $ra

to_2: addi $a0, $a0, -1jal rt_2jr $ra

rt_2: . . .

On the call to rt_1, the return address (next in the caller routine) gets stored in $ra. What happens to the value in $ra (when i != 0) when rt_1 makes a call to rt_2?

Page 62: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.66 Haojin Zhu, SJTU, 2015

Saving the Return Address, Part 1 Nested procedures (i passed in $a0, return value in $v0)

rt_1: bne $a0, $zero, to_2

add $v0, $zero, $zerojr $ra

to_2: addi $sp, $sp, -8sw $ra, 4($sp)sw $a0, 0($sp)addi $a0, $a0, -1jal rt_2

bk_2: lw $a0, 0($sp)lw $ra, 4($sp)addi $sp, $sp, 8jr $ra

Save the return address (and arguments) on the stack

caller rt addr

high addr

$sp

$sp

low addr

old $a0

$racaller rt addrbk_2

old TOS

Page 63: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.68 Haojin Zhu, SJTU, 2015

Saving the Return Address, Part 2 Nested procedures (i passed in $a0, return value in $v0)

rt_1: bne $a0, $zero, to_2

add $v0, $zero, $zerojr $ra

to_2: addi $sp, $sp, -8sw $ra, 4($sp)sw $a0, 0($sp)addi $a0, $a0, -1jal rt_2

bk_2: lw $a0, 0($sp)lw $ra, 4($sp)addi $sp, $sp, 8jr $ra

Save the return address (and arguments) on the stack

caller rt addr

high addr

$sp

$sp

low addr

old $a0

$rabk_2caller rt addr

old TOS

Page 64: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.69 Haojin Zhu, SJTU, 2015

MIPS Register ConventionName Register

NumberUsage Preserve

on call?$zero 0 the constant 0 n.a.$v0 - $v1 2-3 returned values no$a0 - $a3 4-7 arguments yes$t0 - $t7 8-15 temporaries no$s0 - $s7 16-23 saved values yes$t8 - $t9 24-25 temporaries no$gp 28 global pointer yes$sp 29 stack pointer yes$fp 30 frame pointer yes$ra 31 return address yes

Page 65: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.70 Haojin Zhu, SJTU, 2015

Allocating Space on Stack

C has two storage classes: automatic and static Automatic variables are local to function and discarded when

function exits Static variables exist across exits from and entries to procedures

Use stack for automatic (local) variables that don’t fit in registers

Procedure frame or activation record: segment of stack with saved registers and local variables

Some MIPS compilers use a frame pointer ($fp) to point to first word of frame

(29 of 32, 3 left!)

Page 66: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.71 Haojin Zhu, SJTU, 2015

Stack Before, During, After Call

Page 67: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.72 Haojin Zhu, SJTU, 2015

Compiling a Recursive Procedure A procedure for calculating factorial

int fact (int n) {if (n < 1) return 1;

else return (n * fact (n-1)); }

A recursive procedure (one that calls itself!)fact (0) = 1fact (1) = 1 * 1 = 1fact (2) = 2 * 1 * 1 = 2fact (3) = 3 * 2 * 1 * 1 = 6fact (4) = 4 * 3 * 2 * 1 * 1 = 24. . .

Assume n is passed in $a0; result returned in $v0

Page 68: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.73 Haojin Zhu, SJTU, 2015

Compiling a Recursive Procedurefact: addi $sp, $sp, -8 #adjust stack pointer

sw $ra, 4($sp) #save return addresssw $a0, 0($sp) #save argument nslti $t0, $a0, 1 #test for n < 1beq $t0, $zero, L1 #if n >=1, go to L1addi $v0, $zero, 1 #else return 1 in $v0addi $sp, $sp, 8 #adjust stack pointerjr $ra #return to caller (1st)

L1: addi $a0, $a0, -1 #n >=1, so decrement njal fact #call fact with (n-1)#this is where fact returns

bk_f: lw $a0, 0($sp) #restore argument nlw $ra, 4($sp) #restore return addressaddi $sp, $sp, 8 #adjust stack pointermul $v0, $a0, $v0 #$v0 = n * fact(n-1)jr $ra #return to caller (2nd)

Using stacks to save existing data in $ra and $a0

Make the branch

Call procedureFact(n-1)

Return Fact(1)

Return after procedureFact(n-1)

Page 69: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.74 Haojin Zhu, SJTU, 2015

A Look at the Stack for $a0 = 2, Part 1

$sp

$ra

$a0

$v0

old TOS Stack state after

execution of the first encounter with jal (second call to fact routine with $a0 now holding 1) saved return address to

caller routine (i.e., location in the main routine where first call to fact is made) on the stack

saved original value of $a0 on the stack

Page 70: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.75 Haojin Zhu, SJTU, 2015

A Look at the Stack for $a0 = 2, Part 1

$sp

$ra

$a0

$v0

$sp

caller rt addr

caller rt addr$a0 = 2

21

bk_f

old TOS Stack state after

execution of the first encounter with jal (second call to fact routine with $a0 now holding 1) saved return address to

caller routine (i.e., location in the main routine where first call to fact is made) on the stack

saved original value of $a0 on the stack

Page 71: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.77 Haojin Zhu, SJTU, 2015

A Look at the Stack for $a0 = 2, Part 2

$ra

$a0

$v0

$spcaller rt addr

$a0 = 2

10

bk_f

old TOS

$sp$a0 = 1bk_f

bk_f

Stack state after execution of the second encounter with jal (third call to fact routine with $a0 now holding 0) saved return address of

instruction in caller routine (instruction after jal) on the stack

saved previous value of $a0 on the stack

Page 72: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.79 Haojin Zhu, SJTU, 2015

A Look at the Stack for $a0 = 2, Part 3

$ra

$a0

$v0

$spbk_f

$a0 = 1

0

old TOS

$sp$a0 = 0bk_f

bk_f

$a0 = 2caller rt addr

1

$sp

Stack state after execution of the first encounter with the first jr ($v0 initialized to 1) stack pointer updated to

point to third call to fact

Page 73: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.80 Haojin Zhu, SJTU, 2015

A Look at the Stack for $a0 = 2, Part 4

$sp

$ra

$a0

$v0

old TOS Stack state after execution

of the first encounter with the second jr (return from fact routine after updating $v0 to 1 * 1) return address to caller

routine (bk_f in fact routine) restored to $ra from the stack

previous value of $a0 restored from the stack

stack pointer updated to point to second call to fact

Page 74: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.81 Haojin Zhu, SJTU, 2015

A Look at the Stack for $a0 = 2, Part 4

$ra

$a0

$v0

$spbk_f

$a0 = 1

0

old TOS

$a0 = 0bk_f

$a0 = 2caller rt addr

1

$sp

1

bk_f

1 * 1

Stack state after execution of the first encounter with the second jr (return from fact routine after updating $v0 to 1 * 1) return address to caller

routine (bk_f in fact routine) restored to $ra from the stack

previous value of $a0 restored from the stack

stack pointer updated to point to second call to fact

Page 75: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.82 Haojin Zhu, SJTU, 2015

A Look at the Stack for $a0 = 2, Part 5

$sp

$ra

$a0

$v0

old TOS Stack state after

execution of the second encounter with the second jr (return from fact routine after updating $v0 to 2 * 1 * 1) return address to caller

routine (main routine) restored to $ra from the stack

original value of $a0 restored from the stack

stack pointer updated to point to first call to fact

Page 76: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.83 Haojin Zhu, SJTU, 2015

A Look at the Stack for $a0 = 2, Part 5

$ra

$a0

$v0

$spbk_f

$a0 = 1

1

old TOS

$a0 = 0bk_f

$a0 = 2caller rt addr

1 * 1

$sp

2

bk_f

2 * 1 * 1

Stack state after execution of the second encounter with the second jr (return from fact routine after updating $v0 to 2 * 1 * 1) return address to caller

routine (main routine) restored to $ra from the stack

original value of $a0 restored from the stack

stack pointer updated to point to first call to fact

caller rt addr

Page 77: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.84 Haojin Zhu, SJTU, 2015

Optimized Function Convention

To reduce expensive loads and stores from spilling and restoring registers, MIPS divides registers into two categories:

1. Preserved across function call Caller can rely on values being unchanged $ra, $sp, $gp, $fp, “saved registers” $s0- $s7

2. Not preserved across function call Caller cannot rely on values being unchanged Return value registers $v0,$v1, Argument registers

$a0-$a3, “temporary registers” $t0-$t9

Page 78: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.85 Haojin Zhu, SJTU, 2015

Where is the Stack in Memory? MIPS convention Stack starts in high memory and grows down

Hexadecimal (base 16) : 7fff fffchex

MIPS programs (text segment) in low end 0040 0000hex

static data segment (constants and other static variables) above text for static variables

MIPS convention global pointer ($gp) points to static (30 of 32, 2 left! – will see when talk about OS)

Heap above static for data structures that grow and shrink ; grows up to high addresses

Page 79: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.86 Haojin Zhu, SJTU, 2015

MIPS Memory Allocation

05/03/2023 Fall 2012 -- Lecture #7 86

Page 80: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

jal saves PC+1 in %ra

The callee can use temporary registers (%ti) without saving and restoring them

The caller can rely on save registers (%si) without fear of callee changing them

MIPS uses jal to invoke a function andjr to return from a function

Which statement is FALSE?

Page 81: EI209 Chapter 2.1Haojin Zhu, SJTU, 2015 EI 209 Computer Organization Fall 2015 Chapter 2: Instructions: Language of the Computer Haojin Zhu (hjzhu

EI209 Chapter 2.88 Haojin Zhu, SJTU, 2015

Assignment II

Chapter II: 2.4, 2.6, 2.14, 2.20

Submission Deadline: Oct 30