Hier wird Wissen Wirklichkeit Computer Architecture – Part 7 – page 1 of 56 – Prof. Dr. Uwe...

Hier wird Wissen Wirklichkeit Computer Architecture – Part 7 – page 1 of 56 – Prof. Dr. Uwe Brinkschulte, Prof. Dr. Klaus Waldschmidt

Part 7

Instruction Set Architecture(ISA)

Computer Architecture

Slide Sets

WS 2011/2012

Prof. Dr. Uwe BrinkschulteProf. Dr. Klaus Waldschmidt

Programming model

The Instruction Set Architecture (ISA) is the programming model

which is needed for programming a processor.

All details concerning the implementation of the processor are out of

focus in the ISA.

Therefore the ISA can be regarded as an abstract interface between

the compiler and the microarchitecture of the processor.

Programming model

The following key questions lead us to the specification of this

interface:

• How data is represented?

• Where data is stored?

• How data is accessed?

• How instructions are coded?

• Which instructions are available to process data?

Programming model

Therefore, the ISA defines:

• machine data types

• address space organisation

• register model

• addressing modes

• machine instruction set

Programming model

Since the programming model abstracts from implementation details it is

realized either in hardware (real processors) or in software (virtual

processors).

For instance, if the instruction set includes an instruction for multiplication,

the CPU of the processor needs a digital combinatorial circuit for

multiplication.

In this sense, a relation between the abstract ISA and the

microarchitecture exists.

Machine data types

A data type is a tuple of values and operations which can be

performed on these values.

The operations are implemented by the machine instructions.

Machine data types (like data types in high level languages)

are classified into structured and unstructured data types.

An additional class are the primitive data types.

Primitive machine data types

Bit: value set: 0,1operations: AND, OR, XOR, negation, compare

Byte: value set: bit pattern (8 bit)normally smallest addressable unit operations: same as for bit, additionally ADD, SUB, MUL, DIV, SHIFT, ROTATE, …

Word: value set: normally a multiple of byteslargest addressable unit (in a single operation)operations: same as for byte

(sometimes the following convention is used: Half-Word = 16 BitWord = 32 BitDouble Word = 64 Bit)

Examples for more data types

- vector (bit)

- BCD number (binary coded decimal)

- Binary number unsigned

- two complement number

- floating point number

- string

n-1 i 0

n = 8,16,32

0 3 2 1 0

7 6 5 4 3 2 1 0

8, 16 Bit

32 Bit31 0

7 0 15 0

n-1 0n = 8, 16, 32

MSB LSB

MSB=sign bit LSB

n = 8, 16, 32

biased.expon.s fraction

... n = 8, 16, 32n-1 0 n-1 0 n-1 0

31 23 22 0

(taken from MC680x0)

Address space organisation

Physical organisation:

depends on the processor

8 bit processor

15 8 7 0

16 bit processor

. . . . . .

31 24 23 16 15 8 7 0

32 bit processor

. . . . . . . . .

n: physical address, n = 2 address bus width

Locical organisation:

byte oriented access for most processor types

physical word on a 8 bit processorphysical word on a 16 bit processorphysical word on a 32 bit processor

m: logical address, m = n * bit width / 8

Physical to logical mapping:

31 24 23 16 15 8 7 0

. . . . . . . . .

456789101112131415

m-3m-2m-1m

physical address

locical address

Aligned access: the accessed word is aligned according to its length in the physical address space

(logical adress mod length) = 0

31 24 23 16 15 8 7 0

. . . . . . . . .

bytebytebytebyte bytes to byte boundarieshalf-wordhalf-word half-words to half-word boundaries

word words to word boundaries

31 24 23 16 15 8 7 0

. . . . . . . . .

half-word

Unaligned (misaligned) access: the accessed word is not aligned according to its length in the physical address space

(logical adress mod length) 0

Some processors do not support unaligned access (e.g. SPARC)

Byte order in words

8 Bit - byte

16 Bit - word

32 Bit - word

8 Bit - byte

16 Bit - word

32 Bit - word

N + 1N N + 2 N + 3

N + 1N

N + 1 NN + 2N + 3

N + 1 N

big endian byte ordering

little endian byte ordering

31 24 23 16 15 8 7 0

Two different formats:

N: least significant byte, N + 3: most significant byte

Word address is the address of the most significant byte (used e.g. in MC680x0 or SPARC)

Word address is the address of the least significant byte (used e.g. in Pentium family)

Byte order in words

N + 2N + 3big endian byte ordering

Locical (byte oriented) memory organization of a 32 bit word

byte address

N + 1Nlittle endian byte ordering b

byte address

Register model

The number of registers being part of a processor varies between 20

and 200. The advantage of data storage in registers against DRAM

or SRAM-memories are:

• faster access time

• register addresses could be shorter with respect to the instruction

format.

An ISA is called Load-Store-ISA if all machine instructions except

register load and store instructions operate on the register file only.

Registers are classified into hidden registers and programmer visible

registers.

The visible registers are the workplace of the programmer and are

often organized as register files.

Hidden registers are supply registers needed for the internal

functionality of the processing unit (CPU).

Both visible and hidden registers are designed for various purpose

and functionality.

Register model

A register model defines which processor registers are visible

(addressable) to the programmer.

Usually these are the working registers and the state register.

The state register monitors the state of the processor through

conditional flags.

It shows for example whether the processor operates in system or user

The state register is mostly read-only

Commonly existing hidden registers are the instruction register and the

memory interface registers.

Register model

Register implementation

........

32 bit registerwith D-Latches

Asynchronouscounter withD-Latches

clk D Q

Q0 Q2Q1 Q3

Symbol

• program counter (PC) - contains the next instruction address

• state register (SR) - monitors the state of the processor

• stackpointer (SP) - stores the top of the stack

• accumulator (ACCU) – stores computation results (in older or simple

processors)

• data registers (DXi) - storing operands for computations

• address registers (AXi) - storing operand addresses

• general purpose registers (GPi) - storing either operands or operand

addresses

Common visible registers

• instruction register (IR) – contains the currently processed instruction

• instruction queue (IQ) - contains the next instructions to be processed

• memory address register (MAR) - buffers the address of a

memory

access (e.g. to save or load

a general purpose register)

• memory data register (MDR) - buffers the content of a memory access

(e.g. to save or load a general

purpose register)

Common hidden registers

Program counter register

Pointer to the next instruction to be executed

Normally incremented

Set by a jump, jump subroutine, interrupt, return or return from interrupt instruction

Program counter

……

Stackpointer register

Addresses a location in the memory which is organized as a stack (LIFO).

Elements can be pushed (write) and popped (read) only from the top of the stack.

Consequence: Data are stored in a subsequent order

Used e.g. for jump subroutine/return operation on PC

StackpointerN - 4

N + 4…

Push X

Some processors distinguish between user stackpointer (e.g. for jump subroutine/return) and supervisor stackpointer (e.g. for interrupt/return from interrupt)

Sampe CISC register set

Intel Pentium

Sampe RISC register set

Power PC (extract)

The register file of RISC processors has to be much

bigger compared to CISC processors.

A RISC needs more registers, because the register

file is source and destination of all arithmetic or logic

instructions.

Multiple register sets

Register set n

Program counter content

State register content

Data-/address-register 1

Data-/address-register 2

Data-/address-register m

Register set 3

Register set 2

Register set 1

register set structure

Processors with multiple register sets: a step towards multithreaded processors

Processor with multiple register sets:

Each register set can store the program counter (PC) and the state register (SR)

PC and SR exist only once

=> several contexts can be stored, fast context switching

Multithreaded processor:

multiple PCs and SRs exist

instructions from several threads can be executed at the same time in the pipeline

=> several contexts can be processed

Multiple register sets

The registers of a register file are grouped into blocks called windows.

These overlapping windows are used by the subroutines of a program.

MORS (multiple overlapping register set)

Multiple overlapping register sets,register windows

Overall register set

Register window 1

Register window 2

Register window 3

Register window n

jump subroutine

return

Simplifies parameter passing on jumping to subroutines

Each subroutine has its own working space within the register file

Parameters can be directly passed with no need to copy registers or

pass parameters by memory

=> mainly used in RISC processors

Two possible approaches:

• Fixed size register window

• Variable size register window

Multiple overlapping register sets,register windows

precedingwindow

continued

global registers

current window

succeedingwindow

restore

alternative register naming:

r31 = i7

r24 = i0r23 = I7

r16 = I0r15 = o7

r8 = o0r7 = g7

r0 = g0

based on SPARC architecture

r24r23

r16r15

In i+1

Local i+1

Out i+1 In i

Local i

Out i-1

Local i-1

In i-1

Fixed size window

local register

In case of the SPARC architecture, a window consists of 32 registers of which the first 8 also belong to the preceding window and the last 8 also belong to the succeeding window.

The registers are addressed relative to the current window pointer (CWP).

A subroutine call is performed by incrementing the CWP and saving the PC.

The parameters are passed through the overlapping registers of the two windows.

The content of the program counter is saved (return address) into one of these registers.

A time consuming save and reload of registers is omitted.

In case of an overflow of the MORS the window contents have to be saved to a stack.

Fixed size window

preceding

currentwindow

globalregisters

localregisters

0 previous RSP

current RSP

Variable size window

gr0gr1

register stack pointer (RSP)

r64 based on AMD 29000 architecture

Register size of processors with 3-address architecture

processor/architecture (vendor)

# of general purpose registers bit width

overalldirectlyaccessible

register width

registeraddress

immediateoperands

instr.

Alpha 21364 (Compaq) 32 32 64 Bit 5 Bit 8 Bit 32 BitAm29000 (AMD) 192 192 32 Bit 8 Bit 8 Bit 32 BitARM7TDMI (ARM) 16 16 32 Bit 4 Bit 8 Bit 32 BitCrusoe TM5800 (Transmeta) 64 64 32 Bit 6 Bit - -pa-8700 (HP) 32 32 64 Bit 5 Bit 11 Bit 32 BitItanium 2 (Intel, HP) 128 128 64 Bit 7 Bit 8 Bit 41 BitMC88100 (Motorola) 32 32 32 Bit 5 Bit 16 Bit 32 BitMIPS65 20Kc (MIPS) 32 32 64 Bit 5 Bit 16 Bit 32 BitNemesis C (TU Berlin) 96 16 32 Bit 4 Bit 1 Bit 16 BitPowerPC 970 (IBM) 32 32 64 Bit 5 Bit 16 Bit 32 BitUltraSPARC III Cu (SUN) 160 32 64 Bit 5 Bit 13 Bit 32 Bit

Register size of processors with 2-address architecture

processor (vendor)

# of general purpose registers bit width

overalldirectly accessible

register width

register address

immediate operands

smallest instr.

Athlon (AMD X86-64) 16 16 64 Bit 4 Bit 8 - 32 Bit 8 Bit

ColdFire MFC5206 (Motorola) 8 + 8 8 + 8 32 Bit 3 Bit 8 - 32 Bit 16 Bit

MC680xx (Motorola) 8 + 8 8 + 8 32 Bit 3 Bit 8 - 32 Bit 16 Bit

Pentium X (Intel X86) 8 8 32 Bit 3 Bit 8 - 32 Bit 8 Bit

Addressing modes

Machine instructions normally hold information about the operand

addresses.

This can either be a physical address, e.g. a register number or the

address of a memory location, or it can be an address specification.

An address specification defines how to calculate the address.

Thus, the address information determines the location of the

operand(s) belonging to the instruction using one of many addressing

modes.

Addressing modes

Instruction format

e.g. arithmetic instruction

opcode

target source source

operands needed for the execution defined by the opcode

operand register memory address specificationitself number location (dynamic address calculation)

The result of the dynamic address calculation is called effective address

• immediate:

The operand is part of the instruction.

• memory direct and register direct:

The instruction contains the operand address.

• register indirect:

The instruction contains a register number pointing to a register

holding the address of the operand.

In assembler code this addressing mode is typically denoted by

register name

Addressing modes

• memory indirect:

A register addressed in the instruction contains the address of a

memory cell which holds the operand address.

• register offset:

The instruction contains a register number and an offset.

The operand address is the sum of the register’s content and the offset.

• implicit:

The instruction implicitly targets a single register (like the ACCU)

Addressing modes

Reasons for using dynamic address calculation:

• Addresses of data structure elements are composed of the

address of the data structure and the offset of the element to the

beginning. Often this offset is unknown at compile time, therefore

the effective address has to be calculated at runtime.

• Repeated execution of the same instruction, e.g. in a loop,

often accesses successive memory addresses which have to be

calculated at runtime.

Effective address

The address is calculated from several parts found in the instruction and in registers or memory cells at runtime (dynamic address calculation).

The calculated address is defined as effective address.

• An operand address often is unknown at compile time,

because it is calculated during program execution.

• The partitioning of addresses into a base address stored

register and an offset simplifies the handling of shift able

variables and shift able program code.

Effective address (cont.)

o p e r a n d

immediate

instruction

register

memory

operand

eff. address

eff. address register direct

memory direct

Addressing modes 1

e.g. LOAD 8, r1

e.g. LOAD (2000), r1

e.g. LOAD r2, r1

o p e r a n d

register indirectinstruction

register

instruction

registeraddress

e f f e c t i v e a d d r e s s

registeraddress

memory

m e m o r y a d d r e s s

decrement

memory

- eff. address

register

register indirectwith predecrement

Addressing modes 2

e.g. LOAD (r2), r1

e.g. LOAD -(r2), r1

register indirect with displacement(indexed)

o p e r a n d

instruction

register

registeraddress

memory

+ i n d e xregister

displacement

scaling 1, 2 or 4

eff. address

Addressing modes 3

e.g. LOAD.B 126(r3)(r2), r1

memory indirect

o p e r a n d

instruction

register

registeraddress

memory

displacement1

indirect memory address

displacement2eff. address

memory

Addressing modes 4

e.g. LOAD 28(126(r2)), r1

memory indirect(post indexed)

o p e r a n d

instruction

register

registeraddress

memory

displacement1

displacement2

eff. address

memory

i n d e x

scaling 1, 2 or 4

+register

Addressing modes 5

e.g. LOAD.B 28(r3)(126(r2)), r1

memory indirect(preindexed)

o p e r a n d

instruction

register

registeraddress

memory

displacement1

displacement2eff. address

memory

i n d e x

scaling 1, 2 or 4

+register

Addressing modes 6

e.g. LOAD.B 28(126(r3)(r2)), r1

branch target table accessthrough program counter relative addressing

JMP disp (PC)(rn)

memory

target 0

target 2

target 1

• • •

i n d e x

displacement

Access to branch target table by PC relative addressing

Machine instruction set

The machine instruction set of a computer normally includes

instructions of different formats, e.g. 0-address instructions, 1-address

instructions, 2-address instructions and 3-address instructions.

An instruction is divided into so called fields.

The more address fields an instruction contains the smaller the

number of addressable memory cells and/or the number of operations

encoded in the opcode field becomes

(if we assume a constant instruction length).

Variable length vs. constant length instruction format

Variable length: (e.g. 16 - 256 Bit) mostly used in CISC architectures

+ flexible instruction format

+ high code density

+ long immediate and displacement values

Constant length: (e.g. 32 Bit) mostly used in RISC architectures

+ simple and fast fetch

+ simple and fast decode

+ simplified pipelining

Scheme of basic operations of common processors

basic operations

unconditional operations conditional operations

combinatorial operations control flow operations

transportoperations

arithmetic logicoperations

simple branches system branches

load operations

store operations

semaphore operations

arithmetic operations

logic and shift operations

state and control operations

subroutine branches

call call

return return

Instruction sets are divided into groups combining instructions with similar functionality:

Typical instruction groups:

• transport instructions

• arithmetic instructions

• logic instructions

• shift and rotate instructions

• bitwise instructions

• string and array instructions

• branch instructions

• system instructions

• synchronization instructions

Instruction classes

Load store architecture

All instructions - except load and store instructions - address registers only.

Load and store instructions are needed to transfer data to and from main memory.

Mainly used in RISC ISA, combined with pipelining it allows to complete most instructions in one cycle

Furthermore, the address fields of instructions becomes shorter as they only have to address a register instead of a memory address.

A load store ISA accelerates a machine if there are only small caches or if the caches are completely missing and a big register file is available.

Example: An arithmetic instruction

SUBc r3, r7, r21binary code 11010 10101 00111 00011

1 0000000000hexcode D54E3800

31 26 21 16 11 0

instruction format:

OP: opcodeTR: target registerSRn: source registerc/x: set/do not set condition code

Example: A store instruction

STORE r24, 126(r5)binary code 00111 11000 00101

00000000001111110hexcode 3E0A007E

31 26 21 16 0

instruction format:

OP: opcodeSR: source registerBR: base registerDP: displacement (signed)

Two examples for an instruction format

OP TR SR1 SR2 OP SR BRcx DP

State register of a RISC processor (based on SPARC-architecture)

N Z V C

31 16 15 0

PS S CWP SR

interruptmask

interrupt enable

previous S-bit

supervisor/user

current windowpointer

overflow

negative

conditional bits

Conditional codes dependent on conditional bits Z (zero),N (negative), C (carry) und V (overflow). Mnemonics according toMotorolas ColdFire MFC5206 processor.conditional value mnemonic operation expression operand typeequal

not equal

Z independent

higher than

higher than or same

lower than

lower than or same

unsigned

greater than

greater than or equal

less than

less than or equal

(N = V) Z

(N = V)

(N ≠ V)

(N ≠ V) Z

signed

arithmetic overflow

arithmetic shortfall

negative

positive

signed

Multimedia instructions

Typical SIMD instructions to process a single operation on a set of

data (e.g. changing the brightness of image pixels)

Operations can be on packed integers (e.g. MMX on Pentium) or

packed floats (e.g. SSE2 on Pentium)

Typical operations: arithmetic (saturated or overflow), logic, compare,

pack, unpack

Example:

Hier wird Wissen Wirklichkeit Computer Architecture – Part 7 – page 1 of 56 – Prof. Dr. Uwe...

Documents

Transcript of Hier wird Wissen Wirklichkeit Computer Architecture – Part 7 – page 1 of 56 – Prof. Dr. Uwe...

Hier wird Wissen Wirklichkeit 1 Black holes and "remnants" at the LHC Benjamin Koch FIGSS/University of Frankfurt.

Cloud Computing Services Seminar Cloud Computing – Hype oder Wirklichkeit? im WS 2011/2012 Steffi Haag 24. Januar 2012.

1. INTERNATIONAL COMPUTER GAME COMPUTERSPIELE … · 1. INTERNATIONAL COMPUTER GAME CONFERENCE COLOGNE 22.–24.3.2006 COMPUTERSPIELE UND SOZIALE WIRKLICHKEIT COMPUTER GAMES AND SOCIAL

Elektromagnetische Kammermusik - grrrr.org · PDF fileVioline, des Ludwig-Drumkits ... unter anderem Paul Hindemith, ... werden Teile der klanglichen Wirklichkeit „fotorealistisch“

Recent results in Mathematics related to Data Transmission Michel Waldschmidt Université P. et M. Curie - Paris VI miw/ Lahore,

Wie wirklich ist die Wirklichkeit? - christl-meyer-science.netchristl-meyer-science.net/images/PDFs/AIDS_Wie wirklich.pdf · Christl Meyer 7 Ein kurzer historischer Abriß der Geschichte

The World according to GNARP - COnnecting REpositoriesDr. Volker Michel (UB Frankfurt/M.) Hier wird Wissen Wirklichkeit 6th Frankfurt Symposium 2 Virtuelle Fachbibliotheken … sind

Kathmandu , TU, September 21, 2010 Michel Waldschmidt, Professor ,

Brad Waldschmidt, P.E., PTOE · Brad Waldschmidt Kimley-Horn 615.564.2701 Brad.Waldschmidt@kimley-horn.com Brad Freeze TDOT Traffic Operations 615.741.5017 Phillip.B.Freeze@tn.gov

Was ist Informationsdesign? - bilder.buecher.de · »Wenn das Cartesianische ›cogito ergo sum‹ die Wirklichkeit unserer Selbst erfahrung revolutioniert hat, so Tschernobyl die

Ali Nassiri and Geoff Waldschmidt Accelerator System Division Advanced Photon Source

Computer Architecture - uni-frankfurt.de€¦ · Computer Architecture – Part 3 – page 3 of 55 – Prof. Dr. Uwe Brinkschulte, M.Sc. Benjamin Betting Hier wird Wissen Wirklichkeit

Waldschmidt, Anne kulturelles Modell von ... - ssoar.info · Entdeckung von Körper, Subjekt und Identität als historische und kulturell geformte Phänomene, die Problematisierung

Geoff Waldschmidt RF Engineer ASD / RF DOE Lehman CD-2 Review of APS-Upgrade 4-6 Decem ber 2012

An elementary introduction to error correcting codes michel.waldschmidt// Michel Waldschmidt Université P. et M. Curie - Paris.

Hier wird Wissen Wirklichkeit Computer Architecture – Part 6 – page 1 of 22 – Prof. Dr. Uwe Brinkschulte, M.Sc. Benjamin Betting Part 6 Fundamentals in.

Computer Architecture - es.cs.uni-frankfurt.de · Computer Architecture – Part 1 – page 3 of 27 – Prof. Dr. Uwe Brinkschulte, M.Sc. Benjamin Betting Hier wird Wissen Wirklichkeit

Data transmission, cryptography and arithmetic miw/ October 7, 2008 Michel Waldschmidt Université P. et M. Curie - Paris VI.

INDE ET ASIE DE L'OUEST Michel WALDSCHMIDT , Université de Paris 6

Uwe Brinkschulte, Jürgen Becker University of Karlsruhe, Germany Theo Ungerer