Instruction Set Architecture simplified DLX – A RISC architecture with only two instruction...
-
date post
20-Dec-2015 -
Category
Documents
-
view
229 -
download
0
Transcript of Instruction Set Architecture simplified DLX – A RISC architecture with only two instruction...
Instruction Set Architecture
• simplified DLX – A RISC architecture with only two instruction formats.
• 32 general purpose registers, each 32 bits wide: R0-R31.
• Register R0 always stores the value of 0.
• Load and Store operations move data between the general purpose registers and
the main memory.
• All instructions are represented by a word = 4 bytes = 32 bits.
• The DLX architecture also has a few special purpose registers mainly used for
handling interrupts. Special move instructions transfer data between general and
special purpose registers.
The RegistersRegisters are clock-enabled flip-flops. Their goal is to store binary data. The registers are divided into two major groups:
General PurposeRegisters
Special Registers
In the SpecialRegisters
environment
In other environments
In the General Purpose
Registers (GPR)environment
The General Purpose RegistersThe General Purpose Registers R0-R31:
•The GPR environment contains 32 registers; each 32 bits long.
• Store data (arguments and results of instructions).
• GPR environment functionality: in every clock cycle:
• may read the values of 2 registers (values appear in A & B).
• may write the value of 1 register (value given via C).
• implemented via a dual-port RAM.
• registers A,B,C are not part of ISA (they are part of datapath).
GPR Env.
A B
C
Responsible for storing information that is essential for running DLX programs.
List of special registers:
1. Program Counter (PC) - stores the address of the current instruction.
2. Instruction Register (IR) - stores the current instruction.
3. More Special Purpose Registers – needed to support interrupt handling.
Datapath registers (not part of ISA):
1. Memory Data Register (MDR) - stores data that is written to the memory or read from the memory. The MDR holds data for store/load instructions.
2. Memory Address Register (MAR) -stores the memory address that is accessed in load/store instructions.
Special Registers
The Instruction Formats
Two formats of instructions
I-Type)Immediate(
R-Type)Register(
Opcode, two registers & a 16-bit
constant
Opcode, three registers & an additional 6-bit
opcode (function)
Opcode RS1 RD Immediate
6 5 5 16
Opcode RS1 RS2 RD Function
6 5 5 5 5 6
The Instruction Formats (Cont.)Let’s see some examples :
1 (001011 00110 01001 0000000000111111
This is an I-type instruction because IR[31:26] is not 000000.
001011 is the opcode of the addi instruction.
00110 is the binary representation of 6.
01001 is the binary representation of 9.
0000000000111111 – 2’s complement representation of 63.
The instruction is:addi R6 R9 63
and in other words:[ R9 ] [R6] + 63
representation: 2’s complement representation is used for the immediate field and for the general purpose registers.
Opcode RS1 RD Immediate
Assembly
Semantics
The Instruction Formats (Cont.)
2 (011010 00011 11001 0000000000000011
Again, This is an I-type instruction because IR[31:26] is not 000000.
011010 is the opcode of the seqi instruction.
00011 is the binary representation of 3.
11001 is the binary representation of 25.
0000000000000011 – 2’s complement representation of 3.
The instruction is:seqi R3 R25 3
and in other words:
Opcode RS1 RD Immediate
Assembly
Semantics [R25] ([R3] = 3) ? 1 : 0
The Instruction Formats (Cont.)
3 (000000 00101 00111 01101 100011
This is a R-type instruction because IR[31:26] is 000000.
000000 is the opcode of the R-Type instructions.
00101 is the binary representation of 5.
100011 is the representation of the add instruction.
The instruction is:add R5 R7 R13
and in other words: [R13] [R5] + [R7]
Assembly
Semantics
Opcode RS1 RS2 RD Function
From the programmer’s point of view (ISA):
DLX is a universal machine that executes programs stored in the main memory written in the DLX instructions set.
What is the DLX?
The DLX Implementation
The general idea:
Fetch
Decode
Execute
Memory Access
Write-Back
IR M[PC]
decode the instruction stored in the IR.prepare the operands from the GPR (if necessary)
A calculation (e.g. add, compare)
Memory access in load/store instructions
Store the result of the operation in thedestination register in the GPR Env.
Incr
emen
t P
C
The Datapath & the Control
Datapath - A collection of functional units, registers and multiplexers connected by buses .
Control – A state machine that manages: operation of functional units, drivers that write to buses, clock-enable signals.
The Control produces signals whose name is: “Control Signals.”
DatapathControl
The Datapath & the Control SignalsAn example of data-flow: C op(A,B)
- control is in state ALU
- ALU inputs are fed from registers A & B. (corresponding drivers are enabled)
- write result in register C. (driver of ALU output is enabled)
- timing : within one clock cycle.
A B
ALU Env.
CCCE Sampling C
CLK
stableX,Y-ALU’s inputs
not stable stableALU’s output
stableCCE
The DLX Commands ImplementationGeneral Remarks:1) Execution of an instruction requires more than one clock cycle.2) Instructions do not have equal running time.
The DLX
The Datapath
• General Purpose Registers • Functional Units (ALU)• Main Memory (interface)• buses
Notes:• Very few Control states.• Even in modern processors: #states < 200.
The Control
Controls the data flow in the datapath.
A reminder: What is the RAM?
RAM = Random Access Memory
The term “Random” means that one may access any word (as oppose to tape in which only next / previous word may be accessed).
The memory is modeled as an array of words. The index of each word
is called the address of the word. M[i] is the word stored in the address of i.
RAM
Write
Adr.
Data in Data out
Input / Output ports:
1A bit indicating if reading or
writing
The functionality is:
Write = 0 (Reading operation): Data out M[<Adr.>]
Write = 1 (Writing operation): M[<Adr.>] Data in
Data out Logical
A reminder: What is the RAM (Cont.)?
In the following slides we will zoom into each of the five types of Control states, describing what happens in each type.
Let’s get deeper
What happens during Fetch state?
DinAdr
Dout
W
Busy
MemoryEnv.
Control
IR Env.
IR
PC
MR Was the mission
completed?
Sample!1) The PC register contains the address.
2) The desired functionality: IR M[PC]
FetchThe Control usually stays in fetch state
for more than one clock cycle
3) The memory is very slow. It announces: “I’m busy” after getting a read / write request. When busy, no new requests are allowed. As Dout stablizes, the memory announces: “I’m done”. Only then the control asks the IR Env. to sample Dout’s value and update the IR register.
The string dealt with is the one located in the IR register and may be of two types:
1) I-Type: Opcode RS1 RD Immediate
6 5 5 16
Determine:• I-Type?• Next state.
General PurposeRegisters (GPR)
Env.A B
A operand)Copy of RS1(
Adr. 1
Read!
Sign-Extension
The constant )sign-extended(
What happens during Decode state?
2) R-Type: Opcode RS1 RS2 RD Function
6 5 5 5 5 6
Not used
Determine:1) R-Type?2) Next state.
General PurposeRegisters (GPR)
Env.A B
A operand)Copy of RS1(
Adr. 1
Adr. 2
B operand)Copy of RS2(
Read!
What happens during Decode state (Cont.)?
A continuation
of the Opcode field.
Notes:
1) IR[25:21] represent RS1.
IR[20:16] represent RS2.
2) The GPR Env. Is able to perform two read operations at the same clock period (Dual Port RAM).
3) The PC register is advanced in the decode stage: PC PC+1. This is done in order to avoid the use of another Control state and because this increment can be done in the decode stage (buses and ALU are free).
What happens during Decode state (Cont.)?
Opcode RS1 RS2 RD Function
Some kinds of instructions:
1) Instructions which use the ALU (Arithmetic instructions, Logic instructions and Test&Set instructions):
A reminder:
RD RS1 + Sext(Imm) RD RS1 + RS2
RD (RS1 > Sext(Imm) ? 1 : 0) RD (RS1 > RS2 ? 1 : 0)
R-TypeI-Type
2 operands: The first is always RS1 and the second is either RS2 or Sext(Imm).
What happens during Execute state?
GPR Env.
A B
C
Cce IR Env.
0 1 ITYPE
ALU Env. ALU Control signals
Result
A. Arithmetic/Logic instructions:
What happens during Execute state (Cont.)?
Sext (Imm.)
B. Test&Set instructions:
Sgri: RD (RS1 > Sext(Imm) ? 1 : 0)
GPR Env.
A B
C
Cce IR Env.
0 1 ITYPE
ALU Env. TEST
Result
Sext (Imm.)
Two stages:1) executeC (RS1 > RS2 ? 1:0)
2 (writebackRD CIn this way the period
time is shorter .
What happens during Execute state (Cont.)?
Always 0 / 1
2) Shift instructions:
These instructions are always in R-Type structure.
Slli: RD RS1 << 1
Srli: RD RS1 >> 1
The word to be shifted is
stored in RS1
The shift amount is
1
GPR Env.
A B
C
Cce
Shifter RIGHT
Result
1From The PC Env.
IR Env.
Accordingto F
unc.
What happens during Execute state (Cont.)?
Generally, the “1” constant may be any
other constant and the shift may be of more
than one place
3) Jump instructions:
A. Unconditional jump: Jump Reg (jr): PC A
What happens during Execute state (Cont.)?
No direct path from A to PC. Instead:
GPR Env.
AIR Env.
0
ALU Env. ADD
PC Env.
PC Cce
B. Branch: “Jump only if a condition is satisfied”
beqz: PC PC+1+(RS1=0 ? Sext (Imm.) : 0)
bnez: PC PC+1+(RS1=0 ? 0 : Sext (Imm.))
We’ll demonstrate the beqz instruction. Two states are needed:
State I - Branch: Check the condition – RS1=0 ? Sext (Imm.) : 0
GPR Env.
AIR Env.
0
ALU Env. TEST
Comparison result
To the Control
What happens during Execute state (Cont.)?
The next state
BTakenFetch
State II - BTaken: Calculating the jump – PC PC+1+Sext(Imm)
IR Env.
ALU Env.
PC Env.
PC PCce
PCSext (Imm.)
ADD
What happens during Execute state (Cont.)?
JumpNo
Jum
p
C. Calling a routine: ”Jump, remember your address so it will be possible to get back to this address”
jalr: R31 PC+1
PC RS1
We use two Control states in order to execute this instruction because we want to avoid a collision in the buses (to be elaborated).
What happens during Execute state (Cont.)?
State I: Copying the PC (2 clock cycles)
GPR Env.
C
PC Env.
PC Cce
Clock cycle 1:
GPR Env.
C
R31
Clock cycle 2 (like in the Write-Back state):
Write!
The address=31
What happens during Execute state (Cont.)?
State II: Calculating the jump address
PC Env.
PC
ALU Env.
IR Env.0
GPR Env.
A
ADD
Cce
What happens during Execute state (Cont.)?
Reading From Memory - Load
Load Word (lw): RD M(Sext(imm.) + RS1)
Four states are needed for finishing the load instruction:
State I: Effective Address Computation – MAR A+C0
GPR Env.
A
IR Env.
ALU Env.
MAR
C0
ADD
Sext(imm) + RS1
Sext(imm)
State II: Memory Access (Load). This state lasts till the value is returned – MDR M(MAR)
Reading From Memory – Load (Cont.)
Adr
Dout W
MemoryEnv.
MAR
Read!
MDR
Notes:
1) Busy signal informs the Control when the operation is over (Dout is stable).
2) MDR samples Dout every clock cycle, no need to compute CE (Simplifies Control).
3) Dout must be logical even if its value is incorrect!
Busy
Tells the Control when readung is over
State III: Writing MDR’s value to C in the GPR – C MDR
Reading From Memory – Load (Cont.)
MDR
GPR Env.
C
Cce
State IV: Write-Back: RD C
Writing To Memory - Store
Store Word (sw): M(Sext(imm.) + RS1) RD
Three states are needed for finishing the store instruction:
State I: Effective Address Computation – MAR A+C0
GPR Env.
A
IR Env.
ALU Env.
MAR
C0
ADD
Sext(imm.) + RS1
Sext(imm.)
GPR Env.
B
Writing To Memory – Store (Cont.)
MDR
State II: Copying the B register’s (this is RD) value to the MDR – MDR B
State III: Memory Access (Store) – M(MAR) MDR
Adr
W
MemoryEnv. MAR
Write!
MDRDin
Busy
Tells the Control when writing is over
Write-BackThe Write-Back stage occur in the following instruction types:
Arithmetic Logic Shift
Test & Set Load
In this stage, the following happens: RD C
GPR Env.
C
R0-31
Write!
RD
RD’s value is determined according to a I-Type/R-Type structure. This is the reason for
having two different Write-Back states (WBI & WBR).