Design and Allocation of Loosely Coupled Multi-bit Flip ...Feasibility Analyses of LC-MBFF 11 2-bit...
Transcript of Design and Allocation of Loosely Coupled Multi-bit Flip ...Feasibility Analyses of LC-MBFF 11 2-bit...
![Page 1: Design and Allocation of Loosely Coupled Multi-bit Flip ...Feasibility Analyses of LC-MBFF 11 2-bit & 3-bit LC-MBFF libraries implemented Wires for clock sharing net are modeled with](https://reader036.fdocuments.in/reader036/viewer/2022071417/6114a6f9496440678a29243c/html5/thumbnails/1.jpg)
Design and Allocation of Loosely Coupled Multi-bit Flip-flops
for Power Reduction in Post-Placement Optimization
Hyoungseok Moon & Taewhan Kim
Seoul National University
![Page 2: Design and Allocation of Loosely Coupled Multi-bit Flip ...Feasibility Analyses of LC-MBFF 11 2-bit & 3-bit LC-MBFF libraries implemented Wires for clock sharing net are modeled with](https://reader036.fdocuments.in/reader036/viewer/2022071417/6114a6f9496440678a29243c/html5/thumbnails/2.jpg)
Outline
2
Introduction
Multi-bit flip-flop and related works
New Style of Multi-bit Flip-flops
Allocation Algorithm
Minimizing power consumption
Awareness of clock network
Experimental Results
Conclusion
![Page 3: Design and Allocation of Loosely Coupled Multi-bit Flip ...Feasibility Analyses of LC-MBFF 11 2-bit & 3-bit LC-MBFF libraries implemented Wires for clock sharing net are modeled with](https://reader036.fdocuments.in/reader036/viewer/2022071417/6114a6f9496440678a29243c/html5/thumbnails/3.jpg)
So Many Flip-flops in SoC
3
ac_97
![Page 4: Design and Allocation of Loosely Coupled Multi-bit Flip ...Feasibility Analyses of LC-MBFF 11 2-bit & 3-bit LC-MBFF libraries implemented Wires for clock sharing net are modeled with](https://reader036.fdocuments.in/reader036/viewer/2022071417/6114a6f9496440678a29243c/html5/thumbnails/4.jpg)
Conventional Structure of Multi-bit Flip-flops
4
(a) Two 1-bit flip-flops (b) 2-bit flip-flop
14% lower power4% less area
Reduced # of clock sinks
![Page 5: Design and Allocation of Loosely Coupled Multi-bit Flip ...Feasibility Analyses of LC-MBFF 11 2-bit & 3-bit LC-MBFF libraries implemented Wires for clock sharing net are modeled with](https://reader036.fdocuments.in/reader036/viewer/2022071417/6114a6f9496440678a29243c/html5/thumbnails/5.jpg)
Related Works and their Limitations
5
Placement
Routing
Clock Tree Synthesis
RTL Synthesis
Kretchmer, SNUG’99Kretchmer, SNUG’99
Hsu, ICCAD’13Hsu, ICCAD’13
Yan, ICGCS’10
Chang, ICCAD’10
Jiang, TCAD’12
Wang, TCAD’12
Chen, Integration’13
Shyu, TVLSI’13
Yan, ICGCS’10
Chang, ICCAD’10
Jiang, TCAD’12
Wang, TCAD’12
Chen, Integration’13
Shyu, TVLSI’13
Hard to estimate timing/placement
1. Merging FFs to allocate MBFFs2. Placing MBFFs
Constraints1. Timing2. Space3. Routability
Pre-placement
In-placement
Post-placement
![Page 6: Design and Allocation of Loosely Coupled Multi-bit Flip ...Feasibility Analyses of LC-MBFF 11 2-bit & 3-bit LC-MBFF libraries implemented Wires for clock sharing net are modeled with](https://reader036.fdocuments.in/reader036/viewer/2022071417/6114a6f9496440678a29243c/html5/thumbnails/6.jpg)
Fixed Placement of Example Circuit
6
![Page 7: Design and Allocation of Loosely Coupled Multi-bit Flip ...Feasibility Analyses of LC-MBFF 11 2-bit & 3-bit LC-MBFF libraries implemented Wires for clock sharing net are modeled with](https://reader036.fdocuments.in/reader036/viewer/2022071417/6114a6f9496440678a29243c/html5/thumbnails/7.jpg)
Conventional MBFF Allocation
7
Timing
Area
![Page 8: Design and Allocation of Loosely Coupled Multi-bit Flip ...Feasibility Analyses of LC-MBFF 11 2-bit & 3-bit LC-MBFF libraries implemented Wires for clock sharing net are modeled with](https://reader036.fdocuments.in/reader036/viewer/2022071417/6114a6f9496440678a29243c/html5/thumbnails/8.jpg)
New MBFF Allocation
8
![Page 9: Design and Allocation of Loosely Coupled Multi-bit Flip ...Feasibility Analyses of LC-MBFF 11 2-bit & 3-bit LC-MBFF libraries implemented Wires for clock sharing net are modeled with](https://reader036.fdocuments.in/reader036/viewer/2022071417/6114a6f9496440678a29243c/html5/thumbnails/9.jpg)
Structure
Flip-flops are merged via “Sharing nets”
Loosely Coupled Multi-bit Flip-flop
9
(a) 2-bit LC-MBFF (b) 3-bit LC-MBFF
![Page 10: Design and Allocation of Loosely Coupled Multi-bit Flip ...Feasibility Analyses of LC-MBFF 11 2-bit & 3-bit LC-MBFF libraries implemented Wires for clock sharing net are modeled with](https://reader036.fdocuments.in/reader036/viewer/2022071417/6114a6f9496440678a29243c/html5/thumbnails/10.jpg)
Implementation of LC-MBFF
10
The shorter, the better
(a) (b) (c)
The Shortest!
![Page 11: Design and Allocation of Loosely Coupled Multi-bit Flip ...Feasibility Analyses of LC-MBFF 11 2-bit & 3-bit LC-MBFF libraries implemented Wires for clock sharing net are modeled with](https://reader036.fdocuments.in/reader036/viewer/2022071417/6114a6f9496440678a29243c/html5/thumbnails/11.jpg)
Feasibility Analyses of LC-MBFF
11
2-bit & 3-bit LC-MBFF libraries implemented
Wires for clock sharing net are modeled with PTM interconnect structure and 45nm Open Cell Library
HSPICE simulations with 500MHz operating clock
Dimension Value
Width 0.08um
Space 0.08um
Thickness 0.20um
Height 0.20um
Length (δ) 4um ~ 30um
![Page 12: Design and Allocation of Loosely Coupled Multi-bit Flip ...Feasibility Analyses of LC-MBFF 11 2-bit & 3-bit LC-MBFF libraries implemented Wires for clock sharing net are modeled with](https://reader036.fdocuments.in/reader036/viewer/2022071417/6114a6f9496440678a29243c/html5/thumbnails/12.jpg)
Time Delay of LC-MBFF
12
Clock skew is very negligible
![Page 13: Design and Allocation of Loosely Coupled Multi-bit Flip ...Feasibility Analyses of LC-MBFF 11 2-bit & 3-bit LC-MBFF libraries implemented Wires for clock sharing net are modeled with](https://reader036.fdocuments.in/reader036/viewer/2022071417/6114a6f9496440678a29243c/html5/thumbnails/13.jpg)
Power Consumption
13
More power saving with closer flip-flops
(a) 2-bit LC-MBFF (b) 3-bit LC-MBFF
![Page 14: Design and Allocation of Loosely Coupled Multi-bit Flip ...Feasibility Analyses of LC-MBFF 11 2-bit & 3-bit LC-MBFF libraries implemented Wires for clock sharing net are modeled with](https://reader036.fdocuments.in/reader036/viewer/2022071417/6114a6f9496440678a29243c/html5/thumbnails/14.jpg)
LC-MBFF Allocation Rules
14
Flip-flops in a close distance
Flip-flops in the same level of clock tree
Simpler resulting clock tree
Routability of sharing nets
![Page 15: Design and Allocation of Loosely Coupled Multi-bit Flip ...Feasibility Analyses of LC-MBFF 11 2-bit & 3-bit LC-MBFF libraries implemented Wires for clock sharing net are modeled with](https://reader036.fdocuments.in/reader036/viewer/2022071417/6114a6f9496440678a29243c/html5/thumbnails/15.jpg)
LC-MBFF Allocation Flow
15
![Page 16: Design and Allocation of Loosely Coupled Multi-bit Flip ...Feasibility Analyses of LC-MBFF 11 2-bit & 3-bit LC-MBFF libraries implemented Wires for clock sharing net are modeled with](https://reader036.fdocuments.in/reader036/viewer/2022071417/6114a6f9496440678a29243c/html5/thumbnails/16.jpg)
Generate Merging Graph
16
![Page 17: Design and Allocation of Loosely Coupled Multi-bit Flip ...Feasibility Analyses of LC-MBFF 11 2-bit & 3-bit LC-MBFF libraries implemented Wires for clock sharing net are modeled with](https://reader036.fdocuments.in/reader036/viewer/2022071417/6114a6f9496440678a29243c/html5/thumbnails/17.jpg)
Select & Merge ‘Best’ Flip-flops
17
![Page 18: Design and Allocation of Loosely Coupled Multi-bit Flip ...Feasibility Analyses of LC-MBFF 11 2-bit & 3-bit LC-MBFF libraries implemented Wires for clock sharing net are modeled with](https://reader036.fdocuments.in/reader036/viewer/2022071417/6114a6f9496440678a29243c/html5/thumbnails/18.jpg)
Iterate if Any Edge Left
18
![Page 19: Design and Allocation of Loosely Coupled Multi-bit Flip ...Feasibility Analyses of LC-MBFF 11 2-bit & 3-bit LC-MBFF libraries implemented Wires for clock sharing net are modeled with](https://reader036.fdocuments.in/reader036/viewer/2022071417/6114a6f9496440678a29243c/html5/thumbnails/19.jpg)
Example after the Final Update
19
![Page 20: Design and Allocation of Loosely Coupled Multi-bit Flip ...Feasibility Analyses of LC-MBFF 11 2-bit & 3-bit LC-MBFF libraries implemented Wires for clock sharing net are modeled with](https://reader036.fdocuments.in/reader036/viewer/2022071417/6114a6f9496440678a29243c/html5/thumbnails/20.jpg)
Experimental Setup
20
Algorithm implementation
C++ & GCC on Intel 64-bit 2.6GHz machine
Physical design environment
45nm Open Cell Library & PTM interconnect structure
Synopsys Design Compiler & IC Compiler
11 ISCAS89 & IWLS2005 benchmark circuits
500MHz operating clock frequency
set as 30um and as 20um
![Page 21: Design and Allocation of Loosely Coupled Multi-bit Flip ...Feasibility Analyses of LC-MBFF 11 2-bit & 3-bit LC-MBFF libraries implemented Wires for clock sharing net are modeled with](https://reader036.fdocuments.in/reader036/viewer/2022071417/6114a6f9496440678a29243c/html5/thumbnails/21.jpg)
Experimental Results
21
3.13% more power saving on average
With less than 1% interconnect area
[7] Z.-W. Chen and J.-T. Yan, “Routability-constrained multi-bit flip-flop construction for clock power reduction,” Integration, the VLSI Journal, Jun. 2013.
![Page 22: Design and Allocation of Loosely Coupled Multi-bit Flip ...Feasibility Analyses of LC-MBFF 11 2-bit & 3-bit LC-MBFF libraries implemented Wires for clock sharing net are modeled with](https://reader036.fdocuments.in/reader036/viewer/2022071417/6114a6f9496440678a29243c/html5/thumbnails/22.jpg)
Distribution of LC-MBFFs after Allocation
More high-bit flip-flops
Little 1-bit flip-flops
34% less # of flip-flops Reduced # of clock sinks
![Page 23: Design and Allocation of Loosely Coupled Multi-bit Flip ...Feasibility Analyses of LC-MBFF 11 2-bit & 3-bit LC-MBFF libraries implemented Wires for clock sharing net are modeled with](https://reader036.fdocuments.in/reader036/viewer/2022071417/6114a6f9496440678a29243c/html5/thumbnails/23.jpg)
Circuit
Total
Wire
Length
Added
Wire
Length
Added WL
/Total WL
Total
Wire
Area
Total
Circuit
Area
Impacted
Area
s1423 5238 386 7.4% 200.12 1064 1.4%
s15850 8761 502 5.7% 237.01 1549 0.9%
s5378 12844 752 5.9% 415.13 2168 1.1%
s13207 17052 1288 7.6% 529.46 3648 1.1%
s38584 84200 5254 6.2% 3075.19 16266 1.2%
s38417 101362 6820 6.7% 3605.81 19710 1.2%
s35932 86014 7756 9.0% 3508.73 22049 1.4%
AES 32882 1636 5.0% 540.15 8893 0.3%
AC97 28321 1334 4.7% 878.29 6368 0.6%
ETHNET 2325506 38688 1.7% 21920.98 121539 0.3%
DES3 2351961 42130 1.8% 34495.96 160824 0.4%
5.60% 0.91%
Impact of Additional Wire Length
23
With simplified clock tree, the actual area impact is much less
![Page 24: Design and Allocation of Loosely Coupled Multi-bit Flip ...Feasibility Analyses of LC-MBFF 11 2-bit & 3-bit LC-MBFF libraries implemented Wires for clock sharing net are modeled with](https://reader036.fdocuments.in/reader036/viewer/2022071417/6114a6f9496440678a29243c/html5/thumbnails/24.jpg)
Conclusion
24
Loosely-coupled Multi-bit Flip-flop
New structure of multi-bit flip-flop
No timing/area constraints
LC-MBFF Allocation Algorithm
Considering power, clock tree & routability
More clock power saving
3.13% more clock power saving than the existing work
![Page 25: Design and Allocation of Loosely Coupled Multi-bit Flip ...Feasibility Analyses of LC-MBFF 11 2-bit & 3-bit LC-MBFF libraries implemented Wires for clock sharing net are modeled with](https://reader036.fdocuments.in/reader036/viewer/2022071417/6114a6f9496440678a29243c/html5/thumbnails/25.jpg)
Thank you for your attention!
25
![Page 26: Design and Allocation of Loosely Coupled Multi-bit Flip ...Feasibility Analyses of LC-MBFF 11 2-bit & 3-bit LC-MBFF libraries implemented Wires for clock sharing net are modeled with](https://reader036.fdocuments.in/reader036/viewer/2022071417/6114a6f9496440678a29243c/html5/thumbnails/26.jpg)
![Page 27: Design and Allocation of Loosely Coupled Multi-bit Flip ...Feasibility Analyses of LC-MBFF 11 2-bit & 3-bit LC-MBFF libraries implemented Wires for clock sharing net are modeled with](https://reader036.fdocuments.in/reader036/viewer/2022071417/6114a6f9496440678a29243c/html5/thumbnails/27.jpg)
A1. Merging Distance Limit
27
set as 30um and as 20um
D2-bit
D3-bit
D2-bit
D3-bit