Express Cube Topologies for On-chip Interconnects Boris Grot J. Hestness, S. W. Keckler, O. Mutlu...
-
Upload
emil-singleton -
Category
Documents
-
view
214 -
download
0
Transcript of Express Cube Topologies for On-chip Interconnects Boris Grot J. Hestness, S. W. Keckler, O. Mutlu...
![Page 1: Express Cube Topologies for On-chip Interconnects Boris Grot J. Hestness, S. W. Keckler, O. Mutlu † The University of Texas at Austin † Carnegie Mellon.](https://reader035.fdocuments.in/reader035/viewer/2022062720/56649efc5503460f94c0eca8/html5/thumbnails/1.jpg)
Express Cube Topologies for On-chip
InterconnectsBoris Grot
J. Hestness, S. W. Keckler, O. Mutlu†
The University of Texas at Austin†Carnegie Mellon University
‡Part of this work was performed at Microsoft Research
Feb 17, 2009HPCA ‘09
![Page 2: Express Cube Topologies for On-chip Interconnects Boris Grot J. Hestness, S. W. Keckler, O. Mutlu † The University of Texas at Austin † Carnegie Mellon.](https://reader035.fdocuments.in/reader035/viewer/2022062720/56649efc5503460f94c0eca8/html5/thumbnails/2.jpg)
The Era of Many-core
UTCS 2HPCA ‘09
Intel Larrabee• 16+ cores• Bidirectional
ring interconnect
UT TRIPS• 2x16 exec tiles• 16 NUCA tiles• Multiple networks
Intel Polaris• 80 tiles• 8x10
mesh
Tilera Tile• 64 cores• 5 mesh networks
![Page 3: Express Cube Topologies for On-chip Interconnects Boris Grot J. Hestness, S. W. Keckler, O. Mutlu † The University of Texas at Austin † Carnegie Mellon.](https://reader035.fdocuments.in/reader035/viewer/2022062720/56649efc5503460f94c0eca8/html5/thumbnails/3.jpg)
Networks on a Chip (NOCs) On-chip advantages
No pin constraints Rich wiring resources
On-chip limitations 2D substrates limit implementable topologies Logic area constrains use of wiring resources Energy/power budget caps
Focus Topologies for tomorrow’s many-core CMPs
HPCA ‘09 3UTCS
![Page 4: Express Cube Topologies for On-chip Interconnects Boris Grot J. Hestness, S. W. Keckler, O. Mutlu † The University of Texas at Austin † Carnegie Mellon.](https://reader035.fdocuments.in/reader035/viewer/2022062720/56649efc5503460f94c0eca8/html5/thumbnails/4.jpg)
Outline Introduction Existing topologies Multidrop Express Channels (MECS) Evaluation Generalized Express Cubes Summary
UTCS 4HPCA '09
![Page 5: Express Cube Topologies for On-chip Interconnects Boris Grot J. Hestness, S. W. Keckler, O. Mutlu † The University of Texas at Austin † Carnegie Mellon.](https://reader035.fdocuments.in/reader035/viewer/2022062720/56649efc5503460f94c0eca8/html5/thumbnails/5.jpg)
UTCS 5HPCA '09
2-D Mesh
![Page 6: Express Cube Topologies for On-chip Interconnects Boris Grot J. Hestness, S. W. Keckler, O. Mutlu † The University of Texas at Austin † Carnegie Mellon.](https://reader035.fdocuments.in/reader035/viewer/2022062720/56649efc5503460f94c0eca8/html5/thumbnails/6.jpg)
Pros Low design & layout
complexity Simple, fast routers
Cons Large diameter Energy & latency
impact
UTCS 6HPCA '09
2-D Mesh
![Page 7: Express Cube Topologies for On-chip Interconnects Boris Grot J. Hestness, S. W. Keckler, O. Mutlu † The University of Texas at Austin † Carnegie Mellon.](https://reader035.fdocuments.in/reader035/viewer/2022062720/56649efc5503460f94c0eca8/html5/thumbnails/7.jpg)
Pros Multiple terminals
attached to a router node Fast nearest-neighbor
communication via the crossbar
Hop count reduction proportional to concentration degree
Cons Benefits limited by
crossbar complexity
UTCS 7HPCA '09
Concentration (Balfour & Dally, ICS ‘06)
![Page 8: Express Cube Topologies for On-chip Interconnects Boris Grot J. Hestness, S. W. Keckler, O. Mutlu † The University of Texas at Austin † Carnegie Mellon.](https://reader035.fdocuments.in/reader035/viewer/2022062720/56649efc5503460f94c0eca8/html5/thumbnails/8.jpg)
UTCS 8HPCA '09
Concentration
Side-effects Fewer channels Greater channel width
![Page 9: Express Cube Topologies for On-chip Interconnects Boris Grot J. Hestness, S. W. Keckler, O. Mutlu † The University of Texas at Austin † Carnegie Mellon.](https://reader035.fdocuments.in/reader035/viewer/2022062720/56649efc5503460f94c0eca8/html5/thumbnails/9.jpg)
UTCS 9HPCA ‘09
Replication
CMesh-X2
Benefits Restores bisection
channel count Restores channel width Reduced crossbar
complexity
![Page 10: Express Cube Topologies for On-chip Interconnects Boris Grot J. Hestness, S. W. Keckler, O. Mutlu † The University of Texas at Austin † Carnegie Mellon.](https://reader035.fdocuments.in/reader035/viewer/2022062720/56649efc5503460f94c0eca8/html5/thumbnails/10.jpg)
UTCS 10HPCA '09
Flattened Butterfly (Kim et al., Micro ‘07)
Objectives: Improve connectivity Exploit the wire budget
![Page 11: Express Cube Topologies for On-chip Interconnects Boris Grot J. Hestness, S. W. Keckler, O. Mutlu † The University of Texas at Austin † Carnegie Mellon.](https://reader035.fdocuments.in/reader035/viewer/2022062720/56649efc5503460f94c0eca8/html5/thumbnails/11.jpg)
UTCS 11HPCA '09
Flattened Butterfly (Kim et al., Micro ‘07)
![Page 12: Express Cube Topologies for On-chip Interconnects Boris Grot J. Hestness, S. W. Keckler, O. Mutlu † The University of Texas at Austin † Carnegie Mellon.](https://reader035.fdocuments.in/reader035/viewer/2022062720/56649efc5503460f94c0eca8/html5/thumbnails/12.jpg)
UTCS 12HPCA '09
Flattened Butterfly (Kim et al., Micro ‘07)
![Page 13: Express Cube Topologies for On-chip Interconnects Boris Grot J. Hestness, S. W. Keckler, O. Mutlu † The University of Texas at Austin † Carnegie Mellon.](https://reader035.fdocuments.in/reader035/viewer/2022062720/56649efc5503460f94c0eca8/html5/thumbnails/13.jpg)
UTCS 13HPCA '09
Flattened Butterfly (Kim et al., Micro ‘07)
![Page 14: Express Cube Topologies for On-chip Interconnects Boris Grot J. Hestness, S. W. Keckler, O. Mutlu † The University of Texas at Austin † Carnegie Mellon.](https://reader035.fdocuments.in/reader035/viewer/2022062720/56649efc5503460f94c0eca8/html5/thumbnails/14.jpg)
UTCS 14HPCA '09
Flattened Butterfly (Kim et al., Micro ‘07)
![Page 15: Express Cube Topologies for On-chip Interconnects Boris Grot J. Hestness, S. W. Keckler, O. Mutlu † The University of Texas at Austin † Carnegie Mellon.](https://reader035.fdocuments.in/reader035/viewer/2022062720/56649efc5503460f94c0eca8/html5/thumbnails/15.jpg)
Pros Excellent connectivity Low diameter: 2 hops
Cons High channel count:
k2/2 per row/column Low channel utilization Increased control
(arbitration) complexity
UTCS 15HPCA '09
Flattened Butterfly (Kim et al., Micro ‘07)
![Page 16: Express Cube Topologies for On-chip Interconnects Boris Grot J. Hestness, S. W. Keckler, O. Mutlu † The University of Texas at Austin † Carnegie Mellon.](https://reader035.fdocuments.in/reader035/viewer/2022062720/56649efc5503460f94c0eca8/html5/thumbnails/16.jpg)
UTCS 16HPCA '09
Multidrop Express Channels (MECS)
Objectives: Connectivity More scalable channel
count Better channel
utilization
![Page 17: Express Cube Topologies for On-chip Interconnects Boris Grot J. Hestness, S. W. Keckler, O. Mutlu † The University of Texas at Austin † Carnegie Mellon.](https://reader035.fdocuments.in/reader035/viewer/2022062720/56649efc5503460f94c0eca8/html5/thumbnails/17.jpg)
UTCS 17HPCA '09
Multidrop Express Channels (MECS)
![Page 18: Express Cube Topologies for On-chip Interconnects Boris Grot J. Hestness, S. W. Keckler, O. Mutlu † The University of Texas at Austin † Carnegie Mellon.](https://reader035.fdocuments.in/reader035/viewer/2022062720/56649efc5503460f94c0eca8/html5/thumbnails/18.jpg)
UTCS 18HPCA '09
Multidrop Express Channels (MECS)
![Page 19: Express Cube Topologies for On-chip Interconnects Boris Grot J. Hestness, S. W. Keckler, O. Mutlu † The University of Texas at Austin † Carnegie Mellon.](https://reader035.fdocuments.in/reader035/viewer/2022062720/56649efc5503460f94c0eca8/html5/thumbnails/19.jpg)
UTCS 19HPCA '09
Multidrop Express Channels (MECS)
![Page 20: Express Cube Topologies for On-chip Interconnects Boris Grot J. Hestness, S. W. Keckler, O. Mutlu † The University of Texas at Austin † Carnegie Mellon.](https://reader035.fdocuments.in/reader035/viewer/2022062720/56649efc5503460f94c0eca8/html5/thumbnails/20.jpg)
UTCS 20HPCA '09
Multidrop Express Channels (MECS)
![Page 21: Express Cube Topologies for On-chip Interconnects Boris Grot J. Hestness, S. W. Keckler, O. Mutlu † The University of Texas at Austin † Carnegie Mellon.](https://reader035.fdocuments.in/reader035/viewer/2022062720/56649efc5503460f94c0eca8/html5/thumbnails/21.jpg)
UTCS 21HPCA ‘09
Multidrop Express Channels (MECS)
![Page 22: Express Cube Topologies for On-chip Interconnects Boris Grot J. Hestness, S. W. Keckler, O. Mutlu † The University of Texas at Austin † Carnegie Mellon.](https://reader035.fdocuments.in/reader035/viewer/2022062720/56649efc5503460f94c0eca8/html5/thumbnails/22.jpg)
Pros One-to-many topology Low diameter: 2 hops k channels row/column Asymmetric
Cons Asymmetric Increased control
(arbitration) complexity
UTCS 22HPCA ‘09
Multidrop Express Channels (MECS)
![Page 23: Express Cube Topologies for On-chip Interconnects Boris Grot J. Hestness, S. W. Keckler, O. Mutlu † The University of Texas at Austin † Carnegie Mellon.](https://reader035.fdocuments.in/reader035/viewer/2022062720/56649efc5503460f94c0eca8/html5/thumbnails/23.jpg)
Analytical Comparison
UTCS 23HPCA '09
CMesh FBfly MECS
Network Size 64 256 64 256 64 256
Radix (conctr’d) 4 8 4 8 4 8
Diameter 6 14 2 2 2 2
Channel count 2 2 8 32 4 8
Channel width 576 1152 144 72 288 288
Router inputs 4 4 6 14 6 14
Router outputs 4 4 6 14 4 4
![Page 24: Express Cube Topologies for On-chip Interconnects Boris Grot J. Hestness, S. W. Keckler, O. Mutlu † The University of Texas at Austin † Carnegie Mellon.](https://reader035.fdocuments.in/reader035/viewer/2022062720/56649efc5503460f94c0eca8/html5/thumbnails/24.jpg)
Experimental Methodology
Topologies Mesh, CMesh, CMesh-X2, FBFly, MECS, MECS-X2
Network sizes 64 & 256 terminals
Routing DOR, adaptive
Messages 64 & 576 bits
Synthetic traffic Uniform random, bit complement, transpose, self-similar
PARSECbenchmarks
Blackscholes, Bodytrack, Canneal, Ferret, Fluidanimate, Freqmine, Vip, x264
Full-system config M5 simulator, Alpha ISA, 64 OOO cores
Energy evaluation Orion + CACTI 6
UTCS 24HPCA '09
![Page 25: Express Cube Topologies for On-chip Interconnects Boris Grot J. Hestness, S. W. Keckler, O. Mutlu † The University of Texas at Austin † Carnegie Mellon.](https://reader035.fdocuments.in/reader035/viewer/2022062720/56649efc5503460f94c0eca8/html5/thumbnails/25.jpg)
UTCS 25HPCA '09
64 nodes: Uniform Random
0
10
20
30
40
1 4 7 10 13 16 19 22 25 28 31 34 37 40
Late
ncy
(cyc
les)
injection rate (%)
mesh cmesh cmesh-x2 fbfly mecs mecs-x2
![Page 26: Express Cube Topologies for On-chip Interconnects Boris Grot J. Hestness, S. W. Keckler, O. Mutlu † The University of Texas at Austin † Carnegie Mellon.](https://reader035.fdocuments.in/reader035/viewer/2022062720/56649efc5503460f94c0eca8/html5/thumbnails/26.jpg)
UTCS 26HPCA '09
256 nodes: Uniform Random
0
10
20
30
40
50
60
70
1 4 7 10 13 16 19 22 25
Late
ncy
(cyc
les)
Injection rate (%)
mesh cmesh-x2 fbfly mecs mecs-x2
![Page 27: Express Cube Topologies for On-chip Interconnects Boris Grot J. Hestness, S. W. Keckler, O. Mutlu † The University of Texas at Austin † Carnegie Mellon.](https://reader035.fdocuments.in/reader035/viewer/2022062720/56649efc5503460f94c0eca8/html5/thumbnails/27.jpg)
UTCS 27HPCA '09
Energy (100K pkts, Uniform Random)
0.0
0.2
0.4
0.6
0.8
1.0
1.2
1.4
1.6
1.8
2.0
Ave
rage
pac
ket e
ne
rgy
(nJ) Link Energy Router Energy
64 nodes 256 nodes
![Page 28: Express Cube Topologies for On-chip Interconnects Boris Grot J. Hestness, S. W. Keckler, O. Mutlu † The University of Texas at Austin † Carnegie Mellon.](https://reader035.fdocuments.in/reader035/viewer/2022062720/56649efc5503460f94c0eca8/html5/thumbnails/28.jpg)
UTCS 28HPCA '09
64 Nodes: PARSEC
0
2
4
6
8
10
12
14
16
18
20
0.00
0.05
0.10
0.15
0.20
0.25
0.30
0.35
0.40
Router Energy Link Energy latency
Blackscholes Canneal Vip
Tota
l ne
two
rk E
ne
rgy
(J)
Avg
pac
ket
late
ncy
(cyc
les)
x264
![Page 29: Express Cube Topologies for On-chip Interconnects Boris Grot J. Hestness, S. W. Keckler, O. Mutlu † The University of Texas at Austin † Carnegie Mellon.](https://reader035.fdocuments.in/reader035/viewer/2022062720/56649efc5503460f94c0eca8/html5/thumbnails/29.jpg)
Generalized Express Cubes Low-dimensional k-ary n-cube
n = {1,2} Good fit for planar silicon
Express channels Improve connectivity MECS for better wire utilization
Multiple networks Improve throughput Reduce crossbar area & energy overhead
Hierarchical scaling
UTCS 29HPCA '09
![Page 30: Express Cube Topologies for On-chip Interconnects Boris Grot J. Hestness, S. W. Keckler, O. Mutlu † The University of Texas at Austin † Carnegie Mellon.](https://reader035.fdocuments.in/reader035/viewer/2022062720/56649efc5503460f94c0eca8/html5/thumbnails/30.jpg)
Partitioning: a GEC Example
UTCS 30HPCA '09
MECS
MECS-X2
FlattenedButterfly
PartitionedMECS
![Page 31: Express Cube Topologies for On-chip Interconnects Boris Grot J. Hestness, S. W. Keckler, O. Mutlu † The University of Texas at Austin † Carnegie Mellon.](https://reader035.fdocuments.in/reader035/viewer/2022062720/56649efc5503460f94c0eca8/html5/thumbnails/31.jpg)
Summary MECS
A novel one-to-many topology Good fit for planar substrates Excellent connectivity Effective wire utilization
Generalized Express Cubes Framework & taxonomy for NOC topologies Extension of the k-ary n-cube model Useful for understanding and exploring
on-chip interconnect options Future: expand & formalize
UTCS 31HPCA '09
![Page 32: Express Cube Topologies for On-chip Interconnects Boris Grot J. Hestness, S. W. Keckler, O. Mutlu † The University of Texas at Austin † Carnegie Mellon.](https://reader035.fdocuments.in/reader035/viewer/2022062720/56649efc5503460f94c0eca8/html5/thumbnails/32.jpg)
Summary MECS
A novel one-to-many topology Good fit for planar substrates Excellent connectivity Effective wire utilization
Generalized Express Cubes Framework & taxonomy for NOC topologies Extension of the k-ary n-cube model Useful for understanding and exploring
on-chip interconnect options Future: expand & formalize
UTCS 32HPCA '09
![Page 33: Express Cube Topologies for On-chip Interconnects Boris Grot J. Hestness, S. W. Keckler, O. Mutlu † The University of Texas at Austin † Carnegie Mellon.](https://reader035.fdocuments.in/reader035/viewer/2022062720/56649efc5503460f94c0eca8/html5/thumbnails/33.jpg)
UTCS 33HPCA '09