Counterintuitive Intelligence: Doing More to Cost Less A TCO … · 2019-09-23 · Title:...

19
Collaborate. Differentiate. Win. Carlos Macián, Ph.D. Counterintuitive Intelligence: Doing More to Cost Less A TCO Story SR DIRECTOR AI STRATEGY & PRODUCTS SEPTEMBER 17, 2019

Transcript of Counterintuitive Intelligence: Doing More to Cost Less A TCO … · 2019-09-23 · Title:...

Page 1: Counterintuitive Intelligence: Doing More to Cost Less A TCO … · 2019-09-23 · Title: eSilicon-template-new-logo-16x9-format-20150902 Author: Sally Slemons Created Date: 9/23/2019

Collaborate. Differentiate. Win.

Carlos Macián, Ph.D.

Counterintuitive Intelligence:

Doing More to Cost Less

A TCO Story

SR DIRECTOR AI STRATEGY & PRODUCTS

SEPTEMBER 17, 2019

Page 2: Counterintuitive Intelligence: Doing More to Cost Less A TCO … · 2019-09-23 · Title: eSilicon-template-new-logo-16x9-format-20150902 Author: Sally Slemons Created Date: 9/23/2019

2Copyright 2019 eSilicon

Agenda

• Of architects and masons

• AI challenges in the data center

• Maturing enabling technologies

• 3D IC applied to memory overlays

• Conclusions

Counterintuitive Intelligence: A TCO Story

Page 3: Counterintuitive Intelligence: Doing More to Cost Less A TCO … · 2019-09-23 · Title: eSilicon-template-new-logo-16x9-format-20150902 Author: Sally Slemons Created Date: 9/23/2019

3Copyright 2019 eSilicon

Of Architects and Masons

Page 4: Counterintuitive Intelligence: Doing More to Cost Less A TCO … · 2019-09-23 · Title: eSilicon-template-new-logo-16x9-format-20150902 Author: Sally Slemons Created Date: 9/23/2019

4Copyright 2019 eSilicon

Of Architects and Masons

Page 5: Counterintuitive Intelligence: Doing More to Cost Less A TCO … · 2019-09-23 · Title: eSilicon-template-new-logo-16x9-format-20150902 Author: Sally Slemons Created Date: 9/23/2019

5Copyright 2019 eSilicon

AI challenges are networking/HPC’s déjà vu all over again…

• Real estate and yield

• Memory size, latency and

bandwidth

• Chip-to-chip interface density

and latency

• Power and energy efficiency

• … times hyperscale!

AI in the Data Center

C

D

Page 6: Counterintuitive Intelligence: Doing More to Cost Less A TCO … · 2019-09-23 · Title: eSilicon-template-new-logo-16x9-format-20150902 Author: Sally Slemons Created Date: 9/23/2019

6Copyright 2019 eSilicon

AI in the Data Center

AI challenges are networking/HPC’s déjà vu all over again…

Page 7: Counterintuitive Intelligence: Doing More to Cost Less A TCO … · 2019-09-23 · Title: eSilicon-template-new-logo-16x9-format-20150902 Author: Sally Slemons Created Date: 9/23/2019

7Copyright 2019 eSilicon

AI in the Data Center

AI challenges are networking/HPC’s déjà vu all over again…

Page 8: Counterintuitive Intelligence: Doing More to Cost Less A TCO … · 2019-09-23 · Title: eSilicon-template-new-logo-16x9-format-20150902 Author: Sally Slemons Created Date: 9/23/2019

8Copyright 2019 eSilicon

Challenges vs. Enabling Technologies

Challenges Enabling technologies

Real estate Multi-die, chiplets, 3D stacks

Memory size, latency and bandwidth3D memory overlays

New memory technologies

Chip-to-chip (C2C) and die-to-die (D2D)

interface density and latency

XSR on substrate

AIB/HBI+ (high-bandwidth interface) on

EMIB/HDMI

Power and energy efficiency In- and near-memory computing

Page 9: Counterintuitive Intelligence: Doing More to Cost Less A TCO … · 2019-09-23 · Title: eSilicon-template-new-logo-16x9-format-20150902 Author: Sally Slemons Created Date: 9/23/2019

9Copyright 2019 eSilicon

ASIC, Enabling Tech and Hyperscale: A TCO story

Targeted performance with energy efficiency

Push the Pareto curve outwards

Avalanche effect on TCO

Page 10: Counterintuitive Intelligence: Doing More to Cost Less A TCO … · 2019-09-23 · Title: eSilicon-template-new-logo-16x9-format-20150902 Author: Sally Slemons Created Date: 9/23/2019

10Copyright 2019 eSilicon

How is this going to cost less?

• Higher yield

• Heterogenous processes

• Cheaper packaging, assembly and manufacturing

• Minimize overhead

• Higher energy efficiency at all levels of the system

A TCO Story

Build highly efficient, cost-effective ASICs to optimize

your AI architecture

Page 11: Counterintuitive Intelligence: Doing More to Cost Less A TCO … · 2019-09-23 · Title: eSilicon-template-new-logo-16x9-format-20150902 Author: Sally Slemons Created Date: 9/23/2019

11Copyright 2019 eSilicon

A Better Way of Building AI chips

HBM

2e

HBM

2e

C2CPCIE

HBM

2e

HBM

2eC2C

20x20 AI

cores

HBM

2e

HBM

2e

C2CPCIE

HBM

2e

HBM

2eC2C

20x20 AI

cores

C2C between 2 packages

49.8mm

29

.1m

m

vs. D2D

D2D

D2D

D2D

D2D

D2D28

.8m

m44.59mm

xRAM

SRAM + IO

Compute

Single package

InFO_MSL

1.5x1x

1x

Page 12: Counterintuitive Intelligence: Doing More to Cost Less A TCO … · 2019-09-23 · Title: eSilicon-template-new-logo-16x9-format-20150902 Author: Sally Slemons Created Date: 9/23/2019

12Copyright 2019 eSilicon

A Better Way of Building AI chips

HBM2e

HBM2e

C2CPCIE

HBM2e

HBM2e

C2C

20x20 AI cores

49.8mm

29

.1m

m

vs.

48.80mm3

0.1

mm

SRAM + IO

Compute

D2D D2D

D2D D2D

HBM2e

HBM2e

HBM2e

HBM2e

Page 13: Counterintuitive Intelligence: Doing More to Cost Less A TCO … · 2019-09-23 · Title: eSilicon-template-new-logo-16x9-format-20150902 Author: Sally Slemons Created Date: 9/23/2019

13Copyright 2019 eSilicon

Metric per die 2.5D 3D + 2.5D

Size @ N7 789mm2 197mm2 x4 x2

SRAM >4Gbit >4Gbit

AIB+/HBI+ bandwidth N/A 12Tbps

LR SERDES @ 56G 32 lanes 32 lanes

Yield 15.7% 68.6%

Total KGD + interposer cost (*) $1,294.6 $673.4

A Better, Cheaper Way of Building AI Chips

(*) Excluding DRAM

Page 14: Counterintuitive Intelligence: Doing More to Cost Less A TCO … · 2019-09-23 · Title: eSilicon-template-new-logo-16x9-format-20150902 Author: Sally Slemons Created Date: 9/23/2019

14Copyright 2019 eSilicon

What are the (new) challenges?

From Feasibility to Maturity

Challenge Response

Die bonding technology

(WoW, DoW)

• Xperi’s DBI

• TSMC’s Hybrid Bond

Vertical signal density 2-5um DBI via pitch

Thermal density

Worst case with cold plate:

• Tmax= 92.2ºC

• Thetajc=0.032ºC/W

Combined yieldEstablished DoW KGD assembly flow for

mass production

Die clean & prep on

tape

Diced

wafer

on

frame

P&P from tape

Batch

anneal

Production Ready Flow

Die-on-

wafer

clean

Page 15: Counterintuitive Intelligence: Doing More to Cost Less A TCO … · 2019-09-23 · Title: eSilicon-template-new-logo-16x9-format-20150902 Author: Sally Slemons Created Date: 9/23/2019

15Copyright 2019 eSilicon

What are the (new) challenges? (II)

Challenge Response

Wafer probing Temporary probe pads

Power delivery Independent power meshes + TSV

TSV-aware placement & routing

• Place PDN TSV matrix first

• TSV pitch 55um – 388um for 0.5% IR drop

• Bulk of memories on top die

From Feasibility to Maturity

Array pitch (um) x (um) y (um) area of TSV (um2) Open window (um)

1x1 55.56 3 3 9 52.56

2x2 166.67 6 6 36 160.67

3x3 277.78 9 9 81 268.78

4x4 388.89 12 12 144 376.89

Temporary Probe

Pad

DBI

Page 16: Counterintuitive Intelligence: Doing More to Cost Less A TCO … · 2019-09-23 · Title: eSilicon-template-new-logo-16x9-format-20150902 Author: Sally Slemons Created Date: 9/23/2019

16Copyright 2019 eSilicon

What are the (new) challenges? (III)

Challenge Response

CTS DLL-based CTS for block-level trees

Multi-die signoff Same as single die, if DLL-based CTS used

From Feasibility to Maturity

Local treesDIE 1

DLL

DLL

Local treesDIE 2

DLL

DLL

Global tree

Die 1

Die 1 Die 2

Page 17: Counterintuitive Intelligence: Doing More to Cost Less A TCO … · 2019-09-23 · Title: eSilicon-template-new-logo-16x9-format-20150902 Author: Sally Slemons Created Date: 9/23/2019

17Copyright 2019 eSilicon

What are the (new) challenges? (IV)

Challenge Response

C2C interconnect medium• Interposer: 52x52mm in 2020

• Interposer-less: Xperi’s EOI, TSMC’s InFO-MSL

Warpage• Stiffener ring-based design; advanced materials;

dummy dies

Supply chain readiness

• Supply chain enabled

• In mass production today for image sensors

• First SoC commercial products hitting the market now

From Feasibility to Maturity

Page 18: Counterintuitive Intelligence: Doing More to Cost Less A TCO … · 2019-09-23 · Title: eSilicon-template-new-logo-16x9-format-20150902 Author: Sally Slemons Created Date: 9/23/2019

18Copyright 2019 eSilicon

Words from a techno-mason

• Hyperscale dominates the TCO discussion

• AI is all about efficiency

• Efficiency is all about ASIC

• Powerful new enabling technologies are coming of age

• 3D IC

• New memory technologies

• Memory overlays

• AI-specific IP

• Key to a reduced TCO

Conclusions

Page 19: Counterintuitive Intelligence: Doing More to Cost Less A TCO … · 2019-09-23 · Title: eSilicon-template-new-logo-16x9-format-20150902 Author: Sally Slemons Created Date: 9/23/2019

19

Collaborate. Differentiate. Win.