AI-Enabled Agile IC Design and Manufacturing

AI-Enabled Agile IC Design

and Manufacturing

David Z. Pan

The University of Texas at Austin

http://www.ece.utexas.edu/~dpan

EDPS, 10/04/2019

Nvidia Xaiver

9B transistors

Divide a chip into small partitions e.g., 1~2M cells per partition

Turn-around time for 1 iteration of backend flow: 3~6 days for one partition

IC Design/Manufacturing Complexity

What you see (at design) is not (necessarily) what you get (at fab)

System Specification

Architectural Design

Functional/Logic Design

Logic Synthesis

Physical Design

Mask Synthesis& Verification

Fabrication

Packaging and Testing

Floorplanning

Placement

Clock Tree Synthesis

Signal Routing

Design Closure

DFM Closure

IC Design/Manufacturing Flow

Test Patterns

Lithography Experiment

Lithography Modeling

Layout/Mask

Mask Optimization Recipe

SRAF & OPC

Lithography Rule Check

Process Modeling

Mask Optimization

Nanometer Design/Manufacturing Challenges

⧫Performance/Power/Area PPA

⧫Manufacturability/Yield

⧫Reliability

⧫Security

⧫Turn-around-time

⧫……

⧫2018 DARPA ERI ($1.5B)

IDEA/POSH “Hardware

Compiler” ($100M)

A. Olofsson, DARPA, ISPD-2018 Keynote

DARPA Grand Challenge: No-human-in-the-loop, 24-hour turn around time!

How AI (ML/DL) Can Help?

⧫ Lots of work for various stages of physical design and DFM

⧫ For example on lithography hotspot detection

› Our work [Ding+, ICICDT 2009 BPA] among the first

to use ML (SVM) for litho-hotspot detection

➢ Very active research in last 10 years, ICCAD 2012 CAD Contest

➢ Meta-classification combining ML and PM [Ding+, ASPDAC’12 BPA]

➢ Deep neural network [Yang+, DAC’17]

➢ Big data vs. small data; transfer/active/semi-supervised learning [Lin+,

ISPD’18], [Chen+, ASPDAC’19], Litho-GPA [Ye+, DATE 2019]…

⧫ML and PD Tutorial in July 2019 (ACM/IEEE Seasonal School)

› http://yibolin.com/publications/tutorials/PDSeasonableSchool_ML4PD.pdf

⧫ My talk today will cover some recent ideas/results for discussion5

DREAMPlace: Deep Learning Toolkit-Enabled

GPU Acceleration for Modern VLSI Placement

[Lin+, DAC’19, Best Paper Award]

Source code release: https://github.com/limbo018/DREAMPlace

Typical Nonlinear Placement Algorithm

Objective of nonlinear placement

Wirelength Density

• Many papers on how to

model WL, density,

parameter tuning, etc.

• Huge development

effort on a high-quality

placement engine (e.g.,

> 1 year for RePlAce)

• CPU>3h to get good

quality placement of

10M-cell design

• Clustering/acceleration

limited ➔ quality

degradation

What is your Dream Placement Engine?

✓ Best quality: wirelength ➔

congestion, timing, power, …

✓ Ultrafast: placement is at the

center of entire design flow ➔

faster design turn-around-time

✓ Low development overhead: ➔

from 1 year to a month or two?

✓ Extensible: easy to try new

algorithms and acceleration

techniques

10M-cell design

finishes within

Advances in Deep Learning Hardware/Software

[Courtesy Nvidia]

Over 60x speedup in neural

network training since 2013Deep learning toolkits

DREAMPlace Strategies

⧫We propose a novel analogy by casting the nonlinear

placement optimization into a neural network training problem

⧫Greatly leverage deep learning hardware (GPU) and software

toolkit (e.g., PyTorch)

⧫Enable ultra-high parallelism and acceleration while getting

state-of-the-art results

Train a neural network Solve a placement

Analogy Between NN Training and Placement

DREAMPlace Architecture & Overall Flow

Leverage mature/highly optimized deep learning toolkit

Nonlinear

Optimizer

Automatic

Gradient

C++/CUDA

Python

Placement

Nonlinear

Optimizer

Automatic

Gradient

Density

C++/CUDA

Python

DREAMPlace architecture

Random Initial Placement

WL Grad Density Grad

Optimization & Update

Converge?

Legalization

NTUplace DP

DREAMPlace flow

Global Placement Result Comparison

RePlAce [Cheng+, TCAD’18]• CPU: 24-core 3GHz Intel Xeon

• 64GB memory allocated

• Current state-of-the-art

DREAMPlace [Lin+, DAC’19]• CPU: Intel E5-2698 v4 @2.20GHz

• GPU: 1 NVIDIA Tesla V100

• Single CPU thread was used

34× speedup by DREAMPlace 43× speedup by DREAMPlace

ISPD 2005 Benchmarks

200K~2M cells

Industrial Benchmarks

1M~10M cells

10M-cell design

finishes within 5min,

instead of 3+ hrs

Same placement

quality of results!

DREAMPlace Open-Sourced

⧫ Leverage AI hardware and software development recently

⧫ Decouple core algorithm innovations with implementation

› Algorithm innovation written in high-level language, e.g. Python

› Highly extensible: new solver options, new design objectives, …

› Implementations just focus on certain low-level kernel OPs as needed

⧫ Development effort: 1 year ➔ 2 months

⧫ The paradigm can be extended to other DA areas

› Significantly enhance IC design productivity and quality

MAGICAL: Machine Generated Analog IC Layout

Open source MAGICAL 0.2

https://github.com/magical-eda/MAGICAL

Analog IC Layout

⧫ DREAMPlace focus on digital IC

⧫ Analog IC to interface with outside world

⧫ Analog IC layout design still mostly manual

› Very tedious and error-prone

› Prior DA not successful as that in digital IC

• Our mission is to develop a full-automated analog layout

system, leveraging recent AI advancement

• Project started in 08/2018

• [ISPD’19; DAC’19; ICCAD’19; ASPDAC’20]

MAGICAL Layout System Framework

⧫ Input: unannotated netlist

⧫ Output: GDSII Layout

⧫ Key Components:

› Constraint Extraction

› Device Generation

› Placement

› Routing

⧫ Fully-automated (no-human-in-the-loop)

⧫ Guided by analytical, heuristic, and machine learning algorithms (not

everything is machine learning or deep learning!)

MAGICAL

INPUTS

Circuit Netlist

Design Rules

DEVICE

GENERATOR

Parametric Instances

PLACER

Analytical Placement

ROUTER

Multi-pin A* SearchPost-Placement

Optimization

Deep Learning

VALIDATION

Calibre®

DRC/LVS/PEX

MAGICAL

OUTPUT

GDSII Layout

MAGICAL

LAYOUT CONSTRAINT EXTRACTOR

Pattern Matching + Small Signal Analysis EVALUATION

Cadence® Virtuoso®

MAGICAL Preliminary Results

⧫ A comparator design in 40nm TSMC

MAGICAL Layout

Manual Layout

INP INN

CLK CLK

OUTP OUTN

Manual MAGICAL

Power (uW) 16.8 18.7

Output Delay (ps) 150 152

Input-referred Noise (uVrms) 380 334

Input-referred Offset (mV) 0.15 0.50

Manual Layout (taped out)

Post extraction simulation results

Days Seconds

MAGICAL Preliminary Results

⧫ A 2-stage miller-compensated OTA design in 40nm TSMC

MAGICAL LayoutManual Layout

VBNVBN VBNCMFB

VIP VIN

VOP VON

VCMVOPVON

VBP VBP

Gain PathBias Common-mode Feedback

Manual MAGICAL

DC Gain (dB) 37.7 38.0

Unity-gain Bandwidth (MHz) 110 107.5

Phase Margin (degree) 67.8 62.3

Input-referred Noise (uVrms) 219 221.5

CMRR (dB) 103 92.5

Input-referred Offset (mV) 0.2 0.48

Post extraction simulation results

LithoGAN: End-to-End Lithography Modeling

with Generative Adversarial Networks

[Ye+, DAC’19 BPA Candidate]

Harder question cf. lithography hotspot detection: Without going

through litho-simulations, can we directly get printed images?

GAN and CGAN

⧫ Generative Adversarial Network (GAN) [Goodfellow et al, 2014]

› Two neural networks contest (Generator and Discriminator)

› Produce images similar to those in the training data set

⧫ Conditional GAN (CGAN) can take a picture in one domain and

translate it to another one [Isola et al, CVPR’17]

Image Translation for Litho Modeling

⧫ Resist pattern zoomed in for

high-resolution/accuracy 22

Encode into

channels

Expensive Litho Simulation

This is now a

(modified) image

translation task

[Ye+, DAC’19]

⧫ Different elements encoded

on different image channels

CGAN for Lithography Modeling

⧫ CGAN architecture

› Generator G generates a fake resist pattern G(x) with input mask pattern x

› Discriminator D needs to classify the image pair (x, G(x)) as fake and to

predict the image pair (x, y) as real

Sample pairs for training CGAN

SRAF Contact after OPC

[Ye+, DAC’19]

Overall LithoGAN Algorithm

⧫ Dual learning framework

› CGAN to predict the shape of the resist pattern

› CNN to predict the resist shape center

GeneratedImage

PredictedCenter

(ch , cv )

Pre-adjustment

Post-adjustment (Final)

CGAN Output

Prediction

Ground truth

[Ye+, DAC’19]

LithoGAN Results

Input Epoch 1 Epoch 3 Epoch 5 Epoch 7 Epoch 15 Epoch 27 Epoch 50 Epoch 80

Input LithoGAN output Input LithoGAN output

Model advancement progress

LithoGAN is 1800x faster

than rigorous simulations,

with acceptable error (in

consultation with industry)

[Ye+, DAC’19]

⧫ Sub-Resolution Assist Feature Generation using Conditional

Generative Adversarial Networks

⧫ Directly generate sub-resolution assist feature (SRAF)

⧫ 144x faster than model based approach with similar QoR

GAN-SRAF [Alawieh+, DAC’19]

⧫ To bridge design and manufacturing, we

propose Lithography Aware Physical

Design (LAPD) ➔

› Litho Hotspot Detection

› Litho Hotspot Correction

⧫ My group has made many key

contributions in LAPD ⬆️

⧫ LithoGAN opens new directions with

tremendous potential

Another LAPD

Correction

Detection

Design/Manufacturing for Hardware Security

⧫ Global IC supply chain of design, manufacture, test, package...

Image source: https://depositphotos.com/2801291/stock-illustration-gray-detailed-world-map.html

Design

Assembly

Customer

Design/Manufacturing for Hardware Security

⧫ Arm race between attacking and protection

⧫ Hardware IP reverse engineering using learning techniques

⧫ Intelligent IC camouflaging [Li+, ICCAD’16, TCAD’17, HOST’17 BPA]

⧫ Former PhD Meng Li won ACM SRC Grand Finals First Place in 2018

Fabrication Level Cell Level Netlist Level

P-type Substrate

Camouflaging Cells

System Specification

Architectural Design

Functional/Logic Design

Logic Synthesis

Physical Design

Mask Synthesis& Verification

Fabrication

Packaging and Testing

Floorplanning

Placement

Clock Tree Synthesis

Signal Routing

Timing Closure

DFM Closure

To Recap

Test Patterns

Lithography Experiment

Lithography Modeling

Layout/Mask

Mask Optimization Recipe

SRAF & OPC

Lithography Rule Check

Process Modeling

Mask Optimization

40x faster 1800x faster

No Human

in the Loop

MAGICAL140x faster

Conclusion

⧫ Some recent results in AI-enabled agile IC design & DFM

› DREAMPlace

› MAGICAL: GeniusRoute…

› LithoGAN

⧫ Tremendous potentials to leverage both AI hardware and

software advancements

⧫ BIG data, small data, or no (training) data at all (by recasting

problems e.g., placement into DREAMPlace)

⧫ Synergistic AI-IC co-design

Acknowledgment

⧫ Funding support / collaborations from NSF, DARPA, Intel,

Nvidia, Toshiba Memory, Xilinx, Synopsys

⧫ Many students/post-docs who do the real work

⧫ Many collaborators in academia and industry

› Dr. Ren and Dr. Khailany from NVIDIA for DREAMPlace

› Prof. Nan Sun at UT for MAGICAL

› Dr. Nojima et al. from Toshiba Memory on DFM

Thanks!

Q & A?

AI-Enabled Agile IC Design and Manufacturing

Documents

Transcript of AI-Enabled Agile IC Design and Manufacturing

FAST Government Flat, Agile, Streamlined, Tech-enabled Gregory G. Curtin, Ph.D., JD World Economic Forum (WEF) Global Agenda Council on the Future of Government.

0 0,5 1 Tr i p o l i C i t y M a p TRIPOLI PORT MITIGA ......!h IC IC IC IC IC IC IC IC IC IC IC IC ^ 0 0 0 0 IC 0 IC 0 ¿ ^^ 0 í^ ^ ^ ^ 0; 0 0 0; IC ÷ 0 0 ^ ÷÷ ÷ í^ 0 ÷ í

iDSS Enabled Agile Data Migration - Infosys · AGILE DATA MIGRATION Abstract For enterprises, data is a strategic asset and data migration a critical activity. However, the challenges

Modernizing DoD Software Production€¦ · support Agile DevOps software development. Our systems need to be hardware-enabled and software-defined. Software development processes

SDN-Enabled Dynamic Feedback Control and Sensing in Agile ...€¦ · enhanced by the sensing results. Real-Time Control Algorithms: investigated various dynamic resource allocation

Agile Enabled Product Innovation

AGILE USER-CENTERED DESIGN OF AN IOT-ENABLED …AGILE USER-CENTERED DESIGN OF AN IOT-ENABLED GAMIFIED INTERVENTION FOR ENERGY CONSERVATION AT THE WORKPLACE 3 its consequences (Lo et

1 Integrating ChemAxon and Linguamatics to provide Agile, Chemistry-enabled Text Mining Dr Jeffrey L. Nauss Application Specialist, Linguamatics ChemAxon.

Instructional design enabled Agile Method using ADDIE ...

Better Decisions With Big Data Enabled Applicationsweb.idg.no › app › web › online › event › cioforum › 2012 › bd › emc.pdf · The Journey To Big Data Agile Analytics

GenerAtinG SUPPLY CHAin iMPACt€¦ · An agile and practical approach to analytics-driven transformation rests on four tenets: 1. Reimagine analytically enabled business processes

Agile Estimation – What are We Really Looking For? · 2019-11-04 · Agile Estimation – ... Introduction • Agile development practices have enabled organizations to deliver

Agile Integration for Today’s Cloud-Enabled Enterprise · IDC Infobrief | Agile Integration for Today’s Cloud-Enabled Enterprise Sponsored by Red Hat | Page 8 Where Organizations

Cloud-enabled telco opportunitiespreview.thenewsmarket.com/.../282072_v2.pdf · by an agile and service-oriented operating model, could carve out a differentiated and attractive offering

The Role Of The Engineering Clinic In Promoting An Agile ... · The Role of the Engineering Clin ic in Promoting an Agile ECE Learning Environment Abstract ... structure of an ECE

Evaluation Board: Synchronous Boost DC/DC Converter · the IC is enabled, while moving the switch to the upper position (EN→GND) disables the IC. 3. Connect the power supply’s

HOW AI IS TRANSFORMING BUSINESS SERVICES · 2018-06-12 · TRANSFORMING BUSINESS SERVICES June 6, 2018. ... Business outcomes Controls Enabled business Agile platforms Analytics Pricing

Agile Everywhere - Crisp · Agile Product Development! Agile Procurement! Agile Contracting! Agile HR! Agile Leadership! Agile Scaling! Agile Portfolio Management! Agile Hardware!

IC-A25N IC-A25NE IC-A24 IC-A24E IC-A14 IC-A15 IC-A25C IC ...IC-A25N IC-A25C Simple VHF VOR/COM Transceiver 700 mW Loud Audio with a Compact Body IC-A14 IC-A14S Full keypad for EUR/AUS

Photonic IC Enabled Coherent Optical Systems - …warner/Bufs/NeoPhotonics_White_Paper.pdf · NeoPhotonics White Paper 1 Photonic IC Enabled Coherent Optical Systems White Paper