GPU Computing Advances in 3D Electromagnetic Simulation€¦ · Particle in Cell Solver Integral...

23
CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com | May-12 Session S0069: GPU Computing Advances in 3D Electromagnetic Simulation Andreas Buhr, Alexander Langwost, Fabrizio Zanella CST (Computer Simulation Technology)

Transcript of GPU Computing Advances in 3D Electromagnetic Simulation€¦ · Particle in Cell Solver Integral...

Page 1: GPU Computing Advances in 3D Electromagnetic Simulation€¦ · Particle in Cell Solver Integral Equation Direct Solver TLM Solver CST GPU Computing Update . CST – COMPUTER SIMULATION

CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com | May-12

Session S0069:

GPU Computing Advances in 3D

Electromagnetic Simulation

Andreas Buhr, Alexander Langwost, Fabrizio Zanella

CST (Computer Simulation Technology)

Page 2: GPU Computing Advances in 3D Electromagnetic Simulation€¦ · Particle in Cell Solver Integral Equation Direct Solver TLM Solver CST GPU Computing Update . CST – COMPUTER SIMULATION

CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com | May-12

Abstract

Computer Simulation Technology (CST) has been providing GPU acceleration for its

3D Full Wave electromagnetic field simulation tools for several years. The latest

version of CST Studio Suite supports the full range of Tesla products on both

Windows and Linux operating systems.

Using GPU, multi-GPU and MPI-GPU Computing drastically reduces the simulation

times for CST customers. We will provide a status of current and future GPU

developments at CST and share detailed simulation results.

Page 3: GPU Computing Advances in 3D Electromagnetic Simulation€¦ · Particle in Cell Solver Integral Equation Direct Solver TLM Solver CST GPU Computing Update . CST – COMPUTER SIMULATION

CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com | May-12

1992

Foundation of CST

Commercialization of MAFIA (FIT)

1998

CST MICROWAVE STUDIO® — PERFECT

BOUNDARY APPROXIMATION (PBA)®

2005

Complete Technology for 3D EM

2011

System Assembly and Modeling (SAM)

CST Milestones

CST Market (excluding CST)

Page 4: GPU Computing Advances in 3D Electromagnetic Simulation€¦ · Particle in Cell Solver Integral Equation Direct Solver TLM Solver CST GPU Computing Update . CST – COMPUTER SIMULATION

CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com | May-12

CST Worldwide

Page 5: GPU Computing Advances in 3D Electromagnetic Simulation€¦ · Particle in Cell Solver Integral Equation Direct Solver TLM Solver CST GPU Computing Update . CST – COMPUTER SIMULATION

CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com | May-12

CST Customers

Page 6: GPU Computing Advances in 3D Electromagnetic Simulation€¦ · Particle in Cell Solver Integral Equation Direct Solver TLM Solver CST GPU Computing Update . CST – COMPUTER SIMULATION

CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com | May-12

In 2007 began support of nVidia GPU acceleration in our main

product, the MWS Time Domain Solver (FIT)

Between 2008-2010 added multiple GPU and MPI+GPU support for

the FIT solver

In 2012, we added GPU acceleration support for three other solvers:

Particle in Cell Solver

Integral Equation Direct Solver

TLM Solver

CST GPU Computing Update

Page 7: GPU Computing Advances in 3D Electromagnetic Simulation€¦ · Particle in Cell Solver Integral Equation Direct Solver TLM Solver CST GPU Computing Update . CST – COMPUTER SIMULATION

CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com | May-12

• The PIC solver calculates the electromagnetic field by integrating

Maxwell’s equations

• At the same time, it calculates trajectories of particles through the

calculated field.

Particle in Cell Solver: Introduction

Page 8: GPU Computing Advances in 3D Electromagnetic Simulation€¦ · Particle in Cell Solver Integral Equation Direct Solver TLM Solver CST GPU Computing Update . CST – COMPUTER SIMULATION

CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com | May-12

Magnetron Benchmark

Number of Meshcells 1,610,280

Av. Particle Number 1.39e4

Time CPU 36h 40m 07s

Time GPU 9h 05m 48s

Total Speed Up 4.03

Page 9: GPU Computing Advances in 3D Electromagnetic Simulation€¦ · Particle in Cell Solver Integral Equation Direct Solver TLM Solver CST GPU Computing Update . CST – COMPUTER SIMULATION

CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com | May-12

Particle in Cell: Core Benchmarks

0

1

2

3

4

5

6

7

8

9

10

0 10 20 30 40

Speedup

Number of Meshcells

Millions

PIC Solver Loop Speedup

0

1

2

3

4

5

6

7

0 10 20 30 40

Speedup

Number of Meshcells

Millions

PIC Total Speedup

Comparison: nVidia Tesla C2050 vs. 2x Intel XEON E5620 @ 2.4 GHz (8 cores total)

Page 10: GPU Computing Advances in 3D Electromagnetic Simulation€¦ · Particle in Cell Solver Integral Equation Direct Solver TLM Solver CST GPU Computing Update . CST – COMPUTER SIMULATION

CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com | May-12

• Boundary Element Solver

-> works on surface mesh

• Frequency Domain

• Targeted at electrically large & mid-

size structures (≥ 10 λ)

• Generates a complex dense matrix to

be factorized

Integral Equation Solver: Introduction

Page 11: GPU Computing Advances in 3D Electromagnetic Simulation€¦ · Particle in Cell Solver Integral Equation Direct Solver TLM Solver CST GPU Computing Update . CST – COMPUTER SIMULATION

CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com | May-12

Dielectric Lens in Ku/K-Band

14 - 28 GHz, Lens diameter=60mm

Lens: eps=3, thickness=12.2mm

Surface cells: 16k

SAsw 500

Simulation time w/ Tesla C2075: 69min

Memory used : 11 GB

Diam. 60mm

Page 12: GPU Computing Advances in 3D Electromagnetic Simulation€¦ · Particle in Cell Solver Integral Equation Direct Solver TLM Solver CST GPU Computing Update . CST – COMPUTER SIMULATION

CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com | May-12

Integral Equation Solver: Core Benchmarks

0

2

4

6

8

10

12

14

16

0 10 20 30 40 50 60 70

Speedup

Number of Unknowns

Thousands

I-Solver Core Speedup

0

0.5

1

1.5

2

2.5

3

3.5

4

4.5

0 10 20 30 40 50 60 70

Speedup

Number of Unknowns

Thousands

I-Solver Total Speedup

1x C2050

2x C2050

4x C2050

Comparison: nVidia Tesla C2050 vs. 2x Intel XEON E5620 @ 2.4 GHz (8 cores total)

Page 13: GPU Computing Advances in 3D Electromagnetic Simulation€¦ · Particle in Cell Solver Integral Equation Direct Solver TLM Solver CST GPU Computing Update . CST – COMPUTER SIMULATION

CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com | May-12

Broadband Transient Solver

Special aperture and wire modeling (EMC/EMI)

Octree localized mesh

Compact models: vents, seams, slots, shielded cables

TLM Solver: Introduction

Octree mesh SCN

Page 14: GPU Computing Advances in 3D Electromagnetic Simulation€¦ · Particle in Cell Solver Integral Equation Direct Solver TLM Solver CST GPU Computing Update . CST – COMPUTER SIMULATION

CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com | May-12

• Rectangular waveguide horn antenna with parabolic reflector

• 2m diameter dish @ 3.5GHz

• Approximately 23 wavelengths

TLM GPU Speedup

2012 2012 GPU

Time 20:09 4:35

Speedup 4.4

Page 15: GPU Computing Advances in 3D Electromagnetic Simulation€¦ · Particle in Cell Solver Integral Equation Direct Solver TLM Solver CST GPU Computing Update . CST – COMPUTER SIMULATION

CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com | May-12

Satellite system benchmark

Shielded TWP

cable

GPU Tesla 2050 solve time 45min

8-core Westmere-EX CPU 180min: 4X speedup

Overlapping joint

compact seam

with 24 segments

Cable connects

to 3D structure

Page 16: GPU Computing Advances in 3D Electromagnetic Simulation€¦ · Particle in Cell Solver Integral Equation Direct Solver TLM Solver CST GPU Computing Update . CST – COMPUTER SIMULATION

CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com | May-12

Time Domain (FIT) Solver

TST

PBA

• Transient, low memory, broadband solution

• Conformal meshing (PBA, TST)

• Support for multi-GPU, MPI+GPU

Page 17: GPU Computing Advances in 3D Electromagnetic Simulation€¦ · Particle in Cell Solver Integral Equation Direct Solver TLM Solver CST GPU Computing Update . CST – COMPUTER SIMULATION

CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com | May-12

PCB with USB connector/cable

20M mesh cells

12 Westmere-EX core Solver Loop=7840s

2xM2070 Solver Loop=1596s

Solver Loop Speedup = 5x

Page 18: GPU Computing Advances in 3D Electromagnetic Simulation€¦ · Particle in Cell Solver Integral Equation Direct Solver TLM Solver CST GPU Computing Update . CST – COMPUTER SIMULATION

CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com | May-12

GPU Computing – Typical Performance Speedup o

f th

e s

olv

er

loop

The features which need the

largest amount of memory on

the GPU are:

• dispersive materials

• lossy metal

• open boundaries

Model Sizes (Number of Mesh Cells)

CPU Performance

Model too large Swapping Sm

all M

odel

Page 19: GPU Computing Advances in 3D Electromagnetic Simulation€¦ · Particle in Cell Solver Integral Equation Direct Solver TLM Solver CST GPU Computing Update . CST – COMPUTER SIMULATION

CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com | May-12

GPU Computing – Typical Performance

0

5

10

15

20

25

1 GPU (Tesla 10) 2 GPU (Tesla 10) 4 GPU (Tesla 10) 8 GPU (Tesla 10) 1 GPU (Tesla 20) 4 GPU (Tesla 20)

Speedup

Configuration

Mean Speedup of Solver Loop (compared to dual Intel Xeon X5550, fastest memory configuration)

Page 20: GPU Computing Advances in 3D Electromagnetic Simulation€¦ · Particle in Cell Solver Integral Equation Direct Solver TLM Solver CST GPU Computing Update . CST – COMPUTER SIMULATION

CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com | May-12

Time Domain Core Benchmarks

Comparison: nVidia Tesla C2050 vs. 2x Intel XEON E5620 @ 2.4 GHz (8 cores total)

0

2

4

6

8

10

12

14

16

18

20

0 20 40 60 80 100

Speedup

Number of Meshcells

Millions

T Solver Core Speedup

0

2

4

6

8

10

12

14

16

0 20 40 60 80 100

Speedup

Number of Meshcells

Millions

T Solver Total Speedup

1x C2050

2x C2050

4x C2050

Page 21: GPU Computing Advances in 3D Electromagnetic Simulation€¦ · Particle in Cell Solver Integral Equation Direct Solver TLM Solver CST GPU Computing Update . CST – COMPUTER SIMULATION

CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com | May-12

Complex Package Benchmark (SI)

50 million mesh cells

Very high complexity

CPU (2x Quad Core Intel

Xeon E5530, 2.4 GHz) 4x Tesla 10 GPUs 4x Tesla 20 GPUs

Solver Loop Time/s 39467 1662 990

Solver Loop Speedup 1 23.7 39.8

Page 22: GPU Computing Advances in 3D Electromagnetic Simulation€¦ · Particle in Cell Solver Integral Equation Direct Solver TLM Solver CST GPU Computing Update . CST – COMPUTER SIMULATION

CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com | May-12

Passenger Aircraft Benchmark: MPI+GPU

146M mesh cells, (4) MPI+GPU nodes, (2) Tesla C1060 per node

Total simulation time = 8 hours

Page 23: GPU Computing Advances in 3D Electromagnetic Simulation€¦ · Particle in Cell Solver Integral Equation Direct Solver TLM Solver CST GPU Computing Update . CST – COMPUTER SIMULATION

CST – COMPUTER SIMULATION TECHNOLOGY | www.cst.com | May-12

• Significant performance improvement using GPU

Computing

• CST GPU solvers (PIC, TLM, IE, TD) available for

various applications

• Support of up to 8 GPUs/host for TD, IE Solvers;

Cluster MPI+GPU for TD Solver

Conclusion