Opteron and AMD64 A Commodity 64 bit x86 SOC · 22 April 2003 AMD - Salishan HPC 2003 2...

32
Opteron and AMD64 A Commodity 64 bit x86 SOC Fred Weber Vice President and CTO Computation Products Group Advanced Micro Devices

Transcript of Opteron and AMD64 A Commodity 64 bit x86 SOC · 22 April 2003 AMD - Salishan HPC 2003 2...

Page 1: Opteron and AMD64 A Commodity 64 bit x86 SOC · 22 April 2003 AMD - Salishan HPC 2003 2 Opteron/AMD64 Launch – Today! •Official Launch of AMD64 architecture and Production Server/Workstation

Opteron and AMD64A Commodity 64 bit x86 SOC

Fred Weber

Vice President and CTO

Computation Products Group Advanced Micro Devices

Page 2: Opteron and AMD64 A Commodity 64 bit x86 SOC · 22 April 2003 AMD - Salishan HPC 2003 2 Opteron/AMD64 Launch – Today! •Official Launch of AMD64 architecture and Production Server/Workstation

22 April 2003 AMD - Salishan HPC 2003 2

Opteron/AMD64 Launch – Today!

•Official Launch of AMD64 architecture and Production Server/Workstation CPUs–Series 200 (2P) available today–Series 800 (4P+) available later in Q2

•Oracle, IBM-DB2, Microsoft, RedHat, SuSe software support–And many others

•Dozens of server system vendors–System builder availability this quarter–IBM systems available 3Q03

•Lots of public benchmarks

Page 3: Opteron and AMD64 A Commodity 64 bit x86 SOC · 22 April 2003 AMD - Salishan HPC 2003 2 Opteron/AMD64 Launch – Today! •Official Launch of AMD64 architecture and Production Server/Workstation

22 April 2003 AMD - Salishan HPC 2003 3

Before AMD64: Computing & infrastructure islands on either side of the wall

Yesterday’s environment isolates Yesterday’s environment isolates 3232--bit and 64bit and 64--bit computing into bit computing into incompatible islands.incompatible islands.

Requires new infrastructure – cooling, power, enclosures, etc.

Requires new software, since x86 applications are incompatible or only run in “emulation mode”

Steep learning curve for end user and support staff – lowering ROI, increasing TCO

Wastes significant people-hours of work and billions of dollars in research and development

32-Bit NativeOnly

System

64-BitNative

OnlySystem

32-bit Application

32-bit O/S

32-bit Drivers

64-bit Application

64-bit O/S

64-bit Drivers

Platform A Platform B

Page 4: Opteron and AMD64 A Commodity 64 bit x86 SOC · 22 April 2003 AMD - Salishan HPC 2003 2 Opteron/AMD64 Launch – Today! •Official Launch of AMD64 architecture and Production Server/Workstation

22 April 2003 AMD - Salishan HPC 2003 4

AMD’s Industry Vision:Compatible systems that bridge from 32- to 64-bit

AMD: Single PlatformAMD: Single Platform

32-BitApplication

(3 GB limit)

32-BitO/S

32-BitSoftwareDrivers

32-BitApplication

(4GB limit)

64-BitApplication

64-BitO/S

64-BitSoftwareDrivers

64-BitSoftwareDrivers

64-BitO/S

Leverages existing infrastructureLeverages existing infrastructure– thermal, enclosures, power, and

BIOSRuns existing Runs existing 3232--bit applications bit applications nativelynatively with unsurpassed with unsurpassed performanceperformance– >20% increase clock-for-clock

compared to AMP Athlon™ processor

– No tools or O/S work neededRuns existing 32Runs existing 32--bit applications on bit applications on 6464--bit O/Sbit O/S– Take full advantage of 4GB local

memoryAllows customers to migrate to 64Allows customers to migrate to 64--bit performance according to their bit performance according to their schedulescheduleLow learning curve for users and Low learning curve for users and support staffsupport staff

Page 5: Opteron and AMD64 A Commodity 64 bit x86 SOC · 22 April 2003 AMD - Salishan HPC 2003 2 Opteron/AMD64 Launch – Today! •Official Launch of AMD64 architecture and Production Server/Workstation

22 April 2003 AMD - Salishan HPC 2003 5

AMD64 Programmer’s Model

RAX

63

GGPPRR

xx8877

079

31

AHEAX AL

0715In x86

XMM0SSSSEE

127 0

XMM7

EAX

EIP

Added by AMD64

EDI

XMM8XMM8

XMM15

R8

R15

Page 6: Opteron and AMD64 A Commodity 64 bit x86 SOC · 22 April 2003 AMD - Salishan HPC 2003 2 Opteron/AMD64 Launch – Today! •Official Launch of AMD64 architecture and Production Server/Workstation

22 April 2003 AMD - Salishan HPC 2003 6

Opteron SOC Architecture Overview

•First AMD64 based processor

•Aggressive out-of-order, 9-issue superscalar processor

•Integrated DDR memory controller

•Leading performance in integer, floating point and multimedia–AMD64, x87, MMX™, 3DNow!™, SSE,

SSE2

•Glueless multiprocessing through HyperTransport

•Expandable IO through HyperTransport

L2Cache

L1Instruct.Cache

L1Data

Cache

OpteronProcessor

Core

HyperTransport™technology

DDR MemoryController

Opteron Architecture

Page 7: Opteron and AMD64 A Commodity 64 bit x86 SOC · 22 April 2003 AMD - Salishan HPC 2003 2 Opteron/AMD64 Launch – Today! •Official Launch of AMD64 architecture and Production Server/Workstation

22 April 2003 AMD - Salishan HPC 2003 7

AMD Opteron™ ProcessorTechnology Overview

• Processor Core Overview– Support for AMD’s 64-bit technology– 12-stage int, 17-stage fp pipelines– Enhanced TLB structures– TLB flush filter– Enhanced branch prediction– Large L2 cache (up to 1MB)– ECC protection

• Memory Controller Overview– Dual-channel DDR memory– PC2700, PC2100, or PC1600 DDR memory support– Registered or Unbuffered DIMMs– ECC and Chip Kill– High bandwidth (up to 6.4GB/s)

• HyperTransport™ Technology Overview– One, two, or three links– 2, 4, 8, 16, or 32-bits full duplex– Up to 6.4 GB/s bandwidth per link– 19.2 GB/s aggregate external bandwidth

HT = HyperTransport™ technology

XBAR

HT

HT

HT

MCTCPU

SRQ

DRAM

Page 8: Opteron and AMD64 A Commodity 64 bit x86 SOC · 22 April 2003 AMD - Salishan HPC 2003 2 Opteron/AMD64 Launch – Today! •Official Launch of AMD64 architecture and Production Server/Workstation

22 April 2003 AMD - Salishan HPC 2003 8

AMD Opteron™ processor-based 2P Server

"Glueless Multiprocessing"

• No chipset logic needed to connect processors

• HyperTransport™ technology links with ~6.4GBytes/sec bandwidth

• Memory BW and capacity and I/O capacity designed to grow with # CPUs

8x8 HyperTransport @ 400MT/s

PCI

AMD Opteron940 mPGA

AMD Opteron940 mPGA

200-333MHz144-Bit

Reg DDR

16x16 coherent HyperTransport

@ 1600 MT/s

AMD-8111™Southbridge

AMD-8111™Southbridge

SIO

FLASHLPC

32bits @33Mhz

USBUDMA100

10/100 Phy 100 BaseT (RJ45)

CODEC Audio, ModemAC’97

16x16 HyperTransport™ @ 1600 MT/s

PCI-XAMD-8131™

HyperTransport-PCI-X

AMD-8131™HyperTransport-

PCI-X64bits @100Mhz

200-333MHz144-Bit

Reg DDRAMD Opteron

940 mPGA

AMD Opteron940 mPGA

Page 9: Opteron and AMD64 A Commodity 64 bit x86 SOC · 22 April 2003 AMD - Salishan HPC 2003 2 Opteron/AMD64 Launch – Today! •Official Launch of AMD64 architecture and Production Server/Workstation

22 April 2003 AMD - Salishan HPC 2003 9

AMD Opteron™ processor-based 4P Server

200-333MHz144-Bit Reg DDR

200-333MHz144-Bit Reg DDR

200-333MHz144-Bit Reg DDR

AMD Opteron940 mPGA

AMD Opteron940 mPGA

AMD Opteron940 mPGA

AMD Opteron940 mPGA

AMD Opteron940 mPGA

AMD Opteron940 mPGA

AMD Opteron940 mPGA

AMD Opteron940 mPGA

AMD-8111™Southbridge

AMD-8111™Southbridge FLASH

LPC

USBAC97UDMA10010/100 Ethernet

Legacy PCI

100 BaseTManagement LAN

Zircon BMC SIO

PCIGraphics

VGA

AMD-8131™PCI-X

AMD-8131™PCI-X

64-bits @133Mhz

64-bits @133MhzPCI-XHot Plug

PCI-XHot Plug

AMD-8131PCI-X

AMD-8131PCI-X

64-bits @66Mhz

64-bits @66Mhz Gbit Ethernet

1000 BaseT

Gbit Ethernet1000 BaseT

PCI-X PCI-X U320SCSI

200-333MHz144-Bit Reg DDR

16x16 HyperTransport@ 1600MT/s

8x8 HyperTransport @ 1600MT/s

16x16 coherent HyperTransportTM @ 1600MT/s

8x8 HyperTransport @ 400MT/s

Page 10: Opteron and AMD64 A Commodity 64 bit x86 SOC · 22 April 2003 AMD - Salishan HPC 2003 2 Opteron/AMD64 Launch – Today! •Official Launch of AMD64 architecture and Production Server/Workstation

22 April 2003 AMD - Salishan HPC 2003 10

4P, 32GB AMD Opteron™ Processor System

Page 11: Opteron and AMD64 A Commodity 64 bit x86 SOC · 22 April 2003 AMD - Salishan HPC 2003 2 Opteron/AMD64 Launch – Today! •Official Launch of AMD64 architecture and Production Server/Workstation

22 April 2003 AMD - Salishan HPC 2003 11

4U, 4P AMD Opteron™ processor System

Page 12: Opteron and AMD64 A Commodity 64 bit x86 SOC · 22 April 2003 AMD - Salishan HPC 2003 2 Opteron/AMD64 Launch – Today! •Official Launch of AMD64 architecture and Production Server/Workstation

22 April 2003 AMD - Salishan HPC 2003 12

AMD64 Code Quality

•GCC port alpha quality since Feb ‘01–Compiler generating alpha quality code in 50 man-months–Linux kernel ported in 60 man-months–Tool chain was straightforward port

•SpecInt2000 code quality, 64bits vs. 32 bits (using GCC 3.1.1)–average instruction length increased to 3.8 from 3.4 bytes –dynamic instruction count decreased by 10%–dynamic load count decreased by 26%

• number of loads forwarded from recent stores substantially reduced

–dynamic store count decreased by 36%–back to back register dependencies decreased by 10%

Page 13: Opteron and AMD64 A Commodity 64 bit x86 SOC · 22 April 2003 AMD - Salishan HPC 2003 2 Opteron/AMD64 Launch – Today! •Official Launch of AMD64 architecture and Production Server/Workstation

22 April 2003 AMD - Salishan HPC 2003 13

GCC SPECint

80.0%

90.0%

100.0%

110.0%

120.0%

130.0%

140.0%

150.0%

160.0%16

4.gz

ip

175.

vpr

176.

gcc

181.

mcf

186.

craf

ty

197.

pars

er

252.

eon

253.

perlb

mk

254.

gap

255.

vorte

x

256.

bzip

2

300.

twol

f

SPEC

int2

000

64/32

64/64

Page 14: Opteron and AMD64 A Commodity 64 bit x86 SOC · 22 April 2003 AMD - Salishan HPC 2003 2 Opteron/AMD64 Launch – Today! •Official Launch of AMD64 architecture and Production Server/Workstation

22 April 2003 AMD - Salishan HPC 2003 14

FORTRAN Compiler Support

–AMD and STMicroelectronics are working together to bring The Portland Group Compiler Technology to AMD64• Support will include

– F90 & F77• Some F95 extensions also included• SPECcpu2000 explicitly supported

– Optimized 32-bit and 64-bit code generation– Linux and Windows– OpenMP support– Full debugging support

• STMicro will also be developing C and C++ compilers based on same code generation technology

• Beta now, Production quality in 1H03

Page 15: Opteron and AMD64 A Commodity 64 bit x86 SOC · 22 April 2003 AMD - Salishan HPC 2003 2 Opteron/AMD64 Launch – Today! •Official Launch of AMD64 architecture and Production Server/Workstation

22 April 2003 AMD - Salishan HPC 2003 15

The Rewards of Good Plumbing

•High Bandwidth– 2P system is designed to achieve 7 GB/s aggregate memory Read

bandwidth– 4P system is designed to achieve 10 GB/s aggregate memory Read

bandwidth• With data spread uniformly across the nodes

•Low Latency–Average 2P unloaded latency (page hit) is designed to be < 120 ns–Average 4P unloaded latency (page hit) is designed to be < 140 ns–Latency under load increases slowly due to excess Interconnect

Bandwidth–Latency shrinks quickly with increasing CPU clock speed and

HyperTransport link speed

Page 16: Opteron and AMD64 A Commodity 64 bit x86 SOC · 22 April 2003 AMD - Salishan HPC 2003 2 Opteron/AMD64 Launch – Today! •Official Launch of AMD64 architecture and Production Server/Workstation

22 April 2003 AMD - Salishan HPC 2003 16

Integrated Memory ControllerLatency (Local Memory Access, Registered Memory, CAS2)

Read Latency Accessing Local Memory, PC2100

0

20

40

60

80

100

120

140

160

180

200

800 1000 1200 1400 1600 1800 2000 2200 2400 2600 2800 3000 3200 3400 3600 3800 4000

Frequency (MHz)

Late

ncy

(ns) PageHit 0 Hop

PageMiss 0 HopPrb Miss 1 Hop (2 node case)Prb Miss 2 Hop (4 node case)

Page 17: Opteron and AMD64 A Commodity 64 bit x86 SOC · 22 April 2003 AMD - Salishan HPC 2003 2 Opteron/AMD64 Launch – Today! •Official Launch of AMD64 architecture and Production Server/Workstation

22 April 2003 AMD - Salishan HPC 2003 17

Memory Bandwidth

2000

3000

4000

5000

6000

7000

8000

9000

10000

Opt eron 844 8xPC2700CL2.5

Opt eron 244 4xPC2700CL2.5

Opt eron 144 2xPC2700CL2.5

Int el 865 2xPC3200 CL2DDR

Int el 850E 2xPC1066 CL2RDRAM

Int el 7205 2xPC2100 CL2DDR

Int el 845PE PC2700 CL2DDR

RAM Bandwidt h Int Buf f iSSE2 (MB/ s) RAM Bandwidt h Fp Buf f iSSE2 (MB/ s)

Page 18: Opteron and AMD64 A Commodity 64 bit x86 SOC · 22 April 2003 AMD - Salishan HPC 2003 2 Opteron/AMD64 Launch – Today! •Official Launch of AMD64 architecture and Production Server/Workstation

22 April 2003 AMD - Salishan HPC 2003 18

SPECint®_rate2000 Performance(Peak, 2P)

15.5

19.6

21.2

24

26.8

Itanium 2 900MHz

Xeon 2.8GHz

AMD Opteron 240

AMD Opteron 242

AMD Opteron™ 244

SPECint®_peak2000 Performance(Uniprocessor)

719

1130

1170

Itanium 21.0GHz

Xeon3.06GHz

AMDOpteron™

144

© 2003 Advanced Micro Devices. AMD, the AMD Arrow Logo, AMD Opteron and any combinations thereof are trademarks of Advanced Micro Devices. Microsoft and Windows are trademarks of Microsoft Corporation. SPEC and the benchmark named SPECint are registered trademarks of the Standard Performance Evaluation Corporation. Competitive benchmark results stated above reflect results published on www.spec.org as of Apr 14, 2003. SPEC benchmark scores for AMD Opteron processor-based systems are under submission to the SPEC organization. For complete configuration information visit www.spec.org.

www.amd.com/opteronperformance

Integer Performance

104%

164%

100%

100%

79%

108%

122%

137%

Page 19: Opteron and AMD64 A Commodity 64 bit x86 SOC · 22 April 2003 AMD - Salishan HPC 2003 2 Opteron/AMD64 Launch – Today! •Official Launch of AMD64 architecture and Production Server/Workstation

22 April 2003 AMD - Salishan HPC 2003 19

SPECfp®_rate2000 Performance(Peak, 2P)

14.7

22.7

25.1

26.7

30.7

Xeon 2.8GHz

AMD Opteron 240

AMD Opteron 242

AMD Opteron™ 244

Itanium 2 1.0GHz

SPECfp®_peak2000 Performance(Uniprocessor)

1103

1219

1431

Xeon3.06GHz

AMDOpteron™

144

Itanium 21.0GHz

Floating-Point Performance

111%

130%

100%100%

209%

150%

171%

182%

www.amd.com/opteronperformance

© 2003 Advanced Micro Devices. AMD, the AMD Arrow Logo, AMD Opteron and any combinations thereof are trademarks of Advanced Micro Devices. Microsoft and Windows are trademarks of Microsoft Corporation. SPEC and the benchmark named SPECfp are registered trademarks of the Standard Performance Evaluation Corporation. Competitive benchmark results stated above reflect results published on www.spec.org as of Apr 14, 2003. SPEC benchmark scores for AMD Opteron processor-based systems are under submission to the SPEC organization. For complete configuration information visit www.spec.org.

Page 20: Opteron and AMD64 A Commodity 64 bit x86 SOC · 22 April 2003 AMD - Salishan HPC 2003 2 Opteron/AMD64 Launch – Today! •Official Launch of AMD64 architecture and Production Server/Workstation

22 April 2003 AMD - Salishan HPC 2003 20

SPECfp®_rate2000 Performance and Scalability (Peak, 2-4P scaling)

13.8

20.2

26.7

49.2

30.7

49.3

Xeon MP2.0GHz (2P)

Xeon MP2.0GHz (4P)

AMD Opteron244 (2P)

AMD Opteron™844 (4P)

Itanium 21.0GHz (2P)

Itanium 21.0GHz (4P)

SPECfp®_rate2000 Performance(Peak, 4P)

20.2

40.7

45

49.2

49.3

Xeon 2.0GHz

AMD Opteron 840

AMD Opteron 842

AMD Opteron™ 844

Itanium 2 1.0GHz

Floating-Point Performance

201%

244%

100%

184%

100%

www.amd.com/opteronperformance

© 2003 Advanced Micro Devices. AMD, the AMD Arrow Logo, AMD Opteron and any combinations thereof are trademarks of Advanced Micro Devices. Microsoft and Windows are trademarks of Microsoft Corporation. SPEC and the benchmark named SPECfp are registered trademarks of the Standard Performance Evaluation Corporation. Competitive benchmark results stated above reflect results published on www.spec.org as of Apr 14, 2003. SPEC benchmark scores for AMD Opteron processor-based systems are under submission to the SPEC organization. For complete configuration information visit www.spec.org.

244%

223%

100%

146%

161%

100%

Page 21: Opteron and AMD64 A Commodity 64 bit x86 SOC · 22 April 2003 AMD - Salishan HPC 2003 2 Opteron/AMD64 Launch – Today! •Official Launch of AMD64 architecture and Production Server/Workstation

22 April 2003 AMD - Salishan HPC 2003 21

SPECweb®99 Performance(4P Servers, Red Hat CA2)

6700

8800

10135

9396

Itanium 2

Xeon MP2.0 GHz

AMD Opteron 840

AMD Opteron 842

AMD Opteron™ 844

SPECweb®99 Performance(2P Servers, Red Hat CA2)

5181

5373

5800

6250

Itanium 2

AMD Opteron 240

Xeon3.06 GHz

AMD Opteron 242

AMD Opteron™ 244

Web Server Performance

96% 100%

131%

140%

151%

100%

108%

116%

www.amd.com/opteronperformance

© 2003 Advanced Micro Devices. AMD, the AMD Arrow Logo, AMD Opteron and any combinations thereof are trademarks of Advanced Micro Devices. SPEC and the benchmark named SPECweb are registered trademarks of the Standard Performance Evaluation Corporation. Competitive benchmark results stated above reflect results published on www.spec.org as of Apr 14, 2003. SPEC benchmark scores for AMD Opteron processor-based systems are under submission to the SPEC organization. For complete configuration information visit www.spec.org.

N/A N/A

Page 22: Opteron and AMD64 A Commodity 64 bit x86 SOC · 22 April 2003 AMD - Salishan HPC 2003 2 Opteron/AMD64 Launch – Today! •Official Launch of AMD64 architecture and Production Server/Workstation

22 April 2003 AMD - Salishan HPC 2003 22

SPECweb®99_ssl Performance(2P Servers)

1149

1750

1760

1783

Xeon 2.8GHz(32-bit app/32-bit OS)

Itanium 2 1.5GHz(64-bit app/64-bit OS)

AMD Opteron 244(32-bit app/32-bit OS)

AMD Opteron™ 244(64-bit app/64-bit OS)

SPECweb®99_ssl Performance(4P Servers)

1643

3270

3344

3498

Xeon MP 2.0 GHz(32-bit app/32-bit OS)

AMD Opteron 844(32-bit app/32-bit OS)

Itanium 2 1.5 GHz(64-bit app/64-bit OS)

AMD Opteron™ 844(64-bit app/64-bit OS)

Secure Web Server Performance

100%

153%

155%

100%

199%

213%

204%

152%

www.amd.com/opteronperformance

© 2003 Advanced Micro Devices. AMD, the AMD Arrow Logo, AMD Opteron and any combinations thereof are trademarks of Advanced Micro Devices. SPEC and the benchmark named SPECweb are registered trademarks of the Standard Performance Evaluation Corporation. Competitive benchmark results stated above reflect results published on www.spec.org as of Apr 14, 2003. SPEC benchmark scores for AMD Opteron processor-based systems are under submission to the SPEC organization. For complete configuration information visit www.spec.org.

Page 23: Opteron and AMD64 A Commodity 64 bit x86 SOC · 22 April 2003 AMD - Salishan HPC 2003 2 Opteron/AMD64 Launch – Today! •Official Launch of AMD64 architecture and Production Server/Workstation

22 April 2003 AMD - Salishan HPC 2003 23

MMB2 Performance(4P Servers, Windows®)

13200

15520

Itanium 2

Xeon MP2.0GHz

AMDOpteron™

844

MMB2 Performance(2P Servers, Windows®)

9800

11000

Itanium 2

Xeon2.8GHz

AMDOpteron™

244

Email Server Performance

100%

118%

100%

112%

www.amd.com/opteronperformance

© 2003 Advanced Micro Devices. AMD, the AMD Arrow Logo, AMD Opteron and any combinations thereof are trademarks of Advanced Micro Devices. Microsoft and Windows are trademarks of Microsoft Corporation. For full MMB2 results visit http://www.microsoft.com/exchange/techinfo/planning/2000/PerfScal.asp.

N/A N/A

Page 24: Opteron and AMD64 A Commodity 64 bit x86 SOC · 22 April 2003 AMD - Salishan HPC 2003 2 Opteron/AMD64 Launch – Today! •Official Launch of AMD64 architecture and Production Server/Workstation

22 April 2003 AMD - Salishan HPC 2003 24

TPC-C Price/Performance(4P Servers, $/tpmC, Windows®)

$5.32

$5.10

$5.03

$4.31

$2.76

Xeon MP 2.0GHz(HP DL580-G2)

Xeon MP 2.0GHz(Dell PE6650)

Itanium 2 1.0GHz(HP rx5670)

Xeon MP 2.0GHz(IBM xSeries 360)

AMD Opteron™ 844(RackSaver QuatreX-64)

TPC-C Performance(4P Servers, tpmC, Windows®)

52587.46

71586.49

77905

82226.46

87741

Xeon MP 2.0GHz(IBM xSeries 360)

Xeon MP 2.0GHz(Dell PE6650)

Xeon MP 2.0GHz(HP DL580-G2)

AMD Opteron™ 844(RackSaver QuatreX-64)

Itanium 2 1.0GHz(HP rx5670)

Database Server Performance

106%

100%

www.amd.com/opteronperformance

52%

100%

113%

95%

96%

81%

68%

92%

© 2003 Advanced Micro Devices. AMD, the AMD Arrow Logo, AMD Opteron and any combinations thereof are trademarks of Advanced Micro Devices. Microsoft and Windows are trademarks of Microsoft Corporation. TPC-C data is current as of 4/14/03 and includes previously published TPC results. TPC-C data obtained from publicly available information and is subject to change without notice. For more information visit www.tpc.org.

Page 25: Opteron and AMD64 A Commodity 64 bit x86 SOC · 22 April 2003 AMD - Salishan HPC 2003 2 Opteron/AMD64 Launch – Today! •Official Launch of AMD64 architecture and Production Server/Workstation

22 April 2003 AMD - Salishan HPC 2003 25

Linpack – Hot off the press

AMD Opteron™ system # P

Rmax(GFlops

)Nmax

(order)N1/2

(order)Rpeak

(GFlops)

GFLOPs/Pr

oc

Rmax Gflops / Cycle

RPEAK/# Procs

Peak Gflops / Cycle

Rmax / Peak

4P Melody Opteron 1.8GHz 2GB/proc PC2700 8GB Total 4 11.99 28000 14.4 3.00 1.665 3.60 2.00 83.3%2P Melody Opteron 1.8GHz 2GB/proc PC2700 4GB Total 2 6.009 19320 616 7.2 3.00 1.669 3.60 2.00 83.5%1P Melody Opteron 1.8GHz 2GB PC2700 1 3.042 14000 3.6 3.04 1.690 3.60 2.00 84.5%

Page 26: Opteron and AMD64 A Commodity 64 bit x86 SOC · 22 April 2003 AMD - Salishan HPC 2003 2 Opteron/AMD64 Launch – Today! •Official Launch of AMD64 architecture and Production Server/Workstation

22 April 2003 AMD - Salishan HPC 2003 26

CPU Design Clusters – From RISC to AMD64

• K6 was built entirely on Sparc, PA-RISC and Power machines

•K7/Athlon was built 50% on K6 running Linux• Few apps. Mostly only ran in house logic simulators

–K8/Opteron was built 80% on K7 running Linux• Many apps available. Only 64 bit apps conspicuously missing

•Hardware• Over 3000 Athlon CPUs doing back-end CAD work in California and Austin• Over 1500 Athlon CPUs doing front-end design world-wide• Non-AMD machines are used only for applications which require more

memory than x86 is capable of addressing

–Software• Predominantly Linux based• Transitioning away from non-x86 based Unix (Solaris, HP-UX, etc.)• 64-bit software is run on non-AMD machines

Page 27: Opteron and AMD64 A Commodity 64 bit x86 SOC · 22 April 2003 AMD - Salishan HPC 2003 2 Opteron/AMD64 Launch – Today! •Official Launch of AMD64 architecture and Production Server/Workstation

22 April 2003 AMD - Salishan HPC 2003 27

K9 Tapeout Plan

•K9 will be taped out using only AMD Opteron Processors

• Hardware• Create a homogenous compute environment • Anticipate over 8000 AMD Opteron/Athlon CPUs doing

back-end CAD work in Sunnyvale and Austin• Anticipate over 2000 AMD Opteron/Athlon CPUs doing

front-end design world-wide• AMD will not use any non-AMD 32-bit or 64-bit hardware

–Software• 100% Linux/LSF based throughput cluster• 32-bit and 64-bit applications running side by side• Large memory applications will scale well on Opteron – 4P = 16-32 GB of

RAM

Page 28: Opteron and AMD64 A Commodity 64 bit x86 SOC · 22 April 2003 AMD - Salishan HPC 2003 2 Opteron/AMD64 Launch – Today! •Official Launch of AMD64 architecture and Production Server/Workstation

22 April 2003 AMD - Salishan HPC 2003 28

The AMD64/Opteron Story

•The right instruction set–Excellent compatability–Excellent performance future

•The right system architecture–Great memory and IO capacity and bandwidth–Great memory latency–Simple “lego” system configuration

•A strong ecosystem of commodity HW and SW–Support chips, Software tools, motherboards

•Millions of 64 bit CPUs in 03

•10s of millions of 64 bit CPUs in 04

Page 29: Opteron and AMD64 A Commodity 64 bit x86 SOC · 22 April 2003 AMD - Salishan HPC 2003 2 Opteron/AMD64 Launch – Today! •Official Launch of AMD64 architecture and Production Server/Workstation

22 April 2003 AMD - Salishan HPC 2003 29

Opteron Implications

•Allow more balanced scale-up/scale-out future–Remove 2P/4P cost barrier–And eventualy 8P, 16P

•Re-create the workstation–Constrained by 32 bit x86 on one side and slow RISC processors

w/o desktop software on the other–2P, 16GB, 64 bit Workstation that runs Outlook, Powerpoint and

Unreal Tournament• 64 bit portables in 04

•X86 forever (sorry ☺)

Page 30: Opteron and AMD64 A Commodity 64 bit x86 SOC · 22 April 2003 AMD - Salishan HPC 2003 2 Opteron/AMD64 Launch – Today! •Official Launch of AMD64 architecture and Production Server/Workstation

22 April 2003 AMD - Salishan HPC 2003 30

Futures

•Moore’s law continues through the decade (and beyond)–90nm, 65nm, 45nm, 30nm–1 Billion transistors, 4 Billion transistors–Vertical integration

• It will come, first for memory• Gigabyte on a die goes a long way to help memory wall

•Power is the biggest issue–Cache, Evaporators ☺–Metal gate, FinFet, Adiabatic clocks, etc

•CMP is good (and obvious)

•Threading is a mixed bag–Latency tolerance vs. Ahmdal’s law and synchronization overhead

• Long history–Certainly not for execution unit utilization

Page 31: Opteron and AMD64 A Commodity 64 bit x86 SOC · 22 April 2003 AMD - Salishan HPC 2003 2 Opteron/AMD64 Launch – Today! •Official Launch of AMD64 architecture and Production Server/Workstation

22 April 2003 AMD - Salishan HPC 2003 31

Futures

•Communication barrier–More fundamental than memory barrier–Even the speed of light doesn’t help (much)–3D helps a lot

•Single Chip Performance (a guess)–2003 5 +/- 1 Gflop Opteron, P4, iTanium2–2005 12 Gflop 2 * 6GHz–2006 24 Gflop 2P * 2 * 6GHz–2007 36-72 Gflop 4 * 9GHz–2008 144 Gflop 4P * 4 * 9GHz

Page 32: Opteron and AMD64 A Commodity 64 bit x86 SOC · 22 April 2003 AMD - Salishan HPC 2003 2 Opteron/AMD64 Launch – Today! •Official Launch of AMD64 architecture and Production Server/Workstation

22 April 2003 AMD - Salishan HPC 2003 32

What Can You Do To Help

•Killer Apps that drive what you want–Games–Video compression/decompression–Face recognition as a ubiquitous app

•Keep the faith on COTS