KickStart 2018 - HCC · 2019-05-29 · KickStart 2018 David Swanson, PhD Director, Holland...

23
KickStart 2018 David Swanson, PhD Director, Holland Computing Center

Transcript of KickStart 2018 - HCC · 2019-05-29 · KickStart 2018 David Swanson, PhD Director, Holland...

Page 1: KickStart 2018 - HCC · 2019-05-29 · KickStart 2018 David Swanson, PhD Director, Holland Computing Center –HCC 5-year Review, 2016 “HCC operates as a core facility and provides

KickStart 2018David Swanson, PhD

Director, Holland Computing Center

Page 2: KickStart 2018 - HCC · 2019-05-29 · KickStart 2018 David Swanson, PhD Director, Holland Computing Center –HCC 5-year Review, 2016 “HCC operates as a core facility and provides

–HCC 5-year Review, 2016

“HCC operates as a core facility and provides services that are critical to the research productivity

of faculty and students across the University of Nebraska campuses.”

Page 3: KickStart 2018 - HCC · 2019-05-29 · KickStart 2018 David Swanson, PhD Director, Holland Computing Center –HCC 5-year Review, 2016 “HCC operates as a core facility and provides

–David Swanson

“HCC exists to reduce the time to science.”

Page 4: KickStart 2018 - HCC · 2019-05-29 · KickStart 2018 David Swanson, PhD Director, Holland Computing Center –HCC 5-year Review, 2016 “HCC operates as a core facility and provides

HCC is good at …

• Big Data: We manage over 10,000 TeraBytes (10PB)

• Scaled Out Computing: 35,000 cores, OSG, advanced networking, high bandwidth transfers, Globus Online

• Scaled Up Computing: Parallel processing, XSEDE, GPUs

Page 5: KickStart 2018 - HCC · 2019-05-29 · KickStart 2018 David Swanson, PhD Director, Holland Computing Center –HCC 5-year Review, 2016 “HCC operates as a core facility and provides

Recent Study of Data Scientists*

• “Available computing power” was the second most common problem for big data analysis.

• Distributed or parallel processing was the least common solution to their big data needs.

• This could be attributed to the difficulty of processing data on distributed resources.

*K Rexer. Data miner survey-2013 survey summary report. Rexer Analytics, Winchester, 2013.

Page 6: KickStart 2018 - HCC · 2019-05-29 · KickStart 2018 David Swanson, PhD Director, Holland Computing Center –HCC 5-year Review, 2016 “HCC operates as a core facility and provides

HCC has great people• Most are computational scientists

• 5 Full time research personnel

• 7 Sys Admins

• 5 Application Specialists

• Favorite collaborative tools: table with a whiteboard

• Collaboration, not outsourcing outfit

Page 7: KickStart 2018 - HCC · 2019-05-29 · KickStart 2018 David Swanson, PhD Director, Holland Computing Center –HCC 5-year Review, 2016 “HCC operates as a core facility and provides

HCC has great people

• 118 Schorr Center

• 152 Peter Kiewit Institute

• Buffet Cancer Center 5.12.397

[email protected]

• http://hcc.unl.edu

Page 8: KickStart 2018 - HCC · 2019-05-29 · KickStart 2018 David Swanson, PhD Director, Holland Computing Center –HCC 5-year Review, 2016 “HCC operates as a core facility and provides

HCC is free*

• Shared usage of resources: no charge

• Best effort support: no charge

• Dedicated computing, storage or support: up-front fees

*to many users! Paid for by NRI, NSF, and NU researchers

Page 9: KickStart 2018 - HCC · 2019-05-29 · KickStart 2018 David Swanson, PhD Director, Holland Computing Center –HCC 5-year Review, 2016 “HCC operates as a core facility and provides

Storage Options

• Currently 4 types of storage

• /home — backed up, per machine, 25 GB/user, free

• /work — purged, per machine, 50 TB / group, free

• /common — not purged, mounted on all HCC machines

• Attic — near-line archive, not mounted, $25/TB/year

Page 10: KickStart 2018 - HCC · 2019-05-29 · KickStart 2018 David Swanson, PhD Director, Holland Computing Center –HCC 5-year Review, 2016 “HCC operates as a core facility and provides

HCC On Ramp

1. Nebraska Faculty Connection (HCC Group)

2. Log in (HCC account and Duo set up)

3. Moving and editing your data (Globus)

4. Launch jobs! (module, SLURM) or (Jupyter Notebooks)

Page 11: KickStart 2018 - HCC · 2019-05-29 · KickStart 2018 David Swanson, PhD Director, Holland Computing Center –HCC 5-year Review, 2016 “HCC operates as a core facility and provides

Enough about HCC …

• What is your area of study?

• What question do you wish you could answer?

• Do you currently use HCC resources?

• What type of computing (storage/networking/visualization) do you do now?

• Are you experiencing any bottlenecks?

Page 12: KickStart 2018 - HCC · 2019-05-29 · KickStart 2018 David Swanson, PhD Director, Holland Computing Center –HCC 5-year Review, 2016 “HCC operates as a core facility and provides

Parallelism Rules

Page 13: KickStart 2018 - HCC · 2019-05-29 · KickStart 2018 David Swanson, PhD Director, Holland Computing Center –HCC 5-year Review, 2016 “HCC operates as a core facility and provides

248x64 2.3 GHz44x32

2.0 GHz

August2011

Equipment Timeline

August2005

4442.2

GHz(64bit)

Red

January 2002

June2004

2561.4

GHz(32bit)

PrairieFire2562.2

GHz(64bit)

January2012

98x64 2.1 GHz

256 GB RAM

Tusker

August2007

Merritt64

1.3 GHz512 GB RAM

May2009

1151x4 2.6 GHz

Firefly

October2014

October2013

452*16 2.6 GHz

64 GB RAM

Crane

July2013

Firefly

PrairieFireMay2013

Merritt

1 PBNearLineStorage

Attic

76x202.3 GHz

256 GB RAM

Anvil

August2015

June2014

Page 14: KickStart 2018 - HCC · 2019-05-29 · KickStart 2018 David Swanson, PhD Director, Holland Computing Center –HCC 5-year Review, 2016 “HCC operates as a core facility and provides

248x64 2.3 GHz44x32

2.0 GHz

August2011

Equipment Timeline

December2015

(ongoing)

5888cores4 PB

Red

January2012

98x64 2.1 GHz

256 GB RAM

Tusker

Sandhills

76x202.3 GHz

256 GB RAM

Anvil

August2015

October2014

1 PBNearLineStorage

Attic1 PB

SharedStorage

/common

October2013

452*16 2.6 GHz

64 GB RAM

Crane

June2014

January2019

Tusker

December2020

Crane

October2016

May2016

40

44*362.3 GHz

512 GB RAM

CraneOPA

Fall2017

?

Page 15: KickStart 2018 - HCC · 2019-05-29 · KickStart 2018 David Swanson, PhD Director, Holland Computing Center –HCC 5-year Review, 2016 “HCC operates as a core facility and provides

More to come…• HCC is buying incrementally, annually

• MRI or other grant funding could enhance this

• What should we target?

• Establishing an annual process

• Gather input in fall

• purchase in spring

• add incrementally until next spring

Page 16: KickStart 2018 - HCC · 2019-05-29 · KickStart 2018 David Swanson, PhD Director, Holland Computing Center –HCC 5-year Review, 2016 “HCC operates as a core facility and provides

0"

5,000"

10,000"

15,000"

20,000"

25,000"

1999" 2002" 2005" 2009" 2012" 2014"

16" 256" 656"

6956"

14492"

24923"Cores&1,500X&

Cores"

0"

1,000"

2,000"

3,000"

4,000"

5,000"

6,000"

7,000"

1999" 2002" 2005" 2009" 2012" 2014"

0.108" 1.2" 31.2"

1200"

3250"

6800"

TeraBy

tes)

Storage)63,000X)

Storage"

It takes ca. 6 days

to transfer 6,800 TB

at 100 Gbps

Page 17: KickStart 2018 - HCC · 2019-05-29 · KickStart 2018 David Swanson, PhD Director, Holland Computing Center –HCC 5-year Review, 2016 “HCC operates as a core facility and provides

Science is a Team Sport

Page 18: KickStart 2018 - HCC · 2019-05-29 · KickStart 2018 David Swanson, PhD Director, Holland Computing Center –HCC 5-year Review, 2016 “HCC operates as a core facility and provides

LHC and CMS

• In Run 1 the LHC created 15 PB of data/year

• In Run 2 expected output is 30 PB/year (1 GB/sec)

Page 19: KickStart 2018 - HCC · 2019-05-29 · KickStart 2018 David Swanson, PhD Director, Holland Computing Center –HCC 5-year Review, 2016 “HCC operates as a core facility and provides
Page 20: KickStart 2018 - HCC · 2019-05-29 · KickStart 2018 David Swanson, PhD Director, Holland Computing Center –HCC 5-year Review, 2016 “HCC operates as a core facility and provides

Omaha Core

Holland Computing Center

Internet2 via GPN

East/WestCampus Networks(firewalls + IDS)

Lincoln Core Router

2x 10 Gigabit

DYNESEquipment

UNL Science DMZ

Campus NetworkResearchers

WDM

Composit Traffic100 Gigabit

100 Gigabit CapableWest CampusBorder Router

10x CMS DataTransfer Nodes

OmahaHPC

Clusters

100 Gigabit CapableEast CampusBorder Router

perfSONAR+ BRO IDSadditions

10 Gigabit4x 10 Gigabit100 GigabitperfSONARBro IDS

Future RedundantI2 Path (2015+)

Lincoln Core Switch(CMS and HPC clusters) Center for

Brain Imagingand Behavior

10x 10 Gigabit

Internet2 via CIC

Composit Traffic100 Gigabit

Collaboration Infrastructure

• Network Nebraska

• 100 Gbps between HCC and Internet2 via GPN

• DMZ to/from HCC

• OSG access to HCC resources

• DTNs, Globus Connect

Page 21: KickStart 2018 - HCC · 2019-05-29 · KickStart 2018 David Swanson, PhD Director, Holland Computing Center –HCC 5-year Review, 2016 “HCC operates as a core facility and provides

0"

2"

4"

6"

8"

10"

12"

14"

1999" 2002" 2005" 2009" 2012" 2014"

2"3"

5"

9"

13"14"Personnel(7X(

Personnel"

0"

20"

40"

60"

80"

100"

120"

1999" 2002" 2005" 2009" 2012" 2014"

0.155" 0.155" 0.622"10"

30"

110"

Gigabits(per(se

cond

(

Bandwidth(700X(

Bandwidth"

Page 22: KickStart 2018 - HCC · 2019-05-29 · KickStart 2018 David Swanson, PhD Director, Holland Computing Center –HCC 5-year Review, 2016 “HCC operates as a core facility and provides

Local speed record to date 42 (+10!) Gbps

• Software Defined Networks • PerfSONAR • Performance Testing

Page 23: KickStart 2018 - HCC · 2019-05-29 · KickStart 2018 David Swanson, PhD Director, Holland Computing Center –HCC 5-year Review, 2016 “HCC operates as a core facility and provides

Getting Started