NSF’s Office of Cyberinfrastructure

33
1 O C I NSF’s Office of Cyberinfrastructure Kevin Thompson Program Director National Science Foundation Office of Cyberinfrastructure [email protected] (many of the slides Are courtesy of Dr. Atkins)

description

NSF’s Office of Cyberinfrastructure. Kevin Thompson Program Director National Science Foundation Office of Cyberinfrastructure [email protected]. (many of the slides Are courtesy of Dr. Atkins). NSF Blue Ribbon Advisory Panel on Cyberinfrastructure. - PowerPoint PPT Presentation

Transcript of NSF’s Office of Cyberinfrastructure

Page 1: NSF’s Office of Cyberinfrastructure

1

O C

I

NSF’s Office of Cyberinfrastructure

Kevin Thompson

Program Director National Science Foundation Office of Cyberinfrastructure

[email protected](many of the slides

Are courtesy ofDr. Atkins)

Page 2: NSF’s Office of Cyberinfrastructure

2

O C

I

“a new age has dawned in scientific and engineering research, pushed by continuing progress in computing, information, and communication technology, and pulled by the expanding complexity, scope, and scale of today’s challenges. The capacity of this technology has crossed thresholds that now make possible a comprehensive “cyberinfrastructure” on which to build new types of scientific and engineering knowledge environments and organizations and to pursue research in new ways and with increased efficacy.”

http://www.nsf.gov/od/oci/reports/toc.jsphttp://www.nsf.gov/od/oci/reports/toc.jsp

NSF Blue Ribbon Advisory Panel on Cyberinfrastructure

Daniel E. Atkins, ChairUniversity of Michigan

Kelvin K. Droegemeier University of Oklahoma

Stuart I. FeldmanIBM

Hector Garcia-MolinaStanford University

Michael L. KleinUniversity of Pennsylvania

David G. MesserschmittUniversity of California at Berkeley

Paul MessinaCalifornia Institute of Technology

Jeremiah P. OstrikerPrinceton University

Margaret H. WrightNew York University

Page 3: NSF’s Office of Cyberinfrastructure

3

O C

ISome Science Drivers

Inherent complexity and multi-scale nature of todays frontier science challenges.

Requirement for multi-disciplinary, multi-investigator, multi-institutional approach (often international).

High data intensity from simulations, digital instruments, sensor nets, observatories.

Increased value of data and demand for data curation & preservation of access.

Exploiting infrastructure sharing to achieve better stewardship of research funding.

Strategic need for engaging more students in high quality, authentic science and engineering education.

Page 4: NSF’s Office of Cyberinfrastructure

4

O C

ICyberInfrastructure is about Connectedness

betweenIdeas People

Systems Orgs

Page 5: NSF’s Office of Cyberinfrastructure

5

O C

I

Investing Within the Framework of the NSF CI Vision

Complex, multi-scale, Complex, multi-scale, multidisciplinary S&E multidisciplinary S&E research challengesresearch challenges

Complex, multi-scale, Complex, multi-scale, multidisciplinary S&E multidisciplinary S&E research challengesresearch challenges

Advances in Advances in components of CI-components of CI-

systems for S&E R&Esystems for S&E R&E

Advances in Advances in components of CI-components of CI-

systems for S&E R&Esystems for S&E R&E

Blue Ribbon Panel Blue Ribbon Panel reports plus 30+ reports plus 30+

disciplinary or disciplinary or interdisciplinary interdisciplinary

community workshops community workshops on CI on CI

Blue Ribbon Panel Blue Ribbon Panel reports plus 30+ reports plus 30+

disciplinary or disciplinary or interdisciplinary interdisciplinary

community workshops community workshops on CI on CI

NSB and NSF internal NSB and NSF internal working groupsworking groups

NSB and NSF internal NSB and NSF internal working groupsworking groups

Framework for Framework for ActionAction

Framework for Framework for ActionAction

Call for ActionCall for ActionCall for ActionCall for Action

Page 6: NSF’s Office of Cyberinfrastructure

6

O C

I

Page 7: NSF’s Office of Cyberinfrastructure

7

O C

I

CI Vision for 21st Century Discovery

High High PerformancPerformanc

e e ComputingComputing

Page 8: NSF’s Office of Cyberinfrastructure

8

O C

IDrivers for HPC Strategy

Modeling, simulation, prediction– complex systems– width: multi-disciplinary, more systemic– depth: multi-scale– community codes (complex collaborations)

Extraction of knowledge/discovery from massive collections of heterogeneous data

More powerful visualization and interaction capabilities

Page 9: NSF’s Office of Cyberinfrastructure

11

O C

IHPC Multi Track Strategy

Track 2: Four solicitations over FY06-09: $30M/yr

acquisition + additional O&M

cost.

Track 1: One solicitation funded

over 4 years: $200M acquisition + additional O&M

cost.

Page 10: NSF’s Office of Cyberinfrastructure

14

O C

I

SDSC

TACC

UC/ANL

NCSA

ORNL

PU

IU

PSC

NCAR

Caltech

USC/ISI

UNC/RENCI

UW

Resource Provider (RP)

Software Integration Partner

Grid Infrastructure Group (UChicago)

TeraGrid: 11 Resource Providers, One Facility

Tennessee

LSU

Network Hub

Page 11: NSF’s Office of Cyberinfrastructure

15

O C

IThe TeraGrid Offers

A portfolio of high performance computing and visualization architectures

Common user environments Pooled community support expertise Targeted consulting services (ASTA) Science gateways to simplify access, support

collaboration, and integration of research and education.

Knowledge management & collaboration infrastructure

Page 12: NSF’s Office of Cyberinfrastructure

O C

I

Science Gateways(Virtual Organizations)

Built to serve communities of practice by bringing together a variety of resources in a customized portal

Examples include:• NanoHub (models, analysis tools, education tools for nano-science and

nano-engineering)

• NEES (A distributed earthquake engineering collaboratory)

• LEAD (A gateway to support on-demand modeling and analysis of tornados and other strong storms)

• SCEC Earthworks Project (Earthquake hazard assessment from first principles)

• NVO (National Virtual astronomy Observatory - increasingly used vehicle for astronomical study)

http://www.teragrid.org/programs/sci_gateways/

Page 13: NSF’s Office of Cyberinfrastructure

17

O C

I

Projected TeraGrid Performance Growth

(withTrack 1)• 14 PF/s• >1000 TB largest memory

• 14 PF/s• >1000 TB largest memory

Page 14: NSF’s Office of Cyberinfrastructure

18

O C

ICompute power - 504 Teraflops aggregate peakCompute power - 504 Teraflops aggregate peak– 3,936 Sun four-socket, quad-core nodes3,936 Sun four-socket, quad-core nodes– 15,744 AMD Opteron “Barcelona” processors15,744 AMD Opteron “Barcelona” processors

Quad-core, four flops/cycle (dual pipelines)Quad-core, four flops/cycle (dual pipelines)MemoryMemory

– 2 GB/core, 32 GB/node, 125 TB total2 GB/core, 32 GB/node, 125 TB total– 132 GB/s aggregate bandwidth132 GB/s aggregate bandwidth

Disk subsystemDisk subsystem– 72 Sun x4500 “Thumper” I/O servers, 24TB each72 Sun x4500 “Thumper” I/O servers, 24TB each– 1.7 Petabyte total storage1.7 Petabyte total storage

System power: 3.0 MW totalSystem power: 3.0 MW totalSystem: 2.4 MWSystem: 2.4 MW

– ~90 racks, in 6 row arrangement~90 racks, in 6 row arrangement– ~100 in-row cooling units~100 in-row cooling units– ~4000 sq.ft. total footprint~4000 sq.ft. total footprint

Infiniband interconnectInfiniband interconnect– Full non-blocking 7-stage Clos fabricFull non-blocking 7-stage Clos fabric– Low latency (~2 msec), high-bandwidth (~950 MB/s)Low latency (~2 msec), high-bandwidth (~950 MB/s)

TACC Track-2 a

Page 15: NSF’s Office of Cyberinfrastructure

O C

I

TeraGrid HPC Usage by Discipline4/01/06 – 3/31/07

BIO23%

CHE19%

AST15%

PHY15%

ENG10%

DMR9%

GEO6%

CISE3%

DMS0% Staff

0%

Courtesy of John Towns

SBE0.2%

Page 16: NSF’s Office of Cyberinfrastructure

20

O C

I

CI Vision for 21st Century Discovery

High High PerformancPerformanc

e e ComputingComputing

Data & Data & VisualizatiVisualization/Interacton/Interact

ionion

Page 17: NSF’s Office of Cyberinfrastructure

21

O C

I

Drivers for Data & Interaction Strategy

Increasing Scale and Heterogeneity of Data Integration, federation, interoperability IP issues, sharing, openness Curation, quality control Long-term stewardship/preservation

(including appraisal) Operational sustainability Appropriate workflow, visualization

environments

Page 18: NSF’s Office of Cyberinfrastructure

23

O C

I

DataNet Partners: Three Primary Goals

Building information integration capability on the foundation of a reliable data preservation network.

Achieving long-term preservation/access capability in an environment of rapid technology advances.

Achieving economic and technological sustainability.

Page 19: NSF’s Office of Cyberinfrastructure

24

O C

I

New Types of Organizations Envisioned

Integrate library and archival sciences, cyberinfrastructure, computer and information sciences, and domain science expertise.

Provide reliable digital preservation, access, integration, and analysis capabilities for science and/or engineering data over decades-long timeline.

Continuously anticipate and adapt to changes in technologies and in user needs and expectations

Engage at the frontiers of computer and information science and cyberinfrastructure with research and development to drive the leading edge forward; and

Serve as component elements of an interoperable data preservation and access network.

Page 20: NSF’s Office of Cyberinfrastructure

O C

IEarth Observing Systems

Data and Computation

Interoperability

ORION

WATERS

NEON

CUAHSIOOI GEON

DATANET Foundation

Tools

Page 21: NSF’s Office of Cyberinfrastructure

26

O C

I

CI Vision for 21st Century Discovery

High High PerformancPerformanc

e e ComputingComputing

Data & Data & VisualizatioVisualization/Interactin/Interacti

onon

Virtual Organizations Virtual Organizations for Distributed for Distributed CommunitiesCommunities

NanoHub

NEES

ATLAS

NVO

LEAD

iVDgL

CMS

Page 22: NSF’s Office of Cyberinfrastructure

27

O C

IDrivers for Virtual Organization Strategy

Research community driven demand for distributed multi-disciplinary, multi-institutional, multi-observational, multi-facility science projects.

Need for new approaches to broadening participation in both research/discovery and passion-for-science building, inquiry-based, learning/education.

Page 23: NSF’s Office of Cyberinfrastructure

28

O C

I

Possibilities for “better than being there” organizational forms:

– decreased time to discovery;

– decreased time from discovery;

– increased intellectual cross-section and transformational results;

– enhanced stewardship and RoI for research infrastructure investments;

– multi-use: discovery, learning, rapid-response, ....

A key to transforming the What, How, and Who participates.

A key to economic leadership in a global knowledge-based flat world.

Benefits of Virtual Organizations

Page 24: NSF’s Office of Cyberinfrastructure

29

O C

I

P: people, I: information, F: facilities, instruments

ST-SPST-SPPP: Physical : Physical mtgsmtgsII: Print-on-: Print-on-paper books, paper books, journalsjournalsFF: Physical : Physical labs, studios, labs, studios, shopsshops

DT-SPDT-SPPP: Shared : Shared notebooknotebookII: Library : Library reservesreservesFF: Time-shared : Time-shared physical labs, ...physical labs, ...

DT-SPDT-SPPP: Shared : Shared notebooknotebookII: Library : Library reservesreservesFF: Time-shared : Time-shared physical labs, ...physical labs, ...

ST-DPST-DPPP: AV : AV conferenceconferenceII: Web search: Web searchFF: Online : Online instrumentsinstruments

ST-DPST-DPPP: AV : AV conferenceconferenceII: Web search: Web searchFF: Online : Online instrumentsinstruments

DT-DPDT-DPPP: Email: EmailII: Knowbots: KnowbotsFF: Autonomous : Autonomous observatoriesobservatories

DT-DPDT-DPPP: Email: EmailII: Knowbots: KnowbotsFF: Autonomous : Autonomous observatoriesobservatories

TimeSame(synchronous)

Different(asynchronous)

Geog

rap

hi

c P

lace

Sam

eD

iffere

nt

Virtual Organizations offer additional modes of interaction between People, Information,

and Facilities

Page 25: NSF’s Office of Cyberinfrastructure

30

O C

I

VO-Global: International R&E Networking

Page 26: NSF’s Office of Cyberinfrastructure

31

O C

I

NSF’s International Research Connections (IRNC) Program

Goal - enable international science and engineering research collaborations and activities involving U.S. scientists, engineers, and educators

5 year program (2004-2009) and 5 main awards for US $25M total:US-China and other partners (GLORIAD)US-Japan and TEIN2 partners (TransPAC2)US-Europe (Translight/Starlight)US-Latin America (WHREN)US-Australia (TransLight/PaacificWave)

Page 27: NSF’s Office of Cyberinfrastructure

32

O C

IBut What About the Social

Architecture?

Many “collaboratories” have been less effective than hoped or even outright failures. Reasons are often more social than technical.

Need more principled understanding of the analysis and design of virtual organizations from a combined socio-technical perspective is critical to achieving the flexibility and agility to respond to new and emerging challenges in an increasingly competitive knowledge-based economy.

Page 28: NSF’s Office of Cyberinfrastructure

33

O C

I

Need interdisciplinary experimental research involving practicing science and engineering communities to create more systematic knowledge about the intertwined social and technical issues of effective virtual organizations and how they can act as a collective force to change not only how we practice research but what we produce from it.

Leaders of science & engineering virtual organizations need help in understanding social architecture design principles.

Need better understanding the forms and attributes of new organizational forms made possible by CI.

VO Challenges

Page 29: NSF’s Office of Cyberinfrastructure

34

O C

I

Learning & Work Force Learning & Work Force Needs & OpportunitiesNeeds & Opportunities

CI Vision for 21st Century Discovery

High High PerformancPerformanc

e e ComputingComputing

Data & Data & VisualizatioVisualization/Interaction/Interactio

nn

Virtual Organizations Virtual Organizations for Distributed for Distributed CommunitiesCommunities

Page 30: NSF’s Office of Cyberinfrastructure

35

O C

IDrivers for LWD Strategy

Education and workforce development to create and use CI for S&E research and education.

More effective learning through the application of cyberinfrastructure.

Exploiting the new opportunities that cyberinfrastructure brings for broadening participation by people who, because of physical capabilities, location, or history, have been excluded from the frontiers of scientific and engineering research and education.

Explore CI support for integrated research and education.

Page 31: NSF’s Office of Cyberinfrastructure

36

O C

I

Develop authentic cyberlearning opportunities through the creation of virtual environments that provide laboratory-based research experiences to enhance formal and informal education.

Enhance teacher training for and with cyberinfrastructure tools and resources to prepare a 21st century teaching force for a 21st century workforce.

Use cyber tools and resources to collect and analyze first-time data pertaining to individual and organizational learning to improve our understanding of human cognition and meta-cognition, action and interaction.

Investigate the use of handheld mobile devices as platforms for cyberlearning delivery and discovery.

Possible Next Frontiers for CI-enabled Learning and Workforce

Development

Page 32: NSF’s Office of Cyberinfrastructure

37

O C

I

OCI Funding Opportunities

High Performance Computing– TeraGrid III – informational meeting June 25 at NSF– Track II 2009 – stay tuned– PetaApps – stay tuned

Data– DataNet – Sustainable Digital Data Preservation and Access

Network Partners, 07-601, next prelim proposal due date October 6, 2008

Virtual Organizations– Virtual Organizations as Sociotechnical Systems (VOSS), NSF

08-550, deadline was June 2, 2008 Other programs

– Strategic Technologies for Cyberinfrastructure (STCI) next target date August 14, 2008

– International Reserach Network Connections (IRNC) program solicitation expected early 2009

Page 33: NSF’s Office of Cyberinfrastructure

38

O C

I

Thank You