Plugin GC Lec1

download Plugin GC Lec1

of 22

Transcript of Plugin GC Lec1

  • 7/29/2019 Plugin GC Lec1

    1/22

    Grid Computing

    Lectured by: Dr. Pham Tran Vu

    Email: [email protected]

  • 7/29/2019 Plugin GC Lec1

    2/22

    Faculty of Computer Science and EngineeringHCMC University of Technology

    2

    Course Objectives

    Grid computing concepts

    Roles of Grid computing

    Grid architecture

    Grid related technologies

    Grid programming models

    Developing Grid applications

  • 7/29/2019 Plugin GC Lec1

    3/22

    Faculty of Computer Science and EngineeringHCMC University of Technology

    3

    Assessment

    Exercises: 30%

    Seminar: 20%

    Final exam: 50%

  • 7/29/2019 Plugin GC Lec1

    4/22

    Faculty of Computer Science and EngineeringHCMC University of Technology

    4

    References

    Books I. Foster and C. Kesselman, The Grid: Blueprint for a New Computing

    Infrastructure. Morgab Kaufmann Publishers, 1999. Maozhen Li, Mark Baker, The Grid Core Technologies, Wiley, 2005.

    Fran Berman, Anthony J. G. Hey and Geoffrey C. Fox, Grid computing:Making the Global Infrastructure a Reality. John Wiley & Sons Ltd, 2003.

    Luis Ferrelra et al. Grid Services Programming and Application

    Enablement. IBM Redbooks, 2004. Mark Endrei et al. Patterns: Service Oriented Architecture and Web

    Services. IBM Redbooks, 2004.

    Web links: Globus project: http://www.globus.org/alliance/

    Global Grid Forum: http://www.ggf.org

    Course resources: http://www.cse.hcmut.edu.vn/~ptvu/gc

  • 7/29/2019 Plugin GC Lec1

    5/22

    Introduction to Grid Computing

  • 7/29/2019 Plugin GC Lec1

    6/22

    Faculty of Computer Science and EngineeringHCMC University of Technology

    6

    Lets Imagine

    CERN (founded 1954) =Conseil Europen pourla Recherche Nuclaire

    European Organisation forNuclear Reseach

    27 km circumference tunnel

    Large

    HadronCollider

    Concorde

    (15 Km)

    Balloon(30 Km)

    CD stack with1 year LHC data!(~ 20 Km)

    Mt. Blanc(4.8 Km)

    50 CD-ROM

    = 35 GB

    6

    cm

  • 7/29/2019 Plugin GC Lec1

    7/22

    Faculty of Computer Science and EngineeringHCMC University of Technology

    7

    A Bit of History

    I.Foster and C.Kesselman come from the parallel

    computing field Parallelism 1970-80 : The pioneers

    Vectorial computers (Cray 1, Cray 2,..)

    1980-90 : The exuberance

    Massively Parallel Computers (CrayT3D/T3E,Connection Machine,)

    1990-2000 : The profit

    Cluster (Beowulf, ASCII Red,..)

    2000 ->? The Globalization

    GRID

    Global Computing

  • 7/29/2019 Plugin GC Lec1

    8/22

    Faculty of Computer Science and EngineeringHCMC University of Technology

    8

    Definitions

    What is a Grid?

    Analogous to grid in electrical power grids

    (Foster & Kesselman 1999, Chetty & Buyya 2002)

    Hardware and software infrastructures that provides

    dependable consistent, pervasive and inexpensive

    access to high-end computational capabilities

    (Foster & Kesselman 1999)

    Flexible, secure, coordinated resource sharing andproblem solving in dynamic, multi-institutional virtual

    organisations (Foster et al. 2001)

  • 7/29/2019 Plugin GC Lec1

    9/22

    Faculty of Computer Science and EngineeringHCMC University of Technology

    9

    and Definitions

    Grid is a type of parallel and distributed system that enables

    the sharing, selection, and aggregation of geographicallydistributed autonomous resources dynamically at runtimedepending on their availability, capability, performance, cost,and users quality-of-service requirements

    (Buyya 2002)

    The three point checklist:

    (i) Coordinates resources that are not subject to a centralisedcontrol

    (Ii) Uses standard, open, general purpose protocol and interfaces (iii) Delivers nontrivial quality of service

    (Foster 2002)

  • 7/29/2019 Plugin GC Lec1

    10/22

    Faculty of Computer Science and EngineeringHCMC University of Technology

    10

    My Keywords

    Distributed

    Coordinated sharing

    Resources

    Large scale

    Non-trivial

    Big problem Internet

  • 7/29/2019 Plugin GC Lec1

    11/22

    Faculty of Computer Science and EngineeringHCMC University of Technology

    11

    Grid Generations

    1st generation - sharing high performance computing

    resources in distributed environment

    2nd generation - middleware to address issues of

    scalability, heterogeneity and adaptability in

    distributed environments with the focus on largescale computational power and huge volumes of data

    3rd generation - addresses the requirements for

    distributed collaboration in virtual environments;service oriented; automation; knowledge technology

    De Roure, Baker, Jennings & Shadbolt (2003)

  • 7/29/2019 Plugin GC Lec1

    12/22

    Faculty of Computer Science and EngineeringHCMC University of Technology

    12

    Types of Grid

    Usually one distinguishes three types of GRID

    Knowledge GRIDTo share knowledge

    Idea: Virtual distributed laboratory

    Data GRIDTo share the data

    Idea: Large scale data storage

    Computing GRIDTo share computing power

    Idea: Alternative to supercomputers

  • 7/29/2019 Plugin GC Lec1

    13/22

    Faculty of Computer Science and EngineeringHCMC University of Technology

    13

    The GRID today -1

    Knowledge GRID

    Virtual distributed Institutes/Laboratories allowing remotepeople to collaborate as they would be at the same location

    Purpose: to share knowledge, to collaborate for researches orspecific activities

    Few specific tries

    Remote surgery

    .

  • 7/29/2019 Plugin GC Lec1

    14/22

    Faculty of Computer Science and EngineeringHCMC University of Technology

    14

    The GRID today - 2

    Data GRID

    Also referenced as peer-to-peer technology Versus to the Client/Server technology who dominated the market

    of distributed systems during last decades, with this approach everyparticipant is both client and server

    Nobody knows exactly where the data are

    The system figures to find the nearest requested data out

    The requested data could be distributed on several location and samedata could be located at several places (data coherence problem)

    The pursuit of the data is transparent to the user Some examples:

    DATAGRID (CERN), The British Atmospheric Data Centre

  • 7/29/2019 Plugin GC Lec1

    15/22

    Faculty of Computer Science and EngineeringHCMC University of Technology

    15

    The GRID today - 3

    Computational GRID

    A supercomputer on you the table of your kitchen

    Some facts Supercomputers are very expensive and age very rapidly

    Except for some very specific persons, we do not need asupercomputer every day

    but when we need one it is very frustrating not to have one !

    The total time personal and other computer waste time not doinganything is huge

    I am (almost) permanently connected to a lot of computers.

    Potentially I am permanently connected to a supercomputer

    First try to exploit this fact: Project SETI@home

  • 7/29/2019 Plugin GC Lec1

    16/22

    Faculty of Computer Science and EngineeringHCMC University of Technology

    16

    Grid vs. cluster computing

    Partial view vs. Global view of the

    environment

    Large scale, geographically distributed

    Non-structured

    Heterogeneous

    Dynamic, unstable

    Multiple administrative domains

  • 7/29/2019 Plugin GC Lec1

    17/22

    Faculty of Computer Science and EngineeringHCMC University of Technology

    17

    Supercomputer

    PC

    Supercomputer

    My laptopMy laptop

    Workstation

    SMPCluster

    Computing Grid: our goal

    ?

  • 7/29/2019 Plugin GC Lec1

    18/22

    Faculty of Computer Science and EngineeringHCMC University of Technology

    18

    The Dream

    To make computers user friendly: To switch

    from a program centric view Execute this program on this (these) machine(s)

    to a service centric view Provide me this service with this QoS level

  • 7/29/2019 Plugin GC Lec1

    19/22

    Faculty of Computer Science and EngineeringHCMC University of Technology

    19

    The Grid We are Building

  • 7/29/2019 Plugin GC Lec1

    20/22

    Faculty of Computer Science and EngineeringHCMC University of Technology

    20

    Grid Characteristics Large-scale

    Need for dynamic selection

    Partial view of the environment

    Heterogeneity

    Hardware, OS, network, software environments (languages, libraries,

    tools...) Complex

    unpredictable structure

    Dynamic

    unpredictable behaviour

    Multiple administrative domain

    no centralized control

  • 7/29/2019 Plugin GC Lec1

    21/22

    Faculty of Computer Science and EngineeringHCMC University of Technology

    21

    Challenges (1)

    Efficient exploitation of Grid performance

    New programming models Service-centric vision

    Resource connectivity Middleware services Applications

    Interoperability vs. performance Good interoperability -> slow performance

    Fast -> not so interoperable

  • 7/29/2019 Plugin GC Lec1

    22/22

    Faculty of Computer Science and EngineeringHCMC University of Technology

    22

    Challenges (2)

    Grid scalability

    Resource management mechanisms/models Large number of heterogeneous resources

    Grid infrastructure and Grid application evaluations How to benchmark Grid systems and applications

    Trust and security How to trust the environment

    How to protect sensitive data