UAB Dynamic Monitoring and Tuning in Multicluster Environment Genaro Costa, Anna Morajko, Paola...

UABDynamic Monitoring and Tuning in

Multicluster Environment

Genaro Costa, Anna Morajko, Paola Caymes Scutari, Tomàs Margalef and Emilio Luque

Universitat Autònoma de Barcelona

Paradyn Week 2006March 2006

Outline

Introduction Multicluster Systems Applications on Wide Systems MATE New Requirements Design Conclusions

IntroductionSystem performance

New problems require more computation power. Performance is a key issue.

New wide systems are built over the available resources and the user does not have total control of where the application will run.

It became more difficult to reach high performance and efficiency for these wide systems.

Introduction (II)

To reach performance goals, users need to find and solve bottlenecks.

Dynamic Monitoring and Tuning is a promising approach.

With dynamic systems’ properties, efficient resource use is hard to reach even for expert users.

Multicluster Systems

New systems are built using existing resources. Examples are NOW and HNOW linked with multistage network interconnections.

Intra cluster communications have different latencies than inter cluster communications.

Generally multiclusters built of clusters (homogenous or heterogeneous) interconnected by WAN.

Multicluster Systems (II)

Each cluster can have its own scheduler and can be exposed either through a head node or by all nodes

Cluster B

Cluster CCluster A

Condor/LSF/PBS

Headnode

Applications on Wide Systems

Hierarchical Master/Worker Applications

Raise the possibility of performance bottlenecks

Load imbalance problems Inefficient resource use Non-deterministic inter cluster

bandwidth

Worker

WorkerWorker

Worker

Worker WorkerWorker

Master

SubMaster

Sub Master explores

data locality

Common data aretransmitted once

Cluster A

Cluster B

Applications on Wide Systems (II)

Hierarchical Master/Worker Applications

Sub master is seen as a high processing node by the master.

Work distribution from master to sub master should be based on:

Available bandwidth Computing power

These characteristics may have dynamic behavior.

MATEMonitoring, Analysis and Tuning Environment

Dynamic automatic tuning of parallel/distributed applications.

Modifications

Instrumentation

TuningMonitoring

SolutionProblem /

Performance analysis

Performance data

Application development

Application

Execution

Source

Events

DynInst

10Machine 3

Machine 2Machine 1

MATE (II)

Analyzer

instr.

events

modif.

events

DMLibDMLibDMLib

Task1 Task2Task3

instr.

Application Controller - AC Dynamic Monitoring Library - DMLib Analyzer

MATE (III)

Each tuning technique is implemented in MATE as a “tunlet”, a C/C++ library dynamically loaded to the Analyzer process.

measure points – what events are needed

performance model – how to determine bottlenecks and solutions

tuning actions/points/synchronization - what to change, where, when

Analyzer

DTAPITunlet

Performance model

Measure points

Tuning point, action, sync

Tunlet

Performance model

Measure points

Tuning point, action, sync

New Requirements Transparent process tracking

AC should follow application process to any cluster.

Lower inter cluster instrumentation communication overhead Inter cluster communications generally have high

latency and lower bandwidth.

Transparent process tracking

System Service Machine or Cluster can have MATE enabled as

daemon that detects startup of new processes.

MATE EnabledMachine

startupdetection

MATE EnabledMachine

DMLibAC

Tasknattach

control

receivesAnalyzer

information

Analyzersubscription

DESIGN

new ‘Task’

Transparent process tracking

Application plug-in AC can be binary packaged with application binary.

ACTask

Remote Machine

detects Dyninst create

control

Analyzersubscription

Job submission

new ‘Task’

create

DESIGN (II)

Lower communication overhead Smart event collection

Total application trace may generate much overhead.

Event aggregationRemote trace events should be aggregated to

trace event abstractions, saving bandwidth.

Inter Cluster Trace Event Routing

DESIGN (III)

Analyzer Approaches Centralized

Requires tunlets modification to distinguish instrumentation data of local application processes.

Hierarchical Requires tunlets dismembering into local tunlets and

global tunlets.

Distributed Requires that tunlets instances located on different

Analyzer instances cooperate to tune an application.

Machine B3Machine B1

Machine B2Machine A3

Machine A2Machine A1

Lower communication overhead (II)

Centralized Analyzer Approach

Analyzer

ACTask1

AC ACTask1

Event Router

Cluster BCluster A

DESIGN (IV)

Machine A4

GlobalAnalyzer Machine B2

Local Performance Model Analysis

Hierarchical Analyzer Approach

Abstract Events

Machine A3

LocalAnalyzer

ACTask1

AC ACTask1

Cluster BCluster A

LocalAnalyzer

DESIGN (V)

Distributed Monitoring, Analysis and Tuning Environment Distributed Analyzer Approach

Cluster A Cluster B

Machine B2

Machine A3

Analyzer

ACTask1

AC ACTask1

Cluster BCluster A

AnalyzerTunlet instancescooperation

DESIGN (VI)

Conclusions and future work

Conclusions

Interference of instrumentation information on inter cluster communication should be minimal.

Process tracking enables MATE for multicluster systems.

Centralized Analyzer approach benefits tunlet developer but does not scale.

Distributed Analyzer approach scales but requires different model based analysis.

Conclusions and future work (II)

Future Work

Development of new tunlets for distributed and hierarchical Analyzer approach.

Tuning based only of local instrumentation data. Semantics of aggregation for Instrumentation

events. Patterns of distributed tunlets cooperation. Scenarios of distributed Analyzer cooperation in

multiclusters.

Thank you…

UAB Dynamic Monitoring and Tuning in Multicluster Environment Genaro Costa, Anna Morajko, Paola...

Documents

Transcript of UAB Dynamic Monitoring and Tuning in Multicluster Environment Genaro Costa, Anna Morajko, Paola...

Index [dge.carnegiescience.edu] · 2016-12-22 · Index Multicluster blocks technique, 146 Multidimensional ordination technique, 22-23 Multiple observations, remote sensing, 183

Compiler-directed Data Partitioning for Multicluster Processors

by Tomàs Fabregat Research support librarian November 2013

by Tomàs Fabregat Research support librarian

Optimizing Fracturing Design by Applying Multicluster(TCP ...perforators.org/wp-content/uploads/2016/10/SLAP-16-08_v1_Optimizing-Fracturing-Design.pdfOptimizing Fracturing Design by

Diumenge 28 maig 2017 Plaça Major de Vic tomàs sant as ... · Diumenge 28 maig 2017 Plaça Major de Vic tomàs sant as Vine a la festa! A les 21.15 hores espectacle XARXA HUMANA

MULTICLUSTER BOX 36 - Operating Manual

ASP.NET MVC Core by Eduard Tomàs

Training offer Trainingsangebot · - Main components: PV modules, PV inverters, Battery inverters, lead- acid batteries, BatFuse, Multicluster Box and diesel generators - Main functions

Cuarteto Casals - Accion Cultural Files/activ... · 2017. 10. 25. · Cuarteto Casals Vera Martínez-Mehner violín Abel Tomàs violín Jonathan Brown viola Arnau Tomàs violonchelo

Salary Private Practice Continuous Medical Development ...95.110.224.81/anaao/public/aaa_4212339_Albert Tomas i Torrelles... · Albert Tomàs Torrelles Aránzazu Albesa Pérez . Spanish

The Tao of Collaboration •Collaboration: The New Competition by Shannon Scutari

Sunny Island Multicluster System MULTICLUSTER BOX … · Sunny Island Multicluster System MULTICLUSTER BOX 6.3 / 12.3 Installation Guide. SMA Solar Technology AG Table of Contents

Gesualdo Scutari, Daniel P. Palomar, Francisco Facchinei ... · IEEE SIGNAL PROCESSING MAGAZINE [35 ] MAY 2010 [Gesualdo Scutari, Daniel P. Palomar, Francisco Facchinei, and Jong-Shi

SMA Multicluster Technology

1 Challenge the future KOALA-C: A Task Allocator for Integrated Multicluster and Multicloud Environments Presenter: Lipu Fei Authors: Lipu Fei, Bogdan.

UAB Dynamic Tuning of Master/Worker Applications Anna Morajko, Paola Caymes Scutari, Tomàs Margalef, Eduardo Cesar, Joan Sorribes and Emilio Luque Universitat.

Emergence of multicluster chimera stateschaos1.la.asu.edu/~yclai/papers/SREP_2015_NHGL.pdf · Emergence of multicluster chimera states Nan Yao 1,2, Zi-GangHuang2,3, Celso Grebogi4

Connection Overview - MULTICLUSTER BOX 6.3-11 · Installation - Connection Overview Multicluster Box 6.3-11 SMA Solar Technology AG 2 MC-BOX-6-IAU-en-10 Installation - Connection

Feasibility study for the development of industrial areas of Scutari ...