Computer Systems Lab The University of Wisconsin - Madison Department of Computer Sciences Linux...
-
Upload
stephanie-jacobs -
Category
Documents
-
view
213 -
download
1
Transcript of Computer Systems Lab The University of Wisconsin - Madison Department of Computer Sciences Linux...
Computer Systems LabThe University of Wisconsin - Madison Department of Computer Sciences
Linux Clusters
David Thompson
http://www.cs.wisc.edu/ \ ~thomas/madlug
Computer Systems LabThe University of Wisconsin Madison Department of Computer Sciences
Overview
• The Computer Systems Lab (CSL)
• Clusters
• The condor/db cluster
• Scalable Linux Administration
Computer Systems LabThe University of Wisconsin Madison Department of Computer Sciences
Overview
• The Computer Systems Lab (CSL)
• Clusters
• The condor/db cluster
• Scalable Linux Administration
Computer Systems LabThe University of Wisconsin Madison Department of Computer Sciences
Computer Systems Lab
• Purpose
• Staff
• Resources
Computer Systems LabThe University of Wisconsin - Madison Department of Computer Sciences
Purpose
“To support the research and teaching missions of the Department of
Computer Sciences”
Computer Systems LabThe University of Wisconsin Madison Department of Computer Sciences
Staff
• 8 Full Time
• 12 - 20 Part Time
Computer Systems LabThe University of Wisconsin Madison Department of Computer Sciences
Responsibilities
• Networks– Gigabit, 100BaseT, ATM, FDDI– Cisco, Foundry routers– 3com, HP, Cisco switches
Computer Systems LabThe University of Wisconsin Madison Department of Computer Sciences
Responsibilities (cont.)
• Operating Systems– Solaris, Linux, Digital Unix, AIX, IRIX, NT
• Applications– compilers, dbs, simulators, email, image
processing....
Computer Systems LabThe University of Wisconsin Madison Department of Computer Sciences
Responsibilities (cont.)
• 641 software packages installed– 69 Gbytes– multiple version– each package installed for several architectures– several thousand builds
Computer Systems LabThe University of Wisconsin Madison Department of Computer Sciences
Responsibilities - (cont.)
• Workstations– 600 PCs (including cluster)– 200 Sparcs– 15 Alphas– others
• 5600 User home directories– 69 Gbytes
Computer Systems LabThe University of Wisconsin Madison Department of Computer Sciences
Responsibilities (more)
• AFS– 1 Tbyte of ubiquitous file space– 14 File Servers, 3 db Servers– 95% client cache hit rates
• Backups– 2 week epoch cycle (1 Tb)– Daily incs
Computer Systems LabThe University of Wisconsin Madison Department of Computer Sciences
Overview
• The Computer Systems Lab (CSL)
• Clusters
• The condor/db cluster
• Scalable Linux Administration
Computer Systems LabThe University of Wisconsin Madison Department of Computer Sciences
Clusters
• Definitions
• Architectures
• Example
• Applications
Computer Systems LabThe University of Wisconsin Madison Department of Computer Sciences
Definitions
• NOW - Network of workstations
• COW - Cluster of workstations– “Some degree of network isolation”– “Dedicated function”
Computer Systems LabThe University of Wisconsin Madison Department of Computer Sciences
Architectures
• N-dimensional arrays– “previous & next” neighbor– hypercube
• Simple Network
Computer Systems LabThe University of Wisconsin Madison Department of Computer Sciences
Architectures
• Distributed – MPI– PVM– condor
Computer Systems LabThe University of Wisconsin Madison Department of Computer Sciences
Examples
• The Hive
• http://newton.gsfc.nasa.gov/thehive/
Computer Systems LabThe University of Wisconsin Madison Department of Computer Sciences
Examples - The Hive
Computer Systems LabThe University of Wisconsin Madison Department of Computer Sciences
Examples - The Hive (cont.)
Computer Systems LabThe University of Wisconsin Madison Department of Computer Sciences
Redundant Networks
http://einstein.drexel.edu/beowulf/Beowulf.html
Computer Systems LabThe University of Wisconsin Madison Department of Computer Sciences
http://www.cs.nmsu.edu/pcl/
Computer Systems LabThe University of Wisconsin Madison Department of Computer Sciences
Cluster Applications
• Image Analysis– http://newton.gsfc.nasa.gov/thehive/
thehive_dir/tilton.html
• Parallel Virtual File System (PVFS)– http://ece.clemson.edu/parl/pvfs/
• Speech Recognition– http://noel.feld.cvut.cz/magi/
Computer Systems LabThe University of Wisconsin Madison Department of Computer Sciences
Cluster Applications (cont.)
• Physics– Viscoelasticity
• http://www.meca.ucl.ac.be/memawww/deepflow/
– Seismology• http://weland.esd.mun.ca/index.html
– Big Bang• http://www.phy.duke.edu/~muller/BRAHMA/
index.html
Computer Systems LabThe University of Wisconsin Madison Department of Computer Sciences
Cluster Applications (cont.)
• Physics (cont.)– Laser Interferometer Gravitational-Wave
Observatory (LIGO)• http://www.ligo.caltech.edu/
– NA49 (??)• http://na49info.cern.ch/
– Large Acceptance Hadron Detector for an Investigation of Pb-induced Reactions at the CERN SPS
Computer Systems LabThe University of Wisconsin Madison Department of Computer Sciences
Overview
• The Computer Systems Lab (CSL)
• Clusters
• The condor/db cluster
• Scalable Linux Administration
Computer Systems LabThe University of Wisconsin Madison Department of Computer Sciences
Computer Science Cluster
• Two connected clusters– Dual Xeon 550mHz, 512k cache, 1 Gig RAM,
Ultra 2 SCSI 9 Gig boot disk, tulip network– 64 node compute cluster– 36 node db cluster with 4 extra 9 Gig disks and
GNIC-II Gigabit ethernet– Red Hat Linux 6.1, kernel 2.2.12
Computer Systems LabThe University of Wisconsin Madison Department of Computer Sciences
Cluster Architecture
Computer Systems LabThe University of Wisconsin Madison Department of Computer Sciences
Cluster Picture
Computer Systems LabThe University of Wisconsin Madison Department of Computer Sciences
Overview
• The Computer Systems Lab (CSL)
• Clusters
• The condor/db cluster
• Scalable Linux Administration
Computer Systems LabThe University of Wisconsin Madison Department of Computer Sciences
Scalable Linux Administration
• What
• Why
• Installation
• Maintenance
Computer Systems LabThe University of Wisconsin Madison Department of Computer Sciences
Scalable Admin - What
• Leverage
• Control systems
• Remote monitoring
• Operating system upgrades
• Centralized Services– kerberos, afs, logging
Computer Systems LabThe University of Wisconsin Madison Department of Computer Sciences
Scalable Admin - Why
• Consistent user view– Available applications– Stability
• Predictable Admin Environment
• Security
Computer Systems LabThe University of Wisconsin Madison Department of Computer Sciences
Scalable Admin - Installation
• Red Hat Kickstart– Configuration file
• network config, nfs locations, disk layout, RPMs to install
– Boot disk, nfs, or bootp/dhcp– Post-install script– redhat-6.1/i386/doc/HOWTO/KickStart-HOWTO
Computer Systems LabThe University of Wisconsin Madison Department of Computer Sciences
Sample Kickstart Script
# $Id: ks.cfg,v 1.3 1999/10/07 18:57:24 thomas Exp $lang en_USnetwork --bootproto bootpnfs --server pinstall.cs.wisc.edu --dir
/install/redhat-6.0/i386keyboard uszerombr yesclearpart --allpart / --size 100#part /tmp --size 300part /var --size 75part /usr --size 570part swap --size 127part /var/vice/cache --size 120part /local --size 2 --grow --maxsize 4000
Computer Systems LabThe University of Wisconsin Madison Department of Computer Sciences
Scalable Admin - Maintenance
• Update RPMS– Create list of RPMs, versions, and files to install– Each computer updates based on list
• Special files– package (afs)– cfengine (gnu)– config files (filedist)
Computer Systems LabThe University of Wisconsin - Madison Department of Computer Sciences
Linux Clusters
David Thompson
http://www.cs.wisc.edu/~thomas/madlug