Lecture 1: Course Introduction and...

8
Page 1 CS194-3/CS16x Introduction to Systems Lecture 27 Course Wrap-up and Various Topics December 5, 2007 Prof. Anthony D. Joseph http://www.cs.berkeley.edu/~adj/cs16x Lec 27.2 12/5/07 Joseph CS194-3/16x ©UCB Fall 2007 Goals for Today Course wrap-up Various topics Note: Some slides and/or pictures in the following are adapted from slides ©2005 Silberschatz, Galvin, and Gagne. Slides courtesy of Kubiatowicz, AJ Shankar, George Necula, Alex Aiken, Eric Brewer, Ras Bodik, Ion Stoica, Doug Tygar, and David Wagner. Lec 27.3 12/5/07 Joseph CS194-3/16x ©UCB Fall 2007 You‟ve gained a basic knowledge of systems, networking, databases, security, and software engineering problems, challenges, and solutions And, you‟ve worked with a large-scale codebase and built two real-world applications But, there‟s a lot more! Take CS 162, CS 186, CS 169, EE 122, CS 161 Some common themes: Complexity, modularity, abstractions, layering, SW flaws Dealing with large systems and groups is very hard! Methods: Programming and documentation style, E2E arguments Project team organization, IDEs, testing, and other tools Course Summary Lec 27.4 12/5/07 Joseph CS194-3/16x ©UCB Fall 2007 Database relational model, SQL ACID model, query operators Information Retrieval: Boolean search Programming parallel and distributed systems Processes and threads: “Building a Thread SystemConcurrency control in languages, systems, and DBMSs: » Mutual exclusion, semaphores, condition variables, monitors Serializability, conflict serializability, 2PL and strict 2PL, 2PC, logging, recovery, deadlock Programming with MapReduceOperating Systems Kernel and address spaces Address translation, caching, TLBs, and demand paging Course Summary

Transcript of Lecture 1: Course Introduction and...

Page 1

CS194-3/CS16xIntroduction to Systems

Lecture 27

Course Wrap-up and Various Topics

December 5, 2007

Prof. Anthony D. Joseph

http://www.cs.berkeley.edu/~adj/cs16x

Lec 27.212/5/07 Joseph CS194-3/16x ©UCB Fall 2007

Goals for Today

• Course wrap-up

• Various topics

Note: Some slides and/or pictures in the following areadapted from slides ©2005 Silberschatz, Galvin, and Gagne. Slides courtesy of Kubiatowicz, AJ Shankar, George Necula, Alex Aiken, Eric Brewer, Ras Bodik, Ion Stoica, Doug Tygar, and David Wagner.

Lec 27.312/5/07 Joseph CS194-3/16x ©UCB Fall 2007

• You‟ve gained a basic knowledge of systems, networking, databases, security, and software engineering problems, challenges, and solutions

– And, you‟ve worked with a large-scale codebase and built two real-world applications

– But, there‟s a lot more!

– Take CS 162, CS 186, CS 169, EE 122, CS 161

• Some common themes:

– Complexity, modularity, abstractions, layering, SW flaws

– Dealing with large systems and groups is very hard!

• Methods:

– Programming and documentation style, E2E arguments

– Project team organization, IDEs, testing, and other tools

Course Summary

Lec 27.412/5/07 Joseph CS194-3/16x ©UCB Fall 2007

• Database relational model, SQL

– ACID model, query operators

– Information Retrieval: Boolean search

• Programming parallel and distributed systems

– Processes and threads: “Building a Thread System”

– Concurrency control in languages, systems, and DBMSs: » Mutual exclusion, semaphores, condition variables, monitors

– Serializability, conflict serializability, 2PL and strict 2PL, 2PC, logging, recovery, deadlock

– “Programming with MapReduce”

• Operating Systems

– Kernel and address spaces

– Address translation, caching, TLBs, and demand paging

Course Summary

Page 2

Lec 27.512/5/07 Joseph CS194-3/16x ©UCB Fall 2007

• I/O Systems

– Disk management, file systems, naming, directories

• File system organization and DBMS indexing (B+ trees)

• Networking

– Architectures, layering, protocols (UDP and TCP), flow control, congestion control

– “Building a Peer-to-Peer Application”

• Computer and network security

– Cryptosystems, viruses, worms, firewalls, VPNs

• Post-graduation? Consider continuing education, extension, …

Course Summary

Lec 27.612/5/07 Joseph CS194-3/16x ©UCB Fall 2007

Some Topics People Requested

• User Interface design• Computing for developing

nations• Sensor networks• Web programming• Dragons

– No, not really…– But, here is a Chinese dragon from Wikipedia

• And the topic is…

• Datacenter is the Computer

Lec 27.712/5/07 Joseph CS194-3/16x ©UCB Fall 2007

• Google program == Web search, Gmail,…

• Google computer ==

– Thousands of computers, networking, storage

• Warehouse-sized facilities and workloads may be unusual today but are likely to be more common in the next few years

Datacenter is the Computer

(From Luiz Barroso‟s talk at Berkeley)

Lec 27.812/5/07 Joseph CS194-3/16x ©UCB Fall 2007

Datacenter is the Computer

• Orders of magnitude more devices (CPU‟s/rack, Number of racks/managed facility)

• 1980‟s: Bank of America datacenter ≈20 IBM 370 mainframes ≈100 CPU‟s, ≈100 disks

• 1995: Network of Workstations-1 ≈100 CPUs, 200 disks; NOW-2 ≈1000 CPUs, 2000 disks

• 2000: Inktomi ≈ 10,000 nodes in 4 DCs

• 2005: Google ≈ 450,000 Nodes in ≥ 25 DC*

• Computer Architecture == Architecture!

• What should be the basic “building” block

*John Markoff, “Google's not-so-very-secret weapon,” NY Times, 6/13/06

Page 3

Lec 27.912/5/07 Joseph CS194-3/16x ©UCB Fall 2007

9

Datacenter is the Computer

• A datacenter composed of 20 ft. containers

– Power/cooling for 200 KW of racked HW

– External taps for electricity, network, water

• 250 Servers, 7 TB DRAM, or 1.5 PB disk

– Up to 2,000 cores, providing 8,000 simultaneous processing threads

Project Blackbox10/17/06 Lec 27.1012/5/07 Joseph CS194-3/16x ©UCB Fall 2007

Sun‟s Project BlackBox

Lec 27.1112/5/07 Joseph CS194-3/16x ©UCB Fall 2007

Sun‟s Project BlackBox

Lec 27.1212/5/07 Joseph CS194-3/16x ©UCB Fall 2007

Sun‟s Project BlackBox

Page 4

Lec 27.1312/5/07 Joseph CS194-3/16x ©UCB Fall 2007

Sun‟s Project BlackBox

BREAK

Lec 27.1512/5/07 Joseph CS194-3/16x ©UCB Fall 2007

10G Ethernet

PDAs Cell PhonesFuture Devices

WirelessInfrastructure

DesktopPCs

Edge Servers

Datacenter

Datacenter is the Computer

Lec 27.1612/5/07 Joseph CS194-3/16x ©UCB Fall 200716

Datacenter is the Computer

• Re-inventing Client/Server Computing

• The Datacenter is the (Server) Computer

• The Handheld is the (Client) Computer– Dell ships more laptops than desktops in 2008?

• Intel's Otellini: “Digital Convergence, Wireless Will Help Buoy Industry” October 12, 2006 – During his keynote, Otellini and others at Intel

demonstrated a prototype "Universal Communicator“ – a combination cell phone, PC and video device that could transfer real-time video, voice and data wirelessly

Page 5

Lec 27.1712/5/07 Joseph CS194-3/16x ©UCB Fall 2007

• Key insight:

– Enable “weak” end devices use (wireless) networking to leverage powerful infrastructure resources

– Examples:

» Smartphones: Google web search, Microsoft Live Search, Apple iPhone web applications

» Google Apps versus Microsoft Office

» Others?

• Web Services as library functions

– Ruby on Rails, MapReduce (Hadoop), Chubby distributed lock manager

• Use Web Services FOR storage

– E.g., Amazon Simple Storage Service (S3) and Elastic Computing (EC2), Google FS, Bigtable

» Smugmug (≈ Flickr) saves $0.5M/yr using Amazon S3

Datacenter is the Computer

Lec 27.1812/5/07 Joseph CS194-3/16x ©UCB Fall 2007

• Limited capabilities

– 433 Mhz AMD x86 CPU

– 256 MB RAM

– 1 GB NAND Flash» No disk drive

– 802.11b/g WiFi

– 7.5” dual-mode TFT display

» Grayscale reflective 1200x900

» Color 800x600

– Color video camera

• Ultra low-cost: $200

One Laptop Per Child (OLPC)

Lec 27.1912/5/07 Joseph CS194-3/16x ©UCB Fall 2007

OLPC Uses Mesh Networking to Reach Infrastructure

Lec 27.2012/5/07 Joseph CS194-3/16x ©UCB Fall 2007

Unmanned Airborne Vehicles (UAV) as Rapidly Deployable Airborne Radio Networks?

• Stratellite statistics:– 245 ft length, 145 ft width, 87 ft height, 1.3 million ft3

– Solar cells power electric motors

– Payload capacity: 3,000 lbs

– Maximum altitude: 70,000 ft

– LoS to a 300,000 sq mile area

– Wireless reach to 200 mile radius

– Maximum duration: 18 months

– But, not yet commercially available/feasible

Stratellite

Page 6

Lec 27.2112/5/07 Joseph CS194-3/16x ©UCB Fall 2007

Administrivia

• Project 3 code due Thursday 12/6

• Midterm 3 Exam is Monday 12/10• In-class 9-10:45• Course survey followed by exam

Lec 27.2212/5/07 Joseph CS194-3/16x ©UCB Fall 2007

Sensor Networking Wireless Vision

A symphony of embedded devices.

Asset Tracking

Home Automation

Military Scenarios

Security

Lec 27.2312/5/07 Joseph CS194-3/16x ©UCB Fall 2007

UCB 1st Generation Sensor „Mote‟

• 4Mhz, 8bit MCU (ATMEL)

512 bytes RAM, 8K ROM

• 900Mhz Radio (RF Monolithics)

10-100 ft. range

• Temperature Sensor

• Light Sensor

• LED outputs

• Serial Port

COTS Dust

weC Mote

Lec 27.2412/5/07 Joseph CS194-3/16x ©UCB Fall 2007

UCB 2nd Generation Sensor „Mote‟

• Two Board Sandwich

– Main CPU board with Radio Communication

– Secondary Sensor Board

• Allows for expansion and customization

Current sensors include: Acceleration, Magnetic Field, Temperature, Pressure, Humidity, Light, and RF Signal Strength.

Can control RF transmission strength & Sense Reception Strength

Page 7

Lec 27.2512/5/07 Joseph CS194-3/16x ©UCB Fall 2007

UCB Sensor Mote Design Lineage

• COTS dust prototypes (Kris Pister et al.)

• weC Mote (~30 produced)

• Rene Mote (850+ produced)

• Dot (1000 produced)

• Mica node (1800+ produced) ?

Lec 27.2612/5/07 Joseph CS194-3/16x ©UCB Fall 2007

Conclusions

• Google OS– Not so much a product as a speculation on future direction

• Parallel OSs– Need for fine-grained synchronization

• Windows vs Linux: – Graphics vs Server?– Cathedral vs Bazaar– Controlled vs Free

• Quantum Computing– Using interesting properties of physics to compute

• Peer to Peer– A philosophy of systems design at extreme scale– Probabilistic design when it is appropriate– New techniques aimed at unreliable components– A rethinking (and recasting) of distributed algorithms

• Let‟s give a hand to the TAs!• Good Bye!

Lec 27.2712/5/07 Joseph CS194-3/16x ©UCB Fall 2007

Multi-Hop Routing Demo

• Sensors automatically assemble and determine routing topology

– Parallel Breadth First Search

– Shortest path to all nodes remembered

• Base station broadcasts out routing information

• Individuals listen for and propagate route update

– N messages sent

• Generational scheme to prevent cycles in routing table

Base

Lec 27.2812/5/07 Joseph CS194-3/16x ©UCB Fall 2007

Cory Energy Monitoring/Mgmt System

• 50 nodes on single floor

• 5 level ad hoc net

• 30 sec sampling

• 250K samples to database over 6 weeks

Page 8

Lec 27.2912/5/07 Joseph CS194-3/16x ©UCB Fall 2007

Structural Performance due to Multi-Directional Ground Motions (Glaser & CalTech)

Wiring for traditional structural instrumentation+ truckload of equipment

Mote infrastructureMote Layout

Comparison of Results

Lec 27.3012/5/07 Joseph CS194-3/16x ©UCB Fall 2007

Vehicle Tracking

Lec 27.3112/5/07 Joseph CS194-3/16x ©UCB Fall 2007

UAV-based Sensor Network Deployment

Lec 27.3212/5/07 Joseph CS194-3/16x ©UCB Fall 2007

• Datacenter is the computer!

• Many benefits

– Centralized, robust resources

– Amortize costs across tens of millions of users

– Simple, low cost devices can leverage billion dollar investments

• Many issues

– Dependence on network connectivity

– One outage affects many users

– Insufficient power resources for datacenters

Summary