What is This Course About? CS 356 Unit 0 · CS 356 Unit 0 Class Introduction Basic Hardware...

CS 356 Unit 0

Class Introduction

Basic Hardware Organization

What is This Course About?

• Introduction to Computer Systems – a.k.a. Computer Organization or Architecture

• Filling in the "systems" details– How is software generated (compilers, libraries) and

executed (OS, etc.)

– How does computer hardware work and how does it execute the software I write?

• Lays a foundation for future CS courses– CS 350 (Operating Systems), ITP/CS 439 (Compilers), CS

353/EE 450 (Networks), EE 457 (Computer Architecture)

Today's Digital Environment

Voltage / Currents

Transistors / Circuits

Digital Logic

Processor / Memory / GPU / FPGAs

Assembly / Machine

OS /Libraries

C++ / Java /

Python

Algorithms

Voltage / Currents

Transistors / Circuits

Digital Logic

Processor / Memory / GPU / FPGAs

Assembly / Machine

OS /Libraries

C++ / Java /

Python

Algorithms

Networks

Applications

Our Focus in CS 356

Why is System Knowledge Important?

• Increase productivity – Debugging

– Build/compilation

• High-level language abstractions break down at certain points

• Improve performance– Take advantage of hardware features

– Avoid pitfalls presented by the hardware

• Basis of understanding security and exploits

What Will You Learn

• Binary representation systems

• Assembly

• Processor organization

• Memory subsystems (caching, virtual memory)

• Compiler optimization and linking

Administration + Syllabus

• Course Website: usc-cs356.github.io (Install Course VM)

• TextbookComputer Systems: A Programmer’s Perspective Bryant and O’Hallaron, 2015

• Grading:– 30 points for assignments (5 assignments, equally weighted)

– 40 points for midterms (25 for best MT, 15 for worst)

– 30 points for final

• Piazza

• Expectations for getting help– Not allowed to search online!

We know some code is available only (those caught using or even referencing online code will be submitted to SJACS to be assigned an F)

– Acknowledge TA/CP help with comments in your code

– Don’t discuss solutions with other students

ABSTRACTIONS & REALITY

Abstraction vs. Reality

• Abstraction is good until reality intervenes– Bugs can result

– It is important to underlying HW implementations

– Sometimes abstractions don't provide the control or performance you need

Reality 1

• ints are not integers and floats aren't reals

• Is x2 >= 0 ?– Floats: Yes

– Ints: Not always• 40,000*40,000 = 1,600,000,000

• 50,000*50,000 = -1,794,967,296

• Is (x+y)+z = x+(y+z)?– Ints: Yes

– Floats: Not always• (1e20 + -1e20) + 3.14 = 3.14

• 1e20 + (-1e20 + 3.14) = around 0

Reality 1: Examples

Reality 2

• Knowing some assembly is critical

• You'll probably never write much (any?) code in assembly as compilers are often better than even humans at optimizing code

• But knowing assembly is critical when– Tracking down some bugs

– Taking advantage of certain HW features that a compiler may not be able to use

– Implementing system software (OS/compilers/libraries)

– Understanding security and vulnerabilities

Reality 2: Example

Reality 3

• Memory matters!– Memory is not infinite

– Memory can impact performance more than computation for many applications

– Source of many bugs both for single-threaded and especially parallel programs

– Source of many security vulnerabilities

Reality 4

• There's more to performance than asymptotic complexity– Constant factors matter!

– Even operation counts do not predict performance

• How long an instruction takes to execute is not deterministic…it depends on what other instructions have been executed before it

– Understanding how to optimize for the processor organization and memory can lead to up to an order of magnitude performance increase

COMPUTER ORGANIZATION AND ARCHITECTURE

Drivers and Trends

Computer Components

• Processor– Executes the program and

performs all the operations

• Main Memory– Stores data and program

(instructions)– Different forms:

• RAM = read and write but volatile (lose values when power off)

• ROM = read-only but non-volatile (maintains values when power off)

– Significantly slower than the processor speeds

• Input / Output Devices– Generate and consume data from

the system– MUCH, MUCH slower than the

processor

Arithmetic + Logic + Control Circuitry

Program(Instructions)

Data(Operands)

Output Devices

Input Devices

Software Program

Memory (RAM)

Processor

Combine 2c. FlourMix in 3 eggs

Instructions

Data Processor(Reads instructions,

operates on data)

Disk Drive

Architecture Issues

• Fundamentally, computer architecture is all about the different ways of answering the question:

“What do we do with the ever-increasing number of transistors available to us”

• Goal of a computer architect is to take increasing transistor budgets of a chip (i.e. Moore’s Law) and produce an equivalent increase in computational ability

Moore’s Law, Computer Architecture & Real-Estate Planning

• Moore’s Law = Number of transistors able to be fabricated on a chip grows exponentially with time

• Computer architects decide, “What should we do with all of this capability?”

• Similarly real-estate developers ask, “How do we make best use of the land area given to us?”

USC University Park Development Master Planhttp://re.usc.edu/docs/University%20Park%20Development%20Project.pdf

Transistor Physics

• Cross-section of transistors on an IC

• Moore’s Law is founded on our ability to keep shrinking transistor sizes – Gate/channel width shrinks

– Gate oxide shrinks

• Transistor feature size is referred to as the implementation “technology node”

Technology Nodes

Growth of Transistors on Chip

Implications of Moore’s Law

• What should we do with all these transistors– Put additional simple cores on a chip

– Use transistors to make cores execute instructions faster

– Use transistors for more on-chip cache memory• Cache is an on-chip memory used to store data the

processor is likely to need

• Faster than main-memory (RAM) which is on a separate chip and much larger (thus slower)

Memory Wall Problem• Processor performance is increasing much faster than memory

performance

Processor-Memory Performance Gap

7%/year

55%/year

Hennessy and Patterson, Computer Architecture – A Quantitative Approach (2003)

Processor

Cache Example

• Small, fast, on-chip memory to store copies of recently-used data

• When processor attempts to access data it will check the cache first– If the cache has the desired

data, it can supply it quickly– If the cache does not have the

data, it must go to the main memory (RAM) to access it

System Bus

Processor

Cache has desired

Cache does not have

desired data

System Bus

Reality 3 & 4 Example

Pentium 4

L2 Cache

L1 Data

L1 Instruc.

Increase in Clock Frequency

Intel Nehalem Quad Core

Progression to Parallel Systems

• If power begins to limit clock frequency, how can we continue to achieve more and more operations per second?– By running several processor cores in parallel at lower

frequencies

– Two cores @ 2 GHz vs. 1 core @ 4 GHz yield the same theoretical maximum ops./sec.

• For various applications like graphics and computationally intensive workloads this is taken to an extreme by GPUs

GPU Chip Layout

• 2560 Small Cores

• Upwards of 7.2 billion transistors

• 8.2 TFLOPS

• 320 Gbytes/sec

Photo: http://www.theregister.co.uk/2010/01/19/nvidia_gf100/

Source: NVIDIA

Intel Haswell Quad Core

8th Gen Coffee-Lake Hex-Core Intel Processor

https://www.researchgate.net/figure/Die-Map-of-a-Hexa-Core-Coffee-Lake-Processor_fig6_332543387

What is This Course About? CS 356 Unit 0 · CS 356 Unit 0 Class Introduction Basic Hardware...

Documents

Transcript of What is This Course About? CS 356 Unit 0 · CS 356 Unit 0 Class Introduction Basic Hardware...

What is This Course About? CS 356 Unit 0 - ee.usc.eduee.usc.edu/~redekopp/cs356/slides/CS356Unit0_Intro.pdf · transistor budgets of a chip (i.e. Moore’s Law) ... • Moore’s

T CITY OF NEW YORK LAW DEPARTMENT Phone: 212-356-1000 … · Rosemarie Peyton Unit Chief (Interstate CS) 4-176 212-356-3222 rpeyton@law.nyc.gov 212-356-3229 Thea Davis Unit Chief

STANFORD ARTIFICIAL INTELLIGENCE LABORATORY STAN-CS …i.stanford.edu/pub/cstr/reports/cs/tr/73/356/CS-TR-73-356.pdf · knoL1 I edge of LJhich one will be successful, In this situation

CS 356: Computer Network Architectures Lecture 18: End-to ...

CS 356 – Lecture 6 Access Controlmassey/Teaching/cs356/...CS 356 – Lecture 6 Access Control Spring 2013 Review • Chapter 1: Basic Concepts and Terminology – Integrity, Confidentiality,

CS 356 – Lecture 6 Using Cryptographic Tools to Secure the Domain Name System (DNS)gersch/cs356/356lecture06.pdf · 2013-09-13 · CS 356 – Lecture 6 Using Cryptographic Tools

DOCUMENT RESUME ED 356 515 CS 508 168 AUTHOR Watt, James … · DOCUMENT RESUME ED 356 515 CS 508 168 AUTHOR Watt, James H.; And Others TITLE Remembrance of Coverage Past: Agenda-Setting

CS 356: Computer Network Architectures Lecture 11: … · CS 356: Computer Network Architectures Lecture 11: Dynamic Routing: Routing Information Protocol Chap.3.3.1,3.3.2 Xiaowei

CS 356: Computer Network Architectures Lecture 17: End-to ...

CS 356 – Lecture 3 Cryptographic Toolsgersch/cs356/356lecture03.pdfCS 356 – Lecture 3 Cryptographic Tools Fall 2013 Tuesday, September 3, 13 Today’s Plan • Homework # 1 is

CS 356: Computer Network Architectures Lecture 17: Network Resource Allocation Chapter 6.1-6.4

CHEMISTRY 356 FALL 2010 Course Instructor: Dr. A. Azadnia ...

CS 356 – Lecture 9 Malicious Code

CS 356 Firewalls and Intrusion Prevention

Integrated CS Course

ECE 356/COMPSI 356 Computer Network Architecture · ECE 356/COMPSI 356 Computer Network Architecture Lecture 1: Introduction and Course Overview Monday August 26th, ... What computer

CS 356: Introduction to Computer Networks Lecture 6: Multi-access links

CS 356 – Lecture 16 Denial of Service

CS 356: Computer Network Architectures Lecture 10: IP … · 2018-02-15 · CS 356: Computer Network Architectures Lecture 10: IP Fragmentation, ARP, and ICMP Xiaowei Yang xwy@cs.duke.edu

CS 356 â€“ Lecture 5 User Authentication