CS 179: Lecture 2 Lab Review 1

CS 179: Lecture 2Lab Review 1

The Problem Add two arrays

A[] + B[] -> C[]

GPU Computing: Step by Step Setup inputs on the host (CPU-accessible memory) Allocate memory for inputs on the GPU Copy inputs from host to GPU Allocate memory for outputs on the host Allocate memory for outputs on the GPU Start GPU kernel Copy output from GPU to host

(Copying can be asynchronous)

The Kernel Determine a thread index from block ID and thread ID within

a block:

Calling the Kernel

CUDA implementation (2)

Fixing the Kernel For large arrays, our kernel doesn’t work!

Bounds-checking – be on the lookout! Also, need a way for kernel to handle a few more elements…

Fixing the Kernel – Part 1

Fixing the Kernel – Part 2

Fixing our Call

Lab 1! Sum of polynomials – Fun, parallelizable example!

Suppose we have a polynomial P(r) with coefficients c0, …, cn-1, given by:

We want, for r0, …, rN-1, the sum:

Output condenses to one number!

Calculating P(r) once Pseudocode (one possible method):

Given r, coefficients[]result <- 0.0power <- 1.0

for all coefficient indecies i from 0 to n-1:result += (coefficients[i] * power)power *= r

Accumulation atomicAdd() function

Important for safe operations!

Accumulation

Shared Memory Faster than global memory Per-block One block

Linear Accumulation atomicAdd() has a choke point! What if we reduced our results in parallel?

Linear Accumulation

Linear Accumulation (2)

Can we do better?

Last notes minuteman.cms.caltech.edu – the easiest option

CMS accounts! Office hours

Kevin: Monday, 8-10 PM Connor: Tuesday, 8-10 PM

CS 179: Lecture 2 Lab Review 1

Documents

Transcript of CS 179: Lecture 2 Lab Review 1

CS 179 Database Project

CS 1305 - Network Lab Manual

EE2257 CS LAB

CS 1150 – Lab #12B – Databases

Lab Manual 6EE7 Cs

Lab Manual Me Cs

CS Lab Mannual

Cs Lab Manual Final

CS 121 Engineering Computation Lab Lab 1 - Drexel CCI · CS 121 Engineering Computation Lab Lab 1 ... week before final exams). ... CS121_Fall2009_Lab1_Lecture.ppt Author: CS

CS 101B C++ Lab Manual

CS 179: Lecture 3

CS 179: GPU Programming

CS 179: GPU Programming Lab 7 Recitation: The MPI/CUDA Wave Equation Solver.

CS Lab Manual Ok

CS 4396 Computer Networks Lab

CS 4396 Computer Networks Lab

CS 2334: Lab 9

STANFORD ARTIFICIAL INTELLIGENCE PROJECT MEMO A IM …i.stanford.edu/pub/cstr/reports/cs/tr/70/179/CS-TR-70-179.pdfLISP, N.IsP program8 W@ translated Into LISP programs and then executrd

CS 118 OOP Lab Manual

CS 360 Lab 1