Itmat pcbi-r-course-1

34
Intro to using R for Bioinformatics: Part 1 : The Basics Angel Pizarro [email protected]

description

First part of 3-part course on teaching the R statistical package.

Transcript of Itmat pcbi-r-course-1

Page 1: Itmat pcbi-r-course-1

Intro to using R for Bioinformatics: Part 1 : The Basics

Angel [email protected]

Page 2: Itmat pcbi-r-course-1

Injecting a bit of reality

Page 3: Itmat pcbi-r-course-1

Taking it a bit further…

Waxing floors is not fun, and may not seem relevant, but have some faith Daniel-san

Page 4: Itmat pcbi-r-course-1

Outline

• We will teach you some basic uses of R– “Do & Tell” method where you will be asked to do

an exercise and once done, we will explain what just happened.

– Will cover basics, plotting and microarray analysis• We will not teach you statistics.

Page 5: Itmat pcbi-r-course-1

What is ?R is a language and environment for statistical computing and graphics.

– http://www.r-project.org

You can do stuff like this

Page 6: Itmat pcbi-r-course-1

Install & Run R

• You should have already installed R, but if you had trouble please see us after class

• Start R– On Windows, use Tinn-R– On Mac, use the source R application– On Linux, use the console

Page 7: Itmat pcbi-r-course-1

Help is plentiful

Help in three ways

Too much! Get me out!

Page 8: Itmat pcbi-r-course-1

More Helphelp.start()

– Start an HTML help session

help(mean)– Looks up the mean()

function's help page– ?mean

help.search(mean) – Displays all help pages that

contain text “mean”– ??mean

Page 9: Itmat pcbi-r-course-1

Whet your appetite…

Page 10: Itmat pcbi-r-course-1

The Basics

• Please enter each of the following lines into your R session:

Page 11: Itmat pcbi-r-course-1
Page 12: Itmat pcbi-r-course-1
Page 13: Itmat pcbi-r-course-1
Page 14: Itmat pcbi-r-course-1

Basic Algebra

You will also see this form:

Page 15: Itmat pcbi-r-course-1

Variables

• “x” and “y” are variables. • They are pointers to some value• They can also be pointers to some function

Page 16: Itmat pcbi-r-course-1

Vectors

Enter this in your session: Results

Page 17: Itmat pcbi-r-course-1

Small tangent: What is “c (1,2,3)”?

• Use the help()

Page 18: Itmat pcbi-r-course-1

Accessing Vector MembersIn R, Vectors start indexes at 1. Most programming languages start indexing at zero

Also, NOT WHAT YOU THINK IT IS! It is a INDEX VECTOR, meaning that you access the members of a vector with a vector

Page 19: Itmat pcbi-r-course-1

Small Tangent 2: Creating Sequences

• Create regular sequences using a colon

• Colon has high operator precedence

• Also see the seq() function

Page 20: Itmat pcbi-r-course-1

Vectors

• Are a list of items of the same data type

Short for “double precision floating point number”

Page 21: Itmat pcbi-r-course-1

Doing Stuff with Vectors

• Math operations occur on each element in sequence

• Returns a vector of the same size

Page 22: Itmat pcbi-r-course-1

Factors

• Simply a vector of items that mean something– Disease classifications, drug dosage, US states,

months, hapmap ethnic group– Can be ordered– Can have multiple levels• GO Functions

Page 23: Itmat pcbi-r-course-1

Array and Matrix

• Multi-dimensional generalizations of vectors– k-dimensions where k > 0– Assigned by the dim attribute

• Can be indexed by two or more indices– If a single index value (can be a vector) is given,

then dim is ignored and underlying vector values are accessed directly

– Unless the given index values is also an array• Matrix is a two-dimensional array

Page 24: Itmat pcbi-r-course-1

Example

An INDEX ARRAY

Page 25: Itmat pcbi-r-course-1

List

• An ordered collection of named components

Page 26: Itmat pcbi-r-course-1

List Access

Page 27: Itmat pcbi-r-course-1

Data Frame

• Bastard step child of List and Matrix– Essentially a list of vectors of same length

• Closest representation to an Excel file in R• Easiest way to make one is to read in a CSV file

Page 28: Itmat pcbi-r-course-1
Page 29: Itmat pcbi-r-course-1
Page 30: Itmat pcbi-r-course-1

Functions

• We’ve already used them• Functions take in arguments and perform

some action using those arguments. • Actions do not affect the input arguments

Page 31: Itmat pcbi-r-course-1

Example

Page 32: Itmat pcbi-r-course-1

Write to CSV file

Extra column of the row indices

Page 33: Itmat pcbi-r-course-1

Save your work!

• R keeps track of your data and functions

• You can start from where you left off if you save these to some file

Page 34: Itmat pcbi-r-course-1

Start from your save point