An Introduction to R

23
An Introduction to R Statistical Computing AMS 597 Stony Brook University Spring 2009 By Tianyi Zhang

description

An Introduction to R. Statistical Computing AMS 597 Stony Brook University Spring 2009 By Tianyi Zhang. Outline. What’s R? R Operating Environment Help and Manuals Download and install packages Basic Commands and Functions Import and Export Data Graphic Plot Samples. - PowerPoint PPT Presentation

Transcript of An Introduction to R

Page 1: An Introduction to R

An Introduction to R

Statistical ComputingAMS 597

Stony Brook University Spring 2009

By Tianyi Zhang

Page 2: An Introduction to R

Outline

What’s R? R Operating Environment

Help and Manuals Download and install packages

Basic Commands and Functions Import and Export Data Graphic Plot Samples

Page 3: An Introduction to R

What’s R? Programming language and software environment for

data manipulation, calculation and graphical display.

Originally created by Ross Ihaka and Robert Gentleman at University of Auckland, and now developed by the R Development Core Team.

Where to get R?

http://www.r-project.org/ Latest Release: R 2.8.1, on Dec 22, 2008

Page 4: An Introduction to R

Why use R?IT IS FREEPre-compiled binary versions are provided for Microsoft Windows, Mac OS X, and several other Linux/Unix-like operating systemsOpen source code available freely available on GNU General Public LicenseFor computationally-intensive tasks, C, C++ and Fortran code can be linked and called at run timeAn effective data handling and storage facilityA suite of operators for calculations on arrays, in particular matricesA large, coherent, integrated collection of intermediate tools for data analysisGraphical facilities for data analysis and display either directly at the computer or on hardcopy

Page 5: An Introduction to R

How to Study R

Help Manual Google!!!

Page 6: An Introduction to R

R Operating Environment

Page 7: An Introduction to R

R Operating Environment

Page 8: An Introduction to R

Start with R

>help. start()>help(function nam

e)>?function nameExample>?lm

>??object>help.search(“title”)Example>help.search(“test”)>??colsum

Page 9: An Introduction to R

Download and Install Package

All R functions and datasets are stored in packages. Only when a package is loaded are its contents available. This is down both for efficiency, and to aid package developers.

To see which packages are installed at your site, issue the command

>library(boot)

Users connected to the Internet can use install.packages() and update.packages() to install and update packages.

To see packages currently loaded, use search().

Page 10: An Introduction to R

Download and Install Package

Page 11: An Introduction to R

Vector and Assignment

Page 12: An Introduction to R

Basic Commands

Page 13: An Introduction to R

Data Frame

Page 14: An Introduction to R

Import and Export Data

Page 15: An Introduction to R

Control Statements

Page 16: An Introduction to R

Graphic Plot

Plot Types: Line Charts, Bar Charts, Histograms, Pie Charts, Dot Charts, etc.

Format:>PLOT-TYPE(PLOT-DATA, DETAILS)

PLOT-TYPE: plot, plot.xy, barplot, pie, dotchart, etc. PLOT-DATA: Data, Data$XXX, as.matrix(Data), etc. Details: axes, col, pch, lty, ylim, type, xlab, ylab, etc.

For graphics plot: http://www.harding.edu/fmccown/R/

Page 17: An Introduction to R
Page 18: An Introduction to R

Sample 1: T-test

Generate two datasets X and Y; Do the Shapiro-Wilk normality test; Do the t-test

Alternative: two sided; less; greater; Paired or not; Confidence interval.

Page 19: An Introduction to R

Sample 2: Linear Regression

A comparison of GM monthly returns & SP500 monthly returns. GM and SP500 monthly return data during the period of Jan. 2002 to Jun. 2007 are taken. Plotted in R, they will be analyzed and compared.

Data from: http://www.stanford.edu/~xing/statfinbook/data.html

Page 20: An Introduction to R

Sample 2: Linear Regression

Page 21: An Introduction to R

Sample 3: Neuron Data Study

Heat diagramHierarchical Clustering

Page 22: An Introduction to R

Sample 3: Neuron Data Study

Partial Correlation Network among genes in PKJ and BAS cells

Page 23: An Introduction to R

Question?