Fitting distributions with R · Statistics with R: basic stats ... im17_06_stats_distributions...

23
Fitting distributions with R Anja Feldmann

Transcript of Fitting distributions with R · Statistics with R: basic stats ... im17_06_stats_distributions...

Page 1: Fitting distributions with R · Statistics with R: basic stats ... im17_06_stats_distributions Created Date: 6/26/2017 2:12:16 PM ...

Fitting distributions with R

Anja Feldmann

Page 2: Fitting distributions with R · Statistics with R: basic stats ... im17_06_stats_distributions Created Date: 6/26/2017 2:12:16 PM ...

Statistics with R

❒ What is R?❍ Open-source statistical environment❍ A framework for data analysis❍ A programming language❍ Free software

❒ What can you do with R?❍ Data manipulation:

• Loading data• Selecting and modifying• Computing functions on data, e.g. statistics

Page 3: Fitting distributions with R · Statistics with R: basic stats ... im17_06_stats_distributions Created Date: 6/26/2017 2:12:16 PM ...

Statistics with R: data

❒ Loading dataØ vector.small = read.table(“ping-10“,nrows=10)Ø vector.medium = read.table(“ping-100“,nrows=10)Ø vector.large = read.table(“ping-1000“,nrows=10)

❒ Playing with data❍ Vector Length: length(vector)❍ Element n: vector[n]❍ First k elements: vector[1:k]❍ Last k elements: vector[(length(vector)-k+1): length(vector)]❍ All elements larger than x: vector[vector>x]❍ Trimming: mean(vector,trim=1/x)

Page 4: Fitting distributions with R · Statistics with R: basic stats ... im17_06_stats_distributions Created Date: 6/26/2017 2:12:16 PM ...

Statistics with R: basic stats

❒ Summary statistics:❍ Mean: mean(vector)❍ Median: median(vector)❍ Standard deviation: sd(vector)❍ Summary: summary(vector)

❒ Plotting densities:❍ Basic plot: plot(table(vector))❍ Histogram: hist(vector,x)❍ CDF: plot(ecdf(vector))❍ Quantile plot: boxplot(vector)

Page 5: Fitting distributions with R · Statistics with R: basic stats ... im17_06_stats_distributions Created Date: 6/26/2017 2:12:16 PM ...

Statistics with R: distributions

❒ Normal distribution: Centered limit of mostdistributions

❒ Generating a normal distribution with same mean and standard deviation as data:

• x.normal= rnorm(n=1000,m=mean(vector),sd=sd(vector))

• Impact of n on the distribution❍ Plotting the CDF:

• hist(x.normal)• plot(ecdf(x.normal))

Page 6: Fitting distributions with R · Statistics with R: basic stats ... im17_06_stats_distributions Created Date: 6/26/2017 2:12:16 PM ...

Statistics with R: distributions (2)

❒ Poisson distribution: Arrivals of large numbers ofindependent sources❍ x.poisson =

rpois(n=1000,lambda=mean(vector))❍hist(x.poisson)❍plot(ecdf(x.poisson))

6

Page 7: Fitting distributions with R · Statistics with R: basic stats ... im17_06_stats_distributions Created Date: 6/26/2017 2:12:16 PM ...

Statistics with R: distributions (3)

❒ Normal and Poisson have limited variations: nothing like delay measurements

❒ Heavy-tailed distributions:❍Weibull: distribution of failures (extreme

events)• x.weibull=rweibull(n=1000,shape=1.8,scale=mea

n(vector))• hist(x.weibull)• plot(ecdf(x.weibull))

7

Page 8: Fitting distributions with R · Statistics with R: basic stats ... im17_06_stats_distributions Created Date: 6/26/2017 2:12:16 PM ...

Statistics with R: Matching delay

❒ From what distribution do delay come?❍ Plotting sample quantiles (e.g. delay measurements)

against theoretical quantiles❍ If distributions are similar, quantiles should fall on the

diagonal❍ Are delay measurements normal or heavy-tailed?

• qqplot(vector,x.norm)• qqplot(vector,x.weibull)• qqplot(vector,x.poisson)

8

Page 9: Fitting distributions with R · Statistics with R: basic stats ... im17_06_stats_distributions Created Date: 6/26/2017 2:12:16 PM ...
Page 10: Fitting distributions with R · Statistics with R: basic stats ... im17_06_stats_distributions Created Date: 6/26/2017 2:12:16 PM ...
Page 11: Fitting distributions with R · Statistics with R: basic stats ... im17_06_stats_distributions Created Date: 6/26/2017 2:12:16 PM ...
Page 12: Fitting distributions with R · Statistics with R: basic stats ... im17_06_stats_distributions Created Date: 6/26/2017 2:12:16 PM ...
Page 13: Fitting distributions with R · Statistics with R: basic stats ... im17_06_stats_distributions Created Date: 6/26/2017 2:12:16 PM ...
Page 14: Fitting distributions with R · Statistics with R: basic stats ... im17_06_stats_distributions Created Date: 6/26/2017 2:12:16 PM ...
Page 15: Fitting distributions with R · Statistics with R: basic stats ... im17_06_stats_distributions Created Date: 6/26/2017 2:12:16 PM ...
Page 16: Fitting distributions with R · Statistics with R: basic stats ... im17_06_stats_distributions Created Date: 6/26/2017 2:12:16 PM ...
Page 17: Fitting distributions with R · Statistics with R: basic stats ... im17_06_stats_distributions Created Date: 6/26/2017 2:12:16 PM ...
Page 18: Fitting distributions with R · Statistics with R: basic stats ... im17_06_stats_distributions Created Date: 6/26/2017 2:12:16 PM ...
Page 19: Fitting distributions with R · Statistics with R: basic stats ... im17_06_stats_distributions Created Date: 6/26/2017 2:12:16 PM ...
Page 20: Fitting distributions with R · Statistics with R: basic stats ... im17_06_stats_distributions Created Date: 6/26/2017 2:12:16 PM ...
Page 21: Fitting distributions with R · Statistics with R: basic stats ... im17_06_stats_distributions Created Date: 6/26/2017 2:12:16 PM ...
Page 22: Fitting distributions with R · Statistics with R: basic stats ... im17_06_stats_distributions Created Date: 6/26/2017 2:12:16 PM ...
Page 23: Fitting distributions with R · Statistics with R: basic stats ... im17_06_stats_distributions Created Date: 6/26/2017 2:12:16 PM ...