LSSG Green Belt Training Overview of Charts and Graphs: Mini Case.

12
LSSG Green Belt Training Overview of Charts and Graphs: Mini Case

description

Shipping Example Data A few records from the dataset (total 319 records) are shown below: CenterOrderArrivalDaysStatusDistance Eastern3/3/2003 8:343/7/ : On time255 Eastern3/3/2003 8:353/6/ : On time196 Eastern3/3/2003 8:38**Back order299 Central3/3/2003 8:583/6/ : On time81 Central3/3/2003 9:043/8/ : On time235 Central3/3/2003 9:063/9/ : Late259 Western3/3/2003 9:443/6/ : On time291 Western3/3/2003 9:463/6/2003 9: On time271 Source: Minitab Software V14 Example file Categorical Data Continuous (Numerical) Data

Transcript of LSSG Green Belt Training Overview of Charts and Graphs: Mini Case.

Page 1: LSSG Green Belt Training Overview of Charts and Graphs: Mini Case.

LSSG Green Belt Training

Overview of Charts and Graphs: Mini Case

Page 2: LSSG Green Belt Training Overview of Charts and Graphs: Mini Case.

Overview – The Story

A retail company has 3 regional shipping centers, each with its own computer system.

Goal: To determine which of the 3 systems is most efficient, determine if it helps meet customer needs regarding shipping, and improve it further. Then, the system can be used companywide to facilitate integration and enhance efficiency.

Source: Meet Minitab (http://www.minitab.com/support/docs/rel14/MeetMinitab14.pdf)

Page 3: LSSG Green Belt Training Overview of Charts and Graphs: Mini Case.

Shipping Example Data

A few records from the dataset (total 319 records) are shown below:

Center Order Arrival Days Status DistanceEastern 3/3/2003 8:34 3/7/2003 15:21 4.28264 On time 255Eastern 3/3/2003 8:35 3/6/2003 17:05 3.35417 On time 196Eastern 3/3/2003 8:38 * * Back order 299Central 3/3/2003 8:58 3/6/2003 14:59 3.25069 On time 81Central 3/3/2003 9:04 3/8/2003 10:12 5.04722 On time 235Central 3/3/2003 9:06 3/9/2003 16:13 6.29653 Late 259Western 3/3/2003 9:44 3/6/2003 10:08 3.01667 On time 291Western 3/3/2003 9:46 3/6/2003 9:50 3.00278 On time 271

Source: Minitab Software V14 Example file

Categorical Data Continuous (Numerical) Data

Page 4: LSSG Green Belt Training Overview of Charts and Graphs: Mini Case.

First Look at Shipping Data – Univariate Analysis

Univariate analysis simply means looking at data one variable at a time, as a precursor to multivariate analysis.

Purpose: To understand the individual variables before exploring relationships

among variables, To check for extraordinary, incorrect, or missing data.

The basic method is to graph the data to look at the distribution, and compute measures of central tendency (Mean/Median) and variation (Range, Standard Deviation).

Page 5: LSSG Green Belt Training Overview of Charts and Graphs: Mini Case.

Individual Value PlotsDa

ys

8

7

6

5

4

3

2

1

0

Individual Value Plot of Days

Center

Days

WesternEasternCentral

8

7

6

5

4

3

2

1

0

Individual Value Plot of Days vs Center

The graphs below show the number of days for shipping for all data together on the left, and separated by shipping center on the right. The graph on theright also shows the mean values for each center connected by a line. At first glance, it is evident that the Western center has the lowest mean number of daysfor shipping.

Page 6: LSSG Green Belt Training Overview of Charts and Graphs: Mini Case.

Frequency Histograms – A look at the distribution

Days

Freq

uenc

y

7654321

40

30

20

10

0

Histogram of Days

Days

Freq

uenc

y

7654321

20

15

10

5

0

3.984 1.280 994.452 1.252 1012.981 1.090 102

Mean StDev N

CentralEastern

Western

Center

Histogram of DaysNormal

Frequency histograms help us see how the data are distributed. At left we can see that the number of days for shipping across all centers is normally distributed with a mean of about 4, and ranging from about 1 to 8 days. At the right is the distribution of the 3 centers shown separately.

Page 7: LSSG Green Belt Training Overview of Charts and Graphs: Mini Case.

Box Plots – Comparing Distributions

Box plots are a useful way to compare distributions visually. Box plotsshow the smallest value, the first quartile, median, third quartile, and the Largest value of the variable. In the plots below, the Mean is also marked.

Center

Days

WesternEasternCentral

8

7

6

5

4

3

2

1

0

Boxplot of Days vs Center

Page 8: LSSG Green Belt Training Overview of Charts and Graphs: Mini Case.

Graphing Categorical Data – Bar/Pie Charts

Central Eastern

Western

Back orderLateOn time

Category

Pie Chart of Status

Panel variable: Center

The Bar chart below shows the status of orders in the sample as A percentage. About 7-8% of the orders are on back-order, whileAbout 5% are shipped late overall. The Pie charts show the percent Of back-orders and late shipments by each center.

Status

Coun

t

On timeLateBack order

300

250

200

150

100

50

0

Chart of Status

Page 9: LSSG Green Belt Training Overview of Charts and Graphs: Mini Case.

Descriptive Statistics

Descriptive Statistics: Days Results for Center = Central

Variable Status N N* Mean SE Mean StDevDays Back order 0 6 * * * Late 6 0 6.431 0.157 0.385 On time 93 0 3.826 0.119 1.149

Results for Center = Eastern

Variable Status N N* Mean SE Mean StDevDays Back order 0 8 * * * Late 9 0 6.678 0.180 0.541 On time 92 0 4.234 0.112 1.077

Results for Center = Western

Variable Status N N* Mean SE Mean StDevDays Back order 0 3 * * * On time 102 0 2.981 0.108 1.090

Descriptive statistics give us numerical insight into the data. Compare this information to the graphs on the previous page.

Page 10: LSSG Green Belt Training Overview of Charts and Graphs: Mini Case.

Testing Hypotheses - ANOVA

One-way ANOVA: Days versus Center

Source DF SS MS F PCenter 2 114.63 57.32 39.19 0.000Error 299 437.28 1.46Total 301 551.92

S = 1.209 R-Sq = 20.77% R-Sq(adj) = 20.24%

Individual 95% CIs For Mean Based on Pooled StDevLevel N Mean StDev -----+---------+---------+---------+----Central 99 3.984 1.280 (----*---)Eastern 101 4.452 1.252 (----*----)Western 102 2.981 1.090 (----*---) -----+---------+---------+---------+---- 3.00 3.50 4.00 4.50

Pooled StDev = 1.209

While it looks like the centers are different in their efficiencies, a hypothesis test can confirm that. A one-way ANOVA tests whether the mean number of days for the 3 centersare in fact significantly different from each other. The low p-value (almost 0) indicates that one can conclude with great confidence (almost 100%) that there at least one of the centers is different from the others.

Page 11: LSSG Green Belt Training Overview of Charts and Graphs: Mini Case.

Examining relationships - scatterplot

Distance

Days

5004003002001000

8

7

6

5

4

3

2

1

0

Scatterplot of Days vs Distance

Is the better performance of the Western center due to smaller shipping distances than the other regions? First, a scatterplot of Number of Days for shipping against the Distance across all centers seems to show no relationship between the two.

Page 12: LSSG Green Belt Training Overview of Charts and Graphs: Mini Case.

Scatterplot separated by Center

Distance

Days

5004003002001000

8

7

6

5

4

3

2

1

0

CentralEasternWestern

Center

Scatterplot of Days vs Distance

When we look at it by center, there still seems to be no relationship between distance and number of days.