Gio Sherwin (1) (1)

7/30/2019 Gio Sherwin (1) (1)

1/24

CRP phase 4-

Analyzing andInterpreting Quantitative

Data

7/30/2019 Gio Sherwin (1) (1)

2/24

Describing Quantitative Data

Quantitative data -Data that can be quantified and verified, and is amenable

to statistical manipulation.

Quantitative data defines whereas qualitative datadescribes.

7/30/2019 Gio Sherwin (1) (1)

3/24

I.MAKING SENSE OF NUMBERS:

STATISTICAL DATA ANALYSIS

Data analysis is the process of examining whatdata mean to researchers. Statistical data

analysis refers to the process of examining what

quantitative data mean to researchers. Statisticsrefers to any numerical indicator of a set of data.

7/30/2019 Gio Sherwin (1) (1)

4/24

Descriptive Statistical Data Analysis:

- method used to describe the population we

are studying.Frequency distributions, measures of central tendency

(mean, median, and mode), and graphs like pie charts and bar charts

that describe the data are all examples of descriptive statistics.

Inferential Data Analysis:

- making predictions or inferences about a

population from observations and analyses of a

sample.

7/30/2019 Gio Sherwin (1) (1)

5/24

2 PURPOSES

Estimates the characteristics of a population

from data gathered on a sample.

Tests for significant DIFFERENCES between

groups and significant RELATIONSHIPS

between variables.

7/30/2019 Gio Sherwin (1) (1)

6/24

II. DESCRIBING DATA

THROUGH SUMMARYSTATISTICS

Summary statistics provide an efficient way to

describe an entire set of quantitative data.

7/30/2019 Gio Sherwin (1) (1)

7/24

A. Measures of CENTRAL

TENDENCY

describe the central point of distribution(representative values)

The measure of central tendency describes theone score that best represents the entiredistribution, the most CHARACTERISTIC

SCORE. The most characteristic score is thatwhich describes the center point of adistribution of scores.

7/30/2019 Gio Sherwin (1) (1)

8/24

B. Measures of DISPERSION

Describe how scores differ; how scores are

SPREAD (measures of VARIABILITY)

Do not exist for nominal data because categories

are used rather than meaningful numbers.

7/30/2019 Gio Sherwin (1) (1)

9/24

RANGE (or span) - number that reports the distancebetween the highest and lowest scores in a distribution.

VARIANCE - number that represents the mathematical

index of the average distance of the scores on an intervalor ratio scale from the mean in squared units.

STANDARD DEVIATION- number that represents

a summary statistic of how much scores vary from themean - expressed in the original units of measurement.

7/30/2019 Gio Sherwin (1) (1)

10/24

C. DESCRIBING DATA IN

STANDARD SCORES Standard scores provide a common unit of measurementindicating how far away any particular score is from themean. Used to compare numbers(scores) from differentdistributions.

Z score- most popular standard score used by researchers. The z score is calculated by dividing the deviation score by the

standard deviation (X - X bar) divided by the standarddeviation.

Each z score indicates how many standard deviations that score

is from the mean of the distribution.

7/30/2019 Gio Sherwin (1) (1)

11/24

B. FREQUENCY Distributions

Counting and reportingHOW OFTEN

different categories or points on a measurement

scale occur.

A list of the frequency of responses for each

category or measurement point is called a

frequency table.

7/30/2019 Gio Sherwin (1) (1)

12/24

Show changes over time Frequency Tables - a total number of times particular values on

a measurement scale occur in a data set.

Pie Charts - illustrate the frequency counts of categories. Bar Charts - visually illustrate frequency counts for a nominal

and ordinal variables.

Line Graphs - use a single point to represent the frequency

count on a dependent variable for each of the groups. Interval and Ratio level variable frequencies

are illustrated using... Frequency Histograms - distribution of frequencies where

blocks touch Frequency Polygons - similar to line graphs except a line

connects the points representing the frequency count for eachpoint on the measurement scale rather than each category.

7/30/2019 Gio Sherwin (1) (1)

13/24

CRP Phase 4- Inferring

from Data: Estimationand Significance Testing

7/30/2019 Gio Sherwin (1) (1)

14/24

INFERENTIAL STATISTICS :

The set of statistical procedures that allows aresearcher to go beyond the group that has

been measured, and make statements about the

characteristics of a much larger group.

Inferential statistics:accomplish ESTIMATION and

SIGNIFICANCE TESTING.

7/30/2019 Gio Sherwin (1) (1)

15/24

I I. ESTIMATION (Estimation Theory)

A. ESTIMATION Estimates of Population PARAMETERS from Sample

STATISTICS are possible as long as 2 ASSUMPTIONSmet: Normal Distribution and Random Sample.

1. Assumes that the charactetristic of interest is measured on anINTERVAL or RATIO scale and that the characteristics aredistributed NORMALLY

2. Assumes that the SAMPLE has been selected RANDOMLY so

it reflects the population.

Estimation procedures are often referred to as PARAMETRICSTATISTICS

7/30/2019 Gio Sherwin (1) (1)

16/24

B.The Normal Distribution Normal Curve

(symmetrical bell shaped curve)

Distributions NOT in the shape of a normal curve can bedescribed using two characteristics:

1. Skewness- the majority of the scores are toward one end of adistribution, which makes the curve tail off in that direction. a. Positive - tail runs to right side of the curve;

b. Negative - tail runs to the left side of the curve.

2. Kurtosis- refers to how pointed or flat the shape of a distribution is

when it is not a normal curve. a. Peaked - scores are clusterd around the middle; LEPTOKURTIC

distribution

b. Flat - scores are dispersed evenly across the distribution; PLATYKURTICdistribution

7/30/2019 Gio Sherwin (1) (1)

17/24

C. Use of Random Sampling - each

population member has an equal chance of

being selected for the sample. 1. Central Limits Theorem - for samples of a

sufficiently large size, the real distribution

of means is almost always approximately normal. 2. Large samples give more accurate results than

do smaller samples.

3. Random samples allow researchers to operatesafely under the normal curve.

4. Sample must be representative of population.

KISS

7/30/2019 Gio Sherwin (1) (1)

18/24

D.Inferring from a Random Sample

to a Population1. Sampling Distribution

a. Sampling distribution meanb. Standard ERROR (SE) of the mean tells researchershow much these random sample means are likely to

differ from the mean of the sampling distribution.

2. Confidence Level - the degree of assurance that asample statistic can be used to infer a population

parameter.

3. Confidence Interval - the range of scores associatedwith this confidence level. Be cautious!

7/30/2019 Gio Sherwin (1) (1)

19/24

II. SIGNIFICANCE TESTING

The process of analyzing

quantitative data for the puprose oftesting whether a nulll hypothesis is

probably either correct or

false. Significance testing =hypothesis testing.

7/30/2019 Gio Sherwin (1) (1)

20/24

A. The Logic of Significance Testing

1. Testing a Null Hypothesis -- every research hypothesis has a correspondingnull hypothesis.

2. Rejecting a Null Hypothesis -- We do not "PROVE" a research hypothesis.

3. Deciding on the Probability Level (p value)a. The 95% confidence level is referred to as the .05 significance level.

b. A significant difference or relationship found at the .05 level ofprobability means that the researchers are 95% confident that the differenceor relationship is NOT DUE TO CHANCE OR ERROR.

c. The result is only likely to happen BY CHANCE 5 out of 100 times.

7/30/2019 Gio Sherwin (1) (1)

21/24

B. The Practice of Significance Testing

1. Step One: Posing a Research Question

or Hypothesis and a Null Hypothesis

2. Step Two: Conducting the Study

3. Step Three: Testing the NullHypothesis

7/30/2019 Gio Sherwin (1) (1)

22/24

C. Type I Error and Type II Error

1. Type I error

a. ALPHA errorb. False Positivec. Researcher rejects the null and

accepts a research hypothesisd. Much more serious

2. Type II error

a. BETA error

b. False Negativec. Researcher accepts the null andrejects a research hypothesis

d. Not as serious

7/30/2019 Gio Sherwin (1) (1)

23/24

D. Statistical Power 1. POWER Analysis

a. The ability to detect effects of a specific size, given theparticular variance and sample size of the studyb. Equal to 1 minus the probability of committing a Type II

error (failing to reject a false null hypothesis)c. Minimum statistical power is 0 and maximum is 1.0d. Statistical Power should be .80 at a minimum

2. Increasing Statistical Power

a. Employ the most sensitive (and appropriate) statisticaltest

b. Test one-tailed hypotheses and research questionsc. Increase the size of the random sample studied

3. Effect Size

a. An estimate of the degree to which a phenomenon ispresent in a population and/or

b. The extent to which the null hypothesis is false

c. Three Types of Effect Sizes1) Small n= 3932) Medium n= 643) Large n= 26

7/30/2019 Gio Sherwin (1) (1)

24/24

III. IMPORTANT CONCEPTUAL

DEFINITIONS

Parameters - a characteristic of a population or a universe.

Statistic - the measurement of a sample with respect to avariable.

Nonparametric Statistics -statistics used only to describe thecharacteristics of a sample, without being able to generalize

back to its population.

Parametric Statistics - statistics used to estimate thecharacteristics (parameters) of a population based on the

characteristics (statistics) of a sample.

Data Analysis - the methods researchers use to infermeaning from data, to determine what conclusions arejustifed.

Gio Sherwin (1) (1)

Documents

Transcript of Gio Sherwin (1) (1)