Gio Sherwin (1) (1)

download Gio Sherwin (1) (1)

of 24

Transcript of Gio Sherwin (1) (1)

  • 7/30/2019 Gio Sherwin (1) (1)

    1/24

    CRP phase 4-

    Analyzing andInterpreting Quantitative

    Data

  • 7/30/2019 Gio Sherwin (1) (1)

    2/24

    Describing Quantitative Data

    Quantitative data -Data that can be quantified and verified, and is amenable

    to statistical manipulation.

    Quantitative data defines whereas qualitative datadescribes.

  • 7/30/2019 Gio Sherwin (1) (1)

    3/24

    I.MAKING SENSE OF NUMBERS:

    STATISTICAL DATA ANALYSIS

    Data analysis is the process of examining whatdata mean to researchers. Statistical data

    analysis refers to the process of examining what

    quantitative data mean to researchers. Statisticsrefers to any numerical indicator of a set of data.

  • 7/30/2019 Gio Sherwin (1) (1)

    4/24

    Descriptive Statistical Data Analysis:

    - method used to describe the population we

    are studying.Frequency distributions, measures of central tendency

    (mean, median, and mode), and graphs like pie charts and bar charts

    that describe the data are all examples of descriptive statistics.

    Inferential Data Analysis:

    - making predictions or inferences about a

    population from observations and analyses of a

    sample.

  • 7/30/2019 Gio Sherwin (1) (1)

    5/24

    2 PURPOSES

    Estimates the characteristics of a population

    from data gathered on a sample.

    Tests for significant DIFFERENCES between

    groups and significant RELATIONSHIPS

    between variables.

  • 7/30/2019 Gio Sherwin (1) (1)

    6/24

    II. DESCRIBING DATA

    THROUGH SUMMARYSTATISTICS

    Summary statistics provide an efficient way to

    describe an entire set of quantitative data.

  • 7/30/2019 Gio Sherwin (1) (1)

    7/24

    A. Measures of CENTRAL

    TENDENCY

    describe the central point of distribution(representative values)

    The measure of central tendency describes theone score that best represents the entiredistribution, the most CHARACTERISTIC

    SCORE. The most characteristic score is thatwhich describes the center point of adistribution of scores.

  • 7/30/2019 Gio Sherwin (1) (1)

    8/24

    B. Measures of DISPERSION

    Describe how scores differ; how scores are

    SPREAD (measures of VARIABILITY)

    Do not exist for nominal data because categories

    are used rather than meaningful numbers.

  • 7/30/2019 Gio Sherwin (1) (1)

    9/24

    RANGE (or span) - number that reports the distancebetween the highest and lowest scores in a distribution.

    VARIANCE - number that represents the mathematical

    index of the average distance of the scores on an intervalor ratio scale from the mean in squared units.

    STANDARD DEVIATION- number that represents

    a summary statistic of how much scores vary from themean - expressed in the original units of measurement.

  • 7/30/2019 Gio Sherwin (1) (1)

    10/24

    C. DESCRIBING DATA IN

    STANDARD SCORES Standard scores provide a common unit of measurementindicating how far away any particular score is from themean. Used to compare numbers(scores) from differentdistributions.

    Z score- most popular standard score used by researchers. The z score is calculated by dividing the deviation score by the

    standard deviation (X - X bar) divided by the standarddeviation.

    Each z score indicates how many standard deviations that score

    is from the mean of the distribution.

  • 7/30/2019 Gio Sherwin (1) (1)

    11/24

    B. FREQUENCY Distributions

    Counting and reportingHOW OFTEN

    different categories or points on a measurement

    scale occur.

    A list of the frequency of responses for each

    category or measurement point is called a

    frequency table.

  • 7/30/2019 Gio Sherwin (1) (1)

    12/24

    Show changes over time Frequency Tables - a total number of times particular values on

    a measurement scale occur in a data set.

    Pie Charts - illustrate the frequency counts of categories. Bar Charts - visually illustrate frequency counts for a nominal

    and ordinal variables.

    Line Graphs - use a single point to represent the frequency

    count on a dependent variable for each of the groups. Interval and Ratio level variable frequencies

    are illustrated using... Frequency Histograms - distribution of frequencies where

    blocks touch Frequency Polygons - similar to line graphs except a line

    connects the points representing the frequency count for eachpoint on the measurement scale rather than each category.

  • 7/30/2019 Gio Sherwin (1) (1)

    13/24

    CRP Phase 4- Inferring

    from Data: Estimationand Significance Testing

  • 7/30/2019 Gio Sherwin (1) (1)

    14/24

    INFERENTIAL STATISTICS :

    The set of statistical procedures that allows aresearcher to go beyond the group that has

    been measured, and make statements about the

    characteristics of a much larger group.

    Inferential statistics:accomplish ESTIMATION and

    SIGNIFICANCE TESTING.

  • 7/30/2019 Gio Sherwin (1) (1)

    15/24

    I I. ESTIMATION (Estimation Theory)

    A. ESTIMATION Estimates of Population PARAMETERS from Sample

    STATISTICS are possible as long as 2 ASSUMPTIONSmet: Normal Distribution and Random Sample.

    1. Assumes that the charactetristic of interest is measured on anINTERVAL or RATIO scale and that the characteristics aredistributed NORMALLY

    2. Assumes that the SAMPLE has been selected RANDOMLY so

    it reflects the population.

    Estimation procedures are often referred to as PARAMETRICSTATISTICS

  • 7/30/2019 Gio Sherwin (1) (1)

    16/24

    B.The Normal Distribution Normal Curve

    (symmetrical bell shaped curve)

    Distributions NOT in the shape of a normal curve can bedescribed using two characteristics:

    1. Skewness- the majority of the scores are toward one end of adistribution, which makes the curve tail off in that direction. a. Positive - tail runs to right side of the curve;

    b. Negative - tail runs to the left side of the curve.

    2. Kurtosis- refers to how pointed or flat the shape of a distribution is

    when it is not a normal curve. a. Peaked - scores are clusterd around the middle; LEPTOKURTIC

    distribution

    b. Flat - scores are dispersed evenly across the distribution; PLATYKURTICdistribution

  • 7/30/2019 Gio Sherwin (1) (1)

    17/24

    C. Use of Random Sampling - each

    population member has an equal chance of

    being selected for the sample. 1. Central Limits Theorem - for samples of a

    sufficiently large size, the real distribution

    of means is almost always approximately normal. 2. Large samples give more accurate results than

    do smaller samples.

    3. Random samples allow researchers to operatesafely under the normal curve.

    4. Sample must be representative of population.

    KISS

  • 7/30/2019 Gio Sherwin (1) (1)

    18/24

    D.Inferring from a Random Sample

    to a Population1. Sampling Distribution

    a. Sampling distribution meanb. Standard ERROR (SE) of the mean tells researchershow much these random sample means are likely to

    differ from the mean of the sampling distribution.

    2. Confidence Level - the degree of assurance that asample statistic can be used to infer a population

    parameter.

    3. Confidence Interval - the range of scores associatedwith this confidence level. Be cautious!

  • 7/30/2019 Gio Sherwin (1) (1)

    19/24

    II. SIGNIFICANCE TESTING

    The process of analyzing

    quantitative data for the puprose oftesting whether a nulll hypothesis is

    probably either correct or

    false. Significance testing =hypothesis testing.

  • 7/30/2019 Gio Sherwin (1) (1)

    20/24

    A. The Logic of Significance Testing

    1. Testing a Null Hypothesis -- every research hypothesis has a correspondingnull hypothesis.

    2. Rejecting a Null Hypothesis -- We do not "PROVE" a research hypothesis.

    3. Deciding on the Probability Level (p value)a. The 95% confidence level is referred to as the .05 significance level.

    b. A significant difference or relationship found at the .05 level ofprobability means that the researchers are 95% confident that the differenceor relationship is NOT DUE TO CHANCE OR ERROR.

    c. The result is only likely to happen BY CHANCE 5 out of 100 times.

  • 7/30/2019 Gio Sherwin (1) (1)

    21/24

    B. The Practice of Significance Testing

    1. Step One: Posing a Research Question

    or Hypothesis and a Null Hypothesis

    2. Step Two: Conducting the Study

    3. Step Three: Testing the NullHypothesis

  • 7/30/2019 Gio Sherwin (1) (1)

    22/24

    C. Type I Error and Type II Error

    1. Type I error

    a. ALPHA errorb. False Positivec. Researcher rejects the null and

    accepts a research hypothesisd. Much more serious

    2. Type II error

    a. BETA error

    b. False Negativec. Researcher accepts the null andrejects a research hypothesis

    d. Not as serious

  • 7/30/2019 Gio Sherwin (1) (1)

    23/24

    D. Statistical Power 1. POWER Analysis

    a. The ability to detect effects of a specific size, given theparticular variance and sample size of the studyb. Equal to 1 minus the probability of committing a Type II

    error (failing to reject a false null hypothesis)c. Minimum statistical power is 0 and maximum is 1.0d. Statistical Power should be .80 at a minimum

    2. Increasing Statistical Power

    a. Employ the most sensitive (and appropriate) statisticaltest

    b. Test one-tailed hypotheses and research questionsc. Increase the size of the random sample studied

    3. Effect Size

    a. An estimate of the degree to which a phenomenon ispresent in a population and/or

    b. The extent to which the null hypothesis is false

    c. Three Types of Effect Sizes1) Small n= 3932) Medium n= 643) Large n= 26

  • 7/30/2019 Gio Sherwin (1) (1)

    24/24

    III. IMPORTANT CONCEPTUAL

    DEFINITIONS

    Parameters - a characteristic of a population or a universe.

    Statistic - the measurement of a sample with respect to avariable.

    Nonparametric Statistics -statistics used only to describe thecharacteristics of a sample, without being able to generalize

    back to its population.

    Parametric Statistics - statistics used to estimate thecharacteristics (parameters) of a population based on the

    characteristics (statistics) of a sample.

    Data Analysis - the methods researchers use to infermeaning from data, to determine what conclusions arejustifed.