Quantitative Skills: Data Analysis. Data analysis is one of the first steps toward determining...
-
Upload
amy-brooks -
Category
Documents
-
view
214 -
download
1
Transcript of Quantitative Skills: Data Analysis. Data analysis is one of the first steps toward determining...
![Page 2: Quantitative Skills: Data Analysis. Data analysis is one of the first steps toward determining whether an observed pattern has validity. Data analysis.](https://reader035.fdocuments.in/reader035/viewer/2022062718/56649e665503460f94b6105b/html5/thumbnails/2.jpg)
Data analysis is one of the first steps toward determining whether an
observed pattern has validity. Data analysis also helps distinguish among
multiple working hypotheses.
![Page 3: Quantitative Skills: Data Analysis. Data analysis is one of the first steps toward determining whether an observed pattern has validity. Data analysis.](https://reader035.fdocuments.in/reader035/viewer/2022062718/56649e665503460f94b6105b/html5/thumbnails/3.jpg)
Descriptive statistics serves to summarize the data. It helps show the variation in the data, standard errors, best-fit functions, and confidence that
sufficient data have been collected.
![Page 4: Quantitative Skills: Data Analysis. Data analysis is one of the first steps toward determining whether an observed pattern has validity. Data analysis.](https://reader035.fdocuments.in/reader035/viewer/2022062718/56649e665503460f94b6105b/html5/thumbnails/4.jpg)
Inferential statistics involves inferring parameters in the natural population
from a sample.
![Page 5: Quantitative Skills: Data Analysis. Data analysis is one of the first steps toward determining whether an observed pattern has validity. Data analysis.](https://reader035.fdocuments.in/reader035/viewer/2022062718/56649e665503460f94b6105b/html5/thumbnails/5.jpg)
Most of the data you will collect will fit into two categories: measurements or
counts.
Measurement data Count data
![Page 6: Quantitative Skills: Data Analysis. Data analysis is one of the first steps toward determining whether an observed pattern has validity. Data analysis.](https://reader035.fdocuments.in/reader035/viewer/2022062718/56649e665503460f94b6105b/html5/thumbnails/6.jpg)
Most measurements are continuous, meaning there is an infinite number of potential measurements over a given
range.
![Page 7: Quantitative Skills: Data Analysis. Data analysis is one of the first steps toward determining whether an observed pattern has validity. Data analysis.](https://reader035.fdocuments.in/reader035/viewer/2022062718/56649e665503460f94b6105b/html5/thumbnails/7.jpg)
Count data are recordings of qualitative, or discrete, data.
Number of leaf stomata Number of white eyed individuals
![Page 8: Quantitative Skills: Data Analysis. Data analysis is one of the first steps toward determining whether an observed pattern has validity. Data analysis.](https://reader035.fdocuments.in/reader035/viewer/2022062718/56649e665503460f94b6105b/html5/thumbnails/8.jpg)
How much is good enough?
• How much data should a researcher collect to make a claim with confidence? How big should the size of the sample be?
• Is it possible the results were due to chance instead of the manipulation of the variable being tested?
![Page 9: Quantitative Skills: Data Analysis. Data analysis is one of the first steps toward determining whether an observed pattern has validity. Data analysis.](https://reader035.fdocuments.in/reader035/viewer/2022062718/56649e665503460f94b6105b/html5/thumbnails/9.jpg)
Conducting Data Analysis
![Page 10: Quantitative Skills: Data Analysis. Data analysis is one of the first steps toward determining whether an observed pattern has validity. Data analysis.](https://reader035.fdocuments.in/reader035/viewer/2022062718/56649e665503460f94b6105b/html5/thumbnails/10.jpg)
When an investigation involves measurement data, one of the first steps is to construct a histogram, or frequency diagram, to represent the
data’s distribution
![Page 11: Quantitative Skills: Data Analysis. Data analysis is one of the first steps toward determining whether an observed pattern has validity. Data analysis.](https://reader035.fdocuments.in/reader035/viewer/2022062718/56649e665503460f94b6105b/html5/thumbnails/11.jpg)
If the data show an approximate normal distribution on a histogram,
then they are parametric data.
![Page 12: Quantitative Skills: Data Analysis. Data analysis is one of the first steps toward determining whether an observed pattern has validity. Data analysis.](https://reader035.fdocuments.in/reader035/viewer/2022062718/56649e665503460f94b6105b/html5/thumbnails/12.jpg)
If the data do not show an approximate normal distribution on a histogram, then they are nonparametric data. Different
descriptive statistics and tests need to be applied to those data.
![Page 13: Quantitative Skills: Data Analysis. Data analysis is one of the first steps toward determining whether an observed pattern has validity. Data analysis.](https://reader035.fdocuments.in/reader035/viewer/2022062718/56649e665503460f94b6105b/html5/thumbnails/13.jpg)
Sometimes, due to sampling bias, data
might not fit a normal distribution
even when the actual population could be
normally distributed. In this case, a larger sample size might be
needed.
![Page 14: Quantitative Skills: Data Analysis. Data analysis is one of the first steps toward determining whether an observed pattern has validity. Data analysis.](https://reader035.fdocuments.in/reader035/viewer/2022062718/56649e665503460f94b6105b/html5/thumbnails/14.jpg)
For parametric data (a normal distribution), the appropriate descriptive statistics include :
• the mean (average)• sample size• variance• standard deviation• standard error
![Page 15: Quantitative Skills: Data Analysis. Data analysis is one of the first steps toward determining whether an observed pattern has validity. Data analysis.](https://reader035.fdocuments.in/reader035/viewer/2022062718/56649e665503460f94b6105b/html5/thumbnails/15.jpg)
The mean (x)of the sample is the average. The mean summarizes the entire sample and might provide an
estimate of the entire population’s true mean.
![Page 16: Quantitative Skills: Data Analysis. Data analysis is one of the first steps toward determining whether an observed pattern has validity. Data analysis.](https://reader035.fdocuments.in/reader035/viewer/2022062718/56649e665503460f94b6105b/html5/thumbnails/16.jpg)
The sample size (n) refers to how many
members of the population are
included in the study. Sample size is
important when estimating how well
the sample set represents the entire
population.
![Page 17: Quantitative Skills: Data Analysis. Data analysis is one of the first steps toward determining whether an observed pattern has validity. Data analysis.](https://reader035.fdocuments.in/reader035/viewer/2022062718/56649e665503460f94b6105b/html5/thumbnails/17.jpg)
Variance (s2) and standard deviation (s) measure how far a data set is spread out. A
variance of zero indicates that all the values in a data set are identical.
Variance Distance from the mean
![Page 18: Quantitative Skills: Data Analysis. Data analysis is one of the first steps toward determining whether an observed pattern has validity. Data analysis.](https://reader035.fdocuments.in/reader035/viewer/2022062718/56649e665503460f94b6105b/html5/thumbnails/18.jpg)
Because the differences from the mean are squared to calculate variance, the units of variance are not the same units as in the
original data set. The standard deviation is the square root of the variance. The
standard deviation is expressed in the same units as the original data set, which makes it generally more useful than the variance.
![Page 19: Quantitative Skills: Data Analysis. Data analysis is one of the first steps toward determining whether an observed pattern has validity. Data analysis.](https://reader035.fdocuments.in/reader035/viewer/2022062718/56649e665503460f94b6105b/html5/thumbnails/19.jpg)
A small standard deviation indicates that the data tend to be very close to the mean. A large standard deviation indicates that the
data are very spread out away from the mean.
![Page 20: Quantitative Skills: Data Analysis. Data analysis is one of the first steps toward determining whether an observed pattern has validity. Data analysis.](https://reader035.fdocuments.in/reader035/viewer/2022062718/56649e665503460f94b6105b/html5/thumbnails/20.jpg)
A little more than two-thirds of the data points will fall between +1 standard deviation and −1
standard deviation from the sample mean. More than 95% of the data falls between ±2 standard deviations from the sample mean.
![Page 21: Quantitative Skills: Data Analysis. Data analysis is one of the first steps toward determining whether an observed pattern has validity. Data analysis.](https://reader035.fdocuments.in/reader035/viewer/2022062718/56649e665503460f94b6105b/html5/thumbnails/21.jpg)
![Page 22: Quantitative Skills: Data Analysis. Data analysis is one of the first steps toward determining whether an observed pattern has validity. Data analysis.](https://reader035.fdocuments.in/reader035/viewer/2022062718/56649e665503460f94b6105b/html5/thumbnails/22.jpg)
68–95–99.7 Rule
In a normal distribution, 68.27% of all values lie within one standard deviation of the mean. 95.45% of the values lie
within two standard deviations of the mean. 99.73% of the values lie within three standard deviations of the mean.
![Page 23: Quantitative Skills: Data Analysis. Data analysis is one of the first steps toward determining whether an observed pattern has validity. Data analysis.](https://reader035.fdocuments.in/reader035/viewer/2022062718/56649e665503460f94b6105b/html5/thumbnails/23.jpg)
Sample standard error (SE) is a statistic used to make an inference about how well the sample mean matches up to
the true population mean.
![Page 24: Quantitative Skills: Data Analysis. Data analysis is one of the first steps toward determining whether an observed pattern has validity. Data analysis.](https://reader035.fdocuments.in/reader035/viewer/2022062718/56649e665503460f94b6105b/html5/thumbnails/24.jpg)
Standard error should be represented by including error bars on graphs when
appropriate. Error bars are used on graphs to indicate the uncertainty of a reported
measurement.
![Page 25: Quantitative Skills: Data Analysis. Data analysis is one of the first steps toward determining whether an observed pattern has validity. Data analysis.](https://reader035.fdocuments.in/reader035/viewer/2022062718/56649e665503460f94b6105b/html5/thumbnails/25.jpg)
Different statistical tools are used in the case of data that does not resemble a
normal distribution (nonparametric data, or data that is skewed or includes large
outliers).
• median• mode• quartiles• box-and-whisker plots
![Page 26: Quantitative Skills: Data Analysis. Data analysis is one of the first steps toward determining whether an observed pattern has validity. Data analysis.](https://reader035.fdocuments.in/reader035/viewer/2022062718/56649e665503460f94b6105b/html5/thumbnails/26.jpg)
The median is the value separating the higher half of a data sample from the
lower half. To find the median of a data set, first arrange the data in order from lowest to highest value and then select
the value in the middle.
5, 1, 3, 7, 2 1, 2, 3, 5, 7
median
![Page 27: Quantitative Skills: Data Analysis. Data analysis is one of the first steps toward determining whether an observed pattern has validity. Data analysis.](https://reader035.fdocuments.in/reader035/viewer/2022062718/56649e665503460f94b6105b/html5/thumbnails/27.jpg)
If there are two values in the middle of an ordered data set, the median is
found by averaging those two values.
5, 1, 3, 7, 4, 2 1, 2, 3, 4, 5, 7
median
3.5
![Page 28: Quantitative Skills: Data Analysis. Data analysis is one of the first steps toward determining whether an observed pattern has validity. Data analysis.](https://reader035.fdocuments.in/reader035/viewer/2022062718/56649e665503460f94b6105b/html5/thumbnails/28.jpg)
The mode is the value that appears most frequently in a data set.
3, 5, 1, 3, 7, 2
3 is the mode in this example because it appears more frequently than any other number.
![Page 29: Quantitative Skills: Data Analysis. Data analysis is one of the first steps toward determining whether an observed pattern has validity. Data analysis.](https://reader035.fdocuments.in/reader035/viewer/2022062718/56649e665503460f94b6105b/html5/thumbnails/29.jpg)
A bimodal distribution
![Page 30: Quantitative Skills: Data Analysis. Data analysis is one of the first steps toward determining whether an observed pattern has validity. Data analysis.](https://reader035.fdocuments.in/reader035/viewer/2022062718/56649e665503460f94b6105b/html5/thumbnails/30.jpg)
Data Analysis Flowchart:
Type of Data
Measurement Data(Continuous)
· Make histogram
Parametric(normal distribution)
Mean, standard deviation,
standard error
Nonparametric(not a normal distribution)
Median, mode, quartiles
Count Data(Discrete)
![Page 31: Quantitative Skills: Data Analysis. Data analysis is one of the first steps toward determining whether an observed pattern has validity. Data analysis.](https://reader035.fdocuments.in/reader035/viewer/2022062718/56649e665503460f94b6105b/html5/thumbnails/31.jpg)
Example of Data Analysis:Do shady English ivy leaves have a larger
surface area than sunny English ivy leaves?
![Page 32: Quantitative Skills: Data Analysis. Data analysis is one of the first steps toward determining whether an observed pattern has validity. Data analysis.](https://reader035.fdocuments.in/reader035/viewer/2022062718/56649e665503460f94b6105b/html5/thumbnails/32.jpg)
Since the data collected is in centimeters, it is measurement data, not count data.
So the first step is to make a:
HISTOGRAM
![Page 33: Quantitative Skills: Data Analysis. Data analysis is one of the first steps toward determining whether an observed pattern has validity. Data analysis.](https://reader035.fdocuments.in/reader035/viewer/2022062718/56649e665503460f94b6105b/html5/thumbnails/33.jpg)
Does the data resemble a normal curve?
(Close enough, with possible differences due to sampling error)
![Page 34: Quantitative Skills: Data Analysis. Data analysis is one of the first steps toward determining whether an observed pattern has validity. Data analysis.](https://reader035.fdocuments.in/reader035/viewer/2022062718/56649e665503460f94b6105b/html5/thumbnails/34.jpg)
Next, the appropriate statistical tools are applied:
![Page 35: Quantitative Skills: Data Analysis. Data analysis is one of the first steps toward determining whether an observed pattern has validity. Data analysis.](https://reader035.fdocuments.in/reader035/viewer/2022062718/56649e665503460f94b6105b/html5/thumbnails/35.jpg)
A bar graph can then be produced to compare the means:
![Page 36: Quantitative Skills: Data Analysis. Data analysis is one of the first steps toward determining whether an observed pattern has validity. Data analysis.](https://reader035.fdocuments.in/reader035/viewer/2022062718/56649e665503460f94b6105b/html5/thumbnails/36.jpg)
Do the error bars for the shady leaf mean overlap with the error bars for
the sunny leaf mean?
(No.)
![Page 37: Quantitative Skills: Data Analysis. Data analysis is one of the first steps toward determining whether an observed pattern has validity. Data analysis.](https://reader035.fdocuments.in/reader035/viewer/2022062718/56649e665503460f94b6105b/html5/thumbnails/37.jpg)
A more rigorous statistical test will need to be performed, but because the error bars do not overlap there is a high probability
that the two populations are indeed different from each other.
![Page 38: Quantitative Skills: Data Analysis. Data analysis is one of the first steps toward determining whether an observed pattern has validity. Data analysis.](https://reader035.fdocuments.in/reader035/viewer/2022062718/56649e665503460f94b6105b/html5/thumbnails/38.jpg)
Example of Data Analysis:Is 98.6°F actually the average body
temperature for humans?
![Page 39: Quantitative Skills: Data Analysis. Data analysis is one of the first steps toward determining whether an observed pattern has validity. Data analysis.](https://reader035.fdocuments.in/reader035/viewer/2022062718/56649e665503460f94b6105b/html5/thumbnails/39.jpg)
Since the data collected is in Farenheit, it is measurement data, not count data. So the first step is to make a:
HISTOGRAM
![Page 40: Quantitative Skills: Data Analysis. Data analysis is one of the first steps toward determining whether an observed pattern has validity. Data analysis.](https://reader035.fdocuments.in/reader035/viewer/2022062718/56649e665503460f94b6105b/html5/thumbnails/40.jpg)
Does the data resemble a normal curve?
(Close Enough)
![Page 41: Quantitative Skills: Data Analysis. Data analysis is one of the first steps toward determining whether an observed pattern has validity. Data analysis.](https://reader035.fdocuments.in/reader035/viewer/2022062718/56649e665503460f94b6105b/html5/thumbnails/41.jpg)
Next, the appropriate statistical tools are applied:
*Note that by convention, descriptive statistics rounds the calculated results to the same number of decimal
places as the number of data points plus 1.
![Page 42: Quantitative Skills: Data Analysis. Data analysis is one of the first steps toward determining whether an observed pattern has validity. Data analysis.](https://reader035.fdocuments.in/reader035/viewer/2022062718/56649e665503460f94b6105b/html5/thumbnails/42.jpg)
According to the 68–95–99.7 Rule, 68% of all samples lie within one standard deviation from the mean. This means that around 68% of the temperatures should be between 97.51 and 98.99.
![Page 43: Quantitative Skills: Data Analysis. Data analysis is one of the first steps toward determining whether an observed pattern has validity. Data analysis.](https://reader035.fdocuments.in/reader035/viewer/2022062718/56649e665503460f94b6105b/html5/thumbnails/43.jpg)
Including the standard error, we can say with a 68% confidence that the
mean human body temperature of our sample is 98.25 ± 0.06°F.