Probability and statistics(assign 7 and 8)

172
PROBABILITY AND STATISTICS BY ENGR. JORGE P. BAUTISTA

description

kelan nyo isubmit yung assignment no. 7 and 8 nyo nasa slides yun ng stats. isubmit nyo sa akin sa lunes during electromagnetism kasi kukulangin yung class participation nyo sa stats.

Transcript of Probability and statistics(assign 7 and 8)

Page 1: Probability and statistics(assign 7 and 8)

PROBABILITY AND STATISTICS

BYENGR. JORGE P. BAUTISTA

Page 2: Probability and statistics(assign 7 and 8)

COURSE OUTLINE

I. Introduction to StatisticsII. Tabular and Graphical representation of

DataIII. Measures of Central Tendencies, Locations

and VariationsIV. Measure of Dispersion and CorrelationV. Probability and CombinatoricsVI. Discrete and Continuous DistributionsVII.Hypothesis Testing

Page 3: Probability and statistics(assign 7 and 8)

Text and References

Statistics: a simplified approach by Punsalan and Uriarte, 1998, Rex Texbook

Probability and Statistics by Johnson, 2008, Wiley

Counterexamples in Probability and Statistics by Romano and Siegel, 1986, Chapman and Hall

Page 4: Probability and statistics(assign 7 and 8)

Introduction to Statistics

Definition1.In its plural sense, statistics is a set of

numerical data e.g. Vital statistics, monthly sales, exchange rates, etc.

2.In its singular sense, statistics is a branch of science that deals with the collection, presentation, analysis and interpretation of data.

Page 5: Probability and statistics(assign 7 and 8)

General uses of Statistics

a. Aids in decision making by providing comparison of data, explains action that has taken place, justify a claim or assertion, predicts future outcome and estimates un known quantities

b. Summarizes data for public use

Page 6: Probability and statistics(assign 7 and 8)

Examples on the role of Statistics- In Biological and medical sciences, it helps researchers

discover relationship worthy of further attention.Ex. A doctor can use statistics to determine to what

extent is an increase in blood pressure dependent upon age

- In social sciences, it guides researchers and helps them support theories and models that cannot stand on rationale alone.

Ex. Empirical studies are using statistics to obtain socio-economic profile of the middle class to form new socio-political theories.

Page 7: Probability and statistics(assign 7 and 8)

Con’t- In business, a company can use statistics to

forecast sales, design products, and produce goods more efficiently.

Ex. A pharmaceutical company can apply statistical procedures to find out if the new formula is indeed more effective than the one being used.

- In Engineering, it can be used to test properties of various materials,

- Ex. A quality controller can use statistics to estimate the average lifetime of the products produced by their current equipment.

Page 8: Probability and statistics(assign 7 and 8)

Fields of Statistics

a. Statistical Methods of Applied Statistics:1. Descriptive-comprise those methods concerned

with the collection, description, and analysis of a set of data without drawing conclusions or inferences about a larger set.

2. Inferential-comprise those methods concerned with making predictions or inferences about a larger set of data using only the information gathered from a subset of this larger set.

Page 9: Probability and statistics(assign 7 and 8)

con’t

b. Statistical theory of mathematical statistics- deals with the development and exposition of theories that serve as a basis of statistical methods

Page 10: Probability and statistics(assign 7 and 8)

Descriptive VS Inferential

DESCRIPTIVE• A bowler wants to find his

bowling average for the past 12 months

• A housewife wants to determine the average weekly amount she spent on groceries in the past 3 months

• A politician wants to know the exact number of votes he receives in the last election

INFERENTIALA bowler wants to estimate his

chance of winning a game based on his current season averages and the average of his opponents.

A housewife would like to predict based on last year’s grocery bills, the average weekly amount she will spend on groceries for this year.

A politician would like to estimate based on opinion polls, his chance for winning in the upcoming election.

Page 11: Probability and statistics(assign 7 and 8)

Population as Differrentiated from Sample

The word population refers to groups or aggregates of people, animals, objects, materials, happenings or things of any form, this means that there are populations of students, teachers, supervisors, principals, laboratory animals, trees, manufactured articles, birds and many others. If your interest is on few members of the population to represent their characteristics or traits, these members constitute a sample. The measures of the population are called parameters, while those of the sample are called estimates or statistics.

Page 12: Probability and statistics(assign 7 and 8)

The Variable

It refers to a characteristic or property whereby the members of the group or set vary or differ from one another. However, a constant refers to a property whereby the members of the group do not differ one another.

Variables can be according to functional relationship which is classified as independent and dependent. If you treat variable y as a function of variable z, then z is your independent variable and y is your dependent variable. This means that the value of y, say academic achievement depends on the value of z.

Page 13: Probability and statistics(assign 7 and 8)

Con’t

Variables according to continuity of values.1. Continuous variable – these are variables

whose levels can take continuous values. Examples are height, weight, length and width.

2. Discrete variables – these are variables whose values or levels can not take the form of a decimal. An example is the size of a particular family.

Page 14: Probability and statistics(assign 7 and 8)

Con’t

Variables according to scale of measurements:1. Nominal – this refers to a property of the

members of a group defined by an operation which allows making of statements only of equality or difference. For example, individuals can be classified according to thier sex or skin color. Color is an example of nominal variable.

Page 15: Probability and statistics(assign 7 and 8)

Con’t2. Ordinal – it is defined by an operation whereby

members of a particular group are ranked. In this operation, we can state that one member is greater or less that the others in a criterion rather than saying that he/it is only equal or different from the others such as what is meant by the nominal variable.

3. Interval – this refers to a property defined by an operation which permits making statement of equality of intervals rather than just statement of sameness of difference and greater than or less than. An interval variable does not have a “true” zero point.; althought for convenience, a zero point may be assigned.

Page 16: Probability and statistics(assign 7 and 8)

Con’t

4. Ratio – is defined by the operation which permits making statements of equality of ratios in addition to statements of sameness or difference, greater than or less than and equality or inequality of differences. This means that one level or value may be thought of or said as double, triple or five times another and so on.

Page 17: Probability and statistics(assign 7 and 8)

Assignment no. 1

I. Make a list of at least 5 mathematician or scientist that contributes in the field of statistics. State their contributions

II. With your knowledge of statistics, give a real life situation how statistics is applied. Expand your answer.

III. When can a variable be considered independent and dependent? Give an example for your answer.

Page 18: Probability and statistics(assign 7 and 8)

Con’t

IV. Enumerate some uses of statistics. Do you think that any science will develop without test of the hypothesis? Why?

Page 19: Probability and statistics(assign 7 and 8)

Examples of Scales of Measurement

1.Nominal LevelEx. Sex: M-Male F-Female Marital Status: 1-single 2- married 3-

widowed 4- separated2. Ordinal LevelEx. Teaching Ratings: 1-poor 2-fair 3- good 4-

excellent

Page 20: Probability and statistics(assign 7 and 8)

Con’t3. Interval LevelEx. IQ, temperature4. Ratio LevelEx. Age, no. of correct answers in exam

Page 21: Probability and statistics(assign 7 and 8)

Data Collection Methods

1. Survey Method – questions are asked to obtain information, either through self administered questionnaire or personal interview.

2. Observation Method – makes possible the recording of behavior but only at the time of occurrence (ex. Traffic count, reactions to a particular stimulus)

Page 22: Probability and statistics(assign 7 and 8)

Con’t3. Experimental method – a method designed for

collecting data under controlled conditions. An experiment is an operation where there is actual human interference with the conditions that can affect the variable under study.

4. Use of existing studies – that is census, health statistics, weather reports.

5. Registration method – that is car registration, student registration, hospital admission and ticket sales.

Page 23: Probability and statistics(assign 7 and 8)

Tabular Representation

Frequency Distribution is defined as the arrangement of the gathered data by categories plus their corresponding frequencies and class marks or midpoint. It has a class frequency containing the number of observations belonging to a class interval. Its class interval contain a grouping defined by the limits called the lower and the upper limit. Between these limits are called class boundaries.

Page 24: Probability and statistics(assign 7 and 8)

Frequency of a Nominal DataMale and Female College students

Major in Chemistry

SEX FREQUENCY

MALE 23

FEMALE 107

TOTAL 130

Page 25: Probability and statistics(assign 7 and 8)

Frequency of Ordinal DataEx. Frequency distribution of Employee Perception on

the Behavior of their Administrators

Perception Frequency

Strongly favorable 10

favorable 11

Slightly favorable 12

Slightly unfavorable 14

Unfavorable 22

Strongly unfavorable 31

total 100

Page 26: Probability and statistics(assign 7 and 8)

Frequency Distribution Table

Definition:1. Raw data – is the set of data in its original

form2. Array – an arrangement of observations

according to their magnitude, wither in increasing or decreasing order.

Advantages: easier to detect the smallest and largest value and easy to find the measures of position

Page 27: Probability and statistics(assign 7 and 8)

Grouped Frequency of Interval Data

Given the following raw scores in Algebra Examination,

47 56 42 28 56 41 56 55 5978 50 55 57 38 62 52 66 6579 33 34 37 47 42 68 62 5480 68 48 56 39 77 80 62 7157 52 60 70

Page 28: Probability and statistics(assign 7 and 8)

Con’t1. Compute the range: R = H – L and the number of

classes by K = 1 + 3.322log n where n = number of observations.

2. Divide the range by 10 to 15 to determine the acceptable size of the interval. Hint: most frequency distribution have odd numbers as the size of the interval. The advantage is that the midpoints of the intervals will be whole number.

3. Organize the class interval. See to it that the lowest interval begins with a number that is multiple of the interval size.

Page 29: Probability and statistics(assign 7 and 8)

Con’t4. Tally each score to the category of class interval it

belongs to.5. Count the tally columns and summarizes it under

column (f). Then add the frequency which is the total number of the cases (N).

6. Determine the class boundaries. UCB and LCB.(upper and lower class boundary)

7. Compute the midpoint for each class interval and put it in the column (M).

M = (LS + HS) / 2

Page 30: Probability and statistics(assign 7 and 8)

Con’t8. Compute the cumulative distribution for less

than and greater than and put them in column cf< and cf>. (you can now interpret the data). cf = cumulative frequency

9. Compute the relative frequency distribution. This can be obtained by

RF% = CF/TF x 100% CF = CLASS FREQUENCY TF = TOTAL FREQUENCY

Page 31: Probability and statistics(assign 7 and 8)

Graphical RepresentationThe data can be graphically

presented according to their scale or level of measurements.

1. Pie chart or circle graph. The pie chart at the right is the enrollment from elementary to master’s degree of a certain university. The total population is 4350 students

Page 32: Probability and statistics(assign 7 and 8)

Con’t2. Histogram or bar graph- this graphical

representation can be used in nominal, ordinal or interval. For nominal bar graph, the bars are far apart rather than connected since the categories are not continuous. For ordinal and interval data, the bars should be joined to emphasize the degree of differences

Page 33: Probability and statistics(assign 7 and 8)

Given the bar graph of how students rate their library.

A-strongly favorable, 90B-favorable, 48C-slightly favorable, 88D-slightly unfavorable, 48E-unfavorable, 15F-strongly unfavorable, 25

Page 34: Probability and statistics(assign 7 and 8)

The Histogram of Person’s Age with Frequency of Travel

age freq RF

19-20 20 39.2%

21-22 21 41.2%

23-24 4 7.8%

25-26 4 7.8%

27-28 2 3.9%

total 51 100%

Page 35: Probability and statistics(assign 7 and 8)

ExercisesFrom the previous grouped data on algebra scores,a. Draw its histogram using the frequency in the y axis

and midpoints in the x axis.b. Draw the line graph or frequency polygon using

frequency in the y axis and midpoints in the x axis.c. Draw the less than and greater than ogives of the

data. Ogives is a cumulation of frequencies by class intervals. Let the y axis be the CF> and x axis be LCB while y axis be CF< and x axis be UCB

Page 36: Probability and statistics(assign 7 and 8)

Con’td. Plot the relative frequency using the y axis as

the relative frequency in percent value while in the x axis the midpoints.

Page 37: Probability and statistics(assign 7 and 8)

Con’t

25 30 35 40 45 50 55 60 65 70 75 80 85 90

9

8

7

6

5

4

3

2

1

0

f

midpoint29.5 - UCB27- midpoint24.5 - LCB

midpoint

HISTOGRAMLINE GRAPH

Page 38: Probability and statistics(assign 7 and 8)

Con’t

29.5 34.5 39.5 44.5 49.5 54.5 59.5 64.5 69.5 74.5 79.5 84.5

cf less than

40

35

30

25

20

15

10

5

0

UCB

Page 39: Probability and statistics(assign 7 and 8)

Con’t

40

35

30

25

20

15

10

5

024.5 29.5 34.5 39.5 44.5 49.5 54.5 59.5 64.5 69.5 74.5 79.5

cf greater than

LCB

Page 40: Probability and statistics(assign 7 and 8)

Assignment No. 2Given the score in a statistics examinations,33 38 56 35 70 44 81 44 8047 45 72 45 50 51 51 52 6654 54 53 56 84 58 56 57 7055 56 39 56 59 72 63 89 6360 69 65 61 62 64 64 69 6065 53 66 66 67 67 68 68 6966 66 67 70 59 40 71 73 6073 73 73 73 73 73 74 73 7374 79 74 74 70 73 46 74 7475 74 75 75 76 55 77 78 7379 48 81 44 84 77 88 63 8573

Page 41: Probability and statistics(assign 7 and 8)

Con’t1. Construct the class interval, frequency table,

class midpoint(use a whole number midpoint), less than and greater than cumulative frequency, upper and lower boundary and relative frequency.

2. Plot the histogram, frequency polygon, and ogives

Page 42: Probability and statistics(assign 7 and 8)

Con’t3. Draw the pie chart and bar graph of the plans

of computer science students with respect to attending a seminar. Compute for the Relative frequency of each.

A-will not attend=45B-probably will not attend=30C-probably will attend=40D-will attend=25

Page 43: Probability and statistics(assign 7 and 8)

Measures of Centrality and Location

Mean for Ungrouped DataX’ = ΣX / N where X’ = the mean ΣX = the sum of all scores/data N = the total number of casesMean for Grouped DataX’ = ΣfM / N where X’ = the mean M = the midpoint fM = the product of the frequency and each

midpoint N = total number of cases

Page 44: Probability and statistics(assign 7 and 8)

Con’tEx. 1. Find the mean of 10, 20, 25,30, 30, 35, 40 and 50.2. Given the grades of 50 students in a statistics classClass interval f 10-14 4 15-19 3 20-24 12 25-29 10 30-34 6 35-39 6 40-44 6 45-49 3

Page 45: Probability and statistics(assign 7 and 8)

Con’tThe weighted mean. The weighted arithmetic

mean of given groups of data is the average of the means of all groups

WX’ = ΣXw / N where WX’ = the weighted mean w = the weight of X ΣXw = the sum of the weight of X’s N = Σw = the sum of the weight of

X

Page 46: Probability and statistics(assign 7 and 8)

Con’tEx.Find the weighted mean of four groups of

means below:Group, i 1 2 3 4Xi 60 50 70 75

Wi 10 20 40 50

Page 47: Probability and statistics(assign 7 and 8)

Con’tMedian for Ungrouped DataThe median of ungrouped data is the

centermost scores in a distribution. Mdn = (XN/2 + X (N + 2)/2) / 2 if N is even

Mdn = X (1+N)/2 if N is oddEx. Find the median of the following sets of

score:Score A: 12, 15, 19, 21, 6, 4, 2Score B: 18, 22, 31, 12, 3, 9, 11, 8

Page 48: Probability and statistics(assign 7 and 8)

Con’tMedian for Grouped DataProcedure:1. Compute the cumulative frequency less than.2. Find N/23. Locate the class interval in which the middle class falls, and

determine the exact limit of this interval.4. Apply the formula Mdn = L + [(N/2 – F)i]/fm where L = exact lower limit interval containing

the median class F = The sum of all frequencies preceeding L. fm = Frequency of interval containing the median

class i = class interval N = total number of cases

Page 49: Probability and statistics(assign 7 and 8)

Con’tEx. Find the median of the given frequency table.class interval f cf<25-29 3 330-34 5 835-39 10 1840-44 15 3345-49 15 4850-54 15 6355-59 21 8460-64 8 9265-69 6 9870-74 2 100

Page 50: Probability and statistics(assign 7 and 8)

Con’tMode of Ungrouped DataIt is defined as the data value or specific score

which has the highest frequency.Find the mode of the following data.Data A : 10, 11, 13, 15, 17, 20Data B: 2, 3, 4, 4, 5, 7, 8, 10Data C: 3.5, 4.8, 5.5, 6.2, 6.2, 6.2, 7.3, 7.3, 7.3,

8.8

Page 51: Probability and statistics(assign 7 and 8)

Mode of Grouped DataFor grouped data, the mode is defined as the midpoint

of the interval containing the largest number of cases.

Mdo = L + [d1/(d1 + d2)]i where L = exact lower limit interval

containing the modal class. d1 = the difference of the modal class and the

frequency of the interval preceding the modal class d2 = the difference of the modal class and the

frequency of the interval after the modal class.

Page 52: Probability and statistics(assign 7 and 8)

Ex. Find the mode of the given frequency table.class interval f cf<25-29 3 330-34 5 835-39 10 1840-44 15 3345-49 15 4850-54 15 6355-59 21 8460-64 8 9265-69 6 9870-74 2 100

Page 53: Probability and statistics(assign 7 and 8)

Exercises 1. Determine the mean, median and mode of

the age of 15 students in a certain class.15, 18, 17, 16, 19, 18, 23 , 24, 18, 16, 17, 20, 21,

192. To qualify for scholarship, a student should

have garnered an average score of 2.25. determine if the a certain student is qualified for a scholarship.

Page 54: Probability and statistics(assign 7 and 8)

Subjectno. of units grade A 1 2.0 B 2 3.0 C 3 1.5 D 3 1.25 E 5 2.0

Page 55: Probability and statistics(assign 7 and 8)

3. Find the mean, median and mode of the given grouped data.

Classes f 11-22 223-34 835-46 1147-58 1959-70 1471-82 583-94 1

Page 56: Probability and statistics(assign 7 and 8)

Quartiles refer to the values that divide the distribution into four equal parts. There are 3 quartiles represented by Q1 , Q2 and Q3. The value Q1 refers to the value in the distribution that falls on the first one fourth of the distribution arranged in magnitude. In the case of Q2 or the second quartile, this value corresponds to the median. In the case of third quartile or Q3, this value corresponds to three fourths of the distribution.

Page 57: Probability and statistics(assign 7 and 8)

LH

Q3

Q2

Q1= 1st quartile

= 2nd quartile

=3rd quartile

The position of the quartiles in a given set of data

Page 58: Probability and statistics(assign 7 and 8)

For grouped data, the computing formula of the kth quartile where k = 1,2,3,4,… is given by

Qk = L + [(kn/4 - F)/fm]IiWhere L = lower class boundary of the kth

quartile class F = cumulative frequency before the kth

quartile class fm = frequency before the kth quartile i = size of the class interval

Page 59: Probability and statistics(assign 7 and 8)

ExercisesCompute the value of the first and third quartile of the given

dataclass interval f cf<25-29 3 330-34 5 835-39 10 1840-44 15 3345-49 15 4850-54 15 6355-59 21 8460-64 8 9265-69 6 9870-74 2 100

Page 60: Probability and statistics(assign 7 and 8)

Decile:If the given data is divided into ten equal parts,

then we have nine points of division known as deciles. It is denoted by D1 , D2,

D3 , D4 …and D9

Dk = L + [(kn/10 – F)/fm] I

Where k = 1,2,3,4 …9

Page 61: Probability and statistics(assign 7 and 8)

Exercises Compute the value of the third, fifth and seventh decile of the

given dataclass interval f cf<25-29 3 330-34 5 835-39 10 1840-44 15 3345-49 15 4850-54 15 6355-59 21 8460-64 8 9265-69 6 9870-74 2 100

Page 62: Probability and statistics(assign 7 and 8)

Percentile- refer to those values that divide a distribution into one hundred equal parts. There are 99 percentiles represented by P1, P2, P3, P4, P5, …and P99. when we say 55th percentile we are referring to that value at or below 55/100 th of the data.

Pk = L + [(kn/100 – F)/fm]i

Where k = 1,2,3,4,5,…99

Page 63: Probability and statistics(assign 7 and 8)

Exercises Compute the value of the 30th, 55th, 68th and 88th percentile of

the given dataclass interval f cf<25-29 3 330-34 5 835-39 10 1840-44 15 3345-49 15 4850-54 15 6355-59 21 8460-64 8 9265-69 6 9870-74 2 100

Page 64: Probability and statistics(assign 7 and 8)

Assignment no. 3I. The rate per hour in pesos of 12 employees

of a certain company were taken and are shown below.

44.75, 44.75, 38.15, 39.25, 18.00, 15.75, 44.75, 39.25, 18.50, 65.25, 71.25, 77.50

a. Find the mean, median and mode.b. If the value 15.75 was incorrectly written as

45.75, what measure of central tendency will be affected? Support your answer.

Page 65: Probability and statistics(assign 7 and 8)

II. The final grades of a student in six subjects were tabulated below.

Subj units final gradeAlgebra 3 60Religion 2 90English 3 75Pilipino 3 86PE 1 98History 3 70a. Determine the weighted meanb. If the subjects were of equal number of units, what

would be his average?

Page 66: Probability and statistics(assign 7 and 8)

III. The ages of qualified voters in a certain barangay were taken and are shown below

Class Interval Frequency18-23 2024-29 2530-35 4036-41 5242-47 3048-53 2154-59 1260-65 666-71 472-77 1

Page 67: Probability and statistics(assign 7 and 8)

a. Find the mean, median and modeb. Find the 1st and 3rd quantilec. Find the 4th and 6th deciled. Find the 25th and 75th percentile

Page 68: Probability and statistics(assign 7 and 8)

Measure of VariationThe range is considered to be the simplest form

of measure of variation. It is the difference between the highest and the lowest value in the distribution.

R = H – LFor grouped data, the3 difference between the

highest upper class boundary and the lowest lower class boundary.

Example: find the range of the given grouped data in slide no. 59

Page 69: Probability and statistics(assign 7 and 8)

Semi-inter Quartile Range

This value is obtained by getting one half of the difference between the third and the first quartile.

Q = (Q3 – Q1)/2

Example: Find the semin-interquartile range of the

previous example in slide no. 59

Page 70: Probability and statistics(assign 7 and 8)

Average DeviationThe average deviation refers to the arithmetic

mean of the absolute deviations of the values from the mean of the distribution. This measure is sometimes known as the mean absolute deviation.

AD = Σ│x – x’│/ nWhere x = the individual values x’ = mean of the distribution

Page 71: Probability and statistics(assign 7 and 8)

Steps in solving for AD1. Arrange the values in column according to

magnitude2. Compute for the value of the mean x’3. Determine the deviations (x – x’)4. Convert the deviations in step 3 into positive

deviations. Use the absolute value sign.5. Get the sum of the absolute deviations in

step 46. Divide the sum in step 5 by n.

Page 72: Probability and statistics(assign 7 and 8)

Example:1. Consider the following values:16, 13, 9, 6, 15, 7, 11, 12Find the average deviation.

Page 73: Probability and statistics(assign 7 and 8)

For grouped data:AD = Σf│x – x’│ / nWhere f = frequency of each class x = midpoint of each class x’ = mean of the distribution n = total number of frequency

Page 74: Probability and statistics(assign 7 and 8)

Example:Find the average deviation of the given dataClasses f 11-22 223-34 835-46 1147-58 1959-70 1471-82 583-94 1

Page 75: Probability and statistics(assign 7 and 8)

VarianceFor ungrouped datas2 = Σ(x – x’)2 / nExample: Find the variance of16, 13, 9, 6, 15, 7, 11, 12

Page 76: Probability and statistics(assign 7 and 8)

For grouped datas2 = Σf(x – x’)2 / nWhere f = frequency of each class x = midpoint of each class interval x’ = mean of the distribution n = total number of frequency

Page 77: Probability and statistics(assign 7 and 8)

Example: Find the variance of the given dataClasses f 11-22 223-34 835-46 1147-58 1959-70 1471-82 583-94 1

Page 78: Probability and statistics(assign 7 and 8)

Coefficient of variationIf you wish to compare the variability between

different sets of scores or data, coefficient of variation would be very useful measure for interval scale data

CV = s/xWhere s = standard deviation x = the mean

Page 79: Probability and statistics(assign 7 and 8)

Example:In a particular university, a researcher wishes to

compare the variation in scores of the urban students with that of the scores of the rural students in their college entrance test. It is know that the urban student’s mean score is 384 with a standard deviation of 101; while among the rural students, the mean is 174, with a standard deviation of 53, which group shows more variation in scores?

Page 80: Probability and statistics(assign 7 and 8)

Standard Deviation

s = √s2

For ungrouped data s = √ Σ(x – x’)2 / nFor grouped datas = √ Σf(x – x’)2 / n

Page 81: Probability and statistics(assign 7 and 8)

Find the standard deviation of the previous examples for ungrouped and grouped data.

Find the standard deviation of the given dataClasses f 11-22 223-34 835-46 1147-58 1959-70 1471-82 583-94 1

Page 82: Probability and statistics(assign 7 and 8)

Find the standard deviation of16, 13, 9, 6, 15, 7, 11, 12

Page 83: Probability and statistics(assign 7 and 8)

Measure of variation for nominal dataVR = 1 – fm/NWhere VR = the variation ratio fm = modal class frequency N = counting of observation

Page 84: Probability and statistics(assign 7 and 8)

Example: With the data given by a clinical psychologist on the

type of therapy used, compute the variation ratios.Type of therapy no. of patients YR 1980 YR 1985Logotherapy 20 8Reality Therapy 60 105Rational Therapy 42 6Transactional analysis 39 9Family therapy 52 5Others 41 8

Page 85: Probability and statistics(assign 7 and 8)

Assignment no. 4

I. Compute for the semi-interquartile range, absolute deviation, variance and standard deviation test III of assignment no. 3.

II. Compute for the semi-interquartile range, absolute deviation, variance and standard deviation of test I of assignment no. 3.

Page 86: Probability and statistics(assign 7 and 8)

SIMPLE LINEAR REGRESSION AND MEASURES OF CORRELATION

In this topic, you will learn how to predict the value of one dependent variable from the corresponding given value of the independent variable.

Page 87: Probability and statistics(assign 7 and 8)

The scatter diagram:In solving problems that concern estimation and

forecasting, a scatter diagram can be used as a graphical approach. This technique consist of joining the points corresponding to the paired scores of dependent and independent variables which are commonly represented by X and Y on the X-Y coordinate system.

Page 88: Probability and statistics(assign 7 and 8)

Example:The working experience and income of 8 employees are given

belowEmployee years of income experience (in Thousands) X Y A 2 8 B 8 10 C 4 11 D 11 15 E 5 9 F 13 17 G 4 8 H 15 14

Page 89: Probability and statistics(assign 7 and 8)

Using the Least Squares Linear Regression Equation:

Y = a + bXWhere b = [nΣxy – ΣxΣy] / [nΣx2 – (Σx)2] a = y’ – bx’Obtain the equation of the given data and

estimate the income of an employee if the number of years experience is 20 years.

Page 90: Probability and statistics(assign 7 and 8)

Standard Error of Estimate Se = √ [ΣYi

2 – a(Yi) – b(XiYi)] / n-2

The standard error of estimate is interpreted as the standard deviation. We will find that the same value of X will always fall between the upper and lower 3Se limits.

Page 91: Probability and statistics(assign 7 and 8)

Measures of CorrelationThe degree of relationship between variables is

expressed into:1. Perfect correlation (positive or negative)2. Some degree of correlation (positive or

negative)3. No correlation

Page 92: Probability and statistics(assign 7 and 8)

For a perfect correlation, it is either positive or negative represented by +1 and -1. correlation coefficients, positive or negative, is represented by +0.01 to +0.99 and -0.01 to -0.99. The no correlation is represented by 0.

Page 93: Probability and statistics(assign 7 and 8)

0 to +0.25 very small positive correlation+0.26 to +0.50 moderately small positive correlation+0.51 to +0.75 high positive correlation+0.76 to +0.99 very high positive correlation+1.00 perfect positive correlation----------------------------------------------------------0 to -0.25 very small negative correlation-0.26 to -0.50 moderately small positive correlation-0.51 to -0.75 high negative correlation-0.76 to -0.99 very high negative correlation-1.00 perfect negative correlation

Page 94: Probability and statistics(assign 7 and 8)

Anybody who wants to interpret the results of the coefficient of correlation should be guided by the following reminders:

1. The relationship of two variables does no necessarily mean that one is the cause of the effect of the other variable. It does not imply cause-effect relationship.

2. When the computed Pearson r is high, it does not necessarily mean that one factor is strongly dependent on the other. On the other hand, when the computed Pearson r is small it does not necessarily mean that one factor has no dependence on the other.

3. If there is a reason to believe that the two variables are related and the computed Pearson r is high, these two variables are really meant as associated. On the other hand, if the variables correlated are low, other factors might be responsible for such small association.

4. Lastly, the meaning of correlation coefficient just simply informs us that when two variables change there may be a strong or weak relationship taking place.

Page 95: Probability and statistics(assign 7 and 8)

The formula for finding the Pearson r is [nΣXY – ΣXΣY] r = ------------------------------ √[nΣX2 – (ΣX)2] [nΣY2 – (ΣY)2]

Page 96: Probability and statistics(assign 7 and 8)

Example: Given two sets of scores. Find the Pearson r and interpret the result.

X Y 18 10 16 14 14 14 13 12 12 10 10 8 10 5 8 6 6 12 3 0

Page 97: Probability and statistics(assign 7 and 8)

Correlation between Ordinal DataThis is the Spearman Rank-Order Correlation

Coefficient (Spearman Rho). For cases of 30 or less, Spearman ρ is the most widely used of the rank correlation method.

6ΣD2

ρ = 1 - ----------- n(n2 – 1)Where D = (RX – RY)

Page 98: Probability and statistics(assign 7 and 8)

Example:Individual Test X Test Y 1 18 24 2 17 28 3 14 30 4 13 26 5 12 22 6 10 18 7 8 15 8 8 12

Page 99: Probability and statistics(assign 7 and 8)

Gamma Rank OrderAn alternative to the rank order correlation is

the Goodman’s and Kruskal’s Gamma (G).The value of one variable can be estimated or

predicted from the other variable when you have the knowledge of their values. The gamma can also be used when ties are found in the ranking of the data.

Page 100: Probability and statistics(assign 7 and 8)

NS - N1

G = ----------------- NS + N1

Where NS = the number of pairs ordered in the parallel direction

N1 = the number of pairs ordered in the opposite direction

Page 101: Probability and statistics(assign 7 and 8)

Given a segment of the Filipino Electorate according to religion and political party

LAKAS LP NP Total

Catholic 50 25 20

INC 34 72 21

Born Again

22 12 10

Total

Page 102: Probability and statistics(assign 7 and 8)

Correlation between Nominal Data

The Guttman’s Coefficient of predictability is the proportionate reduction in error measure which shows the index of how much an error is reduced in predicting values of one variable from the value of another.

ΣFBR - MBC λc = ------------------ N – MBCWhere FBR = the biggest cell frequencies in the ith row MBC = the biggest column totals N = total observations

Page 103: Probability and statistics(assign 7 and 8)

ΣFBC - MBR λr = ------------------- N – MBRWhere FBC = the biggest cell frequencies in the

column MBR = the biggest of the row totals N = total number of observationsCompute for the λc and λr for the segment of

Filipino electorate and political parties.

Page 104: Probability and statistics(assign 7 and 8)

Assignment no. 51. Given the average yearly cost and sales of company A for a

period of 8 years. Find the pearson r and interpret the results.

Year Cost Sales per P10,000 per P10,0001960 15 381961 30 53.31962 16 601963 39 721964 20 401965 36 47.51966 45 821967 10 21.5

Page 105: Probability and statistics(assign 7 and 8)

2. Given the grades of 10 students in statistics determine the spearman rho and interpret the result

Student Q1 Q2 A 62 57 B 90 88 C 75 90 D 60 67 E 58 60 F 89 79 G 91 78 H 90 62 I 94 86 J 50 55

Page 106: Probability and statistics(assign 7 and 8)

3. Compute for the gamma shown and interpret the result

Socio-economic status

EDUCATIONAL STATUS TOTAL

UPPER MIDDLE LOWER TOTAL

UPPER 24 19 5

MIDDLE 12 54 29

LOWER 9 26 25

TOTAL

Page 107: Probability and statistics(assign 7 and 8)

4. Compute for the λc and λr for the problem no. 3.

Page 108: Probability and statistics(assign 7 and 8)

Counting TechniquesConsider the numbers 1,2,3 and 4. suppose you want

to determine the total 2 digit numbers that can be formed if these are combined. First, let us assume that no digit is to be repeated.

12 21 31 4113 23 32 4214 24 34 43Notice that we were able to used all the possibilities. In

this example, we have 12 possible 2 digit numbers.

Page 109: Probability and statistics(assign 7 and 8)

Now, what if the digits can be repeated?11 12 13 1421 22 23 2431 23 33 3441 42 43 44Hence, we have 16 possible outcomes.In the first activity, we can do it in n1 ways and after it

has been done, the second activity can be done in n2 ways, then the total number of ways in which the two activities can be done is equal to n1 n2.

Page 110: Probability and statistics(assign 7 and 8)

Example:1. How many two digit numbers can be formed from

the numbers 1,2,3 and 4 ifa. Repetition is not allowed?b. Repetition is allowed?2. How many three digit numbers can be formed from

the digits 1,2,3,4 and 5 if any of the digits can be repeated?

3. The club members are going to elect their officers. If there are 5 candidates for president, 5 candidates for vice president and 3 for secretary, then how many ways can the officers be elected?

Page 111: Probability and statistics(assign 7 and 8)

4. An office executive plans to buy as laptop in which there are 5 brands available. Each of the brands has 3 models and each model has 5 colors to chose from. In how many ways can the executive choose?

5. Consider the numbers 2,3 5 and 7. if repetition is not allowed, how many three digit numbers can be formed such that

a. They are all odd?b. They are all even?c. They are greater that 500?

Page 112: Probability and statistics(assign 7 and 8)

6. A pizza place offers 3 choices of salad, 20 kinds of pizza and 4 different deserts. How many different 3 course meals can one order?

7. The executive of a certain company is consist of 5 males and 2 females. How many ways can the presidents and secretary be chosen if

a. The president must be female and the secretary must be male?

b. The president and the secretary are of opposite sex?

c. The president and the secretary should be male?

Page 113: Probability and statistics(assign 7 and 8)

Permutation The term permutation refers to the

arrangement of objects with reference to order.

P(n,r) = n! / (n – r)!Evaluate:1. P(10,6)2. P(5,5)3. P(4,3) + P(4,4)

Page 114: Probability and statistics(assign 7 and 8)

Examples:1. In how many ways can a president, a vice

president, a secretary and a treasurer be elected from a class with 40 students?

2. In how many ways can 7 individuals be seated in a row of 7 chairs?

3. In how many ways can 9 individuals be seated in a row of 9 chairs if two individuals wanted to be seated side by side?

Page 115: Probability and statistics(assign 7 and 8)

4. Suppose 5 different math books and 7 different physics books shall be arranged in a shelf. In how many ways can such books be arranged if the books of the same subject be placed side by side?

5. Determine the possible permutations of the word MISSISSIPPI.

6. Find the total 8 digit numbers that can be formed using all the digits in the following numerals 55777115

Page 116: Probability and statistics(assign 7 and 8)

7. In how many ways can 6 persons be seated around a table with 6 chairs if two individuals wanted to be seated side by side?

8. In a local election, there are 7 people running for 3 positions. In how many ways can this be done?

Page 117: Probability and statistics(assign 7 and 8)

Combination A combination is an arrangement of objects not

in particular order.nCr = C(n,r) = n! / r!(n-r)!Evaluate:1. 8C4

2. 5(5C4 – 5C2)

3. 7C5 / (7C6 – 7C2)

Page 118: Probability and statistics(assign 7 and 8)

1. A class is consist of 12 boys and 10 girls.a. In how many ways can the class elect the

president, vice president, secretary and a treasurer?

b. In how many ways can the class elect 4 members of a certain committee?

2. In how many ways can a student answer 6 out of ten questions?

3. In how many ways can a student answer 6 out of 10 questions if he is required to answer 2 of the first 5 questions?

Page 119: Probability and statistics(assign 7 and 8)

4. In how many ways can 3 balls be drawn from a box containing 8 red and 6 green balls?

5. A box contain 8 red and 6 green balls. In how many ways can 3 balls be drawn such that

a. They are all green?b. 2 is red and 1 is green?c. 1 is red and 2 is green?

Page 120: Probability and statistics(assign 7 and 8)

6. A shipment of 40 computers are unloaded from the van and tested. 6 of them are defective. In how many ways can we select a set of 5 computers and get at least one defective?

7. Five letters a,b,c,d,e are to be chosen. In how many ways could you choose

a. None of themb. At least two of themc. At most three of them

Page 121: Probability and statistics(assign 7 and 8)

Assignment no. 61. How many possible outcomes are there ifa. A die is rolled?b. A pair of dice is rolled?2. In how many ways can 5 math teachers be

assigned to 4 available subjects if each of the 5 teachers have equal chance of being assigned to any of the 4 subjects?

Page 122: Probability and statistics(assign 7 and 8)

3. Consider the numbers 1,2,3,5,and 6. how many 3 digit numbers can be formed from these numbers if

a. Repetition is not allowed and 0 should not be in the first digit?

b. Repetition is allowed and 0 should not be in the first digit?

4. A college has 3 entrance gates and 2 exit gates. In how many ways can a student enter then leave the building?

Page 123: Probability and statistics(assign 7 and 8)

5. In how many ways can 9 passengers be seated in a bus if there are only 5 seats available?

6. In how many ways can 4 boys and 4 girls be seated in a row of 8 chairs if

a. They can sit anywhere?b. The boys and girls are to be seated

alternately?7. In how many ways can ten participants in a

race placed first, second and third?

Page 124: Probability and statistics(assign 7 and 8)

8. Determine the number of distinct permutations of each of the following:

a. STATISTICSb. ADRENALINc. 440449994049. A class consist of 12 boys and 10 girls. In

how many ways can a committee of five be formed if

a. All members are boys?b. 2 are boys and 3 are girls?

Page 125: Probability and statistics(assign 7 and 8)

10. In how many ways can a student answer an exam if out of the 6 problem, he is required to answer only 4?

Page 126: Probability and statistics(assign 7 and 8)

ProbabilityIn the study of probability, we shall consider activities

for which the outcomes cannot be predicted with certainty. These activities, called experiment, could always result in a single outcome. Although the single outcome can not be predicted before the performance of the experiment, the set of all possible outcomes can be determined. This set of all possible outcomes is referred to as sample space. Each individual element or outcome in a sample space is known as a sample point.

Page 127: Probability and statistics(assign 7 and 8)

Definition of terms:1. Random experiment- any process of

generating a set of data or observations that can be repeated under basically the same conditions, which lead to well defined outcomes.

2. Sample space – set of all possible outcomes of an experiment, usually denoted by S.

3. Sample point- an element of the sample space or outcomes.

Page 128: Probability and statistics(assign 7 and 8)

4. event- any subset of the sample space usually denoted by capital letters.

5. Null space- a subset of the sample space that contains no elements and denoted by the symbol Ø.

6. Simple event – an event which contains only one element of the sample space.

7. Compound event – an event that can be expressed as the union of the simple events, thus containing more than one sample points.

8. Mutually exclusive events- two events A and B are mutually exclusive if A∩B have no elements in common.

Page 129: Probability and statistics(assign 7 and 8)

The probability of a event A denoted by P(A) is the sum of the probabilities of mutually exclusive outcomes that constitute the event. It must satisfy the following properties:

0 ≤ P(A) ≤ 1

Page 130: Probability and statistics(assign 7 and 8)

Example:1. Consider the activity of rolling a die. This activity has

6 possible outcomes, that is 1,2,3,4,5 and 6. thus, S = {1,2,3,4,5,6}Any numbers 1 to 6 is a sample point of S. we can say

that there are 6 sample points. If we let A be the event of getting an even number and B an event of getting a perfect square, then

A = {2,4,6} and B = {1,4}Note that the elements of A are elements of the

sample space S. the number of sample points in a sample space S, events A and B are usually written as n(S) = 6, n(A) = 3 and n(B) = 2.

Page 131: Probability and statistics(assign 7 and 8)

2. If a pair of dice is rolled, then determine the number of sample points of the following:

a. Sample spaceb. Event of getting a sum of 5.c. Event of getting a sum of at most 4.3. A box contains 6 red and 4 green balls. If three

balls are drawn from the box, then determine the number of sample points of the following:

a. The sample spaceb. The event of getting all green ballsc. The event of getting 1 red and 2 green balls.

Page 132: Probability and statistics(assign 7 and 8)

Probability is the chance that an event will happen. The probability of an event A denoted by P(A) refers to the number between 0 and 1 including the values of 0 and 1. This number can be expressed as a fraction, as a decimal or as a percent. When we assign a probability of 0 to event A, it means that it is impossible for event A to occur. When event A is assigned a probability of 1, then we say that event A will really occur.

Page 133: Probability and statistics(assign 7 and 8)

P(A) + P(A)’ = 1The probability of occurrence plus the

probability of non-occurrence is always equal to 1.

Example:A student in a statistics class was able to

compute the probability of passing the subject to be equal to 0.55. Based on this information, what is the probability that he is not going to pass the subject?

Page 134: Probability and statistics(assign 7 and 8)

Three approaches of probability:1. Subjective probability- it is determine by the use of

intuition, personal beliefs and other indirect information.

2. A posteriori or probability of relative frequency (empirical probability) – it is determined by repeating the experiment a large number of times using the following rule:

no. of times event A occurred P(A) = --------------------------------------------------- no. of times experiment was repeated

Page 135: Probability and statistics(assign 7 and 8)

Example:Records show that 120 out of 500 students who

entered in a CS/IT programs leave the school due to financial problems. What is the probability that a freshman entering this college will leave the school due to financial problem?

Page 136: Probability and statistics(assign 7 and 8)

2. Last year, the efficiency rating of the employees of a certain company were taken and presented in a frequency distribution below:

Efficiency rating no. of employees 60-65 12 66-71 10 72-77 31 78-83 29 84-89 8Based on the data, what can we say about the

proportion of employees for this year who shall have an efficiency rating from 72-77 and 84-89?

Page 137: Probability and statistics(assign 7 and 8)

3. A Priori or classical probability – it is determined even before the experiment is performed using the following rule:

n(A)P(A) = -------- n(S)Where n(A) = no. of sample points in event A n(S) = no. of sample points in sample

space S.

Page 138: Probability and statistics(assign 7 and 8)

1. If a coin is tossed , what is the probability of getting a head?

2. If two coins are tossed, what is the probability of getting both heads?

3. If a die is rolled, what is the probability of getting an odd number? An even number? A perfect square?

4. If a pair of dice is rolled, what is the probability of getting a sum of 6? A sum of 13?

Page 139: Probability and statistics(assign 7 and 8)

5. The probability that a college student without a flu shot will get the flu is 0.42.what is the probability that a college student without the flu shot will not get the flu?

6. A box contains 7 red and 6 green balls. If 2 balls are drawn from the box, what is the probability of getting both green? 1 red and 1 green?

Page 140: Probability and statistics(assign 7 and 8)

Addition Rule:In practice, the probability of two or more

events are usually considered. If we let A and B be events then these two events can be combined to form another event. The event that at least one of the events A or B will happen is denoted by AUB. The event that both events A and B will occur is denoted by A∩B. The probability of AUB denoted by P(AUB) is given by

P(AUB) = P(A) + P(B) – P(A∩B)

Page 141: Probability and statistics(assign 7 and 8)

Two events A and B are said to be mutually exclusive if they can not occur both at the same time. This implies that the occurrence of event A excludes the occurrence of event B and vice versa. Therefore, P(A∩B) has no sample point which is equal to 0. The previous equation will be

P(AUB) = P(A) + P(B)

Page 142: Probability and statistics(assign 7 and 8)

1. Consider rolling a die and the events of getting an odd number, an even number and a perfect square. Determine the probability of getting

a. An odd or an even number.b. An even number or a perfect square. (this

implies that the two events can occur both at the same time. Therefore the two events are non-mutually exclusive events)

Page 143: Probability and statistics(assign 7 and 8)

2. A card is drawn from an ordinary deck of 52 playing cards. Find the probability of getting

a. An ace or a queenb. A queen or a face cardc. A black card or a queen

Page 144: Probability and statistics(assign 7 and 8)

3. You are going to rolled a pair of dice. Find the probability of getting the sum that is even or the sum that is multiple of 3.

4. A student goes to the library and checks out that 40% are work of fiction, 30% are non fiction and 20% are either fiction or non-fiction. What is the probability that the student check out a work of fiction, non-fiction or both?

Page 145: Probability and statistics(assign 7 and 8)

5. The probability that Anita will buy machine A is 7/11 and the probability that she will buy machine B is 5/11. If the probability of buying either machine A and B is 9/11, what is the probability of buying the two machine?

Page 146: Probability and statistics(assign 7 and 8)

6. A community swim team has 150 members. Seventy-five of the members are advanced swimmers. Forty-seven of the members are intermediate swimmers. The remainder are novice swimmers. Forty of the advanced swimmers practice 4 times a week. Thirty of the intermediate swimmers practice 4 times a week. Ten of the novice swimmers practice 4 times a week. Suppose one member of the swim team is randomly chosen. Answer the questions (Verify the answers):

Page 147: Probability and statistics(assign 7 and 8)

a. What is the probability that the member is a novice swimmer?

b. What is the probability that a member practice 4 times a week?

c. What is the probability that the member is an advanced swimmer and practice 4 times a week?

d. What is the probability that a member is an advance swimmer and an intermediate swimmer? Are they mutually exclusive?

Page 148: Probability and statistics(assign 7 and 8)

SEATWORK1. A BOX CONTAINS 7 RED, 3 GREEN AND 2 YELLOW BALLS. IF ONE BALL IS DRAWN

FROM THE BOX, THEN WHAT IS THE PROBABILITY OF GETTING• A RED?• A NON-RED?• A NON-GREEN?2. SUPPOSE THAT WE ROLL A DICE, WHAT IS THE PROBABILITY OF GETTING A SUM OF

6 OR 8?3. SUPPOSE WE PICK ONE CARD FROM A DECK OF CARDS, WHAT IS THE PROBABILITY

OF GETTING• A KING OR A SPADE?• A KING OR NUMBER 8?4. KLAUS IS TRYING TO CHOOSE WHERE TO GO ON VACATION. HIS CHOICES ARE

A=BAGIUO AND B=TAGAYTAY. HE CAN ONLY AFFORD ONE VACATION. THE PROBABILITY OF CHOOSING A IS 0.36 AND THE PROBABILITY OF CHOOSING B IS 0.44. WHAT IS THE PROBABILITY THAT HE CHOOSES TO GO EITHER A OR B? WHAT IS THE PROBABILITY THAT HE WILL NOT CHOOSE ANY OF THE TWO DISTINATION?

Page 149: Probability and statistics(assign 7 and 8)

Conditional Probability and Multiplication Rule

It is the probability that a second event will occur if the first event already happened. Symbolically, conditional probability is written as P(A/B) and is read as the probability of event A given that B has occurred. The computing formula for the conditional probability of A given B is given by

P(A/B) = P(A∩B)/P(B), provided P(B) is not equal to zero.

Page 150: Probability and statistics(assign 7 and 8)

1. Let P(A) = 0.55 P(B) = 0.35 P(A∩B) = 0.20Find P(A/B) and P(B/A)2. A die is rolled. If the result is an even number,

what is the probability that it is a perfect square?

3. A card is drawn from a deck of 52 cards. Given that the card drawn is a face card, then what is the probability of getting a king? A spade? A red card?

Page 151: Probability and statistics(assign 7 and 8)

4. A vendor has 35 balloons on strings. 20 balloons are yellow, 8 are red and 7 are green. A balloon was selected at random and sold. Given that the balloon selected and sold is yellow, what is the probability that the next balloon selected and sold at random is also yellow?

5. Given that 25 microwaves are on display in a certain store but 2 of them are defective. A customer wishes to buy 2 microwaves and pick them up without replacement. Find the probability that the two are defective.

Page 152: Probability and statistics(assign 7 and 8)

6. Should women participate in combat? yes noMale 32 18Female 8 42a. Find the probability that the respondent

answered YES given that the respondent was a female.

b.Find the probability that the respondent was a male given that the respondent answered NO.

Page 153: Probability and statistics(assign 7 and 8)

7. A box contains 3 red and 8 black balls. If two balls are drawn in succession without replacement, what is the probability that

a. Both are red?b.The first ball is red and the second ball is

black?8. A box contains 3 red and 8 black balls. If 2

balls are drawn at random with replacement, what is the probability that both are red?

Page 154: Probability and statistics(assign 7 and 8)

Assignment no. 71.. A BOX CONTAINS 7 RED, 3 GREEN AND 2 YELLOW BALLS. IF ONE BALL IS DRAWN

FROM THE BOX, THEN WHAT IS THE PROBABILITY OF GETTING• A RED?• A NON-RED?• A NON-GREEN?2. SUPPOSE THAT WE ROLL A DICE, WHAT IS THE PROBABILITY OF GETTING A SUM OF

6 OR 8?3. SUPPOSE WE PICK ONE CARD FROM A DECK OF CARDS, WHAT IS THE PROBABILITY

OF GETTING• A KING OR A SPADE?• A KING OR NUMBER 8?4. KLAUS IS TRYING TO CHOOSE WHERE TO GO ON VACATION. HIS CHOICES ARE

A=BAGIUO AND B=TAGAYTAY. HE CAN ONLY AFFORD ONE VACATION. THE PROBABILITY OF CHOOSING A IS 0.36 AND THE PROBABILITY OF CHOOSING B IS 0.44. WHAT IS THE PROBABILITY THAT HE CHOOSES TO GO EITHER A OR B? WHAT IS THE PROBABILITY THAT HE WILL NOT CHOOSE ANY OF THE TWO DISTINATION?

Page 155: Probability and statistics(assign 7 and 8)

5. The probability that it is Friday and that a student is absent is 0.03. Since there are 5 school days in a week, the probability that it is Friday is 0.2. What is the probability that a student is absent given that today is Friday?

Page 156: Probability and statistics(assign 7 and 8)

Normal DistributionThe normal probability curve is one of the most

commonly used theoretical distributions in statistical inference. The mathematical equation of the normal curve was developed by De Moivre in 1773. the distribution is sometimes called the Gaussian distribution in honor of Gauss, who also derived the equation in the 19th century.

Page 157: Probability and statistics(assign 7 and 8)

Con’tA large population investigated in education and

the behavioral sciences has characteristics that follow a normal distribution. If we are to study, for instance, the scholastic mental capacity of a school population N= 1500, we may find that majority of the student population will yield average scores, a small portion will yield above and below average scores and a few students will yield extremely high and low scores.

Page 158: Probability and statistics(assign 7 and 8)

Con’tThe characteristics of the Normal Curve is1. The curve is symmetrical and bell shaped. It

has its highest point at the center. The lines at both sides fall off toward the opposite directions at exactly equal distance from the center. Therefore if the curve is folded at the middle, the two sides are perfectly of the same size and shape.

Page 159: Probability and statistics(assign 7 and 8)

Con’t2. The number of cases, N, is infinite. This is the

reason why the curve is asymptotic to the baseline which means that the curve at both sides does not touch the baseline or the axis, and that the curve may extend infinitely to both directions.

3. The three measures of central tendency, mean, median and mode coincide at one point at the center of the distribution.

Page 160: Probability and statistics(assign 7 and 8)

Con’t4. The height of the curve indicate the frequency

of cases, expressed as probability, proportion or percentage. Hence, the total area under the normal curve is 1.0 in terms of probability or proportion and 100% in terms of percentage. Thus one half of the area is 50%

5. The basic unit of measurement is expressed in sigma units (σ) or standard deviations along the baseline. It is also called Z-scores.

Page 161: Probability and statistics(assign 7 and 8)

Con’t6. Two parameters are used to describe the

curve. One is the parameter mean(μ or x’) which is equal to zero and the other is the standard deviation(σ) which is equal to 1.

7. Standard deviations or A scores departing away from the mean (μ or x’) towards the right of the curve is in positive while scores departing from the mean is in negative values.

Page 162: Probability and statistics(assign 7 and 8)

The normal probability curve

Page 163: Probability and statistics(assign 7 and 8)

From the previous curveWe can say that,1.At least 68% of the values in the given set of

data fall within plus or minus 1 standard deviation from the mean. In symbols, the interval is given by (x’ – 1σ) – (x’ + 1σ).

2.At least 95% of the value in the given set of data fall within plus or minus 2 standard deviation from the mean. In symbol, the interval is (x’ – 2σ) – (x’ + 2σ) and so on.

Page 164: Probability and statistics(assign 7 and 8)

To illustrate the significance of the empirical rule, consider the NCEE scores of students in a certain college whose mean score x’ or μ = 65 and the standard deviation σ or SD = 6

1. approximately, 68% of the students in that college have NCEE scores between 80 plus or minus 10, that is

65 – (1)(6) – 65 + (1)(6) 59 - 71

Page 165: Probability and statistics(assign 7 and 8)

The Standard ScoreThe standard score Z represents a normal

distribution with mean x’ = 0 and SD = 1. such transformation can be obtained by using the formula below.

Z = (x – x’) / SD

Page 166: Probability and statistics(assign 7 and 8)

Normal Curve AreasThe total area under the normal curve is equal

to 1. since a normally distributed set of data is symmetric, then the total area from Z = 0 to the right is equal to 0.5. the area from Z = 0 to the left is also equal to 0.5.

Example:Find the area under the curve from 1.0<Z<1.252. -1.25<Z<0

Page 167: Probability and statistics(assign 7 and 8)

Normal Probability DistributionFind the probability value of1.P(Z>1.45)2. P(Z<-0.4)3. P(-0.4<Z<1.45)4. P(1.15<Z<2.33)5. P(Z<1.28)6. P(Z>-1.04)

Page 168: Probability and statistics(assign 7 and 8)

Con’t7. The examination results of a large group of

students in statistics are normally distributed with a mean of 40 and a standard deviation of 4. If a student is chosen at random, what is the probability that his score is

a. Below 30?b.Above 55?c. Below 42?d.Between 35 to 45?e.Between 33 to 50?

Page 169: Probability and statistics(assign 7 and 8)

Con’t8. The efficiency rating of 400 faculty members of a

certain university were taken and resulted in a mean rating of 78 with a standard deviation of 6.75. assuming that the set of data are normally distributed, how many of the faculty members have an efficiency rating of

a. Greater than 78?b. Less that 78?c. Greater than 85?d. Between 75-90?

Page 170: Probability and statistics(assign 7 and 8)

Assignment no. 8I. Find the area under the following condition1. Between the -2.02 and 1.012. To the right of 1.623. To the left of 0.564. Between 0.65 and 1.185. Between -2.09 and -0.78II. In a reading ability test, with a sample of 120

cases, the mean score is 50 and the standard deviation is 5.5.

Page 171: Probability and statistics(assign 7 and 8)

Con’ta. What percentage of the cases falls between

the mean and a score of 55?b. What is the probability that a score picked at

random will lie above the score of 55?c. What is the probability that a score will lie

below 40?d. How many cases fall between 55 to 60?e. How many cases fall between 40 to 49?

Page 172: Probability and statistics(assign 7 and 8)

END OFLECTURE