Saint Ma 'S College September 4, 2021: 6 October 2021
Saint Ma 'S College September 4, 2021: 6 October 2021
September 4, 2021
6 October 2021 1
What is a
frequency
distribution?
• is a collection of
obse ations produced by
so ing them into classes
and showing their
frequency of occurrences
in each class.
6 October 2021 [email protected] 2
•The following data give the results of a
sample su ey. The letters A, B, and C
represent the three categories.
A B A A C C A C C C
C B C B B C B B B C
B C C A C C C B C A
Construct a frequency distribution table for these data.
3
Catego Tally
Frequency (f)
A IIIII – I 6
B IIIII – IIII 9
C IIIII – IIIII – IIIII 15
sum = 30
4
•When 40 people were su eyed at Davao
City, they repo ed the distance they
drove to the mall, and the results (in km)
are given below.
2 8 1 5 9 5 14 10 31 20
15 4 10 6 5 5 1 8 12 10
25 40 31 24 20 20 3 9 15 15
25 8 1 1 16 23 18 25 21 12
Construct a frequency distribution table for these data.
6 October 2021 5
Step 1 – Find the range. (R =highest – lowest)
Step 2 – Decide on the number of classes your frequency table will
have.
CI = 1 + (3.322 log n)
Step 3 – Find the class width. ( CW = R/CI)
Step 4 – Determine the lower class limits. Add the class width to the
sta ing point to get the second lower class limit. Then enter the
upper class limit.
Step 5 – Find the class boundaries by subtracting 0.5 from each lower
class limit and adding 0.5 to the upper class limit.
Step 6 – Represent each score by a tally.
Step 7 – Count the frequency for each class.
Step 8 – Determine the class midpoint or class mark. (LL + UP)/2
Step 9 – Determine the cumulative frequency.
Step 10 – Determine the relative frequency. (Each frequency/n) 6
Range :
Class Inte al (CI) = 1 + (3.322 log 40) = 6.32 = 7 (round to the next whole no.)
Class Width: R/CI = 39/7 = 5.57 = 6
Class Class Frequency Class Cumulativ Relative
Inte al Bounda (F) Marks (Xi) e Frequency
(CI) (CB) Frequency (RF)
(CF)
7
Range : 40 – 1 = 39
Class Inte al (CI) = 1 + (3.322 log 40) = 6.32 = 7 (round to the next whole no.)
Class Width: R/CI = 39/7 = 5.57 = 6
Class Class Frequency Class Cumulativ Relative
Inte al Bounda (F) Marks (Xi) e Frequency
(CI) (CB) Frequency (RF)
(CF)
1 - 6 0.5 – 6.5 12 3.5 12 0.30
7 - 12 6.5 – 12.5 10 9.5 22 0.25
13 - 18 12.5 – 6 15.5 28 0.15
18.5
19 - 24 18.5 – 6 21.5 34 0.15
24.5
25 - 30 24.5 – 3 27.5 37 0.08
30.5
31 - 36 30.5 – 2 33.5 39 0.05
36.5
37 - 42 36.5 – 1 39.5 40 0.03 8
Measures of Central Tendency
Grouped Mean
•Find the class marks. Place them in a column.
•Multiply the frequency by the class mark for each class. Place them
in another column.
•Find the sum of the resulting column in step 2.
•Divide the sum obtained in step 3 by the total number of
frequencies. That is,
mean f xm
n
Example
The ages of 80 diabetics were asked. The following distribution was
obtained. Find the mean age.
Age Frequenc y
0-6 1
7-13 4
14-20 5
21-27 8
28-34 10
35-41 10
42-48 7
49-55 12
56-62 11
63-69 7
70-76 5
Measures of Central Tendency
Median
-is the midpoint of the data array. Before nding this value, the
data must be arranged in order, from least to greatest or vice versa.
-will either be a speci c value or will fall between two values.
Measures of Central Tendency
Grouped Median
•Make a Table of cumulative frequency.
•Divide n, number of frequency by 2, to get the halfway point.
•Locate the median class in the cumulative frequency column.
•Substitute in the formula,
n
cf
median ( Md ) 2 w L md
f
2 and
x
2
x
2
N N
N - number of population - population
mean
Example
The nal exam scores of 5 students were 80, 88, 92, 90, and 85.
Determine the variance and standard deviation.
Measures of Variation
The unbiased estimator of the population variance is a statistics
whose value approximates the expected value of a population
variance.
2
x x
s
2
n 1
where
x
- sample mean; n - sample size
Measures of Variation
Sho cut formulas
s
2 n x x
2 2
s n x x
2 2
n ( n 1) n ( n 1)
= [8 (568)-(60)(60)]/56
= (4,544 – 3,600)/56
= 16.8571 s = 4.11
Example
The weights of nine basketball players are recorded as follows (in
pounds).
206 215 305 297 265 282 301 255 261
Find the variance and standard deviation.
8 12 15 4 3 5 7 6
The summation of the scores = 60
The summation of the squares of each score = 568
Measures of Variation
Variance and standard deviation of grouped data
s
2 n ( f x m ) ( f x m )
2 2
n ( n 1)
s n ( f x m ) ( f x m )
2 2
n ( n 1)
Example 9
The ages of 80 diabetics were asked. The following distribution was
obtained. Find the variance and standard deviation.
Age Frequenc
y
0-6 1
7-13 4
14-20 5
21-27 8
28-34 10
35-41 10
42-48 7
49-55 12
56-62 11
63-69 7
70-76 5
Measures of Variation
Coe cient of Variation
-statistics that allows us to compare two di erent data sets that
have di erent units of measurement.
-expresses the standard deviation as a percentage of the mean.
-the data with the larger CV is more variable.
where s
- mean, Md
x - median, s - standard deviation
Measures of Variation
Symmetric
- has two halves which mirrors each other. The value of the
skewness is zero.
Positively skewed
- most scores are below the mean and there are extremely low
scores. In this distribution, typically, the mode is lower than the
median and the median is lower than mean.
Negatively skewed
- most scores are above the mean and there are extremely high
scores. The mean is lower than the median and the median is lower
than the mode.
Example
Find the coe cient of skewness of a distribution with mean 10,
median 8 and standard deviation 3.
Measures of Variation
Measure of Ku osis
-measure to determine the peakness of a distribution.
for ungrouped data set: for grouped data set:
4 4
x x f x m x
K K
ns 4 ns 4
n - number of data in the sample s - sample standard deviation
Measures of Variation
A distribution is said to be
Mesoku ic if K = 3
Leptoku ic if K > 3
Platyku ic if K < 3
THANK YOU!
HAVE A NICE
DAY!