Lesson 14. Analysis of Variance: SST X X) N
Lesson 14. Analysis of Variance: SST X X) N
Lesson 14. Analysis of Variance: SST X X) N
Analysis of Variance
Analysis of Variance or ANOVA refers to a comparison test used to determine the significant
difference among normal population means.
The comparison in means of 3 or more populations which follow normal distributions can be
taken simultaneously in just one application of this test. This test is therefore a generalization of z an t
tests of two normal population means. This test was developed by Sir Ronald Fischer (1892 - 1962).
When all the three assumptions are net, the results of the analysis of variance will be valid.
Where:
SST = sum of squares total
X = individual values in each column
N = total sample size
2 2
(∑ x c ) (∑ x )
SSB= −
n N
Where:
SSB = sum of squares between columns
∑Xc = sum of individual values per column
n = size of sample per column
Where:
A safety engineer is testing four different types of smoke alarm systems. After installing five if each
type in smoke chamber, he introduced smoke to a uniform level, electrically connected the alarms
and observed the reaction time in seconds. Is there a significant difference in the reaction time of the
four types?
Alarm Type
Observations
1 2 3 4
Solution:
Step 1. Make assumptions.
- Respondents randomly selected
- Distribution is normal
SST =[ 5.22+6.3 2+ 4.92 +3.22+ 6.82+ 7.42 +8.12 +5.92 +6.52 +¿ 4 .9 2+3.9 2+6.4 2 +7.92+ 9.32 +4.12 +12.32 +9.4 2+ 7.82+ 10.8 2+8
SST =106.91
SSB=1029.3−973.0125
SSB=56.29
SSW =106.91−56.29
SSW =50.62
The following are growth (cm) of a certain plant due to the application of 4 different concentrations of
certain chemicals over a specified period of time.
Concentrations
1 2 3 4
8.1 7.6 6.8 6.7
8.6 8.3 5.7 7.2
9.3 8.5 7.1 6.2
9.1 8 6.7 6.8
7.9 7.3 7
6
Is there a significant difference in the average growth of these plants for the different concentrations
of the chemical? Use 0.05 level of significance.
Solutions
- Distribution is normal
Ho: µ1 = µ2 = µ3 = µ4
df numerator = k - 1 = 4 - 1 = 3
df denominator = N - k = 20 - 4 = 16
F tabular = 3.24
Step 4. Compute the test statistic.
Concentrations
1 2 3 4
8.1 7.6 6.8 6.7
8.6 8.3 5.7 7.2
9.3 8.5 7.1 6.2
9.1 8 6.7 6.8
7.9 7.3 7
6
Total 35.1 40.3 39.6 33.9 148.9
nt n1 = 4 n2 - 5 n3 = 6 n4 = 5 n = 20
( 148.9 ) 2
SST =8.12+ 8.62 +… … … 6.82 +7.02−
20
SST =1127.91−1108.561
SST =19.35
( 35.1 ) 2 ( 40.3 ) 2 ( 39.6 ) 2 ( 33.9 ) 2
SSB= + + + −1108.561
4 5 6 5
SSB=1124.02−1108.56
SSB=15.46
SSW =SST −SSB
SSW =19.35−15.46
SSW =3.89
Source of Sum of Degree of Mean Square Computed
Variations Squares Freedom F - value
15.46 5.15
MSB= =5.15 F= =21.46
Between Col 15.46 3 3 .24
3.89
MSW = =0.24
Within Col. 3.89 16 16
Total 19.35 19
Since the computed F value of 21.46 is much greater than the Tabular F value of 3.24, Ho is rejected:
There is a significant difference in growth between the 4 different concentrations.
Where:
SST = sum of squares total
X = individual values in each column
N = total sample size
( ∑ X c )2 ( ∑ x )2
SSR= −
n N
Where:
SSR = sum of squares row means
∑Xc = sum of individual values per column
n = size of sample per column
( ∑ x c )2 ( ∑ x ) 2
SSC= −
n N
SSE=SST −SSR−SSC
Where:
Example:
Machines
Day
A B C
1 17 22 20
2 18 20 21
3 21 24 23
4 18 23 17
Solution:
- Distribution is normal
Ho: µA = µB = µC
Ha: β1 = β2 = β3
For treatment:
df numerator = c - 1 = 3 - 1 = 2
F1 = (2,6)
F tabular = 5.14
For Blocks:
df numerator = r - 1 = 4 - 1 = 3
df denominator = (r -1) (c-1) = (4 - 1) (3 -1) = (3)(2) = 6
F2 = (3, 6)
F tabular = 4.76
Machines
Day Total
A B C
1 17 22 20 59
2 18 20 21 59
3 21 24 23 68
4 18 23 17 58
Total 74 89 81 244
( 244 ) 2
SST =289+324+ 441+ 324+ 484+ 400+576+529+ 400+ 441+529+289−
12
¿ 5026−4961.33
¿ 64.67
81 2
SSC=74 2+89 2+ −4961.33
4
6561
¿ 5476+7921+ −4961.33
4
19,958
¿ −4961.33
4
¿ 4989.50−4961.33
¿ 28.17
58 2
SSR=59 2+59 2+68 2+ −4961.33
3
3364
¿ 3481+3481+4523+ −4961.33
3
14,849
¿ −4961.33
3
¿ 4983.33−4961.33
¿ 22
SSE=64.67−28.17−22=14.50
Step 5. Decision
For Treatments:
Since computed F1 = 5.92 exceeds tabular value of F1 = 5.14, Ho is rejected.
For Blocks:
Since computed F2 = 3.03 does not exceed the tabular F2 = 4.76, Ho is accepted.
In other words, we conclude that there is significant difference among the machines and that there is
no difference in the average output of the three machines on the daily basis.
Exercise 14.
Within Col.
Total 400 30
Total
Within Col.
Total 260
Between Col 54
Total
5. Monique, owner of a large company, wanted to compare the mean daily output of a particular item
for four plants. Foe each plant, a random sample of 4 days gave the data listed in the following table.
Do the sample data indicate a difference in the population means for five plants? Use a 5% level of
significance.
A B C D
28 21 23 16
17 16 14 24
17 11 12 12
18 13 10 14
A B C
1.9 2.3 2.8
2.3 2.7 2.8
2.8 3.2 2.9
2.4 2.8 3.5
2.5 2.9 3
2.5 2.9