Chapter 5 (Part Ii)
Chapter 5 (Part Ii)
Chapter 5 (Part Ii)
Statistical Inference
Prepared by: Nur Liyana Mohamed Yousop
ANOVA
ANALYSIS OF VARIANCE (ANOVA)
One Way ANOVA
ANOVA derives its name from the fact that we are analyzing variances in
the data (analysis of variances).
If we conduct
multiple samples, it
will have a
However, using t- compounded effect
test would not be on the error rate of
reliable in cases result.
When we have where there are
only two samples, more than 2
t-test and ANOVA samples
give the same
results.
ANALYSIS OF VARIANCE (ANOVA)
One Way ANOVA
The variable of interest is called a factor. In this example, the factor is the
educational level, and we have three categorical levels of this factor,
college graduate, graduate degree, and some college.
SOLUTION 1
Applying the Excel ANOVA Tool
0.025
SOLUTION 1
Applying the Excel ANOVA Tool
CHI-SQUARE
NON-PARAMETRIC METHOD
INTRODUCTION
σ2 is
unknown/known Nominal @ Categorical
scale data
(E.g. Gender, State of
Birth, Brand)
Case III
NON-PARAMETRIC
n<30 Population is not
normal METHOD
CHI-SQUARE TEST FOR INDEPENDENCE
Test for
• H0: two categorical variables independence
are independent of two
• H1: two categorical variables categorical
are dependent variables.
EXAMPLE 2
Independence and Marketing Strategy
Energy Drink Survey data. A key marketing question is whether the proportion
of males who prefer a particular brand is no different from the proportion of
females.
If gender and brand preference are indeed independent, we would expect
that about the same proportion of the sample of female students would also
prefer brand 1.
If they are not independent, then advertising should be targeted differently
to males and females, whereas if they are independent, it would not matter.
CHI-SQUARE TEST CALCULATIONS
Step 1
◦ Using a cross-tabulation of the data, compute the expected frequency if
the two variables are independent.
CHI-SQUARE TEST CALCULATIONS
Step 2
◦ Compute a test statistic, called a chi-square statistic, which is the sum
of the squares of the differences between observed frequency, fo, and
expected frequency, fe, divided by the expected frequency in each cell:
CHI-SQUARE DISTRIBUTION
Step 3
◦ Compare the chi-square statistic for the level of significance a to the
critical value from a chi-square distribution with (r – 1)(c – 1) degrees
of freedom, where r and c are the number of rows and columns in the
cross-tabulation table, respectively.
The Excel function CHISQ.INV.RT(probability, deg_ freedom) returns the
value of C2 that has a right-tail area equal to probability for a specified
degree of freedom.
Result
Test statistic = 6.49
d.f. = (2 – 1)(3 – 1) = 2
Critical value =
CHISQ.INV.RT(0.05,2) = 5.99
p-value =
CHISQ.TEST(F6:H7,F12:H13) =
0.0389
Reject H0
Test statistic
END OF CHAPTER 5