Comparison of Two Population Mean
Comparison of Two Population Mean
Comparison of Two Population Mean
Means
• Goal: To compare 2 populations/treatments
wrt a numeric outcome
• Sampling Design: Independent Samples
(Parallel Groups) vs Paired Samples
(Crossover Design)
• Data Structure: Normal vs Non-normal
• Sample Sizes: Large (n1,n2>20) vs Small
Independent Samples
• Units in the two samples are different
• Sample sizes may or may not be equal
• Large-sample inference based on Normal
Distribution (Central Limit Theorem)
• Small-sample inference depends on
distribution of individual outcomes (Normal
vs non-Normal)
Parameters/Estimates
(Independent Samples)
• Parameter:
• Estimator: Y 1 Y 2
S12 S 22
•
Estimated standard error: n1 n2
• Shape of sampling distribution:
– Normal if data are normal
– Approximately normal if n1,n2>20
– Non-normal otherwise (typically)
Large-Sample Test of
H A : 1 2 0
– 2-sided alternative
– 2-sided alternative H A : 1 2 0
• P=2P( z |zobs| ) (From the std. Normal distribution)
• Rule:
y 1
y 2 z / 2
S12 S 22
n1 n2
Large-Sample (1-100% Confidence
Interval for
• For 95% Confidence Intervals, z.025=1.96
• Confidence Intervals and 2-sided tests give
identical conclusions at same -level:
– If entire interval is above 0, conclude
– If entire interval is below 0, conclude
– If interval contains 0, do not reject ≠
Example: Vitamin C for Common Cold
• Outcome: Number of Colds During Study Period
for Each Student
• Group 1: Given Placebo
1 1 n1 n2 2
S p2
n1 n2
Small-Sample Test for
Normal Populations
• Decision Rule: (Based on t-distribution with =n1+n2-2 df)
– 1-sided alternative
• If tobs t, ==> Conclude
• If tobs < t ==> Do not reject
– 2-sided alternative
• If tobs t , ==> Conclude
• If tobs -t ==> Conclude
• If -t < tobs < t ==> Do not reject
Small-Sample Test for
Normal Populations
• Observed Significance Level (P-Value)
• Special Tables Needed, Printed by Statistical Software
Packages
– 1-sided alternative
• P=P(t tobs) (From the t distribution)
– 2-sided alternative
• P=2P( t |tobs| ) (From the t distribution)
• If P-Value then reject the null hypothesis
Small-Sample (1-100% Confidence Interval
for Normal Populations
• Confidence Coefficient (1-) refers to the proportion of
times this rule would provide an interval that contains the
true parameter value if it were applied over all
possible samples
• Rule: y 1
y 2 t / 2, 2 1 1
S
p
n1 n2
• Interpretations same as for large-sample CI’s
Small-Sample Inference for
Normal Populations
• Case 2: 12 22
S12 S 22
• Don’t pool variances: Sy y
1 2
n1 n2
n n2
* 1
S2
2
S 22
2
1
n1
n2
n 1 n2 1
1