Chapter 2-1
Chapter 2-1
Chapter 2-1
EXPERIMENTS
16 16.5 17 17.5 18
yi (kgf/cm2)
16.76 17.92
(0.316) (0.247)
2. Scatter Plots
Two dimensional plot of individual data values for
measurements on pairs of variables.
Example
Tension Bond Strength for the Modified formulation
in order of data collection
Scatter Plot for Tension Bond Strength
17.7
17.5
Modified Mortar
17.3
17.1
16.9
16.7
16.5
16.3
0 2 4 6 8 10
Run Order
3. Histograms
Alternatives to dot diagrams and show how
often the data occur in various intervals of the
measured variable.
Example
Job CPU times (in seconds), n = 25
10
yi (sec’s)
0
Interpreting a Histogram:
QL QU
yi
X X *
Inner Fences
Outer Fences
Outliers:
Errant observations (i.e., unusually large or
small relative to the other data values)
which lie outside of the range of data
values under study.
Sources of outliers:
1. Measurement is observed, recorded or
entered into a computer
incorrectly.
2. Measurement comes from a different
population.
3. Measurement is correct but represents a
rare event
Studies Based on a Single Sample :
Parameter Estimation:
Using information from a single random sample to
estimate the value of an unknown parameter:
Point Estimator:
A rule or formula used to calculate a sample
statistic which corresponds to the unknown
parameter. e.g.,
1 n
n 1
yi y ̂
Interval Estimator :
ˆ
An interval between two values and L ˆUwill enclose
that
the parameter with a specified probability (Confidence
level),
i.e,
P (ˆL ˆU ) 1
1-
Where,
ˆL is the lower confidence limit constructed such that
(/2) 100 % of the sampling distribution of lies to itsˆ left.
is the upper confidence limit constructed such that
ˆU 100 % of the sampling distribution of lies to its right.
(/2)
ˆ
is the confidence level of the
interval.
{ = 0.05 for most practical applications}
Note:
The higher the confidence level, the wider
the interval !!
Estimating the mean ():
Case I: Known
z/2 = 1.96 1-
z
The interval is given by:
-t/2 t/2
The interval is given by:
9.74 < < 10.26
Conclusion:
“At the 95% confidence level, we expect the
mean content of all such containers to lie
within 9.74 and 10.26 liters.”
Estimation of the Variance (2)
(n 1) S
2
(n 1) S2
P( 2
) 1
2
2
1 2
2
Example:
To estimate the variance of fill at a cannery,
10 cans were selected at random and their
contents are weighed. The following data
were obtained ( in ounces): 7.96, 7.90, 7.98,
8.01, 7.97, 7.96, 8.03, 8.02, 8.04, 8.02.
Construct a 90% confidence interval
for estimating the variance.
From sample data,
s = 0.0430 f( )
Using Table of Chi Square
At (1- = 0.90,
and = n-1= 9
and
Conclusion
Note:
0.0314<< 0.0707
Hypotheses Testing:
Using sample evidence to reject or accept a
claim regarding the value of a population
parameter.
Hypotheses H0
Decision True False
Accept No error Type II error
Reject Type I error No error
Definitions:
• The probability of making type I error ():
“Is the conditional probability of rejecting Ho, when
Ho is true.”
Reject Reject
Accept
L U
•A one-tailed test:
A test of hypothesis where the alternative is
one-directional:
Upper-tailed
Ho : < o Vs. H1: > o
Reject
Accept
U
Lower-tailed
Reject
Accept
L
TESTING A POPULATION MEAN
Case I: Known or n >= 30
_
y o
Zc =
n
Example
A manufacturer of sports equipment has
developed a new synthetic fishing line that
he claims to have a mean breaking strength
of 8 kg with standard deviation of 0.5. A
random sample of 20 lines is tested and
found to have a mean 7.8 kg.
• Is there sufficient evidence to reject the
manufacturer claim at = 0.01?
Reject Accept
(1-) = 0.98422
TESTING A POPULATION MEAN
Case II: unknown and n < 30
_
tc = y o
S n
Example
In the fishing line example, suppose is
unknown and sample data are:
n =20, y =7.8 kg and S=0.6 kg.
Assuming that the breaking strength is
normally distributed, test the manufacturer’s
claim at =0.01.
Ho : ≥ 8 Vs H1: < 8
lower-tailed test
Rejection region
using the t-distribution Table:
at = 0.01, and
= (n-1)=19 Reject
Accept
t0.01= -2.539 t
t0.01
From sample data:
y o 7.8 8.0
t 1.49
S n 0.6 20
-1.49
Reject
Accept
t
-2.539
Conclusion
Since t > t , we fail to reject Ho ,
and accept the manufacturer’s claim.
Testing the population variance 2
( n 1) S 2
2
o 2
Example
A manufacturer of car batteries claims that
the life of his batteries is approximately
normally distributed with standard deviation
equal to 0.9 years. A random sample of 10
such batteries has a standard deviation of
1.2 years. Is there sufficient evidence to
conclude that > 0.9 years? Use a 5% level
of significance.
Ho: H1:2=(0.9)2 Vs. 2 >(0.9)2
Upper tailed test
At =(n-1)=9, f( )
and = 0.05 Reject
Accept
2 = 16.92
from sample data:
( n 1) S 2 9(1.2) 2
2 16.0
o2 (0.9) 2
Conclusion:
Since 2 < 16.92, accept Ho and
reject the claim that > 0.9 years.