Lab Manual 2016-Ps
Lab Manual 2016-Ps
Lab Manual 2016-Ps
Department of Mathematics
NED University of Engineering & Technology
WORK BOOK
MT-330 Applied Probability & Statistics
Name
: __________________________________________
Roll No
: __________________________________________
Enrollment No
: __________________________________________
Section
: __________________________________________
Group
: __________________________________________
Batch
: __________________________________________
Certificate
Date: _____________
INDEX
Name: _____________________________________ Roll No: ___________________
GRADE /
S.NO.
DATE
REMARKS
PRACTICAL
A
01
Introduction to Minitab
02
03
04
05
06
07
08
09
10
11
12
SIGNATURE
PREFACE
PURPOSE:
The basic purpose of this practical workbook is to provide the necessary computational data
analysis approach in statistics using Minitab for the students of engineering. Since practical
covers wide variety of different and interesting statistical applications therefore it is hope that
this workbook would be of great value for the engineering graduates.
PRE-REQUISITE:
OBJECTIVE:
To provide reader with a working knowledge of statistics, This workbook is designed for
students having numerous applications in engineering. The material in this workbook consists of
sketch of functions, plotting of variables, basic statistics, regression analysis, correlation
analysis, probability distributions & hypothesis testing. This workbook is designed to be a clear,
readable and even enjoyable introduction to the statistical concept that have become an important
part of every engineering problems. Moreover statistical measures such as psychological and
educational testing and increasingly important application regarding probability and statistics are
also discussed and streamlined with engineering disciplines.
Before the widespread availability of powerful computers and prepackaged statistical software,
tedious manual computations were routine in statistics courses. Today, computers have
revolutionized data analysis, which is a fundamental task of statistics. Packages such as Minitab
allow the computer to automate calculations and graphs. Minitab can perform a wide variety of
tasks, from the construction of graphical and numerical summaries for a set of data to the more
complicated statistical procedures and tests. Minitab will free you from mathematical
calculations and allow you to concentrate more on the analysis of data and the interpretation of
the results.
MINITAB WINDOWS
Minitab has six types of windows which can all be open at the same time: Session, Data, Help,
Info, History and Graph. A window can be made active by selecting it. For example, to make the
Session window active, place the mouse pointer on the window and just click it, if it is visible on
your screen. The Session window and Data window are used most frequently.
Session Window
The Session window is used to enter Minitab commands and to display output. It has a menu bar
on top and the usual Minitab prompt MTB> on the left. To enter a command. If MTB> prompt is
not displayed in the session window, choose Editor Enable Command Language.
The Session window scrolls as more output goes into it. You can scroll up and down to see
various parts of your output.
Data Window
The Data window, sometimes called the worksheet, displays active data entered by you or
produced by the computer. You can enter, edit and view the data. The current cell is highlighted;
others can be selected by clicking the mouse on them or using the arrow keys. The scroll bars are
used to view different parts of the worksheet.
MINITAB COMMANDS
Commands tell Minitab what to do. You can issue commands in Minitab by choosing commands
from the menus or by typing session commands directly into the Session window. Session
commands are used throughout this workbook.
However, you can also type session commands in the Session window by turning on the
MTB> command prompt.
A session command consists of one main command and may have one or more
subcommands. Arguments and symbols may also be included in the command.
Subcommands, which further define how the main command should be carried out, are
usually optional. Arguments specify data characteristics.
To execute a command, type the main command followed by any arguments. If the
command has subcommands, end the command line with a semicolon. Type
subcommands at the
a period (.) after the last subcommand. Press <Enter> to execute a command.
Commands and column names are not case-sensitive; you can type them in lowercase,
uppercase, or any combination. You can abbreviate any session command or
subcommand by using the first four letters.
Arguments specify data characteristics, such as location or titles. They can be variables
(columns or constants) as well as text strings or numbers. Enclose variable names in
single quotation marks (for example, HISTOGRAM Salary). In arguments, variable
names and variable numbers can be used interchangeably. For example, DESCRIBE C1
C2 and DESCRIBE Sales C2 do the same thing if C1 is named Sales.
You can abbreviate a consecutive range of columns, stored constants, or matrices with a
dash. For example, PRINT C2-C5 is equivalent to PRINT C2 C3 C4 C5. You can use a
stored constant (such as K20) in place of any constant. You can even use stored constants
to form a range such as K20:15, which represents all integers from the value of K20 to
15.
Command Prompts
The prompts that appear in the Session window help you know what kind of input Minitab
expects. There are four different prompts:
MTB>
Command prompt; type the session commands here and press Enter.
SUBC>
Subcommand prompt; type the subcommands here or type ABORT to cancel the
entire command.
DATA>
Data prompt; enter data here. To finish entering data and return to the MTB>
prompt, type END and press Enter.
CONT>
Continuation prompt; if the command from your previous line ends with the
continuation symbol &, Minitab displays CONT> on the next line so you can
enter the rest of the command or data.
S.NO.
PRACTICAL
Introduction to Minitab
01
Ungrouped Data
Graphical Representation, Measures of Central Tendency & Dispersion
02
Graphical representation, simple bar diagram, pie chart, stem-leaf display, boxplot, A.M, G.M, H.M, median, mode, range, variance and
standard deviation.
Grouped Data
Graphical Representation, Measures of Central Tendency & Dispersion
03
Introduction to Global-Macro
04
06
07
08
PRACTICAL NO. 01
Introduction to Minitab
Note: Show all necessary session commands, results and interpretations (where required)
1. The following table gives some facts on 10 major countries.
Country
Population
Area
Pakistan
133500000
307374
China
1217600000
3691500
United States
265200000
3615278
Taiwan
21400000
13900
Netherlands
15500000
16133
Egypt
63700000
386661
Russia
147700000
6592850
Sweden
8800000
170250
Bangladesh
119800000
55598
Indonesia
949600000
1222244
i.
ii.
iii.
Country
Population
Area
Poland
38600000
120728
x1 1
2. Given
x2 2
y1 6
y2 7
x3 3
x4 4
x5 5
y3 8
y4 9
y5 10
Evaluate the following expressions using Minitab commands and store in constants
K1, K2K7 and print.
i.
i 1
i 1
xi y i
v.
i 1
x
i 1
yi
i 1
log 10 ( xi )
iv.
(5x
i 1
vi.
(x
vii.
exp ( xi )
iii.
2 xi 5 y i
ii.
yi )
i 1
yi )
5
i 1
PRACTICAL NO. 02
Ungrouped Data, Graphical Representations, Measures of
Central Tendency & Dispersion
1. The following table gives some facts on 10 major countries.
Country
Population
Area
Bangladesh
119800000
55598
China
1217600000
3691500
Egypt
63700000
386661
Indonesia
949600000
1222244
Netherlands
15500000
16133
Pakistan
133500000
307374
Russia
147700000
6592850
Sweden
8800000
170250
Taiwan
21400000
13900
United States
265200000
3615278
i.
ii.
2.
10
15
15
10
20
15
10
20
30
15
25
05
45
22
10
27
25
22
10
30
15
05
08
15
05
36
23
17
15
10
35
48
50
25
03
31
PRACTICAL NO. 03
Grouped Data, Graphical Representation, Measures of Central
Tendency & Dispersion
1.
Following data shows the length of time (in minutes) that customers had to wait before
receiving the information
i.
09
16
19
21
11
16
19
22
12
17
19
22
13
17
20
22
13
17
20
24
14
17
20
24
14
18
21
25
15
18
21
27
15
18
21
29
16
19
21
38
12
17
30
32
Construct stem and leaf plot and then make Frequency distribution by the help of
stem and leaf and then construct Relative and Cumulative Frequency distribution
ii.
PRACTICAL NO. 04
Introduction to Global-Macro
1. Write Global macro for a weekly sales ( in $) given in the table of four different brands to
calculate Arithmetic mean, Geometric mean, Harmonic mean, Range, Variance, Standard
deviation, Mean absolute deviation and Coefficient of variation for each brand and
determine which brand gives more consistent sales.
Brand A
Brand B
Brand C
Brand D
114
116
116
110
110
118
115
117
111
114
108
121
113
115
114
123
20
113
100
150
PRACTICAL NO. 05
Skewness and Kurtosis
1. From the following frequency distribution calculate first four moments about origin, first
four moments about mean , 1 ( measure of skewness ) and , 2 ( measure of kurtosis )
also comments on the shape of distribution.
Class
Interval
Frequency
07 09
10 12
13 15
16 18
19 21
Total
02
08
14
09
01
34
Hint: Formulae for above problem given in formula list at the end of workbook
PRACTICAL NO. 06
Binomial Probability Distribution
1. Obtain Binomial probability distributions for p 0.1 , p 0.5 and p 0.9 with n 10 .
Also graph these probability distributions on same graph.
n
P X x p x (1 p) n x ; x 0,1,2, , n
x
2. Suppose that 60% of all consumers favor Internet shopping. Five consumers are
randomly sampled and the number who favors internet shopping is recorded. Let X be
the no. of consumers who favor this concept.
i.
ii.
iii.
PRACTICAL NO. 07
Poisson Probability Distribution
1. Obtain and plot the Poisson probability and cumulative probability distributions for
2.5
P( X x)
e x
; x 0,1, 2...,
x!
2. A QC inspector at an automobile assembly plant has found that the number of paint
defects on a car has a Poisson distribution with a mean of five defects per car.
i.
Calculate mean and standard deviation of X, the no. of paint defects per car.
ii.
iii.
What is the probability that there are no paint defects? Five or more defects?
Less than 8 defects?
PRACTICAL NO. 08
Normal Probability Distribution
1.
i.
2.
ii.
iii.
i.
ii.
iii.
iv.
75% of all students take less than what time to complete the task?
PRACTICAL NO. 09
Linear Regression Line and Correlation Analysis
1.
The selling prices y and living areas in square feet x of 20 homes are given in the
following table. Write a macro to display the results of following tasks.
Home
Price ($)
Area
Home
Price ($)
Area
86000
870
11
118900
2052
86600
840
12
125000
1590
92000
1032
13
130600
1600
92500
1168
14
139875
2044
93500
1100
15
144400
1916
94000
1430
16
148000
2024
104000
1520
17
156000
1840
104500
1468
18
151900
1684
109900
1160
19
159500
1760
10
111900
1800
20
163000
2260
(a) What is the average square footage and selling price of the homes in the sample?
(b) Construct Scatter plot.
(c) Calculate and interpret correlation coefficient (r)?
(d) Determine least square regression line. y a bx By performing arithmetic operations.
(e) Draw regression line. y a bx And scatter plot on the same graph.
(f) Calculate SSE, SS Regression and SS Total?
(g) Calculate Coefficient of determination and comments?
PRACTICAL NO. 10
Multiple Regression Analysis and Non-Linear Models
1.
Temp
Pressure
Quality
Temp
Pressure
Quality
Temp
Pressure
Quality
80
50
50.8
90
50
63.4
100
50
46.6
80
50
50.7
90
50
61.6
100
50
49.1
80
50
49.4
90
50
63.4
100
50
46.4
80
55
93.7
90
55
93.8
100
55
69.8
80
55
90.9
90
55
92.1
100
55
72.5
80
55
90.9
90
55
97.4
100
55
73.2
80
60
74.5
90
60
70.9
100
60
38.7
80
60
73.0
90
60
68.8
100
60
42.5
80
60
71.2
90
60
71.3
100
60
41.4
2.
The following table gives the size x of the home in square feet and the number of
Kilowatt hours y of electrical usage for each of ten homes during a particular month.
1290
1350
1470
1600
1710
1840
1980
2230
2400
2930
Usage 1182
1172
1264
1493
1571
1711
1804
1840
1956
1954
Size
(a) Construct Scatter plot between size of home and electrical usage.
(b) Fit Quadratic model of the form y b0 b1 x b2 x 2
(c) Predict the monthly electrical usage of a 1600 square foot home.
PRACTICAL NO. 11
One Mean Hypothesis
1. A random sample of 64 bags of white Cheddar popcorn weighed, on average, 5.23
ounces with a standard deviation of 0.24 ounces. Test the hypothesis that = 5.5 ounces
against the alternative hypothesis, < 5.5 ounces at the 0.05 level of significance.
2. According to a dietary study, a high sodium intake may be related to ulcers, stomach
cancer, and migraine headaches. The human requirement for salt is only 220 milligrams
per day, which is surpassed in most single servings of ready-to-eat cereals. If a random
sample of 20 similar servings of certain cereal has a mean sodium content of 244
milligrams and a standard deviation of 24.5 milligrams, does this suggest at the 0.05 level
of significance that the average sodium content for a single serving of such cereal is
greater than 220 milligrams? Assume the distribution of sodium contents to be normal.
4. A machine is producing metal pieces that are cylindrical in shape. A sample of pieces is
taken & the diameter is 1.07, 0.97, 1.03, 1.04, 0.99, 0.98, 0.99, 1.01 & 1.03cms. Test the
hypothesis at 1% level of significance that populations mean of the diameter of pieces
from this machine is greater than 1 cm. assuming an approximately normally distributed?
PRACTICAL NO. 12
Two Mean Hypothesis
1. In a study conducted by the Department of Mechanical Engineering and analyzed by the
Statistics Consulting Center at Virginia Tech, steel rods supplied by two different
companies were compared. Ten sample springs were made out of the steel rods supplied
by each company, and the bounciness was studied. The data are as follows:
Company A:
9.3 8.8 6.8 8.7 8.5 6.7 8.0 6.5 9.2 7.0
Company B:
11.0 9.8 9.9 10.2 10.1 9.7 11.0 11.1 10.2 9.6
Can you conclude that there is virtually no difference in means between the steel rods
supplied by the two companies? Use a P-value to reach your conclusion.
2. Two chemical processes for manufacturing the same product are being compared under
the same conditions. Yield from Process A gives an average value of 96.2 from 35 runs,
and the estimated standard deviation of yield is 2.75. Yield from Process B gives an
average value of 93.3 from 40 runs, and the estimated standard deviation is 3.35. Yields
follow a normal distribution. Is the difference between the mean yields statistically
significant? Use 5% level of significance, and show rejection regions for the difference of
mean yields on a sketch.
Symbol Table
xi
x2
( x)2
xy
C.I
Class interval
C.B
Class boundary
Frequency
A.M
Arithmetic mean
G.M
Geometric mean
H.M
Harmonic mean
C.f
Cumulative frequency
Q1,Q2,Q3
Quartiles
D1,D2,,D9
Deciles
P1,P2,,P99
Percentiles
S.D
Standard deviation
Sample mean
s2
Sample variance
Population mean
Population variance
Estimated variance
c.o.v
Coefficient of variation
Coefficient of Skewness
Coefficient of Kurtosis
P(A)
Probability of an event A
P(A/B)
n!
n factorial
n
c
x
n
p
x
E(x)
E(x2)
Expected value of x2
V(x)
Variance of x
f(x)
Function of x
r2
Coefficient of determination
z/2
Critical value of z
2 / 2
d.f
Degree of freedom
Level of significance
H0
Null hypothesis
H1
Alternative Hypothesis
Formulae List
n
A.M x
x
i 1
n
n
A.M x
i 1
n
f ixi
fi
i 1
log xi
f i log xi
G.M Anti log i 1 n
fi
i 1
n
H .M n
1
i 1 xi
H .M
i 1
fi
fi
i 1
i
n 1
Median
th value
2
n
fi
h
i 1
Median l
C .F
f 2
f m f1
Mode l h
f m f1 f m f 2
n 1
Qi i
th value;
4
h
f
Qi l
i 1,2,3
n
fi
i i 1
4
CF ; i 1,2,3
n 1
Di i
th value;
10
i 1,2,...,9
n
fi
h
i
1
i
Di l
f 10
i 1,2,...,9
CF ;
n 1
Pi i
th value;
100
i 1,2,...,99
n
fi
h
i
1
i
Pi l
f 100
i 1,2,...,99
M .D
x
i 1
mean
n
M .D
CF ;
f
i 1
xi mean
n
f
i 1
Variance
f x
i 1
mean
f
i 1
fx
i 1
n
f
i 1
2
i
mean
x mean
Variance
i 1
x
i 1
2
i
S .D
100%
mean
c.o.v
k '
x
i 1
k
i
, k 1,2,3,4...
k
'
fx
i
i 1
, k 1,2,3,4...
f
i 1
x
i 1
mean
, k 1,2,3,4...
i 1
f i xi mean
n
f
i 1
3
2
3
2
, k 1,2,3,4...
4
22
x mean x E ( x)
S .D
V ( x)
E(x)
V (x)
E xi n
i 1
V xi n 2
i 1
mean
xf ( x
E ( x)
xf ( x)
continuous distributi on
discrete distributi on
x 2 f ( x discrete distributi on
E(x2 ) 2
x f ( x) continuous distributi on
V ( x) E ( x 2 ) E ( x)
y a bx
a y bx
n n
n xi yi xi yi
i1 i1
b i1
2
n
n
2
n xi xi
i 1
i1
n
n n
n xi yi xi yi
i 1
i1 i1
r
n 2 n 2 n 2 n 2
n xi xi n yi yi
i1 i1
i1
i1
n