DSBA SMDM Project - Anula
DSBA SMDM Project - Anula
DSBA SMDM Project - Anula
Problem 1: Cold Storage started its operations in Jan 2016. They are in the business of storing
Pasteurized Fresh Whole or Skimmed Milk, Sweet Cream, Flavoured Milk Drinks. To ensure
that there is no change of texture, body appearance, separation of fats the optimal temperature to
be maintained is between 2º - 4º C. In the first year of business, they outsourced the plant
maintenance work to a professional company with stiff penalty clauses. It was agreed that if it
was statistically proven that the probability of temperature going outside the 2º - 4º C during the
one-year contract was above 2.5% and less than 5% then the penalty would be 10% of AMC
(annual maintenance case). In case it exceeded 5% then the penalty would be 25% of the AMC
fee. The average temperature data at the date level is given in the file
“Cold_Storage_Temp_Data_.csv”
1. Find mean cold storage temperature for Summer, Winter, and Rainy Season. (7 marks)
2. Find the overall mean for the full year. (7 marks)
3. Find Standard Deviation for the full year. (7 marks)
4. Assume Normal distribution, what is the probability of temperature having fallen below 2º C?
(7 marks)
5. Assume Normal distribution, what is the probability of temperature having gone above 4º C?
(7 marks)
6. What will be the penalty for the AMC Company? (5 marks) Dataset to be used for Problem 1:
Cold_Storage_Temp_Data_.csv
Solution:
Dataset has 4 variables Season, Month, Date and Temperature. Season and Month both are categorical
columns while Date is integer and Temperature is a float type.
Descriptive Statistics for the dataset:
There are 3 unique values in Season column in which winter season has the most values in the season
column. Mean for the Temperature is 3.002 with the standard deviation of 0.465.
Check for Null values
Note: Date column has been dropped from the DataSet as it was insignificant for analysis
From the above results, it is evident that there is no null values present in the dataset.
1. Find mean cold storage temperature for Summer, Winter, and Rainy Season. (7 marks)
4. Assume Normal distribution, what is the probability of temperature having fallen below 2º C? (7
marks)
Now using the z-table, the probability of temperature having fallen below 2 C is 0.015
5. Assume Normal distribution, what is the probability of temperature having gone above 4º C? (7
marks)
Now using the z-table, the probability of temperature having gone above 4 C is 0.015
Total probability of temperature going out of the above range = probability of temperature having gone
above 4C + the probability of temperature having below 1C
Since the probability of temperature going outside the 2 C – 4 C during the one-year contract is above
2.5% and less than 5% then the penalty would be 10%
Problem 2: In Mar 2018, Cold Storage started getting complaints from their clients that they have been
getting complaints from end consumers of the dairy products going sour and often smelling. On getting
these complaints, the supervisor pulls out data of the last 35 days’ temperatures. As a safety measure,
the Supervisor has been vigilant to maintain the mean temperature 3.9º C or below.
Assume 3.9º C as the upper acceptable mean temperature and at alpha = 0.1 do you feel that there is a
need for some corrective action in the Cold Storage Plant or is it that the problem is from the
procurement side from where Cold Storage is getting the Dairy Products. The data of the last 35 days is
in “Cold_Storage_Mar2018_.csv”
1. Which Hypothesis test shall be performed to check if corrective action is needed at the cold storage
plant? Justify your answer. (6 marks)
2. State the Hypothesis and do the necessary calculations to accept or reject the corresponding null
hypothesis. (8 marks)
Solution:
1. Which Hypothesis test shall be performed to check if corrective action is needed at the cold storage
plant? Justify your answer. (6 marks)
T-Test has to be performed to check if corrective action is needed at the cold storage. We have to go
with T-Test because, population Standard deviation is not given
2. State the Hypothesis and do the necessary calculations to accept or reject the corresponding null
hypothesis. (8 marks)
Null Hypothesis(H0): Mean temperature<= 3.9c(If NULL Hypothesis is accepted, no corrective action at
cold storage is needed)
We do not know the population standard deviation and n = 35. So we use the t distribution and the
tSTAT test statistic.
scipy.stats.ttest_1samp calculates the t test for the mean of one sample given the sample observations
and the expected value in the null hypothesis. This function returns t statistic and the two-tailed p value
We have evidence to reject the null hypothesis since p value < Level of significance
Since, NULL Hypothesis is rejected,which states that probability of the cold storage being at fault is
statistically more responsible for sour and often smelling dairy products. There is a need for some
corrective action in the Cold Storage Plant