Basuc Statshi

Download as docx, pdf, or txt
Download as docx, pdf, or txt
You are on page 1of 20

Q1) Identify the Data type for the Following:

Activity Data Type


Number of beatings from Wife Discrete
Results of rolling a dice Discrete
Weight of a person Continuous
Weight of Gold Continuous
Distance between two places Continuous
Length of a leaf Continuous
Dog's weight Continuous
Blue Color Discrete
Number of kids Discrete
Number of tickets in Indian railways Discrete
Number of times married Discrete
Gender (Male or Female) Discrete

Q2) Identify the Data types, which were among the following
Nominal, Ordinal, Interval, Ratio.
Data Data Type
Gender nominal
High School Class Ranking ordinal
Celsius Temperature Ratio(considering world wide)
Weight nominal
Hair Color nominal
Socioeconomic Status Ratio (considering world wide)
Fahrenheit Temperature Ratio (considering world wide)
Height nominal
Type of living accommodation ordinal
Level of Agreement ordinal
IQ(Intelligence Scale) interval
Sales Figures Ratio (considering world wide)
Blood Group ordinal
Time Of Day Nominal
Time on a Clock with Hands Ordinal
Number of Children Nominal
Religious Preference Nominal
Barometer Pressure Ratio (considering world wide)
SAT Scores Interval
Years of Education Interval

Q3) Three Coins are tossed, find the probability that two heads and one tail are
obtained?
Ans: 3 coins tossed, the possible outcomes are : 8
H H H, H H T, H T H, H T T, T H H, T H T, T T H, T T T
Interested events: 3
Probability : 3/8

Q4) Two Dice are rolled, find the probability that sum is
Ans: possible outcomes are :36
(1,1) (1,2) (1,3) (1,4) (1,5) (1,6)
(2,1) (2,2) (2,3) (2,4) (2,5) (2,6)
(3,1) (3,2) (3,3) (3,4) (3,5) (3,6)
(4,1) (4,2) (4,3) (4,4) (4,5) (4,6)
(5,1) (5,2) (5,3) (5,4) (5,5) (5,6)
(6,1) (6,2) (6,3) (6,4) (6,5) (6,6)
a) Equal to 1------------------------->0/36
b) Less than or equal to 4---------->6/36=1/6
c) Sum is divisible by 2 and 3--->6/36=1/6(the number must be multiple of 6)

Q5) A bag contains 2 red, 3 green and 2 blue balls. Two balls are drawn at
random. What is the probability that none of the balls drawn is blue?

Ans: possible outcomes are: 7c2= 21


Interested events: 5c2=10(2c2+3c2+2c1.3c1=1+3+6)
Probability=5c2/7c2=10/21
Q6) Calculate the Expected number of candies for a randomly selected child
Below are the probabilities of count of candies for children (ignoring the nature of
the child-Generalized view)
CHILD Candies count Probability
A 1 0.015
B 4 0.20
C 3 0.65
D 5 0.005
E 6 0.01
F 2 0.120
Child A – probability of having 1 candy = 0.015.
Child B – probability of having 4 candies = 0.20
Ans: EV=sigma x*p(x)
=(1*0.015 + 4*0.20 + 3*0.65 + 5*0.005 + 6*0.01 + 2*0.120)=3.09
Q7) Calculate Mean, Median, Mode, Variance, Standard Deviation, Range &
comment about the values / draw inferences, for the given dataset
- For Points,Score,Weigh>
Find Mean, Median, Mode, Variance, Standard Deviation, and Range
and also Comment about the values/ Draw some inferences.

Ans:
Mean=sum by total
Median=middle most value of a sorted data
Mode= most frequency of data

Points Score Weight


Mean 3.596563 3.21725 17.84875
Median 3.695 3.325 17.71
Mode 3.92 3.44 17.02
Variance 0.285881 0.957379 3.193166
SD 0.534679 0.978457 1.786943
range 2.17 3.911 8.4

Q8) Calculate Expected Value for the problem below


a) The weights (X) of patients at a clinic (in pounds), are
108, 110, 123, 134, 135, 145, 167, 187, 199
Assume one of the patients is chosen at random. What is the Expected
Value of the Weight of that patient?
Ans: EV=sigma x.p(x)
Ev=x1.p(x1) + x2.p(x2) + ….+ xn.p(xn)
Here EV=1/9(108+110+123+134+135+145+167+187+199)
=1308/9=145.33
Q9) Calculate Skewness, Kurtosis & draw inferences on the following data
Cars speed and distance

SP and Weight(WT)
Ans: 9.a )
By using histograms in R I can concluded that
Skewness of car speed : slight left/negative skewed
Skewness of car distance : right/positive skewed

9.b)
By using histograms in R I can concluded that
Skewness of sp : right/positive skewed
Skewness of WT : slight normal distributed

Q10) Draw inferences about the following boxplot & histogram


Ans: histogram right/positive skewed
Boxplot right/positive skewed with outliers

Q11) Suppose we want to estimate the average weight of an adult


male in Mexico. We draw a random sample of 2,000 men from a
population of 3,000,000 men and weigh them. We find that the
average person in our sample weighs 200 pounds, and the standard
deviation of the sample is 30 pounds. Calculate 94%,98%,96%
confidence interval ?
Ans: given that
sample size(n)=2000,

sd(s)=30,

average(xbar)=200
cI=point estimation +/- margin of error or CI=POE +/- MOE

confidence interval at 94% :

confidence interval at 96% :


confidence interval at 98% :
Q12) Below are the scores obtained by a student in tests

34,36,36,38,38,39,39,40,40,41,41,41,41,42,42,45,49,56
1) Find mean, median, variance, standard deviation.
2) What can we say about the student marks?
Q13) What is the nature of skewness when mean, median of data are equal?
Ans: normally distributed
Q14) What is the nature of skewness when mean > median ?
Ans: positive/right skewness
Q15) What is the nature of skewness when median > mean?
Ans: negative/lest skewness
Q16) What does positive kurtosis value indicates for a data ?
Ans: higher peakedness and lower tail
Q17) What does negative kurtosis value indicates for a data?
Ans: Lower peakedness and higher tail
Q18) Answer the below questions using the below boxplot visualization.

What can we say about the distribution of the data?


Ans: median > mean, I.e. left skew
What is nature of skewness of the data?
Ans: left skew
What will be the IQR of the data (approximately)?
ans: (q3-q1)=18-10=8
Q19) Comment on the below Boxplot visualizations?

Draw an Inference from the distribution of data for Boxplot 1 with respect
Boxplot 2.
Ans: both the boxplots approximately normally distributed and the median of
both box plots are same point (i.e.262.5)

Q 20) Calculate probability from the given dataset for the below cases

Data _set: Cars.csv


Calculate the probability of MPG of Cars for the below cases.
MPG <- Cars$MPG
a. P(MPG>38)
b. P(MPG<40)
c. P (20<MPG<50)
ans:
Q 21) Check whether the data follows normal distribution
a) Check whether the MPG of Cars follows Normal Distribution
Dataset: Cars.csv
Ans:
b) Check Whether the Adipose Tissue (AT) and Waist Circumference(Waist)
from wc-at data set follows Normal Distribution
Dataset: wc-at.csv
Ans:
At plots

Waist plots
Q 22) Calculate the Z scores of 90% confidence
interval,94% confidence interval, 60% confidence interval

Q 23) Calculate the t scores of 95% confidence interval,


96% confidence interval, 99% confidence interval for
sample size of 25
Ans: 22 and 23 in image
Q 24) A Government company claims that an average light
bulb lasts 270 days. A researcher randomly selects 18
bulbs for testing. The sampled bulbs last an average of 260
days, with a standard deviation of 90 days. If the CEO's
claim were true, what is the probability that 18 randomly
selected bulbs would have an average life of no more than
260 days
Hint:

rcode  pt(tscore,df)
df  degrees of freedom

ans:
mu=270

n=18
xbar=260
sigma=90
z=x-mu/sigma=260-270/90=-0.11

pnorm(-0.11)=0.4562

p=45%

T=x-mu/s/sqrt(n)

=260-270/90/sqrt(18)=-0.4714

Pt-(0.4714, 17)=0.3216
P=32%

You might also like