0% found this document useful (0 votes)
33 views17 pages

STA1010 Applied Class Homework-2023-01

Download as pdf or txt
Download as pdf or txt
Download as pdf or txt
You are on page 1/ 17

School of Mathematics

Semester 1, 2023

STA1010 Statistical Methods for Science


APPLIED CLASS
HOMEWORK EXERCISES

• Most of the applied classes (8) have HOMEWORK associated with


them. This takes the form of some brief questions, contained in this
handout.
• Your answers must be submitted in the appropriate moodle submission
box by 11:55pm on Monday of the week after the associated class.

These 8 homework exercises,


along with your ACTIVE participation in applied classes,
count 5% of the overall assessment in this unit.

• You should read the week’s Applied Class exercises and relevant
lecture notes before attempting the homework so that you know the
context of the questions. It will help! You are expected to be prepared
adequately before coming to the applied class.
• Applied Class 4 and Applied Class 11 do not have specific homework
exercises but involve submissions in the PROJECT that counts a
further 6% of the overall assessment in this unit.
• The applied classes are designed to give you some practice. Remember
that learning is gained by active practice of the concepts described and
explained in the lectures. Also do ask questions in the applied classes.
There are also many questions that you could attempt for yourself in the
textbook. Some suggestions for these are in the Applied Class Manual
STA1010 Statistical Methods for Science

Homework for Applied Class 2 DUE Monday 11:55pm Week 4

Name: …………………………………………….. ID……………….


1. One thousand voters selected at random were asked whether or not they agreed with
a certain proposal by the government. The results of opinion are cross-tabulated by
age in the following “contingency table”:

Age: 18-29 30-39 40-49 50+ Total


Agree 250 100 40 90
Disagree 100 100 60 260
Total
a) Give the marginal distribution of the age groups.

Check your distribution… does it add to 1?


b) Find the conditional distribution of the age groups given each opinion?

Based on these distributions, do you think that the opinion might depend on age
group or not? Why?

c) What is the probability that a randomly chosen voter is 50+ and agrees with the
proposal? To what type of distribution does this situation belong?

d) What is the probability that a randomly chosen 50+ aged person does agree?

(more questions over page)


2.
a) When is a scatterplot an appropriate graph to use?

b) State the name and describe the principle used to fit the “line of best fit” to
statistical (x,y) data?

c) What does correlation, r, measure?

d) State the general function for a linear model of data and define (be specific) all
the coefficients.

Mark: /10
STA1010 Statistical Methods for Science

Homework for Applied Class 3 DUE Monday 11:55pm Week 5

Name: …………………………………………………….. ID……………….

1.
a) For the following data, which of the two possible lines of best fit: ŷ1 =12.5–2x, or
ŷ 2 =12–1.5x, is better, according to the Principle of Least Squares. State why.
ŷ1 =12.5–2x ŷ 2 =12–1.5x
x y ŷ1 residual ŷ 2 residual
0 13
1 9
2 10
3 6

(Show your working fully in the table).

b) For the better fit line above, plot the residual value against the x value.

c) What is this type of plot called? What does it indicate here?

2. The logarithmic transformed data for the number of bacteria N on a glass dish after t
days satisfies the linear relationship: ln( N ) = 8.5 + 1.7 t
a) What type of relationship between N and t is evident from this linear ln(y) against
x plot?

b) Express N as a function of t (show final expression expanded fully)

N (t ) =

(more questions over page)


c) By what growth factor does the number of bacteria increase over 1 day?

d) What number of bacteria will be predicted to be present in this system after 1


week?

Mark: /10
STA1010 Statistical Methods for Science

Homework for Applied Class 5 DUE Monday 11:55pm Week 7

Name: ………………………………………….. ID……………….


An urn contains 2 red marbles and 3 blue marbles.
1. One person takes two marbles at random from the urn and does not replace them.
a) State the general ways in which the person could get a red marble and a blue marble.

b) State the number of ways this can occur.

c) What is the probability the person gets a red and a blue marble?
P(R & B) =

2. A second person takes another two marbles from the urn and does not replace
them.
a) Given that the first person took one red and one blue, what is the conditional
probability that the second person also takes one red and one blue?
P(2nd R & B | 1st R & B) =

b) What is the probability the first person got a red and a blue given that the second got a
red and a blue? (Hint: consider the total probability of the second person getting a red
and a blue. You will need to think about how the first person did not get a red and a
blue for that case).
P(1st R & B | 2nd R & B) =

(continue working over page)


Mark: /10
STA1010 Statistical Methods for Science

Homework for Applied Class 6 DUE Monday 11:55pm Week 8

Name: …………………………………………………….. ID……………….


Malaria is a debilitating and potentially deadly parasitic infection. It can be cured and is
mostly preventable with community health precautions but many developing countries lack
the necessary resources. The West African Country of Guinea has a high rate of malaria,
with 70% of its population infected.
A physician from “Doctors without Borders” sees 4 patients in a given hour.
Assume the patient visits are independent.
Let X be the count of patients who are infected with malaria among the 4 patients seen in
any random one hour.

a) What is the probability of the Bernoulli event of “having malaria”? ……….

b) What is the number of trials in the situation above? ……...

c) What is the question that can be answered by a Binomial distribution?

d) State the probability distribution of X above (type and its parameters):

X~ …………….

e) State the formula for P(X=x) and define all terms used:

Evaluate the probability distribution of X:

x Pr(X=x)
0
1
2
3
4

Check your distribution… does it add to 1?


(more questions over page)
f) Draw a probability histogram for the distribution of X (remember that these are
discrete probabilities).

g) Find the probability that the doctor will see “at most” one patient with malaria in any
hour (use correct notation for Probability involved)

h) Find the probability that there are 2 or more cases of malaria in any hour (use correct
notation for Probability involved).

Mark: /10
STA1010 Statistical Methods for Science

Homework for Applied Class 7 DUE Monday 11:55pm Week 9

Name: …………………………………………………….. ID……………….

a) What is the 68-95-99.7 Rule and WHEN does it apply?

2. Based on long term data, the time to travel a particular bus route is 1 hour with a
standard deviation of 10 minutes. Sketch the density curve of the travel time distribution
assuming a normal distribution. Use the rule in a) to put the correct scale on the x-axis.

3.
a) Define the z-transformation for a Normal distribution within a population, defining all
the terms used.

b) An individual run of this bus took 65 minutes. What is the z-score of this trip? What
proportion of trips is likely to take longer than this?

(more questions over page)


4. Given independent random variables with means and standard deviations as shown, find
the mean and standard deviations of these combinations of the variables:

HINT: See Lecture 16:


Standard deviation, s, cannot be added/subtracted, but Variance can.
Remember var(x) = sx2

Mean SD Var
X 12 3
Y 20 2
Y+6
X+Y
X-Y
X1 + X2 + X3

5. Based on past experience a delivery company’s trucks will average 1.3 parking tickets
per truck per month, with a standard deviation of 0.7 tickets.
a) If they have 18 trucks, what are the mean and standard deviation of the total number
of parking tickets the company will have to pay this month?

b) What assumption did you make in answering the above?

Mark: /10
STA1010 Statistical Methods for Science

Homework for Applied Class 8 DUE Monday 11:55pm Week 10

Name: …………………………………………..…………….. ID……………….


1. What does a “sampling distribution of sample means” represent?

2. What do you know about the distribution of X when X has mean µ and standard
deviation s ?
a) If X is a random variable with a normal distribution, then the distribution of many
sample means, X , is:

X~
b) If X is any random variable (regardless of Normal or skewed) and the sample size, n
is large, then the approximate distribution of the sample means, X , is:

X~

c) What is the name of the formal statement of the same result in a) and b) above?

3. What are the formulae for a 100(1 - a)% confidence interval (and clearly define all the
symbols used) for:
a) a population mean when the population standard deviation, s, known?

b) a population mean when s unknown?

c) the difference between two independent population means with both s unknown?

(more questions over page)


4. Determine the 95% confidence interval for the population mean systolic blood pressure
of uni students if a representative sample of size 49 had a mean of 125 mmHg with a
sample standard deviation of 6.8 mmHg.

Mark: /10
STA1010 Statistical Methods for Science

Homework for Applied Class 9 DUE Monday 11:55pm Week 11

Name: ……………………………………………………………….. ID……………….


1. List all the steps involved in carrying out a hypothesis test.

2. Explain what a p-value is.

3. When do we reject the null hypothesis?

4. Medical researchers were interested in measuring the possible loss of vitamin C when
wheat soy blend (contains vitamin C) is mixed with other ingredients and cooked. Five
samples of food with wheat soy blend were prepared according to a certain recipe. The
vitamin C content of the samples was measured before and after cooking. The PAIRED
results were:

Sample 1 2 3 4 5
Before 53 69 66 76 68
After 38 42 44 51 48
∆ = B-A

Set up appropriate hypotheses and carry out a significance test based on the t-distribution.
(Note that it is not possible for cooking to increase the amount of vitamin C).
a) Summary sample data required:

b) Is this a 1-sided or 2-sided hypothesis?

c) H0:

Ha :
d) Standardized test-statistic, t =

e) p-value =

f) Statistical Decision:

g) Conclusion in the context of the question for a report:

Mark: /10
STA1010 Statistical Methods for Science

Homework for Applied Class 10 DUE Monday 11:55pm Week 12

Name: …………………………………………………….. ID……………….

1. If X ~ Bin(n, p) what is the approximate distribution of X for large sample size?


X~
AND what specific conditions are necessary for this approximation to be valid?

2. If a sample of size n is taken from a population with probability of success p, what is


the approximate distribution of the sample proportion p̂ ?

p̂ ~

3. During one of the AFL football season, the “Home Team” won 49 of the 82 games
played outside of Melbourne. Is this strong evidence of a home field advantage in
professional AFL football? That is, does the home team (outside Melbourne) win more
than 50% of its games? Use an appropriate hypothesis test to make your decision to a
level of significance of 5%. Show all steps: [See Lecture 29, page 1]
a) Sample proportion = (keep 3 sig. figures at least)

b) Hypotheses:

c) Check conditions:

d) Standardized test-statistic, z-statistic =

e) p-value =

f) Decision and Conclusion:

Mark: /10

You might also like