Further Mathematics 2019: Unit 3 & 4: Examples Answered
Further Mathematics 2019: Unit 3 & 4: Examples Answered
Further Mathematics 2019: Unit 3 & 4: Examples Answered
Variables
In statistics, we call quantities about which we record information variables.
DATA
Numerical data arises when the information recorded about some variable is a number that
comes from measuring or counting some quantity. Numerical data also comes in two types,
discrete and continuous. In discrete data we count and ask the question “How many?”. In
continuous data we measure and ask the question “How much?”
Categorical data arises when the information recorded about a variable is a word, number or
symbol arising from classifying a person or object as belonging to a particular category.
(NB: numbers which are categorical cannot have arithmetic procedures eg averages, addition,
subtraction, multiplication, division etc applied to them)
1
Further Mathematics: Univariate Data Ringwood SC 2019
Example A1
Classify each of the following as categorical or numerical data. If the data is numerical, further
classify the data as discrete or continuous. If it’s categorical, classify the data as ordinal or
nominal.
DATA TYPE
TIME SPENT IN SHOWER Numerical, Continuous
Frequency Tables
A frequency table is a listing of the values a variable takes in a data set, along with how often
(frequently) each value occurs. It can be used for both numerical and categorical data.
Frequency can be recorded as a:
Count: number of times a value occurs, or
Percent: percentage of times a value occurs.
The tables are set as follows:
Frequency
Percentage Frequency 100%
Total Number
2
Further Mathematics: Univariate Data Ringwood SC 2019
3 3 4 4 5 3 2 4 3 5 3
Display the data in the form of a frequency table. Round percentages to one decimal place.
TOTAL 11
The Mode
In a frequency table, the mode is defined to be the data value (or range of data values) that
occurs most often; that is has the greatest frequency.
Barcharts
Note: Bar charts have a gap between the vertical axis and the first column and has a gap between
the columns. Frequency is always on the vertical axis.
3
Further Mathematics: Univariate Data Ringwood SC 2019
GENDER FREQUENCY
Liberal 3
Greens 1
Labor 6
TOTAL 10
Example B3
According to the segmented bar chart below, what percentage of days was Melbourne’s climate
recorded as moderate?
About 65%
4
Further Mathematics: Univariate Data Ringwood SC 2019
Exercise 1B: 2, 4, 6
Exercise 1C: Displaying and describing the distributions of numerical
variables
Grouped Frequency Distribution
Sometimes the only way we can display data is by grouping into categories, for instance ages.
Listing all possible ages would be tedious therefore we might use the intervals: 15-19, 20-24, 25-
29, etc.
Grouping is also used for continuous data such as heights and weights. We generally try to group
our data into between 5 and 10 class intervals.
Example C1
A local gym recorded the ages of thirty people who were in a cycle class. The results are as
follows:
32 18 20 22 24 52 45 36 28 27
39 25 24 19 51 20 22 26 25 30
19 18 30 32 17 28 25 19 31 28
5
Further Mathematics: Univariate Data Ringwood SC 2019
Histograms
A histogram is a way of representing the information contained in a frequency table containing
numerical data in graphical form. Note: In a bar chart, the ‘bars’ are separated by a space and
they are used for categorical data. Histograms have ‘bars’ which touch and they are used for
numerical data. Frequency is shown on the vertical axis of both.
Note: When you are working with grouped data the first number in the class interval is written at the the
start of the bar. Alternatively, if you’re working with discrete (non-grouped) data, the number is written in
the middle of the bar.
Use the information in the frequency table to construct a histogram to display the distribution.
Describing histograms
“SOCS” = Shape, Outliers, Centre, Spread
In describing histograms, we discuss
1. Shape – see below – ignore the outlier when you are determining the shape!
2. ‘Outliers’ (values that are considerably higher or lower than the bulk of the data)
𝑛+1
3. Centre (median is the 2 th value) – need to know how to find these in histograms
4. Spread (range : highest score – lowest score)
**ALL of these features must be discussed when describing a histogram distribution
Shape
Symmetric Distribution Positively Skewed
10
10
Frequency
8
Frequency
5 6
4
0 2
FEATURES: 0
● Single peaked FEATURES:
● Tails off relatively evenly either side ● Tails off to the right
● Mean > Median
Frequency
6 6
4 4
2
2
0
0
FEATURES: FEATURES:
● Tails off to the left ● Double peaked
● Mean < Median
6
Further Mathematics: Univariate Data Ringwood SC 2019
Outliers
Outliers are values that stand out from the main body of data.
10
8
Frequency
6
4
2
0
Centre
The Centre (using the median) divides histogram into two equal areas.
10
10
8
Frequency
Frequency
6
5 4
2
0 0
n +1
Median location: if “n” is the number of data values, the median is the th value
2
Spread
Spread (using the range) indicates how tight or loose data values in a distribution are clustered.
10
15
Frequency
Frequency
10
5
5
0 0
NOTE : NOTE:
● Data loosely clustered ● Data closely clustered
7
Further Mathematics: Univariate Data Ringwood SC 2019
Step 6: Tap (Alternatively, tap “Analysis Trace”) then scroll left and right using to show
values on the histogram.
The centre of the distribution, as measured by the median, lies in the interval _________ , and the
spread of the distribution, is ______________ as measured by the range.
8
Further Mathematics: Univariate Data Ringwood SC 2019
Example C2
This histogram shows the
distribution of life expectancy
for 183 different countries.
For the distribution of life expectancy for these 183 countries, the data is negatively skewed with
no apparent outliers.
The centre of distribution, as measured by the median, is between 65 and 70 years and the spread
of the distribution, is approximately 35 as measured by the range.
Exercise 1C: 1, 2, 3, 5, 7, 8, 9
9
Further Mathematics: Univariate Data Ringwood SC 2019
a) 0.003 561 4
b) 70.036 5
c) 5.320 4
d) 5320 3
e) 450 000 2
f) 78 000.0 6
g) 78 000 2
If it is 5 or above, round the final digit up; if it is 4 or below, keep the final digit as is.
For example:
Worksheet: Rounding
For extra practice: http://studymaths.co.uk/workout.php?workoutID=62
(Redo until you get 10/10)
10
Further Mathematics: Univariate Data Ringwood SC 2019
A way to overcome this is to write the numbers in logarithmic (log) form. The log of a number is
the power of 10 which creates this number.
log10(10) = log10(101) = 1
log10(100) = log10(102) = 2
log10(1000) = log10(103) = 3
Worked example
The histogram below displays the body weights (in kg) of a number of animal species. Because
the animals represented in this dataset have weights ranging from around 1kg to 90 tonnes (a
dinosaur), most of the data are bunched up at one end of the scale and much detail is missing.
The distribution of weights is highly positively skewed, with an outlier.
However, when a log scale is used, their weights are much more evenly spread along the scale.
The distribution is now approximately symmetric, with no outliers, and the histogram is
considerably more informative. We can now see that the percentage of animals with weights
between 10 and 100kg is similar to the percentage of animals with weights between 100 and
1000 kg.
11
Further Mathematics: Univariate Data Ringwood SC 2019
Actual value to log scale value: (use ) Log scale value to actual value
12
Further Mathematics: Univariate Data Ringwood SC 2019
Example D2
The Richter scale is a log base 10 scale used to measure the size of earthquakes.
a) An earthquake is recorded with a raw value of 75,000. What is this value on the Richter scale,
correct to 3 significant figures?
Log10(75000) = 4.875061… = 4.87 (to 3 s.f.)
b) What is the raw value, correct to 4 significant figures, of an earthquake which is recorded at
6.3 on the Richter scale?
106.3 = 1995262.315 = 1 995 000 (to 4 s.f.)
c) Show that an earthquake that measures 3.0 on the Richter scale is ten times stronger than an
earthquake that measures 2.0 on the Richter scale.
102 = 100
1000
103 = 1 000 = 10 therefore 10 times stronger.
100
Example D3
a) positive skew
b) approximately symmetric
13
Further Mathematics: Univariate Data Ringwood SC 2019
EXAM QUESTIONS
Log10 (10) = 1
B is correct
Log10 (1) = 0
E is correct
14
Further Mathematics: Univariate Data Ringwood SC 2019
Example A1
The dot plot below shows the number of hours students in a class spend on homework each
week. What is the median number of homework hours per week?
Example A2
Prepare an ordered stem and leaf plot for the following set of scores.
12 45 67 45 34 54 87 86 80 40 23 48 69 71
**Hint – you can use your CAS to order the data (Stats – List 1 – Edit – Sort ascending).
Key: 1|2 = 12
Stem Leaf
1 2
2 3
3 4
4 0 5 5 8
5 4
6 7 9
7 1
8 0 6 7
Note: The histogram report template (SOCS) can also be used for stem and leaf plots
15
Further Mathematics: Univariate Data Ringwood SC 2019
Split Stems
Some stem plots are too bunched (there are too many numbers in one leaf) and it is therefore
necessary to perform a split stem. The stem is usually split into halves or fifths.
2 ( 20-24) or 2 (20-21)
2 (25-29) 2 (22-23)
2 (24-25)
2 (26-27)
2 (28-29)
Example A3
Construct a single stem, and a stem split into fifths for the following data.
1.5 0.2 1.2 1.3 0.9 1.8 1.9 1.7 0.7 1.6
1.2 1.0 1.6 1.4 1.1 1.5 1.6 1.5 1.7
Stem Leaf
0 2 7 9 Stem Leaf
1 0 1 2 2 3 4 5 5 5 6 6 6 7 7 8 9 0
0 2
0
0 7
0 9
1 0 1
1 2 2 3
1 4 5 5 5
1 6 6 6 7 7
1 8 9
Exercise 2A: 2, 3, 4, 5
16
Further Mathematics: Univariate Data Ringwood SC 2019
For example, to calculate the median of the following, write out the data set:
3 5 1 4 8
we firstly write out the data values in numerical order:
1 3 4 5 8
and then locate the midpoint of the data set
1 3 4 5 8
^
median = 4
For an odd number of data values, the median will be one of the data values as above. For an even
number of data values, the median does not coincide with an actual data value. For example, to
locate the median of the data set:
5 3 4 8
we firstly write out the data values in numerical order and then locate the midpoint of the data
set. In this case the midpoint lies halfway between 4 and 5, that is, at 4.5:
3 4 5 8
^
median = 4.5
n +1
position
2
By definition, half the data (50%) lies above the median and half below.
17
Further Mathematics: Univariate Data Ringwood SC 2019
Example B1
Find the median of the following data set:
1 8 7 6 5 4 2 2 3 6
The Range
The range is a measure of spread.
Example B2
What is the range for the data in Example B1?
Range = 8 – 1 = 7
Example B3
Calculate the median and range from the stem and leaf plot below:
18
Further Mathematics: Univariate Data Ringwood SC 2019
Q1 – (the lower quartile)– The median of the lower half of you data
Q2 – the median
Q3 – (the upper quartile) – The median of the upper half of your data
Note –the median is not included in the lower or upper halves when performing
calculations for lower and upper quartiles.
The interquartile range is a measure of spread of the distribution that describes the range of the
middle 50% of observations.
The IQR is not affected by the presence of outliers. For this reason it is often a more useful
measure of spread than the range.
Example B4
For each of the following sets of data, find:
i. the lower quartile
ii. the upper quartile
iii. the interquartile range
a) 1, 2, 3, 3, 4, 4, 5, 6, 6, 7, 8
Q1 = 3
Q3 = 6 IQR = 6 – 3 = 3
b) 2, 3, 3, 3, 4, | 5, 6, 6, 6, 7
Q1 = 3
Q3 = 6 IQR = 6 – 3 = 3
19
Further Mathematics: Univariate Data Ringwood SC 2019
Example B5
Calculate the IQR of the data displayed in the stem and leaf plot below.
Exercise 2B : 1, 3, 4, 5, 6, 8
Holiday Homework Worksheet: Univariate Data Exam Questions
Exercise 2C: The Five Number Summary and the Box plot
Outliers can be shown as a dot or a cross. If there is an outlier, the “whisker” only extends to the
lowest/highest value that is not an outlier.
20
Further Mathematics: Univariate Data Ringwood SC 2019
Example C1
a) Calculate the 5 number summary and then construct a box plot for the following data. Use and
label an appropriate scale.
36 35 34 32 37 35 38 32 35 37
ordered : 32, 32, 34, 35, 35, 35, 36, 37, 37, 38
min = 32, Q1 = 34, med = 35, Q3 = 37, max = 38
c) calculate the upper and lower fences, and hence state whether there are any outliers.
IQR = 37 – 34 = 3 lower fence = 34 – 1.5 x 3 = 29.5
upper fence = 37 + 1.5 x 3 = 41.5
Since all values are between 29.5 and 41.5, there are no outliers.
21
Further Mathematics: Univariate Data Ringwood SC 2019
Step 5: Tap (Alternatively, tap “Analysis Trace”) then scroll left and right using to show
values on the box plot
Report template
22
Further Mathematics: Univariate Data Ringwood SC 2019
Example C2
Using the box plots shown, answer
the following questions:
a) In which month was the
temperature generally higher?
May
The spread, as measured by the interquartile range, was greater in May (approximately 6.50C)
than July (approximately 30C)
23
Further Mathematics: Univariate Data Ringwood SC 2019
Example F1
Calculate the mean and median of following sets of data.
a) 2 6 8 10 14 15 15
mean = (2+6+8+10+14+15+15) ÷ 7 median = 10
= 10
b) 2 6 8 10 14 15 50
mean = (2+6+8+10+14+15+50) ÷ 7 median = 10
= 15
Example F2
Use the “mid point” method to find the mean from each of the following frequency tables:
a) Complete the table below assuming that the data variable is discrete, then calculate the mean
from the table.
b) Complete the table below assuming that the data variable is continuous, then calculate the
mean from the table.
24
Further Mathematics: Univariate Data Ringwood SC 2019
Exercise 2F-1: 1, 3, 4, 5, 7, 8
Worksheet: Mean from grouped data
( x – x )2
s =
n – 1
The variance, is the square of the standard deviation, s2, and is also a measure of spread
e.g. Variance = 9, SD = √9 = 3
If the standard deviation is small compared to the mean eg mean = 100 and s.d.= 3 then
there is a small spread of data.
If the distribution is highly skewed, or if there are outliers, then the IQR is a better
measure of spread than the standard deviation.
Finding the standard deviation on the CAS (this is the only way you need to know!)
1. Statistics Enter data into list1
2. Tap “Calc One variable OK”
Note that is the standard deviation
Example F3
Use your calculator to find the standard deviation of the following data set, correct to three
significant figures:
76, 75, 79, 69, 80, 74, 83, 66
Sx = 5.6505373… 5.65
Exercise 2F-2: 1, 2, 3, 4, 6
25
Further Mathematics: Univariate Data Ringwood SC 2019
For any distribution approx.. 95% of the data lies within 2 Standard deviations of the mean
68% of the observations lie within one standard deviation of the mean
95% of the observations lie within two standard deviations of the mean
99.7% of the observations lie within three standard deviations of the mean
26
Further Mathematics: Univariate Data Ringwood SC 2019
Example G1
The number of matches in a box is not always the same. When a sample of boxes was
studied it was found that the number of matches in a box approximated a normal (bellshaped)
distribution with a mean number of matches of 50 and a standard deviation of 2.
What percentage of boxes would be expected to have more than 48 matches?
48 is 1 s.d. below the mean
Therefore 16% of distribution is below 48, which means 84% is above 48.
Example G2
VCE study scores are normally distributed with a mean of 30 and standard deviation of 7.
c) In a class of 25 students, how many would be expected to score between 30 and 37? Answer to
the nearest whole number
34% of scores between 30 to 37 34% of 25 = 8.5 therefore 9 students.
27
Further Mathematics: Univariate Data Ringwood SC 2019
Example G3
IQ scores are normally distributed. Given that 95% of IQ scores lie between 70 and 130, find the
mean and standard deviation of IQ scores.
95% of normal distribution lies within 2 s.d. either side of the mean.
So 70 is 2 s.d. below mean, 130 is 2 s.d. above mean.
Therefore, mean = 100 (midpoint of 70 and 130) and s.d. = 15 (30 ÷ 2)
Mean = 100
Standard deviation = 15
Exercise 2G : 1, 2, 3, 5
or xx
z
s
Example H1
In an IQ test, the mean IQ is 100 and the standard deviation is 15. Dale’s test results give
an IQ of 130. Calculate this as a z-score. Interpret this information.
130−100
𝑧= =2
15
Dale’s score is 2 standard deviations above the mean (which puts him in the top 2.5% of the population)
28
Further Mathematics: Univariate Data Ringwood SC 2019
You may also be asked to use the standard score formula to determine an “actual” value.
In these cases, “Action Advanced Solve” can be used on the calculator.
Example H2
The length of ants in a colony are normally distributed with a mean of 4.8mm and standard
deviation of 1.2mm.
An ant with a standardized length z = –0.5 corresponds to what actual length?
𝑥−4.8
−0.5 = solve x = 4.2mm
1.2
Example H3
A student obtained the following marks in two exams:
Subject Mark Mean Std Dev
Psychology 75 65 10
Statistics 70 60 5
70−60
Statistics standard score: 𝑧 = =2
5
Using standardized scores, this student performed better on the Statistics test.
29
Further Mathematics: Univariate Data Ringwood SC 2019
A population, in statistics, is a group of people (or objects) to whom you can apply any
conclusions or generalisations that you reach in your investigation.
A sample, in statistics, is a smaller group of people (or objects) who have been chosen
from the population and are involved in the investigation.
A simple random sample SRS is a random selection from the population such that every
member of that population has an equal chance of being chosen in the sample and the
choice of one member does not affect the choice of another member (using your CAS).
Step 2: Tap on the letter “R” and then tap on “ randList( ”twice
For example, to select 3 random numbers between 1 and 20, type: randList(3,1,20)
NOTE: If you are given a list of data and have to find a random sample, you need to number your
data 1, 2, 3, etc. The numbers given on your calculator indicates the POSITION of the real
values of the random sample in the data, they are NOT THE REAL VALUES OF DATA.
Example I2
The following data represents the ages of 20 people in an aerobics class. Find a random sample of
8 people.
42 17 18 36 19 22 25 21 20 38 33 30 16 19 25 25 26 25 22 17
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20
a) Assign a number from 1 to 20 to each data value.
b) Use your calculator to select a simple random sample of 8 people from this class. Write
down the ages of the 8 people in the sample.
Randlist(8, 1, 20)
for example: { 7, 3, 18, 15, 12, 6, 9, 10 } Ages: 25, 18, 25, 25, 30, 22, 20, 38
30