Data Analysis For Accountants Assessment 2
Data Analysis For Accountants Assessment 2
Data Analysis For Accountants Assessment 2
Assessment 2 – PORTFOLIO
Student ID number:
https://www.londondaily.news/workplace-collection-pots-grow-to-an-average-of-139-as-brits-
give-generously-to-coworkers/
Paste a screenshot of the headline and relevant parts of the report mentioning the average
below:
The average used in the report is the mean, which the sum of all workplace collection pot
funds divided by the working populations.
What was the size of the sample used to find the average?
It is not specified but an analysis was done on tens of thousands of Collection Pots in
workplaces within the UK.
What is the source of the data? How was the data collected?
The source of the data is the Collection Pot, which is the biggest workplace collecting
platform in the UK. The data was collected from internal analysis of Collection Pots in
workplaces around the UK.
Do you think the average was chosen to give valid and reliable results, or to attract the
most readers?
Yes. The average was chosen to give valid and reliable results
Find an example of what you consider to be a GOOD chart about the cost-of-living crisis, and
an example of a BAD one. Include the charts on this page, together with links to the source.
For each chart, give THREE reasons why you consider it to be good or bad.
“GOOD” CHART
“BAD” CHART
LINK
https://en.wikipedia.org/wiki/2000s_United_States_housing_bubble
Correlation is a statistical measure illustrating how two or more variables move together.
On the contrary, causation goes deeper by looking into for cause and effect between two
variables. A good example of correlation is; consider the relationship between the income in
a household and its expenditures. If a survey is conducted across homes, it is likely to
discover that richer households spend more. In such a case, we can conclude there is an
income and expenditures correlation. When it come to causation, a good example is; my cat
gets fatter because its being fed more. Such a scenario is called cause and effect. The cat
being fed more treats is the cause of it becoming more heavier, i.e., the effect.
Reference:
https://saylordotorg.github.io/text_microeconomics-theory-through-applications/s21-23-
correlation-and-causality.html
For this part of your portfolio, you are required to find some data from Statista.com (email
address and password required off campus), a leading provider of market and consumer data.
You should find some data broadly related to your degree title. (If you do not have access to
Statista, your lecturer will advise you on where to find some data).
For the correlation and regression analysis, you will need TWO columns of data so that you
can have X and Y variables. You made need to download two different files and then
combine them in Excel. You need two columns of the same length, with at least 10 pairs of
(X, Y) data.
Write down the titles of the data file(s) you have chosen to use:
Data file title: UEFA champions finance and social media Practice
Total Revenue Vs. Capacity
Use Excel to create a fully-formatted scatter chart of your data, showing the linear
regression line, R-squared value, and the equation of the regression line.
40000
30000
20000
10000
0
0 100 200 300 400 500 600 700 800
Total Revenue
What is the value of the correlation coefficient for your chosen data? What does it mean?
The correlation coefficient in this case is 0.61 which is positive correlation. That is to say the
higher the capacity the higher the revenue and vice versa.
Using the values in the regression equation, explain what your analysis shows.
The values of the regression equation are:
y = 45.482x + 36490
36490 is the intercept and 45.482 is the slope. 36490 is the capacity, when total revenue is 0.
While 45.482 is the estimate of change in capacity for a unit change in the total revenue.
Explain the concepts of interpolation, extrapolation, validity and reliability when using a
regression model for forecasting.
VALIDITY:
It is defined by how well a measure measures what it is intended to measure. More
specifically, this concept refers to how well the collected data and their analysis support the
results or findings, and whether the results or findings extend to other contexts, or
generalize
RELIABILITY:
Reliability refers to the consistency of the measure. High reliability indicates that the
measurement system produces similar results under the same conditions. If you measure
the same item or person multiple times, you want to obtain comparable values.
Using the example spreadsheet above, explain how the Excel VLOOKUP function can be
used to find items in an Excel table.
How the VLOOKUP function can be used to find Petrol price in November 2021
1. Select the cell where you want the result to be and start the VLOOKUP function by
typing: =VLOOKUP (
2. The first argument is the lookup value then you add a comma: =VLOOKUP (E1,
3. The second argument in VLOOKUP function is the table array. For this segment,
select or type the range and add a comma: =VLOOKUP (E1, A2:E5,
4. The third argument is the column_index_no. which in this case is the second
column,2, then add a comma: =VLOOKUP (E1, A2:E5,2,
5. Write the word: FALSE for an exact match and close the blanket: =VLOOKUP (E1,
A2:E5,2, FALSE)
6. Press on enter! You are done.
Explanation:
Petrol price
i. Petrol Price before VAT = , the formular takes away the 20%
1.2
VAT.
ii. Petrol Price before duty and VAT = Petrol Price before VAT - fuel duty
iii. % Change, 2018 to 2021 for all the petrol prices =
Price∈ November 2021−Price∈November 2018
x 100
Price∈ November 2020
iv. Similar calculation is done for % Change, 2020 to 2021, but for 2020 and 2021.
Price∈ November 2021−Price∈November 2020
x 100
Price∈ November 2020
3 (b) Rebasing and Descriptive Statistics
The table below shows the average price of petrol and diesel in November from 2012 to
2021. Fill in all the missing values and give an explanation of your method.
Explanation:
For the first and third column which contain petrol and diesel prices respectively. These
prices are calculated the same way as follows:
Price∈2012
¿ x Price Index of the year beingcalculated
Price Index∈2012 ¿
For the second and fourth column contain petrol and diesel prices indexes respectively.
They are calculated as follows:
Price Index∈2012
¿ x Price of the year being calculated
Price∈2012 ¿
Use the information in the table above, together with any descriptive statistics you may
calculate, to make FIVE relevant comments and/or comparisons about Petrol and Diesel
prices since 2012.
1. The petrol prices are lower than diesel prices irrespective of the year in the table above.
2. Both prices index of petrol and diesel 100 in 2012.
3. The average price of petrol is 123.9 for the period given while that of diesel between 129.04
4. In 2012 Petrol price was 134.4 and in 2021 the price was 146.9. That’s a 9.3% change.
5. Diesel prices in 2016 and 2020 were the same.
2017 Q1
2018 Q1
2019 Q1
2020 Q1
2021 Q1
Q4
Q2
Q3
Q4
Q2
Q3
Q4
Q2
Q3
Q4
Q2
Q3
Q4
Q2
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20
Obseration number
This time series is for spending by overseas visitors to UK between the Q3 2016 to the Q2
2019. In the analysis, I calculated several items including MA and CMA (Moving Average
and Trend) and SV (Seasonal Variations), the forecast and the projection of the spending by
overseas visitors to UK from the Q3 2016 to the Q2 2019. The seasonal variations were
calculated as shown in the table below, whereas the slope and the intercept of the chart
3 39 299 188
From the data chosen, the numbers seem to fluctuate depending on the quarter. For instance,
the first and the second quarter have low spending compare to the third and fourth. In the Q1
of 2018, the spending was 6080 while on Q4 of the same year the spending was 8401 million.
Therefore the % change was 38.17%. This could have been cause by seasonal weather
patterns. The lowest spending between 2016 and 2019 was Q2 of 2019 with highest spending
being Q4 of 2017 at 4805 and 10088 respectively. Besides, there is a characteristic drop in
spending as years pass by between 2016 and 2019. This average fall in spending could be
The forecast shows an almost similar pattern in spending as previous years for the year end
2019, the whole of 2020 and 2021. However, that is not the case when actual figures are
looked at. In the year beginning 2020, the world was hit by the corona pandemic which halted
spending by visitors to the UK because basically there were no visitor. Therefore, the
projected and forecasted figures were not achieved. For instance, the forecasted figures for
the Q3 2020 were 6786 while in actual sense they were 611.