0% found this document useful (0 votes)
23 views

Aem214 CH-3C

This document discusses various statistical measures of dispersion, including range, standard deviation, and variance. It provides examples of calculating each measure and outlines their characteristics and limitations. Specifically, it notes that range only considers the maximum and minimum values and can be skewed by outliers, while standard deviation measures how data is distributed around the mean by calculating the average deviation. The document also discusses calculating variance and standard error of the mean, and provides step-by-step examples of each calculation method.

Uploaded by

Lucky Goje
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
23 views

Aem214 CH-3C

This document discusses various statistical measures of dispersion, including range, standard deviation, and variance. It provides examples of calculating each measure and outlines their characteristics and limitations. Specifically, it notes that range only considers the maximum and minimum values and can be skewed by outliers, while standard deviation measures how data is distributed around the mean by calculating the average deviation. The document also discusses calculating variance and standard error of the mean, and provides step-by-step examples of each calculation method.

Uploaded by

Lucky Goje
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 6

AEM214 Fundamentals of Statistics – 2021-22

CHAPTER – 3
DISPERSION

Meaning of disperson is scatteredness. It gives us the idea about the homogeneity or


heterogeneity of the distribution. In other words, Dispersion gives us the idea about the
variation in data set from central value. It is very useful to draw conclusion about the data
set or given observations.

Characteristics of ideal measures of disperson: It should be rigidly defined. It should be


easy to calculate and understand. It should be based on all the observation. It should be
amenable to further mathematical treatment.

Measures of Dispersion

There are different tools for measures of disperson like as Range, Mean Deviation,
Absolute Deviation, Standard Deviation, Variance, etc.

Range in Statistics

In statistics, range is defined simply as the difference between the maximum and
minimum observations. It is intuitively obvious why we define range in statistics this way
– range should suggest how diversely spreads out the values are, and by computing the
difference between the maximum and minimum values, we can get an estimate of the
spread of the data.

For example, suppose an experiment involves finding out the weight of lab rats and the
values in grams are 310, 367, 423, 471 and 485. In this case, the range is simply
computed as 485-310 = 175 grams.

Some Limitations of Range

Range is quite a useful indication of how spread out the data is, but it has some serious
limitations. This is because sometimes data can have outliers that are widely off the other
data points. In these cases, the range might not give a true indication of the spread of data.

For example, in our previous case, consider a small baby rat added to the data set that
weighs only 50 grams. Now the range is computed as 485-50 = 435 grams, which looks
like a false indication of the dispersion of data.

This limitation of range is to be expected primarily because range is computed taking


only two data points into consideration. Thus it cannot give a very good estimate of how
the overall data behaves.

Standard deviation is the best way to measurement of deviations. Standard deviation


tells us about how the data is distributed about the mean value.

Singh 2021 Page 1


AEM214 Fundamentals of Statistics – 2021-22

EXAMPLES

For example, the data points 50, 51, 52, 55, 56, 57, 59 and 60 have a mean at 55 (Blue).

Another data set of 12, 32, 43, 48, 64, 71, 83 and 87. This set too has a mean of 55 (Pink).

However, it can clearly be seen that the properties of these two sets are different. The first
set is much more closely packed than the second one. Through standard deviation, we can
measure this distribution of data about the mean.

USAGE

The measurement of deviations through standard deviation is used in many experiments


of social sciences and finances.

In physical experiments, it is important to have a measurement of deviations. Standard


deviation provides a way to check the results. Very large values of standard deviation can
mean the experiment is faulty – either there is too much noise from outside or there could
be a fault in the measuring instrument.

STATISTICAL VARIANCE

Statistical variance gives a measure of how the data distributes itself about the mean or
expected value. In many cases of statistics and experimentation, it is the variance that
gives invaluable information about the data distribution.

STEP BY STEP CALCULATION

For example, suppose you want to find the variance of scores on a test. Suppose the
scores are 67, 72, 85, 93 and 98.

Singh 2021 Page 2


AEM214 Fundamentals of Statistics – 2021-22

1. Write down the formula for variance:

Variance= σ2 = ∑ (Xi-Mean)2 / N

Variance= σ2 = ∑ Fi (Xi-Mean)2 / N

2. There are five scores in total, so N = 5.

σ2 = ∑ (Xi-Mean)2 / 5

3. The mean (µ) for the five scores (67, 72, 85, 93, 98), so µ=Mean = 83.

σ2 = ∑ (Xi-83)2 / 5

4. Now, compare each score (x = 67, 72, 85, 93, 98) to the mean (µ = 83)

σ2 = [ (67-83)2+(72-83)2+(85-83)2+(93-83)2+(98-83)2 ] / 5

5. Conduct the subtraction in each parenthesis.


o 67-83 = -16
o 72-83 = -11
o 85-83 = 2
o 93-83 = 10
o 98 – 83 = 15

The formula will look like this:

σ2 = [ (-16)2+(-11)2+(2)2+(10)2+(15)2] / 5

6. Then, square each parenthesis. We get 256, 121, 4, 100 and 225.

σ2 = [ (-16)x(-16)+(-11)x(-11)+(2)x(2)+(10)x(10)+(15)x(15)] / 5

σ2 = [ 16x16 + 11x11 + 2x2 + 10x10 + 15x15] / 5

σ2 = [256 + 121 + 4 + 100 + 225] / 5

7. Then summarize the numbers inside the brackets:

σ2 = 706 / 5

8. To get the final answer, we divide the sum by 5 (Because it was five scores).
This is the variance for the dataset:

σ2 = 141.2 (This is the variance of the population of scores)

Singh 2021 Page 3


AEM214 Fundamentals of Statistics – 2021-22

Mean Deviation: 1/N ∑ │Xi - Mean│

If in terms of Frequency

Mean Deviation: 1/N ∑ Fi│Xi - Mean│

Absolute Deviation: 1/N ∑ │Xi - A│ (if A is any arbitrary constant)

If in terms of Frequency

Absolute Deviation: 1/N ∑ Fi│Xi - A│ (if A is any arbitrary constant)

STANDARD DEVIATION OF SAMPLE:

In many cases, instead of a population, we deal with scores.

In this case, we need to slightly change the formula for standard deviation as

S2 = the standard deviation of the sample.

Note that the denominator is one less than the sample size in this case.

USAGE

The concept of variance can be extended to continuous data sets too. In that case, instead
of summing up the individual differences from the mean, we need to integrate them. This
approach is also useful when the number of data points is very large, like the population
of a country.

Variance is extensively used in probability theory, wherein from a given smaller sample
set, more generalized conclusions need to be drawn. This is because variance gives us an
idea about the distribution of data around the mean, and thus from this distribution, we
can work out where we can expect an unknown data point.

STANDARD ERROR OF THE MEAN

The standard error of the mean, also called the standard deviation of the mean, is a
method used to estimate the standard deviation of a sampling distribution. To understand
this, first we need to understand why a sampling distribution is required.

Singh 2021 Page 4


AEM214 Fundamentals of Statistics – 2021-22

As an example, consider an experiment that measures the speed of sound in a material


along the three directions (along x, y and z coordinates). By taking the mean of these
values, we can get the average speed of sound in this medium.

Mathematically, the standard error of the mean formula is given by:

σM = standard error of the mean, σ = the standard deviation of the original distribution

N = the sample size and √N = Root of the sample size

CONCEPTS:

It can be seen from the formula that the standard error of the mean decreases as N
increases. This is expected because if the mean at each step is calculated using a lot of
data points, then a small deviation in one value will cause less effect on the final mean.

The standard error of the mean tells us how the mean varies with different experiments
measuring the same quantity. Thus if the effect of random changes are significant, then
the standard error of the mean will be higher. If there is no change in the data points as
experiments are repeated, then the standard error of mean is zero.

Solved Examples:

1. Find the Range and Range Coefficient of the following series which gives the height of
seven plants:
22, 35, 32, 45, 42, 48, 39

Range = Max. – Min. = 48 – 22 = 26


Range Coefficient = (48 – 22)/(48 + 22) = 26/70 = 0.37

2. Ten measurements were made with the following results. Find the standard deviation.

Length in cm. 77 73 75 70 72 76 75 72 74 76
Xi – m 3 -1 1 -4 -2 2 1 -2 0 2
2
(Xi-m) 9 1 1 16 4 4 1 4 0 4
Di 2 -2 0 -5 -3 1 0 -3 -1 1
2
Di 4 4 0 25 9 1 0 9 1 1

ΣXi = 74 Σ(Xi-m)2 = 44 ΣDi = -10 ΣDi2 = 54

Mean = 74, σ = SQRT[44/10] = 2.1 cm.


Short cut Method:
Singh 2021 Page 5
AEM214 Fundamentals of Statistics – 2021-22

S.D. = SQRT[(54/10) – ( 74 – 75 )2] = SQRT [ 44/10 ] = 2.1 cm

3. From the following figures find the mean, standard deviation and coefficient of
variation.

Group 0-10 10-20 20-30 30-40 40-50 50-60 60-70 70-80


Freq.(fi) 5 10 20 40 30 20 10 4
Mid value (Xi) 5 15 25 35 45 55 65 75
di -3 -2 -1 0 1 2 3 4
fi x di -15 -20 -20 0 30 40 30 16
2
fi x di 45 40 20 0 30 80 90 64

Σ fi = 139 Σ fi x di = 61 Σ fi x di2 = 369

Mean = [35 + {(61/139) x 10}] = 39.5

S.D. = SQRT [(369/139) – (61/139)] x 10 = 15.6

C.V. = [S.D./Mean] x 100 = (15.6/39.5) x 100 = 39.2%

Singh 2021 Page 6

You might also like