Unit 3
Unit 3
Unit 3
Descriptive Statistics
Objectives:
By the end of the lesson, you will be able to:
• Apply various measures of central tendency – including the mean,
median, and the mode – to a set of ungrouped data.
• Apply various measures of variability—including the range,
interquartile range, mean absolute deviation, variance, and standard
deviation —to a set of ungrouped data.
• Describe a data distribution statistically and graphically using
skewness, kurtosis, and box-and-whisker plots.
Topics:
• Lesson 3.1: Measures of Central Tendency: Ungrouped Data
• Lesson 3.2: Measures of Variability: Ungrouped Data
• Lesson 3.3: Measures of Shape
Measures of Central Tendency: Ungrouped
Data
• Measures of central tendency yield information about the
center, or middle part, of a group of numbers.
• Measures of central tendency do not focus on the span of
the data set or how far values are from the middle numbers.
Mode
• The mode is the most frequently occurring value in a set of
data.
• Organizing the data into an ordered array (an ordering of the
numbers from smallest to largest) helps to locate the mode.
Example:
Median
• The median is the middle value in an ordered array of
numbers.
• For an array with an odd number of terms, the median is the
middle number.
• For an array with an even number of terms, the median is
the average of the two middle numbers.
Steps for getting the median
• STEP 1. Arrange the observations in an ordered data array.
• STEP 2. For an odd number of terms, find the middle term of
the ordered array. It is the median.
• STEP 3. For an even number of terms, find the average of the
middle two terms. This average is the median.
Median
• Suppose a business researcher wants to determine the
median for the following numbers.
15 11 14 3 21 17 22 16 19 16 5 7 19 8 9 20 4
The researcher arranges the numbers in an ordered array.
3 4 5 7 8 9 11 14 15 16 16 17 19 19 20 21 22
Continue…
• The median is unaffected by the magnitude of extreme
values. This characteristic is an advantage, because large and
small values do not inordinately influence the median.
Mean
• The average of a group of numbers and is computed by
summing all numbers and dividing by the number of
numbers.
• The population mean is represented by the Greek letter mu
(μ).
• The sample mean is represented by .
Mean
• The formulas for computing the population mean and the sample
mean are given in the boxes that follow.
Continue…
• The capital Greek letter sigma (Σ) is commonly used in
mathematics to represent a summation of all the numbers in
a grouping.
• N is the number of terms in the population, and n is the
number of terms in the sample.
Example
Solution:
• Mode: 9,000
• Median: With 13 different companies in this group, N=13.
The median is located at the (13+1)/2 = 7th position. Because
the data are already ordered, the 7th term is 20,000, which is
the median.
• Mean: The total number of cars in service is 1,791,000 =
• μ=
Lesson 3.2
Measures of Variability
Measures of Variability
• Range
• Variance
• Standard Deviation
Range
• The range is the difference between the largest value of a
data set and the smallest value of a set.
• It is a crude measure of variability, describing the distance to
the outer bounds of the data set. It reflects those extreme
values because it is constructed from them.
Variance
• The variance is the average of the squared
deviations about the arithmetic mean for a set of
numbers. The population variance is denoted by .
Population Variance
Table next slide shows the original production numbers for the
computer company, the deviations from the mean, and the
squared deviations from the mean
• The sum of the squared
deviations about the mean of
a set of values—called the
sum of squares of x and
sometimes abbreviated as SSx
—is used throughout
statistics. For the computer
company, this value is 130.
Dividing it by the number of
data values (5 weeks) yields
the variance for computer
production.
• Because the variance is computed from squared deviations,
the final result is expressed in terms of squared units of
measurement. Statistics measured in squared units are
problematic to interpret. Therefore, when used as a
descriptive measure, variance can be considered as an
intermediate calculation in the process of obtaining the
standard deviation.
Standard Deviation
• The standard deviation is a popular measure of variability. It
is used both as a separate entity and as a part of other
analyses, such as computing confidence intervals and in
hypothesis testing.
Population Standard Deviation
S = standard deviation
n = total number of observations