CH 2
CH 2
CH 2
Chapter
II
15
CHAPTER
2. Introduction 2
Chapter Contents
2.1. Types of data
2.1 Types of data 2.2.Data collection methods
2.3. Frequency distribution
2.4. Pictorial presentation
Before knowing the methods used to collect data / information we shall see
2.5. Review questions
first about types of data. Statistical data may be classified in to two basic
types based on sources.
researcher. The dis advantage were low response rate, (mail) is not
suitable for illiterate community.
3. Observation : Observational methods are used mostly in scientific
inquiry where data are observed directly from controlled experiment.
It is used more in the natural sciences through laboratory works than
in social sciences. But this is very useful studying small communities
and institutions.
4. Interviewing: In this method, the person collecting the data is called
the interviewer goes to ask the person (interviewee) direct questions.
The interviewer has to go to the interviewees personally to collect
the information required verbally. This method has also different
schemes.
Observation and Interviewing are used to collect information regarding land
area, crop output, and exchange activities (purchase and sale prices) ,etc.
For Example: The process of sorting letters in a post office, the letters are
classified according to the cities and further arranged according to streets.
Activity 2.3. What is the difference b/n Classification & Tabulation? Dis-
cuss.
Frequency Distribution 19
We use this type of frequency distribution to present raw data when the
range of the data values is small. Mostly it is useful for discrete and small
set of data. Discrete data is generated by counting.
When the range of the data is large, the data must be grouped in to classes
that are more than one unit in width. Some concepts used when we present
data in g .f .d are mentioned and defined below.
1. Lower class limits (LCL): represents the smallest data value that can be included in the
class.
2. Upper class limits (UCL): represents the largest data value that can be included in the
class.
3. Class boundaries (CBs): these are numbers used to separate the class so that there are
no gaps in frequency distribution. The class limits should have the same decimal place value
as the data but the class boundaries have one additional place value and always end with the
digit 5.
4. Unit of measurement (U): is the smallest difference between any two values of the variable
being studied. It is important to change the class limits in to class boundaries.
5. The class width (W): for a class in a frequency distribution is found by subtracting
lower/upper class limit of one class minus the lower /upper class limit of the previous class.
Or it is the difference between UCB and LCB of any class.
6. Cumulative frequency (cf): are used to show how many values are accumulated up to and
including a specific class.
7. Relative frequency (rf): is the frequency of the class divided by the total frequency (i.e.
sum of all frequencies). And if multiplied by 100, gives the percentage of values falling in that
class.
8. Class mark or midpoint: is denoted by Xmi or CMi and is obtained by adding the upper and
lower Class boundaries and dividing by 2, or adding the LCL and UCLs and dividing by 2.
9. The class interval: is the range of values of each class. The class interval 10 – 20 contains all
the values between 10 and 20. Its mid point is 15. The lower class limit is 10. The upper class
limit is 20.
10. Class: is different non overlapping group of data.
11. Open-end Classes: A class has either no lower class limit or no upper class limit in a
frequency table is called an open-end class. We do not like to use open-end classes in practice,
because they create problems in calculation. For example:
Weights (Pounds) No of Persons
Below 110 6
110 - 120 12
120 - 130 20
130 - 140 10
Above 140 2
Lcf = f 1, f 1 + f 2, . . . , f 1 + f 2 + ... + f k
Frequency Distribution 22
Note 2.2. The relative frequency shows what fractional part or portion of
the total frequencies belongs to the corresponding class. The sum of all the
relative frequencies in the frequency distribution is always one.
Note 2.3. The Xmi is the numerical location of the center of the class
interval. Midpoints are necessary for graphing frequency polygon. They
are also used in computing the mean and the standard deviation.
R
W = (2.3.3)
k
Step4 Select a starting point (usually the lowest value; this becomes the lowest limit of the 1st class.
Then add the width successively to get the lower limits of the next classes.
Step5 Find U (unit of measurement).
Step6 Find the upper class limits.
Step7 Find the class boundaries.
Step8 Tally the data.
Step9 Find the frequencies.
Step10 Find the cumulative frequencies.
Frequency Distribution 23
Note 2.4. For finding the class boundaries, we take half of the difference between lower class limit
of the 2nd class and upper class limit of the 1st class (20-19)/2=0.5. This value is subtracted from
lower class limit and added in upper class limit to get the required class boundaries.
Activity 2.4. Find less than and more than comulative frequencies.
Solution:
They are helpful for memorizing facts. Because the imprecations created by diagrams
and graphs can be retained in your mind for a long period of time.
® They facilitate comparison. They help one in making quick and accurate comparisons of
data. They bring out hidden facts and relationships. The information presented can be easily
understood at a glance.
These include the histogram, the frequency polygon the cumulative fre-
quency curve / Ogive curve, the line graph, etc.
1. Histogram – is a graph that displays the data by using vertical bars
[rectangles] of various heights to represent the frequencies.
Histogram of marks
20
15
Frequency
10
5
0
10 20 30 40 50 60 70 80
marks
Frequency Distribution 25
frequency polygon
14
12
10
frequency
8
6
4
2
20 30 40 50 60 70
Marks(midpoint)
Figure 2.3.
less than ogive greater than ogive
50
50
40
40
30
30
<F
>F
20
20
10
10
20 30 40 50 60 70 80 10 20 30 40 50 60 70
Marks) Marks)
Frequency Distribution 26
Solution:
ILLUSTRATION 2.5. Draw a multiple bar chart to represent the import and
export of Canada (values in $) for the years 1991 to 1995. Years Imports
Exports
Solution:
Figure 2.5. The import and export of Canada: from 1991 - 1995.
ILLUSTRATION 2.6. The following table gives the details of monthly budget
of a family. Represent these figures by a suitable diagram.