Data and Variable - PG 9
Data and Variable - PG 9
Data and Variable - PG 9
Development Team
Prof. R.K. Kohli
Principal Investigator
& Prof. V.K. Garg &Prof.AshokDhawan
Co- Principal Investigator
Central University of Punjab, Bathinda
1
Anchor Institute Central University of Punjab
Module Id EVS/SAES-XIV/2
Objectives To give basic introduction of data and its various types with examples
Keywords Statistics, data, quantitative, qualitative, attributes, variables, discrete variables, continuous variables
2. Introduction
Data is a plural form of a word ‘datum’. Data is considered as a collection of items either in qualitative
or in quantitative form and it consists of full information regarding the objective. It consists of relevant
information about the objective and it is analyzed further to extract that information.
Data is collected from the sources in which one is interested. In different sectors, it is possible that one
does not have direct approach to the object due to time and money for example for the study of an
environment affecting factors. One requires large instruments and man power to collect values but there
are other departments like metrological sciences, remote sensing that also deal with the same objects.
Thus one can use published data by these department for further analysis. The source that one has used
to collect the data is secondary and it is called secondary source. If one has direct approach to the sources
and information then data collected is considered as from primary source. So data are collected from
two sources:
(a) Primary Source
(b) Secondary Source
Now information is collected in the form of data from one of the above sources. This information is
further used for analysis purpose. It solely depends on the characteristics of an item that it can be
observed either in quantitative form or in qualitative form. For example, height, weight and age of a
person are quantified in numbers. This is an example of quantitative form and the variables used to
quantify values are called quantitative variables.
Also some characteristics like religion of a person, designation, severity of a diseases, gender etc are
difficult to calculate in terms of number but one can assign numbers to them for recording purpose and
has no meaning in term of value. For example for noting down sex of a person on can use ‘1’ for male
and ‘2’ for female in a government survey. Similarly for considering the health status of patient suffering
from particular disease one can categorize good as ‘1’ , mild as ‘2’ and ‘severe’ as ‘3’. This is an
example of qualitative data and it is further evaluated to draw conclusion.
3
3. Types of data
In this section, an attempt has been made to give an understanding of types of data. One can get detail
understanding of types of data after reading this section.
The following chart explains the different segments of the data and it relation with one another.
Qualitative
data Quantitative
Data
Figure 1
From the above chart, one can observe that data is mainly segmented into two forms and these forms
are further divided into various segments. One of main branch of data includes qualitative data and it is
further divided into attributes, nominal and ordinal. Similarly, quantitative data includes continuous data
and discrete data.
3.1(b) Nominal
Nominal data are considered as that form of data that cannot be ordered or have more than two
categories.
For example, color of hair (black, brown, blonde etc.), marital status (married, unmarried, divorced,
separated), nature of disaster (fire, theft, accident, earthquake, etc.). In these examples, on can observe
5
Self-Check Exercise
In the following question, state the type of data with a reason
Question: Which one of the following subject you learn here?
(a) Mathematics
(b) Physics
(c) Statistics
Answer This question is an example of qualitative data and further it is categorized as nominal data. The
reason behind this is that one cannot order subjects.
In the previous question, subjects are considered as a nominal data. But it can be ordered depending on
the question.
Question: Which of the following subject you like the most?
(a) Mathematics
(b) Physics
(c) Statistics
Answer: It is an example of qualitative data and specifically ordinal data. As one can order the subjects
based on his liking.
Question: How would you rate your learning technique?
Answer: It is an example of qualitative data and specifically ordinal data as the categories are ordered
from poor to excellent.
Question: Did you study statistics in your college?
Answer Qualitative data and it is an attribute. In this case, we simply say item has this attribute or not.
Question: How would you rate your learning techniques? (1= excellent, 5=poor)
*Whole Numbers are those numbers that start from 0 and go till infinity that is 0, 1, 2,
3………………….
From the above example, one can see that in real life the data consists of different types. Hence, one
must be clear while preparing questionnaire or study about the type of items (questions). So one can
keep the following points in mind about qualitative data and quantitative data. These are
Hence from the above comparison, one can observe that both these methods are also complementary to
one other. The questionnaire has given above to prove this statement. As one can see that information
is collected from the respondent to draw conclusions. These include results/statements about all the
attributes and variables available in the data. For example, from the above questionnaire, one can
conclude on the basis of data that x amount of persons (male /female) of y age are affected with monetary
loss of z amount on average. This shows the relationship of loss amount with attributes like sex and
variables like age.
5. Variable
Variable is another commonly used word when collecting information from the observations. Before
looking at the definition let’s first understand it through an example.
Basically data are realization of the variable. For example, in a study of measuring average height of
student then height is considered as a variable. As height of each student is different and it can take any
value within a specified range. Also for other objectives whose value vary under different conditions are
measured through variable.
For example it there are 3 students and their heights are given in inches as 78, 81 and 56 respectively.
So our data have three values and it is collected by measuring heights of students.
Definition
After understanding the definition of the variable, it is easy for one to understand independent and
dependent variables as these variables are used in the most of the studies.
10
Dependent Variable
A variable is said to be dependent variable if its value changes due to change in other variable. The
variable that influences the value of this dependent variable is called independent variable (from above
definition).
From previous example, one can see that there are quantities like algal density, poor water quality and
chlorophyll-II that is used as an indicator of lake water quality. Now, chlorophyll-II value are basically
dependent on the values of algal density and also on the quality of lake water. So basically, chlorophyll-
II is dependent variable and others are independent variables.
Self -Check Exercise
Question: Which one is independent/dependent variable in the study if a scientist conducts an
experiment to test the theory that a vitamin could extend a person’s life-expectancy?
Answer: Here the independent variable is the amount of vitamin that is given to the subjects within the
experiment. Dependent variable is the variable affected by the independent variable and in this case it is
life span.
Question: If a scientist studies the impact of a drug on cancer. What will be an independent variable?
Answer: A scientist studies the impact of a drug on cancer hence it is a dependent variable. Here
independent variables are the administration of the drug like the dosage and the timing of an impact.
Question: If the scientist studies the impact of withholding affection on rats. Which one is the
independent variable?
Answer: Here the amount of affection is the independent variable and dependent variable is the reaction
of the rats.
Question: In a scientific study that how many days people can eat soup until they get sick? Write
independent and dependent variable?
Answer: Here number of days of consuming soup is an independent variable and the dependent variable
is the onset of illness.
11
Daniel, W. W. and C. L. Cross, C. L., Biostatistics: A Foundation for Analysis in the Health Sciences,
10th Edition, John Wiley & Sons, 2013.
Hogg, R. V., J. Mckean and A. Craig, Introduction to Mathematical Statistics, Macmillan Pub. Co. Inc.,
1978.
Meyer, P. L., Introductory Probability and Statistical Applications, Oxford & IBH Pub, 1975.
12