Inquiries, Investigation, and Immersion: Understanding Data and Ways To Collect Data (Population and Sampling Method)
Inquiries, Investigation, and Immersion: Understanding Data and Ways To Collect Data (Population and Sampling Method)
Republic Act 8293, Section 176 states that: No copyright shall subsist in any work of
the Government of the Philippines. However, prior approval of the government agency
or office wherein the work is created shall be necessary for exploitation of such work
for profit. Such agency or office may, among other things, impose as a condition the
payment of royalties.
Borrowed materials (i.e., songs, stories, poems, pictures, photos, brand names,
trademarks, etc.) included in this book are owned by their respective copyright holders.
Every effort has been exerted to locate and seek permission to use these materials
from their respective copyright owners. The publisher and authors do not represent
nor claim ownership over them.
Writer:
Juliet C. Saldo – Tapayas High School
Editors:
1. Fe F. Peralta – San Vicente National High School
2. Edna F. Aladano - GMEVHS
Reviewers:
1. Preciosa R. Dela Vega, EPS - English, SDO Camarines Sur
2. Jeanette Romblon, EPS I - English, SDO Masbate City
Illustrators and Layout Artists:
Learning Target
The lesson, specifically aims to describe the population of the study and
determine sampling procedure.
Vocabulary List
1
Warming Up
Prepare an acronym using the word SAMPLE by which each letter will represent
research characteristic in data gathering
S-
A-
M-
P-
L-
E-
Assuming that you are given a weekly allowance in the amount of P1,000.00. In a
pie graph, reflect how many percent/amount do you allot for food, clothing,
transportation, subject/school requirements, personal care products or groceries
and other needs. Can you find relationship/similarity between budgeting a weekly
allowance and determining the sample size of a study? Justify your answer.
Instruction: Answer the following questions briefly according to what you already
know about it.
Learning About It
2
Two groups of population:
What is 'Statistics'?
Statistics is the science concerned with developing and studying methods for
collecting, analyzing, interpreting and presenting empirical data. Statistics is a
highly interdisciplinary field; research in statistics finds applicability in virtually all
scientific fields and research questions in the various scientific fields motivate the
development of new statistical methods and theory. In developing methods and
studying the theory that underlies the methods statisticians draw on a variety of
mathematical and computational tools.Statistics is a form of mathematical analysis
that uses quantified models, representations and synopses for a given set of
experimental data or real-life studies. Statistics studies methodologies to gather,
review, analyze and draw conclusions from data.
https://www.investopedia.com/terms/s/statistics.asp#ixzz5LpLHBPRf
What is Parameter?
A constant or variable term in a function that determines the specific form of the
function but not its general nature; a variable entering into the mathematical form of
any distribution such that the possible values of the variable correspond to different
distributions
http://www.dictionary.com/browse/parameter
A statistic and a parameter are very similar. They are both descriptions of groups, like
“50% of dog owners prefer X Brand dog food.” The difference between a statistic and
a parameter is that statistics describe a sample. A parameter describes an
entire population.
Source:https://www.investopedia.com/terms/s/statistics.asp#ixzz5LpLHBPRf
3
B. Factors in Determining Sample Size
Theory of Sampling
The analogy of a fruit market can be used when thinking about the population,
the sample, and the sampling technique. The first step in sampling is to identify the
unit of analysis. Let's say that you are conducting research related to a fruit market.
What will be studied in the fruit market? Is it a type of fruit or the fruit sellers
themselves? Let's say you identify citrus fruit as the unit of analysis, and your
population is all citrus fruit within the Bauchi Road fruit market. There are too many
pieces of citrus fruit for you to study in that market, so you must select only a sample
of the citrus fruit.
4
A common error in sampling is that the sample and population are not identical.
For example, the sample may be too narrow. If the population is all citrus fruit within
the Bauchi Road fruit market, then the sample cannot only consist of lemons because
your sample would be missing oranges, grapefruit, and limes. Therefore, you must
find a way of selecting a representative sample of citrus fruit from your population. To
apply to an educational study, perhaps one may say that the population is all university
students, but only university students in public schools are sampled.
Another common error is to make the population too broad. Some may say that the
population is all mangoes in the Bauchi Road fruit market, but they are really only
interested in green mangoes. If only green mangoes are of interest, then the
population should be green mangoes in the Bauchi Road fruit market. In educational
research, sometimes researchers are only interested in a population with a certain
characteristic, such as student who has not chosen a career (in the case of career
counseling). Thus, the population and sample must be the same.
Select the unit of analysis. When selecting the sample, it is imperative that
the sampling technique selects cases based on this unit of analysis. In other
words, if the unit of analysis is students, then the sampling technique must
focus solely on how the students were selected. It would be an error to describe
the selection of schools as the sampling technique when the unit of analysis
is students.
5
Determine how many units need to be sampled. This step is a tricky
balancing act. On the one hand, larger samples tend to be more representative
of the target population and provide stronger statistical power. On the other
hand, larger samples can decrease the quality of the research study,
particularly for experimental and quasi-experimental designs. In experimental
designs, if many people participate in the treatment, then the quality of
treatment that each individual receives might suffer, resulting in inaccurate
conclusions. It is a truism that overpopulation in classrooms reduces the impact
of instruction; if there are too many students in the class, then the teaching will
not be as effective. Likewise, we should equally avoid the problem of
overpopulation in experiments: too many participants in a treatment group will
reduce the impact of the treatment. Therefore, smaller treatment groups are
generally preferable. In general, descriptive designs require at least 100
participants, correlational designs require at least 30 participants, and
experimental, quasi-experimental, and causal-comparative designs require at
least 15 participants per group. The size of the sample in experiments depend
on how effective the treatment is. If you have a very effective treatment, then
only a few participants are necessary. However, if the treatment is weak, then
a larger sample size is necessary to find a significant effect.
Sampling Procedures
There are many sampling procedures that have been developed to ensure that
a sample adequately represents the target population. A few of the most common are
described below.
Recall that the sampling procedure must reflect the unit of analysis. In a study where
the unit of analysis is the student, the researcher must obtain a complete list of every
student in the target population to achieve simple random sampling. This is rarely
possible, so very few, if any, educational studies use simple random sampling.
Another factor to consider is the word random. Random is a technical term in social
science research that means that selection was made without aim, reason, or patterns.
If any study uses the word random, it means that specific scientific procedures were
used to ensure that the sample was selected purely by chance. Scientists have
developed a few procedures that must be followed for a study to achieve random, such
as the hat-and-draw method or a random number table. To be random, participants
cannot be chosen because of their intelligence, gender, social class, convenience, or
any other factor besides scientifically-agreed upon random procedures. Using the
word random when the unit of analysis was not selected by the hat-and-draw method
or a random number table is either irresponsible or flat-out untruthful.
6
Stratified Random Sampling
In stratified random sampling, the researcher first divides the population into
groups based on a relevant characteristic and then selects participants within those
groups. In educational research, stratified random sampling is typically used when the
researcher wants to ensure that specific subgroups of people are adequately
represented within the sample. For example, a research study examining the effect of
computerized instruction on maths achievement needs to adequately sample both
male and female pupils. Stratified random sampling will be used to ensure adequate
representation of both males and females. Stratified random sampling requires four
steps:
Determine the strata that the population will be divided into. The strata are the
characteristics that the population is divided into, perhaps gender, age,
urban/rural, etc.
Determine the number of participants necessary for each stratum. Perhaps the
researcher wants equal representation within the strata: half male, half female;
20 children age 5, 20 children age 6, and 20 age 7; etc. Other times (e.g., large
survey research), the researcher might want to use proportionate random
sampling. This requires that the researcher first knows the proportion of the
group in the entire population and then match that proportion within the sample.
For example, a researcher might find the most recent Nigerian census to
determine that females represent 53% of the population in Nigeria, so the
sample will then include 53% females.
Split the units of analysis into the respective strata. In other words, if the target
population is students and the researcher wants to stratify based on gender,
then the researcher will need two lists of the target population: one list of the
male students and another list of the female students.
7
Randomly sample participants from within the group. Using either the hat-and-
draw method or a random number table, randomly select the requisite number
of males and do the same for the females.
Purposive Sampling
Multi-Stage Sampling
8
The steps in multi-stage sampling are as follows:
Organize the sampling process into stages where the unit of analysis is
systematically grouped.
Select a sampling technique for each stage.
Systematically apply the sampling technique to each stage until the unit of
analysis has been selected.
Source: http://korbedpsych.com/R06Sample.html
How do you make sure that the sample you use to gather your data is
representative of the population you are researching? By taking the time to choose a
sampling strategy. Choosing a sampling strategy is an essential step in the capture
phase of the data journey and will ensure that your data is reliable and reflects the
characteristics of your target group. In this blog, we'll take you step by step through
the process by outlining the ways in which primary data is collected using an example
in which a survey on characteristics (tax, education levels, etc) is collected on
residents in five towns. The towns are of different sizes and have a total of 3,200
households. These 3,200 households make up the target population for your survey.
9
Step one: Define your sample and target population
At times, your survey may require you to cover the entire target population, as
is the case in mapping or population studies. That’s usually referred to as a census
survey. However, target populations are generally large and expensive to survey. In
our example, it may not be feasible to visit all 3,200 households of the five towns.
Instead, you’d want to choose a smaller sample that would be representative of the
population and reflect its characteristics.
In order to select the appropriate sample size, you will need to determine the
degree of accuracy that you want to achieve. For this, you’ll need to establish the
confidence interval and confidence level of your sample.
The confidence interval, also called the margin of error, is a plus or minus figure.
It is the range within which the likelihood of a response occurs. The most commonly
used confidence interval is +/- 5. If you wish to increase the precision level of your
data, you would further reduce the error margin or confidence interval to a +/- 2. For
example, if your survey question is “does the household pay tax?” and 65% of your
sampled households say “yes,” then using a confidence interval of +/- 5, you can state
with confidence that if you had asked the question to all 3,200 households, between
60% (i.e. 65-5) and 70% (i.e. 65+5) would have also responded “yes.”
The confidence level tells you how sure you want to be and is expressed as a
percentage. It represents how often the responses from your selected sample reflect
the responses of the total population. Thus, a 95% confidence level means you can
be 95% certain. The lower the confidence level, the less certain you will be.
Most surveys use the 95% confidence level and a +/- 5 confidence interval.
When you put the confidence level and the confidence interval together, you can say
that you are 95% sure that, if you had surveyed all (3,200) households, between 60%
and 70% of the households of the target population would have answered “yes,” to the
question “does the household pay tax?”.
10
The size of your sample may be determined using any standard sample size
calculator such as Survey Monkey or Raosoft. Using a standard sample size calculator
(as can be seen in table one below) for our example of 3,200 households in five towns,
we can examine the difference in sample sizes based on different confidence levels
and intervals.
Option A
If you decide on a 5% confidence interval and want to achieve a 95%
confidence level, the sample size will be 345 households.
Option B
If you wish to have higher accuracy and increase the confidence level to 99%,
the recommended sample size would be 551.
Option C
For an even higher accuracy you could choose a 2% confidence interval and
99% confidence level and arrive at a sample size of 1807.
If time and resources permit, you could opt for larger samples and choose
option C, to survey 1807 households. However, the quality of your findings are likely
to only be marginally better than with option A or B, as the rate of improvement of
accuracy gradually diminishes with the increase in sample size. The size of your
sample should therefore be decided by the objectives of the study and resources
available.
11
Step three: Define your sampling technique
Once you’ve chosen the sample size for your survey, you’ll need to define which
sampling technique you’ll use to select your sample from the target population. The
sampling technique that’s right for you depends on the nature and objectives of your
project. Sampling techniques can be broadly divided into two types: random sampling
and non-random sampling.
Random sampling
As the name suggests, random sampling literally means selection of the
sample randomly from a population, without any specific conditions. This may be done
by selecting the sample from a list, such as a directory, or physically at the location of
the survey. If you want to ensure that a particular household does not get selected
more than once, you can remove it from the list. This type of sampling is called simple
random sampling without replacement. If you choose not to remove duplicate
households from the list, you would do a simple random sampling with replacement.
12
Non-random sampling
In non-random sampling, the sample selection follows a particular set of
conditions and is generally used in studies where the sample needs to be collected
based on a specific characteristic of the population. For example, you may need to
select only households which own a car, or have children under six years of age. For
this, you would consciously select only the 345 or 551 households that have those
characteristics. Also termed purposive or subjective sampling, non-random sampling
methods include convenience, judgment, quota and snowball sampling.
Step four: Minimize sampling error
It’s normal to make mistakes during sample selection. Your efforts should
therefore always be to reduce the sampling error and make your chosen sample as
representative of the population as possible. The robustness of your sample depends
on how you minimise the sampling error. The extent of errors during sampling vary
according to the technique or method you choose for sample selection.
For samples selected randomly from a target population, the results are
generally prefixed with the +/- sampling error, which is the degree to which the sample
differs from the population. If your study requires you to know the extent of sampling
error that is acceptable for the survey, you can select a random sampling technique.
In random sampling, you will be able to regulate the survey design to arrive at an
acceptable level of error. In a non-random sample selection, the sampling error
remains unknown.
Thus, when your sample survey needs to infer the proportion of a certain
characteristics of the target population, you can select a random sampling method.
But if you want to know the perceptions of residents regarding taxation laws or the
school curriculum, you would want to capture as many perceptions as possible, and
therefore select a non-random method in situations where sampling errors or sampling
for proportionality are not of concern. Non-random sampling techniques can be very
useful in situations when you need to reach a targeted sample with specified
characteristics very quickly.
If you don’t have a sampling strategy in place, you may collect data which is
biased or not representative, rendering your data invalid.
Source: https://datajourney.akvo.org/blog/how-to-choose-your-sampling-strategy-to-
guarantee-relevant-results?utm_term=research%20sample&utm_campaign=DataJourney
13
Keeping You in Practice
Practice Task 1.
Identify population of
your study
Practice Task 2.
Practice Task 3.
14
Additional Tasks
Reflect your chosen research title/problem, then, decide for the appropriate
research design.
Task 1: Figure out the target sample population of your study.
15
II. Enumeration
Instruction: Enumerate and write what are asked from you.
1-4 What are the sampling procedures?
5-8 What are the sampling strategies?
III. Essay.
Instruction: Answer the questions briefly and concisely.
1. Define sampling in your own words and describe its purpose.
2. Describe the situations where sampling can be applied
3. Define population and identify its types.
4. Define and compare target population and accessible population. Present
examples for the two types of population and let the students classify which
type of population is being referred to.
5. Define and compare subject and respondents.
6. State the differences/similarities of Statistics and Parameter.
7. Define and discuss each factor to consider in determining the sample size.
8. Discuss further how each factor applies to sampling process.
9. Present samples that provide further context to the application of each factor.
10. Discuss each approach and explain how they apply in the sampling process.
Learning Challenge
Points to Ponder
Recall that the key question in sampling is How representative is the sample
of the target population? Therefore, the researcher has the burden of
demonstrating in their report (primarily in the methods section) that their sample,
regardless of how it was chosen, represents the target population. Simple random
sampling or multi-stage sampling will typically answer this question the best.
However, as long as the researcher makes a convincing argument in their methods
section that their sample adequately represents the target population, the
researcher really can use any available sampling procedure. – Daniel J. Boudah
16
Answer
WARMING UP Key
17
References
https://www.snapsurveys.com/blog/qualitative-vs-quantitative-research/
https://www.wallstreetmojo.com/quantitative-research/
https://measuringu.com/qual-methods/
http://korbedpsych.com/R06Sample.html
https://datajourney.akvo.org/blog/how-to-choose-your-sampling-strategy-to-
guarantee-relevant-
results?utm_term=research%20sample&utm_campaign=DataJourney