Classical Conditioning
Classical Conditioning
Classical Conditioning
unconditioned) and therefore is a natural response which has not been taught. In this
respect no new behavior has been learned yet.
For example, a stomach virus (UCS) would produce a response of nausea (UCR). In
another example a perfume (UCS) could create a response of happiness or desire
(UCR).
This stage also involves another stimulus which has no effect on a person and is called
the neutral stimulus (NS). The NS could be a person, object, place, etc. The neutral
stimulus in classical conditioning does not produce a response until it is paired with
the unconditioned stimulus.
Ivan Pavlov showed that classical conditioning applied to animals. Did it also apply
to humans? In a famous (though ethically dubious) experiment, Watson and Rayner
(1920) showed that it did.
Little Albert was a 9-month-old infant who was tested on his reactions to various
stimuli. He was shown a white rat, a rabbit, a monkey and various masks. Albert
described as "on the whole stolid and unemotional" showed no fear of any of these
stimuli. However, what did startle him and cause him to be afraid was if a hammer
was struck against a steel bar behind his head. The sudden loud noise would cause
"little Albert to burst into tears.
When Little Albert was just over 11 months old the white rat was presented and
seconds later the hammer was struck against the steel bar. This was done 7 times over
the next 7 weeks and each time Little Albert burst into tears. By now little Albert only
had to see the rat and he immediately showed every sign of fear. He would cry
(whether or not the hammer was hit against the steel bar) and he would attempt to
crawl away.
In addition, the Watson and Rayner found that Albert developed phobias of objects
which shared characteristics with the rat; including the family dog, a fur coat, some
cotton wool and a Father Christmas mask! This process is know as generalization.
Watson and Rayner had shown that classical conditioning could be used to create a
phobia. A phobia is an irrational fear, i.e. a fear that is out of proportion to the danger.
Over the next few weeks and months Little Albert was observed and 10 days after
conditioning his fear of the rat was much less marked. This dying out of a learned
response is called extinction. However, even after a full month it was still evident, and
the association could be renewed by repeating the original procedure a few times.
Critical Evaluation
Classical conditioning emphasizes the importance of learning from the environment,
and supports nurture over nature. However, it is limiting to describe behavior solely in
terms of either nature or nurture, and attempts to do this underestimate the complexity
of human behavior. It is more likely that behavior is due to an interaction between
nature (biology) and nurture (environment).
A strength of classical conditioning theory is that it is scientific. This is because it's
based on empirical evidence carried out by controlled experiments. For
example, Pavlov (1902) showed how classical conditioning can be used to make a dog
salivate to the sound of a bell.
Classical conditioning is also a reductionist explanation of behavior. This is because
complex behavior is broken down into smaller stimulus - response units of behavior.
Supporters of a reductionist approach say that it is scientific. Breaking complicated
behaviors down to small parts means that they can be scientifically tested. However,
some would argue that the reductionist view lacks validity. Thus, whilst reductionism
is useful, it can lead to incomplete explanations.
A final criticism of classical conditioning theory is that it is deterministic. This means
that it does not allow for any degree of free will in the individual. According a person
has no control over the reactions they have learned from classical conditioning, such
as a phobia.
The deterministic approach also has important implications for psychology as a
science. Scientists are interested in discovering laws which can then be used to predict
events. However, by creating general laws of behavior, deterministic psychology
underestimates the uniqueness of human beings and their freedom to choose their own
destiny.
References
Pavlov, I. P. (1897/1902). The work of the digestive glands. London: Griffin.
Watson, J. B. (1913). Psychology as the behaviorist views it. Psychological Review,
20, 158177.
Watson, J. B., & Rayner, R. (1920). Conditioned emotional reactions. Journal of
Experimental Psychology, 3(1), pp. 114.
Watson, J. B. (1924). Behaviorism. New York: People's Institute Publishing Company.
Pavlov developed some rather unfriendly technical terms to describe this process. The
unconditioned stimulus (or UCS) is the object or event that originally produces the
reflexive / natural response.
The response to this is called the unconditioned response (or UCR). The neutral
stimulus (NS) is a new stimulus that does not produce a response.
Once the neutral stimulus has become associated with the unconditioned stimulus, it
becomes a conditioned stimulus (CS). The conditioned response (CR) is the response
to the conditioned stimulus.
References
Pavlov, I. P. (1897/1902). The work of the digestive glands. London: Griffin.
Pavlov, I. P. (1928). Lectures on conditioned reflexes. (Translated by W.H. Gantt)
London: Allen and Unwin.
Pavlov, I. P. (1955). Selected works. Moscow: Foreign Languages Publishing House.
Operant Conditioning deals with operants - intentional actions that have an effect on
the surrounding environment. Skinner set out to identify the processes which made
certain operant behaviours more or less likely to occur.
B.F. Skinner (1938) coined the term operant conditioning; it means roughly changing
of behavior by the use of reinforcement which is given after the desired response.
Skinner identified three types of responses or operant that can follow behavior.
Neutral operants: responses from the environment that neither increase nor
decrease the probability of a behavior being repeated.
Reinforcers: Responses from the environment that increase the probability of a
behavior being repeated. Reinforcers can be either positive or negative.
Punishers: Responses from the environment that decrease the likelihood of a
behavior being repeated. Punishment weakens behavior.
We can all think of examples of how our own behavior has been affected by
reinforcers and punishers. As a child you probably tried out a number of behaviors
and learned from their consequences.
For example, if when you were younger you tried smoking at school, and the chief
consequence was that you got in with the crowd you always wanted to hang out with,
you would have been positively reinforced (i.e. rewarded) and would be likely to
repeat the behavior.
If, however, the main consequence was that you were caught, caned, suspended from
school and your parents became involved you would most certainly have been
punished, and you would consequently be much less likely to smoke now.
Positive Reinforcement
Skinner showed how positive reinforcement worked by placing a hungry rat in his
Skinner box. The box contained a lever on the side and as the rat moved about the box
it would accidentally knock the lever. Immediately it did so a food pellet would drop
into a container next to the lever.
The rats quickly learned to go straight to the lever after a few times of being put in the
box. The consequence of receiving food if they pressed the lever ensured that they
would repeat the action again and again.
Negative Reinforcement
The removal of an unpleasant reinforcer can also strengthen behavior. This is known
as negative reinforcement because it is the removal of an adverse stimulus which is
rewarding to the animal or person. Negative reinforcement strengthens behavior
because it stops or removes an unpleasant experience.
For example, if you do not complete your homework, you give your teacher 5. You
will complete your homework to avoid paying 5, thus strengthening the behavior of
completing your homework.
Skinner showed how negative reinforcement worked by placing a rat in his Skinner
box and then subjecting it to an unpleasant electric current which caused it some
discomfort. As the rat moved about the box it would accidentally knock the lever.
Immediately it did so the electric current would be switched off. The rats quickly
learned to go straight to the lever after a few times of being put in the box. The
consequence of escaping the electric current ensured that they would repeat the action
again and again.
In fact Skinner even taught the rats to avoid the electric current by turning on a light
just before the electric current came on. The rats soon learned to press the lever when
the light came on because they knew that this would stop the electric current being
switched on.
These two learned responses are known as Escape Learning and Avoidance Learning.
Schedules of Reinforcement
Imagine a rat in a Skinner box. In operant conditioning if no food pellet is delivered
immediately after the lever is pressed then after several attempts the rat stops pressing
the lever (how long would someone continue to go to work if their employer stopped
paying them?). The behavior has been extinguished.
Behaviorists discovered that different patterns (or schedules) of reinforcement had
different effects on the speed of learning and on extinction. Ferster and Skinner (1957)
devised different ways of delivering reinforcement, and found that this had effects on
1. The Response Rate - The rate at which the rat pressed the lever (i.e. how
hard the rat worked).
2. The Extinction Rate - The rate at which lever pressing dies out (i.e. how
soon the rat gave up).
Skinner found that the type of reinforcement which produces the slowest rate of
extinction (i.e. people will go on repeating the behavior for the longest time without
reinforcement) is variable-ratio reinforcement. The type of reinforcement which has
the quickest rate of extinction is continuous reinforcement.
Behavior Shaping
A further important contribution made by Skinner (1951) is the notion of behaviour
shaping through successive approximation. Skinner argues that the principles of
operant conditioning can be used to produce extremely complex behaviour if rewards
and punishments are delivered in such a way as to encourage move an organism closer
and closer to the desired behaviour each time.
In order to do this, the conditions (or contingencies) required to receive the reward
should shift each time the organism moves a step closer to the desired behaviour.
According to Skinner, most animal and human behaviour (including language) can be
explained as a product of this type of successive approximation.
Behavior Modification
Behavior modification is a set of therapies / techniques based on operant conditioning
(Skinner, 1938, 1953). The main principle comprises changing environmental events
that are related to a person's behavior. For example, the reinforcement of desired
behaviors and ignoring or punishing undesired ones.
This is not as simple as it sounds always reinforcing desired behavior, for example,
is basically bribery.
There are different types of positive reinforcements. Primary reinforcement is when a
reward strengths a behavior by itself. Secondary reinforcement is when something
strengthens a behavior because it leads to a primary reinforcer.
Examples of behavior modification therapy include token economy and behavior
shaping
Token Economy
Token economy is a system in which targeted behaviors are reinforced with tokens
(secondary reinforcers) and later exchanged for rewards (primary reinforcers).
Tokens can be in the form of fake money, buttons, poker chips, stickers, etc. While the
rewards can range anywhere from snacks to privileges or activities.
Token economy has been found to be very effective in managing psychiatric patients.
However, the patients can become over reliant on the tokens, making it difficult for
them to adjust to society once they leave prisons, hospital etc.
Teachers also use token economy at primary school by giving young children stickers
to reward good behavior.
Educational Applications
In the conventional learning situation operant conditioning applies largely to issues of
class and student management, rather than to learning content. It is very relevant to
shaping skill performance.
A simple way to shape behavior is to provide feedback on learner performance, e.g.
compliments, approval, encouragement, and affirmation. A variable-ratio produces the
highest response rate for students learning a new task, whereby initially reinforcement
(e.g. praise) occurs at frequent intervals, and as the performance improves
reinforcement occurs less frequently, until eventually only exceptional outcomes are
reinforced.
For example, if a teacher wanted to encourage students to answer questions in class
they should praise them for every attempt (regardless of whether their answer is
correct). Gradually the teacher will only praise the students when their answer is
correct, and over time only exceptional answers will be praised.
Unwanted behaviors, such as tardiness and dominating class discussion can be
extinguished through being ignored by the teacher (rather than being reinforced by
having attention drawn to them).
So, if your layperson's idea of psychology has always been of people in laboratories
wearing white coats and watching hapless rats try to negotiate mazes in order to get to
their dinner, then you are probably thinking of behavioral psychology.
Behaviorism and its offshoots tend to be among the most scientific of
thepsychological perspectives. The emphasis of behavioral psychology is on how we
learn to behave in certain ways. We are all constantly learning new behaviors and how
to modify our existing behavior. Behavioral psychology is the psychological approach
that focuses on how this learning takes place.
Critical Evaluation
Operant conditioning can be used to explain a wide variety of behaviors, from the
process of learning, to addiction and language acquisition. It also has practical
application (such as token economy) which can be applied in classrooms, prisons and
psychiatric hospitals.
However, operant conditioning fails to take into account the role of inherited
and cognitive factors in learning, and thus is an incomplete explanation of the learning
process in humans and animals.
For example, Kohler (1924) found that primates often seem to solve problems in a
flash of insight rather than be trial and error learning. Alsosocial learning
theory (Bandura, 1977) suggests that humans can learn automatically through
observation rather than through personal experience.
The use of animal research in operant conditioning studies also raises the issue of
extrapolation. Some psychologists argue we cannot generalize from studies on
animals to humans as their anatomy and physiology is different from humans, and
they cannot think about their experiences and invoke reason, patience, memory or
self-comfort.
References
Bandura, A. (1977). Social learning theory. Englewood Cliffs, NJ: Prentice Hall.
Ferster, C. B., & Skinner, B. F. (1957). Schedules of reinforcement.
Kohler, W. (1924). The mentality of apes. London: Routledge & Kegan Paul.
Skinner, B. F. (1938). The Behavior of organisms: An experimental analysis. New
York: Appleton-Century.
Skinner, B. F. (1948). Superstition' in the pigeon. Journal of Experimental
Psychology, 38, 168-172.