Pscy2o81 Full Notes Content Lecture Labs Experiments Etc
Pscy2o81 Full Notes Content Lecture Labs Experiments Etc
Pscy2o81 Full Notes Content Lecture Labs Experiments Etc
PSYCHOLOGY
assessments
Critical analysis
o Associative learning models
o Presented 2 hypothetical experiments – answer questions identifying and
describing the learning phenomena and behaviours present
o Template to answer questions
Pavlovian conditioning
- Learn about the relationship between events in the world
- Does not require the subject to perform a response to learn association
Instrumental conditioning
- Learn about our behaviour and events in the world
- Requires the subject to perform a response to learn association
Pavlovian conditioning
US – a biologically relevant stimulus that elicits the UR such as food or danger
UR – the response that occurs naturally to the US e.g. fight or flight response to
danger
CS – previously neutral stimulus that following conditioning, predicts the US
CR – the response elicited by the CS following conditioning commonly the same
as the UR
Pavlov’s observations
Examples
examples
Pavlovian conditioning with an appetitive outcome
US= food for dog, UR= salivating, CS= footstep of the dog’s owner, CR= hearing the
footstep and salivating
Real life example: US: park, UR: excitement/happiness, CS: soccer ball in hand
CR=seeing the soccer ball and being happy
Instrumental
positive reinforcement
giving a dog a treat for doing a trick
rewarding your child when they do well in an exam
scholarships for top students
must finish your dinner for dessert
mother giving their child lollies for cleaning up toys
giving your child pocket money for doing chores
all these examples are expected to increase desired behaviour
everyday example: An example of positive reinforcement is asking your child to finish their
dinner to have dessert. Adding dessert is positive reward which will increase behaviour of
your child finishing dinner.
negative reinforcement
sound of no seatbelt put on seatbelt to remove sound
putting on sunscreen when going out in the sun to reduce the risk of skin cancer
positive punishment
slapping child if they steal reduce stealing
fine for driving through red light decrease behaviour
negative punishment
taking away Xbox if they swear
removing play time for a primary student who is disruptive in class
The aim of negative punishment is influence behaviour by removing pleasant stimulus. For
example, removing play time for a primary student who is disruptive in class. By removing
the pleasant stimulus of play time will decrease the student’s behaviour of disrupting in class.
Animal ethics
Animals are used for many purposes by humans
- Food
- Clothing
- Research/medicine
- Pets
- Working animals
It is important to consider the arguments for using animals in research and the ethics
which guide the treatment of animals
Governing principles
All of the experiments have been conducted with adherence to the ethical guidelines
and have been approved by the ethics committees
Introduction lecture
Animal learning
Focus on animal models of learning, cognition and memory
How animal models inform clinical practice
How animal models are useful for neuroscience
The neural basis of clinic disorders
Schizophrenia
Addiction
Feeding
Attachment
Choice
Psychology as a discipline
Ebbinghaus - Remarked that psychology has a long past and a short history
Lamarck’s theory
Organic and inorganic matter are fundamentally different, primarily in that each living
species possesses an innate drive to perfect itself. This drive to perfection results in
changes in its body and in its mentality
The acquired characteristics are hereditary, so that the physical and mental efforts
made across the course of an individual’s life are reflected directly in the physical and
mental organs of their offspring
Lamarckian inheritance
1. Environmental changes can produce new habits
2. New habits produce physical changes: law of use and disuse
3. Physical changes are heritable: inheritance of acquired characteristics
Charles Darwin
Studied medicine at Edinburgh university but was repelled by surgery and bored by
the lectures
Studied at Cambridge university with the aim of becoming a country priest
This occupation would allow him to indulge his passion for natural history
Charles Darwin
After his return in 1836, Darwin wrote several works on geology, edited all the work
done by the various specialist on the zoological material that he had collected, and
wrote a popular account of the voyage
He also gave a good deal of through to how species of animals came to be as they are
Malthus
Argued that in nature plants and animals produce far more offspring than can survive,
and that humans are too capable of overproducing if left unchecked
Thomas Huxley
Comparative neuroatomist
Darwin’s ‘bulldog’ relished his role of public champion of evolutionary theories,
while Darwin remained in the country, often ill and avoiding controversy
Alfred Wallace
Maintained that: natural selection explained the human body but not the human mind
Divine intervention had created life, consciousness in the higher animals and uniquely
human mental faculties (e.g., language, numbers, music, morals, metaphysics)
George Romanes
In 1874 Darwin handed over his notes on animal behaviour to the neurophysiologist.
Romanes, who expanded this corpus of anecdotal evidence, adding some
experimental results
Animal intelligence (1882) reported these observations on a huge range of species
Romanes’
hierarchies for
emotions and
thought
shift to the laboratory study of learning – 20th century
Morgan’s canon
Thorndike’s study of trial-and-error learning
Watson’s behaviourist manifesto
Pavlov’s study of the conditioned reflexes
Skinner’s study of operant conditioned responses
Heinstein’s study of choice
Rediscovery of the mind and rejection of the behaviourism (1960-1970)
Lloyd Morgan
He argued that Romanes had:
Ignored previous opportunities for an animal to learn a behaviour
Confused testable and non-testable inferences from behaviour to mental events
Used unnecessary complex terms to explain behaviour
Edward Thorndike
Aim: systematic, quantitative experiments on trial-and-error learning in animals
Thorndike’s puzzle boxes for cats and dogs
Time from placement in the box to the animal making the effective response plotted
as a function of trial number to produce the first learning curves
Cross species comparisons: no detectable difference
Learning by imitation: no evidence
Memory: no forgetting over time
Law of effect
“Of several responses made of the same situation, those which are accompanied or
closely followed by satisfaction to the animal will, other things being equal, be more
firmly connected with the situation so that, when it recurs., they will be more likely to
recur”
Learning consists in the formation of connections between situations and responses –
animals (and people) learn what to do in a situation – now termed procedural
knowledge (habits)
Satisfaction (rewards) establish these connections while dissatisfaction (punishers)
breaks them
John Watson
Studied maze learning in rats and innate behaviour of birds
In 1913, he published the behaviorist manifesto
Ivan Pavlov
Nobel prize winner in medicine for studied on physiology of digestion
Noticed “psychic-secretions” in his subjects (dogs) and switched to the study of these
secretions
His lectures, published as Conditioned Reflexes (1927), made his work known to the
English-speaking world
Conditioned reflexes
Pavlov implanted a fistula in the dog’s mouth via a small incision in its cheek and
connected the fistula to the tube and finally, to devices that collected the salivary
responses and recorded their occurrence
He was interested in the nature of salivation elicited by food in the mouth and how
this contributed to digestion
However, he noticed that dogs salivated not only when the food was in their mouths
but also when they saw the food or when they saw the attended who regularly fed
them
Why conditioned?
Pavlov called the salivary responses elicited by the food unconditioned reflexes; they
were unconditioned because their occurrence was not conditional on any prior
experience. Any dog salivated the first time that some food was placed into its mouth
Pavlov called the anticipatory responses conditioned reflexes because their
occurrences was conditional on specific history. The dog salivated when they saw the
attendant who regularly fed them but did not salivate when they saw someone else,
equally familiar who had never fed them
The “psychic reflexes” were due to the dogs having learned that the attendant
signalled or predicted the imminent arrival of food
Pavlov’s studies
Substituted stimuli for the attendant to study the development of conditioned reflexes
Every few minutes, a sound was presented for a few seconds and the food was
delivered to the dog. The dog quickly learned to expect the food when it heard the
sound and began to salivate
Critically, food was presented to the dog whether or not it salivates. The dog did not
need to salivate in order to procure the food
Rather, learning that the sound signalled the arrival of food caused the dog to salivate
when it heard the sound
Disinhibition – This refers to the finding that the salivation that has been
extinguished re-appears when the sound is combined with the light. The dog has
learned to inhibit salivation to the sound. Attending to the light removes the attention
required for inhibition of salivation to the sound.
Removing extinction of CR
External inhibition – loss of CR when the CS is combined with a novel CS. This suggests
that the animal pays attention to the novel CS, therefore fails to salivate to CS. E.g., AN
animal learns that tone predicts food and salivates (CR) in presence of tone (CS). Then we
present the tone (CS) with a novel CS. The animal does not salivate to this compound.
Spontaneous recovery. Pavlov found that extinction was not necessarily permanent:
the salivation that ceased to occur because the conditioned stimulus had been
repeatedly presented in the absence of the food re-occurred when the conditioned
stimulus was presented days or weeks later.
animals are very different to humans. findings from animal research may not be
relevant to human beings.
psychological and physical impacts for animals
some types of research are unnecessary e.g. animal testing for new cosmetics -
although cosmetics enhance confidence, they are not essential to life/do not benefit
the individual
animals cannot consent
research might not be successful
What standards need to be met for animal care
What ethical guidelines might you put in place for animal research
minimising the types of research that can be done using animals e.g. remove cosmetic
testing
reduce the number of animals being used in each experiment (just enough to reach
significance in findings)
prohibit the infliction of physical pain e.g. rats being hung by their tail to suspend
them from using it (used in the rat tail suspension paradigm to determine critical and
sensitive periods)
the 3 r’s
replacement
reduction
refinement
examples
Pavlovian conditioning with an appetitive outcome
US= food for dog, UR= salivating, CS= footstep of the dog’s owner, CR= hearing the
footstep and salivating
positive reinforcement
giving a dog a treat for doing a trick
rewarding your child when they do well in an exam
scholarships for top students
have to finish your dinner for dessert
all these examples are expected to increase desired behaviour
negative reinforcement
sound of no seatbelt put on seatbelt to remove sound
positive punishment
slapping child if they steal reduce stealing
fine for driving through red light decrease behaviour
negative punishment
taking away xbox if they swear
WEEK 2 LECTURE 1: PAVLOVIAN AND INSTRUMENTAL
CONDITIONING
BF skinner
studied feeding in rats and arranged that presses of a bar provided access to food
realised that he could study instrumental (or operant) conditioned responses
this “skinner box” allowed automatic scheduling of events and recording of responses
Pavlov’s dogs salivated when they heard the sound that signalled food. They did not
have to salivate to procure the food: the sound caused the salivation
skinner’s rats had to press the bar in order to procure the food: if they failed to press,
then they failed to obtain food
salivary responses, therefore, were involuntarily, elicited mechanistically by the signal
for food: lever presses were voluntary due the animal learning that could procure food
by lever pressing
reinforcement
defined reinforcement in terms of its effects on operant behaviour: it increased the
subsequent likelihood of that behaviour. Thus, anything that acts in this way is a
reinforcer for the contingent behaviour
he found that occasionally reinforcing a response was more effective in maintaining
that response than was reinforcing a response on each of its occurrences
he also studied relations between stimuli, responses, and reinforcement. he called
these relations schedules of reinforcement
schedules of reinforcement
Fixed ratio (number of responses E.g., getting paid for the number of items
reinforced) you make e.g., 100 envelopes
examples of schedules
ratio
fixed: e.g., piece work is where reinforcement (money) is contingent on a fixed
amount of some designated activity e.g., fruit picker filling up a number of baskets
variable: when fishing, casting the line is reinforced (fish biting hook) intermittently,
results in a constant rate of casting
interval
fixed: doing well in an exam (reinforcement) is contingent on studying. Exams occur
at regular intervals. Studying occurs shorty before exam, ceases after the exam, re-
starts before the next exam
variable: poker machines pay off something like a variable interval schedule. This
schedule maintains constant playing
matching law
the matching law and behaviour
Each day we engage in a range of different activities e.g., cook clean, work, meet
friends, study, play sport etc how do we allocate our time?
One idea is that we always try to maximise our benefits and minimise our costs
(contested)
Therefore, we make decisions about what to do by comparing the benefits and costs
of different courses
If one study has a higher value than another, we chose the former. E.g., socialising
over studying
Discount function
WEEK 2 LECTURE 2:
Pavlovian conditioning
Once identified as the salivary responses studied by
Pavlov and the emotional responses observed by Watson and dismissed as of little
general interests
It was the psychology of spits (dogs) and twitches (Watson’s little albert)
Pav conditioning produces a range of responses including approach and withdrawal
Bird video
Colour of light and food
Recognises the colour and tries to eat the colour (knows the association between the
light and the food but acts in an irrational manner)
Sign tracking persisted even when their occurrence prevents the outcome (Williams &
Williams 1969)
Sign tracking: refers to behaviour that is directed towards a stimulus as result of a
learned association between that stimulus and reward. The sign tracking response
develops even through reward delivery is not contingent upon a response
Presentations of the red dot were followed by access to a grain but only if pecks
did not occur. If pecks occurred grain was not delivered
Pecking eventually ceased because grain was not delivered. The cessation of the
pecking meant that presentations of the red dot were followed by access to grain. The
signalling relation between red dot and food reinstated sign tracking (pecking the dot)
which was again exhausted and again reinstated
The pigeons were unable to supress pecking at the CS even when pecking caused the
omission of food
Birds that have approached, courted, and copulated with the CS (stuffed toy)
signalling sexual partner continued to direct these behaviours towards the CS even
when the partner is present
attentional capture in people
pigeons were unable to inhibit pecking, people were unable to inhibit looking at the
diamond, even although looking at the colour caused the omission of the large reward
Every few minutes, a lever is inserted into an experimental chamber for a few
seconds, removed and food is presented to hungry rats
Some rats come to approach, manipulate, and gnaw the lever (sign trackers) whereas
others use the lever as a signal to enter the aperture where food is delivered (goal
trackers)
The individual differences are related to how readily the rats subsequently acquire
drug-related habits: the sign trackers are more likely to become addicted than the goal
trackers
The sign trackers show: more approach to a cocaine cue, more cue-induced and
cocaine-primed reinstatement following extinction of self-administration, greater
preference for a cocaine-associated tactile cue in a conditioned cue preference
procedure, a higher break point for cocaine in a progressive ratio schedule (Flagel,
Akil, & Robinson, 2019)
Sign tracking is held to reflect the processes involved in drug addiction where drug-
related cues acquire so-called incentive salience: the sign trackers are more likely to
imbue the lever with such salience and likewise drug-related cues than goal trackers
Incentive salience refers to motivation for rewards that is driven by both
physiological state and previously learned associations about a reward cue
People and rats are omnivores: almost anything that can be eaten is eaten
The human infant is born with a liking for a sweet taste and a dislike of bitter one but
has no innate preferences for smells
But we come to like a range of flavours (taste-smell experiences)
Rats are fitted with a fistula into their stomach through which nutrients can be infused
and presented with two bottle containing water, one orange flavoured, and the other
lemon flavoured
Drinking the orange flavour is followed by intra gastric (IG) infusions of a nutrient
(glucose, starch, protein) whereas drinking the lemon flavour is followed by infusion
of physiological saline
Rats ingest the orange flavour and show liking responses
The association between the flavour and the nutrient results in the flavour being liked
Drug conditioning of flavour preferences
People come to like the flavours associated with the effects produced by drugs,
such as the flavour of cigarettes, alcoholic beverages, and coffee
In each case, the flavour starts from neutral or even negative hedonic value but
acquires a positive hedonic value because it signals the effects produced by
nicotine, alcohol, or the coffee
Domjan (2007)
Fear
conditioning
Learning to identify cues that signal danger is an adaptation that emerged early in the
evolution of animal life
Such cues allow people and animals to anticipate the danger and respond
appropriately
These responses include arousal, fight/flight/freezing, sympathetic nervous system
arousal, decreased pain sensitivity, protective reflexes
WEEK 2 LECTURE 3
Male fish establish a territory to attract females. They defend the territory against
other males
Males are repeatedly exposed to a stimulus (a light) which signals the appearance
of a rival (located in adjacent glad water tank). They learn that the light signals the
appearance of the rival
Finally, the rival is introduced into their territory after presentation of the stimulus
The males were more successful than controls in the subsequent fights and
therefore more successful in maintaining their territory, attracting females, and
reproducing
Pavlovian conditioning had conferred fitness
Rats can learn that a sweet taste signals pain and will supress ingestion of that
taste
However, this suppression is specific to the place where the taste-pain
experiences occurred
Elsewhere, rats avidly ingest the sweet taste and show liking responses
In contrast, rats who learn that a sweet taste signals malaise refuse to ingest that
taste anywhere and show disgust responses
Flavours associated with nausea become disliked
2 groups, one that learned with the light, and one did not
Contingen
cy: a
future
event of
circumstance which is possible but cannot be predicted with certainty/ a
provision for a possible event or circumstance
Conclusions
Extinction
Learning allows an organism to adjust its behaviour to bring it into line with changes
in the world
Extinction is an example. It occurs when the positive contingency between a CS and
US is broken by exposures to the CS in the absence of the US
The responses elicited by the CS (approach or withdrawal) decline across the CS
alone exposures and eventually cease to occur
Responding to the CS is said to be extinguished
Extinction is used to model cue exposure which is a component of cognitive
behavioural therapy (CBT)
Clinical trials have shown that CBT is more effective than other therapies in the
treatment of anxiety disorders (e.g. PTSD)
The cure exposure component of CBT consists in the patient confronting trauma
related cues (the CS) in the absence of the if any overt danger (the US)
The aim of these confrontations (CS alone exposures) is to reduce the ability of the
trauma-related cues to elicit the fear memories which are destroying the patient’s life
The restoration of extinguished responses
Examples
1. Recovery of fear
You train a rat to fear a light CS by presenting it with an electric shock. Every time the light
flashes an electric shock is elicited causing the rat to show a conditioned response of fear
such as increased HR. Over the next 4 days you extinguish the CR of fear by presenting the
flashing light without an electric shock. A week later you place the rat back into the box and
present the flashing light alone and the rats performs the CR of fear. This restoration of fear
after a period and extinguishing is a demonstration of fear recovery.
2. Fear is renewed
In the first condition (context A) rats put into a green box and trained with light as the CS
which elicits shock (US). Every time the light is turned on, the rat receives a shock which
causes it to respond with the fear response increased heart rate.
Over the next few days, you train the rat in another blue box (Context B) and extinguished
the CR by presenting the light without the electric shock. The following day after training in
context B, you put the rat back into context A and present the light without the shock,
however the CR of increase heart rate returns. The return of the CR in context A with the
presentation of the light demonstrates fear renewal.
3. Fear is reinstated
You train a rat to fear a light CS by presenting it with an electric shock. Every time the light
flashes an electric shock is elicited causing the rat to show a conditioned response of fear
such as increased HR. Over the next 4 days you extinguish the CR of fear by presenting the
flashing light without an electric shock. The following day, you present the shock to the rat
without the presence of the flashing light (US). After a period of time, you train the rat with
only the presence of the flashing light and no shock, and the rat shows the CR of fear. This an
illustration of fear reinstatement.
The omission of the predicted US constitutes the error which initiates the inhibitory
learning that underlines the reduction in conditioned responses (fear responses)
The size of the error will determine the amount of the inhibitory learning, just as
it is the size of the error which determines the amount of excitatory learning (e.g., the
blocking effect).
Evidence for this comes from designs in which one CS (e.g., a noise) is strongly
conditioned, a second (e.g., a light) is moderately conditioned, and a third CS (e.g., a
tone) is weakly conditioned.
Half of the subjects are then extinguished to a compound composed of the light and
the noise, while the remainder are extinguished to a compound containing the light
and the tone.
Subsequent test of the light shows that it elicited less fear (i.e., greater extinction)
when it had been extinguished in compound with the strongly conditioned noise than
with the weakly conditioned tone
The expectation of the feared outcome elicited by the noise-light was greater and,
hence, the error when that outcome did not occur was also greater.
Instrumental condition
Can we learn about relations between any behaviour and its consequences?
In Lewis Carroll’s Through the Looking-Glass, there is an incident in which the Red
Queen and Alice are constantly running but remaining in the same spot.
"Well, in our country," said Alice, still panting a little, "you'd generally get to
somewhere else — if you run very fast for a long time, as we've been doing.”
A slow sort of country!" said the Queen. "Now, here, you see, it takes all the running
you can do, to keep in the same place. If you want to get somewhere else, you must
run at least twice as fast as that!"
Action-outcome associations
Any point above the diagonal line means that the outcome (reinforcement) is
more likely if the action has occurred than if it has not occurred: the action, then,
causes the outcome.
Any point below the diagonal means that the outcome is more likely if the action has
not occurred than if it has occurred. The action prevents the outcome
The diagonal line means that the action and outcome are unrelated.
Points on the diagonal may lead to the phenomenon of learned helplessness
where learning that outcomes are unrelated to your actions: 1) makes it more
difficult to subsequent learn that outcome are contingent on your actions; 2) makes
you less inclined to engage with the problem; and 3) mood changes (depression).
conclusions
Animals and people are sensitive to the contingency between an action and an
outcome. They take into account not only the likelihood of the outcome when the
action occurs but also the likelihood of the outcome when the action has not occurred.
Learning mechanisms are designed to detect predictive relations that exist between
events (Pavlovian conditioning) and the causal relations that exist between actions
and events (instrumental conditioning).
Mechanisms of performance
Incentive learning
One idea is that animals and people learn about the value of an outcome (e.g., food)
relative to an internal state (hunger-satiety) and that it is this value which controls
the action that produces the outcome.
Animals and people rate food as hedonically pleasant when they are hungry and
hedonically neutral or mildly unpleasant when satiated.
Effectively, animals and people learn that food tastes good when hungry but not when
satiated.
Hungry rats learn that pressing a bar produces a novel food. Then half of the rats
(Groups A and B) are provided with this food in their home cages where they can eat
as much as they want; the remainder (Groups C and D) are provided with their
familiar lab chow in the home cages.
Finally, all rats are tested for how much they press the bar.
Rats in Groups A and C are tested hungry. They show substantial pressing.
Those in Groups B and D are tested satiated. Rats in Group D show just as much
pressing as those in Groups A and C while those in Group B refuse to press the bar
Associative learning
Encode statistical regularities in the environment
Allows organisms to build up representation of environment
and to alter/adapt behaviour to exploit statistical regularities in the environment (gain
rewards, avoid punishments)
Pavlovian/classical conditioning
Learning about signalling relationships
E.g., McDonalds sign signals greasy food = salivating
Instrumental/operant conditioning
Learn about action-outcome relationships
Perform responses that lead to positive outcomes (rewards)
Avoid responses that lead to negative outcomes (punishments)
E.g., studying hard = good grade
Contingency
E1 E2?
Does whether E2 happens or not depend on whether E1 happens or not
Does E2 depend on E1?
Is E2 contingent on E1?
Summary
o Contingency: how strongly are events related?
o Learning is influenced by contingency
o But that is not the whole story
Learning is gradual even though AP is constant, “learning curves”
o Maybe learning is not the same as calculating contingency
o What happens when people learn about more than one cue at a time?
Vegemite example
After establishing a strong associative strength – what happens if we eat vegemite, and it
doesn’t make us sick?
Acquisition and extinction
Changes to alpha and beta
The values we chose don’t matter too much; the general pattern of
learning is the same in each case
An alternative view - the propositional nature of human associative learning (Chris Mitchell,
Jan De Houwer & Peter Lovibond)
propositional account
Mitchell et al (2006)
Mr X experiment
Asked to identify 2 types of questions
1. What illness followed bread during training? Options: Daryosis,
Xianethis etc
2. To What extent did the food cause this illness on a scale of 0-8?
Summary
o Its still not clear: more research is needed
o Seems likely that both types of process operate
Simple experiments encourage “thinking about it”
More complex experiments discourage thinking about it, so more
automatic link formation mechanisms dominate
Acquisition
Process of learning
The type of learning can lead to either an increase of decrease in behavior
Pavlovian conditioning
Repeated contingent presentations of the CS with the US
Cs becomes associated with the US
The subject has acquired the CS-US association
Extinction
Repeated presentations of the CS without the US
Produces a gradual decline in the conditioned response
At test the conditioned response is very small or unobservable
E.g., rat, light and electric shock
Acquisition = fear response when the light is presented alone
Extinction = no fear response when light is presented alone
The RW model
Computational model that allows us to stimulate learning phenomena
It is a trial model that calculates the change in associative strength for each trial
The total associative strength must be split between all stimuli present on any trial
The model describes negatively accelerated learning (on each trial you will learn
less)
V = associative strength
blocking
over expectation
Conditioned inhibition
Lab demonstrations of EC
Summary so far
o Evaluative conditioning is real
Not just a consequence of demand characteristics
Pairings produce changes in automatic evaluations of stimuli: “genuine”
liking/disliking
Che
motherapy can induce CTA
This will be bad if patients learn an aversion to foods that are good for
them
Is there anything we can do to prevent this?
WEEK 4 LECTURE 2:
Predictiveness principle: pay more attention to cues that are most accurate predictors
of outcomes
Latent inhibition
Rascle et al 2001
e.g., “count how many TOG and PEC appeared on the screen”
the square signalled the beep
(Marissen 2006) ^
Anderson, Faulker, Rilee, Yantis & Marvel (2013)
Individual differences in reward related attentional bias determine
susceptibility to addiction?
Attentional bias towards reward-related stimuli can be automatic
(Anderson et al 2011)
Instruction may not be enough to reduce maladaptive biases
May require re-training
Relearn to allocate attention in adaptive way
Eberl et al (2013)
Retraining procedure with alcohol dependent individuals
Reduced relapse rates one year by approx. 10%
Summary
o Attention and learning are intimately related
o Attention is influenced by learned predictiveness
Predictiveness principle: greater attention to most predictive stimuli
o Attention is influenced by learned reward value
Our attention system automatically prioritizes processing of stimuli that
have previously signalled high value reward
Learned attentional biases to reward-related stimuli may be implicated in
addiction
Aberrant salience
Dopamine neurons mediate attribution of “motivational salience”
Psychotic patients have excess dopamine
Stimuli attributed too much salience in psychosis. (Kapur 2003)
Hallucinations: internal representations have abnormal salience
Delusions: attempts to make sense of abnormally salient events
Chadwick (2001)
Schizophrenia and aberrant salience – Morris, Griffiths, Le Pelley & Weikert (2013)
Delusional schizophrenic associated with reduced ability to learn to ignore irrelevant
and inconsequential stimuli
- Aberrant salience
- May contribute to hallucinations and delusions
WEEK 5 LECTURE 1: MOTIVATION – GENERAL CONCEPTS
1. Regulation
2. Incentives
Regulation
Motivation behaviours can be understood in terms of homeostasis or
homeostatic mechanism:
Homeostasis (Bernard 1865) is the maintenance of a steady internal state,
this state is the condition of optimal functioning for the organism keeping
critical biological systems within certain some pre-set limits
These limits are setpoints – optimal operating characteristics for that
system
Behaviour is motivated by deviations and serves to return the system to
the set point e.g., eating when we are hungry
These are negative feedback systems
A critical feature of negative feedback is that it tends towards stable
equilibrium (i.e., set point)
1. Detect
2. Compare (actual v model)
3. Correct any error between actual and model via behaviour
homeostasis and negative feedback are fundamental principles of biology. By framing
our theories of motivation in the language and ideas of homeostasis, we are moving
close to a unified biological and psychological explanation of behaviour
our brain has a ‘model’ of the world in terms what a parameter should be (set point or
goal), deviations from this model initiate behaviours to return the parameter to the set
point. Learning can allow us to generate compensatory responses in advance of any
deviation, thereby minimizing impact of these disturbances and more effectively
defending homeostasis
this is the ‘thermostat’ model of motivation – It only reacts to changes
1. Detect
2. Compare (actual v model)
3. Correct any error between actual and model via behaviour
4. Learn about the antecedents in error in the parameter (i.e., what caused perturbation)
5. Predict the perturbation and respond in anticipation of this
Perturbation is a deviation of a system caused by an outside influence
Learning in the form of pavlovian conditioning can allow us to predict when
biological systems might be perturbed and allow us to mount a conditioned response
that defends homeostasis “before” any deviation occurs
Example – Siegal’s theory of compensatory conditioned response (or conditioned
opponent process)
Learning can allow us to generate compensatory responses in advance of any
deviation, thereby minimizing impact of these disturbances and more effectively
defending homeostasis
Beyond learning: social facilitation of eating (De Castro & De Castro 1989)
whereas homeostatic mechanisms push us unto action to satisfy our needs, incentives
pull us towards or repel us from them to satisfy our wants. They are motivational
magnets
incentives are properties of the world, and their significance can be ‘innate’ (primary
incentives) or learned (secondary incentives)
they can be positive or negative
primary positive incentives include food, water, attachment, sex
negative primary incentives include pain, illness
secondary positive incentives include money, drugs, social status, achievement (e.g.,
your motivation to do well in this course)
secondary negative incentives include fear, loss, risk
other things being equal, we seek positive incentives and avoid negative ones
Overview
what is addiction?
Addiction is not recreational drug use, but some proportion of recreational drugs users
will be ‘addicted’
APA, DSM-IV diagnosis of ‘substance dependence’
an individual must exhibit 3 of the following in the past 12 months:
tolerance – refers to reduction in the effectiveness of the drug
withdrawal symptoms – changes in physiology that are associated when the drug is
not taking
increasing doses
unsuccessful effort to reduce the intake of the substance
a considerable amount of time spent in obtaining the substance or using it
interference with important social, occupational, or recreational activities
continuation of use of the substance despite recognition of the fact that doing so
causes physical and psychological problems
addiction – to both licit (e.g., nicotine, alcohol) and illicit (e.g., opiates, cocaine) is a
leading cause of death and disability worldwide
Fowler et al (2001)
High levels of dopamine located around the nucleus accumbens
Cocaine increases release of dopamine in the nucleus accumbens
The brain has circuits which when activated give rise to the sensation of please (the
pleasure pathway)
This pleasure is both “liked” and “wanted”
These brain circuits, normally activated in response to naturally occurring rewards
(e.g., water, sex, money etc) are ‘hijacked’ by drugs of abuse
Incentive sensitisation (Robinson & Beridge)
Advantages
Attempts to integrate a psychological explanation with dopamine function
obvious link to neurobiology
consistent with the important role accorded craving by addicts
does not accord drug withdrawal a causal status in addiction and thus could be applied
to many drugs
limitations
lack of clinical evidence for sensitisation of drug responded among human addicts
not much evidence for dissociations between wanting and liking in human addicts or
in animal based on drugs of abuse
Initially we take drugs because they are rewarding (recreational use), eventually
we take drugs to alleviate withdrawal
Advantages
Explains many features of addiction
Would meet DSM criterion for diagnosis of substance depended
Does not accord drugs of abuse any special role
Can be reconciled with neurobiology
May have fundamental importance and relevance to understanding causes of opioid
overuse
Disadvantages
Fails to explain persistent drug taking in the absence of a withdrawal state (e.g.,
cannabis, psychostimulants)
Tolerance is not an inevitable consequence of drug exposure
The model system (morphine analgesia) has little relevant to understanding addiction
treating addiction
Psychology
If learning is important, surely, we can reduce the power of a drug associated stimuli?
E.g., via extinction such as cue exposure in cognitive behavioural therapy
Sell et al (1999)
Does cue exposure work?
Conklin & Tiffany (2001)
Contingency management
A type of behavioural therapy in which individuals are rewarded for not taking drugs
Types of contingencies
Paid in vouchers for negative drug test
Allowed to work after negative drug tests
Urine samples are collective multiple times each week (to detect brief periods of
abstinence) and abstinence is reinforced each time negative samples are submitted
Can increase reward across time
Liussier et al (2006)
One of the most effective methods
Heilig et al (2016)
Symptoms
Positive symptoms (something added to those with schizophrenia that is not present
in the general person)
Hallucinations; audio and visual
Delusions e.g., people believe that others are trying to kill them
Irrational thoughts and beliefs e.g., thinking that there are messages left for you on tv
or the news
Negative symptoms (absent in those with schizophrenia)
Reduced affect (emotion)
Catatonia (complete absence of movement or anything related to movement)
Cognitive symptoms
Deficits in working memory
Inability to perform the stroop task
Blocking in schizophrenia
2. How does the dopamine hypothesis and aberrant salience explain the positive
symptoms of schizophrenia? (Kapur 2003)
Hypothesis
People with schizophrenia will fail to discriminate between relevant and irrelevant
cues
The amount of learning about the irrelevant cues will be related to the severity of
symptoms
mother – infant
attachment: hormonal
and brain mechanisms
Vasotocin
Vasotocin has evolved into two distinct hormones: oxytocin and vasopressin
Oxytocin and vasopressin are important for mammalian reproductive and attachment
behaviours
Dam will pick up her own pups to ensure that they survive
Naïve virgin rat will not
Kozorovitskiy et al (2006)
What variables determine the formation of an emotional bond between infants and
their parents?
Historically there have been very different views about how infant – parent bonds are
formed and what the consequences of these bonds are
Harlow 1958
no evidence that nursing (feeding) is a critical variable in infant monkey attachment to
a mother
contact comfort is a far more important variable than nursing in determining infant
attachment as measured either by time spent with surrogate or by responses to a
fearful stimulus
later research showed that many other variables were just as important as contact
comfort (e.g., rocking and cradling)
led to the development of attachment theory
sensitive responding by the parent to the infants needs results in an infant who
demonstrated secure attachment, while lack of such sensitive responding results in
insecure attachment
secure infants either seek proximity or contact or else greet the parent at a distance
with a smile or wave
avoidant infants avoid the parent
resistant/ambivalent infants either passively or actively show hostility toward the
parent
increasing vasopressin
in montane/mountain
voles
adult attachment
Has a strong biological basis
The same hormones which mediate parent-infant interactions contribute to formation
of stable long-term monogamous relationships in adults
Oxytocin: mother infant attachment and monogamous relationships
Vasopressin: father infant and monogamous relationships
Different in mating strategies are due to at least in part to differences in levels of these
two key hormones and their receptors
What do these studies of infant, maternal and romantic attachment tell us about
attachment motivation?
Behaviour: approach the individual
Learn the identity of that individual
Invest in this individual while rejecting others
Biology: oxytocin and vasopressin are important for attachment behaviour in
mammals, including humans
Interactions between these hormones and reward circuit are important for attachment
behaviours (i.e., for approaching and learning about the individual)
There are shared behavioural and brain mechanisms for different kinds of attachment
and love
These mechanisms are slightly different for males and females within a species
Although there are some important differences, the behavioural and brain mechanisms
for attachment are also shared across mammalian species, including humans,
indicating that they have been conserved evolution history
The brain mechanisms for attachment and love overlap with those underlying other
motivated behaviours
Activation of one receptor leads to the inhibition of the other in the pair
The first member of the pair (blue) is excited and then fatigued
When the blue is fatigued, inhibition of the opponent (yellow) is reduced
This leads to seeing the opposite colour and produces the after image
The A process
The A process occurs in response to the administration or ingestion of the drug
itself
The physiological repones that occurs to a particular substance – this will change
depending on the type of drug e.g., heroine vs amphetamine
It can be thought of as the process which produces the feeling of euphoria or the
rush of consuming the drug
The A state never changes, it will always produce the same response to the drug.
It is the B process that changes over time
The B process
It produces the opponent physiological effect to that of the drug
The B process is slower to activate and lasts longer than the A process
As the figure displays, over time the A process remains the same, but the B process
grows in length and strength of the activation. It can be thought of in the same that a
muscle grows with repeated use
The B process is thought to explain, tolerance, withdrawal, and craving
UR is the opponent process your body produces to deal with the effects of the
drug
E.g., a decrease in HR/blood pressure from amphetamine
This process is designed to bring the body back to homeostasis
CR is the opponent process your body produces to deal with the effects of the
drug
E.g., decrease in HR/blood pressure for amphetamine
This process is designed to bring the body back to homeostasis
The CR occurs following exposure to the CS alone
So they intensify craving and explain withdrawal and tolerance
Discussion questions
1. What are the behavioural similarities and differences between drug addiction and
attachment relationships?
Similarities Differences
Overtime the B process grows Cues lead to increase behaviour of
leads to increase behaviour i.e., taking drugs
taking more drugs, spending more
time with loved ones
Participate in the behaviour to
alleviate withdrawal (i.e.,
withdrawal from drugs and distress
from not seeing loved ones)
2. What are the brain similarities and differences between drug addiction and attachment
relationships?
Similarities Differences
Mechanism is the same for addiction Physiological response never
and attachment both are changes (A process) – always
associated with a positive and produces the feeling of euphoria
negative outcome when taking drug
Increased in dopamine to reward Excitement and affective response to
your loved one is not the same
(opponent process and attachment)
Withdrawal state in drug taking is
motivation enhances incentive
value of drug
Attachment involves oxytocin and
vasopressin
Pavlovian conditioning
Why is predictive learning important?
Learning about stimuli that predict important events is critical for successful
adaptation. It allows us to use the past and predict the future and thereby behave
accordingly in the present
Pav conditioning is used to study how we learn about these predictive relationships,
specifically, it is used to demonstrate how we learn that certain stimuli in our
environment predict biologically significant events
Pavlovian conditioning is expresses as reflexive behaviours, rather than voluntary
behaviours. This means that the subject is a passive observer and does not need to
interact with its environment
A classic example is Pavlov’s dogs that began salivating to a bell when they learned it
predicted the arrival of food
Various types of
Pavlovian
conditioning
studying fear conditioning in the lab
The hippocampus
Rats received context fear conditioning: they were placed in a conditioning chamber
and received a shock a few seconds later
4 groups
G1: received control sham lesion of the hippocampus before (control group)
G2: received a lesion of the hippocampus before conditioning
G3: received control sham after conditioning
G4: received lesion of the hippocampus after conditioning
All groups displayed a similar amount of fear conditioning: this implies that lesion of
the hippocampus before conditioning (g2) did not prevent freezing response
Results are surprising impairment in G4 suggest that the hippocampus is
required for context fear conditioning. But if this was true G2 should also have
been impaired
Group 2
The hippocampus is lesioned during conditioning rats cannot form a
configural/unique representation of the conditioning chamber single elements are
processed independently in the cortex and are associated with shock associations
retrieved at test and the animals displayed fear
Group 4
The hippocampus was intact unique representation if formed and was associated
with shock
Hippocampus was lesioned after conditioning prevents the retrieval of the unique
context representation no fear displayed
Processing the US
Just like the CS, the brain regions involved in processing the US depends on the
nature of the US being used
Foot shock somatosensory thalamus and somatosensory cortex
Structure located in the medial temporal lobe and plays an important role in the
processing of emotional information
Results:
blocking NMDAr before training impaired the acquisition of fear to the tone and
the context: animals that were infused with either a high dose or a low dose of
ifenprodil showed less fear than animals infused with vehicle.
By contrast, infusions of ifenprodil before the tests,as seen from the graph on the
right, did not impair expression of fear to either the tone or context, as animals that
were infused with both doses of the drug showed as much fear as control animals.
Data indicates that NMDAr activation in the BLA is required for the acquisition but
not the expression/retrieval of fear
Summary
BLA is required for:
Acquisition of conditioned fear
Consolidation of conditioned fear
Retrieval/expression of conditioned fear
one model of Pavlovian fear conditioning is that the BLA and CeA function serially,
where the CS-US association is formed and stored in the BLA
Subsequent retrieval of the memory in the BLA then activates the CeA, which
coordinates fear responses through its projection to the brainstem.
Although this serial model of the amygdala has been widely accepted, current
evidence suggests that it is an oversimplified view.
he main issue with this model is that it implies a very restricted role for the CeA. In
fact, the role of the CeA may be more important than originally thought.
Summary
Formation of the CS-US association occurs in the BLA
Activity in the BLA is important for acquisition, consolidation, and expression of
conditioned fear
NMDAr activity in the BLA is critical for acquisition
Protein synthesis in the BLA is critical for consolidation of conditioned fear memories
Seriel model of of the amygdala suggest that the CeA is important for the expression
of conditioned fear, but this view may be oversimplified and the role of the CeA
might not be restricted to fear expression
WEEK 9 LECTURE 2:
Instrumental conditioning
Topics covered
Goal directed behaviours vs habitual behaviour
How we assess whether behaviours are goal directed or habitual in the lab
Neurobiology of goal-directed behaviour and habitual behaviour
Instrumental behaviours
Pavlovian learning is expressed as reflexive rather than voluntary behaviours
However, we also learn about predictive relationships by interacting with our
environment through volitional or voluntary behaviours
Instrumental behaviours are ones in which we perform voluntarily e.g., driving home
from university
Instrumental learning occurs as a result of performing actions and learning about
their consequences
Goal-directed actions
Goal directions are those that we and other animals perform in order to satisfy our
basic needs and desires
Goal directed actions have 2 main characteristics
1. They rely on the presence of causal relationship with their occurrence i.e.,
contingency requirement you perform an action because you know that doing so
will produce the particular outcome you have in mind e.g., press coke button on
vending machine = receiving coke
2. They depend on value attributed to their consequence i.e., goal requirement –
outcome of performing that action is valuable to you e.g., press button for coke
because of thirst drinking coke will quench thirst
Goal directed actions are said to be driven by action-outcome (A-O associations)
GDA are essential to survival, allowing use to interact with the environment and they
are flexible. However, they are cognitively demanding
Habits
Habits emerge after GDA have been overtrained
Habits are insensitive to contingency and value requirements
Driven by stimulus-response (S-R associations)
For example, if you have been working on level 5 of your work building for years and
years and one day you are moved to level 7. However, you find yourself getting to
work, going on the lift, and hitting the button for level 5. Pressing 5 on the button has
become a habit.
The antecedent stimuli such as being at work and pressing the same button for the lift,
seeing co-workers, same time of day. all the antecedent stimuli elicited the response
of pressing level 5 instead of 7
Habits occur without cognitive oversight, freeing up resources for other tasks
Important for refining and perfecting motor skills; stable and long lasting
Relatively inflexible and it can be difficult not to perform them once they’ve been
established if our circumstances change
The S-R association that leads to the performance of habits can also lead to
maladaptive behaviours such as addiction
Assessing instrumental behaviours in the lab
Outcome devaluation
Used to assess whether behaviours are GD or habitual
3 stages
1. Trained to perform 2 different actions for two different outcomes e.g., right lever =
food pellet, left lever = sucrose solution
2. Devalue the outcome (devalue = the reduction or underestimation of the worth or
importance of something)
2 main approaches
Sensory specific satiety – involves giving the animal free access to one of the foods
either the pellets or the sucrose. The animals fill themselves up with the specific food
and don’t want anymore
Once outcomes are devalued test is given immediately
Conditioned tasted aversion (CTA)
One of the outcomes is given freely to the rats for some period of time e.g.., 30
minutes
They usually fill up on it
Immediately after they are injected with lithium chloride makes them feel mildly
sick
Appeal of eating food is paired with sickness has been completely diminished
tested on the day after the last pairing of the food and the lithium chloride
3. Choice test
Given the opportunity to press the two levers
Successful devaluation is shown when the rats selected the “valued” lever – the one
that delivered the foot outcome they were not sated on or the one that was not paired
with lithium chloride (the one that delivered the food outcome that has now been
devalued)
It’s important to note that these outcomes are conducted in extinction – where no
outcomes are delivered this ensures the performance seen at test reflects the
animal’s knowledge of what was encoded during the training stage and precludes the
animal from using any immediate feedback to adjust their actions
Lab example
Rats were trained to perform two actions that led to two outcomes
They were given outcome devaluation by specific satiety and were immediately tested
in a choice test
Results
Devaluation effect this means that the selectively chose to perform the action that
led to the still valued outcome or the one they were not sated on
This devaluation effect is indicative of GD behaviours
Contingency degradation
During contingency degradation, rats might initially learn that pressing the right lever
produces food pellets and pressing the left lever produces sucrose solution.
In positive contingency – the probability of earning the outcome given performance
of the action is greater than the probability of gaining the outcome in the absence of
the action
Degrade contingency arranging it so that the outcome is freely delivered without
having to press the lever e.g., the left lever might still earn sucrose solution, but the
sucrose solution is also delivered into the magazine without having to press the lever
the probability of the outcome given the action is the same as earning the outcome
in the absence of performing the outcome
When contingencies are degraded, GD animals will cease performing the action
with the degraded contingency
Same refers to the action that earn the same outcome as the freely delivered on or the
degraded action
Different refers to the action that earned the outcome that is different to the freely
delivered one or the non-degraded one
Non-degraded actions is referred to as a contingency degradation effect indicative
of GDA
Overtraining
Varying the amount of training the rats receive
E.g., three groups of rats were trained to learn that two actions led to two different
outcomes
R1 O1
R2 O2
Groups varied in training
One group received 2 days of training
Second group received 5 days of training
Third group received 20 days of training
Training was followed by devaluing one of the outcomes by specific satiety choice
test
Results:
Rats that received 2 and 5 days of training showed devaluation effect Respond
more to the non-devalued lever
Rats trained for 20 days did not show the devaluation effect responded
equivalently on the devalued and non-devalued lever overtrained rats were
insensitive to outcome devaluation demonstrating that overtraining an action
makes it habitual
Interim summary
Instrumental behaviours are those in which we interact with our environment to
produce outcomes
Instrumental Behaviors can be GD or habitual
GD behaviours are sensitive to outcome devaluation and contingency degradation
Overtraining and interval schedules caused GDA to become habits
GD behaviours require the interactions of several brain regions – these regions are the
amygdala, the prelimbic region (PL) of the medial frontal and the dorsal striatum
The striatum is the rodent homologue to the human caudate/putamen
It receives dense projections from the cortex and processed information from the
thalamus specifically, the medial aspect of the dorsal striatum (DS) receives
projections from the prelimbic cortex as well as the BLA
The medial aspect of the DS, PL and the BLA are all important for different
aspects of GD behaviours
Prelimbic cortex (PC): contingency
PL sends dense projections to the striatum (specifically the posterior aspect of the
DMS)
The DMS has been shown to be critical in their acquisition and plays a particular role
in updating actions based on current action-outcome contingencies
Experiment:
Rats were given either sham lesions, lesions of the posterior DMS or lesions of the
anterior DMS and trained to perform two actions for 2 outcomes
One of the action-outcomes were degraded freely delivered
Rats were given a choice test
Sham lesions:
Responded more on the non-degraded lever (contingency degradation effect)
Anterior DMS:
Showed similar pattern to the sham rats
Posterior DMS:
Did not show contingency degradation effect
Responded similarly for both levers showed difficulty in updating the action-
outcome contingencies
pDMS is necessary for the acquisition of GD behaviours
lesions of the pDMS also impair choice between the actions after the outcome
devaluation
rats were given either sham lesions, anterior DMS lesions or pDMS lesions and were
then trained that R1 O1, R2O2
one of the outcomes were devalued by specific satiety choice test
sham rats:
no different in responding before the outcome devaluation
similar responding during training
clear outcome devaluation effect selected the still valued outcome
anterior DMS:
similar pattern to the sham rats
pDMS:
did not show devaluation effect providing further support for the role of the
pDMS in the acquisition of GD behaviours
BLA: contingency
lesions of the BLA have been shown to impair GDA as assessed by contingency
degradation
BLA or sham lesions were made prior to training the rats on 2 action outcome
contingencies one was degraded choice test in extinction
Sham rats showed sensitivity to contingency degradation decreased responding on
degraded lever
BLA rats showed insensitivity to contingency degradation responded similarly
to both levers
BLA has a role that is distinct from the roles played by the PL and the pDMS
The BLA processes changes to outcome value which is critical for GD behaviour
important in assigning incentive value
Rats were trained with two levers that predicted 2 distinct food outcomes one was
devalued by specific satiety
Immediately before devaluation rats were infused with either NMDAr agonist,
ifenprodil or vehicle into the BLA which impaired the expression of GD behaviour
Results:
Data from rats given pre-devaluation infusions are presented here. Vehicle treated rats
showed outcome devaluation, pressing more for the valued lever than the devalued
lever.
However, infusions of ifenprodil into the BLA impaired the expression of goal-
directed behaviours. These rats showed similar responding on the devalued and
non-devalued levers
Importantly, it is the timing of the infusion suggests that the BLA is important for
encoding outcome value as the NDMAr antagonism occurs during the specific satiety
manipulation. This suggest that the impairment caused by the BLA manipulation may
not be because of inability to retrieve instrumental contingences
In support, in another group of rats, these infusions occurred before testing rather
than before devaluation
When the infusions occurred before the choice test both vehicle and infendropil
infused rats showed a devaluation effect behaviour was GD despite the blockade of
NMDAr in the BLA this strengthens the view that the BLA is not involved in
encoding instrumental contingencies
These data suggest that the BLA is involved in encoding outcome value or assigning
incentive value retrieving this value is essential for the expression of GD
behaviour thus, the BLA is not directly involved in GDA but it provides
information that is necessary to display the behaviour
Interim summary
PL is necessary for acquisition but not the performance of GDB
pDMS is necessary for the acquisition and performance of GDB
BLA encodes outcome value which is required for the performance of GDB
undertrained rats:
sham lesions showed an outcome devaluation effect
rats with IL lesions showed a similar devaluation effect IL lesions do not impair
the acquisition of GD behaviours
PL rats responded similarly for both outcomes supports the role of the PL in the
acquisition of GDB
Overtrained rats:
Shams rats showed insensitivity to outcome devaluation similar levels of
responding for both outcomes – suggest that overtraining made this response habitual
IL rats showed GD performance responded more to the non-devalued lever –
suggest that IL is necessary for habits
PL rats had no effect on habit performance – showed insensitivity to outcome
devaluation
The DLS has been shown to be critical for the expression of habits
Rats were trained to press a lever for sucrose solution over many days until there was
a stable and high rate of responding
Rats were given an omission schedule – pressing the lever delayed the delivery of the
sucrose solution rats had to withhold their responding to access the outcome
Control rats received no contingent relationship between the action and outcome
Immediately before omission training rats were infused with either muscimol or
saline into the DLS tested in extinction
Control rats:
Control rats showed insensitivity to the imposition of the omission contingency and
failed to withhold responding during omission training in order to earn the sucrose
solution
Loss of control is indicative of habitual responding mediated by S-R associations
Rats were given lesions of the anterior and posterior CeA or sham lesions before
being overtrained on a response that earned sucrose solution
Devaluation conditioned taste aversion extinction test
Lesions in the anterior CeA restored sensitivity to outcome devaluation despite
overtraining
Restoration was not seen in the sham rats of the posterior CeA rats
Suggest that the anterior CeA is necessary for the acquisition of habits
Summary
PL is necessary for acquisition but not performance of GDB
pDMS is necessary for acquisition and performance
BLA encodes outcome value which is required for performance of GDB
The IL/DLS is necessary for the acquisition and retrieval/expression of habits
The CeA interacts with the DLS in the acquisition of habits
GDA and habits are supported by two independent and parallel systems two forms
of learning co-exist and compete for control
Although the figure suggests a decline in the A-O associations, GDA are not removed
and replaced by habits
For example, driving to work – although the behaviour is done seamlessly because
you drive every day, when you drive past a cop car and your behaviour changes it
is habitual and changes back to goal directions behaviours in an instant
On a neural level, these systems that drive GD and habitual behaviours function in the
same one parallel, independent and compete with each other disruption GD
related structures made behaviours habitual, disrupting habit related structures made
behaviour GD
Topics covered:
Influence of Pavlovian stimuli (CS) on instrumental behaviour
Pavlovian to instrumental transfer (PIT): general and specific
Effects of change in primary motivational states and outcome value
Neural circuity underlying general and specific PIT
General PIT
A stimulus that predicts food enhances performance on action that procures food
E.g., sitting at desk and notice that the time on the clock is almost 12 this time is
associated with lunch and therefore predicts food go into kitchen or buy food
E.g., smell food that your colleague has brought motivates you to buy food
These examples show that a stimulus (time of day, smell of food) prediction food
enhances performance on actions procuring food – this is general PIT effect
This technique is used in advertising/marketing
Specific PIT
The content of these stages differs between general and specific PIT
General PIT – Pavlovian stage
One stimulus predicts the food outcome and the other predicts no outcome
E.g., tone predicts food and clicker predicts nothing (S1 O1)
This training takes place over the course of a few days
Pavlovian Instrumental training
Lever press earns a different type of food e.g., sucrose solution
Action 1 leads to outcome 2 (A1 O2)
Transfer test
S1 and S2 are presented and the amount of responding on the lever is measured in the
presence of each stimulus
This is the first time that the lever and stimuli are presented in the same session
Results: more pressing on the lever when S1 is turned on – when this happens, it is
referred to as a “general transfer effect”
Specific PIT
2 outcomes are used in each stage
Pavlovian stage
Tone predicts food and noise predicts sucrose solution
S1 O1 and S2 O2
Instrumental training
2 different actions lead to two outcomes
Left lever = food
Right lever = sucrose solution
A1 O1
A2 O2
Transfer test
Each stimulus is turned on and the amount of responding is measured
Results: when S1 is on there is more responding on A1 and when S2 is on
more responding on A2 than A1
In other words, a stimulus predicting a particular outcome increases performance
on an action delivering the same outcome
Specific PIT
Comparing responding on two different levers during 3 different times
Responding to the no CS is very limited – considered baseline levels
The two columns in the middle are responding to the levers during s1 (tone=food)
respond more to the left lever because the left lever earned the food pellets during the
instrumental stage
Right column = responding on S2 (noise=sucrose) pressed right lever more
because the right lever earned the sucrose during the instrumental phase
To simplify things the data on the right collapses the two stimuli and compares
performance on the lever that earned the same outcome with the performance on the
lever that earned the different outcome
So, the black bar in this figure is the average rate of responding on the left lever when
the tone turned on AND the rate of responding on the right lever when the white noise
turned on.
The white bar is the average rate of responding on the levers that earned the different
outcome when the stimuli turn on. So, it would be the right lever during the tone, and
the left lever during the white noise.
NB: Baseline responding is sometimes absent on PIT graphs. This responding is
typically subtracted to responding during the stimuli. Usually, the term net responding
appears on the y axis
3 CS design
It is possible to study general and specific PIT in the same animal at the same time
3 CS design
During Pavlovian conditioning stage, rats learn that 3 CSs (s1, s2, s3) predict 3
distinct food outcomes
During the instrumental stage, two of these food outcomes can be earned by
performing 2 distinct actions e.g., left lever press and right lever press
During the test, the amount of responding on each lever is assessed during the
presentation of 3 stimuli and when no stimuli is on at all. The critical point here is that
S1 and S2 predicts outcomes earned by the actions, which should generate specific
PIT, whereas S3 predicts a food outcome but one that is different from the ones
earned by the instrumental actions. This should generate general PIT
What’s found is that S1 and S2 trigger specific PIT: S1 increases A1 but not A2,
whereas S2 increases A2 but not A1. In other words, responding on the “Same” lever
is greater than the “Different” lever, and greater than baseline
In addition, S3 generates general PIT. It increases responding on both A1 and A2
compared to baseline. Responding on A1 and A2 during S3 is generally collapsed to
create ‘General’ responding.
Dilemma
the 3 CS design reveals an important dilemma
S3 enhances performance on both A1 and A2 a stimulus that predicts food
enhances performance on actions procuring food
If true, why isn’t S1 (or S2) enhancing performance on A2 (or A1)? In other words,
why aren’t S1 and S2 producing general PIT.
The reason is that general and specific are two distinct phenomena and, as such, they
involve distinct mechanisms
To understand these mechanisms, we must first understand how outcomes or USs are
being processed.
Processing of outcomes/USs
3 CS design to evaluate general (S3) and specific PIT (S1 and S2)
Pavlovian and instrumental stage was conducted when the rats were hungry
During this first test (left panel), there was evidence of both specific and general PIT.
There was more responding on the same lever than the different lever, indicating a
specific PIT effect, and there was more general responding during S3 than during the
baseline, or the Pre CS period.
The second test (right panel) was conducted while the rats were sated. Note that the
overall performance was lower than when the rats were hungry. This simply reflects
that animal were less motivated to press the levers to get food because they were not
hungry. Nevertheless, specific PIT was still present: there was more responding on the
Same lever than the different lever. However, general PIT was abolished. General
responding during S3 did not differ from baseline.
This experiment therefore demonstrates that lowering the value of the outcome
predicted by a stimulus abolishes general PIT but leaves intact specific PIT.
Consistent with the latter, outcome devaluation spares specific PIT.
Specific PIT was used
Pavlovian instrumental PIT test
Before the test, the two outcomes were devalued by conditioned taste aversion O1
and O2 were paired with injections of lithium chloride to induce illness
Results:
Even though the outcomes were devalued specific PIT emerged: more responding
on the lever that earned the same outcome
despite having outcomes paired with sickness, the influence of Pavlovian cues to bias
choice was still apparent
Outcome devaluation fails to remove specific PIT
Amygdala contained 2 important subnuclei, BLA and CeA – these 2 structures work
together in a parallel and complimentary fashion
CeA encodes information about the motivational properties of the CS-US association
By contrast, the BLA computes information about the sensory specific properties of
the CS-US association
As a result, the CeA and BLA are thought to trigger distinct forms of conditioned
responses:
The CeA produces preparatory CRs based on the valence of the US. For example,
aversive CSs will elicit withdrawal, and appetitive CSs will elicit approach responses.
These responses are due to the motivational properties of the stimuli.
By contrast, the BLA produces consummatory responses, which are specific to the
outcome employed. For example, some aversive CSs elicit freezing and some
appetitive CSs elicit chewing. These are specific to their sensory specific properties
the best evidence for this dichotomy probably comes from the dissociation seen in
general and specific PIT.
DLS vs pDMS
Researchers examined the role of the DLS and the posterior DMS in specific PIT
DLS is critical for habits
posterior DMS is critical for goal-directed actions.
Experiment:
standard PIT design was used to look at specific PIT.
immediately before the test rats were infused into either the DLS or posterior DMS
with either muscimol, which inactivates the structure or saline
results for DLS rats
Saline treated rats showed specific PIT: their responding on the same lever was
greater than the different lever and greater than baseline
Rats given muscimol during test showed much less responding overall than the saline
treated rats, but specific PIT was still observed.
DLS is important for performance but not choice between actions
Results of DMS rats:
the saline control rats showed a clear specific PIT effect:
Rats given muscimol into the DMS showed as much responding as the saline controls,
but the specificity of their performance was gone (specific PIT was removed)
the pDMS is important for specific PIT. Given its role in goal-directed behaviours,
it is likely to provide information about the action-outcome associations that are
necessary for specific PIT.
Energising vs choosing
These data, along with the data from the experiment examining the shift in
motivational state demonstrate that there are two mechanisms at play: one that
energise action performance, and one that controls action selection (i.e., choice).
For instance, the core and the shell were each inactivated before the PIT test using the
3 CS design.
in one group of rats, the shell was infused with either saline or muscimol
in another group of rats, the core was infused with either saline or muscimol.
Note that the data present the net effects of the stimuli. Meaning that baseline
responding was subtracted.
Results for specific PIT
Control rats given saline infusions into the either the shell or the core displayed
specific PIT
The same was true for rats given muscimol into the core (specific PIT effect)
However, rats given infusions of muscimol into the shell did not show a specific PIT
effect
These data demonstrate that the nucleus accumbens shell, but not the core, is critical
for specific PIT.
Results for general PIT:
Unsurprisingly, control rats infused with saline into both the shell and core exhibited
general PIT
The same was true for rats that had muscimol into the shell.
However, rats that had muscimol infusions into the core did not show general PIT
Demonstrates that the nucleus accumbens core, but not the shell, is necessary for
general PIT
Summary
Pavlovian cues have a strong influence on our instrumental behaviours and this
influence is assessed using general and specific PIT
3 Cs designs of PIT allows us to examine specific and general forms of PIT
Specific PIT is mediated by sensory specific properties of outcome; general PIT is
mediated by the outcome’s motivational properties
General PIT requires activity in the CeA and the nucleus accumbens core
Specific PIT requires the BLA, pDMs and the nucleus accumbens shell
PIT exemplifies the influence environmental stimuli have on our actions: this
influence is adaptive but also problematic
Topics covered
Extinction in lab
Restoration of phenomena
Neurobiology of extinction
Clinical implication of fear extinction
Extinction refers to a procedure during which a conditioned stimulus that has been
trained to predict an unconditioned stimulus is repeatedly presented on its own.
Because of these CS-alone presentations, conditioned responses gradually decrease,
and eventually cease
Recall that during fear conditioning, a rat is placed in a chamber, and pairings of the
CS and US are presented. This results in excitatory CS-US associations
During extinction, the CS is repeatedly presented on its own, without the US. These
CS-alone presentations result in inhibitory CS-US associations, or a CS-noUS
association.
Typically, we administer fear extinction 24 hours after conditioning, although it can
be administered days or even weeks after conditioning
The extinction session may seem like the test for fear conditioning. Recall that testing
the retrieval of the fear conditioning memory consists of presenting the CS on its own
However, tests of fear conditioning are generally shorter than extinction training to
ensure conditioned responding does not cease
Spontaneous recovery
Renewal
This refers to the return of conditioned responding due to a change in the extinction
context.
For example, conditioning might take place in a particular context. We will call this
context A.
Then, extinction might take place in a different context, and we’ll call this context B.
Then, we can return rats to context A for testing
Renewal (ABA)
What we observe is that following extinction in context B, rats tested in context A,
the conditioning context, show more fear than those tested in context B, the extinction
context. In other words, the extinguished fear responses are renewed by the context
shift. This type of renewal, where conditioning takes place in context A, extinction
takes place in context B, and testing occurs back in context A is called ABA renewal.
However, renewal can be seen in other situations.
Renewal (AAB)
Renewal (ABC)
renewal is also observed when conditioning occurs in context A, extinction occurs in
context B, and testing occurs in context C.
Rats tested in context C will show more conditioned responding than those tested in
context B, their extinction context
This type of renewal is called ABC renewal. This is the most convincing type of
renewal, as the test context is completely new.
AAB and ABC renewal demonstrate that the return of conditioned responding is not
due to being returned to the conditioning context, per se, but rather being removed
from the extinction context. This is important, as it tells us that extinction is context
specific
Reinstatement
refers to the return of extinguished conditioned responses following exposure to the
US alone
In reinstatement, rats learn an association between a CS and a US, for example, a tone
and footshock. Fear to the tone is then extinguished. After extinction, the rat is
presented with the US alone, but NOT the CS-US pairings.
For example, the rat will be placed back in the chamber and footshocks will be
delivered without the tone
What is observed is more conditioned responding to the CS when it is presented at
test in rats that had been given the shock after extinction compared to rats that
weren’t.
Interestingly, conditioned responses can be reinstated after other stressful events are
experienced. For example, an acute stressor, like mild restraint can serve to reinstate
the conditioned responses
Thus, reinstatement doesn’t necessarily require reexperiencing the US, per se, but can
be caused by some other trauma.
Although the three fear restoration phenomena are very popular, the best evidence for
extinction not being erased comes from studies using specific PIT.
Experiment:
rats were submitted to a standard appetitive PIT procedure.
initially given Pavlovian training, where they learned two stimuli, S1 and S2,
predicted two food outcomes, O1 and O2, respectively
instrumental training where those two food outcomes could be earned by responding
on two distinct levers, R1 and R2
before the transfer test, they were given Pavlovian extinction, where one of the
Pavlovian stimuli was extinguished by presenting it without the outcome
results:
On the left of this figure is the responding on the same lever, the different lever, and
the baseline responding during the CS that was not extinguished specific PIT effect
Similarly, a clear PIT effect can be seen during the CS that was extinguished (data on
the right) - Even though this stimulus was extinguished before the test, it still elevated
performance on the action that earned the same outcome more than the other action,
and more than baseline. In other words, a specific PIT effect was found regardless
of whether the CS was extinguished.
So, extinction of the Pavlovian association does not remove specific PIT. Extinction is
not erasure or unlearning.
Why is PIT the best evidence? Because it does not involve any other manipulation
than extinction.
Interim summary
Neurobiology of extinction
After extinction, the rats were tested the following day, drug-free.
Results:
Rats that had received the control infusions of saline the day before showed low levels
of fear during test, indicating that extinction successfully reduced fear to the CS in
this group.
However, the rats that received muscimol infusions showed significantly more
freezing than the controls, indicating that BLA inactivation prevented the acquisition
of fear extinction.
Thus, activity in the BLA is required for the acquisition of extinction of
conditioned fear.
BLA acquisition
Many experimenters have investigated the specific mechanisms within the BLA that
are necessary for the formation of extinction memories. For example, these
experimenters examined the role of NMDArs in the BLA
a context was paired with a footshock and then fear to the context was extinguished.
Immediately before extinction, half of the rats received an infusion of a drug called
ifenprodil, which is an antagonist of the NMDAr blocks function of NMDAr,
preventing the intracellular cascade of events required for memory formation
other half were infused with a control vehicle.
Results:
Controls rats showed high levels of freezing at the beginning of the session,
indicating that fear conditioning the previous day was successful, and we also see a
gradual decline in freezing across extinction
Rats that received ifenprodil also showed high levels of fear at the beginning of the
session, however these rats showed an impairment in the rate extinction learning
These rats were slower to extinguish than the vehicle control rats, indicating an
impairment in extinction learning due to the blockade of NMDAr
This impairment persisted when the rats were tested drug free the following day
Again, we see control rats showed low levels of freezing to the context. However, rats
that had the infusions of ifenprodil during extinction the previous day showed high
levels of freezing, indicating that blocking NMDAr activity before extinction
impaired this learning.
Thus, NMDAr activity in the BLA is necessary for the acquisition of extinction of
conditioned fear.
interim summary
Acquisition of extinction requires:
Neuronal activity in the BLA
NDMAr activation
Consolidation of fear extinction requires:
Activity in the BLA
No NMDAr activation
Expression of fear extinction
Does not require BLA
The mPFC
the extinction of conditioned fear also involves the BLA the medial prefrontal cortex.
rodent medial prefrontal cortex consists of two main subnuclei: PL and IL.
The PL has been shown to be critical in the expression and potentiation of
conditioned fear
The IL has been implicated in the inhibition of fear responses during fear
extinction.
IL/PL inactivation – fear extinction and expression
A straightforward demonstration of the dissociable roles of the subregions of the
mPFC has been shown by inactivating each specific region before the extinction of
conditioned fear.
Experiment:
rats were conditioned to fear a CS, by pairing it with a footshock extinguished
Immediately before this extinction session, some rats were infused into the PL with
either muscimol or saline
In other rats, muscimol or saline was infused into the IL.
The rats were then tested for fear to the CS the following day drug-free.
Results for PL
The arrow represents the point of infusion, immediately before the extinction session
PL inactivation caused a reduction in fear responses during the session, evidenced by
the lower levels of freezing in muscimol treated rats, which are shown as the black
circles This is consistent with the view that the PL is important for expressing fear
responses
When the rats were tested the following day, PL inactivation had no effect on long-
term extinction learning Rats that had the PL inactivated before extinction froze
as much as their saline controls
Thus, the PL does not seem to be critical for the acquisition of the extinction
memory.
researchers used context fear conditioning to assess the role of the IL in extinction
Foot shocks were presented in a conditioning chamber extinguished
Immediately before extinction, they were infused into the IL with muscimol or saline.
Results:
no impairment caused by IL inactivation was observed during extinction
Rats that had the IL inactivated with muscimol, showed the same reduction in
freezing across the extinction session as rats infused with saline
Unsurprisingly, when rats were tested the following day, those that had the IL
inactivated during extinction showed impaired extinction learning: they froze more
than the saline controls when tested.
IL inactivation: acquisition?
How do we account for these discrepancies in the 2 experiments?
Recall that in one, infusions of muscimol into the IL produced an impairment in
extinction learning during the session, and in another, no impairment was seen during
the extinction session
One explanation is that the effect of IL inactivation during extinction learning may
depend on the modality of the CS – in one case discrete CSs were used, and in the
other case context conditioning was used.
What is important, however, is that in both instances, long-term extinction was
impaired when the rats were tested drug-free.
in both experiments, muscimol was used. Muscimol, has a relatively long-half life,
meaning it metabolizes slowly, and stays in the brain for a long time. This means it is
still exerting its effects after the extinction session is finished.
So, the muscimol infusions into the IL may impair extinction by disrupting
consolidation rather than acquisition, and there is substantial evidence to support the
view that the IL is critical for the consolidation of extinction
Such support for this view comes from studies manipulating neuronal activity after
extinction training
Experiment:
context was paired with shock extinguished (conducted drug free)
immediately after extinction, the IL was infused with either vehicle or muscimol to
temporarily inactivate the structure.
unsurprisingly, there were no differences in groups, as the infusions took place after
this session
Rats were then tested drug-free the following day for fear to the context What was
found at test was that muscimol infusions after extinction disrupted the long-term
reduction in fear produced by the extinction learning the day prior. Rats treated with
muscimol after extinction froze more at test than their vehicle controls,
Providing support for the role of the IL in the consolidation of extinction of
conditioned fear
IL- protein synthesis blockade & consolidation extinction
Further support for this view comes from other studies manipulating neuronal activity
after extinction training
Experiment:
Context paired with food shock extinction (drug free)
immediately after extinction, the IL was infused with either vehicle, lidocaine or
anisomycin
Lidocaine is a sodium channel blocker, that effectively functions to inactivate the
region
Anisomycin is a protein synthesis inhibitor, which we have previously seen is critical
in the consolidation of new memories into long-term memories.
Rats were then tested 24 hours later drug-free for fear to the context.
during this drug-free test was that rats that had the IL infusions of lidocaine and
anisomycin after extinction showed more fear than the controls
This gives direct support for the view that activity and protein synthesis in the IL
after extinction learning is necessary for the consolidation of the extinction
memory
Consistent with single unit recordings, muscimol-induced inactivation has shown that
the IL is involved in retrieval/expression of extinction.
in this study, context fear conditioning and extinction were administered, and then rats
were infused into the IL with muscimol or saline before testing
What was found was an impairment in the retrieval and expression of the extinction
memory caused by the IL inactivation. Muscimol infused rats showed more fear
during the test than the vehicle infused rats,
Providing further support that the IL is necessary for the retrieval and expression of
the extinction memory
lesions of the intercalated cells disrupt the retrieval and expression of the fear
extinction memory
experiment:
intercalated cells were lesioned by infusions of a drug called D-Sap, the day after
extinction training
D-Sap lesions the intercalated cells by targeting a neuropeptide that is specifically
expressed on the intercalated cells, but not the surrounding regions.
Control rats were given a control substance called U-Sap or or D-Sap into the BLA
or CeA, but not the intercalated cells.
subsequently tested one week later, those that had lesions of the intercalated cells,
seen as the red lines in this figure on the right, showed impaired extinction retrieval
These rats froze more than the controls that received U-sap, and the controls that had
D-Sap infused into the BLA or CeA
Demonstrating that the intercalated cells are necessary for the retrieval and expression
of extinction
Summary of extinction
BLA IL ITC PL
Role in Extinction Consolidatio Ret/exp None (role
extinction neurons n in fear
Acquisition Ret/exp expression)
Consolidatio
n
Projections Ext neurons: ITC CeA Fear
IL Ext neurons neurons
ITC
Fear neurons:
PL
Clinical implications
Anxiety disorders
Characterised by irrational fears and beliefs
Lifetime prevalence rate of approx. 20% in Australia
Understandably, there have been many attempts to find ways to reduce or eliminate
the incidence of fear relapse
One common area of focus is on extinction in multiple contexts, however, the data
suggesting that it prevents renewal has been inconclusive
E.g., in one study, rats were extinguished by presenting the CS 144 times in one
context or in 3 different contexts
When extinction occurred in 3 different contexts, renewal was eliminated.
However, when extinction consisted of only 36 trials, the number of extinction
contexts made no difference in subsequent renewal, indicating that extensive
extinction is required in order for renewal to be eliminated
Furthermore, other studies have not been so conclusive. For instance, one study
showed that extinction in multiple contexts produces more fear during extinction and
does not necessarily produce less renewal.
Thus, fear inhibition and reducing relapse is complex and requires much further
investigation
Facilitating extinction
To combat issues of compliance and relapse, pharmacological agents are often used
in conjunction with CBT
Benzodiazepines and selective serotonin reuptake inhibitors (SSRIs) are first-line
medications prescribed for people suffering from anxiety disorders
Such drugs alleviate many of the symptoms of anxiety disorders and reduce the
aversiveness of exposure therapy, thereby encouraging compliance.
Meta-analysis of clinical trials has shown that such drugs are more effective in
the treatment of anxiety disorders than placebo
However, comparison trials have also shown that CBT is more effective in
isolation than in combination with pharmacotherapy. But there is some evidence to
suggest that administration of some pharmacotherapies, such as benzodiazepines,
might be more effective in reducing fear once extinction have been established drug-
free
D-cycloserine (DCS)
Antibiotic that also acts as a NMDAr agonist
Facilitated extinction learning in rodents
In clinical trials it has been shown to improve symptoms of some anxiety disorders
and OCD (although may be less effective for PTSD)
Increased levels of maternal care increase the levels of the glucocorticoid receptor
Sensory specific satiety is used to assessed whether behaviours are GD or habitual
Example of blocking effect – Croissant predicts hug croissant + cheese predicts
hug cheese is not learned to predict hug
AutoShaping --> consistently perform behaviour after reinforcement, even if it is
irrelevant