NSCI 5702 PRINCIPLES OF TRAINING. 1.What is training? 2.Why is it useful to train animals? 3.How do...

36
NSCI 5702 PRINCIPLES OF TRAINING

Transcript of NSCI 5702 PRINCIPLES OF TRAINING. 1.What is training? 2.Why is it useful to train animals? 3.How do...

NSCI 5702PRINCIPLES OF

TRAINING

1. What is training?2. Why is it useful to train animals?3. How do animals learn?4. Training techniques used with animals

LECTURE OUTLINE

“The shaping of an animal so that it behaves in a way that humans desire.” UFAW (1992)

WHAT IS TRAINING?

Husbandry and health purposesSafety of handlerTreatment of problem behaviourHuman assistanceEntertainment/EducationEnrichment

WHY IS IT USEFUL TO TRAIN ANIMALS?

1. Classical Conditioning2. Desensitisation3. Counter Conditioning4. Operate Conditioning

I. ShapingII. Positive ReinforcementIII. Negative ReinforcementIV. Combinations of positive and negative reinforcement

5. Flooding6. Punishment

To understanding how to train animals we must first have an understanding of how they learn – Learning Theory

TRAINING TECHNIQUES USED WITH ANIMALS

HOW ANIMALS LEARN

There are a number of different forms of learning. We will discuss the following four:

1. Sensitisation2. Habituation3. Imprinting4. Associate Learning

Classical ConditioningOperant Conditioning

HOW ANIMALS LEARN

“The increasing of a response to a repeated stimulus” (Broom and Fraser: Domestic Animal Behaviour and Welfare, 2007)

The animal learns to respond to a stimulusAdaptive for survival.

E.g. Gazelles reacting to the sound of a twig breaking (signals approaching predator?)E.g. A rat that has just experiences an aversive stimulus, such as a bright light will immediately afterwards be extra sensitive to other cues, such as noises or lights, that it would not normally respond to.

SENSITISATION

A decrement in response that is produced by gradual exposure to a stimulus that elicits the response.

Commonly used in training.

E.g. Using a tape recording of a particular sound which a dog is fearful of. Tape is played very softly at first and only gradually increased in volumes at increments designed to elicit no response.

DESENSITISATION

The animal learns not to respond to irrelevant stimuli.This decline in response is specific to a given stimulus.Animals will not habituate to relevant stimuli e.g. those

associated with predators, food or mates. Advantageous in that it saves energy that would be wasted

on repeated response to trivial stimulus.

E.g. Zoo animal habituate to the presence of visitors.E.g. Sheep habituate to the sound of passing traffic.

HABITUATION

Phase-sensitive learning that is rapid and apparently independent of the consequences of behaviour Phase sensitive learning = learning occurring at a particular age or a

particular life stage.

E.g. Chicks hatch with an innate tendency to approach and follow their mother.

They have already imprinted on her vocalisations.After hatching (24-36hrs) they imprint on her visual appearance.Other young animals imprint on olfactory cues from their

mothers.Also has an impact upon the animals future choice of a sexual

partnerhttp://en.wikipedia.org/wiki/File:Anas_platyrhynchos_-Boston_Har

bor,_Massachusetts,_USA-_parent_and_chicks-8.ogv

IMPRINTING

IMPRINTING

There are 2 types of associative learning:

1.Classical (Pavlovian) Conditioning

2.Operant (Instrumental) Conditioning

ASSOCIATIVE LEARNING

CLASSICAL (PAVLOVIAN) CONDITIONING

When an animal learns to associate a conditioned stimulus (bell ringing) with an unconditioned stimulus (food) and eventually elicits a conditioned response (salivation)

Definitions: Primary or Unconditioned Stimulus (US) = Stimuli that animals

react to without training Secondary or Conditioned Stimulus (CS) = Stimuli that have been

associated with a primary (unconditioned) Stimulus

CLASSICAL (PAVLOVIAN) CONDITIONING

When an animal learns to associate a conditioned stimulus (bell ringing) with an unconditioned stimulus (food) and eventually elicits a conditioned response (salivation)

CLASSICAL (PAVLOVIAN) CONDITIONING

CLASSICAL (PAVLOVIAN) CONDITIONING

Examples: Knock on the door and bark – people Keys and run to you – leaving/car Tin opening and cat meowing – food Hay basket and squeaking GP’s food http://www.youtube.com/watch?v=WfZfMIHwSkU

CLASSICAL (PAVLOVIAN) CONDITIONING

Application In training to be used prior to a reinforcer E.g. a Bridge

Clicker Whistle

CLASSICAL (PAVLOVIAN) CONDITIONING

If the food is no longer presented with the bell, causing the dog to salivate less in response to the bell

CLASSICAL CONDITIONING: EXTINCTION

Thorndike (1898) put hungry cats into a ‘puzzle box’ with a lever mechanism that opened a door which lead to a food reward.

OPERANT CONDITIONING

Thorndike concluded that it was merely a process of trail and error.

The box invoked a series of trial and error voluntary actions. The cat learnt to press the lever to escape (a rewarding experience).

“a response that is followed by a reward is more likely to recur whereas one that is followed by an unpleasant experience is less likely to occur again? (laws of effect)

Learning is the result of associations forming between stimuli and responses. Associations are weakened or strengthened by the nature and frequency of stimulus-response (S-R pairings)

The animal learns to associate its own behaviour with a particular outcome. If the outcome is rewarding e.g. access to food, the animal learns to repeat the behaviour that resulted in food access previously.

OPERANT CONDITIONING

Operant conditioning is “the type of learning in which the probability of a behaviour recurring is increased or decreased by the consequences that follows.

This includes positive/negative reinforcement and positive/negative punishment.

Forms an association between a behaviour (voluntary) and a consequence.

OPERANT CONDITIONING

Definitions:Anything that increases a behaviour = ReinforcerAnything that decreased a behaviour = Punisher

Consequences:1.Something good can START or be presented = behaviour increases (Positive Reinforcement)2.Something bad can END or be taken away = behaviour increases (Negative Reinforcement)3.Something bad can START or be presented = behaviour decreases (Positive Punishment)4.Something good can end or be taken away = behaviour decreases (Negative Punishment)

REINFORCEMENT AND PUNISHMENT

REINFORCEMENT AND PUNISHMENT

Examples1.Positive Reinforcement (R+)

Adding something good to increase behaviour Food

2.Negative Reinforcement (R-) Removing something bad to increase behaviour Elephant training, horse reins

3.Positive Punishment (P+) Adding something bad to decrease a behaviour Shock collar, physical punishment

4.Negative Punishment (P-) Removing something good to decrease a behaviour Time out.

REINFORCEMENT AND PUNISHMENT

Definitions are based on their actual effect on the behaviour in question.

They must reduced or strengthen the behaviour (to be defined as a punishment or reinforcer).

Pleasures meant as rewards but that do not strengthen the behaviour are indulgences not reinforcement.

Aversives meant as a behaviour weakener but which do not weaken behaviour are abuses, not punishment.

REINFORCEMENT AND PUNISHMENT

Rewards mean difference things to different animals (e.g. food, toys, affection, other animals).

You must first establish what motivates the animal.

This could be: Food Social contact Toys Praise clicker

REINFORCERS AND MOTIVATION TO LEARN

Primary Reinforcer: (e.g. food) A stimulus or event that is

inherently rewarding to the animal

Secondary Reinforcer: (e.g. clicker) Initially meaningless stimuli or

event becomes inherently rewarding after repeated association with primary reinforcer

http://www.youtube.com/watch?v=hgDHWLyztCI&feature=related

REINFORCERS

Timing is everything! The reward must occur within 1-2 seconds of the behaviour.

The frequency of the rewards is also important. During training by rewarding for every correct behaviour then gradually switching to intermittent variable rates.

Don’t phase out rewards too quickly.

Help to make training session enjoyable and to strengthen the human animal bond.

http://www.youtube.com/watch?v=bDZCyObMfkA

REWARDS

What is punishment? “an aversive action or unpleasant sensation (not necessarily

physical) applied either during or within on e second of a particular behaviour that reduces the likelihood of that behaviour being repeated in the future.”

Differs from negative reinforcement (where the aversive stimulus is applied before the behavioural response)

E.g. Hitting a Zebra a few seconds after they have bitten you.

PUNISHMENT

1. Pain, fear, anxiety, learned helplessness and stress which are all welfare concerns.

2. Pain, fear, anxiety learned helplessness and stress also interfere with the animals ability to learn and focus.

3. It can intensify the occurrence and severity of behaviour problems.

4. Difficulty in getting the timing right causing association with the wrong things.

5. It becomes meaningless (desensitisation).6. Breakdown in the trainer-animal relationship.

WHY PUNISHMENT MIGHT NOT WORK

Specific behaviour may be “shaped” Involves teaching the desired behaviour pattern one step at a

time through operant conditioning.The animal needs to be rewarded for behaviour that

resembles the eventual behavioural goal. Initially reinforcement is given to an approximation of the

behavioural goal.Reinforcement continues as the animals behavioural

approximations develop to resemble the final behaviour more closely.

Eventually, only the more precise behaviour is rewarded.

http://www.youtube.com/watch?v=g6F0bRTurPk&feature=related

SHAPING

Animal is encouraged to engage in another behaviour that is more pleasurable and which cannot be performed

simultaneously with fear responses in the presence of the triggering stimulus.

E.g. Feeding a vet phobic Giraffe whilst earing a vets uniform.

COUNTER CONDITIONING

Prolonged exposure to a negatively perceived stimulus at a level that provokes the response so that the animal eventually gives up.

VERY STRESSFUL AND POTENTIALLY DAMAGING

E.g. confining a dog in an area and playing the tape at a louder than appropriate level until the dog no longer reacts fearfully

Used as a last resort and executed in the most humane way.

FLOODING

Trainers should use a variety of different learning methods.

Animal must be motivated to learn.

Minimal use of negative reinforcement because of stressor/& fear – not a good learning state for the animal.

Shaping behaviours – helping the animals to learn.

HOW ANIMAL TRAINERS USE LEARNING THEORY