Operant Conditioning .
-
Upload
arthur-adams -
Category
Documents
-
view
233 -
download
0
Transcript of Operant Conditioning .
Operant Conditioning
https://www.youtube.com/watch?v=teLoNYvOf90
Operant ConditioningWe learn to
associate a response and its consequence
https://www.youtube.com/watch?v=-A-7H4aOhq0
Response: Pushingvending machine button
Consequence: Receiving a candy bar
Operant Conditioning
Operant Conditioning type of learning in which behavior is strengthened if
followed by reinforcement or diminished if followed by punishment
Law of Effect Edward Thorndike’s principle, behaviors followed
by favorable consequences are more likely to be repeated and behaviors followed by unfavorable consequences are less likely to be repeated
Operant Conditioning Operant Behavior
complex or voluntary behaviorspush button, perform complex task
operates (acts) on environmentproduces consequences
Respondent Behavioroccurs as an automatic response to stimulusbehavior learned through classical
conditioning
Operant Conditioning
B.F. Skinner (1904-1990) elaborated on Thorndike’s Law of
Effect father of radical behaviorism - all
behaviors are ultimately learned and controlled by the relationships between the situation that immediately precedes the behavior and the consequences that directly follow it.
developed behavioral technology
Operant ChamberSkinner Box
soundproof chamber with a bar or key that an animal presses or pecks to release a food or water reward
contains a device to record responses
Operant Conditioning
Reinforcerany event that strengthens the behavior it follows
Shapingconditioning procedure in which reinforcers guide
behavior toward closer and closer approximations of a desired goal (think about how you would potty train a toddler or house train a puppy)
Successive Approximations rewards for behaviors that increasingly resemble
desired behavior
Principles of Reinforcement
Primary Reinforcer innate, unlearned stimulussatisfies a biological need (i.e.: water,
food)Secondary Reinforcer
conditioned reinforcerget power thru learned association
with primary reinforcer (i.e.: token economy tickets, money, good grades)
Schedules of ReinforcementContinuous Reinforcement
reinforcing the desired response each and every time it occurs
learning occurs rapidlyextinction also occurs rapidly, when
reinforcement stops, so does behavior Partial or Intermittent Reinforcement
reinforcing a response only part of the time results in slower acquisition
greater resistance to extinction
Fixed = SetVariable = ChangesRatio = NumberInterval = Time
So…. FI = Set Time; FR = Set NumberVI = Change Time; VR = Change
Number
Skinner’s Schedules of Reinforcement
Fixed Ratio (FR) reinforcing a response only after a specified
number of responsesThe faster you respond, the more rewards
you get results in a very high rate of respondingEx: free Starbucks after the purchase of 15
drinks; Frequent Flyer members get free flight on Delta after each 25K miles flown
Skinner’s Schedules of Reinforcement
Variable Ratio (VR)reinforcing a response after an
unpredictable number of responsesbased on average ratiosEx: gambling (slot machines), fishingvery hard to extinguish because of
unpredictability
Skinner’s Schedules of ReinforcementFixed Interval (FI)
reinforces the first response only after a specified time has elapsed
response occurs more frequently as the anticipated time for reward draws near
Ex: checking the mailbox more frequently as the delivery time gets closer; senior citizen discounts at Publix every Wednesday
Skinner’s Schedules of ReinforcementVariable Interval (VI)
reinforcing the first response after an unpredictable, random time interval
produces slow steady responding because you never know when the wait will be over
Ex: pop reading quiz or checking your phone because you never know when you are going to get e-mails or texts
Table 27.2 Schedules of ReinforcementDavid G. Myers: Myers’ Psychology for AP®, Second Edition
Copyright © 2014 by Worth Publishers
Schedules of Reinforcement
Variable Interval
Number of responses
1000
750
500
250
010 20 30 40 50 60 70
Time (minutes)
Fixed Ratio
Variable Ratio
Fixed Interval
Steady responding
Rapid respondingnear time forreinforcement
80
Reinforcement
Reinforcement – any event that increases or strengthens a preceding response
food, money, praise, attention
Reinforcement
Positive Reinforcement- strengthens response by presenting a typically pleasurable stimuli after it; good grades, praise, hug, candy bar, privileges
Negative Reinforcement- strengthens a response by reducing something undesirable or unpleasant; taking aspirin to get rid of a headache, pushing snooze button to turn off annoying alarm clock; fasten seatbelt to stop car from beeping
Table 27.1 Ways to Increase BehaviorDavid G. Myers: Myers’ Psychology for AP®, Second Edition
Copyright © 2014 by Worth Publishers
PunishmentPunishment
Any consequence that decreases the frequency of a preceding behavior
powerful controller to discourage unwanted behavior
Punishment
Positive Punishment – adding an aversive (bad) stimulus because you want the certain behavior to decrease, nagging spanking, speeding ticket, squirt of water in the face
Negative Punishment – taking away a desired (good) stimulus because you want the behavior to decrease – banish cell phone, car keys, computer or tv time
Table 27.3 Ways to Decrease BehaviorDavid G. Myers: Myers’ Psychology for AP®, Second Edition
Copyright © 2014 by Worth Publishers
Problems with Punishment
Punished behavior is not forgotten, it's suppressed- behavior returns when punishment is no longer eminent
Causes increased aggression- shows that aggression is a way to cope with problems- Explains why aggressive delinquents and abusive parents come from abusive homes
Problems with Punishment
Creates fear that can generalize to desirable behaviors, e.g. fear of school, learned helplessness, depression
Does not necessarily guide toward desired behavior- reinforcement tells you what to do--punishment tells you what not to do- Combination of punishment and reward can be more effective than punishment alone
Punishment teaches how to avoid it
Cognition and Operant ConditioningCognitive Map
mental representations of physical locations; humans and animals use them to find their way and to help recall important features of the environment
Ex: when a friend asks you for directions to your house, you are able to create an image in your mind of the roads, places to turn, landmarks
Latent Learning learning that occurs, but is not apparent until there is an
incentive to demonstrate it Ex: a dog is taught to sit but does not do so until offered a treat
as a reward
Cognition and Operant Conditioning
Overjustification Effect Occurs when an external incentive (such as
money or prizes) decreases a person's intrinsic motivation to perform a task. People pay more attention to the incentive, and less attention to the enjoyment and satisfaction that they receive from performing the activity. The overall effect is a shift in motivation to extrinsic factors and the undermining of pre-existing intrinsic motivation.
Operant vs. Classical Conditioning
Extinction CR decreases when CS is Responding decreases when
repeatedly presented alone. reinforcement stops.
Classical Conditioning
OperantConditioning
The Response Involuntary, automatic “Voluntary,” operates on environment
Acquisition Associating events; Associating response with aCS announces UCS. Consequence (reinforcer or
punisher).
Cognitive Subjects develop expectation Subjects develop expectation that processes that CS signals the arrival of a response will be reinforced or
UCS. Punished; they also exhibit latentlearning, without reinforcement
Biological Natural predispositions Organisms best learn behaviors predispositions contain what stimuli and similar to their natural behaviors;
responses can easily be unnatural behaviors instinctivelyassociated. drift back toward natural ones.
Operant or Classical?
http://www.youtube.com/watch?v=R8RIqJLUYSE
You decide….
And we’re done…