Learning Operant Conditioning. Operant Behavior operates (acts) on environment produces...

22
Learning Operant Conditioning

Transcript of Learning Operant Conditioning. Operant Behavior operates (acts) on environment produces...

Page 1: Learning Operant Conditioning.  Operant Behavior  operates (acts) on environment  produces consequences  Respondent Behavior  occurs as an automatic.

Learning

Operant Conditioning

Page 2: Learning Operant Conditioning.  Operant Behavior  operates (acts) on environment  produces consequences  Respondent Behavior  occurs as an automatic.

Operant Conditioning

Operant Behavior operates (acts) on environment produces consequences

Respondent Behavior occurs as an automatic

response to stimulus behavior learned through

classical conditioning

Page 3: Learning Operant Conditioning.  Operant Behavior  operates (acts) on environment  produces consequences  Respondent Behavior  occurs as an automatic.

Operant Conditioning

Operant Conditioning type of learning in which behavior is

strengthened if followed by reinforcement or diminished if followed by punishment

Law of Effect Thorndike’s principle that behaviors

followed by favorable consequences become more likely, and behaviors followed by unfavorable consequences become less likely

Page 4: Learning Operant Conditioning.  Operant Behavior  operates (acts) on environment  produces consequences  Respondent Behavior  occurs as an automatic.

Operant Conditioning

B.F. Skinner (1904-1990) elaborated

Thorndike’s Law of Effect

developed behavioral technology

Page 5: Learning Operant Conditioning.  Operant Behavior  operates (acts) on environment  produces consequences  Respondent Behavior  occurs as an automatic.

Operant Chamber

Skinner Box chamber with a

bar or key that an animal manipulates to obtain a food or water reinforcer

contains devices to record responses

Page 6: Learning Operant Conditioning.  Operant Behavior  operates (acts) on environment  produces consequences  Respondent Behavior  occurs as an automatic.

Operant Conditioning

Reinforcer any event that strengthens the

behavior it follows Shaping

operant conditioning procedure in which reinforcers guide behavior toward closer approximations of a desired goal

Page 7: Learning Operant Conditioning.  Operant Behavior  operates (acts) on environment  produces consequences  Respondent Behavior  occurs as an automatic.

Operant Conditioning

Positive Strengthens behavior by

presenting a positive stimulus (something desired)

Negative Strengthens behavior by

removing an aversive (unpleasant) stimulus

Page 8: Learning Operant Conditioning.  Operant Behavior  operates (acts) on environment  produces consequences  Respondent Behavior  occurs as an automatic.

Principles of Reinforcement

Primary Reinforcer innately reinforcing stimulus i.e., satisfies a biological need

Secondary Reinforcer stimulus that gains its reinforcing

power through its association with primary reinforcer

Learned reinforcement

Page 9: Learning Operant Conditioning.  Operant Behavior  operates (acts) on environment  produces consequences  Respondent Behavior  occurs as an automatic.

Schedules of Reinforcement

Continuous Reinforcement reinforcing the desired response each

time it occurs Partial (Intermittent) Reinforcement

reinforcing a response only part of the time

results in slower acquisition greater resistance to extinction

Page 10: Learning Operant Conditioning.  Operant Behavior  operates (acts) on environment  produces consequences  Respondent Behavior  occurs as an automatic.

Schedules of Reinforcement

Fixed Ratio (FR) reinforces a response only after a

specified number of responses faster you respond the more

rewards you get very high rate of responding like piecework pay

Page 11: Learning Operant Conditioning.  Operant Behavior  operates (acts) on environment  produces consequences  Respondent Behavior  occurs as an automatic.

Schedules of Reinforcement

Variable Ratio (VR) reinforces a response after an

unpredictable number of responses

like gambling, fishing very hard to extinguish because of

unpredictability High rate of response

Page 12: Learning Operant Conditioning.  Operant Behavior  operates (acts) on environment  produces consequences  Respondent Behavior  occurs as an automatic.

Schedules of Reinforcement

Fixed Interval (FI) reinforces a response only after

a specified time has elapsed response occurs more

frequently as the anticipated time for reward draws near

Page 13: Learning Operant Conditioning.  Operant Behavior  operates (acts) on environment  produces consequences  Respondent Behavior  occurs as an automatic.

Schedules of Reinforcement

Variable Interval (VI) reinforces a response at

unpredictable time intervals produces slow steady responding like pop quiz, busy phone

Page 14: Learning Operant Conditioning.  Operant Behavior  operates (acts) on environment  produces consequences  Respondent Behavior  occurs as an automatic.

Schedules of Reinforcement

Variable Interval

Number of responses

1000

750

500

250

010 20 30 40 50 60 70

Time (minutes)

Fixed Ratio

Variable Ratio

Fixed Interval

Steady responding

Rapid respondingnear time forreinforcement

80

Page 15: Learning Operant Conditioning.  Operant Behavior  operates (acts) on environment  produces consequences  Respondent Behavior  occurs as an automatic.

Punishment

Punishment aversive event that

decreases the behavior that it follows

powerful controller of unwanted behavior

Page 16: Learning Operant Conditioning.  Operant Behavior  operates (acts) on environment  produces consequences  Respondent Behavior  occurs as an automatic.

Drawbacks to Punishment

Punished behavior is suppressed, not forgotten. People will continue to perform the act of punishment is avoidable.

Punishment models behavior used to punish.Punishment suppresses unwanted behavior,

but does not teach desired behaviorWhen punishment is unpredictable and

inescapable, helplessness and depression may occur.

Page 17: Learning Operant Conditioning.  Operant Behavior  operates (acts) on environment  produces consequences  Respondent Behavior  occurs as an automatic.

Reinforcement vs. Punishment

Something wanted

Something not wanted

Given to you

Positive Reinforcement

Positive Punishment

Taken from you

Negative Punishment

Negative Reinforcement

Page 18: Learning Operant Conditioning.  Operant Behavior  operates (acts) on environment  produces consequences  Respondent Behavior  occurs as an automatic.

Cognition and Operant Conditioning

Cognitive Map mental representation of the layout of

one’s environment Example: after exploring a maze, rats

act as if they have learned a cognitive map of it

Latent Learning learning that occurs, but is not

apparent until there is an incentive to demonstrate it

Page 19: Learning Operant Conditioning.  Operant Behavior  operates (acts) on environment  produces consequences  Respondent Behavior  occurs as an automatic.

Cognition and Operant Conditioning

Cognitive Process: Organism is an information seeker using

relations among events to form its own adaptive representation of the world.

How much does the first event predict the second

Predictability Expectancy

Page 20: Learning Operant Conditioning.  Operant Behavior  operates (acts) on environment  produces consequences  Respondent Behavior  occurs as an automatic.

Cognition and Operant Conditioning

Overjustification Effect the effect of promising a reward

for doing what one already likes to do

the person may now see the reward, rather than intrinsic interest, as the motivation for performing the task

Page 21: Learning Operant Conditioning.  Operant Behavior  operates (acts) on environment  produces consequences  Respondent Behavior  occurs as an automatic.

Cognition and Operant Conditioning

Intrinsic Motivation Desire to perform a behavior for

its own sake and to be effective Extrinsic Motivation

Desire to perform a behavior due to promised rewards or threats of punishments

Page 22: Learning Operant Conditioning.  Operant Behavior  operates (acts) on environment  produces consequences  Respondent Behavior  occurs as an automatic.

Operant vs Classical Conditioning