Unit 6 Learning

58
Unit 6 Learning 11/13/13

description

Unit 6 Learning. 11/13/13. How do we learn?. Brainstorm, what are some ways we learn new information? Do we learn similarly to any other animals? Does everybody learn the same way?. How we Learn. Learning: A relatively permanent change in an organism’s behavior due to experience. - PowerPoint PPT Presentation

Transcript of Unit 6 Learning

Unit 6 Learning

Unit 6Learning11/13/13How do we learn?Brainstorm, what are some ways we learn new information?Do we learn similarly to any other animals?Does everybody learn the same way?

How we LearnLearning: A relatively permanent change in an organisms behavior due to experience.Habituation: an organisms decreasing response to a stimulus with repeated exposure to it. Associative learning: Learning that certain events occur together. The events maybe two stimuli or a response and its consequences

Learning and ConditioningLearning - A relatively permanent change in behavior due to experiences

Conditioning The acquisition of specific patterns of behavior in the presence of well-defined stimuliClassical and Operant4What is experience: environmental effects filtered through the individuals perceptions. Behavior at a future time.What is behavior? Classical ConditioningStimulus = causeResponse = effect

Ivan Pavlov (18491936)VideoElements of Classical ConditioningNeutral Stimulus (NS)Unconditioned Stimulus (US)Unconditioned Response (UR)Conditioned Stimulus (CS)Conditioned Response (CR)Neutral StimulusBellDoes not normally cause a response or reflex action by itselfa bell ringinga colora furry object7Unconditioned StimulusFoodAlways cause a reflex actionfoodblast of airnoise8Unconditioned Response - SalivationA response to an unconditioned stimulusnaturally occurring reflexSalivation at smell of foodEye blinks at blast of airStartle reaction in babies9Conditioned Stimulus The learned (once neutral) stimulusThe tone of the bell causes salivation

Will eventually elicit the unconditioned response by itself 10Conditioned ResponseThe original unconditioned response becomes conditioned after it has been elicited by the neutral stimulusSalivation because of the bell tone11

12DiscPsy Figure 5.1 p. 167

Classical Conditioning in Humans

Watson and Little AlbertIdentify the:Neutral StimulusUnconditioned StimulusUnconditioned ResponseConditioned StimulusConditioned ResponseIs this Ethical?

16DiscPsy Fig5.2b p170

Identify the:Neutral Stimulus (NS)Unconditioned Stimulus (US)Unconditioned Response (UR)Conditioned Stimulus (CS)Conditioned Response (CR)

Classical Conditioning Acquisition: in classical conditioning, the stage when links a neutral stimulus and an unconditioned stimulus are linked so that the neutral stimulus begins triggering the conditioned response.Higher-order conditioning: a procedure in which the conditioned stimulus in one conditioning experience is paired with a new neutral stimulus, creating a second (often weaker) conditioned stimulus. For example, an animal that has learned that a tone predicts food might then learn that a light predicts the tone and begin responding to the light alone. (Also called second-order conditioning.)

Extinction: the diminishing of a conditioned response; occurs in classical conditioning when an unconditioned stimulus (US) does not follow a conditioned stimulus (CS); occurs in operant conditioning when a response is no longer reinforced.Spontaneous recovery: the reappearance, after a pause, of an extinguished conditioned response.

Generalization: the tendency, once a response has been conditioned, for stimuli similar to the conditioned stimulus to elicit similar responses.Discrimination: in classical conditioning, the learned ability to distinguish between a conditioned stimulus and stimuli that do not signal an unconditioned stimulus.

Cognitive ProcessesEarly behaviorists believed that learned behaviors of various animals could be reduced to mindless mechanisms.However, later behaviorists suggested that animals learn the predictability of a stimulus, meaning they learn expectancy or awareness of a stimulus (Rescorla, 1988).Learned HelplessnessNot just the conditioning, the thought influences behavior as well

Biological PredispositionsPavlov and Watson believed that laws of learning were similar for all animals. Therefore, a pigeon and a person do not differ in their learning.However, behaviorists later suggested that learning is constrained by an animals biology.Each species predispositions prepare it to learn the associations that enhance its survival.

Biological Predispositions

John GarciaGarcia showed that the duration between the CS and the US may be long (hours), but yet result in conditioning. A biologically adaptive CS (taste) led to conditioning and not to others (light or sound). Taste Aversion

Courtesy of John Garcia23Biological PredispositionsEven humans can develop classically to conditioned nausea.

24

Operant ConditioningClassical Conditioning involves respondent behavior (automatic actions)Operant conditioning reinforces desired behavior and discourages undesired behavior.Operant behavior: Behavior that operates (has an effect) on the environment to produce consequencesClassical conditioning=no controlOperant conditioning= control of behavior and consequencesOperant ConditioningLaw of effect: rewarded behavior is likely to recur (Edward Thorndike)Skinner used the Law of Effect to develop principles of behavior control.Operant chamber/Skinners box: box containing a bar or key that an animal can use to obtain a food or water reinforcer; attached devices record rate of bar or key pressing

Shaping: reinforcers guide behavior toward desired behavior with successive approximationsDiscriminative Stimulus: elicits response after association with reinforcement (remember discrimination vs. generalization)Reinforcer: in operant conditioning, any event that strengthens the behavior it follows.Positive Reinforcement: increasing behaviors by presenting positive stimuli, such as food. A positive reinforcer is any stimulus that, when presented after a response, strengthens the response.Negative Reinforcement: increases behaviors by stopping or reducing negative stimuli, such as shock. A negative reinforcer is any stimulus that, when removed after a response, strengthens the response (Note: negative reinforcement is NOT punishment).Primary reinforcer: an innately reinforcer stimulus, such as one that satisfies a biological need.

Conditioned reinforcer: a stimulus that gains its reinforcing power through its association with a primary reinforcer; also known as a secondary reinforcer.

Immediate and Delayed ReinforcementWhich one works better?In rats-immediateIn humans- both will work, sometimes delayed works better.If given the option of a small candy bar today or a big candy bar tomorrow what would you choose?Reinforcement SchedulesContinuous reinforcement: Reinforcing the desired response every time it occurs.Partial (intermittent) reinforcement: Reinforcing a response only part of the time; results in slower acquisition of a response but much greater resistance to extinctionSchedules of ReinforcementRatio Version having to do with instances of the behavior. Ex. Reinforce or reward the behavior after a set number or x many times that an action or behavior is demonstrated.Interval Version having to do with the passage of time.Ex. Reinforce the participant after a set number or x period of time that the behavior is displayed.4 Basic Schedules of ReinforcementFixed-interval scheduleVariable-interval scheduleFixed-ratio scheduleVariable-ratio schedule

Fixed-Interval ScheduleFixed-interval schedule A schedule in which a fixed amount of time must elapse between the previous and subsequent times that reinforcement will occur.No response during the interval is reinforced.The first response following the interval is reinforced.Produces an overall low rate of respondingEx. I get one pellet of food every 5 minutes when I press the leverFixed Interval Reinforcement

Variable-Interval ScheduleVariable-interval Schedule A schedule in which a variable amount of time must elapse between the previous and subsequent times that reinforcement is available.Produces an overall low consistent rate of responding.Ex. I get a pellet of food on average every 5 minutes when I press the bar.

Variable Interval ReinforcementFixed-Ratio ScheduleFixed-ratio Schedule A schedule in which reinforcement is provided after a fixed number of correct responses.These schedules usually produce rapid rates of responding with short post-reinforcement pausesThe length of the pause is directly proportional to the number of responses requiredEx. For every 5 bar presses, I get one pellet of food

An Example of Fixed Ratio ReinforcementEvery fourth instance of a smile is reinforced

Fixed Ratio Reinforcement

Variable-Ratio Schedule Variable-ratio Schedule A schedule in which reinforcement is provided after a variable number of correct responses.Produce an overall high consistent rate of responding.Ex. On average, I press the bar 5 times for one pellet of food.An Example of Variable Ratio ReinforcementRandom instances of the behavior are reinforced

Variable Ratio Reinforcement

TYPEMEANINGOUTCOMEFixed RatioReinforcement depends on a definite number of responsesActivity slows after reinforcement and then picks upVariable RatioNumber of responses needed for reinforcement variesGreatest activity of all schedulesFixed IntervalReinforcement depends on a fixed timeActivity increases as deadline nearsVariable IntervalTime between reinforcement variesSteady activity results

Comparisons of Schedules of ReinforcementFixed intervalReward on fixed time basisLeads to average and irregular performanceFast extinction of behaviorFixed ratioVariable ratioVariable intervalReward tied to specific number of responsesLeads quickly to very high and stable performanceModerately fast extinction of behaviorSCHEDULEFORM OF REWARDReward given after varying periods of timeLeads to moderately high and stable performanceSlow extinction of behaviorReward given for some behaviorsLeads to very high performanceVery slow extinction of behaviorINFLUENCE ON PERFORMANCEEFFECTS ON BEHAVIOR48FI, VI, FR, or VR?When I bake cookies, I can only put one set in at a time, so after 10 minutes my first set of cookies is done. After another ten minutes, my second set of cookies is done. I get to eat a cookie after each set is done baking.After every 10 math problems that I complete, I allow myself a 5 minute break.I look over my notes every night because I never know how much time will go by before my next pop quiz.When hunting season comes around, sometimes Ill spend all day sitting in the woods waiting to get a shot at a big buck. Its worth it though when I get a nice 10 point.Today in Psychology class we were talking about Schedules of Reinforcement and everyone was eagerly raising their hands and participating. Miranda raised her hand a couple of times and was eventually called on.

FI

FR

VI

VI

VR

PunishmentWhile reinforcement increases behavior, punishment does the oppositePunishment: An event that decreases the behavior that it follows.Positive punishment: adding an aversive stimulus. (spanking)Negative punishment: taking away desirable stimulus. (grounded)Biological Predispositions Biological constraints predispose organisms to learn associations that are naturally adaptive.You can teach a pigeon to flap its wings to avoid shock and to peck to receive food but not the other way around.

Cognition & Operant ConditioningEvidence of cognitive processes during operant learning comes from rats during a maze exploration in which they navigate the maze without an obvious reward. Rats seem to develop cognitive maps, or mental representations, of the layout of the maze (environment).Cognitive map: a mental representation of the layout of ones environment52OBJECTIVE 16| Explain how latent learning and the effect of external rewards demonstrate that cognitive processing is an important part of learningLatent LearningSuch cognitive maps are based on latent learning, which becomes apparent when an incentive is given (Tolman & Honzik, 1930).

MotivationIntrinsic Motivation: The desire to perform a behavior for its own sake.

Extrinsic Motivation: The desire to perform a behavior due to promised rewards or threats of punishments.

Skinners LegacySkinner argued that behaviors were shaped by external influences instead of inner thoughts and feelings. Critics argued that Skinner dehumanized people by neglecting their free will.

Falk/ Photo Researchers, Inc. 55OBJECTIVE 18| Describe the controversy over Skinners views of human behavior.Applications of Operant ConditioningSkinner introduced the concept of teaching machines that shape learning in small steps and provide reinforcements for correct rewards.

In SchoolLWA-JDL/ Corbis

56OBJECTIVE 19| Describe some ways to apply operant conditioning principles at school, at work and at home.FI, VI, FR, or VR?6. Madison spanks her son if she has to ask him three times to clean up his room.7. Emily has a spelling test every Friday. She usually does well and gets a star sticker.8. Steves a big gambling man. He plays the slot machines all day hoping for a big win.Snakes get hungry at certain times of the day. They might watch any number of prey go by before they decide to strike.Mr. Bertani receives a salary paycheck every 2 weeks. (Miss Suter doesnt ).Christina works at a tanning salon. For every 2 bottles of lotion she sells, she gets 1 dollar in commission. Mike is trying to study for his upcoming Psychology quiz. He reads five pages, then takes a break. He resumes reading and takes another break after he has completed 5 more pages.6. FR7. FI

8. VR

VI

FI

FR

FR

FI, VI, FR, or VR?13. Megan is fundraising to try to raise money so she can go on the annual band trip. She goes door to door in her neighborhood trying to sell popcorn tins. She eventually sells some.14. Kylie is a business girl who works in the big city. Her boss is busy, so he only checks her work periodically.15. Mark is a lawyer who owns his own practice. His customers makes payments at irregular times.16. Jessica is a dental assistant and gets a raise every year at the same time and never in between.17. Andrew works at a GM factory and is in charge of attaching 3 parts. After he gets his parts attached, he gets some free time before the next car moves down the line.18. Brittany is a telemarketer trying to sell life insurance. After so many calls, someone will eventually buy.13. VR

14. VI

15. VI16. FI

17. FR

18. VR