Operant Conditioning

Operant Conditioning 

  • The learner is not passive. Learning is based on consequences 

Classical Vs. Operant conditioning 

Classical Conditioning
Operant Conditioning
Behavior is determined by what preceded it
Behavior is determined by anticipation of what follows it
Involuntary
Voluntary
Dog salivates after a tone
Dog sits in anticipation of getting a treat
They both use acquisition, discrimination, spontaneous recovery, generation and extinction 

Operant Conditioning

  • A type of learning in which behavior is strengthened if followed by reinforcement or diminished if following by punishment 

Edward L. Thorndike

Law Of Effect- 

Thorndike's principle that responses are "stamped in" by rewards and "stamped out" by punishments 

How do we actually use operant conditioning? 

Shaping us reinforcing small steps on the way to the desired behavior 

Reinforcer

  • Any event that strengthens the behavior it follows 
  • Two types of reinforcement: positive and negative  
Positive Reinforcement
  • Strengthens a response by adding a stimulus after a response  
Negative Reinforcement
  • Removes something unpleasant that was already in the environment following a behavior, making that behavior more likely to occur again 
  • Think (avoiding/ escaping) 

Punishment 
  • Any event that decreases that behavior that it follows 
  • Meant to decrease a behavior 
Positive Punishment 
  • Addition of something unpleasant 
Negative Punishment 
  • The removal of something pleasant 
Punishment works best when it is immediately done after behavior and it is harsh. 

Types of Reinforcer 
  • An innately reinforcing stimulus 
Conditioned 
(secondary reinforcer) 
  • A stimulus that gains its reinforcing power through its association with a primary reinforcer 
Token Economy 
  • Every time a desired behavior is performed, a token is given 
  • They can trade tokens in for verity if prize (reinforcement) 
  • Used in homes, prisiones, mental institution and schools 

Continues Vs. Partial reinforcement 

Continues: 
  • Reinforce the behavior every time the behavior is exhibited 
  • Usually done when the subject is first learning to make association 
  • Acquisition comes really fast 
  • But so does extinction 
Partial: 
  • Reinforce the behavior only SOME of the times it is exhibited 
  • Acquisition comes more slowly 
  • But is more resistant to extinction 
  • Four types of partial reinforcement schedules 

Radio schedules 

Fixed Ratio: 
  • Provides a reinforcement after a SET number of responses 
Variable Ratio: 
  • Provides a reinforcement after a RANDOM number of reinforcement 
  • Very hard to get acquisition by very resistant to extinction 

Interval Schedules 

Fixed Interval 
  • Required a SET amount of time to elapses before giving the reinforcement
Variable Interval 
  • Requires a RANDOM amount of time to elapse before giving the reinforcement 
  • Very hard to get acquisition but also very resistant to extinction  
Observable Learning 
  • Albert Bandura and his BoBo Doll 
  • We learn through modeling behavior from others 
  • Observable learning + Operant conditioning= social learning theory 
Latent Learning 
  • Edward Tolman 
  • Latent means hidden 
  • Sometimes learning is not immediately evident 
Insight Learning 
  • Some animals learn through the "AH HA" experience 

Comments