Operant Conditioning

Operant Conditioning

- November 21, 2019

Operant Conditioning

The learner is not passive. Learning is based on consequences

Classical Vs. Operant conditioning

Classical Conditioning	Operant Conditioning
Behavior is determined by what preceded it	Behavior is determined by anticipation of what follows it
Involuntary	Voluntary
Dog salivates after a tone	Dog sits in anticipation of getting a treat

They both use acquisition, discrimination, spontaneous recovery, generation and extinction

Operant Conditioning

A type of learning in which behavior is strengthened if followed by reinforcement or diminished if following by punishment

Edward L. Thorndike

Law Of Effect-

Thorndike's principle that responses are "stamped in" by rewards and "stamped out" by punishments

How do we actually use operant conditioning?

Shaping us reinforcing small steps on the way to the desired behavior

Reinforcer

Any event that strengthens the behavior it follows

Two types of reinforcement: positive and negative

Positive Reinforcement

Strengthens a response by adding a stimulus after a response

Negative Reinforcement

Removes something unpleasant that was already in the environment following a behavior, making that behavior more likely to occur again

Think (avoiding/ escaping)

Punishment

Any event that decreases that behavior that it follows

Meant to decrease a behavior

Positive Punishment

Addition of something unpleasant

Negative Punishment

The removal of something pleasant

Punishment works best when it is immediately done after behavior and it is harsh.

Types of Reinforcer

An innately reinforcing stimulus

Conditioned

(secondary reinforcer)

A stimulus that gains its reinforcing power through its association with a primary reinforcer

Token Economy

Every time a desired behavior is performed, a token is given

They can trade tokens in for verity if prize (reinforcement)

Used in homes, prisiones, mental institution and schools

Continues Vs. Partial reinforcement

Continues:

Reinforce the behavior every time the behavior is exhibited
Usually done when the subject is first learning to make association
Acquisition comes really fast
But so does extinction

Partial:

Reinforce the behavior only SOME of the times it is exhibited
Acquisition comes more slowly
But is more resistant to extinction
Four types of partial reinforcement schedules

Radio schedules

Fixed Ratio:

Provides a reinforcement after a SET number of responses

Variable Ratio:

Provides a reinforcement after a RANDOM number of reinforcement
Very hard to get acquisition by very resistant to extinction

Interval Schedules

Fixed Interval

Required a SET amount of time to elapses before giving the reinforcement

Variable Interval

Requires a RANDOM amount of time to elapse before giving the reinforcement
Very hard to get acquisition but also very resistant to extinction

Observable Learning

Albert Bandura and his BoBo Doll
We learn through modeling behavior from others
Observable learning + Operant conditioning= social learning theory

Latent Learning

Edward Tolman
Latent means hidden
Sometimes learning is not immediately evident

Insight Learning

Some animals learn through the "AH HA" experience

Comments