Log In Start studying!

Select your language

Suggested languages for you:
StudySmarter - The all-in-one study app.
4.8 • +11k Ratings
More than 3 Million Downloads
Free
|
|
Operant Conditioning

What does a dolphin jumping through hoops, a dog playing dead, and a cat doing a high five have in common? It's operant conditioning! This section will examine B. F. Skinner's operant conditioning, its theory, and some examples.We will start by covering the operant conditioning definition.Next, we will explore the principles and concepts that make up the operant conditioning theory…

Content verified by subject matter experts
Free StudySmarter App with over 20 million students
Mockup Schule

Explore our app and discover over 50 million learning materials for free.

Operant Conditioning

Operant Conditioning

Save the explanation now and read when you’ve got time to spare.

Save
Illustration

Lerne mit deinen Freunden und bleibe auf dem richtigen Kurs mit deinen persönlichen Lernstatistiken

Jetzt kostenlos anmelden

Nie wieder prokastinieren mit unseren Lernerinnerungen.

Jetzt kostenlos anmelden
Illustration

What does a dolphin jumping through hoops, a dog playing dead, and a cat doing a high five have in common? It's operant conditioning! This section will examine B. F. Skinner's operant conditioning, its theory, and some examples.

  • We will start by covering the operant conditioning definition.
  • Next, we will explore the principles and concepts that make up the operant conditioning theory and the Skinner operant conditioning experiment.
  • Moving on, we will explore some operant conditioning theory examples.
  • Finally, we will compare classical and operant conditioning.

Operant Conditioning Definition

B. F. Skinner believed that it is possible to study behaviour scientifically. He also thought behaviour is voluntary and has a purpose: to affect one's environment. This behaviour, which he called operant behaviour, is the focus of operant conditioning.

Skinner describes operant behaviour as behaviour influenced by its outcomes.

In other words, a person acts on their environment for the desired results. So, then, what is operant conditioning?

Operant conditioning is a method of learning or modifying behaviours in which the consequence of a response, whether good or negative, influences the repetition of an action.

Operant conditioning, teaching a dog to roll over, VaiaFig. 1 A dog rolling over for a treat.

Suppose you give your dog a treat when it rolls over. The dog learns to associate the action with the reward through operant conditioning and will likely repeat the behaviour.

Operant conditioning states that every action we take while engaging with our environment has consequences. We are more likely to repeat behaviours with positive outcomes than actions with negative results. When we receive punishment as a consequence for a behaviour, we most likely will never repeat that behaviour.

Operant Conditioning Theory

Skinner divided behaviour into three parts for his scientific study: discriminative stimulus, operant response, and the reinforcer or punisher. These three are the three-term contingency, which illustrates a relationship between the operant response and the consequence (a reinforcer or punisher).

Let's define these three terms:

  • A discriminative stimulus serves as the antecedent of behaviour, such as events or situations in which a behaviour occurs.
  • Reinforcers are the responses increasing the likelihood of the behaviour it follows.
  • Punishers are the responses, decreasing the likelihood of the behaviour it follows.

An exam (discriminative stimulus) is coming up, and you reviewed well and gave your best effort in preparing for the exam. Results came, and you earned a high score. Your parents were proud and took you to your favourite restaurant (reinforcer). If you played video games all day and failed your exam, your parents scolded you for being irresponsible (punisher).

The three-term contingency served as the foundation of Skinner's study on operant conditioning. With his analysis, he also identified several types of operant conditioning.

Operant Conditioning: Types

Skinner developed four types of operant conditioning: positive reinforcement, negative reinforcement, positive punishment, and negative punishment.

We've mentioned that operant conditioning involves rewarding or punishing behaviours.

In positive reinforcement, a favourable outcome follows the behaviour to increase its recurrence.

When you apply positive reinforcement, you'd want to strengthen a response (e.g., in terms of frequency or likelihood) by using an operant reinforcer. In this case, a positive reinforcer.

John noticed his friend, Luke, looked sad, so he decided to crack a joke to cheer him up. Luke laughed, which positively reinforced John's behaviour. So, the next time Luke gets sad, John may likely repeat that behaviour.

Positive reinforcement strengthens a behaviour, so negative reinforcement weakens it. Right? Negative reinforcement can also enhance a behaviour. This type of reinforcement falls under operant aversive conditioning.

Negative reinforcement occurs when you remove an unpleasant event (aversive stimulus or negative reinforcer) following a behaviour.

You're driving and suddenly hear a squealing noise when you step on the brakes. Feeling worried, you brought your car to the mechanic and found that the brake pads needed replacing. The mechanic replaced them, and the squealing noise disappeared. Removal of the squealing noise negatively reinforced the behaviour of bringing the car to the mechanic.

There are two types of negative reinforcement: avoidance and escape behaviour.

In avoidance, the learner prevents the unpleasant event from occurring. If the unpleasant event has already happened, the removal of the negative reinforcer occurs through escape behaviour.

Avoidance: When you leave your dishes in the sink, you hear your mother coming home from the grocery and parking her car in the driveway. You rushed to wash the dishes before she entered the house to avoid nagging.

Escape: But what if your mother arrives and sees the dishes in the sink earlier than expected? Your mother starts to nag you, and you wash the dishes so she'd stop nagging.

Punishment is another form of operant aversive conditioning which aims to weaken behaviours. When behaviours weaken, it means that there is a decrease in frequency, duration, and intervals.

Punishment refers to negative consequences (aversive stimuli) following a behaviour.

Positive punishment occurs when an aversive stimulus (something that you don't want) follows a response.

A group of students faces detention after refusing to follow their teacher.

Adverse outcomes following misbehaviour need to be immediate and consistent so that the learner will associate the consequence of the behaviour with a higher chance of stopping it.

Negative punishment involves removing something valuable (an object or activity) following a response.

A person gets their driving licence suspended after multiple traffic violations.

Psychologists warn, however, of excessive punishment as punishment tells you what not to do; this may not lead to the desired behaviour. Punishments can make the learner aggressive because it is a coping mechanism (to deal with problems in life).

Simply put, positive punishment (+) adds a negative consequence, while negative punishment (-) is to take away something.

Operant Conditioning: Properties of Reinforcement

Earlier, we defined what reinforcers are and positive and negative reinforcements of behaviour. In operant conditioning, Skinner identified reinforcement properties, such as the different types of reinforcers and schedules of reinforcement.

Primary reinforcers, such as food, water, and sleep, are of biological importance to us. This reinforcement is universal, which means it can occur to anyone.

Secondary reinforcers, also known as acquired or conditioned reinforcers, are initially neutral but can strengthen behaviours when paired with a primary reinforcer. Examples include tokens, points, and stickers.

Reinforcement schedules describe the manner and timing of giving reinforcers to a learner.

There are two types of schedules of reinforcement: continuous and partial.

Continuous reinforcement refers to giving reinforcers every time the learner commits the targeted behaviour.

The teacher gives a gold star every time a student participates in class.

Partial reinforcement, on the other hand, involves giving reinforcers based on a target number of desirable actions (ratio schedules) or time (interval schedules).

Fixed ratio schedules require a specific number of responses before reinforcement occurs.

The sales manager gives an employee a bonus for hitting the target sales for six consecutive months.

Fixed interval schedules involve reinforcement of a desirable behaviour after a specific period. This schedule leads to an increased number of responses as reinforcement approaches.

Alice prepares for her licensure exam. She had three months to prepare for the exam, but in the first two months, she didn't spend that much time reviewing. As the exam drew near, she spent the last month of her exam preparation studying her lessons to ensure she passed (reinforcement) the exam.

Variable ratio schedules refer to a reinforcement of desirable behaviours without a specific number of responses.

The most common example of a variable ratio schedule of reinforcement is slot machines. The unpredictability of reinforcement encourages gambling behaviour.

Variable interval schedules refer to a reinforcement of desirable behaviours in unpredictable time intervals.

The unpredictability of receiving a message (reinforcement) via instant messaging may encourage the behaviour of checking your notifications at various times throughout the day.

Operant Conditioning: Principles

We've seen how reinforcement occurs and the types of reinforcers given. Now we'll look at three essential principles of operant conditioning.

The principle of immediacy highlights the timing of the delivery of the reinforcement. If the reinforcement occurs right after the behaviour, the greater its effect on the learner. The less immediate, the less effective the consequences are.

The principle of contingency refers to how consistently a consequence follows a behaviour. This principle highlights the importance of reliably relaying a response to increase the consequence's effectiveness.

The principle of satiation tells us that if the learner has no appetite for a particular stimulus (e.g., reward), the consequence will not be that effective; however, if there's a need for a specific stimulus, the effect of the consequence increases.

Skinner Operant Conditioning: Experiment

In testing his theory, B. F. Skinner conducted operant conditioning experiments on animals by observing their behaviour in the Skinner box. Skinner developed the Skinner box, or the operant conditioning chamber, which recorded the behaviour of an organism in a specific time frame.

The animal either receives a reward (food pallet) or a punishment (unpleasant electric shocks) when it exhibits certain behaviours, such as pressing the lever for rats or pecking keys for pigeons.

Operant conditioning, Skinner Box developed by B. F. Skinner, VaiaFig. 2 Skinner's experiment supports his operant conditioning theory.

As the rat moved around the box, it accidentally pressed the lever connected to a food pellet. The food pellet automatically dropped food into a food dispenser (positive reinforcement). The rat learned this rewarding behaviour quickly after being placed in the Skinner box only a few times.

Skinner tested negative reinforcement by giving the rat unpleasant electric shocks whilst inside the box. When the rat moved inside the box, it accidentally pressed the lever, and the electric shocks stopped immediately (negative reinforcement).

After being placed in the box a few times, the rat quickly learned this behaviour. The next time the rat was placed in the box, it immediately hurried to press the lever to avoid the unpleasant experience of the electric shocks.

Operant Conditioning Examples and Application

There are several examples of applying operant conditioning in everyday life. Skinner's operant conditioning contributed to developing treatment therapies such as the token economy and behaviour shaping.

Parents and teachers use token economy to reinforce desired behaviour through tokens such as stickers, coupons, money, or points a child can exchange for rewards such as food, activities, or privileges. Token economies help teach children to follow the rules at home and school.

Operant conditioning, behaviour shaping of circus animals, VaiaFig. 3 Circus animal training

Behaviour shaping involves eliciting responses by simplifying the desired behaviour into small, manageable steps, followed by a reward when the learner completes each step.

For example, trainers use behaviour shaping to teach complex tricks to circus animals.

In behavioural therapy, psychologists use operant conditioning and its principles to alter behaviour and treat psychological conditions such as depression, eating disorders, and obsessive-compulsive disorder (OCD).

Classical and Operant Conditioning

We understand that both classical and operant conditioning are forms of associative learning. But what's the difference? Let's look at this table to compare the two types of conditioning.

Operant Conditioning

Behaviours are involuntary.

Behaviours are voluntary.

Learning happens before a response occurs (presentation of an unconditioned stimulus after a conditioned stimulus).

Learning happens after a response takes place (through reinforcement or punishment).

The learner is passive.

The learner is active.

The learner associates a neutral stimulus with an unconditioned stimulus, eliciting a response.

The learner associates a response with a consequence that follows it, affecting the recurrence of a behaviour.


Operant Conditioning - Key takeaways

  • Operant conditioning is a method of learning or modifying behaviours in which the consequence of a response, whether good or negative, influences the repetition of an action.

  • Using the Skinner Box, B. F. Skinner conducted operant conditioning research on animals, which recorded behaviour over time.

  • Properties of reinforcement include primary and secondary reinforcement and reinforcement schedules based on the number of responses or time intervals.

  • Real-life examples of operant conditioning include token economy, behaviour shaping and behavioural therapy.

  • Operant conditioning differs from classical conditioning because behaviours are voluntary, and learning occurs after a response. Classical conditioning regards behaviours as reflexes, and learning happens before a reaction occurs.


References

  1. Fig. 2. Image of the Skinner rat experiment (https://commons.wikimedia.org/wiki/File:Skinner_box_scheme_01.png) by Andreas1 (https://commons.wikimedia.org/w/index.php?title=User:Andreas1&action=edit&redlink=1) Licensed by CC BY-SA 3.0 (https://creativecommons.org/licenses/by-sa/3.0/deed.en)

Frequently Asked Questions about Operant Conditioning

Operant conditioning is a method of learning or modifying behaviours in which the consequence of a response, whether good or negative, influences the repetition of an action. 

  • Workers work overtime in the campaign week since they know they will be rewarded with a productivity bonus plus overtime wage.
  • A child will finish his homework every weekend on time as he knows he will be rewarded with two hours of watching his favourite cartoon series.
  • Putting on sunscreen whenever we go out during the day to avoid sunburns.

Four types of operant conditioning are: 

  1. Positive reinforcement.
  2. Positive punishment.
  3. Negative reinforcement.
  4. Negative punishment.

The three principles of operant conditioning are the principle of immediacy, the principle of contingency and the principle of satiation.

In behavioural therapy, psychologists use operant conditioning and its principles to alter behaviour and treat psychological conditions such as depression, eating disorders, and obsessive-compulsive disorder (OCD). 

Final Operant Conditioning Quiz

Operant Conditioning Quiz - Teste dein Wissen

Question

What is operant conditioning?

Show answer

Answer

Operant conditioning is a method of learning or modifying behaviours in which the consequence of a response, whether good or negative, influences the repetition of an action.

Show question

Question

When did Skinner come up with the theory of operant conditioning?

Show answer

Answer

Skinner proposed operant conditioning in 1948.

Show question

Question

Provide one argument in support of operant conditioning.

Show answer

Answer

Operant conditioning helps explain learning processes such as language acquisition or addictions.

Show question

Question

Provide an argument against operant conditioning.

Show answer

Answer

Operant conditioning does not consider the influence of biological and mental processes, such as memory or problem-solving, on the learning process. Therefore, it cannot be regarded as a complete explanation of learning in humans or animals.

Show question

Question

Skinner considered that it is essential to consider action and its ________  to determine the causes of human behaviour.

Show answer

Answer

Consequences.

Show question

Question

Skinner’s (1948) research was initially based on the law of effect by _____?

Show answer

Answer

Thorndike (1898).

Show question

Question

What is a neutral response?

Show answer

Answer

Responses that neither increase nor diminish the chances of repeated behaviour.

Show question

Question

What are reinforcers?

Show answer

Answer

Reinforcers are responses that increase the likelihood of repeated behaviour. They can be positive or negative.

Show question

Question

What are punishers?

Show answer

Answer

Punishers are responses that can debilitate behaviour. These responses can diminish the chances of repeating a behaviour.

Show question

Question

Give an application of operant conditioning.


Show answer

Answer

Teachers can give students compliments or reward them with points for class participation.

Show question

Question

What is a token economy?

Show answer

Answer

Token economy reinforces desired behaviour through tokens considered secondary reinforcers, later replaced with rewards, known as primary reinforcers.

Show question

Question

What is behaviour shaping?

Show answer

Answer

Behaviour shaping can produce complex responses by using rewards and punishments with successive steps. Each reward or punishment should bring the individual closer to the goal.

Show question

Question

Which reinforcement is the example of taking motion sickness medication before a long car ride?

Show answer

Answer

Negative reinforcement.

Show question

Question

If you drink an energy booster during work to help you focus and finish on time, you are more likely to repeat this behaviour. Which reinforcement is this?

Show answer

Answer

Positive reinforcement.

Show question

Question

Why do psychologists argue that we cannot replicate animal studies on humans?

Show answer

Answer

They argue that humans have different anatomy than animals, and they have the power to control many behaviours using reason and self-control, for example.

Show question

Question

Fill in the blank - We are more likely to repeat actions with a _________ than with a ________.

Show answer

Answer

Positive consequence, negative consequence.

Show question

Question

What happens if we are punished for a certain behaviour?

Show answer

Answer

If we are punished for a certain behaviour, we are likely never to repeat that behaviour.

Show question

Question

What was Skinner's (1948) reasoning behind operant conditioning?

Show answer

Answer

Skinner considered classical conditioning an incomplete explanation of human behaviour. He stated that it is essential to consider action and its consequences to determine the causes of human behaviour.

Show question

Question

What is reinforcement?

Show answer

Answer

Reinforcement means that a rewarded behaviour is repeated more often than a behaviour that is not reinforced.

Show question

Question

Where did Skinner observe the behaviours of the animals in his experiments?

Show answer

Answer

The Skinner box.

Show question

Question

What did the Skinner box do?

Show answer

Answer

The Skinner box recorded the behaviour of an organism in a specific time frame.

Show question

Question

What is positive reinforcement?

Show answer

Answer

Skinner (1948) said that when positive reinforcement rewards a behaviour, it increases the likelihood of repeating it.

Show question

Question

Give a real-life example of positive reinforcement.

Show answer

Answer

If you drink an energy booster during work to help you focus and finish on time, you are more likely to repeat this behaviour.

Show question

Question

What is negative reinforcement?

Show answer

Answer

Skinner (1948) said that eliminating a negative stimulus is rewarding to the person and is most likely to reinforce the behaviour in negative reinforcement.

Show question

Question

How did Skinner test negative reinforcement?

Show answer

Answer

Skinner tested negative reinforcement by placing the rat in the Skinner box for some time each day. When the rat was placed in the box, it received unpleasant electric shocks. When the rat moved into the box, it accidentally pressed the lever, and the electric shocks stopped immediately (negative reinforcement). After being placed in the box a few times, the rat quickly learned this behaviour. The next time the rat was placed in the box, it immediately hurried to press the lever to avoid the unpleasant experience of the electric shocks.

Show question

Question

What lasting effects can punishment have on the recipient?

Show answer

Answer

Punishment can cause aggression because it is a coping mechanism (to deal with problems in life).

Show question

Question

When are punished behaviours more likely to re-occur?

Show answer

Answer

They are more likely to reoccur when the behaviour is no longer punished.

Show question

Question

Give an example of a token economy in a school. 

Show answer

Answer

A teacher may reward students with five extra points on exams if they participate in class discussions. Such an approach may likely reinforce the desired participation and engagement in class.

Show question

Question

How do companies shape behaviour?

Show answer

Answer

Companies use non-monetary benefits such as bonuses or free trips to encourage employees to achieve complex project goals.

Show question

Question

Which theory supports the principles of operant conditioning?

Show answer

Answer

The Social Learning Theory by Bandura (1977). 

Show question

Question

Why can operant conditioning not be regarded as a complete explanation of learning in humans and animals?

Show answer

Answer

This is because operant conditioning does not consider the influence of biological and mental processes such as memory or problem-solving on the learning process.

Show question

Question

What is operant behaviour?

Show answer

Answer

Operant behaviour is behaviour influenced by its outcomes.

Show question

Question

What are the three parts of behaviour according to Skinner’s three-term contingency?

Show answer

Answer

Discriminative stimulus, operant response, and reinforcers/punishers.

Show question

Question

What are the four types of operant conditioning?

Show answer

Answer

Positive reinforcement, negative reinforcement, positive punishment, and negative punishment.

Show question

Question

This type of punishment involves taking away something valuable following a response.

Show answer

Answer

Negative punishment.

Show question

Question

Reinforcement _________ behaviours and punishment __________ behaviours.

Show answer

Answer

Strengthens, weakens.

Show question

Question

Which is not an example of a secondary reinforcer?

Show answer

Answer

Food.

Show question

Question

True or false. Primary reinforcers are universal.

Show answer

Answer

True.

Show question

Question

This reinforcement schedule involves reinforcing behaviours every time the learner commits the targeted behaviour.

Show answer

Answer

Continuous reinforcement.

Show question

Question

What are the three principles of operant conditioning?

Show answer

Answer

Principles of immediacy, principle of contingency, and principle of satiation.

Show question

Question

This reinforcement schedule leads to an increased number of responses as reinforcement approaches.

Show answer

Answer

Fixed interval schedules.

Show question

Question

Gambling and lottery games are excellent examples of reinforcement based on which of the following:

Show answer

Answer

Variable ratio schedules.

Show question

Question

True or false. Psychologists can use operant conditioning to treat psychological conditions such as obsessive-compulsive disorder (OCD).

Show answer

Answer

True.

Show question

Question

One difference between classical and operant conditioning is that in classical conditioning, the learner is _________, while in operant conditioning, the learner is __________.

Show answer

Answer

Passive, active.

Show question

60%

of the users don't pass the Operant Conditioning quiz! Will you pass the quiz?

Start Quiz

How would you like to learn this content?

Creating flashcards
Studying with content from your peer
Taking a short quiz

How would you like to learn this content?

Creating flashcards
Studying with content from your peer
Taking a short quiz

Free psychology cheat sheet!

Everything you need to know on . A perfect summary so you can easily remember everything.

Access cheat sheet

Discover the right content for your subjects

No need to cheat if you have everything you need to succeed! Packed into one app!

Study Plan

Be perfectly prepared on time with an individual plan.

Quizzes

Test your knowledge with gamified quizzes.

Flashcards

Create and find flashcards in record time.

Notes

Create beautiful notes faster than ever before.

Study Sets

Have all your study materials in one place.

Documents

Upload unlimited documents and save them online.

Study Analytics

Identify your study strength and weaknesses.

Weekly Goals

Set individual study goals and earn points reaching them.

Smart Reminders

Stop procrastinating with our study reminders.

Rewards

Earn points, unlock badges and level up while studying.

Magic Marker

Create flashcards in notes completely automatically.

Smart Formatting

Create the most beautiful study materials using our templates.

Sign up to highlight and take notes. It’s 100% free.

Start learning with StudySmarter, the only learning app you need.

Sign up now for free
Illustration