PicoSearch

Resources > Training

Operant Conditioning (Under construction)

How do we get our canines to do the complex behaviors of SAR work?

Finding missing people sounds complicated. Is there a preferred breed or intense training program we put our canines through? No. All of our dogs are trained by their owners through operant conditioning utilizing positive reinforcement. Huh, what does that mean? Can I do this with my dog without getting confused? ABSOLUTELY! To learn more, keep reading but keep the following in mind: Every interaction you have with your animal has an impact on its behavior; training does not stop after you put away the clicker; with time and patience you can train anything!

Classical conditioning overview.

To better understand current positive training methods, you should be aware of classical conditioning. This is an involuntary or automatic response to a stimulus. Ivan Pavlov (1849-1946) discovered conditioned reflexes and stated that the organism has little control over the response and that the environment elicits the behavior from the organism. The relationship between behavior and environment is stimulus-response. Perhaps you are familiar with Pavlov's dog experiment? In his experiment, Pavlov rang a bell before feeding a dog. Over time, he noticed the dog salivated after hearing the bell ring and eventually he could ring the bell without feeding the dog and the dog would still salivate. Why? The dog involuntarily learned that the bell meant food and anticipated its arrival by salivating. This is also known as respondent conditioning.

Operant conditioning overview.

Modern conditioning principles are similar to classical conditioning but voluntary behavior is emitted by the organism, which is controlled by its consequences. This relationship was defined by BF Skinner in his book, The Behavior of Organisms, (1938) as a type of learning in which the probability of a behavior recurring is increased or decreased by the consequences that follow it. The animal/person being conditioned is the operator AND chooses to engage in the activity.

Believe it or not, this is an easy way to teach and an easy way to learn. Simply stated, operant conditioning is modifying behavior through consequences. Much of our learning is operantly conditioned- we make choices about our actions based on our past experiences. Our goal is to teach animals to respond consciously (not react as with classical conditioning) to each of our interactions with them. Once you understand the basic theory behind operant conditioning, the application (or practicing) of these techniques does not seem so difficult.

Ten laws of operant conditioning.

Karen Pryor is a leading animal behaviorist and author of Don't Shoot the Dog! (A must read for the serious trainer!) In her book, she defines ten important rules for operant conditioning. They are as follows:

  1. Raise criteria in small increments.
  2. Train one criterion at a time.
  3. Vary reinforcement before moving to the next approximation.
  4. Relax old criteria when introducing new criteria.
  5. Plan ahead.
  6. Don't change trainers mid stream.
  7. If a plan does not work, change the plan.
  8. Don't stop a session gratuitously.
  9. Regress when behavior deteriorates.
  10. End on a positive note.

What is a bridging stimulus?

A stimulus that pinpoints in time the precise moment of a desired behavior and bridges the gap in time between that point and when the organism may receive further reward. This is the MOST important piece in training.

A bridging stimulus, or bridge, is also a conditioned/secondary reinforcer or a marker that means, "yes", "good", "correct", "food is on the way". Think of it as a universal language, communicating to your animal that it has done something in/correctly. If s/he hears the bridge then s/he performed correctly; is s/he does not hear the bridge then s/he has not performed correctly.

The bridge helps in the timing of reinforcement and can be one of many secondary reinforcers for your animal, this one just has a more specific meaning. Each bridge must be precise and have consistent clarity. Remember, this is your communication to your canine!

There are 3 types of bridges- verbal, whistle, and clicker. The clicker is the more popular; they are sold in almost every pet store and all sound the same. Once you have one, how do you begin using it? Start by associating the bridge with a primary reinforcer, like food. You can place a piece of your canine's food on the floor and bridge each time s/he eats it. Your canine will quickly learn that the clicker equals food. You can practice your bridging timing by catching keys in your hand and bridging when they immediately touch your hand.

What is stimulus control?

A discriminative stimulus (indicated as SD but also known as a cue or command) is a stimulus that precedes a behavior, signaling that a specific response will be reinforced if emitted correctly. The result is that the stimulus will consistently elicit only that particular response. There are a few options to choose from. You can use a visual cue (use your hands to tell your dog to sit), an auditory cue (verbally tell your dog to sit),or a tactile cue (touch your dog with a specific pat on a specific body part to tell your dog to sit). Whichever type of SD you choose to use, make sure you are consistent with it. A good rule of thumb is to use one cue for each behavior. Don't put your animal in a position where it has to guess what you are asking for; remember s/he doesn't understand English!

When a trained behavior occurs consistently in response to an appropriate SD it is considered to be under stimulus control. The behavior should meet four conditions:

  1. It is immediately offered following the SD

  2. It is offered only when preceded by the correct SD

  3. It is not offered in the presence of another SD

  4. It does not occur in the absence of the SD.

Are there different types of reward systems?

Consequences are highly effective tools in the learning process. There are two types of consequences- reinforcement and punishment. Reinforcement is the preferred consequence in training. Punishment should be reserved by those who fully grasp its detrimental effects.

Reinforcement is any circumstance/event that INCREASES the frequency of that behavior occurring again. This is the basis of all conditioning. Timing is key; if reinforcement is delivered too early or late the animal might be reinforced for a different behavior or forget what behavior it was reinforced for.

Punishment is the removal of the opportunity to obtain reinforcement, which DECREASES the frequency of that behavior occurring again. Punishment can also be positive (adds something to the animal's environment) or negative (removes something from the animal's environment). This consequence should only be used by advanced trainers who fully comprehend its detrimental effects.

How is reinforcement used?

There are three types of reinforcement: Positive (adds something to the animal's environment), Negative (removes something from the animal's environment), and Differential. Positive reinforcement is the most common tool in teaching your animal s/he has done something correctly. Negative reinforcement usually results in the animal looking to escape or avoid the situation. Differential reinforcement is when one behavior is reinforced and others are not. This is most commonly used for incompatible behaviors- your dog cannot physically sit down AND jump on you at the same time. If you want your dog to sit when you walked in the door instead of jump up, you would differentially reinforce the sit behavior.

Reinforcers are anything that occurs in conjunction with a behavior that tends to increase the likelihood that the behavior will occur again. A primary reinforcer is something that the animal finds inherently reinforcing, i.e. something that satisfies a basic biological need- food. A secondary reinforcer is something that has acquired reinforcing value through its association with a primary reinforcer- a toy, tactile, praise.
There are different ways of applying reinforcers. These are known as schedules of reinforcement, of which there are two types. A continuous reinforcement schedule is given after each correct response. An intermittent reinforcement schedule is given periodically after a certain number of responses.

What do you do when your animal performs an incorrect behavior?

You should ignore the behavior and not let you emotions get it in the way. A least reinforcing stimulus (LRS) is a calm, neutral response of 3-5seconds that occurs at the time of the bridge (had there been one). This lets the animal know s/he performed an undesired behavior but that it is OK to make a mistake. Both animal and trainer should remain calm. This method can be used numerously and safely as a learning tool.

If you act on the incorrect behavior, chances are you are using negative reinforcement and/or punishment. A time out (TO) is the removal of opportunity for positive reinforcement. This should only be used once or twice to decrease an animal's angry or frustrated state of mind. Extinction is a method of eliminating a behavior by no longer reinforcing it. This tool should also be reserved for the more advanced trainers as frustration and aggression are common "side effects".

Is there an easy way to get rid of undesirable behavior?

Karen Pryor lists some methods to get rid of undesirable behavior in her book, Don't Shoot The Dog! How many have you thought about or tried?

  1. Shoot the animal

  2. Punishment

  3. Negative reinforcement

  4. Extinction

  5. Train an incompatible behavior

  6. Put the behavior on a cue

  7. Shape the absence of unwanted behaviors

  8. Change the motivation

What is superstitious behavior?

A behavior that the animal offers during the training of another behavior but is unrelated to the behavior being trained. Because the unrelated behavior is inadvertently reinforced with the desired behavior, the animal perceives it as a necessary component of the behavior being trained and therefore necessary to receive reinforcement. As the trainer, you must be constantly aware of this possibility, as it can lead to problems later in training.

Why do superstitious behaviors occur? There are several possible reasons. Poor bridging technique, such as poor timing or too long of a bridge thereby capturing more than one behavior. Poor observations. Sloppy cues which the animal could learn over the original cue.

How do I begin now that I've learned some basics?

Before you dive into this exciting world, take a few minutes to asses yourself and your animal.

  1. Take an animal history. Do you know what type of dog you have? This may seem like a silly question but for those of you with mixed breeds, this will help figure out several behavior questions. Do you know its background? Was it previously abused? Was it hand raised or parent raised?

  2. Animal trust goes both ways. Does your animal trust you? How well do you trust your animal? Spend some quality time bonding so you both learn something about each other.

  3. Learn those operant conditioning techniques. The tip of the iceberg was covered here, so go soak up more knowledge as you progress with your training.

  4. Learn the bridge. Remember, this is how you will communicate with your animal and is the most important piece in training!

  5. Learn what a great tool positive reinforcement is and become proficient in using it.

  6. Remember the behavior you train are extensions of natural behaviors, even the crazy, fun stuff, so be creative.

  7. Your SD must be consistent and different from other cues you are using.

  8. Don't start training without a plan and use successive approximations (small steps) to train. Too many big steps forward can get confusing quickly!

  9. Be creative- make sessions fun but know when to stop and ALWAYS end on a positive note.

This site was last updated on 04/23/2007

 
 

Copyright: Greater Houston Search Dogss