
A
Clicker Training Primer
Ive agreed to write a series of articles for the Commentator introducing the concepts and philosophies of clicker training. In upcoming issues, Ill debunk some common myths about clicker training, explain how to get behavior (and keep it once youve got it), discuss how to deal with unwanted or problem behavior, and delve into the intricacies of reliability, fluency, and training plans. In this first article Im going to define clicker training, introduce some common terms, and explain how it relates to operant conditioning. Clicker Training DefinedBelieve it or not, its not easy to define clicker training. There is no official definition, and so one trainer might define it quite differently from another. People have used the clicker as a marker within an otherwise-traditional program, as an aversive to mark (or punish) undesired behavior, as a cue for attention, and as a cue for a recall. Though all of those are valid uses of the tin noisemaker called a clicker, none of them are examples of clicker training. Clicker training is both a technology for training animals and a training philosophy. As a technology, clicker training relies on positive reinforcement to make it more likely the dog will repeat desired behavior in the future. Two things, however, make clicker training unique. First, its practitioners emphasize the science underlying the method. Clicker training is based on the principles of operant and classical conditioning. Traditional trainers often argue that their methods can also be explained using the terminology of operant conditioningand thats true. Clicker trainers, however, arent just using the terminology to explain results. Theyre actively applying the principles before and during training. This makes clicker training more than a method, more than a set of step-by-step recipes to get behavior. Clicker trainers who learn the underlying principles have at their disposal a powerful set of tools that enable them to analyze behaviors, modify existing methods for individual animals, and create new methods where none previously existed. Second, as the trainer you use a marker signal, the clicker, to tell the animal when he does what you want. The clicker is like a camera taking a picture of the behavior you are training for. The technology is, at its core, very simple:
For example, say you want to teach your dog to sit. When he sits, you click. Then you give him a bite of his favorite treat. The click means That behavior right there! Thats what I want! and Youve earned a reward. If you click and reinforce every time your dog sits, he will soon figure out that sitting earns a treat and offer the sit more often. You can then add a cue, sit, to tell him when you want him to do the behavior. More importantly, clicker training is more than using a clicker to train your dog. It's a different way of thinking, a way of relating to animals that creates a partnership that is mutually reinforcing and pleasurable. As a philosophy, clicker training has evolved from the works and ideas of Karen Pryor, Jean Donaldson, Bob and Marian Bailey, Turid Rugaas, Murray Sidman, and others who believe it's possible to train a dogor raise a family, or live a successful lifeusing the principle of positive reinforcement instead of coercion or force. The Link to Operant ConditioningAs mentioned above, clicker training is based on the principles of operant conditioning. In the video Patient Like the Chipmunks, Bob Bailey defines operant conditioning as both the science of explaining behavior and the powerful technology of changing it. The principles of operant conditioning describe how animals learn. When trainers use operant conditioning, they apply the principles to obtain the results they want. Operant conditioning breaks learning into three parts:
According to this theoretical framework, the consequence of a behavior determines whether it will be repeated or not in the future. If the consequence strengthens a behaviorcauses it to occur more frequentlywe say the behavior has been reinforced. Clicker trainers use positive reinforcement to teach new skills. On the other hand, behavior that leads to unpleasant consequences occurs less frequently. Punishment (as defined below) suppresses unwanted behaviors. In either case, the consequence results from something being either added (+) or taken away (-) from the environment. This leads us to the definitions of four key operant conditioning terms.
People commonly refer to the four principles of reinforcement and punishment as the four quadrants of operant conditioning. That phrase is misleading in two ways. First, it implies that all four principles are equally weighted or of equal use in a training program. In reality, punishmentparticularly positive punishmenthas several drawbacks, some extreme, which make it inappropriate for most training issues. In addition, because an aversive must be applied or threatened before negative reinforcement can occur, negative reinforcement is also a poor choice. Second, the quadrant description doesnt include a fifth principle of operant conditioning, one that clicker training makes particular use of. This is the principle of extinction. With extinction, a behavior is weakened through the absence of any kind of reinforcement. For example, if no one answers your knock at a door, you will eventually stop knocking. If a dog can't reach a dog biscuit on the other side of a fence, it will eventually stop trying. Because extinction doesnt have the drawbacks associated with punishment, clicker trainers use extinction to reduce or eliminate most unwanted behaviors. A more accurate depiction of the relationship between the principles of operant conditioning and clicker training begins with the image of a pie. In clicker training, positive reinforcement is the largest piece, taking up perhaps two-thirds of the pie. The second largest piece is extinction. The third largest is negative punishment. Positive punishment and negative reinforcement are just two tiny slivers. The most important thing to note is that a complete, reliable training program can be composed entirely of positive reinforcement, extinction, and, to a far lesser extent, negative punishment. Is it important to know these definitions? Yes, for two reasons. First, it helps us understand each other much better. In everyday usage, the words positive and negative often mean good and bad. However, in operant conditioning and clicker training, they refer to something added or something taken away. Punishment is another word that carries strong connotations in everyday language, but in the context of operant conditioning, punishment means only that which suppresses the occurrence of a behavior. Second, to clicker train without understanding the science makes clicker training nothing more than a cookbook full of recipes that may or may not work for your dog. Why? Because if you dont understand the underlying behavioral principles, you cant examine a training situation, determine why it isor, more importantly, isntworking, and adjust for your particular dog. Subset, Not SynonymousOperant conditioning is based on five main principles, and all five are legitimate methods of changing behavior. Clicker training, however, does not make use of all five principles. Karen Pryor, who coined the term clicker training, defines clicker training as a subset of the principles of operant conditioning, including only positive reinforcement, extinction, and to a much lesser extent, negative punishment. The late Marian Breland Bailey, who, with her first husband, Keller Breland, brought operant conditioning out of the laboratory and pioneered and perfected the use of event markers in training, supported this definition. Negative reinforcement and positive punishment, though sometimes effective for changing behavior, have several possible drawbacks.
This last issue - inhibiting the animal's willingness to offer behavior - makes positive punishment and negative reinforcement most incompatible with clicker training. Clicker training can produce incredibly precise behaviors, but shaping these behaviors depends upon the dog's willingness to experiment, to offer a variety of responses, some right, some wrong. A dog that is punished for mistakes isn't going to be anxious to try anything new. Trainers who
are new to clicker training often balk at the thought of giving up positive
punishment and negative reinforcement. They equate the lack of physical
aversives with the lack of consequences. Its important to realize
that no study anywhere has ever determined that positive punishment
(or reinforcement) is inherently more effective than negative punishment
(or reinforcement). By definition, all are effective, and within each
is a continuum from mild to extreme. Reliability is also not an issue.
Reliability is not related to methodit is a number, cold data,
a percentage of correct trials. In upcoming issues Ill explain
more about how to apply clicker training as defined in this article
to get precise, reliable behaviors, but first, in the next issue, Ill
debunk some common myths about clicker training. Melissa Alexander | Training Articles Contents || Site Home | List and Site Owner: Melissa Alexander, mca @ clickersolutions.com |