Ethics and Phenomenology

Phenomenology is, generally speaking, a discipline that examines questions of metaphysics and epistemology. Insofar as ethics is usually seen as a topic apart from metaphysics and epistemology, it is thus not typically addressed by philosophers in the phenomenological tradition. However, there are important areas of overlap between ethics, metaphysics and epistemology, which may be fruitful points of departure for exploring a phenomenologically-oriented notion of ethics. In particular, metaphysics and epistemology seek to consider the validity of, among other ideas, analysis and wonder. An exploration of analysis and wonder can reveal the importance of ethics in this context. Once we have seen what follows from this standpoint, further consideration of ethics in terms of engineering will show how this standpoint can inform upon the world of praxis.

Table of Contents

  1. Theoretical Concerns
    1. Ethics Underlies Wonder and Analysis
    2. War
    3. Hospitals
    4. Ethics of Integration
    5. People
  2. Tying Phenomenological Ethics to the World
    1. Overview
    2. Human Factors Engineering
    3. Phenomenology: Brentano
    4. Phenomenology: Edmund Husserl
    5. Phenomenology and Ergonomics: Parallels
    6. Phenomenology and Ergonomics: Enter Ethics
    7. Ethics as Homological
  3. References and Further Reading
    1. Theory
    2. Tying Phenomenological Ethics to the World

1. Theoretical Concerns

a. Ethics Underlies Wonder and Analysis

Ethics can be seen as the foundation of wonder and analytic thought. First, existentialists accept wonder and deemphasize analysis, though phenomenologists tend to be more open to wonder and analytic thinking. Logical positivists and linguistic analysts see wonder as reducible to logic. Existentialists and phenomenologists are comfortable with ethics associated with wonder and analysis. Positivists and analysts deny ethics as an irreducible field of study. Ethicists would look at wonder to see if people need drugs in order to achieve states of euphoria or peace. Additionally, ethicists would take the same view about computers and analytic method.

In both instances, the question of ethics enters concerning more than the validity of wonder and analysis in the traditional philosophical sense (Kazanjian, 80; Buber, p. 11). Traditionally, existentialists and phenomenologists see wonder as revealing what “is.” Analysis has almost no place in much of existentialism, and varying degrees of validity in phenomenology. Traditionally, linguistic analysts and logical positivists see nothing to be gained with wonder. Reality is language, and is to be analyzed, never something about which to wonder.

Ethics brings in a deeper issue in both instances. Even if wonder alone is valid, ought people use drugs to feel a sense of awe? Even if analysis alone can give access to reality, ought people simply resot to computers, the higher the speed the better, to understand what is?

Existential and phenomenological thinkers tell us that awe or wonder is the basis of analysis, or as pure wonder, may stand alone without cognition. Ethicists may argue that awe or wonder is a human trait and ought not require or involve drugs and surgical stimulation of the brain to induce a sense of wonder (Campbell, p. 163). Awe is basically the social, intersubjective reality of living in the world. Phenomenology calls the world the lived world instead of just the material, quantifiable world. This wonder is consciousness in the unaltered state. In this unaltered state, wonder is part of normal, lived, reality or existence.

Ethicists would say that linguistic analysis or logical positivism changes or distorts reality. Drugs and brain stimulation are not lived reality. They develop a state of artificial awe or wonder. If the unaltered mind uses logic alone to access reality, it may mean altering reality from what ought be to what artificially exists. If the person uses mind altering drugs to achieve awe or wonder, then the awe or wonder itself is altered, artificial, and unlived. We then see not the lived world, but the artificial world.

Phenomenology says we should not excarnate or take analysis out of the lived world. Phenomenological ethicists would say we should not excarnate wonder itself from the lived world. Thus, phenomenologists and existentialists would be medically, pharmaceutically, or biologically excarnating wonder or awe from the lived world, even if they refute positivism and analytics’ portrayal of awe or wonder as wrong and insist on wonder or awe as revealing reality. The ethical position becomes a sociological view (Bryant, p. 1).

Paul Ricoeur (p. 217) looks at Cartesian dualism and says that we must overcome its excarnating of objectivity from the body. Non-biochemically, Ricoeur is criticizing Cartesian dualism for ignoring the embodiment of the objective. He is insisting that objectification must be done within the context of the lived world. We analyze the lived world or reduce it to quantities within the general framework of awe or wonder. Ricoeur’s approach suggests that even logical positivism and linguistic analysis needs to look at the problem of excarnation. These movements are in the same category as Cartesian dualism when they consider analysis or reduction as without wonder, or devoid of the lived world.

The difference between logical positivism and analytic philosophy on the one hand, and Cartesian dualism on the other, rests with their views of reality. Dualism sees mind and body, or objectivity and subjectivity, as both real and valid. The problem is how to relate them. Positivism and analytic thinking argue that the lived world, subjectivity, wonder, awe, and so on do not exist as irreducible reality. They are totally reducible to the simples of fact.

Ricoeurian thinking contrasts with Cartesian dualism and with positivism and analytic philosophy by saying all three movements should see themselves as having wrongly excarnated object from the lived world. An ethics approach to wonder, however, goes deeper than Ricoeur. The ethicist would insist that we cannot justify wonder for the sake of wonder. We need to look at how we approach wonder or awe. Saying that the lived world and wonder are important is not sufficient. Arguing against positivist and analytic reductionism of wonder to facts is only partly correct. How we define the biochemical context of wonder is critical.

In ethics, we may wonder by being conscious and not taking drugs to alter the brain. People look around them, or they inquire, or meditate, and feel it crucial to feel a sense of wonder that beings “are.” We wonder by ourselves, without bio-physiological intervention. Indeed, we do not need to take an aspirin or other legal medication to feel as sense of relaxation, calm, or rest. Beyond this, ethics says we ought not feel it important or in any way justified to take any illegal drugs that might induce a “high.” The biochemical high or awe is indeed a biochemical reduction or analytic approach to wonder. This approach would assume that the state of wonder is primarily, perhaps exclusively, a chemical reaction within the brain, and has little to do with normal, non-biochemical experiences of the lived world. In other words, the biochemical approach to wonder suggests that we wonder not through existing, but primarily through changing the chemistry of the brain.

Ethics might call the biochemical approach to wonder as biochemical, pharmaceutical, or otherwise physiological positivism or analysis. Ironically and unfortunately, this becomes serious to the point where no real inquiry, not even traditional logical positivism and analytic thinking is possible. An ethicist might ask us to look at a piece of analytic literature or philosophy. We see symbols, diagrams, and virtually mathematical methods for attempting to determine resolutions to questions and problems. The analytic thinker, the positivist, would argue that they are coming near to solving issues, and that these solutions or clarifications reveal a reality devoid of wonder.

The ethics approach notes that these analytic and positivist thinkers are consciously engaging in intellectually work. They converse with each other, perhaps argumentatively with existentialists and phenomenologists, but always are participating in some kind of control over what they are doing. Their brains are functioning without medication or alteration. Now, the ethicists will point out, consider the biochemically activated phenomenologist or existentialist. In other words, we no longer just speaking of the positivist and analytic thinker inquiring without drugs. We are no longer speaking of positivists and analytic thinkers trying to totally reduce wonder to facts in terms of normal, not medicated activity. What the ethicist criticizes is the phenomenologist or existentialist who is defending wonder through druges. This person is criticizing positivism and analysis for trying to totally non-biochemically reduce wonder or awe to atomism. Yet, the phenomenologist and existentialist is defending irreducibility by feeling wonder, perhaps even attempting writing if that is possible, by consuming biochemicals which will induce the sense of oneness or awe (Eliade, p. 31).

In effect, the phenomenologist or existentialist has become a de facto positivist or analytic thinker. The phenomenologist or existentialist becomes a biochemical phenomenologist or existentialist, totally reducing the chemistry of the brain, body, and lived world to atoms and chemical reactions. If it is possible, the ethicist calls this positivistic or analytic phenomenology or existentialism. On the other hand, for clarification, the ethicist might use another term: biochemical phenomenology or existentialism.

b. War

What of the analytic thinker or positivist using computers for their approaches? Ethics would point to the efforts by analytic thinkers during World War II to crack Hitler’s Enigma Machine code. The machine worked strictly through symbols. Codes are symbols. During WWII the codes were relatively complex, but speed was crucial in breaking them. Today, and in the future, with cryptology becoming increasingly sophisticated, codes become more complex, and the speed required to break them more crucial.

Positivists and analysts would insist that their philosophy requires respect, and faster computers. Ethicists would argue that we need a better world where criminals and dictators are minimized, and their powers decapitated. Having the computer capabilities of speedier problem-solving does not “solve” the problem in its widest sense. The problem in its widest sense is that people, usually the leaders, go bad and make evil things happen in the world. When governments ignore the rise of evil, they usually invite international catastrophes such as the Second World War. As the war occurs, and as the innocent attempt to now fight and defeat the enemy, many on the side of the innocent take pride in their technical efforts.

Technical abilities helped our side win against Hitler in his efforts to communicate through codes. None of this would have had to occur if we had kept him from rising to power in the first place. His rise to power, and the unethical ways we ignored his ascension were key to the disaster of the Second World War. We ignored his actions against Jews and non-Jews. This ignorance was unethical. We sat back and did nothing.

Toward the end, we began panicking and wondered how to solve problems to end the war. One major answer was to break his coding abilities. Fortunately, we broke his code, and this helped us win the war.

Today, intelligence agencies are increasingly positivistic in their coding/decoding efforts. Computers are the foundations of coding/decoding. Speed is paramount. We spend money, lots of it, in developing ways of surreptitiously monitoring telecommunications to determine what potential terrorists are saying. Technology is advancing rapidly in our endeavors to translate foreign and English conversations to determine whether speakers are planning attacks.

Forgotten in all this rush to technologize existence, society ignores the ethical grounds of analysis and computers (Stine, 141). We forget that analysis is embodied in wonder, and that thinking and wonder involve the ethical orientation. Are we ignoring the poor, the economically and socially deprived, the underprivileged? We no doubt are ignoring the impoverished. Then, in the event that the impoverished seek ways of retaliating, we suddenly seeks technical ways of speedier discovery of the terrorists’ plots.

Even when terrorists are wealthy, we seek to look the other way instead of considering their moral deviancy and their ongoing hatred of humanity, especially of the West. We let this hatred grow, assuming that we do not initially deny it. As their hatred grows, it can mushroom into attacks against the West or even people in other cultures. Only then, in post 9/11 fashion, do we react and seek the speediest computers to analyze terrorist activities and conversations.

Ethics is derived from ethos or people. Any human activity must be seen within the social context. Thinking and wonder are among the fundamental human activities. Relegating cognition to the sum total of data becomes anti-human; similarly, relegating wonder to the realm of intravenous or other methods of drug intake is no longer a human activity. The ethos orientation of cognition means that thought, contrary to what Descartes said, is embodied and of social perspective. Cognition is never disembodied. To disembody cognition is to commit two wrongs.

One wrong is to seek cognition as devoid of awe. This makes thought sterile and dehumanizing. The second wrong is to see disembodied cognition as part of a technology where speed is the only way to resolve problems and answer questions.

Awe or wonder is the pre-cognitive requisite of the cognitive. Yet, ethics notes that we cannot stop there. Wonder cannot be an end in itself. If wonder is derived from a natural, non-drug induced sequence whereby we simply wonder that things “are,” then we are practicing true awe. Once we take drugs or otherwise stimulate the brain to induce wonder, than the wonder is unethical. It is mechanical rather than emerging from ethos.

c. Hospitals

Take the example of a hospital’s intensive care unit. Patients are put on a respirator to help them breathe. They may also be put on intravenous feeding so that the body can be “fed” nutrients mechanically instead of taking in food through the mouth. In time, however, society believes that such patients may be retained on such mechanical devices only if their physical conditions warrant such technologization. The purpose of life is for the patient to be helped toward normalcy. In this case, the patient must be helped to leave the hospital and eat and breathe, and so on, on their own.

The objective of life, of the hospital, is never to merely have the patients remain in intensive care, or even in the hospital. People need to be active in daily life, eating, breathing, and so on on their own. To eat and breathe on their own means dining and respiring as part of society, with one’s own bodily abilities. Food is irreducible to nutrients. Breathing is irreducible to oxygen intake. Food and respiration emerge from the ethos, from the ethical. As biological as eating and breathing, they are not merely physical processes.

For example, the nervous person, the seriously emotionally troubled individual, will have difficulty eating and breathing. Human activity such as eating and breathing are as much part of the ethos or ethical, as they are physical, neurochemical, and so on. Indeed, eating disorders such as those resulting in being overweight, imply reducing food to merely physical entities being “put into the mouth.” Eating does not mean simply stuffing the mouth, eating quickly, or any other physical process. Dining is a cultural, ethical process.

Similarly, we do not just respire by hyperventilating. We breathe by inhaling and exhaling normally, often unconsciously. Perons inhaling too fast may be suffering from an emotional problem, or perhaps physical difficulty. Persons inhaling and exhaling too fast are behaving unethically, anti-ethos or different from normal human activity.

We can say the same about wonder and analysis. Wonder is something we sense under normal human conditions without mechanical assistance. Drugs ought not play part of wonder.

Analysis is an activity in which we participate without the aid of computers, and hopefully within the context of wonder. To think analytically is to take apart. But to spend our time only taking apart means that we are simply assuming that words, pictures, behavior, and so on are only to be taken apart and never appreciated as products of the ethos or community. Taking apart ought mean that something was initially a whole. That wholeness cannot be violated. If we emphasize the taking apart aspect of existence, and reject or ignore the synthetic and the wonderful, we have relegated existence to a form of hospitalization, to a form of the intensive care unit.

Existence is not meant to be only analyzed, and it is not meant to be only wondered. Ethos means that analysis and wonder go hand in hand. Analysis and wonder are not mutually exclusive. We never merely analyze without some wonder, and never wonder by merely mechanical means. Both analysis and wonder reflect an ethical, social, cultural dimension.

Feminism can be helpful here. Feminists argue that nothing written is ever totally objective, and devoid of the cultural. Look at books. Their authors are not just “authorities,” but traditionally have been white males. Their subject matter, too, have typically ignored injustices toward women. Sexism means that we have looked at women simply as reducible to anatomy, and never as human beings. Ethics means that women are human beings, irreducible to physical characteristics.

Racial theory can also help. Racism has meant that authorities writing books have been white males. But the ethical thrust of the women’s’ movement and racial justice has attempted to bring about a better, ethos oriented vision. We now have books and articles written by women, and by nonwhite males. Authors are not just authors. They are a racial-gender-human continuum. No author is the sum total of racial, religious, biological, and other parts. Every author is first of all a human being.

Wonder and analysis, then, are irreducible to mechanical identification. Persons need to be able to wonder with only their mind and body, in awe of the universe or of any particular event. They need only to analyze within the context of this natural wonder, and with computers only on a limited scale.

Ethics does not demand the exclusion of computers from society. The ethos orientation requires only that computing, speed, technology, and other quantification occur within the context of a healthy environment. The idea of proactivity or prevention is important here.

Proactivity means we need to prevent rather than react to bad events. Before illness strikes, we need to monitor physical and other conditions resulting in disease. Ethics means we ought not ignore health dangers, and then react medically, physically, surgically, to “solve” unhealthy situations. Drugs, whether over the counter or prescription, do not need to take the place of a healthy lifestyle and diet. People might need to depend on drugs as they age and their body deteriorates. Even then, they must take drugs only by doctor’s orders, and never simply because the drugs are there.

The preventative, proactive approach to health includes habits of proper diet, exercise, monitoring stress, wearing clothes appropriate to the season, air conditioning during the summer and heat during the winter. These measures and lifestyles help insure that people will not get ill to the extent that they can have some reasonable control over life. Illness can and will come under many circumstances. Viruses, bacteria, many forms of sickness will emerge regardless of what we do to prevent illness.

When illness does come, we need to take a look at the best ways of curing what we have, and returning to a relative healthy state. Physicians may often examine patients and tell them than rest, proper diet, the drinking of fluids, and so on, will probably help bring the patients back to health. Not all diseases require medication. Additionally not all diseases require surgery. Even broken bones may not require cutting the patient. In time, many bones will heal correctly if their break is not in a physical position to cause deformity when healed.

Medicine, then, often seeks to prevent illness through a healthy lifestyle. When medical treatment is needed, pills are often preferable to surgery. Similar approaches are sought by ethicists for awe and analysis.

Wonder and analytic thinking are never mutually exclusive. Existence does not consist of wonder devoid of analysis, or analysis and rational-sensory approaches lacking awe. Most importantly, phenomenological ethics means that wonder and analysis are not to be merely the ends in themselves. We cannot say that because we are analyzing within the context of wonder, we are therefore being ethical, appropriately intellectual and properly in awe.

The states of awe and analysis are human states. They are irreducible to mechanical, physical, neurophysiological methods. Before we consider being in awe as a context for being analytical, we need to realize the need for being ethical, social, humane. Ethics is more than doing right and avoiding wrong in daily activity, business ventures, and the professions. Ethical behavior is basic to cognitive efforts to understand reality. The drug culture of the 1960s assumed that achieving a “high” was very important, but could not be reached until persons smoked pot or did hard drugs to alter the mind.

Similarly, people who believe in the rational approach to existence frequently misinterpret rationalism, logic, calculation, and speed. They too often assume that the logical or rational sequences are only sequences depending on speed. Their view is that speed is fundamental, and therefore the faster a sequence the better. From that view, the quicker we gather and understand greater numbers of variables or parts of the problem, the better our solution.

An unethical view of problem solving involves quick technical solutions to a given problem. A problem can be small or large. Instead of asking ourselves whether the problem is real or not, we frequently tell ourselves that speedy solutions are the answer. For example, take urban crime. We see robbers, burglars, car thieves. We hear of homicides and arsonists. Our typical approach is to assume that crime is crime, and its solution is a nonsocial, purely professional response from the police. The more police the better. The faster our calls are answered, and the quicker the police arrive at the scene, the better we feel that the problem of criminality is being solved.

This unethical view says that more crime we have, the more and faster police response we need. That view also suggests that the faster we get fingerprints and identify the wrongdoer, the more our society is progressing. Our emphasis is on speed, imprisonment or worse, technology, and other mechanical forms of reaction.

The ethical approach is fundamentally different. We would give opportunities to young people in order to attract them to productive lives outside crime. Families need strengthening, discipline must be practiced and taught, neighborhoods aware of wrongdoing, parental responsibility required. Our social institutions must be upheld. Churches, social groups, schools, governmental organizations, hospitals, and all businesses will need to work together. The police are there, but cannot be the only people combating crime. Technology ought be there, but only within the context of the social structures.

Society ought not ignore the social conditions and then go after the criminals arising as a result of deteriorating cultural situations. Culture is the not only contributor to crime. Some people simply may be born trouble makers. A weak social structure lets them do as they please until it is too late. Simply waiting for people to become criminals, then going after them, arresting, taking them to trail, and locking them up are the mechanical ways of recidivism.

The ethical approach attempt to return the criminal to society through rehabilitation when initial parenting or habilitation has failed. We cannot just let young people grow up doing as they please, and then throw the book at them when they go wrong. Society seems to like the mechanical approach to most things. In medicine and health, we increase emergency rooms. In law enforcement, we want more and faster police. Education becomes a mechanical method of learning from computers. Transportation develops into a way of speedier, aircraft, and automobiles even if we need to build bigger airports, and destroy ecology with more highways. Information becomes merely a commodity where we transmit data and receive it with greater efficiency. In more and areas, technology and rapidity of getting something or someone from here to there becomes paramount. Ethically, medicine must involve better health habits, law enforcement better homes, love, and discipline, learning a matter of student teacher interaction, travel a matter of bicycles and trains, and information an issue of understanding and social empathy.

d. Ethics of Integration

Wonder and analysis are good when they are integrated. We cannot have just wonder, or only analysis. Yet, integrated or not, wonder must come from within and never as a result of drugs and electrical stimulation of the brain. Analysis must be within the social context and never merely a computerized battle toward solutions. Phenomenological ethics shows awe to be the view that things are fundamentally one, and culturally uplifting. Basic to all is our wonder that reality is a beautiful, awesome, non-problematic existence.

Existence is more than just a problem to be solved, a difficulty to be overcome. Existing ought mean appreciating life, people, God, culture, and all plants and animals. We cannot just look at the world as an ongoing defect to be repaired. Life may have evil in it, but is not essentially evil. It is a wondrous reality. This view is available to us not just through drugs, but our very natural feeling of awe. Again, good parenting and better social structure can contribute to or take away from this feeling.

Albert Einstein displayed ethics when he told his fellow scholars at Princeton to stop by an say hello from time to time. Most scholars were shocked. They felt that Princeton was a place for intellectual discourse instead of chit chat and normal conversation. They felt even more strongly that Einstein’s work was so critical that they did not wish to interfere in his studies with what they consider small talk or any conversation irrelevant to scientific work.

We think of Einstein as a scientist. He was clearly displaying what phenomenology calls intersubjectivity and wonder as the basis of any scientific work. Einstein believed that normal human beings, even those in intensive scholarly research, needed and should engage in the wonder of interpersonal, face to face community that this the social foundations of any verbal communications. Community is the basis of communications. We cannot communicate or convey information from person to person unless we first establish of acknowledge what phenomenology calls the “given” community or lived world.

e. People

In phenomenological ethics, we are first, last, and always in the community of people, in intersubjectivity, wonder, awe, or the non-cognitive. We are one with vegetation, with nature, with spiritual powers or religious dimensions. The term “people like us” is not to be taken as meaning individuals of our race, creed, color, or gender. It is to be interpreted as meaning that all human beings in the world are like each other. People are the same, regardless of race, creed, and so on.

Wonder means that all things are essentially related with each other. We do not first sense races, creeds, religions, and genders, and then arrive, step by step, to our humanity. The first thing we sense is that all individuals are alike. Races, religions, and so on are differentiations that we tend to make in distinguishing each other. Wonder makes it clear that whatever else we have as differences, human beings are, at bottom, the same.

Only within the context of fundamental awe of the unity of all things, do we then take apart or analyze people from each other, animals, nature, and so on. Analysis, done within awe, is benign. Analysis done outside the framework of wonder becomes mere taking apart of the essentially unified. In this sense, analysis becomes mere destruction.

Wonder and analysis in the ethical perspective comprise our intersubjective, sensory, rational unity. This occurs only when awe and analytic thinking occur within the context of the ethic or ethos: culture. Human beings are meant to awe that they are in the unified world of people, animals, vegetation and nature. Nothing is or ought be totally objectified. We are meant also to differentiate or analyze carefully in order to understand and intellectually cope with the existing world. Within awe, we objectify in order to develop an intellectual stance about why things are as they are.

Intersubjectivity and objectivity go hand in hand. Intersubjectivity or wonder devoid of objectivity becomes dangerously anti-technology. Objectivity alone becomes anti-human. Ethics tells us that this integration is complete when we appreciate intersubjectivity through normal human activity and not through drugs. We need also appreciate objectivity through normal intellectual activity and never through seeing speed, technology, or quantification as an end in itself.

The awe of our being together as a basis for any technique in analyzing that intersubjectivity can be seen MIT’s OpenCourseWare. Classroom learning with face to face interaction is fundamental to any distance learning. Wonder occurs not through mechanical activity but the social interaction found in the classroom; analysis is then found not through sophisticated telecourses, but computers existing and operating in the service and context of face-to-face interaction.

Alfred North Whitehead (p. 232) says that philosophy begins in wonder, and that wonder continues after philosophers have analyzed reality. Judith Boss says ethics begins in wonder. Philosophy can say that wonder and analysis begin with ethics, and that ethics continues as the context or orientation for analysis and wonder, and all activity.

2. Tying Phenomenological Ethics to the World

a. Overview

A key model that represents the way to tie phenomenological ethics to the world is by examining ethics, philosophy, and engineering. Scholars in the field of ethics would say that their field provides basic ideas unifying engineering and philosophy. Those thinkers who are ethicists would indicate that engineering and philosophy share a common ground in ethics. Engineering and philosophy are specific manifestation of ethics. The ethicist’s position sees engineering and philosophy as fields where human beings and values orient technology, objectivity, reason, and logic.

Ethicists (Kazanjian, 1998, Chapter 2) would view ethics as unifying engineering and philosophy. Scholars in ethics would view their field as underlying the humanistic thinking in philosophy, and the scientific views of engineering. Those who study ethics would see ethical ideas as necessary in courses in virtually all disciplines and professions. These scholars see ethics as an interdisciplinary foundation to the arts and sciences. For ethicists, business ethics, legal ethics, medical and biomedical ethics, engineering ethics, are all integral parts of business, law, medicine, and the other disciplines. Those who are ethics scholars would say business ought engage in ethical instead of unethical practices. These ethicists would also see lawyers, physicians, biomedical researchers, engineers, and others as competent when their curriculum teaches them values and morals as well as technical expertise. Ethicists would say that values and morals orient technique. The ethical perspective sees the mechanics of a given field as ethically oriented. Ethics scholars would view any disciplinarian as a professional concerned with human beings instead of merely a cognitive or technical, non-ethical expert. Scholars from the field of ethics see their work as interdisciplinarity, among their tasks being the disclosure of the ethical basis of engineering and philosophy. As such, ethicists see their discipline as basic to liberal arts and sciences, and interdisciplinarity at any level.

b. Human Factors Engineering

Human factors engineering, also known as ergonomics or ergonomic engineering, is that kind of engineering which designs physical environments including machines and processes to match human limits and abilities, and train people to use those environments (Chapanis, p.534; Kantowicz and Sorkin, p. 20). These engineers work with mathematics, physics, chemistry, and often computers. Beyond these scientific and technical fields, ergonomics engineers deal with human beings. These engineers are concerned not only with how to design an environment, but how to design it to be safe for the user.

The ergonomics position sees safety as meaning that engineers ought design the environments to be user friendly and ought avoid both user unfriendly and user too friendly designs (Adams, p. 256). A design that is user unfriendly ignores the user. To be user unfriendly means is a design whereby the machine or process is dangerous or offensive for the user. The other design is user too friendly, whereby the machine or process is so safe as to be rendered unfunctional. Designing something as user friendly means that users are able to work with an environment which takes into account the users’ limits and abilities. Such limits and abilities mean people have arms, legs, eyes, ears, and torsos with certain anatomic and sensory measurements. Arms bend in certain ways and are of certain lengths. The same with legs. Ears hear best at certain sound levels. We see best at certain distances.

Ergonomics is saying that human beings see, hear, and move within certain physical parameters. People do not merely perceive, sense, move, and so on. Any machine or process ought be designed such that it allows the user to use it comfortably, without undue stress or tension. Designing user friendly machines or processes is right. Designing an environment that forces people to merely sense or move is wrong. At the other extreme, designing an environment so safe that users need not make any effort to learn or use it is also wrong. The system could become nonfunctional.

The typical human factors engineering text looks like a combination engineering, psychology, and biology book. Ergonomics engineers say that any physical environment is as much social and psychological as it is mathematical, physical, or chemical. No user friendly design is totally reducible to the sum of nuts and bolts. Human factors argues that objects and people comprise an interface: both are interrelated to each other. Machines/processes and human beings ought not be seen as mutually exclusive, but inherently human-oriented. Al Gini (p. 3) argues that work is vital to our identity, but it must be a humanizing career and never just meaningless, dehumanizing sum of tasks.

Human factors also rejects overemphasizing the user. If machines/processes are to take into account the user’s abilities and limits, they are not to simply make things so safe and user-friendly that the machine/procedure becomes unfunctional or unable to perform its technical task.

c. Phenomenology: Brentano

Phenomenology is the philosophical movement somewhere between existentialism and logical positivism. Existentialists would see human beings or any aspect of reality almost totally irreducible to numbers or rational explanation, while the logical positivist position would view people and any reality as totally reducible to number and reason. The existential position views our social and cultural embodiment or existence is almost completely irreducible to number and reason, whereas logical positivism and linguistic analysis see our existence as basically, perhaps totally, rational and numeric. Brentano is considered the founder of phenomenology. He (Stewart and Mickunas, p. 8) initiated the idea of intentionality. Intentionality means that consciousness or embodiment inherently relates to objects. Consciousness is consciousness of objects. Brentano attempted to overcome the logical positivist notion that objects and sensation are real, and consciousness is totally reducible to objectivity.

Brentano would see the thinking mind and the body mutually interrelated . He believed Cartesian dualism is wrong in stating that thinking and the body are two different entities. In speaking about the mind-body unity, Brentano set the stage for Husserl to develop phenomenology. Brentano spoke of the mind-body continuum and rejected total objectivity. Thinking is continuous or interrelated with the body. But Husserl more fully developed the continuum and rejecting two extremes: thinking alone or objectivism, and mere embodiment or subjectivism.

d. Phenomenology: Edmund Husserl

Edmund Husserl moved beyond Brentano (Stewart and Mickunas, p. 8). Husserl sees a development of the mind-body continuum. Objectivity or mind is never value-free or disembodied, according to Husserl. All objectivity is value-laden or occurs as worldly, social, cultural. This view contrasts with the logical positivist notion that objectivity is the sole reality, and value-free.

Husserl’s position would say objectivity ought be seen as reflecting or matching subjectivity or values. From the perspective of phenomenology, we must consider all phenomena as real that appear to consciousness or our thoughts. Where logical positivists and linguistic analysts, and all emotional terms such as God as poetry and not cognitively meaningful, phenomenologists believe all objectivity reflects subjectivity, culture, values, and ethics.

The phenomenological position sees the mind-body issue in the manner that people ought look at physical environments as continuous with subjectivity, and emotions and noncognitive ideas as the social milieu generating the meaning of physical environments. Phenomenologically, objects, cognition, and cultural artifacts are real: products of human or subjective intentions. Mathematics, physics, chemistry, computers, and all the arts and sciences must be seen as part of life. But these cognitive realities emerge from a social, subjective realm and are not to be divorced from human experience. Cognition is never reducible to numbers, symbols, sense perception, and other non-emotive reality. Words reflect human experiences as a whole.

The position of phenomenology is that objectivity to be value-laden and ought avoid two extremes. One extreme is value-free cognition. This is cognition whereby cognition or any object is seen as free of any emotive or cultural values or spirituality. The other extreme means extreme existentialism that rejects any reducibility. Here, science, technology and any cognitive effort is considered almost anti-human. Phenomenology sees cognition and physical environments as things that take into account our values and any other noncognitive being. People have cognitive and analytical abilities and ought use them in certain ways. Knowing is not a simple matter of sense perception and analysis. The blanket denial of the reality of noncognitive ideas such as God and values suggests too simplistic a means of getting at reality.

Husserl also rejects subjectivism or solipsism. In saying that everything appearing to consciousness is real, critics argues that he was dangerous near, if not in fact, advocating solipsism. However, Husserl reject both logical positivism’s cold objectivism, which says people are objects and values unreal, and extreme existentialism and subjectivism’s solipsism, which maintains that the self is the only reality.

Phenomenology: Alfred Schutz (p. 140) comes from the perspective of applied phenomenology. Specifically, his viewpoint is sociology. He considers sociology as the study of “lived history,” or human institutions within which we find chronological or day to day history. He points out that human beings see, hear, and move within value parameters. Social structures comprise “lived history,” and are the context within which “chronological history” makes sense. Schutz ideas are similar to those of Kenneth Boulding. Boulding, while not technically a phenomenologist, notes that perception and action occur within our images of wholes, and never as the sensing of raw data or merely mechanical anatomic movement. People do not merely perceive, sense, move, and so on.

In phenomenology, consciousness intends or is consciousness of objects, thus revealing a subject-object continuum. Objectivity, perception and movement, in turn, are colored by our values and lived world. Objectivity is continuous with subjectivity. Subjectivity is never the reality of just one person, but intersubjective or social. Thus, phenomenology rejects the existential notion of extreme individuality or the virtually solipsistic ego.

Reality, in phenomenology, is the subject-object continuum or duality. Phenomenologists say we ought avoid Cartesian dualism of the mutually exclusive mind and body. Consciousness is always of the object, and the object is always embodied. Ricoeur (p. 217) argues that phenomenology overcomes Cartesian dualism by reintroducing the excarnate mind into the carnate or body. His efforts enable phenomenology to resolve dualism, as well as the objectivism of positivism, and subjectivism of existentialism.

The mind-body continuum means that subjectivity and objectivity are both real, but comprise a systematic reality instead of parts being real in themselves. Human beings exist in a world of physical reality. We sense this as we consider the lived world of culture giving meaning to material objects and generating ideas. Subjectivity does not exist alone; it requires a object. Likewise, objectivity is not merely “out there;” it is always perceived within cultural, lived orientations.

The phenomenological view is that subjectivity is never devoid of objectivity, while the solipsistic position entails subjectivity as devoid of objectivity. We need the world, for people are part of physical reality. Interpreting objectivity as devoid of subjectivity is similarly wrong. It becomes a dehumanized objectivity disregarding human beings and consciousness. Along these lines and seemingly less serious a problem, dualism is just as wrong, according to phenomenology. Objects and subjects are irreducible to mutual distinct, inherently unrelated entities. We do not just take discreet objectivity and subjectivity and externally juxtapose them. We would be unable to bridge the subject-object gap if it were intrinsically discontinuous or unbridged.

e. Phenomenology and Ergonomics: Parallels

Human factors engineering and phenomenology appear to be mutually distinct fields. One is engineering and quantitative, the other a philosophical movement rejecting total quantification. As such, engineering and phenomenology would seem to be irreconcilable disciplines: engineering being strictly hard culture, phenomenology fundamentally soft culture. But our brief statements above show something else.

A glance at human factors engineering and phenomenology reveals parallels. Human factors believes that all physical environment interface with people. Objects ought be designed as continuous with human operators. The entire system is a machine-person interface or continuum, instead of the machine being something totally objective and non-personal. Phenomenology says that mind or objectification is continuous with the social dimension. Phenomenologists speak of the mind-body continuum. Human factors could speak of the machine-user continuum, phenomenology of the mind-body interface. Human factors would be saying machines are continuous with the user, phenomenology would be indicating that the mind interrelates with the body. In ergonomics, seeing designs or actual machines means seeing the operator or subjectivity. In phenomenology, seeing words on paper must mean seeing human values and other intangibles. For human factors engineers, machines/processes ought be acknowledged as intrinsically continuous or interfacing with people’s physical, social, and psychological limits. In phenomenology, the written word ought be recognized as inherently continuous with values and other cultural themes underlying the empirical.

Human factors says machines/processes ought be user-friendly, and ought not be user-unfriendly. Phenomenology maintains that objectivity ought be seen as value-laden, and never value-free. By user friendly, human factors means buttons, numbers, levers, lights, and other physical apparatus the operations and reasons of which the user can learn relatively easily, and the use of which will not harm the person. The human being need not be the proverbial rocket scientist to understand these operations; training would not require the typical user to earn a Ph.D., or even take one course from MIT. The user also need not be made of steel or physically qualify for Navy SEAL commando work to use the environment. The user friendly environment is designed for the typical person’s intellectual and physical abilities. By value-laden, phenomenology means any word ultimately reflects human values. No word is or can be value-free, as the philosophical movements logical positivism and .linguistic analysis tend to maintain. Positivists and analytic thinkers argue that words such as God, love, and religion do not belong in intellectual discourse because they reflect values and emotion. Words such as chair, table, atom and other words are value-free and non-emotive. However, chair reflects the English language, can imply the electric chair, can mean a department head at a college or university, and appears to be nonsexist relative to the apparently sexist term chairman. Phenomenologists would maintain that no word is value-free, that every term is a sociology of that term. Every word emerges from and reflects the social and cultural framework that produces it.

A user unfriendly environment is totally objective, ignoring human limits and abilities and forcing people to mere push, pull, and perceive. Al Gini (p. 120) notes that work offering no hope and becoming unethical is wrong, and means roughly what ergonomics means by user unfriendly work. Value-free language would mean a totally objective set of words over which there is no debate. However, every math, computer, physics and other science book or piece of literature reflects a human author and the author’s perspective, slant, or view. Feminism and civil rights thinkers have shown that such books (any book) are value-laden whether we like it or not. Each is written by a white male, black male, Latino woman, or person of a particular religious, ethnic, or sexual orientation. An author lacking ethnic, gender, and similar human qualities is impossible.

Human factors could say machines ought be user-laden, while phenomenologists might indicate that objectivity ought be seen as subject-friendly. The human factors term “user” is synonymous with phenomenology’s term “subject.” User and subject mean the human being and the cultural context from which the human being emerges.

Both ergonomics and phenomenology look at human-made environments as reflecting culture and not as just cognitive, scientific, or merely objective fields of study and work. Moreover, both ergonomics and phenomenology consider the human as part of the object. Thus, ergonomics notes that we ought avoid simply catering to the person’s every desire and want, and phenomenology rejects solipsism’s view that the individual is the sole reality.

f. Phenomenology and Ergonomics: Enter Ethics

The previous section notes the technical parallels between ergonomics and phenomenology. Readers will see the term “ought” throughout the paragraphs.

Phenomenology and human factors have fundamental parallels, as indicated in the previous section. These are intellectual or technical similarities. They indicate that both see a unity of objects and people.

In doing so, they are ethical in the general sense. Both machines and rational thought emerge from the social context. Ergonomics argues that machines reflect the social and cultural milieu, and are not totally reducible to nuts and bolts. Phenomenologists (Stewart and Mikunas, p. 10) note that God, love, anger, desire, and other intangibles are real because they appear to consciousness. Secular phenomenologists consider nonreligious themes as real. Religious phenomenologists believe that theological and spiritual notions such as God are real.

Both human factors experts and phenomenologists deny that sensation is our only way of knowing and experiencing. Ergonomics engineers would say it is wrong or unethical to design a machine or process that has operators simply “look,” “hear,” or otherwise sense a control panel or other part of a machine. Phenomenologists argue that we would be outside the ethos or culture if we considers human behavior or reality as strictly sensory phenomena.

Engineers abide by and study professional ethics, and the Occupational Safety and Health Administration monitors dangerous in the workplace according to federal law. Phenomenology, however, does not become part of a professional ethics issue except in the case of the ethics of teaching. It may seem that in phenomenology, the subject-object discontinuity or dichotomy is only an academic rather than a technically ethical matter as in engineering. To say that objects are disconnected from and do not reflect subjectivity is not an ethical matter.

Phenomenology is concerned with ethics in the broad sense of ethos or culture. Totally reducing knowledge or reality to the empirical means excarnating or taking sensation out of the social realm comprising ethos. In applied phenomenology, reducing people to computerized forms, numbers, and related paper work may be seen an unethical or socially undesirable.

In acknowledging the social and psychological as well as physical side of people, both human factors engineering and phenomenology are rooted in the sociology of human-made products. A sociology of work suggests that people do not just “do.” They do and know within social, ethos, and thereby ethical constraints.

Both human factors engineering and phenomenology share the view that the person or subject is not alone. In ergonomics, machines ought take the user into account, but this does not mean that the design reflect everything about the user. Physical environments should not be so designed as to satisfy every want, desire, and whim of the operator. Operators need to be trained, and put forth effort to realize that the environment requires change on the users’ part. Additionally, operators are continuous with their surroundings. They are not Luddites, working or existing alone, without the use of physical environments. Phenomenology says that subjectivity is not the same as subjectivism. In subjectivism, the self is considered to be alone, devoid of objectivity.

Phenomenology seems not to fall into the same ethics category of including punishments for unethical behavior as does human factors engineering. However, the culture that supports a totally dehumanized attitude toward people, such as allowing computerization to go wild and reduce everyone to numbers in every instance, is manifesting an anti phenomenological view. The positivism attitude is that we merely know and are excarnated from feelings and emotions. No ethical ruling can be made against positivism as an intellectual movement Yet positivism reflects the culture view that we can and ought ignore feeling and other intangibles.

The lived world is phenomenology’s notion that people live, work, and play in a social context where not everything is totally reducible to numbers or is effable. Paul Ricouer tells us that Cartesian dualism is the effort to see the world and the mind as two different substances. The Cartesian world-view means that cognition is excarnate, discontinuous with the body. Positivism argues that the cognitive is all there exists. Ricoeur would want us to reintroduce the cognitive into the lived world, and to see cognition as incarnate or embodied.

Broadly speaking, the embodied viewpoint is the ethos-oriented viewpoint whereby cognitive activity emerges from the parameters of culture. People ought not just think. Basically, they never just think. Thus, no individual ought take the stand that we are simply thinking substances, whether this substance is somehow related to the body in dualistic terms, or stands by itself in positivistic notions. On the other hand, the cognitive is part of life. We ought not consider the reductive or cognitive as unwarranted, as in much existentialism. We certainly ought not take the view that the cognitive does not have a reality, that the self is alone, that each of us is isolated.

The ethical view posits a holistic perspective. Objectification ought be seen as interfacing or being continuous with the subject or intersubjectivity. Neither objectification devoid of subjectivity, nor subjectivity without objectivity, is the ought.

Human factors speaks of groups of users, not just a user, as reference for designing machines. Phenomenology speaks of intersubjectivity, not just of one subject, a reference for seeing the cognitive. Both ergonomics and phenomenology look at individuals as social, and their limits and abilities are pertaining to groups rather than to one or two people.

Phenomenological thinkers take the position that could be interpreted as the philosophical version of ergonomics. In ergonomics, we do not hear of linguistic analysis, logical positivism , or existentialism. Yet, Ergonomics reveals or deals with language in the broadest sense. When ergonomics speaks of people seeing, hearing, touching, pulling, they are using language. In saying that a person is something that simply sees, hears, etc., we are being positivistic and reducing the individual to an object. If we agree that users are human beings who see, hear, and otherwise sense and move within emotive, cultural, and physical contexts, we are then thinking or using language from a phenomenological viewpoint.

Traditional linguistic analysis tends to imply that philosophers in that vein are only thinkers and not fundamentally akin the engineering. An interdisciplinary attitude with a broad vision of language sees things differently. Language analysts in philosophy work with symbolic logic and not technical mathematics. Human factors engineers work with mathematics, but are suggesting that people are indeed at least partly physical, sensory, and material. Ergonomics may be called human factors, but it can also be called subjectivity factors: we need to take the subjective and cultural into account for engineering processes.

As a corollary, the phenomenological position would be that human factors considers users as not mere objects, but that any characteristic of the person that appears to consciousness is a valid reality. Thus, engineers who are only nuts and bolts people traditionally say we are only skin, neurons, senses, and bones. This is very positivistic language. Phenomenologically, users are also values, emotions, spirituality, and ethos as a whole. Human factors and phenomenology are looking at operators as fundamentally human beings with dignity and essentially irreducible qualities.

Logical positivists might argue that their members helped win World War II by cracking Hitler’s Enigma Machine code. This is true. On the more fundamental side, Hitler would not have risen to the powerful level that we allowed to him to do so had we been phenomenological and cultural. As he was rising and accumulating power, a cultural view would have told us to stop him in his tracks. Had we done so, war would have been unnecessary, the Normandy invasion would not have had to occur, and Hitlers codes would not have had time to develop to be used against us.

Ethics tells us that human factors and phenomenology speak the same language, though ergonomics is the trained engineer designing machines, and phenomenologists are philosophers trained in inquiry and argument instead of the design of physical environments. Physical environments are but a form of language. Ergonomics and phenomenology speak the same language in terms of acknowledging that objectivity is subject- or value-oriented. Al Gini speaks of work in terms of business ethics: work must be ethical and never unethical.

They speak the same language in saying that human beings are essential social instead of standing alone. The physical, written, and motor environments are never totally reducible to objects “out there.” But as reflections of human beings, these environments mirror “our” and not “my” world. Any human value represents the share world-view of numerous individuals comprising a group. Language ought never be either completely symbolic as in the totally logical methods of linguistic analysis, nor ought it be simply one person’s language which no other person can understand.

Two people, one a human factors engineer, the other a phenomenologist, can look at a machine or consider a procedure. These two individuals can communicate with each other if they understand their shared viewpoint. Both are coming from the ethical perspective. The engineer is saying that the numbers, words, and motions, which is to say, the language, of a system, ought reflect human beings in light of culture. A phenomenologist is saying that the writings in a human factors text ought reflect the social, psychological and related value-oriented words and meanings we see in culture.

Both the ergonomics and phenomenological philosopher would agree that the human values comprise a share enterprised that reflects the objective world continuous with the cultural milieu. No person is an island, no person is reducible to flesh and bones. Somewhere between extreme individualism and mere objectivism, the subject-object continuum or machine-person interface comprises a reality including the validity of external and internal worlds.

Ethics means objects are the externalizing of human ideas and the validity of the outside world. As ethos, we are neither extra-ethos nor merely ethos. The extra-ethos or extra-ethical suggests that people are sensations and motions; the merely ethos or ethical can mean we are only a commune, only a community doing little or nothing. Pushed to the extreme, the commune leads to the individual member as possibly believing that he or she stands alone.

In both human factors and phenomenology, language as our fundamental nature is seen as an object-subjective reality. Positivism sees language as symbols and sensory activity; traditional engineering involving merely nuts and bolts sees machines, and therefore language, as the sum total of physical parts. Human factors and phenomenology, rooted in ethos, consider language as a holistic reality whereby being serves to objectify itself through beings.

g. Ethics as Homological

We typically think of ethics as a course of study, as in professional or philosophical ethics. In that way, ethics is no more fundamental than any other discipline. The above pages show that ethics is isomorphic or homological. Isomorphic is derived from iso meaning the same, and morphic meaning shape. Ethics is the same shape or principle that underlies ergonomics and phenomenology. Homological is derived from homo means same, and logic meaning word or structure. Ethics is the same structure from which ergonomics and phenomenology are derived. Learning ethics is basic to human factors engineering and phenomenology. As we consider ethics, we find it necessary to objectify within the parameters of culture, values, and perhaps spirituality. From the engineering perspective, ethics becomes a method of developing physical structures for human use. From the philosophical view, ethics can be interpreted as the intellectual inquiry into knowledge and reality.

As an isomorphic or homological root to human factors and phenomenology, ethics becomes the foundations for a liberal arts. Whatever we know and do, we are inherently facing the opportunity to know and do what is humane, and what is not. This opportunity means intellectual and engineering approaches are basically ethical. They emerge from culture or ethos. To deny values would be to reject our cultural foundations; to seek only values would be unrealistic.

Interdisciplinary research has gone in two directions. One is interdisciplinarity in the sense of team teaching and putting together courses and topics in some sort of seamless or minimally seamed fabric. Ethics plays an equity role here. It is one of the disciplines or ideas relevant to knowledge. The other direction (Kazanjian, 2002, p. 30) is more fundamental. This is the isomorphic direction. Isomorphic or homological ethics means that we must study ethics as a cultural, ethos framework within which we find the roots for all other professions.

Ethics as the homological root of numerous disciplines can thereby show us how to understand human factors engineering and phenomenology. Take ethics away from ergonomics and we essentially eliminate human factors engineering. The very term “human factor” implies that ethos is basic to engineering. Take ethics away from phenomenology and we basically have logical positivism. The subjective orientations of objectivity refer to the ethos from which emerges the objective.

Contemporary interest in professional ethics implies that ethical and non-ethical thinking and doing is something new. Indeed, our admission is new. The existence of the ethical perspective is as old as humanity. Mircea Eliade (p. 31) has taken pains to elucidate the validity of ethics in primitive cultures. In every society we have dos and don’ts. Without that view, any person in any culture will simply do or know, and the result could hurt that person or others.

Eliade’s point concerns comparative religion. Ancient societies did everything by repeating the anatomic gestures as performed in Primordial Time by the gods. Primordial Time is the time before the gods created the world and our idea of calendar time. No member of any society simply “did” something. Today, we imply that we merely do or know. Even Eliade suggests that contemporary society is totally secularized. Yet, Clifton Bryant’s research in the sociology of work (p. 1) catalogues the social and thus ethical directions of knowledge and technique.

Ethics, then, is not just another topic (Kazanjian, p. 3). It is the ongoing insight that ethos or culture provides us with the framework for survival and etiquette. Human factors and phenomenology are two specific manifestation of that insight.

3. References and Further Reading

a. Theory

  • Clifton D. Bryant ed. The Social Dimensions of Work Englewood Cliffs, NJ.: Prentice-Hall; 1972
  • Martin Buber I and Thou New York: Charles Scribner’s Sons; 1970.
  • Jeremy Campbell The Improbable Machine New York: Simon and Schuster; 1989,
  • Mircea Eliade Patterns in Comparative Religion Cleveland: The World Publishing Company; 1958.
  • Michael M. Kazanjian Learning Values Lifelong The Netherlands: Rodopi; 2002.
  • Paul Ricoeur Freedom and Nature: The Voluntary and the Involuntary Evanston, IL: NorthwesternUniversity Press; 1968.
  • David Stewart and Algis Mackunas Exploring Phenomenology Athens, OH: Ohio University Press; 1990.
  • G. Harry Stine The Hopeful Future New York: Macmillan;1983.
  • Alfred North Whitehead Modes of Thought New York: The Macmillan Company; 1958.

b. Tying Phenomenological Ethics to the World

  • Jack A. Adams Human Factors Engineering. New York: Macmillan; 1989.
  • Mircea Eliade Patterns in Comparative Religion. Cleveland, OH: World Publishing; 1963.
  • Clifton D. Bryant ed. The Sociology of Work. Englewood Cliffs, New Jersey: Prentice Hall; 1972.
  • Alphonse Chapanis, “Human Engineering,” in Operations Research and Systems Engineering ed. Charles D. Flagle, William H. Huggins, and Robert R. Roy, Baltimore: The Johns Hopkins University Press; 1960.
  • Al Gini My Job, My Self. New York: Routledge; 2001.
  • Edmund Husserl Ideas Pertaining to a Pure Phenomenology and to PhenomenologicalPhilosophy The Netherlands: Kluwer; 1967..
  • Barry H. Kantowicz and Robert D. Sorkin. Human Factors. New York: John Wiley; 1983.
  • Michael M. Kazanjian Phenomenology and Education The Netherlands: Rodopi: 1998.
    • Especially chapters one and two comparing ethics, human factors, and phenomenology.
  • Michael M. Kazanjian Learning Values Lifelong The Netherlands: Rodopi; 2002.
  • Algis Mikunas and David Stewart Exploring Phenomenology. Athens, OH: Ohio University Press; 1990.
  • Paul Ricoeur Freedom and Nature: The Voluntary and the Involuntary. Evanston: Northwestern University Press; 1966.
  • Alfred Schutz The Phenomenology of the Social World. Evanston: Northwestern University Press;1967.

Author Information

Michael M. Kazanjian
Email: mkazanji@triton.edu
U. S. A.

Epsilon Calculi

Epsilon Calculi are extended forms of the predicate calculus that incorporate epsilon terms. Epsilon terms are individual terms of the form ‘εxFx’, being defined for all predicates in the language. The epsilon term ‘εxFx’ denotes a chosen F, if there are any F’s, and has an arbitrary reference otherwise. Epsilon calculi were originally developed to study certain forms of arithmetic, and set theory; also to prove some important meta-theorems about the predicate calculus. Later formal developments have included a variety of intensional epsilon calculi, of use in the study of necessity, and more general intensional notions, like belief. An epsilon term such as ‘εxFx’ was originally read as ‘the first F’, and in arithmetical contexts as ‘the least F’. More generally it can be read as the demonstrative description ‘that F’, when arising either deictically, that is, in a pragmatic context where some F is being pointed at, or in linguistic cross-reference situations, as with, for example, ‘There is a red-haired man in the room. That red-haired man is Caucasian’. The application of epsilon terms to natural language shares some features with the use of iota terms within the theory of descriptions given by Bertrand Russell, but differs in formalising aspects of a slightly different theory of reference, first given by Keith Donnellan. More recently, epsilon terms have been used by a number of writers to formalise cross-sentential anaphora, which would arise if ‘that red-haired man’ in the linguistic case above was replaced with a pronoun such as ‘he’. There is then also the similar application in intensional cases, like ‘There is a red-haired man in the room. Celia believed he was a woman.’

Table of Contents

  1. Introduction
  2. Descriptions and Identity
  3. Rigid Epsilon Terms
  4. The Epsilon Calculus’ Problematic
  5. The Formal Semantics of Epsilon Terms
  6. Some Metatheory
  7. References and Further Reading

1. Introduction

Epsilon terms were introduced by the german mathematician David Hilbert, in Hilbert 1923, 1925, to provide explicit definitions of the existential and universal quantifiers, and resolve some problems in infinitistic mathematics. But it is not just the related formal results, and structures which are of interest. In Hilbert’s major book Grundlagen der Mathematik, which he wrote with his collaborator Paul Bernays, epsilon terms were presented as formalising certain natural language constructions, like definite descriptions. And they in fact have a considerably larger range of such applications, for instance in the symbolisation of certain cross-sentential anaphora. Hilbert and Bernays also used their epsilon calculus to prove two important meta-theorems about the predicate calculus. One theorem subsequently led, for instance, to the development of semantic tableaux: it is called the First Epsilon Theorem, and its content and proof will be given later, in section 6 below. A second theorem that Hilbert and Bernays proved, which we shall also look at then, establishes that epsilon calculi are conservative extensions of the predicate calculus, that is, that no more theorems expressible just in the quantificational language of the predicate calculus can be proved in epsilon calculi than can be proved in the predicate calculus itself. But while epsilon calculi do have these further important formal functions, we will not only be concerned to explore them, for we shall also first discuss the natural language structures upon which epsilon calculi have a considerable bearing.

The growing awareness of the larger meaning and significance of epsilon calculi has only come in stages. Hilbert and Bernays introduced epsilon terms for several meta-mathematical purposes, as above, but the extended presentation of an epsilon calculus, as a formal logic of interest in its own right, in fact only first appeared in Bourbaki’s Éléments de Mathématique (although see also Ackermann 1937-8). Bourbaki’s epsilon calculus with identity (Bourbaki, 1954, Book 1) is axiomatic, with Modus Ponens as the only primitive inference or derivation rule. Thus, in effect, we get:

(X ∨ X) → X,
X → (X ∨ Y),
(X ∨ Y) → (Y ∨ X),
(X ∨ Y) → ((Z ∨ X) → (Z ∨ Y)),
Fy → FεxFx,
x = y → (Fx ↔ Fy),
(x)(Fx ↔ Gx) → εxFx = εxGx.

This adds to a basis for the propositional calculus an epsilon axiom schema, then Leibniz’ Law, and a second epsilon axiom schema, which is a further law of identity. Bourbaki, though, used the Greek letter tau rather than epsilon to form what are now called ‘epsilon terms’; nevertheless, he defined the quantifiers in terms of his tau symbol in the manner of Hilbert and Bernays, namely:

(∃x)Fx ↔ FεxFx,
(x)Fx ↔ Fεx¬Fx;

and note that, in his system the other usual law of identity, ‘x = x’, is derivable.

The principle purpose Bourbaki found for his system of logic was in his theory of sets, although through that, in the modern manner, it thereby came to be the foundation for the rest of mathematics. Bourbaki’s theory of sets discriminates amongst predicates those which determine sets: thus some, but only some, predicates determine sets, i.e. are ‘collectivisantes’. All the main axioms of classical Set Theory are incorporated in his theory, but he does not have an Axiom of Choice as a separate axiom, since its functions are taken over by his tau symbol. The same point holds in Bernays’ epsilon version of his set theory (Bernays 1958, Ch VIII).

Epsilon calculi, during this period, were developed without any semantics, but a semantic interpretation was produced by Gunter Asser in 1957, and subsequently published in a book by A.C. Leisenring, in 1969. Even then, readings of epsilon terms in ordinary language were still uncommon. A natural language reading of epsilon terms, however, was present in Hilbert and Bernays’ work. In fact the last chapter of book 1 of the Grundlagen is a presentation of a theory of definite descriptions, and epsilon terms relate closely to this. In the more well known theory of definite descriptions by Bertrand Russell (Russell 1905) there are three clauses: with

The king of France is bald

we get, on Russell’s theory, first

there is a king of France,

second

there is only one king of France,

and third

anyone who is king of France is bald.

Russell uses the Greek letter iota to formalise the definite description, writing the whole

BιxKx,

but he recognises the iota term is not a proper individual symbol. He calls it an ‘incomplete symbol’, since, because of the three parts, the whole proposition is taken to have the quantificational analysis,

(∃x)(Kx & (y)(Ky → y = x) & (y)(Ky → By)),

which is equivalent to

(∃x)(Kx & (y)(Ky→ y = x) & Bx).

And that means that it does not have the form ‘Bx’. Russell believed that, in addition to his iota terms, there was another class of individual terms, which he called ‘logically proper names’. These would simply fit into the ‘x’ place in ‘Bx’. He believed that ‘this’ and ‘that’ were in this class, but gave no symbolic characterisation of them.

Hilbert and Bernays, by contrast, produced what is called a ‘pre-suppositional theory’ of definite descriptions. The first two clauses of Russell’s definition were not taken to be part of the meaning of ‘The King of France is bald’: they were merely conditions under which they took it to be permitted to introduce a complete individual term for ‘the King of France’, which then satisfies

Kx & (y)(Ky → y = x).

Hilbert and Bernays continued to use the Greek letter iota in their individual term, although it has a quite different grammar from Russell’s iota term, since, when Hilbert and Bernays’ term can be introduced, it is provably equivalent to the corresponding epsilon term (Kneebone 1963, p102). In fact it was later suggested by many that epsilon terms are not only complete symbols, but can be seen as playing the same role as the ‘logically proper names’ Russell discussed.

It is at the start of book 2 of the Grundlagen that we find the definition of epsilon terms. There, Hilbert and Bernays first construct a theory of indefinite descriptions in a similar manner to their theory of definite descriptions. They allow, now, an eta term to be introduced as long as just the first of Russell’s conditions is met. That is to say, given

(∃x)Fx,

one can introduce the term ‘ηxFx’, and say

FηxFx.

But the condition for the introduction of the eta term can be established logically, for certain predicates, since

(∃x)((∃y)Fy → Fx),

is a predicate calculus theorem (Copi 1973, p110). It is the eta term this theorem allows us to introduce which is otherwise called an epsilon term, and its logical basis enables entirely formal theories to be constructed, since such individual terms are invariably defined. Thus we may invariably introduce ‘ηx((∃y)Fy → Fx)’, and this is commonly written ‘εxFx’, about which we can therefore say

(∃y)Fy → FεxFx.

Since it is that F which exists if anything is F, Hilbert read the epsilon term in this case ‘the first F’. For instance, in arithmetic, ‘the first’ may be taken to be the least number operator. However, while if there are F’s then the first F is clearly some chosen one of them, if there are no F’s then ‘the first F’ must be a misnomer. And that form of speech only came to be fully understood in the theories of reference which appeared much later, when reference and denotation came to be more clearly separated from description and attribution. Donnellan (Donnellan 1966) used the example ‘the man with martini in his glass’, and pointed out that, in certain uses, this can refer to someone without martini in his glass. In the terminology Donnellan made popular, ‘the first F’, in the second case above works similarly: it cannot be attributive, and so, while it refers to something, it must refer arbitrarily, from a semantic point of view.

With reference in this way separated from attribution it becomes possible to symbolise the anaphoric cross-reference between, for instance, ‘There is one and only one king of France’ and ‘He is bald’. For, independently of whether the former is true, the ‘he’ in the latter is a pronoun for the epsilon term in the former — by a simple extension of the epsilon definition of the existential quantifier. Thus the pair of remarks may be symbolised

(∃x)(Kx & (y)(Ky → y = x)) & Bεx(Kx & (y)(Ky → y = x)).

Furthermore such cross-reference may occur in connection with intensional constructions of a kind Russell also considered, such as

George IV wondered whether the author of Waverley was Scott.

Thus we can say ‘There is an author of Waverley, and George IV wondered whether he was Scott’. But the epsilon analysis of these cases puts intensional epsilon calculi at odds with Russellian views of such constructions, as we shall see later. The Russellian approach, by not having complete symbols for individuals, tends to confuse cases in which assertions are made about individuals and cases in which assertions are made about identifying properties. As we shall see, epsilon terms enable us to make the discrimination between, for instance,

s = εx(y)(Ay ↔ y = x),

(i.e. ‘Scott is the author of Waverley’), and

(y)(Ay ↔ y = s),

(that is, ‘there is one and only one author of Waverley and he is Scott’), and so it enables us to locate more exactly the object of George IV’s thought.

2. Descriptions and Identity

When one starts to ask about the natural language meaning of epsilon terms, it is interesting that Leisenring just mentions the ‘formal superiority’ of the epsilon calculus (Leisenring 1969, p63, see also Routley 1969, Hazen 1987). Leisenring took the epsilon calculus to be a better logic than the predicate calculus, but merely because of the Second Epsilon Theorem. Its main virtue, to Leisenring, was that it could prove all that seemingly needed to be proved, but in a more elegant way. Epsilon terms were just neater at calculating which were the valid theorems of the predicate calculus.

Remembering Hilbert and Bernays’ discussion of definite and indefinite descriptions, clearly there is more to the epsilon calculus than this. And there are, in fact, two specific theorems provable within the epsilon calculus, though not the predicate calculus, which will start to indicate the epsilon calculus’ more general range of application. They concern individuals, since the epsilon calculus is distinctive in providing an appropriate, and systematic means of reference to them.

The need to have complete symbols for individuals became evident some years after Russell’s promotion of incomplete symbols for them. The first major book to allow for this was Rosser’s Logic for Mathematicians, in 1953, although there were precursors. For the classical difficulty with providing complete terms for individuals concerns what to do with ‘non-denoting’ terms, and Quine, for instance, following Frege, often gave them an arbitrary, though specific referent (Marciszewski 1981, p113). This idea is also present in Kalish and Montague (Kalish and Montague 1964, pp242-243), who gave the two rules:

(∃x)(y)(Fy ↔ y = x) ├ FιxFx,
¬(∃x)(y)(Fy ↔ y = x) ├ιxFx = ιx¬(x = x),

where ‘ιxFx’ is what otherwise might be written ‘εx(y)(Fy ↔ y = x)’. Kalish and Montague believed, however, that the second rule ‘has no intuitive counterpart, simply because ordinary language shuns improper definite descriptions’ (Kalish and Montague 1964, p244). And, at that time, what Donnellan was to publish in Donnellan 1966, about improper definite descriptions, was certainly not well known. In fact ordinary speech does not shun improper definite descriptions, although their referents are not as fixed as the above second rule requires. Indeed the very fact that the descriptions are improper means that their referents are not determined semantically: instead they are just a practical, pragmatic choice.

Stalnaker and Thomason recognised the need to be more liberal when they defined their referential terms, which also had to refer, in the contexts they were concerned with, in more than one possible world (Thomason and Stalnaker 1968, p363):

In contrast with the Russellian analysis, definite descriptions are treated as genuine singular terms; but in general they will not be substance terms [rigid designators]. An expression like ιxPx is assigned a referent which may vary from world to world. If in a given world there is a unique existing individual which has the property corresponding to P, this individual is the referent of ιxPx; otherwise, ιxPx refers to an arbitrarily chosen individual which does not exist in that world.

Stalnaker and Thomason appreciated that ‘A substance term is much like what Russell called a logically proper name’, but they said that an individual constant might or might not be a substance term, depending on whether it was more like ‘Socrates’ or ‘Miss America’ (Thomason and Stalnaker 1968, p362). A more complete investigation of identity and descriptions, in modal and general intensional contexts, was provided in Routley, Meyer and Goddard 1974, and Routley 1977, see also Hughes and Cresswell 1968, Ch 11. And with these writers we get the explicit rendering of definite descriptions in epsilon terms, as in Goddard and Routley 1973, p558, Routley 1980, p277, c.f. Hughes and Cresswell 1968, p203.

Certain specific theorems in the epsilon calculus, as was said before, support these kinds of identification. One theorem demonstrates directly the relation between Russell’s attributive, and some of Donnellan’s referential ideas. For

(∃x)(Fx & (y)(Fy → y = x) & Gx)

is logically equivalent to

(∃x)(Fx & (y)(Fy → y = x)) & Ga,

where a = εx(Fx & (y)(Fy → y = x)). This arises because the latter is equivalent to

Fa & (y)(Fy → y = a) & Ga,

which entails the former. But the former is

Fb & (y)(Fy → y = b) & Gb,

with b = εx(Fx & (y)(Fy → y = x) & Gx), and so entails

(∃x)(Fx & (y)(Fy → y = x)),

and

Fa & (y)(Fy → y = a).

But that means that, from the uniqueness clause,

a = b,

and so

Ga,

meaning the former entails the latter, and therefore the former is equivalent to the latter.

The former, of course, gives Russell’s Theory of Descriptions, in the case of ‘The F is G’; it explicitly asserts the first two clauses, to do with the existence and uniqueness of an F. A presuppositional theory, such as we saw in Hilbert and Bernays, would not explicitly assert these two clauses: on such an account they are a precondition before the term ‘the F’ can be introduced. But neither of these theories accommodate improper definite descriptions. Since Donnellan it is more common to allow that we can always use ‘the F’: if the description is improper then the referent of this term is simply found in the term’s practical use.

One detail of Donnellan’s historical account, however, must be treated with some care, at this point. Donnellan was himself concerned with definite descriptions which were improper in the sense that they did not uniquely describe what the speaker took to be their referent. So the description might still be ‘proper’ in the above sense — if there still was something to which it uniquely applied, on account of its semantic content. Thus Donnellan allowed ‘the man with martini in his glass’ to identify someone without martini in his glass irrespective of whether there was some sole man with martini in his glass. But if one talks about ‘the man with martini in his glass’ one can be correctly taken to be talking about who this describes, if it does in fact correctly describe someone — as Devitt and Bertolet pointed out in criticism of Donnellan (Devitt 1974, Bertolet 1980). It is this aspect of our language which the epsilon account matches, for an epsilon account allows definite descriptions to refer without attribution of their semantic character, but only if nothing uniquely has that semantic character. Thus it is not the whole of the first statement above , but only the third part of the second statement which makes the remark ‘The F is G’.

The difficulty with Russell’s account becomes more plain if we read the two equivalent statements using relative and personal pronouns. They then become

There is one and only one F, which is G,
There is one and only one F; it is G.

But using just the logic derived from Frege, Russell could formalise the ‘which’, but could not separate out the last clause, ‘it is G’. In that clause ‘it’ is an anaphor for ‘the (one and only) F’, and it still has this linguistic meaning if there is no such thing, since that is just a matter of grammar. But the uniqueness clause is needed for the two statements to be equivalent — without uniqueness there is no equivalence, as we shall see – so ‘which’ is not itself equivalent to ‘it’. Russell, however, because he could not separate out the ‘it’, had to take the whole of the first expression as the analysis of ‘The F is G’ — he could not formulate the needed ‘logically proper name’.

But how can something be the one and only F ‘if there is no such thing’? That is where another important theorem provable in the epsilon calculus is illuminating, namely:

(Fa & (y)(Fy → y = a)) → a = εx(Fx & (y)(Fy → y = x)).

The important thing is that there is a difference between the left hand side and the right hand side, i.e. between something being alone F, and that thing being the one and only F. For the left-right implication cannot be reversed. We get from the left to the right when we see that the left as a whole entails

(∃x)(Fx & (y)(Fy → y = x)),

and so also its epsilon equivalent

Fεx(Fx & (y)(Fy → y = x)) & (z)(Fz → z = εx(Fx & (y)(Fy → y = x))).

Given Fa, then from the second clause here we get the right hand side of our original implication. But if we substitute ‘εx(Fx & (y)(Fy → y = x))’ for ‘a’ in that implication then on the right we have something which is necessarily true. But the left hand side is then the same as

(∃x)(Fx & (y)(Fy → y = x)),

and that is in general contingent. Hence the implication cannot generally be reversed. Having the property of being alone F is here contingent, but possessing the identity of the one and only F is necessary.

The distinction is not made in Russell’s logic, since possession of the relevant property is the only thing which can be formally expressed there. In Russell’s theory of descriptions, a’s possession of the property of being alone a king of France is expressed as a quasi identity

a = ιxKx,

and that has the consequence that such identities are contingent. Indeed, in counterpart theories of objects in other possible worlds the idea is pervasive that an entity may be defined in terms of its contingent properties in a given world. Hughes and Cresswell, however, differentiated between contingent identities and necessary identities in the following way (Hughes and Cresswell 1968, p191):

Now it is contingent that the man who is in fact the man who lives next door is the man who lives next door, for he might have lived somewhere else; that is living next door is a property which belongs contingently, not necessarily, to the man to whom it does belong. And similarly, it is contingent that the man who is in fact the mayor is the mayor; for someone else might have been elected instead. But if we understand [The man who lives next door is the mayor] to mean that the object which (as a matter of contingent fact) possesses the property of being the man who lives next door is identical with the object which (as a matter of contingent fact) possesses the property of being the mayor, then we are understanding it to assert that a certain object (variously described) is identical with itself, and this we need have no qualms about regarding as a necessary truth. This would give us a way of construing identity statements which makes [(x = y) → L(x = y)] perfectly acceptable: for whenever x = y is true we can take it as expressing the necessary truth that a certain object is identical with itself.

There are more consequences of this matter, however, than Hughes and Cresswell drew out. For now that we have proper referring terms for individuals to go into such expressions as ‘x = y’, we first see better where the contingency of the properties of such individuals comes from — simply the linguistic facility of using improper definite descriptions. But we also see, because identities between such terms are necessary, that proper referring terms must be rigid, i.e. have the same reference in all possible worlds.

This is not how Stalnaker and Thomason saw the matter. Stalnaker and Thomason, it will be remembered, said that there were two kinds of individual constants: ones like ‘Socrates’ which can take the place of individual variables, and others like ‘Miss America’ which cannot. The latter, as a result, they took to be non-rigid. But it is strictly ‘Miss America in year t’ which is meant in the second case, and that is not a constant expression, even though such functions can take the place of individual variables. It was Routley, Meyer and Goddard who most seriously considered the resultant possibility that all properly individual terms are rigid. At least, they worked out many of the implications of this position, even though Routley was not entirely content with it.

Routley described several rigid intensional semantics (Routley 1977, pp185-186). One of these, for instance, just took the first epsilon axiom to hold in any interpretation, and made the value of an epsilon term itself. On such a basis Routley, Meyer and Goddard derived what may be called ‘Routley’s Formula’, i.e.

L(∃x)Fx → (∃x)LFx.

In fact, on their understanding, this formula holds for any operator and any predicate, but they had in mind principally the case of necessity illustrated here, with ‘Fx’ taken as ‘x numbers the planets’, making ‘εxFx’ ‘the number of the planets’. The formula is derived quite simply, in the following way: from

L(∃x)Fx,

we can get

LFεxFx,

by the epsilon definition of the existential quantifier, and so

(∃x)LFx,

by existential generalisation over the rigid term (Routley, Meyer and Goddard 1974, p308, see also Hughes and Cresswell 1968, pp197, 204). Routley, however, was still inclined to think that a rigid semantics was philosophically objectionable (Routley 1977, p186):

Rigid semantics tend to clutter up the semantics for enriched systems with ad hoc modelling conditions. More important, rigid semantics, whether substitutional or objectual, are philosophically objectionable. For one thing, they make Vulcan and Hephaestus everywhere indistinguishable though there are intensional claims that hold of one but not of the other. The standard escape from this sort of problem, that of taking proper names like ‘Vulcan’ as disguised descriptions we have already found wanting… Flexible semantics, which satisfactorily avoid these objections, impose a more objectual interpretation, since, even if [the domain] is construed as the domain of terms, [the value of a term in a world] has to be permitted, in some cases at least, to vary from world to world.

As a result, while Routley, Meyer and Goddard were still prepared to defend the formula, and say, for instance, that there was a number which necessarily numbers the planets, namely the number of the planets (np), they thought that this was only in fact the same as 9, so that one still could not argue correctly that as L(np numbers the planets), so L(9 numbers the planets). ‘For extensional identity does not warrant intersubstitutivity in intensional frames’ (Routley, Meyer and Goddard 1974, p309). They held, in other words that the number of the planets was only contingently 9.

This means that they denied ‘(x = y) → L(x = y)’, but, as we shall see in more detail later, there are ways to hold onto this principle, i.e. maintain the invariable necessity of identity.

3. Rigid Epsilon Terms

There is some further work which has helped us to understand how reference in modal and general intensional contexts must be rigid. But it involves some different ideas in semantics, and starts, even, outside our main area of interest, namely predicate logic, in the semantics of propositional logic.

When one thinks of ‘semantics’ one maybe thinks of the valuation of formulas. Since the 1920s a meta-study of this kind was certainly added to the previous logical interest in proof theory. Traditional proof theory is commonly associated with axiomatic procedures, but, from a modern perspective, its distinction is that it is to do with ‘object languages’. Tarski’s theory of truth relies crucially on the distinction between object languages and meta-languages, and so semantics generally seems to be necessarily a meta-discipline. In fact Tarski believed that such an elevation of our interest was forced upon us by the threat of semantic paradoxes like The Liar. If there was, by contrast, ‘semantic closure’, i.e. if truth and other semantic notions were definable at the object level, then there would be contradictions galore (c.f. Priest 1984). In this way truth may seem to be necessarily a predicate of (object-level) sentences.

But there is another way of looking at the matter which is explicitly non-Tarskian, and which others have followed (see Prior 1971, Ch 7, Sayward 1987). This involves seeing ‘it is true that’ as not a predicate, but an object-level operator, with the truth tabulations in Truth Tables, for instance, being just another form of proof procedure. Operators indeed include ‘it is provable that’, and this is distinct from Gödel’s provability predicate, as Gödel himself pointed out (Gödel 1969). Operators are intensional expressions, as in the often discussed ‘it is necessary that’ and ‘it is believed that’, and trying to see such forms of indirect discourse as metalinguistic predicates was very common in the middle of the last century. It was pervasive, for instance, in Quine’s many discussions of modality and intensionality. Wouldn’t someone be believing that the Morning Star is in the sky, but the Evening Star is not, if, respectively, they assented to the sentence ‘the Morning Star is in the sky’, and dissented from ‘the Evening Star is in the sky’? Anyone saying ‘yes’ is still following the Quinean tradition, but after Montague’s and Thomason’s work on operators (e.g. Montague 1963, Thomason 1977, 1980) many logicians are more persuaded that indirect discourse is not quotational. It is open to doubt, that is to say, whether we should see the mind in terms of the direct words which the subject would use.

The alternative involves seeing the words ‘the Morning Star is in the sky’ in such an indirect speech locution as ‘Quine believes that the Morning Star is in the sky’ as words merely used by the reporter, which need not directly reflect what the subject actually says. That is indeed central to reported speech — putting something into the reporter’s own words rather than just parroting them from another source. Thus a reporter may say

Celia believed that the man in the room was a woman,

but clearly that does not mean that Celia would use ‘the man in the room’ for who she was thinking about. So referential terms in the subordinate proposition are only certainly in the mouth of the reporter, and as a result only certainly refer to what the reporter means by them. It is a short step from this thought to seeing

There was a man in the room, but Celia believed that he was a woman,

as involving a transparent intensional locution, with the same object, as one might say, ‘inside’ the belief as ‘outside’ in the room. So it is here where rigid constant epsilon terms are needed, to symbolise the cross-sentential anaphor ‘he’, as in:

(∃x)(Mx & Rx) & BcWεx(Mx & Rx).

To understand the matter fully, however, we must make the shift from meta- to object language we saw at the propositional level above with truth. Routley, Meyer and Goddard realised that a rigid semantics required treating such expressions as ‘BcWx’ as simple predicates, and we must now see what this implies. They derived, as we saw before, ‘Routley’s Formula’

L(∃x)Fx → (∃x)LFx,

but we can now start to spell out how this is to be understood, if we hold to the necessity of identities, i.e. if we use ‘=’ so that

x = y → L(x = y).

Again a clear illustration of the validity of Routley’s Formula is provided by the number of the planets, but now we may respect the fact that some things may lack a number, and also the fact that referential, and attributive senses of terms may be distinguished. Thus if we write ‘(nx)Px’ for ‘there are n P’s’, then εn(ny)Py will be the number of P’s, and it is what numbers them (i.e. ([εn(ny)Py]x)Px) if they have a number (i.e. if (∃n)(nx)Px) — by the epsilon definition of the existential quantifier. Then, with ‘Fx’ as the proper (necessary) identity ‘x = εn(ny)Py’ Routley’s Formula holds because the number in question exists eternally, making both sides of the formula true. But if ‘Fn’ is simply the attributive ‘(ny)Py’ then this is not necessary, since it is contingent even, in the first place, that there is a number of P’s, instead of just some P, making both sides of the formula false.

Hughes and Cresswell argue against the principle saying (Hughes and Cresswell 1968, p144):

…let [Fx] be ‘x is the number of the planets’. Then the antecedent is true, for there must be some number which is the number of the planets (even if there were no planets at all there would still be such a number, namely 0): but the consequent is false, for since it is a contingent matter how many planets there are, there is no number which must be the number of the planets.

But this forgets continuous quantities, where there are no discrete items before the nomination of a unit. The number associated with some planetary material, for instance, numbers only arbitrary units of that material, and not the material itself. So the antecedent of Routley’s Formula is not necessarily true.

Quine also used the number of the planets in his central argument against quantification into modal contexts. He said (Quine 1960, pp195-197):

If for the sake of argument we accept the term ‘analytic’ as predicable of sentences (hence as attachable predicatively to quotations or other singular terms designating sentences), then ‘necessarily’ amounts to ‘is analytic’ plus an antecedent pair of quotation marks. For example, the sentence:

(1) Necessarily 9 > 4

is explained thus:

(2) ‘9 > 4’ is analytic…

So suppose (1) explained as in (2). Why, one may ask, should we preserve the operatorial form as of (1), and therewith modal logic, instead of just leaving matters as in (2)? An apparent advantage is the possibility of quantifying into modal positions; for we know we cannot quantify into quotation, and (2) uses quotation…

But is it more legitimate to quantify into modal positions than into quotation? For consider (1) even without regard to (2); surely, on any plausible interpretation, (1) is true and this is false:

(3) Necessarily the number of major planets > 4.

Since 9 = the number of major planets, we can conclude that the position of ‘9’ in (1) is not purely referential and hence that the necessity operator is opaque.

But here Quine does not separate out the referential ‘the number of the major planets is greater than 4’, i.e. ‘εn(ny)Py > 4’, from the attributive ‘There are more than 4 major planets’, i.e. ‘(∃n)((ny)Py & n > 4)’. If 9 = εn(ny)Py, then it follows that εn(ny)Py > 4, but it does not follow that (∃n)((ny)Py & n > 4). Substitution of identicals in (1), therefore, does yield (3), even though it is not necessary that there are more than 4 major planets.

We can now go into some details of how one gets the ‘x’ in such a form as ‘LFx’ to be open for quantification. For, what one finds in traditional modal semantics (see Hughes and Cresswell 1968, passim) are formulas in the meta-linguistic style, like

V(Fx, i) = 1,

which say that the valuation put on ‘Fx’ is 1, in world i. There should be quotation marks around the ‘Fx’ in such a formula, to make it meta-linguistic, but by convention they are generally omitted. To effect the change to the non-meta-linguistic point of view, we must simply read this formula as it literally is, so that the ‘Fx’ is in indirect speech rather than direct speech, and the whole becomes the operator form ‘it would be true in world i that Fx’. In this way, the term ‘x’ gets into the language of the reporter, and the meta/object distinction is not relevant. Any variable inside the subordinate proposition can now be quantified over, just like a variable outside it, which means there is ‘quantifying in’, and indeed all the normal predicate logic operations apply, since all individual terms are rigid.

A example illustrating this rigidity involves the actual top card in a pack, and the cards which might have been top card in other circumstances (see Slater 1988a). If the actual top card is the Ace of Spades, and it is supposed that the top card is the Queen of Hearts, then clearly what would have to be true for those circumstances to obtain would be for the Ace of Spades to be the Queen of Hearts. The Ace of Spades is not in fact the Queen of Hearts, but that does not mean they cannot be identical in other worlds (c.f. Hughes and Cresswell, 1968, p190). Certainly if there were several cards people variously thought were on top, those cards in the various supposed circumstances would not provide a constant c such that Fc is true in all worlds. But that is because those cards are functions of the imagined worlds — the card a believes is top (εxBaFx) need not be the card b believes is top (εxBbFx), etc. It still remains that there is a constant, c, such that Fc is true in all worlds. Moreover, that c is not an ‘intensional object’, for the given Ace of Spades is a plain and solid extensional object, the actual top card (εxFx).

Routley, Meyer and Goddard did not accept the latter point, wanting a rigid semantics in terms of ‘intensional objects’ (Goddard and Routley, 1973, p561, Routley, Meyer and Goddard, 1974, p309, see also Hughes and Cresswell 1968, p197). Stalnaker and Thomason accepted that certain referential terms could be functional, when discriminating ‘Socrates’ from ‘Miss America’ — although the functionality of ‘Miss America in year t’ is significantly different from that of ‘the top card in y’s belief’. For if this year’s Miss America is last year’s Miss America, still it is only one thing which is identical with itself, unlike with the two cards. Also, there is nothing which can force this year’s Miss America to be last year’s different Miss America, in the way that the counterfactuality of the situation with the playing cards forces two non-identical things in the actual world to be the same thing in the other possible world. Other possible worlds are thus significantly different from other times, and so, arguably, other possible worlds should not be seen from the Realist perspective appropriate for other times — or other spaces.

4. The Epsilon Calculus’ Problematic

It might be said that Realism has delayed a proper logical understanding of many of these things. If you look ‘realistically’ at picturesque remarks like that made before, namely ‘the same object is ‘inside’ the belief as ‘outside’ in the room’, then it is easy for inappropriate views about the mind to start to interfere, and make it seem that the same object cannot be in these two places at once. But if the mind were something like another space or time, then counterfactuality could get no proper purchase — no one could be ‘wrong’, since they would only be talking about elements in their ‘world’, not any objective, common world. But really, all that is going on when one says, for instance,

There was a man in the room, but Celia believed he was a woman,

is that the same term — or one term and a pronominal surrogate for it — appears at two linguistic places in some discourse, with the same reference. Hence there is no grammatical difference between the cross reference in such an intensional case and the cross reference in a non-intensional case, such as

There was a man in the room. He was hungry.

i.e.

(∃x)Mx & HεxMx.

What has been difficult has merely been getting a symbolisation of the cross-reference in this more elementary kind of case. But it just involves extending the epsilon definition of existential statements, using a reiteration of the substituted epsilon term, as we can see.

It is now widely recognised how the epsilon calculus allows us to do this (Purdy 1994, Egli and von Heusinger 1995, Meyer Viol 1995, Ch 6), the theoretical starting point being the theorem about the Russellian theory of definite descriptions proved before, which breaks up what otherwise would be a single sentence into a sequential piece of discourse, enabling the existence and uniqueness clauses to be put in one sentence while the characterising remark is in another. The relationship starts to matter when, in fact, there is no obvious way to formulate a combination of anaphoric remarks in the predicate calculus, as in, for instance,

There is a king of France. He is bald,

where there is no uniqueness clause. This difficulty became a major problem when logicians started to consider anaphoric reference in the 1960s.

Geach, for instance, in Geach 1962, even believed there could not be a syllogism of the following kind (Geach 1962, p126):

A man has just drunk a pint of sulphuric acid.
Nobody who drinks a pint of sulphuric acid lives through the day.
So, he won’t live through the day.

He said, one could only draw the conclusion:

Some man who has just drunk a pint of sulphuric acid won’t live through the day.

Certainly one can only derive

(∃x)(Mx & Dx & ¬Lx)

from

(∃x)(Mx & Dx),

and

(x)(Dx → ¬Lx),

within predicate logic. But one can still derive

¬Lεx(Mx & Dx),

within the epsilon calculus.

Geach likewise was foxed later when he produced his famous case (numbered 3 in Geach 1967):

Hob thinks a witch has blighted Bob’s mare, and Nob wonders whether she (the same witch) killed Cob’s sow,

which is, in epsilon terms

Th(∃x)(Wx & Bxb) & OnKεx(Wx & Bxb)c.

For Geach saw that this could not be (4)

(∃x)(Wx & ThBxb & OnKxc),

or (5)

(∃x)(Th(Wx & Bxb)& OnKxc).

But also a reading of the second clause as (c.f. 18)

Nob wonders whether the witch who blighted Bob’s mare killed Cob’s sow,

in which ‘the witch who blighted Bob’s mare killed Cob’s sow’ is analysed in the Russellian manner, i.e. as (20)

just one witch blighted Bob’s mare and she killed Cob’s sow,

Geach realised does not catch the specific cross-reference — amongst other things because of the uniqueness condition which is then introduced.

This difficulty with the uniqueness clause in Russellian analyses has been widely commented on, although a recent theorist, Neale, has said that Russell’s theory only needs to be modestly modified: Neale’s main idea is that, in general, definite descriptions should just be localised to the context. His resolution of Geach’s troubling cases thus involves suggesting that ‘she’, in the above, might simply be ‘the witch we have been hearing about’ (Neale 1990, p221). Neale might here have said ‘that witch who blighted Bob’s mare’, showing that an Hilbertian account of demonstrative descriptions would have a parallel effect.

A good deal of the ground breaking work on these matters, however, was done by someone again much influenced by Russell: Evans. But Evans significantly broke with Russell over uniqueness (Evans 1977, pp516-517):

One does not want to be committed, by this way of telling the story, to the existence of a day on which just one man and boy walked along a road. It was with this possibility in mind that I stated the requirement for the appropriate use of an E-type pronoun in terms of having answered, or being prepared to answer upon demand, the question ‘He? Who?’ or ‘It? Which?’ In order to effect this liberalisation we should allow the reference of the E-type pronoun to be fixed not only by predicative material explicitly in the antecedent clause, but also by material which the speaker supplies upon demand. This ruling has the effect of making the truth conditions of such remarks somewhat indeterminate; a determinate proposition will have been put forward only when the demand has been made and the material supplied.

It was Evans who gave us the title ‘E-type pronoun’ for the ‘he’ in such expressions as

A Cambridge philosopher smoked a pipe, and he drank a lot of whisky,

i.e., in epsilon terms,

(∃x)(Cx & Px) & Dεx(Cx & Px).

He also insisted (Evans 1977, p516) that what was unique about such pronouns was that this conjunction of statements was not equivalent to

A Cambridge philosopher, who smoked a pipe, drank a lot of whisky,

i.e.

(∃x)(Cx & Px & Dx).

Clearly the epsilon account is entirely in line with this, since it illustrates the point made before about cases without a uniqueness clause. Only the second expression, which contains a relative pronoun, is formalisable in the predicate calculus. To formalise the first expression, which contains a personal pronoun, one at least needs something with the expressive capabilities of the epsilon calculus.

5. The Formal Semantics of Epsilon Terms

The semantics of epsilon terms is nowadays more general, but the first interpretations of epsilon terms were restricted to arithmetical cases, and specifically took epsilon to be the least number operator. Hilbert and Bernays developed Arithmetic using the epsilon calculus, using the further epsilon axiom schema (Hilbert and Bernays 1970, Book 2, p85f, c.f. Leisenring 1969, p92) :

(εxAx = st) → ¬At,

where ‘s’ is intended to be the successor function, and ‘t’ is any numeral. This constrains the interpretation of the epsilon symbol, but the least number interpretation is not strictly forced, since the axiom only ensures that no number having the property A immediately precedes εxAx.

The new axiom, however, is sufficient to prove mathematical induction, in the form:

(A0 & (x)(Ax → Asx)) → (x)Ax.

For assume the reverse, namely

A0 & (x)(Ax → Asx) & ¬(x)Ax,

and consider what happens when the term ‘εx¬Ax’ is substituted in

t = 0 ∨ t = sn,

which is derivable from the other axioms of number theory which Hilbert and Bernays are using. If we had

εx¬Ax = 0,

then, since it is given that A0, then we would have Aεx¬Ax. But since, by the definition of the universal quantifier,

Aεx¬Ax ↔ (x)Ax,

we know, because ¬(x)Ax is also given, that ¬Aεx¬Ax, which means we cannot have εx¬Ax = 0. Hence we must have the other alternative, i.e.

εx¬Ax = sn,

for some n. But from the new axiom

(εx¬Ax = sn) → An,

hence we must have An, although we must also have

An → Asn,

because (x)(Ax → Asx). All together that requires Aεx¬Ax again, which is impossible. Hence the further epsilon axiom is sufficient to establish the given principle of induction.

The more general link between epsilon terms and choice functions was first set out by Asser, although Asser’s semantics for an elementary epsilon calculus without the second epsilon axiom makes epsilon terms denote rather complex choice functions. Wilfrid Meyer Viol, calling an epsilon calculus without the second axiom an ‘intensional’ epsilon calculus, makes the epsilon terms in such a calculus instead name Skolem functions. Skolem functions are also called Herbrand functions, although they arise in a different way, namely in Skolem’s Theorem. Skolem’s Theorem states that, if a formula in prenex normal form is provable in the predicate calculus, then a certain corresponding formula, with the existential quantifiers removed, is provable in a predicate calculus enriched with function symbols. The functions symbolised are called Skolem functions, although, in another context, they would be Herbrand functions.

Skolem’s Theorem is a meta-logical theorem, about the relation between two logical calculi, but a non-metalogical version is in fact provable in the epsilon calculus from which Skolem’s actual theorem follows, since, for example, we can get, by the epsilon definition, now of the existential quantifier

(x)(∃y)Fxy ↔ (x)FxεyFxy.

As a result, if the left hand side of such an equivalence is provable in an epsilon calculus the right hand side is provable there. But the left hand side is provable in an epsilon calculus if it is provable in the predicate calculus, by the Second Epsilon Theorem; and if the right hand side is provable in an epsilon calculus it is provable in a predicate calculus enriched with certain function symbols — epsilon terms, like ‘εyFxy’. So, by generalisation, we get Skolem’s original result.

When we add to an intensional epsilon calculus the second epsilon axiom

(x)(Fx ↔ Gx) →εxFx = εxGx,

the interpretation of epsilon terms is commonly extensional, i.e. in terms of sets, since two predicates ‘F’ and ‘G’ satisfying the antecedent of this second axiom will determine the same set — if they determine sets at all, that is. For that requires the predicates to be collectivisantes, in Bourbaki’s terms, as with explicit set membership statements, like ‘x ∈ y’. In such a case the epsilon term ‘εx(x ∈ y)’ designates a choice function, i.e. a function which selects one from a given set (c.f. Leisenring 1969, p19, Meyer Viol 1995, p42). In the case where there are no members of the set the selection is arbitrary, although for all empty sets it is invariably the same. Thus the second axiom validates, for example, Kalish and Montague’s rule for this case, which they put in the form

εxFx = εx¬(x = x).

Kalish and Montague in fact prove a version of the second epsilon axiom in their system (Kalish and Montague 1964, see T407, p256). The second axiom also holds in Hermes’ system (Hermes 1965), although there one in addition finds a third epsilon axiom,

εx¬(x = x) = εx(x = x),

for which there would seem to be no real justification.

But the second epsilon axiom itself is curious. One questionable thing about it is that both Leisenring and Meyer Viol do not state that the predicates in question must determine sets before their choice function semantics can apply. That the predicates are collectivisantes is merely presumed in their theories, since ‘εxBx’ is invariably modelled by means of a choice from the presumed set of things which in the model are B. Certainly there is a special clause dealing with the empty set; but there is no consideration of the case where some things are B although those things are not discrete, as with the things which are red, for instance. If the predicate in question is not a count noun then there is no set of things involved, since with mass terms, and continuous quantities there are no given elements to be counted (c.f. Bunt 1985, pp262-263 in particular). Of course numbers can still be associated with them, but only given an arbitrary unit. With the cows in a field, for instance, we can associate a determinate number, but with the beef there we cannot, unless we consider, say, the number of pounds of it.

The point, as we saw before, has a formalisation in epsilon terms. Thus if we write ‘(nx)Fx’, for ‘there are n F’s’, then εn(ny)Fy will be the number of F’s, and it is what numbers them if they have a number. But in the reverse case the previously mentioned arbitrariness of the epsilon term comes in. For if ¬(∃n)(nx)Fx, then ¬([εn(ny)Fy]x)Fx, and so, although an arbitrary number exists, it does not number the F’s. In that case, in other words, we do not have a number of F’s, merely some F.

In fact, even when there is a set of things, the second epsilon axiom, as stated above, does not apply in general, since there are intensional differences between properties to consider, as in, for instance ‘There is a red-haired man, and a Caucasian in the room, and they are different’. Here, if there were only red-haired Caucasians in the room, then with the above second axiom, we could not find epsilon substitutions to differentiate the two individuals involved. This may remind us that it is necessary co-extensionality, and not just contingent co-extensionality which is the normal criterion for the identity of properties (c.f. Hughes and Cresswell 1968, pp209-210). So it leads us to see the appropriateness of a modalised second axiom, which uses just an intensional version of the antecedent of the previous second epsilon axiom, in which ‘L’ means ‘it is necessary that’, namely:

L(x)(Fx ↔ Gx) →εxFx = εxGx.

For with this axiom only the co-extensionalities which are necessary will produce identities between the associated epsilon terms. We can only get, for instance,

εxPx = εx(Px ∨ Px),

and

εxFx = εyFy,

and all other identities derivable in a similar way.

However, the original second epsilon axiom is then provable, in the special case where the predicates express set membership. For if necessarily

(x)(x ∈ y ↔ x ∈ z) ↔ y = z,

while necessarily

y = z ↔ L(y = z),

(see Hughes and Cresswell, 1968, p190) then

L(x)(x ∈ y ↔ x ∈ z) ↔ (x)(x ∈ y ↔ x ∈ z),

and so, from the modalised second axiom we can get

(x)(x ∈ y ↔ x ∈ z) →εx(x ∈ y) = εx(x ∈ z).

Note, however, that if one only has contingently

(x)(Fx ↔ x ∈ z),

then one cannot get, on this basis,

εxFx = εx(x ∈ z).

But this is something which is desirable, as well. For we have seen that it is contingent that the number of the planets does number the planets — because it is not necessary that ([εn(ny)Py]x)Px. This makes ‘(9x)Px’ contingent, even though the identity ‘9 = εn(nx)Px’ remains necessary. But also it is contingent that there is the set of planets, p, which there is, since while, say,

(x)(x ∈ p ↔ Px),

where

εn(nx)(x ∈ p) = εn(nx)Px = 9,

it is still possible that, in some other possible world,

(x)(x ∈ p’ ↔ Px),

with p’ the set of planets there, and

¬(εn(nx)(x ∈ p’) = 9).

We could not have this further contingency, however, if the original second epsilon axiom held universally.

It is on this fuller basis that we can continue to hold ‘x = y → L(x = y)’, i.e. the invariable necessity of identity — one merely distinguishes ‘(9x)Px’ from ‘9 = εx(nx)Px’, and from ‘9 = εx(nx)(x ∈ p)’, as above.

Adding the original second epsilon axiom to an intensional epsilon calculus is therefore acceptable only if all the predicates are about set membership. This is not an uncommon assumption, indeed it is pervasive in the usually given semantics for predicate logic, for instance. But if, by contrast, we want to allow for the fact that not all predicates are collectivisantes then we should take just the first epsilon axiom with merely a modalised version of the second epsilon axiom. The interpretation of epsilon terms is then always in terms of Skolem functions, although if we are dealing with the membership of sets, those Skolem functions naturally are choice functions.

6. Some Metatheory

To finish we shall briefly look, as promised, at some meta-theory.

The epsilon calculi that were first described were not very convenient to use, and Hilbert and Bernays’ proofs of the First and Second Epsilon Theorems were very complex. This was because the presentation was axiomatic, however, and with the development of other means of presenting the same logics we get more readily available meta-logical results. I will indicate some of the early difficulties before showing how these theorems can be proved, nowadays, much more simply.

The problem with proving the Second Epsilon Theorem, on an axiomatic basis, is that complex, and non-constant epsilon terms may enter a proof in the epsilon calculus by means of substitutions into the axioms. What has to be proved is that an epsilon calculus proof of an epsilon-free theorem (i.e. one which can be expressed just in predicate calculus language) can be replaced by a predicate calculus proof. So some analysis of complex epsilon terms is required, to show that they can be eliminated in the relevant cases, leaving only constant epsilon terms, which are sufficiently similar to the individual symbols in standard predicate logic. Hilbert and Bernays (Hilbert and Bernays 1970, Book 2, p23f) say that one epsilon term ‘εxFx’ is subordinate to another ‘εyGy’ if and only if ‘G’ contains ‘εxFx’, and a free occurrence of the variable ‘y’ lies within ‘εxFx’. For instance ‘εxRxy’ is a complex, and non-constant epsilon term, which is subordinate to ‘εySyεxRyx’. Hilbert and Bernays then define the rank of an epsilon term to be 1 if there are no epsilon terms subordinate to it, and otherwise to be one greater than the maximal rank of the epsilon terms which are subordinate to it. Using the same general ideas, Leisenring proves two theorems (Leisenring 1969, p72f). First he proves a rank reduction theorem, which shows that epsilon proofs of epsilon-free formulas in which the second epsilon axiom is not used, but in which every term is of rank less than or equal to r, may be replaced by epsilon proofs in which every term is of rank less than or equal to r – 1. Then he proves the eliminability of the second epsilon axiom in proofs of epsilon-free formulas. Together, these two theorems show that if there is an epsilon proof of an epsilon-free formula, then there is such a proof not using the second epsilon axiom, and in which all epsilon terms have rank just 1. Even though such epsilon terms might still contain free variables, if one replaces those that do with a fixed symbol ‘a’ (starting with those of maximal length) that reduces the proof to one in what is called the ‘epsilon star’ system, in which there are only constant epsilon terms (Leisenring 1969, p66f). Leisenring shows that proofs in the epsilon star system can be turned into proofs in the predicate calculus, by replacing the epsilon terms by individual symbols.

But, as was said before, there is now available a much shorter proof of the Second Epsilon Theorem. In fact there are several, but I shall just indicate one, which arises simply by modifying the predicate calculus truth trees, as found in, for instance, Jeffrey (see Jeffrey 1967). Jeffrey uses the standard propositional truth tree rules, together with the rules of quantifier interchange, which remain unaffected, and which are not material to the present purpose. He also has, however, a rule of existential quantifier elimination,

(∃x)Fx ├ Fa,

in which ‘a’ must be new, and a rule of universal quantifier elimination

(x)Fx ├ Fb,

in which ‘b’ must be old — unless no other individual terms are available. By reducing closed formulas of the form ‘P & ¬C’ to absurdity Jeffrey can then prove ‘P → C’, and validate ‘P ├ C’ in his calculus. But clearly, upon adding epsilon terms to the language, the first of these rules must be changed to

(∃x)Fx ├ FεxFx,

while also the second rule can be replaced by the pair

(x)Fx ├ Fεx¬Fx,
Fεx¬Fx ├ Fa,

(where ‘a’ is old) to produce an appropriate proof procedure. Steen reads ‘εx¬Fx’ as ‘the most un-F-like thing’ (Steen 1972, p162), which explains why Fεx¬Fx entails Fa, since if the most un-F-like thing is in fact F, then the most plausible counter-example to the generalisation is in fact not so, making the generalisation exceptionless. But there is a more important reason why the rule of universal quantifier elimination is best broken up into two parts.

For Jeffrey’s rules only allow him ‘limited upward correctness’ (Jeffrey 1967, p167), since Jeffrey has to say, with respect to his universal quantifier elimination rule, that the range of the quantification there be limited merely to the universe of discourse of the path below. This is because, if an initial sentence is false in a valuation so also must be one of its conclusions. But the first epsilon rule which replaces Jeffrey’s rule ensures, instead, that there is ‘total upwards correctness’. For if it is false that everything is F then, without any special interpretation of the quantifier, one of the given consequences of the universal statement is false, namely the immediate one — since Fεx¬Fx is in fact equivalent to (x)Fx. A similar improvement also arises with the existential quantifier elimination rule. For Jeffrey can only get ‘limited downwards correctness’, with his existential quantifier elimination rule (Jeffrey 1967, p165), since it is not an entailment. In fact, in order to show that if an initial sentence is true in a valuation so is one of its conclusions, in this case, Jeffrey has to stretch his notion of ‘truth’ to being true either in the given valuation, or some nominal variant of it.

The epsilon rule which replaces Jeffrey’s overcomes this difficulty by not employing names, only demonstrative descriptions, and by being, as a result, totally downward correct. For if there is an F then that F is F, whatever name is used to refer to it. The epsilon calculus terminology thus precedes any naming: it gets hold of the more primitive, demonstrative way we have of referring to objects, using phrases like ‘that F’. Thus in explication of the predicate calculus rule we might well have said

suppose there is an F, well, call that F ‘a’, then Fa,

but that requires we understand ‘that F’ before we come to use ‘a’.

So how does the Second Epsilon Theorem follow? This theorem, as before, states that an epsilon calculus proof of an epsilon-free theorem may be replaced by a predicate calculus proof of the same formula. But the transformation required in the present setting is now evident: simply change to new names all epsilon terms introduced in the epsilon calculus quantifier elimination rules. This covers both the new names in Jeffrey’s first rule, but also the odd case where there are no old names in Jeffrey’s second rule. The epsilon calculus proofs invariably use constant epsilon terms, and are thus effectively in Leisenring’s epsilon star system.

Epsilon terms which are non-constant, however, crucially enter the proof of the First Epsilon Theorem. The First Epsilon Theorem states that if C is a provable predicate calculus formula, in prenex normal form, i.e. with all quantifiers at the front, then a finite disjunction of instances of C’s matrix is provable in the epsilon calculus. The crucial fact is that the epsilon calculus gives us access to Herbrand functions, which arise when universal quantifiers are eliminated from formulas using their epsilon definition. Thus

(∃y)(x)¬Fyx,

for instance, is equivalent to

(∃y)¬Fyεx¬¬Fyx,

and so

(∃y)¬FyεxFyx,

and the resulting epsilon term ‘εxFyx’ is a Herbrand function.

Using such reductions, all universal quantifiers can evidently be removed from formulas in prenex normal form, and the additional fact that, in a certain specific way, the remaining existential quantifiers are disjunctions makes all predicate calculus formulas equivalent to disjunctions. Remember that a formula is provable if its negation is reducible to absurdity, which means that its truth tree must close. But, by König’s Lemma, if there is no open path through a truth tree then there is some finite stage at which there is no open path, so, in the case above, for instance, if no valuation makes the last formula’s negation true, then the tree of the instances of that negative statement must close in a finite length. But the negative statement is the universal formula

(y)FyεxFyx,

by the rules of quantifier interchange, so a finite conjunction of instances of the matrix of this universal formula, namely Fyx, must reduce to absurdity. For the rules of universal quantifier elimination only produce consequences with the form of this matrix. By de Morgan’s Laws, that makes necessary a finite disjunction of instances of ¬Fyx. By generalisation we thus get the First Epsilon Theorem.

The epsilon calculus, however, can take us further than the First Epsilon Theorem. Indeed, one has to take care with the impression this theorem may give that existential statements are just equivalent to disjunctions. If that were the case, then existential statements would be unlike individual statements, saying not that one specified thing has a certain property, but merely that one of a certain group of things has a certain property. The group in question is normally called the ‘domain’ of the quantification, and this, it seems, has to be specified when setting out the semantics of quantifiers. But study of the epsilon calculus shows that there is no need for such ‘domains’, or indeed for such semantics. This is because the example above, for instance, is also equivalent to

¬FaεzFaz,

where a = εy¬FεxFyx. So the previous disjunction of instances of ¬Fyx is in fact only true because this specific disjunct is true. The First Epsilon Theorem, it must be remembered, does not prove that an existential statement is equivalent to a certain disjunction; it shows merely that an existential statement is provable if and only if a certain disjunction is provable. And what is also provable, in such a case, is a statement merely about one object. Indeed the existential statement is provably equivalent to it. It is this fact which supports the epsilon definition of the quantifiers; and it is what permits anaphoric reference to the same object by means of the same epsilon term. An existential statement is thus just another statement about an individual — merely a nameless one.

The reverse point goes for the universal quantifier: a universal statement is not the conjunction of its instances, even though it implies them. A generalisation is simply equivalent to one of its instances — to the one involving the prime putative exception to it, as we have seen. Not being able to specify that prime putative exception leaves Jeffrey saying that if a generalisation is false then one of its instances is false without any way of ensuring that that instance has been drawn as a conclusion below it in the truth tree except by limiting the interpretation of the generalisation just to the universe of discourse of the path. It thus seems necessary, within the predicate calculus, that there be a ‘model’ for the quantifiers which restricts them to a certain ‘domain’, which means that they do not necessarily range over everything. But in the epsilon calculus the quantifiers do, invariably, range over everything, and so there is no need to specify their range.

7. References and Further Reading

  • Ackermann, W. 1937-8, ‘Mengentheoretische Begründung der Logik’, Mathematische Annalen, 115, 1-22.
  • Asser, G. 1957, ‘Theorie der Logischen Auswahlfunktionen’, Zeitschrift für Mathematische Logik und Grundlagen der Mathematik, 3, 30-68.
  • Bernays, P. 1958, Axiomatic Set Theory, North Holland, Dordrecht.
  • Bertolet, R. 1980, ‘The Semantic Significance of Donnellan’s Distinction’, Philosophical Studies, 37, 281-288.
  • Bourbaki, N. 1954, Éléments de Mathématique, Hermann, Paris.
  • Bunt, H.C. 1985, Mass Terms and Model-Theoretic Semantics, C.U.P., Cambridge.
  • Church, A. 1940, ‘A Formulation of the Simple Theory of Types’, Journal of Symbolic Logic, 5, 56-68.
  • Copi, I. 1973, Symbolic Logic, 4th ed. Macmillan, New York.
  • Devitt, M. 1974, ‘Singular Terms’, The Journal of Philosophy, 71, 183-205.
  • Donnellan, K. 1966, ‘Reference and Definite Descriptions’, Philosophical Review, 75, 281-304.
  • Egli, U. and von Heusinger, K. 1995, ‘The Epsilon Operator and E-Type Pronouns’ in U. Egli et al. (eds.), Lexical Knowledge in the Organisation of Language, Benjamins, Amsterdam.
  • Evans, G. 1977, ‘Pronouns, Quantifiers and Relative Clauses’, Canadian Journal of Philosophy, 7, 467-536.
  • Geach, P.T. 1962, Reference and Generality, Cornell University Press, Ithaca.
  • Geach, P.T. 1967, ‘Intentional Identity’, The Journal of Philosophy, 64, 627-632.
  • Goddard, L. and Routley, R. 1973, The Logic of Significance and Context, Scottish Academic Press, Aberdeen.
  • Gödel, K. 1969, ‘An Interpretation of the Intuitionistic Sentential Calculus’, in J. Hintikka (ed.), The Philosophy of Mathematics, O.U.P. Oxford.
  • Hazen, A. 1987, ‘Natural Deduction and Hilbert’s ε-operator’, Journal of Philosophical Logic, 16, 411-421.
  • Hermes, H. 1965, Eine Termlogik mit Auswahloperator, Springer Verlag, Berlin.
  • Hilbert, D. 1923, ‘Die Logischen Grundlagen der Mathematik’, Mathematische Annalen, 88, 151-165.
  • Hilbert, D. 1925, ‘On the Infinite’ in J. van Heijenhoort (ed.), From Frege to Gödel, Harvard University Press, Cambridge MA.
  • Hilbert, D. and Bernays, P. 1970, Grundlagen der Mathematik, 2nd ed., Springer, Berlin.
  • Hughes, G.E. and Cresswell, M.J. 1968, An Introduction to Modal Logic, Methuen, London.
  • Jeffrey, R. 1967, Formal Logic: Its Scope and Limits, 1st Ed. McGraw-Hill, New York.
  • Kalish, D. and Montague, R. 1964, Logic: Techniques of Formal Reasoning, Harcourt, Brace and World, Inc, New York.
  • Kneebone, G.T. 1963, Mathematical Logic and the Foundations of Mathematics, Van Nostrand, Dordrecht.
  • Leisenring, A.C. 1969, Mathematical Logic and Hilbert’s ε-symbol, Macdonald, London.
  • Marciszewski, W. 1981, Dictionary of Logic, Martinus Nijhoff, The Hague.
  • Meyer Viol, W.P.M. 1995, Instantial Logic, ILLC Dissertation Series 1995-11, Amsterdam.
  • Montague, R. 1963, ‘Syntactical Treatments of Modality, with Corollaries on Reflection Principles and Finite Axiomatisability’, Acta Philosophica Fennica, 16, 155-167.
  • Neale, S. 1990, Descriptions, MIT Press, Cambridge MA.
  • Priest, G.G. 1984, ‘Semantic Closure’, Studia Logica, XLIII 1/2, 117-129.
  • Prior, A.N., 1971, Objects of Thought, O.U.P. Oxford.
  • Purdy, W.C. 1994, ‘A Variable-Free Logic for Anaphora’ in P. Humphreys (ed.) Patrick Suppes: Scientific Philosopher, Vol 3, Kluwer, Dordrecht, 41-70.
  • Quine, W.V.O. 1960, Word and Object, Wiley, New York.
  • Rasiowa, H. 1956, ‘On the ε-theorems’, Fundamenta Mathematicae, 43, 156-165.
  • Rosser, J. B. 1953, Logic for Mathematicians, McGraw-Hill, New York.
  • Routley, R. 1969, ‘A Simple Natural Deduction System’, Logique et Analyse, 12, 129-152.
  • Routley, R. 1977, ‘Choice and Descriptions in Enriched Intensional Languages II, and III’, in E. Morscher, J. Czermak, and P. Weingartner (eds), Problems in Logic and Ontology, Akademische Druck und Velagsanstalt, Graz.
  • Routley, R. 1980, Exploring Meinong’s Jungle, Departmental Monograph #3, Philosophy Department, R.S.S.S., A.N.U. Canberra.
  • Routley, R., Meyer, R. and Goddard, L. 1974, ‘Choice and Descriptions in Enriched Intensional Languages I’, Journal of Philosophical Logic, 3, 291-316.
  • Russell, B. 1905, ‘On Denoting’ Mind, 14, 479-493.
  • Sayward, C. 1987, ‘Prior’s Theory of Truth’ Analysis, 47, 83-87.
  • Slater, B.H. 1986(a), ‘E-type Pronouns and ε-terms’, Canadian Journal of Philosophy, 16, 27-38.
  • Slater, B.H. 1986(b), ‘Prior’s Analytic’, Analysis, 46, 76-81.
  • Slater, B.H. 1988(a), ‘Intensional Identities’, Logique et Analyse, 121-2, 93-107.
  • Slater, B.H. 1988(b), ‘Hilbertian Reference’, Noûs, 22, 283-97.
  • Slater, B.H. 1989(a), ‘Modal Semantics’, Logique et Analyse, 127-8, 195-209.
  • Slater, B.H. 1990, ‘Using Hilbert’s Calculus’, Logique et Analyse, 129-130, 45-67.
  • Slater, B.H. 1992(a), ‘Routley’s Formulation of Transparency’, History and Philosophy of Logic, 13, 215-24.
  • Slater, B.H. 1994(a), ‘The Epsilon Calculus’ Problematic’, Philosophical Papers, XXIII, 217-42.
  • Steen, S.W.P. 1972, Mathematical Logic, C.U.P. Cambridge.
  • Thomason, R. 1977, ‘Indirect Discourse is not Quotational’, Monist, 60, 340-354.
  • Thomason, R. 1980, ‘A Note on Syntactical Treatments of Modality’, Synthese, 44, 391-395.
  • Thomason, R.H. and Stalnaker, R.C. 1968, ‘Modality and Reference’, Noûs, 2, 359-372.

Author Information

Barry Hartley Slater
Email: slaterbh@cyllene.uwa.edu.au
University of Western Australia
Australia

Epicurus (341—271 B.C.E.)

epicurus2Epicurus is one of the major philosophers in the Hellenistic period, the three centuries following the death of Alexander the Great in 323 B.C.E. (and of Aristotle in 322 B.C.E.). Epicurus developed an unsparingly materialistic metaphysics, empiricist epistemology, and hedonistic ethics. Epicurus taught that the basic constituents of the world are atoms, uncuttable bits of matter, flying through empty space, and he tried to explain all natural phenomena in atomic terms. Epicurus rejected the existence of Platonic forms and an immaterial soul, and he said that the gods have no influence on our lives. Epicurus also thought skepticism was untenable, and that we could gain knowledge of the world relying upon the senses. He taught that the point of all one’s actions was to attain pleasure (conceived of as tranquility) for oneself, and that this could be done by limiting one’s desires and by banishing the fear of the gods and of death. Epicurus’ gospel of freedom from fear proved to be quite popular, and communities of Epicureans flourished for centuries after his death.

Table of Contents

  1. Life
  2. Sources
  3. Metaphysics
    1. Arguments for the Existence of Atoms and Void
    2. Properties of Atoms, Limitlessness of the Universe
    3. Differences from Democritus
      1. Weight
      2. The Swerve
      3. Sensible Qualities
    4. Mechanistic Explanations of Natural Phenomena
    5. The Gods
    6. Philosophy of Mind
    7. Perception
  4. Epistemology
    1. The Canon: Sensations, Preconceptions, and Feelings
    2. Anti-skeptical Arguments
      1. The “Lazy Argument”
      2. The Self-refutation Argument
      3. The Argument from Concept-formation
  5. Ethics
    1. Hedonism, Psychological and Ethical
    2. Types of Pleasure
    3. Types of Desire
    4. The Virtues
    5. Justice
    6. Friendship
    7. Death
      1. The No Subject of Harm Argument
      2. The Symmetry Argument
  6. References and Further Reading
    1. Collections of Primary Sources
    2. Recent Books on Particular Areas of Epicurus’ Philosophy

1. Life

Epicurus was born around 341 B.C.E., seven years after Plato’s death, and grew up in the Athenian colony of Samos, an island in the Mediterranean Sea. He was about 19 when Aristotle died, and he studied philosophy under followers of Democritus and Plato. Epicurus founded his first philosophical schools in Mytilene and Lampsacus, before moving to Athens around 306 B.C.E. There Epicurus founded the Garden, a combination of philosophical community and school. The residents of the Garden put Epicurus’ teachings into practice. Epicurus died from kidney stones around 271 or 270 B.C.E.

After Epicurus’ death, Epicureanism continued to flourish as a philosophical movement. Communities of Epicureans sprang up throughout the Hellenistic world; along with Stoicism, it was one of the major philosophical schools competing for people’s allegiances. Epicureanism went into decline with the rise of Christianity. Certain aspects of Epicurus’ thought were revived during the Renaissance and early modern periods, when reaction against scholastic neo-Aristotelianism led thinkers to turn to mechanistic explanations of natural phenomena.

2. Sources

Epicurus was a voluminous writer, but almost none of his own work survives. A likely reason for this is that Christian authorities found his ideas ungodly. Diogenes Laertius, who probably lived in the third century CE , wrote a 10-book Lives of the Philosophers, which includes three of Epicurus’ letters in its recounting of the life and teachings of Epicurus. These three letters are brief summaries of major areas of Epicurus’ philosophy: the Letter to Herodotus, which summarizes his metaphysics, the Letter to Pythocles, which gives atomic explanations for meteorological phenomena, and the Letter to Menoeceus, which summarizes his ethics. It also includes the Principal Doctrines, 40 sayings which deal mainly with ethical matters.

Because of the absence of Epicurus’ own writings, we have to rely on later writers to reconstruct Epicurus’ thought. Two of our most important sources are the Roman poet Lucretius (c. 94-55 B.C.E.) and the Roman politician Cicero (106-43 B.C.E.). Lucretius was an Epicurean who wrote De Rerum Natura (On the Nature of Things), a six-book poem expounding Epicurus’ metaphysics. Cicero was an adherent of the skeptical academy, who wrote a series of works setting forth the major philosophical systems of his day, including Epicureanism. Another major source is the essayist Plutarch (c. 50-120 CE), a Platonist. However, both Cicero and Plutarch were very hostile toward Epicureanism, so they must be used with care, since they often are less than charitable toward Epicurus, and may skew his views to serve their own purposes.

Although the major outlines of Epicurus’ thought are clear enough, the lack of sources means many of the details of his philosophy are still open to dispute.

3. Metaphysics

Epicurus believes that the basic constituents of the world are atoms (which are uncuttable, microscopic bits of matter) moving in the void (which is simply empty space). Ordinary objects are conglomerations of atoms. Furthermore, the properties of macroscopic bodies and all of the events we see occurring can be explained in terms of the collisions, reboundings, and entanglements of atoms.

a. Arguments for the Existence of Atoms and Void

Epicurus’ metaphysics starts from two simple points: (1) we see that there are bodies in motion, and (2) nothing comes into existence from what does not exist. Epicurus takes the first point to be simply a datum of experience. The second point is a commonplace of ancient Greek philosophy, derived from the Principle of Sufficient Reason (the principle that for everything which occurs there is a reason or explanation for why it occurs, and why this way rather than that).

First, because bodies move, there must be empty space for them to move in, and Epicurus calls this empty space ‘void.’ Second, the ordinary bodies that we see are compound bodies–that is, bodies which are made up of further bodies, which is shown by the fact that they can be broken down into smaller pieces. However, Epicurus thinks that this process of division cannot go on indefinitely, because otherwise bodies would dissolve away into nothing. Also, there must be basic and unchangeable building blocks of matter in order to explain the regularities in nature. These non-compound bodies are atoms–literally, ‘uncuttables.’ Only bodies and void exist per se, that is, exist without depending for their existence on something else. Other things–such as colors, time, and justice–are ultimately explicable as attributes of bodies.

b. Properties of Atoms, Limitlessness of the Universe

Because Epicurus believes that nothing comes into existence from nothing, he thinks that the universe has no beginning, but has always existed, and will always exist. Atoms, too, as the basic building blocks of all else, cannot come into existence, but have always existed. Our particular cosmos, however, is only a temporary agglomeration of atoms, and it is only one of an infinite number of such cosmoi, which come into existence and then dissolve away. Against Aristotle, Epicurus argues that the universe is unlimited in size. If the universe were limited in size, says Epicurus, you could go to the end of it, stick your fist out, and where your fist was located would be the new ‘limit’ of the universe. Of course, this process could be reiterated an endless number of times. Since the universe is unlimited in size, there must also be an unlimited number of atoms and an infinite amount of void. If the number of atoms were limited, then the ‘density’ of atoms in any region would effectively be zero, and there would be no macroscopic bodies, as there evidently are. And there must be an unlimited amount of void, since without a limitless amount of void, the infinite number of atoms would be unable to move.

c. Differences from Democritus

Up to this point, Epicurus is largely following the thought of Democritus, a pre-Socratic philosopher and one of the inventors of atomism. However, he modifies Democritus’ atomism in at least three important ways.

i. Weight

The first is that Epicurus thinks that atoms have weight. Like Democritus, Epicurus believes that atoms have the properties of size, shape, and resistance. Democritus explains all atomic motion as the result of previous atomic collisions, plus the inertia of atoms. Aristotle, however, criticizes Democritus on this point, saying that Democritus has not explained why it is that atoms move at all, rather than simply standing still. Epicurus seems to be answering this criticism when he says that atoms do have a natural motion of direction–‘downward’–even though there is no bottom to the universe. This natural motion is supposed to give an explanation for why atoms move in the first place. Also, Epicurus thinks that it is evident that bodies do tend to travel down, all else being equal, and he thinks that positing weight as an atomic property accounts for this better than thinking all atomic motion is the result of past collisions and inertia.

ii. The Swerve

The second modification of Democritus’ views is the addition of the ‘swerve.’ In addition to the regular tendency of atoms to move downward, Epicurus thinks that occasionally, and at random times, the atoms swerve to the side. One reason for this swerve is that it is needed to explain why there are atomic collisions. The natural tendency of atoms is to fall straight downward, at uniform velocity. If this were the only natural atomic motion, the atoms never would have collided with one another, forming macroscopic bodies. As Lucretius puts it, they would ‘fall downward, like drops of rain, through the deep void.’ The second reason for thinking that atoms swerve is that a random atomic motion is needed to preserve human freedom and ‘break the bonds of fate,’ as Lucretius says. If the laws of atomic motion are deterministic, then the past positions of the atoms in the universe, plus these laws, determine everything that will occur, including human action. Cicero reports that Epicurus worries that, if it has been true from eternity that, e.g., “Milo will wrestle tomorrow,” then presently deliberating about whether to make it true or false would be idle.

iii. Sensible Qualities

The third difference between Epicurus and Democritus has to do with their attitudes toward the reality of sensible properties. Democritus thinks that, in reality, only atoms and the void exist, and that sensible qualities such as sweetness, whiteness, and the like exist only ‘by convention.’ It is controversial exactly how to understand Democritus’ position, but most likely he is asserting that atoms themselves have no sensible qualities–they are simply extended bits of stuff. The sensible qualities that we think bodies have, like sweetness, are not really in the object at all, but are simply subjective states of the percipient’s awareness produced by the interaction of bodies with our sense-organs. This is shown, thinks Democritus, by the fact that the same body appears differently to different percipients depending on their bodily constitution, e.g., that a ‘white’ body appears yellow to somebody with jaundice, or that honey tastes bitter to an ill person. From this, Democritus derives skeptical conclusions. He is pessimistic about our ability to gain any knowledge about the world on the basis of our senses, since they systematically deceive us about the way the world is.

Epicurus wants to resist these pessimistic conclusions. He argues that properties like sweetness, whiteness, and such do not exist at the atomic level–individual atoms are not sweet or white–but that these properties are nonetheless real. These are properties of macroscopic bodies, but the possession of these properties by macroscopic bodies are explicable in terms of the properties of and relations amongst the individual atoms that make up bodies. Epicurus thinks that bodies have the capability to cause us to have certain types of experiences because of their atomic structure, and that such capabilities are real properties of the bodies. Similar considerations apply for properties like “being healthy,” “being deadly,” and “being enslaved.” They are real, but can only apply to groups of atoms (like people), not individual atoms. And these sorts of properties are also relational properties, not intrinsic ones. For example, cyanide is deadly–not deadly per se, but deadly for human beings (and perhaps for other types of organisms). Nonetheless, its deadliness for us is still a real property of the cyanide, albeit a relational one.

d. Mechanistic Explanations of Natural Phenomena

One important aspect of Epicurus’ philosophy is his desire to replace teleological (goal-based) explanations of natural phenomena with mechanistic ones. His main target is mythological explanations of meteorological occurrences and the like in terms of the will of the gods. Because Epicurus wishes to banish the fear of the gods, he insists that occurrences like earthquakes and lightning can be explained entirely in atomic terms and are not due to the will of the gods. Epicurus is also against the intrinsic teleology of philosophers like Aristotle. Teeth appear to be well-designed for the purpose of chewing. Aristotle thinks that this apparent purposiveness in nature cannot be eliminated, and that the functioning of the parts of organisms must be explained by appealing to how they contribute to the functioning of the organism as a whole. Other philosophers, such as the Stoics, took this apparent design as evidence for the intelligence and benevolence of God. Epicurus, however, following Empedocles, tries to explain away this apparent purposiveness in nature in a proto-Darwinian way, as the result of a process of natural selection.

e. The Gods

Because of its denial of divine providence, Epicureanism was often charged in antiquity with being a godless philosophy, although Epicurus and his followers denied the charge. The main upshot of Epicurean theology is certainly negative, however. Epicurus’ mechanistic explanations of natural phenomena are supposed to displace explanations that appeal to the will of the gods. In addition, Epicurus is one of the earliest philosophers we know of to have raised the Problem of Evil, arguing against the notion that the world is under the providential care of a loving deity by pointing out the manifold suffering in the world.

Despite this, Epicurus says that there are gods, but these gods are quite different from the popular conception of gods. We have a conception of the gods, says Epicurus, as supremely blessed and happy beings. Troubling oneself about the miseries of the world, or trying to administer the world, would be inconsistent with a life of tranquility, says Epicurus, so the gods have no concern for us. In fact, they are unaware of our existence, and live eternally in the intermundia, the space between the cosmoi. For Epicurus, the gods function mainly as ethical ideals, whose lives we can strive to emulate, but whose wrath we need not fear.

Ancient critics thought the Epicurean gods were a thin smoke-screen to hide Epicurus’ atheism, and difficulties with a literal interpretation of Epicurus’ sayings on the nature of the gods (for instance, it appears inconsistent with Epicurus’ atomic theory to hold that any compound body, even a god, could be immortal) have led some scholars to conjecture that Epicurus’ ‘gods’ are thought-constructs, and exist only in human minds as idealizations, i.e., the gods exist, but only as projections of what the most blessed life would be.

f. Philosophy of Mind

Epicurus is one of the first philosophers to put forward an Identity Theory of Mind. In modern versions of the identity theory, the mind is identified with the brain, and mental processes are identified with neural processes. Epicurus’ physiology is quite different; the mind is identified as an organ that resides in the chest, since the common Greek view was that the chest, not the head, is the seat of the emotions. However, the underlying idea is quite similar. (Note: not all commentators accept that Epicurus’ theory is actually an Identity Theory.)

The main point that Epicurus wants to establish is that the mind is something bodily. The mind must be a body, thinks Epicurus, because of its ability to interact with the body. The mind is affected by the body, as vision, drunkenness, and disease show. Likewise, the mind affects the body, as our ability to move our limbs when we want to and the physiological effects of emotional states show. Only bodies can interact with other bodies, so the mind must be a body. Epicurus says that the mind cannot be something incorporeal, as Plato thinks, since the only thing that is not a body is void, which is simply empty space and cannot act or be acted upon.

The mind, then, is an organ in the body, and mental processes are identified with atomic processes. The mind is composed of four different types of particles–fire, air, wind, and the “nameless element,” which surpasses the other particles in its fineness. Although Epicurus is reticent about the details, some features of the mind are accounted for in terms of the features of these atoms–for instance, the mind is able to be moved a great deal by the impact of an image (which is something quite flimsy), because of the smallness of the particles that make up the mind. The mind proper, which is primarily responsible for sensation and thought, is located in the chest, but Epicurus thinks that there is also a ‘spirit,’ spread throughout the rest of the body, which allows the mind to communicate with it. The mind and spirit play roles very similar to those of the central and peripheral nervous systems in modern theory.

One important result of Epicurus’ philosophy of mind is that death is annihilation. The mind is able to engage in the motions of sensation and thought only when it is housed in the body and the atoms that make it up are properly arranged. Upon death, says Epicurus, the container of the body shatters, and the atoms disperse in the air. The atoms are eternal, but the mind made up of these atoms is not, just as other compound bodies cease to exist when the atoms that make them up disperse.

g. Perception

Epicurus explains perception in terms of the interaction of atoms with the sense-organs. Objects continually throw off one-atom-thick layers, like the skin peeling off of an onion. These images, or “eidola,” fly through the air and bang into one’s eyes, from which one learns about the properties of the objects that threw off these eidola. This explains vision. Other senses are analyzed in similar terms; e.g., the soothing action of smooth atoms on the tongue causes the sensation of sweetness. As noted above, Epicurus maintains that such sensible qualities are real qualities of bodies.

4. Epistemology

a. The Canon: Sensations, Preconceptions, and Feelings

Epicurus’ epistemology is resolutely empiricist and anti-skeptical. All of our knowledge ultimately comes from the senses, thinks Epicurus, and we can trust the senses, when properly used. Epicurus’ epistemology was contained in his work the ‘Canon,’ or ‘measuring stick,’ which is lost, so many of the details of his views are unavailable to us.

Epicurus says that there are three criteria of truth: sensations, ‘preconceptions,’ and feelings. Sensations give us information about the external world, and we can test the judgments based upon sensations against further sensations; e.g., a provisional judgment that a tower is round, based upon sensation, can be tested against later sensations to be corroborated or disproved. Epicurus says that all sensations give us information about the world, but that sensation itself is never in error, since sensation is a purely passive, mechanical reception of images and the like by sense-organs, and the senses themselves do not make judgments ‘that’ the world is this way or that. Instead, error enters in when we make judgments about the world based upon the information received through the senses.

Epicurus thinks that, in order to make judgments about the world, or even to start any inquiry whatsoever, we must already be in possession of certain basic concepts, which stand in need of no further proof or definition, on pain of entering into an infinite regress. This concern is similar to the Paradox of Inquiry explored by Plato in the Meno, that one must already know about something in order to be able to inquire about it. However, instead of postulating that our immaterial souls had acquaintance with transcendent Forms in a pre-natal existence, as Plato does, Epicurus thinks that we have certain ‘preconceptions’–concepts such as ‘body,’ ‘person,’ ‘usefulness,’ and ‘truth’–which are formed in our (material) minds as the result of repeated sense-experiences of similar objects. Further ideas are formed by processes of analogy or similarity or by compounding these basic concepts. Thus, all ideas are ultimately formed on the basis of sense-experience.

Feelings of pleasure and pain form the basic criteria for what is to be sought and avoided.

b. Anti-skeptical Arguments

Epicurus is concerned to refute the skeptical tendencies of Democritus, whose metaphysics and theory of perception were similar to Epicurus’. At least three separate anti-skeptical arguments are given by Epicureans:

i. The “Lazy Argument”

Epicurus says that it is impossible to live as a skeptic. If a person really were to believe that he knows nothing, then he would have no reason to engage in one course of action instead of another. Thus, the consistent skeptic would engage in no action whatsoever, and would die.

ii. The Self-refutation Argument

If a skeptic claims that nothing can be known, then one should ask whether he knows that nothing can be known. If he says ‘yes,’ then he is contradicting himself. If he doesn’t say yes, then he isn’t making a claim, and we don’t need to listen to him.

iii. The Argument from Concept-formation

If the skeptic says that nothing can be known, or that we cannot know the truth, we can ask him where he gets his knowledge of concepts such as ‘knowledge’ and ‘truth.’ If the senses cannot be relied on, as the skeptic claims, then he is not entitled to use concepts such as ‘knowledge’ and ‘truth’ in formulating his thesis, since such concepts derive from the senses.

5. Ethics

Epicurus’ ethics is a form of egoistic hedonism; i.e., he says that the only thing that is intrinsically valuable is one’s own pleasure; anything else that has value is valuable merely as a means to securing pleasure for oneself. However, Epicurus has a sophisticated and idiosyncratic view of the nature of pleasure, which leads him to recommend a virtuous, moderately ascetic life as the best means to securing pleasure. This contrasts Epicurus strongly with the Cyrenaics, a group of ancient hedonists who better fit the stereotype of hedonists as recommending a policy of “eat, drink, and be merry.”

a. Hedonism, Psychological and Ethical

Epicurus’ ethics starts from the Aristotelian commonplace that the highest good is what is valued for its own sake, and not for the sake of anything else, and Epicurus agrees with Aristotle that happiness is the highest good. However, he disagrees with Aristotle by identifying happiness with pleasure. Epicurus gives two reasons for this. The main reason is that pleasure is the only thing that people do, as a matter of fact, value for its own sake; that is, Epicurus’ ethical hedonism is based upon his psychological hedonism. Everything we do, claims Epicurus, we do for the sake ultimately of gaining pleasure for ourselves. This is supposedly confirmed by observing the behavior of infants, who, it is claimed, instinctively pursue pleasure and shun pain. This is also true of adults, thinks Epicurus, but in adults it is more difficult to see that this is true, since adults have much more complicated beliefs about what will bring them pleasure. But the Epicureans did spend a great deal of energy trying to make plausible the contention that all activity, even apparently self-sacrificing activity or activity done solely for the sake of virtue or what is noble, is in fact directed toward obtaining pleasure for oneself.

The second proof, which fits in well with Epicurus’ empiricism, supposedly lies in one’s introspective experience. One immediately perceives that pleasure is good and that pain is bad, in the same way that one immediately perceives that fire is hot; no further argument is needed to show the goodness of pleasure or the badness of pain. (Of course, this does not establish Epicurus’ further contention that only pleasure is intrinsically valuable and only pain is intrinsically bad.)

Although all pleasures are good and all pains evil, Epicurus says that not all pleasures are choiceworthy or all pains to be avoided. Instead, one should calculate what is in one’s long-term self-interest, and forgo what will bring pleasure in the short-term if doing so will ultimately lead to greater pleasure in the long-term.

b. Types of Pleasure

For Epicurus, pleasure is tied closely to satisfying one’s desires. He distinguishes between two different types of pleasure: ‘moving’ pleasures and ‘static’ pleasures. ‘Moving’ pleasures occur when one is in the process of satisfying a desire, e.g., eating a hamburger when one is hungry. These pleasures involve an active titillation of the senses, and these feelings are what most people call ‘pleasure.’ However, Epicurus says that after one’s desires have been satisfied, (e.g., when one is full after eating), the state of satiety, of no longer being in need or want, is itself pleasurable. Epicurus calls this a ‘static’ pleasure, and says that these static pleasures are the best pleasures.

Because of this, Epicurus denies that there is any intermediate state between pleasure and pain. When one has unfulfilled desires, this is painful, and when one no longer has unfulfilled desires, this steady state is the most pleasurable of all, not merely some intermediate state between pleasure and pain.

Epicurus also distinguishes between physical and mental pleasures and pains. Physical pleasures and pains concern only the present, whereas mental pleasures and pains also encompass the past (fond memories of past pleasure or regret over past pain or mistakes) and the future (confidence or fear about what will occur). The greatest destroyer of happiness, thinks Epicurus, is anxiety about the future, especially fear of the gods and fear of death. If one can banish fear about the future, and face the future with confidence that one’s desires will be satisfied, then one will attain tranquility (ataraxia), the most exalted state. In fact, given Epicurus’ conception of pleasure, it might be less misleading to call him a ‘tranquillist’ instead of a ‘hedonist.’

c. Types of Desire

Because of the close connection of pleasure with desire-satisfaction, Epicurus devotes a considerable part of his ethics to analyzing different kinds of desires. If pleasure results from getting what you want (desire-satisfaction) and pain from not getting what you want (desire-frustration), then there are two strategies you can pursue with respect to any given desire: you can either strive to fulfill the desire, or you can try to eliminate the desire. For the most part Epicurus advocates the second strategy, that of paring your desires down to a minimum core, which are then easily satisfied.

Epicurus distinguishes between three types of desires: natural and necessary desires, natural but non-necessary desires, and “vain and empty” desires. Examples of natural and necessary desires include the desires for food, shelter, and the like. Epicurus thinks that these desires are easy to satisfy, difficult to eliminate (they are ‘hard-wired’ into human beings naturally), and bring great pleasure when satisfied. Furthermore, they are necessary for life, and they are naturally limited: that is, if one is hungry, it only takes a limited amount of food to fill the stomach, after which the desire is satisfied. Epicurus says that one should try to fulfill these desires.

Vain desires include desires for power, wealth, fame, and the like. They are difficult to satisfy, in part because they have no natural limit. If one desires wealth or power, no matter how much one gets, it is always possible to get more, and the more one gets, the more one wants. These desires are not natural to human beings, but inculcated by society and by false beliefs about what we need; e.g., believing that having power will bring us security from others. Epicurus thinks that these desires should be eliminated.

An example of a natural but non-necessary desire is the desire for luxury food. Although food is needed for survival, one does not need a particular type of food to survive. Thus, despite his hedonism, Epicurus advocates a surprisingly ascetic way of life. Although one shouldn’t spurn extravagant foods if they happen to be available, becoming dependent on such goods ultimately leads to unhappiness. As Epicurus puts it, “If you wish to make Pythocles wealthy, don’t give him more money; rather, reduce his desires.” By eliminating the pain caused by unfulfilled desires, and the anxiety that occurs because of the fear that one’s desires will not be fulfilled in the future, the wise Epicurean attains tranquility, and thus happiness.

d. The Virtues

Epicurus’ hedonism was widely denounced in the ancient world as undermining traditional morality. Epicurus, however, insists that courage, moderation, and the other virtues are needed in order to attain happiness. However, the virtues for Epicurus are all purely instrumental goods–that is, they are valuable solely for the sake of the happiness that they can bring oneself, not for their own sake. Epicurus says that all of the virtues are ultimately forms of prudence, of calculating what is in one’s own best interest. In this, Epicurus goes against the majority of Greek ethical theorists, such as the Stoics, who identify happiness with virtue, and Aristotle, who identifies happiness with a life of virtuous activity. Epicurus thinks that natural science and philosophy itself also are instrumental goods. Natural science is needed in order to give mechanistic explanations of natural phenomena and thus dispel the fear of the gods, while philosophy helps to show us the natural limits of our desires and to dispel the fear of death.

e. Justice

Epicurus is one of the first philosophers to give a well-developed contractarian theory of justice. Epicurus says that justice is an agreement “neither to harm nor be harmed,” and that we have a preconception of justice as “what is useful in mutual associations.” People enter into communities in order to gain protection from the dangers of the wild, and agreements concerning the behavior of the members of the community are needed in order for these communities to function, e.g., prohibitions of murder, regulations concerning the killing and eating of animals, and so on. Justice exists only where there are such agreements.

Like the virtues, justice is valued entirely on instrumental grounds, because of its utility for each of the members of society. Epicurus says that the main reason not to be unjust is that one will be punished if one gets caught, and that even if one does not get caught, the fear of being caught will still cause pain. However, he adds that the fear of punishment is needed mainly to keep fools in line, who otherwise would kill, steal, etc. The Epicurean wise man recognizes the usefulness of the laws, and since he does not desire great wealth, luxury goods, political power, or the like, he sees that he has no reason to engage in the conduct prohibited by the laws in any case.

Although justice only exists where there is an agreement about how to behave, that does not make justice entirely ‘conventional,’ if by ‘conventional’ we mean that any behavior dictated by the laws of a particular society is thereby just, and that the laws of a particular society are just for that society. Since the ‘justice contract’ is entered into for the purpose of securing what is useful for the members of the society, only laws that are actually useful are just. Thus, a prohibition of murder would be just, but antimiscegenation laws would not. Since what is useful can vary from place to place and time to time, what laws are just can likewise vary.

f. Friendship

Epicurus values friendship highly and praises it in quite extravagant terms. He says that friendship “dances around the world” telling us that we must “wake to blessedness.” He also says that the wise man is sometimes willing to die for a friend. Because of this, some scholars have thought that in this area, at least, Epicurus abandons his egoistic hedonism and advocates altruism toward friends. This is not clear, however. Epicurus consistently maintains that friendship is valuable because it is one of the greatest means of attaining pleasure. Friends, he says, are able to provide one another the greatest security, whereas a life without friends is solitary and beset with perils. In order for there to be friendship, Epicurus says, there must be trust between friends, and friends have to treat each other as well as they treat themselves. The communities of Epicureans can be seen as embodying these ideals, and these are ideals that ultimately promote ataraxia.

g. Death

One of the greatest fears that Epicurus tries to combat is the fear of death. Epicurus thinks that this fear is often based upon anxiety about having an unpleasant afterlife; this anxiety, he thinks, should be dispelled once one realizes that death is annihilation, because the mind is a group of atoms that disperses upon death.

i. The No Subject of Harm Argument

If death is annihilation, says Epicurus, then it is ‘nothing to us.’ Epicurus’ main argument for why death is not bad is contained in the Letter to Menoeceus and can be dubbed the ‘no subject of harm’ argument. If death is bad, for whom is it bad? Not for the living, since they’re not dead, and not for the dead, since they don’t exist. His argument can be set out as follows:

  1. Death is annihilation.
  2. The living have not yet been annihilated (otherwise they wouldn’t be alive).
  3. Death does not affect the living. (from 1 and 2)
  4. So, death is not bad for the living. (from 3)
  5. For something to be bad for somebody, that person has to exist, at least.
  6. The dead do not exist. (from 1)
  7. Therefore, death is not bad for the dead. (from 5 and 6)
  8. Therefore death is bad for neither the living nor the dead. (from 4 and 7)

Epicurus adds that if death causes you no pain when you’re dead, it’s foolish to allow the fear of it to cause you pain now.

ii. The Symmetry Argument

A second Epicurean argument against the fear of death, the so-called ‘symmetry argument,’ is recorded by the Epicurean poet Lucretius. He says that anyone who fears death should consider the time before he was born. The past infinity of pre-natal non-existence is like the future infinity of post-mortem non-existence; it is as though nature has put up a mirror to let us see what our future non-existence will be like. But we do not consider not having existed for an eternity before our births to be a terrible thing; therefore, neither should we think not existing for an eternity after our deaths to be evil.

6. References and Further Reading

This is not meant as comprehensive bibliography; rather, it’s a selection of further texts to read for those who want to learn more about Epicurus and Epicureanism. Most of the books listed below have extensive bibliographies for those looking for more specialized and scholarly publications.

a. Collections of Primary Source

  • The Epicurus Reader, translated and edited by Brad Inwood and L.P. Gerson, Hackett Publishing.
    • This inexpensive collection has most of the major extant writings of Epicurus, in addition to other ancient sources such as Cicero and Plutarch who wrote about Epicureanism. (Lucretius is not included much.) However, there is little commentary or explication of the material, and some of the primary sources are fairly opaque.
  • The Hellenistic philosophers, Volume 1: translations of the principal sources, with philosophical commentary, by A.A. Long and D.N. Sedley, Cambridge University Press.
    • This excellent book organizes the texts into sections topically, (e.g., “Atoms,” “Soul,” “Language,” “Death,”) and follows each selection of texts with commentary and explication. Vol. 2, which contains the original Greek and Latin texts, has a fine, if somewhat dated (1987) bibliography at the end.Lucretius, De Rerum Natura

There are many different editions of Lucretius’ masterpiece, an extended exposition of Epicurus’ metaphysics, philosophy of mind, and natural science. I personally like the translation by Rolfe Humphries: Lucretius: The Way Things Are. The De Rerum Natura of Titus Lucretius Carus, Indiana University Press. Humphries translates Lucretius’ poem as a poem, not as prose, yet the translation is still very clear and readable.

b. Recent Books on Particular Areas of Epicurus’ Philosophy

The books below are all well-written and influential. They deal in-depth with problems of interpreting particular areas of Epicurus’ philosophy, while still remaining, for the most part, accessible to well-educated general readers. They also have extensive bibliographies. However, do not assume that the interpretations of Epicurus in these books are always widely accepted.

  • Hellenistic Philosophy of Mind, by Julia Annas, University of California Press.
    • This book deals with Epicurean and Stoic theories of what the mind is.
  • Epicurus’ Ethical Theory : The Pleasures of Invulnerability, by Phillip Mitsis, Cornell University Press.
    • This book is concerned with all of the major areas of Epicurean ethics, from pleasure, to friendship, justice, and human freedom. Mitsis is especially good at showing how Epicurus’ conception of pleasure differs from that of the utilitarians.
  • The Morality of Happiness, by Julia Annas, Oxford University Press.
    • This book focuses deals with all major ancient theorists from Aristotle on, but is still a good source of information on Epicurean ethics, especially if one wants to put Epicurean ethics in the context of other ancient ethical theories.
  • Epicurus’ Scientific Method, by Elizabeth Asmis, Cornell University Press.
    • The best book-length treatment of Epicurus’ epistemology available. A little more technical than the books above, but still fairly accessible.

Author Information

Tim O’Keefe
Email: see web page
Georgia State University
U. S. A.

Epictetus (55–135 C.E.)

epictetusEpictetus (pronounced Epic-TEE-tus) was an exponent of Stoicism who flourished in the early second century C.E. about four hundred years after the Stoic school of Zeno of Citium was established in Athens. He lived and worked, first as a student in Rome, and then as a teacher with his own school in Nicopolis in Greece. Our knowledge of his philosophy and his method as a teacher comes to us via two works composed by his student Arrian, the Discourses and the Handbook. Although Epictetus based his teaching on the works of the early Stoics (none of which survives) which dealt with the three branches of Stoic thought, logic, physics and ethics, the Discourses and the Handbook concentrate almost exclusively on ethics. The role of the Stoic teacher was to encourage his students to live the philosophic life, whose end was eudaimonia (‘happiness’ or ‘flourishing’), to be secured by living the life of reason, which – for Stoics – meant living virtuously and living ‘according to nature’. The eudaimonia (‘happiness’) of those who attain this ideal consists of ataraxia (imperturbability), apatheia (freedom from passion), eupatheiai (‘good feelings’), and an awareness of, and capacity to attain, what counts as living as a rational being should. The key to transforming oneself into the Stoic sophos (wise person) is to learn what is ‘in one’s power’, and this is ‘the correct use of impressions’ (phantasiai), which in outline involves not judging as good or bad anything that appears to one. For the only thing that is good is acting virtuously (that is, motivated by virtue), and the only thing that is bad is the opposite, acting viciously (that is, motivated by vice). Someone who seeks to make progress as a Stoic (a prokoptôn) understands that their power of rationality is a fragment of God whose material body – a sort of rarefied fiery air – blends with the whole of creation, intelligently forming and directing undifferentiated matter to make the world as we experience it. The task of the prokoptôn, therefore, is to ‘live according to nature’, which means (a) pursuing a course through life intelligently responding to one’s own needs and duties as a sociable human being, but also (b) wholly accepting one’s fate and the fate of the world as coming directly from the divine intelligence which makes the world the best that is possible.

Table of Contents

  1. Life
  2. Writings
    1. The Discourses
    2. The Handbook
  3. Epictetus’ Stoicism
  4. Key Concepts
    1. The Promise of Philosophy
    2. What is Really Good
    3. What is in our Power
    4. Making Proper Use of Impressions
    5. The Three Topoi
      1. The Discipline of Desire
      2. The Discipline of Action
      3. The Discipline of Assent
    6. God
    7. On Living in Accord with Nature
    8. Metaphors for Life
    9. Making Progress
  5. Glossary of Terms
  6. References and Further Reading
    1. Translations of Epictetus
    2. Translations of Hellenistic Philosophers, including the Stoics
    3. Items that Address Epictetus Specifically
    4. Items Addressing Stoic Philosophy and/or Hellenistic Ethics Generally
    5. Other Items on Hellenistic Philosophy Generally

1. Life

It is possible to draw only a basic sketch of Epictetus’ life. Resources at our disposal include just a handful of references in the ancient texts, to which we can add the few allusions that Epictetus makes to his own life in the Discourses.

Epictetus was born in about 55 C.E. in Hierapolis in Phrygia (modern-day Pamukkale, in south-western Turkey). As a boy he somehow came to Rome as a slave of Epaphroditus who was a rich and powerful freedman, having himself been a slave of the Emperor Nero (he had been an administrative secretary). Whilst still a slave, Epictetus studied with the Stoic teacher Musonius Rufus.

There is a story told by the author Celsus (probably a younger contemporary of Epictetus) – quoted by the early Christian Origen (c.185–254) at Contra Celsum 7.53 – that when still a slave, Epictetus was tortured by his master who twisted his leg. Enduring the pain with complete composure, Epictetus warned Epaphroditus that his leg would break, and when it did break, he said, ‘There, did I not tell you that it would break?’ And from that time Epictetus was lame. The Suda (tenth century), however, although confirming that Epictetus was lame, attributes his affliction to rheumatism.

At some point Epictetus was manumitted, and in about 89, along with other philosophers then in Rome, was banished by the Emperor Domitian. He went to Nicopolis in Epirus (in north-western Greece) where he opened his own school which acquired a good reputation, attracting many upper-class Romans. One such student was Flavius Arrian (c.86–160) who would compose the Discourses and the Handbook, and who later served in public office under the Emperor Hadrian and made his mark as a respected historian (much of his writings survive). Origen (Contra Celsum 6.2) reports that Epictetus had been more popular in his day than had Plato in his. Aulus Gellius (c.125–c.165) reports that one of Marcus Aurelius’ teachers, Herodes Atticus (c.101–177), considered Epictetus to be ‘the greatest of Stoics’ (Attic Nights 1.2.6).

Our sources report that Epictetus did not marry, had no children, and lived to an old age. With respect to marriage and children we may note the story from Lucian (Demonax 55) about the Cynic philosopher Demonax who had been a pupil of Epictetus. On hearing Epictetus exhort his students to marry and have children (for it was a philosopher’s duty to provide a substitute ready for the time when they would die), he sarcastically asked Epictetus whether he could marry one of his daughters.

2. Writings

It appears that Epictetus wrote nothing himself. The works we have that present his philosophy were written by his student, Flavius Arrian. We may conjecture that the Discourses and the Handbook were written some time around the years 104–107, at the time when Arrian (born c.86) was most likely to have been a student.

Dobbin (1998), though, holds the view that the Discourses and the Handbook were actually written by Epictetus himself; the Suda does say, after all, that Epictetus ‘wrote a great deal’. Dobbin is not entirely convinced by Arrian’s claim in his dedicatory preface that he wrote down Epictetus’ words verbatim; firstly, stenographic techniques at this time were primitive, and anyway were the preserve of civil servants; secondly, most of the discourses are too polished, and look too much like carefully crafted prose to be the product of impromptu discussions; and thirdly, some of the discourses (notably 1.29, 3.22 and 4.1) are too long for extempore conversations.

There is no way to resolve this question with certainty. Whether the texts we have do indeed represent a serious attempt to record Epictetus at work verbatim, whether draft texts were later edited and rewritten (as seems wholly likely), possibly by Epictetus, or whether Epictetus did in fact write the texts himself, drawing on his recollections as a lecturer with only occasional attempts at strictly verbatim accuracy, we shall never know. But what we can be certain of, regardless of who actually wrote the words onto the papyrus to make the first draft of the text as we have it today, is that those words were intended to present Stoic moral philosophy in the terms and the style that Epictetus employed as a teacher intent on bringing his students to philosophic enlightenment as the Stoics had understood this enterprise.

a. Discourses

Written in Koine Greek, the everyday contemporary form of the language, Epictetus’ Discourses appear to record the exchanges between Epictetus and his students after formal teaching had concluded for the day. Internal textual evidence confirms that the works of the early Stoic philosophers (Zeno, Cleanthes and Chrysippus) were read and discussed in Epictetus’ classes, but this aspect of Epictetus’ teaching is not recorded by Arrian. What we have, then, are intimate, though earnest, discussions in which Epictetus aims to make his students consider carefully what the philosophic life – for a Stoic – consists in, and how to live it oneself. He discusses a wide range of topics, from friendship to illness, from fear to poverty, on how to acquire and maintain tranquillity, and why we should not be angry with other people.

Not all of the Discourses appear to have survived, as the ancient Byzantine scholar Photius (c.810–c.893) reports that the complete text originally comprised eight books, whereas all we have today are four books. Because the text, chapter by chapter, jumps to different topics and shows no orderly development, it is not readily apparent that anything is missing, and indeed, the reference to eight books may be mistaken (though another author, Aulus Gellius, at Attic Nights 19.1.14, refers to the fifth book of the Discourses). The range of topics is sufficiently broad for us to be reasonably confident that, even if some of the text has been lost, what we lack by and large repeats and revisits the material that we have in the book as it has come down to us.

b. The Handbook

This little book appears to be an abstract of the Discourses, focusing on key themes in Epictetus’ teaching of Stoic ethics. Some of the text is taken from the Discourses, and the fact that not all of it can be correlated with passages in the larger work supports the view that some of the Discourses has indeed been lost.

3. Epictetus’ Stoicism

The writings of the early Stoics, of Zeno (335–263 B.C.E.) the founder of the school, of Chrysippus (c.290–207 B.C.E.) the extremely influential third head of the Stoa, and of others, survive only as quoted fragments found in later works. The question arises as to what extent Epictetus preserved the original doctrines of the Stoic school, and to what extent, if any, he branched out with new emphases and innovations of his own. The nineteenth-century Epictetan scholar Adolf Bonhöffer (1998, 3) remarks: ‘[Epictetus] is completely free of the eclecticism of Seneca and Marcus Aurelius; and, compared with his teacher Musonius Rufus … his work reveals a considerably closer connection to Stoic doctrine and terminology as developed mainly by Chrysippus.’ Evidence internal to the Discourses indicates that Epictetus was indeed faithful to the early Stoics. At 1.4.28–31, Epictetus praises Chrysippus in the highest terms, saying of him, ‘How great the benefactor who shows the way! … who has discovered, and brought to light, and communicated, the truth to all, not merely of living, but of living well’ (trans. Hard). It would be inconsistent, if not wholly ridiculous, to laud Chrysippus in such terms and then proceed to depart oneself from the great man’s teaching. At 1.20.15, Epictetus quotes Zeno, and at 2.6.9–10 he quotes Chrysippus, to support his arguments. Aulus Gellius (Attic Nights 19.1.14) says that Epictetus’ Discourses ‘undoubtedly agree with the writings of Zeno and Chrysippus’.

Scholars are agreed that the ‘doctrine of the three topics (topoi)’ (fields of study) which we find in the Discourses originates with Epictetus (see Bonhöffer 1996, 32; Dobbin 1998, xvii; Hadot 1998, 83; More 1923, 107). Oldfather (1925, xxi, n. 1), in the introduction to his translation of the Discourses, remarks that ‘this triple division … is the only notable original element … found in Epictetus, and it is rather a pedagogical device for lucid presentation than an innovation in thought’. Our enthusiasm for this division being wholly original to Epictetus should be tempered with a reading of extracts from Seneca’s Moral Letters (75.8–18 and 89.14–15) where we also find a threefold division of ethics which, although not exactly similar to Epictetus’ scheme, suggests the possibility that both Seneca and Epictetus drew on work by their predecessors that, alas, has not survived. Suffice it so say, what Epictetus teaches by means of his threefold division is wholly in accord with the principles of the early Stoics, but how he does this is uniquely his own method. The programme of study and exercises that Epictetus’ students adhered to was in consequence different from the programme that was taught by his predecessors, but the end result, consisting in the special Stoic outlook on oneself and the world at large and the ability to ‘live the philosophic life’, was the same.

4. Key Concepts

a. The Promise of Philosophy

Epictetus, along with all other philosophers of the Hellenistic period, saw moral philosophy as having the practical purpose of guiding people towards leading better lives. The aim was to live well, to secure for oneself eudaimonia (‘happiness’ or ‘a flourishing life’), and the different schools and philosophers of the period offered differing solutions as to how the eudaimôn life was to be won.

No less true of us today than it was for the ancients, few people are content with life (let alone wholly content), and what contributes to any contentment that may be enjoyed is almost certainly short-lived and transient.

The task for the Stoic teacher commences with the understanding that (probably) everyone is not eudaimôn for much, or even all, of the time; that there is a reason for this being the case and, most importantly, that there are solutions that can remedy this sorry state of affairs.

Indeed, Epictetus metaphorically speaks of his school as being a hospital to which students would come seeking treatments for their ills (Discourses 3.23.30). Each of us, in consequence merely of being human and living in society, is well aware of what comprise these ills. In the course of daily life we are beset by frustrations and setbacks of every conceivable type. Our cherished enterprises are hindered and thwarted, we have to deal with hostile and offensive people, and we have to cope with the difficulties and anxieties occasioned by the setbacks and illnesses visited upon our friends and relations. Sometimes we are ill ourselves, and even those who have the good fortune to enjoy sound health have to face the fact of their own mortality. In the midst of all this, only the rare few are blessed with lasting and rewarding relationships, and even these relationships, along with everything that constitutes a human life, are wholly transient.

But what is philosophy? Does it not mean making preparation to meet the things that come upon us? (Discourses 3.10.6, trans. Oldfather)

The ills we suffer, says Epictetus, result from mistaken beliefs about what is truly good. We have invested our hope in the wrong things, or at least invested it in the wrong way. Our capacity to flourish and be happy (to attain eudaimonia) is entirely dependent upon our own characters, how we dispose ourselves to ourselves, to others, and to events generally. What qualities our characters come to have is completely up to us. Therefore, how well we flourish is also entirely up to us.

b. What is Really Good

The central claim of Stoic ethics is that only the virtues and virtuous activities are good, and that the only evil is vice and actions motivated by vice (see Discourses 2.9.15 and 2.19.13). When someone pursues pleasure or wealth, say, believing these things to be good, the Stoics hold that this person has made a mistake with respect to the nature of the things pursued and the nature of their own being, for the Stoics deny that advantages such as pleasure and health (wealth and status, and so forth) are good, because they do not benefit those who possess them in all circumstances. Virtue, on the other hand, conceived as the capacity to use such advantages wisely, being the only candidate for that which is always beneficial, is held to be the only good thing (see Plato, Euthydemus 278e–281e and Meno 87c–89a).

Thus, the Stoics identify the eudaimôn (‘happy’) life as one that is motivated by virtue. The term we translate as ‘virtue’ (from the Latin virtus) is aretê, and means ‘excellence’. To progress towards excellence as a human being, for Epictetus, means understanding the true nature of one’s being and keeping one’s prohairesis (moral character) in the right condition. Epictetus uses the term aretê only occasionally, and whereas the early Stoics spoke of striving for excellence as what was proper for a rational creature and required for eudaimonia (‘happiness’ or well-being), Epictetus speaks of striving to maintain one’s prohairesis in proper order (see Discourses 1.4.18 and 1.29.1).

Although things such as material comfort, for instance, will be pursued by the Stoic student who seeks eudaimonia, they will do this in a different way from those not living the ‘philosophic life’ – for Stoics claim that everything apart from virtue (what is good) and vice (what is bad) is indifferent, that is, ‘indifferent’ with regard to being good or bad. It is how one makes use of indifferent things that establishes how well one is making progress towards aretê (moral excellence) and a eudaimôn (‘happy’) life.

Indifferent things are either ‘preferred’ or ‘dispreferred’. Preferred are health and wealth, friends and family, and pretty much all those things that most people pursue as desirable for leading a flourishing life. Dispreferred are their opposites: sickness and poverty, social exclusion, and pretty much all those things that people seek to avoid as being detrimental for a flourishing life. Thus, the preferred indifferents have value for a Stoic, but not in terms of their being good: they have an instrumental value with respect to their capacities to contribute to a flourishing life as the objects upon which our virtuous actions are directed (see Discourses 1.29.2). The Stoic does not lament their absence, for their presence is not constitutive of eudaimonia. What is good is the virtuous use one makes of such preferred things should they be to hand, but no less good are one’s virtuous dispositions in living as well as one may, even when they are lacking.

c. What is in our Power

To maintain our prohairesis (moral character) in the proper condition – the successful accomplishment of this being necessary and sufficient for eudaimonia (‘happiness’) – we must understand what is eph’ hêmin (‘in our power’ or ‘up to us’; see Discourses 1.22.9–16). If we do not do this, our prohairesis will remain in a faulty condition, for we will remain convinced that things such as wealth and status are good when they are really indifferent, troubled by frustrations and anxieties, subject to disturbing emotions we do not want and cannot control, all of which make life unpleasant and unrewarding, sometimes overwhelmingly so. This is why Epictetus remarks: ‘This is the proper goal, to practise how to remove from one’s life sorrows and laments, and cries of “Alas” and “Poor me”, and misfortune and disappointment’ (Discourses 1.4.23, trans. Dobbin).

No one is master of another’s prohairesis [moral character], and in this alone lies good and evil. No one, therefore, can secure the good for me, or involve me in evil, but I alone have authority over myself in these matters. (Discourses 4.12.7–8, trans. Dobbin)

What is in our power, then, is the ‘authority over ourselves’ that we have regarding our capacity to judge what is good and what is evil. Outside our power are ‘external things’, which are ‘indifferent’ with respect to being good or evil. These indifferents, as we saw in the previous section, number those things that are conventionally deemed to be good and those that are conventionally deemed to be bad. Roughly, they are things that ‘just happen’, and they are not in our power in the sense that we do not have absolute control to make them occur just as we wish, or to make them have exactly the outcomes that we desire. Thus, for example, sickness is not in our power because it is not wholly up to us whether we get sick, and how often, nor whether we will recover quickly or indeed at all. Now, it makes sense to visit a doctor when we feel ill, but the competence of the doctor is not in our power, and neither is the effectiveness of any treatment that we might be offered. So generally, it makes sense to manage our affairs carefully and responsibly, but the ultimate outcome of any affair is, actually, not in our power.

What is in our power is the capacity to adapt ourselves to all that comes about, to judge anything that is ‘dispreferred’ not as bad, but as indifferent and not strong enough to overwhelm our strength of character.

The Handbook of Epictetus begins with these words:

Some things are up to us [eph’ hêmin] and some things are not up to us. Our opinions are up to us, and our impulses, desires, aversions–in short, whatever is our own doing. Our bodies are not up to us, nor are our possessions, our reputations, or our public offices, or, that is, whatever is not our own doing. (Handbook 1.1, trans. White)

That is, we have power over our own minds. The opinions we hold of things, the intentions we form, what we value and what we are averse to are all wholly up to us. Although we may take precautions, whether our possessions are carried off by a thief is not up us (but the intention to steal, that of course is in the power of the thief), and our reputations, in whatever quarter, must be decided by what other people think of us, and what they do think is up to them. Remaining calm in the face of adversity and controlling our emotions no matter what the provocation (qualities of character that to this day are referred to as ‘being stoical’), are accomplished in the full Stoic sense, for Epictetus, by making proper use of impressions.

d. Making Proper Use of Impressions

To have an impression is to be aware of something in the world. For example, I may look out of my window and have the impression of an airship floating over the houses in the distance. Whether there is really an airship there, half a mile off, or whether there is just a little helium-filled model tied to my garden gate by a bit of string, is a separate question. ‘Making proper use of impressions’ concerns how we move from the first thing, being aware of something or other, to the second thing, making a judgement that something or other is the case. The Stoic stands in sharp contrast to the non-Stoic, for when the latter faces some disaster, say (let us imagine that their briefcase has burst open and their papers are scattered by the wind all along the station platform and onto the track), they will judge this a terrible misfortune and have the appropriate emotional response to match. Epictetus would declare that this person has made the wrong use of their impression.

In the first place, do not allow yourself to be carried away by [the] intensity [of your impression]: but say, ‘Impression, wait for me a little. Let me see what you are, and what you represent. Let me test you.’ Then, afterwards, do not allow it to draw you on by picturing what may come next, for if you do, it will lead you wherever it pleases. But rather, you should introduce some fair and noble impression to replace it, and banish this base and sordid one. (Discourses 2.18.24–5, trans. Hard)

Few non-Stoics, ignorant of Epictetus’ teaching, would do other than rush around after their papers, descending deeper and deeper into a panic, imagining their boss at work giving them a dressing down for losing the papers, making them work extra hours to make good the loss, and perhaps even dismissing them from their job. The Stoic, by contrast, tests their impression to see what the best interpretation should be: losing the papers is a dispreferred indifferent, to be sure, but having an accident of this sort is bound to happen once in a while, and is nothing to be troubled about. They will quietly gather up the papers they can, and instead of panicking with respect to facing their boss, they will rehearse a little speech about having had an accident and what it means to have lost the papers. If their boss erupts in a temper, well, that is a concern for the boss.

Our attaining the eudaimôn (‘happy’) life requires that we judge things in the right way, for ‘what disturbs men’s minds is not events but their judgements on events’ (Handbook 5, trans. Matheson).

Remember that foul words or blows in themselves are no outrage, but your judgement that they are so. So when any one makes you angry, know that it is your own thought that has angered you. Wherefore make it your endeavour not to let your impressions carry you away. For if once you gain time and delay, you will find it easier to control yourself. (Handbook 20, trans. Matheson)

e. The Three Topoi

The three topoi (fields of study) establish activities in which the prokoptôn (Stoic student) applies their Stoic principles; they are practical exercises or disciplines that when successfully followed are constitutive of the eudaimôn (‘happy’) life which all rational beings are capable of attaining.

There are three areas of study, in which a person who is going to be good and noble must be trained. That concerning desires and aversions, so that he may never fail to get what he desires nor fall into what he would avoid. That concerning the impulse to act and not to act, and, in general, appropriate behaviour; so that he may act in an orderly manner and after due consideration, and not carelessly. The third is concerned with freedom from deception and hasty judgement, and, in general, whatever is connected with assent. (Discourses 3.2.1–2, trans. Hard)

Our capacity to employ these disciplines in the course of daily life is eph’ hêmin (‘in our power’ or ‘up to us’) because they depend on our opinions, judgements, intentions and desires which concern the way we regard things over which our prohairesis (moral character) has complete control.

i. The Discipline of Desire

The first discipline concerns what someone striving for excellence as a rational being should truly believe is worthy of desire, which for the Stoics is that which is truly good, virtue and action motivated by virtue.

Of these [three areas of study], the principle, and most urgent, is that which has to do with the passions; for these are produced in no other way than by the disappointment of our desires, and the incurring of our aversions. It is this that introduces disturbances, tumults, misfortunes, and calamities; and causes sorrow, lamentation and envy; and renders us envious and jealous, and thus incapable of listening to reason. (Discourses 3.2.3, trans. Hard)

Epictetus remarks: ‘When I see a man anxious, I say, What does this man want? If he did not want some thing which is not in his power, how could he be anxious?’ (Discourses 2.13.1, trans. Long). Those things that most of us, most of the time, seek after as being desirable, what we consider will make our lives go well, are things that are not in our power, and thus the hope we have for securing these things is placed in the hands of others or in the hands of fate. And when we are thwarted in our efforts to gain what we desire we become frustrated (or depressed or envious or angry, or all of these things). To be afflicted with such ‘passions’, says Epictetus, is the only real source of misery for human beings. Instead of trying to relieve ourselves of these unpleasant emotions by pressing all the harder to secure what we desire, we should rather place our hope not in ‘external’ things that are not in our power, but in our own dispositions and moral character. In short, we should limit our desire to virtue and to becoming (to the best of our capacities) examples of ‘excellence’. If we do not do this, the inevitable result is that we will continue to desire what we may fail to obtain or lose once we have it, and in consequence suffer the unhappiness of emotional disquiet (or worse). And as is the common experience of all people at some time or other, when we are in the grip of such emotions we run the risk of becoming blind to the best course of action, even when construed in terms of pursuing ‘external’ things.

The Stoic prokoptôn, in contrast, sets their hopes on excellence, recognising that this is where their power over things lies. They will still pursue those ‘preferred indifferent external’ things that are needed for fulfilling those functions and projects that they deem appropriate for them as individuals, and those they have obligations to meet. But they will not be distressed at setbacks or failure, nor at obstructive people, nor at other difficulties (illness, for instance), for none of these things is entirely up to them, and they engage in their affairs in full consciousness of this fact. It is in maintaining this consciousness of what is truly good (virtue), and awareness that the indifferent things are beyond their power, that makes this a discipline for the Stoic prokoptôn.

ii. The Discipline of Action

The second discipline concerns our ‘impulses to act and not to act’, that is, our motivations, and answers the question as to what we each should do as an individual in our own unique set of circumstances to successfully fulfil the role of a rational, sociable being who is striving for excellence.

The outcome of our actions is not wholly in our power, but our inclination to act one way rather than another, to pursue one set of objectives rather than others, this is in our power. The Stoics use the analogy of the archer shooting at a target to explain this notion. The ideal, of course, is to hit the centre of the target, though accomplishing this is not entirely in the archer’s power, for she cannot be certain how the wind will deflect the arrow from its path, nor whether her fingers will slip, nor whether (for it is within the bounds of possibility) the bow will break. The excellent archer does all within her power to shoot well, and she recognises that doing her best is the best she can do. The Stoic archer strives to shoot excellently, and will not be disappointed if she shoots well but fails to hit the centre of the target. And so it is in life generally. The non-Stoic views their success in terms of hitting the target, whereas the Stoic views their success in terms of having shot well (see Cicero, On Ends 3.22).

The [second area of study] has to do with appropriate action. For I should not be unfeeling like a statue, but should preserve my natural and acquired relations as a man who honours the gods, as a son, as a brother, as a father, as a citizen. (Discourses 3.2.4, trans. Hard)

Appropriate acts are in general measured by the relations they are concerned with. ‘He is your father.’ This means that you are called upon to take care of him, give way to him in all things, bear with him if he reviles or strikes you.
‘But he is a bad father.’
Well, have you any natural claim to a good father? No, only to a father.
‘My brother wrongs me.’
Be careful then to maintain the relation you hold to him, and do not consider what he does, but what you must do if your purpose is to keep in accord with nature. (Handbook 30, trans. Matheson)

The actions we undertake, Epictetus says, should be motivated by the specific obligations that we have in virtue of who we are, our natural relations to others, and what roles we have adopted in our dealings with the wider community (see Discourses 2.10.7–13). Put simply, our interest to live well as rational beings obliges us to act virtuously, to be patient, considerate, gentle, just, self-disciplined, even-tempered, dispassionate, unperturbed, and when necessary, courageous. This returns us to the central Stoic notion that the eudaimôn (‘happy’) life is realised by those who are motivated by virtue. The Discipline of Action points out to the prokoptôn how this should be applied in our practical affairs.

Epictetus sums up the first two disciplines:

We must have these principles ready to hand. Without them we must do nothing. We must set our mind on this object: pursue nothing that is outside us, nothing that is not our own, even as He that is mighty has ordained: pursuing what lies within our will [prohairetika], and all else [i.e., indifferent things] only so far as it is given to us. Further, we must remember who we are, and by what name we are called, and must try to direct our acts [kathêkonta] to fit each situation and its possibilities.
We must consider what is the time for singing, what the time for play, and in whose presence: what will be unsuited to the occasion; whether our companions are to despise us, or we to despise ourselves: when to jest, and whom to mock at: in a word, how one ought to maintain one’s character in society. Wherever you swerve from any of these principles, you suffer loss at once; not loss from without, but issuing from the very act itself. (Discourses 4.12.15–18, trans. Matheson)

The loss here is of course loss of eudaimonia.

Failing to ‘remember who we are’ will result in our failing to pursue those actions appropriate to our individual circumstances and commitments. Epictetus says that this happens because we forget what ‘name’ we have (son, brother, councillor, etc.), ‘for each of these names, if rightly considered, always points to the acts appropriate to it’ (Discourses 2.10.11, trans. Hard). To progress in the Discipline of Action, then, the prokoptôn must be conscious, moment by moment, of (a) which particular social role they are playing, and (b) which actions are required or appropriate for fulfilling that role to the highest standard.

iii. The Discipline of Assent

This exercise focuses on ‘assenting to impressions’, and continues the discussion already introduced in the section above on making proper use of impressions. ‘Assent’ translates the Greek sunkatathesis, which means ‘approve’, ‘agree’, or ‘go along with’. Thus, when we assent to an impression (phantasia) we are committing ourselves to it as a correct representation of how things are, and are saying, ‘Yes, this is how it is.’ The Discipline of Assent, then, is an exercise applied to our impressions in which we interpret and judge them in order to move from having the impression of something or other, to a declaration that such-and-such is the case.

The third area of study has to do with assent, and what is plausible and attractive. For, just as Socrates used to say that we are not to lead an unexamined life [see Plato, Apology 38a], so neither are we to accept an unexamined impression, but to say, ‘Stop, let me see what you are, and where you come from’, just as the night-watch say, ‘Show me your token.’ (Discourses 3.12.14–15, trans. Hard)

Make it your study then to confront every harsh impression with the words, ‘You are but an impression, and not at all what you seem to be’. Then test it by those rules that you possess; and first by this–the chief test of all–’Is it concerned with what is in our power or with what is not in our power?’ And if it is concerned with what is not in our power, be ready with the answer that it is nothing to you. (Handbook 1.5, trans. Matheson)

And we should do this with a view to avoiding falling prey to subjective (and false) evaluations so that we can be free from deception and from making rash judgements about how to proceed in the first two disciplines. For if we make faulty evaluations we will end up (with respect to the first discipline) having desires for the wrong things (namely, ‘indifferents’), and (with respect to the second discipline) acting inappropriately with regard to our duties and obligations. This is why Epictetus remarks that the third topic ‘concerns the security of the other two’ (Discourses 3.2.5, trans. Long).

Epictetus runs through a number of imaginary situations to show how we should be alert to the dangers of assenting to poorly evaluated impressions:

… We ought … to exercise ourselves daily to meet the impressions of our senses …. So-and-so’s son is dead. Answer, ‘That lies outside the sphere of the moral purpose, it is not an evil.’ His father has disinherited So-and-so; what do you think of it? ‘That lies outside the sphere of the moral purpose, it is not an evil.’ Caesar has condemned him. ‘That lies outside the sphere of the moral purpose, it is not an evil.’ He was grieved at all this. ‘That lies within the sphere of the moral purpose, it is an evil.’ He has borne up under it manfully. ‘That lies within the sphere of the moral purpose, it is a good.’ Now, if we acquire this habit, we shall make progress; for we shall never give our assent to anything but that of which we get a convincing sense-impression. His son is dead. What happened? His son is dead. Nothing else? Not a thing. His ship is lost. What happened? His ship is lost. He was carried off to prison. What happened? He was carried off to prison. But the observation: ‘He has fared ill,’ is an addition that each man makes on his own responsibility. (Discourses 3.8.1–5, trans. Oldfather)

What we must avoid, then, is adding to our impressions immediately and without proper evaluation any notion that something good or bad is at hand. For the only thing that is good is moral virtue, and the only harm that anyone can come to is to engage in affairs motivated by vice. Thus, to see the loss of a ship as a catastrophe would count as assenting to the wrong impression, for the impression that we have is that of just a ship being lost. To take the extra step of declaring that this is a misfortune and harmful would be to assent to an impression that is not in fact present, and would be a mistake. The loss of a ship, for a Stoic, is nothing more than a dispreferred indifferent, and does not constitute a harm.

f. God

For Epictetus, the terms ‘God’, ‘the gods’, and ‘Zeus’ are used interchangeably, and they appear frequently in the Discourses. In the Handbook, God is discussed as the ‘captain’ who calls us back on board ship, the subsequent voyage being a metaphor for our departure from life (see Handbook 7). God is also portrayed as ‘the Giver’ to whom we should return all those things we have enjoyed on loan when we lose close relatives or friends who die, and when we lose our possessions through misfortune (see Discourses 4.10.16 and Handbook 11).

If the Stoic making progress (the prokoptôn) understands God, the universe, and themselves in the right way, they ‘will never blame the gods, nor find fault with them’ (Handbook 31.1, trans. Oldfather):

Will you be angry and discontented with the ordinances of Zeus, which he, with the Fates who spun in his presence the thread of your destiny at the time of your birth, ordained and appointed? (Discourses 1.12.25, trans. Hard)

Indeed, they will pray to God to lead them to the fate that He has assigned them:

Lead me, Zeus, and you too, Destiny,
Wherever I am assigned by you;
I’ll follow and not hesitate,
But even if I do not wish to,
Because I’m bad, I’ll follow anyway.
(Handbook 53, trans. White = extract from Cleanthes’ Hymn to Zeus)

[For] God has stationed us to a certain place and way of life. (Discourses 1.9.24, trans. Dobbin)

Epictetus presents orthodox Stoic views on God. His justification for believing in God is expressed essentially along the lines of what we recognise as an argument from design. The order and harmony that we can perceive in the natural world (from astronomical events to the way plants grow and fruit in season) is attributed to a divine providence that orders and controls the entire cosmos intelligently and rationally (see Discourses 1.6.1–11, 1.14.1–6, 1.16.7–8 and 2.14.11/25–7). The Stoics were materialists, and God is conceived of as a type of fiery breath that blends perfectly with all other matter in the universe. In doing this, God transforms matter from undifferentiated ‘stuff’ into the varied forms that we see around us. This process is continuous, and God makes the world as it is, doing what it does, moment by moment. Just as the soul of a person is understood to bring alive and animate what would otherwise be dead and inert matter, so God is thought of as the ‘soul of the world’, and the universe is thought of as a sort of animal.

Stoics hold that the mind of each person is quite literally a fragment (apospasma) of God (see Discourses 2.8.11), and that the rationality that we each possess is in fact a fragment of God’s rationality; and this Epictetus primarily identifies as the capacity we have to make proper use of impressions (see Discourses 1.1.12). Epictetus expresses this in terms of what God has ‘given us’; He is conceived of as having constructed the universe in such a way that we have in our possession all that is within the compass of our own character or moral choice and nothing else, but this is no reason for complaint:

What has He given me for my own and subject to my authority, and what has He left for Himself? Everything within the sphere of the moral purpose He has given me, subjected them to my control, unhampered and unhindered. My body that is made of clay, how could He make that unhindered? Accordingly He has made it subject to the revolution of the universe–[along with] my property, my furniture, my house, my children, my wife. … But how should I keep them? In accordance with the terms upon which they have been given, and for as long as they can be given. But He who gave also takes away. …
And so, when you have received everything, and your very self, from Another [i.e., God], do you yet complain and blame the Giver, if He take something away from you? (Discourses 4.1.100–3, with omissions, trans. Oldfather)

The capacity that the prokoptôn has for understanding, accepting, and embracing this state of affairs, that this is indeed the nature of things, is another of the main foundation stones of Stoic ethics.

g. On Living in Accord with Nature

The outlook adopted and the activities performed by the Stoic student in pursuit of excellence, as detailed in the sections above, are frequently referred to collectively by Epictetus (following the Stoic tradition) as ‘following nature’ or ‘living in harmony with nature’. The Stoic prokoptôn maintains his ‘harmony with nature’ by being aware of why he acts as he does in terms of both (a) what his appropriate actions are, and (b) accepting what fate brings. If, for example, the prokoptôn is berated unfairly by his brother, he will not respond with angry indignation, for this would be ‘contrary to nature’, for nature has determined how brothers should rightly act towards each other (see Discourses 3.10.19–20). The task the Stoic student shoulders is to pursue actions appropriate to him as a brother, despite all and any provocation to act otherwise (see Handbook 30). This, for Epictetus, is a major component of what it means to keep one’s prohairesis (moral character) in harmony with nature (see Discourses 1.6.15, 3.1.25 and 3.16.15).

Keeping ourselves in harmony with nature requires that we focus on two things. Firstly, we must pay attention to our own actions so that we respond appropriately, and secondly we must pay attention to the world in which our actions take effect and which prompts those actions in the first place.

When you are about to undertake some action, remind yourself what sort of action it is. If you are going out for a bath, put before your mind what commonly happens at the baths: some people splashing you, some people jostling, others being abusive, and others stealing. So you will undertake this action more securely if you say to yourself, ‘I want to have a bath and also to keep my choice [prohairesis] in harmony with nature.’ And do likewise in everything you undertake. So, if anything gets in your way when you are having your bath, you will be ready to say, ‘I wanted not only to have a bath but also to keep my choice [prohairesis] in harmony with nature; and I shall not keep it so if I get angry at what happens.’ (Handbook 4, trans. Hard)

In this extract about going to the baths, Epictetus focuses more on accepting what fate brings, saying that we should anticipate the sorts of things that can happen, so that when they do we will not be surprised and will not be angry. In other situations, anticipation of trouble or misfortune is impossible, but all the same, the Stoic will accept their fate as what God has ordained for them, and this for Epictetus is the very essence of keeping in harmony with nature (see Discourses 1.4.18–21).

It is circumstances (difficulties) which show what men are. Therefore when a difficulty falls upon you, remember that God, like a trainer of wrestlers, has matched you with a rough young man. For what purpose? you may say. Why, that you may become an Olympic conqueror; but it is not accomplished without sweat. In my opinion no man has had a more profitable difficulty than you have had, if you choose to make use of it as an athlete would deal with a young antagonist. (Discourses 1.24.1–2, trans. Long)

Every problem we face in life should be understood as a new opportunity to strengthen our moral character, just as every new bout for the wrestler provides an opportunity for them to train their skill in wrestling.

To be instructed is this, to learn to wish that every thing may happen as it does. And how do things happen? As the disposer [i.e., God] has disposed them. And he has appointed summer and winter, and abundance and scarcity, and virtue and vice, and all such opposites for the harmony of the whole; and to each of us he has given a body, and parts of the body, and possessions, and companions.
Remembering then this disposition of things, we ought to go to be instructed, not that we may change the constitution of things, – for we have not the power to do it, nor is it better that we should have the power, – but in order that, as the things around us are what they are and by nature exist, we may maintain our minds in harmony with the things which happen. (Discourses 1.12.15–17, trans. Long)

The wise and good man … submits his own mind to him who administers the whole [i.e., God], as good citizens do to the law of the state. He who is receiving instruction ought to come to be instructed with this intention, How shall I follow the gods in all things, how shall I be contented with the divine administration, and how can I become free? For he is free to whom every thing happens according to his will [prohairesis], and whom no man can hinder. (Discourses 1.12.7–9, trans. Long)

In this last extract we see Epictetus refer to the ideal Stoic practice as that of ‘following the gods’. This means essentially the same as ‘following nature’, for God, who is immanent in the world (as the Stoics understand it) is identified with the way the world manifests, so if one follows nature, one must also be following God (see Discourses 1.20.15, 1.30.4, 4.7.20 and 4.10.14).

h. Metaphors for Life

Epictetus employs a number of metaphors to illustrate what the Stoic attitude to life should be.

Life as a festival

In this metaphor, Epictetus encourages us to think of life as a festival, arranged for our benefit by God, as something that we can live through joyously, able to put up with any hardships that befall us because we have our eye on the larger spectacle that is taking place. He asks his students:

Who are you, and for what purpose have you come? Was it not he [i.e., God] who brought you here? … And as what did he bring you here? Was it not as a mortal? Was it not as one who would live, with a little portion of flesh, upon this earth, and behold his governance and take part with him, for a short time, in his pageant and his festival? (Discourses 4.1.104, trans. Hard)

The whole thrust of Stoic ethics aims to persuade us that we should ourselves contribute to the festival by living as well as we may and fulfilling our duties as sociable citizens of God’s ‘great city of the universe’ (Discourses 3.22.4, trans. Hard). (See also Discourses 1.12.21, 2.14.23 and 4.4.24–7/46.)

Life as a game

At Discourses 2.5.2, in encouraging his students to appreciate that external things are indifferent (being neither good nor bad), Epictetus says that we should imitate those who play dice, for neither the dice nor the counters have any real value; what matters, and what is either good or bad, is the way we play the game. Similarly at 2.5.15–20, where Epictetus discusses the example of playing a ball game, no one considers for a moment whether the ball itself is good or bad, but only whether they can throw and catch it with the appropriate skill. What matters are the faculties of dexterity, speed and good judgement exhibited by the players, for it is in deploying these faculties effectively that any player is deemed to have played well. (See also Discourses 4.7.5/19/30–1.) Epictetus also uses the metaphor of playing games when discussing suicide, for just as someone stops playing a game when they are no longer amused by it, so it should be in life generally: if life should become unbearable, no one can force us to keep living it.

To summarize: remember that the door is open. Do not be more cowardly than children, but just as they say, when the game no longer pleases them, ‘I will play no more,’ you too, when things seem that way to you, should merely say, ‘I will play no more,’ and so depart; but if you stay, stop moaning. (Discourses 1.24.20, trans. Hard; see also 1.25.7–21 and 2.16.37)

Life as weaving

In this metaphor, the wool that the weaver uses to make cloth takes the place of the ball in the game; that is, whatever material comes our way, it is our duty to make proper use of it, and if possible make it into the best thing of its kind as we can (see Discourses 2.5.21–2).

Life as a play

We have already seen, when discussing the Discipline of Action, that Epictetus urges us to ‘remember who we are’ and what ‘name’ we have, because what role we play in life will determine which actions are appropriate for us. Obviously, the metaphor of life as a play expands on this idea, but also brings in the notion of our having to accept our fate, whatever that may be, since we do not ourselves chose the role we must play (for although we may aim for one role rather than another, we must recognise that our attaining it is not, in fact, ‘in our power’).

Remember that you are an actor in a play, which is as the author [i.e., God] wants it to be: short, if he wants it to be short; long, if he wants it to be long. If he wants you to act a poor man, a cripple, a public official, or a private person, see that you act it with skill. For it is your job to act well the part that is assigned to you; but to choose it is another’s. (Handbook 17, trans. Hard)

Life as an athletic contest

This metaphor invites us to see an analogy between one’s training in Stoic ethics as preparatory for living the philosophic life and someone’s training in athletics as preparatory for entering the contest in the arena. Epictetus addresses someone who has become distressed at not having enough leisure to study their philosophy books, saying:

For is not reading a kind of preparation for living, but living itself made up of things other than books? It is as if an athlete, when he enters the stadium, should break down and weep because he is not exercising outside. This is what you were exercising for; this is what the jumping-weights, and the sand and your young partners were all for. So are you now seeking for these, when it is the time for action? That is just as if, in the sphere of assent, when we are presented with impressions, some of which are evidently true and others not, instead of distinguishing between them, we should want to read a treatise On Direct Apprehension. (Discourses 4.4.11–13, trans. Hard)

Training to live a life that befits someone who strives for the Stoic ideal is directly compared to athletic training. Such training is difficult, demanding, and unpleasant; there is little point in showing eagerness for any endeavour if we have not properly assessed the demands that will be placed upon us, and in inevitably losing our original enthusiasm we will look foolish. This applies to philosophic training no less than to training as a wrestler in preparation for competing in the Olympic games (see Discourses 3.15.1–13 = Handbook 29). Elsewhere, Epictetus declares that delay is no longer possible, that we must meet the challenges that life throws at us:

Therefore take the decision right now that you must live as a full-grown man, as a man who is making progress; and all that appears to be best must be to you a law that cannot be transgressed. And if you are confronted with a hard task or with something pleasant, or with something held in high repute or no repute, remember that the contest is now, and that the Olympic games are now, and that it is no longer possible to delay the match, and that progress is lost and saved as a result of one defeat and even one moment of giving in. (Handbook 51.2, trans. Boter; see also Discourses 1.4.13–17, 1.18.21–3, 1.24.1–2 and 3.25.3)

Life as military service

This metaphor returns us to the Stoic idea that the universe is governed by God, and that, like it or not, we are all in service to God. The Stoic prokoptôn (student making progress) should understand that they should live life attempting to discharge this service to the highest standards. Epictetus addresses the person who is upset that they are obliged to travel abroad, causing their mother to be distressed at their absence.

Do you not know that life is a soldier’s service? One man must keep guard, another go out to reconnoitre, another take the field. It is not possible for all to stay where they are, nor is it better so. But you neglect to fulfil the orders of the general and complain, when some severe order is laid upon you; you do not understand to what a pitiful state you are bringing the army so far as in you lies; you do not see that if all follow your example there will be no one to dig a trench, or raise a palisade, no one to keep night watch or fight in the field, but every one will seem an unserviceable soldier.
… So too it is in the world; each man’s life is a campaign, and a long and varied one. It is for you to play the soldier’s part–do everything at the General’s bidding, divining his wishes, if it be possible. (Discourses 3.24.31–5, trans. Matheson; see also 1.9.24 and 1.16.20–1)

i. Making Progress

In making progress, the Stoic prokoptôn will pay a price. In standing to God, the world, society, herself and her undertakings in this new way (by accepting the Stoic notions of what is truly good, what is truly up to her, where her proper duties lie, and in considering her life to be one of service to God), the prokoptôn separates herself from the rest of society in fairly marked, if not profound, ways. For example, Epictetus wants his students to enjoy and participate in the ‘festival of life’, yet at the public games (for instance) they must not support any one individual, but must wish the winner to be he who actually wins; they must refrain entirely from shouting or laughing, and must not get carried away by the spectacle of the contest (Handbook 33.10). So whilst the prokoptôn’s friends immerse themselves fully in the games, cheering on their man and jeering at his opponent, the Stoic stands aloof and detached. Deliberately separating herself from the crowd is the price she pays for well-being (eudaimonia), dispassion (apatheia), tranquillity and imperturbability (ataraxia), along with the conviction that she is living as God intends.

But having declared her hand, the prokoptôn will pay in other ways also, for those around her will rebuke and ridicule her (Handbook 22), for in abandoning the values and practices common to the wider community, she will provoke hostility and suspicion. Yet there remains the hope that some at least will see the prokoptôn as someone whose wisdom has value for the community at large, as someone who serves as an example of how one may get along in the world without being overwhelmed by it, as someone with specific skills to offer, such as mediating family disputes and suchlike (see Discourses 1.15.5).

Epictetus characterises the differences between the non-philosopher and someone making progress in these terms:

This is the position and character of a layman: He never looks for either help or harm from himself, but only from externals. This is the position and character of the philosopher: He looks for all his help or harm from himself.
Signs of one who is making progress are: He censures no one, praises no one, blames no one, finds fault with no one, says nothing about himself as though he were somebody or knew something. When he is hampered or prevented, he blames himself. And if anyone compliments him, he smiles to himself at the person complimenting; while if anyone censures him, he makes no defence. He goes about like an invalid, being careful not to disturb, before it has grown firm, any part which is getting well. He has put away from himself every desire, and has transferred his aversion to those things only, of what is under our control [eph’ hêmin], which are contrary to nature. He exercises no pronounced choice in regard to anything. If he gives the appearance of being foolish or ignorant he does not care. In a word, he keeps guard against himself as though he were his own enemy lying in wait. (Handbook 48.1–3, trans. Oldfather)

Epictetus’ life as a Stoic teacher can perhaps be regarded as a personal quest to awaken to true philosophic enlightenment that person who will stand up proudly when his teacher pleads:

Pray, let somebody show me a person who is in such a good way that he can say, ‘I concern myself only with what is my own, with what is free from hindrance, and is by nature free. That is what is truly good, and this I have. But let all else be as god may grant; it makes no difference to me.’ (Discourses 4.13.24, trans. Hard)

For having attained such enlightenment himself (for surely this we must suppose), Epictetus devoted his life to raising up others from the crowd of humanity who could stand beside him and share in a perception of the universe and a way of life that any rational being is obliged to adopt in virtue of the nature of things.

5. Glossary of Terms

adiaphora

‘indifferent’; any of those things that are neither good or bad, everything, in fact, that does not fall under the headings ‘virtue’ or ‘vice’. The indifferents are what those lacking Stoic wisdom frequently take to have value (either positive or negative), and hence take to be desirable or undesirable. Pursuing them, or trying to avoid them, can lead to disturbing emotions that undermine one’s capacity to lead a eudaimôn life.

apatheia

freedom from passion, a constituent of the eudaimôn life.

aphormê

aversion; the opposite of hormê.

apoproêgmena

any ‘dispreferred’ indifferent, including such things as sickness, poverty, social exclusion, and so forth (conventionally ‘bad’ things). Suffering any of the dispreferred indifferents does not detract from the eudaimôn life enjoyed by the Stoic sophos. See proêgmena.

appropriate action

see kathêkon.

aretê

‘excellence’ or virtue; in the context of Stoic ethics the possession of ‘moral excellence’ will secure eudaimonia. For Epictetus, one acquires this by learning the correct use of impressions, following God, and following nature.

askesis

training or exercise undertaken by the Stoic prokoptôn striving to become a Stoic sophos.

assent

see sunkatathesis and phantasiai (impressions).

ataraxia

imperturbability, literally ‘without trouble’, sometimes translated as ‘tranquillity’; a state of mind that is a constituent of the eudaimôn life.

duty

see kathêkon.

ekklisis

avoidance; opposite of orexis.

ektos

‘external’; any of those things that fall outside the preserve of one’s prohairesis, including health, wealth, sickness, life, death, pain – what Epictetus calls aprohaireta, which are not in our power, the ‘indifferent‘ things.

emotion

see pathos.

end

see telos.

eph’ hêmin

what is in our power, or ‘up to us’ – namely, the correct use of impressions.

eudaimonia

‘happiness’ or ‘flourishing’ or ‘living well’. One achieves this end by learning the correct use of impressions following God, and following nature.

eupatheiai

‘good feelings’, possessed by the Stoic wise person (sophos) who experiences these special sorts of emotions, but does not experience irrational and disturbing passions.

excellence

see aretê.

external thing

see ektos.

God

see theos.

hêgemonikon

‘commanding faculty’ of the soul (psuchê); the centre of consciousness, the seat of all mental states, thought by the Stoics (and other ancients) to be located in the heart. It manifests four mental powers: the capacity to receive impressions, to assent to them, form intentions to act in response to them, and to do these things rationally. The Discourses talk of keeping the prohairesis in the right condition, and also of keeping the hêgemonikon in the right condition, and for Epictetus these notions are essentially interchangeable.

hormê

impulse to act; that which motivates an action.

impressions

see phantasiai.

indifferents

see adiaphora.

kathêkon

any ‘appropriate action’, ‘proper function’, or ‘duty’ undertaken by someone aiming to do what befits them as a responsible, sociable person. The appropriate actions are the subject of the second of the three topoi.

katorthôma

a ‘right action’ or ‘perfect action’ undertaken by the Stoic sophos, constituted by an appropriate action performed virtuously.

orexis

‘desire’ properly directed only at virtue.

passion

see pathos.

pathos

any of the disturbing emotions or ‘passions’ experienced by those who lack Stoic wisdom and believe that externals really are good or bad, when in fact they are ‘indifferent‘. A pathos according to the Stoics is a false judgement based on a misunderstanding of what is truly good and bad.

phantasiai

‘impressions’, what we are aware of in virtue of having experiences. Whereas non-rational animals respond to their impressions automatically (thus ‘using’ them), over and above using our impressions, human beings, being rational, can ‘attend to their use’ and, with practice, assent or not assent to them as we deem appropriate. The capacity to do this is what Epictetus strives to teach his students.

phusis

nature. To acquire eudaimonia one must ‘follow nature’, which means accepting our own fate and the fate of the world, as well as understanding what it means to be a rational being and strive for virtue. See aretê and God.

proêgmena

any ‘preferred’ indifferent, conventionally taken to be good, including such things as health and wealth, taking pleasure in the company of others, and so forth. Enjoying any of the preferred indifferents is not in itself constitutive of the eudaimôn life sought by the Stoic prokoptôn. See apoproêgmena.

prohairesis

‘moral character’, the capacity that rational beings have for making choices and intending the outcomes of their actions, sometimes translated as will, volition, intention, choice, moral choice, moral purpose. This faculty is understood by Stoics to be essentially rational. It is the faculty we use to ‘attend to impressions‘ and to give (or withhold) assent to impressions.

prokoptôn

one who is making progress (prokopê) in living as a Stoic, which for Epictetus means above all learning the correct use of impressions.

proper function

see kathêkon.

right action

see katorthôma.

sophos

the Stoic wise person who values only aretê and enjoys a eudaimôn life. The sophos enjoys a way of engaging in life that the prokoptôn strives to emulate and attain.

sunkatathesis

assent; a capacity of the prohairesis to judge the significance of impressions.

tarachê

disturbance, trouble; what one avoids when one enjoys ataraxia.

telos

end; that which we should pursue for its own sake and not for the sake of any other thing. For the Stoic, this is virtue. Epictetus formulates the end in several different but closely related ways. He says that the end is to maintain one’s prohairesis in proper order, to follow God, and to follow nature, all of which count as maintaining a eudaimôn life. The means by which this is to be accomplished is to apply oneself to the ‘three disciplines‘ assiduously.

theos

God, who is material, is a sort of fiery breath that blends with undifferentiated matter to create the forms that we find in the world around us. He is supremely rational, and despite our feelings to the contrary, makes the best world that it is possible to make. Epictetus says that we should ‘follow God’, that is, accept the fate that He bestows on us and on the world. Stoics understand that the rationality enjoyed by every human being (and any other rational beings, should there be any) is literally a fragment of God.

topoi

‘topics’. The ‘three topics’ or ‘fields of study’ which we find elucidated in the Discourses is an original feature of Epictetus’ educational programme. The three fields of study are: (1) The Discipline of Desire, concerned with desire and avoidance (orexis and ekklisis), and what is really good and desirable (virtue, using impressions properly, following God, and following nature); (2) The Discipline of Action, concerned with impulse and aversion (hormê and aphormê), and our ‘appropriate actions‘ or ‘duties’ with respect to living in our communities in ways that befit a rational being; and (3) The Discipline of Assent, concerned with how we should judge our impressions so as not to be carried away by them into anxiety or disturbing emotions with the likelihood of failing in the first two Disciplines.

virtue

from the Latin virtus which translates the Greek aretê, ‘excellence’.

Zeus

the name for God; Epictetus uses the terms ‘Zeus’, ‘God’, and ‘the gods’ interchangeably.

6. References and Further Reading

a. Translations of Epictetus

(Note: ‘Enchiridion‘, ‘Encheiridion‘, ‘Handbook‘, and ‘Manual‘ all refer to the same work. Items in print and currently available are indicated with an asterisk*.)

  • *Boter, Gerard. 1999. The Encheiridion of Epictetus & its Three Christian Adaptations: Transmission & Critical Editions. Leiden: Brill.
  • *Dobbin, Robert. 1998. Epictetus: Discourses Book 1. Oxford: Clarendon.
    • Includes commentary.
  • *Hard, Robin. 1995. The Discourses of Epictetus. ed. with introduction and notes by Christopher Gill. London: Everyman/Dent.
    • Includes the complete Discourses, The Handbook, and Fragments.
  • Higginson, Thomas Wentworth. 1890. The Works of Epictetus Consisting of His Discourses, in Four Books, The Enchiridion, and Fragments. Boston: Little, Brown, & Company.
  • Higginson, Thomas Wentworth. 1944. Epictetus: Discourses and Enchiridion. Roslyn, NY: Walter J. Black.
    • Reprint of the nineteenth-century translation with minor editorial alterations.
  • *Higginson, Thomas Wentworth. 1948. The Enchiridion. Upper Saddle River, NJ: Prentice Hall.
    • Reprint of nineteenth-century translation.
  • *Lobell, Sharon. 1995. Epictetus: The Art of Living. The Classic Manual on Virtue, Happiness, and Effectiveness: A New Interpretation. San Francisco: HarperSanFrancisco.
    • A free paraphrase of the Handbook.
  • Long, George. 1890. The Discourses of Epictetus with the Encheiridion and Fragments. London: George Bell.
    • First published 1848.
  • *Long, George. 1991. Enchiridion. Amherst, NY: Prometheus.
    • Reprint of nineteenth-century translation.
  • Matheson, P. E. 1916. Epictetus: The Discourses and Manual. 2 vols. Oxford: Clarendon Press.
  • *Matson, Wallace I. 1998. Epictetus: Encheiridion. in Louis P. Pojman. ed. Classics of Philosophy: Volume 1, Ancient and Medieval. New York: Oxford University Press.
  • *Oldfather, W. A. 1925, 1928. Epictetus: The Discourses as Reported by Arrian, The Manual, and Fragments. 2 vols. Cambridge, MA: Loeb Classical Library, Harvard University Press.
    • With original Greek text facing English translation.
  • *Saunders, Jason L. ed. 1996. Greek and Roman Philosophy after Aristotle. New York: Free Press.
    • Readings from Epicureanism, Stoicism, Scepticism, Philo, Plotinus, and early Christian thought. Includes P. E. Matheson’s translation of the Manual of Epictetus.
  • *White, Nicholas. 1983. Handbook of Epictetus. Indianapolis: Hackett.
    • A very competent and readable translation, with notes and a helpful, clear introduction.

b. Translations of Hellenistic Philosophers, including the Stoics

  • Inwood, Brad and L. P. Gerson. 1997. Hellenistic Philosophy: Introductory Readings. 2nd edition. Indianapolis: Hackett.
    • Readings from the main schools: Epicureanism, Stoicism and Scepticism.
  • Long, A. A. and D. N. Sedley. 1987. The Hellenistic Philosophers, Volume 1. Cambridge: Cambridge University Press.
    • Readings from the main schools: Epicureanism, Stoicism, Scepticism, and the Academics. Includes commentaries on the readings. This is the standard primary source text. Volume 2 contains the original Greek and Latin.

c. Items that Address Epictetus Specifically

  • Bonhöffer, Adolf Friedrich. 1996. The Ethics of the Stoic Epictetus. trans. William O. Stephens. New York: Peter Lang.
    • A very nicely done translation of this significant nineteenth-century work first published in 1894.
  • Hijmans, B. L. 1959. Askesis: Notes on Epictetus’ Educational System. Assen: Van Gorcum.
  • Long, A. A. 2002. Epictetus: A Stoic and Socratic Guide to Life. Oxford: Oxford University Press.
  • Stephens, William O. 1996. Epictetus on How the Stoic Sage Loves. Oxford Studies in Ancient Philosophy 14: 193–210.
    • A very clear, scholarly survey of Epictetus’ ethics.
  • Stockdale, James Bond. 1993. Courage Under Fire: Testing Epictetus’s Doctrines in a Laboratory of Human Behavior. Stanford: Hoover Institution/Stanford University.
    • An account of how the author used the principles of Stoic ethics to survive the rigors of a Vietnamese prisoner of war camp.
  • Xenakis, Jason. 1969. Epictetus: Philosopher–Therapist. The Hague: Martinus Nijhoff.

d. Items Addressing Stoic Philosophy and/or Hellenistic Ethics Generally

  • Annas, Julia. 1995. The Morality of Happiness. New York: Oxford University Press.
  • Gould, Josiah B. 1970. The Philosophy of Chrysippus. Albany, NY: State University of New York Press.
  • Hadot, Pierre. 1995. Philosophy as a Way of Life. Oxford: Blackwell.
    • Engaging essays on the notion of philosophy as a way of life, with focus on Stoic practice.
  • Hadot, Pierre. 1998. The Inner Citadel: The Mediations of Marcus Aurelius. Cambridge, MA: Harvard University Press.
    • Contains a very helpful chapter on Epictetus.
  • Inwood, Brad. 1985. Ethics and Human Action in Early Stoicism. Oxford: Clarendon Press.
  • Lesses, Glen. 1989. Virtue and the Goods of Fortune in Stoic Moral Theory. Oxford Studies in Ancient Philosophy 7: 95–127.
  • Lesses, Glen. 1993. Austere Friends: The Stoics and Friendship. Apeiron 26: 57–75.
  • Long, A. A. 1986. Hellenistic Philosophy: Stoics, Epicureans, Sceptics. 2nd ed. Berkeley and Los Angeles: University of California Press.
  • More, Paul Elmer. 1923. Hellenistic Philosophies. Princeton, NJ: Princeton University Press.
  • Nussbaum, Martha C. 1994. The Therapy of Desire: Theory and Practice in Hellenistic Ethics. Princeton, NJ: Princeton University Press.
    • Contains very helpful chapters on Stoic ethics from the view point of philosophy as therapy, as the ancients conceived it.
  • Reale, Giovanni. 1990. A History of Ancient Philosophy: 4. The Schools of the Imperial Age. ed. & trans. John R. Catan. Albany, NY: State University of New York Press.
  • Sandbach, F. H. 1989. The Stoics. London: Duckworth and Indianapolis: Hackett.
  • Sharples, R. W. 1996. Stoics, Epicureans, and Sceptics: An Introduction to Hellenistic Philosophy. London: Routledge.
  • Striker, Gisela. 1990. Ataraxia: Happiness as Tranquillity. The Monist 73–1: 97–110. also in Striker 1996
  • Striker, Gisela. 1991. Following Nature: A Study in Stoic Ethics. Oxford Studies in Ancient Philosophy 9: 1–73. also in Striker 1996.
  • Striker, Gisela. 1996. Essays on Hellenistic Epistemology and Ethics. Cambridge: Cambridge University Press.

e. Other items on Hellenistic philosophy generally

  • Algra, Keimpe, et al. eds. 1999. The Cambridge History of Hellenistic Philosophy. Cambridge: Cambridge University Press.
  • Annas, Julia. 1992. Hellenistic Philosophy of Mind. Berkeley and Los Angeles: University of California Press.

Author Information

Keith H. Seddon
Email: k.h.s@btinternet.com
Warnborough College
Ireland

Empedocles (c. 492—432 B.C.E.)

empedoclesEmpedocles (of Acagras in Sicily) was a philosopher and poet: one of the most important of the philosophers working before Socrates (the Presocratics), and a poet of outstanding ability and of great influence upon later poets such as Lucretius. His works On Nature and Purifications (whether they are two poems or only one – see below) exist in more than 150 fragments. He has been regarded variously as a materialist physicist, a shamanic magician, a mystical theologian, a healer, a democratic politician, a living god, and a fraud. To him is attributed the invention of the four-element theory of matter (earth, air, fire, and water), one of the earliest theories of particle physics, put forward seemingly to rescue the phenomenal world from the static monism of Parmenides. Empedocles’ world-view is of a cosmic cycle of eternal change, growth and decay, in which two personified cosmic forces, Love and Strife, engage in an eternal battle for supremacy. In psychology and ethics Empedocles was a follower of Pythagoras, hence a believer in the transmigration of souls, and hence also a vegetarian. He claims to be a daimôn, a divine or potentially divine being, who, having been banished from the immortals gods for ‘three times countless years’ for committing the sin of meat-eating and forced to suffer successive reincarnations in an purificatory journey through the different orders of nature and elements of the cosmos, has now achieved the most perfect of human states and will be reborn as an immortal. He also claims seemingly magical powers including the ability to revive the dead and to control the winds and rains.

Table of Contents

  1. Life
  2. Works
  3. Physics and Cosmology
    1. Physics
    2. Cosmology
  4. Biology
    1. Origin of Species
    2. Embryology
    3. Perception and Thought
  5. Ethics and the Journey of the Soul
    1. The Daimôns and Transmigration of Souls
    2. Meat-Eating and Sin
    3. Theology
    4. Physics and Theology
  6. References and Further Reading
    1. Texts and Commentaries
    2. Studies

1. Life

The most detailed source for Empedocles’ life is Diogenes Laertius’ Lives of the Eminent Philosophers 8.51-75. Perhaps because of his claims to divine status and magical powers a remarkable number of apocryphal stories gathered around the life of Empedocles in antiquity. His death in particular attracted attention and is reported to have occurred in several, clearly bathetic, ways: that he fell overboard from a ship and drowned; that he fell from his carriage, broke his leg and died; that he hanged himself; or the most famous account that, when he felt he was shortly to die and because he wished to appear to have been apotheosized, he leapt into the crater of Etna. In this story the ruse was unfortunately discovered when one of his trademark bronze sandals was thrown up by the volcano.

From more reliable sources it seems that he was born at Acragas in Sicily around 492 B.C.E. and died at the age of sixty. He was the son of a certain Meton, and was from an important and wealthy local aristocratic family: his grandfather, also called Empedocles, is reported to have been victorious in horse-racing at the Olympic Games in 496 B.C.E. It is not known where or with whom he studied philosophy, but various teachers are assigned to him by ancient sources, among them Parmenides, Pythagoras, Xenophanes, Anaxagoras and Anaximander (from whom he is said to have inherited his extravagant mode of dress). Whether or not he was his pupil, Empedocles was certainly very familiar with the work of Parmenides from whom he took the inspiration to write in hexameter verse, and whose physical system he adopts in part, and partly seeks to rectify.

He is reported to have been wealthy and to have kept a train of boy attendants and also to have provided dowries for many girls of Acragas. In dress he affected a purple robe with a golden girdle, bronze sandals, and a Delphic laurel-wreath, and in his manner he was grave and cultivated a regal public persona. These attributes contrast with his political outlook which is uniformly reported to have been actively pro-democratic. He began his political career with the prosecution of two state officials for their arrogant behaviour towards foreign guests which was seen as a sign of incipient tyrannical tendencies. He is also credited with activities against other anti-democratic citizens, and even with putting down an oligarchy and instituting a democracy at Acragas by use of his powers of rhetorical persuasion. Two speeches of his in favour of equality are also mentioned. His surviving poetry certainly shows considerable rhetorical skills, and indeed he is credited by Aristotle with the invention of rhetoric itself. Another report is of his breaking up a shadowy aristocratic political organisation called the ‘Thousand’. As a whole the tradition presents a picture of Empedocles as a popular politician, rhetorician, and champion of democracy and equality. This appears to fit in with the known history of Acragas where after the death of the popular and enlightened tyrant Theron in 473 B.C.E. his son Thrasydaeus proved to be a violent despot. After his forcible removal a democracy was established despite continuing political tensions.

As well as a being a philosopher, poet and politician, Empedocles was famous for his medical skills and healing powers. In his works he presents himself as a wandering healer offering to thousands of eager followers ‘prophecies’ and ‘words of healing for all kinds of illnesses’ (fr. 112 (Fragment numbers are those of Diels-Kranz)). He also promises his addressee Pausanias ‘you will learn remedies (pharmaka) for ills and help against old age’ and even ‘you will lead from Hades the life-force of a dead man’. To what degree this represents the real Empedocles is not known, but a tradition grew up of him as both a renowned physician and a practitioner of more magical cures, or as a charlatan. These stories however, may well derive from Empedocles’ own words in his poetry. On the other hand his work does show considerable interest in biology and especially in embryology and he was eminent enough as a writer on medicine to be attack ed by the writer of the Hippocratic treatise On Ancient Medicine who attempts to separate medicine from philosophy and rejects Empedocles’ work along with all philosophical medical works as irrelevant. The stories of his wonder-working such as curing entire plagues, reviving the dead and controlling the elements are clearly exaggerated at least, but it is becoming clearer, especially since the discovery of the Strasbourg fragments (see below), that, contrary to many former interpretations, Empedocles did not make a clear separation between his philosophy of nature and the more mystical, theological aspects of his philosophy, and so may well have seen no great difference in kind between healing ills through empirical understanding of human physiognomy and healing by means of sacred incantations and ritual purifications. His public as well may have made no great distinction between ‘scientific’ and sacred medicine as is suggested by the account of Empedocles curing a plague by restoring a fresh water-supply, after which he was venerated as a god.

2. Works

Empedocles work survives only in fragments, but luckily in a far greater number than any of the other Presocratics. These fragments are mostly quotations found in other authors such as Aristotle and Plutarch. Although many works, including tragedies and a medical treatise, are attributed to Empedocles by ancient sources no fragments of these have survived, and the extant fragments all come from a work of hexameter poetry traditionally entitled On Nature (Peri Phuseôs) or Physics (Phusika) and some from a possibly separate work called Purifications (Katharmoi). Of these two titles On Nature is by far the better attested and nearly all the fragments which are cited by ancient authors along with the title of the work they came from are attributed to On Nature, while only two are attributed to the Purifications. Because the fragments contain both material that clearly refers to physics and cosmology – the four elements, the cosmic cycle etc. – and also material concerning the fate of the soul, sin and purification, traditionally the former were placed in reconstructions of On Nature, and the latter in the Purifications. Indeed Empedocles’ writings contain ideas and themes that may seem quite incompatible with one another. On Natureas usually reconstructed seemed the work of a mechanist physicist which seeks to replace the traditional gods with four lifeless impersonal elements and two cosmic forces of attraction and repulsion, Love and Strife. The Purifications on the other hand seemed the work of a deeply religious Pythagorean mystic: it was often thought that Empedocles either wrote the Purifications as a move away from the mechanistic materialist position in On Nature, or that the Purifications were an addendum to On Nature, looking at the world from quite a different perspective.

However there have long been doubts about whether there were really two poems or only one poem (perhaps called On Nature and Purifications or with On Nature and Purifications as alternative titles for the same work) which contained both physical and religious material. First, although we may think of a poem called Physics as restricting itself to physical concerns alone, this may well be an anachronistic retrojection of modern rationalistic ideas of a gulf between physics and religion. Further, ancient book titles tend to be generic and there is a long tradition of works called either On Nature (Peri Phuseôs) or Physics (Physika) by various authors, with the earliest attested title for such works being On the Nature of the Universe (Peri Phuseôs tôn Ontôn ‘On the Nature of Things that Exist’), and so neither title may be Empedocles’ own and the two may perhaps be interchangeable different titles for the same work. Although there is still argument on this subject the Strasbourg fragments now suggest strongly that both physical and religious material was originally together in On Nature.

In 1990 the first ancient papyrus fragments of Empedocles were rediscovered at the University of Strasbourg and were published in 1999. Since these were also the first papyrus fragments of any of the Presocratics their discovery caused considerable excitement. Among other important new information they give about Empedocles’ philosophy, with great good fortune fr. a, the longest of the new fragments, was found to be a continuation of the longest of the previously known fragments (fr. 17) and thus now the two together form a continuous text of 69 lines. Fr. 17 is cited by Simplicius as being from book one of On Nature, and again very fortunately Strasbourg fr. a(ii) contains a marginal note by the manuscript copyist identifying line 30 of fr. a(ii) as line 300 of book one of On Nature. Since the Strasbourg fragments seem to have come from a single piece of papyrus, and they also overlap with a formerly known religious fragment usually placed in the Purifications (fr. 1 39) it now seems very likely that Empedocles introduced the themes of sin and purification early on in the physical poem. In fact it can now be argued that all of the fragments of the Purifications can be accommodated in the early part of book one of On Nature.

3. Physics and Cosmology

a. Physics

The foundations of Empedocles’ physics lie in the assumption that there are four ‘elements’ of matter, or ‘roots’ as he calls them, using a botanical metaphor that stresses their creative potential: earth, air, fire and water. These are able to create all things, including all living creatures, by being ‘mixed’ in different combinations and proportions. Each of the elements however, retains its own characteristics in the mixture, and each is eternal and unchanging. The positing of these four roots of matter forms part of a tradition of opposite material creative principles in Presocratic philosophy, but it also has its origins in an attempt to counter the theories of Parmenides who had argued that the world is single and unchanging since nothing can come from nothing and nothing can be destroyed into nothing: the theory known as Eleatic monism. Empedocles’ response was to appropriate Parmenides’ ideas and to use them against themselves. Nothing can come from nothing nor be destroyed into nothing (fr. 12), and therefore, in order to rescue the reality of the phenomenal world, there must be assumed to exist something eternal and unchanging beneath the constant change, growth and decay of the visible world. Empedocles then, transfers the changelessness that Parmenides attributes to the entire world to his four elements, and replaces the static singularity Parmenides’ world with a dynamic plurality. The four elements correspond closely to their expression at the macroscopic level of nature, with the traditional quadripartite division of the cosmos into earth, sea, air, and the fiery aether of the heavenly bodies: these four naturally occurring ‘elements’ of the cosmos clearly represent a fundamental natural division of matter at the largest scale. This division at the macroscopic level of reality is applied reductively at the microscopic level to produce a parallelism between the constituents of matter and the fundamental constituents of the cosmos, but the reduction of the world into four types of material particles does not deny the reality of the world we see, but instead validates it. Empedocles stresses this parallel between the elements at the different levels of reality by using the terms ‘sun’ ‘sea’ and ‘Earth’ interchangeably with ‘fire’, ‘water’ and ‘earth’. Of the four elements, although Empedocles stresses their equality of powers, fire is also granted a special role both in its hardening effect on mixtures of the other elements and also as the fundamental principle of living things.

b. Cosmology

Empedocles also posits two cosmic forces which work upon the elements in both creative and destructive ways. These he personifies as Love (Philia) – a force of attraction and combination – and Strife (Neikos) – a force of repulsion and separation. Whether these cosmic forces are to be envisaged in simply mechanistic terms as descriptions of the way things happen, or as expressions of internal properties of the elements, or as external forces that act upon the elements, is not clear. It is also unclear whether the two forces are to be seen as impersonal mechanistic physical forces or as intelligent divinities that act in purposive ways in creation and destruction. Evidence can be found for all these interpretations. What is clear is that these two forces are engaged in an eternal battle for domination of the cosmos and that they each prevail in turn in an endless cosmic cycle. The details of this cosmic cycle are also difficult to establish, but the most widely accepted interpretation is represented in the following diagram:

EmpGraph

Beginning from the top of the diagram and proceeding clockwise, when Love is completely dominant she draws all the elements fully together into a Sphere in which, although the elements are not fused together into a single mass, each is indistinguishable from the others. The Sphere then, is an a-cosmic state during which no matter can exist, and no life is possible. Then as Love’s power gradually weakens and Strife begins to grow in power, he gradually separates out the elements from the Sphere until there is enough separation for matter to come into existence, for the world to be created and for all life to be born. When Strife has achieved total domination we again get an a-cosmic state in which the elements are separated completely and the world and all life is destroyed in a Whirl. Then Love begins to increase in power and to draw the elements together again, and as she does so the world is again created and life is again born. When Love has achieved full dominan ce we return once more to the sphere. As Empedocles puts it in fr. 17.1-8:

A twofold tale I shall tell: at one time it grew to be one only from many, and at another again it divided to be many from one. There is a double birth of what is mortal, and a double passing away; for the uniting of all things brings one generation into being and destroys it, and the other is reared and scattered as they are again being divided. And these things never cease their continuous exchange of position, at one time all coming together into one through Love, at another again being borne away from each other by Strife’s repulsion.

The cosmos exists in a state of constant flux then, beneath which there is a certain sort of stability in the eternity of the elements. The world is in a constant state of organic evolution, and there appear to be two different creations and two different worlds which have no direct link between them. According the most widely accepted interpretation Empedocles considered that we ourselves inhabit the world under the increasing power of Strife.

4. Biology

Empedocles’ physics have a particularly biological focus as is indicated by his choice of the botanical metaphor of ‘roots’ for what were later called ‘elements’. The term ‘roots’ stresses the creative potential of the roots rather than illustrating the way they create things by being mixed in different combinations: ‘elements’ (stoicheia in Greek, elementa in Latin) is the word for the letters of the alphabet, and is a metaphor that stresses the ability of the elements of matter to form different types of matter by interchange of position just as a limited number of letters are able to form all sorts of different words on the page. To illustrate this aspect of the creative abilities of his roots Empedocles uses an analogy with the way painters can use a limited number of colours to create all sorts of different colours and represent all the different productions of nature.
Fr. 23:

As painters, men well taught by wisdom in the practice of their art, decorate temple offerings when they take in their hands pigments of various colours, and after fitting them in close combination – more of some and less of others – they produce from them shapes resembling all things, creating trees and men and women, animals and birds and water-nourished fish, and long-lived gods too, highest in honor; so let not error convince you in your mind that there is any other source for the countless perishables that are seen, but know this clearly, since the account you have heard is divinely revealed.

Among other aspects, this analogy exhibits Empedocles’ tendency to think about the creative abilities of the elements in terms of their biological products, here a characteristically Empedoclean list of creatures representing the different orders of nature: plants, humans, land animals, birds, and fish, as well as gods. If painters use a mixture of a small number of pigments to produce copies of the works of nature, then the same process is productive of those works of nature. In other ways as well in his presentation of the cosmic cycle and the endless combination and separation of the elements he tends to elide the distinction between the elements and the life-forms they produce. Just as in the parallel he draws between the elements of the cosmos on both microscopic and macroscopic levels, so a close parallel is drawn between living creatures and their constituent elements.

a. Origin of Species

Empedocles presents us with the earliest extant attempt at producing a detailed rational mechanism for the origin of species. Greek traditions include the aetiological myths of the origin of a particular species of animal by transformation from a human being (many of these ancient mythological aetiologies are collected by Ovid in the Metamorphoses). The origins of humans, or of particular heroes, founders of cities or of races is frequently explained by what I term a botanical analogy: they originally emerged autochthonously from the ground just as plants do today, and this is also standard in ancient scientific theories as well: the original spontaneous generation of life from the earth, with all creatures emerging in their present species. Empedocles attempts to provide a comprehensive mechanism for the origins not simply of humans or of a particular animal but of all animal life, including humans, and a rational mechanism that would seem to do away with the need for any design in creatures or any external agency to order them and separate them into their individual species.

In Strasbourg fr. a(ii) 23-30 we now find the following lines in which Empedocles seemingly introduces his account of zoogony:

I will show you to your eyes too, where they find a larger body: first the coming together and the unfolding of birth, and as many as are now remaining of this generation. This [is to be seen] among the wilder species of mountain-roaming beasts; this [is to be seen] in the twofold offspring of men, this [is to be seen] in the produce of the root-bearing fields and of the cluster of grapes mounting on the vine. From these convey to your mind unerring proofs of my account: for you will see the coming together and unfolding of birth.

Empedocles promises an exposition of zoogony and the origin of species which, from the examples he gives – wild animals, humans and plants – is clearly intended to encompass all animal and plant life, including humans. He appeals to present day species as proofs of his theories: we can see both the products of this process of zoogony around us in nature today and also, it seems, we can see the same processes still going on today. That the theory refers to present day species rather than creatures in some counter world is underlined by the stress Empedocles puts on ‘as many as are now remaining of this generation’. So the theory is intended to explain the origin and development of all life and refers specifically to the animals and plants around us today, both as examples of and as proofs of the theory he will propose. This process of generation he describes by the repeated ‘the coming together and the unfolding of birth’. This seems to posit two processes which work, either together or separately, to produce the life we see around us today: a process of coming together and also a process of unfolding or perhaps more strictly ‘unleafing’ since the metaphor originates from the leaves of plants. So the second part of this process of zoogony involves a botanical metaphor: just as in the traditional botanical analogy of the myths of autochthony, an appeal to the development and growth of plants is used to describe the process of the development of all life.

According to fragments B57, B59, B60, and B61, first of all individual limbs and organs were produced from the earth. These wandered separately at first and then under the combining power of Love they came together in all sorts of wild and seemingly random hybrid combinations, producing double fronted creatures, hermaphrodites, ox-faced man creatures and man-faced ox-creatures. This weird picture is explained by Aristotle in the Physics and later in more detail by Simplicius in his commentary on the Physics as a theory of the origin of species in which, as we would put it, a certain form of natural selection is operative. The creatures assembled wrongly from parts of disparate animals will die out, either immediately, or by being unable to breed, and only the creatures by chance put together from homogeneous limbs will survive and so go on to found the species that we see today. The production of species and their ordering then is explained by a mechanistic process long recognised as a forerunner of Darwin’s theory of natural selection. Unlike in Darwin’s theory however, there would seem to be no gradual evolution of one species into another, and all of the variety of nature is produced in a great burst of birth in the beginning and is then whittled down by extinctions into the creatures we see today. That this theory intends to account for the origins of both humans and animals is ensured by the component parts of the ox-headed man-creatures and man-headed ox-creatures. There will clearly also be created by this system man-headed man-creatures and ox-headed ox-creatures, that is to say normal oxen and normal humans, although they are not mentioned. Further evidence that this zoogony relates to present day creatures is given by Aristotle and Simplicius who tell us that this process is still going on today.

However, Empedocles also adds to this theory another explanation of the origins of humans very much along the lines of traditional myths of autochthony. In fr. B62 and Strasbourg fr. d he describes the ‘shoots’ of men and women arising from the earth, drawn up by fire as it separates out from the other elements during the creation under the power of increasing Strife. As his choice of the word ‘shoots’ indicates these are not yet fully articulated people with distinct limbs but ‘whole-nature forms’ that ‘did not as yet show the lovely shape of limbs, or voice or language native to man’. We may assume that as Strife increases in power these ‘shoots’ will, just as plant buds do, gradually become fully articulated with distinct limbs and features. So human origins are accounted for by a botanical analogy, with humans as biological productions of the earth itself. This theory is also intended to account for modern-day as humans, as Strasbourg fr. d tells us ‘even now daylight beholds their remains’. So both the creation under Love and the creation under Strife refer to the origins of modern plants, animals, and humans. This is problematic since according to the picture of the cosmic cycle given above the world created by Strife is quite separate from that created by Love, and two quite different explanations are given by Empedocles for each creation of life. Various attempts have been made to account for this, including a radical revision of the cosmic cycle in order to allow both creations of life to take place within the same world, and also seeing the two different worlds of the cosmic cycle as more useful devices for examining different aspects of creation separately than absolutely chronologically separate phases of a cycle: the work of Love in combining creatures and the work of Strife in articulating them would then actually take place at the same time, but are simply described as operative in chronologically separate phases.

b. Embryology

Empedocles is an exponent of the pangenetic theory of embryology. In this theory inheritance of characteristics from both mother and father is explained by each of the two parents’ limbs and organs creating tiny copies of themselves. These miniature limbs and organs then flow together in the generative seed and when the two seeds combine in the womb the father’s seed may provide the model for the nose, while the mother’s seed the model for the eyes and so on. This is an elegant way of accounting for inheritance of characteristics, but this is unlikely to be the whole story. As Aristotle points out there are strong conceptual similarities between Empedocles’ embryology and the creation under Love in which we see the coming together of pre-formed limbs creating life. So Empedocles thinks of the original formation of animals as a process analogous to the present day formation of the embryo in the womb. From his description in Strasbourg fr. a (ii) 23-30 ‘the coming together and unfolding of birth’ we seem to have two processes that are at work in the formation of both present day creatures and the original creation of life. The ‘coming together’ describes both the original coming together of the limbs of the first creatures and also the coming together of the tiny limbs in conception. The other side of the creative process, the ‘unfolding’ is illustrated by the creation under Strife of the ‘shoots of men and pitiable women’ whose limbs are at first not fully articulated or defined: they will undergo a process of ‘unfolding’ just like plant buds and become fully developed humans. This ‘unfolding’ is clearly paralleled in embryology by the gradual development and growth of the embryo in the womb. Therefore it may be best to think of the tiny limbs and organs contained in the generative seed not as fully developed limbs and organs, but as the genetic material that contains the potential for the development of limbs and organs. This is so mewhat speculative, but would provide Empedocles with a much more nearly truly evolutionary theory of the origin of species than had previously been ascribed to him. Certainly the differentiation into the two sexes is described in terms of potential: the warmth of the womb determines whether the embryo will be male or female, cf. fr B 65: ‘They were poured in pure places; some met with cold and became women’, fr. B 67: ‘For the male was warmer . . . this is the reason why men are dark, more powerfully built, and hairier’. It may be that other characteristics are also determined or informed by environmental factors as well.

c. Perception and Thought

Empedocles seems to have been the first philosopher to give a detailed explanation of the mechanism by which we perceive things. His theory, criticised by Aristotle and Theophrastus, is that all things give off effluences and that these enter pores in the sense organs. The pores and the effluences will be of varying shapes and sizes and so only certain effluences enter certain sense-organs if they meet pores of the correct size and shape to admit them. Further, perception is achieved by the attraction of similars: we perceive light colours with fire in the eye, dark colours with water, smell is achieved by the presence of breath in the nostrils etc.

As Theophrastus complains, perception is closely linked to thought by Empedocles, cf. fr. B109:

With earth, we perceive earth, with water water, with air divine fire, with fire destructive fire, with love love, and strife with baneful strife.

fr. B 107:

All things are fitted together and constructed out of these, and by means of them they think and feel pleasure and pain.

In B 109 Empedocles moves from perception of physical elements to ethical perceptions using the same theory of perception by similars, while in B 107 we can see the theory used to account more directly for thought itself. Hence for Empedocles there is a close link between what we perceive and what we think. Further our thoughts will also be affected by our own physical constitutions (B 108). This process of the attraction of like to like is operative from the most fundamental level with the parts of the roots of matter being attracted to their like, right up to the highest level of the purest mixture which is the highest form of thought. Hence it seems that everything in nature has a share in perception and intelligence, cf. fr. 110.10: ‘know that all things have intelligence and a share of thought’.

5. Ethics and the Journey of the Soul

a. The Daimôns and Transmigration of Souls

Plutarch cites the following fragment as coming from ‘the beginning of Empedocles’ philosophy’, fr. B 115:

There is a decree of necessity, ratified long ago by gods, eternal and sealed by broad oaths, that whenever one in error, from fear, defiles his own limbs, having by his error made false the oath he swore – daimôns to whom life long-lasting is apportioned – he wanders from the blessed ones for three-times countless years, being born throughout the time as all kinds of mortal forms, exchanging one hard way of life for another. For the force of air pursues him into the sea, and sea spits him out onto earth’s surface, earth casts him in the rays of blazing sun, and sun into the eddies of air; one takes him from another, and all abhor him. I too am now one of these, an exile from the gods and a wanderer, having put my trust in raving Strife.

Traditionally Plutarch’s seeming attribution of this fragment to On Naturewas assumed to be incorrect and it was placed in the Purifications instead. However from the evidence of the Strasbourg fragments it seems that it may well be that Plutarch was correct, since they contain a description of the details of the sin Empedocles accuses himself of in fr. 115, cf. Strasbourg fr. d lines 5-6:

‘Alas that merciless day did not destroy me sooner, before I devised with my claws terrible deeds for the sake of food’

In fr. 115 Empedocles describes himself as a ‘daimôn’, a being to whom long life has been granted, but who has committed the sin of meat-eating and bloodshed and consequently is punished by banishment from the company of the immortal gods. The banishment lasts three myriads of years, either ‘three-times countless years’ or thirty thousand years. In either case he must atone for his sin by being repeatedly reincarnated into all the different living forms of the different orders of nature. Elsewhere he says: ‘For before now I have been at some time boy and girl, bush, bird, and a mute fish in the sea’ (fr. B 117). Empedocles then, has already suffered this nearly endless cycle of reincarnations having been seemingly hurled down to the lowest rung of the scale of nature but has worked his way up, has been purified at last and, as he tells us in fr. B. 112, is himself now an immortal god. There are others too numbered among the daimôns, those who ‘at the end … come among men on earth as prophets, minstrels, physicians and leaders, and from these they arise as gods, highest in honour.’ (fr. 146). It is not entirely clear whether we are meant to imagine the daimôns as an entirely separate class of blessed being with a different creation and a different fate from ourselves, the ordinary mortals, or as people who began as ordinary mortals but who, having purified themselves and having achieved perfection, are now approaching divine status. The latter reading would perhaps make more sense in terms of Empedocles’ didactic ethical mission: if we are all potentially perfectable, then his purificatory teaching becomes much more crucial. Empedocles himself, as his life shows, has achieved all four of the states that qualify the daimôns for immortality, he is a prophet, a minstrel, a physician and a leader, and can now pass on his wisdom to those on earth whom he is about to leave behind when he rejoins the company of the immortals. As can be seen from the description above, there are strong similarities between Empedocles and the teachings of Pythagoras on the transmigration of souls. Empedocles is clearly a follower of Pythagoras, in his ethics and psychology at least, and shares his vegetarianism and pacifism.

b. Meat-Eating and Sin

Slaughter and meat-eating are the most terrible of sins, indeed for him animal slaughter is murder and meat-eating is cannibalism, as shown by fr. 137:

The father will lift up his dear son in changed form, and blind fool, as he prays he will slay him, and those who take part in the sacrifice bring the victim as he pleads. But the father, deaf to his cries, slays him in his house and prepares an evil feast. In the same way son seizes father, and children their mother, and having bereaved them of life devour the flesh of those they love.

Here, in terms reminiscent of Hesiod’s description of the coming horrors of the Iron Age in Works and Days, we see the appalling consequences of meat-eating: murder, cannibalism, the destruction of whole families and, by extrapolation, of entire societies. This is a radical position in both political and religious terms. Plato’s Protagoras in the eponymous dialogue can simply assume that all men agree that warfare is ‘a fine and noble thing’. For Empedocles warfare, one fundamental plank of the Greek city state, is the most appalling of all evils and is punished by the immortals by hurling the perpetrators not only out of their society, but out of human society and even down to the level of the lowest forms of nature.

c. Theology

In religious terms as well traditional animal sacrifice, another fundamental basis of Greek society, becomes the grossest impiety of all. A probably apocryphal tale reports that Empedocles sacrificed an ox made of honey and meal at Olympia, the religious heart of Greece: a pointed act of criticism of traditional religion. Further evidence for his radical theology lies in his appropriation of the names of the Olympian gods for his roots of matter and his cosmic forces. Implicitly he argues that the Olympian gods came into being as misinterpretations of the natural world: the real ‘gods’ are the elements of nature and the cosmic forces that direct their endless evolutionary cycle. His religious and ethical teachings then are of purification of the soul in an attempt to achieve perfection and unity with perfect Love. He pictures a time in the past, a sort of golden age, when this universal harmony existed, fr. B 128:

They did not have Ares as god or Kydoimos, nor king Zeus, nor Kronos, nor Poseidon but queen Kypris [Love]. Her they propitiated with holy images and painted animal figures, with perfumes of subtle fragrance and offerings of distilled myrrh and sweet-smelling frankincense, and pouring on the earth libations of golden honey. Their altar was not drenched by the unspeakable slaughter of bulls, but this was the greatest defilement among men – to bereave of life and eat noble limbs.

fr. B 130:

All creatures, both animals and birds, were tame and gentle to men, and bright was the flame of their friendship.

Originally people worshipped only one god, Love, and this resulted in universal harmony, even between humans and animals. Implicitly the argument runs that the worship of the Olympian gods he mentions, Ares, Zeus and Poseidon, and the sacrifices they demand have destroyed this harmony, resulting in worship also of Kydoimos, the personification of the noise of battle. Traditional religion with their sacrificial slaughter and meat-eating have had a degrading effect on society.

d. Physics and Theology

As I say above it now seems very likely that Empedocles discussed purificatory topics early on in his poem On Nature. Unlike for modern rationalists then, it seems that for Empedocles there was no fundamental divide between physics and religion. Indeed as can be seen from fr. B 115 above the sin of the daimôn results in an expiatory journey of the soul not only through the different orders of living creatures but through the physical elements of the cosmos. Empedocles draws a close analogy between the cycle of the soul and the cycle of the cosmos itself. This is a hallmark of his work: frequently he uses the same language whether describing the journey of the soul or the cycle of the elements. Sometimes in the Strasbourg fragments the description of the elements coming together under the power of Love is rendered as ‘we are coming together’. His sin, in fr. 115, he describes as resulting from having put his trust in raving Strife, one of his cosmic forces, and conversely in fr. 130 we see the people of the golden age worshipping the other cosmic force, Love. Clearly there is more than a little cross-over between physics and ethics for Empedocles. How this works in detail is hard to pin down but perhaps the best reading we can give of On Natureis that it represents the detailed expression of the cycle of the soul at the level of the entire cosmos. The endless evolutionary cycling of the elements is in fact part of the cycle of the soul.

(Note: all translations are by M. R. Wright except those of the Strasbourg fragments which are by O. Primavesi and A. Martin.)

6. References and Further Reading

a. Texts and Commentaries

  • Bollack, J. Empédocle, (Paris, 1965-9), 4 vols. With Greek text, French translation, and commentary.
  • Diels, H. and W. Kranz, Die Fragmente der Vorsokratiker (Berlin, 1952), vol. 1, ch. 31, 276-375. Greek text of both fragments (B) and testimonia (A) with German translation.
  • Wright, M. R. (2nd edn.), Empedocles the Extant Fragments (London, 1995). With Greek text, English translation, introduction and commentary.
  • Inwood, B. The Poem of Empedocles (Toronto, 1992). With Greek text, facing English translation and introduction.
  • Martin, A. and O. Primavesi, L’Empédocle de Strasbourg: (P. Strasb. gr. Inv. 1665-1666) (Berlin/Strasbourg, 1998). With Greek text, French and English translations, introduction, commentary, and English summary.

b. Studies

  • Gemelli Marciano, L. “Le ‘demonologie’ empedoclee: problemi di metodo e altro”, Aevum Antiquum 1 (2003), 205-35
  • Gemelli Marciano, L. Le metamorfosi della tradizione: mutamenti de significato e neologismi nel Peri physeos di Empedocle (Bari, 1999).
  • Guthrie, W.K.C. A History of Greek Philosophy vol. 2 (Cambridge 1969), ch. 3
  • Kingsley, P. Ancient Philosophy, Mystery and Magic: Empedocles and Pythagorean Tradition (Oxford, 1995)
  • Kirk, G. S. and J.E. Raven, M. Schofield, (2nd edn.), The Presocratic Philosophers (Cambridge 1983), ch. 10.
  • O’Brien, D. Empedocles’ Cosmic Cycle (Cambridge, 1969)
  • Osborne, C. ‘Empedocles recycled’, Classical Quarterly NS 37 (1987), 24-50
  • Osbourne, C. ‘Rummaging in the recycling bins of Upper Egypt: a discussion of A. Martin and O. Primavesi, ZL’ Empédocle de Strasbourg’, Oxford Studies in Ancient Philosophy 18 (Oxford, 2000), 329-56.
  • Sedley, D. N. ‘Empedocles’ life cycles’, in Proceedings of the Symposium Tertium Mykonense (forthcoming, 2004)
  • Solmsen, F. ‘Love and Strife in Empedocles’ cosmology’, Phronesis 10 (1965), 123-45; repr. in R.E. Allen and D.J. Furley (eds), Studies in Presocratic Philosophy, (London, 1975), vol. 2, 221-64.
  • Trépanier, S. ‘Empedocles on the ultimate symmetry of the world’, Oxford Studies in Ancient Philosophy 24 (2003), 1-57
  • Trépanier, S. Empedocles: An Interpretation (London, 2004)

Author Information

Gordon Campbell
Email: gordon.l.campbell@may.ie
National University of Ireland, Maynooth
Ireland

Ralph Waldo Emerson (1803—1882)

emersonIn his lifetime, Ralph Waldo Emerson became the most widely known man of letters in America, establishing himself as a prolific poet, essayist, popular lecturer, and an advocate of social reforms who was nevertheless suspicious of reform and reformers. Emerson achieved some reputation with his verse, corresponded with many of the leading intellectual and artistic figures of his day, and during an off and on again career as a Unitarian minister, delivered and later published a number of controversial sermons. Emerson’s enduring reputation, however, is as a philosopher, an aphoristic writer (like Friedrich Nietzsche) and a quintessentially American thinker whose championing of the American Transcendental movement and influence on Walt Whitman, Henry David Thoreau, William James, and others would alone secure him a prominent place in American cultural history. Transcendentalism in America, of which Emerson was the leading figure, resembled British Romanticism in its precept that a fundamental continuity exists between man, nature, and God, or the divine. What is beyond nature is revealed through nature; nature is itself a symbol, or an indication of a deeper reality, in Emerson’s philosophy. Matter and spirit are not opposed but reflect a critical unity of experience. Emerson is often characterized as an idealist philosopher and indeed used the term himself of his philosophy, explaining it simply as a recognition that plan always precedes action. For Emerson, all things exist in a ceaseless flow of change, and “being” is the subject of constant metamorphosis. Later developments in his thinking shifted the emphasis from unity to the balance of opposites: power and form, identity and variety, intellect and fate. Emerson remained throughout his lifetime the champion of the individual and a believer in the primacy of the individual’s experience. In the individual can be discovered all truths, all experience. For the individual, the religious experience must be direct and unmediated by texts, traditions, or personality. Central to defining Emerson’s contribution to American thought is his emphasis on non-conformity that had so profound an effect on Thoreau. Self-reliance and independence of thought are fundamental to Emerson’s perspective in that they are the practical expressions of the central relation between the self and the infinite. To trust oneself and follow our inner promptings corresponds to the highest degree of consciousness.

Emerson concurred with the German poet and philosopher Johann Wolfgang von Goethe that originality was essentially a matter of reassembling elements drawn from other sources. Not surprisingly, some of Emerson’s key ideas are popularizations of both European as well as Eastern thought. From Goethe, Emerson also drew the notion of “bildung,” or development, calling it the central purpose of human existence. From the English Romantic poet and critic Samuel Taylor Coleridge, Emerson borrowed his conception of “Reason,” which consists of acts of perception, insight, recognition, and cognition. The concepts of “unity” and “flux” that are critical to his early thought and never fully depart from his philosophy are basic to Buddhism: indeed, Emerson said, perhaps ironically, that “the Buddhist . . . is a Transcendentalist.” From his friend the social philosopher Margaret Fuller, Emerson acquired the perspective that ideas are in fact ideas of particular persons, an observation he would expand into his more general—and more famous—contention that history is biography.

On the other hand, Emerson’s work possesses deep original strains that influenced other major philosophers of the nineteenth and twentieth centuries. The German philosopher Friedrich Nietzsche read Emerson in German translations and his developing philosophy of the great man is clearly influenced and confirmed by the contact. Writing about the Greek philosopher Plato, Emerson asserted that “Every book is a quotation . . . and every man is a quotation,” a perspective that foreshadows the work of French Structuralist philosopher Roland Barthes. Emerson also anticipates the key Poststructuralist concept of différance found in the work of Jacques Derrida and Jacques Lacan—“It is the same among men and women, as among the silent trees; always a referred existence, an absence, never a presence and satisfaction.” While not progressive on the subject of race by modern standards, Emerson observed that the differences among a particular race are greater than the differences between the races, a view compatible with the social constructivist theory of race found in the work of contemporary philosophers like Kwame Appiah.

Table of Contents

  1. Biography
  2. Major Works
  3. Legacy
  4. References and Further Reading

1. Biography

Ralph Waldo Emerson was born on May 25, 1803, in Boston to Ruth Haskins Emerson and William Emerson, pastor of Boston’s First Church. The cultural milieu of Boston at the turn of the nineteenth century would increasingly be marked by the conflict between its older conservative values and the radical reform movements and social idealists that emerged in the decades leading up through the 1840s. Emerson was one of five surviving sons who formed a supportive brotherhood, the financial and emotional leadership of which he was increasingly forced to assume over the years. “Waldo,” as Emerson was called, entered Harvard at age fourteen, taught in the summer, waited tables, and with his brother Edward, wrote papers for other students to pay his expenses. Graduating in the middle of his class, Emerson taught in his brother William’s school until 1825 when he entered the Divinity School at Harvard. The pattern of Emerson’s intellectual life was shaped in these early years by the range and depth of his extracurricular reading in history, literature, philosophy, and religion, the extent of which took a severe toll on his eyesight and health. Equally important to his intellectual development was the influence of his paternal aunt Mary Moody Emerson. Though she wrote primarily on religious subjects, Mary Moody Emerson set an example for Emerson and his brothers with her wide reading in every branch of knowledge and her stubborn insistence that they form opinions on all of the issues of the day. Mary Moody Emerson was at the same time passionately orthodox in religion and a lover of controversy, an original thinker tending to a mysticism that was a precursor to her nephew’s more radical beliefs. His aunt’s influence waned as he developed away from her strict orthodoxy, but her relentless intellectual energy and combative individualism left a permanent stamp on Emerson as a thinker.

In 1829, he accepted a call to serve as junior pastor at Boston’s Second Church, serving only until 1832 when he resigned at least in part over his objections to the validity of the Lord’s Supper. Emerson would in 1835 refuse a call as minister to East Lexington Church but did preach there regularly until 1839. In 1830, Emerson married Ellen Tucker who died the following year of tuberculosis. Emerson married again in 1835 to Lydia Jackson. Together they had four children, the eldest of whom, Waldo, died at the age of five, an event that left deep scars on the couple and altered Emerson’s outlook on the redemptive value of suffering. Emerson’s first book Nature was published anonymously in 1836 and at Emerson’s own expense. In 1837 Emerson delivered his famous “American Scholar” lecture as the Phi Beta Kappa address at Harvard, but his controversial Harvard Divinity School address, delivered in 1838, was the occasion of a twenty-nine year breach with the university and signaled his divergence from even the liberal theological currents of Cambridge. Compelled by financial necessity to undertake a career on the lecture circuit, Emerson began lecturing in earnest in 1839 and kept a demanding public schedule until 1872. While providing Emerson’s growing family and array of dependents with a steady income, the lecture tours heightened public awareness of Emerson’s ideas and work. From 1840-1844, Emerson edited The Dial with Margaret Fuller. Essays: First Series was published in 1841, followed by Essays: Second Series in 1844, the two volumes most responsible for Emerson’s reputation as a philosopher. In 1844, Emerson also purchased the land on the shore of Walden Pond where he was to allow the naturalist and philosopher Henry David Thoreau to build a cabin the following year. While sympathetic to the experimental collective at Brook Farm, Emerson declined urgent appeals to join the group and maintained his own household in Concord with Lydia and their growing family. Emerson attempted to create his own community of kindred spirits, however, assembling in the neighborhood of Concord a group of writers including Thoreau, Nathaniel Hawthorne, the social thinker Margaret Fuller, the reformer Bronson Alcott, and the poet Ellery Channing. English Traits was inspired by a trip to Britain during 1847-1848. By the 1850s, Emerson was an outspoken advocate of abolition in lectures across New England and the Midwest and continued lecturing widely on a number of different topics—eighty lectures in 1867 alone. Emerson spent the final years of his life peacefully but without full use of his faculties. He died of pneumonia in 1882 at his home in Concord.

2. Major Works

As a philosopher, Emerson primarily makes use of two forms, the essay and the public address or lecture. His career began, however, with a short book, Nature, published anonymously in 1836. Nature touches on many of the ideas to which he would return to again and again over his lifetime, most significantly the perspective that nature serves as an intermediary between human experience and what lies beyond nature. Emerson expresses a similar idea in his claim that spirit puts forth nature through us, exemplary of which is the famous “transparent eye-ball” passage, in which he writes that on a particular evening, while “crossing a bare common . . . the currents of Universal Being circulate through me.” On the strength this passage alone, Nature has been widely viewed as a defining text of Transcendentalism, praised and satirized for the same qualities. Emerson invokes the “transparent eye-ball” to describe the loss of individuation in the experience of nature, where there is no seer, only seeing: “I am nothing; I see all.” This immersion in nature compensates us in our most difficult adversity and provides a sanctification of experience profoundly religious —the direct religious experience that Emerson was to call for all his life. While Emerson characterizes traversing the common with mystical language, it is also importantly a matter of knowledge. The fundamental knowledge of nature that circulates through him is the basis of all human knowledge but cannot be distinguished, in Emerson’s thought, from divine understanding.

The unity of nature is the unity of variety, and “each particle is a microcosm.” There is, Emerson writes “a universal soul” that, influenced by Coleridge, he named “reason.” Nature is by turns exhortative and pessimistic, like the work of the English Romantics, portraying man as a creature fallen away from a primordial connection with nature. Man ought to live in a original relation to the universe, an assault on convention he repeats in various formulas throughout his life; however, “man is the dwarf of himself . . . is disunited with himself . . . is a god in ruins.” Nature concludes with a version of Emerson’s permanent program, the admonition to conform your life to the “pure idea in your mind,” a prescription for living he never abandons.

“The American Scholar” and “The Divinity School Address” are generally held to be representative statements of Emerson’s early period. “The American Scholar,” delivered as the Phi Beta Kappa oration at Harvard in 1837, repeats a call for a distinctively American scholarly life and a break with European influences and models—a not original appeal in the 1830s. Emerson begins with a familiar critique of American and particularly New England culture by asserting that Americans were “a people too busy to give to letters any more.” What must have surprised the audience was his anti-scholarly theme, that “Books are for the scholar’s idle times,” an idea that aligns the prodigiously learned and widely read Emerson with the critique of excessive bookishness found in Wordsworth and English Romanticism. Continuing in this theme, Emerson argues against book knowledge entirely and in favor of lived experience: “Only so much do I know, as I have lived.” Nature is the most important influence on the mind, he told his listeners, and it is the same mind, one mind, that writes and reads. Emerson calls for both creative writing and “creative reading,” individual development being essential for the encounter with mind found in books. The object of scholarly culture is not the bookworm but “Man Thinking,” Emerson’s figure for an active, self-reliant intellectual life that thus puts mind in touch with Mind and the “Divine Soul.” Through this approach to the study of letters, Emerson predicts that in America “A nation of men will for the first time exist.”

“The Divinity School Address,” also delivered at Harvard in 1838, was considerably more controversial and marked in earnest the beginning of Emerson’s opposition to the climate of organized religion in his day, even the relatively liberal theology of Cambridge and the Unitarian Church. Emerson set out defiantly to insist on the divinity of all men rather than one single historical personage, a position at odds with Christian orthodoxy but one central to his entire system of thought. The original relation to nature Emerson insisted upon ensures an original relation to the divine, not copied from the religious experience of others, even Jesus of Nazareth. Emerson observes that in the universe there is a “justice” operative in the form of compensation: “He who does a good deed is instantly ennobled.” This theme he would develop powerfully into a full essay, “Compensation” (1841). Whether Emerson characterized it as compensation, retribution, balance, or unity, the principle of an automatic response to all human action, good or ill, was a permanent fixture of his thought. “Good is positive,” he argued to the vexation of many in the audience, “evil merely privative, not absolute.” Emerson concludes his address with a subversive call to rely on one’s self, to “go alone; to refuse the good models.”

Two of Emerson’s first non-occasional public lectures from this early period contain especially important expressions of his thought. Always suspicious of reform and reformers, Emerson was yet an advocate of reform causes. In “Man the Reformer” (1841), Emerson expresses this ambivalence by speculating that if we were to “Let our affection flow out to our fellows; it would operate in a day the greatest of all revolutions.” In an early and partial formulation of his theory that all people, times, and places are essentially alike, he writes in “Lecture on the Times” (1841) that “The Times . . . have their root in an invisible spiritual reality;” then more fully in “The Transcendentalist” (1842): “new views . . . are not new, but the very oldest of thoughts cast into the mould of these new times.” Such ideas, while quintessential Emerson, are nevertheless positions that he would qualify and complicate over the next twenty years.

Emerson brought out his Essays: First Series, in 1841, which contain perhaps his single most influential work, “Self-Reliance.” Emerson’s style as an essayist, not unlike the form of his public lectures, operates best at the level of the individual sentence. His essays are bound together neither by their stated theme nor the progression of argument, but instead by the systematic coherence of his thought alone. Indeed, the various titles of Emerson’s do not limit the subject matter of the essays but repeatedly bear out the abiding concerns of his philosophy. Another feature of his rhetorical style involves exploring the contrary poles of a particular idea, similar to a poetic antithesis. As a philosopher-poet, Emerson employs a highly figurative style, while his poetry is remarkable as a poetry of ideas. The language of the essays is sufficiently poetical that Thoreau felt compelled to say critically of the essays—”they were not written exactly at the right crisis [to be poetry] though inconceivably near it.” In “History” Emerson attempts to demonstrate the unity of experience of men of all ages: “What Plato has thought, he may think; what a saint has felt, he may feel; what at any time has befallen any man, he may understand.” Interestingly, for an idealist philosopher, he describes man as “a bundle of relations.” The experience of the individual self is of such importance in Emerson’s conception of history that it comes to stand for history: “there is properly no history; only biography.” Working back from this thought, Emerson connects his understanding of this essential unity to his fundamental premise about the relation of man and nature: “the mind is one, and that nature is correlative.” By correlative, Emerson means that mind and nature are themselves representative, symbolic, and consequently correlate to spiritual facts. In the wide-ranging style of his essays, he returns to the subject of nature, suggesting that nature is itself a repetition of a very few laws, and thus implying that history repeats itself consistently with a few recognizable situations. Like the Danish philosopher Soren Kierkegaard, Emerson disavowed nineteenth century notions of progress, arguing in the next essay of the book, “Society never advances . . . For everything that is given, something is taken.”

“Self-Reliance” is justly famous as a statement of Emerson’s credo, found in the title and perhaps uniquely among his essays, consistently and without serious digression throughout the work. The emphasis on the unity of experience is the same: “what is true for you in your private heart is true for all men.” Emerson rests his abiding faith in the individual—”Trust thyself”—on the fundamental link between each man and the divine reality, or nature, that works through him. Emerson wove this explicit theme of self-trust throughout his work, writing in “Heroism” (1841), “Self-trust is the essence of heroism.” The apostle of self-reliance perceived that the impulses that move us may not be benign, that advocacy of self-trust carried certain social risks. No less a friend of Emerson’s than Herman Melville parodied excessive faith in the individual through the portrait of Captain Ahab in his classic American novel, Moby-Dick. Nevertheless, Emerson argued that if our promptings are bad they come from our inmost being. If we are made thus we have little choice in any case but to be what we are. Translating this precept into the social realm, Emerson famously declares, “Whoso would be a man must be a nonconformist”—a point of view developed at length in both the life and work of Thoreau. Equally memorable and influential on Walt Whitman is Emerson’s idea that “a foolish consistency is the hobgoblin of small minds, adored by little statesmen and philosophers and divines.” In Leaves of Grass, Whitman made of his contradictions a virtue by claiming for himself a vastness of character that encompassed the vastness of the American experience. Emerson opposes on principle the reliance on social structures (civil, religious) precisely because through them the individual approaches the divine second hand, mediated by the once original experience of a genius from another age: “An institution,” as he explains, “is the lengthened shadow of one man.” To achieve this original relation one must “Insist on one’s self; never imitate” for if the relationship is secondary the connection is lost. “Nothing,” Emerson concludes, “can bring you peace but the triumph of principles,” a statement that both in tone and content illustrates the vocational drive of the former minister to speak directly to a wide audience and preach a practical philosophy of living.

Three years later in 1844 Emerson published his Essays: Second Series, eight essays and one public lecture, the titles indicating the range of his interests: “The Poet,” “Experience,” “Character,” “Manners,” “Gifts,” “Nature,” “Politics,” “Nominalist and Realist,” and “New England Reformers.” “The Poet” contains the most comprehensive statement on Emerson’s aesthetics and art. This philosophy of art has its premise in the Transcendental notion that the power of nature operates through all being, that it is being: “For we are not pans and barrows . . . but children of the fire, made of it, and only the same divinity transmuted.” Art and the products of art of every kind—poetry, sculpture, painting, and architecture—flow from the same unity at the root of all human experience. Emerson’s aesthetics stress not the object of art but the force that creates the art object, or as he characterizes this process in relation to poetry: “it is not metres, but a metre-making argument that makes a poem.” “The Poet” repeats anew the Emersonian dictum that nature is itself a symbol, and thus nature admits of being used symbolically in art. While Emerson does not accept in principle social progress as such, his philosophy emphasizes the progress of spirit, particularly when understood as development. This process he allies with the process of art: “Nature has a higher end . . . ascension, or the passage of the soul into higher forms.” The realm of art, ultimately for Emerson, is only an intermediary function, not an end itself: “Art is the path of the creator to his work.” On this and every subject, Emerson reveals the humanism at the core of his philosophy, his human centric perspective that posits the creative principle above the created thing. “There is a higher work for Art than the arts,” he argues in the essay “Art,” and that work is the full creative expression of human being. Nature too has this “humanism,” to speak figuratively, in its creative process, as he writes in “The Method of Nature:” “The universe does not attract us until housed in an individual.” Most notable in “The Poet” is Emerson’s call for an expressly American poetry and poet to do justice to the fact that “America is a poem in our eyes.” What is required is a “genius . . . with tyrannous eye, which knew the value of our incomparable materials” and can make use of the “barbarism and materialism of the times.” Emerson would not meet Whitman for another decade, only after Whitman had sent him anonymously a copy of the first edition of Leaves of Grass, in which—indicative of Emerson’s influence—Whitman self-consciously assumes the role of the required poet of America and asserts, like his unacknowledged mentor, that America herself is indeed a poem.

“Experience” remains one of Emerson’s best-known and often-anthologized essays. It is also an essay written out of the devastating grief that struck the Emerson household after the death of their five-year-old son, Waldo. He wrote, whether out of conviction or helplessness, “I grieve that grief can teach me nothing.” Emerson goes on, rocking back and forth between resignation and affirmation, establishing along the way a number of key points. In “Experience” he defines “spirit” as “matter reduced to an extreme thinness.” In keeping with the gradual shift in his philosophy from an emphasis on the explanatory model of “unity” to images suggesting balance, he describes “human life” as consisting of “two elements, power and form, and the proportion must be invariably kept.” Among his more quotable aphorisms is “The years teach us much which the days never know,” a memorable argument for the idea that experience cannot be reduced to the smallest observable events, then added back up again to constitute a life; that there is, on the contrary, an irreducible whole present in a life and at work through us. “Experience” concludes with Emerson’s hallmark optimism, a faith in human events grounded in his sense of the total penetration of the divine in all matter. “Every day,” he writes, and “every act betrays the ill-concealed deity,” a determined expression of his lifelong principle that the divine radiates through all being.

The early 1850s saw the publication of a number of distinctively American texts: Nathaniel Hawthorne’s The Scarlet Letter (1850); Melville’s Moby-Dick (1851); Harriet Beecher Stowe’s Uncle Tom’s Cabin (1852); and Whitman’s Leaves of Grass (1855). Emerson’s Representative Men (1850) failed to anticipate this flowering of a uniquely American literature in at least one respect: none of his representative characters were American—nevertheless, each biography yields an insight into some aspect of Emerson’s thought he finds in the man or in his work, so that Representative Men reads as the history of Emerson’s precursors in other times and places. Emerson structures the book around portraits of Plato, the Swedish mystic Emmanuel Swedenborg, the French essayist Montaigne, the poet William Shakespeare, the statesman Napoleon Bonaparte, and the writer Johann Wolfgang von Goethe. Each man stands in for a type, for example, Montaigne represents the “skeptic,” Napoleon the “man of the world.” Humanity, for Emerson, consisted of recognizable but overlapping personality types, types discoverable in every age and nation, but all sharing in a common humanity that has its source in divine being. Each portrait balances the particular feature of the representative man that illustrates the general laws inhabiting humanity along with an assessment of the great man’s shortcomings. Like Nietzsche, Emerson did not believe that great men were ends in themselves but served particular functions, notably for Emerson their capacity to “clear our eyes of egotism, and enable us to see other people in their works.” Emerson’s representative men are “great,” but “exist that there may be greater men.” As a gesture toward self-criticism about an entire book on great men by the champion of American individualism, Emerson concedes, “there are no common men,” and his biographical sketches ultimately balance both the limitations of each man with his—to use an oxymoron—distinctive universality, or in other words, the impact he has had on Emerson’s thought. While Plato receives credit for establishing the “cardinal facts . . . the one and the two.—1. Unity, or Identity; and, 2. Variety,” Emerson concedes that through Plato we have had no success in “explaining existence.” It was Swedenborg, according to Emerson, who discovered that the smallest particles in nature are merely replicated and repeated in larger organizations, and that the physical world is symbolic of the spiritual. But although he approves of the religion Swedenborg urged, a spirituality of each and every moment, Emerson complains the mystic lacks the “liberality of universal wisdom.” Instead, we are “always in a church.” From Montaigne, Emerson gained a heightened sense of the universal mind as he read the French philosophers’ Essays, for “It seemed to me as if I had myself written the book”—as well as an enduring imperative of style: “Cut these words, and they would bleed.” The “skeptic” Montaigne, however, lacks belief, which “consists in accepting the affirmations of the soul.” From Shakespeare, Emerson received confirmation that originality was a reassembly of existing ideas. The English poet possessed the rare capacity of greatness in that he allowed the spirit of his age to achieve representation through him. Nevertheless the world waits on “a poet-priest” who can see, speak, and act, with equal inspiration.” Reflection on Napoleon’s life teaches the value of concentration, one of Emerson’s chief virtues. In The Conduct of Life, Emerson describes “concentration,” or bringing to bear all of one’s powers on a single object, as the “chief prudence.” Likewise, Napoleon’s shrewdness consisted in allowing events to take their natural course and become representative of the forces of his time. The defect of the “man of the world” was that he possessed “the powers of intellect without conscience” and was doomed to fail. Emerson’s moral summary of Napoleon’s sounds a great deal like Whitman: “Only that good profits, which we can taste with all doors open, and which serves all men.” Goethe, “the writer,” like Napoleon, represents the countervailing force of nature against Emerson’s lifelong opponent, what he called “the morgue of convention.” Goethe is also exemplary of the man of culture whose sphere of knowledge, as Emerson himself tried to emulate with his wide and systematic reading, knows no limits or categorical boundaries. Yet, “the lawgiver of art is not an artist,” and repeating a call for an original relation to the infinite, foregoing even the venerable authority of Goethe, Emerson concludes, “We too must write Bibles.”

English Traits was published in 1856 but represented almost a decade of reflections on an invited lecture tour Emerson made in 1847-48 to Great Britain. English Traits presents an unusually conservative set of perspectives on a rather limited subject, that of a single nation and “race,” in place of human civilization and humanity as a whole. English Traits contains an advanced understanding of race, namely, that the differences among the members of a race are greater than the differences between races, but in general introduces few new ideas. The work is highly “occasional,” shaped by his travels and visits, and bore evidence of what seemed to be an erosion of energy and originality in his thought.

The Conduct of Life (1860), however, proved to be a work of startling vigor and insight and is Emerson’s last important work published in his lifetime. “Fate” is arguably the central essay in the book. The subject of fate, which Emerson defines as “An expense of means to end,” along with the relation of fate to freedom and the primacy of man’s vocation, come to be the chief subjects of the final years of his career. Some of Emerson’s finest poetry can be found in his essays. In “Fate” he writes: “A man’s power is hooped in by a necessity, which, by many experiments, he touches on every side, until he learns its arc.” Fate is balanced in the essay by intellect: “So far as a man thinks, he is free.” Emerson’s advice for the conduct of life is to learn to swim with the tide, to “trim your bark” (that is, sails) to catch the prevailing wind. He refines and redefines his conception of history as the interaction between “Nature and thought.” Emerson further refines his conception of the great man by describing him as the “impressionable” man, or the man who most perfectly captures the spirit of his time in his thought and action. Varying a biblical proverb to his own thought, Emerson argues that what we seek we will find because it is our fate to seek what is our own. Always a moderating voice in politics, Emerson writes in “Power” that the “evils of popular government appear greater than they are”—at best a lukewarm recommendation of democracy. On the subject of politics, Emerson consistently posited a faith in balance, the tendencies toward chaos and order, change and conservation always correcting each other. His late aesthetics reinforce this political stance as he veers in “Beauty” onto the subject of women’s suffrage: “Thus the circumstances may be easily imagined, in which woman may speak, vote, argue causes, legislate, and drive a coach, and all the most naturally in the world, if only it come by degrees.”

In his early work, Emerson emphasized the operation of nature through the individual man. The Conduct of Life uncovers the same consideration only now understood in terms of work or vocation. Emerson argued with increasing regularity throughout his career that each man is made for some work, and to ally himself with that is to render himself immune from harm: “the conviction that his work is dear to God and cannot be spared, defends him.” One step above simple concentration of force in Emerson’s scale of values we find his sense of dedication: “Nothing is beneath you, if it is in the direction of your life.” While in favor of many of the social and political reform movements of his time, Emerson never ventured far into a critique of laissez-faire economics. In “Wealth” we find the balanced perspective, one might say contradiction, to be found in all the late work. Emerson argues that to be a “whole man” one must be able to find a “blameless living,” and yet this same essay acknowledges an unsentimental definition of wealth: “He is the richest man who knows how to draw a benefit from the labors of the greatest numbers of men.” In the final essay of the book, “Illusions,” Emerson uses a metaphor—“the sun borrows his beams”—to reassert his pervasive humanism, the idea that we endow nature with its beauty, and that man is at the center of creation. Man is at the center, and the center will hold: “There is no chance, and no anarchy, in the universe.”

3. Legacy

Emerson remains the major American philosopher of the nineteenth century and in some respects the central figure of American thought since the colonial period. Perhaps due to his highly quotable style, Emerson wields a celebrity unknown to subsequent American philosophers. The general reading public knows Emerson’s work primarily through his aphorisms, which appear throughout popular culture on calendars and poster, on boxes of tea and breath mints, and of course through his individual essays. Generations of readers continue to encounter the more famous essays under the rubric of “literature” as well as philosophy, and indeed the essays, less so his poetry, stand undiminished as major works in the American literary tradition. Emerson’s emphasis on self-reliance and nonconformity, his championing of an authentic American literature, his insistence on each individual’s original relation to God, and finally his relentless optimism, that “life is a boundless privilege,” remain his chief legacies.

4. References and Further Reading

  • Baker, Carlos. Emerson Among the Eccentrics: A Group Portrait. New York: Penguin, 1997.
  • Emerson, Ralph Waldo: Essays and Lectures. Ed. Joel Porte. New York: Library of America, 1983.
  • Essays and Poems. Ed. Joel Porte et al. New York: Library of American, 1996.
  • The Complete Sermons of Ralph Waldo Emerson. Vol. 4. Ed Wesley T. Mott et al. Columbia, MO: University of Missouri Press, 1992.
  • The Selected Letters of Ralph Waldo Emerson. Ed. Joel Myerson. New York: Columbia, 1997.
  • The Heart of Emerson’s Journals. Ed. Bliss Perry. Minneola, NY: Dover Press, 1995.
  • Field, Peter. S. Ralph Waldo Emerson: The Making of a Democratic Intellectual. Lanham, MD: Rowman and Littlefield, 2002.
  • Porte, Joel. Representative Man: Ralph Waldo Emerson in His Time. New York: Columbia University Press, 1988.
  • Porte, Joel and Morris, Saundra. The Cambridge Companion to Ralph Waldo Emerson. Cambridge: Cambridge University Press, 1999.
  • Richardson, Robert D. Jr. Emerson: The Mind on Fire. Berkeley: University of California Press, 1995

Author Information

Vince Brewton
Email: vjbrewton@una.edu
University of North Alabama
U. S. A.

Gnosticism

Gnosticism (after gnôsis, the Greek word for “knowledge” or “insight”) is the name given to a loosely organized religious and philosophical movement that flourished in the first and second centuries CE. The exact origin(s) of this school of thought cannot be traced, although it is possible to locate influences or sources as far back as the second and first centuries BCE, such as the early treatises of the Corpus Hermeticum, the Jewish Apocalyptic writings, and especially Platonic philosophy and the Hebrew Scriptures themselves.

In spite of the diverse nature of the various Gnostic sects and teachers, certain fundamental elements serve to bind these groups together under the loose heading of “Gnosticism” or “Gnosis.” Chief among these elements is a certain manner of “anti-cosmic world rejection” that has often been mistaken for mere dualism. According to the Gnostics, this world, the material cosmos, is the result of a primordial error on the part of a supra-cosmic, supremely divine being, usually called Sophia (Wisdom) or simply the Logos. This being is described as the final emanation of a divine hierarchy, called the Plêrôma or “Fullness,” at the head of which resides the supreme God, the One beyond Being. The error of Sophia, which is usually identified as a reckless desire to know the transcendent God, leads to the hypostatization of her desire in the form of a semi-divine and essentially ignorant creature known as the Demiurge (Greek: dêmiourgos, “craftsman”), or Ialdabaoth, who is responsible for the formation of the material cosmos. This act of craftsmanship is actually an imitation of the realm of the Pleroma, but the Demiurge is ignorant of this, and hubristically declares himself the only existing God. At this point, the Gnostic revisionary critique of the Hebrew Scriptures begins, as well as the general rejection of this world as a product of error and ignorance, and the positing of a higher world, to which the human soul will eventually return. However, when all is said and done, one finds that the error of Sophia and the begetting of the inferior cosmos are occurrences that follow a certain law of necessity, and that the so-called “dualism” of the divine and the earthly is really a reflection and expression of the defining tension that constitutes the being of humanity—the human being.

Table of Contents

  1. The Philosophical Character of Gnosticism
    1. Psychology
    2. Existentialism
    3. Hermeneutics
      1. Reception and Revelation
  2. The Gnostic Mytho-Logos
    1. The Myth of Sophia
    2. Christian Gnosticism
      1. Basilides
      2. Marcion
      3. Valentinus and the Valentinian School
        1. The System of Ptolemy
    3. Mani and Manichaeism
  3. Platonism and Gnosticism
    1. Numenius of Apamea and Neo-Platonism
  4. Concluding Summary
  5. References and Further Reading
    1. Sources
    2. Suggestions for Further Reading

1. The Philosophical Character of Gnosticism

Gnosticism, as an intellectual product, is grounded firmly in the general human act of reflecting upon existence. The Gnostics were concerned with the basic questions of existence or “being-in-the-world” (Dasein)—that is: who we are (as human beings), where we have come from, and where we are heading, historically and spiritually (cf. Hans Jonas, The Gnostic Religion 1958, p. 334). These questions lie at the very root of philosophical thinking; but the answers provided by the Gnostics go beyond philosophical speculation toward the realm of religious doctrine and mysticism. However, it is impossible to understand fully the meaning of Gnosticism without beginning at the philosophical level, and orienting oneself accordingly. Since any orientation toward an ancient phenomenon must always proceed by way of contemporary ideas and habits of mind, an interpretative discussion of Gnostic thinking as it applies to Psychology, Existentialism, and Hermeneutics, is not amiss here. Once we have understood, to the extent of our ability, the philosophical import of Gnostic ideas, and how they relate to contemporary philosophical issues, then we may enter into the historical milieu of the Gnostics with some degree of confidence—a confidence devoid, to the extent that this is possible, of tainting exegetical presuppositions.

a. Psychology

Who are we? The answer to this question involves an account (logos) of the nature of the soul (psukhê or psyche); and the attempt to provide an answer has accordingly been dubbed the science or practice of “psychology”—an account of the soul or mind (psukhê, in ancient Greek, denoted both soul, as the principle of life, and mind, as the principle of intellect). Carl Jung, drawing upon Gnostic mythical schemas, identified the objectively oriented consciousness with the material or “fleshly” part of humankind—that is, with the part of the human being that is, according to the Gnostics, bound up in the cosmic cycle of generation and decay, and subject to the bonds of fate and time (cf. Apocryphon of John [Codex II] 28:30). The human being who identifies him/herself with the objectively existing world comes to construct a personality, a sense of self, that is, at base, fully dependent upon the ever-changing structures of temporal existence. The resulting lack of any sense of permanence, of autonomy, leads such an individual to experience anxieties of all kinds, and eventually to shun the mysterious and collectively meaningful patterns of human existence in favor of a private and stifling subjective context, in the confines of which life plays itself out in the absence of any reference to a greater plan or scheme. Hopelessness, atheism, despair, are the results of such an existence. This is not the natural end of the human being, though; for, according to Jung (and the Gnostics) the temporally constructed self is not the true self. The true self is the supreme consciousness existing and persisting beyond all space and time. Jung calls this the pure consciousness or Self, in contradistinction to the “ego consciousness” which is the temporally constructed and maintained form of a discrete existent (cf. C.G. Jung, “Gnostic Symbols of the Self,” in The Gnostic Jung 1992, pp. 55-92). This latter form of “worldly” consciousness the Gnostics identified with soul (psukhê), while the pure or true Self they identified with spirit (pneuma)—that is, mind relieved of its temporal contacts and context. This distinction had an important career in Gnostic thought, and was adopted by St. Paul, most notably in his doctrine of the spiritual resurrection (1 Corinthians 15:44). The psychological or empirical basis of this view, which soon turns into a metaphysical or onto-theological attitude, is the recognized inability of the human mind to achieve its grandest designs while remaining subject to the rigid law and order of a disinterested and aloof cosmos. The spirit-soul distinction (which of course translates into, or perhaps presupposes, the more fundamental mind-body distinction) marks the beginning of a transcendentalist and soteriological attitude toward the cosmos and temporal existence in general.

b. Existentialism

The basic experience of existence, described by the philosophy that has become known as “Existentialism,” involves a general feeling of loneliness or abandonment (Geworfenheit, “having been thrown”) in/to a world that is not amenable to the primordial desires of the human being (cf. Jonas, p. 336). The recognition that the first or primal desire of the human being is for the actualization or positing of a concrete self or “I” (an autonomous and discrete individual existing and persisting amidst the flux and flow of temporal and external “reality”) leads to the disturbing realization that this world is not akin to the human being; for this world (so it seems) follows it own course, a course already mapped out and set in motion long before the advent of human consciousness. Furthermore, that the essential activity of the human being—that is, to actualize an autonomous self within the world—is carried out in opposition to a power or “will” (the force of nature) that always seems to thwart or subvert this supremely human endeavor, leads to the acknowledgment of an anti-human and therefore anti-intellectual power; and this power, since it seems to act, must also exist. However, the fact that its act does not manifest itself as a communication between humanity and nature (or pure objectivity), but rather as a mechanical process of blind necessity occurring apart from the human endeavor, places the human being in a superior position. For even though the force of nature may arbitrarily wipe out an individual human existent, just as easily as it brings one into existence, this natural force is not conscious of its activity. The human mind, on the other hand, is. And so a gap or fissure—a product of reflection—is set up, by which the human being may come to orient him/herself with and toward the world in which s/he exists and persists, for a brief moment. Martin Heidegger has described this brief moment of orientation with/in (toward) the world as “care” (Sorge), which is always a care or concern for the “moment” (Augenblick) within which all existence occurs; this “care” is understood as the product of humankind’s recognition of their unavoidable being-toward-death. But this orientation is never completed, since the human soul finds that it cannot achieve its purpose or complete actualization within the confines set by nature.

While the thwarting necessity of nature is, for the Existentialist, a simple, unquestioned fact; for the Gnostics it is the result of the malignant designs of an inferior god, the Demiurge, carried out through and by this ignorant deity’s own law. In other words, nature is, for modern Existentialism, merely indifferent, while for the Gnostics it was actively hostile toward the human endeavor. “[C]osmic law, once worshipped as the expression of a reason with which man’s reason can communicate in the act of cognition, is now seen only in its aspect of compulsion which thwarts man’s freedom” (Jonas, p. 328). Time and history come to be understood as the provenance of the human mind, over-against futile idealistic constructions like law and order, nomos and cosmos. Knowledge, at this point, becomes a concrete endeavor—a self-salvific task for the human race.

Becoming aware of itself, the self also discovers that it is not really its own, but is rather the involuntary executor of cosmic designs. Knowledge, gnosis, may liberate man from this servitude; but since the cosmos is contrary to life and to spirit, the saving knowledge cannot aim at integration into the cosmic whole and at compliance with its laws. For the Gnostics … man’s alienation from the world is to be deepened and brought to a head, for the extrication of the inner self which only thus can gain itself (Jonas, p. 329).

The obvious question, then—Where did we come from? — only becomes intelligible alongside and within the more dynamic question of Where are we heading?

c. Hermeneutics

In the context of ancient Greek thinking, hermêneia was usually associated with tekhnê, giving us the tekhnê hermêneutikê or “art of interpretation” discussed by Aristotle in his treatise De Interpretatione [Peri Hermêneias]. Interpretation or hermeneutics, according to Aristotle, does not bring us to a direct knowledge of the meaning of things, but only to an understanding of how things come to appear before us, and thereby to provide us with an avenue toward empirical knowledge, as it were.

Moreover, discourse is hermêneia because a discursive statement is a grasp of the real by meaningful expression, not a selection of so-called impressions coming from the things themselves (Paul Ricoeur, The Conflict of Interpretations 1974, p. 4).

In this sense, we may say that the “art of interpretation” is a distinctly historical method of understanding or coming to terms with reality. In other words, since our “expression” is always an ex-position, a going-out from the given forms or patterns of reality toward a living use of these forms with/in Life, then we, as human beings persisting in a realm of becoming, are responsible, in the last analysis, not for any eternal truths or “things in themselves,” but only for the forms these things take on within the context of a living and thinking existence. Knowledge or understanding, then, is not of immutable and eternal things in themselves, but rather of the process by which things—that is, ideas, objects, events, persons, etc.—become revealed within the existential or ontological process of coming-to-know. The attention to process and the emergence of meaning occurs on the most immediate experiential level of human existence, and therefore contains about it nothing of the metaphysical. However, the birth of metaphysics may be located within this primordial or phenomenal structure of basic “brute” experience; for it is the natural tendency of the human mind to order and arrange its data according to rational principles.

The question will inevitably arise, though, as to whence these rational principles derive: are they a derivative product of the phenomenal realm of experience? or are they somehow endemic to the human mind as such, and hence eternal? If we take the first question as an answer, we are led to phenomenology, which “discovers, in place of an idealist subject locked within [a] system of meanings, a living being which from all time has, as the horizon of all its intentions, a world, the world” (Ricoeur, p. 9). According to the general contemporary or “post-modern” formulation, such a “living being” is directed, intentionally, always and only toward a multiplicitous world or realm in which human activity itself becomes the sole object of knowledge, apart from any “transcendent” metaphysical ideals or schemas. For the Gnostics, on the other hand, who worked within and upon the latter question, giving it a positive, if somewhat mytho-poetical answer, rational principles, which seem to be culled from a mere contact with sensible reality, are held to be reminders of a unified existence that is an eternal possibility, open to anyone capable of transcending and, indeed, transgressing this realm of experience and process —that is, of history. This “transgression” consists in the act of balancing oneself with/in, and orienting oneself toward, history as an interplay of past and present, in which the individual is poised for a decision—either to succumb to the flux and flow of an essentially decentered cosmic existence, or to strive for a re-integration into a godhead that is only barely recollected, and more obscure than the immediate perceptions of reality.

i. Reception and Revelation

Where are we heading? This question is at the very heart of Gnostic exegesis, and indeed colors and directs all attempts at coming to terms, not only with the Hebrew Scriptures, which served as the main text of Gnostic interpretation, but with existence in general.

The standard hermeneutical approach, both in our own era, and in Late Hellenistic times, is the receptive approach—that is, an engagement with texts of the past governed by the belief, on the part of the interpreter, that these texts have something to teach us. Whether we struggle to overcome our own “prejudices” or presuppositions, which are the inevitable result of our belonging to a particular tradition by way of the hermeneutical act (Gadamer), or allow our prejudices to shape our reading of a text, in an act of “creative misprision” (Bloom) we are still acknowledging, in some way, our debt to or dependence upon the text with which we are engaged. The Gnostics, in their reading of Scripture, acknowledged no such debt; for they believed that the Hebrew Bible was the written revelation of an inferior creator god (dêmiourgos), filled with lies intended to cloud the minds and judgment of the spiritual human beings (pneumatikoi) whom this Demiurge was intent on enslaving in his material cosmos.

Indeed, while the receptive hermeneutical method implies that we have something to learn from a text, the method employed by the Gnostics, which we may call the “revelatory” method, was founded upon the idea that they (the Gnostics) had received a supra-cosmic revelation, either in the form of a “call,” or a vision, or even, perhaps, through the exercise of philosophical dialectic. This “revelation” was the knowledge (gnôsis) that humankind is alien to this realm, and possesses a “home on high” within the plêrôma, the “Fullness,” where all the rational desires of the human mind come to full and perfect fruition. On this belief, all knowledge belonged to these Gnostics, and any interpretation of the biblical text would be for the purpose of explaining the true nature of things by elucidating the errors and distortions of the Demiurge. This approach treated the past as something already overcome yet still “present,” insofar as certain members of the human race were still laboring under the old law—that is, were still reading the Scriptures in the receptive manner. The Gnostic, insofar as he still remained within the world, as an existing being, was, on the other hand, both present and future. That is to say, the Gnostic embodied within himself the salvific dynamism of a history that had broken from the constraint of a tyrannical past, and found the freedom to invent itself anew. The Gnostic understood himself to be at once at the center and at the end or culmination of this history, and this idea or ideal was reflected most powerfully in ancient Gnostic exegesis. We must now turn to a discussion of the concrete results of this hermeneutical method.

2. The Gnostic Mytho-Logos

The Gnostic Idea or Notion was not informed by a philosophical world-view or procedure. Rather, the Gnostic vision of the world was based upon the intuition of a radical and seemingly irreparable rupture between the realm of experience (pathos) and the realm of true Being—that is, existence in its positive, creative, or authentic aspect.

The problem faced by the Gnostics was how to explain such a radical, pre-philosophical intuition. This intuition is “pre-philosophical” because the brute experience of existing in a world that is alien to humankind’s aspirations may submit itself to a variety of interpretations. And the attempt at an interpretation may take on the form of either muthos or logos—either a merely descriptive rendering of the experience, or a rationally ordered account of such an experience, including an explanation of its origins. The ancient Greek explanation of this experience was to call it a primal “awe” or “wonder” felt by the human being as he faces the world that stands so radically apart from him, and to posit this experience as the beginning of philosophy (cf. Aristotle, Metaphysics 982b 10-25 and Plato, Theaetetus 155d). But the Gnostics recognized this “awe” as the product of a radical disruption of the harmony of a realm persisting beyond becoming—that is, beyond “becoming” in the sense of pathos, or “that which is undergone.” The muthos always corresponds to the “first-hand” account rendered by one who has undergone, immediately, the effect of a certain event. The myth is always an explanation of something already known, and therefore carries its truth-claim along with it, just as the immediacy of an event forbids any doubt or questioning on the part of the one undergoing it. The logos, on the other hand, is the product of a careful reflection (dianoia), and refers, for its truth-value, not to the immediate moment of “grasping” a phenomenon (prolêpsis), but to the moment of reflection during which one attains a conceptual knowledge of the phenomenon, and first comes to “know” it as such—this is gnôsis: insight. The direct result of this gnôsis is the emergence from the sense of existence as pathos, to the actuality of being as aisthêsis—that is, reception and judgment of experience by way of purely rational or divine criteria. Such criteria proceeds directly from the logos, or divine “ordering principle,” to which the Gnostics believed themselves to be related, by way of a divine genealogy. Although Gnostic onto-theology proceeds by way of an elaborate myth, it is a myth informed always by the logos, and is, in this sense, a true mythology—that is, a rendering, in the immediacy of language, of that which is ever-present (to the Gnostic) as a product of privileged reflection.

a. The Myth of Sophia

According to Gnostic mythology (in general) We, humanity, are existing in this realm because a member of the transcendent godhead, Sophia (Wisdom), desired to actualize her innate potential for creativity without the approval of her partner or divine consort. Her hubris, in this regard, stood forth as raw materiality, and her desire, which was for the mysterious ineffable Father, manifested itself as Ialdabaoth, the Demiurge, that renegade principle of generation and corruption which, by its unalterable necessity, brings all beings to life, for a brief moment, and then to death for eternity. However, since even the Pleroma itself is not, according to the Gnostics, exempt from desire or passion, there must come into play a salvific event or savior—that is, Christ, the Logos, the “messenger,” etc.—who descends to the material realm for the purpose of negating all passion, and raising the innocent human “sparks” (which fell from Sophia) back up to the Pleroma (cf. Apocryphon of John [Codex II] 9:25-25:14 ff.).

This process of re-integration with/in the godhead is one of the basic features of the Gnostic myth. The purpose of this re-integration (implicitly) is to establish a series of existents that are ontologically posterior to Sophia, and are the concrete embodiment of her “disruptive” desire—within the unified arena of the Pleroma. Indeed, if the Pleroma is really the Fullness, containing all things, it must contain the manifold principles of Wisdom’s longing. In this sense, we must not view Gnostic salvation as a simply one-sided affair. The divine “sparks” that fell from Sophia, during her “passion,” are un-integrated aspects of the godhead. We may say, then, that in the Hegelian sense the Gnostic Supreme God is seeking, eternally, His own actualization by way of full self-consciousness (cf. G.W.F. Hegel, History of Philosophy vol. 2, pp. 396-399).

But it is not really this simple. The Supreme God of the Gnostics effortlessly generates the Pleroma, and yet (or for this very reason!) this Pleroma comes to act independently of the Father. This is because all members of the Pleroma (known as Aeons) are themselves “roots and springs and fathers” (Tripartite Tractate 68:10) carrying Time within themselves, as a condition of their Being. When the disruption, brought about by the desire of Sophia, disturbed the Pleroma, this was not understood as a disturbance of an already established unity, but rather as the disturbance of an insupportable stasis that had come to be observed as divine. Indeed, when the Greeks first looked to the sky and admired the regularity of the rotations of the stars and planets, what they were admiring, according to the Gnostics, was not the image of divinity, but the image or representation of a “divine” stagnancy, a law and order that stifled freedom, which is the root of desire (cf. Jonas, pp. 260-261). The passion of Sophia—her production of the Demiurge, his enslavement of the human “sparks” in the material cosmos, and the subsequent redemption and restoration—are but one episode in the infinite, unfolding drama of spiritual existence. We, as human beings, just happen to be the unwitting victims of this particular drama. But if, as the Gnostics hold, our salvation consists in our becoming gods (Poimandres 26) or “lord[s] over creation and all corruption” (Valentinus, Fragment F, Layton) then how are we to be confident that, in ages to come, one of us will not give birth to another damned cosmos, just as Sophia had done?

b. Christian Gnosticism

The Christian idea that God has sent his only “Son” (the Logos) to suffer and die for the sins of all humankind, and so make possible the salvation of all, had a deep impact on Gnostic thought. In the extensive and important collection of Gnostic writings discovered at Nag Hammadi, Egypt in 1945, only a handful present the possibility of having originated in a pre-Christian, mostly Hellenistic Jewish milieu. The majority of these texts are Christian Gnostic writings from the early second to late third centuries CE, and perhaps a bit later. When we consider the notion of salvation and its meaning for the early Gnostics, who stressed the creative aspect of our post-salvific existence, we are struck by the bold assertion that our need for salvation arose, in the first place, from an error committed by a divine being, Sophia (Wisdom), during the course of her own creative act (cf. Apocryphon of John [Codex II] 9:25-10:6). Since this is the case, how, we are led to ask, will our post-salvation existence be any less prone to error or ignorance, even evil? The radical message of early Christianity provided the answer to this problematical question; and so the Gnostics took up the Christian idea and transformed it, by the power of their singular mytho-logical technique, into a philosophically and theologically complex speculative schema.

i. Basilides

The Christian philosopher Basilides of Alexandria (fl. 132-135 CE) developed a cosmology and cosmogony quite distinct from the Sophia myth of classical Gnosticism, and also reinterpreted key Christian concepts by way of the popular Stoic philosophy of the era. Basilides began his system with a “primal octet” consisting of the “unengendered parent” or Father; Intellect (nous); the “ordering principle” or “Word” (logos); “prudence” (phronêsis); Wisdom (sophia); Power (dunamis) (Irenaeus, Against Heresies 1.24.3, in Layton, The Gnostic Scriptures 1987) and “justice” and “peace” (Basilides, Fragment A, Layton). Through the union of Wisdom and Power, a group of angelic rulers came into existence, and from these rulers a total of 365 heavens or aeons were generated (Irenaeus 1.24.3). Each heaven had its own chief ruler (arkhôn), and numerous lesser angels. The final heaven, which Basilides claimed is the realm of matter in which we all dwell, was said by him to be ruled by “the god of the Jews,” who favored the Jewish nation over all others, and so caused all manner of strife for the nations that came into contact with them—as well as for the Jewish people themselves. This behavior caused the rulers of the other 364 heavens to oppose the god of the Jews, and to send a savior, Jesus Christ, from the highest realm of the Father, to rescue the human beings who are struggling under the yoke of this jealous god (Irenaeus 1.24.4). Since the realm of matter is the sole provenance of this spiteful god, Basilides finds nothing of value in it, and states that “[s]alvation belongs only to the soul; the body is by nature corruptible” (Irenaeus 1.24.5). He even goes so far as to declare, contra Christian orthodoxy, that Christ’s death on the cross was only apparent, and did not actually occur “in the flesh” (Irenaeus 1.24.4)—this doctrine came to be called docetism.

The notion that material existence is the product of a jealous and corrupt creator god, who favors one race over all others, is really the “mythical” expression of a deeply rooted ethical belief that the source of all evil is material or bodily existence. Indeed, Basilides goes so far as to assert that sin is the direct outcome of bodily existence, and that human suffering is the punishment either for actual sins committed, or even just for the general inclination to sin, which arises from the bodily impulses (cf. Fragments F and G). In an adaptation of Stoic ethical categories, Basilides declares that faith (pistis) “is not the rational assent of a soul possessing free will” (Fragment C); rather, faith is the natural mode of existence, and consequently, anyone living in accordance with the “law of nature” (pronoia), which Basilides calls the “kingdom,” will remain free from the bodily impulses, and exist in a state of “salvation” (Fragment C). However, Basilides goes beyond simple Stoic doctrine in his belief that the “elect,” that is, those who exist by faith, “are alien to the world, as if they were transcendent by nature” (Fragment E); for unlike the Stoics, who believed in a single, material cosmos, Basilides held the view, as we have seen, that the cosmos is composed of numerous heavens, with the material realm as the final heaven, and consequently corrupt. Since this final heaven represents the “last gasp” of divine emanation, as it were, and is by no means a perfect image of true divinity, adherence to its laws can lead to no good. Further, since the body is the means by which the ruler of this material cosmos enforces his law, freedom can only be attained by abandoning or “becoming indifferent to” all bodily impulses and desires. This indifference (adiaphoria) to bodily impulses, however, does not lead to a simple stagnant asceticism. Basilides does not call upon his hearers to abandon the material realm only to dissolve into negativity; instead, he offers them a new life, by appealing to the grand hierarchy of rulers persisting above the material realm (cf. Fragment D). When one turns to the greater hierarchy of Being, there results a “creation of good things” (Fragment C, translation modified). Love and personal creation—the begetting of the Good—are the final result of Basilides’ vaguely dialectical system, and for this reason it is one of the most important early expressions of a truly Christian, if not “orthodox,” philosophy.

ii. Marcion

Marcion of Sinope, in Pontus, was a contemporary of Basilides. According to Tertullian, he started his career as an orthodox Christian—whatever that meant at such an early stage of development of Christian doctrine—but soon formulated the remarkable and radical doctrine that was to lead to his excommunication from the Roman Church in July 144 CE, the traditional date of the founding of the Marcionite Church (Tertullian, Against Marcion 1.1; cf. Kurt Rudolph, Gnosis 1984, p. 314). The teaching of Marcion is elegantly simple: “the God proclaimed by the law and the prophets is not the Father of Our Lord Jesus Christ. The God (of the Old Testament) is known, but the latter (the Father of Jesus Christ) is unknown. The one is just, but the other is good” (Irenaeus 1.27.1). Marcion believed that this cosmos in which we live bears witness to the existence of an inflexible, legalistic, and sometimes spiteful and vengeful God. This view arose from a quite literal reading of the Old Testament, which does contain several passages describing God in terms not quite conducive to divinity—or at least to the idea of the divine that was current in the Hellenistic era. Marcion then, following Paul (in Romans 1:20) declared that God is knowable through His creation; however, unlike Paul, Marcion did not take this “natural revelation” as evidence of God’s singularity and goodness. Quite the contrary, Marcion believed that he knew the God of this realm all too well, and that He was not worthy of the devotion and obedience that He demanded. Therefore, Marcion rejected the teaching of the orthodox Christian Church of his era, that Yahweh (or Jehovah) is the Father of Christ, and, through a creative excision of what he termed “Judaistic interpolations” in Luke and ten Pauline Epistles, Marcion simultaneously put forth his notion of the “alien God” and His act of salvation, and established the first Canon of Scripture used in a “Christian” Church (Jonas, pp. 145-146).

Marcion was not a philosopher in the sense that term has come to imply. He never developed, as far as we can tell from the surviving evidence, a systematic metaphysical, cosmological, or anthropological theory in the manner of a Basilides or a Valentinus (whom we shall discuss below), nor did he appeal to history as a witness for his doctrines. This latter point is the most important. Unlike the majority of Gnostics, who elaborated some sort of divine genealogy (e.g., the Sophia myth) to account for the presence of corruption and strife in the world, Marcion simply posited two opposed and irreducible Gods: the biblical god, and the unknown or “alien” God, who is the Father of Christ. According to Marcion, the god who controls this realm is a being who is intent on preserving his autonomy and power even at the expense of the (human) beings whom he created. The “alien” God, who is the Supremely Good, is a “god of injection,” for he enters this realm from outside, in order to gratuitously adopt the pitiful human beings who remain under the sway of the inferior god as His own children. This act is the origin of and reason for the Incarnation of Christ, according to Marcion.

In spite of the absence of any solid philosophical or theological foundation for this rather simple formulation, Marcion’s idea nevertheless expresses, in a somewhat crude and immediate form, a basic truth of human existence: that the desires of the Mind are incommensurable with the nature of material existence (cf. Irenaeus 1.27.2-3). Yet, if we follow Marcion’s argument to its logical (or perhaps “anti-logical”) conclusion, we discover an existential expression (not a philosophy) of the primal feeling of “abandonment” (Geworfenheit). This expression plays upon the subtle yet poignant opposition of “love of wisdom” (philosophia) and “complete wisdom” (plêrosophia). We are alone in a world that does not lend itself to our quest for unalterable truth, and so we befriend wisdom, which is the way of or manner in which we attain this intuited truth. According to Marcion, this truth is not to be found in this world—all that is to be found is the desire for this truth, which arises amongst human beings. However, since this desire, on the part of human beings, only produces various philosophies, none of which can hold claim to the absolute truth, Marcion concludes that the noetic beings (humans) of this realm are capable of nothing more than a shadow of wisdom. It is only by way of the guidance and grace of an alien and purely good God that humankind will rise to the level of plêrosophia or complete wisdom (cf. Colossians 2:2 ff.). Moreover, instead of attempting to discover the historical connection between the revelation of Christ and the teachings of the Old Testament, Marcion simply rejected the latter in favor of the former, on the belief that only the Gospel (thoughtfully edited by Marcion himself) points us toward complete wisdom (Irenaeus 1.27.2-3; Tertullian, Against Marcion 4.3).

While other Christian thinkers of the era were busy allegorizing the Old Testament in order to bring it into line with New Testament teaching, Marcion allowed the New Testament (albeit in his own special version) to speak to him as a singular voice of authority—and he formulated his doctrine accordingly. This doctrine emphasized not only humankind’s radical alienation from the realm of their birth, but also their lack of any genealogical relation to the God who sacrificed His own Son to save them—in other words, Marcion painted a picture of humanity as a race displaced, with no true home at all (cf. Giovanni Filoramo, A History of Gnosticism 1992, p. 164). The hope of searching for a lost home, or of returning to a home from which one has been turned out, was absent in the doctrine of Marcion. Like Pico della Mirandola, Marcion declared the nature of humankind to be that of an eternally intermediate entity, poised precariously between heaven and earth (cp. Pico della Mirandola, Oration on the Dignity of Man, 3). However, unlike Pico, Marcion called for a radical displacement of humankind—a “rupture”—in which humanity would awaken to its full (if not innate) possibilities.

iii. Valentinus and the Valentinian School

The great Christian teacher and philosopher Valentinus (ca. 100-175 CE) spent his formative years in Alexandria, where he probably came into contact with Basilides. Valentinus later went to Rome, where he began his public teaching career, which was so successful that he actually had a serious chance of being elected Bishop of Rome. He lost the election, however, and with it Gnosticism lost the chance of becoming synonymous with Christianity, and hence a world religion. This is not to say that Valentinus failed to influence the development of Christian theology—he most certainly did, as we shall see below. It was through Valentinus, perhaps more than any other Christian thinker of his time, that Platonic philosophy, rhetorical elegance, and a deep, interpretive knowledge of scripture became introduced together into the realm of Christian theology. The achievement of Valentinus remained unmatched for nearly a century, until the incomparable Origen came on the scene. Yet even then, it may not be amiss to suggest that Origen never would have “happened” had it not been for the example of Valentinus.

The cosmology of Valentinus began, not with a unity, but with a primal duality, a dyad, composed of two entities called “the Ineffable” and “Silence.” From these initial beings a second dyad of “Parent” and “Truth” was generated. These beings finally engendered a quaternity of “Word” (logos), “Life” (zôê), “Human Being” (anthropos), and “Church” (ekklêsia). Valentinus refers to this divine collectivity as the “first octet” (Irenaeus 1.11.1). This octet produced several other beings, one of which revolted or “turned away,” as Irenaeus tells us, and set in motion the divine drama that would eventually produce the cosmos. According to Irenaeus, who was writing only about five years after the death of Valentinus, and in whose treatise Against Heresies the outline of Valentinus’ cosmology is preserved, the entity responsible for initiating the drama is referred to simply as “the mother,” by which is probably meant Sophia (Wisdom). From this “mother” both matter (hulê) and the savior, Christ, were generated. The realm of matter is described as a “shadow,” produced from the “mother,” and from which Christ distanced himself and “hastened up into the fullness” (Irenaeus 1.11.1; cp. Poimandres 5). At this point the “mother” produced another “child,” the “craftsman” (dêmiourgos) responsible for the creation of the cosmos. In the account preserved by Irenaeus, we are told nothing of any cosmic drama in which “divine sparks” are trapped in fleshly bodies through the designs of the Demiurge. However, it is to be assumed that Valentinus did expound an anthropology similar to that of the classical Sophia myth (as represented, for example, in the Apocryphon of John; cf. also The Hypostasis of the Archons, and the Apocalypse of Adam), especially since his school, as represented most significantly by his star pupil Ptolemy (see below), came to develop a highly complex anthropological myth that must have grown out of a simpler model provided by Valentinus himself. The account preserved in Irenaeus ends with a description of a somewhat confused doctrine of a heavenly and an earthly Christ, and a brief passage on the role of the Holy Spirit (Irenaeus 1.11.1). From this one gets the idea that Valentinus was flirting with a primitive doctrine of the Trinity. Indeed, according to the fourth century theologian Marcellus of Ancyra, Valentinus was “the first to devise the notion of three subsistent entities (hypostases), in a work that he entitled On the Three Natures” (Valentinus, Fragment B, Layton).

Valentinus was certainly the most overtly Christian of the Gnostic philosophers of his era. We have seen how the thought of Basilides was pervaded by a Stoicizing tendency, and how Marcion felt the need to go beyond scripture to posit an “alien” redeemer God. Valentinus, on the other hand, seems to have been informed, in his speculations, primarily by Jewish and Christian scripture and exegesis, and only secondarily by “pagan” philosophy, particularly Platonism. This is most pronounced in his particular version of the familiar theological notion of “election” or “pre-destination,” in which it is declared (following Paul in Romans 8:29) that God chose certain individuals, before the beginning of time, for salvation. Valentinus writes, in what is probably a remnant of a sermon:

From the beginning you [the “elect” or Gnostic Christians] have been immortal, and you are children of eternal life. And you wanted death to be allocated to yourselves so that you might spend it and use it up, and that death might die in you and through you. For when you nullify the world and are not yourselves annihilated, you are lord over creation and all corruption (Valentinus, Fragment F).

This seems to be Valentinus’ response to the dilemma of the permanence of salvation: since Sophia or the divine “mother,” a member of the Pleroma, had fallen into error, how can we be sure that we will not make the same or a similar mistake after we have reached the fullness? By declaring that it is the role and task of the “elect” or Gnostic Christian to use up death and nullify the world, Valentinus is making clear his position that these elite souls are fellow saviors of the world, along with Jesus, who was the first to take on the sin and corruption inherent in the material realm (cf. Irenaeus 1.11.1; and Layton p. 240). Therefore, since “the wages of sin is death” (Romans 6:23), any being who is capable of destroying death must be incapable of sin. For Valentinus, then, the individual who is predestined for salvation is also predestined for a sort of divine stewardship that involves an active hand in history, and not a mere repose with God, or even a blissful existence of loving creation, as Basilides held. Like Paul, Valentinus demanded that his hearers recognize their createdness. However, unlike Paul, they recognized their creator as the “Ineffable Parent,” and not as the God of the Hebrew Scriptures. The task of Christian hermeneutics after Valentinus was to prove the continuity of the Old and New Testament. In this regard, as well as in the general spirituality of his teaching—not to mention his primitive trinitarian doctrine—Valentinus had an incalculable impact on the development of Christianity.

1) The System of Ptolemy

Ptolemy (or Ptolemaeus, fl. 140 CE) was described by St. Irenaeus as “the blossom of Valentinus’ school” (Layton, p. 276). We know next to nothing about his life, except the two writings that have come down to us: the elaborate Valentinian philosophical myth preserved in Irenaeus, and Ptolemy’s Epistle to Flora, preserved verbatim by St. Epiphanius. In the former we are met with a grand elaboration, by Ptolemy, of Valentinus’ own system, which contains a complex anthropological myth centering around the passion of Sophia. We also find, in both the myth and the Epistle, Ptolemy making an attempt to bring Hebrew Scripture into line with Gnostic teaching and New Testament allegorization in a manner heretofore unprecedented among the Gnostics.

In the system of Ptolemy we are explicitly told that the cause of Sophia’s fall was her desire to know the ineffable Father. Since the purpose of the Father’s generating of the Aeons (of which Sophia was the last) was to “elevate all of them into thought” (Irenaeus 1.2.1) it was not permitted for any Aeon to attain a full knowledge of the Father. The purpose of the Pleroma was to exist as a living, collective expression of the intellectual magnitude of the Father, and if any single being within the Pleroma were to attain to the Father, all life would cease. This idea is based on an essentially positive attitude toward existence—that is, existence understood in the sense of striving, not for a reposeful end, but for an ever-increasing degree of creative or “constitutive” insight. The goal, on this view, is to produce through wisdom, and not simply to attain wisdom as an object or end in itself. Such an existence is not characterized by desire for an object, but rather by desire for the ability to persist in creative, constitutive engagement with/in one’s own “circumstance” (circumscribed stance or individual arena). When Sophia desired to know the Father, then, what she was desiring was her own dissolution in favor of an envelopment in that which made her existence possible in the first place. This amounted to a rejection of the gift of the Father—that is, of the gift of individual existence and life. It is for this reason that Sophia was not permitted to know the Father, but was turned back by the “boundary” (horos) that separates the Pleroma from the “ineffable magnitude” of the Father (Irenaeus 1.2.2).

The remainder of Ptolemy’s account is concerned with the production of the material cosmos out of the hypostatized “passions” of Sophia, and the activity of the Savior (Jesus Christ) in arranging these initially chaotic passions into a structured hierarchy of existents (Irenaeus 1.4.5 ff., and cp. Colossians 1:16). Three classes of human beings come into existence through this arrangement: the “material” (hulikos), the “animate” (psukhikos), and the “spiritual” (pneumatikos). The “material” humans are those who have not attained to intellectual life, and so place their hopes only upon that which is perishable—for these there is no hope of salvation. The “animate” are those who have only a half-formed conception of the true God, and so must live a life devoted to holy works, and persistence in faith; according to Ptolemy, these are the “ordinary” Christians. Finally, there are the “spiritual” humans, the Gnostics, who need no faith, since they have actual knowledge (gnôsis) of intellectual reality, and are thus saved by nature (Irenaeus 1.6.2, 1.6.4). The Valentinian-Ptolemaic notion of salvation rests on the idea that the cosmos is the concrete manifestation or hypostatization of the desire of Sophia for knowledge of the Father, and the “passions” her failure produced. The history of salvation, then, for human beings, has the character of an external manifestation of the threefold process of Sophia’s own redemption: recognition of her passion; her consequent “turning back” (epistrophê); and finally, her act of spiritual production, whence arose Gnostic humanity (cf. Irenaeus 1.5.1). Salvation, then, in its final form, must imply a sort of spiritual creation on the part of the Gnostics who attain the Pleroma. The “animate” humans, however, who are composed partly of corruptible matter and partly of the spiritual essence, must remain content with a simple restful existence with the craftsman of the cosmos, since no material element can enter the Pleroma (Irenaeus 1.7.1).

In his Epistle to Flora (in Epiphanius 33.3.1-33.7.10), which is an attempt to convert an “ordinary” Christian woman to his brand of Valentinian Christianity, Ptolemy clearly formulates his doctrine of the relation between the God of the Hebrew Scriptures, who is merely “just,” and the Ineffable Father, who is the Supreme Good. Rather than simply declaring these two gods to be unrelated, as did Marcion, Ptolemy develops a complex, allegorical reading of the Hebrew Scriptures in relation to the New Testament in order to establish a genealogy connecting the Pleroma, Sophia and her “passion,” the Demiurge, and the salvific activity of Jesus Christ. The scope and rigor of Ptolemy’s work, and the influence it came to exercise on emerging Christian orthodoxy, qualifies him as one of the most important of the early Christian theologians, both proto-orthodox and “heretical.”

c. Mani and Manichaeism

The world religion founded by Mani (216-276 CE) and known to history as Manichaeism has its roots in the East, borrowing elements from Persian dualistic religion (Zoroastrianism), Jewish Christianity, Buddhism, and even Mithraism. The system developed by Mani was self-consciously syncretistic, which was a natural outgrowth of his desire to see his religion reach the ends of the earth. This desire was fulfilled, and until the late Middle Ages, Manichaeism remained a world religion, stretching from China to Western Europe. It is now completely extinct. The religion began when its founder experienced a series of visions, in which the Holy Spirit supposedly appeared to him, ordering him to preach the revelation of Light to the ends of the earth. Mani came to view himself as the last in a series of great prophets including Buddha, Zoroaster, Jesus, and Paul (Rudolph, p. 339). His highly complex myth of the origin of the cosmos and of humankind drew on various elements culled from these several traditions and teachings. The doctrine of Mani is not “philosophical,” in the manner of Basilides, Valentinus or Ptolemy; for Mani’s teaching was not the product of a more or less rational or systematic speculation about the godhead, resulting in Gnosis, but the wholly creative product of what he felt to be a revelation from the divinity itself. It is for this reason that Mani’s followers revered him as the redeemer and holy teacher of humankind (Rudolph, p. 339). Since Manichaeism belongs more to the history of religion than to philosophy proper (or even the fringes of philosophy, as does Western Gnosticism), it will suffice to say only a few words about the system, if for no other reason than that the great Christian philosopher Augustine of Hippo had followed the Manichaen religion for several years, before converting to Christianity (cf. Augustine, Confessions III.10).

The main point of distinction between the doctrine of Mani and the Western branch of Gnosticism (Basilides, Valentinus, etc.), is that in Manichaeism the “cosmology is subservient to the soteriology” (Rudolph, p. 336). This means, essentially, that Mani began with a fundamental belief about the nature of humanity and its place in the cosmos, and concocted a myth to explain the situation of humankind, and the dynamics of humanity’s eventual salvation. The details of the cosmology were apparently not important, their sole purpose being to illustrate, poetically, the dangers facing the souls dwelling in this “realm of darkness” as well as the manner of their redemption from this place. The Manichaean cosmology began with two opposed first principles, as in Zoroastrianism: the God of Light, and the Ruler of Darkness. This Darkness, being of a chaotic nature, assails the “Kingdom of Light” in an attempt to overthrow or perhaps assimilate it. The “King of the Paradise of Light,” then, goes on the defensive, as it were, and brings forth Wisdom, who in her turn gives birth to the Primal Man, also called Ohrmazd (or Ahura-Mazda). This Primal Man possesses a pentadic soul, consisting of fire, water, wind, light, and ether. Armored with this soul, the Primal Man descends into the Realm of Darkness to battle with its Ruler. Surprisingly, the Primal Man is defeated, and his soul scattered throughout the Realm of Darkness. However, the Manichaeans understood this as a plan on the part of the Ruler of Light to sow the seeds of resistance within the Darkness, making possible the eventual overthrow of the chaotic realm. To this end, a second “Living Spirit” is brought forth, who was also called Mithra. This being, and his partner, “Light-Adamas,” set in motion the history of salvation by putting forth the “call” within the realm of darkness, which recalls the scattered particles of light (from the vanquished soul of Ohrmazd). These scattered particles “answer” Mithra, and the result is the formation of the heavens and earth, the stars and planets, and finally, the establishment of the twelve signs of the zodiac and the ordered revolution of the cosmic sphere, through which, by a gradual process, the scattered particles of light will eventually be returned to the Realm of Light. The Manichaeans believed that these particles ascend to the moon, and that when the moon is full, it empties these particles into the sun, from whence they ascend to the “new Aeon,” also identified with Mithra, the “Living Spirit” (Rudolph, pp. 336-337). This process will continue throughout the ages of the world, until all the particles eventually reach their proper home and the salvation of the godhead is complete.

It should be clear from this brief exposition that humanity as such does not hold the prime place in the salvific drama of Manichaeism, but rather a part of the godhead itself—that is, the scattered soul of Ohrmazd. The purpose of humanity in this scheme is to aid the particles of light in their ascent to the godhead. Of course, these particles dwell within every living thing, and so the salvation of these particles is the salvation of humanity, but only by default, as it were; humanity does not hold a privileged position in Manichaeism, as it does in the Western or strictly Christian Gnostic schools. This belief led the Manichaeans to establish strict dietary and purity laws, and even to require selected members of their church to provide meals for the “Elect,” so that the latter would not become defiled by harming anything containing light particles. All of this, however, is a long way from philosophy. Hans Jonas was right to describe Manichaeism as representing “a more archaic level of gnostic thought” (Jonas, p. 206). Now that we have examined one of the non-philosophical directions taken by Gnostic thought, let us proceed to discuss its role in the philosophical development of the era.

3. Platonism and Gnosticism

Long before the advent of Gnosticism, Plato had posited two contrary World Souls: one “which does good” and one “which has the opposite capacity” (Plato, Laws X. 896e, tr. Saunders). For Plato, this did not imply that the cosmos is under the control of a corrupt or ignorant god, as it did for the Gnostics, but simply that this cosmos, like the human soul, possesses a rational and an irrational part, and that it is the task of the rational part to govern the irrational. The question arose, however, among Platonists, regarding Plato’s true position on this matter. Was he declaring that a part of the cosmos is evil? or that the divine Demiurge (who, in the highly influential Timaeus account, is said to have crafted the cosmos) actually produced an evil soul? Both of these conjectures flew in the face of everything that the ancient thinkers believed about the cosmos—that is, that it was divine, orderly, and perfect. A common solution, among both Platonists and Pythagoreans, was to interpret the second or “evil” Soul as Matter, that is, the material or generative principle, which is the opposite of the truly divine and unchanging Forms. The purpose of the Intellectual principle, or the “good” Soul, is to bring this disorderly principle under the control of reason, and thereby maintain an everlasting but not eternal cosmos (cf. Timaeus 37d). Since the cosmos, according to Plato in the Timaeus, cannot be as perfect as the eternal image upon which it is founded, a generative principle is necessary to maintain the “living creature” (which is precisely how the cosmos is described), and therefore not really “evil,” even though it possesses the “opposite capacity” (generation, and hence, corruption) from that of the Good or Rational Soul.

a. Numenius of Apamea and Neo-Platonism

Several centuries after Plato, around the time when the great Gnostic thinkers like Valentinus and Ptolemy were developing their systems, we encounter the Platonic philosopher Numenius of Apamea (fl. 150 CE). The main ideas of Numenius’ philosophy, preserved in the fragments of his writings that survive, bear clear traces of Gnostic influence. His cosmology describes, in language strikingly similar to that of the Gnostics, the degradation of the divine dêmiourgos upon his contact with pre-existent Matter (hulê, or the “indefinite” principle):

[I]n the process of coming into contact with Matter, which is the Dyad, [the Demiurge] gives unity to it, but is Himself divided by it, since Matter has a character prone to desire [epithumêtikon êthos] and is in flux. So in virtue of not being in contact with the Intelligible (which would mean being turned in upon Himself), by reason of looking towards Matter and taking thought for it, He becomes unregarding (aperioptos) of Himself. And he seizes upon the sense realm and ministers to it and yet draws it up to His own character, as a result of this yearning towards Matter [eporexamenos tês hulês] (Numenius, Fragment 11, in Dillon 1977, The Middle Platonists, pp. 367-368).

In this fragment, Numenius is transferring a basic Gnostic anthropological idea into the realm of cosmology. It is a common feature of Gnostic systems to describe the individual human soul’s contact with the material realm as resulting in a forgetting of the soul’s true origin. Platonism, also, warned against the soul’s becoming too attached to the realm of the senses, since this realm is changing and illusory, and does not accurately reflect the divinity. However, neither Platonism nor Gnosticism described such a danger as affecting, in any way, the Demiurge; for the Gnostics declared the Demiurge to be just as much a part of the cosmos as he was its ruler, and the orthodox Platonists located the Demiurge outside the cosmos, declaring the cosmos to be self-sufficient (following Timaeus 34b). Numenius, however, went further and bridged the gap between the sensible cosmos and the Intelligible Realm by linking the Demiurge to the latter by way of contemplation, and to the former by way of his “desire” (orexis) for matter. In Fragment 18, Numenius tells us that the Demiurge derives his “critical faculty” (kritikon) from his contemplation of the Good, and his “impulsive faculty” (hormêtikon) from his attachment to Matter (Dillon, p. 370). This idea seems to foreshadow Plotinus’ doctrine that the individual soul will always take on certain characteristics of Matter, and that these characteristics manifest themselves in the form of sense perceptions that must be brought under the controlling influence of rational judgment (cf. Enneads I.8.9 and I.1.7). Unlike Plotinus, however, who leaves the World-Soul or active part of the Demiurge safely beyond the affective cosmic realm, Numenius posits a Demiurge that is both transcendent and immanent, and arrives at a doctrine of a cosmos that, even on the highest level—the level of the celestial bodies—is not devoid of evil influence, since even the Demiurge, the highest cosmic deity, is infected by the tainting influence of Matter. “This importation of evil into the celestial realm is surely more Gnostic than Platonist, and did not comment itself to such successors as Plotinus or Porphyry, though it does seem to be accepted by Iamblichus” (Dillon, p. 374).

Plotinus, during the height of his teaching career at Rome (ca. 255 CE), composed a treatise “Against Those Who Declare the Creator of This World, and the World Itself, to be Evil,” also known, simply, as “Against the Gnostics” (Ennead II.9) in which he argues for the divinity and goodness of the cosmos, and upholds the ancient Greek belief in the divinity of the stars and planets, declaring them to be our “noble brethren,” and responsible only for the good things that befall humankind. Porphyry, in his Life of Plotinus, tells us that Plotinus commissioned him, along with his fellow student Amelius, to write more treatises attacking the Gnostics on points that Plotinus skipped over (Porphyry, Life of Plotinus 16). Porphyry also mentions by name two Gnostic treatises that were discovered in Egypt in 1945, and are now readily available to scholars: Zostrianos, and Allogenes, in the Nag Hammadi Collection of Codices. These texts, as well as the Tripartite Tractate (also in the Nag Hammadi Collection) show how tightly Platonism and Gnosticism were intertwined in the early centuries of our era.

4. Concluding Summary

Gnosticism began with the same basic, pre-philosophical intuition that guided the development of Greek philosophy—that there is a dichotomy between the realm of true, unchanging Being, and ever-changing Becoming. However, unlike the Greeks, who strived to find the connection between and overall unity of these two “realms,” the Gnostics amplified the differences, and developed a mytho-logical doctrine of humankind’s origin in the realm of Being, and eventual fall into the realm of darkness or matter, that is, Becoming. This general Gnostic myth came to exercise an influence on emerging Christianity, as well as upon Platonic philosophy, and even, in the East, developed into a world religion (Manichaeism) that spread across the known world, surviving until the late Middle Ages. In the twentieth century, there began a renewed interest in Gnostic ideas, particularly in the pioneering work of Hans Jonas, the Existentialist philosopher and student of Martin Heidegger. The psychologist Carl Jung, as well, drew upon Gnostic motifs in his theoretical work, and the increasing emphasis on Hermeneutics in late twentieth century thought owes something to the analyses of Gnostic myth and exegesis done by Harold Bloom, Paul Ricoeur, and others.

More than any of these accomplishments, however, it was the discovery in 1945, in Egypt, of a large collection of Coptic Gnostic codices, now known as the Nag Hammadi Collection, or the Nag Hammadi Library. This collection contains works of the Valentinian School, as well as of many earlier and contemporaneous sects, and sheds much needed light on the nature and structure of what to this day is still called, with some reservations, the Gnostic Religion. The study of this library has led certain scholars to question the existence of any unified movement called “Gnosticism” or the “Gnostic Religion.” Michael Allen Williams, in 1996, published a book entitled Rethinking “Gnosticism”: An Argument For Dismantling A Dubious Category (Princeton University Press 1996). Through a detailed study of numerous texts of the Nag Hammadi Collection, Williams attempts to show that the extreme diversity underlying the texts that many scholars have lumped together under the catch-all phrase of “Gnosticism,” casts doubt on the existence of anything like a Gnostic religion. Moreover, he argues, such a wholesale consignment of these texts to what is, in fact, a modern designation, blinds us to the deeper meaning of these diverse intellectual monuments. It should be noted, however, that the early Church Fathers, like Clement of Alexandria, Irenaeus, Origen, Hippolytus, Epiphanius, and even “pagan” philosophers like Plotinus and Porphyry, who have preserved for us accounts and occasionally some original documents of philosophers and theologians whom they term “Gnostic,” were also contemporaries or near contemporaries of many of the figures and schools that they criticize and interpret. The insights of these writers, then, who were living and working side by side, and almost always in conflict with, members of the Gnostic sects, should be given priority over any modern attempts to revise our understanding of what Gnosticism is.

5. References and Further Reading

a. Sources

  • Dillon, John (1977). “Numenius of Apamea” in The Middle Platonists (Cornell University Press).
  • Filoramo, Giovanni. A History of Gnosticism, tr. Anthony Alcock (Oxford: Blackwell Publishers 1990, 1992).
  • Hegel, G.W.F. “The Gnostics” in Lectures on the History of Philosophy, vol 2. “Plato and the Platonists,” tr. E.S. Haldane and Frances H. Simson (University of Nebraska Press; Bison Books Edition 1995).
  • Jonas, Hans (1958, 2001). The Gnostic Religion: The Message of the Alien God and the Beginnings of Christianity (Boston: Beacon Press).
  • Layton, Bentley (1987). The Gnostic Scriptures (Doubleday: The Anchor Bible Reference Library).
  • Plato. Laws, tr. Trevor J. Saunders, in Plato: Complete Works, ed. John M. Cooper (Indianapolis: Hackett Publishing 1997).
  • Plato. Timaeus, tr. Donald J. Zeyl, in Plato: Complete Works.
  • Plotinus. The Enneads, tr. A.H. Armstrong, in 7 volumes (Harvard: Loeb Classical Library 1966).
  • Ricoeur, Paul. The Conflict of Interpretations (Northwestern University Press 1974).
  • Rudolph, Kurt. Gnosis: The Nature and History of Gnosticism, tr. Robert McLachlan Wilson (Edinburgh: T. and T. Clark Ltd. 1984).
  • Segal, Robert A. (ed.) The Gnostic Jung (Princeton University Press 1992).

b. Suggestions for Further Reading

  • Barnstone, Willis (1984 ed.) The Other Bible (Harper San Francisco).
  • Bultmann, Rudolph (1956). Primitive Christianity in its Contemporary Setting (New York: Meridian Books).
  • Fideler, David (1993). Jesus Christ, Sun of God: Ancient Cosmology and Early Christian Symbolism (Wheaton, Illinois: Quest Books).
  • Pagels, Elaine (1975). The Gnostic Paul: Gnostic Exegesis of the Pauline Letters (Philadelphia: Trinity Press).
  • Williams, Michael Allen. Rethinking “Gnosticism”: An Argument For Dismantling A Dubious Category (Princeton University Press 1996).

Author Information

Edward Moore
Email: emoore@theandros.com
St. Elias School of Orthodox Theology
U. s. A.

The Garden of Epicurus

A garden near the city of Athens, owned and used by the philosopher Epicurus and his followers. It became a symbol of Epicurean philosophy.

Table of Contents

  1. Location and Use
  2. References and Further Reading

1. Location and Use

In 307/306 BCE the Athenian philosopher Epicurus bought a house with a garden just outside Athens along the road from the Dipylon gate to the Academy (CiceroDe Finibus 5.1.3). Other great founders of philosophical schools had chosen public areas for their teaching: Plato established his school near the Academy, Isocrates and Aristotle taught in the Lyceum, Zeno often met his students in the Stoa Poecile. In contrast, Epicurus’ hedonistic and materialistic philosophy flourished and grew amidst the privately owned groves of his Garden. The Garden itself – apart from the city, a private space, and pleasurable – became a symbol for the detachment and hedonism of the Epicurean school. Nothing of the Garden’s layout is known, but its closeness to the canalized Eridanus River must have provided plentiful water for irrigation of its trees and plants. After Epicurus’ death the Garden was passed down to his followers (Diogenes Laertius, 10.10 and 10.17). We may imagine that Epicureans seeking relief from the disturbances of the city gathered in the Garden’s groves for many centuries.

2. References and Further Reading

  • Furley, David John. “Epicurus” in the Oxford Classical Dictionary. Third Edition. Oxford 1996.
  • Wycherley, R.E. The Stones of Athens. Princeton 197

Author Information

William Morison
Email: morisonw@gvsu.edu
Grand Valley State University

Ge Hong (Ko Hung, 283—343 C.E.)

Ge_HongGe Hong was an eclectic philosopher who dedicated his life to searching for physical immortality, which he thought was attainable through alchemy. He lived during China’s tumultuous Period of Disunity (220-589 C.E.), a time in which alien conqueror regimes ruled northern China, the cradle of Chinese civilization, while a series of weak, transplanted Chinese states occupied recently colonized southern China. These political conditions, along with the social chaos they engendered, no doubt gave rise to Ge Hong’s ardent desire to establish order and permanency in both his spiritual and secular worlds. His most important contribution to Chinese philosophy was his attempt to reconcile an immortality-centered Daoism with Confucianism. Equally important, to establish political order, he also tried to reconcile Legalism with Confucianism. His penetrating insight was that the teachings of no one school could solve the problems that his world faced – only a combination of the best methods of each could do so.

Table of Contents

  1. The Life of Ge Hong
  2. Immortality
  3. Reconciliation of Daoism and Confucianism
  4. Confucianism and Legalism
  5. The Importance of Broad Knowledge
  6. Conclusion
  7. References and Further Reading

1. The Life of Ge Hong

In 283 C.E., Ge Hong was born into a southern magnate family whose native place was the Jurong district in Danyang prefecture, which was near Nanjing, in the southwest corner of present day Jiangsu province. Both his grandfather and father had reputations for broad learning and served as high ministers for the Wu state, which ruled over southeastern China from 220-280. Ge’s father continued to hold a number of middle level positions under the Western Jin dynasty (265-317) that briefly reunited China. Upon his father’s death in 296, Ge endured a period of relative poverty and lost his family’s extensive library due to civil strife. To educate himself, from this time on, he started copying books and reading voraciously. He began with the Confucian classics, but soon turned his attention to the various philosophical writings. Under the tutelage of Zheng Yin, who was both a Confucian classicist and a Daoist adept, Ge began his studies of the immortality arts. Zheng Yin himself was a disciple of Ge’s uncle, Ge Xuan (164-244), a Daoist adept who was reputed to have become an immortal.

Like other southern gentry, Ge Hong’s early career was spent in military positions. He had an extensive knowledge of martial affairs and was trained in the use of arms. In 303, at the age of twenty, he was called upon to organize and lead a militia in his native place against a rebel army, which he handily defeated. In a rare admission of violence for a Chinese literatus, he even relates that, with bow and arrow, two men and a horse died at his hands. In 305, after being promoted to the rank of a General Who Makes the Waves Submit, Ge tried to make his way to Luoyang, the capital. Although his autobiography tells us he did so “to look for unusual books,” he was probably also hoping to obtain a promotion. However, due to the Rebellion of the Eight Princes that was being fought throughout northern China, he never made it; instead, he wandered throughout southern China. To escape from the turmoil that was embroiling the rest of China, he finally accepted a position as a military councilor under his friend who was appointed to be the governor of Guangzhou (Canton), a port city in the far south.

After his patron was killed enroute to assuming the governorship, Ge refused many other military appointments and remained in Guangzhou for the next eight years, living the life of a recluse at nearby Mount Luofu. In 314, he returned to his native place of Jurong. There, he studied under another Daoist adept named Bao Qing (260-330), the former governor of Nanhai prefecture, who was so impressed that he gave Ge his eldest daughter in marriage. It was during this long period of reclusion that Ge wrote his two part magnum opus whose title bore his sobriquet: the Inner Chapters of the Master Who Embraces Simplicity and the Outer Chapters of the Master Who Embraces Simplicity. Ge used the Daoist Inner Chapters to substantiate the reality of immortality and convey the methods for realizing it, while the Confucian Outer Chapters describe the problems afflicting the secular world and his proposed solutions. In fact, until the fourteenth century, the Inner Chapters and Outer Chapters circulated independently from each other. With these two works, Ge aspired to establish his own school of philosophy. Also at this time, illustrative of his aspirations, Ge compiled hagiographical works entitled Biographies of Divine Transcedents and Biographies of Recluses.

With the establishment of the Eastern Jin dynasty in southern China in 317, the transplanted throne was eager to gain the allegiance of powerful southern gentry families; thus, men such as Ge Hong were showered with official appointments, which were usually more honorary than substantive in nature. In recognition of his past military successes, Ge himself was given the title of Marquis of the Region Within the Pass. Finally, in 326, Wang Dao (276-339), the prime minister, appointed him to a series of positions, ending with that of military advisor. By 332, due to his advanced age (he was 50) and desire to find ingredients for immortality elixirs, he begged to be given a post in northern part of present-day Vietnam. On his way there, the governor of Guangzhou, Deng Yue, detained him there indefinitely. He thus took up residence at nearby Mount Luofu where he engaged in immortality practices until his death in 343.

By his own admission, Ge Hong was a man that was out of sorts with his age. He was a southerner in an age where only northern émigrés were given posts of substance. Due to his lack of verbal eloquence, he could never obtain social prestige in the salons where men were prized for their ability to engage in “Pure Talk” – abstract philosophical discussions. Nor did his strong Confucian sense of morality sit well with the libertine tendencies of that prevailed among the northern émigrés. The outlet of his frustrations became his writings, in which he attacked the fashions and trends of his day and proposed his own vision of how people should obtain stability in an instable world.

2. Immortality

Ge Hong wholeheartedly believed that anyone, through unrelenting effort and study, could obtain immortality. One does not have to be either rich or powerful to do so; in fact, wealth and position are harmful because they inhibit one from attaining the necessary moral and physical serenity. Moreover, it is not up to the arbitrary decisions of deities to extend our lives – they are merely divine administrators who keep track of our sins and good deeds; consequently, sacrifices and prayers to them for this purpose are useless. Thus, whether one can obtain immortality is entirely based on his or her own diligence and determination. It was precisely for those educated people who wanted and were willing to work towards obtaining immortality that Ge wrote his Inner Chapters. The overriding importance that he attached to obtaining eternal life is evident in that the inner, and thereby his most important, chapters of his magnum opus were dedicated to the topic.

Ge Hong firmly believed that physical immortality was possible. This is because all things are permeated by the metaphysical oneness, xuan (the mystery), which creates and animates all things. Significantly, for Ge, xuan is synonymous with the words dao (the way, the ultimate reality) and yi (the one, the unity). In this light, he describes xuan in the following manner: it “carries within it the embryo of the Original One, it forms and shapes the two Principles (Yin and Yang); it exhales and absorbs the great Genesis, it inspires and transforms the multitude of species, it makes constellations go round, it shaped the primordial Darkness, it guides the wonderful mainspring of the universe, it exhales the four seasons … if one adds to it, it does not increase. If one takes away from it, it does not grow less. If something is given to it, it is not increased in glory. If something is taken from it, it does not suffer. Where the Mystery is present, joy is infinite; where the Mystery has departed, efficacy is exhausted and the spirit disappears” (Robinet, 82-83). In other words, the key to immortality is maintaining this everlasting oneness within oneself – if one cannot do so, he or she will soon die. The reason why people lose it is that they become attached through their desires to the outside world, thereby forgetting the jewel that resides within. As Ge put it, “The way of xuan is obtained within oneself, but is lost due to things outside oneself. Those who employ xuan are gods; those who forget it are merely [empty] vessels.”

How does one maintain the unity within oneself? For Ge Hong it had much to do with preserving, enhancing, and refining one’s qi, which for him embodied the metaphysical mystery. Qi, which originally meant “breath” or “vapor,” came to designate the vital energy that exists within and animates all things. As Ge Hong relates, “people reside within qi and qi resides within people. From heaven and earth down to the ten thousand things, each one requires qi to live. As for those who excel at circulating their qi, internally they are able to nourish their body; externally, they are able to repel illnesses.” Since each person receives a finite amount of qi at birth, he recommends various methods to retain and enhance it, which include breathing exercises, sexual techniques, calisthenics, dietary restrictions, and the ingestion of herbal medicines. Since none of these methods is infallible, he recommends that an adept should practice a number of them in combination with each other. By doing so, one protects oneself from manifold disasters, such as illnesses, demons, savage beasts and weapons, while also lessening desires, transforming the body, and extending one’s lifespan. These methods could even give their practitioners supernatural powers, such as curing illnesses, raising the dead, seeing the future, commanding gods and ghosts, forgoing food for years, and the ability to disappear.

Nevertheless, none of these techniques could permanently keep xuan within oneself. To do that, nothing was comparable with taking alchemically created medicines. Ge thus informs us that, “Even if one performs breathing exercises and calisthenics, as well as ingests herbal medicines, this can only extend the years of your lifespan, but it will not save you from death. Ingesting divine cinnabar will make your lifespan inexhaustible. You will last as long as heaven and earth, be able to travel on clouds and ride dragons, and ascend at will to the Heaven of Highest Clarity.” Alchemically derived medicines, the best of which contained either liquefied gold or reverted cinnabar, were able to have this marvelous effect because the substances from which they were made had shown themselves, through repeated firings in the alchemist’s stove, to be impervious to decay or dissolution. According to Ge, “As for forging of gold and cinnabar, the longer one burns them, the more marvelous their transformations. When gold enters the flames, even after one hundred firings, it will not disappear. If you bury it forever, it will never decay. If one ingests these two substances, they will refine that person’s body, and make it so that he or she will neither age nor die.” In other words, one makes one’s own body imperishable by ingesting imperishable things. Mechanically what happens is that, upon ingestion, these substances seep into one’s blood and qi, thereby making them stronger. Ge Hong calls this using an outside substance to fortify one’s self. The reason why herbs are inferior to gold and cinnabar is that they are perishable; thus, they lack the capacity to make the body imperishable. Unfortunately though, the ingredients for making these mineral medicines are difficult to obtain, the process of smelting them is arduous, and the ritual circumstances under which they must be made are elaborate; as a result, Ge Hong several times admits that he has never had enough resources to attempt to produce these superb formulas.

3.Reconciliation of Daoism and Confucianism

It is often said of premodern Chinese literati that they were Daoist at home while Confucian in the office. Ge Hong was in fact probably one of the first Chinese thinkers to consciously try to reconcile Confucianism and Daoism. As the division of his major work into inner and outer chapters indicates, he did so by asserting that Confucianism and Daoism addressed different aspects of life. Confucianism addressed the external world and provided means by which to ameliorate its many problems; Daoism concerned the inner world and provided means by which to attain immortality. As Ge succinctly put it, “For an extraordinarily talented person, what difficulty could there be in practicing both (Confucianism and Daoism) at the same time? Inwardly, such people treasure the way of nourishing life; outwardly, they exhibit their brilliance in the world. If they regulate their persons, their persons then will be cultivated for a prolonged time; if they rule the country, the country will achieve the state of great peace.” Cultivating one’s spirit for immortality thereby automatically enables one to rule a country well. Thus, if one becomes a terrestrial immortal, Ge Hong sees no reason why such a person cannot hold office and contribute to the welfare of his generation.

Nevertheless, even though both were important, Daoism was even more so. That is because in the far past the sage kings followed the Dao “the oneness” or “the natural order of things,” as a result, the people’s conduct was flawless and natural processes transpired smoothly without disruption or disaster. Later kings, however, no longer followed the Dao; consequently, natural disasters occurred frequently and people became evil and unruly. It was only at this point that Confucianism was introduced in an attempt to rectify this situation. Thus, Daoism is superior because it kept the world from becoming chaotic; Confucianim, on the other hand, only appeared when the world declined into disorder and its practitioners have often become entangled in the resulting mess. Thus, Daoists, like Confucians, provide the world with moral order, but they do so without becoming soiled in the process. As Ge Hong put it, “In regard to the Daoists, their making consists of excelling in cultivating the self to complete their duties; their repose consists in excelling in doing away with the impurities of people; their governance consists of excelling in cutting off misfortune before it occurs; their giving consists in excelling at saving things, but not considering it virtuous; their activity consist in excelling at using their heart-mind to urge the people [to do good]; their quiescence consists in excelling at being cautious and without rancor. These characteristics are why Daoism is the ruler and leader of the hundred schools of philosophy and why it is the ancestor of [Confucian] righteous and benevolence.” Nevertheless, since only a few people are able to correctly pursue Daoism and present times are disordered, Confucianism is necessary to maintain the social order that is embodied in the family and the state. Very much in a Confucian vein, he evaluates both philosophies through a moral lenses.

One of the ways in which Ge Hong connected Confucianism and Daoism is by stating that one needed to perfect oneself ethically to pursue immortality. In his Inner Chapters, Ge Hong makes it clear that none of the methods for prolonging one’s life will work unless one is morally pure, which can only happen by realizing Confucian virtues. Ge explicitly states that, “those who seek to become immortals must regard loyalty, filiality, peacefulness, obedience, benevolence and trustworthiness as fundamental. If one does not cultivate his or her moral behavior, and merely instead devotes oneself to esoteric methods, he or she will never obtain an extended lifespan.” Since these virtues, particularly that of benevolence (ren), emphasize putting the interests of others before one’s own, they cultivated a sense of selflessness and detachment that Ge viewed as essential for maintaining the mysterious oneness within oneself.

His strong emphasis on morality led him to systematize and quantify earlier ideas about how spirits punished immoral behavior. Ge Hong maintained that for each minor moral transgression one committed, the Director of Fates would subtract three days from his or her lifespan; for each major transgression, three hundred days would be deducted. To guide people’s behavior, he even listed sixty-four possible sins. Very few of these prohibitions are religious in nature – the overwhelming majority concern secular life and many are Confucian inspired. Furthermore, he posited that, to achieve spiritual benefits, one had to continuously accumulate good deeds: 300 were needed to become an earthbound immortal and 1200 to become a celestial immortal. One mishap and the balance would be canceled. Ge Hong even transformed the three corpses, evil entities within the body who endeavor to destroy it to earn their freedom, into ethical agents that try to decimate their host’s health by disclosing his sins to the celestial authorities. This system of measuring good and bad deeds would later giver rise to the Ledgers of Merit and Demerit, popular books that let people keep track of their moral progress by assigning numerical scores to virtuous and immoral behavior.

Ge Hong also attempted to reconcile Daoism with Confucianism by both emphasizing the importance and naturalness of hierarchy and attacking Daoism’s equalitarian tendencies. Although Laozi and Zhuangzi always assumed that kings would exist, their utopian vision of society was a small village society whose inhabitants never leave their hamlets, do not use contrivances, and have few material goods. In order to attack this line of thinking in his Outer Chapters, Ge Hong puts forth the views of a man named Bao Jingyan who extended the Daoist arguments to their logical conclusion. Bao maintains that the simple agrarian utopias in which people lived simply and equally were lost due to the creation of hierarchy, which was based on the strong oppressing the weak and the smart deceiving the foolish. With the lord/subject tie came a host of evils such as weapons, armies, rebellions, greed, thievery, deceit, extravagance, and crime. Thus, Bao advocated the abolishment of rulers as the key to securing peace and happiness. Incidentally, Bao is the earliest known Chinese advocate of anarchy. Ge Hong, on the contrary, thinks that in a state of nature people think only of their own desires, hence they vie with each other like beasts for scarce resources. Hierarchy was thus established to put an end to the strong oppressing the weak. Moreover, hierarchy is natural: as the oneness unfolds into the ten thousand things, it divides itself into high and low; hence, heaven is above and earth is below. Thus, it is only natural that some people are more important than others. This is true to the extent that even immortals are hierarchically organized: freshly minted immortals must occupy the lower rungs of the celestial bureaucracy and serve their superiors, while terrestrial immortals are inferior to their celestial counterparts. Ge also recognized that civilization could not be undone and that hierarchy had brought about material progress, as the following passage indicates: “Now, [would you be at ease] if I made you reside in the cramped quarters of a nest or cave? [Would you be at ease] if upon your death, your body was abandoned in the fields? [Would you be at ease] if upon being impeded by a river, you had to swim to cross it? [Would you be at ease] if upon traveling through the mountains, you had to walk and shoulder luggage? [Would you be at ease] if your cooking implements were cast away and you had to make do with raw and smelly food? [Would you be at ease] if you no longer had stone needles for acupuncture and had to merely rely on nature to [cure] your illness? [Would you be at ease] if nakedness was your only ornament and you had no clothes? [Would you be at ease] if you came across a female and made her your mate without an intermediary? You and I would both likely say, ‘to do these things would be impossible.’ How much less could we do without a lord!” In other words, progress and hierarchy are realities, and beneficial ones at that, which can neither be ignored nor abandoned.

4. Confucianism and Legalism

Since Ge Hong recognized that this world cannot be ignored, he believed that one must find a way to improve it. Given the corruption and chaos that ruled his age, like many of his contemporaries, he looked for answers beyond Confucianism to its arch nemesis, Legalism. His reform program was thus a synthesis of both Confucian and Legalist political ideas. First of all, even though he believed that the ruler, like a good Confucian sovereign, should cultivate his person, lead the people through his own moral example, and take their welfare as his overriding concern, he admitted that this was not sufficient to guide society. To govern well, one had to have clear laws to punish miscreants. He warns us that, “It is not that governing with benevolence is not wonderful, it is just that the black-haired masses can be crafty and deceitful. They hanker after profit and forget righteousness. If one does not order them with one’s authority and correct them with punishments, if one only admires the ways of Fuxi and Shennong (Confucian cultural heroes), then chaos cannot be avoided and the resulting calamities will be numerous. [Yet] to use killing to stop killing, how could anyone find joy in that?” In short, although leading through one’s moral example is preferable, it is not realistic: it is sometimes necessary to use the harsher methods. Since Confucian moral example was not enough, the ruler must turn to the law and mete out punishments. Following Legalist ideas, Ge argues that the laws must be clear, explicit, and fair; i.e., they must be applicable to everyone. Moreover, the punishments for misbehavior must be severe. It is precisely generous rewards and harsh punishments that will keep the strong from oppressing the weak. Regimes are weak because their laws are neither severe nor enforced. In line with this thinking, Ge was in favor of reviving punishments that mutilated the guilty. Convicts suffering from such punishments would be constant reminders to the people of the terrible price to be paid for violating the law. Lest the modern reader judge Ge harshly for supporting such draconian measures, since the death penalty largely replaced the mutilation punishments, Ge thought the latter was more humane, since at least the criminal would escape with his or her life.

Another way in which Ge Hong tried to reconcile Confucianism and Legalism was through the type of training officials should receive. Under the Legalist Qin dynasty (221-207 BCE), which unified China for the first time, officials were largely men who excelled in legal and administrative matters. During the Han dynasty (206 BCE – 220 CE), particularly during its latter half, Confucianism gradually became the dominant ideology, hence the education and knowledge of officials became more centered on the Confucian classics. Men who primarily specialized in legal matters were slighted and only given clerical positions. Due to this situation, Ge complained that officials, no matter what their level, no longer understood the laws, hence they often issued incorrect judgments and were deceived by their more legally savvy underlings. Consequently, he thought that aspirants to officialdom should be tested not only on the Confucian classics, but also on the law.

5. The Importance of Broad Knowledge

That officials should be knowledgeable in both the classics and the law highlights one of his most consistent teachings: a person must be broadly educated and that deep study leads to the mastery of all things. For Ge Hong, through the diligent acquisition of knowledge, anything was possible, whether it be ruling a country or attaining immortality. In this vein, he said, “When one peels away dark clouds, one exposes the sun; as a result, the ten thousand things cannot hide their shapes. By unrolling bamboo and silk (that is, by reading books) and investigating the past and present, heaven and earth thereby hide none of their facts. How much less so gods and demons? And how much less so the affairs of people?” Nevertheless, one could not just specialize in one kind of learning, but had to learn the teachings of many different schools. Likewise, in seeking immortality, one should study many techniques and never merely practice one exclusively. Similarly, in terms of book learning, one should not merely confine oneself to learning the classics because all written works had something of worth. Indeed, he propounded the revolutionary sentiment that the elaborate writings of his day were superior to the simplistic classics. The more widely one read, and the more techniques one acquired, the more one would be likely to excel in both the spiritual and secular worlds. Study was also a means of self-cultivation – through it one could eliminate desires by becoming indifferent to his or her physical circumstances.

According to Ge Hong, one of the primary reasons governance of his time was so inept and ineffective was that officials were not selected on the basis of their intelligence, but only due to their connections, bribery, or their ability to speak eloquently. Ge thought the solution to this problem would be to use examinations to select men on the basis of their knowledge of the classics and administrative matters. The examinations should be held in the palace, supervised by high officials, and their contents should be kept in the utmost secrecy. By this means, there would be little opportunity to pass the examinations through deceit or bribery. Moreover, when the only way to become an official is through examinations, everyone will value study. Although he admitted that passing the examinations would not guarantee that that person would be a good official, he thought that the ability to do so was a fair indicator of talent. In other words, Ge Hong was one of the earliest proponents of selecting officials through a vigorous and fair examination system, one of the hallmarks of Chinese civilization.

6. Conclusion

In sum, Ge Hong was a philosopher who, due to the topsy-turvy world in which he lived, was willing to look for solutions in the wisdom of any philosopher, regardless of his sectarian background. With Ge’s overriding sense of the importance of morality and his overwhelming urge for permanency in the form of immortality, he reconciled Confucian and Daoism by saying that both were trying to improve the condition of mankind and that practicing Confucian virtues was necessary for attaining immortality. Likewise, this search for concrete, no-nonsense answers also led him to reconcile his Confucian leanings with the real politick teachings of Legalism. Thus, although he maintained that the ruler must endeavor to mold his people’s behavior through his own example, generous rewards and severe punishments were even more important in regulating the affairs of the troublesome masses. In order to manifest both these philosophies, Ge advocated that officials be both experts in the classics and legal matters. Thus, Ge helped fashion the values that allowed latter Chinese to unproblematically simultaneously use Daoist, Confucian, and Legalist assumptions in both their public and private lives.

7. References and Further Reading

  • Balazs, Etienne. Chinese Civilization and Bureaucracy. New Haven: Yale University Press, 1964.
    • Besides giving a valuable introduction to the tumultuous intellectual and social milieu in which Ge Hong lived, this work also translates part of the chapter from his Outer Chapters in which Ge Hong critiques the anarchist Bao Jingyan.
  • Campany, Robert Ford. To Live as Long as Heaven and Earth: A Translation and Study of Ge Hong’s Traditions of Divine Transcendents. Berkeley: University of California Press, 2002.
    • This fine translation of Ge Hong’s biographies of immortals has an introduction that insightfully describes his religious ideas.
  • Lai Chi-Tim, “Ko Hung’s Discourse of Hsien-Immortality: A Daoist Configuration of an Alternate Ideal Self-Identity,” Numen 45 (1998): 183-220.
    • Although somewhat turgid, this article successfully delineates the novel aspects of Ge Hong’s views on immortality and situates his religious beliefs within the social and political context in which they were formed.
  • Robinet, Isabelle. Daoism: Growth of a Religion, translated by Phyllis Brooks. Stanford: Stanford University Press, 1997.
    • Robinet devotes an entire chapter of this work to Ge Hong and masterfully contextualizes his thought within the Daoist religious tradition.
  • Sailey, Jay. The Master Who Embraces Simplicity: A Study of the Philosopher Ko Hung, A.D. 283-343. San Francisco: Chinese Materials Center, Inc., 1978.
    • The author translates twenty-one chapters of Ge Hong’s Outer Chapters. He also deftly summarizes Ge Hong’s ideas as seen in this work.
  • Sivin, Nathan. “On the Pao P’u Tzu Nei Pien and the Life of Ko Hong (283-343),” Isis 60 (1976): 388-391.
    • In this short article, Sivin makes some important points about the circulation of his works and the length of his life.
  • Sivin, Nathan. “On the Word ‘Daoist’ as a Source of Perplexity.” History of Religions 17 (1978): 303-330.
    • This intellectually penetrating article challenges the idea that Ge Hong was a Daoist at all, in the sense that he was not at all connected with organized Daoist religion.
  • Ware, James R. Alchemy, Medicine & Religion in the China of A.D. 320: The Nei P’ien of Ko Hung. Rpt; New York: Dover Publications, Inc., 1981.
    • Originally published in 1966, this work is a complete translation of Ge Hong’s The Inner Chapters. The reader must beware, though, since the text is inaccurately translated through Judeo-Christian lenses.
  • Yu, David C. History of Chinese Daoism: Volume 1. Lanham: University Press of America, 2000.
    • This overview of the history of Daoism devotes a lengthy chapter to Ge Hong with extensive quotations to his views on immortality.

Author Information

Keith Knapp
Email: keith.knapp@citadel.edu
The Citadel
U. S. A.

Galen (130—200 C.E.)

galen-200x220

Galen was one of the most prominent ancient physicians as well as a philosopher (though most of his philosophical writings are lost). Nonetheless, his philosophical interests are quite evident in his practice of biological science. Galen made some key anatomical observations (though most of these were on other primates).  However, this inclination toward observation moved his theory into the class of critical empiricism.

Galen was also a well-read scholar who combined extensive erudition with ‘cutting edge’ observational practice to completely change the understanding and teaching of medicine. He frequently integrates his observational practice with the natural philosophy of Plato and Aristotle.  His position as the leading authority in medical theory extended for at least fourteen hundred years.

Galen correctly saw that there is a methodological difference between taking account of the patient in front of you in all of the patient’s particularity and, instead, understanding the patient in front of you as representing an instance of a general rule of biomedical science. The way that Galen sought to insert himself into this debate makes his conclusions relevant to medicine today.

 

Table of Contents

  1. Life
  2. Hellenistic Schools of Medicine
  3. Method
  4. Galen’s Critical Empiricism
  5. References and Further Reading

1. Life

Galen of Pergamum was a physician who was born in Pergamum was a bustling and vibrant city at the time and was particularly famous for its statue of Asclepius, a god of healing. Throughout Galen’s life, he avowed a devotion to Asclepius. The city also had a library that almost rivaled Alexandria’s in its size. Galen’s father, Nicon, was a prosperous architect. This allowed Galen the leisure to get an education and choose a path of life unencumbered by the need to earn money. However, this affluence did not mean that Galen was brought up “soft” (as per Plato’s discussion in the Republic 544b-570e in which he discusses the devolution of political systems due to the decay of personal arête). Galen’s education was broad and directed by his father. Galen studied in mathematics (a particular favorite of his father), grammar, logic, and philosophy–that included inquiry into the four major schools of the time: the Platonists, the Peripatetics, the Stoics, and the Epicureans. This pluralistic sensibility influenced the philosophical/scientific method of Galen. According to pluralism, one should look at all the prevalent theories and then make up one’s own mind choosing either one of the theories or perhaps a new mixture of those presented according to their strengths.

Galen began his study of medicine around the age of sixteen when his father had a dream suggesting this direction. Galen traveled to Smyrna and Corinth to study with both a Rationalist and with an Empiricist. When Galen’s father died, Galen traveled to Egypt (Alexandria) where he lived for perhaps five years (152-157). What Galen might have studied in Alexandria is highly speculative. However, Galen, himself, later declares that students should “look at the human skeleton with your own eyes. This is very easy in Alexandria, so that the physicians of that area instruct their pupils with the aid of autopsy” (Kühn II, 220, translation L. Edelstein). This quotation points to the practice of autopsy (dissection of cadavers) in Alexandria. Whether Galen also studied anatomy this way is unclear. It is clear that Galen (at least) engaged in comparative anatomy by dissecting monkeys.

In 157 Galen returned to his hometown to become a surgeon to the gladiators. When civil unrest broke out in 162, Galen left for Rome. The medical community in Rome was competitive and corrupt. In Rome, Galen’s ambition got the best of him with the result that his high profile created powerful enemies who caused him to depart secretly in 166. After a couple of years in obscurity, Galen was recalled by the Roman Emperors Marcus Aurelius and Lucius Verus to serve the army in their war against the Germans. When the plague hit Rome, Galen was made personal physician to Marcus Aurelius and Aurelius’ son, Commodus. For many years it has been held that Galen remained in Roman society until his death around 199-200 (based upon the Suda Lexicon written around 1000); however, new research by Vivian Nutton has persuasively set the date of Galen’s death much later. Nutton proposes that Galen may have lived into his eighties (possibly as old as 87). The source for this new information comes from Byzantine and Arab scholars from the sixth century onwards. On the basis of this, it seems that Galen died around 216, give or take several years, in the reign of Caracalla.

A great many of Galen’s works have survived. The Kühn edition of Galen (Greek with a Latin translation) runs over 20,000 pages. There are other Galenic works that only exist in Arabic translations. However, many of Galen’s works are lost, e.g., many of his treatises on philosophy (logic, physics, and ethics) perished in a fire that consumed the Temple of Peace in 191.

2. Hellenistic Schools of Medicine

During the end of the fourth century BCE and throughout the third century BCE there were enormous advances in medicine revolving around the principal practitioners: Diocles, Praxagoras, Herophilus, and Erasistratus. During this period the debate about the relative roles of theory and observation were central to these writers (Kühn X, 107). It is, in fact, a perennial question in the philosophy of science. What is at issue is when does one impose a theoretical structure on the world? Part of the answer concerns the origins of the theoretical structure. From whence did it arise? In part, this is a struggle for a logic of induction that might assist the practitioner. Without such a theory of inductive logic, it is unclear whether nature is revealing her nature to the careful observer or whether the observer is imposing his own ideas upon nature. Aristotle discusses some of these issues in Posterior Analytics II.19 and in The Parts of Animals I. However, this is not the end of the question. Some of this tension can be seen in the biomedical writers in the Hippocratic era. However, it is also true that in the construction of scientific theories there must, of necessity, be a tension between those who embrace theoretical structures and those who are skeptical of them. The latter group generally bases their misgivings upon a possible tendency among theorists to create an a priori science. What makes a priori science troublesome is that it breaks contact with the empirical world. It suggests that ratiocination about natural causes is sufficient for the production of scientific theories. For most natural philosophers such a stance is entirely unacceptable. Setting the proper balance between theory and observation was (and continues to me) an important question in the philosophy of science.

One group that added to the debate on the role of observation were the Empiricists. The origins of the Empiricist School might be found in Acron of Akragas, a fifth century BCE follower of Empedocles. This conjecture is based merely upon the testimony of later writers. It could certainly be the case that there was no real medical empiricism, as such, before Serapion, a third century BCE doctor .

Another interesting speculation on the origins of the empiricist physicians comes from Michael Frede. Frede has suggested that from a reference in Plato’s Laws 720a-c; 857c-d that there was a two-tired medical system with physicians for the wealthy (who employed theoretical principles) and physicians for the slaves (who relied merely upon trial-and-error experience). If this speculation is correct, then the burden of proof for the empiricists is to show that the theoretical “book learning” of upper class doctors could be reduced to mere experience. In other words, experience, itself, could generate competence. The result would be an elevation of the second-level physician. If Frede is correct on this, then perhaps social situation is partially responsible for the rise of the medical empiricists.

Sextus Empiricus (circa 160-210) set out a loosely woven doctrine of “consideration” or skepsis. Sextus is a key source of our knowledge of Pyrrhonism and is also said to have been a physician (though his writings on medicine have not survived). It is not clear whether Sextus was an original thinker or merely a reflection of his era. However, at the very least, one can garner background information of what might have influenced the empiricists through the doctrine of skepsis. Under this doctrine the theoretical structures of the philosophers (Dogmatists) would be held in abeyance (neither accepted nor rejected). What would rule the day would be the case before the physician right now. The case and the physician’s experience would dictate the treatment.

Against the Empiricists, on the other hand, were the philosophers (Dogmatists). In one important way the Dogmatists are not a “school” as such. They are often depicted by their detractors, such as the Empiricists, rather than being self-identifying. This may relate to the social class dynamics noted earlier. Thus, one should keep in mind that the group is not so much a school of practitioners but a depiction of a group by objectors to those who profess a foundation in medical theory. Perhaps the best way to characterize the Dogmatists would be on the issue of aetiology. The Empiricists attacked the Dogmatists for asserting that there might be hidden causes of disease, and that these hidden causes might be grasped via ratiocination. This was because (under this characterization) the Dogmatists were advocating reasoning and conjecture over experience. To the Empiricists, this was akin to creating a priori science.

The Dogmatists (even in this quasi-class depiction) were identified with one of the four prominent philosophical schools (Platonists, Aristotelians/Peripatetics, Stoics, and Epicureans). Detractors said that the Dogmatists honored theory over observation and experience. Of course, from the point of view of the philosophical schools, rational theories create a critical structure that aid in the interpretation and explanation of nature. The sense of explanation here harkens back to Aristotle, who distinguished knowing the fact (hoti) and the reasoned fact (dioti, APo II, i). It may not be enough to know that if I (as a physician) do x, then y will result (anecdotal correlation of two events). That sort of hoti (or merely event + consequence unit) is insufficient. The reason for this is that when circumstances alter slightly, how is the practitioner to know whether this alteration is significant unless he also has an appreciation of the mechanism that underlies the process? For example, anecdotal correlation might (in a non-medical modern example) suggest that every time I wash my car, it will rain. My personal experience may be almost perfect, but that does not mean that such a causal connection actually exists. The reluctance to embrace a non-observable causal mechanism leaves this dilemma to those who profess an aversion to theory in favor of experience.

Somewhat in the middle of these two schools were the Methodists. Aside from Soranus there are no surviving texts of the Methodists. Therefore most of what we have comes from the descriptions of Galen and pseudo-Galen on these writers. The following are cited as being Methodists: Thessalos, Themison, Proklos, Reginos, Antipatros, Eudemos, Mnaseas, Philon, Dionysios, Menemachos, Olympikos, Apollonides, Soranus, Julianus (Kühn X, 52-53, XIV, 684). There is some controversy about the characterization and origins of this school but many relate it to Themison of Laodicea a pupil of Asclepiades of Bithynia. However this attribution is disputed by Celsus and Soranus who state that Themison is not the first but merely a representative of Methodism. At any rate, the Methodists paid attention (in contrast to the Dogmatists and Empiricists) to the disease alone as opposed to the situation of the individual patient, that is, his medical history and personal situation. The disease alone dictates treatment (Kühn III, 14-20). Thus, the physician does not have to have anatomical or physiological knowledge of the body. Instead, he observes the body in a holistic manner (koinotetes). The three principle conditions of a body viewed in this way are: (a) the body’s dryness, (b) the body’s fluidity, and (c) the mixture of the two. The “method” to be followed was to follow the phenomena. Underlying this assumption was the notion about the status of pores in the mechanism of the body’s common balance. The body’s pores allowed atoms to enter and exit the body. When the atoms came and went freely health was the result. When there was a disruption, then sickness was the result. When the pores were either too small (constriction) or too large (dilatation) then an imbalance occurred in the normal atomic flow. Atoms are invisible to the naked eye. Pores are visible, but their subtle alterations are often not visibly detectable. Thus, on the face of it, the Methodists seem to be contra-Empiricist. However, the atomist tradition (upon which this theory rests) was taken to be Empiricist. (In principle, one could view an entirely physical event-if it were possible to witness it.) Thus, the Methodists seem to have affinities to both. This is evident in Themison (first century, BCE) and Thessalus (first century, AD). Disease was depicted as a community of constriction or dilatation (or some combination of the two) that, in principle, was observable even though, in practice, it couldn’t be observed except through its effects, namely, the disease. Thus, though the intent of the Methodists was probably to lean toward the Empiricists, the actual practice put them more in-between.

Galen often characterizes himself as an eclectic belonging to no school. It is true that Galen was an innovator in observation, for example he gave the first depiction of the four-chambered human heart. But his epistemology was grounded in his philosophical training. Over and over Galen relies on an over-arching medical theory to drive his aetiology (Kühn X, 123, 159, 246). In this way his practice is closest to Aristotelian critical empiricism that requires careful observation and a comprehensive theory that will make those observations meaningful.

3. Method

Because of Galen’s pluralistic method, it is appropriate that (for the most part) his own method draws upon his predecessors with additions and corrections. For example, Galen employed the four-element theory (earth, air, fire, and water) as well as the theories of the contraries (hot, cold, wet, and dry). Though Aristotle interrelated these two descriptive accounts in his work Generation and Corruption, it is Galen who attempts to create a more gradated form by making quasi-quantitative categories of the contraries to describe the material composition of the mixtures (On Mixtures). From the perspective of modern science, this is an advancement upon Aristotle. This work on mixtures is also used to account for the properties of drugs (On Simples). Drugs were supposed to counteract the disposition of the body. Thus, if a patient were suffering from cold and wet (upper respiratory infection), then the appropriate drug would be one that is hot and dry (such as certain molds and fungi-does this remind you of penicillin?). The use of broad-reaching natural principles enhanced the explanatory power of Galen’s theory of biological science.

Galen speaks at length about the philosophers Plato (from whom he accepts the tri-partite soul) and Aristotle (whose biological works are well known to him). In medicine, he is also greatly influenced by historical figures such as Hippocrates (who he describes as a single individual opposed to our modern understanding of a group of writers-even though Galen was aware of the Hippocratic Question), Herophilus, and especially Erasistratus. In his avowed work on biological theory, On the Natural Faculties, Galen goes to great lengths to refute the principles of Erasistratus and his followers.

Contemporary figures are also discussed such as Aclepiades, and the Methodists Themison and Thessalus. This thorough use of the context of medicine allows Galen to consider, for example, Eristrates’ theory of mechanical digestion via a vacuum principle and to supplant it with his own theory of attraction (holke). Galen’s theory of attraction may have had its roots in the theory of natural place that always lacked a material force to implement it. At any rate, when the mechanisms are inscrutable, it was important for Galen to offer an account that fits into other parts of his theory (such as the mixture of the contraries in the composition of the elements).

One of the most influential aspects of Galenic practice was his implementation of (or invention of-as per Wesley Smith) the Hippocratic theory of the four humours (phlegm, blood, black bile, and yellow bile). These points of focus relate to a theory of health as balance. Each of these four humours is related to the three principal points of the body: head (phlegm), heart (blood), black bile (liver) and yellow bile (the liver’s complement, the gall bladder). The three principal points of the body are also loosely linked to the Platonic tripartite soul: head (sophia, reason), heart (thumos, emotion or spiritedness), liver (epithumos, desire). Thus, the sort of just balance of the soul that Plato argues for in the Republic is also the ground of natural health. When one part of the soul/body is out of balance, then the individual becomes ill. The physician’s job is to assist the patient in maintaining balance. If a person is too full of uncontrollable emotion or spiritedness, for example, then he is suffering from too much blood. The obvious answer is to engage in bloodletting (guaranteed to calm a person down). As in the case of pharmacology and the contraries, the four humours provide a comprehensive account of what it means to obtain and maintain health via the balancing of various primary principles.

4. Galen’s Critical Empiricism

One of the striking features of ancient medicine is the extent that very limited observations had to be interpreted in order to explain natural function. For example, given that blood was considered to be nourishment, trophe, it seemed reasonable (following Aristotle) that the blood would be entirely consumed by the body’s tissue. Thus, the blood would be manufactured in the liver and heart and then would flow to the rest of the body and be consumed. The flow of blood went one-way. However, there was a problem: there were two sorts of blood vessels (veins and arteries). These were structurally distinct. This was known through dissection of primates. Then it is assumed that Nature does nothing in vain (discussed at length in On the Use of the Parts as a key biomedical explanatory principle). This means that the veins and arteries have different functions. But they cannot be too disparate. The answer to this dilemma for Galen is that the arteries carry blood mixed with aer or pneuma that acts as a vital force whereas the venous blood is ordinary-though Galen held (correctly) that the two systems were connected by tiny almost invisible vessels (capillaries).

Thus Galen began with a problem and a number of observations and sought to make sense of the seeming anomalies via his overarching biomedical principles. In this way, Galen was acting according to the mathematical training from his father and a desire to create a unified (quasi-axiomatic) explanatory system. Without observation, this could have led to a priori or “armchair” science. But when combined with careful observation, it leads to critical empiricism.

Another example of this mixture of observation and inference is in the area of conception theory. Galen says in his treatise, On Seed,

These things have been said by me because of some of the philosophers who call themselves Aristotelians and Peripatetics. I, at least, would not address these men so, they being so greatly ignorant of the opinion of Aristotle that they think it is pleasing to him that the sperm of the male being cast into the uterus of the female places the principle of motion in the katamenia (the female seed) and, after this is expelled, the principle of motion in the katamenia and, after it is expelled, does not any part become the corporeal substance of the fetus. They have been deceived by the first book of the Generation of Animals that alone of the five they seem to have read. These things are written there, “As we said, of the generation of the principles we may say that chiefly there are the male principle and the female principle. The male offers the motive principle and the efficient cause of generation while the female offers the material principle” [Galen quoting Aristotle, G.A. 716a 5].

These are not far after the beginning: in still later parts of the tract he writes as well, “But this may be well concluded that the male provides the form and the principle of motion and the female provides the body and the matter just as the example of curding milk. Here the body is the milk and the fig juice contains the principle that makes it curdle” [Galen quoting Aristotle, G.A. 729a 10; Kühn IV, 516-517, my tr.].

The biological accounts of human reproduction in the ancient world offer excellent examples of the interaction between observation and inference. There are a number of issues involved in this issue that pre-dates even the Hippocratic writers. The one that is mentioned here is the issue of whether there is one seed (the male’s only) or two (the male’s and the female’s). In the above example Galen seems to be saying that the first reading of Aristotle in which the male provides the efficient cause and the female provides the material cause, simpliciter, is a misreading of Aristotle. Instead, the event (conception) is depicted as a more involved process in which principles of both parents come into play. These principles revolve around the empirically observable facts that children as often as not resemble the mother as much as the father. The “one seed” theory in which the father’s seed, alone, fashions the child can only account for such an outcome by calling it a sort of mutation (agone, para physin). But regularity counts for something. It is odd when an event that may approach or exceed 50% is called a mutation. This turns the entire idea of mutation (a statistical anomaly) on its head.

Galen approaches the issue with a balanced approach beginning with anatomical observations. Galen did some of the most extensive work in the ancient world on the study of the female anatomy (albeit mostly upon apes, On Anatomical Procedures, I.2). Galen’s observation of a fluid in the horns of the uterus (Kühn IV, 594, 600-601) were the basis of his (mistaken) view that he had discovered female seed. However, in the midst of this mistake he was on the right track in viewing the ovaries as analogous to the male testes.

The point in this second example is that Galen wanted to combine his observations gained in dissections of apes to his pronouncements vis-à-vis the debate concerning “one seed conception” vs. “two seed conception.” This commitment to integrating observation and theory contributed to making Galen a towering figure in medicine and the philosophy of science.

5. Select Bibliography

Primary Texts

  • Galeni Opera Omnia. Basel: Par’Andrea to Kratandro, 1538.Kühn, C.G. Galeni Opera Omnia. Leipzig: C. Cnobloch, 1821-1833, rpt. Hildesheim, 1965.
    • This is still the standard edition though it is very gradually being supplanted by the Corpus Medicorum Graecorum Leipzig, 1914-present.

Key Texts in Translation

  • Abhandlung darüber, dass der vorzügliche Arzt Philosoph sein muss. [Quod optimus medicus sit idem philosophus] translated by Peter Bachmann. Göttingen: Vanderhoeck & Ruprecht, 1996.L’Áme et ses passions: Les passions et les erreurs de l’áme. Translated and notes by Vincent Barras. Paris: Les Belle Lettres, 1995.
  • Galen on Antecedent Causes. Edited and translated with introduction and commentary by R.J. Hankinson. Cambridge: Cambridge University Press, 1998.
  • Galen on Bloodletting. Translated by Peter Brain. Cambridge: Cambridge University Press, 1986.
  • Galen on Food and Diet. Translated and notes by Mark Grant. London: Routledge, 2000.
  • Galen’s Institutio logica. Translated with commentary by John Spangler Kieffer. Baltimore: Johns Hopkins University Press, 1964.
  • Galen on Language and Ambiguity (De captionibus). Translated with commentary by Robert Blair Edlow. Leiden: Brill, 1977.
  • Galen on the Natural Faculties. Translated by Arthur John Brock. London: Heineiman, Ltd., 1952. Loeb series.
  • Galen on the Usefulness of the Parts of the Body {De usu partium). Translated with commentary by Margaret Tallmadge May. Ithaca, NY: Cornell University Press, 1968.
  • Galen, The Therapeutic Method: Books 1 & 2 (De methodo medendi). Edited and translated by R.J. Hankinson. Oxford: Clarendon Press, 1991.

Selected Secondary Sources

  • Barnes, Jonathan. “A Third Sort of Syllogism: Galen and the Logic of Relations” in Modern Thinkers and Ancient Thinkers. R. W. Sharples, ed. Boulder, CO: Westview Press, 1993.Boylan, Michael. “Galen’s Conception Theory” Journal of the History of Biology 19.1 (1986): 44-77.
  • Boudon-Millot, ed, fr. tr. Introduction génerale; sur ses propres livres que l’excellent médecin devienne philosophe. Paris: Les Belles Lettres, 2007.
  • Boudon-Millot, And Alessia Guardasole, and Caroline Magdelaine, eds. La science médicale antique: nouveaux regards: etudes reunites en l’honneur de Jacques Jouanna. Paris: Beauchesne, 2007.
  • Boylan, Michael. “Galen on the Blood, Pulse, and Arteries” Journal of the History of Biology 40.2 (2007): 207-230.
  • Boylan, Michael. “The Hippocratic and Galenic Challenges to Aristotle’s Conception Theory” Journal of the History of Biology 15.1 (1984): 83-112.
  • Connell, Sophia. “Aristotle and Galen on Sex Difference and Reproduction: A New Approach to an Ancient Rivalry.” Studies in History and the Philosophy of Science. 31-a.3(2000):405-427.
  • Cosans, Christopher E. “The Experimental Foundations of Galen’s Teleology” Studies in History and Philosophy of Science. 29A.1 (1998): 63-90.
  • Crombie, A. C. Augustine to Galileo. Vol. 1. London: Heinemann, 1961.
  • DeLacy, Philip. “Galen’s Platonism” American Journal of Philology. 93 (1972): 27-39.
  • Durling, Richard. A Dictionary of Medical Terms. Leiden: Brill, 1993.
  • Edelstein, Ludwig. Ancient Medicine. Baltimore, MD: Johns Hopkins University Press, 1967.
  • Farrington, B. Greek Science: Theophrastus to Galen. Baltimore, MD: Penguin, 1953.
  • Fischer, Klaus-Dietrich ed., Text and Tradition: Studies in Ancient Greek Medicine and its Transmission: Presented to Jutta Kollesch Leiden: Brill, 1998.
  • Frede, Michael. “The Empiricist Attitude toward Reason and Theory” Apeiron. 21 (1988): 79-97.
  • Freudiger, Jurg. “Methodus resolutiva: Antikes und Neuzeitliches in Jacopo Acontios Methodenschrift” Freiburger Zeitschrift für Philosophie und Theologie. 45.3 (1998): 407-446.
  • Gill, Christopher. “Galen vs. Chrysippus on the Tripartite Psyche in ‘Timaeus’ 69-72” in Interpreting the ‘Timaeus-Critias. Tomas Calvo ed. Sankt Augustin: Academia: 1997.
  • Gill, Christopher. “Did Chrysippus Understand Medea?” Phronesis. 28.2 (1983): 136-149.
  • Hankinson, R. J. “Actions and Passions” in Passions and Perceptions. Martha Nussbaum, ed. Cambridge: Cambridge University Press, 1993.
  • Hankinson, R.J. The Cambridge Companion to Galen. Cambridge: Cambridge University Press, 2008.
  • Hankinson, R. J. “Galen’s Anatomy of the Soul” Phronesis 36.3 (1991): 197-233.
  • Hankinson, R. J. “A Purely Verbal Dispute? Galen on Stoic and Academic Epistemology” Revue Internationale de Philosophie. 45.178 (1991): 267-300.
  • Hankinson, R. J. “Evidence, Externality and Antecendence: Inquiries Into Later Greek Causal Concepts.” Phronesis 32.1 (1987): 80-100.
  • Hankinson, R. J. “Causes and Empiricism: A Problem in the Interpretation of Later Greek Medical Method.” Phronesis 32.4 (1987): 329-348.
  • Kagan, Jerome, Nancy Snidman, Doreen Ardus, J. Steven Rezinck. Galen’s Prophecy: Temperament in Human Nature. NY: Basic Books, 1994.
  • Kember, O. “Right and Left in the Sexual Theories of Parmenides” Journal of Hellenic Studies. 91 (1971): 70-79.
  • Kidd, I. G. “Posidonius on Emotions” in Problems in Stoicism. A. A. Long, ed. London: Athlone, 1971.
  • Kollesch, Jutta. Galen über das Riechorgan. Berlin: Akademie-Verlag, 1964.
  • Kollesch, Jutta and Diethard Nickel, eds. Galen und das hellenistische Erbe. Stuttgart: Franz Steiner, 1993.
  • Kudlien, Fridolf and Richard J. Durling. Galen’s Method of Healing. Leiden: E.J. Brill, 1991.
  • Lloyd, G.E.R. Methods and Problems in Greek Science. Cambridge: Cambridge University Press, 1991.
  • Lloyd, G.E.R. Greek Science After Aristotle. New York: Norton, 1973.
  • Lloyd, G.E.R. “Parmenides’ Sexual Theories: A Reply to MER Kember” Journal of Hellenic Studies. 92 (1972): 178-179.
  • Lumpe, Adolf. “Der logische Grundgedanke der vierten Schlussfigur.” Prima Philosophia. 11.4 (1998): 397-404.
  • Lumpe, Adolf. “Zur Anordnung der Pramissen des kategorischen Syllogismus bei Albinos, Galenus und Pseudo-Apuleius” Prima Philosophia 8.2 (1995): 115-124.
  • Mansfield, Jaap. “The Idea of the Will in Chrysippus, Posidonius, and Galen” Proceedings of the Boston Area Colloquium in Ancient Philosophy 7 (1991): 107-145.
  • Manuli, Paola. “Galien et le Stoicisme” Revue de Mataphysique et de Morale 97.3 (1992): 365-375.
  • Mowry, Bryan. “From Galen’s Theory to William Harvey’s Theory: A Case Study in the Rationality of Scientific Theory Change” Studies in History and the Philosophy of Science 16 (1985): 49-82.
  • Nutton, Vivian. Ancient Medicine. London: Routledge, 2004.
  • Nutton, Vivian. “The Chronology of Galen’s Early Career” Classical Quarterly 23 (1973): 158-171.
  • Nutton, Vivian. (ed.) Galen: Problems and Prospects. London: Wellcome Institute, 1981.
  • Nutton, Vivian. “Galen ad multos annos” Dynamis 15 (1995): 25-39.
  • Rescher, Nicholas. Galen and the Syllogism: An Examination of the Thesis that Galen Originated the Fourth Figure of the Syllogism in Light of New Data from the Arabic. Pittsburgh, PA: University of Pittsburgh Press, 1996.
  • Sarton, George. Galen of Pergamon. Lawrence, KS: University of Kansas Press, 1954.
  • Siegel, Rudolph. Galen’s System of Physiology and Medicine. Basel: Karger, 1968.
  • Smith, Wesley. The Hippocratic Tradition. Ithaca, NY: Cornell University Press, 1979.
  • Temkin, Owsei. Galenism: The Rise and Decline of a Medical Philosophy. Ithaca, NY: Cornell University Press, 1973.
  • Tieleman, Teun. “Plotinus on the Seat of the Soul: Reverberations of Galen and Alexander in Enn. IV, 3 27ESS, 23.” Phronesis. 43.4 (1998): 306-325.

Select Proceedings of Conferences on Galen

1981 English

  • Nutton, Vivian, Galen : Problems and Prospects. London: Wellcome Institute for the History of Medicine, 1981.

1982 English

  • Kudlien, F., & Durling, R. J. Galen’s method of healing : Proceedings of the 1982 Galen Symposium. Paper presented at the Galen Symposium (1982 : Christian-Albrechts Universität); Studies in Ancient Medicine,; v. 1, 205. Leiden: Brill, 1991.

1986 3rd Italian

  • Manuli, P., & Vegretti, M. (1988). Le Opere Psicologiche di Galeno : Atti del terzo Colloquio Galenico Internazionale, Pavia, 10-12 Settembre 1986. Paper presented at the Colloqio Galenico Internazionale (3d : 1986 : Pavia, Italy); Elenchos (Bibliopolis (Firm)) 13,

1989 4th German

  • Kollesch, J., Nickel, D., Humboldt-Universität zu Berlin, & Institut für Geschichte der Medizin. (1993). Galen und das Hellenistische Erbe : Verhandlungen des IV. Internationalen Galen-Symposiums veranstaltet vom Institut für Geschichte der Medizin am Bereich Medizin (charité) der Humboldt-Universität zu Berlin 18.-20. September 1989. Paper presented at the Galen Symposium (4th : 1989 : Humboldt-Universität Zu Berlin); Sudhoffs Archiv.; Beihefte,; Heft 32,

1995 5th English

  • Debru, A. (1997). Galen on Pharmacology : Philosophy, history, and medicine : Proceedings of the Vth International Galen Colloquium, Lille, 16-18 March 1995. Paper presented at the International Galen Colloquium (5th : 1995 : Lille, France); Studies in Ancient Medicine,; v. 16, 336. Leiden: Brill, 2007.

1988 Spanish

  • López Férez, J. A. (1991). Galeno, obra, pensamiento e influencia : Coloquio internacional celebrado en Madrid, 22-25 de marzo de 1988. Madrid : Universidad Nacional de Educación a Distancia, 1991.

2002 Italian

  • Garofalo, I., Roselli, A., Fischer, K., Galen, On the therapeutic method, & Book III. (2003). Galenismo e Medicina Tardoantica : Fonti greche, latine e arabe : Atti del Seminario Internazionale di Siena, Certosa di Pontignano, 9 e 10 Settembre 2002. Paper presented at the Annali Dell’Istituto Universitario Orientale Di Napoli.; Sezione Filologico-Letteraria.; Quaderni,; 7,

2002 English

  • Nutton, Vivian. The Unknown Galen. London : Institute of Classical Studies, School of Advanced Study, University of London, 2002.

Author Information

Michael Boylan
Email: michael.boylan@marymount.edu
Marymount University
U. S. A.

Gottlob Frege (1848—1925)

FregeGottlob Frege was a German logician, mathematician and philosopher who played a crucial role in the emergence of modern logic and analytic philosophy. Frege’s logical works were revolutionary, and are often taken to represent the fundamental break between contemporary approaches and the older, Aristotelian tradition. He invented modern quantificational logic, and created the first fully axiomatic system for logic, which was complete in its treatment of propositional and first-order logic, and also represented the first treatment of higher-order logic. In the philosophy of mathematics, he was one of the most ardent proponents of logicism, the thesis that mathematical truths are logical truths, and presented influential criticisms of rival views such as psychologism and formalism. His theory of meaning, especially his distinction between the sense and reference of linguistic expressions, was groundbreaking in semantics and the philosophy of language. He had a profound and direct influence on such thinkers as Russell, Carnap and Wittgenstein. Frege is often called the founder of modern logic, and he is sometimes even heralded as the founder of analytic philosophy.

Table of Contents

  1. Life and Works
  2. Contributions to Logic
  3. Contributions to the Philosophy of Mathematics
  4. The Theory of Sense and Reference
  5. References and Further Reading
    1. Frege’s Own Works
    2. Important Secondary Works

1. Life and Works

Frege was born on November 8, 1848 in the coastal city of Wismar in Northern Germany. His full christened name was Friedrich Ludwig Gottlob Frege. Little is known about his youth. His father, Karl Alexander Frege, and his mother, Auguste (Bialloblotzsky) Frege, both worked at a girl’s private school founded in part by Karl. Both were also principals of the school at various points: Karl held the position until his death 1866, when Auguste took over until her death in 1878. The German writer Arnold Frege, born in Wismar in 1852, may have been Frege’s younger brother, but this has not been confirmed. Frege probably lived in Wismar until 1869; in the years from 1864-1869 he is known to have studied at the Gymnasium in Wismar.

In Spring 1869, Frege began studies at the University of Jena. There, he studied chemistry, philosophy and mathematics, and must have solidly impressed Ernst Abbe in mathematics, who later become of Frege’s benefactors. After four semesters, Frege transferred to the University of Göttingen, where he studied mathematics and physics, as well as philosophy of religion under Hermann Lotze. (Lotze is sometimes thought to have had a profound impact on Frege’s philosophical views.) In late 1873, Frege finished his doctoral dissertation, under the guidance of Ernst Schering, entitled Über eine geometrische Darstellung der imaginären Gebilde in der Ebene (“On a Geometrical Representation of Imaginary Figures in a Plane”), and received his Ph.D.

In 1874, with the recommendation of Ernst Abbe, Frege received a lectureship at the University of Jena, where he stayed the rest of his intellectual life. His position was unsalaried during his first five years, and he was supported by his mother. Frege’s Habilitationsschrift, entitled Rechnungsmethoden, die auf eine Erweiterung des Grössenbegriffes gründen (“Methods of Calculation Based upon An Amplification of the Concept of Magnitude,”), was included with the material submitted to obtain the position. It involves the theory of complex mathematical functions, and contains seeds of Frege’s advances in logic and the philosophy of mathematics.

Frege had a heavy teaching load during his first few years at Jena. However, he still had time to work on his first major work in logic, which was published in 1879 under the title Begriffsschrift, eine der arithmetischen nachgebildete Formelsprache des reinen Denkens (“Concept-Script: A Formula Language for Pure Thought Modeled on That of Arithmetic”). Therein, Frege presented for the first time his invention of a new method for the construction of a logical language. Upon the publication of the Begriffsschrift, he was promoted to ausserordentlicher Professor, his first salaried position. However, the book was not well-reviewed by Frege’s contemporaries, who apparently found its two-dimensional logical notation difficult to comprehend, and failed to see its advantages over previous approaches, such as that of Boole.

Sometime after the publication of the Begriffsschrift, Frege was married to Margaret Lieseburg (1856-1905). They had at least two children, who unfortunately died young. Years later they adopted a son, Alfred. However, little else is known about Frege’s family life.

Frege had aimed to use the logical language of the Begriffsschrift to carry out his logicist program of attempting to show that all of the basic truths of arithmetic could be derived from purely logical axioms. However, on the advice of Carl Stumpf, and given the poor reception of the Begriffsschrift, Frege decided to write a work in which he would describe his logicist views informally in ordinary language, and argue against rival views. The result was his Die Grundlagen der Arithmetik (“The Foundations of Arithmetic”), published in 1884. However, this work seems to have been virtually ignored by most of Frege’s contemporaries.

Soon thereafter, Frege began working on his attempt to derive the basic laws of arithmetic within his logical language. However, his work was interrupted by changes to his views. In the late 1880s and early 1890s Frege developed new and interesting theories regarding the nature of language, functions and concepts, and philosophical logic, including a novel theory of meaning based on the distinction between sense and reference. These views were published in influential articles such as “Funktion und Begriff” (“Function and Concept”, 1891), “Über Sinn und Bedeutung” (“On Sense and Reference”, 1892) and “Über Begriff und Gegenstand” (“On Concept and Object”, 1892). This maturation of Frege’s semantic and philosophical views led to changes in his logical language, forcing him to abandon an almost completed draft of his work in logic and the foundations of mathematics. However, in 1893, Frege finally finished a revised volume, employing a slightly revised logical system. This was his magnum opus, Grundgesetze der Arithmetik (“Basic Laws of Arithmetic”), volume I. In the first volume, Frege presented his new logical language, and proceeded to use it to define the natural numbers and their properties. His aim was to make this the first of a three volume work; in the second and third, he would move on to the definition of real numbers, and the demonstration of their properties.

Again, however, Frege’s work was unfavorably reviewed by his contemporaries. Nevertheless, he was promoted once again in 1894, now to the position of Honorary Ordinary Professor. It is likely that Frege was offered a position as full Professor, but turned it down to avoid taking on additional administrative duties. His new position was unsalaried, but he was able to support himself and his family with a stipend from the Carl Zeiss Stiftung, a foundation that gave money to the University of Jena, and with which Ernst Abbe was intimately involved.

Because of the unfavorable reception of his earlier works, Frege was forced to arrange to have volume II of the Grundgesetze published at his own expense. It was not until 1902 that Frege was able to make such arrangements. However, while the volume was already in the publication process, Frege received a letter from Bertrand Russell, informing him that it was possible to prove a contradiction in the logical system of the first volume of the Grundgesetze, which included a naive calculus for classes. For more information, see the article on “Russell’s Paradox“. Frege was, in his own words, “thunderstruck”. He was forced to quickly prepare an appendix in response. For the next couple years, he continued to do important work. A series of articles entitled “Über die Grundlagen der Geometrie,” (“On the Foundations of Geometry”) was published between 1903 and 1906, representing Frege’s side of a debate with David Hilbert over the nature of geometry and the proper construction and understanding of axiomatic systems within mathematics.

However, around 1906, probably due to some combination of poor health, the early loss of his wife in 1905, frustration with his failure to find an adequate solution to Russell’s paradox, and disappointment over the continued poor reception of his work, Frege seems to have lost his intellectual steam. He produced very little work between 1906 and his retirement in 1918. However, he continued to influence others during this period. Russell had included an appendix on Frege in his 1903 Principles of Mathematics. It is from this that Frege came be to be a bit wider known, including to an Austrian student studying engineering in Manchester, England, named Ludwig Wittgenstein. Wittgenstein studied the work of Frege and Russell closely, and in 1911, he wrote to both of them concerning his own solution to Russell’s paradox. Frege invited him to Jena to discuss his views. Wittgenstein did so in late 1911. The two engaged in a philosophical debate, and while Wittgenstein reported that Frege “wiped the floor” with him, Frege was sufficiently impressed with Wittgenstein that he suggested that he go to Cambridge to study with Russell–a suggestion that had profound importance for the history of philosophy. Moreover, Rudolf Carnap was one of Frege’s students from 1910 to 1913, and doubtlessly Frege had significant influence on Carnap’s interest in logic and semantics and his subsequent intellectual development and successes.

After his retirement in 1918, Frege moved to Bad Kleinen, near Wismar, and managed to publish a number of important articles, “Der Gedanke” (“The Thought”, 1918), “Der Verneinung” (“Negation”, 1918), and “Gedankengefüge” (“Compound Thoughts”, 1923). However, these were not wholly new works, but later drafts of works he had initiated in the 1890s. In 1924, a year before his death, Frege finally returned to the attempt to understand the foundations of arithmetic. However, by this time, he had completely given up on his logicism, concluding that the paradoxes of class or set theory made it impossible. He instead attempted to develop a new theory of the nature of arithmetic based on Kantian pure intuitions of space. However, he was not able to write much or publish anything about his new theory. Frege died on July 26, 1925 at the age of 76.

At the time of his death, Frege’s own works were still not very widely known. He did not live to see the profound impact he would have on the emergence of analytic philosophy, nor to see his brand of logic–due to the championship of Russell–virtually wholly supersede earlier forms of logic. However, in bequeathing his unpublished work to his adopted son, Alfred, he wrote prophetically, “I believe there are things here which will one day be prized much more highly than they are now. Take care that nothing gets lost.” Alfred later gave Frege’s papers to Heinrich Scholz of the University of Münster for safekeeping. Unfortunately, however, they were destroyed in an Allied bombing raid on March 25, 1945. Although Scholz had made copies of some of the more important pieces, a good portion of Frege’s unpublished works were lost.

Although he was a fierce, sometimes even satirical, polemicist, Frege himself was a quiet, reserved man. He was right-wing in his political views, and like many conservatives of his generation in Germany, he is known to have been distrustful of foreigners and rather anti-semitic. Himself Lutheran, Frege seems to have wanted to see all Jews expelled from Germany, or at least deprived of certain political rights. This distasteful feature of Frege’s personality has gravely disappointed some of Frege’s intellectual progeny.

2. Contributions to Logic

Trained as a mathematician, Frege’s interests in logic grew out of his interests in the foundations of arithmetic. Early in his career, Frege became convinced that the truths of arithmetic are logical, analytic truths, agreeing with Leibniz, and disagreeing with Kant, who thought that arithmetical knowledge was grounded in “pure intuition”, as well as more empiricist thinkers such as J. S. Mill, who thought that arithmetic was grounded in observation. In other words, Frege subscribed to logicism. His logicism was modest in one sense, but very ambitious in others. Frege’s logicism was limited to arithmetic; unlike other important historical logicists, such as Russell, Frege did not think that geometry was a branch of logic. However, Frege’s logicism was very ambitious in another regard, as he believed that one could prove all of the truths of arithmetic deductively from a limited number of logical axioms. Indeed, Frege himself set out to demonstrate all of the basic laws of arithmetic within his own system of logic.

Frege concurred with Leibniz that natural language was unsuited to such a task. Thus, Frege sought to create a language that would combine the tasks of what Leibniz called a “calculus ratiocinator” and “lingua characterica“, that is, a logically perspicuous language in which logical relations and possible inferences would be clear and unambiguous. Frege’s own term for such a language, “Begriffsschrift” was likely borrowed from a paper on Leibniz’s ideas written by Adolf Trendelenburg. Although there had been attempts to fashion at least the core of such a language made by Boole and others working in the Leibnizian tradition, Frege found their work unsuitable for a number of reasons. Boole’s logic used some of the same signs used in mathematics, except with different logical meanings. Frege found this unacceptable for a language which was to be used to demonstrate mathematical truths, because the signs would be ambiguous. Boole’s logic, though innovative in some respects, was weak in others. It was divided into a “primary logic” and “secondary logic”, bifurcating its propositional and categorical elements, and could not deal adequately with multiple generalities. It analyzed propositions in terms of subject and predicate concepts, which Frege found to be imprecise and antiquated.

Frege saw the formulae of mathematics as the paradigm of clear, unambiguous writing. Frege’s brand of logical language was modeled upon the international language of arithmetic, and it replaced the subject/predicate style of logical analysis with the notions of function and argument. In mathematics, an equation such as “f(x) = x2 + 1″ states that f is a function that takes x as argument and yields as value the result of multiplying x by itself and adding one. In order to make his logical language suitable for purposes other than arithmetic, Frege expanded the notion of function to allow arguments and values other than numbers. He defined a concept (Begriff) as a function that has a truth-value, either of the abstract objects the True or the False, as its value for any object as argument. See below for more on Frege’s understanding of concepts, functions and objects. The concept being human is understood as a function that has the True as value for any argument that is human, and the False as value for anything else. Suppose that “H( )” stands for this concept, and “a” is a constant for Aristotle, and “b” is a constant for the city of Boston. Then “H(a)” stands for the True, while “H(b)” stands for the False. In Frege’s terminology, an object for which a concept has the True as value is said to “fall under” the concept.

The values of such concepts could then be used as arguments to other functions. In his own logical systems, Frege introduced signs standing for the negation and conditional functions. His own logical notation was two-dimensional. However, let us instead replace Frege’s own notation with more contemporary notation. For Frege, the conditional function, “→” is understood as a function the value of which is the False if its first argument is the True and the second argument is anything other than the True, and is the True otherwise. Therefore, “H(b) → H(a)” stands for the True, while “H(a) → H(b)” stands for the False. The negation sign “~” stands for a function whose value is the True for every argument except the True, for which its value is the False. Conjunction and disjunction signs could then be defined from the negation and conditional signs. Frege also introduced an identity sign, standing for a function whose value is the True if the two arguments are the same object, and the False otherwise, and a sign, which he called “the horizontal,” namely “—”, that stands for a function that has the True as value for the True as argument, and has the False as value for any other argument.

Variables and quantifiers are used to express generalities. Frege understands quantifiers as “second-level concepts”. The distinction between levels of functions involves what kind of arguments the functions take. In Frege’s view, unlike objects, all functions are “unsaturated” insofar as they require arguments to yield values. But different sorts of functions require different sorts of arguments. Functions that take objects as argument, such as those referred to by “( ) + ( )” or “H( )”, are called first-level functions. Functions that take first-level functions as argument are called second-level functions. The quantifier, “∀x(…x…)”, is understood as standing for a function that takes a first-level function as argument, and yields the True as value if the argument-function has the True as value for all values of x, and has the False as value otherwise. Thus, “∀xH(x)” stands for the False, since the concept H( ) does not have the True as value for all arguments. However, “∀x[H(x) → H(x)]” stands for True, since the complex concept H( ) → H( ) does have the True as value for all arguments. The existential quantifier, now written “∃x(…x…)” is defined as “~∀x~(…x…)”.

Those familiar with modern predicate logic will recognize the parallels between it and Frege’s logic. Frege is often credited with having founded predicate logic. However, Frege’s logic is in some ways different from modern predicate logic. As we have seen, a sign such as “H( )” is a sign for a function in the strictest sense, as are the conditional and negation connectives. Frege’s conditional is not, like the modern connective, something that flanks statements to form a statement. Rather, it flanks terms for truth-values to form a term for a truth-value. Frege’s “H(b) → H(a)” is simply a name for the True, by itself it does not assert anything. Therefore, Frege introduces a sign he called the “judgment stroke”, ⊢, used to assert that what follows it stands for the True. Thus, while “H(b) → H(a)” is simply a term for a truth-value, “⊢ H(b) → H(a)” asserts that this truth-value is the True, or in this case, that if Boston is human, then Aristotle is human. Moreover, Frege’s logical system was second-order. In addition to quantifiers ranging over objects, it also contained quantifiers ranging over first-level functions. Thus, “⊢∀xF[F(x)]” asserts that every object falls under at least one concept.

Frege’s logic took the form of an axiomatic system. In fact, Frege was the first to take a fully axiomatic approach to logic, and the first even to suggest that inference rules ought to be explicitly formulated and distinguished from axioms. He began with a limited number of fixed axioms, introduced explicit inference rules, and aimed to derive all other logical truths (including, for him, the truths of arithmetic) from them. Frege’s first logical system, that of the 1879 Begriffsschrift, had nine axioms (one of which was not independent), one explicit inference rule, and also employed a second and third inference rule implicitly. It represented the first axiomatization of logic, and was complete in its treatment of both propositional logic and first-order quantified logic. Unlike Frege’s later system, the system of the Begriffsschrift was fully consistent. (It has since been proven impossible to devise a system for higher-order logic with a finite number of axioms that is both complete and consistent.)

In order to make deduction easier, in the 1893 logical system of the Grundgesetze, Frege used fewer axioms and more inference rules: seven and twelve, respectively, this time leaving nothing implicit. The Grundgesetze also expanded upon the system of the Begriffsschrift by adding axioms governing what Frege called the “value-ranges” (Werthverlaüfe) of functions, understood as objects corresponding to the complete argument-value mappings generated by functions. In the case of concepts, their value-ranges were identified with their extensions. While Frege did sometimes also refer to the extensions of concepts as “classes“, he did not conceive of such classes as aggregates or collections. They were simply understood as objects corresponding to the complete argument-value mappings generated by concepts considered as functions. Frege then introduced two axioms dealing with these value-ranges. Most infamous was his Basic Law V, which asserts that the truth-value of the value-range of function F being identical to the value-range of function G is the same as the truth-value of F and G having the same value for every argument. If one conceives of value-ranges as argument-value mappings, then this certainly seems to be a plausible hypothesis. However, from it, it is possible to prove a strong theorem of class membership: that for any object x, that object is in the extension of concept F if and only if the value of F for x as argument is the True. Given that value-ranges themselves are taken to be objects, if the concept in question is that of being a extension of a concept not included in itself, one can conclude that the extension of this concept is in itself just in case it is not. Therefore, the logical system of the Grundgesetze was inconsistent due to Russell’s Paradox. See the entry on Russell’s Paradox for more details. However, the core of the system of the Grundgesetze, that is, the system minus the axioms governing value-ranges, is consistent and, like the system of the Begriffsschrift, is complete in its treatment of propositional logic and first-order predicate logic.

Given the extent to which it is taken granted today, it can be difficult to fully appreciate the truly innovative and radical approach Frege took to logic. Frege was the first to attempt to transcribe the old statements of categorical logic in a language employing variables, quantifiers and truth-functions. Frege was the first to understand a statement such as “all students are hardworking” as saying roughly the same as, “for all values of x, if x is a student, then x is hardworking”. This made it possible to capture the logical connection between statements such as “either all students are hardworking or all students are intelligent” and “all students are either hardworking or intelligent” (for example, that the first implies the second). In earlier logical systems such as that of Boole, in which the propositional and quantificational elements were bifurcated, the connection was wholly lost. Moreover, Frege’s logical system was the first to be able to capture statements of multiple generality, such as “every person loves some city” by using multiple quantifiers in the same logical formula. This too was impossible in all earlier logical systems. Indeed, Frege’s “firsts” in logic are almost too numerous to list. We have seen here that he invented modern quantification theory, presented the first complete axiomatization of propositional and first-order “predicate” logic (the latter of which he invented outright), attempted the first formulation of higher-order logic, presented the first coherent and full analysis of variables and functions, first showed it possible to reduce all truth-functions to negation and the conditional, and made the first clear distinction between axioms and inference rules in a formal system. As we shall see, he also made advances in the logic of mathematics. It is small wonder that he is often heralded as the founder of modern logic.

On Frege’s “philosophy of logic”, logic is made true by a realm of logical entities. Logical functions, value-ranges, and the truth-values the True and the False, are thought to be objectively real entities, existing apart from the material and mental worlds. (As we shall see below, Frege was also committed to other logical entities such as senses and thoughts.) Logical axioms are true because they express true thoughts about these entities. Thus, Frege denied the popular view that logic is without content and without metaphysical commitment. Frege was also a harsh critic of psychologism in logic: the view that logical truths are truths about psychology. While Frege believed that logic might prescribe laws about how people should think, logic is not the science of how people do think. Logical truths would remain true even if no one believed them nor used them in their reasoning. If humans were genetically designed to use regularly the so-called “inference rule” of affirming the consequent, etc., this would not make it logically valid. What is true or false, valid of invalid, does not depend on anyone’s psychology or anyone’s beliefs. To think otherwise is to confuse something’s being true with something’s being-taken-to-be-true.

3. Contributions to the Philosophy of Mathematics

Frege was an ardent proponent of logicism, the view that the truths of arithmetic are logical truths. Perhaps his most important contributions to the philosophy of mathematics were his arguments for this view. He also presented significant criticisms against rival views. We have seen that Frege was a harsh critic of psychologism in logic. He thought similarly about psychologism in mathematics. Numbers cannot be equated with anyone’s mental images, nor truths of mathematics with psychological truths. Mathematical truths are objective, not subjective. Frege was also a critic of Mill’s view that arithmetical truths are empirical truths, based on observation. Frege pointed out that it is not just observable things that can be counted, and that mathematical truths seem to apply also to these things. On Mill’s view, numbers must be taken to be conglomerations of objects. Frege rejects this view for a number of reasons. Firstly, is one conglomeration of two things the same as a different conglomeration of two things, and if not, in what sense are they equal? Secondly, a conglomeration can be seen as made up of a different number of things, depending on how the parts are counted. One deck of cards contains fifty two cards, but each card consists of a multitude of atoms. There is no one uniquely determined “number” of the whole conglomeration. He also reiterated the arguments of others: that mathematical truths seem apodictic and knowable a priori. He also argued against the Kantian view that arithmetic truths are based on the pure intuition of the succession of time. His main argument against this view, however, was simply his own work in which he showed that truths about the nature of succession and sequence can be proven purely from the axioms of logic.

Frege was also an opponent of formalism, the view that arithmetic can be understood as the study of uninterpreted formal systems. While Frege’s logical language represented a kind of formal system, he insisted that his formal system was important only because of what its signs represent and its propositions mean. The signs themselves, independently of what they mean, are unimportant. To suggest that mathematics is the study simply of the formal system, is, in Frege’s eyes, to confuse the sign and thing signified. To suggest that arithmetic is the study of formal systems also suggests, absurdly, that the formula “5 + 7 = 12”, written in Arabic numerals, is not the same truth as the formula, “V + VII = XII”, written in Roman numerals. Frege suggests also that this confusion would have the absurd result that numbers simply are the numerals, the signs on the page, and that we should be able to study their properties with a microscope.

Frege suggests that rival views are often the result of attempting to understand the meaning of number terms in the wrong way, for example, in attempting to understand their meaning independently of the contexts in which they appear in sentences. If we are simply asked to consider what “two” means independently of the context of a sentence, we are likely to simply imagine the numeral “2”, or perhaps some conglomeration of two things. Thus, in the Grundlagen, Frege espouses his famous context principle, to “never ask for the meaning of a word in isolation, but only in the context of a proposition.” The Grundlagen is an earlier work, written before Frege had made the distinction between sense and reference (see below). It is an active matter of debate and discussion to what extent and how this principle coheres with Frege’s later theory of meaning, but what is clear is that it plays an important role in his own philosophy of mathematics as described in the Grundlagen.

According to Frege, if we look at the contexts in which number words usually occur in a proposition, they appear as part of a sentence about a concept, specifically, as part of an expression that tells us how many times a certain concept is instantiated. Consider, for example, “I have six cards in my hand” or “There are 11 members of congress from Wisconsin.” These propositions seem to tell us how many times the concepts of being a card in my hand and being a member of congress from Wisconsin are instantiated. Thus, Frege concludes that statements about numbers are statements about concepts. This insight was very important for Frege’s case for logicism, as Frege was able to show that it is possible to define what it means for a concept to be instantiated a certain number of times purely logically by making use of quantifiers and identity. To say that the concept F is instantiated zero times is to say that there are no objects that instantiate F, or, equivalently, that everything does not instantiate F. To say that F is instantiated one time is to say there is an object x that instantiates F, and that for all objects y, either y does not instantiate F or y is x. To say that F is instantiated twice is to say that there are two objects, x and y, each of which instantiates F, but which are not the same as each other, and for all z, either z does not instantiate F, or z is x or z is y. One could then consider numbers as “second-level concepts”, or concepts of concepts, which can be defined in purely logical terms. (For more on the distinction of levels of concepts, see above.)

Frege, however, does not leave his analysis of numbers there. Understanding number-claims as involving second-level concepts does give us some insight into the nature of numbers, but it cannot be left at this. Mathematics requires that numbers be treated as objects, and that we be able to provide a definition of the number “two” simpliciter, without having to speak of two Fs. For this purpose, Frege appeals to his theory of the value-ranges of concepts. On the notion of a value-range, see above. We saw above that we can gain some understanding of number claims as involving second-level concepts, or concepts of concepts. In order to find a definition of numbers as objects, Frege treats them instead as value-ranges of value-ranges. Exactly, however, are they to be understood?

Frege notes that we have an understanding of what it means to say that there are the same number of Fs as there are Gs. It is to say that there is a one-one mapping between the objects that instantiate F and the objects instantiating G, i.e. that there is some function f from entities that instantiate F onto entities that instantiate G such that there is a different F for every G, and a different G for every F, with none left over. (In this, Frege’s views on the nature of cardinality were in part anticipated by Georg Cantor.) However, we must bear in mind that the propositions:

(1) There are equally many Fs as there are Gs.
(2) The number of Fs = the number of Gs

must obviously have the same truth-value, as they seem to express the same fact. We must, therefore, look for a way of understanding the phrase “the number of Fs” that occurs in (2) that makes clear how and why the whole proposition will be true or false for the same reason as (1) is true or false. Frege’s suggestion is that “the number of Fs” means the same as “the value-range of the concept being a value-range of a concept instantiated equally many times as F.” This means that the number of Fs is a certain value-range, containing value-ranges, and in particular, all those value-ranges that have as many members as there are Fs. Then (2) is understood as saying the same as “the value-range of the concept being a value-range of a concept instantiated equally many times as F = the value-range of the concept being a value-range of a concept instantiated equally many times as G“, which will be true if and only if there are equally many Fs as Gs, i.e. if every value-range of a concept instantiated equally many times as F is also a value-range of a concept instantiated equally many times as G.

To give some examples, if there are zero Fs, then the number of Fs, i.e. zero, is the value-range consisting of all value-ranges with no members. Recall that for Frege, classes are identified with value-ranges of concepts. (See above.) To rephrase the same point in terms of classes, zero is the class of all classes with no members. Since there is only one such class, zero is the class containing only the empty class. If there is one F, then the number of Fs, i.e. one, is the class consisting of all classes with one member (the extensions of concepts instantiated once). Here we can see the connection with the understanding of number expressions as being statements about concepts. Rather than understanding zero as the concept a concept has just in case it is not instantiated, zero is understood as the value-range consisting of value-ranges of concepts that are not instantiated. Rather than understanding one as the concept a concept has just in case it is instantiated by a unique object, it is understood as the value-range consisting of value-ranges of concepts instantiated by unique objects. This allows us to understand numbers as abstract objects, and provide a clear definition of the meaning of number signs in arithmetic such as “1”, “2”, “3”, etc.

Some of Frege’s most brilliant work came in providing definitions of the natural numbers in his logical language, and in proving some of their properties therein. After laying out the basic laws of logic, and defining axioms governing the truth-functions and value-ranges, etc., Frege begins by defining a relation that holds between two value-ranges just in case they are the value-ranges of concepts instantiated equally many times. This relation holds between value-ranges just in case they are the same size, i.e. just in case there is one-one correspondence between the entities that fall under their concepts. Using this, he then defines a function that takes a value-range as argument and yields as value the value-range consisting of all value-ranges the same size as it. The number zero is then defined as the value-range consisting of all value-ranges the same size as the value-range of the concept being non-self-identical. Since this concept is not instantiated, zero is defined as the value-range of all value-ranges with no members, as described above. There is only one such number zero. Since this is true, then the concept of being identical to zero is instantiated once. Frege then uses this to define one. One is defined as the value-range of all value-ranges equal in size to the value-range of the concept being identical to zero. Having defined one is this way, Frege is able to define two. He has already defined one and zero; they are each unique, but different from each other. Therefore, two can be defined as the value-range of all value-ranges equal in size to the value-range of the concept being identical to zero or identical to one. Frege is able to define all natural numbers in this way, and indeed, prove that there are infinitely many of them. Each natural number can be defined in terms of the previous one: for each natural number n, its successor (n + 1) can be defined as the value-range of all value-ranges equal in size to the value-range of the concept of being identical to one of the numbers between zero and n.

In the Begriffsschrift, Frege had already been able to prove certain results regarding series and sequences, and was able to define the ancestral of a relation. To understand the ancestral of a relation, consider the example of the relation of being the child of. A person x bears this relation to y just in case x is y‘s child. However, x falls in the ancestral of this relation with respect to y just in case x is the child of y, or is the child of y‘s child, or is the child of y‘s child’s child, etc. Frege was able to define the ancestral of relations logically even in his early work. He put this to use in the Grundgesetze to define the natural numbers. We have seen how the notion of successorship can be defined for Frege, i.e. the relation n + 1 bears to n. The natural numbers can be defined as the value-range of all value-ranges that fall under the ancestral of the successor relation with respect to zero. The natural numbers then consist of zero, the successor of zero (one), the successor of the successor of zero (two), and so on ad infinitum. Frege was then able to use this definition of the natural numbers to provide a logical analysis of mathematical induction, and prove that mathematical induction can be used validly to demonstrate the properties of the natural numbers, an extremely important result for making good on his logicist ambitions. Frege could then use mathematical induction to prove some of the basic laws of the natural numbers. Frege next turned his logicist method to an analysis of integers (including negative numbers) and then to the real numbers, defining them using the natural numbers and certain relations holding between them. We need not dwell on the details of this work here.

Frege’s approach to providing a logical analysis of cardinality, the natural numbers, infinity and mathematical induction were groundbreaking, and have had a lasting importance within mathematical logic. Indeed, prior to 1902, it must have seemed to him that he had been completely successful in showing that the basic laws of arithmetic could be understood purely as logical truths. However, as we have seen, Frege’s definition of numbers heavily involves the notion of classes or value-ranges, but his logical treatment of them is shown to be impossible due to Russell’s paradox. This presents a serious problem for Frege’s logicist approach. Another heavy blow came after Frege’s death. In 1931, Kurt Gödel discovered his famous incompleteness proof to the effect that there can be no consistent formal system with a finite number of axioms in which it is possible to derive all of the truths of arithmetic. This presents a serious blow to more ambitious forms of logicism, such as Frege’s, which aimed to provide precisely the sort of system Gödel showed impossible. Nevertheless, it cannot be denied that Frege’s work in the philosophy of mathematics was important and insightful.

4. The Theory of Sense and Reference

Frege’s influential theory of meaning, the theory of sense (Sinn) and reference (Bedeutung) was first outlined, albeit briefly, in his article, “Funktion und Begriff” of 1891, and was expanded and explained in greater detail in perhaps his most famous work, “Über Sinn und Bedeutung” of 1892. In “Funktion und Begriff”, the distinction between the sense and reference of signs in language is first made in regard to mathematical equations. During Frege’s time, there was a widespread dispute among mathematicians as to how the sign, “=”, should be understood. If we consider an equation such as, “4 x 2 = 11 – 3″, a number of Frege’s contemporaries, for a variety of reasons, were wary of viewing this as an expression of an identity, or, in this case, as the claim that 4 x 2 and 11 – 3 are one and the same thing. Instead, they posited some weaker form of “equality” such that the numbers 4 x 2 and 11 – 3 would be said to be equal in number or equal in magnitude without thereby constituting one and the same thing. In opposition to the view that “=” signifies identity, such thinkers would point out that 4 x 2 and 11 – 3 cannot in all ways be thought to be the same. The former is a product, the latter a difference, etc.

In his mature period, however, Frege was an ardent opponent of this view, and argued in favor of understanding “=” as identity proper, accusing rival views of confusing form and content. He argues instead that expressions such as “4 x 2″ and “11 – 3” can be understood as standing for one and the same thing, the number eight, but that this single entity is determined or presented differently by the two expressions. Thus, he makes a distinction between the actual number a mathematical expression such as “4 x 2″ stands for, and the way in which that number is determined or picked out. The former he called the reference (Bedeutung) of the expression, and the latter was called the sense (Sinn) of the expression. In Fregean terminology, an expression is said to express its sense, and denote or refer to its reference.

The distinction between reference and sense was expanded, primarily in “Über Sinn und Bedeutung” as holding not only for mathematical expressions, but for all linguistic expressions (whether the language in question is natural language or a formal language). One of his primary examples therein involves the expressions “the morning star” and “the evening star”. Both of these expressions refer to the planet Venus, yet they obviously denote Venus in virtue of different properties that it has. Thus, Frege claims that these two expressions have the same reference but different senses. The reference of an expression is the actual thing corresponding to it, in the case of “the morning star”, the reference is the planet Venus itself. The sense of an expression, however, is the “mode of presentation” or cognitive content associated with the expression in virtue of which the reference is picked out.

Frege puts the distinction to work in solving a puzzle concerning identity claims. If we consider the two claims:

(1) the morning star = the morning star

(2) the morning star = the evening star

The first appears to be a trivial case of the law of self-identity, knowable a priori, while the second seems to be something that was discovered a posteriori by astronomers. However, if “the morning star” means the same thing as “the evening star”, then the two statements themselves would also seem to have the same meaning, both involving a thing’s relation of identity to itself. However, it then becomes to difficult to explain why (2) seems informative while (1) does not. Frege’s response to this puzzle, given the distinction between sense and reference, should be apparent. Because the reference of “the evening star” and “the morning star” is the same, both statements are true in virtue of the same object’s relation of identity to itself. However, because the senses of these expressions are different–in (1) the object is presented the same way twice, and in (2) it is presented in two different ways–it is informative to learn of (2). While the truth of an identity statement involves only the references of the component expressions, the informativity of such statements involves additionally the way in which those references are determined, i.e. the senses of the component expressions.

So far we have only considered the distinction as it applies to expressions that name some object (including abstract objects, such as numbers). For Frege, the distinction applies also to other sorts of expressions and even whole sentences or propositions. If the sense/reference distinction can be applied to whole propositions, it stands to reason that the reference of the whole proposition depends on the references of the parts and the sense of the proposition depends of the senses of the parts. (At some points, Frege even suggests that the sense of a whole proposition is composed of the senses of the component expressions.) In the example considered in the previous paragraph, it was seen that the truth-value of the identity claim depends on the references of the component expressions, while the informativity of what was understood by the identity claim depends on the senses. For this and other reasons, Frege concluded that the reference of an entire proposition is its truth-value, either the True or the False. The sense of a complete proposition is what it is we understand when we understand a proposition, which Frege calls “a thought” (Gedanke). Just as the sense of a name of an object determines how that object is presented, the sense of a proposition determines a method of determination for a truth-value. The propositions, “2 + 4 = 6” and “the Earth rotates”, both have the True as their references, though this is in virtue of very different conditions holding in the two cases, just as “the morning star” and “the evening star” refer to Venus in virtue of different properties.

In “Über Sinn und Bedeutung”, Frege limits his discussion of the sense/reference distinction to “complete expressions” such as names purporting to pick out some object and whole propositions. However, in other works, Frege makes it quite clear that the distinction can also be applied to “incomplete expressions”, which include functional expressions and grammatical predicates. These expressions are incomplete in the sense that they contain an “empty space”, which, when filled, yields either a complex name referring to an object, or a complete proposition. Thus, the incomplete expression “the square root of ( )” contains a blank spot, which, when completed by an expression referring to a number, yields a complex expression also referring to a number, e.g., “the square root of sixteen”. The incomplete expression, “( ) is a planet” contains an empty place, which, when filled with a name, yields a complete proposition. According to Frege, the references of these incomplete expressions are not objects but functions. Objects (Gegenstände), in Frege’s terminology, are self-standing, complete entities, while functions are essentially incomplete, or as Frege says, “unsaturated” (ungesättigt) in that they must take something else as argument in order to yield a value. The reference of the expression “square root of ( )” is thus a function, which takes numbers as arguments and yields numbers as values. The situation may appear somewhat different in the case of grammatical predicates. However, because Frege holds that complete propositions, like names, have objects as their references, and in particular, the truth-values the True or the False, he is able to treat predicates also as having functions as their references. In particular, they are functions mapping objects onto truth-values. The expression, “( ) is a planet” has as its reference a function that yields as value the True when saturated by an object such as Saturn or Venus, but the False when saturated by a person or the number three. Frege calls such a function of one argument place that yields the True or False for every possible argument a “concept” (Begriff), and calls similar functions of more than one argument place (such as that denoted by “( ) > ( )”, which is doubly in need of saturation), “relations”.

It is clear that functions are to be understood as the references of incomplete expressions, but what of the senses of such expressions? Here, Frege tells us relatively little save that they exist. There is some amount of controversy among interpreters of Frege as to how they should be understood. It suffices here to note that just as the same object (e.g. the planet Venus), can be presented in different ways, so also can a function be presented in different ways. While “identity”, as Frege uses the term, is a relation holding only between objects, Frege believes that there is a relation similar to identity that holds between functions just in case they always share the same value for every argument. Since all and only those things that have hearts have kidneys, strictly speaking, the concepts denoted by the expressions “( ) has a heart”, and “( ) has a kidney” are one and the same. Clearly, however, these expressions do not present that concept in the same way. For Frege, these expressions would have different senses but the same reference. Frege also tells us that it is the incomplete nature of these senses that provides the “glue” holding together the thoughts of which they form a part.

Frege also uses the distinction to solve what appears to be a difficulty with Leibniz’s law with regard to identity. This law was stated by Leibniz as, “those things are the same of which one can be substituted for another without loss of truth,” a sentiment with which Frege was in full agreement. As Frege understands this, it means that if two expressions have the same reference, they should be able to replace each other within any proposition without changing the truth-value of that proposition. Normally, this poses no problem. The inference from:

(3) The morning star is a planet.

to the conclusion:

(4) The evening star is a planet.

in virtue of (2) above and Leibniz’s law is unproblematically valid. However, there seem to be some serious counterexamples to this principle. We know for example that “the morning star” and “the evening star” have the same customary reference. However, it is not always true that they can replace one another without changing the truth of a sentence. For example, if we consider the propositions:

(5) Gottlob believes that the morning star is a planet.

(6) Gottlob believes that the evening star is a planet.

If we assume that Gottlob does not know that the morning star is the same heavenly body as the evening star, (5) may be true while (6) false or vice versa.

Frege meets this challenge to Leibniz’s law by making a distinction between what he calls the primary and secondary references of expressions. Frege suggests that when expressions appear in certain unusual contexts, they have as their references what is customarily their senses. In such cases, the expressions are said to have their secondary references. Typically, such cases involve what Frege calls “indirect speech” or “oratio obliqua“, as in the case of statements of beliefs, thoughts, desires and other so-called “propositional attitudes”, such as the examples of (5) and (6). However, expressions also have their secondary references (for reasons which should already be apparent) in contexts such as “it is informative that…” or “… is analytically true”.

Let us consider the examples of (5) and (6) more closely. To Frege’s mind, these statements do not deal directly with the morning star and the evening star itself. Rather, they involve a relation between a believer and a thought believed. Thoughts, as we have seen, are the senses of complete propositions. Beliefs depend for their make-up on how certain objects and concepts are presented, not only on the objects and concepts themselves. The truth of belief claims, therefore, will depend not on the customary references of the component expressions of the stated belief, but their senses. Since the truth-value of the whole belief claim is the reference of that belief claim, and the reference of any proposition, for Frege, depends on the references of its component expressions, we are led to the conclusion that the typical senses of expressions that appear in oratio obliqua are in fact the references of those expressions when they appear in that context. Such contexts can be referred to as “oblique contexts”, contexts in which the reference of an expression is shifted from its customary reference to its customary sense.

In this way, Frege is able to actually retain his commitment in Leibniz’s law. The expressions “the morning star” and “the evening star” have the same primary reference, and in any non-oblique context, they can replace each other without changing the truth-value of the proposition. However, since the senses of these expressions are not the same, they cannot replace each other in oblique contexts, because in such contexts, their references are non-identical.

Frege ascribes to senses and thoughts objective existence. In his mind, they are objects every bit as real as tables and chairs. Their existence is not dependent on language or the mind. Instead, they are said to exist in a timeless “third realm” of sense, existing apart from both the mental and the physical. Frege concludes this because, although senses are obviously not physical entities, their existence likewise does not depend on any one person’s psychology. A thought, for example, has a truth-value regardless of whether or not anyone believes it and even whether or not anyone has grasped it at all. Moreover, senses are interpersonal. Different people are able to grasp the same senses and same thoughts and communicate them, and it is even possible for expressions in different languages to express the same sense or thought. Frege concludes that they are abstract objects, incapable of full causal interaction with the physical world. They are actual only in the very limited sense that they can have an effect on those who grasp them, but are themselves incapable of being changed or acted upon. They are neither created by our uses of language or acts of thinking, nor destroyed by their cessation.

Unfortunately, Frege does not tell us very much about exactly how these abstract objects pick out or present their references. Exactly what is it that makes a sense a “way of determining” or “mode of presenting” a reference? In the wake of Russell’s theory of descriptions, a Fregean sense is often interpreted as a set of descriptive information or criteria that picks out its reference in virtue of the reference alone satisfying or fitting that descriptive information. In giving examples, Frege implies that a person might attach to the name “Aristotle” the sense the pupil of Plato and teacher of Alexander the Great. This sense picks out Aristotle the person because he alone matches this description. Here, care must be taken to avoid misunderstanding. The sense of the name “Aristotle” is not the words “the pupil of Plato and teacher of Alexander the Great”; to repeat, senses are not linguistic items. It is rather that the sense consists in some set of descriptive information, and this information is best described by a descriptive phrase of this form. The property of being the pupil of Plato and teacher of Alexander is unique to Aristotle, and thus, it may be in virtue of associating this information with the name “Aristotle” that this name may be used to refer to Aristotle. As certain commentators have noted, it is not even necessary that the sense of the name be expressible by some descriptive phrase, because the descriptive information or properties in virtue of which the reference is determined may not be directly nameable in any natural language.

From this standpoint, it is easy to understand how there might be senses that do not pick out any reference. Names such as “Romulus” or “Odysseus”, and phrases such as “the least rapidly converging series” or “the present King of France” express senses, insofar as they lay out criteria that things would have to satisfy if they were to be the references of these expressions. However, there are no things which do in fact satisfy these criteria. Therefore, these expressions are meaningful, but do not have references. Because the sense of a whole proposition is determined by the senses of the parts, and the reference of a whole proposition is determined by the parts, Frege claims that propositions in which such expressions appear are able to express thoughts, but are neither true nor false, because no references are determined for them.

This interpretation of the nature of senses makes Frege a forerunner to what has since been come to be known as the “descriptivist” theory of meaning and reference in the philosophy of language. The view that the sense of a proper name such as “Aristotle” could be descriptive information as simple as the pupil of Plato and teacher of Alexander the Great, however, has been harshly criticized by many philosophers, and perhaps most notably by Saul Kripke. Kripke points out that this would make a claim such as “Aristotle taught Alexander” seem to be a necessary and analytic truth, which it does not appear to be. Moreover, he claims that many of us seem to be able to use a name to refer to an individual even if we are unaware of any properties uniquely held by that individual. For example, many of us don’t know enough about the physicist Richard Feynman to be able to identify a property differentiating him from other prominent physicists such as Murray Gell-Mann, but we still seem to be able to refer to Feynman with the name “Feynman”. John Searle, Michael Dummett and others, however, have proposed ways of expanding or altering Frege’s notion of a sense to circumvent Kripke’s worries. This has led to a very important debate in the philosophy of language, which, unfortunately, we cannot fully discuss here.

5. References and Further Reading

a. Frege’s Own Works

  • “Antwort auf die Ferienplauderei des Herrn Thomae.” Jahresbericht der Deutschen Mathematiker-Vereinigung 15 (1906): 586-90. Translated as “Reply to Thomae’s Holiday Causerie.” In Collected Papers on Mathematics, Logic and Philosophy [CP], 341-5. Translated by M. Black, V. Dudman, P. Geach, H. Kaal, E.-H. W. Kluge, B. McGuinness and R. H. Stoothoff. New York: Basil Blackwell, 1984.
  • “Über Begriff und Gegenstand.” Vierteljahrsschrift für wissenschaftliche Philosophie 16 (1892): 192-205. Translated as “On Concept and Object.” In >CP 182-94. Also in The Frege Reader [FR], 181-93. Edited by Michael Beaney. Oxford: Blackwell, 1997. And In Translations from the Philosophical Writings of Gottlob Frege [TPW], 42-55. 3d ed. Edited by Peter Geach and Max Black. Oxford: Blackwell, 1980.
  • Begriffsschrift, eine der arithmetischen nachgebildete Formelsprache des reinen Denkens. Halle: L. Nebert, 1879. Translated as Begriffsschrift, a Formula Language, Modeled upon that of Arithmetic, for Pure Thought. In From Frege to Gödel, edited by Jean van Heijenoort. Cambridge, MA: Harvard University Press, 1967. Also as Conceptual Notation and Related Articles. Edited and translated by Terrell W. Bynum. London: Oxford University Press, 1972.
  • “Über die Begriffsschrift des Herrn Peano und meine eigene.” Verhandlungen der Königlich Sächsischen Gesellschaft der Wissenschaften zu Leipzig 48 (1897): 362-8. Translated as “On Mr. Peano’s Conceptual Notation and My Own.” In CP 234-48.
  • “Über formale Theorien der Arithmetik.” Sitzungsberichte der Jenaischen Gesellschaft für Medizin und Naturwissenschaft 19 (1885): 94-104. Translated as “On Formal Theories of Arithmetic.” In CP 112-21.
  • Funktion und Begriff. Jena: Hermann Pohle, 1891. Translated as “Function and Concept.” In CP 137-56, TPW 21-41 and FR 130-48.
  • “Der Gedanke.” Beträge zur Philosophie des deutschen Idealismus 1 (1918-9): 58-77. Translated as “Thoughts.” In CP 351-72. Also as part I of Logical Investigations [LI], edited by P. T. Geach. Oxford: Blackwell, 1977. And as “Thought.” In FR 325-45.
  • “Gedankengefüge.” Beträge zur Philosophie des deutschen Idealismus 3 (1923): 36-51. Translated as “Compound Thoughts.” In CP 390-406, and as part III of LI.
  • Über eine geometrische Darstellung der imaginären Gebilde in der Ebene. Ph. D. Dissertation: University of Göttingen, 1873. Translated as “On a Geometrical Representation of Imaginary Forms in the Plane.” In CP 1-55.
  • Grundgesetze der Arithmetik. 2 vols. Jena: Hermann Pohle, 1893-1903. Translated in part as The Basic Laws of Arithmetic: Exposition of the System. Edited and translated by Montgomery Furth. Berkeley: University of California Press, 1964.
  • “Über die Grundlagen der Geometrie.” Jahresbericht der Deutschen Mathematiker-Vereinigung 12 (1903): 319-24, 368-75, 15 (1906): 293-309, 377-403, 423-30. Translated as “On the Foundations of Geometry.” In CP 273-340. Also as On the Foundations of Geometry and Formal Theories of Arithmetic. Translated by Eike-Henner W. Kluge. New York: Yale University Press, 1971.
  • Die Grundlagen der Arithmetik, eine logisch mathematische Untersuchung über den Begriff der Zahl. Breslau: W. Koebner, 1884. Translated as The Foundations of Arithmetic: A Logico-Mathematical Enquiry into the Concept of Number. 2d ed. Translated by J. L. Austin. Oxford: Blackwell, 1953.
  • “Kritische Beleuchtung einiger Punkte in E. Schröders Vorlesungen über die Algebra der Logik.Archiv für systematsche Philosophie 1 (1895): 433-56. Translated as “A Critical Elucidation of Some Points in E. Schröder, Vorlesungen über die Algebra der Logik.” In CP 210-28, and TPW 86-106.
  • Nachgelassene Schriften. Hamburg: Felix Meiner, 1969. Translated as Posthumous Writings. Translated by Peter Long and Roger White. Chicago: University of Chicago Press, 1979.
  • “Le nombre entier.” Revue de Métaphysique et de Morale 3 (1895): 73-8. Translated as “Whole Numbers.” In CP 229-33.
  • Rechnungsmethoden, die auf eine Erweiterung des Grössenbegriffes gründen. Habilitationsschrift: University of Jena, 1874. Translated as “Methods of Calculation based on an Extension of the Concept of Quantity.” In CP 56-92.
  • Review of Zur Lehre vom Transfiniten, by Georg Cantor. Zeitschrift für Philosophie und philosophische Kritik 100 (1892): 269-72. Translated in CP 178-181.
  • Review of Philosophie der Arithmetik, by Edmund Husserl. Zeitschrift für Philosophie und philosophische Kritik 103 (1894): 313-32. Translated in CP 195-209.
  • “Über Sinn und Bedeutung.” Zeitschrift für Philosophie und philosophische Kritik 100 (1892): 25-50. Translated as “On Sense and Meaning.” In CP 157-77. As “On Sinn and Bedeutung.” In FR 151-71. And as “On Sense and Reference.” In TPW 56-78.
  • “Über das Trägheitsgesetz.” Zeitschrift für Philosophie und philosophische Kritik 98 (1891): 145-61. Translated as “On the Law of Inertia.” In CP 123-36.
  • “Die Unmöglichkeit der Thomaeschen formalen Arithmetik aus Neue nachgewiesen.” Jahresbericht der Deutschen Mathematiker-Vereinigung 17 (1908): 52-5. Translated as “Renewed Proof of the Impossibility of Mr. Thomae’s Formal Arithmetic.” In CP 346-50.
  • “Der Verneinung.” Beträge zur Philosophie des deutschen Idealismus 1 (1918-9): 143-57. Translated as “Negation.” In CP 373-89, part II of LI, and FR 346-61.
  • “Was ist ein Funktion?” In Festschrift Ludwig Boltzmann gewidmet zum sechzigsten Geburtstage, 656-66. Leipzig: Amrosius Barth, 1904. Translated as “What is a Function?” In CP 285-92, and TPW 285-92.
  • Wissenschaftlicher Briefwechsel. Hamburg: Felix Meiner, 1976. Translated as Philosophical and Mathematical Correspondence. Translated by Hans Kaal. Chicago: University of Chicago Press, 1980.
  • Über die Zahlen des Herrn H. Schubert. Jena: Hermann Pohle, 1899. Translated as “On Mr. H. Schubert’s Numbers.” In CP 249-72.

b. Important Secondary Works

  • Angelelli, Ignacio. Studies on Gottlob Frege and Traditional Philosophy. Dordrecht: D. Reidel, 1967.
  • Baker, G. P. and P. M. S. Hacker. Frege: Logical Excavations. New York: Oxford University Press, 1984.
  • Beaney, Michael. Frege: Making Sense. London: Duckworth, 1996.
  • Beaney, Michael. Introduction to The Frege Reader, by Gottlob Frege. Oxford: Blackwell, 1997.
  • Bell, David. Frege’s Theory of Judgment. New York: Oxford University Press, 1979.
  • Bynum, Terrell W. “On the Life and Work of Gottlob Frege. ” Introduction to Conceptual Notation and Related Articles, by Gottlob Frege. London: Oxford University Press, 1972.
  • Carl, Wolfgang. Frege’s Theory of Sense and Reference. Cambridge: Cambridge University Press, 1994.
  • Carnap, Rudolph. Meaning and Necessity. 2d ed. Chicago: University of Chicago Press, 1956.
  • Church, Alonzo. “A Formulation of the Logic of Sense and Denotation.” In Structure, Method and Meaning: Essays in Honor of Henry M. Sheffer, edited by P. Henle, H. Kallen and S. Langer, 3- 24. New York: Liberal Arts Press, 1951.
  • Currie, Gregory. Frege: An Introduction to His Philosophy. Totowa, NJ: Barnes and Noble, 1982.
  • Dummett, Michael. Frege: Philosophy of Language. 2d ed. Cambridge, MA: Harvard University Press, 1981.
  • Dummett, Michael. Frege: Philosophy of Mathematics. Cambridge, MA: Harvard University Press, 1991.
  • Dummett, Michael. Frege and Other Philosophers. Oxford: Oxford University Press, 1991.
  • Dummett, Michael. The Interpretation of Frege’s Philosophy. Cambridge, MA: Harvard University Press, 1981.
  • Geach, Peter T. “Frege.” In Three Philosophers, edited by G. E. M. Anscombe and P. T. Geach, 127-62. Oxford: Oxford University Press, 1961.
  • Gödel, Kurt. “On Formally Undecidable Propositions of Principia Mathematica and Related Systems I.” In From Frege to Gödel, edited by Jan van Heijenoort, 596-616. Cambridge, MA: Harvard University Press, 1967. Originally published as “Über formal unentscheidbare Sätze der Principia Mathematica und verwandter Systeme I.” Monatshefte für Mathematik und Physik 38 (1931): 173-98.
  • Grossmann, Reinhardt. Reflections on Frege’s Philosophy. Evanston: Northwestern University Press, 1969.
  • Haaparanta, Leila and Jaakko Hintikka, eds. Frege Synthesized. Boston: D. Reidel, 1986.
  • Kaplan, David. “Quantifying In.” Synthese 19 (1968): 178-214.
  • Klemke, E. D., ed. Essays on Frege. Urbana: University of Illinois Press, 1968.
  • Kluge, Eike-Henner W. The Metaphysics of Gottlob Frege. Boston: Martinus Nijhoff, Boston, 1980.
  • Kneale, William and Martha Kneale. The Development of Logic. London: Oxford University Press, 1962.
  • Kripke, Saul. Naming and Necessity. Cambridge, MA: Harvard University Press, 1980. First published in Semantics of Natural Languages. Edited by Donald Davidson and Gilbert Harman. Dordrecht: D. Reidel, 1972.
  • Linsky, Leonard. Oblique Contexts. Chicago: University of Chicago Press, 1983.
  • Resnik, Michael D. Frege and the Philosophy of Mathematics. Ithaca: Cornell University Press, 1980.
  • Ricketts, Thomas G., ed. The Cambridge Companion to Frege. Cambridge: Cambridge University Press, forthcoming.
  • Russell, Bertrand. “The Logical and Arithmetical Doctrines of Frege.” In The Principles of Mathematics, Appendix A. 1903. 2d. ed. Reprint, New York: W. W. Norton & Company, 1996.
  • Russell, Bertrand. “On Denoting.” Mind 14 (1905): 479-93.
  • Salmon, Nathan. Frege’s Puzzle. Cambridge: MIT Press, 1986.
  • Schirn. Matthias, ed. Logik und Mathematik: Frege Kolloquium 1993. Hawthorne: de Gruyter, 1995.
  • Schirn. Matthias, ed. Studien zu Frege. 3 vols. Stuttgart-Bad Cannstatt: Verlag-Holzboog, 1976.
  • Searle, John R. Intentionality: An Essay in the Philosophy of Mind. Cambridge: Cambridge University Press, 1983.
  • Sluga, Hans. “Frege and the Rise of Analytic Philosophy.” Inquiry 18 (1975): 471-87.
  • Sluga, Hans. Gottlob Frege. Boston: Routledge & Kegan Paul, 1980.
  • Sluga, Hans. The Philosophy of Frege. 4 vols. New York: Garland Publishing, 1993.
  • Sternfeld, Robert. Frege’s Logical Theory. Carbondale: Southern Illinois University Press, 1966.
  • Thiel, Christian. Sense and Reference in Frege’s Logic. Translated by T. J. Blakeley. Dordrecht: D. Reidel, 1968.
  • Tichý, Pavel. The Foundations of Frege’s Logic. New York: Walter de Gruyter, 1988.
  • Walker, Jeremy D. B. A Study of Frege. London: Oxford University Press, 1965.
  • Weiner, Joan. Frege in Perspective. Ithaca: Cornell University Press, 1990.
  • Wright, Crispin. Frege’s Conception of Numbers as Objects. Aberdeen: Aberdeen University Press, 1983.
  • Wright, Crispin. Frege: Tradition and Influence. Oxford: Blackwell, 1984.

Author Information

Kevin C. Klement
Email: klement@philos.umass.edu
University of Massachusetts, Amherst
U. S. A.

Michel Foucault: Feminism

Michel FoucaultPoststructuralism and contemporary feminism have emerged as two of the most influential political and cultural movements of the late twentieth century. The recent alliance between them has been marked by an especially lively engagement with the work of French philosopher Michel Foucault. Although Foucault makes few references to women or to the issue of gender in his writings, his treatment of the relations between power, the body and sexuality has stimulated extensive feminist interest. Foucault’s idea that the body and sexuality are cultural constructs rather than natural phenomena has made a significant contribution to the feminist critique of essentialism. While feminists have found Foucault’s analysis of the relations between power and the body illuminating, they have also drawn attention to its limitations. From the perspective of a feminist politics that aims to promote women’s autonomy, the tendency of a Foucauldian account of power to reduce social agents to docile bodies seems problematic. Although many feminist theorists remain critical of Foucault’s questioning of the categories of the subject and agency on the grounds that such questioning undermines the emancipatory aims of feminism, others have argued that in his late work he develops a more robust account of subjectivity and resistance which, while not without its problems from a feminist perspective, nevertheless has a lot to offer a feminist politics. The affinities and tensions between Foucault’s thought and contemporary feminism are discussed below.

Table of Contents

  1. Background: Foucault’s Genealogy of Power, Knowledge and the Subject
  2. Between Foucault and Feminism: Convergence and Critique
  3. Power, the Body and Sexuality
  4. Subjectivity, Identity and Resistance
  5. Freedom, Power and Politics
  6. References and Further Reading

1. Background: Foucault’s Genealogy of Power, Knowledge and the Subject

In the works of his middle years – Discipline and Punish and The History of Sexuality, Vol. 1 – Foucault traces the emergence of some of the practices, concepts, forms of knowledge, social institutions and techniques of government which have contributed to shaping modern European culture. He calls the method of historical analysis he employs ‘genealogical’. Genealogy is a form of critical history in the sense that it attempts a diagnosis of ‘the present time, and of what we are, in this very moment’ in order ‘to question … what is postulated as self-evident … to dissipate what is familiar and accepted’ (Foucault 1988a: 265). What distinguishes genealogical analysis from traditional historiography is that it is ‘a form of history which can account for the constitution of knowledges, discourses, domains of objects etc. without having to make reference to a subject which is either transcendental in relation to the field of events or runs in its empty sameness throughout history’ (Foucault 1980: 149). Rather than assuming that the movement of history can be explained by the intentions and aims of individual actors, genealogy investigates the complex and shifting network of relations between power, knowledge and the body which produce historically specific forms of subjectivity. Foucault links his genealogical studies to a modality of social critique which he describes as a ‘critical ontology of the present’. In a late paper, he explains that an ontology of the present involves ‘an analysis of the historical limits that are imposed on us’ in order to create the space for ‘an experiment with the possibility of going beyond them’ (Foucault 1984: 50). Thus, genealogy is a form of social critique that seeks to determine possibilities for social change and ethical transformation of ourselves.

One of the central threads of Foucault’s genealogy of the present is an analysis of the transformations in the nature and functioning of power which mark the transition to modern society. Foucault’s genealogy of modern power challenges the commonly held assumption that power is an essentially negative, repressive force that operates purely through the mechanisms of law, taboo and censorship. According to Foucault, this ‘juridico-discursive’ conception of power (Foucault 1978: 82) has its origins in the practices of power characteristic of pre-modern societies. In such societies, he claims, power was centralized and coordinated by a sovereign authority who exercised absolute control over the population through the threat or open display of violence. From the seventeenth century onwards, however, as the growth and care of populations increasingly became the primary concerns of the state, new mechanisms of power emerged which centered around the administration and management of ‘life’. In the complex story that Foucault tells, this new form of ‘bio-power’ coalesced around two poles. One pole is concerned with the efficient government of the population as a whole and focuses on the management of the life processes of the social body. It involves the regulation of phenomena such as birth, death, sickness, disease, health, sexual relations and so on. The other pole, which Foucault labels ‘disciplinary power’, targets the human body as an object to be manipulated and trained. In Discipline and Punish Foucault studies the practices of discipline and training associated with disciplinary power. He suggests that these practices were first cultivated in isolated institutional settings such as prisons, military establishments, hospitals, factories and schools but were gradually applied more broadly as techniques of social regulation and control. The key feature of disciplinary power is that it is exercised directly on the body. Disciplinary practices subject bodily activities to a process of constant surveillance and examination that enables a continuous and pervasive control of individual conduct. The aim of these practices is to simultaneously optimize the body’s capacities, skills and productivity and to foster its usefulness and docility: ‘What was then being formed was a policy of coercions that act on the body, a calculated manipulation of its elements, its gestures, its behavior. The human body was entering a machinery of power that explores it, breaks it down and rearranges it…Thus, discipline produces subjected and practiced bodies, “docile” bodies’ (Foucault 1977: 138-9). It is not, however, only the body that disciplinary techniques target. Foucault presents disciplinary power as productive of certain types of subject as well. In Discipline and Punish he describes the way in which the central technique of disciplinary power – constant surveillance – which is initially directed toward disciplining the body, takes hold of the mind as well to induce a psychological state of ‘conscious and permanent visibility’ (Foucault 1977: 201). In other words, perpetual surveillance is internalized by individuals to produce the kind of self-awareness that defines the modern subject. With the idea that modern power operates to produce the phenomena it targets Foucault challenges the juridical notion of power as law which assumes that power is simply the constraint or repression of something that is already constituted. On Foucault’s account the transition to modernity entails the replacement of the law by the norm as the primary instrument of social control. Foucault links the importance assumed by norms in modern society to the development of the human or social sciences. In the first volume of The History of Sexuality he describes how, in the eighteenth and nineteenth centuries, sex and sexuality became crucial political issues in a society concerned with managing and directing the life of individuals and of populations. On Foucault’s account, the spread of bio-power is intimately connected to the social science discourses on sex and sexuality which proliferated during this period. These discourses, he claims, tended to understand sex as an instinctual biological and psychic drive with deep links to identity and, thus, with potentially far-reaching effects on the sexual and social behavior of individuals. The idea that the sexual drive could function in a normal, healthy manner or could be warped and perverted into pathological forms led to a project of classification of behavior along a scale of normalization and pathologization of the sexual instinct (Dreyfus & Rabinow 1982: 173). Once the social (and sexual) science categories of normalcy and deviancy were established, various political technologies aimed at treating and reforming ‘deviant’ behavior could be sanctioned as in the interests of both the individual and society. Thus, Foucault suggests that in modern society the behavior of individuals and groups is increasingly pervasively controlled through standards of normality which are disseminated by a range of assessing, diagnostic, prognostic and normative knowledges such as criminology, medicine, psychology and psychiatry. Modern individuals, moreover, become the agents of their own ‘normalization’ to the extent that they are subjected to, and become invested in, the categories, classifications and norms propagated by scientific and administrative discourses which purport to reveal the ‘truth’ of their identities. Modern disciplinary society can, therefore, dispense with direct forms of repression and constraint because social control is achieved by means of subtler strategies of normalization, strategies which produce self-regulating, ‘normalized’ individuals. It is Foucault’s insight into the productivity of the practices and technologies characteristic of normalizing bio-power that underpins his general conclusion that power in modern societies is a fundamentally creative rather than repressive force (Foucault 1977: 194). Above all, Foucault claims that modern regimes of power operate to produce us as subjects who are both the objects and vehicles of power. He explains that: ‘The individual is not to be conceived as a sort of elementary nucleus, a primitive atom, a multiple and inert material on which power comes to fasten or against which it happens to strike, and in so doing subdues or crushes individuals. In fact, it is already one of the prime effects of power that certain bodies, certain gestures, certain discourses, certain desires, come to be identified and constituted as individuals. The individual, that is, is not the vis-à-vis of power; it is … one of its prime effects.’ (Foucault 1980: 98). Foucault’s analysis of productive bio-power points to a complex interaction between modern forms of power and knowledge: ‘the exercise of power perpetually creates knowledge and, conversely, knowledge constantly induces effects of power’ (Foucault 1980: 52). For Foucault, power can be said to create knowledge in two related senses. Firstly, in the sense that particular institutions of power make certain forms of knowledge historically possible. In the case of the social sciences, for example, it is the refinement of disciplinary techniques for observing and analyzing the body in various institutional settings that facilitates the expansion of new areas of social research. Power can also be said to create knowledge in the sense that institutions of power determine the conditions under which scientific statements come to be counted as true or false (Hacking 1986). According to Foucault, then, ‘truth is a thing of this world: it is produced only by virtue of multiple forms of constraint. And it induces regular effects of power’ (Foucault 1980: 131). This description suggests that the production of ‘truth’ is never entirely separable from technologies of power. On the other hand, Foucault maintains that knowledge induces effects of power in so far as it constitutes new objects of inquiry – ‘objects’ like ‘the delinquent’, ‘the homosexual’ or ‘the criminal type’ – which then become available for manipulation and control (Rouse 1994: 97). For example, he claims that it is the knowledge generated by the human sciences which enables modern power to circulate through finer channels, ‘gaining access to individuals themselves, to their bodies, their gestures, and all their daily actions’ (Foucault 1980: 151). It is in order to signal the mutually conditioning operations of power and knowledge that Foucault speaks of regimes of ‘power/knowledge’ or ‘discourses’; that is, structured ways of knowing and exercising power.

2. Between Foucault and Feminism: Convergence and Critique

From the perspective of contemporary social and political theory, the originality of Foucault’s genealogies of power/knowledge resides in the challenge they pose to traditional ways of thinking about power. It is this challenge that has made Foucault’s work both a significant resource for feminist theory and generated heated debate amongst feminist social and political theorists. While there is broad agreement that Foucault’s redefinition of how we think about power in contemporary societies contains important insights for feminism, feminists remain divided over the implications of this redefinition for feminist theory and practice.

An analysis of power relations is central to the feminist project of understanding the nature and causes of women’s subordination. Drawing on the traditional model of power as repression, many types of feminist theory have assumed that the oppression of women can be explained by patriarchal social structures which secure the power of men over women. Increasingly, however, this assumption is being called into question by other feminists who are concerned to counter what they regard as the oversimplified conception of power relations this view entails, as well as its problematic implication that women are simply the passive, powerless victims of male power. In the context of this debate, Foucault’s work on power has been used by some feminists to develop a more complex analysis of the relations between gender and power which avoids the assumption that the oppression of women is caused in any simple way by men’s possession of power. On the basis of Foucault’s understanding of power as exercised rather than possessed, as circulating throughout the social body rather than emanating from the top down, and as productive rather than repressive (Sawicki 1988: 164), feminists have sought to challenge accounts of gender relations which emphasize domination and victimization so as to move towards a more textured understanding of the role of power in women’s lives. Foucault’s redefinition of power has made a significant and varied contribution to this project. Foucault’s notion that power is constitutive of that upon which it acts has enabled feminists to explore the often complicated ways in which women’s experiences, self-understandings, comportment and capacities are constructed in and by the power relations which they are seeking to transform. The idea that modern power is involved in producing rather than simply repressing individuals has also played a part in a controversial move within feminism away from traditional liberationist political orientations. Eschewing a liberationist political program which aims for total emancipation from power, Foucauldian-influenced feminism concentrates on exposing the localized forms that gender power relations take at the micro-political level in order to determine concrete possibilities for resistance and social change. In pursuing this project, feminist scholars have drawn on Foucault’s analysis of the productive dimension of disciplinary power which is exercised outside of the narrowly defined political realm in order to examine the workings of power in women’s everyday lives. Some feminists have also found Foucault’s contention that the body is the principal site of power in modern society useful in their explorations of the social control of women through their bodies and sexuality. Finally, feminists have taken up Foucault’s analytic of power/knowledge, with its emphasis on the criteria by which claims to knowledge are legitimated, in order to develop a theory which avoids generalizing from the experiences of Western, white, heterosexual, middle-class feminisms. Drawing on Foucault’s questioning of fixed essences and his relativist notion of truth, feminists have sought to create a theoretical space for the articulation of hitherto marginalized subject positions, political perspectives and interests. While there is considerable overlap between Foucault’s analytic of power/knowledge and feminist concerns, his work has also been subject to strong criticism by feminists. This more critical body of work takes issue with precisely those aspects of Foucault’s conception of power that Foucauldian feminists have found useful. The most commonly cited feminist objections center around two issues: his view of subjectivity as constructed by power and his failure to outline the norms which inform his critical enterprise. Nancy Fraser argues that the problem with Foucault’s claim that forms of subjectivity are constituted by relations of power is that it leaves no room for resistance to power. If individuals are simply the effects of power, mere ‘docile bodies’ shaped by power, then it becomes difficult to explain who resists power. Thus, Fraser finds Foucault’s assertion that power always generates resistance incoherent. She argues, moreover, that Foucault’s refusal to articulate independently justified norms which would enable him to distinguish acceptable from unacceptable forms of power means that he cannot answer crucial questions about why domination ought to be resisted. According to Fraser, ‘only with the introduction of normative notions could he begin to tell us what is wrong with the modern power/knowledge regime and why we ought to oppose it’ (Fraser 1989: 29). In Fraser’s view, Foucault’s normatively neutral stance on power limits the value of his work for feminism because it fails to provide the normative resources required to criticize structures of domination and to guide programs for social change. Echoing and extending Fraser’s criticisms, Nancy Hartsock contends that Foucault’s questioning of the categories of subjectivity and agency should be treated with suspicion by feminists. She asks: ‘Why is it that just at the moment when so many of us who have been silenced begin to demand the right to name ourselves, to act as subjects rather than objects of history, that just then the concept of subjecthood becomes problematic?’ (Hartsock 1990: 164). Like Fraser, Hartsock finds Foucault’s conception of modern power problematic in so far as it reduces individuals to ‘docile bodies’ rather than subjects with the capacity to resist power. She claims that Foucault’s understanding of the subject as an effect of power threatens the viability of a feminist politics because it denies the liberatory subject and, thus, condemns women to perpetual oppression. Hartsock argues, moreover, that Foucault’s rejection of the Enlightenment belief that truth is intrinsically opposed to power (and, therefore, inevitably plays a liberating role) undermines the emancipatory political aims of feminism. By insisting on the mutually conditioning operations of knowledge and power, Hartsock contends that Foucault denies the possibility of liberatory knowledge; that is, he denies the possibility that increased and better knowledge of patriarchal power can lead to liberation from oppression. For this reason she believes that his work is incompatible with the fundamentally emancipatory political orientation of feminism. These criticisms of Foucault are directed at the conception of the subject and power developed in his middle years. Some feminists have argued, however, that in his late work Foucault modifies his theoretical perspective in ways that make it more useful to the project of articulating a coherent feminist ethics and politics. Feminist responses to Foucault’s late work are discussed in the final section.

3. Power, the Body and Sexuality

There are a number of aspects of Foucault’s analysis of the relations between power, the body and sexuality that have stimulated feminist interest. Firstly, Foucault’s analyses of the productive dimensions of disciplinary powers which is exercised outside the narrowly defined political domain overlap with the feminist project of exploring the micropolitics of personal life and exposing the mechanics of patriarchal power at the most intimate levels of women’s experience. Secondly, Foucault’s treatment of power and its relation to the body and sexuality has provided feminist social and political theorists with some useful conceptual tools for the analysis of the social construction of gender and sexuality and contributed to the critique of essentialism within feminism. Finally, Foucault’s identification of the body as the principal target of power has been used by feminists to analyze contemporary forms of social control over women’s bodies and minds.

Rather than focusing on the centralized sources of societal power in agencies such as the economy or the state, Foucault’s analysis of power emphasizes micro level power relations. Foucault argues that, since modern power operates in a capillary fashion throughout the social body, it is best grasped in its concrete and local effects and in the everyday practices which sustain and reproduce power relations. This emphasis on the everyday practices through which power relations are reproduced has converged with the feminist project of analyzing the politics of personal relations and altering gendered power relations at the most intimate levels of experience ‘in the institutions of marriage, motherhood and compulsory heterosexuality, in the ‘private’ relations between the sexes and in the everyday rituals and regimens that govern women’s relationships to themselves and their bodies (Sawicki 1998: 93). Nancy Fraser notes that Foucault’s work gives renewed impetus to what is often referred to as ‘the politics of everyday life’ in so far as it provides ‘the empirical and conceptual basis for treating phenomena such as sexuality, the school, psychiatry, medicine and social science as political phenomena.’ She argues that because Foucault’s approach to the analysis of power sanctions the treatment of problems in these areas as political problems it ‘widens the arena within which people may collectively confront, understand and try to change the character of their lives’ (Fraser 1989: 26). One of Foucault’s most fertile insight into the workings of power at the micro-political level is his identification of the body and sexuality as the direct locus of social control. Foucault insists on the historical specificity of the body. It is this emphasis on the body as directly targeted and formed by historically variable regimes of bio-power that has made Foucault’s version of poststructuralist theory the most attractive to feminist social and political theorists. The problem of how to conceive of the body without reducing its materiality to a fixed biological essence has been one of the key issues for feminist theory. At a fundamental level, a notion of the body is central to the feminist analysis of the oppression of women because biological differences between the sexes are the foundation that has served to ground and legitimize gender inequality. By means of an appeal to ahistorical biological characteristics, the idea that women are inferior to men is naturalized and legitimized. This involves two related conceptual moves. Firstly, women’s bodies are judged inferior with reference to norms and ideals based on men’s physical capacities and, secondly, biological functions are collapsed into social characteristics. While traditionally men have been thought to be capable of transcending the level of the biological through the use of their rational faculties, women have tended to be defined entirely it terms of their physical capacities for reproduction and motherhood. In an effort to avoid this conflation of the social category of woman with biological functions (essentialism), earlier forms of feminism developed a theory of social construction based on the distinction between sex and gender. The sex/gender distinction represents an attempt by feminists to sever the connection between the biological category of sex and the social category of gender. According to this view of social construction, gender is the cultural meaning that comes to be contingently attached to the sexed body. Once gender is understood as culturally constructed it is possible to avoid the essentialist idea that gender derives from the natural body in any one way. However, while the distinction between ahistorical biological sexes and culturally constructed gender roles challenges the notion that a woman’s biological makeup is her social destiny, it entails a problematic dissociation of culturally constructed genders from sexed bodies. The effect of this dissociation is that the sexed body comes to be seen as irrelevant to an individual’s gendered cultural identity. It is this disconcerting consequence of drawing a distinction between sex and gender that has led some feminists to appropriate Foucault’s theory of the body and sexuality. In the first volume of The History of Sexuality, Foucault develops an anti-essentialist account of the sexual body, which, however, doesn’t deny its materiality. At the heart of Foucault’s history of sexuality is an analysis of the production of the category of sex and its function in regimes of power aimed at controlling the sexual body. Foucault argues that the construct of a supposedly ‘natural’ sex functions to disguise the productive operation of power in relation to sexuality: ‘The notion of sex brought about a fundamental reversal; it made it possible to invert the representation of the relationships of power to sexuality, causing the latter to appear, not in its essential and positive relation to power, but as being rooted in a specific and irreducible urgency which power tries as best it can to dominate’ (Foucault 1978: 155). Foucault’s claim here is that the relationship between power and sexuality is misrepresented when sexuality is viewed as an unruly natural force that power simply opposes, represses or constrains. Rather, the phenomenon of sexuality should be understood as constructed through the exercise of power relations. Drawing on Foucault’s account of the historical construction of sexuality and the part played by the category of sex in this construction, feminists have been able to rethink gender, not as the cultural meanings that are attached to a pregiven sex, but, in Judith Butler’s formulation, ‘as the … cultural means by which “sexed nature” or “a natural sex” is produced and established as…prior to culture’ (Butler 1990: 7). Following Foucault, Butler argues that the notion of a ‘natural’ sex that is prior to culture and socialization is implicated in the production and maintenance of gendered power relations because it naturalizes the regulatory idea of a supposedly natural heterosexuality and, thus, reinforces the reproductive constraints on sexuality. In addition to his anti-essentialist view of the body and sexuality, Foucault insists on the corporeal reality of bodies. He argues that this rich and complex reality is oversimplified by the biological category of sex which groups together in an ‘artificial unity’ a range of disparate and unrelated biological functions and bodily pleasures. Thus, in The History of Sexuality, Foucault explains that: ‘The purpose of the present study is in fact to show how deployments of power are directly connected to the body – to bodies, functions, physiological processes, sensations, and pleasures; far from the body having to be effaced, what is needed is to make it visible through an analysis in which the biological and the historical are not consecutive to one another … but are bound together in an increasingly complex fashion in accordance with the development of the modern technologies of power that take life as their objective. Hence I do not envisage a “history of mentalities” that would take account of bodies only through the manner in which they have been perceived and given meaning and value; but a “history of bodies” and the manner in which what is most material and most vital in them has been invested’ (Foucault 1978: 151-2). Because Foucault’s anti-essentialist account of the body is nevertheless attentive to the materiality of bodies it has been attractive to feminists concerned to expose the processes through which the female body is transformed into a feminine body. Thus, in claiming that the body is directly targeted and ‘produced’ by power and, thus, unknowable outside of its cultural significations, Foucault breaks down the distinction between a natural sex and a culturally constructed gender. Elizabeth Grosz argues that, unlike some other versions of poststructuralist theory which analyze the representation of bodies without due regard for their materiality, Foucault’s insistence on the corporeal reality of the body which is directly molded by social and historical forces avoids the traditional gendered opposition between the body and culture. For this reason, she believes that, while Foucault fails to consider the issue of sexual difference, his thought may contribute to the feminist project of exploring the relation between social power and the production of sexually differentiated bodies (Grosz 1994). Not all feminists, however, are comfortable with Foucault’s anti-naturalistic rhetoric. Kate Soper argues that by jettisoning the idea of a natural body, Foucault’s anti-essentialism might ‘lend itself to the forces of reaction in so far as it offers itself as a pre-emptive warning against any politics which aims at the removal of the constraining and distorting effects of cultural stereotyping’ (Soper 1993: 33). Here Soper articulates a common feminist concern about the potentially conservative political consequences of Foucault’s version of social constructivism. By contrast, Lois McNay argues that although Foucault’s model of the relation between the body and power precludes the view that the body and sexuality might be liberated from power, it leaves room for the possibility that existing forms of sexuality and gendered power relations might be transformed. According to McNay, Foucault’s history of sexuality ‘exposes the contingent and socially determined nature of sexuality and, thereby, frees the body from the regulatory fiction of heterosexuality and opens up new realms in which bodily pleasures can be explored’ (McNay 1992: 30). In another fruitful engagement with Foucault’s work on the body and power, feminist scholars have embraced the notion of normalizing-disciplinary power for its potential to shed light on the social control of women in a contemporary context. For example, Sandra Bartky’s appropriation of Foucault takes the form of a detailed examination of the subjection of the female body to disciplinary practices such as dieting, exercise and beauty regimens that produce a form of embodiment which conforms to prevailing norms of feminine beauty and attractiveness. On her account these disciplinary practices subjugate women, not by taking power away from them, but by generating skills and competencies that depend on the maintenance of a stereotypical form of feminine identity. Bartky suggests that women’s seemingly willing acceptance of the various norms and practices that promote their larger disempowerment is due to the fact that challenging ‘the patriarchal construction of the female body… may call into question that aspect of personal identity that is tied to the development of a sense of competence’ (Bartky 1988: 77; Sawicki 1994: 293). In a similar vein, Susan Bordo brings Foucauldian insights to bear in her analysis of predominantly female eating disorders such as anorexia nervosa and bulimia (Bordo 1988). Following Foucault, she argues that these disorders might be understood as disciplinary technologies of the body. The anorexic woman takes to an extreme the practices to which women subject themselves in their efforts to conform to cultural norms of an ideal feminine form. In the figure of the anorexic Bordo sees an association of power and self-control with the achievement of a potentially fatal slenderness. For Bordo, this association is a stark illustration of the way in which disciplinary power is linked to the social control of women. Disciplinary technologies are particularly effective forms of social control because they take hold of individuals at the level of their bodies, gestures, desires and habits to create individuals who are attached to and, thus, the unwitting agents of their own subjection. In other words, disciplinary power fashions individuals who ‘voluntarily’ subject themselves to self-surveillance and self-normalization. Thus, like Bartky, Bordo finds Foucault’s work useful to explain women’s collusion with patriarchal standards of femininity.

4. Subjectivity, Identity and Resistance

Although the use that Bartky and Bordo make of Foucault’s insights into the operation of normalizing disciplinary power is a corrective to his failure to recognize the gendered nature of disciplinary techniques, some feminists have argued that their work reproduces a problematic dimension of Foucault’s account of modern disciplinary power. Jana Sawicki explains that the problem faced by this kind of feminist appropriation of Foucault is its inability to account for effective resistance to disciplinary practices. Like Foucault, Bartky and Bordo envisage modern disciplinary power as ubiquitous and inescapable. Foucauldian power reduces individuals to docile and subjected bodies and thus seems to deny the possibility of freedom and resistance. According to Sawicki, ‘Bartky and Bordo have portrayed forms of patriarchal power that insinuate themselves within subjects so profoundly that it is difficult to imagine how they (we) might escape. They describe our complicity in patriarchal practices of victimization without providing suggestions about how we might resist it’ (Sawicki 1988: 293).

Feminist critics of Foucault like Nancy Hartsock argue that his failure to develop an adequate notion of resistance is a consequence of his reduction of individuals to effects of power relations. Hartsock echoes a widespread feminist concern that Foucault’s understanding of power reduces individuals to docile bodies, to victims of disciplinary technologies or objects of power rather than subjects with the capacity to resist (Hartsock 1990: 171-2). The problem for Hartsock and others is that without the assumption of a subject or individual that pre-exists its construction by technologies of power, it becomes difficult to explain who resists power? If there are no ready-made individuals with interests that are defined prior to their construction by power, then what is the source of our resistance? Some feminists have responded to these concerns by claiming that, although Foucault rejects the idea that resistance can be grounded in a subject or self who pre-exists its construction by power, he does not deny the possibility of resistance to power. In his later work Foucault explains that his theory of power implies both the possibility and existence of forms of resistance. According to Foucault: ‘there are no relations of power without resistances; the latter are all the more real and effective because they are formed right at the point where relations of power are exercised’ (Foucault 1980: 142). Foucauldian resistance neither predates the power it opposes nor issues from a site external to power. Rather it relies upon and grows out of the situation against which it struggles. Foucault’s understanding of resistance as internal to power refuses the utopian dream of achieving total emancipation from power. In the place of total liberation Foucault envisages more specific, local struggles against forms of subjection aimed at loosening the constraints on possibilities for action. He suggests that a key struggle in the present is against the tendency of normalizing-disciplinary power to tie individuals to their identities in constraining ways. It is, Foucault contends, because disciplinary practices limit the possibilities of what we can be by fixing our identities that the object of resistance must be ‘to refuse what we are’ – that is, to fracture the limitations imposed on us by normalizing identity categories. Foucault’s notion of resistance as consisting, at least in the first instance, in a refusal of fixed, stable or naturalized identity has been met with some suspicion by feminists. Many feminists are reluctant to abandon a commitment ‘to some essential, liberatory subject rooted in “women’s experience” (or nature), as the starting point for emancipatory theory’ (Sawicki 1994: 289). For Hartsock, Foucault’s perspective functions to preclude the possibility of feminist politics which, she claims, is necessarily an identity-based politics grounded in a conception of the identity, needs and interests of women. Some of the most exciting feminist appropriations of Foucault converge around this issue of identity and its role in politics. Judith Butler argues that Foucault’s work provides feminists with the resources to think beyond the strictures of identity politics. According to Butler, feminists should be wary of the idea that politics needs to be based on a fixed idea of women’s nature and interests. She argues that: ‘The premature insistence on a stable subject of feminism, understood as a seamless category of women, inevitably generates multiple refusals to accept the category. These domains of exclusion reveal the coercive and regulatory consequences of that construction, even when the construction has been elaborated for emancipatory purposes. Indeed, the fragmentation within feminism and the paradoxical opposition to feminism from “women” whom feminism claims to represent suggest the necessary limits of identity politics’ (Butler 1990: 4). Butler discerns at least two problems in the attempt to ground politics in an essential, naturalized female identity. She argues that the assertion of the category ‘woman’ as the ground for political action excludes, marginalizes and inevitably misrepresents those who do not recognize themselves within the terms of that identity. For Butler the appeal to identity both overlooks the differences in power and resources between, for example, third world and Western women, and tends to make these differences a source of conflict rather than a source of strength. She claims, moreover, that a feminist identity politics that appeals to a fixed ‘feminist subject,’ ‘presumes, fixes and constrains the very ‘subjects’ that it hopes to represent and liberate’ (Butler 1990: 148). In Foucault’s presentation of identity as an effect Butler sees new possibilities for feminist political practice, possibilities that are precluded by positions that take identity to be fixed or foundational. One of the distinct advantages of Foucault’s understanding of the constituted character of identity is, in Butler’s view, that it enables feminism to politicize the processes through which stereotypical forms of masculine and feminine identity are produced. Butler’s own work represents an attempt to explore these processes for the purposes of loosening the heterosexual restrictions on identity formation. In pursuing this project she argues that Foucault’s characterization of identity as constructed does not mean that it is completely determined or artificial and arbitrary. Rather, a Foucauldian approach to identity production demonstrates the role played by cultural norms in regulating how we embody or perform our gender identities. According to Butler, gender identity is simply ‘a set of repeated acts within a highly rigid regulatory frame that congeal over time to produce the appearance of substance, of a natural sort of being’ (Butler 1990: 33). The regulatory power of the norms that govern our performances of gender is both disguised and strengthened by the assumption that gendered identities are natural and essential. Thus, for Butler, one of the most important feminist aims should be to challenge dominant gender norms by exposing the contingent acts that produce the appearance of an underlying ‘natural’ gender identity. Against the claim that feminist politics is necessarily an identity politics, Butler suggests that: ‘If identities were no longer fixed as the premises of a political syllogism, and politics no longer understood as a set of practices derived from the alleged interests that belong to a set of ready-made subjects, a new configuration of politics would surely emerge from the ruins of the old’ (Butler 1990: 149). Butler envisages this new configuration of politics as an anti-foundational coalition politics that would accept the need to act within the tensions produced by contradiction, fragmentation and diversity. While Butler’s political vision emphasises strategies for resisting and subverting identity, Wendy Brown argues that contemporary feminism should be wary of both identity politics and the ‘politics of resistance’ associated with the work of Foucault and Butler. Brown argues that identity politics entails a commitment to the authenticity of women’s experiences which functions to secure political authority. At the same time, however, most feminists wish to acknowledge that feminine identity and experience are constructed under patriarchal conditions. Brown suggests that this inconsistency in feminist political thought – acknowledging social construction on the one hand and attempting to preserve a realm of authentic experience free from construction on the other – might be explained by the fact that feminists are reluctant to give up the claim to moral authority that the appeal to the truth and innocence of woman’s experience secures. By appealing to the silenced truth of women’s experience, feminists have been able to condemn the repressive effects of patriarchal power. For Brown the attempt to establish moral authority by asserting the hidden truth of women’s experience and identity represents a rejection of politics. She argues that this kind of move in feminism: ‘… betrays a preference for extrapolitical terms and practices: for Truth (unchanging and incontestable) over politics (flux, contest, instability); for certainty and security (safety; immutability, privacy) over freedom (vulnerability, publicity); for discoveries (science) over decisions (judgments); for separable subjects armed with established rights over unwieldy and shifting pluralities adjudicating for themselves and their future on the basis of nothing more than their own habits and arguments’ (Brown 1995: 37). Brown finds a similar failure to meet the challenges confronting contemporary politics in the ‘politics of resistance’ inspired by Foucault. As she sees it, the problem with resistance-as-politics is that it does not ‘contain a critique, a vision, or grounds for organized collective efforts to enact either… [resistance] goes nowhere in particular, has no inherent attachments and hails no particular vision’ (Brown 1995: 49). In light of these inadequacies, Brown calls for the politics of resistance to be supplemented by a political practices aimed at cultivating ‘political spaces for posing and questioning political norms [and] for discussing the nature of “the good” for women’ (Brown 1995: 49). The creation of such democratic spaces for discussion will, Brown argues, contribute to teaching us how to have public conversations with each other and enable us to argue from our diverse perspectives about a vision of the common good (“what I want for us”) rather than from some assumed common identity (“who I am”).

5. Freedom, Power and Politics

The key problems identified by feminist critics as preventing too close a convergence between Foucault’s work and feminism – his reduction of social agents to docile bodies and the lack of normative guidance in his model of power and resistance – are indirectly addressed by Foucault in his late work on ethics. Whereas in his earlier genealogies Foucault emphasized the processes through which individuals were subjected to power, in his later writings he turned his attention to practices of self-constitution or ‘practices of freedom’ which he called ethics.

The idea of practicing freedom is central to Foucault’s exploration and analysis of the ethical practices of Antiquity. It refers to the ways in which individuals in Antiquity were led to exercise power over themselves in the attempt to constitute or transform their identity and behavior in the light of specific goals. What interests Foucault about these ethical practices and ancient ‘arts of existence’ is the kind of freedom they presuppose. He suggests that the freedom entailed in practicing the art of self-fashioning consists neither in resisting power nor in seeking to liberate the self from regulation. Rather, it entails the active and conscious arrogation of the power of regulation by individuals for the purposes of ethical and aesthetic self-transformation. In her reflections on Foucault’s positive account of freedom, Sawicki notes that it offers a more affirmative alternative to his earlier emphasis on the reactive strategy of resistance to normalization (Sawicki 1998: 104). For the late Foucault, individuals are still understood to be shaped by their embeddedness in power relations, which means that their capacities for freedom and autonomous action are necessarily limited. However, he suggests that by actively deploying the techniques and models of self-formation that are ‘proposed, suggested, imposed’ upon them by society (Foucault 1988b: 291), individuals may creatively transform themselves and in the process supplant the normalization operating in pernicious modern technologies of the self (Sawicki 1998: 105). Sawicki sees a link between Foucault’s notion of practices of freedom and Donna Haraway’s call for a cyborg politics that emphasizes the conscious creation of marginalized subjects capable of resisting domination. In a more critical vein, feminists like Jean Grimshaw and McNay argue that Foucault’s promising turn to a more active model of subjectivity still leaves crucial issues unresolved. In Grimshaws formulation, Foucault evades the vital question of ‘when forms of self-discipline or self-surveillance can … be seen as exercises of autonomy or self-creation, or when they should be seen, rather, as forms of discipline to which the self is subjected, and by which autonomy is constrained’ (Grimshaw 1993: 66; McNay 1992: 74). In response to this criticism, Moya Lloyd suggests that it is Foucault’s earlier notion of genealogy as critique which allows us to distinguish between autonomous practices of the self and technologies of normalization. For Lloyd, the Foucauldian practice of critique – a practice which involves the effort to recognize, decipher and problematize the ways in which the self is produced – generates possibilities for alternative practices of the self and, thus, for more autonomous experiments in self-formation. Lloyd explains that ‘it is not the activity of self-fashioning in itself that is crucial. It is the way in which that self-fashioning, when allied to critique, can produce sites of contestation over the meanings and contours of identity, and over the ways in which certain practices are mobilized’ (Lloyd: 1988: 250). With the introduction of a notion of freedom in his late work, Foucault also clarifies the normative grounds for his opposition to certain forms of power. In his discussion of ethics, Foucault suggests that individuals are not limited to reacting against power, but may alter power relationships in ways that expand their possibilities for action. Thus, Foucault’s work on ethics can be linked to his concern to counter domination, that is, forms of power that limit the possibilities for the autonomous development of the self’s capacities. By distinguishing power relations that are mutable, flexible and reversible, from situations of domination in which resistance is foreclosed, Foucault seeks to encourage practices of liberty ‘that will allow us to play … games of power with as little domination as possible’ (Foucault 1988b: 298). Sawicki argues that Foucault’s notion of practices of freedom has the potential to broaden our understanding of what it is to engage in emancipatory politics. In Foucault’s conception of freedom as a practice aimed at minimizing domination, Sawicki discerns an implicit critique of traditional emancipatory politics which tends to conceive of liberty as a state free from every conceivable social constraint. Following Foucault, Sawicki argues that the problem with this notion of emancipation is that it does not go far enough: ‘Reversing power positions without altering relations of power is rarely liberating. Neither is it a sufficient condition of liberation to throw off the yoke of domination’ (Sawicki 1998: 102). If, as Foucault suggests, freedom exists only in being exercised and is, thus, a permanent struggle against what will otherwise be done to and for individuals, it is dangerous to imagine it as a state of being that can be guaranteed by laws and institutions. By insisting that liberation from domination is not enough to guarantee freedom, Foucault points to the importance of establishing new patterns of behaviour, attitudes and cultural forms that work to empower the vulnerable and, in this way, to ensure that mutable relations of power do not congeal into states of domination. Thus, for Sawicki, the value of Foucault’s late work for feminism consists in the conceptual tools that it provides to think beyond traditional emancipatory theories and practices.

6. References and Further Reading

  • Bartky, S., ‘Foucault, femininity and the modernization of patriarchal power’ in I. Diamond & L. Quinby (eds), Feminism and Foucault: Reflections on Resistance, Boston: Northeastern University Press, 1988.
  • Bordo, S., ‘Anorexia Nervosa: Psychopathology as the Crystallization of Culture’ in I. Diamond & L. Quinby (eds) Feminism and Foucault: Reflections on Resistance, Boston: Northeastern University Press, 1988.
  • Brown, W., ‘Postmodern Exposures, Feminist Hesitations’ in States of Injury: power and freedom in late modernity, Princeton, N.J.: Princeton University Press, 1995.
  • Butler, J., Gender Trouble: Feminism and the Subversion of Identity, NY: Routledge, 1990.
  • Butler, J., Bodies that Matter: On the Discursive Limits of “Sex”, NY: Routledge, 1993.
  • Diamond, I. & Quinby, L., (eds.) Feminism and Foucault: Reflections on Resistance, Boston: Northeastern University Press, 1988.
  • Dreyfus, H. and Rabinow, P., Michel Foucault: Beyond Structuralism and Hermeneutics, Sussex: The Harvester Press, 1982.
  • Foucault, M., Discipline and Punish: The Birth of the Prison, trans. A. Sheridan, Harmondsworth: Peregrine, 1977.
  • Foucault, M., The History of Sexuality, translated by R. Hurley, Penguin Books, 1978.
  • Foucault, M., ‘Body/Power’ and ‘Truth and Power’ in C. Gordon (ed.) Michel Foucault: Power/Knowledge, U.K.: Harvester, 1980.
  • Foucault, M., ‘The subject and power’ in H. Dreyfus and P. Rabinow, Michel Foucault: Beyond Structuralism and Hermeneutics, Chicago: Chicago University Press, 1982.
  • Foucault, M., ‘What is Enlightenment?’ in The Foucault Reader, P. Rabinow (ed.) NY: Pantheon, 1984a.
  • Foucault, M., ‘On the genealogy of ethics: an overview of work in progress’ in The Foucault Reader, P. Rabinow (ed.) NY: Pantheon, 1984b.
  • Foucault, M., Politics, Philosophy, Culture: Interviews and Other Writings, 1977-1984, L. Kritzman (ed.), London: Routledge, 1988a.
  • Foucault, M., ‘The ethic of care for the self as a practice of freedom’ in J. Bernhauer and D. Rasmussen (eds), The Final Foucault, Cambridge: Mass.: MIT Press, 1988b.
  • Fraser, N., Unruly Practices: power, discourse and gender in contemporary social theory, Cambridge: Polity Press, 1989.
  • Grimshaw, J., ‘Practices of Freedom’ in Up Against Foucault, C. Ramazanoglu (ed.), London and NY: Routledge, 1993.
  • Grosz, E., Volatile Bodies: Toward a Corporeal Feminism, Bloomington: Indiana University Press, 1994.
  • Gutting, G., (ed.) The Cambridge Companion to Foucault, Cambridge: Cambridge University Press, 1994.
  • Hacking, I., ‘The Archaeology of Knowledge’ in D. Couzens Hoy (ed.), Foucault: a critical reader, NY: Basil Blackwell, 1986.
  • Hartsock, N., ‘Foucault on power: a theory for women?’ in L. Nicholson (ed.), Feminism/Postmodernism, London & NY: Routledge, 1990.
  • Hekman, S. (ed.) Feminist Interpretations of Michel Foucault, Pennsylvania: Pennsylvania University Press, 1996.
  • Lloyd, M., ‘A Feminist Mapping of Foucauldian Politics’ in Feminism and Foucault: Reflections on Resistance, I. Diamond & L. Quinby (eds), Boston: Northeastern University Press, 1988.
  • McNay, L., Foucault: a critical introduction, Cambridge: Polity Press, 1994.
  • McNay, L., Foucault and Feminism: Power, Gender and the Self, Polity Press, 1992.
  • Ramazanoglu, C., Up Against Foucault: Explorations of Some Tensions Between Foucault and Feminism, London & NY: Routledge, 1993.
  • Rouse, J., ‘Power/Knowledge’ in Gary Gutting (ed) The Cambridge Companion to Foucault, Cambridge: Cambridge University Press, 1994.
  • Sawicki, J., ‘Feminism and the Power of Discourse’ in J. Arac (ed.) After Foucault: Humanistic Knowledge, Postmodern Challenges, New Brunswick and London: Rutgers University Press, 1988, pp. 161-178.
  • Sawicki, J., ‘Foucault, feminism, and questions of identity’ in ed. G. Gutting, The Cambridge Companion to Foucault, Cambridge: Cambridge University Press, 1994.
  • Sawicki, J., ‘Feminism, Foucault and “Subjects” of Power and Freedom’ in The Later Foucault: politics and philosophy, J. Moss (ed.), London; Thousand Oaks: Sage Publications, 1998.
  • Soper, K., ‘Productive contradictions’, Up Against Foucault: Explorations of Some Tensions Between Foucault and Feminism, London & NY: Routledge, 1993.

Author Information

Aurelia Armstrong
Email: a.armstrong@uq.edu.au
University of Queensland
Australia

Nikolai Fedorovich Fedorov (1829—1903)

FedorovFedorov’s thoughts have been variously described as bold, culminating, curious, easily-misunderstood, extreme, hazy, idealist, naive, of-value, scientifico-magical, special, unexpected, unique, and utopian. Many of the small number of philosophers familiar with Fedorov admit his originality, his independence, his human concern, perhaps even his logic — up to a point. But his resurrection project is viewed with understandable skepticism and often dismissed as an impossible fantasy. Interestingly, the harshest criticism has come from Christian thinkers such as Florovsky and Ustryalov whose objections bear religious overtones; some materialists such as Muravyov and Setnitsky have been quite benign and favorable by comparison. Perhaps all would agree, however, on Fedorov’s single-mindedness. Looked at positively, this is simply another term for purity-of-heart, a quality of saintliness. With his strong emphasis on kinship and brotherhood demanding, ultimately, a world in which all must mutually benefit, Fedorov perhaps anticipates Rawls who says: “Thus what we are doing is to combine into one conception the totality of conditions that we are ready upon due reflection to recognize as reasonable in our conduct with regard to one another. … all persons … even … persons who are not contemporaries but who belong to many generations. Thus to see our place in society from the perspective of this position is … to regard the human situation not only from all social but also from all temporal points of view. The perspective of eternity is not a perspective from a certain place beyond the world, nor the point of view of a transcendent being; rather it is a certain form of thought and feeling that rational persons can adopt within the world. … Purity of heart, if one could attain it, would be to see clearly and to act with grace and self-command from this point of view.” Fedorov wrote: “By refusing to grant ourselves the right to set ourselves apart … we are kept from setting any goal for ourselves that is not the common task of all.” But Fedorov’s thought soars beyond the present world to a world of its own, in his insistence that we can become immortal and godlike through rational efforts, and that our moral obligation is to create a heaven to be shared by all who ever lived.

Table of Contents

  1. Life
  2. Philosophy
  3. Further Reading

1. Life

Russian philosopher, teacher, and librarian Nikolai Fedorovich Fedorov was born June 9, 1829, and died December 28, 1903. He was founder of an immortalist (anti-death) philosophy emphasizing “the common task” of resurrecting the dead through scientific means. Since the end of the Cold War, his thought has received renewed interest and advocacy in Russia and elsewhere — for example, in connection with cryonics (cryonic hibernation) and prolongevity. Nikolai Fedorovich Fedorov (alternative romanized spellings are possible — for example: Nicholas Fyodorovich Fyodorov) advocated the ethical priority of a research and development project he called “the common task,” by which he meant the universal physical resurrection of the dead by future advances in science and technology. He was highly praised by such people as Fyodor Dostoevsky and Leo Tolstoy (literature), Afanasi Fet (poetry), and Konstantin Tsiolkowsky (astronautics), yet he is not well known in the West, despite some limited interest. The illegitimate son of Prince Pavel Ivanovich Gagarin and Elisaveta Ivanova, a woman of lower-class nobility, Nikolai (with his mother and her other children) had to leave his father’s home at age four, due to the prince’s death. The family continued to be well cared for, however. Beginning in 1868, he worked for 25 years as a librarian with the Rumiantsev Museum (now the Russian State Library), Moscow; during this period, he was teacher-mentor of the young Konstantin Tsiolkowsky. After retiring, and until his death, he worked in the Archives of the Ministry of Foreign Affairs. His works, published posthumously, were available (in accordance with the Christian spirit of Fedorov’s philosophy) only free of charge from the publisher, who renounced all rights.

2. Philosophy

Due to his Christian perspective, Fedorov found the widespread lack of love among people appalling. He divided these non-loving relations into two kinds. One is alienation among people: “non-kindred relations of people among themselves.” The other is isolation of the living from the dead: “nature’s non-kindred relation to men.” “[O]ne should live not for oneself nor for others but with all and for all” (Filosofiya Obshchago Dela vol. I, 118, n. 5, as quoted in Zakydalsky, 55). Fedorov is referring to all people of all time (past, present, future). He is speaking of a project to unite humankind, the colonization (“spiritualization”) of the universe, the quest for the Kingdom of God, the creation of cosmos from chaos, the death of death, even resurrection of the dead. Fedorov believed, and passionately felt, that resignation in the face of death and separation of knowledge from action was false Christianity. He cautioned against being fooled into worshipping the blind forces of Satan. Rather, one should actively participate in changing what is into what ought to be.

The division between the learned and the unlearned was, in Fedorov’s view, worse than the separation of the rich and the poor. The unlearned are more concerned with work than thought. The learned (philosophers and scientists) are less concerned with work than thought. The learned seem unaware that ideas “are not subjective, nor are they objective; they are projective.” Philosophers and scientists, because they have separated ideas from moral action, are simply slaves to the imperfect present order. It is a root dogma of the learned that paradise is not possible. The unlearned should demand that the learned (because only they have the necessary knowledge) become a temporary task force for the Kingdom of God. The learned, however, will attempt to persuade us that problems like crop failures, disease, and death are not general questions but matters for a narrow discipline, questions for only a very small (or nonexistent) minority of the learned. Separation of the learned from the masses turns them into a seemingly permanent class, producing non-lovers of humankind. The “transformation of the blind course of nature into one that is rational … is bound to appear to the learned as a disruption of order, although this order of theirs brings only disorder among men, striking them down with famine, plague, and death.”

A citizen, a comrade, or a team-member can be replaced by another. However a person loved, one’s kin, is irreplaceable. Moreover, memory of one’s dead kin is not the same as the real person. Pride in one’s forefathers is a vice, a form of egotism. On the other hand, love of one’s forefathers means sadness in their death, requiring the literal raising of the dead. Politics must be replaced by physics. The politics of egoism and altruism must be replaced by Christianity which “knows only all men.” Pride is a Tower of Babel that separates us from one another. Love is a “fusion as opposed to a confusion.” For Fedorov, “complete and universal salvation” is preferable to “incomplete or non-universal salvation in which some men — the sinners — are condemned to eternal torments and others — the righteous — to an eternal contemplation of these torments.” That is to say, Fedorov’s bold science project, “the common task,” is not the only possible route to salvation. “Salvation may also occur without the participation of men … if they do not unite in the common task”; “if we do not unite to accomplish our salvation, if we do not accept the Gospel message,” then a “purely transcendent resurrection will save only the elect; for the rest it will be an expression of God’s wrath,” “eternal punishment.” “I believe this literally.” “Christianity has not fully saved the world, because it has not been fully assimilated.” Christianity “is not simply a doctrine of redemption, but the very task of redemption.”

Fedorov’s thoughts have been variously described as bold, culminating, curious, easily-misunderstood, extreme, hazy, idealist, naive, of-value, scientifico-magical, special, unexpected, unique, and utopian. Many of the small number of philosophers familiar with Fedorov admit his originality, his independence, his human concern, perhaps even his logic — up to a point. But his resurrection project is viewed with understandable skepticism and often dismissed as an impossible fantasy. Interestingly, the harshest criticism has come from Christian thinkers such as Florovsky and Ustryalov whose objections bear religious overtones; some materialists such as Muravyov and Setnitsky have been quite benign and favorable by comparison. Perhaps all would agree, however, on Fedorov’s single-mindedness. Looked at positively, this is simply another term for purity-of-heart, a quality of saintliness. With his strong emphasis on kinship and brotherhood demanding, ultimately, a world in which all must mutually benefit, Fedorov perhaps anticipates Rawls who says: “Thus what we are doing is to combine into one conception the totality of conditions that we are ready upon due reflection to recognize as reasonable in our conduct with regard to one another. … all persons … even … persons who are not contemporaries but who belong to many generations. Thus to see our place in society from the perspective of this position is … to regard the human situation not only from all social but also from all temporal points of view. The perspective of eternity is not a perspective from a certain place beyond the world, nor the point of view of a transcendent being; rather it is a certain form of thought and feeling that rational persons can adopt within the world. … Purity of heart, if one could attain it, would be to see clearly and to act with grace and self-command from this point of view.” Fedorov wrote: “By refusing to grant ourselves the right to set ourselves apart … we are kept from setting any goal for ourselves that is not the common task of all.” But Fedorov’s thought soars beyond the present world to a world of its own, in his insistence that we can become immortal and godlike through rational efforts, and that our moral obligation is to create a heaven to be shared by all who ever lived. “[D]eath is merely the result or manifestation of our infantilism, lack of independence and self-reliance, and of our incapacity for mutual support and the restoration of life. People are still minors, half-beings, whereas the fullness of personal existence, personal perfection, is possible. However, it is possible only within general perfection. Coming of age will bring perfect health and immortality, but for the living [living contemporaries of Fedorov] immortality is impossible without the resurrection of the dead”(What Was Man Created For?, 76).

3. Further Reading

(Collected Works in Russian)

  • Fedorov, N. F. Filosofiya Obshchago Dela: Stat’i, Mysli, i Pis’ma Nikolaia Fedorovicha Fedorova, ed. V. A. Kozhevnikov and N. P. Peterson, 2 vols. originally published by Fedorov’s friends and followers after his death, 1906, 1913; reprint London: Gregg Press, 1970.
  • Fedorov, N. F. Sobranie Sochineniy, 4 vols. + supp. Moscow: Traditsiya, 2000.

(Works in English)

  • Berdyaev, N. A. “N. F. Fyodorov.” The Russian Review 9 (1950) 124-130.
    • Fedorov’s thought was not without influence on Berdyaev’s existentialism.
  • Berdyaev, N. A. The Russian Idea. New York: Macmillan Co., 1948.
    • Fedorov and other original Russian thinkers are discussed.
  • Fedorov, N. F. “The Question of Brotherhood or Kinship, of the Reasons for the Unbrotherly, Unkindred, or Unpeaceful State of the World, and of the Means for the Restoration of Kinship” in Edie, J. M.; Scanlan, J. P.; Zeldin, M.; and Kline, G. L., eds. Russian Philosophy. Chicago: Quadrangle Books, 1965. 16-54.
    • This is one place to begin if you want to read Fedorov directly (in English translation).
  • Fedorov, N. F. What Was Man Created For? The Philosophy of the Common Task: Selected Works. Koutiassov, E.; and Minto, M., eds. Lausanne, Switzerland: Honeyglen/L’Age d’Homme, 1990.
    • A good source of Fedorov in English translation; includes a list of Russian language works in the bibliography.
  • Lossky, N. O. History of Russian Philosophy. New York: International Universities Press, 1951.
    • Fedorov is included in this history.
  • Lukashevich, S. N. F. Fedorov (1828-1903): A Study in Russian Eupsychian and Utopian Thought. Newark: University of Delaware Press, 1977.
    • The methodology used in this study may not insure full appreciation of Fedorov’s thought, but it does demonstrate that his thought was indeed a detailed, coherent philosophy in which the various pieces fit together.
  • Schmemann, A., ed. Ultimate Questions: An Anthology of Modern Russian Religious Thought. New York: Holt, Rinehart and Winston, 1965; reprint Crestwood, NY: St. Vladimir’s Seminary Press, 1977.
    • Selections (translations) from Russian religious thinkers, including Fedorov, concerned with eschatology or other “ultimate” questions. The Fedorov material is from vol. 1 of Filosofiya Obshchago Dela and deals with “the restoration of kinship among mankind.”
  • Soloviov, M. “The ‘Russian Trace’ in the History of Cryonics,” Cryonics 16:4 (4th Quarter, 1995) 20-23.
    • Closing paragraph describes author’s then-current (post-cold-war) and perhaps unprecedented efforts promoting cryonics and immortalism in the former Soviet Union; the article itself acknowledges a debt to Fedorov.
  • Young, G. M. Nikolai F. Fedorov: An Introduction. Belmont, Mass.: Nordland Publishing Co., 1979.
    • Not only an excellent introduction, but a mine of references and information inviting further Fedorovian research, including Russian language works, many of which are not yet translated (or not fully translated) into English.
  • Zakydalsky, T. D. N. F. Fyodorov’s Philosophy of Physical Resurrection. Ann Arbor, Mich.: UMI, 1976.
    • A Ph.D. dissertation (Bryn Mawr) of 531 pages. Bibliography has a list of Russian language works.
  • Zenkovsky, V. V. A History of Russian Philosophy. New York: Columbia University Press, 1953.
    • Fedorov is included in this history.

Author Information

Charles Tandy
Email: cetandy@gmail.com
Ria University
U. S. A.

R. Michael Perry
Email: mike@alcor.org
U. S. A.

Ignacio Ellacuría (1930—1989)

EllacuriaIgnacio Ellacuría, a naturalized citizen of El Salvador, was born in Spain in 1930. He joined the Jesuits in 1947 and was quickly sent to El Salvador, where he lived and worked for the next forty-two years, except for periods when he was pursuing his education in Ecuador, Spain, and West Germany. He developed an important and novel contribution to Latin American Liberation Philosophy. The body of thought known as Liberation Philosophy developed in Latin America in the second half of the Twentieth Century. It grew out of the works of philosophers working in Peru (A. Salazar Bondy) and Mexico (Leopoldo Zea), and quickly spread throughout Latin America. It resulted from efforts by these philosophers to create a Latin American philosophy by looking at how the discipline could help to make sense of Latin American reality. That reality, as distinct from the European (and later North American) context in which the modern Western philosophical tradition developed, is one of dependence on economic and political (and to some extent cultural) factors that are beyond one’s control. In thematizing dependency, Latin American philosophy developed a liberation philosophy that focused on the social and personal imperative to overcome dependency as the path toward the fullness of one’s humanity, given the conditions of dependency. There are at least five different schools within Latin American liberation philosophy (see Cerutti in the Bibliography below), but all are grounded in the attempt to use philosophy to understand the Latin American reality of dependency and the need to overcome it.

Table of Contents

  1. Life
  2. Ellacuría’s Philosophy of Liberation
  3. References and Further Reading

1. Life

Ellacuría’s initial training in philosophy was in the Neo-Scholasticism required at that time of all Jesuits. Later he studied Ortega, Bergson, Heidegger, phenomenology, and the existentialists. All of these influenced him, but the key influences in the make up of his mature philosophical thought were Hegel, Marx, and the Basque philosopher Xavier Zubiri (1898-1983). Ellacuría worked on his doctorate under Zubiri from 1962 to 1965, writing a dissertation that reached some 1100 pages on the concept of essence in Zubiri’s thought. He also studied theology with the great Heideggerian Jesuit, Karl Rahner, and had finished all the requirements for a second PhD, but did not write the dissertation (which he was also going to write under Zubiri). For the next 18 years, until Zubiri’s death in 1983, they were close collaborators, with Ellacuría returning to Spain from El Salvador for a few months each year to facilitate their work. The two worked together on most of Zubiri’s texts and talks, eventually reaching the point where Zubiri would not publish something, or even present a lecture, without first showing the material to Ellacuría.

Zubiri is a major figure in 20th century Spanish philosophy and has had a lot of influence in Latin America, largely through the efforts of Ellacuría, but his work is not well known in the countries more traditionally associated with Continental Philosophy (France and Germany) or in the Anglo-American tradition. By the age of 23, Zubiri had finished both a PhD in theology at the Gregorian University and a PhD in philosophy at the University of Madrid. At 28 he was named to the prestigious chair in the history of philosophy at the University of Madrid, and for the next few years he traveled widely in Europe to study with experts in many different fields: philosophy with Husserl and Heidegger, physics with Schrödinger and De Broglie, as well as biology and mathematics with luminaries of the day. Zubiri also taught in Paris at the Institut Catholique and at the University of Barcelona, but in 1942 he left formal academia and for the rest of his life conducted seminars on his own.

From among a large number of very important publications, his two most important are On Essence (1963) and the three-volume work, Sentient Intelligence (1980-83). Ellacuría, who knew all of Zubiri’s work, was particularly familiar with these two works: his doctoral dissertation was on the former, and he worked very closely with Zubiri to bring the latter to publication before Zubiri’s death.

Zubiri created a systematic philosophy grounded in a re-configuring and overcoming of the distinction between epistemology and metaphysics, between the knower and the known (for more, see the section below on Ellacuría’s philosophy). There are now various interpretations of Zubiri’s work (among others, phenomenological, Nietzschean, praxical) with Ellacuría heading up the historical/metaphysical interpretation. Although there is no agreement among Zubirian scholars as to which among these is the better interpretation, the fact that Zubiri adopted Ellacuría as his closest collaborator for the last 20 years of his life has to lend some weight to Ellacuría’s interpretation

Ellacuría was murdered in 1989 – along with five other Jesuits with whom he lived, their housekeeper and her daughter – at the hands of an elite, US-trained squadron of the Salvadoran army. The murders came towards the end of El Salvador’s long civil war (1980-1992) between a right-wing government and leftist guerillas. At the time of his death, Ellacuría was president of the country’s prestigious Jesuit university, the University of Central America (UCA), as well as chair of its philosophy department and editor of many of its scholarly publications. In his quarter century with the UCA, the last ten years as its president, he had played a principle role in molding it into a university whose full institutional power – that is, through its research, teaching and publications – was directed towards uncovering the causes of poverty and oppression in El Salvador. In addition, he spoke out frequently on these topics as a regular contributor to the country’s newspapers, radio and television programs. He also addressed these topics frequently in his scholarly publications on philosophy and theology. These were the reasons behind his murder.

During his lifetime Ellacuría was known, primarily, as one of the principle contributors to Latin American liberation theology. However, he also spent the last two decades of his life elaborating a liberation philosophy. The latter work was left, at the time of his murder, unfinished, unpublished, and scattered across many different writings. In the years since his death, a number of scholars have pieced together his philosophical thought, and it is now possible to argue that Ellacuría had a well-developed philosophy that represents an important contribution to Latin American liberation philosophy.

2. Ellacuría’s Philosophy of Liberation

Ellacuría argued that philosophy, in order to remain true to itself, must be a philosophy of liberation. He begins with the assertion that it is the responsibility of philosophy to help us in figuring out what reality is and in situating ourselves within reality. For Ellacuría, human reality is historical and social: the range of possibilities in which the freedom of any given individual’s life must be exercised is the result of both past human actions and the society in which the individual lives. Human actions accrete as history, and within this reality individuals and societies are able to realize some of the possibilities handed over by the past, in the process creating new possibilities to hand over to future generations. There is progress in reality, from the physical to the biological to the praxical, each of these representing a further unfolding of an ever more complex reality. In the realm of praxis (his word for human action to change reality), human beings act to realize a wider range of possibility: praxis seeks to realize a fuller praxis. Thus, praxis realizes a gradual increase in liberty: praxis gradually liberates liberty.

Human beings, as praxical beings, are responsible for the further unfolding of reality, i.e., for the realization of a reality in which all praxical beings can fully realize themselves as such. Ellacuría argues that the vantage point from which one can see most clearly what reality unfolding as history has and has not delivered, is the perspective of the marginalized. Thus, the philosophy of history must make a preferential option for the marginalized, i.e., it must be a philosophy of liberation.

Ellacuría’s liberation philosophy begins with a critique grounded in a Zubirian metaphysics that is radically critical of all forms of idealism, including most of what has passed for realism in the history of Western philosophy. This critique argues that the Western tradition made a fundamental error, from Parmenides on, in separating sensation and the intellect, an error which distorted all subsequent philosophy. This error resulted in the “logification of intelligence” and the “entification of reality.” By the former, Zubiri means that the full powers of the intellect have been reduced to a predicative logos, i.e., a logos whose function is to determine what things are, in themselves and in relation to other things. Zubiri argues that while this is a vital part of intelligence, it is not the only part and not the most fundamental part, but Western philosophy reduced intelligence to this predicative logos. In doing so, the object of logos, i.e., the being of entities, became the sum total of reality: reality became entified. These two distortions (the logification of intelligence and entification of reality) can only be overcome by the recognition that sensation and intellection are not separate, that they are two aspects of a single faculty. Zubiri called this faculty the sentient intellect. By this term he meant that, for human beings, the intellect is always sentient and sensation is always intelligent. The two faculties of sensation and intelligence are, for human beings, one and the same faculty. This new, human faculty, the “sentient intellect,” is Zubiri’s candidate for the specific difference of human beings as a species: a new type of sensation that is essentially different from the sense faculty of other animals, different by the addition of intelligence.

In what way is human sensation essentially different than the sensation of other animals? For Zubiri, part of every human sensation, but absent in animal sensation, is the awareness that the object sensed is real, i.e., that it is has the property of being something in and of itself, independent from me, that it is not a willful extension of me. This recognition of the real as real is the fundamental act of the intelligence; it is the intellectual act that is part and parcel, structurally, inextricably, of every act of human sensation. Thus, through the unitary faculty of the sentient intellect we apprehend reality as real. The consequence of this is that we are always already installed in reality. There is no question about how the mind reaches what is real, no need to build a bridge between the mind and reality.

The intellect, like the rest of the body, evolved as a response to challenges posed by the environment. Animals respond to stimuli while humans are confronted with possible realities. Animals are faced with a predetermined cast of responses to a given stimuli. But human beings in any given situation have an open spectrum of options from among which we must choose. We are, in effect, faced with the possibilities of many different realities, and our choices contribute to the determination of reality as it is realized; thus the name that Zubiri gives to human beings: the “reality animal.” The openness of the options facing us is the structural basis of our freedom. Freedom is not something mysterious but a result of the evolutionary pressures that lead to the emergence of a sentient intelligence. The evolutionary niche occupied by human beings is one in which the cast of responses to a stimulus grew to the point where there was no longer anything automatic about which possible response would be enacted. Our niche is the one where the huge number of possible responses opened up different potential realities, allowing us more fully to exploit reality’s possibilities. In other words, our niche is precisely the freedom to choose from among the huge number of possible responses, i.e., from among the huge number of possible realities. To manage this operation of choosing, animal sensation evolved into the sentient intellect.

So, according to Zubirian metaphysics, human beings are always already installed in reality as the part of reality whose actions determine future reality: humans are the part of reality that now unfolds further reality. In previous eras, the unfolding of reality took place by physical and biological forces, but now it is human forces (praxis) that unfolds reality. This is not to say that physical and biological forces are no longer present. They are present, and continue to form the foundation of praxis, but praxis outstrips them. An authentic praxis, however, must recognize its foundation in biology and physics – that is why the physical and biological needs of human beings must be met in order for the fullness of human praxis to be realizable. Thus, an authentic praxis must strive for a reality in which the physical and biological needs of all humans are met.

Ellacuría concludes from all of this that the primary question facing human beings – metaphysical and ethical at once – is: given that we are always already in reality, what is the proper way to engage it? Ellacuría characterizes Zubiri’s intellectual motto as “to come as close as possible, intellectually, to the reality of things.” Western philosophy “had not found an adequate way to shoulder responsibility for reality [hacerse cargo de la realidad].” The search for the right way to engage reality was the motivation for Ellacuría’s work. For Ellacuría, humans are now shouldered with responsibility for reality in the sense of being charged with the task of figuring out what is the proper way of exercising the fundamental freedom opened up by the advent, within evolution, of the sentient intellect. In this sense, human beings are the responsible part of reality, i.e., the part of reality whose task it is to figure out how to respond to reality thereby creating a new reality unfolded out of the previous reality. In order for humans to properly exercise this responsibility, we must discern the direction in which reality needs to be taken.

The sentient intellect evolved to enable us to act more effectively in insuring our own survival. This is not selfish, as it may at first sound, given the element of responsibility that comes along with the sentient intellect. As the reality animal, our actions decide between various possible future realities. Thus, as the responsible part of reality, we are now charged with assisting in the further realization of reality. Ellacuría gives the special name of “praxis” to this action that determines reality.

If we look at the development of reality, we can discern a progression from matter, to life, to human life. This progression has been under the control of, first, physical forces, then biological forces, and now, with the evolution of the being with sentient intelligence, the progressive unfolding of reality is subject to the force of praxis. Thus there is a gradual liberation of more developed forces. Subsequent forces do not erase the earlier ones, but rather subsume them dialectically. Thus, human praxis cannot ignore the physical and biological needs of reality: these are the imperatives that must be satisfied on the way to the full realization of praxis itself. Reality has delivered, liberated, successively more developed forces, each layered over the previous: the biological on top of the physical, and the praxical on top of the biological. The direction of this process can be seen: praxis is the most advanced force reality has developed, and praxis must now take its place as the force that most drives the further unfolding of reality (just as physical and biological forces had, successively, taken that place previously). Since the essence of praxis is freedom, human beings must now exercise our freedom such that we further the proper development of reality. To remain true to our essence, and true to the essence of reality, we must act so as to further the development, the spread, of praxis. Thus, the direction of this process of liberation is the liberation of liberty itself, a process for which the reality animal, the praxical being, is responsible. Thus the full realization of reality entails this: praxical beings acting to bring about the realization of the reality in which all praxical beings (that is, all human beings) can realize the fullness of their praxical essence. In other words, physical and biological forces brought about human beings; but the nature of human beings is such that we are now responsible for the further and fuller realization of reality, which realization is precisely the liberation of all human beings such that they can realize the fullness of their essence. Thus Ellacuría is able to argue that the metaphysics of reality demands a liberatory praxis from us: liberation, because of the essence of human beings and the nature of reality, is a metaphysical imperative.

We can begin to see the prescriptions that emerge from the foregoing analysis. Ellacuría’s liberation philosophy allows him to argue that the essence of being human demands that society be structured in such a way as to meet the physical and biological needs of human beings at an adequate level, i.e., a level that frees us to pursue our essence as praxical beings. Further, his analysis suggests that it is the duty of those of us who enjoy a wider exercise of freedom to dedicate our talents and efforts towards the construction of such a society: our essence as the leading edge of reality that is now responsible for the further unfolding of reality demands that we assist in the establishment of a reality in which praxis is more fully realized, i.e., a reality in which more people (ultimately, all people) are freed from basic wants (inflicted on them by poverty) so that they can exercise their praxis. In other words, the full self-realization of the privileged lies in their enlisting themselves in the struggles of the oppressed. This does not mean that the privileged have to become oppressed. Rather, it means that they should use the education and power delivered to them by their socially and historically conditioned privilege to further the struggles of the oppressed. Note that this is not paternalistic. The struggles of the oppressed represent the leading edge of reality’s further development. The endeavors of the privileged apart from these struggles represent dead-end dilly-dallying (no matter how important they seem to those engaged in them) that does not further the humanization of reality and, thus, will not become an enduring part of human history. Far from paternalism, what saves the privileged from the meaningless pursuits with which they are wont to fill their time, and thus from a meaningless life, is the decision to lend their efforts to further the cause of the oppressed.

Thus, with Zubirian realism and in creative dialogue with Marx, Ellacuría undertook, from the perspective of the poor of the Third World, the project of forging a philosophy that recognized the material nature of being human – and thus the need to take into account the structures of poverty and oppression – while holding open the possibility of a transcendent realm, a realm one and the same with the material realm (actually part of the material realm) in which can exist human freedom and perhaps even God. Ellacuría was constructing a liberation philosophy in the service of the concrete needs of the Latin American people and of the Third World in general. It is a project in the service of which Ellacuría took great strides, but which remained unfinished at his death.

3. References and Further Reading

There still remain a number of unpublished pieces that are important to Ellacuría’s liberation philosophy. These consist primarily of extensive notes he took for the courses he taught at the UCA. These, and all of Ellacuría’s published and unpublished writings, are located in the Ignacio Ellacuría Archives at the Universidad Centroamericana (UCA) in San Salvador, El Salvador.

  • Burke, Kevin (2000). The Ground Beneath the Cross: The Theology of Ignacio Ellacuría, Washington, DC: Georgetown University Press.
    • In English, this book contains good chapters (chs. 2-4) on the philosophical foundation of Ellacuría’s theological thought.
  • Cerutti, Horacio (1992). La Filosofia de la Liberación Latinoamericana, Mexico City: FCE.
    • The best overview of Latin American liberation philosophy, though the book was written before Ellacuría’s contributions to the topic were widely known. Thus, Cerutti charts four main currents of Latin American liberation philosophy. Ellacuría’s contributions represent a fifth current.
  • Ellacuría, Ignacio (2000-2002). Escritos Teológicos [ET], four volumes, San Salvador: UCA Editores.
    • Some philosophically important pieces are also collected here.
  • Ellacuría, Ignacio (1996-2001). Escritos Filosóficos [EF], three volumes, San Salvador: UCA Editores.
    • His scores of important philosophical essays have been collected here.
  • Ellacuría, Ignacio (1999). Escritos Universitarios [EU], San Salvador: UCA Editores.
    • Some philosophically important pieces are also collected here.
  • Ellacuría, Ignacio (1993). Veinte Años de Historia en El Salvador: Escritos Políticos [VA], three volumes, second edition, San Salvador: UCA Editores.
    • Some philosophically important pieces are also collected here.
  • Ellacuría, Ignacio (1990). Filosofía de la Realidad Histórica, San Salvador: UCA Editores.
    • Ellacuría’s main philosophical work. This 600-page book was written and revised a couple of times in the early 1970s. It was never finished (there are indications in his notes that he intended to write more chapters) but it is fairly polished and the best indication of the scope and force of his argument for liberation philosophy.
  • Hassett, John & Hugh Lacey, eds. (1991). Towards a Society that Serves Its People: The Intellectual Contribution of El Salvador’s Murdered Jesuits [TSSP], Washington, DC: Georgetown University Press.
    • English translations of eight of his essays (philosophical, theological and political).
  • Samour, Héctor (2002). Voluntad de Liberación: El Pensamiento Filosófico de Ignacio Ellacuría, San Salvador: UCA Editores.
    • The most thorough presentation of Ellacuría’s philosophical thought. Samour is the scholar who has done the most to pull together, from the thousands of pages of unpublished and published material, Ellacuría’s liberation philosophy and this comprehensive book is the result of his labors.
  • Whitfield, Teresa (1995). Paying the Price: Ignacio Ellacuría and the Murdered Jesuits of El Salvador, Philadelphia: Temple University Press.
    • The best intellectual biography on Ellacuría.

From among all of the collected essays, the most important for understanding Ellacuría’s liberation philosophy are the following:

  • “Filosofía y Política” [1972], VA-1, pp. 47-62.
  • “Liberación: Misión y Carisma de la Iglesia” [1973], ET-2, pp. 553-584.
  • “Diez Años Después: ¿Es Posible una Universidad Distinta?” [1975], EU, pp. 49-92 (an English translation is available in TSSP, pp. 177-207).
  • “Hacia una Fundamentación del Método Teológico Latinoamericana” [1975], ET-1, pp. 187-218.
  • “Filosofía, ¿Para Qué?” [1976], EF-3, pp. 115-132.
  • “Fundamentación Biológica de la Ética” [1979], EF-3, pp, 251-269.
  • “Universidad y Política” [1980], VA-1, pp. 17-46.
  • “El Objeto de la Filosofía” [1981], VA-1, pp. 63-92.
  • “Función Liberadora de la Filosofía” [1985], VA-1, pp. 93-122.
  • “La Superación del Reduccionismo Idealista en Zubiri” [1988], EF-3, pp. 403-430.
  • “El Desafío de las Mayorías Populares” (1989), EU, pp. 297-306 (an English translation is available in TSSP, pp. 171-176).
  • “En Torno al Concepto y a la Idea de Liberación” [1989], ET-1, pp. 629-657.
  • “Utopía y Profetismo en América Latina” [1989], ET-2, pp. 233-294 (an English translation is available in TSSP, pp. 44-88).

Author Information

David I. Gandolfo
Email: david.gandolfo@furman.edu
Furman University
U. S. A.

Embodied Cognition

Embodied Cognition is a growing research program in cognitive science that emphasizes the formative role the environment plays in the development of cognitive processes. The general theory contends that cognitive processes develop when a tightly coupled system emerges from real-time, goal-directed interactions between organisms and their environment; the nature of these interactions influences the formation and further specifies the nature of the developing cognitive capacities. Since embodied accounts of cognition have been formulated in a variety of different ways in each of the sub-fields comprising cognitive science (that is, developmental psychology, artificial life/robotics, linguistics, and philosophy of mind), a rich interdisciplinary research program continues to emerge. Yet, all of these different conceptions do maintain that one necessary condition for cognition is embodiment, where the basic notion of embodiment is broadly understood as the unique way an organism’s sensorimotor capacities enable it to successfully interact with its environmental niche. In addition, all of the different formulations of the general embodied cognition thesis share a common goal of developing cognitive explanations that capture the manner in which mind, body, and world mutually interact and influence one another to promote an organism’s adaptive success.

Table of Contents

  1. Motivation for the Movement
  2. General Characteristics of Embodied Cognition
    1. Primacy of Goal-Directed Actions Occurring In Real-Time
      1. Developmental Psychology
      2. Robotics/Artificial Life
    2. Form of Embodiment Constrains Kinds of Cognitive Processes
    3. Cognition is Constructive
  3. Embodied Cognition vs. Classicism/Cognitivism
  4. Philosophical Implications of the Embodied Cognition Research Program
    1. The Compatibilist Approach
    2. The Purist Approach
  5. References and Further Reading

1. Motivation for the Movement

Although ideas applied in the embodied cognition research program can be traced back to the seminal works of Heidegger, Piaget, Vygotsky, Merleau-Ponty, and Dewey, the current thesis can be seen as a direct response and, in some cases, a proposed alternative to the cognitivist/classicist view of the mind, which conceptualizes cognitive functions in terms of a computer metaphor. The cognitivist/classicist research program can be defined as a rule-based, information-processing model of cognition that 1) characterizes problem-solving in terms of inputs and outputs, 2) assumes the existence of symbolic, encoded representations which enable the system to devise a solution by means of computation, and 3) maintains that cognition can be understood by focusing primarily on an organism’s internal cognitive processes (that is, specifically those involving computation and representation). Although this research program is still prevalent, a number of problems have been raised about its viability, including the symbol-grounding problem (Searle 1980, Harnad 1990), the frame problem, the common-sense problem (Horgan and Tienson 1989), and the rule-described/expertise problem (Dreyfus 1992).

Embodied cognition theorists view cognitivist/classicist accounts as problematic for many reasons, but they are especially concerned that these accounts result in an isolationist assumption that attempts to understand cognition by focusing almost exclusively on an organism’s internal cognitive processes. Specifically, the concern is that if an isolationist assumption rests at the heart of the cognitivist/classicist research program, then the resulting explanations are inaccurate because they either underplay or completely overlook environmental factors that are essential to the formation of an accurate explanation of cognitive development. Consequently, this isolationist assumption is perceived to result in decreased explanatory power since it de-emphasizes two crucial factors that are needed to understand cognitive development: 1) the exact way organisms are embodied, and 2) the manner in which this embodied form simultaneously constrains and prescribes certain interactions within the environment. In its place, embodied cognition theorists favor a relational analysis that views the organism, the action it performs, and the environment in which it performs it as inextricably linked. Yet, before one can fully appreciate why embodied cognition theorists favor a relational over an isolationist analysis, it is necessary to discuss the theoretical assumptions that comprise the general embodied cognition framework.

2. General Characteristics of Embodied Cognition

Since the present embodied cognition research program is in its early stages, the general approach does not yet have hard and fast tenets that are agreed upon by all embodied cognition theorists. Consequently, this program is rather fluid, in that even the central researchers are striving to understand further exactly what is meant by embodied cognition. Yet, this should not prevent the characterization of the common assumptions found in most embodied cognition theories. The goal of this section is to highlight some of the most common theoretical assumptions shared by embodied accounts of cognition. The viewing of these assumptions together will provide a clearer picture of what embodied cognition roughly entails as a research program.

Once again, the central claim of embodied cognition is that an organism’s sensorimotor capacities, body and environment not only play an important role in cognition, but the manner in which these elements interact enables particular cognitive capacities to develop and determines the precise nature of those capacities. Developmental psychologist Esther Thelen (2001) further clarifies the central claim of this research program in the following passage:

To say that cognition is embodied means that it arises from bodily interactions with the world. From this point of view, cognition depends on the kinds of experiences that come from having a body with particular perceptual and motor capacities that are inseparably linked and that together form the matrix within which memory, emotion, language, and all other aspects of life are meshed. The contemporary notion of embodied cognition stands in contrast to the prevailing cognitivist stance which sees the mind as a device to manipulate symbols and is thus concerned with the formal rules and processes by which the symbols appropriately represent the world (xx).

Although embodied cognition accounts vary significantly across disciplines in terms of the specific ways in which they attempt to apply the general theory, a few common theoretical assumptions can be found in just about any embodied view one examines. These further theoretical assumptions help to flesh out the central thesis, and include 1) the primacy of goal-directed actions occurring in real-time; 2) the belief that the form of embodiment determines the type of cognition; and 3) the view that cognition is constructive. Each theoretical assumption will be explained by considering the work of a theorist whose research exemplifies the particular theoretical assumption under investigation. The first theoretical assumption, the primacy of goal-directed actions occurring in real time, is explained by considering research in robotics/artificial life and developmental psychology.

a. Primacy of Goal-Directed Actions Occurring In Real-Time

Embodied cognition theorists contend that thought results from an organism’s ability to act in its environment. More precisely, what this means is that as an organism learns to control its own movements and perform certain actions, it develops an understanding of its own basic perceptual and motor-based abilities, which serve as an essential first step toward acquiring more complex cognitive processes, such as language. Thus, goal-directed actions are described as primary for embodied theorists because these theorists argue that thought and language would not occur without the initial performance of these actions. In essence these low-level actions and movements are viewed as necessary for higher cognitive capacities to develop. In order to consider evidence in support of this initial theoretical assumption, one need only turn to the research of developmental psychologists Esther Thelen and Linda Smith (Thelen and Smith 1994, Thelen 1995). By briefly summarizing one of their numerous experiments on infant development, we can consider why many embodied cognition theorists characterize Thelen and Smith’s research as some of the most influential and convincing developmental evidence in support of this assumption that “thought grows from action and that activity is the engine of change” (Thelen 1995: 69). This discussion will highlight why the primacy of actions unfolding in real time is one of the defining theoretical assumptions of embodied accounts of cognition.

i. Developmental Psychology

In order to understand how infants learn to reach, Thelen and Smith (1994) examined four different infants from the time the babies were 3 weeks old until they were 1 year old. What Thelen and Smith conclude is that each of the four infants faced unique problems in learning to reach based on their individual energy level, body mass and the different ways in which they initially tried to reach (that is, their pre-reaching behaviors). Given these different pre-reaching movements, each of the infants had to learn a different set of strategies for controlling their arms so that the ultimate solution was specifically tailored to address the unique problem the particular infant was encountering. Thus, each infant was eventually able to overcome these developmental obstacles and learn to reach the toys, but the specific ways in which they learned this behavior varied depending upon the specific problem they were encountering. To understand how these different reaching problems translated into unique reaching solutions, let’s consider two of the infants whose reaching approaches varied considerably: Gabriel and Hannah.

Thelen and Smith describe Gabriel as an extremely active infant who was initially unable to successfully reach the toy because he would excitedly flap his arms, in seemingly random movements that were not focused enough to enable him to obtain the toy. Consequently, he had to learn to control these energetic movements so that this energy would become more focused. By learning to control these excited movements, he would then be able to produce a more controlled reaching-action that would propel his hand to the desired location. Gabriel eventually learned to reach toys after multiple unsuccessful attempts; however, these unsuccessful reaching attempts were instrumental in helping him realize how to adjust his muscle patterns so that a successful reaching pattern finally emerged that enabled him to focus his energy in the direction of the toy.

In contrast to Gabriel’s need to control wildly energetic movements, Hannah encountered quite the opposite problem. Unlike Gabriel, Hannah is described as “a quiet, contemplative infant who was visually alert and socially responsive, but motorically less active” (Thelen and Smith 1994: 259). Consequently, she did not encounter control problems, but suffered from the inability to generate enough force to overcome gravitational forces and propel her arm forward. Like Gabriel, Hannah learned to exert the proper amount of force needed to successfully reach an object through trial and error. However, her initial reaches were closer to an adult pattern than Gabriel’s because her slow movements enabled her to have more control over where her hand would encounter the toy. Thelen and Smith (1994) conclude that:

Hannah’s problem was different from Gabriel’s, but it was also the same. She, like Gabriel, had to adjust the energy of forces moving her arm—in her case to make her arm sufficiently stiff or forceful to lift it off her lap. What Gabriel and Hannah had in common, therefore, was the ability to modulate the forces they delivered to the arms to change their ongoing, but non-functional patterns to movements that brought their hands close enough to the toys for them to make contact. Their solutions were discovered in relation to their own situations, carved out of their individual landscapes, and not pre-figured by a synergy known ahead by the brain or the genes (260).

The importance of Thelen and Smith’s research becomes clear when we contrast their conclusions with the manner in which change is explained in other leading developmental theories. Thelen notes that in other theories change is explained by appealing to “some deus ex machina—’the genes,’ ‘maturation of the brain,’ ‘a shift into a new stage,’ or ‘an increase in information-processing capacity’” (Thelen 1995: 91). Such moves are problematic, Thelen argues since they merely push the level of explanation back a step so that in order to fully understand how change occurs this new theoretical mechanism must also be explained. Moreover, Thelen notes that the unique problems encountered and solved by individual infants make it extremely unlikely that the solutions were innate, since no internal mechanism could know in advance the specific “energy parameters of the system” (Thelen 1995: 90).

In contrast to these ungrounded attempts at explanation, Thelen and Smith claim to provide a theoretically-grounded, emergent conception of change by explaining change in terms of a dynamical systems framework, in which the challenge is “to understand how the system can generate its own change, through its own activity, and within its own continuing dynamics, be it the spring-like attractors of the limbs or the neural dynamics of the brain” (Thelen 1995: 91).

One advantage of a dynamic systems analysis is that it can account for how different infants must learn unique pre-reaching strategies based on their specific energy level, body mass and the different ways in which they initially tried to reach (that is, their pre-reaching behaviors). Yet, despite these different techniques, Thelen and Smith’s account still identifies the common factors that all of the infants had to learn to control: the various forces surrounding arm control, such as gravitational resistance. By developing a dynamical systems analysis of reaching behavior, Thelen and Smith provide a theoretical mechanism that tries to explain the exact way in which these different forces interact. The resulting analysis tracks how activity brings about changes in the system, so that new types of behavior emerge from behaviors the system already knows. This means of generating new patterns from those that already exist results in ‘environmental scaffolding’, since a new behavior is generated from the current resources of the system. Moreover, this dynamic systems analysis enables the researcher to track how the different movements/actions change and evolve over time. Consequently, behaviors, such as reaching, are explained in terms of interactive forces, which are mathematically understood since they are grounded in the physics of action.

One possible objection to a dynamic systems analysis of development is that this research program is limited because it will only be able to account for low-level, goal-directed action (that is, walking, reaching, etc.). Although this in itself would be a step forward, the ultimate goal is to also explain the diachronic emergence of higher-level cognitive abilities. Thus, in order to even have a chance at explaining cognitive complexity, a dynamical systems approach must bridge the gap between explaining how individuals acquire new lower-order activity patterns and explaining how they acquire higher order activity patterns, such as learning to categorize. In answer to this concern, Thelen argues that the infant’s ability to gain control over its body in order to perform various activities enables the infant to simultaneously learn certain categories. More specifically, the infant learns “that a certain category of force dynamics is appropriate for a certain class of tasks” (Thelen 1995: 95). For instance, infants learn that objects in front of them can be fun to play with. Therefore, these infants work to remember the ways in which they must change their muscle patterns in order to manipulate forces, which enables them to reach the object. Consequently, after a certain number of experiences with particular perceptual events (e.g., the toy in front of them), infants begin to recognize that action oriented solutions to these events are also generalizable (e.g., class of reaching toy behaviors). It is in this way that infants begin to associate particular patterns of force with particular events in the world. Thelen further explains that:

These early movements often look to be entirely without form or meaning. But if what neuroscientists tell us about the plasticity of the brain and how it changes is correct, infants are also continually learning something about the perceptual-motor systems and their relations to the world in their repeated spontaneous activity. That is, what infants sense and what they feel in their ordinary looking and moving are teaching their brains about their bodies and about their worlds. They are in fact exploring what range of forces delivered to their muscles get their arms in particular places and then learning from their exploration, remembering how certain categories of forces get their hands forward toward some-thing interesting (90).

Consequently, infants must learn how to perform certain activity patterns, such as reaching, and then remember when it is appropriate to generate those patterns again to achieve a desired goal. In order to effectively perform these behaviors at the appropriate times, the infant must learn to categorize particular situations and correctly apply the action solution that corresponds with that situation. For example, if a baby learns how to control its arm muscles so that it can reach a toy it desires, then it will not take long for the infant to realize that the same type of reaching behavior can also be used to grasp food. It is in this sense that the behaviors become generalized as the infant learns to use its body to explore its environment. Moreover, one might argue that the generalized categories formulated to perform these reaching behaviors could be viewed as one instance of intentional categorization emerging from action of a dynamical system.

Next, an examination of research conducted in the growing field of robotics/artificial life will further clarify why the primacy of action occurring in real time is a defining theoretical assumption that guides research in all areas of embodied cognition.

ii. Robotics/Artificial Life

Until recently, almost all of the robots built in the field of artificial intelligence were constructed according to the stored-description model. Building systems, according to the stored-description technique, requires programmers to guess at the conditions the robot will encounter, and then to spell out all of the relevant information that is needed for the system to generate an appropriate response in its environment. Determining what information to include in the system is difficult, since the programmer must anticipate everything the robot will need to know to perform its task as well as providing the robot a response to any unexpected environmental features that might throw it off task. This process of explicitly stating all of the necessary information is further complicated by the fact that the system does not start with any prior knowledge, or even a simplistic understanding of the kinds of things existing in the world. So, even if all of the relevant information is correctly represented in the system, there are still no guarantees the robot will correctly perform its task, since it must then determine what makes a piece of information relevant in one situation and not in another. Given these challenges, robots utilizing the stored description model are very brittle and tend to malfunction in environments when they encounter unexpected events, or multiple soft constraints.

In the early 1980’s, MIT roboticist Rodney Brooks became dissatisfied with the stored-description approach as well as with the general direction of artificial intelligence research. Although systems were being built that could play chess and calculate taxes, behaviors commonly associated with higher cognitive functions, Brooks argued that little progress was being made on developing systems that could quickly perform simple environmental tasks. After all, if one of the goals of robotics is to simulate how human cognitive processes work, then constructing robots only according to the stored description approach becomes problematic if these robots cannot adapt and change with their environment; abilities attributed to even simpler organisms, like insects. Therefore, Brooks decided to try to build a robot that could thrive in an environment without utilizing a central planning facility; the result was Herbert.

Herbert was designed to wander around the MIT lab disposing of empty soda cans. Although Herbert’s task might seem relatively simple, to accomplish it successfully he had to perform a number sub-tasks; including identifying empty soda cans from full ones, avoiding the stationary tables and chairs in his path, and maneuvering around the seldom-stationary people who also inhabit the lab. In order to efficiently accomplish his task of can removal, Herbert relied on what Brook’s called a “subsumption architecture,” which consisted of a number of connected layers, each responsible for performing a specific task; actions emerged from the suppression or activation of various sub-systems. As Herbert moved through his environment, he continuously encountered stimuli, which dictated which layer was activated at any given time. For instance, once Herbert’s object-detection layer successfully detected a wall obstructing its path, it activated the object-avoidance layer, which shut down the layer responsible for forward motion. The various connected layers plus the environmental stimuli ultimately determine the suppression or activation of a particular layer. Brooks argued that the subsumption architecture enables Herbert to “use the world as its own best representation” since Herbert does not need to refer to a detailed map of his surroundings before determining how to react. Instead, in systems such as Herbert, an effective interface is continually recreated between the system and the world without relying on a central planning facility to dictate commands, or encoding classicist representations.

Brook’s subsumption architecture provided an alternative to the stored-description architecture by demonstrating that a robot could quickly react in its environment without the aid of a formal plan. From a design perspective, this development was an important accomplishment since a smart tradeoff was achieved; a fast reaction time was gained by developing sub-systems/layers that generated behaviors that reacted to types of phenomena (that is, avoiding walls in general) instead of tokens (that is, avoiding wall #3). Since Herbert’s task could be successfully executed without needing to re-identify one wall from the next, Herbert’s wall avoidance layer reacts to every wall in the same manner—by avoiding it. Consequently, knowledge of tokens was traded for knowledge of types in a manner that promoted speed.

In summary, Brooks’ research in artificial life, as well as the research of many other roboticists (see also Mataric 1992, Agre and Chapman 1997, Tilden 1999, Mataric, Clancey 1997), helps to clarify the first theoretical assumption of embodied cognition: the primacy of goal-directed action occurring in real time. One reason that Brooks’ research is an excellent example of this theoretical assumption is his emphasis on developing robots that employ quick, cost-effective solutions to “everyday” problems encountered in an environment. Although much more progress needs to occur in Artificial Life before architectures are developed that are capable of explaining behaviors associated with higher cognitive processes, these early architectures are still able to do something the classicist/cognitivist systems have not: provide a preliminary attempt at modeling some of the simple, low-level behaviors that are necessary for survival.

In addition, the earlier examination of Thelen and Smith’s research provides us with another example of why embodied cognition accounts maintain that action occurring in real time is the essential to understanding cognitive development. Specifically, a dynamic systems analysis is capable of tracking the way in which behaviors evolve and unfold over time; this real-time analysis is completely missing from current classicist/cognitivist accounts of developmental change.

b. Form of Embodiment Constrains Kinds of Cognitive Processes

The next theoretical assumption to which most embodied cognition theorists ascribe is the belief that the embodiment of an organism simultaneously limits and prescribes the types of cognitive processes that are available to it. In other words, the particular way in which an organism is embodied (e.g., whether it has feet, fins, eyes, a tail, etc.) will influence how it performs goal-directed actions in the world, and the particular sensorimotor experiences connected with these actions will serve as the basis for category and concept formation.

To illustrate this point, consider how two very different organisms, a child and a puppy, will try to play with a ball. If the child wishes to get the ball, she will most likely use her hands, but she could also use her feet. Yet, she will not normally use her mouth to get the ball, even if the size of the ball does not preclude this option. This is because, aside from being culturally frowned upon, the other options enable greater control, are easier to perform, and are culturally sanctioned. However, a puppy has fewer options, and will most likely grab the ball with its mouth, since its particular form of embodiment will not enable it to grasp the ball with its paws. Although there are further differences related to how the child and puppy can perceive and interact with the ball, including the fact that the child’s visual system will include color cues, while the dog’s visual system will only enable it to see the ball in black and white, the important point is that, in each case, the way the organism is embodied constrains the options available to it.

A further point is that each of these different types of interactions (that is, grabbing with one’s hands, clutching with one’s mouth, pouncing with one’s paws, etc.) has its own set of corresponding sensorimotor experiences, which directly influence how the organism interacts with the object. This is because the continuous feedback from these sensorimotor experiences serves as the basis for how the organism understands a specific interaction. Moreover, since activities always take place in a specific environmental context, such as when a child plays soccer with a friend on a spring day, the sensorimotor driven understanding of the situation that is gained from performing the activity in these circumstances can further inform how the organism might carry out future attempts at performing the same activity.

In general, environmental factors are very important because they can influence not only what options are available to a particular organism, but also why an organism might choose one option over another when performing a particular goal-directed activity. For instance, weather conditions, the size of the ball, the rules of the game, and whether or not an individual has any broken limbs will most likely factor into their decision to throw the ball, or kick it. Yet, all of this person’s past experiences with an object in these varied activity-based contexts will in some way contribute to their current understanding of the activity. The individual’s understanding of these past experiences is directly informed by the kinds of sensorimotor experiences their form of embodiment allows.

The various sensorimotor experiences that occur while performing an action in a particular environmental context further specify the type of categories/concepts the organism is capable of forming. For instance, it is common for a small child to have a basic understanding of concepts related to macroscopic objects, such as grass, that are likely to exist in her immediate environment, while having little to no real understanding of concepts related to microscopic objects, such as bacteria, that might be found in the same environment. It is not surprising that the child gains an understanding of the macroscopic first, because these objects are the ones that she can see, taste, feel, hear, and smell unaided. In other words, she has sensorimotor experiences that are directly linked to the macroscopic objects in her environment, and these experiences serve as the foundation for concept formation. Not surprisingly, direct experience of microscopic entities will most likely occur later in the child’s life, when she is introduced to tools, such as a microscope, that will enable the detection of these entities. The child can also acquire indirect knowledge of microscopic entities if the explanation is cast in terms of those things that she already does understand, namely entities found on the macroscopic level.

In conclusion, the way in which we are embodied determines the type of action patterns we can perform and these action patterns shape our cognitive functions (that is, the way in which we can conceptualize and categorize). This is because most embodied cognition theorists argue that category and concept formation is made possible and constrained by the particular sensorimotor experiences of the organism. It is in this sense that the form of embodiment partly determines the kind of cognitive processes available to the organism. Psychologists, such as Barsalou (1983, 1997), Glenberg (1997,1999), and Thelen and Smith (1994), are but a few of the cognitive scientists who adopt this theoretical assumption even though the specific content of their individual views varies. For instance, Glenberg (1997) illustrates how cognition results from embodiment due to’mesh,’ which refers to the particular way in which affordances, knowledge, and goals combine. Yet, Barsalou (1997) develops a theory of simulation, and as demonstrated earlier, Thelen and Smith (1994) explain the emergence of this theoretical assumption according to a dynamical systems framework. Thus, all of these individuals agree with the theoretical assumption that the form of embodiment partly determines the cognitive processes available to the organism, but they still debate precisely how this occurs.

c. Cognition is Constructive

If the way we conceptualize and categorize is based on the way we are embodied, then according to embodied cognition theorists these concepts and categories are actively constructed and not merely apprehended wholesale from an observer-independent environment. The point here is that the way in which we are embodied not only constrains the way we can interact in the world, but our particular form of embodiment also partly determines the way the world appears to us. In effect, it does not follow from the existence of an observer-independent world that this world is seen in the same manner by all organisms. Instead, the claim is that certain environmental features are re-constructed depending upon a number of relevant factors, including the task at hand (that is, the goal-oriented action being performed), the functioning sensorimotor modalities, the vantage point of the organism, the form of embodiment, etc. The basic idea is that the organism actively constructs a sensorimotor representation that is based on those environmental features that are directly relevant to the goal-directed action it is currently performing. Consequently, environmental space X could be viewed differently by the same organism depending on the type of task the organism is performing in that space, primarily because the goal-directed activity determines which environmental features are relevant to the successful performance of the activity. For instance, individuals attend to different features when they are preparing to mow a stretch of grass with a lawn mover, than when they are playing soccer on it later the same day. This is because the environmental features one must observe to successfully mow the lawn are different from those that impact playing soccer well.

In direct contrast to viewing cognition as actively constructed from select environmental features, the cognitivist/classicist assumption is that the world has a set of pre-given features that are passively retrieved from the environment through representations that mirror the world; the way the organism is built and its particular goal-directed actions are not viewed as integral to the cognitivist/classicist analysis. Yet, embodied cognition theorists question the evolutionary viability of viewing cognition as passive retrieval; they maintain it is too time-consuming and unnecessary for organisms to formulate representations that completely mirror environmental features that are unrelated to the goal-directed activity the organism is currently performing. In response, the classicist/cognitivist might argue that a more serious problem results if you do claim that the embodiment of an organism determines how it will view the world; the very existence of an observer-independent world is called into question if an organism’s understanding of the world is constructed.

The embodied cognition theorist might respond that the classicist/cognitivist has misinterpreted what it means to claim that cognition is a constructive process. By constructive, Embodied theorists do not mean to imply that there is no objective, external reality and that everything is subjective. Instead, the point is that a type of mutual specification occurs between the organism and its environment, so that the way the world looks and the way in which the organism can interact in the world is primarily determined by the way the organism is embodied. So, an observer-independent world can be granted, but embodied cognition theorists claim that an organism will understand this world in terms of the unique sensorimotor relations it experiences. These fundamental sensorimotor experiences achieved through acting in the world are actively constructed to facilitate concept formation. For instance, we view our bodies as having distinct fronts and backs. Due to the characteristics we associate with each of these bodily spatial relations, linguist George Lakoff and philosopher Mark Johnson (1999) argue that we also characterize objects in the world according to these assignments (that is, go to the front of the house, that is the back of her shirt, etc.). This process is considered to be constructive because we project these characteristics onto the world because they reflect the foundational understanding we have of our own bodies.

Consequently, if we were embodied differently then we would not see the world in this particular way, but in terms of our new set of defining bodily characteristics. However, by taking into account the bodies that we do have, our actual projected spatial assignments can be traced back to sensorimotor experience, which enables the formation of spatial schemas that are projected onto a scene to facilitate reasoning without the use of deductive logic. These schemas are constructive because they do not mirror what exists in the world. Instead, these schemas structure elements within the world in such a way that the individuals can understand their environment quickly. Given this, it should not be surprising that one way for an organism to interpret its environment is in terms of something it already knows well: its own bodily interactions.

A number of arguments in support of the constructive nature of cognition are also offered In The Embodied Mind, in which cognitive scientist Francisco Varela, philosopher Evan Thompson and psychologist Eleanor Rosch argue at length that color “provides a paradigm of a cognitive domain that is neither pre-given nor represented but rather experiential and enacted” (1991:171). Specifically, Varela, Thompson, and Rosch maintain that our ability to see colors results from the active interplay of various sensorimotor modalities. The interconnected way in which these different sensorimotor modalities mutually affect one another is clearly demonstrated in the case of the colorblind painter; a neurological case study from which Varela et al are not merely arguing that color is constructive as a result of the visual system, but they are making the stronger claim that “color perception partakes of both other visual and sensory modalities” (164).

In this case study, a painter (hereafter Mr. I) who completely lost his ability to see colors after a car accident finds that this loss directly affected the way he experienced other sensorimotor experiences, such as taste and sound. As a result of his accident, he was only able to see the world in varying degrees of black, white and gray. Moreover, Mr. I was not able to imagine colors, dream in colors, or remember what colors looked like. Since he was no longer viewing the world as colored in any of these ways, Mr. I reported that the nature of his experience of the world was also affected dramatically. Reportedly, everything around him “had a distasteful, ‘dirty’ look, the whites glaring, yet discolored and off white, the black cavernous-everything wrong, unnatural, stained, and impure.” Due to this abrupt change in the way he was viewing his environment, he stated that he was no longer able to have sex or enjoy food. Moreover, Mr. I was not able to enjoy music to the degree he had before the accident since he was no longer able to visually transform musical notes into color sequences.

After living with this condition for some time, Mr. I remarked that while he was initially upset about his inability to perceive color, he now no longer misses it. In fact, he reported that his actions, tastes and behaviors have naturally adjusted over time to reflect that of a night person. He stated that “I love the night time….I often wonder about people who work at night. They never see the sunlight. They prefer it….It’s a different world: there’s a lot of space—you’re not hemmed in by streets, people….It’s a whole new world. Gradually I am becoming a night person. At one time I felt kindly toward color, very happy about it….Now I don’t even know it exists—it’s not even a phantom” (164). Varela et al. concluded that:

This description provides rare insight into how our perceived world, which we usually take for granted, is constituted through complex and delicate patterns of sensorimotor activity. Our colored world is brought forth by complex processes of structural coupling. When these processes are altered, some forms of behavior are no longer possible. One’s behavior changes as one learns to cope with new conditions and situations. And, as one’s actions change, so too does one’s sense of the world. If these changes are dramatic enough—as in Mr. I’s loss of color—then a different perceived world will be enacted (164).

This case is meant to illustrate that if one’s ability to see color is completely removed, then other sensorimotor experiences are also affected. Varela et al. argue that since vision is not the only modality affected by Mr. I’s accident, his condition provides some insight into the way in which “perception and action, sensorium and motorium, are linked together as successively emergent and mutually selecting patterns” (163).

Although color is but one example of the way in which cognition is constructive, the above case study might prompt one to ask what is the proper or correct way to view the world? According to Embodied theorists, the answer is that there is no single proper or correct way of viewing the world, since being able to correctly see the world translates into using whatever sensorimotor modalities one has to act successfully in one’s environment. Moreover, since an organism’s sensorimotor apparatus determines the way it will experience the world, many embodied theorists argue that instead of assuming that every organism shares the exact same view of the world (that is, we all view an objective reality in the same way), it makes more sense to acknowledge that an organism’s particular view of the world is the direct result of its functioning sensorimotor experiences. The point is that an organism’s knowledge of the world is primarily through its experiences within the world and these experiences are constrained by the types of functioning sensorimotor modalities it has. When one of these modalities is impaired, then its experience of the world will similarly be affected on multiple levels, since these modalities influence one another. The case of the colorblind painter illustrates the cross-modal natures of sensori-motor experience by showing that the impairment of one modality (color) affected the way the world was experienced in other modalities (taste, sound, etc.) to the point that certain previously performed actions suddenly no longer make sense. Therefore, the type of structural coupling that enables color perception to occur is a paradigm example of constructive cognition.

The theoretical assumption that at least some forms of cognition are constructive is supported by a growing number of theorists from a variety of disciplines. Varela et al. argue that the coupling that occurs between organism and environment results in constructive cognition. Lakoff and Johnson (1999) argue that cognition is constructive since it involves projecting schemas (e.g., bodily) and combining these schemas to create a metaphorical understanding of the world. Glenberg (1997, 1999), Damasio (1994), and Fauconnier and Turner (2002) are but a few of the cognitive scientists who maintain that cognition is in some way constructive. Thus, this theoretical assumption is becoming more widely supported in the embodied cognition literature.

3. Embodied Cognition vs. Classicism/Cognitivism

Based on the analysis of the above theoretical assumptions of embodied cognition, it is now possible to directly contrast the central themes of the embodied cognition research program with those commonly expressed in the classicist/cognitivist research program:

Classicist/Cognitivist View Embodied Cognition View
1. Computer metaphor of mind; rule-based, logic driven. 1. Coupling metaphor of mind; form of embodiment + environment + action constrain cognitive processes.
2. Isolationist analysis – cognition can be understood by focusing primarily on an organism’s internal processes. 2. Relational analysis-interplay among mind, body, and environment must be studied to understand cognition.
3. Primacy of computation. 3. Primacy of goal-directed action unfolding in real time.
4. Cognition as passive retrieval. 4. Cognition as active construction based upon an organism’s embodied, goal-directed actions
5. Symbolic, encoded representations 5. Sensorimotor representations

Although most embodied cognition accounts do adhere to the theoretical assumptions outlined in this entry, it is important to recognize that this rapidly changing research program encompasses a diverse group of theorists, who are continuing to refine and revise the preliminary theoretical assumptions associated with the embodied cognition view. Consequently, some accounts may reject one of the outlined assumptions, yet still identify as an embodied account of cognition.

4. Philosophical Implications of the Embodied Cognition Research Program

The ultimate claim of embodied theorists is that new insights into previously unanswered questions concerning cognitive development will be attained if cognitive scientists re-orient their approach and conduct research in a manner that acknowledges the crucial links existing among an organism’s brain, body, and world. Yet, this immediately begs the question: what does it mean for researchers to re-orient their approach? Once again, there is no consensus among the embodied cognition theorists as to what this re-orientation entails; however; there are currently two distinct views concerning how cognitive scientists should apply the general embodied cognition thesis, each with different methodological implications.

a. The Compatibilist Approach

The Compatibalist Approach to Embodied Cognition involves using a variety of methods to explain cognitive processes. In some cases, the phenomena will call for a classicist/cognitivist analysis and in other cases the methods associated with the embodied cognition framework will make more sense. Researchers who endorse this compatibalist view, such as philosopher Andy Clark (1997), argue that it would be a mistake to completely dispense with the theoretical tools associated with classicist/cognitivist models, especially since it is unclear if embodied cognition accounts will be able to adequately explain higher level processes (e.g., meta-cognitive states such as the ability to think about one’s own thoughts) without invoking on some level a computational or representational analysis. In short, embodied cognition theorists who endorse a compatibalist view to research are hedging their bets, and leaving open the possibility of utilizing tools from multiple theoretical frameworks. A potential problem with compatibalist conceptions is that it is not clear how mechanisms/tools derived from opposing theoretical frameworks can be successfully linked together, since these frameworks employ at best different, and at times mutually exclusive, assumptions about the world (that is, cognition is constructive vs. cognition is passive). Given this, one might question how mechanisms derived from a cognitivist framework can hook-up and mutually inform mechanisms derived from embodied frameworks so that a theoretically viable explanation emerges despite the fundamental theoretical differences. Perhaps it is this very concern that has led some embodied cognition theorists to endorse a more stringent form of embodied cognition: the purist approach to embodied cognition.

b. The Purist Approach

The Purist Approach to Embodied Cognition is often characterized as the radical version of the embodied cognition thesis because researchers who adopt it argue that the classicist/cognitivist thesis is incorrect. Consequently, they claim that any tools or theoretical mechanisms developed from classicist/cognitivist assumptions are also flawed. Instead, these classicist/cognitivist tools cannot be augmented, but must be completely replaced with a diverse set of tools/mechanisms that are consistent with the central embodied cognition thesis. One problem with the purist view of embodied cognition is that there is no guarantee that the necessary tools/mechanisms will be developed to enable embodied theorists to explain these higher cognitive processes, especially those specific to human cognition. Even though a number of promising theoretical tools currently exist (that is, dynamic systems theory, schemas, conceptual blending, mesh, etc.), those researchers who are adopting the purist approach are clearly gambling that more sophisticated theoretical tools/mechanisms will be developed in the near future to adequately explain the emergence of higher cognitive processes. Although it is too early to say definitively what the outcome will be, it is clear that the general embodiment thesis can no longer be ignored by researchers in cognitive science, including philosophers of mind, since the very thesis calls into question widely-held assumptions about cognition.

5. References and Further Reading

  • Brooks, R. (1991). “Intelligence without representation.” Artificial Intelligence, 47, 139-159.
  • Clancey, W. (1997). Situated Cognition: On Human Knowledge and Computer Representations. Cambridge, MA: Cambridge University Press.
  • Clark, A. (1997). Being There: Putting Brain Body and World Together Again. Cambridge, MA: MIT Press. (Recommended.)
  • Clark, A. (1999). “Embodied, situated, and distributed cognition.” In W. Betchel and G. Graham (eds), A Companion to Cognitive Science, Malden, MA: Blackwell Publishing.
  • Clark, A. and Chalmers, D. (1998). The extended mind. Analysis, 58, 7-19.
  • Cisek, P. (1999). “Beyond the Computer Metaphor: Behavior as Interaction.” In Nunez, R. and Freeman, W., Reclaiming Cognition: the primacy of action intention and emotion, Bowling Green, OH: Imprint Academic.
  • Dreyfus, H. (1972/92). What Computers Can’t Do: A Critique of Artificial Reason. New York: Harper and Row. (Third edition: What Computers Still Can’t Do. 1992. Cambridge, MA: MIT)
  • Fauconnier, G. and Turner, M. (2002). The Way We Think: Conceptual Blending and the Mind’s Hidden Complexities. New York, NY: Basic Books.
  • Glenberg, A. (1997). “What memory is for: Creating meaning in the service of action.” Behavioral and Brain Sciences, 20, 1-55.
  • Glenberg, A. (1999). “Why Mental Models Must Be Embodied.” In Mental Models in Discourse Processing and Reasoning, Rickheit, G. and Habel, C. (eds). New York: Elsevier.
  • Harnad, S. (1990). “The symbol grounding problem.” Physica D, 42,335-346.
  • Horgan, T and Tienson, J. (1989). “Representations Without Rules.” Philosophical Topics, 17 (Spring), 147-174.
  • Hutchins, E. (1995). Cognition in the Wild. Cambridge, MA: MIT Press. (Recommended.)
  • Lakoff, G., and Johnson, M. (1999). Philosophy In the Flesh: The Embodied Mind And Its Challenge To Western Thought. New York, NY: Basic Books. (Recommended.)
  • Mataric, M. J. (1992). “Integration of representation into goal-driven behavior based robots.” IEEE Transactions on Robotics and Automation, 8 (3): 304-312.
  • Searle, J. (1980). “Minds, brains, and programs.” Behavioral and Brain Sciences, 1, 417-424.
  • Thelen, E.,and Smith, L. (1994). A Dynamic Systems Approach to the Development of Cognition and Action. Cambridge, MA: MIT Press.
  • Thelen, E. (1995). “Time-scale dynamics in the development of an embodied cognition.” In Mind In Motion, ed. R. Port and T. van Gelder. Cambridge, MA: MIT Press.
  • Thelen, E., Schoner, G., Scheier, C., and Smith, L.B.(2001). “The Dynamics of Embodiment: A Field Theory of Infant Perservative Reaching.” Behavioral and Brain Sciences 24: 1-86.
  • Varela, F., Thompson, E., Rosch, E. (1991). The Embodied Mind. Cambridge, MA: MIT Press.

Author Information

Monica Cowart
Email: Monica.Cowart@merrimack.edu
Merrimack College
U. S. A.

Benedict De Spinoza (1632—1677)

SpinozaBenedict de Spinoza  was among the most important of the post-Cartesian philosophers who flourished in the second half of the 17th century. He made significant contributions in virtually every area of philosophy, and his writings reveal the influence of such divergent sources as Stoicism, Jewish Rationalism, Machiavelli, Hobbes, Descartes, and a variety of heterodox religious thinkers of his day. For this reason he is difficult to categorize, though he is usually counted, along with Descartes and Leibniz, as one of the three major Rationalists. Given Spinoza’s devaluation of sense perception as a means of acquiring knowledge, his description of a purely intellectual form of cognition, and his idealization of geometry as a model for philosophy, this categorization is fair. But it should not blind us to the eclecticism of his pursuits, nor to the striking originality of his thought.

Among philosophers, Spinoza is best known for his Ethics, a monumental work that presents an ethical vision unfolding out of a monistic metaphysics in which God and Nature are identified. God is no longer the transcendent creator of the universe who rules it via providence, but Nature itself, understood as an infinite, necessary, and fully deterministic system of which humans are a part. Humans find happiness only through a rational understanding of this system and their place within it. On account of this and the many other provocative positions he advocates, Spinoza has remained an enormously controversial figure. For many, he is the harbinger of enlightened modernity who calls us to live by the guidance of reason. For others, he is the enemy of the traditions that sustain us and the denier of what is noble within us. After a review of Spinoza’s life and works, this article examines the main themes of his philosophy, primarily as they are set forth in the Ethics.

Table of Contents

  1. Life and Works
  2. Geometric Method and the Ethics
  3. Metaphysics
    1. Substance Monism
      1. Definitions
      2. Preliminary Propositions
      3. Substance Monism Demonstrated
    2. The Modal System
      1. Natura naturans and Natura naturata
      2. Two Types of Mode
      3. Causal Determinism
      4. Causal Parallelism
  4. Mind and Cognition
    1. The Mind as the Idea of the Body
    2. Imagination
      1. Sense Perception
    3. Inadequate Ideas
    4. Adequate Ideas
    5. Three Kinds of Knowledge
  5. Psychology
    1. Rejection of Free-Will
    2. The Conatus Principle
    3. The Affects
    4. Bondage
  6. Ethics
    1. Freedom from the Passions
    2. Conatus and the Guidance of Reason
    3. Knowledge of God as the Highest Good
    4. Intellectual Love of God and Human Blessedness
    5. Eternity of the Mind
    6. Conclusion
  7. References and Further Reading
    1. Texts and Translations of Spinoza
    2. General Studies Suitable as Introductions
    3. More Advanced and Specialized Studies
    4. Collected Essays on Spinoza

1. Life and Works

Spinoza came into the world a Jew. Born in 1632, he was the son of Marrano parents. They had immigrated to Amsterdam from Portugal in order to escape the Inquisition that had spread across the Iberian Peninsula and live in the relatively tolerant atmosphere of Holland. Spinoza’s father, Michael, was a successful merchant and a respected member of the community. His mother, Hanna, the second of Michael’s three wives, died in 1638, just before Spinoza was to turn six.

The young Spinoza, given the name Baruch, was educated in his congregation’s academy, the Talmud Torah school. There he received the kind of education that the community deemed necessary to constitute one as an educated Jew. This largely consisted of religious study , including instruction in Hebrew, liturgy, Torah, prophetic writings, and rabbinical commentaries. Although Spinoza no doubt excelled in these, he did not move on to the higher levels of study which focused on the Talmud and were typically undertaken by those preparing for the rabbinate. Whether by desire or by necessity, Spinoza left the school in order to work in his father’s business, which he eventually took over with his half-brother, Gabriel.

The Jewish community in Amsterdam was by no means a closed one , but Spinoza’s commercial activities put him in touch with more diverse currents of thought than those to which he had hitherto been exposed. Most significantly, he came into contact with so-called ‘free-thinking’ Protestants – dissenters from the dominant Calvinism – who maintained a lively interest in a wide range of theological issues, as well as in the latest developments in philosophy and science. This naturally included the work of Descartes, which was regarded by many in Holland to be the most promising of several alternatives to scholasticism that had emerged in recent decades. In order to discuss their interests, these free-thinkers organized themselves into small groups, they called colleges, which met on a regular basis. Spinoza may have attended such meetings as early as the first half of the 1650’s, and it is most likely here that he received his first exposure to Cartesian thought.

This is not to say that Spinoza ceased to mine the resources of his own tradition – he became steeped, for example, in the writings of such philosophically important figures as Maimonides and Gersonides – but his intellectual horizons were expanding and he was experiencing a restlessness that drove him to look further afield. It was at this time that he placed himself under the tutelage of an ex-Jesuit, Franciscus Van den Enden, who had recently set up a Latin school in Amsterdam. Van den Enden turned out to be the perfect teacher for Spinoza. In addition to having an excellent reputation as a Latinist, he was a medical doctor who kept abreast of all that was new in the sciences. He was also notorious for his allegedly irreligious cast of mind, and he was a passionate advocate of democratic political ideals. It is safe to say that Spinoza’s studies with Van den Enden included more than lessons on how to decline nouns.

Spinoza’s intellectual reorientation, however, came at a cost. His increasingly unorthodox views and, perhaps, laxity in his observance of the Jewish law strained his relations with the community. Tensions became so great that, in 1656, the elders of the synagogue undertook proceedings to excommunicate him. Without providing details, the writ of excommunication accuses him of ‘abominable heresies’ and ‘monstrous deeds’. It then levels a series of curses against him and prohibits others from communicating with him, doing business with him, reading anything he might write, or even coming into close proximity with him. Spinoza may still have been a Jew, but he was now an outcast.

Little is known about Spinoza’s activities in the years immediately following his excommunication. He continued his studies with Van den Enden and occasionally took up residence in his teacher’s home. As it was now impossible for him to carry on in commerce, it was most likely at this time that he took up lens grinding as an occupation. There is also evidence that he traveled periodically to Leiden to study at the university. There he would have received formal instruction in Cartesian philosophy and become familiar with the work of prominent Dutch Cartesians. In 1661, he settled near Leiden, in the town of Rijnsburg.

It was during this same period, in the late 1650’s, that Spinoza embarked upon his literary career. His first work, the Treatise on the Emendation of the Intellect, is an attempt to formulate a philosophical method that would allow the mind to form the clear and distinct ideas that are necessary for its perfection. It contains, in addition, reflection upon the various kinds of knowledge, an extended treatment of definition, and a lengthy analysis of the nature and causes of doubt. For reasons that are unknown, the Treatise was left unfinished, though it appears that Spinoza always intended to complete it. Shortly thereafter, while in Rijnsburg, Spinoza set to work on his Short Treatise on God, Man, and His Well-Being. This work, circulated privately among friends, foreshadows many of the themes of his mature work, the Ethics. Most notably, it contains an unambiguous statement of the most famous of Spinoza’s theses – the identity of God and Nature.

Spinoza’s stay in Rijnsburg was brief. In 1663 he moved to the town of Voorburg, not far from The Hague, where he settled into a quiet, but busy, life. At the behest of friends, he immediately set about preparing for publication a set of lessons that he had given to a student in Leiden on Descartes’s Principles of Philosophy. The result was the only work that he was to publish under his own name, now Latinized to Benedict: René Descartes’s Principles of Philosophy, Parts I and II, Demonstrated According to the Geometric Method by Benedict de Spinoza of Amsterdam. As a condition of publication, Spinoza had his friend, Lodewijk Meyer, write a preface to the work, warning the reader that his aim was exposition only and that he did not endorse all of Descartes’s conclusions. He also appended a short piece, entitled Metaphysical Thoughts, in which he sketched some of his own views. Despite his admiration for Descartes, Spinoza did not want to be seen as a Cartesian.

Spinoza’s work on Descartes shows him to have been interested from early on in the use of geometric method in philosophy. In addition to putting parts of the Principles into geometric form, he began experimenting with geometric demonstrations of material taken from his own Short Treatise. It was out of this experimentation that the idea arose for a fully geometric presentation of his thought. He began work on this sometime in the early 1660’s, and by 1665 substantial portions of what was to become the Ethics were circulating in draft form among his friends back in Amsterdam. Though he was well into the project by then, the political and religious climate of the day made Spinoza hesitant to complete it . He chose to exercise caution and suspended work on it, turning instead to a book that would prepare an audience receptive to the Ethics. This was the Theological-Political Treatise, which he completed and published anonymously in 1670.

Spinoza’s aim in the Theological-Political Treatise was to argue that the stability and security of society is not undermined but, rather, enhanced by freedom of thought, meaning primarily the freedom to philosophize. As is clear from the text, he considered the primary threat to this freedom emanated from the clergy, whom he accused of playing upon the fears and superstitions of people in order to maintain power. His solution was to divest the clergy of all political power, even to the point of placing authority over the practice of religion in the hands of the sovereign. The sovereign, Spinoza argued, should extend broad liberties within this domain, requiring adherence to no more than a minimal creed that was neutral with respect to competing sects and the meaning of which was open to a variety of interpretations. This, he hoped, would allow philosophers the freedom to do their work unencumbered by the constraints of sectarianism.

As was to be expected, the Theological-Political Treatise was met with a firestorm of criticism. It was condemned as a work of evil, and its author was accused of having nefarious intentions in writing it. Even some of Spinoza’s closest friends were deeply unsettled by it. Though he had assiduously tried to avoid it, Spinoza found himself embroiled in heated religious controversy and saddled with a reputation for atheism, something he greatly resented.

Spinoza’s last move, in 1670, was to The Hague, where he was to live out his remaining years. Besides having to deal with fallout from his Theological-Political Treatise, he witnessed a political revolution that culminated in the murder of the Grand Pensionary of Holland, Jan De Witt, along with his brother, Cornelius, by an angry mob of Orangist-Calvinists. Spinoza admired De Witt for his liberal policies and was horrified at the murder. With the ascent of the Orangist-Calvinist faction, he felt his own situation to be tenuous.

Despite these distractions, Spinoza pressed on. He undertook new projects, including the writing of a Hebrew grammar, and he turned back to work on the Ethics. Given the hostility with which the Theological-Political Treatise was met and the realities of the new political landscape, he must have done so with a deep sense of pessimism about its chances for success. By 1675 it was complete. As he perceived his enemies to have grown in influence and opportunity, however, Spinoza decided against publishing it. Public viewing of the definitive statement of his philosophy would have to wait until after his death.

By this time Spinoza was in a state of failing health. Weakened by a respiratory illness, he devoted the last year of his life to writing a work of political philosophy, his Political Treatise. Though left unfinished at his death, Spinoza’s intention was to show how governments of all types could be improved and to argue for the superiority of democracy over other forms of political organization. Following the lead of Machiavelli and Hobbes, his argument was to be non-utopian, based on a realistic assessment of human nature drawn from the psychological theory set forth in the Ethics. In the part he did finish, Spinoza showed himself to be an astute analyst of diverse constitutional forms and an original thinker among liberal social contract theorists.

Spinoza died peacefully in his rented room in The Hague in 1677. He left no will, but the manuscripts of his unpublished works—the Treatise on the Emendation of the Intellect, the Ethics, the Hebrew Grammar, and the Political Treatise along with his correspondence—were found in in his desk. These were immediately shipped to Amsterdam for publication, and in short order they appeared in print as B.D.S. Opus Posthuma. But even in death Spinoza could not escape controversy; in 1678, these works were banned throughout Holland.

2. Geometric Method and the Ethics

Upon opening Spinoza’s masterpiece, the Ethics, one is immediately struck by its form. It is written in the style of a geometrical treatise, much like Euclid’s Elements, with each book comprising a set of definitions, axioms, propositions, scholia, and other features that make up the formal apparatus of geometry. One wonders why Spinoza would have employed this mode of presentation. The effort it required must have been enormous, and the result is a work that only the most dedicated of readers can make their way through.

Some of this is explained by the fact that the seventeenth century was a time in which geometry was enjoying a resurgence of interest and was held in extraordinarily high esteem, especially within the intellectual circles in which Spinoza moved. We may add to this the fact that Spinoza, though not a Cartesian, was an avid student of Descartes’s works. As is well known, Descartes was the leading advocate of the use of geometric method within philosophy, and his Meditations was written more geometrico, in the geometrical style. In this respect the Ethics can be said to be Cartesian in inspiration.

While this characterization is true, it needs qualification. The Meditations and the Ethics are very different works, not just in substance, but also in style. In order to understand this difference one must take into account the distinction between two types of geometrical method, the analytic and the synthetic. Descartes explains this distinction as follows:

Analysis shows the true way by means of which the thing in question was discovered methodically and as it were a priori, so that if the reader is willing to follow it and give sufficient attention to all points, he will make the thing his own and understand it just as perfectly as if he had discovered it for himself. . . . . Synthesis, by contrast, employs a directly opposite method where the search is, as it were, a posteriori . . . . It demonstrates the conclusion clearly and employs a long series of definitions, postulates, axioms, theorems and problems, so that if anyone denies one of the conclusions it can be shown at once that it is contained in what has gone before, and hence the reader, however argumentative or stubborn he may be, is compelled to give his assent. (CSM II,110-111)

The analytic method is the way of discovery. Its aim is to lead the mind to the apprehension of primary truths that can serve as the foundation of a discipline. The synthetic method is the way of invention. Its aim is to build up from a set of primary truths a system of results, each of which is fully established on the basis of what has come before. As the Meditations is a work whose explicit aim is to establish the foundations of scientific knowledge, it is appropriate that it employs the analytic method. The Ethics, however, has another aim, one for which the synthetic method is appropriate.

As its title indicates, the Ethics is a work of ethical philosophy. Its ultimate aim is to aid us in the attainment of happiness, which is to be found in the intellectual love of God. This love, according to Spinoza, arises out of the knowledge that we gain of the divine essence insofar as we see how the essences of singular things follow of necessity from it. In view of this, it is easy to see why Spinoza favored the synthetic method. Beginning with propositions concerning God, he was able to employ it to show how all other things can be derived from God. In grasping the order of propositions as they are demonstrated in the Ethics, we thus attain a kind of knowledge that approximates the knowledge that underwrites human happiness. We are, as it were, put on the road towards happiness. Of the two methods it is only the synthetic method that is suitable for this purpose.

3. Metaphysics

Although the Ethics is not principally a work of metaphysics, the system it lays out stands as one of the great monuments in the tradition of grand metaphysical speculation. What is perhaps most noteworthy about this system is that it is a species of monism – the doctrine that all of reality is in some significant sense one. In Spinoza’s case, this is exemplified by the claim that there is one and only one substance. This substance he identifies as God. While monism has had its defenders in the west, they have been few and far between. Spinoza is arguably the greatest among them.

a. Substance Monism

Spinoza builds his case for substance monism in a tightly reasoned argument that culminates in IP14. We may best follow the course of this argument by taking it in three parts. First, we examine four definitions that play a crucial role in the argument. Second, we look at two propositions to which the demonstration of IP14 appeals. And third, we turn to the demonstration of IP14 itself.

i. Definitions

Among the eight definitions that open Book One of the Ethics, the following four are most important to the argument for substance monism:

ID3: By substance I understand what is in itself and is conceived through itself, that is, that whose concept does not require the concept of another thing, from which it must be formed.

This definition has two components. First, a substance is what exists in itself. This is to say that it is an ultimate metaphysical subject. While other things may exist as features of a substance, substance does not exist as a feature of anything else. Second, a substance is what is conceived through itself. This is to say that the idea of a substance does not involve the idea of any other thing. Substances are both ontologically and conceptually independent.

ID4: By attribute I understand what the intellect perceives of a substance, as constituting its essence.

An attribute is not just any property of a substance – it is its very essence. So close is the association of an attribute and the substance of which it is an attribute that Spinoza denies that there is a real distinction between them.

ID5: By mode I understand the affections of a substance, or that which is in another through which it is also conceived.

A mode is what exists in another and is conceived through another. Specifically, it exists as a modification or an affection of a substance and cannot be conceived apart from it. In contrast to substances, modes are ontologically and conceptually dependent.

ID6: By God I understand a being absolutely infinite, that is, a substance consisting of an infinity of attributes, of which each one expresses an eternal and infinite essence.

God is an infinite substance. By this Spinoza means both that the number of God’s attributes is unlimited and that there is no attribute that God does not possess. As we make our way through the Ethics, we learn that only two of these attributes can be known by the human mind. These are thought and extension.

ii. Preliminary Propositions

Spinoza moves from these definitions to demonstrate a series of propositions concerning substance in general and God in particular on the basis of which he will demonstrate that God is the one and only substance. The following two propositions are landmarks in the overall argument and are explicitly invoked in the demonstration of IP14:

IP5: In Nature there cannot be two or more substances of the same nature or attribute.

In support of this proposition, Spinoza argues that if two or more substances were to exist they would be differentiated either by a difference in modes or by a difference in attributes. However, they could not be differentiated by a difference in modes, for substances are prior in nature to their modes. Thus, they would have to be differentiated by a difference in attributes. Controversially, Spinoza takes this to entail that no two substances can have exactly the same set of attributes, nor can they have a common attribute. Substances must be entirely dissimilar to one another.

IP11: God, or a substance consisting of infinite attributes, each of which expresses eternal and infinite essence, necessarily exists.

In support of this proposition, Spinoza offers a variant of the so-called Ontological Argument. The basic consideration upon which this variant rests is that it pertains to the nature of substance to exist. Spinoza establishes this earlier, in IP7, by appealing to the fact that substances, being entirely dissimilar to one another, cannot produce one another. Since nothing else can produce a substance, substances must be self-caused, which is to say that it pertains to the nature of substance to exist. To imagine that God does not exist is thus absurd. As a substance consisting of infinite attributes, it pertains to the divine nature to exist.

iii. Substance Monism Demonstrated

With these propositions in place, Spinoza has everything he needs to demonstrate that there is one and only one substance and that this substance is God:

IP14: Except God, no substance can be or be conceived.

The demonstration of this proposition is exceedingly simple. God exists (by IP11). Since God possesses every attribute (by ID6), if any substance other than God were to exist, it would possess an attribute in common with God. But, since there cannot be two or more substances with a common attribute (by IP5), there can be no substance other than God. God is the one and only substance.

The implications of this proposition are startling, and Spinoza can be seen to be working them out through the remainder of the Ethics. Most obviously, this proposition marks a break with the substance pluralism advocated by the majority of philosophers in the west. Even Descartes, from whom Spinoza learned much in the area of metaphysics, posited a plurality of mental and physical substances, along with God, whom he regarded as the paradigm of a substance. More importantly, it signals a rejection of classical theism, the idea that God is the creator of the universe who remains ontologically distinct from it and governs it according to his sovereign will. Spinoza has nothing but scorn for this idea and dismisses it as a product of the imagination. How it is that he reconceptualizes the relation between God, the infinite substance, and the order of finite things, becomes clear only as we turn to his account of the modal system.

b. The Modal System

In line with his rejection of classical theism, Spinoza famously identifies God with Nature. Nature is no longer seen as a power that is distinct from and subordinate to God, but as a power that is one and the same with divine power. Spinoza’s phrase ‘Deus sive Natura’ (‘God or Nature’) captures this identification and is justly celebrated as a succinct expression of his metaphysics. In isolation, however, the phrase is relatively uninformative. It tells us nothing about how Spinoza, having rejected the creator/creation relation posited by the classical model, conceives of the relation between God and the system of modes.

i. Natura naturans and Natura naturata

To fill out his thoughts on this matter, Spinoza distinguishes between Nature taken in its active or productive aspect, which he identifies with God or the divine attributes, and Nature taken in its derivative or produced aspect, which he identifies with the system of modes. The former he calls Natura naturans(literally: Nature naturing) and the latter he calls Natura naturata (literally: Natura natured). Spinoza’s use of these formulas is revealing in two respects. First, his double employment of ‘Natura‘ signals the ontological unity that exists between God and the system of modes. Each mode within the system is a modification of nothing other than the very substance that is God. Second, his employment of the active ‘naturans‘ in the first and the passive ‘naturata‘ in the second signals a causal relation between God and the modal system. God is not merely the subject of modes; he is an active power that produces and sustains them.

In view of the ontological unity that exists between God and the modal system, Spinoza is careful to specify that the divine causality is immanent rather that transitive. What this means is that God’s causal activity does not pass outside of the divine substance to produce external effects, as it would if God were a creator in the traditional sense. Rather, it remains wholly within the divine substance to produce the multitude of modes that constitute the modal system. Spinoza likens this to the way in which the nature of a triangle is productive of its own essential properties: “From God’s supreme power, or infinite nature, infinitely many things in infinitely many modes, that is, all things, have necessarily flowed, or always follow, by the same necessity and in the same way as from the nature of a triangle it follows, from eternity and to eternity, that its three angles are equal to two right angles” (IP17S1). The entire modal system, Natura naturata, follows immanently from the divine nature, Natura naturans.

ii. Two Types of Mode

Into this relatively simple picture, Spinoza introduces a complication. There are, he says, two types of mode. The first consists in what he calls infinite and eternal modes. These are pervasive features of the universe, each of which follows from the divine nature insofar as it follows from the absolute nature of one or another of God’s attributes. Examples include motion and rest under the attribute of extension and infinite intellect under the attribute of thought. The second consists in what may be called finite and temporal modes, which are simply the singular things that populate the universe. Modes of this type follow from the divine nature as well, but do so only as each follows from one or another of God’s attributes insofar as it is modified by a modification that is itself finite and temporal. Examples include individual bodies under the attribute of extension and individual ideas under the attribute of thought.

Unfortunately, Spinoza does little to explain either what these infinite and eternal modes are or what relation they have to finite and temporal modes. Taking their cue from a statement in the Treatise on the Emendation of the Intellect that the laws of nature are embedded in the infinite and eternal modes, many commentators have suggested that Spinoza thought of these modes as governing the manner in which finite modes affect one another. For example, if laws of impact are somehow embedded in the infinite and eternal mode motion and rest, then the outcome of any particular collision will be determined by that mode together with the relevant properties (speed, direction, size, etc) of the bodies involved. If this is correct, then Spinoza envisions every finite mode to be fully determined by intersecting lines of causality: a horizontal line that stretches back through the series of antecedent finite modes and a vertical line that moves up through the series of infinite modes and terminates in one or another of the attributes of God.

iii. Causal Determinism

However it may be that Spinoza ultimately conceives of the relation between infinite and finite modes, he is clear about one thing – the system of modes is an entirely deterministic system in which everything is fully determined to be and to act:

IP29: In nature there is nothing contingent, but all things have been determined from the necessity of the divine nature to exist and produce an effect in a certain way.

Spinoza reminds us that God’s existence is necessary. It pertains to the very nature of substance to exist. Furthermore, since each and every mode follows from the necessity of the divine nature, either from the absolute nature of one or another of God’s attributes, as is the case with the infinite and eternal modes, or from one or another of God’s attributes insofar as it is modified by a modification that is finite, as is the case with the finite modes, they are all necessary as well. Since there is nothing other than the divine substance and its modes, there is nothing that is contingent. Any appearance of contingency is the result of a defect in knowledge, either of God or of the order of causes. Accordingly, Spinoza makes it central to his theory of knowledge that to know a thing adequately is to know it in its necessity, as it has been fully determined by its causes.

iv. Causal Parallelism

An obvious question to ask at this point is whether it is possible for finite modes falling under one attribute to act upon and determine finite modes falling under another attribute. Spinoza’s answer is an unambiguous no. Causal relations exist only among modes falling under the same attribute. His explanation for this may be traced back to an axiom set forth at the beginning of Book One:

IA4: The knowledge of an effect depends on, and involves, the knowledge of its cause.

Given this axiom, if a finite mode falling under one attribute were to have God as its cause insofar as he is considered under a different attribute, i.e., if it were to be caused by a finite mode falling under a different attribute, then the knowledge of that mode would involve the knowledge of that other attribute. Since it does not, that mode cannot have God as its cause insofar as he is considered under some other attribute. In other words, it cannot be caused by a finite mode falling under some other attribute.

When applied to modes falling under those attributes of which we have knowledge – thought and extension – this has an enormously important consequence. There can be no causal interaction between ideas and bodies. This does not mean that ideas and bodies are unrelated to one another. Indeed, it is one of the best-known theses in the Ethics that the lines of causation that run among them are strictly parallel:

IIP7: The order and connection of ideas is the same as the order and connection of things.

In the demonstration of this proposition Spinoza says that it is a consequence of IA4 and leaves it at that. Nevertheless, it is apparent that this proposition has deep foundations in his substance monism. As thought and extension are not attributes of distinct substances, so ideas and bodies are not modes of distinct substances. They are “one and the same thing, but expressed two ways” (IIP7S). If ideas and bodies are one and the same thing, however, their order and connection must be the same. The doctrine of substance monism in this way insures that ideas and bodies, though causally independent, are causally parallel.

4. Mind and Cognition

It is at this point that Spinoza’s metaphysics touches upon his theory of mind and yields some of its most profound consequences. Most obviously, substance monism prohibits him from affirming the kind of dualism that Descartes affirmed, one in which mind and body are conceived as distinct substances. What is more, his contention that modes falling under different attributes have no causal interaction but are causally parallel to one another prohibits him from affirming that mind and body interact. Because he takes seriously the reality of the mental while rejecting dualism and eliminating interaction, Spinoza’s views on the mind are generally given a sympathetic hearing in a way that Descartes’s views are not.

a. The Mind as the Idea of the Body

To understand Spinoza’s account of the mind we must begin with IIP7. This proposition, together with its scholium, commits him to the thesis that for each finite mode of extension there exists a finite mode of thought that corresponds to it and from which it is not really distinct. More elaborately, it commits him to the thesis that (1) for each simple body there exists a simple idea that corresponds to it and from which it is not really distinct and (2) for each composite body there exists a composite idea that corresponds to it and from which it is not really distinct, composed, as it were, of ideas corresponding to each of the bodies of which the composite body is composed. Spinoza counts all of these ideas, whether simple or composite, as minds. In this respect he does not consider the human mind to be unique. It is simply the idea that corresponds to the human body.

In taking this position, Spinoza does not mean to imply that all minds are alike. As minds are expressions of the bodies to which they correspond in the domain of thought, some have abilities that others do not. Simply put, the greater the capacity of a body for acting and being acted upon, the greater the capacity of the mind that corresponds to it for perception. Spinoza elaborates:

[I]n proportion as a body is more capable than others of doing many things at once, or being acted on in many ways at once, so its mind is more capable than others of perceiving many things at once. And in proportion as the actions of a body depend more on itself alone, and as other bodies concur with it less in acting, so its mind is more capable of understanding distinctly. And from these [truths] we know the excellence of one mind over the others. (IIP13S)

Herein lies the explanation of the excellence of the human mind. The human body, as a highly complex composite of many simple bodies, is able to act and be acted upon in myriad ways that other bodies cannot. The human mind, as an expression of that body in the domain of thought, mirrors the body in being a highly complex composite of many simple ideas and is thus possessed of perceptual capacities exceeding those of other, non-human minds. Only a mind that corresponds to a body of complexity comparable to that of the human body can have perceptual abilities comparable to those of the human mind.

b. Imagination

A perceptual ability that is of particular interest to Spinoza is imagination. This he takes to be a general capacity of representing external bodies as present, whether they are actually present or not. Imagination thus includes more than the capacity to form those mental constructs that we normally consider to be imaginative. It includes memory and sense perception as well. Since it is clearly impossible to get around in the world without this, Spinoza concedes that it is “in this way [that] I know almost all the things that are useful in life” (TIE 22).

That being said, Spinoza consistently opposes imagination to intellect and views it as providing no more than confused perception. To use his preferred terminology, the ideas of the imagination are inadequate. They may be essential for getting around in the world, but they give us a distorted and incomplete picture of the things in it. To understand why, it is useful to begin with sense perception. This is the most important form of imaginative perception, and it is from this form that all others derive.

i. Sense Perception

On Spinoza’s account, sense perception has its origin in the action of an external body upon one or another of the sensory organs of one’s own body. From this there arises a complex series of changes in what amounts to the body’s nervous system. As the mind is the idea of the body, it will represent these changes. This, Spinoza contends, is what constitutes sense perception.

In order to explain how this act of representation yields perception of an external body, Spinoza appeals to the fact that the changed state of one’s body is a function both of the nature of one’s body and the nature of the external body that caused that state. Because of this, the mind’s representation of that state will express something more than the nature of one’s own body. It will express the nature of the external body as well:

IIP16: The idea of any mode in which the human body is affected by external bodies must involve the nature of the human body and at the same time the nature of the external body.

It is this feature of the mind’s act of representation – that it expresses the nature of an external body – that explains how such an act constitutes sense perception.

c. Inadequate Ideas

In view of this it is not difficult to see why Spinoza judges sense perception to be inadequate. Grounded as it is in the mind’s representation of the state of one’s own body rather than in the direct representation of external bodies, sense perception is indirect. Since this goes for all imaginative ideas, the problem with them all is the same:

IIP16C2: It follows, second, that the ideas which we have of external bodies indicate the condition of our own body more than the nature of the external bodies.

It is because of this that Spinoza refers to the ideas of the imagination as confused. The vision they give of external bodies is unavoidably colored, so to speak, by the lens of one’s own body.

Confusion, however, is just one aspect of the inadequacy of imaginative ideas. Such ideas are also mutilated. The reason for this lies in IA4, which states that the knowledge of an effect depends upon and involves the knowledge of its causes. This is a condition that imaginative ideas can never satisfy. The mind may contain the idea of an external body, but it cannot contain ideas of all of the causes of that body. These, being infinite, fall outside of its scope and are fully contained only in God’s infinite intellect. God’s ideas of bodies may be adequate, but ours are not. They are cut off from those ideas that are necessary in order to render them adequate.

d. Adequate Ideas

Although imaginative ideas of external bodies are the most important examples of inadequate ideas, they are not the only examples. Spinoza goes on to show that the mind’s ideas of the body, its duration, and its parts are all inadequate. So too is the mind’s idea of itself. Even so, he remains optimistic about the possibility of adequate ideas.

This optimism becomes evident as Spinoza shifts his attention from imaginative ideas of singular things to intellectual ideas of common things. These common things are things that are either common to all bodies or common to the human body and certain bodies by which the human body is regularly affected. Spinoza tells us little else about these common things, except to say that they are fully present in the whole and in each of the parts of every body in which they are present. Nevertheless, it is fairly certain that the class of things common to all bodies includes the attribute of extension and the infinite and eternal mode of motion and rest. What is included in the class of things common to the human body and those bodies by which the human body is regularly affected is not so certain. Whatever they turn out to be, however, Spinoza assures us that our ideas of them can only be adequate.

To see why, consider some thing, A, that is common to the human body and some body by which the human body is affected. A, Spinoza contends, will be fully present in the affection that arises in the human body as a result of the action of the external body, just as it is in the two bodies themselves. As a result, the mind, in possessing the idea of that affection, not only will have the idea of A, but its idea will be neither confused nor mutilated. The mind’s idea of A will be adequate.

This result is of utmost importance. Because any idea that follows from an adequate idea is itself adequate, these ideas, appropriately called common notions, can serve as axioms in a deductive system. When working out this system, the mind engages in a fundamentally different kind of cognition than when it engages in any of the various forms of imaginative perception. In all forms of imaginative perception the order of ideas mirrors the order of bodily affections, and this order, depending as it does upon the chance encounters of the body with external bodies, is entirely fortuitous. By contrast, the derivation of adequate ideas from common notions within a deductive system follows a wholly different order. This Spinoza calls the order of reason. The paradigm case is geometry.

e. Three Kinds of Knowledge

With this distinction between adequate and inadequate perception in place, Spinoza introduces a set of further distinctions. He begins with inadequate perception, which he now calls knowledge of the first kind, and divides it into two parts. The first consists of knowledge from random experience (experientia vaga). This is knowledge “from singular things which have been represented to us through the senses in a way which is mutilated, confused, and without order for the intellect”(P40S2). The second consists of knowledge from signs (ex signis), “for example, from the fact that, having heard or read certain words, we recollect things, and form certain ideas of them, like those through which we imagine the things”(P40S2). What links both of these forms of knowledge is that they lack a rational order. It is obvious that knowledge from random experience follows the order of the affections of the human body, but so does knowledge from signs. A Roman who hears the word ‘pomum‘, for instance, will think of an apple, not because there is any rational connection between the word and the object, but only because they have been associated in his or her experience.

When we reach what Spinoza calls the second kind of knowledge, reason (ratio), we have ascended from an inadequate to an adequate perception of things. This type of knowledge is gained “from the fact that we have common notions and adequate ideas of the properties of things” (P40S2). What Spinoza has in mind here is what was just indicated, namely, the formation of adequate ideas of the common properties of things and the movement by way of deductive inference to the formation of adequate ideas of other common properties. Unlike in the case of knowledge of the first kind, this order of ideas is rational.

We might think that in attaining this second kind of knowledge we have attained all that is available to us. However, Spinoza adds a third type, which he regards as superior. He calls this intuitive knowledge (scientia intuitiva) and tells us that it “proceeds from an adequate idea of the formal essence of certain attributes of God to the adequate knowledge of the [formal] essence of things”(P40S2). Unfortunately, Spinoza is once again obscure at a crucial junction, and it is difficult to know what he has in mind here. He seems to be envisioning a type of knowledge that gives insight into the essence of some singular thing together with an understanding of how that essence follows of necessity from the essence of God. Furthermore, the characterization of this kind of knowledge as intuitive indicates that the connection between the individual essence and the essence of God is grasped in a single act of apprehension and is not arrived at by any kind of deductive process. How this is possible is never explained.

Problems of obscurity aside, we can still see something of the ideal at which Spinoza is aiming. Inadequate ideas are incomplete. Through them we perceive things without perceiving the causes that determine them to be, and it is for this reason that we imagine them to be contingent. What Spinoza is offering with the third kind of knowledge is a way of correcting this. It is important to note, however, that he is not proposing that we can have this knowledge with respect to the durational existence of any particular item. As we have already seen, this would require having ideas of all of the temporal causes of a thing, which are infinite. Rather, he is proposing that we can have it with respect to the essence of a singular thing as it follows from the essence of God. To have this kind of knowledge is to understand the thing as necessary rather than contingent. It is, to use Spinoza’s famous phrase, to regard it sub quadam specie aeternitatis, under a certain aspect of eternity.

5. Psychology

One of the most interesting but understudied areas of Spinoza’s thought is his psychology, the centerpiece of which is his theory of the affects. Spinoza, of course, was not the first philosopher to take an interest in the affects. He had only to look to the work of Descartes and Hobbes in the previous generation and to the work of the Stoics before them to find sustained discussions of the topic. His own work shows that he learned much from these thinkers.

Despite his debts, Spinoza expressed deep dissatisfaction with the views of those who had preceded him. His dissatisfaction reflects the naturalistic orientation that he wished to bring to the subject:

Most of those who have written about the affects, and men’s way of living, seem to treat, not of natural things, which follow the common laws of Nature, but of things which are outside Nature. Indeed they seem to conceive man in Nature as a dominion within a dominion. For they believe that man disturbs, rather than follows, the order of Nature, that he has absolute power over his actions, and that he is determined only by himself. (III Preface)

In opposition to what he saw as a tendency on the part of previous philosophers to treat humans as exceptions to the natural order, Spinoza proposes to treat them as subject to the same laws and causal determinants as everything else. What emerges can best be described as a mechanistic theory of the affects.

a. Rejection of Free-Will

In working out this new perspective, the first thing on Spinoza’s agenda is to clear away what he sees as the most pervasive confusion that we as humans have about ourselves. This is the belief in free-will. Spinoza has nothing but scorn for this belief and treats it as a delusion that arises from the fact that the ideas we have of our actions are inadequate. “[M]en believe themselves to be free,” he writes, “because they are conscious of their own actions and are ignorant of the causes by which they are determined” (IIIP2S). If we were to acquire adequate ideas of our actions, since these would carry with them knowledge of their causes, we would immediately see this belief as the delusion that it is.

Spinoza’s position on this matter is quite obviously dictated by the determinism of his metaphysics. The mind, as a finite mode, is fully determined to be and to act by other finite modes. To posit a faculty of will by which it is made autonomous and independent of external causal determinants is to remove it from nature. Spinoza will have none of this. As it is fully part of nature, the mind must be understood according to the same principles that govern all modes.

b. The Conatus Principle

The first and most important of these principles is what has come to be known as the Conatus Principle:

IIIP6: Each thing, as far as it can by its own power, strives to persevere in being.

The correct interpretation of this principle is far from clear, but it appears to posit a kind of existential inertia within modes. Each mode, to the extent of its power, so acts as to resist the destruction or diminution of its being. Spinoza expresses this by saying that each mode has an innate striving (conatus) to persevere in being. This striving is so central to what a mode is that he identifies it as a mode’s very essence:

IIIP7: The striving by which each thing strives to persevere in its being is nothing but the actual essence of the thing.

Though a bit mysterious as to what it means to say that the striving of a mode is its essence, this identification will play a key role in Spinoza’s ethical theory. Among other things, it will provide the basis upon which he can determine what is involved in living by the guidance of reason.

c. The Affects

Spinoza begins his account of the affects with those that result from the action of external causes upon the mind. These are the passive affects, or passions. He identifies three as primary – joy, sadness, and desire – and characterizes all others as involving a combination of one or more of these together with some kind of cognitive state. Love and hate, for example, are joy and sadness coupled with an awareness of their respective causes. Longing, for example, is desire coupled with a memory of the desired object and an awareness of its absence. All remaining passions are characterized in a similar fashion.

Although joy, sadness, and desire are primitive, they are each defined in relation to the mind’s striving for perseverance. Joy is that affect by which the mind passes to a greater perfection, understood as an increased power of striving. Sadness is that affect by which the mind passes to a lesser perfection, understood as a decreased power of striving. And desire is the striving for perseverance itself insofar as the mind is conscious of it. Because all passions are derived from these primary affects, the entire passional life of the mind is thus defined in relation to the striving for perseverance.

This may seem paradoxical. Insofar as the mind strives to persevere in being it would appear to be active rather than passive. This is true, but we must realize that the mind strives both insofar as it has adequate ideas and insofar as it has inadequate ideas. The passions are defined only in relation to the mind’s striving insofar as it has inadequate ideas. In fact, the passions are themselves a species of inadequate ideas. And since all inadequate ideas are caused from without, so too are the passions. It is in this respect that they must be considered to be passive rather than active.

This, however, is not the case with those affects that are defined in relation to the mind’s striving insofar as it has adequate ideas. All such affects, being themselves a species of adequate ideas, are active. Mirroring his analysis of the passions, Spinoza takes two of these as primitive – active joy and active desire – and treats the remainder as derivative. (He does not acknowledge the possibility of an active form of sadness, since the diminishment of the mind’s perfection, which is what is involved in sadness, can only occur through the action of external causes.) In doing so, he posits an element within the affective life that is not only active, but, because it is grounded in the mind’s striving insofar as it has adequate ideas, is fully rational. It is a central concern of Spinoza’s ethical program to maximize this element.

d. Bondage

That Spinoza would wish to maximize the active affects is understandable in light of his characterization of life led under the sway of the passions. Such a life is one in which the individual exercises little effective self-control and is buffeted by external circumstances in ways that are largely random. “The man who is subject to the [passive] affects,” Spinoza writes, “is under the control, not of himself, but of fortune, in whose power he so greatly is that often, though he sees the better for himself, he is still forced to follow the worse” (IV Preface). Life under the sway of the passions is a life of bondage.

Unfortunately, the extent to which we can extricate ourselves from the sway of the passions is limited. There are two reasons for this. The first is that the mind is a mode of limited power, yet it is inserted into an order of nature in which there exists an infinite number of modes whose power surpasses its own. To think that the mind can exist unaffected within this order is to assume, falsely, that it is endowed with infinite power or that nothing in nature acts upon it. The second, which is a specification of the first, is that an affect is not restrained merely because it is opposed by reason. It must be opposed by an affect that is stronger than it. The trouble is that reason often lacks this affective power. This is because the strength of the active affects, which pertain to reason, is a function of the strength of the mind alone, whereas the strength of the passive affects, the passions, is a function of the strength of their external causes, which in many cases is greater. In such cases reason is unable to overrule passion and is impotent as a guide. “With this,” Spinoza concludes, “I have shown the cause why men are moved more by opinion than by true reason, and why the true knowledge of good and evil arouses disturbances of the mind, and often yields to lust of every kind” (IV17S). Such is the life of bondage.

6. Ethics

It is from this rather pessimistic diagnosis of the human condition that Spinoza’s ethical theory takes off. In view of this, it is not at all surprising that his ethics is largely one of liberation, a liberation that is directly tied to the cultivation of reason. In this respect, Spinoza’s ethical orientation is much more akin to that of the ancients than to that of his fellow moderns. Like the ancients, he sought not so much to analyze the nature and source of moral duty as to describe the ideal human life. This is the life that is lived by the so-called ‘free-man’. It is a life of one who lives by the guidance of reason rather than under the sway of the passions.

a. Freedom from the Passions

In the opening propositions of Book Five, Spinoza lists a number of respects in which the mind, despite its condition of bondage, is able to weaken the hold that the passions have over it. Generally speaking, it is able to do this insofar as it acquires adequate ideas. This, Spinoza tells us, is due to the fact that “the power of the mind is defined by knowledge alone, whereas lack of power, or passion, is judged solely by the privation of knowledge, that is, by that through which ideas are called inadequate” (VP20S). Two examples illustrate this liberating power of adequate ideas.

First, Spinoza claims that the mind is able to form adequate ideas of its affects. It can thus form adequate ideas of the passions, which are themselves inadequate ideas. Since there is no real distinction between an idea and the idea of that idea, those passions of which the mind forms adequate ideas are thereby dissolved.

Second, the effect of a thing upon the mind is lessened to the extent that it is understood to be necessary rather than contingent. We tend, for example, to be saddened less by the loss of a good when we understand that its loss was inevitable. Similarly, we tend to be angered less by another person’s actions when we understand that he or she could not have done otherwise. Since adequate ideas present things as necessary rather than as contingent, the acquisition of such ideas thereby lessens their effect upon the mind.

As these examples illustrate, the mind’s power over the passions is a function of the adequate ideas that it possess. Liberation lies in the acquisition of knowledge, which empowers the mind and renders it less susceptible to external circumstances. In taking this position, Spinoza places himself in a long tradition that stretches back to the Stoics and ultimately to Socrates.

b. Conatus and the Guidance of Reason

Spinoza tells us that the model human life – the life lived by the ‘free-man’ – is one that is lived by the guidance of reason rather than under the sway of the passions. This tells us very little, however, unless we know what it is that reason prescribes. In order to make this determination, Spinoza falls back upon the mind’s striving for perseverance:

Since reason demands nothing contrary to Nature, it demands that everyone love himself, seek his own advantage, what is really useful to him, want what will really lead a man to greater perfection, and absolutely, that everyone should strive to preserve his own being as far as he can. This, indeed, is as necessarily true as that the whole is greater than its part. (IVP18S)

Reason’s prescription is egoistic. We are to act in accordance with our nature. But since our nature is identical to our striving to persevere in being, reason prescribes that we do whatever is to our advantage and seek whatever aids us in our striving. To act this way, Spinoza insists, is to act virtuously.

This does not mean that in living by the guidance of reason we necessarily place ourselves at odds with others. Reason prescribes that individuals seek whatever aids in the striving for perseverance. But since the goods that are necessary in order to persevere in being are attainable only within the context of social life, reason dictates that we act in ways that are conducive to the stability and harmony of society. Spinoza goes so far as to say that in a society in which everyone lives by the guidance of reason, there would be no need of political authority to restrict action. It is only insofar as individuals live under the sway of the passions that they come into conflict with one another and are in need of political authority. Those who live by the guidance of reason understand this and recognize that authority as legitimate.

c. Knowledge of God as the Highest Good

Spinoza’s contention that those who live by the guidance of reason will naturally live in harmony with one another receives some support from his view of the highest good for a human. This is the knowledge of God. Since this knowledge can be possessed equally by all who seek it, it can be sought by all without drawing any into conflict.

To establish that the knowledge of God is the highest good, Spinoza again appeals to the fact that the mind’s striving is its essence. Since what follows from the mind’s essence alone are adequate ideas, this allows him to construe the mind’s striving as a striving for adequate ideas. It is a striving for understanding:

IVP26: What we strive for from reason is nothing but understanding; nor does the mind, insofar as it uses reason, judge anything else useful to itself except what leads to understanding.

From here it is but an easy step to show that the knowledge of God is the mind’s greatest good. As an infinite substance, God is the greatest thing that can be conceived. Moreover, since everything other than God is a mode of God, and since modes can neither be nor be conceived without the substance of which they are modes, nothing else can be or be conceived apart from God. Spinoza concludes:

IVP28: Knowledge of God is the mind’s greatest good: its greatest virtue is to know God.

The knowledge of God is the fulfillment of the mind’s striving to persevere in being.

d. Intellectual Love of God and Human Blessedness

In elaborating this thesis, Spinoza specifies this knowledge as knowledge of the third kind. This is the knowledge that proceeds from the adequate idea of one or another of God’s attributes to the adequate idea of the formal essence of some singular thing that follows from that attribute. When we possess knowledge of the third kind, we possess adequate perception of God’s essence considered not only in itself, but as the immanent causal power of the particular modifications to which it is subject. Knowledge of the first kind, because it is inadequate, and knowledge of the second kind, because it is restricted to the common properties of things, both fail to give us this.

In attaining the third kind of knowledge the mind passes to the highest state of perfection that is available to it. As a result, it experiences active joy to the greatest possible degree. More importantly, since it is by this kind of knowledge that the mind understands God to be the cause of its own perfection, it gives rise to an active love for God as well. This Spinoza refers to as the intellectual love of God. It is the affective correlate to the third kind of knowledge.

The intellectual love of God turns out to have a great many unique properties. Among other things, it is entirely constant, it has no contraries, and it is the very love by which God loves himself. Most significantly, it constitutes the blessedness of the one who possesses it. When such a love dominates one’s affective life, one attains the serenity and freedom from passion that is the mark of wisdom. Spinoza thus writes of the person who has attained this love that he “is hardly troubled in spirit, but being, by a certain eternal necessity, conscious of himself, and of God, and of things, he never ceases to be, but always possess true peace of mind” (VP42S). This is human blessedness.

e. Eternity of the Mind

Spinoza’s comment that a person who has attained the intellectual love of God “never ceases to be” is perplexing to say the least. It signals a commitment to the view that in some fashion or another the mind, or some part of it, survives the death of the body:

VP23: The human mind cannot be absolutely destroyed with the body, but something of it remains which is eternal.

At first sight, this appears to be in violation of Spinoza’s anti-dualist contention that mind and body are one and the same thing conceived under two different attributes. On the basis of this contention, one would expect him to reject the survival of the mind in any fashion. That he asserts it instead has understandably been a source of great controversy among his commentators.

At least some of the problem can be cleared away by taking account of a crucial distinction that Spinoza makes between the existence of the body and its essence. The existence of the body is its actual duration through time. This involves its coming to be, the changes it undergoes within its environment, and its eventual destruction. By contrast, the essence of the body is non-durational. It is grounded in the timeless essence of God, specifically as one among the innumerable particular ways of being extended.

The importance of this distinction lies in the fact that, by appealing to the parallelism doctrine, Spinoza can conclude that there is a corresponding distinction with respect to the mind. There is an aspect of the mind that is the expression of the existence of the body, and there is an aspect of the mind that is the expression of the essence of the body. Spinoza readily concedes that the aspect of the mind that expresses the existence of the body cannot survive the destruction of the body. It is destroyed with the destruction of the body. Such, however, is not the fate of the aspect of the mind that expresses the essence of the body. Like its object, this aspect of the mind is non-durational. Since only what is durational ceases to be, this aspect of the mind is unaffected by the destruction of the body. It is eternal.

Here we must be careful not to misunderstand what Spinoza is saying. In particular, we should not take him to be offering anything approaching a full-blooded doctrine of personal immortality. In fact, he dismisses the belief in personal immortality as arising from confusion: “If we attend to the common opinion of men, we shall see that they are indeed conscious of the eternity of their mind, but that they confuse it with duration, and attribute it to the imagination, or memory, which they believe remains after death” (VP34S). Individuals have some awareness of the eternity of their own minds. But they mistakenly believe that this eternity pertains to the durational aspect of the mind, the imagination. As it is the imagination, inclusive of memory, that constitutes one’s unique identity as a person, the belief in personal immortality is similarly mistaken.

None of this is to say that Spinoza’s doctrine of the eternity of the mind has no relevance to ethics. Although the imagination is not eternal, the intellect is. And since the intellect is constituted by the mind’s store of adequate ideas, the mind is eternal precisely to the extent that it has these ideas. As a consequence, a person whose mind is constituted largely by adequate ideas participates more fully in eternity than a person whose mind is constituted largely by inadequate ideas. So, while Spinoza offers us no hope of personal immortality, we may take consolation in the fact that “death is less harmful to us, the greater the mind’s clear and distinct knowledge, and hence, the more the mind loves God” (VP38S).

f. Conclusion

Spinoza does not pretend that any of this is easy. The acquisition of adequate ideas, especially those by which we attain knowledge of the third kind, is difficult, and we can never completely escape the influence of the passions. Nevertheless, Spinoza holds out to those who make the effort the promise, not of personal immortality, but of participation in eternity within this life. He closes the Ethics with these words:

If the way I have shown to lead to these things now seems very hard, still, it can be found. And of course, what is found so rarely must be hard. For if salvation were at hand, and could be found without great effort, how could nearly everyone neglect it? But all things excellent are as difficult as they are rare. (VP42S)

7. References and Further Reading

All passages from the texts of Spinoza are taken from the translations appearing in The Collected Works of Spinoza. Vol.I. Edited and translated by Edwin Curley. (Princeton: Princeton University Press, 1985). Passages from the Ethics are cited according to Book (I – V), Definition (D), Axiom (A), Proposition (P), Corollary (C), and Scholium (S). (IVP13S) refers to Ethics, Book IV, Proposition 13, Scholium. Passages from the Treatise on the Emendation of the Intellect are cited according to paragraph number. (TIE 35) refers to Treatise on the Emendation of the Intellect, paragraph 35.

All passages from the texts of Descartes are taken from the translations appearing in The Philosophical Writings of Descartes. 2 Vols. Edited and translated by John Cottingham, Robert Stoothoff & Dugald Murdoch (Cambridge: Cambridge University Press, 1985). Passages are cited according to volume and page number. (CSM II,23) refers to Cottingham, Stoothoff & Murdoch, Volume II, page 23.

a. Texts and Translations of Spinoza

  • Spinoza Opera. 4 Vols. Edited by Carl Gebhart. (Heidelberg: Carl Winter, 1925).
    • Standard critical edition of Spinoza’s writings and correspondence in Latin and Dutch.
  • The Collected Works of Spinoza. Vol. I. Edited and translated by Edwin Curley. (Princeton University Press, 1985).
    • First of two volumes (the second is not yet complete) in what, when complete, will become the standard translation into English of Spinoza’s writings and correspondence.
  • A Spinoza Reader: The Ethics and Other Works. Edited and translated by Edwin Curley. (Princeton: Princeton University Press, 1994).
    • Useful reader that contains the entire text of the Ethics, as well as substantial selections from the Treatise on the Emendation of the Intellect, the Short Treatise, and theTheological-Political Treatise. Also contains helpful selections from Spinoza’s correspondence.
  • Baruch Spinoza: The Complete Works. Edited by Michael L. Morgan and translated by Samuel Shirley. (Indianapolis: Hackett, 2002).
    • Only modern translation into English of Spinoza’s complete works, including his correspondence.

b. General Studies Suitable as Introductions

  • Allison, Henry. Benedict de Spinoza: An Introduction. (New Haven: Yale UP, 1987).
  • Curley, Edwin. Behind the Geometrical Method. (Princeton: Princeton University Press, 1988).
  • Lloyd, Genevieve. Spinoza and the “Ethics”. (London: Routledge, 1996).
  • Hampshire, Stuart. Spinoza. (London: Penguin, 1951).
  • Steinberg, Diane, On Spinoza. (Belmont, CA: Wadsworth, 2000).

c. More Advanced and Specialized Studies

  • Bennett, Jonathan. A Study of Spinoza’s “Ethics”. (Indianapolis: Hackett, 1984).
  • De Dijn, Herman. Spinoza: The Way to Wisdom. (West Lafayette, IN: Purdue University Press, 1996).
  • Della Rocca, Michael. Representation and the Mind-Body Problem in Spinoza. (Oxford: Oxford University Press, 1996).
  • Donagan, Alan. Spinoza. (Chicago: University of Chicago Press, 1988). Curley, Edwin. Spinoza’s Metaphysics: An Essay in Interpretation. (Cambridge: Harvard University Press, 1969).
  • Delahunty, R.J. Spinoza. (London: Routledge & Kegan Paul, 1985).
  • Lloyd, Genevieve, Part of Nature: Self-Knowledge in Spinoza’s Ethics. (Ithaca, NY: Cornell University Press, 1994).
  • Mark, Thomas Carson. Spinoza’s Theory of Truth. (New York: Columbia University Press, 1972).
  • Mason, Richard. The God of Spinoza: A Philosophical Study. (Cambridge: Cambridge University Press, 1997).
  • Nadler, Steven. Spinoza: A Life. (Cambridge: Cambridge University Press, 1999).
  • Nadler, Steven. Spinoza’s Heresy. (Oxford: Oxford University Press, 2001).
  • Wolfson, Harry Austryn. The Philosophy of Spinoza. 2 Vols. (Cambridge: Harvard University Press, 1934).
  • Woolhouse, R.S. Descartes, Spinoza, Leibniz: The Concept of Substance in Seventeenth Century Metaphysics. (London: Routledge, 1993).
  • Yovel, Yrmiyahu, Spinoza and Other Heretics. Vol.I: The Marrano of Reason; Vol.II: The Adventures of Immanence. (Princeton: Princeton University Press, 1989).

d. Collected Essays on Spinoza

  • Chappell, Vere (ed.). Baruch de Spinoza. (New York: Garland Publishing, 1992).
  • Curley, Edwin and Pierre-François Moreau (eds.). Spinoza: Issues and Directions. (Leiden: E.J. Brill, 1990).
  • Freeman, Eugene and Maurice Mandelbaum (eds.). Spinoza: Essays in Interpretation. (LaSalle, IL: Open Court, 1975).
  • Garrett, Don (ed.). The Cambridge Companion to Spinoza. (Cambridge: Cambridge University Press, 1996).
  • Grene, Marjorie (ed.). Spinoza: A Collection of Critical Essays. (Garden City, NY: Doubleday/Anchor Press, 1973).
  • Grene, Marjorie and Debra Nails (eds.). Spinoza and the Sciences. (Dordrecht: Reidel, 1986).
  • Kennington, Richard (ed.). The Philosophy of Baruch Spinoza. (Washington DC: Catholic University Press, 1980).
  • Lloyd, Genevieve (ed.). Spinoza Critical Assessments, 4 Vols. (London: Routledge, 2001).
  • Shanan, Robert and J.I. Biro (eds.). Spinoza: New Perspectives. (Norman, OK: University of Oklahoma Press, 1978).

Author Information

Blake D. Dutton
Email: bdutton@luc.edu
Loyola University Chicago
U. S. A.

Western Concepts of God

Western concepts of God have ranged from the detached transcendent demiurge of Aristotle to the pantheism of Spinoza. Nevertheless, much of western thought about God has fallen within some broad form of theism. Theism is the view that there is a God which is the creator and sustainer of the universe and is unlimited with regard to knowledge (omniscience), power (omnipotence), extension (omnipresence), and moral perfection. Though regarded as sexless, God has traditionally been referred to by the masculine pronoun.

Concepts of God in philosophy are entwined with concepts of God in religion. This is most obvious in figures like Augustine and Aquinas, who sought to bring more rigor and consistency to concepts found in religion. Others, like Leibniz and Hegel, interacted constructively and deeply with religious concepts. Even those like Hume and Nietzsche, who criticized the concept of God, dealt with religious concepts. While Western philosophy has interfaced most obviously with Christianity, Judaism and Islam have had some influence. The orthodox forms of all three religions have embraced theism, though each religion has also yielded a wide array of other views. Philosophy has shown a similar variety. For example, with regard to the initiating cause of the world, Plato and Aristotle held God to be the crafter of uncreated matter. Plotinus regarded matter as emanating from God. Spinoza, departing from his judaistic roots, held God to be identical with the universe, while Hegel came to a similar view by reinterpreting Christianity.

Issues related to Western concepts of God include the nature of divine attributes and how they can be known, if or how that knowledge can be communicated, the relation between such knowledge and logic, the nature of divine causality, and the relation between the divine and the human will.

Table of Contents

  1. Sources of Western Concepts of God
  2. Historical Overview
    1. Greeks
    2. Early Christian Thought
    3. Medieval Thought
    4. Renaissance Thought
    5. Enlightenment
    6. Modern Period
  3. Divine Attributes
    1. Incorporeality
    2. Simplicity
    3. Unity
    4. Eternity
    5. Immutability
    6. Omnipotence
    7. Omniscience
    8. Impassibility
    9. Goodness
  4. References and Further Reading

1. Sources of Western Concepts of God

Sources of western concepts of the divine have been threefold: experience, revelation, and reason. Reported experiences of God are remarkably varied and have produced equally varied concepts of the divine being. Experiences can be occasioned by something external and universally available, such as the starry sky, or by something external and private, such as a burning bush. Experiences can be internal and effable, such as a vision, or internal and ineffable, as is claimed by some mystics. Revelation can be linked to religious experience or a type of it, both for the person originally receiving it and the one merely accepting it as authoritative. Those who accept its authority typically regard it as a source of concepts of the divine that are more detailed and more accurate than could be obtained by other means. Increasingly, the modern focus has been on the complexities of the process of interpretation (philosophical hermeneutics) and the extent to which it is necessarily subjective. Revelation can be intentionally unconnected to reason such that it is accepted on bare faith (fideism; compare Kierkegaard), or at the other extreme, can be grounded in reason in that it is accepted because and only insofar as it is reasonable (compare Locke). Reason has been taken as ancillary to religious experience and revelation, or on other accounts, as independent and the sole reliable source of concepts of God.

Each of the three sources of concepts of God has had those who regard it as the sole reliable basis of our idea of the divine. By contrast, others have regarded two or three of the sources as interdependent and mutually reinforcing. Regardless of these differing approaches, theism broadly construed has been a dominant theme for much of the history of Western thought.

2. Historical Overview

a. Greeks

At the dawn of philosophy, the Ionian Greeks sought to understand the true nature of the cosmos and its manifestations of both change and permanence. To Heraclitus, all was change and nothing endured, whereas to Parmenides, all change was apparent. The Pythagoreans found order and permanence in mathematics, giving it religious significance as ultimate being. The Stoics identified order with divine reason.

To Plato, God is transcendent-the highest and most perfect being-and one who uses eternal forms, or archetypes, to fashion a universe that is eternal and uncreated. The order and purpose he gives the universe is limited by the imperfections inherent in material. Flaws are therefore real and exist in the universe; they are not merely higher divine purposes misunderstood by humans. God is not the author of everything because some things are evil. We can infer that God is the author of the punishments of the wicked because those punishments benefit the wicked. God, being good, is also unchangeable since any change would be for the worse. For Plato, this does not mean (as some later Christian thought held) that God is the ground of moral goodness; rather, whatever is good is good in an of itself. God must be a first cause and a self-moved mover otherwise there will be an infinite regress to causes of causes. Plato is not committed to monotheism, but suggests for example that since planetary motion is uniform and circular, and since such motion is the motion of reason, then a planet must be driven by a rational soul. These souls that drive the planets could be called gods.

Aristotle made God passively responsible for change in the world in the sense that all things seek divine perfection. God imbues all things with order and purpose, both of which can be discovered and point to his (or its) divine existence. From those contingent things we come to know universals, whereas God knows universals prior to their existence in things. God, the highest being (though not a loving being), engages in perfect contemplation of the most worthy object, which is himself. He is thus unaware of the world and cares nothing for it, being an unmoved mover. God as pure form is wholly immaterial, and as perfect he is unchanging since he cannot become more perfect. This perfect and immutable God is therefore the apex of being and knowledge. God must be eternal. That is because time is eternal, and since there can be no time without change, change must be eternal. And for change to be eternal the cause of change-the unmoved mover-must also be eternal. To be eternal God must also be immaterial since only immaterial things are immune from change. Additionally, as an immaterial being, God is not extended in space.

The Neo-Platonic God of Plotinus (204/5-270 A.D.) is the source of the universe, which is the inevitable overflow of divinity. In that overflow, the universe comes out of God (ex deo) in a timeless process. It does not come by creation because that would entail consciousness and will, which Plotinus claimed would limit God. The first emanation out of God (nous) is the highest, successive emanations being less and less real. Finally, evil is matter with no form at all, and as such has no positive existence. God is an impersonal It who can be described only in terms of what he is not. This negative way of describing God (the via negativa) survived well into the middle ages. Though God is beyond description, Plotinus (perhaps paradoxically) asserted a number of things, such as that virtue and truth inhere in God. Because for Plotinus God cannot be reached intellectually, union with the divine is ecstatic and mystical. His thought influenced a number of Christian mystics, such as Meister Eckhart (1260-1327).

b. Early Christian Thought

Early Christians regarded Greek religion as holding views unworthy of God, but they were divided as to Greek philosophy. Christian philosopher Justin Martyr (c. 100-c. 165) saw Christianity as compatible with the highest and best Greek thought, whereas Tertullian (c. 160-c. 225) dismissed philosophy, saying that Jerusalem (faith) could have nothing to do with Athens (philosophy).

Having been born out of Judaism, Christianity was unambiguously monotheistic and affirmed that God created the material of the universe out of nothing (ex nihilo). But it also affirmed the Trinity as multiplicity within unity, a view it regarded as implicit in Judaism.

Consistent with theism, Augustine (354-430) regarded God as omniscient, omnipotent, omnipresent, morally good, the creator (ex nihilo) and sustainer of the universe. Despite these multiple descriptors, God is uniquely simple. Being entirely free, he did not have to create, but did so as an act of love. As his creation, it reflects his mind. Time and space began at creation, and everything in creation is good. Evil is uncreated, being a lack of good and without positive existence. Though God is not responsible for evil even it has a purpose: to show forth what is good, especially what is good within God. Augustine developed a theme found as early as Plato, Aristotle, and Zeno of Citium, that God is a perfect being. After enumerating a hierarchy of excellencies (things to be “preferred”) Augustine affirms that God “lives in the highest sense” and is “the most powerful, most righteous, most beautiful, most good, most blessed” (On the Trinity, XV, 4). When we think of God, we “attempt to conceive something than which nothing more excellent or sublime exists” (Christian Doctrine, I, 7, 7). But where Aristotle concluded that the greatest being must be aware only of himself, Augustine emphasized an opposite and distinctly Christian theme: God loves creatures supremely to the point of becoming incarnate in Christ in order to be revealed to them and to reconcile them to himself. Moreover, God is providentially active in history, from an individual level (Confessions) on up to dealings with entire nations (City of God). So as to the important subject of God’s relationship to the world, Christian thought could not be more opposite Aristotle’s view of a Being who contemplates only himself.

John Scotus Erigena (c. 810-c.877) had stronger affinities for Neo-Platonic thought. God created the universe according to eternal patterns in his mind and it is an expression of his thought, however incomplete an expression the cosmos may be. Erigena’s pantheistic tendencies can be seen in his notion that God creates out of himself and “God is in all things.” Creation is not in time but is eternal. In the process God used universals and made them particulars (e.g., humanity became individual persons). Immortality is the reverse process of particulars going back to universals. In Erigena’s terms, division is the process of differentiating universals into particulars; analysis is the reverse, a return to unity and thus to God. These are not mere mental activities but mirror reality and God’s relationship to the world. God is ultimately unknowable, being beyond all language and categories. Aristotle’s predicates and categories cannot apply to God because they assume some type of substance. Nevertheless God can be described, albeit inadequately, using both positive and negative statements. Positive statements are only approximate but can be made more exact by adding negative statements. For example, it can be said that God is good (positive), but also that he is not good (negative) in that he is above goodness. These can be combined in the statement that he is “supergood.” In spite of these approximations, God must be reached by mystical experience.

c. Medieval Thought

Islamic Neoplatonist al-Farabi (875-950) held that universals are in things and have no existence apart from particulars. Objects are contingent in that they may or may not exist; they do not have to exist. Therefore there must be something that has to exist-that exists necessarily-to ground the existence of all other (contingent) things. This being is God. The world evolves by emanation, and matter is a phase of that process. The potential in matter is made actual, and over time God brings out its form. Thought is one emanation from God, and through it knowledge arises in humans. The actualized human intellect becomes an immortal substance.

Avicenna (Ibn Sina; 980-1037), a Muslim, also distinguished between God as the one necessary being and all other things, which are contingent. The world is an emanation from God as the outworking of his self-knowledge. As such it is eternal and necessary. God must be eternal and simple, existing without multiplicity. In their essence, things do not contain anything that accounts for their existence. They are hierarchically arranged such that the existence of each thing is accounted for by something ontologically higher. At the top is the one being whose existence is necessary. From contingent things we come to know universals, whereas God knows universals prior to their existence in things.

Al-Ghazali (1058-1111) challenged any joining of theology and philosophy, holding that because the mind and senses are subject to error, truth must come by divine grace. Rather than the world existing necessarily in a Neoplatonic sense, it exists by the will of God alone. It is in no way autonomous, and even causal relationships are non-necessary. He rejected as un-Islamic Avicenna’s view that things like souls or intellects could be eternal.

Anselm (1033-1109), archbishop of Canterbury, raised the perfect being concept to a new level by making it the foundation of his celebrated ontological argument. He accepted that God is the highest level of being under which there are, by degrees, lesser and lesser beings. Similar to Plato, Anselm assumes the realist view that entities which share an attribution, such as “good,” also share in being. And somewhere there must be a perfection of that being (e.g., perfect goodness). That perfection is God.

Though a Muslim and an Aristotelian, Averroes (Ibn Rushd; 1126-1198) added to the growing concept of emanation by claiming that the universal mind is an emanation from God. Humans participate in this universal mind and only it, not the soul, is immortal. The mind of the common person understands religious symbols in a literal way, whereas the philosopher interprets them allegorically. Consequently, something understood as true philosophically may be untrue theologically, and vice versa.

Working from Judaism, Maimonides (1135-1204) accepted creation rather than an eternal universe. He drew from philosophic traditions to formulate three proofs based on the nature of God, and these were developed further by Aquinas. Following Aristotle Maimonides demonstrated the existence of a Prime Mover, and with some inspiration from Avicenna, the existence of a necessary being. He also showed God to be a primary cause. Though he considered God’s existence demonstrable, he held that nothing positive could be said about God.

Bonaventura (John of Fidanza, c. 1221-1274) argued that the Aristotlean denial of Platonic ideas would entail that God knows himself but not the world. As such God could not be its creator. Furthermore, because some change in the universe is cyclic and therefore unexplainable by chance, change would have to be deterministic. But this would deny God’s providence as well as human moral responsibility. So a proper concept of God must include Platonic ideas. Reason can prove God as creator since an eternal universe entails both that the amount of time of its existence is infinite and that it is increasing. Yet there cannot be both an infinite and a larger infinite (a view not held in modern times).

Thomas Aquinas (1225-1274) accepted both Aristotle and Christian revelation. He accepted both reason and revelation as sources of knowledge of God. Over the neo-Platonic notion of a hierarchy of reality in which lower existences are less real and a mere shadow of the divine, Aquinas accepted gradations of form and matter. Atop the hierarchy is God as pure form and no matter. As pure actuality and no potentiality, he is perfect and therefore changeless. He is also pure intelligence and pure activity. To these Aristotelian concepts Aquinas added Christian convictions that God is loving, providential, and ruler of the universe. Reason and revelation are in harmony because they have the same divine source, and revelation is not unreasonable. Perception is also in harmony because the world’s origins are divine. This being the case, God as cause can be known through the world as effect. For this reason empirical facts ground Aquinas’s theistic proofs.

The God that can be known in part from the universe is fundamentally different from it. Only God is identical to his essence, being neither more nor less than it. By contrast, a being such as Socrates is transcended by humanity because there are other people. On the other hand, Socrates has qualities (“accidents”) that are not part of his essence; for example, he may be sitting. So unlike God, Socrates is both greater than and less than his essence. There is nothing that transcends God so nothing is greater than his essence. And there are no accidents in God because accidents are caused by something else (just as part of the cause of Socrates sitting is a chair).

God is not (completely) knowable because he is not material, whereas our knowledge is normally dependent on our senses. Furthermore, we normally know things by knowing their genus and species, yet God is unique and so cannot be known in that way. We can know something of God the negative way (via negativa) by removing limits, concluding for example, that God is unmoved, and unlimited by space. What we can know of God positively is neither exactly like our knowledge of temporal things (univocal) nor entirely different (equivocal). Rather, it is analogical, being in some ways the same and in other ways different. God knows x in a way that is both like and unlike the way in which Socrates knows x. God knows, but in a way that is, among other things, complete, immediate, and timeless.

That God created is evident (though not provable) because a material universe cannot emanate from an immaterial being. The universe exists to manifest God, who created the fullest possible range of beings because in them he can be revealed to the fullest extent. Beings range from angels, who are immaterial; to humans, who are material and immaterial; to animals, who are purely material (and both eat and move); to plants, to inanimate objects.

God as primary cause works through such created things as secondary causes. Nevertheless, creatures with a will remain free and responsible. God can also work apart from secondary causes in what we call miracles. Being good, God created the best possible world in the sense that it has the best kinds of things. Evil is a privation or lack of good and as such God did not cause it the way he causes other things. So we cannot ask why God brought about evil, but we can ask why he did not bring about more good. He did not bring about more good in order that he could be revealed through the greatest range of things, and as well, to allow for certain types of good (such as compassion, which can exist only where there is some suffering).

Aquinas and others grounded the scholastic synthesis of knowledge in the view that truth, morality, and God himself could be known by reason because the divine will itself is guided by reason. What is reasonable is therefore what is true and right. But John Duns Scotus (1265-1308) claimed that in humans and in God it is the will–not the intellect–that is primary. Evidence of this is that a being must will what to think about, thus something must act on the intellect; whereas nothing need act on the will. The view entails that there is no reason why God acts or wills as he does. This makes truth and morality essentially arbitrary and thereby unknowable through reason. God could have willed different moral standards. Scotus’s view makes our knowledge of God a matter of revelation and faith, not of reason.

Another concept about God’s will further destabilized the medieval world view. William of Ockham (1285-1347) held that omnipotence means God can do literally anything. Accordingly, a person could perceive something by sheer act of divine will, without the object being there at all. On his view, faith and reason can be contradictory. Ockham’s “razor” sought to cut from explanations those entities that are unverifiable thereby making simpler explanations preferred. This was later used to cut out of world views such things as divine purposes, which had been central to explanations since the Greeks. Eventually, even concepts of a divine being would be optional–or even unnecessary–to explanations and world views.

The connection between reason and God was further undermined by Meister Eckhart’s (1260-1327/28) view that God is “above being” and that human unity with the divine must be suprarational. Knowledge is a matter of proceeding from particulars to unity, beyond which is a unity with the divine surpassing all differences, “a silent desert.” The divine being is therefore inexpressible. God knows all things in their unity, timelessly; but on our temporal level it makes sense to differentiate time as well as events.

d. Renaissance Thought

God moved out of the intellectual center of knowledge as faith was no longer grounded in reason and reason was no longer supervised by faith. The power of the church waned and society found inspiration in the classical world. Interest in this life and the world drove interest in science, which soon uncovered mathematically describable physical regularities. This development shaped the concept of God in a way that further undermined the Aristotelian world view, with its emphasis on such things as divine purpose. Regularities such as those discovered in Kepler’s laws of planetary motion and Newton’s laws implied a supreme engineer. Early in these developments, Giordano Bruno (1548-1600) emphasized God as immanent in the universe as an active principle, a trend in the conception of God that would increase along with the ever more detailed understanding of natural processes to be achieved in the scientific revolution.

The Reformation period saw an emphasis on divine sovereignty over human affairs as a corollary to its emphasis on fallen humanity’s inability to achieve a right standing with God. If humans cannot come to God unaided, then it is God who must choose some to be right with him. Since the Reformers affirmed that divine choice cannot be based on merit, love must be the central divine attribute operating in salvation. This view of divine predestination brought new questions, both theological and philosophical, about the relationship between the human and divine wills. The question of how people could be free and responsible if predestination ultimately determines fate was resolved in John Calvin’s (1509-64) tradition partly by distinguishing between God’s irresistible and resistible will. The latter consists of human choices which God allows (for a higher divine purpose) to run counter to his perfect will. Thus God is entirely sovereign and humans are responsible for their deeds. James Arminius (1560-1609) objected that Calvinism made God responsible for sin, and he proposed instead that God predestined those whom he foresaw would repent.

The Reformers’ emphasis on the fallenness of the will led to their distrust in reason as a source of information about the spiritual realm, including God. An unfallen mind would see God everywhere through His creation, but our fallen minds cannot find God. Being therefore hidden, as Martin Luther emphasized (1483-1546), God must reveal Himself in revelation and deed. Humanity must resist the temptation to go beyond what is revealed, especially since God reveals only what we need to know, not all that we wish to know. The Reformers’ reluctance to use reason to narrow the gap between the spiritual and physical realms continued the Augustinian tradition (which faintly echoed Plato’s two realms), challenging the Scholastics’ high view of reason and of Aristotle. That reason has a limited role in the spiritual realm was later emphasized by Soren Kierkegaard (1813-55) and Karl Barth (1886-1968).

e. Enlightenment

Philosophy began splitting from religion as the two moved in opposite directions with regard to reason. Religion was retreating from reason both by emphasizing the divine will over the divine intellect, and in the human realm, by emphasizing faith over reason. Meanwhile, broad elements in the culture turned away from the authority of the church and Aristotle to regard reason as the main source of knowledge. The wisdom of this seemed confirmed in the discoveries of scientists like Newton and Kepler, who had great success using observations to find mathematical regularities in nature. Discoveries were revealing a highly ordered universe, implying a highly reasonable God.

Deism rose as a philosophical form of theism that used reason as its source of knowledge of God. Without revelation to give detail to natural theology, knowledge of God was minimal. Lord Herbert of Cherbury (1583-1648) claimed simply that there is one supreme God, who should be worshiped; virtuous living constitutes worship, people should repent, and God rewards good and punishes evil. The emerging Newtonian universe was one of mechanical precision and predictability, with no room for outside causes. Accordingly, there seemed to be little or no room for divine intervention. Deism, then, held that God caused the universe but did not intervene thereafter. Prayer and miracles were deemed unnecessary because of God’s superior engineering.

The emphasis on God as a perfect designer entailed that waste and suffering were only apparently pointless. The plan and wisdom of God were seen in the grand scheme of the universe, hence God is known best in generality and abstraction.

In a time of upheaval, Rene Descartes (1596-1650) famously sought to ground all knowledge on a foundation he could not doubt: that he was a thinking being. The success of his approach depended crucially on God’s benevolence: because we can be sure that the divine being would not mislead us, we can trust that our clear and distinct ideas are true. God’s character thus forms the basis for our certainty that there is indeed a reality corresponding to our ideas. God’s omnipotence entails the ability to do even what is logically impossible. Descartes also regarded God as not merely uncaused, but somehow the cause of himself.

John Locke (1632-1704) held a view reminiscent of scholasticism, that revelation reveals about God what cannot be known by reason alone–yet neither does revelation violate reason. He went beyond the scholastics to affirm that what violates reason cannot be accepted as revelation. His motive was to rule out what he called “enthusiasm,” which would include supposed private revelations about God held on the sole authority of an individual’s intuition that a revelation is true. Reason must judge whether a supposed revelation is true. His view further welded the concept of God to reason.

Baruch Spinoza (1632-1677) agreed with Descartes that clear and distinct ideas indeed reflect reality, but he thought that philosophy must start with God, not the self. This is because God is first in the order of things. God’s primacy is also the reason Spinoza rejected Bacon’s method of beginning with observation. He abandoned his judaistic roots by affirming that God is the whole of reality, and neither transcendent nor personal.

Aquinas had concluded that God exists on grounds that the universe needs something outside itself as a cause. But Spinoza believed that there can be only one thing–God–because wholes alone are independent and there can be only one whole (or “substance”). There is nothing outside the whole on which the whole can depend. That whole is a network of truths connected by implication. That being the case, everything is either necessary or impossible. Since to be free is to be undetermined by anything outside oneself, God is free because nothing can be outside him; and God alone is free because everything within the whole is the way it is by necessity. There is no need to prove the existence of God beyond the need to prove the existence of the one substance. For Spinoza, God is not an external initiating cause of the world and so is not demonstrable as such. He is nonetheless an immanent and continuing cause of the world. Nor could God be the world’s designer or one who imbues it with purpose. That is because wanting to bring something about implies lack, and God can lack nothing. Lacking purposes, God can have no moral goals for humanity. God is the network of all truths, not a personal being who gives revelation. Still, to know God-which is necessarily a matter of reason-is an essential good. As Spinoza said, “the highest virtue of the mind is to understand or to know God” (Ethics, Part 4, prop. 28; trans. Elwes).

Where Spinoza explained reality in terms of a singular substance that is divine, Gottfried Leibniz (1646-1716) proposed innumerable instances of the same types of substance. These monads as he called them, are centers of psychic energy. They do not act causally on each other but are coordinated in a grand harmony preestablished by God. That so many diverse elements act in harmony is proof for God’s existence. Because God operates on a principle of sufficient reason, there must be a reason why he chose to create just this world: it must be the best one possible. While many things are possible individually, even God is limited in what can be brought about together (just as a man can be a father or childless, but not both). Since God alone is perfect, created things have limitations, which is a source of evil. Nevertheless, we find that evil is often a prerequisite for some types of good. God’s choice to create this particular world is a matter of his internal moral necessity. He made this world because it has the greatest variety and can, as an act of love, reveal his nature in the greatest possible way.

Leibniz made God the source of causality, George Berkeley (1685-1753) made God the source of perception. He denied the existence of physical substances (because he regarded belief in the physical world as a root of atheism) and claimed that God directly gives us our ideas of the world. The orderliness of our ideas is testimony to the power of God.

David Hume (1711-1776) accepted Berkeley’s empiricism, which claimed that our ideas are of particular things and not universal things; but Hume’s empiricism led him to skeptical conclusions. He held that our observations about the world do not warrant belief in the God of theism. Design, for example, is manifestly imperfect; furthermore, a good God would not allow evil. If our observations point beyond the world at all it might be to a finite god, or even a number of gods. So the concept of God must be rooted not in reason but in emotion and the will.

f. Modern Period

Immanuel Kant (1724-1804) also rejected empirical knowledge as a way of knowing God. In fact, he maintained that God cannot be demonstrated at all, yet neither can his existence be disproved. As humans we typically go beyond what we can rightly infer, and our idea that God can be objectively known is an example. Nevertheless, as an idea, God has regulative value for our thinking in that it acts heuristically and gives a sense of unity to our experience. Practically, too, the idea of God grounds important moral beliefs. Specifically, it is fitting that those who do what is right are happy; and since that is not reliably attained in this life, we can rightly posit that there is life in a sphere beyond this one. We can make the practical assumption too that God exists to ensure the connection between virtue and happiness.

God was considered to be an objective issue before Kant. After him there was a greater tendency to consider it a subjective issue, one that is irreducibly a matter of interpretation. It was associated with discussions of ethics and values rather than of science and facts. This accompanied a change from the Enlightenment’s emphasis on objective knowledge of God as a transcendent engineer, to Romanticism’s emphasis on personal experience of God as a Spirit immanent in everything. Friedrich Schleiermacher (1768-1834) accordingly emphasized a feeling of dependence on God, while Albrect Ritschl (1822-1889) emphasized God as a source of moral freedom and values.

Whereas Kant and those he affected regard God as elusive to our rationality, for G. W. F. Hegel (1770-1831) God is the essence of rationality. Furthermore, Spirit reveals itself and its development through the world, being visible for all to see in the very events of history. Thus the categories which Kant regarded as being limited to the human mind Hegel regarded as part of the Absolute Mind. As such, the very structure of that Mind (or Spirit) can be known. Hegel challenged views that had been dominant since Aristotle, that God and truth are unchanging, and that logic deals with dichotomies that are properly kept apart by the principle of non-contradiction (according to which A cannot also be non-A). For Hegel, dichotomies are united in a higher reality. For example, Being and Nothing are transcended in Becoming. That is because Being is a general term and has no qualities, so it passes over into the concept of Nothing. That passing over is Becoming. The original opposition is thereby transcended.

Hegel believed that reality divides into dichotomies and contradictions that are resolved in a dynamic synthesis. Spirit thus moves from homogeneity to differentiation to unity in diversity. He therefore rejected Schelling’s idea that the Absolute is undifferentiated. Because for Hegel Spirit is more than matter, he rejected Spinoza’s view that the Absolute is substance only. For Hegel it is more than that; it is developing consciousness. In this process God comes to self-awareness through mankind’s awareness of him–God thinking of himself through human consciousness.

Kant had claimed that ultimate reality (the thing-in-itself) is unknowable, but Arthur Schopenhauer (1788-1860) said it is knowable because it is will. We can know it directly because we can know our own will. Will manifests itself with increasing sophistication in the physical world (through gravity, for example), in plants and animals, and in human nature. But because the will is completely free it is irrational and blind. He rejected Hegel’s optimistic belief in the ultimate victory of rationality, and in contrast to Leibniz, he held that this is the worst of all possible worlds.

Hegel’s view that Spirit is in process and not a static state was continued in Alfred N. Whitehead (1861-1947). Whitehead held that God is necessary to each act of becoming, and in turn God develops through each act of becoming. God strives to enrich the world as well as himself by nurturing harmony and order while preserving values that enhance truth, beauty, and goodness. He strives to eliminate evil from the world using persuasive (rather than coercive) power. In this sense, “He does not create the world, he saves it.” He leads it by means of his vision, rather like a poet.

The so called right wing Hegelians rejected pantheism and interpreted Hegel in a way consistent with theism. Left wing Hegelians associated the Absolute with material reality. Ludwig Feuerbach (1804-1872) said that people create the concept of God and project it onto reality. Karl Marx (1818-1883) made religion both a product and a tool of oppression, the “opium of the people.” People formulate religion in response to the sufferings caused by society’s inequities. Like a narcotic, it insulates them from the pain but it also makes people incapable of dealing with the cause of that pain. Furthermore, religion legitimates the status quo.

Friedrich Nietzsche (1884-1900) rejected belief in God as weak and untenable. He believed his times witnessed the death of God as a cultural force, yet at the same time he feared the outcome. He did not think that God died in the sense that He once existed and at some point ceased to exist, but that modern society regarded God as irrelevant.

Sigmund Freud (1856-1939) regarded God as a projection of the mind, a product of wishful thinking. The pre-scientific mind, for example, finds it easier to cope with an anthropomorphized universe. It is easier to suppose that a personal being is in control than to face seemingly capricious forces of nature. But when humanity grows into a more scientific understanding of the universe, such beliefs will be discarded.

Feuerbach, Marx, Nietzsche, Freud and others thus did not try to rationally defeat belief in God. Rather, they sought to explain its origins and the personal motives of believers.

In the early twentieth century, logical positivism narrowed the scope of meaning in a way that made belief in God subjective by definition. Besides tautologies only empirically verifiable statements were said to be true or false.

Ludwig Wittgenstein (1889-1951) was initially sympathetic to linking meaning to verifiability. He held that language is static and pictures reality. This limits what can be meaningfully expressed in language and excludes propositions about such things as ethics, aesthetics, and the meaning of life. On such topics, “one must be silent.” Wittgenstein later came to the view that meaning comes not from a link to the world but from usage. In this way language is more like doing than picturing. Because this necessarily gives language and meaning a social dimension, concepts of God are bound to their use within, for example, a believing community. On this view it is possible to claim that to know “God” is not to know the existence and attributes of a metaphysical being, but the use of a term and its connections to a life style.

3. Divine Attributes

Classical theism is found in the Greeks since Plato; in the Judaism of Philo, Maimonides, and others; in Christian orthodoxy generally, and in Islam as early as al-Kindi. Discussions of God in classical theism have centered on a number of specific attributes. The working assumption from the Greeks onward has been that God is the most perfect possible being. There is an implicit question as to whether perfections are coherent such that they can exist in one person. If they are not, God would have all perfections possible for a single being. In more theologically oriented thinkers, the assumption that God is a perfect being serves not to formulate the concept of God but only to fill in what is given in revelation. The Reformers, for example, depended heavily on revelation because of their conviction that the human mind is darkened by corruption and therefore is inadequate to shape concepts of God.

a. Incorporeality

Incorporeality. God has no body (from Latin, incorporale), or is non-physical. This is a central tenet of monotheistic religions, which insist that any references to God’s eyes, ears, mind, and the like are anthropomorphic. Christian belief in the incarnation is a unique case in which God takes on human form in Christ.

While some regard God’s incorporeality as true analytically (that is, true by the very definition of the word “God”), others derive it from one or more other attributes. Accordingly, God cannot be corporeal because that would preclude his being eternal, immutable, and simple, for example. Furthermore, if God were corporeal and omnipresent, it would seem that all physical things would be part of God. Others derive divine incorporeality from an apparent incorporeal element of human nature, termed the soul or spirit.

b. Simplicity

Simplicity. God has no parts or real distinctions. The neo-Platonist Plotinus regarded God as therefore characterless, but Christianity generally recognizes the legitimacy of talk of attributes. For Aquinas, to be simple God must be (among other things) incorporeal as well as identical to his nature, not a member of a class that shares a common nature. Aquinas said that God has the perfections we ascribe to him, but that they exist in him in an incomprehensible unity such that we cannot understand the reality behind our statements. When we ascribe goodness to God, goodness does not mean exactly what it does when we ascribe it to a creature (univocal meaning), nor does it mean something entirely different (eqivocal meaning). Its meaning is analogical: in some sense the same and in some sense different. Maimonides insisted on equivocal meaning only, with the result that negative attributes alone can be ascribed to God. Yet he recognized that even negative attribution gives some understanding of the divine being. In Islam, most philosophers (such as al-Farabi) accepted divine simplicity, whereas most theologians rejected it. Some used it to reject the Trinity. Augustine had recognized a potential conflict between simplicity and the Trinity, but believed the resolution lay in proper understanding of the Trinity.

c. Unity

Unity. Monotheism maintains that there is one God. To this Christianity adds that there is a threefold distinction within one God. Stated roughly, God is one substance in three persons. Aquinas argued that there cannot be two gods because neither would be absolutely perfect since one would have a quality that the other lacked (Summa Theologica Ia, 11, 3). Richard Swinburne says that theism is a simpler hypothesis than polytheism, the latter positing more beings with various capabilities and relations. Theism is therefore more likely since simpler hypotheses turn out to be true more often. Moreover, the universe exhibits a unity, in its universal natural laws for example. This unity argues for one deity as its originator (The Existence of God, 1991, pp. 141-2).

d. Eternity

Eternity. Biblical authors spoke of God remembering the past, knowing the future, and acting in the present. According to early Christian thought, God exists forever, without beginning or end. For him events are past, present, and future. Later Christian thought, under the influence of Platonism it is said, held that God exists not inside time, but outside it. God is atemporal in that for him everything is simultaneous, there being no past, present, or future. This later view was held by Augustine, Anselm, and Aquinas; and classically expressed by Boethius, “Eternity is the complete and total possession of unending life all at once” (Consolation of Philosophy, V, vi). Boethius regarded a timeless being as superior because it does not lack a past and future; its entire existence is in a timeless present.

In modern times the timeless view has been defended by E. L. Mascall, Norman Kretzmann, Eleanor Stump, Paul Helm, and Brian Leftow. Arguments in favor include: it makes God more transcendent, it simplifies foreknowledge, it proposes the same divine relationship to time as to space–God is outside it; furthermore it allows for the creation of time along with matter. Arguments for the earlier view, that God is eternal but exists within time, include: personhood requires existence in time because only in time can there be intending, acting, knowing, remembering, and the like; it is difficult to explain how a timeless God can know or respond to events; and the notion of timeless eternity is incoherent.

e. Immutability

Immutability. Those who accept the view that God is outside time are able to argue that God cannot change because any change would have to take place inside time. The view that God is an absolutely perfect being can also lead to the conclusion that he cannot change: if he is perfect he could change neither for the better nor for the worse. Simplicity can be grounds for accepting divine immutability since the only things subject to change are things with parts. Immutability has been taken in a strong sense to mean that if a predicate p applies to God at any time then it must apply at every time. But this is so broad that it brings into the discussion of immutability things that, while changing, are in no way changing within God. For example, “Smith believes in God” could be false yesterday and true today, yet nothing within God has changed. God is immutable in a weaker and less problematic sense if it is required only that he does not change in his character and purpose. The weaker sense fits well with the view that God exists in time, since he could be considered immutable yet begin an action, forgive a person, and so on. Thus, predicates like, “God is protecting r from harm” could be the case at one time but not another and God would still be immutable. The stronger sense of immutability fits well with a God outside of time.

f. Omnipotence

Omnipotence. The claim that God can do anything has been the subject of a number of qualifications. First, many affirm the biblical view that God cannot do what is morally contrary to his nature. Similar to Anselm (Proslogion 7), Aquinas says that God cannot sin because he is omnipotent, since sin is a falling short of perfection (Summa Theologica, Ia.25.3). Nelson Pike says that it is logically possible for God to sin but he would not do what is against his nature. Aquinas also says that God cannot do other things that corporeal beings can do. And, he cannot do what is logically impossible, such as make a square circle. Descartes is one of the few to hold the contrary view, that the laws of mathematics and logic are subject to the will of God (Descartes’ Conversation with Burman, 22, 90). Perhaps the most significant challenge to omnipotence involves the existence of evil. It seems evil would not exist if God is both good and omnipotent. Process theology denies omnipotence, Christian Science denies the ultimate reality of evil, and some post-Holocaust thinking seems to question the goodness of God. Augustine defends the orthodox Christian concept of God on grounds that he did what was good in creating free beings yet they used their freedom to do evil. Some suffering is the just consequence of sin. Furthermore, where evil is a lack of good we cannot ask why God created it since it is merely the absence of something. Aquinas, Leibniz and others recognize that some good things exist only in the presence of certain types of evil. For example, forgiveness exists only where there is sin. In the light of these secondary goods, Leibniz argues that out of all the possible worlds God created the one with the best possible balance of good and evil. Some thinkers appeal to a future life to settle apparent discrepancies in the balance of good over evil. God’s future blessing, it is said, can more than make up for suffering in this world. William Alston develops the idea that as limited beings we are incapable of discerning-and therefore questioning-whether God has sufficient reasons for allowing the evil that exists.

g. Omniscience

Omniscience. While a few like Avicenna and Averroes seem to have held that a God who lacks certain types of knowledge would be more perfect, most have claimed that God knows everything. This is sometimes refined, for example, to the claim that God knows everything that is logically possible to know. An area of concern going back to Aristotle (On Interpretation 9) is the claim that propositions about future contingent events (that is, those whose causes are not determined by past events) have no truth value. If so they are unknowable, even by an omniscient being (a view held in modern times by so called Open Theism). Some have claimed that even if future events have a truth value, they are logically unknowable. Of special concern is the relationship between omniscience and human free will: if yesterday God knew infallibly that I would do x today, it seems I have no alternative but to do x today–a conclusion that seems to violate free will. To solve this, Boethius and Aquinas appealed to the concept of God’s timelessness, which entails that none of God’s knowledge is past or future. Aquinas also said that God determines all events and determines that they will be done freely. De Molina objected that this amounts to removing free will. He constructed his own view, which said that God’s knowledge is logically prior to his decree of what will be. God knows what an individual will do in all possible circumstances (a capacity called middle knowledge), and he decrees those circumstances in which a person freely cooperates with the divine plan. Thus foreknowledge is compatible with free will. Others have conceded that foreknowledge is incompatible with free will but claim that God voluntarily limits his knowledge of future events so that there can still be freedom. This makes omniscience a matter of having an ability to know rather than having specific knowledge. Another solution to the problem of omniscience and freedom challenges the idea that God’s knowledge limits future free actions in any way. While God knows necessarily that I will do x tomorrow that does not entail that it is necessary I do x. What God knows is what I will freely choose to do. So God knows today that I will do x tomorrow because tomorrow I will freely choose to do x. But if tomorrow I choose to do y, then today God knows that tomorrow I will do y. This view is consistent with what we know about less than infallible knowledge of future events. I may know that a person will choose steak over bologna though I in no way influenced their choice.

h. Impassibility

Impassibility. Various views have been held as to whether God can be affected by outside influences. Because Aristotle regarded change as inconsistent with perfection, he concluded that God could not be affected by anything outside himself. Furthermore, God engages not in feeling, but thinking, and he himself is the object of his contemplation. God is thus unaffected by the world in any way. The Stoics ruled out divine passibility because they regarded imperturbability as a virtue, and God must be the supreme example of it. John of Damascus agreed that God is imperturbable, but stressed it is because he is sovereign, not because he is uncaring. Aquinas accepted Aristotle’s view that God cannot change and is impassible. He can act, but nothing can act upon him. So emotions that proceed from God, such as love and joy, are in God; but other emotions such as anger and sadness can be ascribed to him only metaphorically. Early, medieval, and Reformation Christianity generally affirmed that because God could not suffer, Christ suffered in his humanity but not in his divine nature. However, the idea that God is unaffected by the world is being rethought in modern times. Moltmann, who was for a time a German prisoner of war, and Kitamori, a Japanese thinker, both witnessed World War II and its aftermath. They concluded that God must be moved by suffering. Richard Creel defends impassibility as being uncontrolled by outside influences. He says, among other things, that: God has emotions but they are not controlled by anything outside himself, he takes into account the ultimate good that will come from suffering, suffering does not make love more admirable, a God who suffers would be more appropriately an object of pity than of worship, justice does not require passibility because it need not be based on emotion; and omniscience does not require passibility because God need know only that a person has an emotion, he does not need to experience it. A mediating position would allow emotion in God but not control of him in any way by creatures. God would be affected by the world but only in the way and to the extent he allows.

i. Goodness

Goodness. Whereas classical Greek religion ascribed to the gods very human foibles, theism from Plato onward has affirmed that God is purely good and could not be the author of anything evil (Republic). In Judaism divine goodness is thought to be manifested especially in the giving of the law (Torah). In Islam it is thought to be manifested in divine revelation of truth through the prophets, especially as revealed in the Qur’an. And in Christianity it is manifested in the gracious granting of Christ as the way of salvation.

While goodness encompasses all moral perfection (e.g., truth telling, justice), benevolence is that particular aspect of goodness that wills the benefit of another. The Reformers, and Protestantism generally, stressed that God’s desire for the benefit of creatures is dependent not on their merits but purely on divine love. Divine love is not only irrespective of merit but it is shown most clearly where it is entirely unmerited, as in grace shown to fallen humanity. Therefore divine forgiveness and redemption are taken as the highest expressions of benevolence. Benevolence intersects with omnipotence in providence, wherein God orders events for good ends. It also raises the possibility of a clash between the divine and human wills, as when a person spurns God’s action in the world.

Divine goodness raises the question of whether God wills x because it is good, or x is good because God wills it. The former seems to weaken divine sovereignty, but the latter seems to make goodness arbitrary. The arbitrariness may be somewhat relieved if God’s will is understood as bounded by his unchanging character. God would not, for example, decide to make torturing for enjoyment right since his nature forever condemns it. The issue has implications for divine command ethics, according to which acts are right or wrong because God commands or forbids them (as opposed to, for example, a competing view that acts are right or wrong according to whether they promote the greatest happiness).

As to our knowledge of divine goodness, Aquinas separates the order of being from the order of knowing: all goodness derives from God but we understand divine goodness by extrapolating from the goodness of creatures. For Aquinas, this requires an analogical (as opposed to an equivocal) relationship between divine and human goodness. For Kant, divine goodness is known as a postulate of pure practical reason: God must be there to reward virtue and punish evil.

The greatest challenge to belief in divine goodness has been the fact that evil exists, or more recently, the amount and type of evil rather than the mere fact of it. The problem is lessened if it is acknowledged that divine goodness does not require that each creature always be made to experience as much happiness as it is capable of experiencing. Reasons may include, for example, that: it is impossible that all creatures collectively experience maximal happiness (e.g., because the maximal happiness of one precludes the maximal happiness of another), or that there is some higher good than the happiness of all creatures (e.g., John Hick’s view that maturity is that higher good, and acquiring it may entail some displeasure), or that some forms of good are manifested only when certain types of evil exist (for example, forgiveness requires wrongdoing; mentioned in “6,” above); or because God’s favor is undeserved and not given in response to merit, it cannot be owed and God cannot be faulted for not giving it.

4. References and Further Reading

  • Davis, Stephen T., Logic and the Nature of God (Grand Rapids, MI: Eerdmans, 1983).
    • Deals with challenges to the logical consistency of theism.
  • Fiddes, Paul S., The Creative Suffering of God (Oxford, 1988).
    • In-depth treatment of impassibility.
  • Hasker, W., God, Time and Knowledge (Ithaca, NY: Cornell University Press, 1989).
  • Hick, John, Evil and the God of Love, rev. ed (San Francisco, CA: Harper &Row, 1978).
    • Overview of major historical views on evil; concludes that the world is a place of soul-making.
  • Kelly, Joseph F., The Problem of Evil in the Western Tradition: From the Book of Job to Modern Genetics (Collegeville, MN: The Liturgical Press, 2002).
    • Comprehensive and accessible survey of western thought on the subject.
  • Kenny, A. The God of the Philosophers (Oxford, 1979).
  • Morris, Thomas V., Our Idea of God: An Introduction to Philosophical Theology (Downers Grove, IL: InterVarsity, 1991).
    • Basic introduction to issues such as perfect being theology; God’s goodness, power, and knowledge.
  • Quinn, Philip and Charles Taliaferro eds. A Companion to Philosophy of Religion (Malden, MA: Blackwell, 1997).
    • Contains 620 pages of articles by authorities; many of them introduce various aspects of theism, including attributes of God, pluralism, theism and modern science, and the problem of evil.
  • Swinburne, Richard, The Coherence of Theism (Oxford, 1977; rev. 1993).
    • Discusses many aspects of theism to show its logical consistency.

Author Information

Brian Morley
Email: bmorley@masters.edu
The Master’s College
U. S. A.

Paul Ricoeur (1913—2005)

RicoeurPaul Ricoeur was among the most impressive philosophers of the 20th century continental philosophers, both in the unusual breadth and depth of his philosophical scholarship and in the innovative nature of his thought. He was a prolific writer, and his work is essentially concerned with that grand theme of philosophy: the meaning of life. Ricoeur’s “tensive” style focuses on the tensions running through the very structure of human being. His constant preoccupation was with a hermeneutic of the self, fundamental to which is the need we have for our lives to be made intelligible to us. Ricoeur’s flagship in this endeavor is his narrative theory. Though a Christian philosopher whose work in theology is well-known and respected, his philosophical writings do not rely upon theological concepts, and are appreciated by non-Christians and Christians alike. His most widely read works are The Rule of Metaphor, From Text to Action, and Oneself As Another, and the three volumes of Time and Narrative. His other significant books include Hermeneutics and the Human Sciences, Conflict of Interpretations, The Symbolism of Evil, Freud and Philosophy, and Freedom and Nature: The Voluntary and the Involuntary.

Table of Contents

  1. Life and Works
  2. Style
  3. Influences
  4. The Philosophy
  5. Time and Narrative
  6. Ethics
  7. References and Further Reading
    1. Selected Ricoeur Bibliography
    2. Further Reading

1. Life and Works

Jean Paul Gustave Ricoeur was born on February 27, 1913, at Valence, France, and he died in Chatenay-Malabry, France on May 20, 2005. He lost both his parents within his first few years of his life and was raised with his sister Alice by his paternal grandparents, both of whom were devout Protestants. Ricoeur was a bookish child and successful student. He was awarded a scholarship to study at the Sorbonne in 1934, and afterwards was appointed to his first teaching position at Colmar, Alsace. While at the Sorbonne he first met Gabriel Marcel, who was to become a lifelong friend and philosophical influence. In 1935 he was married to Simone Lejas, with whom he has raised five children.

Ricoeur served in World War II – spending most of it as a prisoner of war – and was awarded the Croix de Guerre. He was interred with Mikel Dufrenne, with whom he later wrote a book on the work of Karl Jaspers. After the war Ricoeur returned to teaching, taking positions at the University of Strasbourg, the Sorbonne, University of Paris at Nanterre, the University of Louvain and University of Chicago. Ricoeur is a traditional philosopher in the sense that his work is highly systematic and steeped in the classics of Western philosophy. His is a reflective philosophy, that is, one that considers the most fundamental philosophical problems to concern self-understanding. While Ricoeur retains subjectivity at the heart of philosophy, his is no abstract Cartesian-style subject; the subject is always a situated subject, an embodied being anchored in a named and dated physical, historical and social world. For this reason his work is sometimes described as philosophical anthropology. Ricoeur is a post-structuralist hermeneutic philosopher who employs a model of textuality as the framework for his analysis of meaning, which extends across writing, speech, art and action. Ricoeur considers human understanding to be cogent only to the extent that it implicitly deploys structures and strategies characteristic of textuality. It is Ricoeur’s view that our self-understandings, and indeed history itself , are “fictive”, that is, subject to the productive effects of the imagination through interpretation. For Ricoeur, the human subjectivity is primarily linguistically designated and mediated by symbols. He states that the “problematic of existence” is given in language and must be worked out in language and discourse. Ricoeur refers to his hermeneutic method as a “hermeneutics of suspicion” because discourse both reveals and conceals something about the nature of being. Unlike post-structuralists such as Foucault and Derrida, for whom subjectivity is nothing more than an effect of language, Ricoeur anchors subjectivity in the human body and the material world, of which language is a kind of second order articulation. In the face of the fragmentation and alienation of post-modernity, Ricoeur offers his narrative theory as the path to a unified and meaningful life; indeed, to the good life.

2. Style

Ricoeur has developed a theoretical style that can best be described as “tensive”. He weaves together heterogeneous concepts and discourses to form a composite discourse in which new meanings are created without diminishing the specificity and difference of the constitutive terms. Ricoeur’s work on metaphor and on the human experience of time are perhaps the best examples of this method, although his entire philosophy is explicitly such a discourse. For example, in What Makes Us Think? Ricoeur discusses the nature of mental life in terms of the tension between our neurobiological conceptions of mind and our phenomenological concepts. Similarly, in the essay “Explanation and Understanding” he discusses human behavior in terms of the tension between concepts of material causation, and the language of actions and motives. The tensive style is in keeping with what Ricoeur regards as basic, ontological tensions inherent in the peculiar being that is human existence, namely, the ambiguity of belonging to both the natural world and the world of action (through freedom of the will). Accordingly, Ricoeur insists that philosophy find a way to contain and express those tensions, and so his work ranges across diverse schools of philosophical thought, bringing together insights and analysis from both the Anglo-American and European traditions, as well as from literary studies, political science and history.

The tensions are played out in our ability to take different perspectives on ourselves and so to formulate diverse approaches and methods in understanding ourselves. The different theoretical frameworks employed in philosophy and the sciences are not simply the result of ignorance or power. They are the result of tensions that run through the very structure of human being; tensions which Ricoeur describes as “fault lines.” Ricoeur’s entire body of work is an attempt to identify and map out the intersections of these numerous and irreducible lines that comprise our understandings of the human world. Ricoeur calls these “fault lines” because they are lines that can intersect in different ways in all the different aspects of human lives, giving lives different meanings. However, as points of intersection of discourses, these meanings can come apart. Ricoeur argues that the stability we enjoy with respect to the meanings of our lives is a tentative stability, subject to the influences of the material world, including the powers and afflictions of one’s body, the actions of other people and institutions, and one’s own emotional and cognitive states. Given the fundamental nature of these tensions, Ricoeur argues that it is ultimately poetics (exemplified in narrative), rather than philosophy that provides the structures and synthetic strategies by which understanding and a coherent sense of self and life is possible.

3. Influences

Ricoeur acknowledges his indebtedness to several key figures in the tradition, most notably, Aristotle, Kant, Hegel and Heidegger. Aristotelian teleology pervades Ricoeur’s textual hermeneutics, and is most obvious in his adoption of a narrative approach. The concepts of “muthos” and “mimesis” in Aristotle’s Poetics form the basis for Ricoeur’s account of narrative “emplotment,” which he enjoins with the innovative powers of the Kantian productive imagination within a general theory of poetics.

The influence of Hegel is manifest in Ricoeur’s employment of a method he describes as a “refined dialectic.” For Ricoeur, the dialectic is a “relative moment[s] in a complex process called interpretation” (Explanation and Understanding”, 150). Like Hegel, the dialectic involves identifying key oppositional terms in a debate, and then proceeding to articulate their synthesis into a new, more developed concept. However, this synthesis does not have the uniformity of a Hegelian synthesis. Ricoeur’s method entails showing how the meanings of two seemingly opposed terms are implicitly informed by, and borrow from, each other. Within the dialectic, the terms maintain their differences at the same time that a common “ground” is formed. However, the common ground is simply the ground of their mutual presupposition. Ricoeur’s dialectic, then, is a unity of continuity and discontinuity. For example, in “Explanation and Understanding” Ricoeur argues that scientific explanation implicitly deploys a background hermeneutic understanding that exceeds the resources of explanation. At the same time, hermeneutic understanding necessarily relies upon the systematic process of explanation. Neither the natural sciences nor the human sciences are fully autonomous disciplines. A key dialectic that runs through Ricoeur’s entire corpus is the dialectic of same and other. This is a foundational dialectic for him, and so, as might be expected, it structures his discussions and dissections of every field of philosophy he enters: selfhood, justice, love, morality, personal identity, knowledge, time, language, metaphor, action, aesthetics, metaphysics, and so on. Unlike the Hegelian dialectic, for Ricoeur, there is no absolute culminating point. There is, nevertheless, a kind of absolute, an objective existence that is revealed indirectly through the dialectic. This is most evident in the third volume of Time and Narrative, where he argues that phenomenological time presupposes an objective order of time (cosmological time), and in The Rule of Metaphor, where he argues that language belongs to, and is expressive of, extra-linguistic reality. Despite this apparent concession to realism, Ricoeur insists that the objective cannot be known as such, but merely grasped indirectly and analytically. Here, the Kantian influence comes to the fore. For Ricoeur, objective reality is the contemporary equivalent of Kantian noumena: although it can never itself become an object of knowledge, it is a kind of necessary thought, a limiting concept, implied in objects of knowledge. This view informs Ricoeur’s “tensive” style. Although we can know, philosophically that there is an objective reality, and, in that sense, a metaphysical constraint on human existence, we can never understand human existence simply in terms of this objectivity. What we must appeal to in order to understand our existence are our substantive philosophical and ethical concepts and norms. This sets up an inevitable tension between the contingency of those norms and the brute fact of objective reality, evidenced in our experience of the involuntary, for example, as aging and dying. Again, Kant looms large. We necessarily regard ourselves from two perspectives: as the author of our actions in the practical world, and as part of, or passive to, cause and effect in the natural world. Such is the inherently ambiguous and tensive nature of human, mortal subjects. It is this condition, then, with which philosophy must grapple. And it is to this condition that Ricoeur offers narrative as the appropriate framework.

4. The Philosophy

There are two closely related questions that animate all of Ricoeur’s work, and which he considers to be fundamental to philosophy: “Who am I?” and “How should I live?” The first question has been neglected by much of contemporary analytical and post-modern philosophy. Consequently, those philosophies lack the means to address the second question. Postmodernism self-consciously rejects traditional processes of identity formation, depicting them as familial and political power relations premised upon dubious metaphysical assumptions about gender, race and mind. At the same time, contemporary philosophy of mind reduces questions of “who?” to questions of “what?”, and in doing so, closes down considerations of self while rendering the moral question one of mere instrumentality or utility. In relation to the question “Who am I?”, Ricoeur acknowledges a long-standing debt to Marcel and Heidegger, and to a lesser extent to Merleau-Ponty. To the moral question, the debt is to Aristotle and Kant. In addressing the question “who am I?” Ricoeur sets out first to understand the nature of selfhood – to understand the being whose nature it is to enquire into itself.

In this endeavor, Ricoeur’s philosophy is driven by the desire to provide an account that will do justice to the tensions and ambiguities which make us human, and which underpin our fallibility. Ricoeur’s interest here can be noted as early as The Voluntary and The Involuntary, drafted during his years as a prisoner of war. There he explores the involuntary constraints to which we are necessarily subject in virtue of our being bodily mortal creatures, and the voluntariness necessary to the idea of ourselves as the agents of our actions. We have, as he later describes it, a “double allegiance”, an allegiance to the material world of cause and effect, and to the phenomenal world of the freedom of the will by which we tear ourselves away from the laws of nature through action. This conception of the double nature of the self lies at the core of Ricoeur’s philosophy. Ricoeur rejects the idea that a self is a metaphysical entity; there is no entity, “the self,” there is only selfhood. Selfhood is an intersubjectively constituted capacity for agency and self-ascription that can be had by individual human beings. Selfhood proper is neither simply an abstract nor an animal self-awareness, but both. It essentially involves an active grasp of oneself as a “who”–that is, as a person who is the subject of a concrete situation, a situation characterized by material and phenomenal qualities. This entails understanding oneself as a named person with a time and place of birth, linked to other similarly named persons and to certain ethnic and cultural traditions, living in a dated and named place. In Oneself As Another Ricoeur describes how the complexity of the question of “who?” opens directly onto a certain way of articulating the question of personal identity: “how the self can be at one and the same time a person of whom we speak and a subject who designates herself in the first person while addressing a second person. . . The difficulty will be . . . understanding how the third person is designated in discourse as someone who designates himself as a first person (34-5)”. Drawing on Heidegger’s notion of Dasein, Ricoeur goes on to write that “To say self is not to say myself . . . the passage from selfhood to mineness is marked by the clause “in each case” . . . The self . . . is in each case mine” (OAA 180). What he means by this is that each person has to take one’s selfhood as one’s own; each must take oneself as who one is; one must “attest” to oneself. Subjectivity, or selfhood, is for Ricoeur, a dialectic of activity and passivity because we are beings with a “double nature,” structured along the fault lines of the voluntary and the involuntary, beings given to ourselves as something to be known. Ricoeur shares Marcel’s view that the answer to the question “Who am I?” can never be fully explicated. This is because, in asking “Who am I?”, “I” who pose the question necessarily fall within the domain of enquiry; I am both seeker and what is sought. This peculiar circularity gives a “questing” and dialectical character to selfhood, which now requires a hermeneutic approach. This circularity has its origins in the nature of embodied subjectivity. Ricoeur’s account is built upon Marcel’s conception of embodied subjectivity as a “fundamental predicament”(Marcel, 1965). The predicament lies in the anti-dualist realization that “I” and my body are not metaphysically distinct entities. My body cannot be abstracted from its being mine. Whatever states I may attribute to my body as its states, I do so only insofar as they are attributes of mine. My body is both something that I am and something that I have: it is “my body” that imagines, perceives and experiences. The unity of “my body” is a unity sui generis. Yet my body is also that over which I exercise a certain instrumentality through my agency. However, the agency that effects that instrumentality is nothing other than “my body.” There is no I-body relation; the primitive term here is “my body.” The inherent ambiguity of the “carnate body” or “corps-sujet” can be directly experienced by clasping one’s own hands (an example often employed by Marcel and Merleau-Ponty). In this experience the distinction between subject and object becomes blurred: it isn’t clear which hand is being touched and which is touching; each hand oscillates between the role of agent and object, without ever being both simultaneously. One cannot feel oneself feeling. This example is supposed to demonstrate two points: first, that the ambiguity of my body prevents the complete objectification of myself, and second, that ambiguity extends to all perception. Perception is not simply passive, but rather, involves an active reception (a concept that Ricoeur takes up and develops in his account of the ontology of the self and one’s own body in Oneself As Another, see 319–329). In other words, my body has an active role in structuring my perceptions, and so, the meaning of my perceptions needs to be interpreted in the context of my bodily situation. The non-coincidence of myself and my body constitutes a “fault line” within the structure of subjectivity. The result is that knowledge of myself and the world is not constituted by more or less accurate facts, but rather, is a composite discourse–a discourse which charts the intersection of the objective, intersubjective and subjective aspects of lived experience. On this view, all knowledge, including my knowledge of my own existence, is mediate and so calls for interpretation. This also means that self-understanding can never be grasped by the kind of introspective immediacy celebrated by Descartes. Instead, as human beings we are never quite “at one” with ourselves; we are fallible creatures. Thus, who I am is not an objective fact to be discovered, but rather something that I must achieve or create, and to which I must attest. On Ricoeur’s view, the question “Who am I ?” is a question specific to a certain kind of being, namely, being a subject of a temporal, material, linguistic and social unity. The ability to grasp oneself as a concrete subject of such a world requires a complex mode of understanding capable of integrating discourses of quite heterogenous kinds, including, importantly, different orders of time. It is to the temporal dimension of selfhood that Ricoeur has most directly addressed his hermeneutic philosophy and narrative model of understanding.

5. Time and Narrative

Central to Ricoeur’s defense of narrative is its capacity to represent the human experience of time. Such a capacity is an essential requisite for a reflective philosophy. Ricoeur sets out his account of “human time” in Time and Narrative, Volume 3. He points out that we experience time in two different ways. We experience time as linear succession, we experience the passing hours and days and the progression of our lives from birth to death. This is cosmological time–time expressed in the metaphor of the “river” of time. The other is phenomenological time; time experienced in terms of the past, present and future. As self-aware embodied beings, we not only experience time as linear succession, but we are also oriented to the succession of time in terms of what has been, what is, and what will be. Ricoeur’s concept of “human time” is expressive of a complex experience in which phenomenological time and cosmological time are integrated. For example, we understand the full meaning of “yesterday” or “today” by reference to their order in a succession of dated time. To say “Today is my birthday” is to immediately invoke both orders of time: a chronological date to which is anchored the phenomenological concept of “birthday.” Ricoeur describes this anchoring as the “inscription” of phenomenological time on cosmological time (TN3 109).

These two conceptions of time have traditionally been seen in opposition, but Ricoeur argues that they share a relation of mutual presupposition. The order of “past-present-future” within phenomenological time presupposes the succession characteristic of cosmological time. The past is always before the present which is always after the past and before the future. The order of succession is invariable, and this order is not part of the concepts of past, present or future considered merely as existential orientations. On the other hand, within cosmological time, the identification of supposedly anonymous instants of time as “before” or “after” within the succession borrows from the phenomenological orientation to past and future. Ricoeur argues that any philosophical model for understanding human existence must employ a composite temporal framework. The only suitable candidate here is the narrative model. Ricoeur links narrative’s temporal complexity to Aristotle’s characterization of narrative as “the imitation of an action”. Ricoeur’s account of the way in which narrative represents the human world of acting (and, in its passive mode, suffering) turns on three stages of interpretation that he calls mimesis1 (prefiguration of the field of action), mimesis2 (configuration of the field of action), and mimesis3 (refiguration of the field of action). Mimesis1 describes the way in which the field of human acting is always already prefigured with certain basic competencies, for example, competency in the conceptual network of the semantics of action (expressed in the ability to raise questions of who, how, why, with whom, against whom, etc.); in the use of symbols (being able to grasp one thing as standing for something else); and competency in the temporal structures governing the syntagmatic order of narration (the “followability” of a narrative). Mimesis2 concerns the imaginative configuration of the elements given in the field of action at the level of mimesis1. Mimesis2 concerns narrative “emplotment.” Ricoeur describes this level as “the kingdom of the as if” Narrative emplotment brings the diverse elements of a situation into an imaginative order, in just the same way as does the plot of a story. Emplotment here has a mediating function. It configures events, agents and objects and renders those individual elements meaningful as part of a larger whole in which each takes a place in the network that constitutes the narrative’s response to why, how, who, where, when, etc. By bringing together heterogeneous factors into its syntactical order emplotment creates a “concordant discordance,” a tensive unity which functions as a redescription of a situation in which the internal coherence of the constitutive elements endows them with an explanatory role. A particularly useful feature of narrative which becomes apparent at the level mimesis2 is the way in which the linear chronology of emplotment is able to represent different experiences of time. What is depicted as the “past” and the “present” within the plot does not necessarily correspond to the “before” and “after” of its linear, episodic structure. For example, a narrative may begin with a culminating event, or it may devote long passages to events depicted as occurring within relatively short periods of time. Dates and times can be disconnected from their denotative function; grammatical tenses can be changed, and changes in the tempo and duration of scenes create a temporality that is “lived” in the story that does not coincide with either the time of the world in which the story is read, nor the time that the unfolding events are said to depict. In Volume 2 of Time and Narrative, Ricoeur’s analyses of Mrs. Dalloway, The Magic Mountain and Remembrance of Things Past centre on the diverse variations of time produced by the interplay of a three tiered structure of time: the time of narrating; the narrated time; and the fictive experience of time produced through “the conjunction/disjunction of the time it takes to narrate and narrated time” (TN2 77). Narrative configuration has at hand a rich array of strategies for temporal signification. Another key feature of mimesis2 is the ability of the internal logic of the narrative unity (created by emplotment) to endow the connections between the elements of the narrative with necessity. In this way, emplotment forges a causal continuity from a temporal succession, and so creates the intelligibility and credibility of the narrative. Ricoeur argues that the temporal order of the events depicted in the narrative is simultaneous with the construction of the necessity that connects those elements into a conceptual unity: from the structure of one thing after another arises the conceptual relation of one thing because of another. It is this conversion that so well “imitates” the continuity demanded by a life, and makes it the ideal model for personal identity and self-understanding. Mimesis3 concerns the integration of the imaginative or “fictive” perspective offered at the level of mimesis2 into actual, lived experience. Ricoeur’s model for this is a phenomenology of reading, which he describes as “the intersection of the world of the text and the world of the reader”(TN1 71). Not only are our life stories “written,” they must be “read,” and when they are read they are taken as one’s own and integrated into one’s identity and self-understanding. Mimesis3 effects the integration of the hypothetical to the real by anchoring the time depicted (or recollected or imputed) in a dated “now” and “then” of actual, lived time. Mimesis is a cyclical interpretative process because it is inserted into the passage of cosmological time. As time passes, our circumstances give rise to new experiences and new opportunities for reflection. We can redescribe our past experiences, bringing to light unrealized connections between agents, actors, circumstances, motives or objects, by drawing connections between the events retold and events that have occurred since, or by bringing to light untold details of past events. Of course, narrative need not have a happy ending. The concern of narrative is coherence and structure, not the creation of a particular kind of experience. Nevertheless, the possibility of redescription of the past offers us the possibility of re-imagining and reconstructing a future inspired by hope. It is this potentially inexhaustible process that is the fuel for philosophy and literature.

6. Ethics

Besides the metaphysical complexity and heterogeneity of the human situation, one of Ricoeur’s deepest concerns is the tentative, even fragile status of the coherence of a life. His conception of ethics is directly tied to his conception of the narrative self. Because selfhood is something that must be achieved and something dependent upon the regard, words and actions of others, as well as chancy material conditions, one can fail to achieve selfhood, or one’s sense of who one is can fall apart. The narrative coherence of one’s life can be lost, and with that loss comes the inability to regard oneself as the worthy subject of a good life; in other words, the loss of self-esteem.

Ricoeur’s ethics is teleological. He argues that human life has an ethical aim, and that aim is self-esteem: “the interpretation of ourselves mediated by the ethical evaluation of our actions. Self-esteem is itself an evaluation process indirectly applied to ourselves as selves” (The Narrative Path, 99). In short, self-esteem means being able to attest to oneself as being the worthy subject of a good life, where “good” is an evaluation informed not simply by one’s own subjective criteria, but rather by intersubjective criteria to which one attests. This entails another moral concept: that of imputation. As the subject of my actions, I am responsible for what I do; I am the subject to whom my actions can be imputed and whose character is to be interpreted in the light of those actions. Ricoeur describes the ethical perspective that arises from this view of the subject as “aiming at the good life” with and for others, in just institutions” (OAA 172). Such a perspective merely spells out the premise of this practical and material conception of selfhood, with its presupposition of the world of action, lived with others. For Ricoeur, a life can have an aim because the teleological structure of action extends over a whole life, understood within the narrative framework. The ethical life is achieved by aiming to live well with others in just institutions. Ricoeur’s view of selfhood has it that we are utterly reliant upon each other. While Ricoeur emphasizes the importance of the first person perspective and the notion of personal responsibility, his is no philosophy of the radical individual. He emphasizes that we are “mutually vulnerable”, and so the fate (self-esteem) of each of us is tied up with the fate of others. This situation has a normative dimension: we have an indebtedness to each other, a duty to care for each other and to engender self-respect and justice, all of which are necessary to the creation and preservation of self-esteem. While duty runs deep, Ricoeur argues that it is nevertheless preceded by a certain reciprocity. In order to feel commanded by duty, one must first have the capacity to hear and respond to the demand of the Other. That is, there must be some fundamental, primordial openness and orientation to others for the power of duty to be felt. Prior to duty there must be a basic reciprocity, which underlies our mutual vulnerability and from which duty, as well as the possibility of friendship and justice, arises. Here, Ricoeur emphasizes the ethical primacy of acting and suffering. Ricoeur calls this phenomenon “solicitude” or “benevolent spontaneity” (OAA 190). It makes the relation of self and Other (and thus, ethics) primordial, or ontological – hence the title of Ricoeur’s book on ethics, Oneself As Another. Self-esteem is said to arise from a primitive reciprocity of spontaneous, benevolent feelings, feelings which one is also capable of directing toward oneself, but only through the benevolence of others. This fundamental reciprocity is prior to the activity of giving. This can be demonstrated in the situation of sympathy, where it is the Other’s suffering (not acting) that one shares. Here, Ricoeur argues that “from the suffering Other there comes a giving that is no longer drawn from the power of acting and existing, but precisely from weakness itself” (OAA 188-9). In this case, the suffering Other is unable to act, and yet gives. What the suffering Other gives to he or she who shares this suffering is precisely the knowledge of their shared vulnerability and the experience of the spontaneous benevolence required to bear that knowledge. As might be supposed from Ricoeur’s view of embodied subjectivity, one is always already an Other to oneself. So, love and understanding for others, and love and understanding for oneself, are two sides of the same sheet of paper, so to speak. One becomes who one is through relations with the Other, whether in the instance of one’s own body or another’s. Reciprocity forms the basis of those productive and self-affirming relations central to so much of ethics, namely friendship and justice. Its corruption leads to self-loathing and the destruction of self-esteem, which goes hand-in-hand with harm to others and injustice. For Ricoeur, friendship and justice become the chief virtues because of their crucial role in the well-being of selfhood, and thus, in maintaining the conditions of possibility of selfhood. Friends and just institutions not only protect against the suffering of self-destruction to which one is always vulnerable, they provide the means for reconstructing and redeeming damaged lives. The theme of redemption runs right through Ricoeur’s work, and no doubt it has a religious origin. However, the notion of redemption can be viewed in secular terms as the counterpart to the constructive nature of one’s identity, and the temporal complexity of the human situation which calls for interpretation.

7. References and Further Reading

  • Marcel, Gabriel. Being and Having: an existentialist diary (New York: Harper and Row, 1965).
  • Marcel, Gabriel. The Mystery of Being: 1, Reflection and Mystery (Chicago: Henry Regnery, 1960).
  • Merleau-Ponty, Maurice.  The Visible and The Invisible, trans. Alphonso Lingis (Evanston: Northwestern University Press, 1968).
  • Ricoeur, Paul. “Explanation and Understanding” in From Text to Action, trans. Kathleen Blamey and John Thompson (Evanston, Ill: Northwestern University Press, 1991).
  • Ricoeur, Paul. “Humans as the Subject Matter of Philosophy” in The Narrative Path, The Later Works of Paul Ricoeur, eds. T. Peter Kemp and David Rasmussen (Cambridge, Mass: MIT Press, 1988).
  • Ricoeur, Paul. “Intellectual Autobiography” in Lewis Edwin Hahn, ed., The Philosophy of Paul Ricoeur, The Library of Living Philosophers Volume XXII (Chicago, Illinois: Open Court, 1995).
  • Ricoeur, Paul. “What is Dialectical?” in Freedom and Morality ed. John Bricke, (Lawrence: University of Kansas, 1976).

a. Selected Ricoeur Bibliography

  • History and Truth, trans. Charles A Kelbley, (Evanston, Illinois: Northwestern University Press, 1965)
  • Fallible Man, trans. Charles A Kelbley (New York: Fordham University Press, 1986)
  • Freedom and Nature: The Voluntary and the Involuntary (Evanston, Illinois: Northwestern University Press, 1966)
  • Husserl: An Analysis of his Phenomenology, trans. E. G. Ballard and L. E. Embree (Evanston, Illinois: Northwestern University Press, 1966)
  • The Symbolism of Evil, trans. E. Buchanan (New York and Evanston: Harper-Row, 1967)
  • Freud and Philosophy: an essay on interpretation, trans. D. Savage (New Haven and London: Yale University Press, 1970)
  • Tragic Wisdom and Beyond, with Gabriel Marcel, trans. P. McCormick and S. Jolin (Evanston, Ill: Northwestern University Press, 1973)
  • The Conflict of Interpretations. Essays in Hermeneutics, trans. D. Ihde (Evanston, Ill: Northwestern University Press, 1974)
  • The Rule of Metaphor, multidisciplinary studies in the creation of meaning in language (London: Routledge & Kegan Paul, 1978)
  • Hermeneutics and the Human Sciences. Essays on Language, Action and Interpretation edited and trans. J. B. Thompson (Cambridge and New York: Cambridge University Press, 1981)
  • Time and Narrative, Volumes 1-3, trans. Kathleen Blamey and David Pellauer (Chicago and London: University of Chicago Press, 1984 -1988)
  • From Text to Action, trans. Kathleen Blamey and John Thompson (Evanston, Ill: Northwestern University Press, 1991)
  • Oneself as Another, trans. Kathleen Blamey (Chicago: University of Chicago Press, 1992)
  • Tolerance between intolerance and the intolerable (Providence: Berghahn Books, 1996)
  • Critique and conviction : conversations with FranÁois Azouvi and Marc de Launay trans. Kathleen Blamey (New York: Columbia University Press, 1998)
  • Thinking Biblically: Exegetical and Hermeneutical Studies, with Andre LeCocque (Chicago and London: University of Chicago Press, 1998)
  • The Just, trans. David Pellauer (Chicago and London: University of Chicago Press, 2000)
  • What Makes Us Think? A Neuroscientist and a Philosopher Argue About Ethics, Human Nature and the Brain, with Jean-Pierre Changeux, trans. M. B. DeBevoise (Princeton and Oxford: Princeton University Press, 2000)

b. Further Reading

  • Henry Isaac Venema: Identifying selfhood : imagination, narrative, and hermeneutics in the thought of Paul Ricoeur (Albany, N.Y. : State University of New York Press, 2000)
  • Bernard P. Dauenhauer : Paul Ricoeur : the promise and risk of politics (Lanham, MD: Rowman & Littlefield Publishers, 1998)
  • Charles E. Regan, Paul Ricoeur, his life and his work (Chicago & London: University of Chicago Press, 1996)
  • Lewis Edwin Hahn, ed. The Philosophy of Paul Ricoeur, The Library of Living Philosophers Volume XXII (Chicago, Illinois: Open Court, 1995)
  • David Wood, ed. On Paul Ricoeur (London & New York: Routledge, 1991)
  • S.H. Clark: Paul Ricoeur (London and New York: Routledge, 1990)
  • Patrick L. Bourgeois and Frank Schalow: Traces of understanding: a profile of Heidegger’s and Ricoeur’s hermeneutics (Amsterdam and Atlanta, GA : Rodopi, 1990)
  • T. Peter Kemp and David Rasmussen: The Narrative Path: The Later Works of Paul Ricoeur (Cambridge, Mass: MIT Press, 1989)
  • John B. Thompson: Critical hermeneutics : a study in the thought of Paul Ricoeur and Jurgen Habermas (Cambridge: Cambridge University Press, 1981)
  • Charles E. Reagan ed: Studies in the Philosophy of Paul Ricoeur (Athens: Ohio University Press, 1979)
  • Don Ihde, Hermeneutic Phenomenology: The Philosophy of Paul Ricoeur (Evanston: Northwestern University Press, 1971)

Author Information

Kim Atkins
Email: kim.atkins@utas.edu.au
University of Tasmania
Tasmania

Thomas Hobbes: Moral and Political Philosophy

hobbesThe English philosopher Thomas Hobbes (1588-1679) is best known for his political thought, and deservedly so. His vision of the world is strikingly original and still relevant to contemporary politics. His main concern is the problem of social and political order: how human beings can live together in peace and avoid the danger and fear of civil conflict. He poses stark alternatives: we should give our obedience to an unaccountable sovereign (a person or group empowered to decide every social and political issue). Otherwise what awaits us is a state of nature that closely resembles civil war – a situation of universal insecurity, where all have reason to fear violent death and where rewarding human cooperation is all but impossible.

One controversy has dominated interpretations of Hobbes. Does he see human beings as purely self-interested or egoistic? Several passages support such a reading, leading some to think that his political conclusions can be avoided if we adopt a more realistic picture of human nature. However, most scholars now accept that Hobbes himself had a much more complex view of human motivation. A major theme below will be why the problems he poses cannot be avoided simply by taking a less selfish view of human nature.

Table of Contents

  1. Introduction
  2. Life and Times
  3. Two Intellectual Influences
  4. Ethics and Human Nature
    1. Materialism Versus Self-Knowledge
    2. The Poverty of Human Judgment and our Need for Science
    3. Motivation
    4. Political Philosophy
  5. The Natural Condition of Mankind
    1. The Laws of Nature and the Social Contract
    2. Why Should we Obey the Sovereign?
    3. Life Under the Sovereign
  6. Conclusion
  7. References and Further Reading

1. Introduction

Hobbes is the founding father of modern political philosophy. Directly or indirectly, he has set the terms of debate about the fundamentals of political life right into our own times. Few have liked his thesis, that the problems of political life mean that a society should accept an unaccountable sovereign as its sole political authority. Nonetheless, we still live in the world that Hobbes addressed head on: a world where human authority is something that requires justification, and is automatically accepted by few; a world where social and political inequality also appears questionable; and a world where religious authority faces significant dispute. We can put the matter in terms of the concern with equality and rights that Hobbes’s thought heralded: we live in a world where all human beings are supposed to have rights, that is, moral claims that protect their basic interests. But what or who determines what those rights are? And who will enforce them? In other words, who will exercise the most important political powers, when the basic assumption is that we all share the same entitlements?

We can see Hobbes’s importance if we briefly compare him with the most famous political thinkers before and after him. A century before, Nicolo Machiavelli had emphasized the harsh realities of power, as well as recalling ancient Roman experiences of political freedom. Machiavelli appears as the first modern political thinker, because like Hobbes he was no longer prepared to talk about politics in terms set by religious faith (indeed, he was still more offensive than Hobbes to many orthodox believers), instead, he looked upon politics as a secular discipline divorced from theology. But unlike Hobbes, Machiavelli offers us no comprehensive philosophy: we have to reconstruct his views on the importance and nature of freedom; it remains uncertain which, if any, principles Machiavelli draws on in his apparent praise of amoral power politics.

Writing a few years after Hobbes, John Locke had definitely accepted the terms of debate Hobbes had laid down: how can human beings live together, when religious or traditional justifications of authority are no longer effective or persuasive? How is political authority justified and how far does it extend? In particular, are our political rulers properly as unlimited in their powers as Hobbes had suggested? And if they are not, what system of politics will ensure that they do not overstep the mark, do not trespass on the rights of their subjects?

So, in assessing Hobbes’s political philosophy, our guiding questions can be: What did Hobbes write that was so important? How was he able to set out a way of thinking about politics and power that remains decisive nearly four centuries afterwards? We can get some clues to this second question if we look at Hobbes’s life and times.

2. Life and Times

Hobbes’s biography is dominated by the political events in England and Scotland during his long life. Born in 1588, the year the Spanish Armada made its ill-fated attempt to invade England, he lived to the exceptional age of 91, dying in 1679. He was not born to power or wealth or influence: the son of a disgraced village vicar, he was lucky that his uncle was wealthy enough to provide for his education and that his intellectual talents were soon recognized and developed (through thorough training in the classics of Latin and Greek). Those intellectual abilities, and his uncle’s support, brought him to university at Oxford. And these in turn—together with a good deal of common sense and personal maturity—won him a place tutoring the son of an important noble family, the Cavendishes. This meant that Hobbes entered circles where the activities of the King, of Members of Parliament, and of other wealthy landowners were known and discussed, and indeed influenced. Thus intellectual and practical ability brought Hobbes to a place close to power—later he would even be math tutor to the future King Charles II. Although this never made Hobbes powerful, it meant he was acquainted with and indeed vulnerable to those who were. As the scene was being set for the Civil Wars of 1642-46 and 1648-51—wars that would lead to the King being executed and a republic being declared—Hobbes felt forced to leave the country for his personal safety, and lived in France from 1640 to 1651. Even after the monarchy had been restored in 1660, Hobbes’s security was not always certain: powerful religious figures, critical of his writings, made moves in Parliament that apparently led Hobbes to burn some of his papers for fear of prosecution.

Thus Hobbes lived in a time of upheaval, sharper than any England has since known. This turmoil had many aspects and causes, political and religious, military and economic. England stood divided against itself in several ways. The rich and powerful were divided in their support for the King, especially concerning the monarch’s powers of taxation. Parliament was similarly divided concerning its own powers vis-à-vis the King. Society was divided religiously, economically, and by region. Inequalities in wealth were huge, and the upheavals of the Civil Wars saw the emergence of astonishingly radical religious and political sects. (For instance, the Levellers called for much greater equality in terms of wealth and political rights; the Diggers, more radical still, fought for the abolition of wage labor.) Civil war meant that the country became militarily divided. And all these divisions cut across one another: for example, the army of the republican challenger, Cromwell, was the main home of the Levellers, yet Cromwell in turn would act to destroy their power within the army’s ranks. In addition, England’s recent union with Scotland was fragile at best, and was almost destroyed by King Charles I’s attempts to impose consistency in religious practices. We shall see that Hobbes’s greatest fear was social and political chaos—and he had ample opportunity both to observe it and to suffer its effects.

Although social and political turmoil affected Hobbes’s life and shaped his thought, it never hampered his intellectual development. His early position as a tutor gave him the scope to read, write and publish (a brilliant translation of the Greek writer Thucydides appeared in 1629), and brought him into contact with notable English intellectuals such as Francis Bacon. His self-imposed exile in France, along with his emerging reputation as a scientist and thinker, brought him into contact with major European intellectual figures of his time, leading to exchange and controversy with figures such as Descartes, Mersenne and Gassendi. Intensely disputatious, Hobbes repeatedly embroiled himself in prolonged arguments with clerics, mathematicians, scientists and philosophers—sometimes to the cost of his intellectual reputation. (For instance, he argued repeatedly that it is possible to square the circle  It is no accident that the phrase is now proverbial for a problem that cannot be solved!) His writing was as undaunted by age and ill health as it was by the events of his times. Though his health slowly failed—from about sixty, he began to suffer shaking palsy, probably Parkinson’s disease, which steadily worsened—even in his eighties he continued to dictate his thoughts to a secretary, and to defend his quarter in various controversies.

Hobbes gained a reputation in many fields. He was known as a scientist (especially in optics), as a mathematician (especially in geometry), as a translator of the classics, as a writer on law, as a disputant in metaphysics and epistemology; not least, he became notorious for his writings and disputes on religious questions. But it is for his writings on morality and politics that he has, rightly, been most remembered. Without these, scholars might remember Hobbes as an interesting intellectual of the seventeenth century; but few philosophers would even recognize his name.

What are the writings that earned Hobbes his philosophical fame? The first was entitled The Elements of Law (1640); this was Hobbes’s attempt to provide arguments supporting the King against his challengers. De Cive [On the Citizen] (1642) has much in common with Elements, and offers a clear, concise statement of Hobbes’s moral and political philosophy. His most famous work is Leviathan, a classic of English prose (1651; a slightly altered Latin edition appeared in 1668). Leviathan expands on the argument of De Cive, mostly in terms of its huge second half that deals with questions of religion. Other important works include: De Corpore [On the Body] (1655), which deals with questions of metaphysics; De Homine [On Man] (1657); and Behemoth (published 1682, though written rather earlier), in which Hobbes gives his account of England’s Civil Wars. But to understand the essentials of Hobbes’s ideas and system, one can rely on De Cive and Leviathan. It is also worth noting that, although Leviathan is more famous and more often read, De Cive actually gives a much more straightforward account of Hobbes’s ideas. Readers whose main interest is in those ideas may wish to skip the next section and go straight to ethics and human nature.

3. Two Intellectual Influences

As well as the political background just stressed, two influences are extremely marked in Hobbes’s work. The first is a reaction against religious authority as it had been known, and especially against the scholastic philosophy that accepted and defended such authority. The second is a deep admiration for (and involvement in) the emerging scientific method, alongside an admiration for a much older discipline, geometry. Both influences affected how Hobbes expressed his moral and political ideas. In some areas it is also clear that they significantly affected the ideas themselves.

Hobbes’s contempt for scholastic philosophy is boundless. Leviathan and other works are littered with references to the “frequency of insignificant speech” in the speculations of the scholastics, with their combinations of Christian theology and Aristotelian metaphysics. Hobbes’s reaction, apart from much savage and sparkling sarcasm, is twofold. In the first place, he makes very strong claims about the proper relation between religion and politics. He was not (as many have charged) an atheist, but he was deadly serious in insisting that theological disputes should be kept out of politics. (He also adopts a strongly materialist metaphysics, that—as his critics were quick to charge—makes it difficult to account for God’s existence as a spiritual entity.) For Hobbes, the sovereign should determine the proper forms of religious worship, and citizens never have duties to God that override their duty to obey political authority. Second, this reaction against scholasticism shapes the presentation of Hobbes’s own ideas. He insists that terms be clearly defined and relate to actual concrete experiences—part of his empiricism. (Many early sections of Leviathan read rather like a dictionary.) Commentators debate how seriously to take Hobbes’s stress on the importance of definition, and whether it embodies a definite philosophical doctrine. What is certain, and more important from the point of view of his moral and political thought, is that he tries extremely hard to avoid any metaphysical categories that do not relate to physical realities (especially the mechanical realities of matter and motion). Commentators further disagree whether Hobbes’s often mechanical sounding definitions of human nature and human behavior are actually important in shaping his moral and political ideas—see Materialism versus self-knowledge below.

Hobbes’s determination to avoid the “insignificant” (that is, meaningless) speech of the scholastics also overlaps with his admiration for the emerging physical sciences and for geometry. His admiration is not so much for the emerging method of experimental science, but rather for deductive science—science that deduces the workings of things from basic first principles and from true definitions of the basic elements. Hobbes therefore approves a mechanistic view of science and knowledge, one that models itself very much on the clarity and deductive power exhibited in proofs in geometry. It is fair to say that this a priori account of science has found little favor after Hobbes’s time. It looks rather like a dead-end on the way to the modern idea of science based on patient observation, theory-building and experiment. Nonetheless, it certainly provided Hobbes with a method that he follows in setting out his ideas about human nature and politics. As presented in Leviathan, especially, Hobbes seems to build from first elements of human perception and reasoning, up to a picture of human motivation and action, to a deduction of the possible forms of political relations and their relative desirability. Once more, it can be disputed whether this method is significant in shaping those ideas, or merely provides Hobbes with a distinctive way of presenting them.

4. Ethics and Human Nature

Hobbes’s moral thought is difficult to disentangle from his politics. On his view, what we ought to do depends greatly on the situation in which we find ourselves. Where political authority is lacking (as in his famous natural condition of mankind), our fundamental right seems to be to save our skins, by whatever means we think fit. Where political authority exists, our duty seems to be quite straightforward: to obey those in power.

But we can usefully separate the ethics from the politics if we follow Hobbes’s own division. For him ethics is concerned with human nature, while political philosophy deals with what happens when human beings interact. What, then, is Hobbes’s view of human nature?

a. Materialism Versus Self-Knowledge

Reading the opening chapters of Leviathan is a confusing business, and the reason for this is already apparent in Hobbes’s very short Introduction. He begins by telling us that the human body is like a machine, and that political organization (the commonwealth) is like an artificial human being. He ends by saying that the truth of his ideas can be gauged only by self-examination, by looking into our selves to adjudge our characteristic thoughts and passions, which form the basis of all human action. But what is the relationship between these two very different claims? For obviously when we look into our selves we do not see mechanical pushes and pulls. This mystery is hardly answered by Hobbes’s method in the opening chapters, where he persists in talking about all manner of psychological phenomena—from emotions to thoughts to whole trains of reasoning – as products of mechanical interactions. (As to what he will say about successful political organization, the resemblance between the commonwealth and a functioning human being is slim indeed. Hobbes’s only real point seems to be that there should be a head that decides most of the important things that the body does.)

Most commentators now agree with an argument made in the 1960’s by the political philosopher Leo Strauss. Hobbes draws on his notion of a mechanistic science, that works deductively from first principles, in setting out his ideas about human nature. Science provides him with a distinctive method and some memorable metaphors and similes. What it does not provide—nor could it, given the rudimentary state of physiology and psychology in Hobbes’s day—are any decisive or substantive ideas about what human nature really is. Those ideas may have come, as Hobbes also claims, from self-examination. In all likelihood, they actually derived from his reflection on contemporary events and his reading of classics of political history such as Thucydides.

This is not to say that we should ignore Hobbes’s ideas on human nature—far from it. But it does mean we should not be misled by scientific imagery that stems from an in fact non-existent science (and also, to some extent, from an unproven and uncertain metaphysics). The point is important mainly when it comes to a central interpretative point in Hobbes’s work: whether or not he thinks of human beings as mechanical objects, programmed as it were to pursue their self-interest. Some have suggested that Hobbes’s mechanical world-view leaves no room for the influence of moral ideas, that he thinks the only effective influence on our behavior will be incentives of pleasure and pain. But while it is true that Hobbes sometimes says things like this, we should be clear that the ideas fit together only in a metaphorical way. For example, there is no reason why moral ideas should not “get into” the mechanisms that drive us round (like so many clock-work dolls perhaps?). Likewise, there is no reason why pursuing pleasure and pain should work in our self-interest. (What self-interest is depends on the time-scale we adopt, and how effectively we might achieve this goal also depends on our insight into what harms and benefits us). If we want to know what drives human beings, on Hobbes’s view, we must read carefully all he says about this, as well as what he needs to assume if the rest of his thought is to make sense. The mechanistic metaphor is something of a red herring and, in the end, probably less useful than his other starting point in Leviathan, the Delphic epithet: nosce teipsum (know thyself).

b. The Poverty of Human Judgment and our Need for Science

There are two major aspects to Hobbes’s picture of human nature. As we have seen, and will explore below, what motivates human beings to act is extremely important to Hobbes. The other aspect concerns human powers of judgment and reasoning, about which Hobbes tends to be extremely skeptical. Like many philosophers before him, Hobbes wants to present a more solid and certain account of human morality than is contained in everyday beliefs. Plato had contrasted knowledge with opinion. Hobbes contrasts science with a whole raft of less reliable forms of belief—from probable inference based on experience, right down to “absurdity, to which no living creature is subject but man” (Leviathan, v.7).

Hobbes has several reasons for thinking that human judgment is unreliable, and needs to be guided by science. Our judgments tend to be distorted by self-interest or by the pleasures and pains of the moment. We may share the same basic passions, but the various things of the world affect us all very differently; and we are inclined to use our feelings as measures for others. It becomes dogmatic through vanity and morality, as with “men vehemently in love with their own new opinions…and obstinately bent to maintain them, [who give] their opinions also that reverenced name of conscience” (Leviathan, vii.4). When we use words which lack any real objects of reference, or are unclear about the meaning of the words we use, the danger is not only that our thoughts will be meaningless, but also that we will fall into violent dispute. (Hobbes has scholastic philosophy in mind, but he also makes related points about the dangerous effects of faulty political ideas and ideologies.) We form beliefs about supernatural entities, fairies and spirits and so on, and fear follows where belief has gone, further distorting our judgment. Judgment can be swayed this way and that by rhetoric, that is, by the persuasive and “colored speech” of others, who can deliberately deceive us and may well have purposes that go against the common good or indeed our own good. Not least, much judgment is concerned with what we should do now, that is, with future events, “the future being but a fiction of the mind” (Leviathan, iii.7) and therefore not reliably known to us.

For Hobbes, it is only science, “the knowledge of consequences” (Leviathan, v.17), that offers reliable knowledge of the future and overcomes the frailties of human judgment. Unfortunately, his picture of science, based on crudely mechanistic premises and developed through deductive demonstrations, is not even plausible in the physical sciences. When it comes to the complexities of human behavior, Hobbes’s model of science is even less satisfactory. He is certainly an acute and wise commentator of political affairs; we can praise him for his hard-headedness about the realities of human conduct, and for his determination to create solid chains of logical reasoning. Nonetheless, this does not mean that Hobbes was able to reach a level of scientific certainty in his judgments that had been lacking in all previous reflection on morals and politics.

c. Motivation

The most consequential aspect of Hobbes’s account of human nature centers on his ideas about human motivation, and this topic is therefore at the heart of many debates about how to understand Hobbes’s philosophy. Many interpreters have presented the Hobbesian agent as a self-interested, rationally calculating actor (those ideas have been important in modern political philosophy and economic thought, especially in terms of rational choice theories). It is true that some of the problems that face people like this—rational egoists, as philosophers call them—are similar to the problems Hobbes wants to solve in his political philosophy. And it is also very common for first-time readers of Hobbes to get the impression that he believes we are all basically selfish.

There are good reasons why earlier interpreters and new readers tend to think the Hobbesian agent is ultimately self-interested. Hobbes likes to make bold and even shocking claims to get his point across. “I obtained two absolutely certain postulates of human nature,” he says, “one, the postulate of human greed by which each man insists upon his own private use of common property; the other, the postulate of natural reason, by which each man strives to avoid violent death” (De Cive, Epistle Dedicatory). What could be clearer?—We want all we can get, and we certainly want to avoid death. There are two problems with thinking that this is Hobbes’s considered view, however. First, quite simply, it represents a false view of human nature. People do all sorts of altruistic things that go against their interests. They also do all sorts of needlessly cruel things that go against self-interest (think of the self-defeating lengths that revenge can run to). So it would be uncharitable to interpret Hobbes this way, if we can find a more plausible account in his work. Second, in any case Hobbes often relies on a more sophisticated view of human nature. He describes or even relies on motives that go beyond or against self-interest, such as pity, a sense of honor or courage, and so on. And he frequently emphasizes that we find it difficult to judge or appreciate just what our interests are anyhow. (Some also suggest that Hobbes’s views on the matter shifted away from egoism after De Cive, but the point is not crucial here.)

The upshot is that Hobbes does not think that we are basically or reliably selfish; and he does not think we are fundamentally or reliably rational in our ideas about what is in our interests. He is rarely surprised to find human beings doing things that go against self-interest: we will cut off our noses to spite our faces, we will torture others for their eternal salvation, we will charge to our deaths for love of country. In fact, a lot of the problems that befall human beings, according to Hobbes, result from their being too little concerned with self-interest. Too often, he thinks, we are too much concerned with what others think of us, or inflamed by religious doctrine, or carried away by others’ inflammatory words. This weakness as regards our self-interest has even led some to think that Hobbes is advocating a theory known as ethical egoism. This is to claim that Hobbes bases morality upon self-interest, claiming that we ought to do what it is most in our interest to do. But we shall see that this would over-simplify the conclusions that Hobbes draws from his account of human nature.

d. Political Philosophy

This is Hobbes’s picture of human nature. We are needy and vulnerable. We are easily led astray in our attempts to know the world around us. Our capacity to reason is as fragile as our capacity to know; it relies upon language and is prone to error and undue influence. When we act, we may do so selfishly or impulsively or in ignorance, on the basis of faulty reasoning or bad theology or others’ emotive speech.

What is the political fate of this rather pathetic sounding creature—that is, of us? Unsurprisingly, Hobbes thinks little happiness can be expected of our lives together. The best we can hope for is peaceful life under an authoritarian-sounding sovereign. The worst, on Hobbes’s account, is what he calls the natural condition of mankind, a state of violence, insecurity and constant threat. In outline, Hobbes’s argument is that the alternative to government is a situation no one could reasonably wish for, and that any attempt to make government accountable to the people must undermine it, so threatening the situation of non-government that we must all wish to avoid. Our only reasonable option, therefore, is a “sovereign” authority that is totally unaccountable to its subjects. Let us deal with the “natural condition” of non-government, also called the “state of nature,” first of all.

5. The Natural Condition of Mankind

The state of nature is “natural” in one specific sense only. For Hobbes political authority is artificial: in the “natural” condition human beings lack government, which is an authority created by men. What is Hobbes’s reasoning here? He claims that the only authority that naturally exists among human beings is that of a mother over her child, because the child is so very much weaker than the mother (and indebted to her for its survival). Among adult human beings this is invariably not the case. Hobbes concedes an obvious objection, admitting that some of us are much stronger than others. And although he is very sarcastic about the idea that some are wiser than others, he does not have much difficulty with the idea that some are fools and others are dangerously cunning. Nonetheless, it is almost invariably true that every human being is capable of killing any other. “Even the strongest must sleep; even the weakest might persuade others to help him kill another”. (Leviathan, xiii.1-2) Because adults are equal in this capacity to threaten one another’s lives, Hobbes claims there is no natural source of authority to order their lives together. (He is strongly opposing arguments that established monarchs have a natural or God-given right to rule over us.)

Thus, as long as human beings have not successfully arranged some form of government, they live in Hobbes’s state of nature. Such a condition might occur at the “beginning of time” (see Hobbes’s comments on Cain and Abel, Leviathan, xiii.11, Latin version only), or in “primitive” societies (Hobbes thought the American Indians lived in such a condition). But the real point for Hobbes is that a state of nature could just as well occur in seventeenth century England, should the King’s authority be successfully undermined. It could occur tomorrow in every modern society, for example, if the police and army suddenly refused to do their jobs on behalf of government. Unless some effective authority stepped into the King’s place (or the place of army and police and government), Hobbes argues the result is doomed to be deeply awful, nothing less than a state of war.

Why should peaceful cooperation be impossible without an overarching authority? Hobbes provides a series of powerful arguments that suggest it is extremely unlikely that human beings will live in security and peaceful cooperation without government. (Anarchism, the thesis that we should live without government, of course disputes these arguments.) His most basic argument is threefold. (Leviathan, xiii.3-9) (i) He thinks we will compete, violently compete, to secure the basic necessities of life and perhaps to make other material gains. (ii) He argues that we will challenge others and fight out of fear (“diffidence”), so as to ensure our personal safety. (iii) And he believes that we will seek reputation (“glory”), both for its own sake and for its protective effects (for example, so that others will be afraid to challenge us).

This is a more difficult argument than it might seem. Hobbes does not suppose that we are all selfish, that we are all cowards, or that we are all desperately concerned with how others see us. Two points, though. First, he does think that some of us are selfish, some of us cowardly, and some of us “vainglorious” (perhaps some people are of all of these!). Moreover, many of these people will be prepared to use violence to attain their ends—especially if there is no government or police to stop them. In this Hobbes is surely correct. Second, in some situations it makes good sense, at least in the short term, to use violence and to behave selfishly, fearfully or vaingloriously. If our lives seem to be at stake, after all, we are unlikely to have many scruples about stealing a loaf of bread; if we perceive someone as a deadly threat, we may well want to attack first, while his guard is down; if we think that there are lots of potential attackers out there, it is going to make perfect sense to get a reputation as someone who should not be messed with. In Hobbes’s words, “the wickedness of bad men also compels good men to have recourse, for their own protection, to the virtues of war, which are violence and fraud”. (De Cive, Epistle Dedicatory) As well as being more complex than first appears, Hobbes’s argument becomes very difficult to refute.

Underlying this most basic argument is an important consideration about insecurity. As we shall see Hobbes places great weight on contracts (thus some interpreters see Hobbes as heralding a market society dominated by contractual exchanges). In particular, he often speaks of “covenants,” by which he means a contract where one party performs his part of the bargain later than the other. In the state of nature such agreements are not going to work. Only the weakest will have good reason to perform the second part of a covenant, and then only if the stronger party is standing over them. Yet a huge amount of human cooperation relies on trust, that others will return their part of the bargain over time. A similar point can be made about property, most of which we cannot carry about with us and watch over. This means we must rely on others respecting our possessions over extended periods of time. If we cannot do this, then many of the achievements of human society that involve putting hard work into land (farming, building) or material objects (the crafts, or modern industrial production, still unknown in Hobbes’s time) will be near impossible.

One can reasonably object to such points: Surely there are basic duties to reciprocate fairly and to behave in a trustworthy manner? Even if there is no government providing a framework of law, judgment and punishment, do not most people have a reasonable sense of what is right and wrong, which will prevent the sort of contract-breaking and generalized insecurity that Hobbes is concerned with? Indeed, should not our basic sense of morality prevent much of the greed, pre-emptive attack and reputation-seeking that Hobbes stressed in the first place? This is the crunch point of Hobbes’s argument, and it is here (if anywhere) that one can accuse Hobbes of pessimism. He makes two claims. The first concerns our duties in the state of nature (that is, the so-called “right of nature”). The second follows from this, and is less often noticed: it concerns the danger posed by our different and variable judgments of what is right and wrong.

On Hobbes’s view the right of nature is quite simple to define. Naturally speaking—that is, outside of civil society – we have a right to do whatever we think will ensure our self-preservation. The worst that can happen to us is violent death at the hands of others. If we have any rights at all, if (as we might put it) nature has given us any rights whatsoever, then the first is surely this: the right to prevent violent death befalling us. But Hobbes says more than this, and it is this point that makes his argument so powerful. We do not just have a right to ensure our self-preservation: we each have a right to judge what will ensure our self-preservation. And this is where Hobbes’s picture of humankind becomes important. Hobbes has given us good reasons to think that human beings rarely judge wisely. Yet in the state of nature no one is in a position to successfully define what is good judgment. If I judge that killing you is a sensible or even necessary move to safeguard my life, then—in Hobbes’s state of nature – I have a right to kill you. Others might judge the matter differently, of course. Almost certainly you will have quite a different view of things (perhaps you were just stretching your arms, not raising a musket to shoot me). Because we are all insecure, because trust is more-or-less absent, there is little chance of our sorting out misunderstandings peacefully, nor can we rely on some (trusted) third party to decide whose judgment is right. We all have to be judges in our own causes, and the stakes are very high indeed: life or death.

For this reason Hobbes makes very bold claims that sound totally amoral. “To this war of every man against every man,” he says, “this also is consequent [i.e., it follows]: that nothing can be unjust. The notions of right and wrong, justice and injustice have no place [in the state of nature]”. (Leviathan, xiii.13) He further argues that in the state of nature we each have a right to all things, “even to one another’s body’ (Leviathan, xiv.4). Hobbes is dramatizing his point, but the core is defensible. If I judge that I need such and such—an object, another person’s labor, another person’s death—to ensure my continued existence, then in the state of nature, there is no agreed authority to decide whether I’m right or wrong. New readers of Hobbes often suppose that the state of nature would be a much nicer place, if only he were to picture human beings with some basic moral ideas. But this is naïve: unless people share the same moral ideas, not just at the level of general principles but also at the level of individual judgment, then the challenge he poses remains unsolved: human beings who lack some shared authority are almost certain to fall into dangerous and deadly conflict.

There are different ways of interpreting Hobbes’s view of the absence of moral constraints in the state of nature. Some think that Hobbes is imagining human beings who have no idea of social interaction and therefore no ideas about right and wrong. In this case, the natural condition would be a purely theoretical construction, and would demonstrate what both government and society do for human beings. (A famous statement about the state of nature in De Cive (viii.1) might support this interpretation: “looking at men as if they had just emerged from the earth like mushrooms and grown up without any obligation to each other…”) Another, complementary view reads Hobbes as a psychological egoist, so that—in the state of nature as elsewhere – he is merely describing the interaction of ultimately selfish and amoral human beings.

Others suppose that Hobbes has a much more complex picture of human motivation, so that there is no reason to think moral ideas are absent in the state of nature. In particular, it is historically reasonable to think that Hobbes invariably has civil war in mind, when he describes our “natural condition.” If we think of civil war, we need to imagine people who have lived together and indeed still do live together—huddled together in fear in their houses, banded together as armies or guerrillas or groups of looters. The problem here is not a lack of moral ideas—far from it – rather that moral ideas and judgments differ enormously. This means (for example) that two people who are fighting tooth and nail over a cow or a gun can both think they are perfectly entitled to the object and both think they are perfectly right to kill the other—a point Hobbes makes explicitly and often. It also enables us to see that many Hobbesian conflicts are about religious ideas or political ideals (as well as self-preservation and so on)—as in the British Civil War raging while Hobbes wrote Leviathan, and in the many violent sectarian conflicts throughout the world today.

In the end, though, whatever account of the state of nature and its (a) morality we attribute to Hobbes, we must remember that it is meant to function as a powerful and decisive threat: if we do not heed Hobbes’s teachings and fail to respect existing political authority, then the natural condition and its horrors of war await us.

a. The Laws of Nature and the Social Contract

Hobbes thinks the state of nature is something we ought to avoid, at any cost except our own self-preservation (this being our “right of nature,” as we saw above). But what sort of ought is this? There are two basic ways of interpreting Hobbes here. It might be a counsel of prudence: avoid the state of nature, if you’re concerned to avoid violent death. In this case Hobbes’s advice only applies to us (i) if we agree that violent death is what we should fear most and should therefore avoid; and (ii) if we agree with Hobbes that only an unaccountable sovereign stands between human beings and the state of nature. This line of thought fits well with an egoistic reading of Hobbes, but it faces serious problems, as will be seen.

The other way of interpreting Hobbes is not without problems either. This takes Hobbes to be saying that we ought, morally speaking, to avoid the state of nature. We have a duty to do what we can to avoid this situation arising, and a duty to end it, if at all possible. Hobbes often makes his view clear, that we have such moral obligations. But then two difficult questions arise: Why these obligations? And why are they obligatory?

Hobbes frames the issues in terms of an older vocabulary, using the idea of natural law that many ancient and medieval philosophers had relied on. Like them, he thinks that human reason can discern some eternal principles to govern our conduct. These principles are independent of (though also complementary to) whatever moral instruction we might get from God or religion. In other words, they are laws given by nature rather than revealed by God. But Hobbes makes radical changes to the content of these so-called laws of nature. In particular, he does not think that natural law provides any scope whatsoever to criticize or disobey the actual laws made by a government. He thus disagrees with those Protestants who thought that religious conscience might sanction disobedience of immoral laws, and with Catholics who thought that the commandments of the Pope have primacy over those of national political authorities.

Although he sets out nineteen laws of nature, it is the first two that are politically crucial. A third, that stresses the important of keeping to contracts we have entered into, is important in Hobbes’s moral justifications of obedience to the sovereign. (The remaining sixteen can be quite simply encapsulated in the formula, do as you would be done by. While the details are important for scholars of Hobbes, they do not affect the overall theory and will be ignored here.)

The first law reads as follows:

Every man ought to endeavor peace, as far as he has hope of obtaining it, and when he cannot obtain it, that he may seek and use all helps and advantages of war. (Leviathan, xiv.4)

This repeats the points we have already seen about our right of nature, so long as peace does not appear to be a realistic prospect. The second law of nature is more complicated:

That a man be willing, when others are so too, as far-forth as for peace and defense of himself he shall think it necessary, to lay down this right to all things, and be contented with so much liberty against other men, as he would allow other men against himself. (Leviathan, xiv.5)

What Hobbes tries to tackle here is the transition from the state of nature to civil society. But how he does this is misleading and has generated much confusion and disagreement. The way that Hobbes describes this second law of nature makes it look as if we should all put down our weapons, give up (much of) our “right of nature,” and jointly authorize a sovereign who will tell us what is permitted and punish us if we do not obey. But the problem is obvious. If the state of nature is anything like as bad as Hobbes has argued, then there is just no way people could ever make an agreement like this or put it into practice.

At the end of Leviathan, Hobbes seems to concede this point, saying “there is scarce a commonwealth in the world whose beginnings can in conscience be justified” (Review and Conclusion, 8). That is: governments have invariably been foisted upon people by force and fraud, not by collective agreement. But Hobbes means to defend every existing government that is powerful enough to secure peace among its subjects—not just a mythical government that’s been created by a peaceful contract out of a state of nature. His basic claim is that we should behave as if we had voluntarily entered into such a contract with everyone else in our society—everyone else, that is, except the sovereign authority.

In Hobbes’s myth of the social contract, everyone except the person or group who will wield sovereign power lays down their “right to all things.” They agree to limit drastically their right of nature, retaining only a right to defend their lives in case of immediate threat. (How limited this right of nature becomes in civil society has caused much dispute, because deciding what is an immediate threat is a question of judgment. It certainly permits us to fight back if the sovereign tries to kill us. But what if the sovereign conscripts us as soldiers? What if the sovereign looks weak and we doubt whether he can continue to secure peace…?) The sovereign, however, retains his (or her, or their) right of nature, which we have seen is effectively a right to all things—to decide what everyone else should do, to decide the rules of property, to judge disputes and so on. Hobbes concedes that there are moral limits on what sovereigns should do (God might call a sovereign to account). However, since in any case of dispute the sovereign is the only rightful judge—on this earth, that is – those moral limits make no practical difference. In every moral and political matter, the decisive question for Hobbes is always: who is to judge? As we have seen, in the state of nature, each of us is judge in our own cause, part of the reason why Hobbes thinks it is inevitably a state of war. Once civil society exists, the only rightful judge is the sovereign.

b. Why Should we Obey the Sovereign?

If we had all made a voluntary contract, a mutual promise, then it might seem half-way plausible to think we have an obligation to obey the sovereign (although even this requires the claim that promising is a moral value that overrides all others). If we have been conquered or, more fortunately, have simply been born into a society with an established political authority, this seems quite improbable. Hobbes has to make three steps here, all of which have seemed weak to many of his readers. First of all, he insists that promises made under threat of violence are nonetheless freely made, and just as binding as any others. Second, he has to put great weight on the moral value of promise keeping, which hardly fits with the absence of duties in the state of nature. Third, he has to give a story of how those of us born and raised in a political society have made some sort of implied promise to each other to obey, or at least, he has to show that we are bound (either morally or out of self-interest) to behave as if we had made such a promise.

In the first place, Hobbes draws on his mechanistic picture of the world, to suggest that threats of force do not deprive us of liberty. Liberty, he says, is freedom of motion, and I am free to move whichever way I wish, unless I am literally enchained. If I yield to threats of violence, that is my choice, for physically I could have done otherwise. If I obey the sovereign for fear of punishment or in fear of the state of nature, then that is equally my choice. Such obedience then comes, for Hobbes, to constitute a promise that I will continue to obey.

Second, promises carry a huge moral weight for Hobbes, as they do in all social contract theories. The question, however, is why we should think they are so important. Why should my (coerced) promise oblige me, given the wrong you committed in threatening me and demanding my valuables? Hobbes has no good answer to this question (but see below, on egoistic interpretations of Hobbes’s thinking here). His theory suggests that (in the state of nature) you could do me no wrong, as the right of nature dictates that we all have a right to all things. Likewise, promises do not oblige in the state of nature, inasmuch as they go against our right of nature. In civil society, the sovereign’s laws dictate what is right and wrong; if your threat was wrongful, then my promise will not bind me. But as the sovereign is outside of the original contract, he sets the terms for everyone else: so his threats create obligations.

As this suggests, Hobbesian promises are strangely fragile. Implausibly binding so long as a sovereign exists to adjudicate and enforce them, they lose all power should things revert to a state of nature. Relatedly, they seem to contain not one jot of loyalty. To be logically consistent, Hobbes needs to be politically implausible. Now there are passages where Hobbes sacrifices consistency for plausibility, arguing we have a duty to fight for our (former) sovereign even in the midst of civil war. Nonetheless the logic of his theory suggests that, as soon as government starts to weaken and disorder sets in, our duty of obedience lapses. That is, when the sovereign power needs our support, because it is no longer able to coerce us, there is no effective judge or enforcer of covenants, so that such promises no longer override our right of nature. This turns common sense on its head. Surely a powerful government can afford to be challenged, for instance by civil disobedience or conscientious objection? But when civil conflict and the state of nature threaten, in other words when government is failing, then we might reasonably think that political unity is as morally important as Hobbes always suggests. A similar question of loyalty also comes up when the sovereign power has been usurped—when Cromwell has supplanted the King, when a foreign invader has ousted our government. Right from the start, Hobbes’s critics saw that his theory makes turncoats into moral heroes: our allegiance belongs to whoever happens to be holding the gun(s). Perversely, the only crime the makers of a coup can commit is to fail.

Why does this problem come about? To overcome the fact that his contract is a fiction, Hobbes is driven to construct a “sort of” promise out of the fact of our subjugation to whatever political authority exists. He stays wedded to the idea that obedience can only find a moral basis in a “voluntary” promise, because only this seems to justify the almost unlimited obedience and renunciation of individual judgment he is determined to prove. It is no surprise that Hobbes’s arguments creak at every point: nothing could bear the weight of justifying such an overriding duty.

All the difficulties in finding a reliable moral obligation to obey might tempt us back to the idea that Hobbes is some sort of egoist. However, the difficulties with this tack are even greater. There are two sorts of egoism commentators have attributed to Hobbes: psychological and ethical. The first theory says that human beings always act egoistically, the second that they ought to act egoistically. Either view might support this simple idea: we should obey the sovereign, because his political authority is what keeps us from the evils of the natural condition. But the basic problem with such egoistic interpretations, from the point of view of Hobbes’s system of politics, is shown when we think about cases where selfishness seems to conflict with the commands of the sovereign—for example, where illegal conduct will benefit us or keep us from danger. For a psychologically egoist agent, such behavior will be irresistible; for an ethically egoist agent, it will be morally obligatory. Now, providing the sovereign is sufficiently powerful and well-informed, he can prevent many such cases arising by threatening and enforcing punishments of those who disobey. Effective threats of punishment mean that obedience is in our self-interest. But such threats will not be effective when we think our disobedience can go undetected. After Orwell’s 1984 we can imagine a state that is so powerful that no reasonable person would ever think disobedience could pay. But for Hobbes, such a powerful sovereign was not even conceivable: he would have had to assume that there would be many situations where people could reasonably hope to “get away with it.” (Likewise, under non-totalitarian, liberal politics, there are many situations where illegal behavior is very unlikely to be detected or punished.) So, still thinking of egoistic agents, the more people do get away with it, the more reason others have to think they can do the same. Thus the problem of disobedience threatens to “snowball,” undermining the sovereign and plunging selfish agents back into the chaos of the state of nature.

In other words, sovereignty as Hobbes imagined it, and liberal political authority as we know it, can only function where people feel some additional motivation apart from pure self-interest. Moreover, there is strong evidence that Hobbes was well aware of this. Part of Hobbes’s interest in religion (a topic that occupies half of Leviathan) lies in its power to shape human conduct. Sometimes this does seem to work through self-interest, as in crude threats of damnation and hell-fire. But Hobbes’s main interest lies in the educative power of religion, and indeed of political authority. Religious practices, the doctrines taught in the universities (!), the beliefs and habits inculcated by the institutions of government and society: how these can encourage and secure respect for law and authority seem to be even more important to Hobbes’s political solutions than his theoretical social contract or shaky appeals to simple self-interest.

What are we to conclude, then, given the difficulties in finding a reliable moral or selfish justification for obedience? In the end, for Hobbes, everything rides on the value of peace. Hobbes wants to say both that civil order is in our “enlightened” self-interest, and that it is of overwhelming moral value. Life is never going to be perfect for us, and life under the sovereign is the best we can do. Recognizing this aspect of everyone’s self-interest should lead us to recognize the moral value of supporting whatever authority we happen to live under. For Hobbes, this moral value is so great—and the alternatives so stark – that it should override every threat to our self-interest except the imminent danger of death. The million-dollar question is then: is a life of obedience to the sovereign really the best human beings can hope for?

c. Life Under the Sovereign

Hobbes has definite ideas about the proper nature, scope and exercise of sovereignty. Much that he says is cogent, and much of it can reduce the worries we might have about living under this drastically authoritarian sounding regime. Many commentators have stressed, for example, the importance Hobbes places upon the rule of law. His claim that much of our freedom, in civil society, “depends on the silence of the laws” is often quoted (Leviathan, xxi.18). In addition, Hobbes makes many points that are obviously aimed at contemporary debates about the rights of King and Parliament—especially about the sovereign’s rights as regards taxation and the seizure of property, and about the proper relation between religion and politics. Some of these points continue to be relevant, others are obviously anachronistic: evidently Hobbes could not have imagined the modern state, with its vast bureaucracies, massive welfare provision and complicated interfaces with society. Nor could he have foreseen how incredibly powerful the state might become, meaning that sovereigns such as Hitler or Stalin might starve, brutalize and kill their subjects, to such an extent that the state of nature looks clearly preferable.

However, the problem with all of Hobbes’s notions about sovereignty is that—on his account – it is not Hobbes the philosopher, nor we the citizens, who decide what counts as the proper nature, scope or exercise of sovereignty. He faces a systematic problem: justifying any limits or constraints on the sovereign involves making judgments about moral or practical requirements. But one of his greatest insights, still little recognized by many moral philosophers, is that any right or entitlement is only practically meaningful when combined with a concrete judgment as to what it dictates in some given case. Hobbes’s own failure, however understandable, to foresee the growth of government and its powers only supports this thought: that the proper nature, scope or exercise of sovereignty is a matter of complex judgment. Alone among the people who comprise Hobbes’s commonwealth, it is the sovereign who judges what form he should appear in, how far he should reach into the lives of his subjects, and how he should exercise his powers.

It should be added that the one part of his system that Hobbes concedes not to be proven with certainty is just this question: who or what should constitute the sovereign power. It was natural for Hobbes to think of a King, or indeed a Queen (he was born under Elizabeth I). But he was certainly very familiar with ancient forms of government, including aristocracy (government by an elite) and democracy (government by the citizens, who formed a relatively small group within the total population). Hobbes was also aware that an assembly such as Parliament could constitute a sovereign body. All have advantages and disadvantages, he argues. But the unity that comes about from having a single person at the apex, together with fixed rules of succession that pre-empt dispute about who this person should be, makes monarchy Hobbes’s preferred option.

In fact, if we want to crack open Hobbes’s sovereign, to be able to lay down concrete ideas about its nature and limits, we must begin with the question of judgment. For Hobbes, dividing capacities to judge between different bodies is tantamount to letting the state of nature straight back in. “For what is it to divide the power of a commonwealth, but to dissolve it; for powers divided mutually destroy each other”. (Leviathan, xxix.12; cf De Cive, xii.5) Beyond the example of England in the 1640s, Hobbes hardly bothers to argue the point, although it is crucial to his entire theory. Always in his mind is the Civil War that arose when Parliament claimed the right to judge rules of taxation, and thereby prevented the King from ruling and making war as he saw fit, and when churches and religious sects claimed prerogatives that went against the King’s decisions.

Especially given modern experiences of the division of powers, however, it is easy to see that these examples are extreme and atypical. We might recall the American constitution, where powers of legislation, execution and case-by-case judgment are separated (to Congress, President and the judiciary respectively) and counter-balance one another. Each of these bodies is responsible for judging different questions. There are often, of course, boundary disputes, as to whether legislative, executive or judicial powers should apply to a given issue, and no one body is empowered to settle this crucial question of judgment. Equally obviously, however, such disputes have not led to a state of nature (well, at least if we think of the US after the Civil War). For Hobbes it is simply axiomatic that disputation as to who should judge important social and political issues spells the end of the commonwealth. For us, it is equally obvious that only a few extreme forms of dispute have this very dangerous power. Dividing the powers that are important to government need not leave a society more open to those dangerous conflicts. Indeed, many would now argue that political compromises which provide different groups and bodies with independent space to judge certain social or political issues can be crucial for preventing disputes from escalating into violent conflict or civil war.

6. Conclusion

What happens, then, if we do not follow Hobbes in his arguments that judgment must, by necessity or by social contract or both, be the sole province of the sovereign? If we are optimists about the power of human judgment, and about the extent of moral consensus among human beings, we have a straightforward route to the concerns of modern liberalism. Our attention will not be on the question of social and political order, rather on how to maximize liberty, how to define social justice, how to draw the limits of government power, and how to realize democratic ideals. We will probably interpret Hobbes as a psychological egoist, and think that the problems of political order that obsessed him were the product of an unrealistic view of human nature, or unfortunate historical circumstances, or both. In this case, I suggest, we might as well not have read Hobbes at all.

If we are less optimistic about human judgment in morals and politics, however, we should not doubt that Hobbes’s problems remain our problems. But hindsight shows grave limitations to his solutions. Theoretically, Hobbes fails to prove that we have an almost unlimited obligation to obey the sovereign. His arguments that sovereignty—the power to judge moral and political matters, and enforce those judgments—cannot be divided are not only weak; they are simply refuted by the (relatively) successful distribution of powers in modern liberal societies. Not least, the horrific crimes of twentieth century dictatorships show beyond doubt that judgment about right and wrong cannot be a question only for our political leaders.

If Hobbes’s problems are real and his solutions only partly convincing, where will we go? It might reasonably be thought that this is the central question of modern political thought. We will have no doubt that peaceful coexistence is one of the greatest goods of human life, something worth many inconveniences, sacrifices and compromises. We will see that there is moral force behind the laws and requirements of the state, simply because human beings do indeed need authority and systems of enforcement if they are to cooperate peacefully. But we can hardly accept that, because human judgment is weak and faulty, that there can be only one judge of these matters—precisely because that judge might turn out to be very faulty indeed. Our concern will be how we can effectively divide power between government and people, while still ensuring that important questions of moral and political judgment are peacefully adjudicated. We will be concerned with the standards and institutions that provide for compromise between many different and conflicting judgments. And all the time, we will remember Hobbes’s reminder that human life is never without inconvenience and troubles, that we must live with a certain amount of bad, to prevent the worst: fear of violence, and violent death.

7. References and Further Reading

  • Edwards, Alistair (2002) “Hobbes” in Interpreting Modern Political Philosophy: From Machiavelli to Marx, eds. A Edwards and J Townshend (Palgrave Macmillan, Houndmills)
    • A very helpful overview of key interpretative debates about Hobbes in the twentieth century.
  • Hill, Christopher (1961/1980) The Century of Revolution, 1603-1714, second ed (Routledge, London)
    • The classic work on the history and repercussions of England’s civil war.
  • Hobbes, Thomas (1998 [1642]) On the Citizen, ed & trans Richard Tuck and Michael Silverthorne (Cambridge University Press, Cambridge)
    • The best translation of Hobbes’s most straightforward book,De Cive.
  • Hobbes, Thomas (1994 [1651/1668]) Leviathan, ed Edwin Curley (Hackett, Indianapolis)
    • The best edition of Hobbes’s magnum opus, including extensive additional material and many important variations (ignored by all other editions) between the English text and later Latin edition.
  • Sorrell, Tom (1986) Hobbes (Routledge & Kegan Paul, London)
    • A concise and well-judged account of Hobbes’s life and works.
  • Sorrell, Tom, ed (1996) The Cambridge Companion to Hobbes (Cambridge University Press, Cambridge)
    • An excellent set of essays on all aspects of Hobbes’s intellectual endeavors.

Author Information

Garrath Williams
Email: g.d.williams@lancaster.ac.uk
Lancaster University
United Kingdom

Hippocrates (c. 450—c. 380 B.C.E.)

HippocratesHippocrates of Cos was said to have lived sometime between 450 BCE to 380 BCE. He was a physician, and the writings of the Corpus Hippocraticum provide a wealth of information on biomedical methodology and offer one of the first reflective codes of professional ethics. Though Plato (a contemporary) makes reference to Hippocrates (Phaedrus 270a and elsewhere), it is generally believed that most of the writings in the Corpus Hippocraticum are actually the work of a number of different writers. By convention of time, place and general approach a common name of ‘Hippocrates’ was assigned to the lot (without distinguishing those of the historical Hippocrates). Hippocrates and the other associated writers provide the modern student with a number of different sorts of insights.

On the biomedical methodology side, these writings provide the most detailed biomedical observations to date in the Western world. They also offer causal speculations that can be knitted together to form a theoretical framework for diagnosis and treatment. On the ethical side, their code of professional ethics is so well structured that it continues to stand as a model for other professions.

Table of Contents

  1. Biomedical Methodology
    1. The Four Humors
    2. An Ancient Debate: Are General Causal Theories Beneficial?
    3. Prognosis and Treatment
    4. The Hippocratic Writings and Hellenistic Medicine
  2. Ethics
    1. The Oath
    2. The Oath and Modern Codes of Conduct
  3. References and Further Reading
    1. Primary Sources
    2. Secondary Sources
    3. Published Conferences on Hippocrates

1. Biomedical Methodology

One way to parse the groups of Hippocratic writers revolves around their geographical origins: Cos vs. Cnidos. Though this classification is controversial, it is useful (whether one accepts the literal geographical demarcation) to mark some clear distinctions in the Hippocratic body of writing. It appears to be the case that the Cos writers sought to create general biomedical “laws” that for the most part would give the explanation for why someone was sick. Any physician might make reference to these “laws” and thereby have an etiology for the disease, and by extension a strategy for treatment.

a. The Four Humors

The most historically prominent theoretical scheme of the Coan writers was the doctrine of the four humors of the body: blood, phlegm, black bile, and yellow bile (or sometimes serum). Health was defined as the balance of the four humors. Disease was defined as the imbalance of the humors. When imbalance occurred, then the physician might intervene by making a correction to bring the body back into balance. For example, if the individual were too full of phlegm (making her phlegmatic or lethargic), then the phlegm must be countered. Citrus fruit was thought to be a counter-acting agent. Thus, if one feels lethargic, increasing one’s citrus intake will re-create balance. The treatment is, in fact, generally effective. Moderns might describe the therapy differently by ascribing the effect to vitamin-C, phosphorus, and natural sugar. This example illustrates the scope of the Hippocratic physician in this context: something like a cross between the modern roles of an herbalist dietician and a personal trainer. Nonetheless, the cures that were dictated by the four humor theory seemed to work well enough for this theory to extend to the nineteenth century (in various guises).

b. An Ancient Debate: Are General Causal Theories Beneficial?

Other biomedical writers–some say from Cnidos–held that strict empirical principles did not allow scientists to go far beyond the data. It was a better methodology for the biomedical practitioner to stay as close as possible to the data that were before him. This meant that each patient would be seen in her particularity. Such a method required careful trial and error observation and only slight manipulation of the patient in the form of treatment.

There was a great conflict in the ancient world concerning the status of observational conclusions (the empirically concrete). Should they be given in their specificity and remain as disparate, individual accounts, or should they be grouped and more general principles drawn from them? In this instance it was very much in dispute whether it was better to set out individual reports of particular illnesses (case studies) or to try to draw general rules from the particulars.

Take, for example Epidemics III:

THE MORTIFICATION OF THE GANGRENE. If the gangrene mortifies itself there is a head pain and frequently a scratchy throat; the sick limb loses sensation, a feeling of cold comes to the head and the affected limb sweats. He suddenly loses his speech and blows blood from his nose as he becomes pale. If the disease takes hold of the patient with a weak force, he recovers the discharged blood. If the disease takes him with a strong force, he dies promptly. In this case one induces sneezing by pleasant substances; one evacuates by the upper and lower. Alternatively those odors will be a little active. The soup will be light and hot. Wine is absolutely forbidden. (Epidemics III, Littré 7, p. 123)

In this passage one is left merely with symptoms and treatment. But when one practices medicine in this way there are severe restrictions. For the disease is seen as a collection of symptoms. The cure can only be guessed at unless it has been previously written down in a manual. When a physician is confronted with a novel disease he must find a similar set of symptoms and use that treatment. This aspect of the “trial and error” method brought harsh rebuke from Galen.

The point is that they [the Cnidians] looked at the varieties of symptoms which change for many reasons and failed to consider the specificity of the dispositions, as did Hippocrates, who used for their discovery a method only by using which, one can find the number of diseases . . . . Hippocrates censures the Cnidian physicians for their ignorance of the genera and species of diseases, and he points out the divisions by which what seems to be one becomes many by being divided. (Corpus Medicorum Graecorum 5.9.1, pp. 121-22; Claudii Galeni De Placitis Hippocratis et Platonis, ed. I. Mueller (Lipsiae, 1874), p. 776)

c. Prognosis and Treatment

What was it that made the Cnidians different from the Coan writers? This can be found by examining the two steps in any medical practice: Prognosis and Treatment. In the Coan work, On Prognosis, the writer suggests that prognosis consists in knowing the patient’s condition in the past, present, and the future. Now how could a physician know this? Well, this could also have been part of a handbook catalogued through similar case studies. The practitioner could memorize each individual description. Next, the practitioner could add to this his own experience. But the problem is that each case is individual. It possesses “nature” only in the sense of possessing a unique set of properties. The practitioner would not be in a good position to treat novel cases. When confronted with a novel case, the practitioner is left with seeking similar cases. The implied premise is that similar cases call for similar remedies. The more the experience, the more refined the practitioner can be in balancing similar cases with the remedies.

Obviously, much rides on the word, ‘similar.’ Is a rich body of knowledge enough? Is it not also requisite to have a classification procedure, which itself implies rules of classification. And how does one select and justify such rules? It would seem that we are pressed backwards toward archai, starting points for some axiomatic system (à la Aristotle’s Posterior Analytics, I, i-ii).

Such an alternative to the empiricist program aims at establishing a theory of causes that underlie individual cases. These causes such as the “hot,” “cold,” “wet,” and “dry” or “the four humors” are more general because they seek to describe a different sense of the nature of disease. ‘Nature’ in this context refers to the sort of condition that comes from observations based upon the individuality of actual patients. For here we are interested in the genera and species of the disease in question. Such an exercise creates a classification of types of diseases.

But for this classification not to be based upon accidental characteristics, it is requisite that it include the causal factors that operate to bring about the disease in the first place. This is really the foundational or causal network that is responsible for the disease’s very existence. Such an understanding of “nature” moves away from individuals and their “similarities” toward the theoretical. Understood in this way, the nature of disease is a regulating factor upon the prognosis of the physician. This nature must be understood in order to offer treatment. In this sense, nature is the overarching principles that give an account of the mechanism of the disease. What made the Coan writers so attractive to Galen was that they investigated various senses of nature while the Cnidians confined themselves only to the data as they presented themselves.

d. The Hippocratic Writings and Hellenistic Medicine

The Hippocratic writings were influential in the development of later biomedical practitioners. The three principal Hellenistic schools: Dogmatists, Methodists, and the Empirics all hearken back in various ways to the Hippocratic writings. Many debates in the Hippocratic writings (such as the “preformation” vs. “epigenesis” debate) are picked-up again and given a twist according to the predilections of the Hellenistic schools. Galen, himself, often cites Hippocrates, aka “the Hippocratic writers,” as the point of departure for his own theory building. Thus, it would be fair to say that not only were the Hippocratic writers the first systematic biomedical writers in the Western tradition, but also the most influential to later writers.

2. Ethics

In the time of Hippocrates (and the other associated writers) there were many who wanted to pass themselves off as physicians. These individuals had not gone through an apprenticeship and thus had no specialized (professional) knowledge. Because of this, these con men went about fleecing customers. This created a problem for those who entered the study of medicine the traditional way. These more careful practitioners had to distinguish themselves from the charlatans. The way most professions try to deal with this sort of problem and the legitimate problems that arise during practice is to create codes of conduct and structures of accreditation. The most famous of these in the biomedical tradition is: The Oath of Hippocrates.

a. The Oath

By Apollo (the physician), by Asclepius (god of healing), by Hygeia (god of health), by Panacea (god of remedy), and all the gods and goddesses, together as witnesses, I hereby swear that I will carry out, inasmuch as I am able and true to my considered judgment, this oath and the ensuing duties:

  1. To hold my teacher in this art on a par with my parents. To make my teacher a partner in my livelihood To look after my teacher and financially share with her/him when s/he is in need. To consider him/her as a brother/sister along with his/her family. To teach his/her family the art of medicine, if they want to learn it, without tuition or any other conditions of service. To impart all the lessons necessary to practice medicine to my own sons and daughters, the sons and daughters of my teacher and to my own students, who have taken this oath-but to no one else.
  2. I will help the sick according to my skill and judgment, but never with an intent to do harm or injury to another.
  3. I will never administer poison to anyone-even when asked to do so. Nor will I ever suggest a way that others (even the patient) could do so. Similarly, I will never induce an abortion. Instead, I will keep holy my life and art.
  4. I will not engage in surgery–not even upon suffers from stone, but will withdraw in favor of others who do this work.
  5. Whoever I visit, rich or poor, I will concern myself with the well being of the sick. I will commit no intentional misdeeds, nor any other harmful action such as engaging in sexual relations with my patients (regardless of their status).
  6. Whatever I hear or see in the course of my professional duties (or even outside the course of treatment) regarding my patients is strictly confidential and I will not allow it to be spread about. But instead, will hold these as holy secrets.

Now if I carry out this oath and not break its injunctions, may I enjoy a good life and may my reputation be pure and honored for all generations. But if I fail and break this oath, then may the opposite befall me.

Within this oath are both a moral code for the profession of medicine and the outlines of a system of accreditation for new physicians via an apprenticeship. These two functions went a long way to establishing medicine as a profession that ordinary people could trust.

b. The Oath and Modern Codes of Conduct

In the modern world there are many professional codes of conduct. One could look at the American Medical Association Code, the American Bar Association Code, et al. However, the Hippocratic Oath set the standard of what a professional code is. A few key features that will tell why one should accept or reject such codes as solutions to the problems that have been outlined.

It is this author’s opinion that among professional codes, the Hippocratic Oath is a good one. It balances between very specific prohibitions such as not administering poison or not having sexual relations with one’s patients, to more general principles such as “I will concern myself with the well being of the sick.” and “do no harm.” These general principles are very useful because they govern a larger domain than simply prohibiting a particular action. These principles are not set out without context. Instead they are put into the context medicine’s mission.

Beginning in #1 the tone is set that medicine is an art that is “given by the gods.” It is an esoteric art that is to be reserved for those who are willing to commit to the provisions of the code. Thus, it is not open to everyone. This fulfills the condition of specialized knowledge mentioned earlier. It is for the sake of doing good to others and always avoiding harm. This fulfills the condition of providing a service for others.

Thirdly, the code ties itself to the larger moral tradition, “I will commit no intentional misdeeds.” Whereas “harm” has a direct link to manner in which medicine is practiced, “misdeeds” links the physician to the larger moral tradition. There is no possible hiding in the shared community perspective alone.

These three factors are the basis of any good professional code.

A Good Professional Code Should Contain

  1. A specific listing of common abuses.
  2. A few general guidelines that tie behavior to the mission of the profession.
  3. A link to general theories of morality.

Where codes of professional ethics fail is in overemphasizing one of these elements too highly or in ignoring an element entirely. If codes of ethics exist in order to remedy the “inward perspective” problem described above, then they must create links to more general “shared worldviews.” This would put them in the realm of common morality.

This is the most important point from my perspective. So often the “practice” of the profession defines its excellence in an introspective way such that the achievement of these functional requirements is all that matters-divorced from any other visions, namely, moral visions.

In the modern arena, many professional codes have evolved from a legal perspective. The practitioners of the profession do not want to go to jail or to be sued. Thus, they create certain codes that will make this possible situation less probable. These sorts of codes are defensive in nature and stand at the opposite end of the spectrum from the Hippocratic Oath. Their mission is not to set internal standards and link to common morality, rather they seek to “shave” as close as possible to maximizing an egoistic bottom line at the expense of the pillars of professionalism: one’s specialized education and one’s mission to serve others.

Any code that takes as its basis merely a negative approach designed to protect the practitioner from going to jail or being sued is fundamentally inadequate. This is not where one should set her sights. Rather, we should dream about what the profession may be-in the best of all possible worlds. The Oath of Hippocrates thus properly sets the mission that should drive all codes of ethics.

3. References and Further Reading

a. Primary Sources

  • Corpus Medicorum Graecorum (Berlin: Akademie-Verlag, on-going).
    • New editions of selected texts with excellent notes and apparatus by various editors.
  • Hippocrate, Oeuvres. Paris: Les Belles Lettres, 1967-2008.
  • Hippocrates, selected works Loeb series. 8 vols. Cambridge, MA: Harvard University Press, 1923-1995.
  • Littré, E. Oeuvres complète d’hippocrate 10 vols. (Paris: J. B. Billière, 1851).
    • The standard edition.

b. Secondary Sources

  • Bourgey, Louis, Observation et experience chez les médecins de la collection hippocratique. (Paris: J. Vrin, 1953).
    • A fine connection to principles in the philosophy of science.
  • Edelstein, Ludwig Ancient Medicine. (Baltimore, MD: Johns Hopkins University Press, 1967).
    • Still the best single treatment of ancient biomedical practitioners.
  • Jouanna, Jacques. Hipporcrate Translated as Hippocrates by M. B. DeBevoise (Baltimore, MD: Johns Hopkins University Press, 1999).
    • A broad and speculative treatment.
  • Jouanna, Jacques. Hippocrate: pour une archéologie d l’école de Cnide. (Paris: Belles Lettres, 1974).
    • A fine detailed analysis.
  • Lloyd, G.E.R. Early Greek Science: Thales to Aristotle. (New York: Norton, 1970).
    • An overview for the student interested in an introduction.
  • Lloyd, G.E.R. Magic, Reason, and Experience. (Cambridge: Cambridge University Press, 1979).
    • Perhaps Lloyd’s best analytical work by one of the finest practitioners of ancient scientific history.
  • Smith, Wesley. The Hippocratic Tradition. (Ithaca, NY: Cornell University Press, 1979). Second edition revised 2002 – available online at: http://www.bium.univ-paris5.fr/amn/Hippo2.pdf (accessed on August 26, 2009).
    • A solid overview by an excellent scholar.
  • Temkin, Owsei. Hippocrates in a World of Pagans and Christians. (Baltimore, MD: Johns Hopkins University Press, 1991).
    • A wide-ranging work that stimulates in the tradition of the history of ideas.

c. Published Conferences on Hippocrates

#1 French

  • La Collection Hippocratique et son role dans l’histoire medicine : Colloque de Strasbourg (23-27 Octobre 1972). (1975). Paper presented at the Colloque Sur La Collection Hippocratique Et Son Role Dans l’Histoire Medicale (1972 : Strasbourg); Universite Des Sciences Humaines De Strasbourg. Centre De Recherche Sur La Proche-Orient Et La Grece Antiques. Travaux, 2.

#2 French

  • Joly, R., (1977). Corpus hippocraticum : Actes du Colloque Hippocratique de Mons (22-26 Septembre 1975). Paper presented at the Colloque Hippocratique, 2nd, Mons, 1975.; Éditions Universitaires De Mons. Série Sciences Humaines ; 4.

#3 French

  • Grmek, M. D. (1980). Hippocratica : Actes du Colloque Hippocratique de Paris, 4-9 Septembre 1978. Paper presented at the Colloque Hippocratique De Paris (1978); Colloques Internationaux Du Centre National De La Recherche Scientifique ; no 583; Variation: Colloques Internationaux Du Centre National De La Recherche Scientifique ; no 583.

#4 French

  • Lasserre, F., & Mudry, P. (1983). Formes de pensée dans la collection Hippocratique : Actes du IVe Colloque International Hippocratique : Lausanne, 21-26 Septembre 1981. Paper presented at the International Hippocrates Colloquium (4th : 1981 : Lausanne, Switzerland); Publications De La Faculté Des Lettres ; 26; Variation: Publications De La Faculté Des Lettres (Université De Lausanne. Faculté Des Lettres) ; 26.

#5 German

  • Baader, G., Winau, R., Berliner Gesellschaft für Geschichte der Medizin, Freie Universität Berlin, & Institut für Geschichte der Medizin. (1989). Die Hippokratischen epidemien : Theorie-praxis-tradition : Verhandlungen des ve Colloque International Hippocratique. Paper presented at the International Hippocrates Colloquium (5th : 1984 : Berlin, Germany); Sudhoffs Archiv,; Beihefte ; Heft 27, 441.

#6 French

  • Potter, P., Maloney, G., & Desautels, J. (1990). La maladie et les maladies dans la Collection Hippocratique : Actes du VIe Colloque International Hippocratique, Québec du 28 Septembre au 3 Octobre 1987. Paper presented at the Colloque International Hippocratique (6e : 1987 : Québec, Québec),

#7 Spanish

  • López Férez, J. A. (1992). Tratados hipocráticos : Estudios acerca de su contenido, forma e influencia : Actas del VIIe Colloque International Hippocratique, Madrid, 24-29 de Septiembre de 1990. Paper presented at the Colloque International Hippocratique (7th : 1990 : Madrid, Spain),

#8 German

  • Wittern, R., & Pellegrin, P. (1996). Hippokratische medizin und antike philosophie : Verhandlungen des VIII. Internationalen Hippokrates-Kolloquiums in Kloster Banz/Staffelstein vom 23.-28. Sept. 1993. Paper presented at the International Hippocrates Colloquium (8th : 1993 : Kloster Banz/Staffelstein); Medizin Der Antike; Bd. 1, 654.

#9 multiple languages

  • Garofalo, I. (1999). Aspetti della terapia nel corpus hippocraticum : Atti del IXe Colloque International Hippocratique, Pisa, 25-29 Settembre 1996. Paper presented at the International Hippocrates Colloquium (9th : 1996 : Pisa, Italy); Studi / Accademia Toscana Di Scienze e Lettere La Colombaria; 183; Variation: Studi (Accademia Toscana Di Scienze e Lettere La Colombaria); 183. 716.

#10 French

  • Thivel, A., & Zucker, A. (2002). Le normal et le pathologique dans la collection Hippocratique : Actes du xème Colloque International Hippocratique, Nice, 6-8 Octobre 1999. Paper presented at the Colloque International Hippocratique (10th : 1999 : Nice, France),

#11 English

Author Information

Michael Boylan
Email: michael.boylan@marymount.edu
Marymount University
U. S. A.

Hipparchia (fl. 300 B.C.E.)

HipparchiaHipparchia is notable for being one of the few women philosophers of Ancient Greece.  Drawn to the doctrines and the self-imposed hardships of the Cynic lifestyle, Hipparchia lived in poverty with her husband, Crates the Cynic. While no existing writings are directly attributed to Hipparchia, recorded anecdotal accounts emphasize both her direct, Cynic rhetoric and her nonconformity to traditional gendered roles. Entering into marriage is a traditional social role that Cynics would normally reject; yet with her marriage to Crates, Hipparchia raised  Greek cultural expectations regarding the role of women in marriage, as well as the Cynic doctrine itself. With her husband, Hipparchia publicly embodied fundamental Cynic principles, specifically that the path toward virtue was the result of rational actors living in accordance with a natural law that eschewed conventional materialism and embraced both self-sufficiency and mental asperity.  Written accounts of Hipparchia’s life reference in particular both her belief in human shamelessness or anaideia, and her rhetorical acuity at Greek symposiums traditionally attended only by men.  Along with Crates, Hipparchia is considered a direct influence on the later school of Stoicism.

Table of Contents

  1. Life and Philosophy
  2. References and Further Reading

1. Life and Philosophy

Hipparchia was a Cynic philosopher from Maroneia in Thrace, who flourished around 300 B.C.E. She became famous for her marriage to Crates the Cynic, and infamous for supposedly consummating the marriage in public. Hipparchia was likely born between 340 and 330 B.C.E., and was probably in her mid-teens when she decided to adopt the Cynic mantle. She may have been introduced to philosophy by her brother, Metrocles, who was a pupil in Aristotle’s Lyceum and later began to follow Crates. Most of our knowledge about Hipparchia comes from anecdotes and sayings repeated by later authors. Diogenes Laertius reports that she wrote some letters, jokes and philosophical refutations, which are now lost (see Diogenes Laertius, Lives of Eminent Philosophers, Vol. II, tr. R. D. Hicks, Cambridge: Harvard University Press, 1925, reprint 1995, VI.96-98). He adds that myriad stories were told about “the female philosopher”.

Diogenes Laertius claims that Hipparchia was so eager to marry Crates that she threatened to kill herself rather than live in any other way. (DL VI.96.7-8) Although Crates was by this time an old man, she rejected her other youthful suitors because she had fallen in love with “both the discourses and the life” of Crates, and was said to be “captured” by the logos of the Cynics. (VI.96.1 and 4-5) At the request of her parents, Crates tried to talk Hipparchia out of the marriage. (VI.96.9-10) When he failed in this task, he disrobed in front of her and said, “this is the groom, and these are his possessions; choose accordingly.” (VI.96.11-15) This tale should be taken with the proverbial grain of salt, given that Diogenes Laertius is writing centuries later, and that his account may include ‘apt’ stories that are technically false, but which arose and were transmitted because they were taken to be revealing illustrations. Given the interest and controversy generated by the female Cynic, it is easy to imagine stories of this kind being told about her. In any event, we know that Hipparchia chose to marry Crates and share his philosophical pursuits.

Hipparchia’s decision to become a Cynic was surprising, on account of both the Cynic disregard for conventional institutions and the extreme hardship of the lifestyle. Cynics attempted to live “according to nature” by rejecting artificial social conventions and refusing all luxuries, including any items not absolutely required for survival. They gave up their possessions, carrying what few they needed in a wallet. They wore only a simple mantle or cloak, and begged to obtain their basic needs. Crates’ willingness to marry was also unusual, considering that marriage is a social institution of the sort normally rejected by Cynics, and earlier Cynics like Diogenes and Antisthenes had maintained that the philosopher would never marry. A few centuries later, while arguing that marriage is generally unsuitable for the Cynic (or Stoic) philosopher, Epictetus allows for exceptions specifically because of the philosophical marriage of Hipparchia and Crates (Epictetus, Discourses, tr. C. H. Oldfather, Cambridge: Harvard University Press, 1928). By marrying a Cynic and becoming one herself, Hipparchia thus performed the characteristically Cynic feat of “changing the currency,” both of her culture and the Cynic tradition itself. The Cynic motto of “change the currency” (parakrattein to nomismata), first adopted by Diogenes of Sinope, implied rejection of the prevailing social and political order in favor of an unconventional, self-sufficient life as a “citizen of the universe” (kosmopolites). (It had been said, perhaps falsely, that Diogenes or his father had been driven from Sinope when found guilty of literally defacing the coins and changing their values, but it is also likely that the counterfeiting story arose after he adopted the metaphorical motto.)

Some later authors, such as Apuleius and Augustine, report that Hipparchia and Crates consummated their marriage by having sex on a public porch. Whether the tale is accurate or not, they were known to conduct themselves in all respects according to the Cynic value of anaideia, or shamelessness. The story of Hipparchia’s Cynic marriage quickly became the premiere example of that virtue, which is based on the Cynic belief that any actions virtuous enough to be done in private are no less virtuous when performed in public. As exemplars of anaideia, Hipparchia and Crates influenced their pupil Zeno of Citium, the founder of Stoicism. His Republic advocates the equality of the sexes, co-ed public exercise and training, and a version of “free love” wherein those wishing to have sex will simply satisfy their desires wherever they happen to be at the moment, even in public. Stoic ethics were generally influenced by Cynic values, such as self-sufficiency, the importance of practice in achieving virtue, and the rejection of the conventional values attached to pleasure and pain. The Stoics also advocated living according to nature in the sense of conforming one’s own reason to the dictates of the rational natural law.

Eratosthenes reports that Hipparchia and Crates had a son named Pasicles, and Diogenes Laertius’ account of the life of Crates also refers to their son. The Cynic Letters, a collection of pseudographic letters attributed to various Cynic figures and probably written by a several different authors a few centuries after Hipparchia lived, mention that she bore and raised children according to her Cynic values. Whatever the actual details of her practices might have been, her example influenced later Cynic attitudes towards pregnancy and child-rearing. For example, one of the letters attributed to Crates suggests that Hipparchia has given birth “without trouble” because she believes that her usual “labor is the cause of not laboring” during the birth itself (33.14-15). The birth was easier because she continued to work “like an athlete” during her pregnancy (33.17), which the author notes is unusual. The letters also mention Hipparchia’s use of a tortoise shell cradle, cold water for the baby’s bath, and continued adherence to an austere diet.

Hipparchia is also famous for an exchange with Theodorus the Atheist, a Cyrenaic philosopher, who had challenged the legitimacy of her presence at a symposium. She was reported to have regularly attended such functions with Crates. According to Diogenes Laertius, Theodorus quoted a verse from Euripides’ Bacchae, asking if this is she “abandoning the warp and woof and the shuttle” (like Agave returning home from the “hunt” with the head of her son Pentheus). (VI.98.2) Hipparchia affirms that yes, it is she, but asks Theodorus whether she has had the wrong understanding of herself, if she spent her time on education rather than wasting it on the loom. (VI.98.3-6) In the ancient Greek cultural context, women of her social class typically would have been occupied with weaving and organizing the household servants, and Hipparchia’s rejection of the conventional expectations for women was quite radical.

Diogenes Laertius also reports the syllogism that Hipparchia used to put down Theodorus during the same symposium mentioned above: Premise 1: “Any action which would not be called wrong if done by Theodorus, would not be called wrong if done by Hipparchia.” Premise 2: “Now Theodorus does no wrong when he strikes himself”. Conclusion: “therefore neither does Hipparchia do wrong when she strikes Theodorus.” (VI.97.6-9) This is a classic example of the Cynic rhetorical trope of spoudogeloion: a deliberately comic syllogism which nevertheless makes a serious point. Diogenes Laertius says that since Theodorus “had no reply wherefore to meet the argument,” he “tried to strip her of her cloak. But Hipparchia showed no sign of alarm or of the perturbation natural in a woman” (VI.97), as befitted her Cynic commitment to anaideia.

2. References and Further Reading

Diogenes Laertius, Lives of Eminent Philosophers, Vol. II, tr. R. D. Hicks (Cambridge: Harvard University Press) 1925 (reprint 1995), VI.96-98.

Abraham J. Malherbe, The Cynic Epistles (Atlanta: Scholar’s Press) 1997, 78-83.

Discussions in the modern period of Hipparchia’s encounter with Theodorus are found in Bayle’s Historical and Critical Dictionary and in Menage’s History of Women Philosophers. See Pierre Bayle, Historical and Critical Dictionary: Selections, ed. Richard H. Popkin and Craig Bush (Indianapolis: Hackett) 1991, 102-103, and Gilles Menage, The History of Women Philosophers, tr. Beatrice H. Zedler (Lanham, MD: University Press of America, 1984), 103.

For further information about Cynic philosophy, see Diogenes Laertius Book VI, as well as D. R. Dudley, A History of Cynicism: From Diogenes to the Sixth Century AD (London) 1937 (reprint Ares Publishing, 1980), and R. Bracht Branham and Marie Odile Goulet-Caze, eds., The Cynics: The Cynic Movement in Antiquity and its Legacy (Berkeley: University of California Press) 2000.

Author Information

Laura Grams
Email: lgrams@mail.unomaha.edu
University of Nebraska at Omaha
U. S. A.

Heraclitus (fl. c. 500 B.C.E.)

HeraclitusA Greek philosopher of the late 6th century BCE, Heraclitus criticizes his predecessors and contemporaries for their failure to see the unity in experience. He claims to announce an everlasting Word (Logos) according to which all things are one, in some sense. Opposites are necessary for life, but they are unified in a system of balanced exchanges. The world itself consists of a law-like interchange of elements, symbolized by fire. Thus the world is not to be identified with any particular substance, but rather with an ongoing process governed by a law of change. The underlying law of nature also manifests itself as a moral law for human beings. Heraclitus is the first Western philosopher to go beyond physical theory in search of metaphysical foundations and moral applications.

Table of Contents

  1. Life and Times
  2. Theory of Knowledge
  3. The Doctrine of Flux and the Unity of Opposites
  4. Criticism of Ionian Philosophy
  5. Physical Theory
  6. Moral and Political Theory
  7. Accomplishments and Influence
  8. References and Further Reading

1. Life and Times

Heraclitus lived in Ephesus, an important city on the Ionian coast of Asia Minor, not far from Miletus, the birthplace of philosophy. We know nothing about his life other than what can be gleaned from his own statements, for all ancient biographies of him consist of nothing more than inferences or imaginary constructions based on his sayings. Although Plato thought he wrote after Parmenides, it is more likely he wrote before Parmenides. For he criticizes by name important thinkers and writers with whom he disagrees, and he does not mention Parmenides. On the other hand, Parmenides in his poem arguably echoes the words of Heraclitus.  Heraclitus criticizes the mythographers Homer and Hesiod, as well as the philosophers Pythagoras and Xenophanes and the historian Hecataeus. All of these figures flourished in the 6th century BCE or earlier, suggesting a date for Heraclitus in the late 6th century. Although he does not speak in detail of his political views in the extant fragments, Heraclitus seems to reflect an aristocratic disdain for the masses and favor the rule of a few wise men, for instance when he recommends that his fellow-citizens hang themselves because they have banished their most prominent leader (DK22B121 in the Diels-Kranz collection of Presocratic sources).

2. Theory of Knowledge

Heraclitus sees the great majority of human beings as lacking understanding:

Of this Word’s being forever do men prove to be uncomprehending, both before they hear and once they have heard it. For although all things happen according to this Word they are like the unexperienced experiencing words and deeds such as I explain when I distinguish each thing according to its nature and declare how it is. Other men are unaware of what they do when they are awake just as they are forgetful of what they do when they are asleep. (DK22B1)

Most people sleep-walk through life, not understanding what is going on about them. Yet experience of words and deeds can enlighten those who are receptive to their meaning. (The opening sentence is ambiguous: does the ‘forever’ go with the preceding or the following words? Heraclitus prefigures the semantic complexity of his message.)

On the one hand, Heraclitus commends sense experience: “The things of which there is sight, hearing, experience, I prefer” (DK22B55). On the other hand, “Poor witnesses for men are their eyes and ears if they have barbarian souls” (DK22B107). A barbarian is one who does not speak the Greek language. Thus while sense experience seems necessary for understanding, if we do not know the right language, we cannot interpret the information the senses provide. Heraclitus does not give a detailed and systematic account of the respective roles of experience and reason in knowledge. But we can learn something from his manner of expression.

Describing the practice of religious prophets, Heraclitus says, “The Lord whose oracle is at Delphi neither reveals nor conceals, but gives a sign” (DK22B93). Similarly, Heraclitus does not reveal or conceal, but produces complex expressions that have encoded in them multiple messages for those who can interpret them. He uses puns, paradoxes, antitheses, parallels, and various rhetorical and literary devices to construct expressions that have meanings beyond the obvious. This practice, together with his emphasis on the Word (Logos) as an ordering principle of the world, suggests that he sees his own expressions as imitations of the world with its structural and semantic complexity. To read Heraclitus the reader must solve verbal puzzles, and to learn to solve these puzzles is to learn to read the signs of the world. Heraclitus stresses the inductive rather than the deductive method of grasping the world, a world that is rationally structured, if we can but discern its shape.

For those who can discern it, the Word has an overriding message to impart: “Listening not to me but to the Word it is wise to agree that all things are one” (DK22B50). It is perhaps Heraclitus’s chief project to explain in what sense all things are one.

3. The Doctrine of Flux and the Unity of Opposites

According to both Plato and Aristotle, Heraclitus held extreme views that led to logical incoherence. For he held that (1) everything is constantly changing and (2) opposite things are identical, so that (3) everything is and is not at the same time. In other words, Universal Flux and the Identity of Opposites entail a denial of the Law of Non-Contradiction. Plato indicates the source of the flux doctrine: “Heraclitus, I believe, says that all things go and nothing stays, and comparing existents to the flow of a river, he says you could not step twice into the same river” (Cratylus 402a = DK22A6).

What Heraclitus actually says is the following:

On those stepping into rivers staying the same other and other waters flow. (DK22B12)

There is an antithesis between ‘same’ and ‘other.’ The sentence says that different waters flow in rivers staying the same. In other words, though the waters are always changing, the rivers stay the same. Indeed, it must be precisely because the waters are always changing that there are rivers at all, rather than lakes or ponds. The message is that rivers can stay the same over time even though, or indeed because, the waters change. The point, then, is not that everything is changing, but that the fact that some things change makes possible the continued existence of other things. Perhaps more generally, the change in elements or constituents supports the constancy of higher-level structures.As for the alleged doctrine of the Identity of Opposites, Heraclitus does believe in some kind of unity of opposites. For instance, “God is day night, winter summer, war peace, satiety hunger . . .” (DK22B67). But if we look closer, we see that the unity in question is not identity:

As the same thing in us is living and dead, waking and sleeping, young and old. For these things having changed around are those, and conversely those having changed around are these. (DK22B88)

The second sentence in B88 gives the explanation for the first. If F is the same as G because F turns into G, then the two are not identical. And Heraclitus insists on the common-sense truth of change: “Cold things warm up, the hot cools off, wet becomes dry, dry becomes wet” (DK22B126). This sort of mutual change presupposes the non-identity of the terms. What Heraclitus wishes to maintain is not the identity of opposites but the fact that they replace each other in a series of transformations: they are interchangeable or transformationally equivalent.

Thus, Heraclitus does not hold Universal Flux, but recognizes a lawlike flux of elements; and he does not hold the Identity of Opposites, but the Transformational Equivalence of Opposites. The views that he does hold do not, jointly or separately, entail a denial of the Law of Non-Contradiction. Heraclitus does, to be sure, make paradoxical statements, but his views are no more self-contradictory than are the paradoxical claims of Socrates. They are, presumably, meant to wake us up from our dogmatic slumbers.

4. Criticism of Ionian Philosophy

Heraclitus’ theory can be understood as a response to the philosophy of his Ionian predecessors. The philosophers of the city of Miletus (near Ephesus), Thales, Anaximander, and Anaximenes, believed some original material turns into all other things. The world as we know it is the orderly articulation of different stuffs produced out of the original stuff. For the Milesians, to explain the world and its phenomena was just to show how everything came from the original stuff, such as Thales’ water or Anaximenes’ air.

Heraclitus seems to follow this pattern of explanation when he refers to the world as “everliving fire” (DK22B30, quoted in full in next section) and makes statements such as “Thunderbolt steers all things,” alluding to the directive power of fire (DK22B64). But fire is a strange stuff to make the origin of all things, for it is the most inconstant and changeable. It is, indeed, a symbol of change and process. Heraclitus observes,

All things are an exchange for fire, and fire for all things, as goods for gold and gold for goods. (DK22B90)

We can measure all things against fire as a standard; there is an equivalence between all things and gold, but all things are not identical to gold. Similarly, fire provides a standard of value for other stuffs, but it is not identical to them. Fire plays an important role in Heraclitus’ system, but it is not the unique source of all things, because all stuffs are equivalent.

Ultimately, fire may be more important as a symbol than as a stuff. Fire is constantly changing-but so is every other stuff. One thing is transformed into another in a cycle of changes. What is constant is not some stuff, but the overall process of change itself. There is a constant law of transformations, which is, perhaps, to be identified with the Logos. Heraclitus may be saying that the Milesians correctly saw that one stuff turns into another in a series, but they incorrectly inferred from this that some one stuff is the source of everything else. But if A is the source of B and B of C, and C turns back into B and then A, then B is likewise the source of A and C, and C is the source of A and B. There is no particular reason to promote one stuff at the expense of the others. What is important about the stuffs is that they change into others. The one constant in the whole process is the law of change by which there is an order and sequence to the changes. If this is what Heraclitus has in mind, he goes beyond the physical theory of his early predecessors to arrive at something like a process philosophy with a sophisticated understanding of metaphysics.

5. Physical Theory

Heraclitus’ criticisms and metaphysical speculations are grounded in a physical theory. He expresses the principles of his cosmology in a single sentence:

This world-order, the same of all, no god nor man did create, but it ever was and is and will be: everliving fire, kindling in measures and being quenched in measures. (DK22B30)

This passage contains the earliest extant philosophical use of the word kosmos, “world-order,” denoting the organized world in which we live, with earth, sea, atmosphere, and heavens. While ancient sources understand Heraclitus as saying the world comes to be and then perishes in a fiery holocaust, only to be born again (DK22A10), the present passage seems to contradict this reading: the world itself does not have a beginning or end. Parts of it are being consumed by fire at any given time, but the whole remains. Almost all other early cosmologists before and after Heraclitus explained the existence of the ordered world by recounting its origin out of elemental stuffs. Some also predicted the extinction of the world. But Heraclitus, the philosopher of flux, believes that as the stuffs turn into one another, the world itself remains stable. How can that be?

Heraclitus explains the order and proportion in which the stuffs change:

The turnings of fire: first sea, and of sea, half is earth, half firewind (prêstêr: some sort of fiery meteorological phenomenon). (DK22B31a)

Sea is liquefied and measured into the same proportion as it had before it became earth. (DK22B31b)

Fire is transformed into water (“sea”) of which half turns back into fire (“firewind”) and half into earth. Thus there is a sequence of stuffs: fire, water, earth, which are interconnected. When earth turns back into sea, it occupies the same volume as it had before it turned into earth. Thus we can recognize a primitive law of conservation-not precisely conservation of matter, at least the identity of the matter is not conserved, nor of mass, but at least an equivalence of matter is maintained. Although the fragments do not give detailed information about Heraclitus’ physics, it seems likely that the amount of water that evaporates each day is balanced by the amount of stuff that precipitates as water, and so on, so that a balance of stuffs is maintained even though portions of stuff are constantly changing their identity.

For Heraclitus, flux and opposition are necessary for life. Aristotle reports,

Heraclitus criticizes the poet who said, ‘would that strife might perish from among gods and men’ [Homer Iliad 18.107]’ for there would not be harmony without high and low notes, nor living things without female and male, which are opposites. (DK22A22)

Heraclitus views strife or conflict as maintaining the world:

We must recognize that war is common and strife is justice, and all things happen according to strife and necessity. (DK22B80)

War is the father of all and king of all, who manifested some as gods and some as men, who made some slaves and some freemen. (DK22B53)

In a tacit criticism of Anaximander, Heraclitus rejects the view that cosmic justice is designed to punish one opposite for its transgressions against another. If it were not for the constant conflict of opposites, there would be no alternations of day and night, hot and cold, summer and winter, even life and death. Indeed, if some things did not die, others would not be born. Conflict does not interfere with life, but rather is a precondition of life.

As we have seen, for Heraclitus fire changes into water and then into earth; earth changes into water and then into fire. At the level of either cosmic bodies (in which sea turns into fiery storms on the one hand and earth on the other) or domestic activities (in which, for instance, water boils out of a pot), there is constant flux among opposites. To maintain the balance of the world, we must posit an equal and opposite reaction to every change. Heraclitus observes,

The road up and down is one and the same. (DK22B60)

Here again we find a unity of opposites, but no contradiction. One road is used to pursue two different routes. Daily traffic carries some travelers out of the city, while it brings some back in. The image applies equally to physical theory: as earth changes to fire, fire changes to earth. And it may apply to psychology and other domains as well.

6. Moral and Political Theory

There has been some debate as to whether Heraclitus is chiefly a philosopher of nature (a view championed by G. S. Kirk) or a philosopher concerned with the human condition (C. H. Kahn). The opening words of Heraclitus’ book (DK22B1, quoted above) seem to indicate that he will expound the nature of things in a way that will have profound implications for human life. In other words, he seems to see the theory of nature and the human condition as intimately connected. In fact, recently discovered papyri have shown that Heraclitus is concerned with technical questions of astronomy, not only with general theory. There is no reason, then, to think of him as solely a humanist or moral philosopher. On the other hand, it would be wrong to think of him as a straightforward natural philosopher in the manner of other Ionian philosophers, for he is deeply concerned with the moral implications of physical theory.

Heraclitus views the soul as fiery in nature:

To souls it is death to become water, to water death to become earth, but from earth water is born, and from water soul. (DK22B36)

Soul is generated out of other substances just as fire is. But it has a limitless dimension:

If you went in search of it, you would not find the boundaries of the soul, though you traveled every road-so deep is its measure [logos]. (DK22B45)

Drunkenness damages the soul by causing it to be moist, while a virtuous life keeps the soul dry and intelligent. Souls seem to be able to survive death and to fare according to their character.

The laws of a city-state are an important principle of order:

The people [of a city] should fight for their laws as they would for their city wall. (DK22B44)

Speaking with sense we must rely on a common sense of all things, as a city relies on its wall, and much more reliably. For all human laws are nourished by the one divine law. For it prevails as far as it will and suffices for all and overflows. (DK22B114)

The laws provide a defense for a city and its way of life. But the laws are not merely of local interest: they derive their force from a divine law. Here we see the notion of a law of nature that informs human society as well as nature. There is a human cosmos that like the natural cosmos reflects an underlying order. The laws by which human societies are governed are not mere conventions, but are grounded in the ultimate nature of things. One cannot break a human law with impunity. The notion of a law-like order in nature has antecedents in the theory of Anaximander, and the notion of an inherent moral law influences the Stoics in the 3rd century BCE.

Heraclitus recognizes a divine unity behind the cosmos, one that is difficult to identify and perhaps impossible to separate from the processes of the cosmos:

The wise, being one thing only, would and would not take the name of Zeus [or: Life]. (DK22B32)

God is day night, winter summer, war peace, satiety hunger, and it alters just as when it is mixed with incense is named according to the aroma of each. (DK22B67)

Evidently the world either is god, or is a manifestation of the activity of god, which is somehow to be identified with the underlying order of things. God can be thought of as fire, but fire, as we have seen, is constantly changing, symbolic of transformation and process. Divinity is present in the world, but not as a conventional anthropomorphic being such as the Greeks worshiped.

7. Accomplishments and Influence

Heraclitus goes beyond the natural philosophy of the other Ionian philosophers to make profound criticisms and develop far-reaching implications of those criticisms. He suggests the first metaphysical foundation for philosophical speculation, anticipating process philosophy. And he makes human values a central concern of philosophy for the first time. His aphoristic manner of expression and his manner of propounding general truths through concrete examples remained unique.

Heraclitus’s paradoxical exposition may have spurred Parmenides’ rejection of Ionian philosophy. Empedocles and some medical writers echoed Heraclitean themes of alteration and ongoing process, while Democritus imitated his ethical observations. Influenced by the teachings of the Heraclitean Cratylus, Plato saw the sensible world as exemplifying a Heraclitean flux. Plato and Aristotle both criticized Heraclitus for a radical theory that led to a denial of the Law of Non-Contradiction. The Stoics adopted Heraclitus’s physical principles as the basis for their theories.

8. References and Further Reading

  • Barnes, Jonathan. The Presocratic Philosophers. London: Routledge & Kegan Paul, 1982, vol. 1, ch. 4.
    • Uses modern arguments to defend the traditional view, going back to Plato and Aristotle, that Heraclitus’ commitment to the flux doctrine and the identity of opposites results in an incoherent theory.
  • Graham, Daniel W. “Heraclitus’ Criticism of Ionian Philosophy.” Oxford Studies in Ancient Philosophy 15 (1997): 1-50.
    • Defends Heraclitus against the traditional view held by Barnes and others, and argues that his theory can be understood as a coherent criticism of earlier Ionian philosophy.
  • Hussey, Edward. “Epistemology and Meaning in Heraclitus.” Language and Logos. Ed. M. Schofield and M. C. Nussbaum. Cambridge: Cambridge UP, 1982. 33-59.
    • Studies Heraclitus’ theory of knowledge.
  • Kahn, Charles H. The Art and Thought of Heraclitus. Cambridge: Cambridge UP, 1979.
    • An important reassessment of Heraclitus that recognizes the literary complexity of his language as a key to interpreting his message. Focuses on Heraclitus as a philosopher of the human condition.
  • Kirk, G. S. Heraclitus: The Cosmic Fragments. Cambridge: Cambridge UP, 1954.
    • Focuses on Heraclitus as a natural philosopher.
  • Marcovich, Miroslav. Heraclitus: Greek Text with a Short Commentary. Merida, Venezuela: U. of the Andes, 1967.
    • A very thorough edition of Heraclitus, which effectively sorts out fragments from reports and reactions.
  • Mourelatos, Alexander P. D. “Heraclitus, Parmenides, and the Naive Metaphysics of Things.” Exegesis and Argument. Ed. E. N. Lee et al. Assen: Van Gorcum, 1973. 16-48.
    • Examines Heraclitus’ response to the pre-philosophical understanding of things.
  • Nussbaum, Martha C. “Psychê in Heraclitus.” Phronesis 17 (1972): 1-16, 153-70.
    • Good treatment of Heraclitus’ conception of soul.
  • Robinson, T. M. Heraclitus: Fragments. Toronto: U of Toronto P, 1987.
    • Good brief edition with commentary.
  • Vlastos, Gregory. “On Heraclitus.” American Journal of Philology 76 (1955): 337-68. Reprinted in G. Vlastos, Studies in Greek Philosophy, vol. 1, Princeton: Princeton U. Pr., 1995.
    • Vigorous defense of the traditional interpretation of Heraclitus against Kirk and others.

Author Information

Daniel W. Graham
Email: daniel_graham@byu.edu
Brigham Young University
U. S. A.

Human Rights

Human rights are certain moral guarantees. This article examines the philosophical basis and content of the doctrine of human rights. The analysis consists of five sections and a conclusion. Section one assesses the contemporary significance of human rights, and it argues that the doctrine of human rights has become the dominant moral doctrine for evaluating the moral status of the contemporary geo-political order. Section two proceeds to chart the historical development of the concept of human rights, beginning with a discussion of the earliest philosophical origins of the philosophical bases of human rights and culminating in some of most recent developments in the codification of human rights. Section three considers the philosophical concept of a human right and analyses the formal and substantive distinctions philosophers have drawn between various forms and categories of rights. Section four addresses the question of how philosophers have sought to justify the claims of human rights and specifically charts the arguments presented by the two presently dominant approaches in this field: interest theory and will theory. Section five then proceeds to discuss some of the main criticisms currently leveled at the doctrine of human rights and highlights some of the main arguments of those who have challenged the universalist and objectivist bases of human rights. Finally, a brief conclusion is presented, summarising the main themes addressed.

Table of Contents

  1. Introduction: The Contemporary Significance of Human Rights
  2. Historical Origins and Development of the Theory and Practice of Human Rights
  3. Philosophical Analysis of the Concept of Human Rights
    1. Moral vs. Legal Rights
    2. Claim Rights & Liberty Rights
    3. Substantive Categories of Human Rights
    4. Scope of Human Rights Duties
  4. Philosophical Justifications of Human Rights
    1. Do Human Rights Require Philosophical Justification?
    2. The interests Theory Approach
    3. The Will Theory Approach
  5. Philosophical Criticisms of Human Rights
    1. Moral Relativism
    2. Epistemological Criticisms of Human Rights
  6. Conclusion
  7. References and Further Reading

1. Introduction: The Contemporary Significance of Human Rights

Human rights have been defined as

basic moral guarantees that people in all countries and cultures allegedly have simply because they are people. Calling these guarantees “rights” suggests that they attach to particular individuals who can invoke them, that they are of high priority, and that compliance with them is mandatory rather than discretionary. Human rights are frequently held to be universal in the sense that all people have and should enjoy them, and to be independent in the sense that they exist and are available as standards of justification and criticism whether or not they are recognized and implemented by the legal system or officials of a country. (Nickel, 1992:561-2)

The moral doctrine of human rights aims at identifying the fundamental prerequisites for each human being leading a minimally good life. Human rights aim to identify both the necessary negative and positive prerequisites for leading a minimally good life, such as rights against torture and rights to health care. This aspiration has been enshrined in various declarations and legal conventions issued during the past fifty years, initiated by the Universal Declaration of Human Rights (1948) and perpetuated by, most importantly, the European Convention on Human Rights (1954) and the International Covenant of Civil and Political Rights (1966). Together these three documents form the centrepiece of a moral doctrine that many consider to be capable of providing the contemporary geo-political order with what amounts to an international bill of rights. However, the doctrine of human rights does not aim to be a fully comprehensive moral doctrine. An appeal to human rights does not provide us with a fully comprehensive account of morality per se. Human rights do not, for example, provide us with criteria for answering such questions as whether telling lies is inherently immoral, or what the extent of one’s moral obligations to friends and lovers ought to be? What human rights do primarily aim to identify is the basis for determining the shape, content, and scope of fundamental, public moral norms. As James Nickel states, human rights aim to secure for individuals the necessary conditions for leading a minimally good life. Public authorities, both national and international, are identified as typically best placed to secure these conditions and so, the doctrine of human rights has become, for many, a first port of moral call for determining the basic moral guarantees all of us have a right to expect, both of one another but also, primarily, of those national and international institutions capable of directly affecting our most important interests. The doctrine of human rights aspires to provide the contemporary, allegedly post-ideological, geo-political order with a common framework for determining the basic economic, political, and social conditions required for all individuals to lead a minimally good life. While the practical efficacy of promoting and protecting human rights is significantly aided by individual nation-states’ legally recognising the doctrine, the ultimate validity of human rights is characteristically thought of as not conditional upon such recognition. The moral justification of human rights is thought to precede considerations of strict national sovereignty. An underlying aspiration of the doctrine of human rights is to provide a set of legitimate criteria to which all nation-states should adhere. Appeals to national sovereignty should not provide a legitimate means for nation-states to permanently opt out of their fundamental human rights-based commitments. Thus, the doctrine of human rights is ideally placed to provide individuals with a powerful means for morally auditing the legitimacy of those contemporary national and international forms of political and economic authority which confront us and which claim jurisdiction over us. This is no small measure of the contemporary moral and political significance of the doctrine of human rights. For many of its most strident supporters, the doctrine of human rights aims to provide a fundamentally legitimate moral basis for regulating the contemporary geo-political order.

2. Historical Origins and Development of the Theory and Practice of Human Rights

The doctrine of human rights rests upon a particularly fundamental philosophical claim: that there exists a rationally identifiable moral order, an order whose legitimacy precedes contingent social and historical conditions and applies to all human beings everywhere and at all times. On this view, moral beliefs and concepts are capable of being objectively validated as fundamentally and universally true. The contemporary doctrine of human rights is one of a number of universalist moral perspectives. The origins and development of the theory of human rights is inextricably tied to the development of moral universalism. The history of the philosophical development of human rights is punctuated by a number of specific moral doctrines which, though not themselves full and adequate expressions of human rights, have nevertheless provided a number of philosophical prerequisites for the contemporary doctrine. These include a view of morality and justice as emanating from some pre-social domain, the identification of which provides the basis for distinguishing between ‘true’ and merely ‘conventional’ moral principles and beliefs. The essential prerequisites for a defence of human rights also include a conception of the individual as the bearer of certain ‘natural’ rights and a particular view of the inherent and equal moral worth of each rational individual. I shall discuss each in turn.

Human rights rest upon moral universalism and the belief in the existence of a truly universal moral community comprising all human beings. Moral universalism posits the existence of rationally identifiable trans-cultural and trans-historical moral truths. The origins of moral universalism within Europe are typically associated with the writings of Aristotle and the Stoics. Thus, in his Nicomachean Ethics, Aristotle unambiguously expounds an argument in support of the existence of a natural moral order. This natural order ought to provide the basis for all truly rational systems of justice. An appeal to the natural order provides a set of comprehensive and potentially universal criteria for evaluating the legitimacy of actual ‘man-made’ legal systems. In distinguishing between ‘natural justice’ and ‘legal justice’, Aristotle writes, ‘the natural is that which has the same validity everywhere and does not depend upon acceptance.’ (Nicomachean Ethics, 189) Thus, the criteria for determining a truly rational system of justice pre-exist social and historical conventions. ‘Natural justice’ pre-exists specific social and political configurations. The means for determining the form and content of natural justice is the exercise of reason free from the distorting effects of mere prejudice or desire. This basic idea was similarly expressed by the Roman Stoics, such as Cicero and Seneca, who argued that morality originated in the rational will of God and the existence of a cosmic city from which one could discern a natural, moral law whose authority transcended all local legal codes. The Stoics’ argued that this ethically universal code imposed upon all of us a duty to obey the will of god. The Stoics thereby posited the existence of a universal moral community effected through our shared relationship with god. The belief in the existence of a universal moral community was maintained in Europe by Christianity over the ensuing centuries. While some have discerned intimations towards the notion of rights in the writings of Aristotle, the Stoics, and Christian theologians, a concept of rights approximating that of the contemporary idea of human rights most clearly emerges during the 17th. And 18th. Centuries in Europe and the so-called doctrine of natural law.

The basis of the doctrine of natural law is the belief in the existence of a natural moral code based upon the identification of certain fundamental and objectively verifiable human goods. Our enjoyment of these basic goods is to be secured by our possession of equally fundamental and objectively verifiable natural rights. Natural law was deemed to pre-exist actual social and political systems. Natural rights were thereby similarly presented as rights individuals possessed independently of society or polity. Natural rights were thereby presented as ultimately valid irrespective of whether they had achieved the recognition of any given political ruler or assembly. The quintessential exponent of this position was the 17th. Century philosopher John Locke and, in particular, the argument he outlined in his Two Treatises of Government (1688). At the centre of Locke’s argument is the claim that individuals possess natural rights, independently of the political recognition granted them by the state. These natural rights are possessed independently of, and prior to, the formation of any political community. Locke argued that natural rights flowed from natural law. Natural law originated from God. Accurately discerning the will of God provided us with an ultimately authoritative moral code. At root, each of us owes a duty of self-preservation to God. In order to successfully discharge this duty of self-preservation each individual had to be free from threats to life and liberty, whilst also requiring what Locke presented as the basic, positive means for self-preservation: personal property. Our duty of self-preservation to god entailed the necessary existence of basic natural rights to life, liberty, and property. Locke proceeded to argue that the principal purpose of the investiture of political authority in a sovereign state was the provision and protection of individuals’ basic natural rights. For Locke, the protection and promotion of individuals’ natural rights was the sole justification for the creation of government. The natural rights to life, liberty, and property set clear limits to the authority and jurisdiction of the State. States were presented as existing to serve the interests, the natural rights, of the people, and not of a Monarch or a ruling cadre. Locke went so far as to argue that individuals are morally justified in taking up arms against their government should it systematically and deliberately fail in its duty to secure individuals’ possession of natural rights.

Analyses of the historical predecessors of the contemporary theory of human rights typically accord a high degree of importance to Locke’s contribution. Certainly, Locke provided the precedent of establishing legitimate political authority upon a rights foundation. This is an undeniably essential component of human rights. However, the philosophically adequate completion of theoretical basis of human rights requires an account of moral reasoning, that is both consistent with the concept of rights, but which does not necessarily require an appeal to the authority of some super-human entity in justifying human beings’ claims to certain, fundamental rights. The 18th. Century German philosopher, Immanuel Kant provides such an account.

Many of the central themes first expressed within Kant’s moral philosophy remain highly prominent in contemporary philosophical justifications of human rights. Foremost amongst these are the ideals of equality and the moral autonomy of rational human beings. Kant bestows upon contemporary human rights’ theory the ideal of a potentially universal community of rational individuals autonomously determining the moral principles for securing the conditions for equality and autonomy. Kant provides a means for justifying human rights as the basis for self-determination grounded within the authority of human reason. Kant’s moral philosophy is based upon an appeal to the formal principles of ethics, rather than, for example, an appeal to a concept of substantive human goods. For Kant, the determination of any such goods can only proceed from a correct determination of the formal properties of human reason and thus do not provide the ultimate means for determining the correct ends, or object, of human reason. Kant’s moral philosophy begins with an attempt to correctly identify those principles of reasoning that can be applied equally to all rational persons, irrespective of their own specific desires or partial interests. In this way, Kant attaches a condition of universality to the correct identification of moral principles. For him, the basis of moral reasoning must rest upon a condition that all rational individuals are bound to assent to. Doing the right thing is thus not determined by acting in pursuit of one’s own interests or desires, but acting in accordance with a maxim which all rational individuals are bound to accept. Kant terms this the categorical imperative, which he formulates in the following terms, ‘act only on that maxim through which you can at the same time will that it should become a universal law.’ (1948:84). Kant argues that this basic condition of universality in determining the moral principles for governing human relations is a necessary expression of the moral autonomy and fundamental equality of all rational individuals. The categorical imperative is self-imposed by morally autonomous and formally equal rational persons. It provides the basis for determining the scope and form of those laws which morally autonomous and equally rational individuals will institute in order to secure these very same conditions. For Kant, the capacity for the exercise of reason is the distinguishing characteristic of humanity and the basis for justifying human dignity. As the distinguishing characteristic of humanity, formulating the principles of the exercise of reason must necessarily satisfy a test of universality; they must be capable of being universally recognized by all equally rational agents. Hence, Kant’s formulation of the categorical imperative. Kant’s moral philosophy is notoriously abstract and resists easy comprehension. Though often overlooked in accounts of the historical development of human rights, his contribution to human rights has been profound. Kant provides a formulation of fundamental moral principles that, though exceedingly formal and abstract, are based upon the twin ideals of equality and moral autonomy. Human rights are rights we give to ourselves, so to speak, as autonomous and formally equal beings. For Kant, any such rights originate in the formal properties of human reason, and not the will of some super-human being.

The philosophical ideas defended by the likes of Locke and Kant have come to be associated with the general Enlightenment project initiated during the 17th. and 18th. Centuries, the effects of which were to extend across the globe and over ensuing centuries. Ideals such as natural rights, moral autonomy, human dignity and equality provided a normative bedrock for attempts at re-constituting political systems, for overthrowing formerly despotic regimes and seeking to replace them with forms of political authority capable of protecting and promoting these new emancipatory ideals. These ideals effected significant, even revolutionary, political upheavals throughout the 18th. Century, enshrined in such documents as the United States’ Declaration of Independence and the French National Assembly’s Declaration of the Rights of Man and Citizen. Similarly, the concept of individual rights continued to resound throughout the 19th. Century exemplified by Mary Wollstencraft’s Vindication of the Rights of Women and other political movements to extend political suffrage to sections of society who had been denied the possession of political and civil rights. The concept of rights had become a vehicle for effecting political change. Though one could argue that the conceptual prerequisites for the defence of human rights had long been in place, a full Declaration of the doctrine of human rights only finally occurred during the 20th. Century and only in response to the most atrocious violations of human rights, exemplified by the Holocaust. The Universal Declaration of Human Rights (UDHR) was adopted by the UN General Assembly on 10th. December 1948 and was explicitly motivated to prevent the future occurrence of any similar atrocities. The Declaration itself goes far beyond any mere attempt to reassert all individuals’ possession of the right to life as a fundamental and inalienable human right. The UDHR consists of a Preamble and 30 articles which separately identify such things as the right not to be tortured (article 5), a right to asylum (article 14), a right to own property (article 17), and a right to an adequate standard of living (article 25) as being fundamental human rights. As I noted earlier, the UDHR has been further supplemented by such documents as the European Convention for the Protection of Human Rights and Fundamental Freedoms (1953) and the International Covenant on Economic, Social and Cultural Rights (1966). The specific aspirations contained within these three documents have themselves been reinforced by innumerable other Declarations and Conventions. Taken together these various Declarations, conventions and covenants comprise the contemporary human rights doctrine and embody both the belief in the existence of a universally valid moral order and a belief in all human beings’ possession of fundamental and equal moral status, enshrined within the concept of human rights. It is important to note, however, that the contemporary doctrine of human rights, whilst deeply indebted to the concept of natural rights, is not a mere expression of that concept but actually goes beyond it in some highly significant respects. James Nickel ( 1987: 8-10) identifies three specific ways in which the contemporary concept of human rights differs from, and goes beyond that of natural rights. First, he argues that contemporary human rights are far more concerned to view the realization of equality as requiring positive action by the state, via the provision of welfare assistance, for example. Advocates of natural rights, he argues, were far more inclined to view equality in formalistic terms, as principally requiring the state to refrain from ‘interfering’ in individuals’ lives. Second, he argues that, whereas advocates of natural rights tended to conceive of human beings as mere individuals, veritable ‘islands unto themselves’, advocates of contemporary human rights are far more willing to recognize the importance of family and community in individuals’ lives. Third, Nickel views contemporary human rights as being far more ‘internationalist’ in scope and orientation than was typically found within arguments in support of natural rights. That is to say, the protection and promotion of human rights are increasingly seen as requiring international action and concern. The distinction drawn by Nickel between contemporary human rights and natural rights allows one to discern the development of the concept of human rights. Indeed, many writers on human rights agree in the identification of three generations of human rights. First generation rights consist primarily of rights to security, property, and political participation. These are most typically associated with the French and US Declarations. Second generation rights are construed as socio-economic rights, rights to welfare, education, and leisure, for example. These rights largely originate within the UDHR. The final and third generation of rights are associated with such rights as a right to national self-determination, a clean environment, and the rights of indigenous minorities. This generation of rights really only takes hold during the last two decades of the 20th. Century but represents a significant development within the doctrine of human rights generally.

While the full significance of human rights may only be finally dawning on some people, the concept itself has a history spanning over two thousand years. The development of the concept of human rights is punctuated by the emergence and assimilation of various philosophical and moral ideals and appears to culminate, at least to our eyes, in the establishment of a highly complex set of legal and political documents and institutions, whose express purpose is the protection and promotion of the fundamental rights of all human beings everywhere. Few should underestimate the importance of this particular current of human history.

3. Philosophical Analysis of the Concept of Human Rights

Human rights are rights that attach to human beings and function as moral guarantees in support of our claims towards the enjoyment of a minimally good life. In conceptual terms, human rights are themselves derivative of the concept of a right. This section focuses upon the philosophical analysis of the concept of a ‘right’ in order to clearly demonstrate the various constituent parts of the concept from which human rights emerges. In order to gain a full understanding of both the philosophical foundations of the doctrine of human rights and the different ways in which separate human rights function, a detailed analysis is required.

a. Moral vs. Legal Rights

The distinction drawn between moral rights and legal rights as two separate categories of rights is of fundamental importance to understanding the basis and potential application of human rights. Legal rights refer to all those rights found within existing legal codes. A legal right is a right that enjoys the recognition and protection of the law. Questions as to its existence can be resolved by simply locating the relevant legal instrument or piece of legislation. A legal right cannot be said to exist prior to its passing into law and the limits of its validity are set by the jurisdiction of the body which passed the relevant legislation. An example of a legal right would be my daughter’s legal right to receive an adequate education, as enshrined within the United Kingdom’s Education Act (1944). Suffice it to say, that the exercise of this right is limited to the United Kingdom. My daughter has no legal right to receive an adequate education from a school board in Southern California. Legal positivists argue that the only rights that can be said to legitimately exist are legal rights, rights that originate within a legal system. On this view, moral rights are not rights in the strict sense, but are better thought of as moral claims, which may or may not eventually be assimilated within national or international law. For a legal positivist, such as the 19th. Century legal philosopher Jeremy Bentham, there can be no such thing as human rights existing prior to, or independently from legal codification. For a positivist determining the existence of rights is no more complicated than locating the relevant legal statute or precedent. In stark contrast, moral rights are rights that, it is claimed, exist prior to and independently from their legal counterparts. The existence and validity of a moral right is not deemed to be dependent upon the actions of jurists and legislators. Many people argued, for example, that the black majority in apartheid South Africa possessed a moral right to full political participation in that country’s political system, even though there existed no such legal right. What is interesting is that many people framed their opposition to apartheid in rights terms. What many found so morally repugnant about apartheid South Africa was precisely its denial of numerous fundamental moral rights, including the rights not to be discriminated against on grounds of colour and rights to political participation, to the majority of that country’s inhabitants. This particular line of opposition and protest could only be pursued because of a belief in the existence and validity of moral rights. A belief that fundamental rights which may or may not have received legal recognition elsewhere, remained utterly valid and morally compelling even, and perhaps especially, in those countries whose legal systems had not recognized these rights. A rights-based opposition to apartheid South Africa could not have been initiated and maintained by appeal to legal rights, for obvious reasons. No one could legitimately argue that the legal political rights of non-white South Africans were being violated under apartheid, since no such legal rights existed. The systematic denial of such rights did, however, constitute a gross violation of those peoples’ fundamental moral rights.

From the above example it should be clear that human rights cannot be reduced to, or exclusively identified with legal rights. The legal positivist’s account of justified law excludes the possibility of condemning such systems as apartheid from a rights perspective. It might, therefore, appear tempting to draw the conclusion that human rights are best identified as moral rights. After all, the existence of the UDHR and various International Covenants, to which South Africa was not a signatory in most cases, provided opponents of apartheid with a powerful moral argument. Apartheid was founded upon the denial of fundamental human rights. Human rights certainly share an essential quality of moral rights, namely, that their valid existence is not deemed to be conditional upon their being legally recognized. Human rights are meant to apply to all human beings everywhere, regardless of whether they have received legal recognition by all countries everywhere. Clearly, there remain numerous countries that wholly or partially exclude formal legal recognition to fundamental human rights. Supporters of human rights in these countries insist that the rights remain valid regardless, as fundamental moral rights. The universality of human rights positively entails such claims. The universality of human rights as moral rights clearly lends greater moral force to human rights. However, for their part, legal rights are not subject to disputes as to their existence and validity in quite the way moral rights are. It would be a mistake to exclusively identify human rights with moral rights. Human rights are better thought of as both moral rights and legal rights. Human rights originate as moral rights and their legitimacy is necessarily dependent upon the legitimacy of the concept of moral rights. A principal aim of advocates of human rights is for these rights to receive universal legal recognition. This was, after all, a fundamental goal of the opponents of apartheid. Human rights are best thought of, therefore, as being both moral and legal rights. The legitimacy claims of human rights are tied to their status as moral rights. The practical efficacy of human rights is, however, largely dependent upon their developing into legal rights. In those cases where specific human rights do not enjoy legal recognition, such as in the example of apartheid above, moral rights must be prioritised with the intention that defending the moral claims of such rights as a necessary prerequisite for the eventual legal recognition of the rights in question.

b. Claim Rights & Liberty Rights

To gain an understanding of the functional properties of human rights it is necessary to consider the more specific distinction drawn between claim rights and liberty rights. It should be noted that it is something of a convention to begin such discussions by reference to W.N. Hohfeld’s (1919) more extended classification of rights. Hohfeld identified four categories of rights: liberty rights, claim rights, power rights, and immunity rights. However, numerous scholars have subsequently tended to collapse the last two within the first two and hence to restrict attention to liberty rights and claim rights. The political philosopher Peter Jones (1994) provides one such example.

Jones restricts his focus to the distinction between claim rights and liberty rights. He conforms to a well-established trend in rights’ analysis in viewing the former as being of primary importance. Jones defines a claim right as consisting of being owed a duty. A claim right is a right one holds against another person or persons who owe a corresponding duty to the right holder. To return to the example of my daughter. Her right to receive an adequate education is a claim right held against the local education authority, which has a corresponding duty to provide her with the object of the right. Jones identifies further necessary distinctions within the concept of a claim right when he distinguishes between a positive claim right and a negative claim right. The former are rights one holds to some specific good or service, which some other has a duty to provide. My daughter’s claim right to education is therefore a positive claim right. Negative claim rights, in contrast, are rights one holds against others’ interfering in or trespassing upon one’s life or property in some way. My daughter could be said to possess a negative claim right against others attempting to steal her mobile phone, for example. Indeed, such examples lead on to the final distinction Jones identifies within the concept of claim rights: rights held ‘in personam’ and rights held ‘in rem’. Rights held in personam are rights one holds against some specifically identified duty holder, such as the education authority. In contrast, rights held in rem are rights held against no one in particular, but apply to everyone. Thus, my daughter’s right to an education would be practically useless were it not held against some identifiable, relevant, and competent body. Equally, her right against her mobile phone being stolen from her would be highly limited if it did not apply to all those capable of potentially performing such an act. Claim rights, then, can be of either a positive or a negative character and they can be held either in personam or in rem.

Jones defines liberty rights as rights which exist in the absence of any duties not to perform some desired activity and thus consist of those actions one is not prohibited from performing. In contrast to claim rights, liberty rights are primarily negative in character. For example, I may be said to possess a liberty right to spend my vacations lying on a particularly beautiful beach in Greece. Unfortunately, no one has a duty to positively provide for this particular exercise of my liberty right. There is no authority or body, equivalent to an education authority, for example, who has a responsibility to realize my dream for me. A liberty right can be said, then, to be a right to do as one pleases precisely because one is not under an obligation, grounded in others’ claim rights, to refrain from so acting. Liberty rights provide for the capacity to be free, without actually providing the specific means by which one may pursue the objects of one’s will. For example, a multi-millionaire and a penniless vagrant both possess an equal liberty right to holiday in the Caribbean each year.

c. Substantive Categories of Human Rights

The above section was concerned to analyse what might be termed the ‘formal properties’ of rights. This section, in contrast, proceeds to consider the different categories of substantive human rights. If one delves into all of the various documents that together form the codified body of human rights, one can identify and distinguish between five different categories of substantive human rights. These are as follows: rights to life; rights to freedom; rights to political participation; rights to the protection of the rule of law; rights to fundamental social, economic, and cultural goods. These rights span the so-called three generations of rights and involve a complex combination of both liberty and claim rights. Some rights, such as for example the right to life, consist of both liberty and claim rights in roughly equal measure. Thus, the adequate protection of the right to life requires the existence of liberty rights against others trespassing against one’s person and the existence of claim rights to have access to basic prerequisites to sustaining one’s life, such as an adequate diet and health-care. Other rights, such as social, economic, and cultural rights, for example, are weighted more heavily towards the existence of various claim rights, which requires the positive provision of the objects of such rights. The making of substantive distinctions between human rights can have controversial, but important, consequences. Human rights are typically understood to be of equal value, each right is conceived of as equally important as every other. On this view, there can exist no potential for conflict between fundamental human rights. One is simply meant to attach equal moral weight to each and every human right. This prohibits arranging human rights in order of importance. However, conflict between rights can and does occur. Treating all human rights as of equal importance prohibits any attempts to address or resolve such conflict when it arises. Take the example of a hypothetical developing world country with severely limited financial and material resources. This country is incapable of providing the resources for realising all of the human rights for all of its citizens, though it is committed to doing so. In the meantime, government officials wish to know which human rights are more absolute than others, which fundamental human rights should it immediately prioritise and seek to provide for? This question, of course, cannot be answered if one sticks to the position that all rights are of equal importance. It can only be addressed if one allows for the possibility that some human rights are more fundamental than others and that the morally correct action for the government to take would be to prioritise these rights. A refusal to do so, no matter how consistent it may be philosophically would be tantamount to dogmatically sticking one’s head in the metaphorical sands. Attempting to make such distinctions is, of course, a philosophically fraught exercise. It clearly requires the existence of some more ultimate criteria against which one can ‘measure’ the relative importance of separate human rights. This is a highly controversial issue within the philosophy of human rights and one which I shall return to when I consider how philosophers attempt to justify the doctrine of human rights. What remains to be addressed in our analysis of the concept of a human right are the questions of what adequately implementing human rights generally requires, and upon whom does this task fall; who has responsibility for protecting and promoting human rights and what is required of them to do so?

d. Scope of Human Rights Duties

Human rights are said to be possessed equally, by everyone. A conventional corollary of this claim is that everyone has a duty to protect and promote the human rights of everyone else. However, in practice, the onus for securing human rights typically falls upon national governments and international, inter-governmental bodies. Philosophers such as Thomas Pogge (1995) argue that the moral burden for securing human rights should fall disproportionately upon such institutions precisely because they are best placed and most able to effectively perform the task. On this reading, non-governmental organizations and private citizens have an important role to play in supporting the global protection of human rights, but the onus must fall upon the relevant national and international institutions, such as the governments of nation-states and such bodies as the United Nations and the World Bank. One might wish to argue that, for example, human rights can be adequately secured by the existence of reciprocal duties held between individuals across the globe. However, ‘privatizing’ human rights in this fashion would ignore two particularly salient factors: individuals have a tendency to prioritise the moral demands of those closest to them, particularly members of their own family or immediate community; individuals’ ability to exercise their duties is, to a large extent, determined by their own personal financial circumstances. Thus, global inequalities in the distribution of wealth fundamentally undermine the ability of those in the poorer countries to reciprocate assistance provided them by those living in wealthier countries. Reasons such as these underlie Pogge’s insistence that the onus of responsibility lies at the level of national and international institutions. Adequately protecting and promoting human rights requires both nation-states ensuring the adequate provision of services and institutions for their own citizens and the co-operation of nation-states within international institutions acting to secure the requisite global conditions for the protection and promotion of everyone’s human rights.

What must such bodies actively do to adequately secure individuals’ human rights? Does my daughter’s human right to receive an adequate education require the education authority to do everything possible to assist and enhance my child’s education? Does it require the provision of a world-class library, frequent study trips abroad, and employing the most able and best-qualified teachers? The answer is, of course, no. Given the relative scarcity of resources and the demands placed upon those resources, we are inclined to say that adequately securing individuals’ human rights extends to the establishment of decent social and governmental practice so as to ensure that all individuals have the opportunity of leading a minimally good life. In the first instance, national governments are typically held to be primarily responsible for the adequate provision of their own citizens’ human rights. Philosophers such as Brian Orend (2002) endorse this aspiration when he writes that the object of human rights is to secure ‘minimal levels of decent and respectful treatment.’ It is important to note, however, that the duty ensure the provision of even minimal levels of decent and respectful treatment cannot be strictly limited by national boundaries. The adequate protection and promotion of everyone’s human rights does require, for example, the more affluent and powerful nation-states providing sufficient assistance to those countries currently incapable of adequately ensuring the protection of their own citizens’ basic human rights. While some may consider Orend’s aspirations for human rights to be unduly cautious, even the briefest survey of the extent of human suffering and deprivation in many parts of the world today is sufficient to demonstrate just how far we are from realizing even this fairly minimal standard.

National and international institutions bear the primary responsibility of securing human rights and the test for successfully fulfilling this responsibility is the creation of opportunities for all individuals to lead a minimally good life. The realization of human rights requires establishing the conditions for all human beings to lead minimally good lives and thus should not be confused as an attempt to create a morally perfect society. The impression that many have of human rights as being unduly utopian testifies less to the inherent demands of human rights and more to the extent to which even fairly modest aspirations are so far from being realized in the world today. The actual aspirations of human rights are, on the face of it, quite modest. However, this should not distract from a full appreciation of the possible force of human rights. Human rights call for the creation of politically democratic societies in which all citizens have the means of leading a minimally good life. While the object of individual human rights may be modest, the force of that right is intended to be near absolute. That is to say, the demands of rights are meant to take precedence over other possible social goals. Ronald Dworkin has coined the term ‘rights as trumps’ to describe this property. He writes that, ‘rights are best understood as trumps over some background justification for political decisions that states a goal for the community as a whole.’ (1977:153) In general, Dworkin argues, considerations of rights claims must take priority over alternative considerations when formulating public policy and distributing public benefits. Thus, for example, a minority’s possession of rights against discriminatory treatment should trump any and all considerations of the possible benefits that the majority would derive from discriminating against the minority group. Similarly, an individual’s right to an adequate diet should trump other individuals’ desires to eat lavish meals, despite the aggregate gain in pleasure these individuals would derive. For Dworkin, rights as trumps expresses the fundamental ideal of equality upon which the contemporary doctrine of human rights rests. Treating rights as trumps is a means for ensuring that all individuals are treated in an equal and like fashion in respect of the provision of fundamental human rights. Fully realizing the aspirations of human rights may not require the provision of ‘state of the art’ resources, but this should not detract from the force of human rights as taking priority over alternative social and political considerations.

4. Philosophical Justifications of Human Rights

We have established that human rights originate as moral rights but that the successful passage of many human rights into international and national law enables one to think of human rights as, in many cases, both moral rights and legal rights. Furthermore, human rights may be either claim rights or liberty rights, and have a negative or a positive complexion in respect of the obligations imposed by others in securing the right. Human rights may be divided into five different categories and the principal object of securing human rights is the creation of the conditions for all individuals to have the opportunity to lead a minimally good life. Finally, human rights are widely considered to trump other social and political considerations in the allocation of public resources. Broadly speaking, philosophers generally agree on such issues as the formal properties of human rights, the object of human rights, and the force of human rights. However, there is much less agreement upon the fundamental question on how human rights may be philosophically justified. It would be fair to say that philosophers have provided many different, at times even conflicting, answers to this question. Philosophers have sought to justify human rights by appeal to single ideals such as equality, autonomy, human dignity, fundamental human interests, the capacity for rational agency, and even democracy. For the purposes of clarity and relative simplicity I will focus upon the two, presently most prominent, philosophical attempts to justify human rights: interests theory and will theory. Before I do that, it is necessary to address a prior question.

a. Do Human Rights Require Philosophical Justification?

Many people tend to take the validity of human rights for granted. Certainly, for many non-philosophers human rights may all too obviously appear to rest upon self-evidently true and universally valid moral principles. In this respect, human rights may be perceived as empirical facts about the contemporary world. Human rights do exist and many people do act in accordance with the correlative duties and obligations respecting human rights entails. No supporter of human rights could possibly complain about such perceptions. If nothing else, the prevalence of such views is pragmatically valuable for the cause of human rights. However, moral philosophers do not enjoy such licence for epistemological complacency. Moral philosophers remain concerned by the question of the philosophical foundations of human rights. There is a good reason why we should all be concerned with such a question. What might be termed the ‘philosophically naïve’ view of human rights effectively construes human rights as legal rights. The validity of human rights is closely tied to, and dependent upon, the legal codification of human rights. However, as was argued earlier, such an approach is not sufficient to justify human rights. Arguments in support of the validity of any moral doctrine can never be settled by simply pointing to the empirical existence of particular moral beliefs or concepts. Morality is fundamentally concerned with what ought to be the case, and this cannot be settled by appeals to what is the case, or is perceived to be the case. From such a basis, it would have been very difficult to argue that apartheid South Africa, to take an earlier example, was a morally unjust regime. One must not confuse the law with morality, per se. Nor consider the two to be simply co-extensional. Human rights originate as moral rights. Human rights claim validity everywhere and for everyone, irrespective of whether they have received comprehensive legal recognition, and even irrespective of whether everyone is agreement with the claims and principles of human rights. Thus, one cannot settle the question of the philosophical validity of human rights by appealing to purely empirical observations upon the world. As a moral doctrine, human rights have to be demonstrated to be valid as norms and not facts. In order to achieve this, one has to turn to moral philosophy. Presently, two particular approaches to the question of the validity of human rights predominate: what might be loosely termed the ‘interests theory approach’ and the ‘will theory approach’.

b. The Interests Theory Approach

Advocates of the interests theory approach argue that the principal function of human rights is to protect and promote certain essential human interests. Securing human beings’ essential interests is the principal ground upon which human rights may be morally justified. The interests approach is thus primarily concerned to identify the social and biological prerequisites for human beings leading a minimally good life. The universality of human rights is grounded in what are considered to be some basic, indispensable, attributes for human well-being, which all of us are deemed necessarily to share. Take, for example, an interest each of us has in respect of our own personal security. This interest serves to ground our claim to the right. It may require the derivation of other rights as prerequisites to security, such as the satisfaction of basic nutritional needs and the need to be free from arbitrary detention or arrest, for example. The philosopher John Finnis provides a good representative of the interests theory approach. Finnis (1980) argues that human rights are justifiable on the grounds of their instrumental value for securing the necessary conditions of human well-being. He identifies seven fundamental interests, or what he terms ‘basic forms of human good’, as providing the basis for human rights. These are: life and its capacity for development; the acquisition of knowledge, as an end in itself; play, as the capacity for recreation; aesthetic expression; sociability and friendship; practical reasonableness, the capacity for intelligent and reasonable thought processes; and finally, religion, or the capacity for spiritual experience. According to Finnis, these are the essential prerequisites for human well-being and, as such, serve to justify our claims to the corresponding rights, whether they be of the claim right or liberty right variety.

Other philosophers who have defended human rights from an interests-based approach have addressed the question of how an appeal to interests can provide a justification for respecting and, when necessary, even positively acting to promote the interests of others. Such questions have a long heritage in western moral and political philosophy and extend at least as far back as the 17th. Century philosopher Thomas Hobbes. Typically, this approach attempts to provide what James Nickel (1987:84) has termed ‘prudential reasons’ in support of human rights. Taking as the starting point the claim that all human beings possess basic and fundamental interests, advocates of this approach argue that each individual owes a basic and general duty to respect the rights of every other individual. The basis for this duty is not mere benevolence or altruism, but individual self-interest. As Nickel writes, ‘a prudential argument from fundamental interests attempts to show that it would be reasonable to accept and comply with human rights, in circumstances where most others are likely to do so, because these norms are part of the best means for protecting one’s fundamental interests against actions and omissions that endanger them.’ (ibid). Protecting one’s own fundamental interests requires others’ willingness to recognize and respect these interests, which, in turn, requires reciprocal recognition and respect of the fundamental interests of others. The adequate protection of each individual’s fundamental interests necessitates the establishment of a co-operative system, the fundamental aim of which is not to promote the common good, but the protection and promotion of individuals’ self-interest.

For many philosophers the interests approach provides a philosophically powerful defence of the doctrine of human rights. It has the apparent advantage of appealing to human commonality, to those attributes we all share, and, in so doing, offers a relatively broad-based defence of the plethora of human rights considered by many to be fundamental and inalienable. The interests approach also provides for the possibility of resolving some of the potential disputes which can arise over the need to prioritise some human rights over others. One may do this, for example, by hierarchically ordering the corresponding interests identified as the specific object, or content, of each right.

However, the interests approach is subject to some significant criticisms. Foremost amongst these is the necessary appeal interests’ theorists make to some account of human nature. The interests-approach is clearly operating with, at the very least, an implicit account of human nature. Appeals to human nature have, of course, proven to be highly controversial and typically resist achieving the degree of consensus required for establishing the legitimacy of any moral doctrine founded upon an account of human nature. For example, combining the appeal to fundamental interests with the aspiration of securing the conditions for each individual leading a minimally good life would be complicated by social and cultural diversity. Clearly, as the economic philosopher Amartya Sen (1999) has argued, the minimal conditions for a decent life are socially and culturally relative. Providing the conditions for leading a minimally good life for the residents of Greenwich Village would be significantly different to securing the same conditions for the residents of a shanty town in Southern Africa or South America. While the interests themselves may be ultimately identical, adequately protecting these interests will have to go beyond the mere specification of some purportedly general prerequisites for satisfying individuals’ fundamental interests. Other criticisms of the interests approach have focused upon the appeal to self-interest as providing a coherent basis for fully respecting the rights of all human beings. This approach is based upon the assumption that individuals occupy a condition of relatively equal vulnerability to one another. However, this is simply not the case. The model cannot adequately defend the claim that a self-interested agent must respect the interests of, for example, much less powerful or geographically distant individuals, if she wishes to secure her own interests. On these terms, why should a purely self-interested and over-weight individual in, say, Los Angeles or London, care for the interests of a starving individual in some distant and impoverished continent? In this instance, the starving person is not in a position to affect their overweight counterpart’s fundamental interests. The appeal to pure self-interest ultimately cannot provide a basis for securing the universal moral community at the heart of the doctrine of human rights. It cannot justify the claims of universal human rights. An even more philosophically oriented vein of criticism focuses upon the interests’ based approach alleged neglect of constructive human agency as a fundamental component of morality generally. Put simply, the interests-based approach tends to construe our fundamental interests as pre-determinants of human moral agency. This can have the effect of subordinating the importance of the exercise of freedom as a principal moral ideal. One might seek to include freedom as a basic human interest, but freedom is not constitutive of our interests on this account. This particular concern lies at the heart of the so-called ‘will approach’ to human rights.

c. The Will Theory Approach

In contrast to the interests approach, the will theory attempts to establish the philosophical validity of human rights upon a single human attribute: the capacity for freedom. Will theorists argue that what is distinctive about human agency is the capacity for freedom and that this ought to constitute the core of any account of rights. Ultimately, then, will theorists view human rights as originating in, or reducible to, a single, constitutive right, or alternatively, a highly limited set of purportedly fundamental attributes. H.L.A. Hart, for example, inferentially argues that all rights are reducible to a single, fundamental right. He refers to this as ‘equal right of all men to be free.’ (1955:77). Hart insists that rights to such things as political participation or to an adequate diet, for example, are ultimately reducible to, and derivative of, individuals’ equal right to liberty. Henry Shue (1996) develops upon Hart’s inferential argument and argues that liberty alone is not ultimately sufficient for grounding all of the rights posited by Hart. Shue argues that many of these rights imply more than mere individual liberty and extend to include security from violence and the necessary material conditions for personal survival. Thus, he grounds rights upon liberty, security, and subsistence. The moral philosopher Alan Gewirth (1978, 1982) has further developed upon such themes. Gewirth argues that the justification of our claims to the possession of basic human rights is grounded in what he presents as the distinguishing characteristic of human beings generally: the capacity for rationally purposive agency. Gewirth states that the recognition of the validity of human rights is a logical corollary of recognizing oneself as a rationally purposive agent since the possession of rights are the necessary means for rationally purposive action. Gewirth grounds his argument in the claim that all human action is rationally purposive. Every human action is done for some reason, irrespective of whether it be a good or a bad reason. He argues that in rationally endorsing some end, say the desire to write a book, one must logically endorse the means to that end; as a bare minimum one’s own literacy. He then asks what is required to be a rationally purposive agent in the first place? He answers that freedom and well-being are the two necessary conditions for rationally purposive action. Freedom and well-being are the necessary means to acting in a rationally purposive fashion. They are essential prerequisites for being human, where to be human is to possess the capacity for rationally purposive action. As essential prerequisites, each individual is entitled to have access to them. However, Gewirth argues that each individual cannot simply will their own enjoyment of these prerequisites for rational agency without due concern for others. He bases the necessary concern for others’ human rights upon what he terms the ‘principle of generic consistency’ (PGC). Gewirth argues that each individual’s claim to the basic means for rationally purposive action is based upon an appeal to a general, rather than, specific attribute of all relevant agents. I cannot logically will my own claims to basic human rights without simultaneously accepting the equal claims of all rationally purposive agents to the same basic attributes. Gewirth has argued that there exists an absolute right to life possessed separately and equally by all of us. In so claiming, Gewirth echoes Dworkin’s concept of rights as trumps, but ultimately goes further than Dworkin is prepared to do by arguing that the right to life is absolute and cannot, therefore, be overridden under any circumstances. He states that a ‘right is absolute when it cannot be overridden in any circumstances, so that it can never be justifiably infringed and it must be fulfilled without any exceptions.’ (1982:92). Will theorists then attempt to establish the validity of human rights upon the ideal of personal autonomy: rights are a manifestation of the exercise of personal autonomy. In so doing, the validity of human rights is necessarily tied to the validity of personal autonomy. On the face of it, this would appear to be a very powerful, philosophical position. After all, as someone like Gewirth might argue, critics of this position would themselves necessarily be acting autonomously and they cannot do this without simultaneously requiring the existence of the very means for such action: even in criticizing human rights one is logically pre-supposing the existence of such rights.

Despite the apparent logical force of the will approach, it has been subjected to various forms of criticism. A particularly important form of criticism focuses upon the implications of will theory for so-called ‘marginal cases’; human beings who are temporarily or permanently incapable of acting in a rationally autonomous fashion. This would include individuals who have diagnosed from suffering from dementia, schizophrenia, clinical depression, and, also, individuals who remain in a comatose condition, from which they may never recover. If the constitutive condition for the possession of human rights is said to be the capacity for acting in a rationally purposive manner, for example, then it seems to logically follow, that individuals incapable of satisfying this criteria have no legitimate claim to human rights. Many would find this conclusion morally disturbing. However, a strict adherence to the will approach is entailed by it. Some human beings are temporarily or permanently lacking the criteria Gewirth, for instance, cites as the basis for our claims to human rights. It is difficult to see how they could be assimilated within the community of the bearers of human rights on the terms of Gewirth’s argument. Despite this, the general tendency is towards extending human rights considerations towards many of the so-called ‘marginal cases’. To do otherwise would appear to many to be intuitively wrong, if not ultimately defensible by appeal to practical reason. This may reveal the extent to which many peoples’ support of human rights includes an ineluctable element of sympathy, taking the form of a general emotional concern for others. Thus, strictly applying the will theorists’ criteria for membership of the community of human rights bearers would appear to result in the exclusion of some categories of human beings who are presently recognized as legitimate bearers of human rights.

The interests theory approach and the will theory approach contain strengths and weaknesses. When consistently and separately applied to the doctrine of human rights, each approach appears to yield conclusions that may limit or undermine the full force of those rights. It may be that philosophical supporters of human rights need to begin to consider the potential philosophical benefits attainable through combining various themes and elements found within these (and other) philosophical approaches to justifying human rights. Thus, further attempts at justifying the basis and content of human rights may benefit from pursuing a more thematically pluralist approach than has typically been the case to date.

5. Philosophical Criticisms of Human Rights

The doctrine of human rights has been subjected to various forms of fundamental, philosophical criticism. These challenges to the philosophical validity of human rights as a moral doctrine differ from critical appraisals of the various philosophical theories supportive of the doctrine for the simple reason that they aim to demonstrate what they perceive to the philosophical fallacies upon which human rights are founded. Two such forms of critical analysis bear particular attention: one which challenges the universalist claims of human rights, and another which challenges the presumed objective character of human rights principles.

a. Moral Relativism

Philosophical supporters of human rights are necessarily committed to a form of moral universalism. As moral principles and as a moral doctrine, human rights are considered to be universally valid. However, moral universalism has long been subject to criticism by so-called moral relativists. Moral relativists argue that universally valid moral truths do not exist. For moral relativists, there is simply no such thing as a universally valid moral doctrine. Relativists view morality as a social and historical phenomenon. Moral beliefs and principles are therefore thought of as socially and historically contingent, valid only for those cultures and societies in which they originate and within which they are widely approved. Relativists point to the vast array of diverse moral beliefs and practices apparent in the world today as empirical support for their position. Even within a single, contemporary society, such as the United States or Great Britain, one can find a wide diversity of fundamental moral beliefs, principles, and practices. Contemporary, complex societies are thus increasingly considered to be pluralist and multicultural in character. For many philosophers the multicultural character of such societies serves to fundamentally restrict the substance and scope of the regulative political principles governing those societies. In respect of human rights, relativists have tended to focus upon such issues as the presumed individualist character of the doctrine of human rights. It has been argued by numerous relativists that human rights are unduly biased towards morally individualist societies and cultures, at the necessary expense of the communal moral complexion of many Asian and African societies. At best, some human rights’ articles may be considered to be redundant within such societies, at worse they may appear to be positively harmful if fully implemented, replacing the fundamental values of one civilization with those of another and thereby perpetuating a form of cultural and moral imperialism.

The philosophical debate between universalists and relativists is far too complex to adequately summarise here. However, certain immediate responses to the relativist critique of human rights are immediately available. First, merely pointing to moral diversity and the presumed integrity of individual cultures and societies does not, by itself, provide a philosophical justification for relativism, nor a sufficient critique of universalism. After all, there have existed and continue to exist many cultures and societies whose treatment of their own people leaves much to be desired. Is the relativist genuinely asking us to recognize and respect the integrity of Nazi Germany, or any other similarly repressive regime? There can be little doubt that, as it stands, relativism is incompatible with human rights. On the face of it, this would appear to lend argumentative weight to the universalist support of human rights. After all, one may speculate as to the willingness of any relativist to actually forego their possession of human rights if and when the social surroundings demanded it. Similarly, relativist arguments are typically presented by members of the political elites within those countries whose systematic oppression of their peoples has attracted the attention of advocates of human rights. The exponential growth of grass-roots human rights organizations across many countries in the world whose cultures are alleged to be incompatible with the implementation of human rights, raises serious questions as to the validity and integrity of such ‘indigenous’ relativists. At its worst, the doctrine of moral relativism may be being deployed in an attempt to illegitimately justify oppressive political systems. The concern over the presumed incompatibility between human rights and communal moral systems appears to be a more valid issue. Human rights have undeniably conceived of the principal bearer of human rights as the individual person. This is due, in large part, to the Western origins of human rights. However, it would be equally fair to say that the so-called ‘third generation’ of human rights is far more attuned to the communal and collective basis of many individuals’ lives. In keeping with the work of political philosophers such as Will Kymlicka, there is increasing awareness of the need to tailor human rights principles to such things as the collective rights of minorities and, for example, these minorities’ claims to such things as communal land rights. While human rights remain philosophically grounded within an individualist moral doctrine, there can be no doubt that attempts are being made to adequately apply and human rights to more communally oriented societies. Human rights can no longer be accused of being ‘culture-blind’.

b. Epistemological Criticisms of Human Rights

The second most important contemporary philosophical form of human rights’ criticism challenges the presumed objective basis of human rights as moral rights. This form of criticism may be thought of as a river into which run many philosophical tributaries. The essence of these attempts to refute human rights consists in the claim that moral principles and concepts are inherently subjective in character. On this view moral beliefs do not emanate from a correct determination of a rationally purposive will, or even gaining insight into the will of some divine being. Rather, moral beliefs are fundamentally expressions of individuals’ partial preferences. This position therefore rejects the principal ground upon which the concept of moral rights rests: that there exist rational and a priori moral principles upon which a correct and legitimate moral doctrine is to be founded. In modern, as opposed to ancient, philosophy this argument is most closely associated with the 18th. Century Scottish philosopher David Hume. More recently versions of it have been defended by the likes of C.L.Stevenson, Ludwig Wittgenstein, J.L.Mackie, and Richard Rorty. Indeed, Rorty (1993) has argued that human rights are based not upon the exercise of reason, but a sentimental vision of humanity. He insists that human rights are not rationally defensible. He argues that one cannot justify the basis of human rights by appeal to moral theory and the canons of reason since, he insists, moral beliefs and practices are not ultimately motivated by an appeal to reason or moral theory, but emanate from a sympathetic identification with others: morality originates in the heart, and not in the head. Interestingly, though unambiguously sceptical about the philosophical basis of human rights, Rorty views the existence of human rights as a ‘good and desirable thing’, something whose existence we all benefit from. His critique of human rights is this not motivated by an underlying hostility to the doctrine. For Rorty, human rights are better served by emotional appeals to identify with the unnecessary suffering of others, than by arguments over the correct determination of reason.

Rorty’s emphasis upon the importance of an emotional identification with others is a legitimate concern. It may, for example, provide additional support for the philosophical arguments presented by the likes of Gewirth. However, as Michael Freeman has recently pointed out, ‘Rorty’s argument…confuses motivation and justification. Sympathy is an emotion. Whether the action we take on the basis of our emotions is justified depends on the reasons for the action. Rorty wishes to eliminate unprovable metaphysical theories from philosophy, but in his critique of human-rights theory he goes too far, and eliminates reasoning.’ (2002:56) Rorty’s own account of the basis and scope of moral knowledge ultimately prohibits him from claiming that human rights is a morally desirable phenomenon, since he explicitly rules out the validity of appealing to the independently verifiable criteria required to uphold any such judgement. What we require from Rorty is an independent reason for accepting his conclusion. It is precisely this that he denies may be legitimately provided by moral philosophy.

Rorty aside, the general critique of moral objectivity has a long and very well-established heritage in modern moral philosophy. It would be false to claim that either the objectivists or the subjectivists have scored any ultimate ‘knock-down’ over their philosophical opponents. Human rights are founded upon the claim to moral objectivity, whether by appeal to interests or the will. Any critique of moral objectivism is bound, therefore, to have repercussions for the philosophical defence of human rights. As I noted above, philosophers such as Alan Gewirth and John Finnis, in their separate and different ways, have attempted to establish the rational and objective force of human rights. The reader interested in pursuing this particular theme further is therefore recommended to pursue a close philosophical analysis of either, or both, of these two philosophers.

6. Conclusion

Human rights have a long historical heritage. The principal philosophical foundation of human rights is a belief in the existence of a form of justice valid for all peoples, everywhere. In this form, the contemporary doctrine of human rights has come to occupy centre stage in geo-political affairs. The language of human rights is understood and utilized by many peoples in very diverse circumstances. Human rights have become indispensable to the contemporary understanding of how human beings should be treated, by one another and by national and international political bodies. Human rights are best thought of as potential moral guarantees for each human being to lead a minimally good life. The extent to which this aspiration has not been realized represents a gross failure by the contemporary world to institute a morally compelling order based upon human rights. The philosophical basis of human rights has been subjected to consistent criticism. While some aspects of the ensuing debate between philosophical supporters and opponents of human rights remain unresolved and, perhaps, irresolvable, the general case for human rights remains a morally powerful one. Arguably, the most compelling motivation for the existence of human may rest upon the exercise of imagination. Try imagining a world without human rights!

7. References and Further Reading

  • Dworkin, Ronald. Taking Rights Seriously, (London: Duckworth, 1978)
  • Freeman, Michael. Human Rights: An Interdisciplinary Approach, (Cambridge: Polity, 2002)
  • Finnis, John. Natural Law and Natural Rights, (Oxford; Clarendon Press, 1980)
  • Gewirth, Alan. Reason and Morality, (Chicago: Chicago University Press, 1978)
  • Gewirth, Alan. Human Rights: Essays on Justification and Applications, (Chicago; University of Chicago Press, 1982)
  • Jones, Peter. Rights, (Basingstoke; Macmillan, 1994)
  • Mackie, J.L. Ethics: Inventing Right and Wrong, (Harmondsworth; Penguin, 1977)
  • Nickel, James. Making Sense of Human Rights: Philosophical Reflections on the Universal Declaration of Human Rights, (Berkeley; University of California Press, 1987)
  • Rorty, Richard. “Human rights, rationality, and sentimentality”. In S.Shute & S. Hurley (eds.) On Human Rights: the Oxford Amnesty Lectures 1993, (New York; Basic Books, 1993)
  • Waldron, Jeremy. Theories of Rights, (Oxford; Oxford University Press, 1984) Chapters by Ronald Dworkin, Alan Gewirth, and H.L.A.Hart

Author Information

Andrew Fagan
Email: fagaaw@essex.ac.uk
University of Essex
United Kingdom

St. Louis Hegelians

The common name given to a group of amateur philosophers founded and led by William Torrey Harris (1835-1909) and Henry Conrad Brokmeyer (1828-1906). Harris, a New Englander born in Connecticut and educated at Yale, first became acquainted with idealism through the Transcendentalists, mainly from his attendance in 1857 at the Orphic Seer’s Conversations of Amos Bronson Alcott (1799-1888). The experience inspired Harris to leave Yale before obtaining a degree, and set off west to St. Louis to seek his vocation. Initially he took a position teaching shorthand in the St. Louis Public Schools, but he quickly advanced through the system, eventually becoming Superintendent of Schools, a position he held from 1867 to 1880. Brokmeyer was a Prussian immigrant who arrived in New York as a young man of sixteen. Bold and restless in temperament, he made his way westward, acquiring a small fortune by running a shoe factory in Mississippi. Desiring to further his education, he abandoned his business pursuits to enter Georgetown University in Kentucky, but his quarrelsome character led to his departure for Brown University in Providence, Rhode Island, only to leave that institution as well after a heated debate with President Wayland. The venture to New England, however, did give him an exposure to Transcendentalism, which inspired him, like Harris, once again to head west–first to the back country of Warren County Missouri, where he expended his energy in a close study of German thought, particularly Hegel, and then, in 1856, to St. Louis.

It was there that Harris and Brokmeyer met in 1858 at the St. Louis Mercantile Library, where Harris was offering a public lecture. Brokmeyer convinced Harris of the significance of Hegel’s system, and its relevance to the historical trends of American society. They immediately joined forces, attracting a number of other youthful followers with intellectual ambitions, many of whom were, like Harris, teachers in the public schools. The nascent Hegelian movement was temporarily stalled when Brokmeyer went off to serve as a Colonel in the Union Army during the Civil War, but it rebounded in full force upon his return with the formation of the St. Louis Philosophical Society in 1866, and the launching of the Journal of Speculative Philosophy, the official organ of the Society, in 1867.

Brokmeyer was the acknowledged intellectual leader of the movement. He published little, but his charismatic personality, quixotic meliorism, and extraordinary skills in argument and debate, consistently employed in the application of Hegelian dialectical logic, established his status as the framer of the ideals and aims of the movement. The manuscript of his translation of Hegel’s Logic, although never published, became the theoretical text of the group, copied and distributed not only in St. Louis, but to sympathetic thinkers in other parts of the United States. Harris was, more than any other, the movement’s public voice and organizing genius. He edited the Journal, contributing many of its articles himself. He also orchestrated a number of attempts to bring about a rapprochement between the western and New England idealists, first by inviting Alcott, Harris’s former mentor, and Ralph Waldo Emerson to St. Louis, later by his participation in the formation of the Concord School of Philosophy, a summer school headed by Alcott that merged the two groups within its faculty. (Harris taught for all nine of the sessions of the Concord School’s existence, from 1879 to 1887, and his disquisitions on Hegel became the most popular of the faculty’s offerings.) But although these efforts furthered the influence of the St. Louisians, they were not, because of philosophical differences, wholly successful.

Even though Harris and Brokmeyer were first inspired to philosophical pursuits by the Transcendentalists, the thought of the St. Louis group was distinguished from the latter by its greater concentration on philosophical understanding guided by Hegelian method, without the literary and theological concerns of the New England movement, and a greater stress on social responsibility and reform. The emerging views of the various members of the group varied somewhat in details, but they shared a common conviction in the relevance of a Hegelian social philosophy, inspired mainly by Hegel’s The Philosophy of Right and The Philosophy of History, to the problems and challenges facing the American society of their day, and the importance of education as a means of effecting necessary social change. Brokmeyer insisted on the necessity that thought issue in practical action directed to the social good, and the St. Louisians took this imperative to heart. The emphasis on education is evident in the pages of their journal, which were largely dedicated to the dissemination of European idealism, either through translations of Hegel and other German writers or summations of their work. They also shared a common enthusiasm for the prospects of their home city, divining by a clever but highly questionable use of the Hegelian dialectic what they believed to be historical forces that would propel St. Louis into an era of cultural supremacy in American society.

Gradually the group dissolved during the 1870s and 1880s as the core members of the group struck out on their own to pursue separate interests and aims. Characteristically, education and moral advancement were the themes of many of these individual pursuits. Denton Snider (1841-1925), a central figure within the movement who eventually became its historian, set upon a course of freelance teaching and lecturing as well as pursuing literary ambitions. In addition to offering lectures throughout the eastern and midwestern United States, including the Concord School, he founded or played a leading role in the operation of a number of visionary educational projects, such as the Communal University in Chicago and later St. Louis, the Chicago Kindergarten College, and the Goethe School in Milwaukee. Thomas Davidson (1840-1900), another key player in the original St. Louis movement, established the Breadwinner’s College in New York City, a school devoted to the education of the working class, and later established a summer school at his home in Glenmore, New York.

The theme is echoed in the careers of the St. Louis movement’s founders, Harris and Brokmeyer, during and after the dissipation of the movement itself. During his years as Superintendent of Schools in St. Louis, Harris was a strong proponent for the advancement of public education in Missouri. After his involvement at the Concord School he was appointed the United States Commissioner of Education in 1889. Brokmeyer entered the political arena in Missouri, and played a key role in the state’s Constitutional Convention of 1875, which established a legal guarantee of education for all between the ages of six and twenty. Brokmeyer eventually served a term as Lieutenant Governor of the state, and acting Governor during 1876 and 1877, but when his political prospects turned against him, he returned to the wilderness life in numerous sojourns to the west. For a time he lived with the Creek Indians in Oklahoma. In 1896 he settled back in St. Louis, returning to a quiet life of scholarship and reflection until his death in 1906.

Despite the fact that the members of the group produced an extraordinary output of published writing, both in their journal and independently, the movement’s ideas had little lasting influence on American philosophy, due in large part to the orthodoxy of their Hegelianism, which was soon overshadowed by the emerging naturalism of American thought during the first decades of the twentieth century. The one exception was George H. Howison (1834-1916), who came under the influence of the group while teaching mathematics at Washington University in St. Louis. Howison later settled in Berkeley, California, and developed a pluralistic form of idealism that survived as the twentieth century school of thought known as Personalism. The most significant contribution of the group to American thought was their journal, which offered a much needed vehicle for the publication of the early work of some of the most prominent figures of the next generation of American philosophy, such as John Dewey, William James, Charles Sanders Peirce, and Josiah Royce. In fact, Harris’s encouragement when a young John Dewey timidly submitted his first philosophical essay for publication was crucial in the budding philosopher’s decision to continue his studies. Although the ideas of the movement had little enduring influence, the St. Louis Hegelians represent an important chapter in the history of American philosophical thought and the developing relationship between intellectual and popular culture in the nineteenth century.

Suggestions for Further Reading

  • Elizabeth Flower and Murray G. Murphy, “The Absolute Immigrates to America: The St. Louis Hegelians” in A History of Philosophy in America, vol. 2 (New York: G. P. Putnam’s Sons, 1977), pp. 463-514.
  • William H. Goetzmann, ed., The American Hegelians: An Intellectual Episode in the History of Western America (New York: Alfred A. Knopf, 1973).
  • Frances A. Harmon, The Social Philosophy of the St. Louis Hegelians (New York: Columbia University Press, 1943).
  • Henry A. Pochmann, German Culture in America, Philosophical and Literary Influences, 1600-1900 (Madison, WS: University of Wisconsin Press, 1961).
  • Denton J. Snider, The St. Louis Movement in Philosophy, Literature, Education, Psychology, with Chapters of Autobiography (St. Louis: Sigma Publishing, 1920).

Author Information

Richard Field
Email: RFIELD(at)nwmissouri.edu
Northwest Missouri State University
U. S. A.

The IEP is actively seeking an author who will write a replacement article.

Hegel: Social and Political Thought

hegelGeorg Wilhelm Friedrich Hegel (1770-1831) is one of the greatest systematic thinkers in the history of Western philosophy. In addition to epitomizing German idealist philosophy, Hegel boldly claimed that his own system of philosophy represented an historical culmination of all previous philosophical thought. Hegel’s overall encyclopedic system is divided into the science of Logic, the philosophy of Nature, and the philosophy of Spirit. Of most enduring interest are his views on history, society, and the state, which fall within the realm of Objective Spirit. Some have considered Hegel to be a nationalistic apologist for the Prussian State of the early 19th century, but his significance has been much broader, and there is no doubt that Hegel himself considered his work to be an expression of the self-consciousness of the World Spirit of his time. At the core of Hegel’s social and political thought are the concepts of freedom, reason, self-consciousness, and recognition. There are important connections between the metaphysical or speculative articulation of these ideas and their application to social and political reality, and one could say that the full meaning of these ideas can be grasped only with a comprehension of their social and historical embodiment. The work that explicates this concretizing of ideas, and which has perhaps stimulated as much controversy as interest, is the Philosophy of Right (Philosophie des Rechts), which will be a main focus of this essay.

Table of Contents

  1. Biography
  2. Political Writings
  3. The Jena Writings (1802-06)
  4. The Phenomenology of Spirit
  5. Logic and Political Theory
  6. The Philosophy of Right
    1. Abstract Right
    2. Morality
    3. Ethical Life
      1. The Family
      2. Civil Society
      3. The State
        1. Constitutional Law
        2. International Law
        3. World History
  7. Closing Remarks
  8. References and Further Reading
    1. Works by Hegel in German and in English Translation
    2. Works on Hegel’s Social and Political Philosophy

1. Biography

G.W.F. Hegel was born in Stuttgart in 1770, the son of an official in the government of the Duke of Württemberg. He was educated at the Royal Highschool in Stuttgart from 1777-88 and steeped in both the classics and the literature of the European Enlightenment. In October, 1788 Hegel began studies at a theological seminary in Tübingen, the Tüberger Stift, where he became friends with the poet Hölderlin and philosopher Friedrich Schelling, both of whom would later become famous. In 1790 Hegel received an M.A. degree, one year after the fall of the Bastille in France, an event welcomed by these young idealistic students. Shortly after graduation, Hegel took a post as tutor to a wealthy Swiss family in Berne from 1793-96. In 1797, with the help of his friend Hölderlin, Hegel moved to Frankfurt to take on another tutorship. During this time he wrote unpublished essays on religion which display a certain radical tendency of thought in his critique of orthodox religion.

In January 1801, two years after the death of his father, Hegel finished with tutoring and went to Jena where he took a position as Privatdozent (unsalaried lecturer) at the University of Jena, where Hegel’s friend Schelling had already held a university professorship for three years. There Hegel collaborated with Schelling on a Critical Journal of Philosophy (Kritisches Journal der Philosophie) and he also published a piece on the differences between the philosophies of Fichte and Schelling (Differenz des Fichte’schen und Schelling’schen Systems der Philosophie) in which preference was consistently expressed for the latter thinker. After having attained a professorship in 1805, Hegel published his first major work, the Phenomenology of Spirit (Phänomenologie des Geistes, 1807) which was delivered to the publisher just at the time of the occupation of Jena by Napoleon’s armies. With the closing of the University, due to the victory of the French in Prussia, Hegel had to seek employment elsewhere and so he took a job as editor of a newspaper in Bamberg, Bavaria in 1807 (Die Bamberger Zeitung) followed by a move to Nuremberg in 1808 where Hegel became headmaster of a preparatory school (Gymnasium), roughly equivalent to a high school, and also taught philosophy to the students there until 1816. During this time Hegel married, had children, and published his Science of Logic (Wissenschaft der Logik) in three volumes.

One year following the defeat of Napoleon at Waterloo (1815), Hegel took the position of Professor of Philosophy at the University of Heidelberg where he published his first edition of the Encyclopedia of the Philosophical Sciences in Outline (Encyklopädie der philosophischen Wissenschaften im Grundrisse, 1817). In 1818 he became Professor of Philosophy at the University of Berlin, through the invitation of the Prussion minister von Altenstein (who had introduced many liberal reforms in Prussia until the fall of Napoleon), and Hegel taught there until he died in 1831. Hegel lectured on various topics in philosophy, most notably on history, art, religion, and the history of philosophy and he became quite famous and influential. He held public positions as a member of the Royal Examination Commission of the Province of Brandenberg and also as a councellor in the Ministry of Education. In 1821 he published the Philosophy of Right (Philosophie des Rechts) and in 1830 was given the honor of being elected Rector of the University. On November 14, 1831 Hegel died of cholera in Berlin, four months after having been decorated by Friedrich Wilhelm III of Prussia.

2. Political Writings

Apart from his philosophical works on history, society, and the state, Hegel wrote several political tracts most of which were not published in his lifetime but which are significant enough in connection to the theoretical writings to deserve some mention. (These are published in English translation in Hegel’s Political Writings and Political Writings, listed in the bibliography of works by Hegel below.)

Hegel’s very first political work was on “On the Recent Domestic Affairs of Wurtemberg” (Über die neuesten innern Verhältnisse Württembergs…, 1798) which was neither completed nor published. In it Hegel expresses the view that the constitutional structure of Wurtemberg requires fundamental reform. He condemns the absolutist rule of Duke Ferdinand along with the narrow traditionalism and legal positivism of his officials and welcomes the convening of the Estates Assembly, while disagreeing with the method of election in the Diet. In contrast to the existing system of oligarchic privilege, Hegel argues that the Diet needs to be based on popular election through local town councils, although this should not be done by granting suffrage to an uneducated multitude. The essay ends inconclusively on the appropriate method of political representation.

A quite long piece of about 100 pages, The German Constitution (Die Verfassung Deutchlands) was written and revised by Hegel between 1799 and 1802 and was not published until after his death in 1893. This piece provides an analysis and critique of the constitution of the German Empire with the main theme being that the Empire is a thing of the past and that appeals for a unified German state are anachronistic. Hegel finds a certain hypocrisy in German thinking about the Empire and a gap between theory and practice in the German constitution. Germany was no longer a state governed by law but rather a plurality of independent political entities with disparate practices. Hegel stresses the need to recognize that the realities of the modern state necessitate a strong public authority along with a populace that is free and unregimented. The principle of government in the modern world is constitutional monarchy, the potentialities of which can be seen in Austria and Prussia. Hegel ends the essay on an uncertain note with the idea that Germany as a whole could be saved only by some Machiavellian genius.

The essay “Proceedings of the Estates Assembly in the Kingdom of Württemberg, 1815-1816” was published in 1817 in the Heidelbergische Jahrbücher. In it Hegel commented on sections of the official report of the Diet of Württemberg, focusing on the opposition by the Estates to the King’s request for ratification of a new constitutional charter that recognized recent liberalizing changes and reforms. Hegel sided with King Frederick and criticized the Estates as being reactionary in their appeal to old customary laws and feudal property rights. There has been controversy over whether Hegel here was trying to gain favor with the King in order to attain a government position. However, Hegel’s favoring a sovereign kingdom of Wurtemberg over the German Empire and the need for a constitutional charter that is more rational than the previous are quite continuous with the previous essays. A genuine state needs a strong and effective central public authority, and in resisting the Estates are trying to live in the feudal past. Moreover, Hegel is not uncritical of the King’s constitutional provisions and finds deficiencies in the exclusion of members of professions from the Estates Assembly as well as in the proposal for direct suffrage in representation, which treats citizens like unintegrated atomic units rather than as members of a political community.

The last of Hegel’s political tracts, “The English Reform Bill,” was written in installments in 1831 for the ministerial newspaper, the Preussische Staatszeitung, but was interrupted due to censure by the Prussian King because of the perception of its being overly critical and anti-English. As a result, the remainder of the work was printed independently and distributed discretely. Hegel’s main line of criticism is that the proposed English reforms of suffrage will not make much of a difference in the distribution of political power and may only create a power struggle between the rising group of politicians and the traditional ruling class. Moreover, there are deep problems in English society that cannot be addressed by the proposed electoral reforms, including political corruption in the English burroughs, the selling of seats in parliament, and the general oligarchic nature of social reality including the wide disparities between wealth and poverty, Ecclesiastical patronage, and conditions in Ireland. While Hegel supports the idea of reform with its appeal to rational change as against the “positivity” of customary law, traditionalism and privilege, he thinks that universalizing suffrage with a property qualification without a thorough reform of the system of Common Law and the existing social conditions will only be perceived as token measures leading to greater disenchantment among the newly enfranchised and possibly inclinations to violent revolution. Hegel claims that national pride keeps the English from studying and following the reforms of the European Continent or seriously reflecting upon and grasping the nature of government and legislation.

There are several overall themes that reoccur in these political writings and that connect with some of the main lines of thought in Hegel’s theoretical works. First, there is the contrast between the attitude of legal positivism and the appeal to the law of reason. Hegel consistently displays a “political rationalism” which attacks old concepts and attitudes that no longer apply to the modern world. Old constitutions stemming from the Feudal era are a confused mixture of customary laws and special privileges that must give way to the constitutional reforms of the new social and political world that has arrived in the aftermath of the French Revolution. Second, reforms of old constitutions must be thorough and radical, but also cautious and gradual. This might sound somewhat inconsistent, but for Hegel a reform is radical due to a fundamental change in direction, not the speed of such change. Hegel suggests that customary institutions not be abolished too quickly for there must be some congruence and continuity with the existing social conditions. Hegel rejects violent popular action and sees the principal force for reform in governments and the estates assemblies, and he thinks reforms should always stress legal equality and the public welfare. Third, Hegel emphasizes the need for a strong central government, albeit without complete centralized control of public administration and social relations. Hegel here anticipates his later conception of civil society (bürgerliche Gesellschaft), the social realm of individual autonomy where there is significant local self-governance. The task of government is not to thoroughly bureaucratize civil society but rather to provide oversight, regulation, and when necessary intervention. Fourth, Hegel claims that representation of the people must be popular but not atomistic. The democratic element in a state is not its sole feature and it must be institutionalized in a rational manner. Hegel rejects universal suffrage as irrational because it provides no means of mediation between the individual and the state as a whole. Hegel believed that the masses lacked the experience and political education to be directly involved in national elections and policy matters and that direct suffrage leads to electoral indifference and apathy. Fifth, while acknowledging the importance of a division of powers in the public authority, Hegel does not appeal to a conception of separation and balance of powers. He views the estates assemblies, which safeguard freedom, as essentially related to the monarch and also stresses the role of civil servants and members of the professions, both in ministerial positions and in the assemblies. The monarchy, however, is the central supporting element in the constitutional structure because the monarch is invested with the sovereignty of the state. However, the power of the monarch is not despotical for he exercises authority through universal laws and statutes and is advised and assisted by a ministry and civil service, all members of which must meet educational requirements.

3. The Jena Writings (1802-06)

Hegel wrote several pieces while at the University of Jena that point in the direction of some of the main theses of the Philosophy of Right. The first was entitled “On the Scientific Modes of Treatment of Natural Law–Its Place in Practical Philosophy and Its Relationship to the Positive Science of Law” (Über die wissenschaftlichen Behandlungsarten des Naturrechts…), published originally in the Kritisches Journal der Philosophie in 1802, edited jointly by Hegel and Schelling. In this piece, usually referred to as the essay on Natural Law, Hegel criticizes both the empirical and formal approaches to natural law, as exemplified in British and Kantian philosophy respectively. Empiricism reaches conclusions that are limited by the particularities of its contexts and materials and thus cannot provide universally valid propositions regarding the concepts of various social and political institutions or of the relation of reflective consciousness to social and political experience. Formalist conclusions, on the other hand, are too insubstantial and abstract in failing to properly link human reason concretely to human experience. Traditional natural law theories are based on an abstract rationalism and the attempts of Rousseau, Kant, and Fichte to remedy this through their various ethical conceptions fail to overcome abstractness. For Hegel, the proper method of philosophical science must link concretely the development of the human mind and its rational powers to actual experience. Moreover, the concept of a social and political community must transcend the instrumentalizing of the state.

Hegel’s work entitled “The System of Ethical Life” (System der Sittlichkeit) was written in 1802-03 and first published in its entirety by Georg Lasson in 1913 in a volume entitled Schriften zur Politik und Rechtsphilosophie. In this work, Hegel develops a philosophical theory of social and political development that correlates with the self-development of essential human powers. Historically, humans begin in an immediate relation to nature and their social existence takes the form of natürliche Sittlichkeit, i.e., a non-selfconscious relation to nature and to others. However, the satisfaction of human desires leads to their reproduction and multiplication and leads to the necessity for labor, which induces transformation in the human world and people’s connections to it. This process leads to a self-realization that undermines the original naïve unity with nature and others and to the formation of overtly cooperative endeavors, e.g., in the making and use of tools. Another result of labor is the emergence of private property as an embodiment of human personality as well as of sets of legal relationships that institutionalize property ownership, exchange, etc., and deal with crimes against property. Furthermore, disparities in property and power lead to relationships of subordination and the use of the labor of others to satisfy one’s increasingly complex and expanded desires. Gradually, a system of mutual dependence, a “system of needs,” develops, and along with the increasing division of labor there also develops class differentiations reflecting the types of labor or activity taken up by members of each class, which Hegel classifies into the agricultural, acquisitive, and administerial classes. However, despite relations of interdependence and cooperation the members of society experience social connections as a sort of blind fate without some larger system of control which is provided by the state which regulates the economic life of society. The details of the structure of the state are unclear in this essay, but what is clear is that for Hegel the state provides an increased rationality to social practices, much in the sense that the later German sociologist Max Weber (1864-1920) would articulate how social practices become more rational by being codified and made more predictable.

The manuscripts entitled Realphilosophie are based on lectures Hegel delivered at Jena University in 1803-04 (Realphilosophie I) and 1805-06 (Realphilosophie II), and were originally published by Johannes Hoffmeister in 1932. These writings cover much of the same ground as the System der Sittlichkeit in explicating a philosophy of mind and human experience in relation to human social and political development. Some of the noteworthy ideas in these writings are the role and significance of language for social consciousness, for giving expression to a people (Volk) and for the comprehending of and mastery of the world, and the necessity and consequences of the fragmentation of primordial social relationships and patterns as part of the process of human development. Also, there is a reiteration of the importance of property relations as crucial to social recognition and how there would be no security of property or recognition of property rights if society were to remain a mere multitude of families. Such security requires a system of control over the “struggle for recognition” through interpersonal norms, rules, and juridical authority provided by the nation state. Moreover, Hegel repeats the need for strong state regulation of the economy, which if left to its own workings is blind to the needs of the social community. The economy, especially through the division of labor, produces fragmentation and diminishment of human life (compare Marx on alienation) and the state must not only address this phenomenon but also provide the means for the people’s political participation to further the development of social self-consciousness. In all of this Hegel appears to be providing a philosophical account of modern developments both in terms of the tensions and conflicts that are new to modernity as well as in the progressive movements of reform found under the influence of Napoleon.

Finally, Hegel also discusses the forms of government, the three main types being tyranny, democracy, and hereditary monarchy. Tyranny is found typically in primitive or undeveloped states, democracy exists in states where there is the realization of individual identity but no split between the public and private person, and hereditary monarchy is the appropriate form of political authority in the modern world in providing strong central government along with a system of indirect representation through Estates. The relation of religion to the state is undeveloped in these writings, but Hegel is clear about the supereminent role of the state that stands above all else in giving expression to the Spirit (Geist) of a society in a sort of earthly kingdom of God, the realization of God in the world. True religion complements and supports this realization and thus cannot properly have supremacy over or be opposed to the state.

4. The Phenomenology of Spirit

The Phenomenology of Spirit (Die Phänomenologie des Geistes), published in 1807, is Hegel’s first major comprehensive philosophical work. Originally intended to be the first part of his comprehensive system of science (Wissenschaft) or philosophy, Hegel eventually considered it to be the introduction to his system. This work provides what can be called a “biography of spirit,” i.e., an account of the development of consciousness and self-consciousness in the context of some central epistemological, anthropological and cultural themes of human history. It has continuity with the works discussed above in examining the development of the human mind in relation to human experience but is more wide-ranging in also addressing fundamental questions about the meaning of perceiving, knowing, and other cognitive activities as well as of the nature of reason and reality. Given the focus of this essay, the themes of the Phenomenology to be discussed here are those directly relevant to Hegel’s social and political thought.

One of the most widely discussed places in the Phenomenology is the chapter on “The Truth of Self-Certainty” which includes a subsection on “Independence and Dependence of Self-Consciousness: Lordship and Bondage.” This section treats of the (somewhat misleadingly named) “master/slave” struggle which is taken by some, especially the Marxian-inspired, as a paradigm of all forms of social conflict, in particular the struggle between social classes. It is clear that Hegel intended the scenario to typify certain features of the struggle for recognition (Anerkennung) overall, be it social, personal, etc. The conflict between master and slave (which shall be referred to hereafter as lord and bondsman as more in keeping with Hegel’s own terminology and the intended generic meaning) is one in which the historical themes of dominance and obedience, dependence and independence, etc., are philosophically introduced. Although this specific dialectic of struggle occurs only at the earliest stages of self-consciousness, it nonetheless sets up the main problematic for achieving realized self-consciousness–the gaining of self-recognition through the recognition of and by another, through mutual recognition.

According to Hegel, the relationship between self and otherness is the fundamental defining characteristic of human awareness and activity, being rooted as it is in the emotion of desire for objects as well as in the estrangement from those objects, which is part of the primordial human experience of the world. The otherness that consciousness experiences as a barrier to its goal is the external reality of the natural and social world, which prevents individual consciousness from becoming free and independent. However, that otherness cannot be abolished or destroyed, without destroying oneself, and so ideally there must be reconciliation between self and other such that consciousness can “universalize” itself through the other. In the relation of dominance and subservience between two consciousnesses, say lord and bondsman, the basic problem for consciousness is the overcoming of its otherness, or put positively, the achieving of integration with itself. The relation between lord and bondsman leads to a sort of provisional, incomplete resolution of the struggle for recognition between distinct consciousnesses.

Hegel asks us to consider how a struggle between two distinct consciousnesses, let us say a violent “life-or-death” struggle, would lead to one consciousness surrendering and submitting to the other out of fear of death. Initially, the consciousness that becomes lord or master proves its freedom through willingness to risk its life and not submit to the other out of fear of death, and thus not identify simply with its desire for life and physical being. Moreover, this consciousness is given acknowledgement of its freedom through the submission and dependence of the other, which turns out paradoxically to be a deficient recognition in that the dominant one fails to see a reflection of itself in the subservient one. Adequate recognition requires a mirroring of the self through the other, which means that to be successful it must be mutual. In the ensuing relationship of lordship and bondage, furthermore, the bondsman through work and discipline (motivated by fear of dying at the hands of the master or lord) transforms his subservience into a mastery over his environment, and thus achieves a measure of independence. In objectifying himself in his environment through his labor the bondsman in effect realizes himself, with his transformed environment serving as a reflection of his inherently self-realizing activity. Thus, the bondsman gains a measure of independence in his subjugation out of fear of death. In a way, the lord represents death as the absolute subjugator, since it is through fear of this master, of the death that he can impose, that the bondsman in his acquiescence and subservience is placed into a social context of work and discipline. Yet despite, or more properly, because of this subjection the bondsman is able to attain a measure of independence by internalizing and overcoming those limitations which must be dealt with if he is to produce efficiently. However, this accomplishment, the self-determination of the bondsman, is limited and incomplete because of the asymmetry that remains in his relation to the lord. Self-consciousness is still fragmented, i.e., the objectification through labor that the bondsman experiences does not coincide with the consciousness of the lord whose sense of self is not through labor but through power over the bondsman and enjoyment of the fruits of the bondsman’s labor. Only in a realm of ethical life can self-determination be fully self-conscious to the extent that universal freedom is reflected in the life of each individual member of society.

Thus, in the Phenomenology consciousness must move on through the phases of Stoicism, Skepticism, and the Unhappy Consciousness before engaging in the self-articulation of Reason, and it is not until the section “Objective Spirit: The Ethical Order” that the full universalization of self-consciousness is in principle to be met with. Here we find a shape of human existence where all men work freely, serving the needs of the whole community rather than of masters, and subject only to the “discipline of reason.” This mode of ethical life, typified in ancient Greek democracy, also eventually disintegrates, as is expressed in the conflict between human and divine law and the tragic fate that is the outcome of this conflict illustrated in the story of Antigone. However, the ethical life described here is still in its immediacy and is therefore at a level of abstractness that falls short of the mediation of subjectivity and universality which is provided spiritually in revealed Christianity and politically in the modern state, which purportedly provides a solution to human conflict arising from the struggle for recognition. In any case, the rest of the Phenomenology is devoted to examinations of culture (including enlightenment and revolution), morality, religion, and finally, Absolute Knowing.

The dialectic of self-determination is, for Hegel, inherent in the very structure of freedom, and is the defining feature of Spirit (Geist). The full actualization of Spirit in the human community requires the progressive development of individuality which effectively begins with the realization in self-consciousness of the “truth of self-certainty” and culminates in the shape of a shared common life in an integrated community of love and Reason, based upon the realization of truths of incarnation, death, resurrection, and forgiveness as grasped in speculative Religion. The articulation Hegel provides in the Phenomenology, however, is very generic and is to be made concrete politically with the working out of a specific conception of the modern nation-state with its particular configuration of social and political institutions. It is to the latter that we must turn in order to see how these fundamental dialectical considerations take shape in the “solution” to the struggle for recognition in self-consciousness. However, before moving directly to Hegel’s theory of the state, and history, some discussion of his Logic is in order.

5. Logic and Political Theory

The Logic constitutes the first part of Hegel’s philosophical system as presented in his Encyclopedia. It was preceded by his larger work, The Science of Logic (Wissenschaft der Logik), published in 1812-16 in two volumes. The “Encyclopedia Logic” is a shorter version intended to function as part of an “outline,” but it became longer in the course of the three published versions of 1817, 1827, and 1830. Also, the English translation by William Wallace contains additions from the notes of students who heard Hegel’s lectures on this subject. (Reference to the paragraphs of the Encyclopedia will be made with the “¶” character.)

The structure of the Logic is triadic, reflecting the organization of the larger system of philosophy as well as a variety of other motifs, both internal and external to the Logic proper. The Logic has three divisions: the Doctrine of Being, the Doctrine of Essence, and the Doctrine of the Notion (or Concept). There are a number of logical categories in this work that are directly relevant to social and political theorizing. In the Doctrine of Being, for example, Hegel explains the concept of “being-for-self” as the function of self-relatedness in the resolving of opposition between self and other in the “ideality of the finite” (¶ 95-96). He claims that the task of philosophy is to bring out the ideality of the finite, and as will be seen later Hegel’s philosophy of the state is intended to articulate the ideality of the state, i.e., its affirmative and infinite or rational features. In the Doctrine of Essence, Hegel explains the categories of actuality and freedom. He says that actuality is the unity of “essence and existence” (¶ 142) and argues that this does not rule out the actuality of ideas for they become actual by being realized in external existence. Hegel will have related points to make about the actuality of the idea of the state in society and history. Also, he defines freedom not in terms of contingency or lack of determination, as is popular, but rather as the “truth of necessity,” i.e., freedom presupposes necessity in the sense that reciprocal action and reaction provide a structure for free action, e.g., a necessary relation between crime and punishment.

The Doctrine of the Notion (Begriff) is perhaps the most relevant section of the Logic to social and political theory due to its focus on the various dynamics of development. This section is subdivided into three parts: the subjective notion, the objective notion, and the idea which articulates the unity of subjective and objective. The first part, the subjective notion, contains three “moments” or functional parts: universality, particularity, and individuality (¶ 163ff). These are particularly important as Hegel will show how the functional parts of the state operate according to a progressive “dialectical” movement from the first to the third moments and how the state as a whole, as a functioning and integrated totality, gives expression to the concept of individuality (in ¶198 Hegel refers to the state as “a system of three syllogisms”). Hegel treats these relationships as logical judgments and syllogisms but they do not merely articulate how the mind must operate (subjectivity) but also explain actual relationships in reality (objectivity). In objective reality we find these logical/dialectical relationships in mechanism, chemism, and teleology. Finally, in the Idea, the correspondence of the notion or concept with objective reality, we have the truth of objects or objects as they ought to be, i.e., as they correspond to their proper concepts. The logical articulation of the Idea is very important to Hegel’s explanation of the Idea of the state in modern history, for this provides the principles of rationality that guide the development of Spirit in the world and that become manifested in various ways in social and political life.

6. The Philosophy of Right

In 1821, Hegel’s Philosophy of Right orginally appeared under the double title Naturrecht und Staatswissenschaften in Grundrisse; Grundlinien der Philosophie des Rechts (Natural Law and the Science of the State; Elements of the Philosophy of Right). The work was republished by Eduard Gans in 1833 and 1854 as part of Hegel’s Werke, vol. viii and included additions from notes taken by students at Hegel’s lectures. The English language translation of this work by T. M. Knox refers to these later editions as well as to an edition published in 1923 by Georg Lasson, which included corrections from previous editions.

The Philosophy of Right constitutes, along with Hegel’s Philosophy of History, the penultimate section of his Encyclopedia, the section on Objective Spirit, which deals with the human world and its array of social rules and institutions, including the moral, legal, religious, economic, and political as well as marriage, the family, social classes, and other forms of human organization. The German word Recht is often translated as ‘law’, however, Hegel clearly intends the term to have a broader meaning that captures what we might call the good or just society, one that is “rightful” in its structure, composition, and practices.

In the Introduction to this work Hegel explains the concept of his philosophical undertaking along with the specific key concepts of will, freedom, and right. At the very beginning, Hegel states that the Idea of right, the concept together with its actualization, is the proper subject of the philosophical science of right (¶ 1). Hegel is emphatic that the study is scientific in that it deals in a systematic way with something essentially rational. He further remarks that the basis of scientific procedure in a philosophy of right is explicated in philosophical logic and presupposed by the former (¶ 2). Furthermore, Hegel is at pains to distinguish the historical or legal approach to “positive law” (Gesetz) and the philosophical approach to the Idea of right (Recht), the former involving mere description and compilation of laws as legal facts while the latter probes into the inner meaning and necessary determinations of law or right. For Hegel the justification of something, the finding of its inherent rationality, is not a matter of seeking its origins or longstanding features but rather of studying it conceptually.

However, there is one sense in which the origin of right is relevant to philosophical science and this is the free will. The free will is the basis and origin of right in the sense that mind or spirit (Geist) generally objectifies itself in a system of right (human social and political institutions) that gives expression to freedom, which Hegel says is both the substance and goal of right (¶ 4). This ethical life in the state consists in the unity of the universal and the subjective will. The universal will is contained in the Idea of freedom as its essence, but when considered apart from the subjective will can be thought of only abstractly or indeterminately. Considered apart from the subjective or particular will, the universal will is “the element of pure indeterminacy or that pure reflection of the ego into itself which involves the dissipation of every restriction and every content either immediately presented by nature, by needs, desires, and impulses, or given and determined by any means whatever” (¶ 5). In other words, the universal will is that moment in the Idea of freedom where willing is thought of as state of absolutely unrestrained volition, unfettered by any particular circumstances or limitations whatsoever–the pure form of willing. This is expressed in the modern libertarian view of completely uncoerced choice, the absence of restraint (or “negative liberty” as understood by Thomas Hobbes). The subjective will, on the other hand, is the principle of activity and realization that involves “differentiation, determination, and positing of a determinacy as a content and object” (¶ 6). This means that the will is not merely unrestrained in acting but that it actually can give expression to the doing or accomplishing of certain things, e.g., through talent or expertise (sometimes called “positive freedom”). The unity of both the moments of abstract universality (the will in-itself) and subjectivity or particularity (the will for-itself) is the concrete universal or true individuality (the will in-and-for-itself). According to Hegel, preservation of the distinction of these two moments in the unity (identity-in-difference) between universal and particular will is what produces rational self-determination of an ego, as well as the self-consciousness of the state as a whole. Hegel’s conception of freedom as self-determination is just this unity in difference of the universal and subjective will, be it in the willing by individual persons or in the expressions of will by groups of individuals or collectivities. The “negative self-relation” of this freedom involves the subordination of the natural instincts, impulses, and desires to conscious reflection and to goals and purposes that are consciously chosen and that require commitment to rational principles in order to properly guide action.

The overall structure of the Philosophy of Right is quite remarkable in its “syllogistic” organization. The main division of the work corresponds to what Hegel calls the stages in the development of “the Idea of the absolutely free will,” and these are Abstract Right, Morality, and Ethical Life. Each of these divisions is further subdivided triadically: under Abstract Right there is Property, Contract, and Wrong; under Morality falls Purpose and Responsibility, Intention and Welfare, and Good and Conscience; finally, under Ethical Life comes the Family, Civil Society, and the State. These last subdivisions are further subdivided into triads, with fourth level subdivisions occurring under Civil Society and the State. This triadic system of rubrics is no mere description of a static model of social and political life. Hegel claims that it gives expression to the conceptual development of Spirit in human society based upon the purely logical development of rationality provided in his Logic. Thus, it is speculatively based and not derivable from empirical survey, although the particularities of the system do indeed correspond to our experience and what we know about ourselves anthropologically, culturally, etc.

The transition in the Logic from universality to particularity to individuality (or concrete universality) is expressed in the social and political context in the conceptual transition from Abstract Right to Morality to Ethical Life. In the realm of Abstract Right, the will remains in its immediacy as an abstract universal that is expressed in personality and in the universal right to possession of external things in property. In the realm of Morality, the will is no longer merely “in-itself,” or restricted to the specific characteristics of legal personality, but becomes free “for-itself,” i.e., it is will reflected into itself so as to produce a self-consciousness of the will’s infinity. The will is expressed, initially, in inner conviction and subsequently in purpose, intention, and conviction. As opposed to the merely juridical person, the moral agent places primary value on subjective recognition of principles or ideals that stand higher than positive law. At this stage, universality of a higher moral law is viewed as something inherently different from subjectivity, from the will’s inward convictions and actions, and so in its isolation from a system of objectively recognized legal rules the willing subject remains “abstract, restricted, and formal” (¶ 108). Because the subject is intrinsically a social being who needs association with others in order to institutionalize the universal maxims of morality, maxims that cover all people, it is only in the realm of Ethical Life that the universal and the subjective will come into a unity through the objectification of the will in the institutions of the Family, Civil Society, and the State.

In what follows, we trace through Hegel’s systematic development of the “stages of the will,” highlighting only the most important points as necessary to get an overall view of this work.

a. Abstract Right

The subject of Abstract Right (Recht) is the person as the bearer or holder of individual rights. Hegel claims that this focus on the right of personality, while significant in distinguishing persons from mere things, is abstract and without content, a simple relation of the will to itself. The imperative of right is: “Be a person and respect others as persons” (¶ 36). In this formal conception of right, there is no question of particular interests, advantages, motives or intentions, but only the mere idea of the possibility of choosing based on the having of permission, as long as one does not infringe on the right of other persons. Because of the possibilities of infringement, the positive form of commands in this sphere are prohibitions.

(1) Property (the universality of will as embodied in things)

A person must translate his or her freedom into the external world “in order to exist as Idea” (¶ 41), thus abstract right manifests itself in the absolute right of appropriation over all things. Property is the category through which one becomes an object to oneself in that one actualizes the will through possession of something external. Property is the embodiment of personality and of freedom. Not only can a person put his or her will into something external through the taking possession of it and of using it, but one can also alienate property or yield it to the will of another, including the ability to labor for a restricted period of time. One’s personality is inalienable and one’s right to personality imprescriptible. This means one cannot alienate all of one’s labor time without becoming the property of another.

(2) Contract (the positing of explicit universality of will)

In this sphere, we have a relation of will to will, i.e., one holds property not merely by means of the subjective will externalized in a thing, but by means of another’s person’s will, and implicitly by virtue of one’s participation in a common will. The status of being an independent owner of something from which one excludes the will of another is thus mediated in the identification of one’s will with the other in the contractual relation, which presupposes that the contracting parties “recognize each other as persons and property owners” (¶ 71). (Note the significant development here beyond the dialectic of lord and bondsman.) Moreover, when contract involves the alienation or giving up of property, the external thing is now an explicit embodiment of the unity of wills. In contractual relations of exchange, what remains identical as the property of the individuals is its value, in respect to which the parties to the contract are on an equal footing, regardless of the qualitative external differences between the things exchanged. “Value is the universal in which the subjects of the contract participate” (¶ 77).

(3) Wrong (the particular will opposing itself to the universal)

In immediate relations of persons to one another it is possible for a particular will to be at variance with the universal through arbitrariness of decision and contingency of circumstance, and so the appearance (Erscheinung) of right takes on the character of a show (Schein), which is the inessential, arbitrary, posing as the essential. If the “show” is only implicit and not explicit also, i.e., if the wrong passes in the doer’s eyes as right, the wrong is non-malicious. In fraud a show is made to deceive the other party and so in the doer’s eyes the right asserted is only a show. Crime is wrong both in itself and from the doer’s point of view, such that wrong is willed without even the pretense or show of right. Here the form of acting does not imply a recognition of right but rather is an act of coercion through exercise of force. It is a “negatively infinite judgement” in that it asserts a denial of rights to the victim, which is not only incompatible with the fact of the matter but also self-negating in denying its own capacity for rights in principle.

The penalty that falls on the criminal is not merely just but is “a right established within the criminal himself, i.e., in his objectively embodied will, in his action,” because the crime as the action of a rational being implies appeal to a universal standard recognized by the criminal (¶ 100). The annulling of crime in this sphere of immediate right occurs first as revenge, which as retributive is just in its content, but in its form it is an act of a subjective will and does not correspond with its universal content and hence as a new transgression is defective and contradictory (¶ 102). All crimes are comparable in their universal property of being injuries, thus, in a sense it is not something personal but the concept itself which carries out retribution.

Crime, as the will which is implicitly null, contains its negation in itself, which is its punishment.

The nullity of crime is that it has set aside right as such, but since right is absolute it cannot be set aside. Thus, the act of crime is not something positive, not a first thing, but is something negative, and punishment is the negation of crime’s negation.

b. Morality

The demand for justice as punishment rather than as revenge, with regard to wrong, implies the demand for a will which, though particular and subjective, also wills the universal as such. In wrong the will has become aware of itself as particular and has opposed itself to and contradicted the universal embodied in rights. At this stage the universally right is abstract and one-sided and thus requires a move to a higher level of self-consciousness where the universally right is mediated by the particular convictions of the willing subject. We go beyond the criminal’s defiance of the universal by substituting for the abstract conception of personality the more concrete conception of subjectivity. The criminal is now viewed as breaking his own law, and his crime is a self-contradiction and not only a contradiction of a right outside him. This recognition brings us to the level of morality (Moralität) where the will is free both in itself and for itself, i.e., the will is self-conscious of its subjective freedom.

At the level of morality the right of the subjective will is embodied in immediate wills (as opposed to immediate things like property). The defect of this level, however, is that the subject is only for itself, i.e., one is conscious of one’s subjectivity and independence but is conscious of universality only as something different from this subjectivity. Therefore, the identity of the particular will and the universal will is only implicit and the moral point of view is that of a relation of “ought-to-be,” or the demand for what is right. While the moral will externalizes itself in action, its self-determination is a pure “restlessness” of activity that never arrives at actualization.

The right of the moral will has three aspects. First, there is the right of the will to act in its external environment, to recognize as its actions only those that it has consciously willed in light of an aim or purpose (purpose and responsibility). Second, in my intention I ought to be aware not simply of my particular action but also of the universal which is conjoined with it. The universal is what I have willed and is my intention. The right of intention is that the universal quality of the action is not merely implied but is known by the agent, and so it lies from the start in one’s subjective will. Moreover, the content of such a will is not only the right of the particular subject to be satisfied but is elevated to a universal end, the end of welfare or happiness (intention and welfare). The welfare of many unspecified persons is thus also an essential end and right of subjectivity. However, right as an abstract universal and welfare as abstract particularity, may collide, since both are contingent on circumstances for their satisfaction, e.g., in cases where claims of right or welfare by someone may endanger the life of another there can be a counter-claim to a right of distress. “This distress reveals the finitude and therefore the contingency of both right and welfare” (¶ 128). This “contradiction” between right and welfare is overcome in the third aspect of the moral will, the good which is “the Idea as the unity of the concept of the will with the particular will” (¶ 129).

In addition to the right of the subjective will that whatever it recognizes as valid shall be seen by it as good, and that an action shall be imputed to it as good or evil in accordance with its knowledge of the worth which the action has in its external objectivity (¶ 132), which together constitute a “right of insight,” the will also must recognize the good as its duty, which is, to begin with, duty for duty’s sake, or duty formally and without content (e.g., as expressed in the Kantian “categorical imperative”). Because of this lack of content, the subjective will in its abstract reflection into itself is “absolute inward certainty (Gewißheit) of self,” or conscience (Gewissen). While true or authentic conscience is the disposition to will what is absolutely good, and thus correspond with what is objectively right, purely formal conscience lacks an objective system of principles and duties. Although conscience is ideally supposed to mean the identity of subjective knowing and willing with the truly good, when it remains the subjective inner reflection of self-consciousness into itself its claim to this identity is deficient and one-sided. Moreover, when the determinate character of right and duty reduces to subjectivity, the mere inwardness of the will, there is the potentiality of elevating the self-will of particular individuals above the universal itself, i.e., of “slipping into evil” (¶ 139). What makes a person evil is the choosing of natural desires in opposition to the good, i.e., to the concept of the will. When an individual attempts to pass off his or her action as good, and thus imposing it on others, while being aware of the discrepancy between its negative character and the objective universal good, the person falls into hypocrisy. This is one of several forms of perverse moral subjectivity that Hegel discusses at length in his remarks (¶ 140).

c. Ethical Life

Hegel’s analysis of the moral implications of “good and conscience” leads to the conclusion that a concrete unity of the objective good with the subjectivity of the will cannot be achieved at the level of personal morality since all attempts at this are problematic. The concrete identity of the good with the subjective will occurs only in moving to the level of ethical life (Sittlichkeit), which Hegel says is “the Idea of freedom…the concept of freedom developed into the existing world and the nature of self-consciousness” (¶ 142). Thus, ethical life is permeated with both objectivity and subjectivity: regarded objectively it is the state and its institutions, whose force (unlike abstract right) depends entirely on the self-consciousness of citizens, on their subjective freedom; regarded subjectively it is the ethical will of the individual which (unlike the moral will) is aware of objective duties that express one’s inner sense of universality. The rationality of the ethical order of society is thus constituted in the synthesis of the concept of the will, both as universal and as particular, with its embodiment in institutional life.

The synthesis of ethical life means that individuals not only act in conformity with the ethical good but that they recognize the authority of ethical laws. This authority is not something alien to individuals since they are linked to the ethical order through a strong identification which Hegel says “is more like an identity than even the relation of faith or trust” (¶ 147). The knowledge of how the laws and institutions of society are binding on the will of individuals entails a “doctrine of duties.” In duty the individual finds liberation both from dependence on mere natural impulse, which may or may not motivate ethical actions, and from indeterminate subjectivity which cannot produce a clear view of proper action. “In duty the individual acquires his substantive freedom” (¶ 149). In the performance of duty the individual exhibits virtue when the ethical order is reflected in his or her character, and when this is done by simple conformity with one’s duties it is rectitude. When individuals are simply identified with the actual ethical order such that their ethical practices are habitual and second nature, ethical life appears in their general mode of conduct as custom (Sitten). Thus, the ethical order manifests its right and validity vis-à-vis individuals. In duty “the self-will of the individual vanishes together with his private conscience which had claimed independence and opposed itself to the ethical substance. For when his character is ethical, he recognizes as the end which moves him to act the universal which is itself unmoved but is disclosed in its specific determinations as rationality actualized. He knows that his own dignity and the whole stability of his particular ends are grounded in this same universal, and it is therein that he actually attains these” (¶ 152). However, this does not deny the right of subjectivity, i.e., the right of individuals to be satisfied in their particular pursuits and free activity; but this right is realized only in belonging to an objective ethical order. The “bond of duty” will be seen as a restriction on the particular individual only if the self-will of subjective freedom is considered in the abstract, apart from an ethical order (as is the case for both Abstract Right and Morality). “Hence, in this identity of the universal will with the particular will, right and duty coalesce, and by being in the ethical order a man has rights in so far as he has duties, and duties in so far as he has rights” (¶ 155).

In the realm of ethical life the logical syllogism of self-determination of the Idea is most clearly applied. The moments of universality, particularity, and individuality initially are represented respectively in the institutions of the family, civil society, and the state. The family is “ethical mind in its natural or immediate phase” and is characterized by love or the feeling of unity in which one is not conscious of oneself as an independent person but only as a member of the family unit to which one is bound. Civil society, on the other hand, comprises an association of individuals considered as self-subsistent and who have no conscious sense of unity of membership but only pursue self-interest, e.g., in satisfying needs, acquiring and protecting property, and in joining organizations for mutual advantage. Finally, the constitution of the political state brings together in a unity the sense of the importance of the whole or universal good along with the freedom of particularity of individual pursuits and thus is “the end and actuality of both the substantial order and the public life devoted thereto” (¶ 157).

i. The Family

The family is characterized by love which is “mind’s feeling of its own unity,” where one’s sense of individuality is within this unity, not as an independent individual but as a member essentially related to the other family members. Thus, familial love implies a contradiction between, on the one hand, not wanting to be a self-subsistent and independent person if that means feeling incomplete and, on the other hand, wanting to be recognized in another person. Familial love is truly an ethical unity, but because it is nonetheless a subjective feeling it is limited in sustaining unity (pars. 158-59, and additions).

(A) Marriage

The union of man and woman in marriage is both natural and spiritual, i.e., is a physical relationship and one that is also self-conscious, and it is entered into on the basis of the free consent of the persons. Since this consent involves bringing two persons into a union, there is the mutual surrender of their natural individuality for the sake of union, which is both a self-restriction and also a liberation because in this way individuals attain a higher self-consciousness.

(B) Family Capital

The family as a unit has its external existence in property, specifically capital (Vermögen) which constitutes permanent and secured possessions that allow for endurance of the family as “person” (¶ 170). This capital is the common property of all the family members, none of whom possess property of their own, but it is administered by the head of the family, the husband.

(C) Education of Children & Dissolution of the Family

Children provide the external and objective basis for the unity of marriage. The love of the parents for their children is the explicit expression of their love for each other, while their immediate feelings of love for each other are only subjective. Children have the right to maintenance and education, and in this regard a claim upon the family capital, but parents have the right to provide this service to the children and to instill discipline over the wishes of their children. The education of children has a twofold purpose: the positive aim of instilling ethical principles in them in the form of immediate feeling and the negative one of raising them out of the instinctive physical level. Marriage can be dissolved not by whim but by duly constituted authority when there is total estrangement of husband and wife. The ethical dissolution of the family results when the children have been educated to be free and responsible persons and they are of mature age under the law. The natural dissolution of the family occurs with the death of the parents, the result of which is the passing of inheritance of property to the surviving family members. The disintegration of the family exhibits its immediacy and contingency as an expression of the ethical Idea (pars. 173-80).

ii. Civil Society

With civil society (bürgerliche Gesellschaft) we move from the family or “the ethical idea still in its concept,” where consciousness of the whole or totality is focal, to the “determination of particularity,” where the satisfaction of subjective needs and desires is given free reign (pars. 181-182). However, despite the pursuit of private or selfish ends in relatively unrestricted social and economic activity, universality is implicit in the differentiation of particular needs insofar as the welfare of an individual in society is intrinsically bound up with that of others, since each requires another in some way to effectively engage in reciprocal activities like commerce, trade, etc. Because this system of interdependence is not self-conscious but exists only in abstraction from the individual pursuit of need satisfaction, here particularity and universality are only externally related. Hegel says that “this system may be prima facie regarded as the external state, the state based on need, the state as the Understanding (Verstand) envisages it” (¶ 183). However, civil society is also a realm of mediation of particular wills through social interaction and a means whereby individuals are educated (Bildung) through their efforts and struggles toward a higher universal consciousness.

(A) The System of Needs

This dimension of civil society involves the pursuit of need satisfaction. Humans are different from animals in their ability to multiply needs and differentiate them in various ways, which leads to their refinement and luxury. Political economy discovers the necessary interconnections in the social and universalistic side of need. Work is the mode of acquisition and transformation of the means for satisfying needs as well as a mode of practical education in abilities and understanding. Work also reveals the way in which people are dependent upon one another in their self-seeking and how each individual contributes to the need satisfaction of all others. Society generates a “universal permanent capital” (¶ 199) that everyone in principle can draw upon, but the natural inequalities between individuals will translate into social inequalities. Furthermore, labor undergoes a division according to the complexities of the system of production, which is reflected in social class divisions: the agricultural (substantial or immediate); the business (reflecting or formal); and the civil servants (universal). Membership in a class is important for gaining status and recognition in a civil society. Hegel says that “A man actualizes himself only in becoming something definite, i.e., something specifically particularized; this means restricting himself exclusively to one of the particular spheres of need. In this class-system, the ethical frame of mind therefore is rectitude and esprit de corps, i.e., the disposition to make oneself a member of one of the moments of civil society by one’s own act … in this way gaining recognition both in one’s own eyes and in the eyes of others” (¶ 207).

The “substantial” agricultural class is based upon family relationships whose capital is in the products of nature, such as the land, and tends to be patriarchial, unreflective, and oriented toward dependence rather than free activity. In contrast to this focus on “immediacy,” the business class is oriented toward work and reflection, e.g., in transforming raw materials for use and exchange, which is a form of mediation of humans to one another. The main activities of the business class are craftsmanship, manufacture, and trade. The third class is the class of civil servants, which Hegel calls the “universal class” because it has the universal interests of society as its concern. Members of this class are relieved from having to labor to support themselves and maintain their livelihood either from private resources such as inheritance or are paid a salary by the state as members of the bureaucracy. These individuals tend to be highly educated and must qualify for appointment to government positions on the basis of merit.

(B) Administration of Justice

The principle of rightness becomes civil law (Gesetz) when it is posited, and in order to have binding force it must be given determinate objective existence. To be determinately existent, laws must be made universally known through a public legal code. Through a rational legal system, private property and personality are given legal recognition and validity in civil society, and wrongdoing now becomes an infringement, not merely of the subjective right of individuals but also of the larger universal will that exists in ethical life. The court of justice is the means whereby right is vindicated as something universal by addressing particular cases of violation or conflict without mere subjective feeling or private bias. “Instead of the injured party, the injured universal now comes on the scene, and … this pursuit consequently ceases to be the subjective and contingent retribution of revenge and is transformed into the genuine reconciliation of right with itself, i.e, into punishment” (¶ 220). Moreover, court proceedings and legal processes must take place according to rights and rules of evidence; judicial proceedings as well as the laws themselves must be made public; trial should be by jury; and punishment should fit the crime. Finally, in the administration of justice, “civil society returns to its concept, to the unity of the implicit universal with the subjective particular, although here the latter is only that present in single cases and the universality in question is that of abstract right” (¶ 229).

(C) The Police and the Corporation

The Police (Polizei) for Hegel is understood broadly as the public authorities in civil society. In addition to crime fighting organizations, it includes agencies that provide oversight over public utilities as well as regulation of and, when necessary, intervention into activities related to the production, distribution, and sale of goods and services, or with any of the contingencies that can affect the rights and welfare of individuals and society generally (e.g., defense of the public’s right not to be defrauded, and also the management of goods inspection). Also, the public authority superintends education and organizes the relief of poverty. Poverty must be addressed both through private charity and public assistance since in civil society it constitutes a social wrong when poverty results in the creation of a class of “penurious rabble” (¶ 245). Society looks to colonization to increase its wealth but poverty remains a problem with no apparent solution.

The corporation (Korporation) applies especially to the business class, since this class is concentrated on the particularities of social existence and the corporation has the function of bringing implicit similarities between various private interests into explicit existence in forms of association. This is not the same as our contemporary business corporation but rather is a voluntary association of persons based on occupational or various social interests (such as professional and trade guilds, educational clubs, religious societies, townships, etc.) Because of the integrating function of the corporation, especially in regard to the social and economic division of labor, what appear as selfish purposes in civil society are shown to be at the same time universal through the formation of concretely recognized commonalities. Hegel says that “a Corporation has the right, under the surveillance of the public authority, (a) to look after its own interests within its own sphere, (b) to co-opt members, qualified objectively by requisite skill and rectitude, to a number fixed by the general structure of society, (c) to protect its members against particular contingencies, (d) to provide the education requisite to fit other to become members. In short, the right is to come on the scene like a second family for its members …” (¶ 252). Furthermore, the family is assured greater stability of livelihood insofar as its providers are corporation members who command the respect due to them in their social positions. “Unless he is a member of an authorized Corporation (and it is only by being authorized that an association becomes a Corporation), an individual is without rank or dignity, his isolation reduces his business to mere self-seeking, and his livelihood and satisfaction become insecure” (¶ 253). Because individual self-seeking is raised to a higher level of common pursuits, albeit restricted to the interest of a sectional group, individual self-consciousness is raised to relative universality. Hence, “As the family was the first, so the Corporation is the second ethical root of the state, the one planted in civil society” (¶ 255).

iii. The State

The political State, as the third moment of Ethical Life, provides a synthesis between the principles governing the Family and those governing Civil Society. The rationality of the state is located in the realization of the universal substantial will in the self-consciousness of particular individuals elevated to consciousness of universality. Freedom becomes explicit and objective in this sphere. “Since the state is mind objectified, it is only as one of its members that the individual has objectivity, genuine individuality, and an ethical life … and the individual’s destiny is the living of a universal life” (¶ 258). Rationality is concrete in the state in so far as its content is comprised in the unity of objective freedom (freedom of the universal or substantial will) and subjective freedom (freedom of everyone in knowing and willing of particular ends); and in its form rationality is in self-determining action or laws and principles which are logical universal thoughts (as in the logical syllogism).

The Idea of the State is itself divided into three moments: (a) the immediate actuality of the state as a self-dependent organism, or Constitutional Law; (b) the relation of states to other states in International Law; (c) the universal Idea as Mind or Spirit which gives itself actuality in the process of World-History.

1) Constitutional Law

(1) The Constitution (internally)

Only through the political constitution of the State can universality and particularity be welded together into a real unity. The self-consciousness of this unity is expressed in the recognition on the part of each citizen that the full meaning of one’s actual freedom is found in the objective laws and institutions provided by the State. The aspect of identity comes to the fore in the recognition that individual citizens give to the ethical laws such that they “do not live as private persons for their own ends alone, but in the very act of willing these they will the universal in the light of the universal, and their activity is consciously aimed at none but the universal end” (¶ 260). The aspect of differentiation, on the other hand, is found in “the right of individuals to their particular satisfaction,” the right of subjective freedom which is maintained in Civil Society. Thus, according to Hegel, “the universal must be furthered, but subjectivity on the other hand must attain its full and living development. It is only when both these moments subsist in their strength that the state can be regarded as articulated and genuinely organized” (¶ 260, addition).

As was indicated in the introduction to the concept of Ethical Life above, the higher authority of the laws and institutions of society requires a doctrine of duties. From the vantage point of the political State, this means that there must be a correlation between rights and duties. “In the state, as something ethical, as the inter-penetration of the substantive and the particular, my obligation to what is substantive is at the same time the embodiment of my particular freedom. This means that in the state duty and right are united in one and the same relation” (¶ 261). In fulfilling one’s duties one is also satisfying particular interests, and the conviction that this is so Hegel calls “political sentiment” (politische Gesinnung) or patriotism. “This sentiment is, in general, trust (which may pass over into a greater or lesser degree of educated insight), or the consciousness that my interest, both substantive and particular, is contained and preserved in another’s (that is, the state’s) interest and end, i.e., in the other’s relation to me as an individual” (¶ 268).

Thus, the “bond of duty” cannot involve being coerced into obeying the laws of the State. “Commonplace thinking often has the impression that force holds the state together, but in fact its only bond is the sense of order which everybody possesses” (¶ 268, addition).

According to Hegel, the political state is rational in so far as it inwardly differentiates itself according to the nature of the Concept (Begriff). The principle of the division of powers expresses inner differentiation, but while these powers are distinguished they must also be built into an organic whole such that each contains in itself the other moments so that the political constitution is a concrete unity in difference. Constitutional Law is accordingly divided into three moments: (a) the Legislature which establishes the universal through lawmaking; (b) the Executive which subsumes the particular under the universal through administering the laws; (c) the Crown which is the power of subjectivity of the state in the providing of the act of “ultimate decision” and thus forming into unity the other two powers. Despite the syllogistic sequence of universality, particularity, and individuality in these three constitutional powers, Hegel discusses the Crown first followed by the Executive and the Legislature respectively. Hegel understands the concept of the Crown in terms of constitutional monarchy.

(a) The Crown

“The power of the crown contains in itself the three moments of the whole, namely, (a) the universality of the constitution and the laws; (b) counsel, which refers the particular to the universal; and (g) the moment of ultimate decision, as the self-determination to which everything else reverts and from which everything else derives the beginning of its actuality” (¶ 275). The third moment is what gives expression to the sovereignty of the state, i.e., that the various activities, agencies, functions and powers of the state are not self-subsistent but rather have their basis ultimately in the unity of the state as a single self or self-organized organic whole. The monarch is the bearer of the individuality of the state and its sovereignty is the ideality in unity in which the particular functions and powers of the state subsist. “It is only as a person, the monarch, that the personality of the state is actual. Personality expresses the concept as such; but the person enshrines the actuality of the concept, and only when the concept is determined as a person is it the Idea or truth” (¶ 279).

The monarch is not a despot but rather a constitutional monarch, and he does not act in a capricious manner but is bound by a decision-making process, in particular to the recommendations and decisions of his cabinet (supreme advisory council). The monarch functions solely to give agency to the state, and so his personal traits are irrelevant and his ascending to the throne is based on hereditary succession, and thus on the accident of birth. “In a completely organized state, it is only a question of the culminating point of formal decision … he has only to say ‘yes’ and dot the ‘i’ …. In a well organized monarchy, the objective aspect belongs to law alone, and the monarch’s part is merely to set to the law the subjective ‘I will'” (¶ 280, addition). The “majesty of the monarch” lies in the free asserting of ‘I will’ as an expression of the unity of the state and the final step in establishing law.

(b) The Executive

The executive has the task of executing and applying the decisions formally made by the monarch. “This task of merely subsuming the particular under the universal is comprised in the executive power, which also includes the powers of the judiciary and the police” (¶ 287). Also, the executive is the higher authority that oversees the filling of positions of responsibilities in corporations. The executive is comprised of the civil servants proper and the higher advisory officials organized into committees, both of which are connected to the monarch through their supreme departmental heads. Overall, government has its division of labor into various centers of administration managed by special officials. Individuals are appointed to executive functions on the basis of their knowledgibility and proof of ability and tenure is conditional on the fulfillment of duties, with the offices in the civil service being open to all citizens.

The executive is not an unchecked bureaucratic authority. “The security of the state and its subjects against the misuse of power by ministers and their officials lies directly in their hierarchical organization and their answerability; but it lies too in the authority given to societies and corporations …” (¶ 295). However, civil servants will tend to be dispassionate, upright, and polite in part as “a result of direct education in thought and ethical conduct” (¶ 296). Civil servants and the members of the executive make up the largest section of the middle class, the class with a highly developed intelligence and consciousness of right. Moreover, “The sovereign working on the middle class at the top, and Corporation-rights working on it at the bottom, are the institutions which effectively prevent it from acquiring the isolated position of an aristocracy and using its education and skill as a means to an arbitrary tyranny” (¶ 297).

(c) The Legislature

For Hegel, “The legislature is concerned (a) with the laws as such in so far as they require fresh and extended determination; and (b) with the content of home affairs affecting the entire state” (¶ 298). Legislative activity focuses on both providing well-being and happiness for citizens as well as exacting services from them (largely in the form of monetary taxes). The proper function of legislation is distinguished from the function of administration and state regulation in that the content of the former are determinate laws that are wholly universal whereas in administration it is application of the law to particulars, along with enforcing the law. Hegel also says that the other two moments of the political constitution, the monarchy and the executive, are the first two moments of the legislature, i.e., are reflected in the legislature respectively through the ultimate decision regarding proposed laws and an advising function in their formation. Hegel rejects the idea of independence or separation of powers for the sake of checks and balances, which he holds destroys the unity of the state (¶ 300, addition). The third moment in the legislature is the estates (Stände), which are the classes of society given political recognition in the legislature.

In the legislature, the estates “have the function of bringing public affairs into existence not only implicitly, but also actually, i.e., of bringing into existence the moment of subjective formal freedom, the public consciousness as an empirical universal, of which the thoughts and opinions of the Many are particulars” (¶ 301). Not only do the estates guarantee the general welfare and public freedom, but they are also the means by which the state as a whole enters the subjective consciousness of the people through their participation in the state. Thus, the estates incorporate the private judgment and will of individuals in civil society and give it political significance.

The estates have an important integrating function in the state overall. “Regarded as a mediating organ, the Estates stand between the government in general on the one hand, and the nation broken up into particulars (people and associations) on the other. … [I]n common with the organized executive, they are a middle term preventing both the extreme isolation of the power of the crown … and also the isolation of the particular interests of persons, societies and Corporations” (¶ 302). Also, the organizing function of the estates prevents groups in society from becoming formless masses that could form anti-government feelings and rise up in blocs in opposition to the state.

The three classes of civil society, the agricultural, the business, and the universal class of civil servants, are each given political voice in the Estates Assembly in accordance with their distinctiveness in the lower spheres of civil life. The legislature is divided into two houses, an upper and lower. The upper house comprises the agricultural estate (including the peasant farmers and landed aristocracy), a class “whose ethical life is natural, whose basis is family life, and, so far as its livelihood is concerned, the possession of land. Its particular members attain their position by birth, just as the monarch does, and, in common with him, they possess a will which rests on itself alone” (¶ 305). Landed gentry inherit their estates and so owe their position to birth (primogeniture) and thus are free from the exigencies and uncertainties of the life of business and state interference. The relative independence of this class makes it particularly suited for public office as well as a mediating element between the crown and civil society.

The second section of the estates, the business class, comprises the “fluctuating and changeable element in civil society” which can enter politics only through its deputies or representatives (unlike the agricultural estate from which members can present themselves to the Estates Assembly in person). The appointment of deputies is “made by society as a society” both because of the multiplicity of members but also because representation must reflect the organization of civil society into associations, communities, and corporations. It is only as a member of such groups that an individual is a member of the state, and hence rational representation implies that consent to legislation is to be given not directly by all but only by “plenipotentiaries” who are chosen on the basis of their understanding of public affairs as well as managerial and political acumen, character, insight, etc. Moreover, their charge is to further the general interest of society and not the interest of a particular association or corporation instead (¶ 308-10).

The deputies of civil society are selected by the various corporations, not on the basis of universal direct suffrage which Hegel believed inevitably leads to electoral indifference, and they adopt the point of view of society. “Deputies are sometimes regarded as ‘representatives’; but they are representatives in an organic, rational sense only if they are representatives not of individuals or a conglomeration of them, but of one of the essential spheres of society and its large-scale interests. Hence, representation cannot now be taken to mean simply the substitution of one man for another; the point is that the interest itself is actually present in its representative, while he himself is there to represent the objective element of his own being” (¶ 311).

The debates that take place in the Estates Assembly are to be open to the public, whereby citizens can become politically educated both about national affairs and the true character of their own interests. “The formal subjective freedom of individuals consists in their having and expressing their own private judgements, opinions, and recommendations as affairs of state. This freedom is collectively manifested as what is called ‘public opinion’, in which what is absolutely universal, the substantive and the true, is linked with its opposite, the purely particular and private opinions of the Many” (¶ 316). Public opinion is a “standing self-contradiction” because, on the one hand, it gives expression to genuine needs and proper tendencies of common life along with common sense views about important matters and, on the other, is infected with accidental opinion, ignorance, and faulty judgment. “Public opinion therefore deserves to be as much respected as despised — despised for its concrete expression and for the concrete consciousness it expresses, respected for its essential basis, a basis which only glimmers more or less dimly in that concrete expression” (¶ 318). Moreover, while there is freedom of public communication, freedom of the press is not totally unrestricted as freedom does not mean absence of all restriction, either in word or deed.

Hegel calls the class of civil servants the “universal class” not only because as members of the executive their function is to “subsume the particular under the universal” in the administration of law, but also because they reflect a disposition of mind (due perhaps largely from their education) that transcends concerns with selfish ends in the devotion to the discharge of public functions and to the public universal good. As one of the classes of the estates, civil servants also participate in the legislature as an “unofficial class,” which seems to mean that as members of the executive they will attend legislative assemblies in an advisory capacity, but this is not entirely clear from Hegel’s text. Also, given that the monarch and the classes of civil society when conceived in abstraction are opposed to each other as “the one and the many,” they must become “fused into a unity” or mediated together through the civil servant class. From the point of view of the crown the executive is such a middle term, because it carries out the final decisions of the crown and makes it “particularized” in civil society. On the other hand, in order for the classes of civil society to actually sense this unity with the crown a mediation must occur from the other direction, so to speak, where the upper house of the estates, in virtue of certain likenesses to the Crown (e.g., role of birth for one’s position) is able to mediate between the Crown and civil society as a whole.

(2) Sovereignty vis-à-vis foreign States

The interpenetration of the universal with the particular will through a complex system of social and political mediations is what produces the self-consciousness of the nation-state considered as an organic (internally differentiated and interrelated) totality or concrete individual. In this system, particular individuals consciously pursue the universal ends of the State, not out of external or mechanical conformity to law, but in the free development of personal individuality and the expression of the unique subjectivity of each. However, individuality is not something possessed by particular persons alone, or even primarily by such persons. The state as a whole, i.e., the nation-state as distinct from the political state as one of its moments, constitutes a higher form of individuality. In principle, Mind or Spirit possesses a singleness in its “negative self-relation,” i.e., in the sense that unity in a being is a function of setting itself off from other beings. “Individuality is awareness of one’s existence as a unit in sharp distinction from others. It manifests itself here in the state as a relation to other states, each of which is autonomous vis-à-vis the others. This autonomy embodies mind’s actual awareness of itself as a unit and hence it is the most fundamental freedom which a people possesses as well as its highest dignity” (¶ 322). For any being to have self-conscious independence requires distinguishing the self from any of its contingent characteristics (inner self-negation), which externally is a distinction from another being. This consciousness of what one is not is for the nation-state its negative relation to itself embodied externally in the world as the relation of one state to another. However, this is not a mere externality, “But in fact this negative relation is that moment in the state which is most supremely its own, the state’s actual infinity as the ideality of everything finite within it” (¶ 323).

According to Hegel, war is an “ethical moment” in the life of a nation-state and hence is neither purely accidental nor an inherent evil. Because there is no higher earthly power ruling over nation-states, and because these entities are oriented to preserving their existence and sovereignty, conflicts leading to war are inevitable. Also, defense of one’s nation is an ethical duty and the ultimate test of one’s patriotism is war. “Sacrifice on behalf of the individuality of the state is the substantial tie between the state and all its members and so is a universal duty” (¶ 325). In making a sacrifice for the sake of the state individuals prove their courage, which involves a transcendence of concern with egoistic interests and mere material existence. “The intrinsic worth of courage as a disposition of mind is to be found in the genuine absolute, final end, the sovereignty of the state. The work of courage is to actualize this final end, and the means to this end is the sacrifice of personal actuality” (¶ 328). Moreover, war, along with catastrophy, disease, etc, highlights the finitude, insecurity, and ultimate transitoriness of human existence and puts the health of a state to a test. Hegel does not consider the ideal of “perpetual peace,” as advocated by Kant, a realistic goal towards which humanity can strive. Not only is the sovereignty of each state imprescriptible, but any alliance or league of states will be established in opposition to others.

2) International Law

“International law springs from the relations between autonomous states. It is for this reason that what is absolute in it retains the form of an ought-to-be, since its actuality depends on different wills each of which is sovereign” (¶ 330). States are not private persons in civil society who pursue their self-interest in the context of universal interdependence but rather are completely autonomous entities with no relations of private right or morality. However, since a state cannot escape having relations with other states, there must be at least some sort of recognition of each by the other. International law prescribes that treaties between states ought to be kept, but this universal proviso remains abstract because the sovereignty of a state is its guiding principle, hence states are to that extent in a state of nature in relation to each other (in the Hobbesian sense of there being natural rights to one’s survival with no natural duties to others). “Their rights are actualized only in their particular wills and not in a universal will with constitutional powers over them. This universal proviso of international law therefore does not go beyond an ought-to-be, and what really happens is that international relations in accordance with treaty alternate with the severance of these relations” (¶ 333). Obviously, if states come to disagree about the nature of their treaties, etc., and there is no acceptable compromise for each party, then matters will ultimately be settled by war.

States recognize their own welfare as the highest law governing their relations to one another, however, the claim by a state to recognition of this welfare is quite different from claims to welfare by individual person in civil society. “The ethical substance, the state, has its determinate being, i.e., its right, directly embodied in something existent … and the principle of its conduct and behavior can only be this concrete existent and not one of many universal thoughts supposed to be moral commands” (¶ 337). States recognize each other as states, and even in war there is awareness of the possibility that peace can be restored and that therefore war ought to come to an end, as well as understandings about the proper limitations on the waging of war. However, at most this translates into the jus gentium, the law of nations understood as customary relationships, which remains a “maelstrom of external contingency.” The principles of the mind or spirit (Volksgeist) of a nation-state are wholly restricted because its particularity is already that of realized individuality, possessing objective actuality and self-consciousness. Hence, the reciprocal relations of states to one another partake of a “dialectic of finitude” out of which arises the universal mind, “the mind of the world, free from all restriction, producing itself as that which exercises its right–and its right is the highest right of all–over these finite minds in the ‘history of the world which is the world’s court of judgment'” (¶ 340).

3) World History

To say that history is the world’s court of judgment is to say that over and above the nation-states, or national “spirits,” there is the mind or Spirit of the world (Weltgeist) which pronounces its verdict through the development of history itself. The verdicts of world history, however, are not expressions of mere might, which in itself is abstract and non-rational. Rather than blind destiny, “world history is the necessary development, out of the concepts of mind’s freedom alone, of the moments of reason and so of the self-consciousness and freedom of mind” (¶ 342). The history of Spirit is the development through time of its own self-consciousness through the actions of peoples, states, and world historical actors who, while absorbed in their own interests, are nonetheless the unconscious instruments of the work of Spirit. “All actions, including world-historical actions, culminate with individuals as subjects giving actuality to the substantial. They are the living instruments of what is in substance the deed of the world mind and they are therefore directly at one with that deed though it is concealed from them and is not their aim and object” (¶ 348). The actions of great men are produced through their subjective willing and their passion, but the substance of these deeds is actually the accomplishment not of the individual agent but of the World Spirit (e.g., the founding of states by world-historical heroes).

Hegel says that in the history of the world we can distinguish several important formations of the self-consciousness of Spirit in the course of its free self-development, each corresponding to a significant principle. More specifically, there are four world-historical epochs, each manifesting a principle of Spirit as expressed through a dominant culture. In the Philosophy of Right, Hegel discusses these in a very abbreviated way in paragraphs 253-260, which brings this work to an end. Here we will draw from the more elaborated treatment in the appendix to the introduction to Hegel’s lectures on the Philosophy of World History.

(1) The Oriental Realm (mind in its immediate substance)

Here Spirit exists in its substantiality (objectivity) without inward differentiation. Individuals have no self-consciousness of personality or of rights–they are still immersed in external nature (and their divinities are naturalistic as well). Hegel characterizes this stage as one of consciousness in its immediacy, where subjectivity and substantiality are unmediated. In his Philosophy of History Hegel discusses China, India, and Persia specifically and suggests that these cultures do not actually have a history but rather are subject to natural cyclical processes. The typical governments of these cultures are theocratic and more particularly despotism, aristocracy, and monarchy respectively. Persia and Egypt are seen as transitional from these “unhistorical” and “non-political” states. Hegel calls this period the “childhood” of Spirit.

(2) The Greek Realm (mind in the simple unity of subjective and objective)

In this realm, we have the mixing of subjective freedom and substantiality in the ethical life of the Greek polis, because the ancient Greek city-states give expression to personal individuality for those who are free and have status. However, the relation of individual to the state is not self-conscious but is unreflective and based on obedience to custom and tradition. Hence, the immediate union of subjectivity with the substantial mind is unstable and leads to fragmentation. This is the period of the “adolescence” of Spirit.

(3) The Roman Realm (mind in its abstract universality)

At this stage, individual personality is recognized in formal rights, thus including a level of reflection absent in the Greek realm of “beautiful freedom.” Here freedom is difficult because the universal subjugates individuals, i.e., the state becomes an abstraction over above its citizens who must be sacrificed to the severe demands of a state in which individuals form a homogeneous mass. A tension between the two principles of individuality and universality ensues, manifesting itself in the formation of political despotism and insurgency against it. This realm gives expression to the “manhood” of Spirit.

(4) The Germanic Realm (reconciled unity of subjective and objective mind)

This realm comprises along with Germany and the Nordic peoples the major European nations (France, Italy, Spain) along with England. The principle of subjective freedom comes to the fore in such a way as to be made explicit in the life of Spirit and also mediated with substantiality. This involves a gradual development that begins with the rise of Christianity and its spiritual reconciliation of inner and outer life and culminates in the appearance of the modern nation-state, the rational Idea of which is articulated in the Philosophy of Right. (Along the way there are several milestones Hegel discusses in his Philosophy of History that are especially important in the developing of the self-consciousness of freedom, in particular the Reformation, the Enlightenment, and the French Revolution.) One of the significant features of the modern world is the overcoming of the antithesis of church and state that developed in the Medieval period. This final stage of Spirit is mature “old age.”

In sum, for Hegel the modern nation-state can be said to manifest a “personality” and a self-consciousness of its inherent nature and goals, indeed a self-awareness of everything which is implicit in its concept, and is able to act rationally and in accordance with its self-awareness. The modern nation-state is a “spiritual individual,” the true historical individual, precisely because of the level of realization of self-consciousness that it actualizes. The development of the perfected nation-state is the end or goal of history because it provides an optimal level of realization of self-consciousness, a more comprehensive level of realization of freedom than mere natural individuals, or other forms of human organization, can produce.

7. Closing Remarks

In closing this account of Hegel’s theory of the state, a few words on a “theory and practice” problem of the modern state. In the preface to the Philosophy of Right Hegel is quite clear that his science of the state articulates the nature of the state, not as it ought to be, but as it really is, as something inherently rational. Hegel’s famous quote in this regard is “What is rational is actual and what is actual is rational,” where by the ‘actual’ (Wirklich) Hegel means not the merely existent, i.e., a state that can be simply identified empirically, but the actualized or realized state, i.e., one that corresponds to its rational concept and thus in some sense must be perfected. Later in the introduction of the Idea of the state in paragraph 258, Hegel is at pains to distinguish the Idea of the state from a state understood in terms of its historical origins and says that while the state is the way of God in the world we must not focus on particular states or on particular institutions of the state, but only on the Idea itself. Furthermore he says, “The state is no ideal work of art; it stands on earth and so in the sphere of caprice, chance, and error, and bad behavior may disfigure it in many respects. But the ugliest of men, or a criminal, or an invalid, or a cripple, is still always a living man. The affirmative, life, subsists despite his defects, and it is this affirmative factor which is our theme here” (¶ 258, addition). The issue, then, is whether the actual state — the subject of philosophical science — is only a theoretical possibility and whether from a practical point of view all existing states are in some way disfigured or deficient. Our ability to rationally distill from existing states their ideal characteristics does not entail that a fully actualized state does, or will, exist. Hence, there is perhaps some ambiguity in Hegel’s claim about the modern state as an actualization of freedom.

8. References and Further Reading

a. Works by Hegel in German and in English Translation

Below are works by Hegel that relate most directly to his social and political philosophy.

  • Encyklopädie der philosophischen Wissenschaften im Grundrisse, Berlin 1830; ed. G. Lasson & O. Pöggler (Hamburg, 1959).
    • In the third volume of this work, The Philosophy of Spirit, the section on Objective Spirit corresponds to Hegel’s Philosophy of Right.
  • Grundlinien der Philosophie des Rechts, ed. J. Hoffmeister. Hamburg, 1955.
  • Hegels Grundlinien der Philosophie des Rechts, 2nd edn. hrsg. G. Lasson. Leipzig, 1921.
    • This is the most recent edition referred to in T. M. Knox’s translation of 1952.
  • Hegel’s Logic, trans. William Wallace. Oxford University Press, 1892.
  • Hegel’s Phenomenology of Spirit, trans. A.V. Miller. Oxford University Press, 1977.
  • Hegel’s Philosophy of Mind, trans. William Wallace & A. V. Miller. Oxford University Press, 1971.
  • Hegel’s Philosophy of Right, trans. T. M. Knox. Clarendon Press, 1952; Oxford University Press, 1967.
  • Hegel’s Political Writings, trans. T. M. Knox, with an introductory essay by Z. A. Pelczynski. Oxford: Clarendon Press, 1964.
    • This contains the following pieces: “The German Constitution,” “On the Recent Domestic Affairs of Wurtemberg …,” “The Proceedings of the Estates Assembly in the Kingdom of Wurtemberg, 1815-1816,” and “The English Reform Bill.”
  • Hegels sämtliche Werke, vol. VIII, ed. E. Gans. Berlin: 1833, 1st ed.; 1854, 2nd ed..
    • These were the first editions of the material of The Philosophy of Right to incorporate additions culled from notes taken at Hegel’s lectures. T. M. Knox reproduces these in his 1952 translation.
  • Jenaer Realphilosophie I: Die Vorlesungen von 1803/4, ed. J. Hoffmeister. Leipzig, 1913.
  • Jenaer Realphilosophie II: Die Vorlesungen von 1805/6, ed. J. Hoffmeister. Hamburg, 1967.
  • Lectures on the Philosophy of World History: Introduction, trans. H. B. Nisbet, with an introduction by Duncan Forbes. Cambridge University Press, 1975.
    • This is based on the 1955 German edition by J. Hoffmeister.
  • Natural Law, trans. T. M. Knox, with an introduction by H. B. Acton. Philadelphia, PA: University of Pennsylvania Press, 1977.
  • Phänomenologie des Geistes, ed. J. Hoffmeister. Hamburg: Felix Meiner, 1952.
  • The Philosophy of History, trans. J. B. Sibree. New York: Dover Publications Inc., 1956.
    • This is a reprint of the 1899 translation (the first was done in 1857) of Hegel’s Lectures on the Philosophy of History, published by Colonial House Press. The Dover edition has a new introduction by C. J. Friedrich.
  • Political Writings. Eds. L. Dickie & H. B. Nisbet. Cambridge Texts in the History of Political Thought. Cambridge: Cambridge University Press, 1999.
  • Politische Schriften, Nachwort von Jürgen Habermas. Frankfurt/Main, 1966. A more recent edition of the material of the Schriften zur Politik (see below).
  • Reason in History, trans. R. S. Hartman. New York, 1953. The introduction to Hegel’s lectures on the Philosophy of World History.
  • Schriften zur Politik und Rechtsphilosophie, 2nd ed. hrsg. Georg Lasson. Leipzig, 1923. This is the basis of T. M. Knox’s translations in Hegel’s Political Writings, 1964.
  • System of Ethical Life and First Philosophy of Spirit, trans. H. S. Harris & T. M. Knox. Albany, NY: State University of New York Press, 1979.
  • Die Vernunft in der Geschichte, ed. J. Hoffmeister. Hamburg, 1955.
    • This is the fourth edition of Hegel’s lectures on the Philosophy of World History given in Berlin from 1822-1830; the previous editions were done by Eduard Gans (1837), Karl Hegel (1840), and Georg Lasson (1917, 1920, 1930). In the 1930 edition, Lasson added additional manuscript material by Hegel as well as lecture notes from students, which are preserved in Hoffmeister’s edition.
  • Werke. Frankfurt: Suhrkamp Verlag, 1970.
    • This is the most recent and comprehensive collection of Hegel’s works. His social and political writings are contained in various volumes.

b. Works on Hegel’s Social and Political Philosophy

The books listed below either focus on one or more aspects of Hegel’s social and political thought or include some discussion in this area and, moreover, are significant enough works on Hegel to be included. The most comprehensive bibliography on Hegel is Hegel-Bibliographie (München: K. G Saur Verlag, 1980). For books and articles in the last 25 years, consult the Philosopher’s Index.

  • Avineri, Shlomo. Hegel’s Theory of the Modern State. Cambridge: Cambridge University Press, 1972.
  • Bosanquet, Bernard. The Philosophical Theory of the State. 4th edition, London: Macmillan, 1930, 1951.
  • Cullen, Bernard. Hegel’s Social and Political Thought: An Introduction. New York: St. Martin’s Press, 1979.
  • Findlay, John. Hegel: A Re-examination (1958). Oxford: Oxford University Press, 1976.
  • Foster, Michael B. The Political Philosophies of Plato and Hegel. Oxford: Clarendon Press, 1935/1968.
  • Dickey, Laurence. Religion, Economics, and the Politics of Spirit. Cambridge: Cambridge University Press, 1987.
  • Franco, Paul. Hegel’s Philosophy of Freedom. New Haven, CT: Yale University Press, 2000.
  • Gray, Jesse Glen. Hegel And Greek Thought. New York: Harper & Row, 1968.
  • Hardimon, Michael O. Hegel’s Social Philosophy: The Project of Reconciliation. Cambridge University Press, 1994.
  • Harris, H. S. Hegel’s Development, vols. 1 & 2. Oxford: Clarendon Press, 1972, 1983.
  • Haym, Rudolf. Hegel und seine Zeit. Berlin, 1857; Hildenshine, 1962).
  • Henrich, Dieter & R. P. Horstman. Hegels Philosophie des Rechts. Stuttgart: Klett-Catta, 1982.
  • Hicks, Steven V. International Law and the Possibility of a Just World Order: An Essay on Hegel’s Universalism. Value Inquiry Book Series 78. Amsterdam/Atlanta, GA: Rodopi, 1999.
  • Hyppolite, Jean. Genesis and Structure of Hegel’s Phenomenology of Spirit (1946). Trans. S. Cherniak & J. Heckman. Evanston, IL: Northwestern University Press, 1974.
  • Kainz, Howard P. Hegel’s Philosophy of Right with Marx’s Commentary. The Hague: Nijhoff, 1974.
  • Kaufman, Walter A. Hegel’s Political Philosophy. New York: Atherton Press, 1970.
  • ________. Hegel: A Reinterpretation. New York: Anchor Books, 1966.
  • Kelly, George Armstrong. Hegel’s Retreat From Eleusis: Studies In Political Thought. Princeton: Princeton University Press, 1978.
  • Kojeve, Alexander. Introduction to the Reading of Hegel (1947). Ed. Allen Bloom, trans. J. H. Nichols. New York: Basic Books, 1969.
  • Lakeland, Paul. The Politics of Salvation: The Hegelian Idea of the State. Albany, NY: SUNY Press, 1984.
  • MacGregor, David. The Communist Ideal in Hegel and Marx. Toronto: University of Toronto Press, 1984.
  • ___________. Hegel, Marx, and the English State. University of Toronto Press, 1996.
  • Marcuse, Herbert. Reason and Revolution: Hegel and the Rise of Social Theory. Boston: Beacon Press, 1960.
  • Mehta, V.R. Hegel and the Modern State. New Delhi: Associated Publishing House, 1968.
  • Mitias, Michael. Moral Foundation of the State in Hegel’s Philosophy of Right. Amsterdam: Rodopi, 1984.
  • Morris, George S. Hegel’s Philosophy of the State and of History. Chicago: S. C. Griggs & Co., 18871, 18922.
  • O’Brien, George Dennis. Hegel On Reason and History. Chicago: Chicago University Press, 1975.
  • O’Neil, John, ed. Hegel’s Dialectic of Desire and Recognition: Texts and Commentary. Albany, NY: SUNY Press, 1996.
  • Paolucci, Henry. The Political Thought of G. W. F. Hegel. Whitestone, NY: Griffon House, 1978.
  • Pelczynski, Z. A. (ed.). Hegel’s Political Philosophy: Problems and Perspectives. London: Cambridge University Press, 1971.
  • ___________. The State and Civil Society: Studies in Hegel’s Political Philosophy. Cambridge: Cambridge University Press, 1984.
  • Perkins, Robert L. (ed.). History and System: Hegel’s Philosophy of History. Albany: State University of New York Press, 1984.
  • Plamenatz, John. Man and Society, vol. II. London: Longman, 1963.
  • Plant, Raymond. Hegel: An Introduction. London: Allen & Unwin Ltd., 1972; Basil Blackwell, 1983.
  • Pepperzak, Adriaan T. Philosophy and Politics: A Commentary to the Preface of Hegel’s Philosophy of Right. Dordrecht: Martinus Nijhoff Publishers, 1987.
  • Popper, Karl. The Open Society and Its Enemies. Princeton: Princeton University Press, 1966.
  • Reyburn, Hugh A. The Ethical Theory of Hegel: A Study of the Philosophy of Right. Oxford: Clarendon Press, 1921.
  • Riedel, Manfred. Between Tradition and Revolution: The Hegelian Transformation of Political Philosophy. Cambridge: Cambridge University Press, 1984.
  • Ritter, Joachim. Hegel and the French Revolution: Essays on ‘The Philosophy of Right’. trans. Richard Dien Winfield, Cambridge, MA: The MIT Press, 1982.
  • Rosenkranz, Karl. Hegel As The National Philosopher of Germany. trans. G. S. Hall, St. Louis: Gray, Baker, 1874.
  • Rosenweig, Franz. Hegel und der Staat. Berlin/München, 1920; Aalen: Scientia Verlag, 1982.
  • Shanks, Andrew. Hegel’s Political Theology. Cambridge: Cambridge University Press, 1991.
  • Shklar, Judith N. Freedom and Independence: A Study of the Political Ideas of Hegel’s ‘Phenomenology of Mind’. Cambridge: Cambridge University Press, 1976.
  • Siebert, Rudolf J. Hegel’s Concept of Marriage and Family: The Origin of Subjective Freedom. Washington, D.C.: The University Press of America, 1979.
  • _______. Hegel’s Philosophy of History: Theological, Humanistic and Scientific Elements. Washington: University Press of America, 1979.
  • Siep, Ludwig. Anerkennung als Prinzip der praktische Philosophie: Zur Hegels Jenaer Philosophie des Geistes. München, Alber, 1979
  • Singer, Peter. Hegel. Past Masters Series (Oxford University Press, 1983).
  • Smith, Steven B. Hegel’s Critique of Liberalism: Rights in Context. Chicago: Chicago University Press, 1989.
  • Steinberger, Peter J. Logic and Politics: Hegel’s Philosophy of Right. New Haven: Yale University Press, 1982.
  • Stepelevich, L. S. & D. Lamb, (eds.). Hegel’s Philosophy of Action. Atlantic Highlands, NJ: Humanities Press, 1983.
  • Taylor, Charles. Hegel and Modern Society. New York and London: Cambridge University Press, 1979.
  • Tunick, Mark. Hegel’s Political Philosophy. Princeton University Press, 1992.
  • Verene, Donald Phillip (ed.). Hegel’s Social and Political Thought: The Philosophy of Objective Spirit. Atlantic Highlands, NJ: Humanities Press/Sussex: Harvester Press, 1980.
  • Walsh, William Henry. Hegelian Ethics. London/Melbourne: Macmillan; New York: St. Martin’s Press, 1969.
  • Wazek, Norbert. The Scottish Enlightenment and Hegel’s Account of ‘Civil Society‘. Boston: Kluwer Academic Publishers, 1988.
  • Weil, Eric. Hegel et L’Etat. Paris, 1950.
  • Westphal, Merold. History and Truth in Hegel’s Phenomenology. Atlantic Highlands, NJ: Humanities Press, 1979.
  • Wilkins, Burleigh Taylor. Hegel’s Philosophy of History. Ithaca: Cornell University Press, 1974.
  • Williams, Robert R. (ed.). Beyond Liberalism and Communitarianism: Studies in Hegel’s Philosophy of Right. Proceedings of the 15th Biennial Meeting of the Hegel Society of America. SUNY Press, 2000.
  • Wood, Allen. Hegel’s Ethical Thought. Cambridge University Press, 1982.

Author Information

David A. Duquette
Email: david.duquette@snc.edu
St. Norbert College
U. S. A.

Johann Georg Hamann (1730—1788)

HamannJohann Georg Hamann was the philosophically most sophisticated thinker of the German Counter-enlightenment. Born in 1730 in Königsberg in eastern Prussia, Hamann was a contemporary and friendly acquaintance of the philosopher Immanuel Kant, and in many ways Hamann’s career can be seen in parallel to that of his great friend. Like Kant, Hamann attended the University of Königsberg, and in his early life was a devoted partisan of the Enlightenment, the philosophical and literary movement that emphasized the clearing away of outdated prejudice and the application of scientific reason to every area of human life. But during a business trip to London (on behalf of the firm of the Berens family, who also published Kant’s works), Hamann underwent a sort of conversion that involved giving up his commitment to the secular Enlightenment in favor of a more orthodox view of Protestant Christianity. As a consequence, he embarked on a career of trenchant and often scathing criticism of the Enlightenment. This change in world-views coincided with his reading of the British empiricist philosophers George Berkeley and David Hume. Hamann saw the idealism of the former and the skepticism of the latter as constituting a reductio ad absurdum of Enlightenment thought: Scientific reason leads us inevitably either to doubt or to deny the reality of the world around us. Three of Hamann’s intellectual achievements are of particular significance: His writings Sokratische Denkwürdigkeiten (Socratic Memorabilia) and Aesthetica in nuce (Aesthetics in a Nutshell), in which he opposed Enlightenment thought with an indirect and ironic mode of discourse emphasizing the importance of aesthetic experience and the role of genius in intuiting nature; his views on language; and his influential criticisms of Kant’s critical thought, expressed in his “Metakritik über den Purismum der Vernunft” and in his commentary, in a letter to Johann Gottfried Herder, on Kant’s essay “What is Enlightenment?”

Table of Contents

  1. Sokratische Denkwürdigkeiten and Aesthetica in nuce
  2. Hamann’s Views on Language
  3. “Metacritique” of Kant
  4. References and Further Reading
    1. Works in German
    2. Works in English
    3. Works in English that Discuss Hamann

1. Sokratische Denkwürdigkeiten and Aesthetica in nuce

Hamann’s rejection of the Enlightenment was greeted with distress by his friends Kant and Berens. Although they hoped that he could be won back to the cause of reason, these hopes were dashed with the publication in 1759 of Sokratische Denkwürdigkeiten, and the following year of Aesthetica in nuce. Together these two works offer a world-view that might be described as antirationalist but not irrationalist.

Hamann’s intention in the Sokratische Denkwürdigkeiten is to offer a defense of religious faith that renders such faith immune against rational attack while in no way accepting the rationalist’s terms of debate. In order to do this, however, he faces a seemingly insoluble problem: He must undermine the grounds of the Enlightenment view of reason and religion without committing himself to other, opposed positions that are subject to rational criticism and refutation. Several aspects of how he goes about this were very influential in German thought in the 18th century. First, the work is written under a pseudonym, or rather, not under any name at all: The title page says that the Denkwürdigkeiten were “assembled for the boredom of the public by a lover of boredom,” most likely a reference to the Enlighteners’ desire to educate the public in the name of reason. By distancing himself from the authorship of what was probably his most important work, Hamann makes clear that any arguments offered or positions taken in the book ought to be viewed as moves in a game rather than as expressions of his rational faculty. Second, Hamann makes crucial use of irony, specifically Socratic irony, in his attack on the Enlightenment. “I have,” says Hamann at the beginning of the work, “written about Socrates in a Socratic manner. Analogy was the soul of his syllogisms, and he gave them irony as their body.” Specifically, Hamann holds up Socrates, the philosophers’ secular saint, in order to draw an unfavorable contrast between him and the Enlightenment. Despite his wisdom, Socrates explicitly renounced his claim to know the answers to the questions he asked; rather than taking and defending determinate positions on the issues he was interested in, Socrates engaged his listeners in conversation so as to bring them to realize that they did not know the answers to these questions any more than Socrates did. Similarly, Hamann intends the Sokratische Denkwürdigkeiten to show that Berens and Kant are (at least) as far from genuine knowledge as he is. Finally, like all of Hamann’s works, the style of the Sokratische Denkwürdigkeiten is intentionally opaque: In contrast to the Enlightenment emphasis on universal truths that transcend the time and place in which they are expressed, Hamann fills his text with oblique allusions to a wide variety of texts in several languages; moves from one point to another with little indication of how the various passages are supposed to hang together; and shifts without warning from careful argumentative analysis to citation of texts to something like oracular declamation. As a result, it is impossible for the reader to forget that the text she is reading is the work of a particular individual writing in a particular time and place, rather than expressing timeless deliverances of reason.

How then does the text of the Sokratische Denkwürdigkeiten accomplish the defense of religious faith Hamann desires? The chief contention of the work is that religious faith is neither based on nor subject to reason. Here Hamann relies not so much on Socrates but rather on David Hume, whose skeptical writings had affected him so deeply a few years before. Hume would doubtless have found little to his liking in Hamann’s rejection of the Enlightenment, but Hamann found much in Hume to serve his purposes. Specifically, Hamann adapts Hume’s important claim that “belief… [etc.].” Hume intends this as a way of answering the worst sort of skepticism: If our beliefs are not based on reasoning, then reasoning cannot threaten them, either. Hamann makes use of the fact that in German there is one word, ‘Glaube,’ that corresponds both to ‘belief’ and to ‘faith’ in English. Thus in his hands Hume’s claim is extended to religious faith as well, making it immune from rational criticism. But this is not to be understood as a position taken with debates about the philosophical foundations of religion. Instead, Hamann again makes use of the figure of Socrates. He compares Socrates to someone refusing to join a game of cards: If this person didn’t know how to play, Hamann observes, we might take their refusal as an expression of incapacity, much as we would take an expression of ignorance from an ordinary person as a genuine indication that he lacks knowledge. But in the case of Socrates, who was manifestly a deep thinker and great philosopher, professions of ignorance must be read as refusals to participate in a game in which the other players “break the rules of the game and steal its joy [das Glück desselben stehlen]. Socrates’ ignorance thus became a “thorn in the eyes” of the sophists (here again Kant and Berens are clearly intended) and serve as “testimony” against the “new Athenians” of Hamann’s time, who deified Socrates “in order to be better able to mock the carpenter’s son [Jesus].”

But if Socrates was a great philosopher, as Hamann emphasizes, what can he be said to know? Hamann’s answer to this question is ‘genius.’

What in Homer makes up for the ignorance of artistic rules, that Aristotle thought up after him, and what in Shakespeare makes up for the ignorance or violation of these rules? Genius (Genie) is the unambiguous answer. Socrates could thus well have been ignorant; he had a genius (Genius) on whose knowledge he could rely, and who he feared as his God.Hamann’s use of the notion of genius in the Sokratische Denkwürdigkeiten serves as a bridge to his second major work, the Aesthetica in nuce. His target in the Aesthetica is Enlightenment thought as it applies specifically to art and beauty. Aesthetics in the Enlightenment alternated between attempts to reduce art to rules, more specifically rules for the accurate and morally uplifting imitation of nature, and attempts to explain art as a response to the subjective human capacity for feeling and sensation. Hamann emphatically rejects both of these tendencies, along with the devaluation of the aesthetic he seems them as implying. Far from being reducible to rational principles, in his view aesthetic experience is a fundamental and immediate experience of nature, which he encapsulates (both in the Aesthetica and in Sokratische Denkwürdigkeiten) under the term ‘genius.’

The chief philosophical significance of the Aesthetica in nuce is that Hamann here deepens his conception of the connection between artistic genius, nature, and God. Nature, he says, is “a speech through creation to creation.” That is, nature is a text written by God, which, being creatures ourselves, we are able to understand through His grace. But this understanding is of course not a rational one, through concepts and scientific investigation. Rather, in aesthetic experience we grasp nature in a manner that precedes, and indeed forms the basis for, rational thought: “Poetry [Poesie] is the native tongue of the human race, just as gardening is older than agriculture, painting older than writing, chant older than declamation, similes older than conclusions, and barter than trade.” This view has radical consequences for the Enlightenment. Whereas the task of philosophical aesthetics in the early modern period was to incorporate aesthetic experience into the rational worldview, Hamann now argues that we must instead do the former, that is, view reason as one aspect of our aesthetic experience of the world. It is thus pointless to try to formulate rational standards for beauty. Second, giving art priority over reason threatens reason’s claim to be the proper form for representing nature, which is crucial to the central role given in the Enlightenment to natural science. Finally, if reason is subordinated to art rather than the reverse, then in so far as there is a tension between artistic and rational views of the world the value placed on reason in the 18th-century represents not progress but regress. Hamann’s early writings inspired thinkers such as Friedrich Heinrich Jacobi, who appropriated the skeptical arguments of David Hume to argue that reason is based entirely on faith, and Johann Gottfried Herder, who offered an account of human thought that emphasized the continuous historical development of humanity from its original natural state. More immediately, Hamann’s thought had an enormous impact on the literary movement known as the Sturm und Drang– literally, “storm and stress.” Works of the Sturm und Drang emphasized nature and human passion. Indeed these two themes were closely linked, in that passion was seen as closer to nature. More distantly, Hamann’s thought was instrumental in the rise, around the turn of the century, of the Romantic movement in Germany.

2. Hamann’s Views on Language

From his earliest works onward, language was a central theme in all of Hamann’s writings. Here too his opposition to the Enlightenment was influential not only in his time but also in present-day philosophy and literary theory. Hamann’s account of language can best be understood by contrast with an admittedly too-simple sketch of the sort of view he opposed. Much Enlightenment thought on language was naturalistic, that is, it saw language as a useful tool invented by human beings. The original humans were thinking, rational beings who invented symbols, attaching names things in the world around them for purposes of communication and learning. Thus both reason and the world precede, and are independent of, language. Hamann rejects this view in all its particulars.

Important elements of Hamann’s account of language are already visible in the Aesthetica in nuce, in particular in the claim that the world is “a speech through creation to creation.” Here it is clear that language for Hamann is not something projected onto the world by human reason, but instead is as it were embedded in the things themselves by God the creator. At some points in his writings on language, Hamann maintains the position that language is simultaneously the work of both God and humans, while at other places he seems to lean more toward the view that God alone is the source of language. In any case, he clearly holds the view that neither thought nor reason is possible independently of language. Indeed, since God’s act of creation is in a sense inherently linguistic, he must hold that language precedes, or at least is contemporaneous with reason in particular and thought in general. As we will see, this is an idea that is very important for his critique of the philosophy of Immanuel Kant.

3. “Metacritique” of Kant

In 1781 Hamann’s friend but philosophical opponent Immanuel Kant published his Critique of Pure Reason. Kant’s project in the Critique has two sides. On the one hand, Kant argues that reason is incapable of attaining knowledge of the existence of, for example, God and the immortality of the soul; however, these beliefs are also incapable of being refuted through reason. This much, of course, Hamann could gladly agree with. But Kant also undertakes to defend both reason and the claim of natural science to offer a privileged description of the world. The latter task is accomplished in the Transcendental Deduction of the Categories, in which Kant argues that our experience requires us to understand the natural world as being composed of substances interacting according to necessary causal laws discoverable by natural science. The former task (which is Hamann’s chief target) is accomplished by reinterpreting reason as the ability to set goals for human cognition and moral action. This alarmed Hamann because it put reason in the place of religious faith, along with the tradition and culture he thought essential to human understanding. In response to Kant’s work, which was the most important event in German philosophy in the 18th century, Hamann penned a short essay entitled “Metakritik über den Purismum der Vernunft” (“Metacritique on the Purism of Reason”). Although the Metacritique was never published in Hamann’s lifetime, he included it in a letter to his friend Johann Gottfried Herder (who was also a student of Kant’s), and Herder passed it on to Friedrich Heinrich Jacobi, thus enabling this small but interesting text to exert what one commentator has called a “subterranean influence” on German thought after Kant.

Hamann’s thesis in the Metacritique is that “language is the center of reason’s misunderstanding with itself.” More specifically, Hamann thinks that Kant’s critical philosophy, while maintaining that everything in the world must submit to rational questioning and appraisal, nevertheless overlooks the crucial fact that all use of reason, including Kant’s reason, depends on language: Kant imagines, he says, that he can simply “invent” a “universal philosophical language,” whereas here and elsewhere Hamann maintains that words have meaning only in relation to the time and place where they are appropriate. Hamann is clearly on to something important here, because the force of Kant’s conclusions in the Critique of Pure Reason requires that we accept his quite substantial body of terminology, such as the distinctions between a priori and a posteriori, and between analytic and synthetic propositions. But, one might ask, why can’t one simply invent terms of art and stipulate their meanings? This is probably, in fact, what Kant saw himself as doing. Hamann answers this question indirectly, by appealing to the empiricists Berkeley and Hume. Both Berkeley and Hume reject the existence of so-called “abstract ideas,” arguing that there is no philosophical justification for referring to anything in the world other than particular sensible things, whereas abstract ideas are things that can exist only in the privacy of human minds. Since Kant himself accepts the quasi-empiricist view that our knowledge is limited to possible experience, Hamann’s point is that Kant cannot justify his own philosophical enterprise unless he can offer a justification for the very language in which the enterprise is couched- a demand that seems impossible for Kant to fulfill.

Quite late in his life, Hamann participated in another intellectual dispute involving Kant, this one centering on the question, “What is enlightenment.” Although Kant was not the first to contribute to this debate, his was the most prominent and influential statement on the question. In his essay, also entitled “What is Enlightenment?,” Kant defines enlightenment as “the departure of human beings from their self-incurred incapacity.” Its slogan, he says, is sapere aude!— Dare to think! Ignorance on this view is a sort of moral failing in human beings who have neglected to exercise their rational faculties to the fullest extent possible. Hamann responded to Kant’s essay not in print, but rather in a letter to a former student of Kant’s, Christian Jacob Kraus. Again, his target is the Enlightenment’s belief that reason rather than culture, tradition, or religious faith, is the proper guide for human life. His response to Kant turns on an important change in Kant’s language: For Kant’s word “incapacity” [Unmündigkeit] he substitutes the word “domination” [Vormundschaft]. Failure to be fully enlightened results, Hamann suggests, not from a failure to think for oneself, but rather from the fact that people are told what to think by people-like Kant– who see themselves as more rational and thus closer to the truth than ordinary mortals. Hamann thus rejects Kant’s view that the incapacity he bemoans is “self-incurred.” Instead, the “enlightened” state replaces one dominant group (say, the aristocracy) with another (“Enlighteners” such as Kant). Here Hamann anticipates, at least in broad strokes, the late 20th-century suspicion that liberal democracy cannot live up to its own pretensions to universal tolerance, because viewing oneself as a citizen in a liberal democracy requires many of us to subordinate some of our most passionately held beliefs to the demands of citizenship.

Johann Georg Hamann died in 1788.

4. References and Further Reading

a. Works in German

  • Samtliche Werke, ed. by Josef Nadler (Vienna: Verlag Herder, 1951).
  • Schriften zur Sprache, ed. by Josef Simon (Frankfurt: Suhrkamp Verlag, 1967).
  • Sokratische Denkwurdigkeiten/Aesthetica in nuce, ed. by Sven-Aage Jorgenson (Stuttgart: Philipp Reclam Verlag, 1968).
  • Hamann and others, Was ist Aufklarung?, ed. by Ehrhard Bahr (Stuttgart: Philipp Reclam Verlag, 1974).

b. Works in English

  • Hamann’s Socratic Memorabilia. A Translation and Commentary, trans. and ed. by James C. O’Flaherty (Baltimore: The Johns Hopkins Press, 1967).
  • What is Enlightenment? 18th Century Answers, 20th Century Questions
  • ed. by James B. Schmidt (Berkeley, CA: University of California Press, 1996).

c. Works in English that Discuss Hamann

  • Beiser, Frederick C., The Fate of Reason: German Philosophy from Kant to Fichte (Cambridge, MA: Harvard University Press, 1987).
  • Berlin, Isaiah, The Magus of the North: J. G. Hamann and the Origins of Modern Irrationalism (New York: Farrar Straus and Giroux, 1993).
  • Dickson, Gwen G., Johann Georg Hamann’s Relational Metacriticism (Berlin and New York: De Gruyter, 1995).
  • Kühn, Manfred, Kant: A Biography (New York : Cambridge University Press, 2001).
  • O’ Flaherty, James C., The Quarrel of Reason with Itself: Essays on Hamann, Nietzsche, Lessing, & Michaelis (Rochester, NY: Camden House, 1991).

Author Information

Ted Kinnaman
Email: mailto:tkinnama@gmu.edu
George Mason University
U. S. A.

Luce Irigaray (1930—)

Luce IrigarayLuce Irigaray is a prominent author in contemporary French feminism and Continental philosophy. She is an interdisciplinary thinker who works between philosophy, psychoanalysis, and linguistics. Originally a student of the famous analyst Jacques Lacan, Irigaray’s departure from Lacan in Speculum of the Other Woman, where she critiques the exclusion of women from both philosophy and psychoanalytic theory, earned her recognition as a leading feminist theorist and continental philosopher. Her subsequent texts provide a comprehensive analysis and critique of the exclusion of women from the history of philosophy, psychoanalytic theory and structural linguistics.

Irigaray alleges that women have been traditionally associated with matter and nature to the expense of a female subject position. While women can become subjects if they assimilate to male subjectivity, a separate subject position for women does not exist. Irigaray’s goal is to uncover the absence of a female subject position, the relegation of all things feminine to nature/matter, and, ultimately, the absence of true sexual difference in Western culture. In addition to establishing this critique, Irigaray offers suggestions for altering the situation of women in Western culture. Mimesis, strategic essentialism, utopian ideals, and employing novel language, are but some of the methods central to changing contemporary culture. Irigaray’s analysis of women’s exclusion from culture and her use of strategic essentialism have been enormously influential in contemporary feminist theory. Her work has generated productive discussions about how to define femininity and sexual difference, whether strategic essentialism should be employed, and assessing the risk involved in engaging categories historically used to oppress women. Irigaray’s work extends beyond theory into practice. Irigaray has been actively engaged in the feminist movement in Italy. She has participated in several initiatives in Italy to implement a respect for sexual difference on a cultural and, in her most recent work, governmental level. Her contributions to feminist theory and continental philosophy are many and her complete works present her readers with a rewarding challenge to traditional conceptions of gender, self, and body.

Table of Contents

  1. Biography
  2. Irigaray’s Project
  3. Influences
    1. Psychoanalysis
    2. Philosophy
  4. Major Themes
    1. Mimesis
    2. Novel Language and Utopian Ideals
    3. Mother/Daughter Relationships
    4. Language
    5. Ethics
    6. Politics
  5. Criticisms
    1. Strategic Essentialism
    2. Privileges Psychological Oppression
    3. Elides Differences
    4. Opaque Writing Style
    5. Exclusive Ethics
    6. Later Work
  6. References and Further Reading
    1. English Translations
    2. Suggested Further Reading

1. Biography

In a 1993 interview with Margaret Whitford, Luce Irigaray specifically says that she does not like to be asked personal questions. She does not want opinions about her everyday life to interfere with interpretations of her ideas. Irigaray believes that entrance into intellectual discussions is a hard won battle for women and that reference to biographical material is one way in which women’s credibility is challenged. It is no surprise that detailed biographical information about Irigaray is limited and that different accounts conflict.

What remains constant between accounts is that Luce Irigaray was born in Belgium in 1930. She holds two doctoral degrees-one in Philosophy and the other in Linguistics. She is also a trained and practicing psychoanalyst. She has held a research post at the Centre National de la Recherche Scientifique de Paris since 1964. She is currently the Director of Research in Philosophy at the center, and also continues her private practice. Perhaps the most well known fact of Irigaray’s life-which Irigaray herself refers to in the opening of je, tu, nous-is her education at, and later expulsion from, the Ecole Freudienne de Paris (Freudian School of Paris). The Ecole Freudienne was founded by the famous psychoanalyst Jacques Lacan. Irigaray trained at the school in the sixties. In 1974, she published the thesis she wrote while studying at the school, Speculum, de l’autre femme, translated into English as Speculum of the Other Woman. This thesis criticized-among philosophical topics-the phallocentrism of Freudian and Lacanian psychoanalysis. The publication of this thesis gained her recognition, but also negatively affected Irigaray’s career. She was relieved of her teaching post at the University of Vincennes and was ostracized by the Lacanian community. In spite of these early hardships, Irigaray went on to become an influential and prolific author in contemporary feminist theory and continental philosophy. In addition to her intellectual accomplishments, Irigaray is committed to active participation in the women’s movement in both France and internationally-especially in Italy. Several of her later texts are dedicated to her work in the women’s movement of Italy. She is still actively researching and publishing.

2. Irigaray’s Project

Irigaray argues that, since ancient times, mothers have been associated with nature and unthinking matter. Further, Irigaray believes that all women have historically been associated with the role of “mother” such that, whether or not a woman is a mother, her identity is always defined according to that role. This is in contrast to men who are associated with culture and subjectivity. While excluded from culture and subjectivity, women serve as their unacknowledged support. In other words, while women are not considered full subjects, society itself could not function without their contributions. Irigaray ultimately states that Western culture itself is founded upon a primary sacrifice of the mother, and all women through her.

Based on this analysis, Irigaray says that sexual difference does not exist. True sexual difference would require that men and women are equally able to achieve subjectivity. As is, Irigaray believes that men are subjects (e.g. self-conscious, self-same entities) and women are “the other” of these subjects (e.g. the non-subjective, supporting matter). Only one form of subjectivity exists in Western culture and it is male. While Irigaray is influenced by both psychoanalytic theory and philosophy, she identifies them both as influential discourses that exclude women from a social existence as mature subjects. In many of her texts, Irigaray seeks to unveil how both psychoanalytic theory and philosophy exclude women from a genuine social existence as autonomous subjects, and relegate women to the realm of inert, lifeless, inessential matter. With this critique in place, Irigaray suggests how women can begin to reconfigure their identity such that one sex does not exist at the expense of the other. However, she is unwilling to definitively state what that new identity should be like. Irigaray refrains from prescribing a new identity because she wants women to determine for themselves how they want to be defined. While both philosophy and psychoanalytic theory are her targets, Irigaray identifies philosophy as the master discourse. Irigaray’s reasons for this designation are revealed in Speculum of the Other Woman where she demonstrates how philosophy-since Ancient times-has articulated fundamental epistemological, ontological, and metaphysical truths from a male perspective that excludes women. While she is not suggesting that philosophy is single-handedly responsible for the history of women’s oppression, she wants to emphasize that the similar type of exclusion manifest in both philosophy and psychoanalysis predates the birth of psychoanalysis. As the companion discourse to philosophy, psychoanalysis plays a unique role. While Irigaray praises psychoanalysis for utilizing the method of analysis to reveal the plight of female subjectivity, she also thinks that it reinforces it. Freud attempts to explain female subjectivity and sexuality according to a male model. From this perspective, female subjectivity looks like a deformed or insufficiently developed form of male subjectivity. Irigaray argues that if Freud had turned the tools of analysis onto his own discourse, then he would have seen that female subjectivity cannot be understood through the lenses of a one-sex model. In other words, negative views of women exist because of theoretical bias-not because of nature. Through her critiques of both philosophy and psychoanalytic theory, Irigaray argues that women need to attain a social existence separate from the role of mother. However, this alone will not change the current state of affairs. For Irigaray is not suggesting that the social role of women will change if they merely step over the line of nature into culture. Irigaray believes that true social change will occur only if society challenges its perception of nature as unthinking matter to be dominated and controlled. Thus, while women must attain subjectivity, men must become more embodied. Irigaray argues that both men and women have to reconfigure their subjectivity so that they both understand themselves as belonging equally to nature and culture. Irigaray’s discussions of mimesis, novel language and utopian ideals, reconfiguring the mother/daughter relationship, altering language itself, ethics, and politics are all central to achieving this end.

3. Influences

Irigaray’s interdisciplinary interests in philosophy, psychoanalysis, and linguistics underscore that her work has more than one influence. Two main discourses that maintain a strong presence throughout her work are psychoanalysis, with Sigmund Freud and Jacques Lacan as its representatives, and philosophy. Insofar as Lacanian psychoanalysis works out of a background in structural linguistics, both Lacan and Irigaray also focus on language. Irigaray engages with philosophy, psychoanalysis and linguistics in order to uncover the lack of true sexual difference in Western culture.

a. Psychoanalysis

Irigaray states on the opening page of An Ethics of Sexual Difference that each age is defined by a philosophical issue that calls to be thoroughly examined-ours is sexual difference. Sexual difference is often associated with the anatomical differences between the sexes. However, Irigaray follows the French psychoanalyst Jacques Lacan in understanding sexual difference as a difference that is assigned in language. While Irigaray is critical of Lacan, she is influenced by Lacan’s interpretation of Freud’s theory of subject formation.

Freud’s work has served as a starting point for diverse psychoanalytic theories such as drive theory, object relations theory, and ego psychology. Lacan interprets Freud’s work from a background in structural linguistics, philosophy, and, of course, psychoanalysis. Of particular importance to Irigaray’s work is Lacan’s claim that there are two key moments in the formation of a child’s identity: the formation of an imaginary body and the assignation of sexual difference in language. Freud introduces the idea of an imaginary body in The Ego and the Id, in the section of the same name, when he describes the ego (self-consciousness) as neither strictly a psychic phenomenon nor a bodily phenomenon. Freud believes that an ego is formed in reference to a body, such that the manner in which an infant understands his or her selfhood is inseparable from his or her bodily existence. However, the body that an infant attributes to him or herself is not objectively understood-it is the mind’s understanding of the body. This means that a person’s understanding of his or her own body is imbued with a degree of fantasy and imagination. In his famous essay “The Mirror Stage as Formative of the I,” Lacan expands Freud’s comments on the bodily ego into a theory about imaginary anatomy. Lacan states that the first of two key moments in subject formation is the projection of an imaginary body. This occurs in the mirror stage at roughly six months. As a being who still lacks mobility and motor control, an infant who is placed in front of a mirror (another person can serve here as well, typically the mother) will identify with the unified, idealized image that is reflected back in the mirror. While the image in the mirror does not match the infant’s experience, it is a key moment in the development of his or her ego. Rather than identify with him or herself as a helpless being, the child choose to identify with the idealized image of him or herself. Lacan believes that the element of fantasy and imagination involved in the identification with the mirror image marks the image as simultaneously representative and misrepresentative of the infant. While the body of the mirror stage is key to the infant’s identity, it is also only an interpretation of his or her biological existence. In other words, according to Lacan, one’s understanding of one’s body occurs only in conjunction with an organization in language and image that begins in the mirror stage, and is further complicated by the next stage of ego formation-entrance into the Symbolic order. Irigaray agrees with Lacan that how we understand our biology is largely culturally influenced-thus does she accept the idea of an imaginary body. Irigaray employs the Lacanian imaginary body in her discussions about Western culture’s bias against women. Irigaray argues that, like people, cultures project dominant imaginary schemes which then affect how that culture understands and defines itself. According to Irigaray, in Western culture, the imaginary body which dominates on a cultural level is a male body. Irigaray thus argues that Western culture privileges identity, unity, and sight-all of which she believes are associated with male anatomy. She believes that fields such as philosophy, psychoanalysis, science and medicine are controlled by this imaginary. Three examples from her work illustrate her view. In Speculum of the Other Woman, Irigaray addresses Freud’s claim in his essay “Femininity” that little girls are only little men. She argues that Freud could not understand women because he was influenced by the one-sex theory of his time (men exist and women are a variation of men), and expanded his own, male experience of the world into a general theory applicable to all humans. According to Irigaray, since Freud was unable to imagine another perspective, his reduction of women to male experience resulted in viewing women as defective men. Another example is found in “Cosi Fan Tutti,” (in This Sex Which Is Not One) where Irigaray argues that Lacan’s ahistorical master signifier of the Symbolic order-the Phallus-is a projection of the male body. Irigaray argues that Lacan failed to diagnose the error of his predecessor, Freud, and similarly understood the world-and especially language-in terms of a one-sex model of sexuality and subjectivity. Although Lacan claims that the Phallus is not connected to male biology, his appropriation of Freud renders this claim false. A final example is found in “The Mechanics of ‘Fluids'” (also in This Sex Which Is Not One) where Irigaray argues that science itself is biased towards categories typically personified as masculine (e.g. solids as opposed to fluids). Irigaray believes that if women are not understood in Western culture, it is because Western culture has yet to accept alternate paradigms for understanding them. While selfhood begins in the mirror stage with the imaginary body, it is not solidified until one enters the Symbolic order. According to Lacan, the Symbolic order is an ahistorical system of language that must be entered for a person to have a coherent social identity. The Phallus is the privileged master signifier of the Symbolic order. One must have a relationship to the Phallus if one is to attain social existence. According to Lacan, infants in the mirror stage do not differentiate between themselves and the world. For example, an infant views him or herself as continuous with his or her mother, and this understanding of the mother-child relationship organizes the infant’s world. However, as the infant matures, he or she becomes aware that his or her mothers’ attention is not wholly directed toward the infant in a reciprocal manner. The mother participates in a larger social context dominated by the Symbolic order. The infant fantasizes that if he or she could occupy the role of the Phallus-the master signifer of that Symbolic order-he or she could regain the full attention of the mother. However, this is impossible. In exchange for giving up this fantasy-which the Father demands of the child in the Oedipus complex-the infant gains his or her own relationship to the Phallus. The infant must break with the mother (nature, pre-symbolic) in order to become a subject (culture, symbolic order). One among many unique claims of Lacan’s is that the infant acquires sexual difference in his or her relationship to the Phallus. According to Lacan, sexual difference is not about biological imperative (e.g. if you have a penis you are male, if you have a vagina you are female), it is about having one of two types of relationship to the Phallus-having or being the Phallus. Hence, in the Lacanian view, the body as humans understand it is something that is constructed in the mirror stage, and sexually differentiated in the entrance to the Symbolic order. Irigaray critically appropriates this radical description of sexual difference. She discusses the linguistic character of sexual difference in a manner similar to Lacan in This Sex Which Is Not One. Irigaray is more concerned with how culture-and language as a product of culture-understands sexual difference and subjectivity than with arguing that truths about sexual difference or subjectivity emerge out of biology itself. However she distances herself from Lacan in two key manners. First, Irigaray disagrees with Lacan’s depiction of the Symbolic order as ahistorical and unchanging. Irigaray believes that language systems are malleable, and largely determined by power relationships that are in flux. Second, Irigaray remains unconvinced by Lacan’s claims that the Phallus is an ahistorical master signifier of the Symbolic order that has no connection to male anatomy. In “Cosi Fan Tutti,” she argues that the Phallus is not a purely symbolic category, but is ultimately an extension of-and reinforcement of-Freud’s description of the world according to a one-sex model. According to Irigaray, the Phallus as the master signifier (that can be traced back to male anatomy) is evidence that the Symbolic order is constructed and not ahistorical.

b. Philosophy

Irigaray is also influenced by her extensive study of the history of philosophy. Texts such as Speculum of the Other Woman and An Ethics of Sexual Difference demonstrate her command of the philosophical canon. Speculum of the Other Woman discusses the elision of all things feminine in traditional thinkers such as Aristotle, Descartes, Kant, and Hegel. An Ethics of Sexual Difference also discusses the elision of the feminine, but specifically from the perspective of ethical relationships between men and women. An Ethics of Sexual Difference addresses thinkers as diverse as Plato, Merleau-Ponty, Spinoza, and Levinas. Irigaray is also writing a series of texts devoted to the four elements. The elemental works Marine Lover of Friedrich Nietzsche and The Forgetting of Air in Martin Heidegger are sustained discussions of the exclusions implemented by key male philosophers.

No one philosopher can be identified as influencing Irigaray. She appropriates from various thinkers while maintaining a critical distance. For example, her method of mimesis resembles Derridian deconstruction. However, she also criticizes Derrida’s deconstruction of the category “woman” (see Derrida’s Spurs) in Marine Lover. As another example, she agrees with Heidegger that every age has a concept that underlies and informs its beliefs, but is radically unknown to it. For Heidegger it was “Being,” for Irigaray it is “sexual difference.” Like Heidegger, she wants to investigate the concept that Western culture takes to be self-evident in order to show that it is unknown to us. However she is critical in The Forgetting of Air in Martin Heidegger of Heidegger’s exclusion of women. One can also find Levinasian (An Ethics of Sexual Difference), Hegelian (I love to you) or Marxist (This Sex Which Is Not One, “Women on the Market”) undertones in Irigaray’s discussions of ethics and dialectical thinking. While she is clearly influenced by the history of philosophy, her own project of creating a new space for redefining women does not permit her to privilege any one philosophical approach.

4. Major Themes

a. Mimesis

Irigaray describes herself as analyzing both the analysts and the philosophers. Perhaps the most famous critical tool employed by Irigaray is mimesis. Mimesis is a process of resubmitting women to stereotypical views of women in order to call the views themselves into question. Key to mimesis is that the stereotypical views are not repeated faithfully. One example is that if women are viewed as illogical, women should speak logically about this view. According to Irigaray, the juxtaposition of illogical and logical undermines the claim that women are illogical. Or if women’s bodies are viewed as multiple and dispersed, women should speak from that position in a playful way that suggests that this view stems from a masculine economy that values identity and unity (e.g. the penis or the Phallus) and excludes women as the other (e.g. lack, dispersed, or “nothing to see”). This type of mimesis is also known as strategic essentialism. Irigaray’s essay “This Sex Which Is Not One,” in the text of the same name, provides several clear examples of this method.

According to Irigaray, the very possibility of repeating a negative view unfaithfully suggests that women are something other than the view expressed. Irigaray repeats the views because she believes that overcoming harmful views of women cannot occur through simply ignoring the views. True to the methodology of psychoanalysis, she believes that negative views can only be overcome when they are exposed and demystified. When successfully employed, mimesis repeats a negative view-without reducing women to that view-and makes fun of it such that the view itself must be discarded. Irigaray’s wager in utilizing mimesis with regard to female subjectivity is as follows. Male dominance has defined Western culture for centuries. If a new form of subjectivity comes into being out of the death of the modern, transcendental subject, and we have never really investigated or mimetically engaged with the deformed, female form of subjectivity that accompanied and sustained the male form, then what would prevent the logic of master/subject/male and slave/other/female from repeating itself? According to Irigaray, the logic will not be altered until we call attention to the fact that subjectivity has changed before when male dominance has not. We must ask after the feminine other. Irigaray believes that only by asking after the other through mimesis will it be possible to affect a paradigm shift. Irigaray therefore speaks from the silenced position of women in order to (a) challenge the authority of either the negative view or the repression by revealing that position to be nothing more than a fabrication (b) show how the woman/body has been excluded by either revealing the stereotypical view to be false or by inciting the excluded woman/body to speak and (c) thereby force a shift in the conception of female subjectivity and the body. Irigaray employs mimesis because she believes that a ‘second sex’ cannot exist in its own right (or with a positive form of identity as opposed to being viewed as a deformed version of male identity) until we have not only challenged, but also passed back through the oppressive formulation of sexual difference in contemporary Western culture.

b. Novel Language and Utopian Ideals

While the goal of mimesis is to problematize the male definition of femininity to such a degree that a new definition of and, ultimately, an embodied subject position for women can emerge, Irigaray says in her earlier work that she will not prescribe in advance either the definition or the subject position. In This Sex Which Is Not One, Irigaray clearly indicates that she will not redefine femininity because it would interfere with women redefining themselves for themselves. Further, she believes that she cannot describe the feminine (e.g. female subjectivity, the female imaginary body) outside of the current, male definitions without further disrupting the male definitions of women. A new definition for women has to emerge out of a mimetic engagement with the old definitions, and it is a collective process.

Irigaray is, however, willing to provide material to help ignite the process of redefinition. The material she offers varies from new concepts about religion and bodies-expressed through both the novel use of existing words and the creation of new words-to utopian ideals. One example of a new concept that she puts into play through novel language is her discussion of the sensible/transcendental and female divinity. Irigaray introduces these concepts in order to disrupt male dominance in religion. Irigaray follows Feuerbach in interpreting the divine as an organizing principle for both identity and culture. Religion is thus viewed as caught up in power and culture. Irigaray specifically targets male dominated religions that posit a transcendental God. She believes that these religions reinforce male dominance and the division of the world into male/subject and female/body. She suggests that in place of a religion that focuses on a transcendent God, we construct a divinity that is both sensible and transcendental. In other words, given the connection between religion and culture, and the manner in which the mind/body split has fallen out along gender lines, why not propose a vision of divinity that will help Western culture overcome its dualisms and prejudices about those dualisms. Irigaray is not prescribing the sensible/transcendental as a new religion to be implemented and followed, but merely placing it in circulation as a creative impetus for change. An example of utopian ideals can be found in Sexes and Genealogies, thinking the difference, and je, tu, nous. In these texts, Irigaray describes civil laws that she believes would help women achieve social existence (mature subjectivity) in Western culture. In one law she suggests that virginity needs to be protected under the law so that women have control over their own sexuality. She also describes new ways in which the mother/daughter relationship should be legally protected, and outlines how mothers and daughters can communicate with each other so that female subjectivity can be further developed. When these texts were first published, these views were widely interpreted as suggestions intended to initiate discussions between women (utopian ideals) and not as prescriptions for social change. While Irigaray’s later work has complicated this interpretation, it is still widely accepted.

c. Mother/Daughter Relationships

According to Irigaray, while it is necessary to alter cultural norms, it is equally as important to address the problematic nature of individual relationships between women-especially the mother/daughter relationship. To emphasize how mother/daughter relationships are sundered in contemporary Western culture, Irigaray turns to Greek mythology. For example, she discusses the myth of Demeter, the goddess of the earth (agriculture), and her daughter Persephone. In the myth, Zeus, Persephone’s father, aids his brother Hades, king of the underworld, to abduct the young Perspephone. Hades has fallen in love with Persephone and wants her to be queen of the underworld. When Demeter learns that her daughter is missing, she is devastated and abandons her role as goddess of the earth. The earth becomes barren. To reestablish harmony in the world, Zeus needs Demeter to return to her divine responsibilities. Zeus orders Hades to return Persephone. However, Persephone is tricked into eating a pomegranate seed that binds her to Hades forever. Under the persuasion of Zeus, Hades agrees to release Persephone from the underworld for half of each year. Irigaray reads this myth as an example of both a positive mother/daughter relationship, and the success of men at breaking it apart. Demeter and Persephone love each other and Demeter strives to protect her daughter. However, in this myth they are ultimately at the mercy of the more powerful males. The myth is also an example of men exchanging women as if they were commodities. Zeus conspires with his brother and, in effect, gives his daughter away without consulting either Persephone or Demeter. Irigaray believes that myths tell us something about the deterioration of the mother/daughter relationship and the manner in which men have traditionally controlled the fate of women-whether they are wives, daughters, sisters, or mothers. Irigaray utilizes myth to suggest that mothers and daughters need to protect their relationships and strengthen their bonds to one another.

The need to alter the mother/daughter relationship is a constant theme in Irigaray’s work. While she believes that women’s social and political situation has to be addressed on a global level, she also thinks that change begins in individual relationships between women. Thus she stresses the need for mothers to represent themselves differently to their daughters, and to emphasize their daughter’s subjectivity. For example, in je, tu, nous, Irigaray offers suggestions for developing mother-daughter relationships such as displaying images of the mother-daughter couple, or consciously emphasizing that the daughter and the mother are both subjects in their own right. Changing relationships between mothers and daughters also requires language work.

d. Language

Since Irigaray agrees with Lacan that one must enter language (culture) in order to be a subject, she believes that language itself must change if women are to have their own subjectivity that is recognized at a cultural level. She believes that language typically excludes women from an active subject position. Further, inclusion of women in the current form of subjectivity is not the solution. Irigaray’s goal is for there to be more than one subject position in language.

In order to prove that language excludes women from subjectivity, Irigaray conducted research that links the exclusion of women from subjectivity in Western culture to the speech patterns of men and women. She concluded that general speech patterns specific to each sex do exist and that women often do not occupy the subject position in language. She argues that in language experiments, women were less willing to occupy the subject position. Referring to the French language as a clear example-even though she believes that the structure of the English language does not exempt it from sexism-she discusses the dominance of the masculine in both the plural and the neuter, which takes the same form as the masculine. Irigaray argues that objects of value, such as the sun or God, are typically marked with the masculine gender while less important objects are feminine. Since language and society mutually affect each other, Irigaray believes that language must change along with society. Failure to see the importance of changing language is an impediment to real change. According to Irigaray, it is crucial that women learn to occupy the position of “I” and “you” in language. Irigaray views the “I” and the “you” as markers of subjectivity. In her text I love to you, Irigaray describes how she determined that women do not occupy the subject position. She conducted an experiment where she gave her subjects a noun (e.g. enfant) and asked her test subjects to use the noun in a sentence as a pronoun (il or elle). The majority of both men and women consistently chose “il”. She noted in another experiment, where she gave a sequence that implied the use of “elle” (e.g. robe-se-voir), that both sexes avoided using “elle” (she) and “elle se” (she herself) as an active subject. In contrast, when she gave a sequence that implied the use of il as a subject, it was almost always used. Further, Irigaray discovered that young girls seek an intersubjective dialogue with their mothers, but that their mothers did not reciprocate. Irigaray concludes from her research that women are not subjects in language in the same way as men. She believes that men and women do not produce the same sentences with similar cues, they use prepositions differently, and they represent temporality in language differently. Irigaray seeks for men and women to recognize each other in language as irreducible others. She argues that this cannot happen until women occupy the subject position, and men learn to communicate with other subjects. Irigaray believes that a language of ‘indirection’ could help bring this to fruition. She describes this in her book I love to you. The title itself is an example of this language of indirection. Saying “I love to you” rather than “I love you” is a way of symbolizing a respect for the other. The “to” is a verbal barrier against appropriating or subjugating the other. Speaking differently in this manner is an integral part of Irigaray’s general project to cultivate true intersubjectivity between the genders. However, she does not put forth a definitive plan for implementing this change in language.

e. Ethics

While ethics is a constant theme throughout her work, Irigaray’s text An Ethics of Sexual Difference is devoted to this theme. In this text, Irigaray intertwines essays of her own on the ethics of sexual difference with dialogues that she has created between herself and six male philosophers: Plato, Aristotle, Descartes, Spinoza, Merleau-Ponty and Levinas. Irigaray groups the dialogues into four sections that each begin with an essay of her own about sexual difference and love. Her own essay signals what themes she will address with regard to each of the philosophers she discusses. Irigaray utilizes her analyses of the male philosophers to discuss the following themes which are essential to her ethics: creative relationships between men and women that are not based in reproduction, separate ‘places’ for men and women (emotional and embodied), wonder at the difference of the other, acknowledgement of finiteness and intersubjectivity, and an embodied divinity.

In the first section, which engages Plato and Aristotle, Irigaray emphasizes that an ethical love relationship must be creative independent of procreation, and that both men and women need to have a place for themselves (be embodied individuals) that is open to, but not subsumable by, the other. In the second section, using Descartes and Spinoza, she argues that ethical love cannot occur between men and women until there is respect and wonder for the irreducible difference of the other, and an admittance and acceptance of one’s finiteness. In the third section, in which there is no engagement with a male philosopher, Irigaray describes how the infinite is essential to love between men and women. She believes that it is unethical that women have not had access to subjectivity, and that the universals of our culture have been dominated by a male imaginary. She says that ethics requires that men and women understand themselves as embodied subjects. In the fourth and final section, Irigaray discusses Merleau-Ponty and Levinas. She argues that if ethical relationships are to occur between men and women, men must overcome nostalgia for the womb. Thus will they develop their identity, and open up a space for women to create their own. Further, Irigaray believes that we must think both otherness and divinity in conjunction with embodiment. She believes that separating mind and body is unethical insofar as it perpetuates the division in culture between man/mind and woman/body. Ethics involves thinking of otherness and divinity in terms of the sensible/transcendental. At the end of her An Ethics of Sexual Difference, it is clear that Irigaray does not believe that Western culture is ethical, and that the primary reason is its treatment of women and nature. She believes that nothing short of altering our views of subjectivity, science, and religion can change this situation. Men and women must work together to learn to respect the irreducible difference between them. Women must become full subjects, and men must recognize that they are embodied. Further, ethical love relationships are based in respect for alterity and creativity outside of reproduction. Her text I love to you, which focuses on both language and ethics, is a clear example of how her discussion of ethics can also be developed from a Hegelian perspective.

f. Politics

Irigaray refuses to belong to any one group in the feminist movement because she believes that there is a tendency for groups to set themselves up against each other. When groups within the women’s movement fight each other, this detracts from the overall goal of trying to positively alter the social, political, and symbolic position of women. Irigaray models solidarity among women in her unwillingness to belong exclusively to one group.

Irigaray is particularly active in the feminist movement in Italy. Texts such as I love to you, Democracy Begins Between Two, and Two Be Two were all inspired by and, at various moments, give accounts of Irigaray’s experience with the Italian women’s movement. An example of Irigaray’s most recent collaborations with Italy, and a testimony to her commitment to her ideas, is her collaboration with the Commission for Equal Opportunities for the region of Emilia-Romagna. She was invited by this region to educate its citizens about her political ideals. Her text, Democracy Begins Between Two, was a part of that collaboration insofar as it was the theoretical work behind her role as adviser. In that text she also describes how she and Renzo Imbeni co-authored a “Report on Citizenship of the Union.” This report argued for rights based on sexual difference and was submitted to the European Parliament for ratification.

5. Criticisms

a. Strategic Essentialism

Irigaray’s use of strategic essentialism has been criticized as essentialism itself-or of endorsing the belief that social behavior follows from biology. The appearance of her translated work in the United States was met with great opposition. She was read as further naturalizing women at a time when women were benefiting both politically and socially from arguing that biology did not matter. Irigaray and her supporters defended her engagement with essentialist views as a strategy. They argued that when Irigaray seeks to alter the exclusion of the feminine by repeating or reiterating naturalizing discourses about female bodies, she is not suggesting a return to a lost female body that pre-exists patriarchy. Rather, she is employing her strategy of mimesis. While many contemporary interpreters now accept this view, strategic essentialism remains a controversial aspect of Irigaray’s work.

b. Privileges Psychological Oppression

Irigaray has been criticized-especially by materialist feminists-on the grounds that she privileges questions of psychological oppression over social/material oppression. The concern is that the psychoanalytic discourse that Irigaray relies upon-even though she is critical of it-universalizes and abstracts away from material conditions that are of central concern to feminism. Materialist feminists do not believe that definitive changes in the structure of politics can result from the changes Irigaray proposes in psychoanalytic theories of subject formation. However, Irigaray’s goal to challenge psychoanalytic theory and to change the definition of femininity evinces an agreement with the materialist position. Both agree that the ahistorical, overly universalized character of traditional psychoanalytic theory must be rejected. Further, Irigaray argues that focusing on language work and on altering allegedly intractable structures does not mean that women have to ignore material conditions. In This Sex Which Is Not One, Irigaray says that simultaneous with her challenges to the symbolic order, women must fight for equal wages, and against discrimination in employment and education. Irigaray recognizes that it is important to find ways to challenge the social and economic position in which women find themselves. But focusing exclusively on women’s material or economic situation as the key to change will only-at best-grant women access to a male social role insofar as it will not change the definition of women. Irigaray’s response to first changing material conditions would be that it would leave the question of a non-patriarchal view of female identity untouched. Due to the force of the oppression of women, it is the definitions that have to be changed before women, as distinct from men, will attain a social existence.

c. Elides Differences

Related to the materialist critique is the question of whether or not Irigaray’s psychoanalytic approach can account for real differences between women. Irigaray often discusses a subject position for women and a new definition of women. A common question asked of Irigaray is whether or not a universal definition for women is desirable considering the real differences between women. More specifically, if Irigaray insists on a universal subject position for women, will it be exclusively determined by first world, white, middle class women? Can her universal successfully include the experiences of minority women, second and third world women, and economically disadvantaged women? Or does it create further exclusion among the excluded themselves? Irigaray’s interpreters remain divided on this question.

d. Opaque Writing Style

Irigaray is often criticized along with other French feminists, such as Julia Kristeva, for the opacity of her writing style. Based on her writing style, she has been dismissed as elitist. Irigaray’s writing is undeniably challenging and complex. But, the difficulty of her work can be equally productive as it is labor intensive. Irigaray’s opacity can be viewed as fruitful when understood in conjunction with one mode of writing that she assumes-that of an analyst. In this style of writing, Irigaray not only will not assume the position of a master-knower who imparts knowledge in a linear manner, she also considers her readers’ reactions to her work to be an integral part of that work. Her alleged failure to be clear, or to give a concrete, linear feminist theory, are invitations for readers to imagine their own vision for the future. Like the psychoanalytic session, her texts are a collaboration between writer (analyst) and reader (analysand). Irigaray believes that, through writing in this style, she can take culture as a whole as her analysand.

e. Exclusive Ethics

Irigaray’s view of ethics is criticized because she describes the quintessential ethical relationship using a man and a woman. The question arises of whether or not Irigaray is suggesting that the heterosexual couple is the model for ethical relationships. Since it is unclear whether or not Irigaray’s view can be applied to other types of relationships (e.g. same sex friendships or same sex love relationships), this point of criticism remains unresolved. Related to this critique is a concern that Irigaray’s emphasis on sexual difference and male/female relationships also prevent her from accounting for non-traditional family arrangements.

f. Later Work

Irigaray’s most recent work raises the final point of controversy. In her earlier work, Irigaray refuses to give a new definition of women because she thinks that women must give it to themselves. However, in her most recent work she has developed laws that she submitted to the European Parliament for ratification. Irigaray’s interpreters debate about the relationship between her early work and her most recent texts. Is there continuity between the early and the later position? Or has Irigaray abandoned her earlier project? A spectrum of interpretations are available with no final answer.

6. References and Further Reading

a. English Translations

  • Irigaray, Luce. An Ethics of Sexual Difference. Trans. Carolyn Burke and Gillian C. Gill. Ithaca: Cornell UP, 1993.
    • Mimetic engagement with Plato, Aristotle, Descartes, Spinoza, Merleau-Ponty, and Levinas on the question of ethics. Irigaray elaborates here her own vision for ethical relationships.
  • Between East and West: From Singularity to Community. Trans. Stephen Pluhácek. New York: Columbia UP, 2002.
    • Draws on Eastern philosophy and meditative techniques such as yoga to suggest new approaches to the question of sexual difference.
  • Democracy Begins Between Two. Trans. Kirsteen Anderson. New York: Routledge, 2000.
    • Inspired by a partnership with the Commission for Equal Opportunities for the region of Emilia-Romagna in Italy, this text describes civil rights for women that would grant them an equal social position to men. This text also includes the Report on Citizenship of the Union by Renzo Imbeni. This report was written in collaboration with Irigaray and submitted to the European Parliament for ratification.
  • Elemental Passions. Trans. Joanne Collie and Judith Still. New York: Routledge, 1992.
    • One text in Irigaray’s series of elemental works. Addresses the relationship between men and women within the context of the elements and the senses.
  • je, tu, nous: towards a culture of difference. Trans. Alison Martin. New York: Routledge, 1993.
    • A series of essays that address diverse issues such as civil rights for women and prejudices in biology about the mother-fetus relationship.
  • I love to you: sketch of a possible felicity in history. Trans. Alison Martin. New York: Routledge, 1996.
    • Strategic engagement with Hegel in which Irigaray appropriates his use of dialectic in order to describe how men and women are both individuals and members of their gender. Also includes an extensive discussion of the language of indirection that Irigaray believes facilitates ethical relationships between men and women.
  • The Irigaray Reader. Ed. Margaret Whitford. Cambridge: Blackwell, 1991.
    • Useful compilation of essays, some of which are found in the texts listed here.
  • Marine Lover of Friedrich Nietzsche. Trans. Gillian C. Gill. New York: Columbia University Press, 1991.
    • One text in Irigaray’s elemental series, this text is a strategic engagement with Nietzsche and Derrida on the elision of femininity.
  • Sexes and Genealogies. Trans. Gillian C. Gill. New York: Columbia University Press, 1993.
    • Compilation of essays that address themes as diverse as how to alter the psychoanalytic session to descriptions of the sensible/transcendental.
  • Speculum of the Other Woman. Trans. Gillian C. Gill. Ithaca: Cornell University Press, 1985.
    • Irigaray’s doctoral dissertation. This text is a complex engagement with the history of philosophy and psychoanalytic theory.
  • The Forgetting of Air in Martin Heidegger. Trans. Mary Beth Mader. Austin: University of Texas Press, 1999.
    • One text in Irigaray’s elemental series. This text is a strategic engagement with the philosopher Martin Heidegger.
  • Thinking the Difference: For a Peaceful Revolution. Trans. Karin Montin. New York: Routledge, 1994.
    • Compilation of essays on diverse themes. Similar in structure to je, tu, nous.
  • This Sex Which Is Not One. Trans. Catherine Porter. New York: Cornell University Press, 1985.
    • Compilation of essays that discuss themes as diverse as where Lacanian theory went wrong, what mimesis is, and how to give a Marxist critique of the exchange of women in Western culture.
  • To Be Two. Trans. Monique M. Rhodes and Marco F. Cocito-Monoc. New York: Routledge, 2001.
    • Later work. Further exploration of the question of difference and alterity.
  • To Speak Is Never Neutral. New York: Routledge, 2000.
    • Sustained discussion of language. Studying the language of both mentally ill and normal subjects, Irigaray argues that language is never deployed in a completely neutral manner.
  • Why Different?. Trans. Camille Collins. Ed. Luce Irigaray and Sylvere Lotinger. New York: Semiotext(e) Foreign Agent Series, 2000.
    • A compilation of interviews with Irigaray about select work written in the 80’s and 90’s such as Sexes and Genealogies and Language is Never Neutral.

b. Suggested Further Reading

  • Chanter, Tina. Ethics of Eros: Irigaray’s Re-Writing of the Philosophers. New York: Routledge, 1995.
    • Thoroughly discusses philosophical influences on Irigaray’s work. Argues that comprehending the philosophical influences on Irigaray highlights her innovative ideas about the now passe sex/gender distinction.
  • Cheah, Pheng and Elizabeth Grosz. “The Future of Sexual Difference: An Interview with Judith Butler and Drucilla Cornell.” Diacritics, no. 28.1 (1998): 19-41.
    • Highlights central disagreements between prominent feminist thinkers about Irigaray’s work.
  • Freud, Sigmund. Standard Edition of the Complete Psychological Works of Sigmund Freud. Trans. James Strachey in collaboration with Anna Freud. 24 vols. London: Hogarth Press and the Institute of Psychoanalysis, 1953-1974.
  • Freud, Sigmund. The Freud Reader. Ed. Peter Gay. NewYork: W.W. Norton & Co., 1989.
    • Accessible compilation of Freud’s work. Of particular interest are “The Ego and the Id,” “Femininity,” “Mourning and Melancholia,” and “Three Essays On The Theory of Sexuality.” For unabridged versions of texts, consult the standard edition listed above.
  • Fuss, Diana. Essentially Speaking: Feminism, Nature and Difference. New York: Routledge, 1989.
    • Interesting discussion of strategic essentialism. Includes a discussion of Irigaray, pp. 55-72.
  • Gatens, Moira. Imaginary Bodies: Ethics, Power, and Corporeality. New York: Routledge, 1996.
    • Useful discussion of how the imaginary body plays out at a cultural level.
  • Grosz, Elizabeth. Volatile Bodies: Towards a Corporeal Feminism. Indianapolis: Indiana University Press, 1994.
    • A central text in philosophy of the body and the overcoming of dualisms.
  • Lacan, Jacques. Ecrits. Trans. Alan Sheridan. New York: W.W. Nortion & Co., 1977.
    • An accessible compilation of key essays in Lacanian thought.
  • Feminine Sexuality. Ed. Mitchell, Juliet and Jacqueline Rose. Trans. Jaqueline Rose. New York: W.W. Norton & Co., 1985.
    • An accessible compilation of key essays by Lacan on feminine sexuality.
  • Lorraine, Tamsin. Irigaray and Deleuze: Experiments in Visceral Philosophy. Ithaca: Cornell University Press, 1999.
    • Very clear description of difficult aspects of Irigaray’s thought. Interesting thesis about connections with Deleuze and Guatarri.
  • Schor, Naomi. “This Essentialism Which is Not One.” Ed. Burke, Carolyn, Naomi Schor, and Margaret Whitford. New York: Columbia University Press, 1994.
    • Very famous and useful discussion of the different kinds of essentialism.
  • Whitford, Margaret. Luce Irigaray: Philosophy in the Feminine. New York: Routledge, 1991.
    • Whitford writes about the psychoanalytic influence on Irigaray’s work. Whitford fleshes out Irigaray’s appropriation of key psychoanalytic themes and clearly explains complex aspects of Irigaray’s work.

Author Information

Sarah K. Donovan
Email: Sarah.Donovan@villanova.edu
Villanova University
U. S. A.

Interventionism

The theory of interventionism examines the nature and justifications of interfering with another polity (that is, political organization) or with choices made by individuals. Interventionism is characterized by the use or threat of force or coercion to alter a political or cultural situation nominally outside the intervenor’s moral or political jurisdiction. It commonly deals with a government’s interventions in other governments’ affairs–and is thus an aspect of political philosophy, but it can also be extended to interventions in others’ cultures, religions, lifestyles, and economic activities–and thus can fit into applied ethics, covering such issues as paternalism, imperialism, and topics in business, medical, and environmental ethics.

Table of Contents

  1. A Note on Methodological Considerations
  2. What Does Interventionism Deal With?
  3. Arguments for Interventionism
    1. Epistemological Reasons
    2. Political Realism
    3. Rights Theories
  4. Non-Interventionist Doctrines
  5. Legal Positivism and Non-Interventionism
  6. Isolationism
  7. Economic Interventionism

1. A Note on Methodological Considerations

The context of interventionism requires an epistemological consideration. A methodological individualist will argue that it involves interventions in the lives of individuals; that essentially it does not matter whether the individuals are part of one’s political entity or belonging to another–interventionism applies solely to individuals. A methodological holist on the other hand will identify the object of interventionism as groups–cultural, political, religious, national, and so on. Whilst the methodological individualist will focus on issues that infringe or attempt to alter individuals’ rights or choices, the holist will draw attention to issues affecting groups and their identities. Methodological compatibilism holds that interventions do affect individual rights or choices but individuals also identify themselves with groups who can also be separately affected by interference. For example, demanding that all female bank employees wear blue dresses affects the individual’s choice of clothes in the workplace but also interferes with the banking corporation’s right to determine its own standard of dress.

2. What Does Interventionism Deal With?

Beyond epistemological considerations interventionism commonly deals with the justifications of governments to interfere in (a) the lives of its own civilian population–domestic interventions, and (b) the activities of other nations–foreign interventions. In the case of domestic interventionism that apparatus is the police force (or the army acting as a domestic policing force as with the British army in Northern Ireland 1969-date); in the case of international interventionism it is the army. In either scenario interventionism implies the potential or actual use of coercion.

Reasoning or persuading another group of people that a chosen policy, or a certain tradition, is wrong either morally (given a certain standard) or on consequentialist considerations (the policy will not achieve what it’s meant to achieve) are not examples of interventionism. Reasoning includes all forms of rhetoric, example, persuasion, exhortation, counseling, discourse, and so on. The other group changes policy or tradition only if it desires –is persuaded– to change. They do so voluntarily. On the other hand, it may be claimed that in attempting to persuade others to change their minds is a form of interventionism. But this definition then becomes too broad to be of use–merely speaking to another or judging their behavior in the absence of any threats, coercion, or force, cannot be termed interventionist, for its goal is not to interfere but to explain possible choices.

Breaking diplomatic relations also does not imply the use of force and hence is not a form of interventionism. This is an essentially peaceful attempt to alter another government’s actions in effect by removing acknowledgement of its international political status.

Voluntary decisions on the part of a people may change a nation’s values. Trading in goods and ideas can change a society, yet such changes should not, for the most part, be deemed interventionist. Changes in culture and language that result from the voluntary decisions of many individuals cannot be tied to any form of interventionism, for the policy of interventionism is a policy of threatening or using coercion or force of some description. Whether such examples exist is hard to ascertain, for commonly the expansion of freedom of trade that has led to an exchange of ideas and hence of cultures is historically almost universally connected with imperialist policies that do aim at explicit forms of intervention. Following World War Two (1939-45) when Western imperialism dwindled as a political value, it can be argued that various societies (e.g., Taiwan, Malaysia) voluntarily took up what are referred to as ‘Western values’ through the influence of non-violent commercial ventures. However, critics may point out that previous military interventions could be considered as necessary precursors to changes in the culture of the people.

Coercion is a form of interventionism. Coercion implies offering choices that normally would not be accepted, but which leave the individual to choose the option preferred by the coercer, or by default one that is less acceptable. For example: if a knife is held to your throat and you are given the option to hand over your car keys or die, you are being coerced; if a government demands that you open up your borders to a free trade in opium or face armed conflict (China, Opium Wars with Britain) your nation is being coerced.

Domestic interventions entail restricting the choices of individuals or groups or altering their activities through legislative coercion. Limiting freedom of speech or trade, restricting occupational access to certain religious groups, or enforcing the draft are examples of interventions in the choices of individuals or groups, while increasing beer taxes are examples of altering choices through legislative frameworks; failure to comply may incur penalties.

On the international level, interventionist activities involve threatening, coercing, or forcing another group or nation to alter its behavior or change its government or policies. International interventionism can incorporate direct activities such as the use or threat of war, as well as indirect activities such as assassination, subversion, and economic embargoes of all descriptions (complete or partial blockades, transport restrictions, etc.).

General goals of international interventionism include attempting to change: governments (e.g., Iran, 1979); people’s expectations of governmental activities; general attitudes of just conduct not held as appropriate in the wider international community (e.g., South African Apartheid). Specific goals can include changing a state apparatus or its personnel (the government), to remove a particular statesperson or group, to change specific or general policies, to alter cultural or political beliefs, or even to alter patterns of economic and population distributions.

3. Arguments for Interventionism

Utilitarian or consequentialist prescriptions are open-ended: they could support interventions either generally or in particular circumstances, depending on expected results. Other positions offer more principled cases for interventionism, for example on epistemological grounds, political realism or rights analyses.

a. Epistemological Reasons

Intervening can be justified on grounds of the government possessing better knowledge than individual agents, or from paternalistic reasons, which presume the target agents are incapable of making informed choices themselves. To that extent, governments may legislate a range of programs from ensuring that people take out adequate insurance or invest sufficiently into pensions to requiring health checks or continued education; or economic interventions could be justified on the grounds that economic agents (investors, corporations, banks) do not act in the long term interest of the nation, whereas civil servants who are deemed above the profit motive can take the longer view (as held by John Maynard Keynes 1883-1946, for example).

b. Political Realism

Political realism is defined by the primacy of national interest in international affairs. This can be viewed as either a moral duty or as a description of the ruling state of affairs. Policy prescriptions involve pursuing interventions as they benefit the national interest. The theory implies that states should be left alone to seek and to defend their own interests. In the realist tradition, of which there are many shades, such supporters include Thucydides, Machiavelli, and Hobbes.

Political realism offers a broad interventionist doctrine that can justify intervening for reasons of economic profit as well as for balance of power considerations. The history of the British Empire provides many examples of both justifications (Cf. its interventions in European politics in the War of the Spanish Succession 1702-13 and the War of the Austrian Succession 1740-8), whilst post-war US foreign policy offers more recent case studies (Vietnam War 1961-73 and the Gulf War 1990-91). It is captured by Thucydides’ description of the Pelopennesian War, that it was Spartan “fear of Athenian growth” that caused the war. Realists often invoke consequentialist concerns regarding the developing international state of affairs–that should the foreign power to grow unchecked, a war would ensue, or economic resource bases would be lost, or an invasion could occur. The Schlieffen Plan, prior to the First World War (1914-18) is another useful example of balance of power considerations.

Political realism assumes that interests are to be maintained through the exercise of power, and that the world is characterized by competing power bases (nation states [Hegel], for example, or classes [Marx]). Political realism is essence reduces to the ethical principle that might is right.

c. Rights Theories

Some claim that rights only pertain to individuals, and that nations and governments only acquire any rights or privileges by virtue of the civilians giving them power. Rights theorists thus argue that individual rights supersede or ‘trump’ the rights or privileges of governments. On this basis, interventions in support of rights are morally justifiable. For example, if a foreign government tyrannizes its civilians, an intervention to support their rights can be justified, for the moral status of rights does not end at political borders. However, what needs to be considered is at what point do rights violations justify an intervention, or would an intervention do more harm than good? Second is the argument from hypocrisy–can a nation be justified in intervening in another’s affairs when it does not have a clean slate of its own? Finally, given that rights are being violated, is a government guilty of moral failure if it fails to intervene, and if so, is that moral failure a failure of its duty or of virtuous behavior?

4. Non-Interventionist Doctrines

Non-interventionism is the theory that one does not have any moral justification in intervening in others’ affairs. On a rights based analysis, or from Kantian considerations of duty, this may be considered an absolutist prohibition on the grounds that it either violates others’ rights to freedom or respect due them as individual moral entities. Consequentialists may infer from evidence that interventionism is always counter-productive and should not be practiced. In contemporary ethical analysis, a rule utilitarian may claim that since interventions never work (an empirical, testable hypothesis), ethical considerations aimed at maximizing the greatest good for the greatest number should employ non-interventionism on principle. However, act utilitarians may agree that historically interventions have not worked, but that does not mean that they will not in a future situation, and hence non-interventionism should not be held categorically.

As a political-economical doctrine, non-interventionism includes the economic doctrine of laissez-faire, which holds that governments should not intervene in the economic activities of individuals or corporations. Some thinkers, notably Herbert Spencer (1820-1903) have extended the doctrine to moral issues too, arguing, for example, that intervening in the plight of the poor only makes their condition worse by creating an atmosphere of dependency, rather than leaving them to independently struggle and find their own values. Other supporters of the economic laissez-faire doctrine do not go as far as Spencer; Friedrich von Hayek argues (Constitution of Liberty, 1956) that governments do have responsibilities to the poor resulting from their duty to provide a general framework to ensure the smooth operation of the free market system.

On a broader view, non-interventionism is applied by John Stuart Mill in On Liberty; he claims that responsibility to others only goes so far as ensuring they know of the dangers that may befall them, but does not extend to actually physically restraining those who would knowingly injure themselves. In the international sphere, Mill (“Notes on Intervention” Collected Works) argues for a policy of self-determination: that other people be allowed to make their own mistakes, and hence forge their own paths to freedom; intervening paternalistically on their behalf will not be conducive to their learning the value of freedom in its own right. Such a stance can be used in a variety of issues including freedom of press and expression. For example, John Milton in Areopagitica argues: “And though all the winds of doctrine were let loose to play on the earth, so Truth be in the field, we do injuriously by licensing and prohibiting to misdoubt her strength. Let her and Falsehood grapple; who ever knew Truth put to the worse in a free and open encounter?”

5. Legal Positivism and Non-Interventionism

In the international sphere, legal positivists are commonly non-interventionists. Legal positivists, following Christian Wolff (1679-1754), argue that nation states possess absolute rights to political sovereignty and territorial integrity, which implies that national borders be inviolable. Wolff writes: “Nations are regarded as individual free persons living in a state of nature. For they consist of a multitude of men united into a state. Therefore since states are regarded as individual free persons living in a state of nature, nations must also be regarded in relation to each other as individual free persons living in a state of nature.” (Jus Gentium Methodo Scientifica Pertractatum Trans. Joseph Drake. Clarendon Press: Oxford, 1934, §2, p.9) The positivist theory of international relations implies that interventions would violate international borders; this position itself resolves into an absolutist doctrine that deems interventions should never be condoned and more pragmatic positions that permit some exceptions to the rule.

Positivist exceptions to non-interventionism emanate from humanitarian considerations that overwhelm nominally sacrosanct national borders, if the target state is violating basic human rights to such an extent that it can no longer be deemed a proper representative of its people. The type of interventionism supported depends on the theory of the state entertained.

If governments are viewed as instrumental institutions that exist to uphold the domestic rights of civilians, then a violation of its remit can warrant an intervention on behalf of the citizens. Michael Walzer in Just and Unjust Wars (1977) entertains this position, arguing that only in extreme cases of rights violations “that shock the moral conscience of mankind” (p.107), can interventions be supported. He gives the examples of genocide, mass murder or enslavement. Rights violations above this level, he implies, are not grounds for interventionism (e.g., removal of free movement, freedom of the press, etc).

A Hobbesian case for interventionism can be maintained by those who consider governments the sole and proper moral and legal authorities. Hobbes claims that individuals give up the rights that they possess in the state of nature (except the right of self-preservation) to the state (the ‘Leviathan’). He argues the State should be obeyed, even it is acting quite tyrannically, for the alternative –and the greater evil– is the state of war in which justice and morality do not hold. However, if a state acts to takes its civilians into the state of nature by governing incompetently or unjustly then the people have a right to form a new state. This allows the legal positivist to condone interventions where governments have obviously failed in their obligations and have brought war to the people through their ineptitude.

The third possible justification for the positivist is when a supra-legal body legislates in favor of an intervention. For example, the United Nations has the jurisdiction to pass a resolution of intervention, but it does not condone unilateral interventions. Positivists draw parallels here between governments arbitrating in domestic disputes and a world body acting to dissolve international disputes.

6. Isolationism

Isolationism is the political doctrine of non-involvement in foreign affairs. The state, it is argued, should confine its activities to its own jurisdiction, and therefore, what happens abroad is of no concern. Isolationism can be argued from a consequentialist perspective: that getting involved would only make matters (whatever those matters are) worse; or from an intrinsicist perspective similar to the legal positivist case, that national jurisdiction (and hence moral and political concerns) ends at the political borders.

7. Economic Interventionism

Government intervention in the economy was noted above. Whilst the effects and the principles are the subject matter of economics, philosophers can fruitfully examine the nature of the epistemological arguments used in the debates which involve considerations of methodological individualism versus holism, and a-priori versus a-posteriori reasoning.

Author Information

Alexander Moseley
Email: alexandermoseley@icloud.com
United Kingdom

Jain Philosophy

Jain_handJainism is properly the name of one of the religious traditions that have their origin in the Indian subcontinent. According to its own traditions, the teachings of Jainism are eternal, and hence have no founder; however, the Jainism of this age can be traced back to Mahavira, a teacher of the sixth century BCE, a contemporary of the Buddha. Like those of the Buddha, Mahavira’s doctrines were formulated as a reaction to and rejection of the Brahmanism (religion based on the Hindu scriptures, the Vedas and Upanisads) then taking shape. The brahmans taught the division of society into rigidly delineated castes, and a doctrine of reincarnation guided by karma, or merit brought about by the moral qualities of actions. Their schools of thought, since they respected the authority of the Vedas and Upanisads, were known as orthodox darsanas (‘darsanas‘ means literally, ‘views’). Jainism and Buddhism, along with a school of materialists called Carvaka, were regarded as the unorthodox darsanas, because they taught that the Vedas and Upanisads, and hence the brahman caste, had no authority.

Table of Contents

  1. Metaphysics
  2. Epistemology and Logic
  3. Ethics
  4. References and Further Reading

1. Metaphysics

According to Jain thought, the basic constituents of reality are souls (jiva), matter (pudgala), motion (dharma), rest (adharma), space (akasa), and time (kala). Space is understood to be infinite in all directions, but not all of space is inhabitable. A finite region of space, usually described as taking the shape of a standing man with arms akimbo, is the only region of space that can contain anything. This is so because it is the only region of space that is pervaded with dharma, the principle of motion (adharma is not simply the absence of dharma, but rather a principle that causes objects to stop moving). The physical world resides in the narrow part of the middle of inhabitable space. The rest of the inhabitable universe may contain gods or other spirits.

While Jainism is dualistic—that is, matter and souls are thought to be entirely different types of substance—it is frequently said to be atheistic. What is denied is a creator god above all. The universe is eternal, matter and souls being equally uncreated. The universe contains gods who may be worshipped for various reasons, but there is no being outside it exercising control over it. The gods and other superhuman beings are all just as subject to karma and rebirth as human beings are. By their actions, souls accumulate karma, which is understood to be a kind of matter, and that accumulation draws them back into a body after death. Hence, all souls have undergone an infinite number of previous lives, and—with the exception of those who win release from the bondage of karma—will continue to reincarnate, each new life determined by the kind and amount of karma accumulated. Release is achieved by purging the soul of all karma, good and bad.

Every living thing has a soul, so every living thing can be harmed or helped. For purposes of assessing the worth of actions (see Ethics, below), living things are classified in a hierarchy according to the kinds of senses they have; the more senses a being has, the more ways it can be harmed or helped. Plants, various one-celled animals, and ‘elemental’ beings (beings made of one of the four elements—earth, air, fire, or water) have only one sense, the sense of touch. Worms and many insects have the senses of touch and taste. Other insects, like ants and lice, have those two senses plus the sense of smell. Flies and bees, along with other higher insects, also have sight. Human beings, along with birds, fish, and most terrestrial animals, have all five senses. This complete set of senses (plus, according to some Jain thinkers, a separate faculty of consciousness) makes all kinds of knowledge available to human beings, including knowledge of the human condition and the need for liberation from rebirth.

2. Epistemology and Logic

Underlying Jain epistemology is the idea that reality is multifaceted (anekanta, or ‘non-one-sided’), such that no one view can capture it in its entirety; that is, no single statement or set of statements captures the complete truth about the objects they describe. This insight, illustrated by the famous story of the blind men trying to describe an elephant, grounds both a kind of fallibilism in epistemology and a sevenfold classification of statements in logic.

Every school of Indian thought includes some judgment about the valid sources of knowledge (pramanas). While their lists of pramanas differ, they share a concern to capture the common-sense view; no Indian school is skeptical. The Jain list of pramanas includes sense perception, valid testimony (including scriptures), extra-sensory perception, telepathy, and kevala, the state of omniscience of a perfected soul. Notably absent from the list is inference, which most other Indian schools include, but Jain discussion of the pramanas seem to indicate that inference is included by implication in the pramana that provides the premises for inference. That is, inference from things learned by the senses is itself knowledge gained from the senses; inference from knowledge gained by testimony is itself knowledge gained by testimony, etc. Later Jain thinkers would add inference as a separate category, along with memory and tarka, the faculty by which we recognize logical relations.

Since reality is multi-faceted, none of the pramanas gives absolute or perfect knowledge (except kevala, which is enjoyed only by the perfected soul, and cannot be expressed in language). As a result, any item of knowledge gained is only tentative and provisional. This is expressed in Jain philosophy in the doctrine of naya, or partial predication (sometimes called the doctrine of perspectives or viewpoints). According to this doctrine, any judgment is true only from the viewpoint or perspective of the judge, and ought to be so expressed. Given the multifaceted nature of reality, no one should take his or her own judgments as the final truth about the matter, excluding all other judgments. This insight generates a sevenfold classification of predications. The seven categories of claim can be schematized as follows, where ‘a’ represents any arbitrarily selected object, and ‘F’ represents some predicate assertible of it:

  1. Perhaps a is F.
  2. Perhaps a is not-F.
  3. Perhaps a is both F and not-F.
  4. Perhaps a is indescribable.
  5. Perhaps a is indescribable and F.
  6. Perhaps a is indescribable and not-F.
  7. Perhaps a is indescribable, and both F and not-F.

Each predication is preceded by a marker of uncertainty (syat), which I have rendered here as ‘perhaps.’ Some render it as ‘from a perspective,’ or ‘somehow.’ However it is translated, it is intended to mark respect for the multifaceted nature of reality by showing a lack of conclusive certainty.

Early Jain philosophical works (especially the Tattvartha Sutra) indicate that for any object and any predicate, all seven of these predications are true. That is to say, for every object a and every predicate F, there is some circumstance in which, or perspective from which, it is correct to make claims of each of these forms. These seven categories of predication are not to be understood as seven truth-values, since they are all seven thought to be true. Historically, this view has been criticized (by Sankara, among others) on the obvious ground of inconsistency. While both a proposition and its negation may well be assertible, it seems that the conjunction, being a contradiction, can never be even assertible, never mind true, and so the third and seventh forms of predication are never possible. This is precisely the kind of consideration that leads some commentators to understand the ‘syat’ operator to mean ‘from a perspective.’ Since it may well be that from one perspective, a is F, and from another, a is not-F, then one and the same person can appreciate those facts and assert them both together. Given the multifaceted nature of the real, every object is in one way F, and in another way not-F. An appreciation of the complexity of the real also can lead one to see that objects are, as they are in themselves, indescribable (as no description can capture their entirety). This yields the fourth form of predication, which can then be combined with other insights to yield the last three forms.

Perhaps the deepest problem with this doctrine is one that troubles all forms of skepticism and fallibilism to one degree or another; it seems to be self-defeating. After all, if reality is multifaceted, and that keeps us from making absolute judgments (since my judgment and its negation will both be equally true), the doctrines that underlie Jain epistemology are themselves equally tentative. For example, take the doctrine of anekantevada. According to that doctrine, reality is so complex that any claim about it will necessarily fall short of complete accuracy. The doctrine itself must then fall short of complete accuracy. Therefore, we should say, “Perhaps (or “from a perspective”) reality is multifaceted.” At the same time, we have to grant the propriety, in some circumstances, of saying, “Perhaps reality is not multifaceted.” Jain epistemology gains assertibility for its own doctrine, but at the cost of being unable to deny contradictory doctrines. What begins as a laudable fallibilism ends as an untenable relativism.

3. Ethics

Given that the proper goal for a Jain is release from death and rebirth, and rebirth is caused by the accumulation of karma, all Jain ethics aims at purging karma that has been accumulated, and ceasing to accumulate new karma. Like Buddhists and Hindus, Jains believe that good karma leads to better circumstances in the next life, and bad karma to worse. However, since they conceive karma to be a material substance that draws the soul back into the body, all karma, both good and bad, leads to rebirth in the body. No karma can help a person achieve liberation from rebirth. Karma comes in different kinds, according to the kind of actions and intentions that attract it. In particular, it comes from four basic sources: (1) attachment to worldly things, (2) the passions, such as anger, greed, fear, pride, etc., (3) sensual enjoyment, and (4) ignorance, or false belief. Only the first three have a directly ethical or moral upshot, since ignorance is cured by knowledge, not by moral action.

The moral life, then, is in part the life devoted to breaking attachments to the world, including attachments to sensual enjoyment. Hence, the moral ideal in Jainism is an ascetic ideal. Monks (who, as in Buddhism, live by stricter rules than laymen) are constrained by five cardinal rules, the “five vows”: (1) ahimsa, frequently translated “non-violence,” or “non-harming,” satya, or truthfulness, asteya, not taking anything that is not given, brahmacharya, chastity, and aparigraha, detachment. This list differs from the rules binding on Buddhists only in that Buddhism requires abstention from intoxicants, and has no separate rule against attachment to the things of the world. The cardinal rule of interaction with other jivas is the rule of ahimsa. This is because harming other jivas is caused by either passions like anger, or ignorance of their nature as living beings. Consequently, Jains are required to be vegetarians. According to the earliest Jain documents, plants both are and contain living beings, although one-sensed beings, so even a vegetarian life does harm. This is why the ideal way to end one’s life, for a Jain, is to sit motionless and starve to death. Mahavira himself, and other great Jain saints, are said to have died this way. That is the only way to be sure you are doing no harm to any living being.

While it may seem that this code of behavior is not really moral, since it is aimed at a specific reward for the agent—and is therefore entirely self-interested—it should be noted that the same can be said of any religion-based moral code. Furthermore, like the Hindus and Buddhists, Jains believe that the only reason that personal advantage accrues to moral behavior is that the very structure of the universe, in the form of the law of karma, makes it so.

4. References and Further Reading

  • Battacharya, Hari Mohan. Jaina Logic and Epistemology. Calcutta: K. P. Gagchi and Company, 1994.
    • A full explanation and critical examination of Jain theory of knowledge.
  • Battacharya, Narendra Nath. Jain Philosophy: Historical Outline. Delhi: Munshiram Manoharlal Publishers, 1976.
  • Benesch, Walter. An Introduction to Comparative Philosophy. London: Macmillan Press, 1997.
    • A systematic comparison of Western philosophical systems with Asian, including Jain, systems.
  • Jacobi, Herman, trans. Jaina Sutras. Sacred Books of the East, vols 22 and 45. London: Oxford University Press, 1884.
    • The only English translation of the Jain scriptures.
  • Sharma, Arvind. A Jaina Perspective on the Philosophy of Religion. Delhi: Motilal Banarsidas, 2001.
  • Tatia, Nathmal, trans. That Which Is: Tattvartha Sutra. San Francisco: HarperCollins Publishers, 1994.
    • An early Jain handbook.

Author Information

Mark Owen Webb
Email: mark.webb@ttu.edu
Texas Tech University
U. S. A.

Ismaili Philosophy

Ismailism belongs to the Shi‘a main stream of Islam. Recent scholarship, based on a more judicious analysis of primary sources, has shown how Ismaili thought was in constant interaction with and to a certain extent influenced well-known currents of Islamic philosophy, theology, and mysticism.

Shi‘i and Ismaili philosophy use ta’wil as a tool of interpretation of scripture. This Qur’anic term connotes going back to the original meaning of the Qur’an. The objective of Ismaili thought is to create a bridge between Hellenic philosophy and religion. The human intellect is engaged to retrieve and disclose that which is interior or hidden (batin).

Ismailism presents a cosmology within an adapted Neoplatonic framework but tries to create an alternative synthesis. The starting point of such a synthesis is the doctrine of ibda‘ (derived from Qur’an 2:117). In its verbal form it is taken to mean ‘eternal existentiation’ to explain the notion in the Qur’an of God’s timeless command (Kun: ‘Be!’). The process of creation can be said to take place at several levels. Ibda‘ represents the initial level. The human intellect eventually relates to creation and tries to penetrate the mystery of the unknowable God.

Human history operates cyclically. The function of the Prophet is to reveal the religious law (shari‘a) while the Imam unveils gradually to his disciples the inner meaning (batin) of the revelation through the ta’wil.

Table of Contents

  1. Introduction
  2. Language and Meaning: The Stance of Ismaili Philosophy
  3. Manifesting Transcendence: Knowledge of the Cosmos

1. Introduction

Ismailism belongs to the Shi‘a branch of Islam, and, in common with various Muslim interpretive communities, has been concerned with developing a philosophical discourse to elucidate foundational Qur’anic and Islamic beliefs and principles. It would, however, be misleading to label Ismaili and other Muslim philosophical stances, as has been done by some scholars in the past, simplistically as manifestations of “Ismaili/Muslim Neoplatonism,” and “Ismaili/Muslim gnosticism,” and so forth. While elements of these philosophical and spiritual schools were certainly appropriated, and common features may be evident in the expression and development of Ismaili as well as other ideas, it must be noted that they were applied within very different historical and intellectual contexts and that such ideas came to be quite dramatically transformed in their meaning, purpose and significance in Islamic philosophy.

By those who were hostile to it or opposed its philosophical and intellectual stance, the Ismailis were regarded as heretical; legends were fabricated about them and their teachings. Early Western scholarship on Islamic philosophy inherited some of the biases of some medieval Muslim anti-philosophical stances, which tended to project a negative image of Ismailism, perceiving its philosophical contribution as having been derived from sources and tendencies ‘alien’ to Islam. Recent scholarship, based on a more judicious analysis of primary sources, provides a balanced perspective, and has shown how Ismaili thought was in constant interaction with and to a certain extent influenced well-known currents of Islamic philosophy and theology. Their views represent a consensus that it is inappropriate to treat Ismailism as a marginal school of Islamic thought; rather it constitutes a significant philosophical branch, among others, in Islamic philosophy.

Early Ismaili philosophy works dating back to the Fatimid period (fourth/tenth to sixth/twelfth century) are in Arabic; Nasir Khusraw (d. 471/1078) was the only Ismaili writer of the period to write in Persian. The Arabic tradition was continued in Yemen and India by the Musta‘li branch and in Syria by the Nizaris. In Persia and in Central Asia, the tradition was preserved and elaborated in Persian. Elsewhere among the Ismailis, local oral languages and literatures played an important part, though no strictly philosophical writings were developed in these languages.

2. Language and Meaning: The Stance of Ismaili Philosophy

Among the tools of interpretation of scripture that are associated particularly with Shi‘i and Ismaili philosophy is that of ta’wil. The application of this Qur’anic term, which connotes “going back to the first/the beginning,” marks the effort in Ismaili thought of creating a philosophical and hermeneutical discourse that establishes the intellectual discipline for approaching revelation and creates a bridge between philosophy and religion.

Philosophy as conceived in Ismaili thought thus seeks to extend the meaning of religion and revelation to identify the visible and the apparent (zahir) and also to penetrate to the roots, to retrieve and disclose that which is interior or hidden (batin). Ultimately, this discovery engages both the intellect (‘aql) and the spirit (ruh), functioning in an integral manner to illuminate and disclose truths (haqa’iq).

The appropriate mode of language which serves us best in this task is, according to Ismaili philosophers, symbolic language. Such language, which employs analogy, metaphor and symbols, allows one to make distinctions and to establish differences in ways that a literal reading of language does not permit. Such language employs a special system of signs, the ultimate meaning of which can be ‘unveiled’ by the proper application of hermeneutics (ta’wil).

3. Manifesting Transcendence: Knowledge of the Cosmos

It has been argued that Ismaili cosmology, integrates a manifestational cosmology (analogous to some aspects of Stoic thought) within an adapted Neoplatonic framework to create an alternative synthesis. The starting point of such a synthesis is the doctrine of ibda (derived from Qur’an 2:117). In its verbal form it is taken to mean ‘eternal existentiation’ to explain the notion in the Qur’an of God’s timeless command (Kun: Be!). Ibda therefore connotes not a specific act of creation but the dialogical mode through which a relationship between God and His creation can be affirmed – it articulates the process of beginning and sets the stage for developing a philosophy of the manifestation of transcendence in creation.

In sum the process of creation can be said to take place at several levels. Ibda represents the initial level – one transcends history, the other creates it. The spiritual and material realms are not dichotomous, since in the Ismaili formulation, matter and spirit are united under a higher genus and each realm possesses its own hierarchy. Though they require linguistic and rational categories for definition, they represent elements of a whole, and a true understanding of God must also take account of His creation. Such a synthesis is crucial to how the human intellect eventually relates to creation and how it ultimately becomes the instrument for penetrating through history the mystery of the unknowable God implied in the formulation of tawhid.

Human history, as conceived in Ismailism, operates cyclically. According to this typological view, the epoch of the great prophets mirrors the cosmological paradigm, unfolding to recover the equilibrium and harmony inherent in the divine pattern of creation. Prophets and, after them, their appointed successors, the imams, have as their collective goal the establishment of a just society. The function of the Prophet is to initiate the cycle for human society and of the Imam to complement and interpret the teaching to sustain the just order at the social and individual levels.

As Nasir Khusraw, the best known of the Ismaili writers in Persian, states in a passage paraphrased by Corbin:

Time is eternity measured by the movements of the heavens,
whose name is day, night, month, year. Eternity is Time not
measured, having neither beginning nor end…The cause of Time
is the Soul of the World….; it is not in time, for time is
in the horizon of the soul as its instrument, as the duration
of the living mortal who is “the shadow of the soul”, while
eternity is the duration of the living immortal – that is to
say of the Intelligence and of the Soul.

This synthesis of time as cycle and time as arrow lies at the heart of an Ismaili philosophy of active engagement in the world.

Author Information

Azim Nanji
Email: info@iis.ac.uk
The Institute of Ismaili Studies
United Kingdom

Hamid al-Din al-Kirmani (d. 1020)

Hamid al-Din al-Kirmani was a prominent Ismaili missionary during the reign of the Fatimid caliph-imam al-Hakim (996-1021). He was of Persian origin and was probably born in the province of Kirman. He seems to have spent the greater part of his life as a Fatimid da‘i (missionary) in Iraq (in Baghdad and Basra) and in central and western parts of Iran.Al-Kirmani was part of the official Fatimid campaign against the dissident da‘is, who had also proclaimed al-Hakim’s divinity. In Cairo he produced several works in refutation of the Druze movement and religion. Subsequently, al-Kirmani returned to Iraq where he completed his last and magnum opus, Rahat al-‘aql.

A prolific writer, al-Kirmani was one of the most learned Ismaili theologians of the Fatimid times. He was well-acquainted with the Hebrew text of the Old Testament, the Syriac version of the New Testament, and the post-Biblical Jewish writings. He expounded the Ismaili Shi‘i doctrine of the imamate in numerous writings. In a few treatises, al-Kirmani refuted the theological views of the Zaydis, the Twelver Shi‘is, and other Muslim opponents of the Fatimid Ismaili imams. Al-Kirmani was also an accomplished philosopher belonging to that select group of Ismaili da‘is of the Iranian lands who amalgamated in an original manner their Ismaili theology with different philosophical traditions, notably a type of Neoplatonism then current in the Muslim world.

Hamid al-Din al-Kirmani was a prominent Ismaili da‘i or missionary and one of the most learned Ismaili theologians and philosophers of the Fatimid period. As in the case of other prominent missionaries who observed strict secrecy in their activities in the midst of hostile milieus, few biographical details are available on al-Kirmani, who flourished during the reign of the Fatimid caliph-imam al-Hakim (996-1021). Al-Kirmani is not mentioned in any contemporary Muslim historical sources, but highlights of his life and career can be gathered from his own numerous extant works as well as the writings of the later Musta‘li-Tayyibi Ismaili authors of Yaman.

Al-Kirmani’s date of birth remains unknown, but he was of Persian origin and was probably born in the province of Kirman. He seems to have spent the greater part of his life as a Fatimid da‘i in Iraq, having been particularly active in Baghdad and Basra. In Iraq, al-Kirmani successfully concentrated his efforts on local rulers and influential tribal chiefs, with whose support the Ismailis aimed to bring about the downfall of the ‘Abbasids. Alarmed by the successes of the Fatimid da‘wa or mission in Iraq, the ‘Abbasid caliph al-Qadir took retaliatory measures. In 1011, he sponsored the so-called Baghdad manifesto to discredit the Fatimids, also refuting their ‘Alid ancestry. The honorific title hujjat al-Iraqayn, meaning the hujja or chief da‘i of both Iraqs (al-Iraq al-Arabi and al-Iraq al-Ajami), which is often added to al-Kirmani’s name and may be of a late origin, implies that he was also active in central and western parts of Iran.

Al-Kirmani rose to prominence during the reign of al-Hakim, when the central headquarters of the Fatimid da‘wa in Cairo considered him as the most learned Ismaili theologian of the time. It was in that capacity that al-Kirmani played an important role in refuting the extremist ideas of some dissident da‘is, who were then founding what was to become known as the Druze movement and religion. As part of the official Fatimid campaign against the dissident da‘is, who had also proclaimed al-Hakim’s divinity, al-Kirmani was summoned in 1014 or shortly earlier to Cairo where he produced several works in refutation of the extremist doctrines. Al-Kirmani’s writings, which were widely circulated, were to some extent successful in checking the spread of the extremist doctrines associated with the initiation of the Druze movement. Subsequently, al-Kirmani returned to Iraq where he completed his last and magnum opus, Rahat al-‘aql, in 1020 and where he died soon afterwards.

A prolific writer, al-Kirmani was one of the most learned Ismaili theologians of the Fatimid times. He was well-acquainted with the Hebrew text of the Old Testament, the Syriac version of the New Testament, and the post-Biblical Jewish writings. He expounded the Ismaili Shi‘i doctrine of the imamate in numerous writings. In a few treatises, al-Kirmani refuted the theological views of the Zaydis, the Twelver Shi‘is, and other Muslim opponents of the Fatimid Ismaili imams. In his al-Aqwal al-dhahabiya, al-Kirmani refuted the ideas of Abu Bakr Mohammad b. Zakariya al-Razi (d. 934), who had argued for the necessity of revelation and prophethood while tracing all sciences to revelational origins. Al-Kirmani was also an accomplished philosopher belonging to that select group of Ismaili da‘is of the Iranian lands who amalgamated in an original manner their Ismaili theology (kalam) with different philosophical traditions, notably a type of Neoplatonism then current in the Muslim world. As a philosopher, al-Kirmani was fully acquainted with Aristotelian and Neoplatonic philosophies as well as the metaphysical systems of the Muslim philosophers (falasifa), notably al-Farabi, and Ibn Sina (Avicenna) who was his contemporary. In his Kitab al-riyad, al-Kirmani acted as an arbiter in a philosophical debate that had taken place earlier among some Iranian da‘is, notably Muhammad al-Nasafi, Abu YaRahat al-‘aql, which is written for the advanced adepts. In this book, al-Kirmani also propounded what may be regarded as the third stage in the development of Ismaili cosmology in medieval times. Al-Kirmani replaced the Neoplatonic dyad of the Intellect (‘aql) and Soul (nafs) in the spiritual world, which had been adopted by his Iranian Ismaili predecessors, by a series of ten separate Intellects in partial adaptation of al-Farabi’s Aristotelian cosmic system. Al-Kirmani’s cosmology, representing an original synthesis of different philosophical traditions, was not however adopted by the Fatimid Ismailis; it later provided the basis for the development of the fourth and final stage of Ismaili cosmology at the hands of the Musta‘li-Tayyibi scholars in Yaman.

References and Further Reading

  • W. Ivanow, Ismaili Literature: A Bibliographical Survey, Tehran, 1963, pp. 40-45. Contains a survey of al-Kirmani’s known works and their manuscripts, preserved mainly in Yaman and India.
  • I. K. Poonawala, Biobibliography of Ismaili Literature Malibu, Calif., 1977, pp. 94-102. Also contains a survey of al-Kirmani’s known works and their manuscripts, preserved mainly in Yaman and India.
  • J. van Ess, “Bibliographische Notizen zur islamischen Theologie. I. Zur Chronologie der Werke des Hamidaddin al-Kirmani”, Die Welt des Orients, 9, 1978, pp. 255-261. A partial chronology of al-Kirmani’s works.
  • W. Madelung, “Das Imamat in der frühen ismailitischen Lehre”, Der Islam, 37, 1961, pp. 114-127.
  • H. Corbin, Cyclical Time and Ismaili Gnosis, London, 1983, index.
  • F. Daftary, The Ismailis: Their History and Doctrines, Cambridge, 1990, pp. 113, 192-193, 196-197, 218, 227, 229-230, 235-236, 240, 245-246, 287, 291, 298.
  • Paul E. Walker, Early Philosophical Shiism, Cambridge, 1993, index.
  • Paul. E. Walker, Hamid al-Din al-Kirmani: Ismaili Thought in the Age of al-Hakim, London, 1999.
  • Daniel De Smet, La Quiétude de l’intellect: Néoplatonisme et gnose ismaélienne dans l’oeuvre de Hamid ad-Din al-Kirmani, Louvain, 1995.

Author Information

Farhad Daftary
Email: info@iis.ac.uk
The Institute of Ismaili Studies
United Kingdom

Søren Kierkegaard (1813—1855)

KierkegaardSøren Kierkegaard is an outsider in the history of philosophy. His peculiar authorship comprises a baffling array of different narrative points of view and disciplinary subject matter, including aesthetic novels, works of psychology and Christian dogmatics, satirical prefaces, philosophical “scraps” and “postscripts,” literary reviews, edifying discourses, Christian polemics, and retrospective self-interpretations. His arsenal of rhetoric includes irony, satire, parody, humor, polemic and a dialectical method of “indirect communication” – all designed to deepen the reader’s subjective passionate engagement with ultimate existential issues. Like his role models Socrates and Christ, Kierkegaard takes how one lives one’s life to be the prime criterion of being in the truth. Kierkegaard’s closest literary and philosophical models are Plato, J.G. Hamann, G.E. Lessing, and his teacher of philosophy at the University of Copenhagen Poul Martin Møller, although Goethe, the German Romantics, Hegel, Kant and the logic of Adolf Trendelenburg are also important influences. His prime theological influence is Martin Luther, although his reactions to his Danish contemporaries N.F.S. Grundtvig and H.L. Martensen are also crucial. In addition to being dubbed “the father of existentialism,” Kierkegaard is best known as a trenchant critic of Hegel and Hegelianism and for his invention or elaboration of a host of philosophical, psychological, literary and theological categories, including: anxiety, despair, melancholy, repetition, inwardness, irony, existential stages, inherited sin, teleological suspension of the ethical, Christian paradox, the absurd, reduplication, universal/exception, sacrifice, love as a duty, seduction, the demonic, and indirect communication.

Table of Contents

  1. Life (1813-55)
    1. Father and Son: Inherited Melancholy
    2. Regina Olsen: The Sacrifice of Love
    3. The Master of Irony and the Seductions of Writing
    4. The “Authorship”: From Melancholy to Humor
    5. The “Second Authorship”: Self-Sacrifice, Love, Despair, and the God-Man
    6. The Attack on the Danish People’s Church
  2. The “Aesthetic Authorship”
    1. On the Concept of Irony and Either/Or
    2. Fear and Trembling and Repetition
    3. Philosophical Fragments, The Concept of Anxiety, and Prefaces
    4. Stages on Life’s Way and Concluding Unscientific Postscript
  3. The Edifying Discourses
    1. Sermons, Deliberations, and Edifying Discourses
    2. Direct and Indirect Communication
    3. That Single Individual, My Reader
  4. The “Second Authorship”
    1. Works of Love
    2. Anti-Climacus
    3. The Attack on the Church
  5. References and Further Reading

1. Life (1813-55)

a. Father and Son: Inherited Melancholy

Søren Aabye Kierkegaard was born on May 5th 1813 in Copenhagen. He was the seventh and last child of wealthy hosier, Michael Pedersen Kierkegaard and Ane Sørensdatter Lund, a former household servant and distant cousin of Michael Kierkegaard. This was Michael Kierkegaard’s second marriage, which came within a year of his first wife’s death and four months into Ane Lund’s first pregnancy. Michael Kierkegaard was a deeply melancholic man, sternly religious and carried a heavy burden of guilt, which he imposed on his children. Søren Kierkegaard often lamented that he had never had a childhood of carefree spontaneity, but that he had been “born old.” As a starving shepherd boy on the Jutland heath Michael had cursed God. His surname derived from the fact that his family was indentured to the parish priest, who provided a piece of the church (Kirke) farm (Gaard) for the family’s use. The name Kirkegaard (in older spelling Kierkegaard) more commonly means ‘churchyard’ or ‘cemetery.’ A sense of doom and death seemed to hover over Michael Kierkegaard for most of his 82 years. Although his material fortunes soon turned around dramatically, he was convinced that he had brought a curse on his family and that all his children were doomed to die by the age attained by Jesus Christ (33). Of Michael’s seven children, only Peter Christian and Søren Aabye survived beyond this age.

At age 12 Michael Kierkegaard was summoned to Copenhagen to work for his uncle as a journeyman in the cloth trade. Michael turned out to be an astute businessman and by the age of 24 had his own flourishing business. He subsequently inherited his uncle’s fortune, and augmented his wealth by some felicitous investments during the state bankruptcy of 1813 (the year, as Søren later put it, in which so many bad notes were put into circulation). Michael retired young and devoted himself to the study of theology, philosophy and literature. He bequeathed to his surviving sons Peter and Søren not only material wealth, but also supremely sharp intellect, a fathomless sense of guilt, and a relentless burden of melancholy. Although his father was wealthy, Søren was brought up rather stringently. He stood out at school because of his plain, unfashionable apparel and spindly stature. He learned to avoid teasing only by honing a caustic wit and a canny appreciation of other people’s psychological weaknesses. He was sent to one of Copenhagen’s best schools, The School of Civic Virtue [Borgerdydskolen], to receive a classical education. More than twice as much time was devoted to Latin in this school than to any other subject. Søren distinguished himself academically at school, especially in Latin and history, though according to his classmates he struggled with Danish composition. This became a real problem later, when he tried desperately to break into the Danish literary scene as a writer. His early publications were characterized by complex Germanic constructions and excessive use of Latin phrases. But eventually he became a master of his mother tongue, one of the two great stylists of Danish in his time, together with Hans Christian Andersen. Kierkegaard’s father is a constant presence in his authorship. He appears in stories of sacrifice, of inherited melancholy and guilt, as the archetypal patriarch, and even in explicit dedications at the beginning of several edifying discourses. Kierkegaard’s mother, on the other hand, never gets a mention in any of the writings – not even in his journal on the day of her death. His mother-tongue, though, is omnipresent. If we conjoin this fact with the remark in Concluding Unscientific Postscript (1846) that “… an omnipresent person should be recognizable precisely by being invisible,” we could speculate that the mother is even more present than the father, pervading all but the foreign language insertions in the texts. But whether or not there is any substance in this speculation, the invisibility of the mother and the treatment of women in general are indicative of Kierkegaard’s uneasy relationship with the opposite sex.

b. Regina Olsen: The Sacrifice of Love

Søren drifted into the study of theology at the University of Copenhagen, but soon broadened his study to include philosophy and literature. He started rather desultorily, and enjoyed a relatively dissolute time, even aspiring to cut the figure of a dandy. He ran up debts, which his father reluctantly paid, but eventually knuckled down to finish his degree when his father died in 1838. It seemed he was destined for a life as a pastor in the Danish People’s Church. In 1840, just before he enrolled at the Pastoral Seminary, he became engaged to Regina Olsen. This engagement was to form the basis of a great literary love story, propagated by Kierkegaard through his published writings and his journals. It also provided an occasion for Kierkegaard to define himself further as an outsider. For several years (at least since 1835) Kierkegaard had been dabbling with the idea of becoming a writer. The wealth he had inherited from his father enabled him to support himself comfortably without the need to work for a living. But it was not really enough to support a wife, let alone a wife and children. Furthermore, Kierkegaard harbored an undisclosed secret, something dark and personal, which he thought it his duty to confide to a wife, but which he dared not. Whether it was some sexual indiscretion, an inherited sexual disease, his innate melancholy, an egotistical mania to become a writer, or something else, we can only speculate. But when it came to the crunch, it seemed sufficient to make him break off the engagement rather than to reveal it to Regina. Thereafter, Kierkegaard frequently used marriage as a trope for “the universal” – especially for the universal demands made by social mores. Correlatively, becoming an “exception” was both a task and constantly in need of justification. The tortuous dialectic of universal and exception, worked out in terms of the sacrifices of love, subsequently informs much of Either/Or, Repetition, Fear and Trembling, Prefaces, and Stages on Life’s Way. A frequent foil for the trope of marriage as the universal is the figure of a young man “poeticized” by a broken engagement, who thereby becomes “an exception.” Only when the young man is “poeticized” in the direction of the religious, however, is there any question of his being a “justified exception.” Kierkegaard’s ultimate justification for breaking off his own engagement was his dedication to a life of writing as a religious poet, under the direction of divine Governance. As a measure of the importance the relationship to Regina had for his life, Kierkegaard adapted a line from Virgil’s Aeneid II,3 as “a motto for part of his life’s suffering”: Infandum me jubes Regina renovare dolorem (“Queen [Regina], the sorrow you bid me revive is unspeakable”).

c. The Master of Irony and the Seductions of Writing

During the period of his engagement Kierkegaard was also busy writing his Master’s dissertation in philosophy, On the Concept of Irony: with constant reference to Socrates (1841). This was later automatically converted to a doctorate (1854). Kierkegaard had petitioned the king to write his dissertation in Danish – only the third such request to be granted. Usually academic dissertations had to be written and defended in Latin. Kierkegaard was allowed to write his dissertation in Danish, but had to condense it into a series of theses in Latin, to be defended publicly in Latin, before the degree would be awarded. Almost immediately after his dissertation defense, Kierkegaard broke off his engagement to Regina. He then undertook the first of four journeys to Berlin – his only trips abroad apart from a brief trip to Sweden. During this first trip to Berlin Kierkegaard completed most of the first volume of Either/Or (much of the second volume already having been completed).

Throughout the second half of the 1830s Kierkegaard had aspired to become part of the pre-eminent literary set in Copenhagen. This centered on Professor J.L. Heiberg, playwright, philosopher, aesthetician, journal publisher, and doyen of Copenhagen’s literati. Heiberg had been credited with introducing Hegel’s philosophy to Denmark, though in fact there had already been lectures on Hegel by the Norwegian philosopher Henrik Steffens among others. Nevertheless, the fact that Heiberg gave Hegel’s work his imprimatur accelerated its acceptance into mainstream Danish intellectual life. By the end of the 1830s Hegelianism dominated Copenhagen’s philosophy, theology and aesthetics. Of course this engendered some resistance, including that from Kierkegaard’s professors of philosophy F.C. Sibbern and Poul Martin Møller. One of Hegelianism’s most illustrious local exponents was Kierkegaard’s archrival H.L. Martensen (professor of theology at Copenhagen University, later Bishop Primate of the Danish People’s Church). Martensen, just five years senior to Kierkegaard, was firmly entrenched in the Heiberg literary set, and anticipated at least one of Kierkegaard’s pet literary projects – an analysis of the figure of Faust. In his journals, as part of his practice at becoming a writer, Kierkegaard had been fascinated with three great literary figures from the Middle Ages, who he thought embodied the full range of modern aesthetic types. These figures were Don Juan, Faust, and the Wandering Jew. They embodied sensuality, doubt and despair respectively. Martensen’s publication on Faust pre-empted Kierkegaard’s budding literary project, though the latter eventually found expression in the first volume of Either/Or (1843). Meanwhile, Kierkegaard continued to seek Heiberg’s seal of approval. His first major breakthrough was an address to the University of Copenhagen’s Student Association on the issue of freedom of the press. This was a satirical conservative riposte to a previous address in favor of more liberal press laws, and was the first broadside by Kierkegaard in a long career of lambasting the popular press, especially insofar as it supported political agitation for democracy. In this instance, however, it seemed motivated more by a desire to showcase his wit and erudition than by any deeper engagement with the political issues. The freedom of the press had been severely undermined by King Frederik VI’s ordinance of 1799, and was threatened with full censorship by his press legislation of 1834. The Society for the Proper Use of Press Freedom was formed in 1835 to combat this development. Kierkegaard followed up his speech with an article in Heiberg’s paper, The Copenhagen Flying Post (1836). The article, published pseudonymously, was so clever and polished that some people mistook it for the work of Heiberg himself. This amounted to his calling card for invitation to the Heiberg literary salon. Kierkegaard followed this with further pseudonymous articles on the same topic. But his first monograph was a 70-page review of Hans Christian Andersen’s novel, Only a Fiddler. This too was a strategic move to break into the inner sanctum of Heiberg’s circle. Andersen was emerging as a major talent in Danish letters, having published poetry, plays and two novels, which had almost immediately been translated into German. Only a Fiddler was on a topic dear to Kierkegaard’s heart – genius. Andersen’s prime claim was that genius needs nurturing, and can succumb to circumstance and disappear without trace. Kierkegaard, in his book-length review From the Papers of One Still Living (1838), disagreed stridently, maintaining that the spark of genius could never be extinguished, but only augmented by adversity. Furthermore, he developed a theory of the novel in which he asserted that to be worth its salt, a novel had to be informed by a “life-view” and a “life-development.” He criticized Andersen’s novel for its dependence on contingent features from Andersen’s own life, rather than being transfigured by a mature philosophy of life with clarity of purpose. He contrasted Andersen’s novel unfavorably in this respect with the novel by Heiberg’s mother, Thomasine Gyllembourg, A Story of Everyday Life. Kierkegaard was to return to Gyllembourg as a novelist in his review of her Two Ages in A Literary Review (1846). He was also to write a review of the work of Heiberg’s wife Louise, Denmark’s leading actress, in The Crisis and A Crisis in the Life of an Actress (1848).

d. The “Authorship”: From Melancholy to Humor

Neither the articles in Heiberg’s papers, nor the monograph on Andersen as novelist had gained Kierkegaard secure membership of Heiberg’s circle – though he was an occasional visitor there. With the breaking of his engagement to Regina, the completion of a major academic book (The Concept of Irony), his decision to devote himself to writing, and the trip to Berlin both to audit Schelling’s lectures (along with Karl Marx, Jacob Burckhardt and other luminaries) and to concentrate on his new literary project (Either/Or), Kierkegaard was about to embark on what he later, retrospectively, called his “authorship.” This was eventually to comprise all the “aesthetic” pseudonymous works from Victor Eremita’s Either/Or to Johannes Climacus’s Concluding Unscientific Postscript, the Edifying Discourses under Kierkegaard’s own name (up to 1846), and Two Ages: The Age of Revolution and the Present Age: A Literary Review (by S. Kierkegaard). In short, these were the works published between Kierkegaard’s first and final visits to Berlin.

Either/Or burst upon the Copenhagen reading public with great force. It was immediately understood to be a major literary event. It was also regarded as scandalous by some, since its first volume portrayed the cynical, bored aestheticism of the modern flâneur, culminating in “The Seducer’s Diary.” Many, including Heiberg, took this to be a thinly disguised account of Kierkegaard’s own treatment of Regina Olsen. Most of the reviews, including Heiberg’s, concentrated on the scurrilous content of the first volume of the book. But other reviews read the two-volume work as a whole, and discovered the edifying and ethical framework in which the aesthetic point of view was to be assessed. Nevertheless, Heiberg’s review deeply offended Kierkegaard, and marked the point at which his relationship to Heiberg changed from aspiring associate to embittered critic. Hereafter in the “authorship” Heiberg became the target of unrelenting satire. He and Martensen were the main representatives of Danish Hegelianism, which is attacked at various points in the “authorship” – particularly in Prefaces (1844) and in Concluding Unscientific Postscript. It is worth noting that Hegel himself comes in for much less criticism, and much more positive endorsement, in Kierkegaard’s work than is commonly assumed. It is the Christian Hegelianism of Danish intellectuals that is the main target of his critiques. The “authorship” comprises two parallel series of texts. On the one hand are the pseudonymous works, which purportedly follow a dialectical trajectory of existential “stages” from the aesthetic, through the ethical, to the religious, and ultimately to the paradoxical religious stage of Christian faith. On the other hand are the Edifying Discourses, which are published under Kierkegaard’s own name, which resemble sermons on biblical texts, and which are addressed to a readership already presumed to be Christian. The pseudonymous authorship starts with an existential type modeled on the German Romantic aesthete – the ironic, urbane flâneur whose main concern is to avoid boredom and to maintain a cerebral spectator’s interest in life and its sensuous pleasures. Ironically, this aesthete is beset with melancholy. His greatest happiness is his unhappiness, as the section of Either/Or entitled “The Unhappiest One” concludes. Although boredom is stated to be the negative motivation for the aesthete’s actions, at a deeper level we can discern that it is escape from melancholy and despair that are the real motivators. As part of the dialectical framework of the “authorship,” Kierkegaard says there are also intermediate states between the discrete existential stages. These he calls “confinia” or border areas. Between the aesthetic and ethical stages lies the confinium of irony. Between the ethical and religious stages lies the confinium of humor. Humor is defined as “irony to a higher power” – so it does not wear its meaning on its sleeve. It is also to be understood as an inclusive, magnanimous state of affirming “both/and” (both the aesthetic and the ethical, both the tragic and the comic) rather than the ethically exclusive “either/or.” The author of Concluding Unscientific Postscript, Johannes Climacus is a self-professed “humorist” in this sense. Although he purports to give the reader the truth about Christianity, he also “revokes” all he has said in that book. The religious humorist purports to go beyond the aesthetic and the ethical by choosing the religious exclusively, yet by virtue of the absurd, gets the aesthetic and the ethical back again within the religious. In terms of his own psychological economy, Kierkegaard seems to have been struggling to lose his melancholy and have it at the same time. It seems to have served him as an essential motor of aesthetic productivity, but was also a constant source of suffering from which he sought escape. For a long time Kierkegaard reconciled himself to his life of aesthetic self-indulgence as an author with the idea that it was all for a limited time. Once his “authorship’ was complete, he would retire from writing and become a country pastor ministering to the souls of simple folk. Authorship was both a demonic temptation and a means of self-justification as an exception to the universal demands of society’s ethics. But just as he was on the point of completing the “authorship,” Kierkegaard managed to provoke an attack on himself by the press, which demanded further work as an author in response.

e. The “Second Authorship”: Self-Sacrifice, Love, Despair, and the God-Man

Kierkegaard provoked an attack on himself by the journal The Corsair. The journal, edited by the talented Jewish author Meïr Goldschmidt, specialized in ruthless satirical attacks on contemporary Danish authors. Yet, perhaps because of the esteem in which Goldschmidt held him, Kierkegaard had been spared. Kierkegaard found this favorable treatment offensive (partly out of vanity, ostensibly because of his ongoing critique of the press’s influence on public opinion). So he publicly challenged The Corsair to do its worst. It did. It launched a series of attacks on Kierkegaard, more personal than literary, and focused on his odd appearance and his relationship with Regina. In some wicked caricatures it portrayed him with one trouser leg shorter than the other, with a sway back, and riding on a woman’s (Regina’s) back with stick in hand. These caricatures made a laughing stock of Kierkegaard in Copenhagen, to the extent that he was mocked in the street and had to give up his habit of walking around the inner city to talk with all and sundry.

But it galvanized him to begin a “second authorship.” This time the edifying discourses under his own name were supplemented with works by the pseudonym Anti-Climacus. Anti-Climacus represents an idealized Christian point of view – one that Kierkegaard professed is higher than he had been able to achieve in his own life. The only other pseudonyms to appear in this “second authorship” were Inter et Inter, author of The Crisis and A Crisis in the Life of an Actress, and “H.H.” author of “Two Ethical-Religious Essays.” In addition the “second authorship” comprises: Works of Love (1847), The Sickness Unto Death (1849), Practice in Christianity (1850), as well as various edifying discourses, including Edifying Discourses in Various Spirits (1847), The Lily of the Field and the Bird of the Air (1849), Three Discourses at the Communion on Fridays (1849), Two Discourses at the Communion on Fridays (1851), and For Self-Examination (1851). He also published a retrospective self-interpretation of his writings to date, On My Work as an Author (under his own name – 1851). In addition Kierkegaard wrote various works at this time which he decided not to publish. The most significant of these are: The Book on Adler and The Point of View for My Work as an Author. The former gives a detailed analysis of the “phenomenon” of Adolph Adler, a pastor in the Danish People’s Church who claimed to have had a divine revelation. He was deemed mad by the church authorities and pensioned off. Adler had been a leading Hegelian in the 1840s, but on Kierkegaard’s analysis ends up being “a Satire on Hegelian Philosophy and the Present Age.” Kierkegaard makes an immanent critique of Adler’s writings to demonstrate their confusion and the absence of revelation. Kierkegaard published only the addendum to The Book on Adler as “The Difference between a Genius and an Apostle” in “Two Ethical Religious Essays.” The Point of View for My Work as an Author sets out Kierkegaard’s (retrospective) interpretation of his authorship. It is subtitled: “A Direct Communication, Report to History.” It explains in direct terms the dialectic of indirect communication, but Kierkegaard was uncertain whether its directness at that time was dialectically correct for the authorship and refrained from publishing it. The “second authorship” reintroduces various concepts from the “aesthetic authorship,” but “transfigured” by the light of Christian faith. One of the most significant of these is “despair,” which is a transfigured version of “anxiety.” Both concepts are illuminated by reference to the notion of sin, and both are constitutive of the dialectic of selfhood. Only by acknowledging our ultimate dependence on God’s grace is it possible to overcome despair, and to become a self (paradoxically by becoming as “nothing” before God). Another concept transfigured in the “second authorship” is “love.” In the “aesthetic authorship” “love” is understood in pagan terms, primarily as eros – or desire. Desire is preferential, based on a lack (we only desire what we don’t have, according to Plato’s Symposium), and is ultimately selfish. Christian love is understood as agape. It is self-sacrificing, directed to the neighbor (without personal preference), is conceived as a spiritual duty rather than a psychological feeling, and comes as a gift from God rather than from the attraction between human beings. Its only perfect model is in the person of Jesus Christ, the God-man. We can see in the journey from eros in the “aesthetic authorship” to agape in the “second authorship” a personal attempt by Kierkegaard to sublimate his selfish desire for Regina into a self-sacrificing universal duty to love the neighbor. On his own terms this is impossible for a human being to achieve alone. It is only possible if love as agape is received as a gift by the grace of God.

f. The Attack on the Danish People’s Church

The “authorship” and “second authorship” had been governed by Kierkegaard’s elaborate method of “indirect communication.” This method, inspired by Socrates and Christ, is designed to elicit self-examination from the reader in order to start the process of existential transfiguration that is entailed by Christian faith. It is designed to make it harder for the reader to appropriate the text objectively and dispassionately. Instead, the text is folded back on itself, layered with riddles and paradoxes, and designed to be a mirror in which the way the reader judges the text amounts to a self-judgment on the reader. The different works in the “authorships” are related to one another dialectically, so that a reader has to traverse a complicated journey to arrive at the threshold of Christian faith. The method of indirect communication requires meticulous attention to each word, and to the dialectical trajectory of the whole oeuvre. At times, the subtlety of the method nearly drove Kierkegaard to distraction, and he had to rely on the intervention of “Governance” [Styrelse], to let him know whether it was appropriate to publish the works he had written. On the Point of View for My Work as an Author: A Report to History, and The Book on Adler, failed to get Governance’s stamp of approval for publication.

But ultimately Kierkegaard began to think that this elaborate method of indirect communication, and his obsession with linguistic detail were temptations to the demonic. Besides, time was running out and some direct, decisive intervention in Danish church politics was necessary. This was precipitated by the death of the Bishop Primate of the Danish People’s Church, J.P. Mynster (1854). Mynster had been the family pastor in Michael Kierkegaard’s day, and Søren Kierkegaard had always had a filial respect for him. But when the new Bishop Primate elect, H.L. Martensen, announced that Mynster had been “a witness to the truth” Kierkegaard could not restrain himself. He launched a stinging attack on the established church in a series of articles in the newspaper Fædrelandet [The Fatherland], and by means of a broadsheet called The Instant [or more literally “The Glint of an Eye”](1855) and in a series of other short, sharp pieces including This Must Be Said, So Let It Be Said (1855), and What Christ Judges of Official Christianity (1855). On September 28th 1855 Kierkegaard collapsed in the street. A few days later he was admitted to Frederiksberg Hospital in Copenhagen, where he died on November 11th.

2. The “Aesthetic Authorship”

a. On the Concept of Irony and Either/Or

Although Kierkegaard explicitly leaves On the Concept of Irony out of his “authorship,” it functions as an important preface to that body of work. According to the theory of existential stages contained in the authorship, irony functions as a “confinium” [border area] between the aesthetic and the ethical. But it also functions as a point of entry to the aesthetic. As Kierkegaard argues in On the Concept of Irony, irony is a midwife at the birth of individual subjectivity. It is a distancing device, which folds immediate experience back on itself to create a space of self-reflection. In Socrates it is incarnated as “infinite negativity” – a force that undermines all received opinion to leave Socrates’ interlocutors bewildered – and responsible for their own thoughts and values. That is, Socratic irony forces his interlocutors to reflect on themselves, to distance themselves critically from their immediate beliefs and values.

Although the aesthetic can consist in immediate immersion in sensuous experience, as in the case of Don Juan, Kierkegaard’s most developed portrait is of the reflective aesthete in Either/Or volume 1. Faust is the first example of a reflective aesthete. He is lost in reflective ennui and craves a return to immediate experience. This is the basis of his attraction to Margarete, who embodies innocent immediacy. At its most extreme, the aesthete is unhappily and utterly self-alienated by means of temporal dislocation. “The Unhappiest One” – an echo of Hegel’s “unhappy consciousness” – hopes for that which can only be remembered, and remembers that which can only be hoped. He or she lives only in the modality of possibility and never in the modality of actuality, and therefore fails to be self-present. Yet, by means of reflective self-knowledge, the prudent rotation of moods and the arbitrary focus of interest, this “unhappiness” can be transformed into the greatest happiness for the aesthete. The “infinitizing” element of possibility becomes the realm of freedom, where even the most banal events can be “poeticized” by aesthetic sensibility. Actuality is transformed into nothing more than an occasion for generating reflective possibilities, rather than being an obstacle or a task. Johannes the seducer need see only a dainty ankle descending from a carriage to reconstruct the whole woman – just as Cuvier reconstructs the whole dinosaur from a single bone. The reconstruction, in the case of Johannes however, is not for the sake of knowing what’s real, but is for the sake of his own aesthetic titillation. If the actual doesn’t fit Johannes’ reflective desires, he manipulates it and himself until he generates a story that satisfies him. His seduction of Cordelia is not aimed at mere sexual consummation, but more at narrative consummation – she is to be used as an occasion, and manipulated in whatever ways Johannes deems necessary, to become the character in the story of seduction he has predetermined. But this detachment from the actual, by self-centered immersion in reflective possibility, is exactly what On the Concept of Irony had accused the German Romantics of achieving with their use of irony. The first volume of Either/Or just gives us a more developed version, artistically construed from the point of view of German Romantic irony. On the Concept of Irony had already argued for the necessity to go beyond immersion in irony, or mere possibility – to become a “master of irony,” so that irony could be used strategically for ethical and religious ends. The title Either/Or presents us with a choice between the aesthetic and the ethical. The first volume is written from the point of view of the reflective aesthete, who has run astray in possibility. Although its main theme is love, this is conceived selfishly as erotic desire. The papers that comprise volume 1 are written ad se ipsum [to himself]. The aesthete’s brilliant pyrotechnics are demonically self-enclosed, ironically cutting him off from genuine communication. The second volume, on the other hand, is written by a judge, who advocates transparency and openness in communication. It is written in the form of letters, as a direct communication to the aesthetic author of the first volume. The letters implore him to realize the limitations of his demonic self-enclosure, and to embrace his ethical duties to others. Whereas the paradigm of love in volume 1 is seduction, the paradigm of love in volume 2 is marriage. Marriage is a trope for the universal claims of civic duty. It requires an open, intimate, transparent, honest relation to an other. Yet the first section of volume 2 argues for the aesthetic validity of marriage. Judge Wilhelm wants to persuade the aesthete that ethical love is compatible with aesthetic love – that love in marriage does not exclude sensual enjoyment and love of beauty as such, but only the selfishness of lust for “the flesh.” The latter is a category excluded by Christianity. It pertains to the body and psyche, to the exclusion of spirit, which is the definitive Christian category. Yet the claims of the judge ring hollow. Either/Or is presented as a whole book, edited by Victor Eremita (the victorious hermit). It presents us with a radical, exclusive choice between the aesthetic and the ethical, yet the judge tries to show their compatibility in marriage. The final word of the book belongs neither to the aesthete, the judge, nor even to the pseudonymous editor, but to an anonymous parson. His sermon, “The Edification Which Lies In The Fact That In Relation To God We Are Always In The Wrong,” alerts the reader to the impossibility of escaping sin through ethics. The assumption shared by both the aesthete and the ethicist is that love can provide a means for ascent to the divine. Whereas erotic desire provides a means for the aesthete to ascend to a state of reflective possibility unconstrained by actuality, in which he becomes his own creator-god, the judge conceives ethical love to be a dialectical advance on aesthetic selfishness – in the direction of God. The whole pseudonymous authorship, from Either/Or to Concluding Unscientific Postscriptcan be read as a parody of the notion of a scala paradisi by means of which humans can ascend to the divine. The original model for this ladder to paradise is Plato’s account of love [eros] in the Symposium. But the model is appropriated by many subsequent writers, including Augustine and Johannes Climacus, a sixth century monk from Mt. Sinai, who wrote a book called Scala Paradisi. Kierkegaard borrows this name for his pseudonymous author of Philosophical Fragments and Concluding Unscientific Postscript. But it is in order to parody the notion that humans can ascend to the divine under their own power. Each of the pseudonymous books in the “authorship” makes a gesture of movement from human to divine, whether by means of the aesthetic sublime, ethical virtue, the religious leap of faith, or philosophical dialectics. But in each case the apparent movement is “revoked” in some way. Ultimately Kierkegaard endorses the Lutheran view that human beings are radically dependent on God to descend to us. Human beings have no inherent capacity for transcending their own immanence, but are completely reliant on God’s grace to connect with alterity.

b. Fear and Trembling and Repetition

The next two books in the pseudonymous authorship, Fear and Trembling and Repetition, are supposed to represent a higher stage on the dialectical ladder – the religious. They are supposed to have moved beyond the aesthetic and the ethical. Fear and Trembling explicitly problematizes the ethical, while Repetition problematizes the notion of movement. Fear and Trembling reconstructs the story of Abraham and Isaac from the Old Testament. It tries to understand psychologically, ethically and religiously what Abraham was doing in obeying an apparent command from God to sacrifice his son. It apparently concludes that Abraham is “a knight of faith” who is religiously justified in his “teleological suspension of the ethical.” The ethic in question here is the civic virtue championed by Judge Wilhelm in Either/Or – corresponding to Hegel’s Sittlichkeit [customary morality]. The end for which this ethic is suspended is the unconditional command of God. But such obedience raises difficult epistemological questions – how do we distinguish the voice of God from, say, a delusional hallucination? The answer, which induces fear and trembling, is that we can only do so by faith. Abraham can say nothing to justify his actions – to do so would return him to the realm of human immanence and the sphere of ethics. The difference between Agamemnon, who sacrificed his daughter Iphigenia, and Abraham is that Agamemnon could justify his action in terms of customary morality. The sacrifice, however painful, was demanded for the sake of the success of the Greek military mission against Troy. Such sacrifices, for purposes greater than the individuals involved, were intelligible to the society of the time. Abraham’s sacrifice would have served no such purpose. It was unjustifiable in terms of prevailing morality, and was indistinguishable from murder. The ineffability of Abraham’s action is underscored by the pseudonym Kierkegaard chose as author of Fear and Trembling, namely, Johannes de silentio. But while Fear and Trembling is supposed to have moved beyond the aesthetic and the ethical, its subtitle is “a dialectical lyric.” Although its subject matter is ineffable and its author silent, it effuses aesthetically on its theme. It ends with an “Epilogue” that asserts that, as far as love and faith go, we cannot build on what the previous generation has achieved. We have to begin from the beginning. We can never “go further.”

Repetition begins with a discussion of the analysis of motion by the Eleatic philosophers. It goes on to distinguish two forms of movement with respect to knowledge of eternal truth: recollection and repetition. Recollection is understood on the model of Plato’s anamnesis – a recovery of a truth already present in the individual, which has been repressed or forgotten. This is a movement backwards, since it is retrieving knowledge from the past. It can never discover eternal truth with which it was previously unacquainted. In contrast, repetition is defined as “recollection forwards.” It is supposed to be the definitive movement of Christian faith. The pseudonym Constantin Constantius congratulates the Danish language on the word “Gjentagelse” [repetition], which more literally means “taking again.” The emphasis in the Danish, then, is on the action involved in the repetition of faith rather than on the intellection involved in recollection. Christian faith is not a matter of intellectual reflection, but of living a certain sort of life, namely, imitating [repeating] the life of Christ. Despite this verbal analysis of the difference between recollection and repetition, the characters in Repetition fail to achieve religious repetition. The pseudonymous author fails in his attempt to repeat a journey to Berlin, and the “young man” who has been “poeticized” by love seems to move in the direction of the religious, but ultimately gets no further than religious poetry. He becomes obsessed with Job, the biblical paradigm of repetition. He substitutes the book of Job for the beloved he has rejected, even taking it to bed with him. But in the end the “young man” turns out to be no more than a fiction invented by Constantius as a psychological experiment. He falls back into the realm of aesthetics, of mere possibility, a figment for the psyche rather than the spirit.

c. Philosophical Fragments, The Concept of Anxiety, and Prefaces

In June 1844 Kierkegaard published three pseudonymous books: Philosophical Fragments, The Concept of Anxiety, and Prefaces. Philosophical Fragments, the first book by the pseudonym Johannes Climacus, tackles the question of how there can be an historical point of departure for an eternal truth. This picks up from Constantius’ discussion of the difference between repetition and recollection. But Johannes uses the perspective and vocabulary of philosophy, rather than Constantius’ aesthetic irony. He introduces the paradox of the Christian incarnation as the stumbling block for any attempts by reason to ascend logically to the divine. The idea that the eternal, infinite, transcendent God could simultaneously be incarnated as a finite human being, in time, to die on the cross is an offense to reason. It is even too absurd an idea for humans to have invented, according to Climacus, so the idea itself must have a transcendent origin. In order for humans to encounter transcendent, eternal truth other than through recollection, the condition for reception of that truth must also have come from outside. If we have Christian faith, it is Christ as teacher who is the condition for receiving this truth – and he is conceived, precisely, as an incursion of the transcendent deity into the realm of human immanence. There can be no ascent to this truth by reason and logic, contra Hegel, who tries to demonstrate that “universal philosophical science” ultimately reveals “the Absolute.”

The emphasis Climacus places on the paradox of the Christian incarnation, together with his assertion that this causes offense to reason, have prompted many to the view that Kierkegaard is an “irrationalist” about Christian faith. Some take this to mean that his view of faith is contrary to reason, or transcendent of reason – in either case, exclusive of reason. Others have sought to find means of reconciling Climacus’ claims with some more extended notion of reason. It is important in considering these issues to distinguish Kierkegaard’s position from that of his pseudonym, and to take into account the point of view from which this consideration is made. Kierkegaard’s main aim in having Climacus make these claims is to undermine the idea that philosophical reason can be used as a scala paradisi. His principle target is Hegelianism, but he is also trying to distinguish pagan (especially Platonic) epistemology from Christian epistemology. We must also bear in mind that under the influence of Christian faith, all experience is transfigured (“everything is new in Christ”). This includes the experience of reason, as well as ethics and aesthetics. Ethics, for example, might be teleologically suspended in faith, but is recouped within Christian faith – though it comes to have another meaning. It is no longer merely customary morality, but is the morality sanctioned by Christian love, which is deontological, centered on spirit rather than sympathy, self-sacrificing, and is mediated by God (the “third” in every love relation). Similarly aesthetics is transfigured under Christian faith, from self-serving reflections confined to the realm of possibility, to the beauty inherent in altruistic self-effacing acts of love. Reason itself comes to have another meaning under Christian faith, so that it no longer takes offense at the paradox, but recognizes its necessity given the exigencies of relating the transcendent to the immanent without reduction. Reason is recontextualized within existence, rather than being elevated to absorb the whole of existence. Prefaces: Light Reading for Certain Classes as the Occasion May Require reinforces the polemic against Hegel’s speculative ladder of reason. Although much of its content is devoted to satirical broadsides at J.L. Heiberg, H.L. Martensen, and the popular press in Copenhagen, its starting point is the paradox of philosophical prefaces articulated in the preface to Hegel’s The Phenomenology of Spirit. Hegel’s assumption is that a philosophical work should be a sort of Bildungsroman – a narrative by means of which the reader’s consciousness is dialectically developed in the course of reading. If we assume the reader is to learn something from the process of reading the book, then he or she will not be in a position to understand the conclusions of the book until they have worked their way through the content. By the time they reach the end they will be conditioned by what they have read to understand the conclusion. But a preface presents the conclusions to the book at the outset. It is really an anticipatory postface rather than a preface. The reader will really only be able to understand it after having read the book. It is meant for orientation of the reader on embarking on the voyage of self-development represented by the book. But if it is a direct bridge into the book, the subject matter itself, then it is really part of the book rather than a preface. If, on the other hand, it stands radically outside the book, then it can’t be a bridge into the book and is redundant. This gap between preface and book parallels the gap Hegel draws between “particular philosophical sciences” (such as aesthetics, and history of philosophy) and “universal philosophical science” (logic). The former must be used as a contingent starting point, commensurate with the limited knowledge of the reader, as a point of induction into logic. The particular can retrospectively be subsumed within the universal, but cannot be expanded to become the universal. It has been claimed, in accordance with this position, that if the reader understands the preface to Hegel’s Phenomenology of Spirit, he or she understands the whole of Hegel’s philosophy. But the condition for understanding the preface is already to understand the whole of Hegel’s philosophy. The pseudonymous author of Prefaces, Nicholas Notabene, is a pedant whose wife has forbidden him to be an author. He takes an author to be a writer of books, and with cunning sophistry decides to write nothing but prefaces “which are not the prefaces to any books.” Notabene’s prefaces are analogues of human immanence – no amount of expansion will make them bridges to the transcendent. All human immanence is a “preface” to the divine. Only once the divine has come to us (in the incarnation or through direct revelation) can we retrospectively understand the status of our prefatory lives as mere prefaces. For Kierkegaard there is only one book – the bible. We are never “authors” of books, but only readers of “the old familiar text handed down from the fathers.” On the same day as he published Prefaces Kierkegaard also published On the Concept of Anxiety by Vigilius Haufniensis [Watchman of the Harbor – namely, Copenhagen]. Its subtitle is “A Simple Psychologically Orienting Deliberation on the Dogmatic Issue of Hereditary Sin.” It is supposed to be a serious counterweight to the “light reading” of Prefaces. But it forms part of the same polemic against immanent human efforts to reach the divine. From the points of view of psychology and theological dogmatics it elaborates the theme of the sermon appended to Either/Or – that against God we are always in the wrong. Sin is inescapable. Sin ultimately consists in being outside of God. Only Jesus Christ, the God-man, is not in sin. Sin consciousness comes into being as part of human psychological development. It is absent from the innocent immediacy of childhood. It awakens with sexual desire – when we want to possess another. Desire is here understood as a lack that we want to fill. Possession, or incorporation of the other, is thought to be the way to fulfill the desire. In erotic love it feels as though part of ourselves is outside of us, and needs to be reintegrated (as in Aristophanes’ explanation of love in Plato’s Symposium). This is the beginning of self-alienation and the loss of innocent immediacy. Self-alienation is a necessary stage on the way to becoming a self. A self is a synthesis of finite and infinite, temporal and eternal, body and soul, held together by spirit. Only with the diremption of these aspects of the self, through self-alienation, does spirit arise. But spirit can only achieve the synthesis of self if it acknowledges its absolute dependence in this task on God (“the power that posits it”). Long before it gets to this stage, the person feels anxiety in the face of self-alienation. Anxiety is an ambivalent state, “a sympathetic antipathy and an antipathetic sympathy.” It is the intimation of the delights of freedom, but also of the dread responsibility that is a consequence of freedom. Like vertigo, it is the simultaneous fascination and fear of the abyss – a hypnotic possibility of falling that induces the dizziness to actually fall. The main arena for the exercise of freedom is in becoming a self. But this requires alienation from one’s immediate sensate being, taking ethical responsibility for one’s relations to other people, and acknowledgement of one’s ultimate dependence on God. Each of these entails risk – and hence anxiety. One of the risks involved is the possibility of falling prey to the demonic. A key definition of this notion is “self-enclosed reserve” [Indesluttethed] – a state in which the individual fails to relate to an other as other, but returns into him or herself in narcissism or solipsism. Kierkegaard feared that his convoluted, indirect writing could be his own form of the demonic, and ultimately opted for more direct forms of communication.

d. Stages on Life’s Way and Concluding Unscientific Postscript

Like many of Kierkegaard’s pseudonymous works, Stages on Life’s Way repeats elements from earlier pseudonymous works. In particular, it repeats the device of nesting narrators within narrators, it repeats characters from Either/Or and Repetition, and it “repeats” “The Seducer’s Diary” in “Quidam’s Diary.” The latter was originally conceived at the same time as “The Diary of the Seducer” but was to differ by having the seducer undermined by his own depression once he had won the girl. Stages also repeats the idea built up over the sequence of pseudonymous works that human existence can be conceived as falling into distinct “stages” or “spheres,” which are related in a dialectical progression. Stages repeats the same stages that have already been traversed in the preceding works, apparently without making any progress.

It is another example of the false ladder to paradise, exemplified by Plato’s ladder of eros. The first major section of Stages, “In Vino Veritas,” borrows its title from Plato’s Symposium and is modeled explicitly on that work, both structurally and thematically. It consists in a group of men at a banquet, each discoursing in turn on the nature of (erotic) love. This section of the book is followed by “Some Reflections on Marriage” by Judge Wilhelm, to give an ethical perspective on love. This is followed by “Quidam’s Diary,” which is supposed to follow a trajectory from erotic love to religious consciousness. But Quidam’s diary is framed by the words of Frater Taciturnus (a distorted repetition of Johannes de silentio), in which he tells us that Quidam’s diary was retrieved from the bottom of a lake. It was enclosed in a box with the key locked inside – a symbol of the demonic. Later Frater Taciturnus tells the reader explicitly that Quidam is demonic “in the direction of the religious.” Furthermore, like the “young man” from Repetition, Quidam is only a fiction invented by Frater Taciturnus to illustrate a point. As we read through Stages it looks as though we are progressing from the aesthetic, through the ethical to the religious. But Frater Taciturnus pulls the ladder out from under our feet in his “Letter to the Reader.” He even suggests that there might not be any reader, in which case he is content to talk to himself – i.e. return demonically into himself, rather than relate himself earnestly to an actual other. Concluding Unscientific Postscript repeats these movements of Stages. It proclaims itself to be only a postscript to the Philosophical Fragments, which any attentive reader of that book could have written, and contains an extensive review of the pseudonymous authorship to date. The self-proclaimed humorist, Johannes Climacus takes up the problematic of Philosophical Fragments of whether there can be an historical point of departure for eternal truth. He seems to conclude that since it is impossible to demonstrate the objective truth of Christianity’s claims, the most the individual can do is to concentrate on the how of appropriation of those claims. This issues in the extensive discussion of inwardness and subjectivity, which is usually taken as the basis for the accusation that Kierkegaard is an “irrationalist.” Climacus, but not Kierkegaard, proclaims that “truth is subjectivity” (as well as “subjectivity is untruth”). Climacus also makes a distinction between two types of religiousness: “Religiousness A” and “Religiousness B.” The former is the pagan conception of religion and is characterized by intelligibility, immanence, and recognition of continuity between temporality and eternity. Religiousness B is dubbed “paradoxical religiousness” and is supposed to represent the essence of Christianity. It posits a radical divide between immanence and transcendence, a discontinuity between temporality and eternity, yet also claims that the eternal came into existence in time. This is a paradox and can only be believed “by virtue of the absurd.” The distinction between “Religiousness A” and “Religiousness B” is another expression of the distinction between recollection and repetition, or between eros and agape, or between immanence and transcendence. It is supposed to mark the gulf between Christianity and all other forms of faith. The paradox of the Christian incarnation is presented as an offense to reason, which can only be overcome by a leap of faith. But even a leap is under the control of the individual. It might take more courage and induce more anxiety than the steady step-by-step ascension of a ladder. One is out over 70000 fathoms. But Climacus is a humorist. Humor is characterized as a means of “revoking” existence. Although Climacus writes about Christian faith, he doesn’t live it. He represents in the modality of possibility what can only be experienced in the modality of actuality. At the end of Concluding Unscientific Postscript, Climacus explicitly revokes everything he has said – though he is careful to add that to say something and revoke it is not the same as never having said it at all. That is, at the end of the pseudonymous scala paradisi, the pseudonymous author proclaims that what he has said is misleading – because it presents a continuity between immanent human categories of thought and the divine in the form of analogy. But there is no analogy to the divine. It is sui generis. It is “the book” to human life as “preface.”

3. The Edifying Discourses

a. Sermons, Deliberations, and Edifying Discourses

Simultaneously with the publication of the aesthetic pseudonymous works, Kierkegaard published a series of works he called “Edifying Discourses” [Opbyggelige Taler]. These were written under his own name and most of them were dedicated “To the Late Michael Pedersen Kierkegaard, Formerly a Clothing Merchant Here in the City, My Father.” Although they typically take a New Testament theme as their point of departure, Kierkegaard explicitly denies that they are sermons. This is because he had not been ordained, and so wrote “without authority.” They are also addressed to “that single individual” and not to a congregation.

Kierkegaard distinguishes his “edifying discourses” as a genre from other works he calls “deliberations” [Overveielser]. Edifying discourses “build up” whereas deliberations are a “weighing up.” Edifying discourses presuppose Christian faith and terminology as given and understood, and build on that. They are meant to augment the faith and love of the Christian reader. Deliberations, while they may ostensibly deal with the same subject matter, imply that the reader stands outside the matter being weighed. But this is in a particular sense. In weighing something on a scale, we measure two weights against one another. In deliberating, the reader weighs the temporal significance of the subject matter against its eternal significance. The deliberation, as a type of writing, weighs into the reader’s balance of temporal and eternal with polemical force. It is meant to turn the normal, worldly view topsy-turvy. Works of Love is subtitled “Some Christian Deliberations in the Form of Discourses.” It has the polemical, topsy-turvy nature of deliberation, but contains within it the form of the discourse. Furthermore, one of the explicit themes of these discourses is edification. But because of the framework of deliberation, the discourses about edification are not necessarily for edification. They don’t presuppose an understanding of the Christian categories, but are meant to lead the reader to an understanding – through deliberation. The earlier pseudonymous book, The Concept of Anxiety is subtitled “A Simple Psychologically Orienting Deliberation on the Dogmatic Issue of Hereditary Sin.” Like Works of Love it is a serious weighing up of various Christian concepts, in a manner designed to provoke readers to rethink the relation between the temporal and eternal in their lives. Kierkegaard uses yet other related genres besides deliberations and edifying discourses. The pseudonym Anti-Climacus uses the subtitles “A Christian Psychological Exposition [Udvikling] for Edification and Awakening” (The Sickness Unto Death) and “For Awakening and Making Inward” (Practice in Christianity). These are written from an idealized Christian point of view, so not only presuppose an understanding of the Christian categories, but seek to raise the level of awareness to the highest level of Christian faith.

b. Direct and Indirect Communication

Kierkegaard struggled to find appropriate means of communication that would address the inward nature of Christian faith. He thought his contemporaries had too much (objective) knowledge, which needed stripping away, before they could achieve awareness of individual inwardness. Everything was made too easy for people, with the press providing ready-made opinions, popular culture providing ready-made values, and speculative philosophy providing promissory notes in place of real achievements. Kierkegaard’s task as a communicator was, initially, to make things more difficult. In order to do this, he devised a method of indirect communication. This was designed to confront the reader with paradox, contradiction, and difficulty by means of refraction of the narrative point of view through pseudonyms, prefaces, postscripts, interludes, preliminary expectorations, repetitions, irony, revocation and other devices that obscure the author’s intention. These devices are meant to undermine the authority of the author, so any “truths” contained in the text cannot merely be learned by rote or appropriated “objectively.” Instead, the text is meant to supply a polished surface in which the reader comes to see him or herself. The manner in which the reader appropriates the text, understands it, and judges it will disclose more about the reader than about the text.

Part of the method of indirect communication was to juxtapose two series of texts: the pseudonymous texts and the “edifying discourses.” The latter were published under Kierkegaard’s own name, and were co-extensive with the pseudonymous authorship. They are evidence that he was a religious author from the outset. The indirect method of the pseudonymous works is often convoluted, obscure, and a combination of personal confession and obfuscation (of those confessions). The whole of the pseudonymous authorship from Either/Or to Concluding Unscientific Postscript can be read as a parody of Hegel’s Encyclopedia of the Philosophical Sciences – an enormously baroque conceit that threatens to become demonic in its obscurity and labyrinthine complexity. This complexity is balanced by the relatively simple thematic variations on biblical texts to be found in the edifying discourses. The latter were direct communications – but addressed only to Christians who could understand them. The indirect works, on the other hand, were designed to seduce or deceive into the truth those who stand outside it – such as the Danish Hegelians and their followers. By parodying Hegel’s Encyclopedia, Kierkegaard was undermining the whole system on which the Danish Hegelians placed so much faith. He supplemented his parody of Hegel with more specific jibes at particular Danish Hegelians throughout the “authorship.” Kierkegaard continued to write edifying discourses in conjunction with the “second authorship,” to accompany the works of the pseudonym Anti-Climacus. After the “second authorship” he wrote Christian discourses that were more polemical and strident than the edifying discourses. They were equally “direct” – being published under his own name, but addressed different emotions and values.

c. That Single Individual, My Reader

Kierkegaard’s edifying discourses are addressed to “that single individual, my reader.” When he first used this address he meant it to apply to Regina Olsen. But he came to see that it had a wider application. He had polemicized from his earliest writings against the press, and against cultural and political tendencies to “level” individuals into homogeneous masses. His term of loathing for the depersonalized, de-individualized instrument of leveling was “the crowd.” It corresponds to Nietzsche’s notion of “the herd” and to Heidegger’s notion of “das Man.” One subset of “the crowd” that especially attracted Kierkegaard’s ire was “the reading public.” This was the anonymous mass, consumer of the secondhand literary opinion of “reviewers.” Most reviewers, in Kierkegaard’s opinion, were hasty, ill-informed panderers to public opinion, so that reviewers and public fed off each other in a vicious circle. Reviews were even written without the reviewer having read the book, then circulated through gossip by “the reading public” as final judgment on the book. The anonymous circulation of public gossip is the antithesis of serious engagement with truth on a personal level.

Christianity addresses the single individual. Its truths, according to Kierkegaard, must be appropriated inwardly, seriously and with infinite passion. Just as we cannot die another’s death, we cannot live another’s faith. Existing inwardly in passion as an individual is a prerequisite for Christian faith. Having Christian faith is a prerequisite for understanding the edifying discourses. So the edifying discourses are addressed to each single individual. The pseudonymous works in the aesthetic authorship often have letters addressed to the reader too. But, as in the case of the letters of Constantine Constantius and Frater Taciturnus, they turn out to be soliloquies addressed to themselves more than direct, open communications to a reader posited as genuinely other.

4. The “Second Authorship”

a. Works of Love

Works of Love was written under Kierkegaard’s own name. Its subtitle places it within the genre of “Christian deliberations” – i.e. polemical weighings-up of Christian notions. It does not presuppose an existential understanding of Christian love, as it would were it an “edifying discourse,” but challenges the reader to open him or herself to the specifically Christian understanding of love. For a reader who understands love principally in terms of eros, the Christian notion of love as agape is counterintuitive. Whereas eros is a preferential feeling of desire, agape is a spiritual duty to serve the neighbor (without discrimination in terms of preference). Whereas eros is ultimately selfish, aimed at satisfying the lover’s desire, agape is selfless, requiring self-sacrifice. Whereas eros is often built on the visual objectification of the beloved, agape requires the individual to become “transparent” and “as nothing” before God. Whereas eros is typically a relation between two people, agape always involves God as the “third” in the relation.

Works of Love concentrates not so much on the understanding of love as such, but on the understanding of works of love. Love will be known as the fruit of these works of love. Since God is love, it can only be known through the existential commitment of Christian faith. This faith is only lived in the attempt to imitate the life of Christ. Christ’s life was itself God’s principal work of love for human beings. It is only through this work of love that we can know God as love. The only true work of love is helping someone else achieve autonomy through Christian love. But if that person sees that he or she was dependent on some other human being to achieve autonomy, that autonomy will be undone. The human author of a work of love must disappear in the act of love, so that only the love is perceived and only God is recognized as its author. This presents Kierkegaard with a difficult task in writing Works of Love. If it helps its readers achieve autonomy through an understanding of Christian love, and the readers recognize Kierkegaard to be the author, it will fail to be a work of love. Kierkegaard has to disappear as author in order for the book to function as a work of love. He resorts to the device of the dash [Tankestreg] to achieve his disappearance. He explicitly talks about this use of the dash during the course of Works of Love, and ends the penultimate section of the book with a dash (unfortunately omitted from the English translation). The conclusion that follows the dash is a presentation of the words of the Apostle John. As an Apostle, John presents the word of God. The word of God is a record of the life of Christ, which is God’s work of love. So God’s word is the work of love. Kierkegaard, by means of the dash, erases his ego as an author to allow the word of God to shine through – thereby preserving Works of Love as a work of love.

b. Anti-Climacus

Anti-Climacus is the pseudonymous author of two of Kierkegaard’s mature works: The Sickness Unto Death (1849) and Practice in Christianity (1850). As his name indicates, Anti-Climacus represents the antithesis of Johannes Climacus. As we have seen, Climacus derives his name from the monk who wrote Scala Paradisi, thereby embracing the idea that it is possible for human beings to ascend to heaven under their own power. The “aesthetic” authorship, culminating in Concluding Unscientific Postscript, explores a number of possible modes of scaling heaven – by means of erotic love, the Babel tower of aesthetic poetry, ethical works, or speculative reason. All are found wanting. Having established the absolute nature of transcendence through repeated parodies of these vain attempts in the aesthetic authorship, Kierkegaard proceeds to show through Anti-Climacus how various aesthetic concepts are transfigured from an ideal Christian point of view.

The central notions explored in The Sickness Unto Death are “despair” and “the self.” In this respect it is a Christian repetition of the central themes of The Concept of Anxiety, with “despair” supplanting “anxiety.” Both explore the task of becoming a self from the points of view of psychology and Christian faith. Both invoke sin as the greatest obstacle to becoming a self. Yet paradoxically, becoming conscious of sin is a prerequisite for faith and selfhood. Anti-Climacus distinguishes between “human being” and “self.” The human being is a synthesis, of infinite and finite, temporal and eternal, freedom and necessity, body and soul. The self, on the other hand, is the process of relating these elements of synthesis to one another. The self is the task of maintaining the proper equilibrium of the synthesis. But this task is beyond the capacity of a mere human being alone. Willing to be a self is itself a form of despair. Not willing to be a self is also a form of despair. Being unaware of the possibility of being a self is also a form of despair. The only antidote to despair is Christian faith. Faith provides the missing element in the synthesis, namely, an acknowledgement of God as the necessary underpinning of the self-relation. But to become aware of God, one first has to become aware of one’s absolute difference from God. This is the function of sin-consciousness. Sin-consciousness presupposes God-consciousness. The ultimate form of despair is despairing over one’s sin, and thereby failing to accept God’s forgiveness. Only through the movement of faith can God’s grace be received and accepted, thereby acknowledging God’s absolute alterity as well as our absolute dependence on God to be selves. Practice in Christianity complements The Sickness Unto Death thematically. It deals with the appropriate Christian response to divine grace, and with healing through penitence. But it also repeats some of the themes of Philosophical Fragments and Concluding Unscientific Postscript. In particular it revisits the themes of offense and the historical point of departure for eternal truth. The latter is explored under the rubric of becoming contemporary with the absolute. Christian faith is the only means for the immanent, temporal human being to have contact with the transcendent, eternal truth, since that faith consists in believing that Christ was the incarnation of God. That faith consists not merely in intellectual belief, but in willingness to imitate the life of Christ to the utmost of one’s powers. Anti-Climacus catalogues various ways in which we might take offense at someone claiming to be the “God-man.” In the process he discusses the necessity for God, as transcendent, to use a method of indirect communication. The God-man needs to be “incognito” – to arrive in the unrecognizable form of a servant. He needs to suffer, to be spurned, to avoid any possible direct revelation of His exalted status. Only by means of indirect communication, rather than by direct revelation, will the individual come to relate to the God-man through faith. The possibility of faith is the obverse of the possibility of offense. Offense is underscored by means of the Almighty’s lowly incognito and indirect method of communication.

c. The Attack on the Church

Kierkegaard came to think that perhaps indirect communication should be the exclusive provenance of the God-man. He came increasingly to regard his own indirection, and his love affair with language, to be demonic temptations. When the Bishop Primate of the Danish People’s Church, his father’s old pastor J.P. Mynster, died in January 1854, Kierkegaard felt free to attack the established church more directly and stridently. He had suppressed some critical and potentially offensive writings while Mynster was still alive. But he was precipitated into a full frontal attack when the new Bishop Primate, H.L. Martensen, Kierkegaard’s old rival, publicly described the late Mynster as “a witness to the truth.” Kierkegaard had respected Mynster as a pastor and a man, but found his administration of the church wanting. Mynster had steered the church into closer relations with the state, and had shored up the values of “Christendom” rather than “Christianity.” The former was a phenomenon of cultural history; the latter was the vehicle of passionate, inward individual faith. Given the leveling tendencies of “the present age,” Christendom as a cultural phenomenon was on a collision course with Christian faith. It threatened to replace “the single individual” with “the crowd” (under the guise of “the congregation”), struggle with mediation, revolution with reflection, and works of love with the welfare state. Worst, it threatened to usurp eternal truth with temporal gossip. Therefore, to call its chief spokesman a “witness to the truth” provoked an extreme reaction from Kierkegaard.

His discourses changed from gentle edifications to strident calls to arms. He moved from a position of “armed neutrality” with respect to church politics, to one of decisive intervention in “the instant.” “The Instant” [Øieblikket – literally ‘the glint of an eye’] was Kierkegaard’s final frenetic publication. The Concept of Anxiety had identified “the instant” as the point of intersection of time and eternity. It is the moment of decision, the moment of transfiguring vision, the moment of contemporaneity with Christ. It was also the moment to let go of indirect communication and to speak directly. “The Instant” was the name of a broadsheet Kierkegaard published to continue his attack on the state church. He published ten issues between its inception in May 1855 and the last in September 1855, when he collapsed and was admitted to hospital. But to speak directly, having spoken for so long indirectly, is not the same as the “objective” direct communication he originally resisted. It was not a direct communication about eternal truth, but a timely intervention in contemporary politics. It was a verbal act, rather than a measured contribution to literature. Another important part of the “second authorship” consists in the self-reflections Kierkegaard wrote on his own work as an author. In 1851 he published On My Work as an Author, but had also written several other works that were only published posthumously. These include The Point of View for my Work as an Author: A Report to History (1859), Armed Neutrality, or My Position as a Christian Author in Christendom (1880), and “Three Notes Concerning my Activity as an Author” (1859). He also withheld from publication The Book on Adler, an extended study of Adolph Adler, a prominent Hegelian and pastor in the Danish People’s Church. Adler claimed to have received divine revelation, but Kierkegaard’s analysis of his writings tries to demonstrate Adler’s confusion. Adler becomes, in Kierkegaard’s words, “a Satire on Hegelian Philosophy and the Present Age.” Kierkegaard also used Adler’s case to distinguish between “a genius” and “an apostle.” Another work, also published posthumously, was “The Ethical and Ethico-religious Dialectic of Communication” (1877). Kierkegaard agonized over whether to publish these direct communications about his own strategies of communication and how he saw his activity as an author. Of particular concern was how these direct writings would affect the complex dialectic of direct and indirect communications he had set up in his “authorships.” Ultimately he relied on the guidance of “Governance” [Styrelse] to decide whether or not to publish – much as Socrates had relied on the warnings of his daimonion about whether to engage people in philosophical cross-examination. Retrospectively, Kierkegaard regarded his activity as an author to have been under the direction of Governance. He had not had a clear view at the outset about the structure of his authorships, but had come to see that what he had been directed to write was what was required for a religious poet in the present age. He was a writer who overflowed with ideas – far too many to write down. Therefore Governance had to sit him down like a schoolboy, and make him write as though he were writing “a work assignment.” In much the same way as he disappeared under the dash in works of love, Kierkegaard “disappears” in these accounts of his own activity as a writer under the sign of “Governance.”

5. References and Further Reading

Kierkegaard’s Writings

Danish

  • Breve og Aktstykker vedrørende Søren Kierkegaaard, ed. Niels Thulstrup, Copenhagen: Munksgaard, 1953-4.
  • Søren Kierkegaards Papirer, ed. P.A. Heiberg, V. Kuhr & E. Torsting, second edition Niels Thulstrup, Copenhagen: Gyldendal, 1968-78.
  • Søren Kierkegaards Samlede Værker, ed. A.B. Drachmann, J.L. Heiberg & H.D. Lange, second edition, Copenhagen: Nordisk Forlag, 1920-36.
  • Søren Kierkegaards Skrifter, ed. N.J. Cappelørn, et.al., Copenhagen: Gad, 1997-.
  • English Kierkegaard’s Writings volumes 1-XXVI, ed. & trans. H.V. Hong, et.al. Princeton University Press: 1978-2000.

Commentary

  • Cappelørn, Niels Jørgen, Hermann Deuser, et.al. (eds), Kierkegaard Studies Yearbook 1996-, Berlin & New York: Walter de Gruyter, 1996-
  • Ferreira, M. Jamie, Love’s Grateful Striving: A Commentary on Kierkegaard’s Works of Love, Oxford University Press, 2001
  • Garff, Joakim, SAK: Søren Aabye Kierkegaard: en biografi, Copenhagen: Gad, 2000
  • Hannay, Alastair, Kierkegaard: A Biography, Cambridge University Press, 2001
  • Hannay, Alastair & Gordon Marino (eds), The Cambridge Companion to Kierkegaard, Cambridge University Press, 1998
  • Kirmmse, Bruce, Encounters With Kierkegaard, Princeton University Press, 1996
  • Kirmmse, Bruce, Kierkegaard in Golden Age Denmark, Bloomington: Indiana University Press, 1990
  • Mackey, Louis, Points of View: Readings of Kierkegaard, Tallahassee: Florida State University Press, 1986
  • Malantschuk, Gregor, Kierkegaard’s Thought, ed. & trans. H.V. Hong & E.H. Hong, Princeton University Press, 1971
  • Pattison, George, Kierkegaard: The Aesthetic and the Religious, New York: St. Martin’s Press, 1992
  • Perkins, Robert L (ed.), International Kierkegaard Commentary, Macon: Mercer University Press
    • This is a series of anthologies of essays, with each volume designed to accompany the volumes comprising Kierkegaard’s Writings, op.cit.

Author Information

William McDonald
Email: wmcdonal@metz.une.edu.au
University of New England
Australia

Nasir Khusraw (1004—1060)

Abu Mo’in Hamid al-Din Nasir ibn Khusraw is an important figure in the development of Ismaili philosophy. Much of his biography and philosophical ideology has been obtained through fragmented texts, both in poetry and prose.  Born into a politically connected family, Khusraw was well-educated and in the sciences and humanities.  Having spent most of his life occupying prestigious positions within the Sajuq court, Khusraw converted to the Ismaili faith at the age of forty after careful study.  He spent the rest of his life writing and advocating for the Ismaili faith, and eventually was forced into exile by Sunni authorities.

Consistent with other Ismaili philosopher, Khusraw’s cosmology is heavily inspired by Neoplatonism.  His metaphysics describes a God from which everything emanates and consistently strives back towards.  Through God, existence is cast into being through Universal Soul and Universal Intellect.  Each of these concepts provides the foundation for material objects, ascending from minerals to human beings.  Within each human being exists a soul and intellect, imperfect in form but existing within the Universals.   Khusraw interweaves his metaphysics within the Shi’i doctrine, requiring a divinely inspired guide to assist us in our journey to reconnect with Universal Intellect and Soul.  In holding to this cosmogonic description, Khusraw distinguishes his philosophy from previous Ismaili thought introduced by al-Farabi and picked up by Ibn Sina and al-Kirmani.

Table of Contents

  1. Life
  2. Philosophy
  3. References and Further Reading

1. Life

In striking contrast to other Ismaili writers of the time (s.v., Hamid ai-din al Kirmani; Abu Ya‘qub al-Sijistani), many sources of information exist pertaining to Khusraw’s life.   Documentation was recorded,  with vary degrees of accuracy, by Khusraw himself, a (hostile) contemporary, and by later historians.  Since his death, Khusraw has been included in every major literary or historical survey of Ismailism.  Khusraw’s life can be divided into four periods: his early years up to the age of forty (discernible from fragments of various texts); his conversion to Ismailism (of which he has left two different versions in the form of prose and poetry); his seven-year journey (documented in Safarnama); and his years of preaching followed by persecution and exile (drawn primarily from his poetry, but also a few statements in his philosophical works).

In 1004, Abu Mo’in Hamid al-Din Nasir ibn Khusraw was born in Qobadiyan, the district of Marv, in the eastern Iranian province of Khurasan. Along with two of his brothers, Khusraw occupied a high position in the administrative ranks of the Saljuq court – reportedly in the revenue department.  Evidence also suggests that he was familiar with the court of previous dynasty, the Ghaznavids.  Based on the quality of his writings, he received an excellent education in the sciences, literatures and philosophies of his time, including the study of Greek and Neoplatonic philosophy.  In his writing, Khusraw reportes examining the doctrines of the different Islamic schools and not being satisfied until he found and understood the Ismaili faith.  As a result of his conversion to Ismailism he embarked on a seven-year journey, during which time he spent three years in the Ismaili court in Cairo under the Fatimid caliph, al-Mustansir (1029-1094). The Fatimid dynasty (909-1171) aimed at creating an Islamic state based on Ismaili tenets, and thus presented a direct theological and military challenge to the Sunni ‘Abbasid caliphate based in Baghdad. Khusraw left Cairo as the head (hujjat) of Ismaili missionary activities in his home province of Khurasan.  After leaving Cairo, Khusraw was forced into exile by the Sunni authorities.  He spent the rest of his life exiled in the Pamir Mountains in Badakhshan, located in modern-day Tajikistan and Afghanistan.

2. Philosophy

Khusraw’s philosophical works reveal a strong Neoplatonic structure and vocabulary.  For example, his cosmogony closely follows Plotinus, moving from God and God’s word (logos) to Intellect, Soul, and the world of Nature.  Underlying each of the Ismaili cosmogonic systems is a fundamental division of the world into two realms, the esoteric (batin) and the exoteric (zahir).  From this division, everything in the physical world points to its counterpart in the spiritual, which is seen as its source, or true form.  The cosmogonic structure itself reveals a purposeful, providential unfolding from the spiritual realm into the physical world.  Conversely, as a reflection, the physical world seeks to grasp the spiritual realm and comprehend it.    In holding to this cosmogonic description, Khusraw follows his fellow Ismailis (Nasafi and al-Sijistani) while differentiating his theory from the structure introduced by al-Farabi and later adopted by Ibn Sina and the Ismaili philosopher al-Kirmani.

Khusraw begins with a discussion of tawhid (oneness, God’s unity), the clear understanding of which is the only way to achieve spiritual perfection. For Nasir, God Himself is indescribable beyond all categories of being and non-being (nothing which has an opposite can be ascribed to Him, since that would be limiting Him to human concepts).   However, from God emerges his Word (kalmia), ‘Be!’, which brings into existence Universal Intellect, perfect in potentiality and actuality.  Universal Intellect transcends time and space,  containing all being within itself.  Universal Intellect enjoys a worshipful intimacy with God and derives perfection from this intimacy.  From this worship emerges Universal Soul, perfect in potentiality but not in actuality because it is separated from God by Intellect.  Universal Soul recognizes its separation from God, and moves closer to God in a desire for the perfection enjoyed by Intellect.  Through its search for perfection, Universal Soul introduces the first movement into the entire structure, manifest in time and space.

The entire cosmos is set into motion through the movement of Universal Soul.  As a corollary, being is differentiated into two sets of opposites:  hot and cold, wet and dry.  Derived from these sets of opposites are the four elements: earth, air, fire, and water.  From these four elements arise the successive development of   minerals, plants, and animals.  Finally, as the summit of physical creation, human beings arise.  Within each human being exists an individual intellect and individual soul manifesting the same characteristics (but on a smaller level) as the universals.  In fact, the entire cosmos is formed on a matrix of Intellect and Soul; everything within the cosmos displays original intelligence and the search for perfection exhibited by the soul.

Khusraw’s ethics grow from and reflect this cosmogony. Each individual’s task is to recognize his or her own imperfections and then move to correct them, seeking the closest relationship possible with God.  For Khusraw, this is achieved by stringent and repeated application of the intellect to both physical and spiritual matters.  In order to correct these imperfections a believer must find a guide and study dilligently, perform all required religious acts with a full understanding, and supplement new understanding with higher levels of worldly activity.  As an Ismaili, Khusraw held the Shi‘i doctrine that God would not send a revelation without a guide to interpret it.  For the Ismailis, this guide must be a living person, the Imam of the Time.  As a living bridge between the two realms, this person must be divinely inspired, infallible, and perfectly capable of providing guidance in spiritual and worldly affairs.

3. References and Further Reading

The following sources elucidate Khusraw’s philosophy:

  • H. Corbin, “Nasir-i Khusrau and Iranian Ismailism,” in The Cambridge History of Iran: Volume 4, ed., R. N. Frye (Cambridge 1975), pp. 520-42 and 689-90;
  • A. Hunsberger, “Nasir Khusraw: Fatimid Intellectual,” in F. Daftary, ed., Intellectual Traditions in Islam (London 2000), pp. 112-29;
  • A. Hunsberger, Nasir Khusraw’s Doctrine of the Soul: From the Universal Intellect to the Physical World in Ismaili Philosophy, PhD thesis, Columbia University, New York, 1992;
  • S. Meskoob, Shahrokh, “The Origin and Meaning of ‘Aql (Reason) in the View of Nasir Khusraw,” Iran Nameh, 6 (1989), pp. 239-57, and 7 (1989), pp. 405-29.

For a full bibliography of Nasir Khusraw’s works and ideas, see:

  • A. C. Hunsberger, Nasir Khusraw, the Ruby of Badakhshan: A Portrait of the Persian Poet, Traveller and Philosopher (London 2000).

For works still in manuscript, see:

  • I. K. Poonawala, Bibibliography of Ismaili Literature, Malibu, Calif., 1977, p. 123.

Author Information

Alice C. Hunsberger
Email: info@iis.ac.uk
Institute of Ismaili Studies
United Kingdom

Immanuel Kant: Aesthetics

kant2Immanuel Kant is an 18th century German philosopher whose work initated dramatic changes in the fields of epistemology, metaphysics, ethics, aesthetics, and teleology. Like many Enlightenment thinkers, he holds our mental faculty of reason in high esteem; he believes that it is our reason that invests the world we experience with structure. In his works on aesthetics and teleology, he argues that it is our faculty of judgment that enables us to have experience of beauty and grasp those experiences as part of an ordered, natural world with purpose. After the Introduction, each of the above sections commences with a summary. These will give the reader an idea of what topics are discussed in more detail in each section. They can also be read together to form a brief bird’s-eye-view of Kant’s theory of aesthetics and teleology.

Kant believes he can show that aesthetic judgment is not fundamentally different from ordinary theoretical cognition of nature, and he believes he can show that aesthetic judgment has a deep similarity to moral judgment. For these two reasons, Kant claims he can demonstrate that the physical and moral universes – and the philosophies and forms of thought that present them – are not only compatible, but unified.

Table of Contents

  1. Introduction
    1. Kant’s Life and Works
    2. The Central Problems of the Critique of Judgment
  2. Kant’s Aesthetics
    1. The Judgment of the Beautiful
    2. The Deduction of Taste
    3. The Sublime
    4. Fine Art and Genius
    5. Idealism, Morality and the Supersensible
  3. Kant’s Teleology
    1. Objective Purposiveness and Science
    2. ‘The Peculiarity of the Human Understanding’
    3. The Final Purpose and Kant’s Moral Argument for the Existence of God
  4. The Problem of the Unity of Philosophy and its Supersensible Objects
  5. References and Further Reading
    1. Works by Kant
    2. Other Primary and Secondary Works

1. Introduction

a. Kant’s Life and Works

Immanuel Kant is often said to have been the greatest philosopher since the Greeks. Certainly, he dominates the last two hundred years in the sense that – although few philosophers today are strictly speaking Kantians – his influence is everywhere. Moreover, that influence extends over a number of different philosophical regions: epistemology, metaphysics, aesthetics, ethics, politics, religion. Because of Kant’s huge importance, and the variety of his contributions and influences, this encyclopedia entry is divided into a number of subsections. What follows here will be a brief account of Kant’s life and works, followed by an overview of those themes that Kant felt bridged his philosophical works, and made them into one ‘critical philosophy’.

Kant was born in Königsberg, Prussia (now Kalingrad in Russia) in 1724 to Pietist Lutheran parents. His early education first at a Pietist school and then at the University of Königsberg was in theology, but he soon became attracted by problems in physics, and especially the work of Isaac Newton. In 1746 financial difficulties forced him to withdraw from the University. After nine years supporting himself as a tutor to the children of several wealthy families in outlying districts, he returned to the University, finishing his degree and entering academic life, though at first (and for many years) in the modest capacity of a lecturer. (Only in 1770 was he given a University chair in logic and metaphysics at Königsberg.) He continued to work and lecture on, and publish widely, on a great variety of issues, but especially on physics and on the metaphysical issues behind physics and mathematics. He rarely left his home city, and gradually became a celebrity there for his brilliant, witty but eccentric character.

Kant’s early work was in the tradition (although not dogmatically even then) of the great German rationalist philosopher Leibniz, and especially his follower Wolff. But by the 1760s, he was increasingly admiring Leibniz’s great rival Newton, and was coming under the additional influences of the empiricist skepticism of Hume and the ethical and political thought of Rousseau. In this period he produced a series of works attacking Leibnizian thought. In particular, he now argued that the traditional tools of philosophy – logic and metaphysics – had to be understood to be severely limited with respect to obtaining knowledge of reality. (Similar, apparently skeptical, claims were relatively common in the Enlightenment.)

It was only in the late 1760s, and especially in his Inaugural Dissertation of 1770 that Kant began to move towards the ideas that would make him famous and change the face of philosophy. In the Dissertation, he argued for three key new ideas: first, that sensible and conceptual presentations of the world (for example, my seeing three horses, and my concept of three) must be understood to be two quite distinct sources of possible knowledge. Second, it follows that knowledge of sensible reality is only possible if the necessary concepts (such as substance) are already available to the intellect. This fact, Kant argued, also limits the legitimate range of application of these concepts. Finally, Kant claimed that sensible presentations were of only appearances’, and not things as they are in themselves. This was because space and time, which describe the basic structure of all sensible appearances, are not existent in things in themselves, but are only a product of our organs of sense. Perceiving things in space and time is a function of the mind of the perceiver. The hypothesis that both key concepts, and the basic structure of space and time, are a priori in the mind, is a basic theme of Kant’s idealism (see the entry on ‘Kant’s Metaphysics’). It is important to recognize that this last claim about space and time also exacerbates the limitation imposed above by proposing a whole realm of ‘noumena’ or ‘things in themselves’ which necessarily lies beyond knowledge in any ordinary sense. These new and often startling ideas, with a few important modifications, would form the basis of his philosophical project for the rest of his life.

After publishing quite often in the preceding 15 years, the Dissertation ushered in an apparently quiet phase in Kant’s work. Kant realized that he had discovered a new way of thinking. He now needed rigorous demonstrations of his new ideas, and had to pursue their furthest implications. He even needed to find a new philosophical language to properly express such original thoughts! This took more than a decade of his life. Except for a remarkable set of correspondence during this period, Kant published nothing until the massive first edition of the Critique of Pure Reason, in 1781 (revised second edition, 1787).

Over the next two decades, however, he furiously pursued his new philosophy into different territories, producing books or shorter publications on virtually every philosophical topic under the sun. This new philosophy came to be known as ‘critical’ or ‘transcendental’ philosophy. Of particular importance were the so called three Critiques: The Critique of Pure Reason (1781/1787), Critique of Practical Reason (1788), and the Critique of Judgment (1790). Kant quickly became famous in the German speaking world, and soon thereafter elsewhere. This fame did not mean universal praise, however. Kant’s work was feverishly debated in all circles – his work on religion and politics was even censored. And by the time of his death in 1804, philosophers such as Fichte, Schelling and the Hegel were already striking out in new philosophical directions. Directions, however, that would have been unthinkable without Kant.

b. The Central Problems of the Critique of Judgment

Kant’s Critique of Judgment (the third Critique) was and continues to be a surprise – even to Kant, for it emerged out of Kant’s philosophical activity having not been a part of the original plan. (For an account of Kant’s first two Critiques, please see the entry on ‘Kant’s Metaphysics’.) Some philosophers have even claimed that it is the product of the onset of senility in Kant. After initial enthusiasm during the romantic period, the book was relatively ignored until work such as Cassirer’s in the early 20th Century. Especially in the last few decades, however, the Critique of Judgment is being increasingly seen as a major and profound work in Kant’s output.

Part of the surprise lies in the diversity of topics Kant deals with. For much of the previous two centuries the book was read – and it still is largely read in this way – as a book about aesthetics (the philosophy of the beautiful and the sublime). In fact this type of reading by no means adequately reflects Kant’s explicit themes, and is forced to ignore much of the text. Here, we shall try to sketch out the range of topics and purposes (including aesthetics) Kant gives to his third Critique.

There are several commonly available translations of the Critique of Judgment. Here, we will use Werner S. Pluhar’s (Hackett, 1987), but will make reference alternative translations of key terms, especially as found in the widely used James Creed Meredith translation. To facilitate the use of the variety of available editions, passages in Kant’s text will be indicated by section number, rather than page number.

The basic, explicit purpose of Kant’s Critique of Judgment is to investigate whether the ‘power’ (also translated as ‘faculty’ – and we will use the latter here) of judgment provides itself with an priori principle. In earlier work, Kant had pretty much assumed that judgment was simply a name for the combined operation of other, more fundamental, mental faculties. Now, Kant has been led to speculate that the operation of judgment might be organized and directed by a fundamental a priori principle that is unique to it. The third Critique sets out to explore the validity and implications of such a hypothesis.

In the third Critique, Kant’s account of judgment begins with the definition of judgment as the subsumption of a particular under a universal (Introduction IV). If, in general, the faculty of understanding is that which supplies concepts (universals), and reason is that which draws inferences (constructs syllogisms, for example), then judgment ‘mediates’ between the understanding and reason by allowing individual acts of subsumption to occur (cf. e.g. Introduction III). This leads Kant to a further distinction between determinate and reflective judgments (Introduction IV). In the former, the concept is sufficient to determine the particular – meaning that the concept contains sufficient information for the identification of any particular instance of it. In such a case, judgment’s work is fairly straightforward (and Kant felt he had dealt adequately with such judgments in the Critique of Pure Reason). Thus the latter (where the judgment has to proceed without a concept, sometimes in order to form a new concept) forms the greater philosophical problem here. How could a judgment take place without a prior concept? How are new concepts formed? And are there judgments that neither begin nor end with determinate concepts? This explains why a book about judgment should have so much to say about aesthetics: Kant takes aesthetic judgments to be a particularly interesting form of reflective judgments.

As we shall see, the second half of Kant’s book deals with teleological judgments. Broadly speaking, a teleological judgment concerns an object the possibility of which can only be understood from the point of view of its purpose. Kant will claim that teleological judgments are also reflective, but in a different way – that is, having a different indeterminacy with respect to the concepts typical of natural science.

Reflective judgments are important for Kant because they involve the judgment doing a job for itself, rather than being a mere co-ordinator of concepts and intuitions; thus, reflective judgments might be the best place to search for judgment’s a priori legislating principle. The principle in question (if it exists), Kant claims, would assert the suitability of all nature for our faculty of judgment in general. (In the narrower case of determinate judgments, Kant believes he has demonstrated the necessity of this ‘suitability’ – please see the entry on ‘Kant’s Metaphysics’.) This general suitability Kant calls the finality or purposiveness/ purposefulness of nature for the purposes of our judgment. Kant offers a number of arguments to prove the existence and validity of this principle. First, he suggests that without such a principle, science (as a systematic, orderly and unified conception of nature) would not be possible. All science must assume the availability of its object for our ability to judge it. (A similar argument is used by Kant in the Critique of Pure Reason in discussing the regulative role of rational ideas (see A642-668=B670-696)). Second, without such a principle our judgments about beauty would not exhibit the communicability, or tendency to universality even in the absence of a concept, that they do. It is this second argument that dominates the first half of the Critique of Judgment.

As we shall see, Kant uses the particular investigation into judgments about art, beauty and the sublime partly as a way of illuminating judgment in general. Aesthetic judgments exhibit in an exemplary fashion precisely those features of judgment in general which allow one to explore the transcendental principles of judgment. But Kant has still higher concerns. The whole problem of judgment is important because judgment, Kant believes, forms the mediating link between the two great branches of philosophical inquiry (the theoretical and the practical). It had been noted before (for example, by Hume) that there seems to be a vast difference between what is, and what ought to be. Kant notes that these two philosophical branches have completely different topics, but these topics, paradoxically, have as their object the very same sensible nature. Theoretical philosophy has as its topic the cognition of sensible nature; practical philosophy has as its topic the possibility of moral action in and on sensible nature.

This problem had arisen before in Kant’s work, in the famous Antinomies in both the first and second Critiques. A key version of the problem Kant poses in the Antinomies concerns freedom: how can nature be both determined according to the laws of science, and yet have ‘room’ for the freedom necessary in order for morality to have any meaning? Ultimately, for Kant this would be a conflict of our faculty of reason against itself. For, in its theoretical employment, reason absolutely demands the subjection of all objects to law; but in its practical (moral) employment, reason equally demands the possibility of freedom. The problem is solved by returning to the idealism we discussed in previous section of the introduction. Every object has to be conceived in a two-fold manner: first as an appearance, subject to the necessary jurisdiction of certain basic concepts (the Categories) and to the forms of space and time; second, as a thing in itself, about which nothing more can be said. Even if appearances are rigorously law-governed, it is still possible that things in themselves can act freely. Nevertheless, although this solution eliminates the conflict, it does not actually unify the two sides of reason, nor the two objects (what is and what ought) of reason.

Judgment seems to relate to both sides, however, and thus (Kant speculates) can form the third thing that allows philosophy to be a single, unified discipline. Kant thus believes that judgment may be the mediating link that can unify the whole of philosophy, and correlatively, also the link that discovers the unity among the objects and activities of philosophy. Unfortunately, Kant never makes explicit exactly how the bulk of his third Critique is supposed to solve this problem; understandably, it is thus often ignored by readers of Kant’s text. Thus, the central problem of the Critique of Judgment is a broad one: the unity of philosophy in general. This problem is investigated by that mental faculty which Kant believes is the key to this unity, namely judgment. And judgment is investigated by the critical inquiry into those types of judgment in which the a priori principle of judgment is apparent: on the beautiful, on the sublime, and on teleology. We shall return to the grand issue of the unity of philosophy at the end of this article.

The various themes of the Critique of Judgment have been enormously influential in the two centuries since its publication. The accounts of genius, and of the significance of imagination in aesthetics, for example, became basic pillars of Romanticism in the early 19th Century. The formalism of Kant’s aesthetics in general inspired two generations of formalist aesthetics, in the first half of the 20th Century; the connection between judgment and political or moral communities has been similarly influential from Schiller onwards, and was the main subject of Hanna Arendt’s last, uncompleted, project; and Kant’s treatment of the sublime has been a principle object of study by several recent philosophers, such as J.-F. Lyotard. Kant’s discussion, in the second half of the book, of the distinction between the intellectus ectypus and the intellectus archetypus was an extremely important in the decades immediately after Kant in the development of German Idealism. And his moral proof for the existence of God is often ranked alongside the great arguments of Anselm and Aquinas.

The following entry is divided into two sections, which correspond for the most part to the major division of Kant’s book between the ‘Critique of Aesthetic Judgment’ and the ‘Critique of Teleological Judgment’. Part A deals with Kant’s account of beauty, the sublime, and fine art. In the first two of these subjects, Kant’s concern is with what features an aesthetic judgment exhibits, how such a judgment is possible, and is there any transcendental guarantee of the validity of such a judgment. The treatment of fine art shifts the focus onto the conditions of possibility of the production of works of art. Part B deals with Kant’s account of teleological judgment, and its relation to the natural science of biology. However, if the discussion above of the ‘Central Problems’ of the Critique of Judgment is correct, a major part of Kant’s interest is less in these particular analyses, than in their broader implications for e.g. morality, the nature of human thought, our belief in the existence of God, and ultimately for the unity of philosophy itself. We will be dealing with these implications throughout, but especially in sections A5, B2, B3 and B4.

2. Kant’s Aesthetics

a. The Judgment of the Beautiful

Overview: The Critique of Judgment begins with an account of beauty. The initial issue is: what kind of judgment is it that results in our saying, for example, ‘That is a beautiful sunset’. Kant argues that such aesthetic judgments (or ‘judgments of taste’) must have four key distinguishing features. First, they are disinterested, meaning that we take pleasure in something because we judge it beautiful, rather than judging it beautiful because we find it pleasurable. The latter type of judgment would be more like a judgment of the ‘agreeable’, as when I say ‘I like doughnuts’.

Second and third, such judgments are both universal and necessary. This means roughly that it is an intrinsic part of the activity of such a judgment to expect others to agree with us. Although we may say ‘beauty is in the eye of the beholder’, that is not how we act. Instead, we debate and argue about our aesthetic judgments – and especially about works of art -and we tend to believe that such debates and arguments can actually achieve something. Indeed, for many purposes, ‘beauty’ behaves as if it were a real property of an object, like its weight or chemical composition. But Kant insists that universality and necessity are in fact a product of features of the human mind (Kant calls these features ‘common sense’), and that there is no objective property of a thing that makes it beautiful.

Fourth, through aesthetic judgments, beautiful objects appear to be ‘purposive without purpose’ (sometimes translated as ‘final without end’). An object’s purpose is the concept according to which it was made (the concept of a vegetable soup in the mind of the cook, for example); an object is purposive if it appears to have such a purpose; if, in other words, it appears to have been made or designed. But it is part of the experience of beautiful objects, Kant argues, that they should affect us as if they had a purpose, although no particular purpose can be found.

Having identified the major features of aesthetic judgments, Kant then needs to ask the question of how such judgments are possible, and are such judgments in any way valid (that is, are they really universal and necessary).

It is useful to see the aesthetics here, as with Kant’s epistemology and to a certain extent his ethics also, as being a leap over the terms of the debate between British (and largely empiricist) philosophy of art and beauty (Shaftesbury, Hutcheson, Hume and Burke) and Continental rationalist aesthetics (especially Baumgarten, who invented the modern use of the term aesthetics’ in the mid-18th century). The key ideas of the former group were (i) the idea of a definite human nature, such that studies of beauty could, within limits, be universal in scope; (ii) the assertion that beautiful objects and our responses to them were essentially involved in sense or feeling, and were not cognitive; (iii) that any ‘natural’ responses to beauty were generally overlaid by individual and communal experiences, habits and customs. The main disagreement with rationalist thought on the matter was in the second of these ideas. Baumgarten, following Leibniz, argued that all sense perception was merely ‘confused’ cognition, or cognition by way of sensible images. Thus, although beauty certainly appears to our senses, this by no means demonstrates that beauty is non-cognitive! Beauty, for Baumgarten, has more to do with rational ideas such as harmony, rather than with the physiological.

Kant asserted the basic distinction between intuitive or sensible presentations on the one hand, and the conceptual or rational on the other. (See ‘Kant’s Transcendental Idealism’ in the article on ‘Kant’s Metaphysics’.) Therefore, despite his great admiration for Baumgarten, it is impossible for Kant to agree with Baumgarten’s account of aesthetic experience. (By ‘aesthetic’ here we mean in Baumgarten’s sense of a philosophy of the beautiful and related notions, and not in Kant’s original usage of the term in the Critique of Pure Reason to mean the domain of sensibility.) In addition, Kant holds that aesthetic experience, like natural experience leading to determinate judgments, is inexplicable without both an intuitive and a conceptual dimension. Thus, for example, beauty is also by no means non-cognitive, as the British tradition had held.

Thus, Kant begins to analyze the experience of beauty, in order to ask as precisely as possible the question ‘how are judgments about beauty possible’. Kant’s initial focus is on judgments about beauty in nature, as when we call a flower, a sunset, or an animal ‘beautiful’. What, at bottom, does such a judgment mean, and how does it take place as a mental act? In order to begin to answer these questions, Kant needs to clarify the basic features of such judgments. On Kant’s analysis, aesthetic judgments are still more strange even than ordinary reflective judgments, and must have a number of peculiar features which at first sight look like nothing other than paradoxes. We will now describe those features using Kant’s conceptual language.

Taking up roughly the first fifth of the Critique of Judgment, Kant discusses four particular unique features of aesthetic judgments on the beautiful (he subsequently deals with the sublime). These he calls ‘moments’, and they are structured in often obscure ways according to the main divisions of Kant’s table of categories (See article on Kant’s Metaphysics).

The First Moment. Aesthetic judgments are disinterested. There are two types of interest: by way of sensations in the agreeable, and by way of concepts in the good. Only aesthetic judgment is free or pure of any such interests. Interest is defined as a link to real desire and action, and thus also to a determining connection to the real existence of the object. In the aesthetic judgment per se, the real existence of the beautiful object is quite irrelevant. Certainly, I may wish to own the beautiful painting, or at least a copy of it, because I derive pleasure from it – but that pleasure, and thus that desire, is distinct from and parasitic upon the aesthetic judgment (see sect;9). The judgment results in pleasure, rather than pleasure resulting in judgment. Kant accordingly and famously claims that the aesthetic judgment must concern itself only with form (shape, arrangement, rhythm, etc.) in the object presented, not sensible content (color, tone, etc.), since the latter has a deep connection to the agreeable, and thus to interest. Kant is thus the founder of all formalism in aesthetics in modern philosophy. This claim of the disinterestedness of all aesthetic judgments is perhaps the most often attacked by subsequent philosophy, especially as it is extended to include fine art as well as nature. To pick three examples, Kant’s argument is rejected by those (Nietzsche, Freud) for whom all art must always be understood as related to will; by those for whom all art (as a cultural production) must be political in some sense (Marxism); by those for whom all art is a question of affective response expressionists).

The Second Moment. Aesthetic judgments behave universally, that is, involve an expectation or claim on the agreement of others – just ‘as if’ beauty were a real property of the object judged. If I judge a certain landscape to be beautiful then, although I may be perfectly aware that all kinds of other factors might enter in to make particular people in fact disagree with me, never-the-less I at least implicitly demand universality in the name of taste. The way that my aesthetic judgments ‘behave’ is key evidence here: that is, I tend to see disagreement as involving error somewhere, rather than agreement as involving mere coincidence. This universality is distinguished first from the mere subjectivity of judgments such as ‘I like honey’ (because that is not at all universal, nor do we expect it to be); and second from the strict objectivity of judgments such as ‘honey contains sugar and is sweet’, because the aesthetic judgment must, somehow, be universal ‘apart from a concept’ (sect;9). Being reflective judgments, aesthetic judgments of taste have no adequate concept (at least to begin with), and therefore can only behave as if they were objective. Kant is quite aware that he is flying in the face of contemporary (then and now!) truisms such as ‘beauty is in the eye of the beholder’. Such a belief, he argues, first of all can not account for our experience of beauty itself, insofar as the tendency is always to see ‘beauty’ as if it were somehow in the object or the immediate experience of the object. Second, Kant argues that such a relativist view can not account for the social ‘behavior’ of our claims about what we find beautiful. In order to explore the implications of ‘apart from a concept’, Kant introduces the idea of the ‘free play’ of the cognitive faculties (here: understanding and imagination), and the related idea of communicability. In the case of the judgment of the beautiful, these faculties no longer simply work together (as they do in ordinary sensible cognition) but rather each ‘furthers’ or ‘quickens’ the other in a kind of self-contained and self-perpetuating cascade of thought and feeling. We will return to these notions below.

The Third Moment. The third introduces the problem of purpose and purposiveness (also translated ‘end’ and ‘finality’). An object’s purpose is the concept according to which it was manufactured; purposiveness, then, is the property of at least appearing to have been manufactured or designed. Kant claims that the beautiful has to be understood as purposive, but without any definite purpose. A ‘definite purpose’ would be either the set of external purposes (what the thing was meant to do or accomplish), or the internal purpose (what the thing was simply meant to be like). In the former case, the success of the process of making is judged according to utility; in the latter, according to perfection. Kant argues that beauty is equivalent neither to utility nor perfection, but is still purposive. Beauty in nature, then, will appear as purposive with respect to our faculty of judgment, but its beauty will have no ascertainable purpose – that is, it is not purposive with respect to determinate cognition. Indeed, this is why beauty is pleasurable since, Kant argues, pleasure is defined as a feeling that arises on the achievement of a purpose, or at least the recognition of a purposiveness (Introduction, VI).

The purposiveness of art is more complicated. Although such works may have had purposes behind their production (the artist wished to express a certain mood, or communicate a certain idea), nevertheless, these can not be sufficient for the object to be beautiful. As judges of art, any such knowledge we do have about these real purposes can inform the judgment as background, but must be abstracted from to form the aesthetic judgment properly. It is not just that the purpose for the beauty of the beautiful happens to be unknown, but that it cannot be known. Still, we are left with the problem of understanding how a thing can be purposive, without having a definite purpose.

The Fourth Moment. Here, Kant is attempting to show that aesthetic judgments must pass the test of being ‘necessary’, which effectively means, ‘according to principle’. Everyone must assent to my judgment, because it follows from this principle. But this necessity is of a peculiar sort: it is ‘exemplary’ and ‘conditioned’. By exemplary, Kant means that the judgment does not either follow or produce a determining concept of beauty, but exhausts itself in being exemplary precisely of an aesthetic judgment. With the notion of condition, Kant reaches the core of the matter. He is asking: what is it that the necessity of the judgment is grounded upon; that is, what does it say about those who judge?

Kant calls the ground ‘common sense’, by which he means the a priori principle of our taste, that is of our feeling for the beautiful. (Note: by ‘common sense’ is not meant being intelligent about everyday things, as in: ‘For a busy restaurant, it’s just common sense to reserve a table in advance.’) In theoretical cognition of nature, the universal communicability of a representation, its objectivity, and its basis in a priori principles are all related. Similarly, Kant wants to claim that the universal communicability, the exemplary necessity and the basis in an a priori principle are all different ways of understanding the same subjective condition of possibility of aesthetic judgment that he calls common sense. (As we shall see, on the side of the beautiful object, this subjective principle corresponds to the principle of the purposiveness of nature.) Thus Kant can even claim that all four Moments of the Beautiful are summed up in the idea of ‘common sense’ (CJ sect.22). Kant also suggests that common sense in turn depends upon or is perhaps identical with the same faculties as ordinary cognition, that is, those features of humans which (as Kant showed in the Critique of Pure Reason) make possible natural, determinative experience. Here, however, the faculties are merely in a harmony rather than forming determinate cognition.

b. The Deduction of Taste

Overview: There are two aspects to Kant’s basic answer to the question of how aesthetic judgments happen. First, some of Kant’s earlier work seemed to suggest that our faculty or ability to judge consisted of being a mere processor of other, much more fundamental mental presentations. These were concepts and intuitions (‘intuition’ being Kant’s word for our immediate sensible experiences – see entry on ‘Kant’s Metaphysics’). Everything interesting and fundamental happened in the formation of concepts, or in the receiving of intuitions. But now Kant argues that judgment itself, as a faculty, has an fundamental principle that governs it. This principle asserts the purposiveness of all phenomena with respect to our judgment. In other words, it assumes in advance that everything we experience can be tackled by our powers of judgment. Normally, we don’t even notice that this assumption is being made, we just apply concepts, and be done with it. But in the case of the beautiful, we do notice. This is because the beautiful draws particular attention to its purposiveness; but also because the beautiful has no concept of a purpose available, so that we cannot just apply a concept and be done with it. Instead, the beautiful forces us to grope for concepts that we can never find. And yet, nevertheless, the beautiful is not an alien and disturbing experience – on the contrary, it is pleasurable. The principle of purposiveness is satisfied, but in a new and unique way.

Asking what this new and unique way is takes us to the second aspect. Kant argues that the kinds of ‘cognition’ (i.e. thinking) characteristic of the contemplation of the beautiful are not, in fact, all that different from ordinary cognition about things in the world. The faculties of the mind are the same: the ‘understanding’ which is responsible for concepts, and the ‘sensibility’ (including our imagination) which is responsible for intuitions. The difference between ordinary and aesthetic cognition is that in the latter case, there is no one ‘determinate’ concept that pins down an intuition. Instead, intuition is allowed some ‘free play’, and rather than being subject to one concept, it instead acts in ‘harmony’ with the lawfulness in general of the understanding. It is this ability of judgment to bring sensibility and understanding to a mutually reinforcing harmony that Kant calls ‘common sense’. This account of common sense explains how the beautiful can be purposive with respect to our ability to judge, and yet have no definite purpose. Kant believes common sense also answers the question of why aesthetic judgments are valid: since aesthetic judgments are a perfectly normal function of the same faculties of cognition involved in ordinary cognition, they will have the same universal validity as such ordinary acts of cognition.

The idea of a harmony between or among the faculties of cognition is turning out to be the key idea. For such a harmony, Kant claims, will be purposive, but without purpose. Moreover, it will be both universal and necessary, because based upon universal common sense, or again, because related to the same cognitive faculties which enable any and all knowledge and experience. Lastly, because of the self-contained nature of this harmony, it must be disinterested. So, what does Kant think is going on in such ‘harmony’, or in common sense for that matter, and does he have any arguments which make of these idea more than mere metaphors for beauty?

Up to now, we have had no decent argument for the existence of common sense as a principle of taste. At best, common sense was plausible as a possible explanation of, for example, the tendency to universality observed in aesthetic judgments. (As Kant admits in sect.17). Such a demand for universality could be accounted for nicely if we assumed an a priori principle for taste, which might also explain the idea of universal communicability. This argument, however, is rather weak. Kant believes he has an ingenious route to proving the case with much greater certainty.

Throughout the Four Moments of the Beautiful, Kant has dropped many important clues as to the transcendental account of the possibility of aesthetic judgment: in particular, we have talked about communicability, common sense and the harmony of the cognitive sub-faculties. Kant then cuts off to turn to the sublime, representing a different problem within aesthetic judgment. He returns to beauty in sect.30, which forms the transition to the passages tantalizingly called the Deduction. These transitional passages feel much like a continuation of the Four Moments; we will treat them as such here, since also Kant claims that the sublime does not need a Deduction.

The Deduction in fact appears in two versions in Kant’s texts (sect.9 and 21 being the first; sect.30-40 the second, with further important clarification in the ‘Dialectic’ sect.55-58). Here, we will discuss only the second. Both explicitly are attempting to demonstrate the universal communicability and thus intersubjective validity of judgments of taste. Which for Kant is the same as saying that there is a ‘common sense’ – by which he means that humans all must have a kind of sensing ability which operates the same way.

Briefly, the argument begins by asserting that aesthetic judgments must be judgments in some sense; that is, they are mental acts which bring a sensible particular under some universal (Kant’s Introduction, IV). The four moments of the beautiful are then explicitly seen as being limitations on the conditions under which this judgment can take place (no interest, purposive without determining purpose, etc.); all these Kant summarizes by saying that the judgments are formal only, lacking all ‘matter’. By this, he means that although the judgment is a judgment of the presentation of a particular (singular) object, no particular determination of either sensible intuition, or understanding forms a necessary part of the judgment. (In ordinary cognition of the world, this lack of restriction would be entirely out of place. It would be nonsense to judge whether a particular thing was a sofa without restricting my judgment to that particular thing, and to the concept of a sofa.) However, considered in general (that is, in their essence as sub-faculties) the faculties of imagination and understanding are likewise not restricted to any presentation or kind of sense, or any concept. This means that Kant is describing the ‘proportion’ between understanding and intuition as something like the always present possibility of the faculties being freed to mutually enact their essence.

Because such faculties in general are required for all theoretical cognition whatsoever, regardless of its object (as Kant claims to have proven in the first Critique), they can be assumed present a priori, in the same form and in the same way, in all human beings. The presence of the cognitive sub-faculties in their various relations is equivalent with the principle of the universal communicability and validity (i.e. common sense) of any mental states in which these faculties are involved a priori. Therefore, an aesthetic judgment must be seen to be an expression of this principle. The key move is obviously to claim that the aesthetic judgment rests upon the same unique conditions as ordinary cognition, and thus that the former must have the same universal communicability and validity as the latter. It is just that, presented with the beautiful, our cognitive faculties are released from the limitations that characterize ordinary thought, and produce what above we called a cascade of thoughts and feelings.

It is difficult to know what to make of this argument (with the various other versions of it scattered throughout the text) and the hypothesis it purports to prove. For one thing, Kant’s work here is so heavily reliant upon the results of the first Critique as to not really be able to stand on its own, while at the same time it is not clear at several points whether the first and third Critiques are fully compatible. For another, does not all this talk about the faculties ‘in general’ seem as if Kant is hypostatising these faculties, as really existent things in the mind that act, rather than simply as an expression for certain capacities? However, there is no doubting the fascinating and profound implications of what Kant is proposing. For example, the notions of common sense and communicability are closely akin to key political ideas, leading several commentators to propose that what Kant is really writing about are the foundations of any just politics (see e.g. sect.60). Or again, the ‘freedom’ of the imagination is explicitly linked by Kant to the freedom characteristic of the moral will, allowing Kant to construct a deeply rooted link between beauty and the moral (sect.59). Finally, of course, there is K

c. The Sublime

Overview: For Kant, the other basic type of aesthetic experience is the sublime. The sublime names experiences like violent storms or huge buildings which seem to overwhelm us; that is, we feel we ‘cannot get our head around them’. This is either mainly ‘mathematical’ – if our ability to intuit is overwhelmed by size (the huge building) – or ‘dynamical’ – if our ability to will or resist is overwhelmed by force (e.g. the storm). The problem for Kant here is that this experience seems to directly contradict the principle of the purposiveness of nature for our judgment. And yet, Kant notes, one would expect the feeling of being overwhelmed to also be accompanied by a feeling of fear or at least discomfort. Whereas, the sublime can be a pleasurable experience. All this raises the question of what is going on in the sublime

Kant’s solution is that, in fact, the storm or the building is not the real object of the sublime at all. Instead, what is properly sublime are ideas of reason: namely, the ideas of absolute totality or absolute freedom. However huge the building, we know it is puny compared to absolute totality; however powerful the storm, it is nothing compared to absolute freedom. The sublime feeling is therefore a kind of ‘rapid alternation’ between the fear of the overwhelming and the peculiar pleasure of seeing that overwhelming overwhelmed. Thus, it turns out that the sublime experience is purposive after all – that we can, in some way, ‘get our head around it’.

Since the ideas of reason (particularly freedom) are also important for Kant’s moral theory, there seems to be an interesting connection between the sublime and morality. This Kant discusses under the heading of ‘moral culture’, arguing for example that the whole sublime experience would not be possible if humans had not received a moral training that taught them to recognize the importance of their own faculty of reason.

Traditionally, the sublime has been the name for objects inspiring awe, because of the magnitude of their size/height/depth (e.g. the ocean, the pyramids of Cheops), force (a storm), or transcendence (our idea of God). Vis-à-vis the beautiful, the sublime presents some unique puzzles to Kant. Three in particular are of note. First, that while the beautiful is concerned with form, the sublime may even be (or even especially be) formless. Second, that while the beautiful indicates (at least for judgment) a purposiveness of nature that may have profound implications, the sublime appears to be ‘counter-purposive’. That is, the object appears ill-matched to, does ‘violence’ to, our faculties of sense and cognition. Finally, although from the above one might expect the sublime experience to be painful in some way, in fact the sublime does still involve pleasure – the question is ‘how?’.

Kant divides the sublime into the ‘mathematical’ (concerned with things that have a great magnitude in and of themselves) and the ‘dynamically’ (things that have a magnitude of force in relation to us, particularly our will). The mathematical sublime is defined as something ‘absolutely large‘ that is, ‘large beyond all comparison‘ (sect.25). Usually, we apply some kind of standard of comparison, although this need not be explicit (e.g. ‘Mt. Blanc is large’ usually means ‘compared with other mountains (or perhaps, with more familiar objects), Mt. Blanc is large’). The absolutely large, however, is not the result of a comparison

Now, of course, any object is measurable – even the size of the universe, no less a mountain on Earth. But Kant then argues that measurement not merely mathematical in nature (the counting of units), but fundamentally relies upon the ‘aesthetic’ (in the sense of ‘intuitive’ as used in the first Critique) grasp of a unit of measure. Dealing with a unit of measure, whether it be a millimeter or a kilometer, requires a number (how many units) but also a sense of what the unit is. This means that there will be absolute limits on properly aesthetic measurement because of the limitations of the finite, human faculties of sensibility. In the first place, there must be an absolute unit of measure, such that nothing larger could be ‘apprehended’; in the second place, there must be a limit to the number of such units that can be held together in the imagination and thus ‘comprehended’ (sect.26). An object that exceeds these limits (regardless of its mathematical size) will be presented as absolutely large – although of course it is still so with respect to our faculties of sense.

However, we must return to the second and third peculiar puzzles of the sublime. As we saw above with respect to the beautiful, pleasure lies in the achievement of a purpose, or at least in the recognition of a purposiveness. So, if the sublime presents itself as counter-purposive, why and how is pleasure associated with it? In other words, where is the purposiveness of the sublime experience? Kant writes,

[W]e express ourselves entirely incorrectly when we call this or that object of nature sublime … for how can we call something by a term of approval if we apprehend it as in itself contrapurposive? (sect.23)

This problem constitutes Kant’s principle argument that something else must be going on in the sublime experience other than the mere overwhelmingness of some object. As Kant will later claim, objects of sense (oceans, pyramids, etc.) are called ‘sublime’ only by a kind of covert sleight-of-hand, what he calls a ‘subreption’ (sect.27). In fact, what is actually sublime, Kant argues, are ideas of our own reason. The overwhelmingness of sensible objects leads the minds to these ideas.

Now, such presentations of reason are necessarily unexhibitable by sense. Moreover, the faculty of reason is not merely an inert source of such ideas, but characteristically demands that its ideas be presented. (This same demand is what creates all the dialectical problems that Kant analyses in, for example, the Antinomies.) Kant claims that the relation of the overwhelming sensible object to our sense is in a kind of ‘harmony’ (sect.27) or analogy to the relation of the rational idea of absolute totality to any sensible object or faculty. The sublime experience, then, is a two-layer process. First, a contrapurposive layer in which our faculties of sense fail to complete their task of presentation. Second, a strangely purposive layer in which this very failure constitutes a ‘negative exhibition’ (‘General Comment’ following sect.29) of the ideas of reason (which could not otherwise be presented). This ‘exhibition’ thus also provides a purposiveness of the natural object for the fulfillment of the demands of reason. Moreover, and importantly, it also provides a new and ‘higher’ purposiveness to the faculties of sense themselves which are now understood to be properly positioned with respect to our ‘supersensible vocation’ (sect.27) – i.e. in the ultimately moral hierarchy of the faculties. Beyond simply comprehending individual sensible things, our faculty of sensibility, we might say, now knows what it is for. We will return to this point shortly. The consequence of this purposiveness is exactly that ‘negative pleasure’ (sect.23) for which we had be searching. The initial displeasure of the ‘violence’ against our apparent sensible interests is now matched by a ‘higher’ pleasure arising from the strange purposiveness Kant has discovered. Interestingly, on Kant’s description, neither of these feelings wins out – instead, the sublime feeling consists of a unique ‘vibration’ or ‘rapid alternation’ of these feelings (sect.27).

The dynamically sublime is similar. In this case, a ‘might’ or power is observed in nature that is irresistible with respect to our bodily or sensible selves. Such an object is ‘fearful’ to be sure, but (because we remain disinterested) is not an object of fear. (Importantly, one of Kant’s examples here is religion: God is fearful but the righteous man is not afraid. This is the difference, he says, between a rational religion and mere superstition.) Again, the sublime is a two-layered experience. Kant writes that such objects ‘raise the soul’s fortitude above its usual middle range and allow us to discover in ourselves an ability to resist which is of a quite different kind…’ (sect.28). In particular, nature is called ‘sublime merely because it elevates the imagination to the exhibition of those cases wherein the mind can be made to feel [sich fühlbar machen] the sublimity, even above nature, that is proper to its vocation’ (sect.28, translation modified). In particular, the sublimity belongs to human freedom which is (by definition) unassailable to the forces of nature. Such a conception of freedom as being outside the order of nature, but demanding action upon that order, is the core of Kant’s moral theory. Thus we can begin to see the intimate connection between the sublime (especially here the dynamically sublime) and morality

This connection (for the sublime in general) becomes even more explicit in Kant’s discussion of what he calls ‘moral culture’. (sect.29) The context is to ask about the modality of judgments on the sublime – that is, to they have the same implicit demand on the necessary assent of others that judgments on the beautiful have? Kant’s answer is complicated. There is an empirical factor which is required for the sublime: the mind of the experiencer must be ‘receptive’ to rational ideas, and this can only happen in a culture that already understands morality as being a function of freedom or, more generally, conceives of human beings as having a dimension which in some way transcends nature. The sublime, properly speaking, is possible only for members of such a moral culture (and, Kant sometimes suggests, may reciprocally contribute to the strengthening of that culture). So, the sublime is subjected to an empirical contingency. However, Kant claims, we are justified in demanding from everyone that they necessarily have the transcendental conditions for such moral culture, and thus for the sublime, because these conditions are (as in the case of the beautiful) the same as for theoretical and practical thought in general. The claims about moral culture show that, for Kant, aesthetics in general is not an isolated problem for philosophy but intimately linked to metaphysical and moral questions. This is one more reason why it is important not to assume that the Critique of Aesthetic Judgment is a book merely about beauty and sublimity. Moreover, this ‘link’ has an even greater significance for Kant: it shows reflective judgment in action as it were relating together both theoretical and practical reason, for this was the grand problem he raised in his Introduction.

Kant’s treatment of the sublime raises many difficulties. For example, only the dynamically sublime has any strict relationship to the moral idea of freedom. This raises the question of whether the mathematical and dynamically sublime are in fact radically different, both in themselves as experiences, and in their relation to ‘moral culture’. Again, Kant gives an interesting account of how magnitude is estimated in discussing the mathematical sublime, but skips the parallel problem in the dynamically sublime (how does one estimate force?). Finally, many readers have found the premise of the whole discussion implausible: that in the sublime experience, what is properly sublime and the object of respect should be the idea of reason, rather than nature.

d. Fine Art and Genius

Overview: Thus far, Kant’s main focus for the discussion of beauty and the sublime has been nature. He now turns to fine art. Kant assumes that the cognition involved in judging fine art is similar to the cognition involved in judging natural beauty. Accordingly, the problem that is new to fine art is not how it is judged by a viewer, but how it is created. The solution revolves around two new concepts: the ‘genius’ and ‘aesthetic ideas’.

Kant argues that art can be tasteful (that is, agree with aesthetic judgment) and yet be ‘soulless’ – lacking that certain something that would make it more than just an artificial version of a beautiful natural object. What provides soul in fine art is an aesthetic idea. An aesthetic idea is a counterpart to a rational idea: where the latter is a concept that could never adequately be exhibited sensibly, the former is a set of sensible presentations to which no concept is adequate. An aesthetic idea, then, is as successful an attempt as possible to ‘exhibit’ the rational idea. It is the talent of genius to generate aesthetic ideas, but that is not all. First, the mode of expression must also be tasteful – for the understanding’s ‘lawfulness’ is the condition of the expression being in any sense universal and capable of being shared. The genius must also find a mode of expression which allows a viewer not just to ‘understand’ the work conceptually, but to reach something like the same excited yet harmonious state of mind that the genius had in creating

Starting in sect.43, Kant addresses himself particularly to fine art for the first time. The notion of aesthetic judgment already developed remains central. But unlike the investigation of beauty in nature, the focus shifts from the transcendental conditions for judgment of the beautiful object to the transcendental conditions of the making of fine art. In other words: how is it possible to make art? To solve this, Kant will introduce the notion of genius.

But that is not the only shift. Kant stands right in the middle of a complete historical change in the central focus of aesthetics. While formerly, philosophical aesthetics was largely content to take its primary examples of beauty and sublimity from nature, after Kant the focus is placed squarely on works of art. Now, in Kant, fine art seems to ‘borrow’ its beauty or sublimity from nature. Fine art is therefore a secondary concept. On the other hand, of course, in being judged aesthetically, nature is seen ‘as if’ purposeful, designed, or a product of an intelligence. So, in this case at least, the notion of ‘nature’ itself can be seen as secondary with respect to the notions of design or production, borrowed directly from art. Thus, the relation between nature and art is much more complex than it seems at first. Kant’s work thus forms an important part of the historical change mentioned above. Moreover, it is clear from a number of comments that Kant makes about ‘genius’ that he is an aesthetic conservative reacting against, for example, the emphasis on the individual, impassioned artist characteristic of the ‘Sturm und Drang‘ movement. But, historically, his discussion of the concept contributed to the escalation of the concept in the early 19th Century.

So, in order to understand how art is possible, we have to first understand what art is, and what art production is, vis-á-vis natural objects and natural ‘production’. First, then, what does Kant mean by ‘nature’? (1) On the one hand, in expressions like ‘the nature of X’ (e.g. ‘the nature of human cognition’), it means those properties which belong essentially to X. This can either be an empirical claim or, more commonly in Kant, a priori. On the other hand, nature as itself an object has several meanings for Kant. Especially: (2) If I say ‘nature as opposed to art’ I mean that realm of objects not presented as the objects of sensible will – that is, which are quite simply not made or influenced by human hands. (3) If I say ‘nature as an object of cognition’ I mean any object capable of being dealt with ‘objectively’ or ‘scientifically’. This includes things in space outside of us, but also aspects of sensible human nature that are the objects of sciences such as psychology. (4) Nature is also the object of reflective judgments and is that which is presupposed to be purposive or pre-adapted with respect to judgment.

Kant begins by giving a long clarification of art. As a general term, again, art refers to the activity of making according to a preceding notion. If I make a chair, I must know, in advance, what a chair is. We distinguish art from nature because (though we may judge nature purposive) we know in fact there is no prior notion behind the activity of a flower opening. The flower doesn’t have an idea of opening prior to opening – the flower doesn’t have a mind or a will to have or execute ideas with.

Art also means something different from science – as Kant says, it is a skill distinguished from a type of knowledge. Art involves some kind of practical ability, irreducible to determinate concepts, which is distinct from a mere comprehension of something. The latter can be fully taught; the former, although subject to training to be sure, relies upon native talent. (Thus, Kant will later claim, there can be no such thing as a scientific genius, because a scientific mind can never be radically original. See sect.46.) Further, art is distinguished from labor or craft – the latter being something satisfying only for the payoff which results and not for the mere activity of making itself. Art (not surprisingly, like beauty) is free from any interest in the existence of the product itself.

Arts are subdivided into mechanical and aesthetic. The former are those which, although not handicrafts, never-the-less are controlled by some definite concept of a purpose to be produced. The latter are those wherein the immediate object is merely pleasure itself. Finally, Kant distinguishes between agreeable and fine art. The former produces pleasure through sensation alone, the latter through various types of cognitions

This taxonomy of fine art defines more precisely the issue for Kant. What, then, ‘goes on’ in the mind of the artist? It is clearly not just a matter of applying good taste, otherwise all art critics would be artists, all musicians composers, and so forth. Equally, it is not a question of simply expressing oneself using whatever means come to hand, since such productions might well lack taste. We feel reasonably secure that we know how it is possible for, for example, clockmakers to make clocks, or glass-blowers to blow glass (which doesn’t mean that we can make clocks or blow glass, but that as a kind of activity, we understand it). We have also investigated how it is for someone looking at a work of beauty to judge it. But it is not yet clear how, on the side of production, fine art gets made.

Kant sums up the problem in two apparent paradoxes. The first of these is easy to state. Fine art is a type of purposeful production, because it is made; art in general is production according to a concept of an object. But fine art can have no concept adequate to its production, else any judgment on it will fail one of the key features of all aesthetic judgments: namely purposiveness without a purpose. Fine art therefore must both be, and not be, an art in general.

To introduce the second paradox, Kant notices that we have a problem with the overwrought – that which draws attention to itself as precisely an artificial object or event. ‘Over-the-top’ acting is a good example. Kant expresses this point by saying that, in viewing a work of art we must be aware of it as art, but it must never-the-less appear natural. Where ‘natural’ here stands for the appearance of freedom from conventional rules of artifice; this concept is derived from the second sense of ‘nature’ given above. The paradox is that art (the non-natural) must appear to be natural.

Kant must overcome these paradoxes and explain how fine art can be produced at all. In sect.46, the first step is taken when Kant, in initially defining ‘genius’, conflates ‘nature’ in the first sense above with nature in the third sense. He writes,

Genius is the talent (natural endowment) that gives the rule to art. Since talent is an innate productive ability of the artist and as such belongs itself to nature, we could also put it this way: Genius is the innate mental predisposition (ingenium) through which nature gives the rule to art. (sect.46)

In other words, that which makes it possible to produce (fine art) is not itself produced – not by the individual genius, nor (we should add) through his or her culture, history, education, etc. From the definition of genius as that talent through which nature gives the rule to art follows (arguably!) the following key propositions. First, fine art is produced by individual humans, but not as contingent individuals. That is, not by human nature in the empirically known sense. Second, fine art as aesthetic (just like nature as aesthetic) can have no definite rules or concepts for producing or judging it. But genius supplies a rule, fully applicable only in the one, concrete instance, precisely by way of the universal structures of the genius’ mental abilities (which again, is ‘natural’ in sense one).

Third, the rule supplied by genius is more a rule governing what to produce, rather than how. Thus, while all fine art is a beautiful ‘presentation’ of an object (sect.48), this partly obscures the fact that genius is involved in the original creation of the object to be presented. The ‘how’ is usually heavily informed by training and technique, and is governed by taste. Taste, Kant claims, is an evaluative faculty, not a productive one (sect.48). Thus, the end of sect.47, he will distinguish between supplying ‘material’ and elaborating the ‘form’. Fourth, because of this, originality is a characteristic of genius. This means also that fine art properly is never an imitation of previous art, though it may ‘follow’ or be ‘inspired by’ previous art (sect.47). Fifth, as we mentioned above, fine art must have the ‘look of nature’ (sect.45). This is because the rule of its production (that concept or set of concepts of an object and of the ‘how’ of its production which allows the genius to actually make some specific something) is radically original. Thus, fine art is ‘natural’ in sense two, in that it lies outside the cycle of production and re-production within which all other arts in general are caught up (and thus, again, cannot be imitated). This leads Kant to make some suggestive, but never fully worked out, comments about artistic influences and schools, the role of culture, of technique and education, etc. (See e.g. sect.49-50)

Having made the various distinctions between the matter and the form of expression in genius’ work, or again between the object and its presentation, Kant applies these to a brief if eccentric comparative study of the varieties of fine art (sect.51-53). According to the manner of presentation, he divides all fine arts into the arts of speech (especially poetry, which Kant ranks the highest of the arts), the arts of visual form (sculpture, architecture and painting), and the arts involving a play of sensible tones (music). The last pages of this part of Kant’s book are taken up with a curious collection of comments on the ‘gratifying’ (non-aesthetic but still relatively free activities), especially humor.

However, we have not yet clarified what kind of thing the ‘rule’ supplied by genius is; therefore we have not yet reached an understanding of the nature of the ‘talent’ for the production of fine art that is genius.

Genius provides the matter for fine art, taste provides the form. The beautiful is always formal, as we have already discovered. So, what distinguishes one ‘matter’ from another, such that genius might be required? What genius does, Kant says, is to provide ‘soul’ or ‘spirit’ (‘Seele‘, sect.49) to what would otherwise be uninspired. This peculiar idea seems to be used in a sense analogous to saying that someone ‘has soul’, meaning to have nobility or a deep and exemplary moral character, as opposed to being shallow or even in a sense animal-like; but Kant also, following the Aristotelian tradition, means that which makes something alive rather than mere material. There can be an uninspired fine art, but it is not very interesting (pure beauty, mentioned above, may be an example). There can also, Kant warns, be inspired nonsense, which is also not very interesting. Genius inspires art works – gives them spirit – and does so by linking the work of art to what Kant will call aesthetic ideas.

This is defined in the third paragraph of sect.49. The aesthetic idea is a presentation of the imagination to which no thought is adequate. This is a ‘counterpart’ to rational ideas (which we encountered above in talking of the sublime), which are thoughts to which nothing sensible or imagined can be adequate. Each is excessive, we might say, but on different sides of our cognitive apparatus. Aesthetic ideas are seen to be ‘straining’ after the presentation of rational ideas – this is what gives them their excess over any set of ordinary determinate concepts.

In the judgment of the beautiful, we had a harmony between the imagination and the understanding, such that each furthered the extension of the other. Kant is now saying: certainly that is true for all judgments of taste, whether of natural or artificial objects. And yet we can distinguish between such a harmony which happens on the experiencing of a beautiful form simply, or a harmony which happens on the experiencing of a beautiful form that itself is the expression of something yet higher but that cannot in any other way be expressed. (The notion of ‘expression’ is important: what Kant is describing is an aesthetic process, rather than a process of understanding something with concepts, and then communicating that understanding.) Inspired fine art is beautiful, but in addition is an expression of the state of mind which is generated by an aesthetic idea.

The relevant passages in sect.49 are both confused and compressed. Kant seems to have two different manners in which aesthetic ideas can be the spirit of fine art. First, the aesthetic idea is a presentation of a rational idea (one of Kant’s examples is the moral idea of cosmopolitan benevolence). Of course, we know that there is no such adequate presentation. An obvious example might be a novelist or playwright’s attempt to portray a morally upright character: because, for Kant, an important part of our moral being transcends the world of phenomena, there must always be a mis-match between the idea and the portrayal of the character. Here the aesthetic idea seems to function by prompting an associated or coordinated surplus of thought that is directly analogous to the associated surplus of imaginative presentations demanded by rational ideas. (We saw a similar relation between the demand of rational ideas and imaginative activity in Kant’s analysis of the sublime. Indeed, arguably there is an analogy here to the concept of ‘negative exhibition’.) In practice, this will often involve what Kant calls ‘aesthetic attributes’: more ordinary, intermediate images: ‘Thus Jupiter’s eagle with the lightning in its claws is an attribute of the mighty king of heaven’.

Second, the aesthetic idea can be an impossibly perfect or complete presentation of a possible empirical experience and its concept (death, envy, love, fame are Kant’s examples). Here the aesthetic idea is not presenting a particular rational idea so much as a general function of reason: the striving for a maximum, a totality or the end of a series (as in Kant’s account of the mathematical sublime). And again, the effect is an associated ‘expansion’ of the concept beyond its determinate bounds. In either case, the aesthetic idea is not merely a presentation, but one which will set the imagination and understanding into a harmony, creating the same kind of self-sustaining and self-contained feeling of pleasure as the beautiful.

Kant’s theory of genius – for all its vagueness and lack of philosophical rigor – has been enormously influential. In particular, the radical separation of the aesthetic genius from the scientific mind; the emphasis on the near-miraculous expression (through aesthetic ideas and attributes) of the ineffable, excited state of mind; the link of fine art to a ‘metaphysical’ content; the requirement of radical originality; the raising of poetry to the head of all arts – all these claims (though not all of them entirely unique to Kant) were commonplaces and wide-spread for well over a century after Kant. Indeed, when modernists protested (often paradoxically) against the concept of the artist by using ‘automatic writing’ or ‘found objects’ it is, for the most part, this concept of the artist-genius that they are reacting against.

e. Idealism, Morality and the Supersensible

Overview: Let us return to the notion of beauty as tackled in sections A1 and A2. Viewed from the position of our knowledge of nature, the supposed purposiveness of nature looks like nonsense. Not only does our scientific knowledge seem to have no room for the concept of a purpose, but many and perhaps all beautiful natural objects can be accounted for on purely scientific terms. Thus, any principle of purposiveness can only be understood as ideal. That is, such a principle says more about the particular nature of our cognitive faculties than it says about what nature really is.

But the principle of purposiveness is still valid from the point of view of the activities of judgment. This in turn means that, for judgment, the question is valid as to how this natural purposiveness is to be explained. The only possible account is that the appearance of purposiveness in nature is conditioned by the supersensible realm underlying nature. But this means that beauty is a kind of revelation of the hidden substrate of the world, and that this substrate has a necessary sympathy with our highest human projects. To this, Kant adds a series of important analogies between the activity of aesthetic judgment and the activity of moral judgment. These analyses lead Kant to claim that beauty is the ‘symbol of morality’.

Above, at the end of section A1, we saw Kant claim that his whole account of the transcendental possibility of judgments on the beautiful could be summed up in the notion of common sense. This principle of common sense is the form that the general a priori principle of the purposiveness of nature for judgment takes when we are trying to understand the subjective conditions of aesthetic judgments of beauty. That is, where the principle is taken as a rule governing the conditions of aesthetic judgments in the subject, then it is properly called ‘common sense’. But where the principle is taken to be functioning like a concept of an object (the beautiful thing), then it is to be seen as the principle of the purposiveness of all nature for our judgment (see sect.55-58). But nature, understood scientifically, is not purposive. This strange situation gives rise to what Kant calls a ‘dialectic’ – merely apparent knowledge claims or paradoxes that arise from the misuse of a faculty. Just as in the ‘dialectic’ sections in the first two Critiques (see the entry on ‘Kant’s Metaphysics’), he Kant solves the problem by way of an appeal to the rational idea of the supersensible. Dialectical problems, for Kant, always involves a confusion between the rational ideas of the supersensible (which have at best a merely regulative validity) and natural concepts (which have a validity guaranteed but restricted to appearances). This particular form of dialectical problem involves two contradictory, but apparently necessary, truth claims – Kant calls such a situation an ‘antinomy’. (See Introduction 2 above, and the entry on ‘Kant’s Metaphysics’.) A similar dialectical problem will arise in the ‘Critique of Teleological Judgment’ where we will resume our discussion of these issues. For the moment it is enough to observe that the Antinomy of Taste seems to involve two contradictory claims about the origin of beautiful objects.

However, it could be the case that nature as the object of scientific laws (‘nature’, as Kant is fond of saying, according to the ‘immanent’ principles of the understanding), is itself responsible for the beautiful forms in nature (Kant’s example is the formation of beautiful crystals, understood perfectly through the science of chemistry). This possibility demonstrates the idealism of the principle of purposiveness. Kant thus writes, ‘we … receive nature with favor, [it is] not nature that favors us’ (sect.58).

He writes,

Just as we must assume that objects of sense as appearances are ideal if we are to explain how we can determine their forms a priori, so we must presuppose an idealistic interpretation of purposiveness in judging the beautiful in nature and in art… (sect.58)

But at the same time, this idealism also necessarily raises the question of what conditions beautiful appearances: if we are asking for a concept that accounts (on the side of the ideal object) for this purposiveness, it must be what Kant calls the realm of the ‘supersensible’ that is ‘underlying’ all nature and all humanity. As we know, no other concept (e.g. a natural concept) is adequate to grasping the beautiful object as beautiful. So, in forming an aesthetic judgment, which judges a beautiful object as purposive without purpose, we must assume the legitimacy of the rational concept of an underlying supersensible realm in order to account for that purposiveness. This assumption is valid only within and only for that judgment, and thus is certainly not a matter of knowledge. Thus, Kant can borrow the notion of aesthetic idea from his account of fine art and, speaking from the point of view of reflective judgment, say that beauty in general is always the expression of aesthetic ideas (sect.51). From the point of view of judgment, everything happens as if the unfolding beauty of the natural world is like the product of a genius. This piques the interest of reason – for judgment has, as it were, found phenomenal evidence of the reality of reason’s more far-reaching claims about the supersensible (see B3 below). The profundity of beauty, for Kant, consists of precisely this assumption by judgment; it allows him to make further connections between beauty and morality, and (as we shall see) ultimately to suggest the unity of all the disciplines of philosophy.

The last major section of the Critique of Aesthetic Judgment famously considers the relation between beauty and morality, which recalls the earlier treatment of the sublime and moral culture. Here, Kant claims that beauty is the ‘symbol’ of morality (sect.59). A symbol, he argues, is to be defined as a kind of presentation of a rational idea in an intuition. The ‘presentation’ in question is an analogy between how judgment deals with or reflects upon the idea and upon the symbolic intuition. Thus, if ‘justice’ is symbolized by a blind goddess with a scale, it is not because all judges are blind! Rather, ‘blindness’ and ‘weighing’ function as concepts in judgments in a way analogous to how the concept of ‘justice’ functions. In showing how beauty in general is the symbol of morality, Kant lists four points: (1) Both please directly and not through consequences; (2) Both are disinterested; (3) Both involve the idea of a free conformity to law (free conformity of the imagination in the case of beauty, of the will in the case of morality); (4) Both are understood to be founded upon a universal principle. The importance of this section is two-fold: first, historically, Kant is giving a philosophical underpinning to the notion that taste should be related to and, through cultivation, also promotes morality. This is a claim that is often rolled out even today. Second, the link to morality is a detailing out of the basic link between aesthetics in general and the pure concepts of reason (ideas). First aesthetic judgments (both the sublime and the beautiful), and then teleological judgments will form the bridge between theoretical and practical reason, and (Kant hopes) bring unity to philosophy. We shall return to this in section B4.

3. Kant’s Teleology

a. Objective Purposiveness and Science

Overview: The second part of Kant’s book deals with a special form of judgment called ‘teleological judgment’. The word ‘teleology’ comes from the Greek word ‘telos’ meaning end or purpose. A teleological judgment, on Kant’s account, is a judgment concerning an object the possibility of which can only be grasped from the point of view of its purpose. The purpose in question Kant calls an ‘intrinsic purpose’. In such a case, we have to say that, strictly speaking, the object was not made according to a purpose that is different from the object (as the idea of vegetable soup in the mind of the cook is different from the soup itself), but that the object itself embodies its purpose. Kant is talking mainly about living organisms (which he calls ‘natural purposes’), which are both cause and effect, both blueprint and product, of themselves. The problem here is that such a notion is paradoxical for human thought in general, and certainly incompatible with scientific thought.

This raises two issues. First, the paradoxical nature of any concept of a natural purpose means that our minds necessarily supplement judgment with the concept of causation through purposes – i.e. the concept of art, broadly speaking. In other words, for lack of any more adequate resources, we think natural purposes on an analogy with the production of man-made objects according to their purpose. Second, just as with aesthetic judgments, Kant does not claim that such judgments ever achieve knowledge. Kant argues that teleological judgments are required, even in science – but not to explain organisms, rather simply to recognize their existence, such that biological science can then set about trying to understanding them on its own terms.

The word ‘teleology’ comes from the Greek word ‘telos’ meaning end or purpose. A teleological judgment, on Kant’s account, is a judgment concerning an object the possibility of which can only be grasped from the point of view of its purpose.

The second half of Kant’s book (the ‘Critique of Teleological Judgment’) is much less often studied and referred to. This is of course related to the fact that Kant’s aesthetics has been hugely influential, while his teleology has sparked less contemporary interest; and also the fact that, in the Introduction to the whole text, Kant writes that ‘In a critique of judgment, [only] the part that deals with aesthetic judgment belongs to it essentially.’ (Introduction VIII). This is because, as we saw above, in aesthetic judgment the faculty of judgment is, as it were, on its own – although certainly the action of judgment there has implications for our faculty of reason. In teleological judgment, on the other hand, the action of judgment – although still reflective – is much more closely linked to ordinary theoretical cognition of nature. Judgment in its teleological function is not, let us say, laid bare in its purity. However, it would be wrong to ignore the ‘Critique of Teleological Judgment’ either on the grounds of its lesser influence, or especially on the assumption that its content is intrinsically less interesting.

The main difference between aesthetic and teleological judgments is the ‘reality’ of the purpose for the object. Whereas the object of aesthetic judgment was purposive without a purpose, the objects of teleological judgment do have purposes for which a concept or idea is to hand. There are, Kant claims, two types of real purposes: first, an ‘extrinsic purpose’ which is the role a thing may play in being a means to some end. An example would be an object of art in the general sense: a shoe for example, or a landscaped garden – something that was made for a purpose, and where the purpose is the reason behind it being made.

However, just as in the critique of aesthetic judgment, such ordinary examples are not (apparently) troubling and are thus not what Kant has in mind. So, Kant notes that there is a second type of real purpose, an ‘intrinsic purpose’. In this case, rather than the purpose being primarily understood as ‘behind’ the production of a thing, a thing embodies its own purpose. These are what Kant calls ‘natural purposes’ (also translated as ‘physical ends’), and the key examples are living organisms (sect.65).

Such an organism is made up of parts – individual organs, and below that, individual cells. These parts, however, are ‘organized’ – they are determined to be the parts that they are – according to the form or ‘purpose’ which is the whole creature. The parts reciprocally produce and are produced by the form of the whole. Nor is the idea of the whole separate to the organism and its cause (for then the creature would be an art product.) A mechanical clock may be made up or organized parts, but this organization is not the clock itself, but rather the concept of the clock in the mind of the craftsperson who made it. The organism is such insofar as it intrinsically and continually produces itself; the clock is not an organism because it has to be made according to a concept of it.

But how does this principle relate to the sciences of nature? Such an account of organisms as teleological is not original to Kant. It extends back to Aristotle, and, despite increasing hostility to Aristotle’s physics since the Renaissance, remained a commonplace in European biology through the 18th century and beyond. Kant is very careful to distinguish himself from the rationalist position which, he claims, takes teleology as a constitutive principle – that is, as a principle of scientific knowledge. Importantly, Kant claims that such a teleological causation is utterly alien to natural causation as our understanding is able to conceive it. However, since natural mechanical causal connections are necessary, this means that a physical end has to be understood to be contingent with respect to such ‘mechanical’ natural laws. Reason, however, always demands necessity in its objects (the principle of reason here is akin to Leibniz’s notion of the principle of sufficient reason; see entry on Leibniz’s Metaphysics). Accordingly, reason provides the idea of causation according to ends (on the analogy of art being the product of a will). As we know, however, a purely rational concept has no constitutive validity with respect to objects of experience. Instead, Kant claims, teleological judgment is merely reflective, and its principle merely regulative. The teleological judgment gives no knowledge, in other words, but simply allows the cognitive faculty to recognize a certain class of empirical objects (living organisms) that then might be subjected (so far as that is possible) to further, empirical, study. In effect, Kant is saying that, were it not for the reflective judgment and the principle of its functioning here (the rational idea of an ‘intrinsic’ end or purpose), the ability to experience something as alive (and thus subsequently to study it as the science of biology) would be impossible. Ordinary scientific judgments will be unable to fully explore and explain certain biological phenomena, and thus teleological judgments have a limited scientific role.

Such judgments only apply (with the above mentioned constraints) to individual things on the basis of their inner structure, and are not an attempt to account for their existence per se. Nevertheless, even this suggests to reason by analogy the idea of the whole of nature as a purposive system, which could only be explained if based upon some supersensible foundation – although it is hardly necessary in every instance to take the investigation so far (sect.85). In fact, the whole of nature is not given to us in this way, Kant admits, and therefore this extended idea is not as essential to science as the narrower one of natural purposes (sect.75). Nevertheless, the idea may be useful in discovering phenomena and laws in nature that might not have been recognized on a mechanical understanding alone. (Recent ecological thought, for example, has often tended to think of whole eco-systems as if they were in themselves organisms, and whole species of plants and animals (as well as the physical environment they inhabit) are their ‘organs’. Such an approach may be fruitful for understanding the inter-connectedness of the system, but also may be dangerous if taken too far – when it begins to see as necessary what in fact has to be considered as contingent.)

Thus Kant believes he has discovered a role, albeit a limited one, for teleological judgments within natural science. In fact, of course, the whole conception of biological science was moving away from such notions, first with the theory of evolution, and subsequently with the idea of genetics. Nevertheless, there is something fascinating about Kant’s conception of a natural purpose, which seems to capture something of the continuing scientific and philosophical difficulties in understanding what ‘life’ in general is.

b. ‘The Peculiarity of the Human Understanding’

Overview: Why is it the case that a proper concept of a natural purpose is impossible for us, and has to be supplemented with the concept of production according to a separate purpose? It is because of a fundamental ‘peculiarity’ of the human understanding, according to Kant. Our minds he describes as ‘intellectus ectypus’, cognition only by way of ‘images’. That is why it is impossible for us to understand something that is at the same time object and purpose. Kant then claims that this characterization of the human intellect raises the possibility of another form of intellect, the ‘intellectus archetypus‘, or cognition directly through the original. In such a case, there would be no distinction between perceiving a thing, understanding a thing, and the thing existing. This is as close as our finite minds can get to understanding the mind of God.

However, in dealing with the limited role discussed above, there is an implicit danger. If reason does not pay sufficient critical attention to the reflection involved the result is an antinomy (sect.70) between the basic scientific principle of the understanding – to seek to treat everything as necessary in being subject to natural laws – and the teleological principle – that there are some objects that are cannot be treated according to these laws, and are thus radically contingent with respect to them. Kant’s basic solution to this antinomy is given immediately (sect.71): the problem is simply that reason has forgotten that the second of these principles is not constitutive of its object – that is, does not account of the object’s existence. There could only be an antinomy if both principles were understood to be so constitutive. Kant, however, continues for several sections the discussion of the antinomy and its solution, in the end proposing a remarkable new solution.

In sect.77, Kant is at pains to point out that the teleological, reflective judgment is a necessity for human minds because of a peculiarity of such minds. (This discussion recalls the treatment of idealism in the ‘Critique of Aesthetic Judgment’ above.) In our understanding of the world (and for any other understanding we could imagine the workings of), the universal principle (law of nature) never fully determines any particular thing in all its real detail. Thus these details, although necessary in themselves as part of the order of nature, must be contingent with respect to our universal concept. It is simply beyond our understanding that there should be a concept that, in itself, determines as necessary all the features of any particular thing. (At this point, Kant is clearly influenced by Leibniz’s idea of the ‘complete concept’ – please see the entry on Leibniz’s Metaphysics.) As Kant explains it, an object so understood would be a whole that conditions all its parts.

But a living organism would be just such a whole. As we have seen, to understand its possibility we have to apply (through reflective judgment) the rational idea of an intrinsic purpose. Here, as we have just seen, the problem of the contingency with respect to natural law is exacerbated. But this idea is of a presentation of such a whole, and the presentation is conceived of as a purpose which conditions or leads to the production of the parts. Ours, in other words, is an understanding which always ‘requires images (it is an intellectus ectypus)’ (sect.77).

This peculiarity of our understanding poses the possibility of another form of intelligence, the intellectus archetypus, an intelligence which is not limited to this detour of presentations in its thinking and acting. Such an understanding would not function in a world of appearances, but directly in the world of things-in-themselves. Its power of giving the universal (concepts and ideas) would not be a separate power from its power of forming intuitions of particular things; concept and thing, thought and reality would be one. From the point of view of such an understanding, what we humans must conceive as the contingency of natural purposes with respect to the universal concept, is only an appearance. For the intellectus archetypus, such natural purposes would indeed be necessary, in the same sense as events subject to mechanical natural law. Thus, the notion of an intellectus archetypus – and the corresponding distinction for us between appearances and things-in-themselves – gives Kant a more complete way of solving the above antinomy. Because of the limitation of our understanding, we are incapable of knowing the details of the necessity of all natural processes. The idea of a natural purpose is an essential additional principle which partly corrects for this limitation, but also produces the antinomy. But the contingency introduced by the new principle is (or, rather, may be) only a contingency for us (as intellectus ectypus), and therefore the principle of natural purposes does not contradict the demand of reason for necessity.

Such an idea clearly takes us in the direction of theology – the study of the divine being, and that being’s relation to creation. But it is above all important to remember that, at this point, Kant is not claiming that there is, or must be, or that he can prove there to be, such a being. Thus, for example, given Kant’s concern with purposiveness and design, one might think he would make a case for the so-called ‘argument from design’ (the argument to the existence of a creator from the apparently designed quality of creation). But, in fact, Kant believes this to be an extraordinarily weak argument (see for example sect.sect.85, 90 and ‘General Comment on Teleology’), though interesting. Kant, however, thinks he has an argument which is related to it, and which (within certain limits) works much better. It is this argument which occupies most of the second half of the ‘Critique of Teleological Judgment’.

c. The Final Purpose and Kant’s Moral Argument for the Existence of God

Overview: The notion of the intellectus archetypus is clearly heading in the direction of philosophical theology. Kant’s book culminates with his most sustained presentation and discussion of his Moral Proof for the Existence of God.

Kant’s work already included some very famous critiques of other such proofs. In the Critique of Pure Reason, he provides some of the standard attacks on the cosmological and especially the ontological arguments. And in the Critique of Judgment, he argues that the argument from design, at least as normally stated, is very weak. Kant’s own proof, he thinks, avoids the problems typical of other arguments, precisely because it does not conclude by stating that we know the existence of God. This is because Kant is quite happy with the idea that God’s existence could never be necessary for theoretical reason. But he then asks whether practical reason – i.e. the moral side of our intellect – has the same limitation.

In Kant’s account of practical reason, the moral law is conceived of as duty. Acting from the mere pure and universal form of the moral law is everything, the consequences of action do not enter into the equation (see entry on ‘Kant’s Metaphysics’). However, Kant claims that the moral law obligates us to consider the final purpose or aim of all moral action. This final purpose of moral action Kant calls the ‘highest good’ (summum bonum). This means the greatest possible happiness for all moral beings. Importantly, this goal is not the ground of morality – unlike ordinary instances of desire or action, wherein I act precisely because I want to reach the goal. Moral action is grounded in duty – but, subsequently, so to speak, we must be assured that the final purpose is actually possible.

Just as moral action must be possible through freedom, so the summum bonum must be possible through moral action. But the possibility of the summum bonum as the final purpose in nature appears to be questionable. Therefore, if our moral action is to make sense, there must be someone working behind the scenes. This could only be activity of a ‘moral author of the world’ which would make it at least possible for the summum bonum to be reached. Moral action, therefore, assumes the existence of a God. But that the postulation of God lies ‘within’ moral action in this way automatically discounts the ‘moral proof’ from any theoretical validity.

After an extended discussion of the ins and outs of the role of teleological judgments in science, from sect.78 to around sect.82, Kant’s discussion begins to shift to a quite different topic. In sect.82 he argues in this way: it might seem, he says, that certain features of nature have as an extrinsic purpose their relations to other features: the nectar for the honey, the river for the irrigation of land near its bank, etc. (Ultimately, again, these might be seen as part of the intention or design of the intelligent cause of creation.) This, Kant says, is a perfectly understandable way of speaking sometimes, and even helps us to cognize certain natural processes, but has no objective foundation in science. There is always another way of looking at things for which what we thought was a purpose is in fact only a means to something else entirely (e.g. the nectar is simply a way of attracting bees for the purposes of pollination).

It is sometimes even claimed (often on a religious basis) that human beings are the real, ‘ultimate’ purpose of nature, and all other things have, in the end, the benefit and use of humans as an extrinsic end. But ‘in the chain of purposes man is never more than a link’ (sect.83). Nature per se does not, then, contain or pursue any such purposes, not even for man. But Kant is not quite yet finished with these kinds of problems, and introduces in sect.84 the idea of a ‘final purpose’.
Kant defines a ‘final purpose’ as ‘a purpose that requires no other purpose as a condition of its possibility’ (sect.84). This is no longer an extrinsic purpose that nature might have. Still, it is clear that, again, there can be no intrinsic final purpose in nature -all natural products and events are conditioned, including the world around us, our own bodies and even our mental life. (And living beings, qua natural purposes, are conditioned by themselves.) So, what kind of thing would such a final purpose be? Kant writes, ‘… the final purpose of an intelligent cause must be of such a kind that in the order of purposes it depends upon no condition other than just the idea of it’ (sect.84).

As we have discovered on several previous occasions, for Kant human beings are not merely natural beings. The human capacity for freedom is both a cause which acts according to purposes (the moral law) represented as necessary, and yet which has to be thought as independent of the chain of natural causation/purposes. Kant then writes, carefully, ‘… if things in the world … require a supreme cause that acts in terms of purposes, then man [qua free] is the final purpose of creation’ (sect.84). (As Kant emphasizes on several occasions – e.g. in the last part of sect.91 – it is the fact of freedom that forms the incontrovertible first premise of the argument he is about to put forward.) Put more grandly, ‘without man [as a moral being] all of creation would be a mere wasteland, gratuitous and without a final purpose’ (sect.86). Thus, the question that really ‘matters’, Kant writes, ‘is whether we do have a basis, sufficient for reason (whether speculative or practical), for attributing a final purpose to the supreme cause [in its] acting in terms of purposes’ (sect.86). Certainly, the argument will not involve a ‘speculatively’ (i.e. theoretically) sufficient basis.

Kant’s ‘moral proof for the existence of God’ is given beginning in sect.87. Actually, this proof first appeared in the Critique of Practical Reason a few years previously (see entry on Kant’s Metaphysics), and is in fact assumed through the Critique of Pure Reason. But Kant’s most detailed discussion is in the third Critique.

The rational idea of purposiveness, although never constitutive, seems to be relevant everywhere so far: in Kant’s account of the possibility of science in his Introduction, in the account of beauty (and in a different way in the sublime), and in the treatment of teleological judgments. Because these are one and all reflective judgments, they entail neither a theoretical nor a practical conclusion as to what might be behind these purposes. Even where teleological judgments about purposes in nature leads us to consider the possibility of a world author, this approach leaves quite indeterminate (and thus useless for the purposes of religion or theology) our idea of that world author (thus Kant’s ultimate criticism of what he calls ‘physicotheology’ in sect.85). But, Kant asks, is there any reason requiring us to assume nature is purposive with respect to practical reason?

In Kant’s account of practical reason, the moral law is conceived of as duty. Acting from the mere pure and universal form of the moral law is everything, the consequences of action do not enter into the equation. However, as Kant makes clear in the Introduction to the Critique of Judgment, the practical faculties in general have to do with desire – i.e. purposes motivating action – and the free will is termed the ‘higher’ faculty of desire. Kant claims that the moral law necessarily obligates us to consider the final purpose of moral action. However, it is not to be considered as the ground of morality, as would normally be the case in desire, when the presentation of the result (my aim) causes the action (action leading to that aim). This final purpose linked to the higher, moral, faculty of desire Kant calls the ‘highest good’ (summum bonum). Conceived of as a state of natural beings, this means the greatest possible happiness for all moral beings.

Kant is using this inter-implication of moral law and final purpose of moral action as a premise of his argument. The obvious question that arises is why, given the stress Kant always makes on the absolutely unconditioned nature of moral freedom, he should feel able to make this claim. It would seem as if precisely the purity of the free will would make any connection to purposes immoral. Kant writes that, even speaking practically, we must consider ourselves

… as beings of the world and hence as beings connected with other things in the world; and those same moral laws enjoin us to direct our judging to those other things [regarded] either as purposes or as objects for which we ourselves are the final purpose (sect.87).

In other words, practical reason is a human faculty – where, as always for Kant, being human is defined in terms of a unity of a lower, sensible nature together with a higher, supersensible dimension. Our sensibly conditioned will is not a different thing from our free will, but is the same faculty considered now as phenomenal psychology, now as noumenal activity. This must be the case if our actions in the phenomenal world are to be considered moral in any sense of the word. But this sensibly conditioned will does require attention to be paid to consequences – to the object of our action. Free will may determine itself unconditionally through the mere form of the moral law, but it remains the faculty of will, that is the higher faculty of desire, and thus retains the essential link to purposes.

Just as moral action must be possible through freedom, so the summum bonum must be possible through moral action. The impossibility of achieving this end would make a nonsense of moral action, because it would in effect mean that free will was no longer will, that practical reason was no longer practical (because it could not be said to act). Kant is claiming that it is just part of the meaning of an action – even a purely and formally determined action, i.e. one not conditioned by its purpose – to also posit the possibility of achieving its purpose.

But the possibility of the summum bonum as the final purpose in nature is not at all obvious. Indeed, a cynic might claim that moral action makes no difference at all – that the good man is no more happy for it, and that ‘nice guys finish last’. Kant writes,

.. the concept of the practical necessity of [achieving] such a purpose by applying our forces does not harmonize with the theoretical concept of the physical possibility its being achieved, if the causality of nature is the only causality (of a means [for achieving it]) that we connect with our freedom. (sect.87)

The obvious inference then is that the ‘causality of nature’ cannot be the ‘only causality’ – and there must also be the moral causality of a moral author of the world which would make it at least possible for the summum bonum to be reached. Without the postulate of such a moral author – who, as we saw above, must have our free morality in mind as a final purpose, if anything – our free moral action could not be represented as possible. Moral action, precisely as both moral and as action, within itself assumes the existence of a God. Of course, in acting morally we may not be conscious either of the summum bonum as final purpose, nor of the necessary postulation of God as moral author of the world – we are just doing what is right. Nevertheless, when that duty is fully understood, these necessary implications will be found within it.

But that the postulation of God is ‘within’ moral action in this way automatically discounts the ‘moral proof’ from any theoretical validity. Theoretical philosophy must continue to operate within its legitimate grounds, treating so far as possible all of nature as intelligible in terms of mechanical cause and effect and requiring neither purpose nor creator. This distinction is extremely important for Kant, as despite the link to morality and the ‘fact’ of our freedom, the ‘moral proof’ does not make of religion anything but a matter of faith (e.g. sect.91). This involves noting that the conception of God involved in the moral proof is and must be bound up with how things are cognizable by us. (This of course continues the treatment of the intellectus ectypus begun in sect.77 and of the idealism of reflective judgment in sect.58.) Kant writes, As for objects that we have to think a priori (either as consequences or as grounds) in reference to our practical use of reason in conformity with duty, but that are transcendent for the theoretical use of reason: they are mere matters of faith. […] To have faith … is to have confidence that we shall reach an aim that we have a duty to further, without our having insight into whether achieving it is possible. (sect.91)

The summum bonum, God as moral author (and the immortality of the soul, treated in the Critique of Practical Reason) are all such objects of faith. For Kant, this stress on faith keeps religion pure of the misunderstandings involved in, for example, fanaticism, demonology or idolatry (sect.89). Kant spends the last fifth of the ‘Critique of Teleological Judgment’ dealing with how his proof is to be understood, the nature and limitations of its validity, and various metaphysical and religious implications, including those for his own conception of critical philosophy.

Kant’s argument and later variations are generally considered to be one of the great arguments for the existence of a God. Obviously, questions can be raised about its validity. For example, whether the possibility of the final purpose is somehow necessarily linked to any moral action. However, the typical objection – that the argument is insufficient to give any knowledge – is just irrelevant, since Kant is not interested in knowledge at this point.

4. The Problem of the Unity of Philosophy and its Supersensible Objects

Overview: Let us conclude by looking at Kant’s grand conception for his Critique of Judgment.

The problem of the unity of philosophy is the problem of how thought oriented towards knowledge (theoretical reason) can be a product of the same faculty as thought oriented towards moral duty (practical reason). The problem of the unity of the objects of philosophy is the problem of how the ground of that which we know (the supersensible ground of nature) is the same as the ground of moral action (the supersensible ground of that nature in which the summum bonum is possible – together with freedom within the subject). Kant only makes some rather vague suggestions about how proof of these unities is to be established – but it is clear that he believes the faculty of judgment is the key

We will briefly look at the second of these problems. The central move is the a priori principle of nature’s purposiveness for judgment. This amounts to the assumption that judgment will always be possible, even in cases like aesthetic judgment where no concept can be found. As we discussed in A5, this principle makes a claim (though only from the ‘point of view’ of judgment) about the supersensible ground of nature. This claim leads to two assertions. First, that the supersensible ground of beauty in nature is the same as the undetermined ground of nature as an object of science. Second, it is also capable of moral determination and thus also the same as the supersensible ground of moral nature. Together, these two prove the unity of the supersensible objects of philosophy.

Let us very briefly look at the grand problem Kant poses for himself in the Critique of Judgment. The problem comes down to the implications of the ‘abyss’ that Kant opened up between theoretical and practical philosophy; or, as we may as well put it, between the side of our being that knows or tries to know the world, and the side that wills (or fails to will) according to moral law. Although this issue dominates Kant’s two introductions to his book, the book itself contains only occasional references to it, and certainly no clear statement of a solution. But arguably there is sufficient material to suggest what Kant’s solution might have been.

The following quotation contains the kernel.” The understanding, inasmuch as it can give laws to nature a priori, proves that we cognize nature only as appearance, and hence at the same time points to a supersensible substrate of nature; but it leaves this substrate entirely undetermined” (Introduction IX, translation modified). Kant is referring to the first Critique and especially to his solution to the Antinomies therein. The solution there merely required that we recognize the distinction between appearances and things-in-themselves. But this solution required nothing further of the latter other than its mere negative definition: that it not be subject to the conditions of appearance.

Kant continues, ‘Judgment, through its a priori principle of judging nature [purposively; in other words judging nature] in terms of possible particular laws of nature, provides nature’s supersensible substrate (within as well as outside us) with determinability by the intellectual faculty [i.e. reason].’ He is referring here particularly to the principle of reflective judgment (and especially aesthetic judgments on the beautiful) that nature will exhibit a purposiveness with respect to our faculty of judgment, that ‘particular’ laws of nature will always be ‘possible’. This purposiveness can only be accounted for if judgment assumes a supersensible that determines this purposiveness. This supersensible is the ‘same’ supersensible substrate underlying nature as the object of theoretical reason. It is no longer merely indeterminate. But because the particular laws are as yet only ‘possible’ – and this is exacerbated in aesthetic judgment with the notion of purposiveness ‘without purpose’ – the substrate remains left open, it is ‘determinable’ but not ‘determined’. That is to say, judgment conceives of the supersensible as capable of receiving a determinate purpose, should there be good reasons for assuming there to be such a purpose.

Kant continues, ‘But reason, through its a priori practical law, gives this same substrate determination.’ The determination in question is the one Kant introduced in the moral proof for the existence of God: that is, from the point of view of our moral selves, the ‘same’ supersensible is the ground of phenomenal nature’s co-operation in our moral projects. It carries the summum bonum as its final purpose.

Kant accordingly concludes: ‘Thus judgment makes the transition from the domain of the concept of nature to that of the concept of freedom.’ Judgment has also made the transition such that the supersensible objects of reason have to been seen as ‘the same’. Moreover, Judgment has, on the side of the subjective mind, made it conceivable to reason that its theoretical and practical employments are not only compatible (that was proved already in the Antinomy concerning freedom) but also capable of co-ordination towards moral purposes. Because, on the one hand, aesthetic judgment were found to be not fundamentally different from ordinary theoretical cognition of nature (see A2 above), and on the other hand, aesthetic judgment has a deep similarity to moral judgment (A5). Thus, Kant has demonstrated that the physical and moral universes – and the philosophies and forms of thought that present them – are not only compatible, but unified.

5. References and Further Reading

a. Works by Kant

The standard edition of the collected works in German is Kant’s gesammelte Schriften, Edited by the Deutsche Akademie der Wissenshaften, Berlin: Walter de Gruyter. Equally widely available is the Werkausgabe in zwölf Bänden, edited by Wilhelm Weischedel, Frankfurt am Mein: Suhrkamp. There are alternative, perfectly acceptable, translations of most of the following. Cambridge University Press, at the time of writing, is about half-way through publishing the complete works in English.

  • Aesthetics and Teleology. Ed., Eric Matthews and Eva Schaper. (Cambridge: Cambridge University Press, forthcoming)
  • Critique of Judgment. Trans., Werner Pluhar. (Indianapolis: Hackett, 1987)
  • Critique of Judgment. Trans., James Creed Meredith. (Oxford: Clarendon, 1988)
  • Critique of Practical Reason. Trans., Ed., Lewis White Beck. (Oxford: Maxwell Macmillan International, 1993)
  • Critique of Pure Reason. Trans., Werner Pluhar. (Indianapolis: Hackett, 1996)

b. Other Primary and Secondary Works

For a treatment of various themes in Kant, please also see the introductions to the above editions.

  • Burnham, Douglas. An Introduction to Kant’s Critique of Judgment. (Edinburgh: Edinburgh University Press [in the US, Columbia University Press], 2000)
  • Caygill, Howard. The Art of Judgement.(Oxford: Blackwell, 1989)
  • Cohen, Ted and Guyer, Paul. Essays in Kant’s Aesthetics. (Chicago: Chicago University Press, 1982)
  • Crawford, Donald. Kant’s Aesthetic Theory. (Madison: Wisconsin University Press, 1974)
  • Crawford, Paul. The Kantian Sublime. (Oxford: Clarendon Press, 1991)
  • Gibbons, Sarah L. Kant’s Theory of Imagination.(Oxford: Oxford University Press, 1994)
  • Guyer, Paul, ed. The Cambridge Companion to Kant.(Cambridge: Cambridge University Press, 1992)
  • Guyer, Paul. Kant and the Claims of Taste. (Cambridge, Mass: Harvard University Press, 1979)
  • Guyer, Paul. Kant and the Experience of Freedom.(Cambridge: Cambridge University Press, 1996)
  • Henrich, Dieter. Aesthetic Judgment and the Moral Image of the World. (Stanford: Stanford University Press, 1992)
  • Kemal, Salim. Kant and Fine Art. (Oxford: Clarendon Press, 1986)
  • Kemal, Salim. Kant’s Aesthetic Theory. (London: St Martin’s Press, 1992)
  • Makkreel, Rudi. Imagination and Understanding in Kant. (Chicago: University of Chicago Press, 1994)
  • McCloskey, Mary. Kant’s Aesthetic. (London: Macmillan, 1987) Schaper, Eva. Studies in Kant’s Aesthetics.(Edinburgh: Edinburgh University Press, 1979)
  • Zammito, John H. The Genesis of Kant’s Critique of Judgement.(Chicago: University of Chicago Press, 1992)

Author Information

Douglas Burnham
Email: H.D.Burnham@staffs.ac.uk
Staffordshire University
United Kingdom

Immanuel Kant: Metaphysics

kant2Immanuel Kant (1724-1804) is one of the most influential philosophers in the history of Western philosophy. His contributions to metaphysics, epistemology, ethics, and aesthetics have had a profound impact on almost every philosophical movement that followed him. This article focuses on his metaphysics and epistemology in one of his most important works, The Critique of Pure Reason.  A large part of Kant’s work addresses the question “What can we know?” The answer, if it can be stated simply, is that our knowledge is constrained to mathematics and the science of the natural, empirical world. It is impossible, Kant argues, to extend knowledge to the supersensible realm of speculative metaphysics. The reason that knowledge has these constraints, Kant argues, is that the mind plays an active role in constituting the features of experience and limiting the mind’s access only to the empirical realm of space and time.

Kant responded to his predecessors by arguing against the Empiricists that the mind is not a blank slate that is written upon by the empirical world, and by rejecting the Rationalists’ notion that pure, a priori knowledge of a mind-independent world was possible.  Reason itself is structured with forms of experience and categories that give a phenomenal and logical structure to any possible object of empirical experience.  These categories cannot be circumvented to get at a mind-independent world, but they are necessary for experience of spatio-temporal objects with their causal behavior and logical properties.  These two theses constitute Kant’s famous transcendental idealism and empirical realism.

Kant’s contributions to ethics have been just as substantial, if not more so, than his work in metaphysics and epistemology.  He is the most important proponent in philosophical history of deontological, or duty based,  ethics. In Kant’s view, the sole feature that gives an action moral worth is not the outcome that is achieved by the action, but the motive that is behind the action.  And the only motive that can endow an act with moral value, he argues, is one that arises from universal principles discovered by reason.  The categorical imperative is Kant’s famous statement of this duty: “Act only according to that maxim by which you can at the same time will that it should become a universal law.”

Table of Contents

  1. Historical Background to Kant
    1. Empiricism
    2. Rationalism
  2. Kant’s Answers to his Predecessors
  3. Kant’s Copernican Revolution: Mind Making Nature
  4. Kant’s Transcendental Idealism
  5. Kant’s Analytic of Principles
  6. Kant’s Dialectic
  7. The Ideas of Reason
  8. Kant’s Ethics
    1. Reason and Freedom
    2. The Duality of the Human Situation
    3. The Good Will
    4. Duty
  9. Kant’s Criticisms of Utilitarianism
  10. References and Further Reading

1. Historical Background to Kant

In order to understand Kant’s position, we must understand the philosophical background that he was reacting to. First, this article presents a brief overview of his predecessor’s positions with a brief statement of Kant’s objections, then I will return to a more detailed exposition of Kant’s arguments. There are two major historical movements in the early modern period of philosophy that had a significant impact on Kant: Empiricism and Rationalism. Kant argues that both the method and the content of these philosophers’ arguments contain serious flaws. A central epistemological problem for philosophers in both movements was determining how we can escape from within the confines of the human mind and the immediately knowable content of our own thoughts to acquire knowledge of the world outside of us. The Empiricists sought to accomplish this through the senses and a posteriori reasoning. The Rationalists attempted to use a priori reasoning to build the necessary bridge. A posteriori reasoning depends upon experience or contingent events in the world to provide us with information. That “Bill Clinton was president of the United States in 1999,” for example, is something that I can know only through experience; I cannot determine this to be true through an analysis of the concepts of “president” or “Bill Clinton.” A priori reasoning, in contrast, does not depend upon experience to inform it. The concept “bachelor” logically entails the ideas of an unmarried, adult, human male without my needing to conduct a survey of bachelors and men who are unmarried. Kant believed that this twofold distinction in kinds of knowledge was inadequate to the task of understanding metaphysics for reasons we will discuss in a moment.

a. Empiricism

Empiricists, such as Locke, Berkeley, and Hume, argued that human knowledge originates in our sensations. Locke, for instance, was a representative realist about the external world and placed great confidence in the ability of the senses to inform us of the properties that empirical objects really have in themselves. Locke had also argued that the mind is a blank slate, or a tabula rasa, that becomes populated with ideas by its interactions with the world. Experience teaches us everything, including concepts of relationship, identity, causation, and so on. Kant argues that the blank slate model of the mind is insufficient to explain the beliefs about objects that we have; some components of our beliefs must be brought by the mind to experience.

Berkeley’s strict phenomenalism, in contrast to Locke, raised questions about the inference from the character of our sensations to conclusions about the real properties of mind-independent objects. Since the human mind is strictly limited to the senses for its input, Berkeley argued, it has no independent means by which to verify the accuracy of the match between sensations and the properties that objects possess in themselves. In fact, Berkeley rejected the very idea of mind-independent objects on the grounds that a mind is, by its nature, incapable of possessing an idea of such a thing. Hence, in Kant’s terms, Berkeley was a material idealist. To the material idealist, knowledge of material objects is ideal or unachievable, not real. For Berkeley, mind-independent material objects are impossible and unknowable. In our sense experience we only have access to our mental representations, not to objects themselves. Berkeley argues that our judgments about objects are really judgments about these mental representations alone, not the substance that gives rise to them. In the Refutation of Material Idealism, Kant argues that material idealism is actually incompatible with a position that Berkeley held, namely that we are capable of making judgments about our experience.

David Hume pursued Berkeley’s empirical line of inquiry even further, calling into question even more of our common sense beliefs about the source and support of our sense perceptions. Hume maintains that we cannot provide a priori or a posteriori justifications for a number of our beliefs like, “Objects and subjects persist identically over time,” or “Every event must have a cause.” In Hume’s hands, it becomes clear that empiricism cannot give us an epistemological justification for the claims about objects, subjects, and causes that we took to be most obvious and certain about the world.

Kant expresses deep dissatisfaction with the idealistic and seemingly skeptical results of the empirical lines of inquiry. In each case, Kant gives a number of arguments to show that Locke’s, Berkeley’s, and Hume’s empiricist positions are untenable because they necessarily presuppose the very claims they set out to disprove. In fact, any coherent account of how we perform even the most rudimentary mental acts of self-awareness and making judgments about objects must presuppose these claims, Kant argues. Hence, while Kant is sympathetic with many parts of empiricism, ultimately it cannot be a satisfactory account of our experience of the world.

b. Rationalism

The Rationalists, principally Descartes, Spinoza, and Leibniz, approached the problems of human knowledge from another angle. They hoped to escape the epistemological confines of the mind by constructing knowledge of the external world, the self, the soul, God, ethics, and science out of the simplest, indubitable ideas possessed innately by the mind. Leibniz in particular, thought that the world was knowable a priori, through an analysis of ideas and derivations done through logic. Supersensible knowledge, the Rationalists argued, can be achieved by means of reason. Descartes believed that certain truths, that “if I am thinking, I exist,” for example, are invulnerable to the most pernicious skepticism. Armed with the knowledge of his own existence, Descartes hoped to build a foundation for all knowledge.

Kant’s Refutation of Material Idealism works against Descartes’ project as well as Berkeley’s. Descartes believed that he could infer the existence of objects in space outside of him based on his awareness of his own existence coupled with an argument that God exists and is not deceiving him about the evidence of his senses. Kant argues in the Refutation chapter that knowledge of external objects cannot be inferential. Rather, the capacity to be aware of one’s own existence in Descartes’ famous cogito argument already presupposes that existence of objects in space and time outside of me.

Kant had also come to doubt the claims of the Rationalists because of what he called Antinomies, or contradictory, but validly proven pairs of claims that reason is compelled toward. From the basic principles that the Rationalists held, it is possible, Kant argues, to prove conflicting claims like, “The world has a beginning in time and is limited as regards space,” and “The world has no beginning, and no limits in space.” (A 426/B 454) Kant claims that antinomies like this one reveal fundamental methodological and metaphysical mistakes in the rationalist project. The contradictory claims could both be proven because they both shared the mistaken metaphysical assumption that we can have knowledge of things as they are in themselves, independent of the conditions of our experience of them.

The Antinomies can be resolved, Kant argues, if we understand the proper function and domain of the various faculties that contribute to produce knowledge. We must recognize that we cannot know things as they are in themselves and that our knowledge is subject to the conditions of our experience. The Rationalist project was doomed to failure because it did not take note of the contribution that our faculty of reason makes to our experience of objects. Their a priori analysis of our ideas could inform us about the content of our ideas, but it could not give a coherent demonstration of metaphysical truths about the external world, the self, the soul, God, and so on.

2. Kant’s Answers to his Predecessors

Kant’s answer to the problems generated by the two traditions mentioned above changed the face of philosophy. First, Kant argued that that old division between a priori truths and a posteriori truths employed by both camps was insufficient to describe the sort of metaphysical claims that were under dispute. An analysis of knowledge also requires a distinction between synthetic and analytic truths. In an analytic claim, the predicate is contained within the subject. In the claim, “Every body occupies space,” the property of occupying space is revealed in an analysis of what it means to be a body. The subject of a synthetic claim, however, does not contain the predicate. In, “This tree is 120 feet tall,” the concepts are synthesized or brought together to form a new claim that is not contained in any of the individual concepts. The Empiricists had not been able to prove synthetic a priori claims like “Every event must have a cause,” because they had conflated “synthetic” and “a posteriori” as well as “analytic” and “a priori.” Then they had assumed that the two resulting categories were exhaustive. A synthetic a priori claim, Kant argues, is one that must be true without appealing to experience, yet the predicate is not logically contained within the subject, so it is no surprise that the Empiricists failed to produce the sought after justification. The Rationalists had similarly conflated the four terms and mistakenly proceeded as if claims like, “The self is a simple substance,” could be proven analytically and a priori.

Synthetic a priori claims, Kant argues, demand an entirely different kind of proof than those required for analytic a priori claims or synthetic a posteriori claims. Indications for how to proceed, Kant says, can be found in the examples of synthetic a priori claims in natural science and mathematics, specifically geometry. Claims like Newton’s, “the quantity of matter is always preserved,” and the geometer’s claim, “the angles of a triangle always add up to 180 degrees” are known a priori, but they cannot be known merely from an analysis of the concepts of matter or triangle. We must “go outside and beyond the concept. . . joining to it a priori in thought something which I have not thought in it.” (B 18) A synthetic a priori claim constructs upon and adds to what is contained analytically in a concept without appealing to experience. So if we are to solve the problems generated by Empiricism and Rationalism, the central question of metaphysics in the Critique of Pure Reason reduces to “How are synthetic a priori judgments possible?” (19) (All references to The Critique of Pure Reason will be to the A (1781) and B(1787) edition pages in Werner Pluhar’s translation. Indianapolis: Hackett, 1996.) If we can answer that question, then we can determine the possibility, legitimacy, and range of all metaphysical claims.

3. Kant’s Copernican Revolution: Mind Making Nature

Kant’s answer to the question is complicated, but his conclusion is that a number of synthetic a priori claims, like those from geometry and the natural sciences, are true because of the structure of the mind that knows them. “Every event must have a cause” cannot be proven by experience, but experience is impossible without it because it describes the way the mind must necessarily order its representations. We can understand Kant’s argument again by considering his predecessors. According to the Rationalist and Empiricist traditions, the mind is passive either because it finds itself possessing innate, well-formed ideas ready for analysis, or because it receives ideas of objects into a kind of empty theater, or blank slate. Kant’s crucial insight here is to argue that experience of a world as we have it is only possible if the mind provides a systematic structuring of its representations. This structuring is below the level of, or logically prior to, the mental representations that the Empiricists and Rationalists analyzed. Their epistemological and metaphysical theories could not adequately explain the sort of judgments or experience we have because they only considered the results of the mind’s interaction with the world, not the nature of the mind’s contribution. Kant’s methodological innovation was to employ what he calls a transcendental argument to prove synthetic a priori claims. Typically, a transcendental argument attempts to prove a conclusion about the necessary structure of knowledge on the basis of an incontrovertible mental act. Kant argues in the Refutation of Material Idealism that the fact that “There are objects that exist in space and time outside of me,” (B 274) which cannot be proven by a priori or a posteriori methods, is a necessary condition of the possibility of being aware of one’s own existence. It would not be possible to be aware of myself as existing, he says, without presupposing the existing of something permanent outside of me to distinguish myself from. I am aware of myself as existing. Therefore, there is something permanent outside of me.

This argument is one of many transcendental arguments that Kant gives that focuses on the contribution that the mind itself makes to its experience. These arguments lead Kant to reject the Empiricists’ assertion that experience is the source of all our ideas. It must be the mind’s structuring, Kant argues, that makes experience possible. If there are features of experience that the mind brings to objects rather than given to the mind by objects, that would explain why they are indispensable to experience but unsubstantiated in it. And that would explain why we can give a transcendental argument for the necessity of these features. Kant thought that Berkeley and Hume identified at least part of the mind’s a priori contribution to experience with the list of claims that they said were unsubstantiated on empirical grounds: “Every event must have a cause,” “There are mind-independent objects that persist over time,” and “Identical subjects persist over time.” The empiricist project must be incomplete since these claims are necessarily presupposed in our judgments, a point Berkeley and Hume failed to see. So, Kant argues that a philosophical investigation into the nature of the external world must be as much an inquiry into the features and activity of the mind that knows it.

The idea that the mind plays an active role in structuring reality is so familiar to us now that it is difficult for us to see what a pivotal insight this was for Kant. He was well aware of the idea’s power to overturn the philosophical worldviews of his contemporaries and predecessors, however. He even somewhat immodestly likens his situation to that of Copernicus in revolutionizing our worldview. In the Lockean view, mental content is given to the mind by the objects in the world. Their properties migrate into the mind, revealing the true nature of objects. Kant says, “Thus far it has been assumed that all our cognition must conform to objects” (B xvi). But that approach cannot explain why some claims like, “every event must have a cause,” are a priori true. Similarly, Copernicus recognized that the movement of the stars cannot be explained by making them revolve around the observer; it is the observer that must be revolving. Analogously, Kant argued that we must reformulate the way we think about our relationship to objects. It is the mind itself which gives objects at least some of their characteristics because they must conform to its structure and conceptual capacities. Thus, the mind’s active role in helping to create a world that is experiencable must put it at the center of our philosophical investigations. The appropriate starting place for any philosophical inquiry into knowledge, Kant decides, is with the mind that can have that knowledge.

Kant’s critical turn toward the mind of the knower is ambitious and challenging. Kant has rejected the dogmatic metaphysics of the Rationalists that promises supersensible knowledge. And he has argued that Empiricism faces serious limitations. His transcendental method will allow him to analyze the metaphysical requirements of the empirical method without venturing into speculative and ungrounded metaphysics. In this context, determining the “transcendental” components of knowledge means determining, “all knowledge which is occupied not so much with objects as with the mode of our knowledge of objects in so far as this mode of knowledge is to be possible a priori.” (A 12/B 25)

The project of the Critique of Pure Reason is also challenging because in the analysis of the mind’s transcendental contributions to experience we must employ the mind, the only tool we have, to investigate the mind. We must use the faculties of knowledge to determine the limits of knowledge, so Kant’s Critique of Pure Reason is both a critique that takes pure reason as its subject matter, and a critique that is conducted by pure reason.

Kant’s argument that the mind makes an a priori contribution to experiences should not be mistaken for an argument like the Rationalists’ that the mind possesses innate ideas like, “God is a perfect being.” Kant rejects the claim that there are complete propositions like this one etched on the fabric of the mind. He argues that the mind provides a formal structuring that allows for the conjoining of concepts into judgments, but that structuring itself has no content. The mind is devoid of content until interaction with the world actuates these formal constraints. The mind possesses a priori templates for judgments, not a priori judgments.

4. Kant’s Transcendental Idealism

With Kant’s claim that the mind of the knower makes an active contribution to experience of objects before us, we are in a better position to understand transcendental idealism.  Kant’s arguments are designed to show the limitations of our knowledge. The Rationalists believed that we could possess metaphysical knowledge about God, souls, substance, and so forth; they believed such knowledge was transcendentally real. Kant argues, however, that we cannot have knowledge of the realm beyond the empirical. That is, transcendental knowledge is ideal, not real, for minds like ours. Kant identifies two a priori sources of these constraints. The mind has a receptive capacity, or the sensibility, and the mind possesses a conceptual capacity, or the understanding.

In the Transcendental Aesthetic section of the Critique, Kant argues that sensibility is the understanding’s means of accessing objects. The reason synthetic a priori judgments are possible in geometry, Kant argues, is that space is an a priori form of sensibility. That is, we can know the claims of geometry with a priori certainty (which we do) only if experiencing objects in space is the necessary mode of our experience. Kant also argues that we cannot experience objects without being able to represent them spatially. It is impossible to grasp an object as an object unless we delineate the region of space it occupies. Without a spatial representation, our sensations are undifferentiated and we cannot ascribe properties to particular objects. Time, Kant argues, is also necessary as a form or condition of our intuitions of objects. The idea of time itself cannot be gathered from experience because succession and simultaneity of objects, the phenomena that would indicate the passage of time, would be impossible to represent if we did not already possess the capacity to represent objects in time.

Another way to understand Kant’s point here is that it is impossible for us to have any experience of objects that are not in time and space. Furthermore, space and time themselves cannot be perceived directly, so they must be the form by which experience of objects is had. A consciousness that apprehends objects directly, as they are in themselves and not by means of space and time, is possible—God, Kant says, has a purely intuitive consciousness—but our apprehension of objects is always mediated by the conditions of sensibility. Any discursive or concept using consciousness (A 230/B 283) like ours must apprehend objects as occupying a region of space and persisting for some duration of time.

Subjecting sensations to the a priori conditions of space and time is not sufficient to make judging objects possible. Kant argues that the understanding must provide the concepts, which are rules for identifying what is common or universal in different representations.(A 106) He says, “without sensibility no object would be given to us; and without understanding no object would be thought. Thoughts without content are empty; intuitions without concepts are blind.” (B 75) Locke’s mistake was believing that our sensible apprehensions of objects are thinkable and reveal the properties of the objects themselves. In the Analytic of Concepts section of the Critique, Kant argues that in order to think about the input from sensibility, sensations must conform to the conceptual structure that the mind has available to it. By applying concepts, the understanding takes the particulars that are given in sensation and identifies what is common and general about them. A concept of “shelter” for instance, allows me to identify what is common in particular representations of a house, a tent, and a cave.

The empiricist might object at this point by insisting that such concepts do arise from experience, raising questions about Kant’s claim that the mind brings an a priori conceptual structure to the world. Indeed, concepts like “shelter” do arise partly from experience. But Kant raises a more fundamental issue. An empirical derivation is not sufficient to explain all of our concepts. As we have seen, Hume argued, and Kant accepts, that we cannot empirically derive our concepts of causation, substance, self, identity, and so forth. What Hume had failed to see, Kant argues, is that even the possibility of making judgments about objects, to which Hume would assent, presupposes the possession of these fundamental concepts. Hume had argued for a sort of associationism to explain how we arrive at causal beliefs. My idea of a moving cue ball, becomes associated with my idea of the eight ball that is struck and falls into the pocket. Under the right circumstances, repeated impressions of the second following the first produces a belief in me that the first causes the second.

The problem that Kant points out is that a Humean association of ideas already presupposes that we can conceive of identical, persistent objects that have regular, predictable, causal behavior. And being able to conceive of objects in this rich sense presupposes that the mind makes several a priori contributions. I must be able to separate the objects from each other in my sensations, and from my sensations of myself. I must be able to attribute properties to the objects. I must be able to conceive of an external world with its own course of events that is separate from the stream of perceptions in my consciousness. These components of experience cannot be found in experience because they constitute it. The mind’s a priori conceptual contribution to experience can be enumerated by a special set of concepts that make all other empirical concepts and judgments possible. These concepts cannot be experienced directly; they are only manifest as the form which particular judgments of objects take. Kant believes that formal logic has already revealed what the fundamental categories of thought are. The special set of concepts is Kant’s Table of Categories, which are taken mostly from Aristotle with a few revisions:

Of Quantity
Unity
Plurality
Totality
Of Quality Of Relation
Reality Inherence and Subsistence
Negation Causality and Dependence
Limitation Community
Of Modality
Possibility-Impossibility
Existence-Nonexistence
Necessity-Contingency

While Kant does not give a formal derivation of it, he believes that this is the complete and necessary list of the a priori contributions that the understanding brings to its judgments of the world. Every judgment that the understanding can make must fall under the table of categories. And subsuming spatiotemporal sensations under the formal structure of the categories makes judgments, and ultimately knowledge, of empirical objects possible.

Since objects can only be experienced spatiotemporally, the only application of concepts that yields knowledge is to the empirical, spatiotemporal world. Beyond that realm, there can be no sensations of objects for the understanding to judge, rightly or wrongly. Since intuitions of the physical world are lacking when we speculate about what lies beyond, metaphysical knowledge, or knowledge of the world outside the physical, is impossible. Claiming to have knowledge from the application of concepts beyond the bounds of sensation results in the empty and illusory transcendent metaphysics of Rationalism that Kant reacts against.

It should be pointed out, however, that Kant is not endorsing an idealism about objects like Berkeley’s. That is, Kant does not believe that material objects are unknowable or impossible. While Kant is a transcendental idealist–he believes the nature of objects as they are in themselves is unknowable to us–knowledge of appearances is nevertheless possible. As noted above, in The Refutation of Material Idealism, Kant argues that the ordinary self-consciousness that Berkeley and Descartes would grant implies “the existence of objects in space outside me.” (B 275) Consciousness of myself would not be possible if I were not able to make determinant judgments about objects that exist outside of me and have states that are independent of my inner experience. Another way to put the point is to say that the fact that the mind of the knower makes the a priori contribution does not mean that space and time or the categories are mere figments of the imagination. Kant is an empirical realist about the world we experience; we can know objects as they appear to us. He gives a robust defense of science and the study of the natural world from his argument about the mind’s role in making nature. All discursive, rational beings must conceive of the physical world as spatially and temporally unified, he argues. And the table of categories is derived from the most basic, universal forms of logical inference, Kant believes. Therefore, it must be shared by all rational beings. So those beings also share judgments of an intersubjective, unified, public realm of empirical objects. Hence, objective knowledge of the scientific or natural world is possible. Indeed, Kant believes that the examples of Newton and Galileo show it is actual. So Berkeley’s claims that we do not know objects outside of us and that such knowledge is impossible are both mistaken.

In conjunction with his analysis of the possibility of knowing empirical objects, Kant gives an analysis of the knowing subject that has sometimes been called his transcendental psychology. Much of Kant’s argument can be seen as subjective, not because of variations from mind to mind, but because the source of necessity and universality is in the mind of the knowing subject, not in objects themselves. Kant draws several conclusions about what is necessarily true of any consciousness that employs the faculties of sensibility and understanding to produce empirical judgments. As we have seen, a mind that employs concepts must have a receptive faculty that provides the content of judgments. Space and time are the necessary forms of apprehension for the receptive faculty. The mind that has experience must also have a faculty of combination or synthesis, the imagination for Kant, that apprehends the data of sense, reproduces it for the understanding, and recognizes their features according to the conceptual framework provided by the categories. The mind must also have a faculty of understanding that provides empirical concepts and the categories for judgment. The various faculties that make judgment possible must be unified into one mind. And it must be identical over time if it is going to apply its concepts to objects over time. Kant here addresses Hume’s famous assertion that introspection reveals nothing more than a bundle of sensations that we group together and call the self. Judgments would not be possible, Kant maintains, if the mind that senses is not the same as the mind that possesses the forms of sensibility. And that mind must be the same as the mind that employs the table of categories, that contributes empirical concepts to judgment, and that synthesizes the whole into knowledge of a unified, empirical world. So the fact that we can empirically judge proves, contra Hume, that the mind cannot be a mere bundle of disparate introspected sensations. In his works on ethics Kant will also argue that this mind is the source of spontaneous, free, and moral action. Kant believes that all the threads of his transcendental philosophy come together in this “highest point” which he calls the transcendental unity of apperception.

5. Kant’s Analytic of Principles

We have seen the progressive stages of Kant’s analysis of the faculties of the mind which reveals the transcendental structuring of experience performed by these faculties. First, in his analysis of sensibility, he argues for the necessarily spatiotemporal character of sensation. Then Kant analyzes the understanding, the faculty that applies concepts to sensory experience. He concludes that the categories provide a necessary, foundational template for our concepts to map onto our experience. In addition to providing these transcendental concepts, the understanding also is the source of ordinary empirical concepts that make judgments about objects possible. The understanding provides concepts as the rules for identifying the properties in our representations.

Kant’s next concern is with the faculty of judgment, “If understanding as such is explicated as our power of rules, then the power of judgment is the ability to subsume under rules, i.e., to distinguish whether something does or does not fall under a given rule.” (A 132/B 172). The next stage in Kant’s project will be to analyze the formal or transcendental features of experience that enable judgment, if there are any such features besides what the previous stages have identified. The cognitive power of judgment does have a transcendental structure. Kant argues that there are a number of principles that must necessarily be true of experience in order for judgment to be possible. Kant’s analysis of judgment and the arguments for these principles are contained in his Analytic of Principles.

Within the Analytic, Kant first addresses the challenge of subsuming particular sensations under general categories in the Schematism section. Transcendental schemata, Kant argues, allow us to identify the homogeneous features picked out by concepts from the heterogeneous content of our sensations. Judgment is only possible if the mind can recognize the components in the diverse and disorganized data of sense that make those sensations an instance of a concept or concepts. A schema makes it possible, for instance, to subsume the concrete and particular sensations of an Airedale, a Chihuahua, and a Labrador all under the more abstract concept “dog.”

The full extent of Kant’s Copernican revolution becomes even more clear in the rest of the Analytic of Principles. That is, the role of the mind in making nature is not limited to space, time, and the categories. In the Analytic of Principles, Kant argues that even the necessary conformity of objects to natural law arises from the mind. Thus far, Kant’s transcendental method has permitted him to reveal the a priori components of sensations, the a priori concepts. In the sections titled the Axioms, Anticipations, Analogies, and Postulates, he argues that there are a priori judgments that must necessarily govern all appearances of objects. These judgments are a function of the table of categories’ role in determining all possible judgments, so the four sections map onto the four headings of that table. I include all of the a priori judgments, or principles, here to illustrate the earlier claims about Kant’s empirical realism, and to show the intimate relationship Kant saw between his project and that of the natural sciences:

Axioms of Intuition
All intuitions are extensive magnitudes.
Anticipations of Perception Analogies of Experience
In all appearances the real that is an object of sensation has intensive magnitude, i.e., a degree. In all variations by appearances substance is permanent, and its quantum in nature is neither increased nor decreased.
All changes occur according to the law of the connection of cause and effect.
All substances, insofar as they can be perceived in space as simultaneous, are in thoroughgoing interaction.
Postulates of Empirical Thought
What agrees (in terms of intuition and concepts) with the formal conditions of experience is possible.
What coheres with the material conditions of experience (with sensation) is actual.
That whose coherence with the actual is determined according to universal conditions of experience is necessary (exists necessarily)

6. Kant’s Dialectic

The discussion of Kant’s metaphysics and epistemology so far (including the Analytic of Principles) has been confined primarily to the section of the Critique of Pure Reason that Kant calls the Transcendental Analytic. The purpose of the Analytic, we are told, is “the rarely attempted dissection of the power of the understanding itself.” (A 65/B 90). Kant’s project has been to develop the full argument for his theory about the mind’s contribution to knowledge of the world. Once that theory is in place, we are in a position to see the errors that are caused by transgressions of the boundaries to knowledge established by Kant’s transcendental idealism and empirical realism. Kant calls judgments that pretend to have knowledge beyond these boundaries and that even require us to tear down the limits that he has placed on knowledge, transcendent judgments. The Transcendental Dialectic section of the book is devoted to uncovering the illusion of knowledge created by transcendent judgments and explaining why the temptation to believe them persists. Kant argues that the proper functioning of the faculties of sensibility and the understanding combine to draw reason, or the cognitive power of inference, inexorably into mistakes. The faculty of reason naturally seeks the highest ground of unconditional unity. It seeks to unify and subsume all particular experiences under higher and higher principles of knowledge. But sensibility cannot by its nature provide the intuitions that would make knowledge of the highest principles and of things as they are in themselves possible. Nevertheless, reason, in its function as the faculty of inference, inevitably draws conclusions about what lies beyond the boundaries of sensibility. The unfolding of this conflict between the faculties reveals more about the mind’s relationship to the world it seeks to know and the possibility of a science of metaphysics.

Kant believes that Aristotle’s logic of the syllogism captures the logic employed by reason. The resulting mistakes from the inevitable conflict between sensibility and reason reflect the logic of Aristotle’s syllogism. Corresponding to the three basic kinds of syllogism are three dialectic mistakes or illusions of transcendent knowledge that cannot be real. Kant’s discussion of these three classes of mistakes are contained in the Paralogisms, the Antinomies, and the Ideals of Reason. The Dialectic explains the illusions of reason in these sections. But since the illusions arise from the structure of our faculties, they will not cease to have their influence on our minds any more than we can prevent the moon from seeming larger when it is on the horizon than when it is overhead. (A 297/B 354).

In the Paralogisms, Kant argues that a failure to recognize the difference between appearances and things in themselves, particularly in the case of the introspected self, leads us into transcendent error. Kant argues against several conclusions encouraged by Descartes and the rational psychologists, who believed they could build human knowledge from the “I think” of the cogito argument. From the “I think” of self-awareness we can infer, they maintain, that the self or soul is 1) simple, 2) immaterial, 3) an identical substance and 4) that we perceive it directly, in contrast to external objects whose existence is merely possible. That is, the rational psychologists claimed to have knowledge of the self as transcendentally real. Kant believes that it is impossible to demonstrate any of these four claims, and that the mistaken claims to knowledge stem from a failure to see the real nature of our apprehension of the “I.” Reason cannot fail to apply the categories to its judgments of the self, and that application gives rise to these four conclusions about the self that correspond roughly to the four headings in the table of categories. But to take the self as an object of knowledge here is to pretend to have knowledge of the self as it is in itself, not as it appears to us. Our representation of the “I” itself is empty. It is subject to the condition of inner sense, time, but not the condition of outer sense, space, so it cannot be a proper object of knowledge. It can be thought through concepts, but without the commensurate spatial and temporal intuitions, it cannot be known. Each of the four paralogisms explains the categorical structure of reason that led the rational psychologists to mistake the self as it appears to us for the self as it is in itself.

We have already mentioned the Antinomies, in which Kant analyzes the methodological problems of the Rationalist project. Kant sees the Antinomies as the unresolved dialogue between skepticism and dogmatism about knowledge of the world. There are four antinomies, again corresponding to the four headings of the table of categories, that are generated by reason’s attempts to achieve complete knowledge of the realm beyond the empirical. Each antinomy has a thesis and an antithesis, both of which can be validly proven, and since each makes a claim that is beyond the grasp of spatiotemporal sensation, neither can be confirmed or denied by experience. The First Antinomy argues both that the world has a beginning in time and space, and no beginning in time and space. The Second Antinomy’s arguments are that every composite substance is made of simple parts and that nothing is composed of simple parts. The Third Antinomy’s thesis is that agents like ourselves have freedom and its antithesis is that they do not. The Fourth Antinomy contains arguments both for and against the existence of a necessary being in the world. The seemingly irreconcilable claims of the Antinomies can only be resolved by seeing them as the product of the conflict of the faculties and by recognizing the proper sphere of our knowledge in each case. In each of them, the idea of “absolute totality, which holds only as a condition of things in themselves, has been applied to appearances” (A 506/B534).

The result of Kant’ analysis of the Antinomies is that we can reject both claims of the first two and accept both claims of the last two, if we understand their proper domains. In the first Antinomy, the world as it appears to us is neither finite since we can always inquire about its beginning or end, nor is it infinite because finite beings like ourselves cannot cognize an infinite whole. As an empirical object, Kant argues, it is indefinitely constructable for our minds. As it is in itself, independent of the conditions of our thought, it should not be identified as finite or infinite since both are categorical conditions of our thought. Kant’s resolution of the third Antinomy (A 445/B 473) clarifies his position on freedom. He considers the two competing hypotheses of speculative metaphysics that there are different types of causality in the world: 1) there are natural causes which are themselves governed by the laws of nature as well as uncaused causes like ourselves that can act freely, or 2) the causal laws of nature entirely govern the world including our actions. The conflict between these contrary claims can be resolved, Kant argues, by taking his critical turn and recognizing that it is impossible for any cause to be thought of as uncaused itself in the realm of space and time. But reason, in trying to understand the ground of all things, strives to unify its knowledge beyond the empirical realm. The empirical world, considered by itself, cannot provide us with ultimate reasons. So if we do not assume a first or free cause we cannot completely explain causal series in the world. So for the Third Antinomy, as for all of the Antinomies, the domain of the Thesis is the intellectual, rational, noumenal world. The domain of the Antithesis is the spatiotemporal world.

7. The Ideas of Reason

The faculty of reason has two employments. For the most part, we have engaged in an analysis of theoretical reason which has determined the limits and requirements of the employment of the faculty of reason to obtain knowledge. Theoretical reason, Kant says, makes it possible to cognize what is. But reason has its practical employment in determining what ought to be as well. (A 633/B 661) This distinction roughly corresponds to the two philosophical enterprises of metaphysics and ethics. Reason’s practical use is manifest in the regulative function of certain concepts that we must think with regard to the world, even though we can have no knowledge of them.

Kant believes that, “Human reason is by its nature architectonic.” (A 474/B 502). That is, reason thinks of all cognitions as belonging to a unified and organized system. Reason is our faculty of making inferences and of identifying the grounds behind every truth. It allows us to move from the particular and contingent to the global and universal. I infer that “Caius is mortal” from the fact that “Caius is a man” and the universal claim, “All men are mortal.” In this fashion, reason seeks higher and higher levels of generality in order to explain the way things are. In a different kind of example, the biologist’s classification of every living thing into a kingdom, phylum, class, order, family, genus, and species, illustrates reason’s ambition to subsume the world into an ordered, unified system. The entire empirical world, Kant argues, must be conceived of by reason as causally necessitated (as we saw in the Analogies). We must connect, “one state with a previous state upon which the state follows according to a rule.” Each cause, and each cause’s cause, and each additional ascending cause must itself have a cause. Reason generates this hierarchy that combines to provide the mind with a conception of a whole system of nature. Kant believes that it is part of the function of reason to strive for a complete, determinate understanding of the natural world. But our analysis of theoretical reason has made it clear that we can never have knowledge of the totality of things because we cannot have the requisite sensations of the totality, hence one of the necessary conditions of knowledge is not met. Nevertheless, reason seeks a state of rest from the regression of conditioned, empirical judgments in some unconditioned ground that can complete the series (A 584/B 612). Reason’s structure pushes us to accept certain ideas of reason that allow completion of its striving for unity. We must assume the ideas of God, freedom, and immortality, Kant says, not as objects of knowledge, but as practical necessities for the employment of reason in the realm where we can have knowledge. By denying the possibility of knowledge of these ideas, yet arguing for their role in the system of reason, Kant had to, “annul knowledge in order to make room for faith.” (B xxx).

8. Kant’s Ethics

It is rare for a philosopher in any era to make a significant impact on any single topic in philosophy. For a philosopher to impact as many different areas as Kant did is extraordinary. His ethical theory has been as influential as, if not more influential than, his work in epistemology and metaphysics. Most of Kant’s work on ethics is presented in two works. The Foundations of the Metaphysics of Morals (1785) is Kant’s “search for and establishment of the supreme principle of morality.” In The Critique of Practical Reason (1787) Kant attempts to unify his account of practical reason with his work in the Critique of Pure Reason. Kant is the primary proponent in history of what is called deontological ethics. Deontology is the study of duty. On Kant’s view, the sole feature that gives an action moral worth is not the outcome that is achieved by the action, but the motive that is behind the action. The categorical imperative is Kant’s famous statement of this duty: “Act only according to that maxim by which you can at the same time will that it should become a universal law.”

a. Reason and Freedom

For Kant, as we have seen, the drive for total, systematic knowledge in reason can only be fulfilled with assumptions that empirical observation cannot support. The metaphysical facts about the ultimate nature of things in themselves must remain a mystery to us because of the spatiotemporal constraints on sensibility. When we think about the nature of things in themselves or the ultimate ground of the empirical world, Kant has argued that we are still constrained to think through the categories, we cannot think otherwise, but we can have no knowledge because sensation provides our concepts with no content. So, reason is put at odds with itself because it is constrained by the limits of its transcendental structure, but it seeks to have complete knowledge that would take it beyond those limits.

Freedom plays a central role in Kant’s ethics because the possibility of moral judgments presupposes it. Freedom is an idea of reason that serves an indispensable practical function. Without the assumption of freedom, reason cannot act. If we think of ourselves as completely causally determined, and not as uncaused causes ourselves, then any attempt to conceive of a rule that prescribes the means by which some end can be achieved is pointless. I cannot both think of myself as entirely subject to causal law and as being able to act according to the conception of a principle that gives guidance to my will. We cannot help but think of our actions as the result of an uncaused cause if we are to act at all and employ reason to accomplish ends and understand the world.

So reason has an unavoidable interest in thinking of itself as free. That is, theoretical reason cannot demonstrate freedom, but practical reason must assume it for the purpose of action. Having the ability to make judgments and apply reason puts us outside that system of causally necessitated events. “Reason creates for itself the idea of a spontaneity that can, on its own, start to act–without, i.e., needing to be preceded by another cause by means of which it is determined to action in turn, according to the law of causal connection,” Kant says. (A 533/B 561) In its intellectual domain, reason must think of itself as free.

It is dissatisfying that he cannot demonstrate freedom; nevertheless, it comes as no surprise that we must think of ourselves as free. In a sense, Kant is agreeing with the common sense view that how I choose to act makes a difference in how I actually act. Even if it were possible to give a predictive empirical account of why I act as I do, say on the grounds of a functionalist psychological theory, those considerations would mean nothing to me in my deliberations. When I make a decision about what to do, about which car to buy, for instance, the mechanism at work in my nervous system makes no difference to me. I still have to peruse Consumer Reports, consider my options, reflect on my needs, and decide on the basis of the application of general principles. My first person perspective is unavoidable, hence the deliberative, intellectual process of choice is unavoidable.

b. The Duality of the Human Situation

The question of moral action is not an issue for two classes of beings, according to Kant. The animal consciousness, the purely sensuous being, is entirely subject to causal determination. It is part of the causal chains of the empirical world, but not an originator of causes the way humans are. Hence, rightness or wrongness, as concepts that apply to situations one has control over, do not apply. We do not morally fault the lion for killing the gazelle, or even for killing its own young. The actions of a purely rational being, by contrast, are in perfect accord with moral principles, Kant says. There is nothing in such a being’s nature to make it falter. Its will always conforms with the dictates of reason. Humans are between the two worlds. We are both sensible and intellectual, as was pointed out in the discussion of the first Critique. We are neither wholly determined to act by natural impulse, nor are we free of non-rational impulse. Hence we need rules of conduct. We need, and reason is compelled to provide, a principle that declares how we ought to act when it is in our power to choose

Since we find ourselves in the situation of possessing reason, being able to act according to our own conception of rules, there is a special burden on us. Other creatures are acted upon by the world. But having the ability to choose the principle to guide our actions makes us actors. We must exercise our will and our reason to act. Will is the capacity to act according to the principles provided by reason. Reason assumes freedom and conceives of principles of action in order to function.

Two problems face us however. First, we are not wholly rational beings, so we are liable to succumb to our non-rational impulses. Second, even when we exercise our reason fully, we often cannot know which action is the best. The fact that we can choose between alternate courses of actions (we are not determined to act by instinct or reason) introduces the possibility that there can be better or worse ways of achieving our ends and better or worse ends, depending upon the criteria we adopt. The presence of two different kinds of object in the world adds another dimension, a moral dimension, to our deliberations. Roughly speaking, we can divide the world into beings with reason and will like ourselves and things that lack those faculties. We can think of these classes of things as ends-in-themselves and mere means-to-ends, respectively. Ends-in-themselves are autonomous beings with their own agendas; failing to recognize their capacity to determine their own actions would be to thwart their freedom and undermine reason itself. When we reflect on alternative courses of action, means-to-ends, things like buildings, rocks, and trees, deserve no special status in our deliberations about what goals we should have and what means we use to achieve them. The class of ends-in-themselves, reasoning agents like ourselves, however, do have a special status in our considerations about what goals we should have and the means we employ to accomplish them. Moral actions, for Kant, are actions where reason leads, rather than follows, and actions where we must take other beings that act according to their own conception of the law into account.

c. The Good Will

The will, Kant says, is the faculty of acting according to a conception of law. When we act, whether or not we achieve what we intend with our actions is often beyond our control, so the morality of our actions does not depend upon their outcome. What we can control, however, is the will behind the action. That is, we can will to act according to one law rather than another. The morality of an action, therefore, must be assessed in terms of the motivation behind it. If two people, Smith and Jones, perform the same act, from the same conception of the law, but events beyond Smith’s control prevent her from achieving her goal, Smith is not less praiseworthy for not succeeding. We must consider them on equal moral ground in terms of the will behind their actions.

The only thing that is good without qualification is the good will, Kant says. All other candidates for an intrinsic good have problems, Kant argues. Courage, health, and wealth can all be used for ill purposes, Kant argues, and therefore cannot be intrinsically good. Happiness is not intrinsically good because even being worthy of happiness, Kant says, requires that one possess a good will. The good will is the only unconditional good despite all encroachments. Misfortune may render someone incapable of achieving her goals, for instance, but the goodness of her will remains.

Goodness cannot arise from acting on impulse or natural inclination, even if impulse coincides with duty. It can only arise from conceiving of one’s actions in a certain way. A shopkeeper, Kant says, might do what is in accord with duty and not overcharge a child. Kant argues, “it is not sufficient to do that which should be morally good that it conform to the law; it must be done for the sake of the law.” (Foundations of the Metaphysics of Morals, Akademie pagination 390) There is a clear moral difference between the shopkeeper that does it for his own advantage to keep from offending other customers and the shopkeeper who does it from duty and the principle of honesty.(Ibid., 398) Likewise, in another of Kant’s carefully studied examples, the kind act of the person who overcomes a natural lack of sympathy for other people out of respect for duty has moral worth, whereas the same kind act of the person who naturally takes pleasure in spreading joy does not. A person’s moral worth cannot be dependent upon what nature endowed them with accidentally. The selfishly motivated shopkeeper and the naturally kind person both act on equally subjective and accidental grounds. What matters to morality is that the actor think about their actions in the right manner.

We might be tempted to think that the motivation that makes an action good is having a positive goal–to make people happy, or to provide some benefit. But that is not the right sort of motive, Kant says. No outcome, should we achieve it, can be unconditionally good. Fortune can be misused, what we thought would induce benefit might actually bring harm, and happiness might be undeserved. Hoping to achieve some particular end, no matter how beneficial it may seem, is not purely and unconditionally good. It is not the effect or even the intended effect that bestows moral character on an action. All intended effects “could be brought about through other causes and would not require the will of a rational being, while the highest and unconditional good can be found only in such a will.” (Ibid., 401) It is the possession of a rationally guided will that adds a moral dimension to one’s acts. So it is the recognition and appreciation of duty itself that must drive our actions.

d. Duty

What is the duty that is to motivate our actions and to give them moral value? Kant distinguishes two kinds of law produced by reason. Given some end we wish to achieve, reason can provide a hypothetical imperative, or rule of action for achieving that end. A hypothetical imperative says that if you wish to buy a new car, then you must determine what sort of cars are available for purchase. Conceiving of a means to achieve some desired end is by far the most common employment of reason. But Kant has shown that the acceptable conception of the moral law cannot be merely hypothetical. Our actions cannot be moral on the ground of some conditional purpose or goal. Morality requires an unconditional statement of one’s duty.

And in fact, reason produces an absolute statement of moral action. The moral imperative is unconditional; that is, its imperative force is not tempered by the conditional “if I want to achieve some end, then do X.” It simply states, do X. Kant believes that reason dictates a categorical imperative for moral action. He gives at least three formulations of the Categorical Imperative.

  1. “Act only according to that maxim by which you can at the same time will that it should become a universal law.” (Ibid., 422)
  2. “Act as though the maxim of your action were by your will to become a universal law of nature.” (Ibid)
  3. Act so that you treat humanity, whether in your own person or in that of another, always as an end and never as a means only.” (Ibid., 429)

What are Kant’s arguments for the Categorical Imperative? First, consider an example. Consider the person who needs to borrow money and is considering making a false promise to pay it back. The maxim that could be invoked is, “when I need of money, borrow it, promising to repay it, even though I do not intend to.” But when we apply the universality test to this maxim it becomes clear that if everyone were to act in this fashion, the institution of promising itself would be undermined. The borrower makes a promise, willing that there be no such thing as promises. Thus such an action fails the universality test.

The argument for the first formulation of the categorical imperative can be thought of this way. We have seen that in order to be good, we must remove inclination and the consideration of any particular goal from our motivation to act. The act cannot be good if it arises from subjective impulse. Nor can it be good because it seeks after some particular goal which might not attain the good we seek or could come about through happenstance. We must abstract away from all hoped for effects. If we remove all subjectivity and particularity from motivation we are only left with will to universality. The question “what rule determines what I ought to do in this situation?” becomes “what rule ought to universally guide action?” What we must do in any situation of moral choice is act according to a maxim that we would will everyone to act according to.

The second version of the Categorical Imperative invokes Kant’s conception of nature and draws on the first Critique. In the earlier discussion of nature, we saw that the mind necessarily structures nature. And reason, in its seeking of ever higher grounds of explanation, strives to achieve unified knowledge of nature. A guide for us in moral matters is to think of what would not be possible to will universally. Maxims that fail the test of the categorical imperative generate a contradiction. Laws of nature cannot be contradictory. So if a maxim cannot be willed to be a law of nature, it is not moral.

The third version of the categorical imperative ties Kant’s whole moral theory together. Insofar as they possess a rational will, people are set off in the natural order of things. They are not merely subject to the forces that act upon them; they are not merely means to ends. They are ends in themselves. All means to an end have a merely conditional worth because they are valuable only for achieving something else. The possessor of a rational will, however, is the only thing with unconditional worth. The possession of rationality puts all beings on the same footing, “every other rational being thinks of his existence by means of the same rational ground which holds also for myself; thus it is at the same time an objective principle from which, as a supreme practical ground, it must be possible to derive all laws of the will.” (Ibid., 429)

9. Kant’s Criticisms of Utilitarianism

Kant’s criticisms of utilitarianism have become famous enough to warrant some separate discussion. Utilitarian moral theories evaluate the moral worth of action on the basis of happiness that is produced by an action. Whatever produces the most happiness in the most people is the moral course of action. Kant has an insightful objection to moral evaluations of this sort. The essence of the objection is that utilitarian theories actually devalue the individuals it is supposed to benefit. If we allow utilitarian calculations to motivate our actions, we are allowing the valuation of one person’s welfare and interests in terms of what good they can be used for. It would be possible, for instance, to justify sacrificing one individual for the benefits of others if the utilitarian calculations promise more benefit. Doing so would be the worst example of treating someone utterly as a means and not as an end in themselves.

Another way to consider his objection is to note that utilitarian theories are driven by the merely contingent inclination in humans for pleasure and happiness, not by the universal moral law dictated by reason. To act in pursuit of happiness is arbitrary and subjective, and is no more moral than acting on the basis of greed, or selfishness. All three emanate from subjective, non-rational grounds. The danger of utilitarianism lies in its embracing of baser instincts, while rejecting the indispensable role of reason and freedom in our actions.

10. References and Further Reading

  • Anthropology from a Pragmatic Point of View, trans. Victor Lyle Dowden. Southern Illinois University Press, 1996.
  • The Conflict of the Faculties, trans. Mary Gregor. Lincoln: University of Nebraska Press, 1992.
  • Correspondence. ed. Arnulf Zweig. New York: Cambridge University Press, 1999.
  • Critique of Judgment, trans. Werner S. Pluhar. Indianapolis: Hackett, 1987.
  • Critique of Practical Reason, trans. Mary Gregor. New York: Cambridge University Press, 1997.
  • Critique of Pure Reason, trans. Werner Pluhar. Indianapolis: Hackett, 1996.
  • Groundwork of the Metaphysics of Morals. ed. Mary Gregor. New York: Cambridge University Press, 1998.
  • Kant’s Latin Writings, Translations, Commentaries, and Notes, trans. Lewis White Beck in collaboration with Mary Gregor, Ralf Meerbote, John Reuscher. New York: Peter Lang, 1986
  • Kant: Philosophical Correspondence 1759-1799, ed. and trans. Arnulf Zweig. Chicago: Chicago University Press, 1967.
  • Logic, trans. Robert S. Hartman and Wolfgang Schwarz. New York: Dover Publications, 1974.
  • Metaphysical Foundations of Natural Science, trans. James Ellington. Indianapolis: Hackett, 1975.
  • The Metaphysics of Morals. trans. Mary Gregor. New York: Cambridge University Press, 1996.
  • Opus Postumum, ed. Eckart Forster, trans. Eckart Forster and Michael Rosen. Cambridge: Cambridge University Press, 1993.
  • Prolegomena to Any Future Metaphysics, trans. Gary Hatfield. New York: Cambridge University Press, 1997.
  • Religion within the Limits of Reason Alone. trans. T.M. Greene and H.H. Hudson. New York: Harper and Row, 1960.
  • Theoretical Philosophy, trans. David Walford and Ralf Meerbote. Cambridge: Cambridge University Press, 1992.
  • What Real Progress Has Metaphysics Made in Germany Since the Time of Leibniz and Wolff?(1804). trans. T. Humphrey. New York: Abaris, 1983 (Ak. XX).

Author Information

Matt McCormick
Email: mccormick@csus.edu
California State University, Sacramento
U. S. A.

Jean-François Lyotard (1924—1998)

LyotardFrench post-structuralist philosopher, best known for his highly influential formulation of postmodernism in The Postmodern Condition. Despite its popularity, however, this book is in fact one of his more minor works. Lyotard’s writings cover a large range of topics in philosophy, politics, and aesthetics, and experiment with a wide variety of styles. His works can be roughly divided into three categories: early writings on phenomenology, politics, and the critique of structuralism, the intermediate libidinal philosophy, and later work on postmodernism and the “differend.” The majority of his work, however, is unified by a consistent view that reality consists of singular events which cannot be represented accurately by rational theory. For Lyotard, this fact has a deep political import, since politics claims to be based on accurate representations of reality. Lyotard’s philosophy exhibits many of the major themes common to post-structuralist and postmodernist thought. He calls into question the powers of reason, asserts the importance of nonrational forces such as sensations and emotions, rejects humanism and the traditional philosophical notion of the human being as the central subject of knowledge, champions heterogeneity and difference, and suggests that the understanding of society in terms of “progress” has been made obsolete by the scientific, technological, political and cultural changes of the late twentieth century. Lyotard deals with these common themes in a highly original way, and his work exceeds many popular conceptions of postmodernism in its depth, imagination, and rigor. His thought remains pivotal in contemporary debates surrounding philosophy, politics, social theory, cultural studies, art and aesthetics.

Table of Contents

  1. Biography
  2. Early Works
    1. Phenomenology
    2. Algeria
    3. Discourse, Figure
  3. Libidinal Philosophy
  4. Postmodernism
    1. Paganism
    2. The Postmodern Condition
    3. The Differend
  5. Reason and Representation
  6. The Subject and the Inhuman
  7. Science and Technology
  8. Politics
  9. Art and Aesthetics
  10. Late Works
    1. Malraux
    2. Augustine
  11. References and Further Reading
    1. Books by Lyotard
    2. Books about Lyotard

1. Biography

Jean-François Lyotard was born in Vincennes, France, on August 10, 1924. His father, Jean-Pierre Lyotard, was a sales representative. His mother’s maiden name was Madeleine Cavalli. He was schooled at the Paris Lycées Buffon and Louis-le-Grand, and his youthful aspirations to be a Dominican monk, a painter, an historian, or a novelist eventually gave way to a career in philosophy. He studied philosophy and literature at the Sorbonne (after twice failing the entrance exam to the Ecole Normale Supérieure), where he became friends with Gilles Deleuze. His early interest in philosophies of indifference resulted in his M.A. dissertation Indifference as an Ethical Notion. Lyotard describes his existence up until the Second World War as a ‘poetic, introspective and solitary way of thinking and living.’ The war disrupted both his way of life and his thought; he acted as a first-aid volunteer in the fight for liberation in the Paris streets in August 1944, and gave up the idea of indifference for a commitment to the investigation of reality in terms of social interactions. Lyotard became a husband and father at a young age, marrying Andrée May in 1948 and subsequently having two children, Corinne and Laurence. Lyotard passed the agrégation (the examination required in order to teach in France) and took up a position teaching philosophy at a boy’s lycée (school) in Constantine in French-occupied East Algeria in 1950. From 1952-59 he taught at a school for the sons of military personnel at La Flèche. In Constantine Lyotard read Marx and became acquainted with the Algerian political situation, which he believed was ripe for socialist revolution. In 1954 Lyotard joined the socialist revolutionary organisation Socialisme ou Barbarie (Socialism or Barbarism). Other members of the organisation included Cornelius Castoriadis, Claude Lefort, and Pierre Souyris. Lyotard had met Souyris at a union meeting late in 1950, and they had a long and close friendship, eventually troubled by political and theoretical differences.

Lyotard became an intellectual militant, and asserts that for fifteen years he was so dedicated to the cause of socialist revolution that no other aspect of life (with the sole exception of love) diverted him from this task. His writings in this period are solely concerned with ultra-left revolutionary politics, with a sharp focus on the Algerian situation (the war of independence had broken out in 1954). He contributed to and edited the Socialisme ou Barbarie journal, and wrote pamphlets to distribute to workers at protests and at factory gates. In 1964 a schism erupted in Socialisme ou Barbarie over Castoriadis’ new theoretical direction for the group. Lyotard, along with Souyris, became a member of the splinter group Pouvoir Ouvrier (Worker’s Power), but resigned in 1966. He had lost belief in the legitimacy of Marxism as a totalising theory, and returned to the study and writing of philosophy. From 1959 to 1966 Lyotard was maître-assistant at the Sorbonne, and then gained a position in the philosophy department at the University of Paris X, Nanterre. There he took part in the May 1968 political actions, organising demonstrations for the “March 22 Movement.”

Lyotard attended the radical psychoanalyst Jacques Lacan’s seminars in the mid-60s, and his reaction to Lacan’s theories resulted in Discours, figure, for which he received the degree of doctorat d’état. From 1968 to 1970 Lyotard was chargé de recherches at the Centre National de la Recherche Scientifique. In the early 1970s Lyotard was appointed to the University of Paris VIII, Vincennes, where he was a popular teacher and a prolific writer. In 1972 he was made maître de conferences, and in 1987 he became Professor Emeritus at Vincennes. The 1979 publication of The Postmodern Condition brought Lyotard worldwide fame, and in the 1980s and 90s he lectured widely outside of France. Lyotard was professor of French and Italian at the University of California, Irvine, Robert W. Woodruff Professor of French at Emory University, and a founding member and sometime president of the Collège International de Philosophie. Lyotard was a visiting professor at numerous universities, including John Hopkins, the University of California, Berkeley and San Diego, the University of Minnesota, the Université de Montréal, Canada, the Universität Siegen, West Germany, and the University of Saõ Paulo, Brazil. Lyotard married his second wife Dolorès Djidzek in 1993 and had a son, David. Lyotard died of leukaemia in Paris on April 21, 1998.

2. Early Works

a. Phenomenology

Lyotard’s first book, published in 1954, is a short introduction to and examination of phenomenology. The first part introduces phenomenology through the work of Edmund Husserl, and the second part evaluates phenomenology’s relation to the human sciences (particularly psychology, sociology, and history). In the second part the focus shifts from Husserl to the work of Maurice Merleau-Ponty. Throughout, Lyotard is concerned with phenomenology’s attempt to find a “third way” between subjectivism and objectivism, avoiding the problems of each. In particular, he is interested in the bearing this problem has on the question of whether phenomenology can think history politically, thus potentially contributing to Marxism. This theme (the relation of phenomenology to Marxism) was a prime concern for French thinkers of the fifties, and Lyotard’s book is a useful documentation of the issues at stake. Much of his exposition and discussion is positive, and Lyotard argues that phenomenology can make valuable contributions to the social sciences, where it should serve two functions: firstly, to define the object of the science eidetically (i.e. in its essence) prior to all experimentation, and secondly, to philosophically reassess the results of experimentation. Lyotard argues, for example, that sociology has need of a phenomenological definition of the essence of the social before it can proceed effectively as a science. While he sees the usefulness of phenomenology in many disciplines, however, Lyotard’s conclusions about the usefulness of phenomenology to Marxism are largely negative. He argues that phenomenology does not represent progress on Marxism, but is in fact a step backwards. For Lyotard phenomenology cannot properly formulate a materialist worldview and the objective nature of the relations of production; it ends up interpreting class struggle as taking place in consciousness. Lyotard rejects phenomenology’s attempt to find a third way between subjectivism and objectivism, and asserts Marxism’s superiority in viewing subjectivity as already contained in objectivity.

b. Algeria

In the fifteen years between his first two books of philosophy, Lyotard devoted all his writing efforts to the cause of revolutionary politics. His most substantial writings of this time were his contributions to the Socialisme ou Barbarie journal on the political situation in Algeria [many of which are collected in Political Writings]. The project of Socialisme ou Barbarie was to provide theoretical resources to contribute to socialist revolution, critiquing other existing socialist strands (particularly Stalinism and the French communist party) as a hindrance to revolution, and with a particular emphasis on the critique of bureaucracy. In the essays on Algeria, Lyotard applies this project to the French occupation, trying to determine the potential for socialist revolution arising from this situation. He pays close attention to the economic forces at work in occupied Algeria, arguing that it is in the economic interests of France to keep Algerians in a state of underdevelopment and poverty. Furthermore, Lyotard introduces a notion of ‘terror’ that he develops more fully in his later works, indicating the suppression of Algerian culture by the imposition of foreign (French) cultural forms. The conclusion Lyotard comes to is that the occupation must end if the Algerian people are to prosper, but he remains ambivalent about the possibility of revolution. He surmises that a nationalist, democratic revolution will only lead to new forms of social inequality and domination, and insists that a socialist revolution is necessary. This ambivalence was reflected in Socialisme ou Barbarie‘s debate about whether or not to support the Algerian war of independence, fearing that its democratic and nationalistic leanings would not bring about the result they desired. In “Algeria Evacuated,” written after the end of the occupation, Lyotard regretfully asks why a socialist revolution did not take place, concluding that the social and political upheavals resulted in an opportunistic struggle for power rather than a class-based action. The end result of Lyotard’s work on Algeria and the disappointment at the failure of socialist revolution to take place led him to an abandonment of revolutionary socialism and traditional Marxism on the grounds that social reality is too complex to describe accurately with any master-discourse.

c. Discourse, Figure

Lyotard’s second book of philosophy is long and difficult. It covers a wide variety of topics, including phenomenology, psychoanalysis, structuralism, poetry and art, Hegelian dialectics, semiotics, and philosophy of language. The main thrust of this work, however, is a critique of structuralism, particularly as it manifests itself in Lacan’s psychoanalysis. The book is divided into two parts: the first uses Merleau-Ponty’s phenomenology to undermine structuralism, and the second uses Freudian psychoanalysis to undermine both Lacanian psychoanalysis and certain aspects of phenomenology. Lyotard begins with an opposition between discourse, related to structuralism and written text, and figure (a visual image), related to phenomenology and seeing. He suggests that structured, abstract conceptual thought has dominated philosophy since Plato, denigrating sensual experience. The written text and the experience of reading are associated with the former, and figures, images and the experience of seeing with the latter. Part of Lyotard’s aim is to defend the importance of the figural and sensual experience such as seeing. He proceeds to deconstruct this opposition, however, and attempts to show that discourse and figure are mutually implicated. Discourse contains elements of the figural (poetry and illuminated texts are good examples), and visual space can be structured like discourse (when it is broken up into ordered elements in order for the world to be recognisable and navigable by the seeing subject). He develops an idea of the figural as a disruptive force which works to interrupt established structures in the realms of both reading and seeing. Ultimately, the point is not to privilege the figural over the discursive, but to show how these elements must negotiate with each other. The mistake of structuralism is to interpret the figural in entirely discursive terms, ignoring the different ways in which these elements operate. In the second part of Discours, figure, structure and transgression are related to Freudian libidinal forces, paving the way for the libidinal philosophy developed in Libidinal Economy.

3. Libidinal Philosophy

In the early 1970s Lyotard developed a philosophy based around Sigmund Freud’s theory of the libido. For Lyotard, libidinal energy can be used as a “theoretical fiction” to describe the transformations that take place in society. After his break with Marxism and rejection of totalising theory, he sought to develop a theory that will take account of multiple and different forces and desires at work in any political or social situation, from the writing of theory to revolutionary politics to global economics. Lyotard’s libidinal philosophy is developed in the major work Libidinal Economy and in two sets of essays, Dérive à partir de Marx et Freud [some of which is translated in Driftworks] and Des Dispositifs Pulsionnels. Libidinal Economy is an unusual and difficult work, and encompasses a complex set of theories concerning politics, economics, theory, academic style, and readings of Marx and Freud. It is written in a bewildering combination of styles (at times reading more like an avant-garde novel than a philosophical text), a method Lyotard uses in an attempt to overcome the limitations he sees in traditional academic theory.

The libidinal philosophy begins Lyotard’s general commitment to an ontology of events, which also underlies his later postmodern philosophy. Lyotard sees reality in terms of unpredictable happenings (events), rather than structured regularities. These events can be interpreted in different ways, and no single interpretation will capture events accurately. Events always exceed interpretation; there is always something “left over” that an interpretation does not account for. In the libidinal philosophy Lyotard uses the idea of libidinal energy to describe events and the way they are interpreted or exploited, and he develops a philosophy of society and theory in terms of the economy of libidinal energies. Lyotard uses the terms “libidinal intensities,” and “affects” to refer to events. These intensities and affects are, in more common terminology, feelings and desires. In the terms of Freudian psychoanalysis, they are the “primary processes” of the libido, the forces that exist in the body on a more basic level than the “secondary processes” of the conscious mind. In particular, Lyotard focuses on sexual desire. He uses these terms metaphorically, however, to describe the workings of reality and society as a whole, divorcing them from their usual attachments to human beings. Lyotard describes the wholly impersonal as well as the personal in terms of feelings and desires, and paints a picture of the world that moves and is moved in the ways that feelings move people. Lyotard admits that this description of everything in libidinal terms is a “theoretical fiction,” merely a way of speaking which gives us useful terms for theorizing about what happens in the world. Metaphysically, Lyotard is a materialist, and for him affects must be understood as concrete material entities. An affect might be a sound, a color, a smile or a caress: anything which has an ability to “move,” to produce feelings and desires. Affects are structured and interpreted in systems made up of dispositifs, libidinal dispositions or set-ups, and society is composed of multitudes of different dispositions that compete to exploit the energies of libidinal events. Lyotard develops a complex set of figures to describe how this process takes place.

Libidinal Economy begins with the figure of a body (ambivalently sexed), being cut open and spread out to form a flat, band-like surface. Lyotard is here beginning to describe a region on which libidinal intensities take place and on which they meet with the dispositifs that channel libidinal energy. This region is material like the body, but it is not yet organized, thus the figure of dismemberment. The flat band that the body has become is then given a twist and joined end to end, forming a moebius strip (a circular figure which has only one surface due to the twist it contains; a line traced along one side of the strip will end up on the other side without breaking contact with the surface). This strip is then set in motion, circulating so fast it glows red with heat. This is the libidinal band (sometimes called the libidinal skin). It represents the “primary processes” of desire and libidinal intensity in which libidinal energy circulates in an aleatory fashion, not yet investing anything. Because the libidinal band is a moebius strip, desire circulates on only one surface; there is no inside or outside. In time the band begins to slow and cool, and forms what Lyotard calls “the (disjunctive) bar.”

As the bar slows, sometimes it invests this region, sometimes that. It becomes disjunctive, distinguishing this from not-this. This stage in the transformation of the libidinal band represents the formation of rational thought, dominated by binary logic and the law of noncontradiction. Finally the bar stops and forms a stable disjunction. Lyotard describes the bar as then turning around on itself and creating an enclosed space, a theatrical volume. This is the particular transformation of the libidinal band – or the particular dispositif on the libidinal band – that gives rise to representation and theory. The theatrical space has an inside and an outside, a clear disjunction between this and not-this. Lyotard’s image of theory as theatre is based on the etymological relationship between the two terms; they are both derived from the Greek theasthai, meaning to look at, contemplate, or behold. The theorist is like a spectator who views the representation of the world (outside the theatre) on the stage (inside the theatre).

Lyotard’s description of the transformations of the libidinal band is a theoretical fiction which provides an account of how the world works through the interplay of intense, excited libidinal energies and the stable structures which exploit them and dampen their intensity. The band is the space on which libidinal intensities meet dispositifs, or libidinal set-ups. These set-ups channel energy into more or less stable systems and structures, and therefore all dispositifs, all systems and structures, can be described in terms of the slowing and cooling of the band. An example would be the way political institutions channel desires to change society away from violent, disruptive eruptions towards more moderate, less disruptive modes of action. Systems exploit libidinal intensities by channeling them into stable structures. And yet, these systems deny their own origins in intense and aleatory libidinal energy, taking themselves to be permanent and stable. Systems hide, or dissimulate, affects (libidinal intensities). Conversely, however, affects dissimulate systems. Systems and affects dissimulate each other. This means that systems contain and hide affects, and that affects contain and hide the possibility for forming systems. Dissimulation is a concept that allows us to see the elements of the libidinal economy as duplicitous. That is, they have more than one possibility. It is always possible for intensities to channel into a stable system, or to disrupt a system by destabilising it through intense investment.

Lyotard develops a critical but nuanced approach towards theory, politics and economics within the terms of the libidinal philosophy. His prime concern is that the structures that exploit libidinal intensities tend to become hegemonic. That is, they tend to claim sole right to the exploitation or interpretation of intensities. At the same time, they often deny libidinal intensities themselves, taking themselves to be primary and stable structures. Lyotard sees these tendencies as limiting and nihilistic, in the sense that they deny the full possibilities of the expression of intensities. In theory, politics, and cultural conventions, structured dispositions take themselves to be the actual structures of reality or “correct” interpretations, thus limiting the possibilities of change. For Lyotard change is life affirming, whereas the stable structures that inhibit change are nihilistic and life denying. However, Lyotard does not simply assert libidinal intensity as an affirmative “other” to nihilism. For Lyotard, there is no affirmative region, no pure outside to nihilism. Lyotard does not propose that we champion affects, singularities, intensities and libidinal energy over systems, structures, theory, concepts and representation. This is because the only way libidinal energies can exist is within structures. Lyotard does not advocate a simple liberation of desire and does not attempt to set up a place beyond representation which would be immune to the effects of nihilism. Lyotard presents us, rather, with a metaphysical system in which intensities and structures are both essential elements of the libidinal economy.

Lyotard’s response to the nihilism of structure takes place through the concept of dissimulation, which suggests that libidinal energy must work within structures. All structures contain libidinal energy as an under-exploited potentiality, waiting to be released and to flow into new structures. This libidinal energy is the event, which always contains more possibilities for interpretation and exploitation than any single structure can give it. Lyotard’s libidinal philosophy prescribes a “freeing up” of structures, so that events may be allowed their maximum potentiality of expression in competing interpretations and dispositions. Releasing the energy in structures in turn creates new events, with their own energetic potentialities. Because the event is unpredictable, we cannot actively control the way it will be released and form new structures. However, we can “act passively” so as to encourage the maximum release of intensity within structures. Lyotard’s own style of writing in Libidinal Economy is one attempt to do this: by multiplying genres of discourse, there is no overall dominant structure in the text and it is open to several competing modes of reading, interpretation and application. Ultimately, libidinal philosophy suggests a method of subversion from within existing structures through experimentation with the forms of those structures.

4. Postmodernism

Lyotard abandoned his libidinal philosophy in the later years of the seventies, beginning a philosophy of paganism that developed, by the eighties, into his unique version of postmodernism. The turn from the libidinal to the pagan and the postmodern continued a concern with events and the limits of representation, but concerned two key changes: 1. A change in the mode of analysis from libidinal forces to language, and 2. a new focus on justice. Whereas in the libidinal philosophy the focus was to see that a single interpretation of an event did not become hegemonic, in Lyotard’s later philosophy he is primarily concerned with the problems of justice that arise between competing interpretations of events. Lyotard’s philosophy of language and justice is most fully developed through the concept of the differend, in the book of the same name.

a. Paganism

Lyotard develops the notion of paganism in “Lessons in Paganism” (reprinted in The Lyotard Reader), Just Gaming and various other short works of the late seventies. The term “paganism” refers to a way of thinking that takes into account and strives to do justice to incommensurable differences. Just as pagan religions believe in a number of different gods rather than just one God, Lyotard’s pagan philosophy represents a concern for pluralism and multiplicity (terms he uses synonymously to oppose the idea of universality). This concern for difference, multiplicity and pluralism is related to Lyotard’s basic commitment to an ontology of singular events: if reality is constituted by unique happenings, then there will be no universal law of judgement which will be able to take account of each and every event in a way which does them all justice. Paganism suggests that there are irreducible differences in the order of things, and that we must take things on their own terms without attempting to reduce them to universals. In his writings on paganism, Lyotard analyses politics in the form of a justice of rhetoric. In “Lessons in Paganism” he claims that all discourse is narrative; all theory, all politics, all law, are merely a collection of stories. In Just Gaming, he analyses situations where questions of justice and judgement arise in terms of language games. Lyotard rejects the claims of any discourse to be grounded in truth. He rejects the idea of a master-discourse (later called a metanarrative) that is thought to provide the basis for judgement in all situations. (Marxism, Hegelian philosophy, and Kant’s ideal of unity or totality as regulating justice are examples of master-discourses that have dominated the philosophical tradition.) Instead, Lyotard suggests that paganism is the most appropriate response to the desire for justice. Paganism is godless politics; it is the abandonment of universal judgement for specific, plural judgements. This means giving up the idea of a single, law-like theoretical schema which could be applied to any situation in which judgment is required. Lyotard asserts that a justice of multiplicities requires a multiplicity of justices. Paganism is the attempt to judge without pre-existing criteria, in matters of truth, beauty, politics and ethics.

Paganism rejects any universal criteria for judgement, yet Lyotard claims that we must judge, that justice demands this of us. So how do we judge, without criteria? Lyotard invokes both Kant and Nietzsche in his answer. In Kantian terms, we judge through the constitutive imagination. For Kant, this ability to judge, and to invent criteria, is mysterious, and there is little we can say about it. In Nietzschean terms, Lyotard says that judgement is an expression of the will to power. It is perhaps misleading of Lyotard to say that paganism is judgement without criteria; for it is judgement only without universal criteria. What he is denying is the possibility of a discourse that will give us adequate criteria for judgement in each and every case. Instead, what we must do (as pagans) is meet every circumstance that requires judgement anew, and create criteria specific to that case by an affirmative act of the imaginative will. Thus we will get a plurality of criteria, a plurality of judgements, a plurality of justices. In this sense, paganism can be thought of as a plurality of rules of judgement (gods), as opposed to belief in just one rule or set of rules (God). Somewhat paradoxically, perhaps (as Lyotard himself admits), the justice of this pluralism is assured by a prescriptive of universal value – the prescriptive that the rules of individual language games be respected; that they are not subsumed under a single criterion of judgement.

b. The Postmodern Condition

Lyotard soon abandoned the term ‘paganism’ in favour of ‘postmodernism.’ He presents his initial and highly influential formulation of postmodernism in The Postmodern Condition: A Report on Knowledge, commissioned by the government of Quebec and published in 1979. Lyotard famously defines the postmodern as ‘incredulity towards metanarratives,’ where metanarratives are understood as totalising stories about history and the goals of the human race that ground and legitimise knowledges and cultural practises. The two metanarratives that Lyotard sees as having been most important in the past are (1) history as progressing towards social enlightenment and emancipation, and (2) knowledge as progressing towards totalisation. Modernity is defined as the age of metanarrative legitimation, and postmodernity as the age in which metanarratives have become bankrupt. Through his theory of the end of metanarratives, Lyotard develops his own version of what tends to be a consensus among theorists of the postmodern – postmodernity as an age of fragmentation and pluralism.

The Postmodern Condition is a study of the status of knowledge in computerized societies. It is Lyotard’s view that certain technical and technological advancements have taken place since the Second World War (his historical pin-pointing of the beginning of postmodernity) which have had and are still having a radical effect on the status of knowledge in the world’s most advanced countries. As a defining element with which to characterise these technical and technological advancements, Lyotard chooses computerization. Lyotard identifies the problem with which he is dealing – the variable in the status of knowledge – as one of legitimation. For Lyotard, this is a question of both knowledge and power. Knowledge and power are simply two sides of the same question: who decides what knowledge is, and who knows what needs to be decided? According to Lyotard, in the computer age the question of knowledge is now more than ever a question of government. With vast amounts of knowledge stored digitally in databases, who decides what knowledge is worth storing (what is legitimate knowledge) and who has access to these databases? Lyotard points a suspicious finger at multinational corporations. Using IBM as an example, he suggests a hypothetical in which the corporation owns a certain belt in the Earth’s orbital field in which circulate satellites for communication and/or for storing data banks. Lyotard then asks, ‘who will have access to them? Who will determine which channels or data are forbidden? The State? Or will the State simply be one user among others?’

The method Lyotard chooses to use in his investigations is that of language games. Lyotard writes that the developments in postmodernity he is dealing with have been largely concerned with language: ‘phonology and theories of linguistics, problems of communication and cybernetics, modern theories of algebra and informatics, computers and their languages, problems of translation and the search for areas of compatibility among computer languages, problems of information storage and data banks, telematics and the perfection of intelligent terminals, paradoxology.’ Lyotard’s use of language games is derived from Ludwig Wittgenstein. The theory of language games means that each of the various categories of utterance can be defined in terms of rules specifying their properties and the uses to which they can be put. Lyotard makes three particularly important observations about language games. Firstly, the rules of language games do not carry within themselves their own legitimation, but are subject to a “contract” between “players” (interlocutors). Secondly, if there are no rules there is no game and even a small change in the rules changes the game. Thirdly, every utterance should be thought of as a “move” in a game. Different types of utterances, as identified by Wittgenstein, pertain to different types of language games. Lyotard gives us a few examples of types of utterances. The “denotative” is an utterance which attempts to correctly identify the object or referent to which it refers (such as “Snow is white”). The “performative” is an utterance which is itself a performance of an act to which it refers (such as “I promise”). The “prescriptive” is an utterance which instructs, recommends, requests, or commands (such as “Give me money”). For both Wittgenstein and Lyotard, language games are incommensurable, and moves in one language game cannot be translated into moves in another language game. For example, we cannot judge what ought to be the case (a prescriptive) from what is the case (a denotative.)

Lyotard’s choice of language games is primarily political in motivation, and relates to the close links between knowledge and power. In examining the status of knowledge in postmodernity, Lyotard is examining the political as well as epistemological aspects of knowledge (legitimation), and he sees the basic social bond – the minimum relation required for society to exist – as moves within language games. Lyotard needs a methodological representation to apply to society in order to examine the status of knowledge in postmodern societies. He presents us with two alternative views of society that have been popular in this century: society as a unitary whole (“traditional” theory) or society as a binary division (“critical” theory). Lyotard rejects both of these alternatives on the grounds that the choice seems difficult or arbitrary, and also rejects a third alternative – that we might distinguish two kinds of equally legitimate knowledge, one based on the view of society as unitary and the other on the view of society as binary. This division of knowledge is caught within a type of oppositional thinking that Lyotard believes is out of step with postmodern modes of knowledge.

Instead of the recently popular or “modern” models of society, Lyotard argues that even as the status of knowledge has changed in postmodernity, so has the nature of the social bond, particularly as it is evident in society’s institutions of knowledge. Lyotard presents a postmodern methodological representation of society as composed of multifarious and fragmented language games, but games which strictly (but not rigidly – the rules of a game can change) control the moves which can be made within them by reference to narratives of legitimation which are deemed appropriate by their respective institutions. Thus one follows orders in the army, prays in church, questions in philosophy, etc., etc. In his analysis of the state of knowledge in postmodernity, Lyotard firstly distinguishes between two types of knowledge – “narrative” knowledge and “scientific” knowledge. Narrative knowledge is the kind of knowledge prevalent in “primitive” or “traditional’ societies, and is based on storytelling, sometimes in the form of ritual, music and dance. Narrative knowledge has no recourse to legitimation – its legitimation is immediate within the narrative itself, in the “timelessness” of the narrative as an enduring tradition – it is told by people who once heard it to listeners who will one day tell it themselves. There is no question of questioning it. Indeed, Lyotard suggests that there is an incommensurability between the question of legitimation itself and the authority of narrative knowledge.

In scientific knowledge, however, the question of legitimation always arises. Lyotard says that one of the most striking features of scientific knowledge is that it includes only denotative statements, to the exclusion of all other kinds (narrative knowledge includes other kinds of statements, such as prescriptives). According to the “narrative” of science, however, only knowledge which is legitimated is legitimate – i.e. is knowledge at all. Scientific knowledge is legitimated by certain scientific criteria – the repeatability of experiments, etc. If the entire project of science needs a metalegitimation, however (and the criteria for scientific knowledge would itself seem to demand that it does) then science has no recourse but to narrative knowledge (which according to scientific criteria is no knowledge at all). This narrative has usually taken the form of a heroic epic of some kind, with the scientist as a “hero of knowledge” who discovers scientific truths. The distinction between narrative and scientific knowledge is a crucial point in Lyotard’s theory of postmodernism, and one of the defining features of postmodernity, on his account, is the dominance of scientific knowledge over narrative knowledge. The pragmatics of scientific knowledge do not allow the recognition of narrative knowledge as legitimate, since it is not restricted to denotative statements). Lyotard sees a danger in this dominance, since it follows from his view that reality cannot be captured within one genre of discourse or representation of events that science will miss aspects of events which narrative knowledge will capture. In other words, Lyotard does not believe that science has any justification in claiming to be a more legitimate form of knowledge than narrative. Part of his work in The Postmodern Condition can be read as a defence of narrative knowledge from the increasing dominance of scientific knowledge. Furthermore, Lyotard sees a danger to the future of academic research which stems from the way scientific knowledge has come to be legitimated in postmodernity (as opposed to the way it was legitimated in modernity).

In modernity the narrative of science was legitimated by one of a number of metanarratives, the two principal ones being respectively Hegelian and Marxist in nature. The Hegelian metanarrative speculates on the eventual totality and unity of all knowledge; scientific advancement is legitimated by the story that it will one day lead us to that goal. The Marxist metanarrative gives science a role in the emancipation of humanity. According to Lyotard, postmodernity is characterised by the end of metanarratives. So what legitimates science now? Lyotard’s answer is – performativity. This is what Lyotard calls the “technological criterion” – the most efficient input/output ratio. The technical and technological changes over the last few decades – as well as the development of capitalism – have caused the production of knowledge to become increasingly influenced by a technological model. It was during the industrial revolution, Lyotard suggests, that knowledge entered into the economic equation and became a force for production, but it is in postmodernity that knowledge is becoming the central force for production. Lyotard believes that knowledge is becoming so important an economic factor, in fact, that he suggests that one day wars will be waged over the control of information.

Lyotard calls the change that has taken place in the status of knowledge due to the rise of the performativity criterion the mercantilization of knowledge. In postmodernity, knowledge has become primarily a saleable commodity. Knowledge is produced in order to be sold, and is consumed in order to fuel a new production. According to Lyotard knowledge in postmodernity has largely lost its truth-value, or rather, the production of knowledge is no longer an aspiration to produce truth. Today students no longer ask if something is true, but what use it is to them. Lyotard believes that computerization and the legitimation of knowledge by the performativity criterion is doing away with the idea that the absorption of knowledge is inseparable from the training of minds. In the near future, he predicts, education will no longer be given “en bloc” to people in their youth as a preparation for life. Rather, it will be an ongoing process of learning updated technical information that will be essential for their functioning in their respective professions.

Lyotard does not believe that the innovations he predicts in postmodern education will necessarily have a detrimental effect on erudition. He does, however, see a problem with the legitimation of knowledge by performativity. This problem lies in the area of research. Legitimation by performativity lends itself to what Lyotard calls “terror” – the exclusion of players from language games or the exclusion of certain games entirely. Most true “discoveries,” Lyotard argues, are discoveries by virtue of the fact that they are so radical that they change the rules of the game – they cannot even be articulated within the rules of the “dominant” game (which is dominant because it draws the consensus of opinions). Many discoveries are not found to have a use until quite some time after they are made; therefore they seem to be of little value by the performativity criterion. Furthermore, for economic reasons, legitimation by performativity tends to follow the consensus opinion – that which is perceived by the majority of experts to have the most efficient input/output ratio is considered most likely in fact to be most performatively efficient, and hence the safest investment.

Lyotard argues that legitimation by performativity is against the interests of research. He does not claim that research should be aimed at production of “the truth”; he does not try to re-invoke the metanarratives of modernity to legitimate research. Rather, he sees the role of research as the production of ideas. Legitimation of knowledge by performativity terrorises the production of ideas. What, then, is the alternative? Lyotard proposes that a better form of legitimation would be legitimation by paralogy. The etymology of this word resides in the Greek words para – beside, past, beyond – and logos in its sense as “reason.” Thus paralogy is the movement beyond or against reason. Lyotard sees reason not as a universal and immutable human faculty or principle but as a specific and variable human production; “paralogy” for him means the movement against an established way of reasoning. In relation to research, this means the production of new ideas by going against or outside of established norms, of making new moves in language games, changing the rules of language games and inventing new games. Lyotard argues that this is in fact what takes place in scientific research, despite the imposition of the performativity criterion of legitimation. This is particularly evident in what Lyotard calls “postmodern science” – the search for instabilities [see Science and Technology]. For Lyotard, knowledge is not only the known but also the “revelation” or “articulation” of the unknown. Thus he advocates the legitimation of knowledge by paralogy as a form of legitimation that would satisfy both the desire for justice and the desire for the unknown.

c. The Differend

Lyotard develops the philosophy of language that underlies his work on paganism and postmodernism most fully in The Differend: Phrases in Dispute. This book is, by Lyotard’s own estimation, both his most philosophical and most important. Here he analyses how injustices take place in the context of language. A differend is a case of conflict between parties that cannot be equitably resolved for lack of a rule of judgement applicable to both. In the case of a differend, the parties cannot agree on a rule or criterion by which their dispute might be decided. A differend is opposed to a litigation – a dispute which can be equitably resolved because the parties involved can agree on a rule of judgement. Lyotard distinguishes the victim from the plaintiff. The later is the wronged party in a litigation; the former, the wronged party in a differend. In a litigation, the plaintiff’s wrong can be presented. In a differend, the victim’s wrong cannot be presented. A victim, for Lyotard, is not just someone who has been wronged, but someone who has also lost the power to present this wrong. This disempowerment can occur in several ways: it may quite literally be a silencing; the victim may be threatened into silence or in some other way disallowed to speak. Alternatively, the victim may be able to speak, but that speech is unable to present the wrong done in the discourse of the rule of judgement. The victim may not be believed, may be thought to be mad, or not be understood. The discourse of the rule of judgement may be such that the victim’s wrong cannot be translated into its terms; the wrong may not be presentable as a wrong.

Lyotard presents various examples of the differend, the most important of which is Auschwitz. He uses the example of the revisionist historian Faurisson’s demands for proof of the Holocaust to show how the differend operates as a sort of double bind or “catch-22.” Faurisson will only accept proof of the existence of gas chambers from eyewitnesses who were themselves victims of the gas chambers. But of course, any such eyewitnesses are dead and are not able to testify. Faurisson concludes from this that there were no gas chambers. The situation is this: either there were no gas chambers, in which case there would be no eyewitnesses to produce evidence, or there were gas chambers, in which case there would still be no eyewitnesses to produce evidence (since they would be dead). Since Faurisson will accept no evidence for the existence of gas chambers except the testimony of actual victims, he will conclude from both possibilities (i.e. gas chambers existed; gas chambers didn’t exist) that gas chambers didn’t exist. The situation is a double bind because there are two alternatives – either there were gas chambers or there were not – which lead to the same conclusion: there were no gas chambers (and no Final Solution). The case is a differend because the harm done to the victims cannot be presented in the standard of judgment upheld by Faurisson. Lyotard presents the logic of the double bind involved in the differend in general as follows: either p or not p; if not-p, then Fp; if p, then not-p, then Fp. The two possibilities (p or not-p) both lead to the same conclusion (Fp). Lyotard gives a further example of the logic of the double bind: it is like saying both either it is white, or it is not white; and if it is white it is not white.

Another example of the differend which commentators on Lyotard often invoke is that of indigenous peoples’ claims to land rights in colonised countries. This example shows the relevance of Lyotard’s work for practical problems of justice in the contemporary world. Let us take Australian Aborigines as an example. Many tribal groups claim that land which they traditionally inhabited is now owned and controlled by the descendants of European colonists. They claim that the land was taken from them wrongfully, and that it should be given back to them. There is a differend in this case because Aboriginal land rights are established by tribal law, and evidence for such rights may not be presentable in the law of the Australian government. The court of appeal in which claims to land rights are heard functions entirely according to government law, and tribal law is not considered a valid system of judgment. In the case of a dispute over a certain area of land by farmers who are descendants of colonists on the one hand, and a tribe of Aborigines on the other hand, the court of appeal will be the one which involves the law that the farmers recognise (government law), while the law that the Aborigines recognise (tribal law) will not be considered valid. It may be the case that the only evidence for the claim to land rights that the Aborigines have will not be admissible as evidence in the court of government law (though it is perfectly acceptable in tribal law). Hence, we have a case of a wrong which cannot be presented as a wrong; a differend.

Lyotard develops the theory of the differend through a complex analysis of language, drawing heavily on analytic philosophers as well as ancients and early moderns. Lyotard’s ontology of events is developed here in terms of the phrase as event, and the limits of representation are seen in the indeterminacy involved in the linking of phrases. Phrases, on Lyotard’s account, may be extralinguistic, and can include signs, gestures, or anything that happens. Every event is to be understood as a phrase in the philosophy of the differend. This characterisation of events as phrases may be understood as a theoretical fiction or “a way of speaking” which allows Lyotard to develop a theory of events through the analysis of language, just as the libidinal philosophy does using libidinal energy. Lyotard calls the way phrases are linked together in series, one after the other, the concatenation of phrases. The law of concatenation states that these linkages must be made – that is, a phrase must be followed by another phrase – but that how to link is never determinate. There are many possible ways of linking on to a phrase, and no way is the right way.

In order to characterise phrases as events which are beyond full understanding and accurate representation, Lyotard undermines the common view that the meanings of phrases can be determined by what they refer to (the referent). That is, for Lyotard the meaning of a phrase as event (something happens) cannot be fixed by appealing to reality (what actually happened). He develops this view of language by appealing to Saul Kripke’s concept of the proper name as a “rigid designator” and by defining “reality” in an original way. Proper names pick our referents in a way that is rigid and consistent but, according to Lyotard, empty of sense. For example, the name Fred may consistently pick out a particular person, but there are many different senses or meanings which may be attached to this person. Only phrases carry sense (i.e. tell us something meaningful about Fred). The proper name may fix reference, but does nothing to fix sense. The name acts as a point which links the referent and the many senses which may be attached to it. Lyotard then defines reality as this complex of possible senses attached to a referent through a name. The correct sense of a phrase cannot be determined by a reference to reality, since the referent itself does not fix sense and reality itself is defined as the complex of competing senses attached to a referent. The phrase event remains indeterminate.

Lyotard uses the concepts of a phrase universe and of the difference between presentation and situation in order to show how phases can carry meanings and yet be indeterminate. Every phrase presents a universe, composed of the following four elements or, as Lyotard calls them, instances:

  1. The sense (the possible meanings of the phrase)
  2. The referent (the thing to which the phrase refers)
  3. The addressor (that from which the phrase comes)
  4. The addressee (that to which the phrase is sent)

In the initial presentation of the phrase, the instances of the universe are equivocal. That is, there are many possible ways in which the instances may be situated in relation to each other. Who or what uttered the phrase, and to whom? To what does the phrase refer? What sense of the phrase is meant? This equivocation means that the meaning of the phrase is not fixed in the initial presentation, and only becomes fixed through what Lyotard calls situation. Situation takes place when the instances of the phrase universe are fixed through the concatenation of phrases. That is, when the phrase is followed by another phrase. When phrases are concatenated, they follow rules for linking called phrase regimens. Phrase regimens fix the instances of the phrase universe within a concatenation; these regimens are syntactic types of phrases such as the cognitive, the descriptive, the prescriptive, the interrogative, the evaluative, and so on. Any situation of a phrase within a concatenation will only be one possible situation of the initial presentation of the phrase, however. It is always possible to situate the phrase in a different way by concatenating with a different phrase regimen. In other words, the presentation of the phrase event is not able to be accurately represented by any particular situation. This also means that there is no “correct” way of concatenating a phrase, no correct phrase regimen to be employed in following one phrase with another.

Lyotard insists that phrase regimens are heterogenous and incommensurable. That is, they are of radically different types and cannot be meaningfully compared through an initial presentation of the phrase event of which they are situations. However, different phrase regimens can be brought together through genres. Genres supply rules for the linking of phrases, but rather than being syntactic rules as phrase regimens are, genres direct how to concatenate through ends, goals, or stakes. What is at stake in the genre of comedy, for example, is to be humorous, to make people laugh. This goal directs how phrases are linked on from one to another. As an example, Lyotard suggests that the phrase “To arms!” might be followed by the phrase “You have just formulated a prescription” if the goal is to make people laugh, but not if the goal implied by the genre is to inspire direct action (such as may be the case if it is uttered by a military commander on a battlefield). Genres of discourse can bring heterogenous phrase regimens together in a concatenation, but genres themselves are heterogenous and incommensurable. This means that there is no “correct” genre in which to situate the initial phrase which is presented, and no genre has more validity than others. The differend arises on this level of genres when the phrase event gives rise to different genres, but one genre claims validity over the others. That is, one genre claims the exclusive right to impose rules of concatenation from the initial phrase.

How do we know when a differend has occurred? Lyotard says that it is signalled by the difficulty of linking on from one phrase to another. A differend occurs when a discourse does not allow the linkages which would enable the presentation of a wrong. Lyotard insists that phrases must, of necessity, follow other phrases – even silence is a kind of phrase, with its own generic effects. A silent phrase in the context of a dispute may be covering four possible states of affairs, corresponding to each of the instances in the phrase universe:

  1. The sense: The meaning of the referent cannot be signified.
  2. The referent: The referent (the wrong, etc.) did not take place.
  3. The addressor: The addressor does not believe that the referent falls within the competence of him/her self to present.
  4. The addressee: The addressor does not believe that the referent (the wrong, etc.) falls within the competence (to hear, to understand, to judge, etc.) of the addressee.

In order for the referent to be expressed, these four silent negations must be withdrawn. The referent must have reality, must be presentable in the rules of the discourse, and the addressor must have confidence in the competence of both him/her self and the addressee. Through the idea of the differend, Lyotard has drawn particular attention to the problems of the presentability of the referent when the parties in dispute cannot agree on a common discourse, or rule of judgement (i.e. cannot agree on the genre(s) of phrase linkage). Justice demands, however, that wrongs be presented – we must at least try to “present the unpresentable.” How is this possible? Lyotard does not believe that there is any easy answer. But for the sake of justice, we must try. We must identify differends as best we can – sometimes, no more than vague feelings attest to the existence of a differend. It may be the feeling of “not being able to find the words.” Lyotard associates the identification of a differend with the feeling of the sublime, the mixture of pleasure and pain which accompanies the attempt to present the unpresentable. He privileges art as the realm which is best able to provide testimony to differends through its sublime effects [see Reason and Representation; Politics; Art and Aesthetics].

5. Reason and Representation

Lyotard’s philosophy frequently calls into question the powers of reason, rejecting many of the claims that have been made about it in the history of philosophy. The limitations of reason are particularly evident for Lyotard in regard to the problems of representation. Since Descartes, the dominant model of rational thought in Western philosophy has been that of the human subject representing the objective world to its self. It has frequently been claimed that in this way complete and certain knowledge is possible, at least in theory. Lyotard calls such claims into doubt through his thesis that events exceed representation. Furthermore, Lyotard draws attention to the fact that reason tends to operate with structured systems of concepts which exclude the sensual and emotional, but that these exclusions can never be entirely maintained. On the one hand, any representation will miss something of the event, and on the other, non-rational forces such as feelings and desires will arise to disrupt rational schemas of thought.

Lyotard’s analysis of the limits of reason and representation is played out in Discours, figure through the terms of the discursive and the figural. The discursive is the term used for reason and representation here; it is the rational system of representation by concepts that forms a system of oppositions. The figural is what exceeds rational representation; it appeals to sensual experience, emotions and desires. Lyotard uses the metaphors of flatness and depth to refer to discourse and figure, respectively. The opposition between discourse and figure is deconstructed, however, since to maintain it as an opposition would be to remain within the logic of discourse (and to retain discourse as primary). Lyotard introduces a distinction between opposition and difference to account for the differing ways in which the discursive and the figural function. Difference corresponds to figure, and the distinction between discourse and figure itself is said to be one of difference rather than opposition. In opposition, two terms are rigidly opposed and quite distinct; in difference, the two terms are mutually implicated, yet ultimately irreconcilable. Difference is a disruptive force at the limits of discourse, indicating that no rational system of representation can ever be closed or complete, but is always opened up to forces (sensual, emotional, figural) that it cannot enclose within itself.

In Discours, figure, Lyotard takes structuralism (still a dominant intellectual trend in France in the early seventies when the book was written) as an example of the excesses of reason and representation. Structuralism seeks to explain everything in terms of underlying, conditioning structures that take the form of rigid systems of oppositions. His aim is to show that structuralism ignores the figural elements at work both outside and within representational structures. Lyotard shows that discourse and figure are mutually implicated (thus deconstructing the opposition) by examining the relationship of Ferdinand de Saussure’s linguistics and Merleau-Ponty’s phenomenology. For Saussure, language is a “flat” system of opposing terms that gain meaning from each other, rather than from referents outside the system. Merleau-Ponty’s phenomenology suggests that we experience the world on a pre-cognitive level as ambiguous and somewhat chaotic sense data which must be synthesized by the perceiving subject in order to structure the world in a meaningful way. Saussure’s linguistics suggests that our understanding of the world is given as a structure to begin with, while Merleau-Ponty’s phenomenology suggests that we first encounter an unstructured world, which we must work to structure. Drawing on Merleau-Ponty’s phenomenological analysis of the depth of the visual field, Lyotard posits an interruption of the supposedly flat system of language by this depth. This takes place through the deictic terms in language (such as here, now, I, you, this) which gain meaning by referring to temporal and spatial specificities in the world of the language-user. The discursive structure of language, therefore, needs reference at some points to sensual experience. The opposition is further deconstructed by Lyotard’s insistence that our experience of space may also be structured in a discursive fashion. Space can be broken into ordered elements related to each other in a structured and organised way, such as by mapping it with a three dimensional grid. A rigid theory of how the body interacts with space, as Merleau-Ponty may arguably be accused of developing, also exhibits structuralist tendencies. This leads Lyotard to a criticism of phenomenology as well, on the grounds that its descriptions of the body in the world are also too structural and do not account for the disruptive force of the figural. Lyotard sees Lacan’s application of Saussurean linguistics to psychoanalysis as particularly worrisome. He attacks Lacan’s famous dictum that ‘the unconscious is structured like a language’ on the grounds that it is an over-rationalisation that posits representational structures to the exclusion of the figural. Returning to Freud, Lyotard develops a theory of libidinal forces as figural, as disruptive of reason and representation.

Reason and representation are further “critiqued” in the libidinal philosophy of Libidinal Economy and the related essays, although here the very idea of critique itself is called into question, since insofar as it remains theory, it remains within the oppositional logic of representational rationality. Rather than opposing the libidinal to the rational, then, Lyotard develops his theory of dissimulation, the mutual enfoldment of the libidinal and the rational which is similar to the deconstructive logic of difference worked out in Discours, figure. Lyotard’s main criticism of representation in the libidinal philosophy is that it is nihilistic. He draws an analogy between representational structures and Friedrich Nietzsche’s characterisation of religion and transcendental philosophy as forms of nihilism. For Nietzsche religion is nihilistic because it places the highest values (as the ground for all values) in a transcendent realm which cannot be accessed, thereby cutting us off from the highest values and devaluing the realm of our actual experience. According to Lyotard, representational theory follows this model by placing the reality that representation refers to in a transcendent realm. Lyotard expresses this nihilism in terms of what he calls “the Great zero.” This zero is the divide between representation and what it represents. Representation is nihilistic because it can never close the divide between representation and reality, effectively cutting off representational thought from access to reality. What is represented is constantly deferred. For Lyotard semiotics is a prime example of representational nihilism, because the definition of the sign is that it replaces something (negating that which it replaces).

In the libidinal philosophy Lyotard does not reject theory and representation itself as necessarily nihilistic; rather, it is representational theory’s own understanding of itself – how it represents itself – that is the focus of Lyotard’s attack. Instead of opposing theory with alternative practises which are more libidinal, Lyotard asserts that theory itself is a libidinal practice which denies that it is libidinal. The nihilistic aspect of representational theory is this denial of the libidinal. Theory attempts to be detached and “cold,” and takes itself to be a stable and consistent structure which represents stable structures in the world. Lyotard’s response to the nihilism of representational theory is not to propose an “other” to it (which he believes is impossible), but to inscribe theory itself into the libidinal economy. It is the concept of dissimulation which makes this possible. Systems dissimulate affects. Representational theory is itself a libidinal dispositif, and Lyotard accentuates the libidinal aspects of theory in order to combat its nihilistic tendencies. Against the nihilism of the semiotic sign Lyotard proposes a reinterpretation of the sign: the tensor. The tensor is a duplicitous sign. One of its sides (or potentialities) is the semiotic sign; this side is the potential to be inscribed in an existing structure of meaning. The other side of the tensor contains residual potentialities for other meanings. This side of the tensor disrupts and escapes the system, flowing into new systems and structures. The tensor expresses the theory of dissimulation at work in the sign. We might think of the tensor as the semiotic sign dissimulating affects which might disrupt its meaning and flow into new systems.

The critique of reason and representation shift in Lyotard’s postmodern philosophy from a focus on the figurative and libidinal forces which disrupt systems to an analysis of incommensurability in language and the limits of the rational faculty. Lyotard uses Wittgenstein’s idea of language games to show that reason and representation cannot be totalizing. The end of metanarratives means that no single overarching theory can pretend to account for everything. Rather, the postmodern condition is composed of fragmented language games attached to incommensurable forms of life. For Lyotard language is composed of a multiplicity of phrase regimes which cannot be translated into each other. Some are descriptive, some prescriptive, etc. These phrase regimes have no outside criteria for comparison. Between them lies the differend, an absolute difference which cannot be reconciled. In Lyotard’s postmodern philosophy, then, reason and representation are set limits by the incommensurability of language games; it is not possible for reason to understand everything through a representational system. In the postmodern philosophy events are analyzed as phrases, and again Lyotard asserts that events exceed representation in that no representational system can account for all phrases.

Furthermore, Lyotard’s postmodernism draws attention to the limits of reason through its focus on the sublime. The differend is experienced as a feeling of not being able to find the words to express something; it signals the limits of one language game or phrase regime and the attempt to move on to another one. Lyotard analyses this experience in terms of Kant’s idea of the sublime, which is itself an experience of the limits of reason. In Kant’s philosophy, the sublime is the mixed feeling of pleasure and pain that we feel in the face of something of great magnitude and grandeur. We can have an idea of such things, but we cannot match up that idea with a direct sensory intuition since sublime objects surpass our sensory abilities. An example of a sublime object for Kant would be a mountain; we can have an idea of a mountain, but not a sensory intuition of it as a whole. We feel pain at the frustration of our faculties to fully grasp the sublime object, but a pleasure as well in the attempt to do so. Lyotard extends the notion of the sublime from that which is absolutely great to all things which confound our abilities to synthesize them into knowledge. Thus the sublime is situated at the differend between language games and phrase regimes; we feel a mixture of pleasure and pain in the frustration of not knowing how to follow on from a phrase but feeling that there is something important that must be put into words. In Lyotard’s postmodern philosophy the sublime is the feeling that indicates the limits of reason and representation.

6. The Subject and the Inhuman

Like many other prominent French thinkers of his generation (such as Michel Foucault, Jacques Derrida and Gilles Deleuze), Lyotard develops critiques of the subject and of humanism. Lyotard’s misgivings about the subject as a central epistemological category can be understood in terms of his concern for difference, multiplicity, and the limits of organisational systems. For Lyotard the subject as traditionally understood in philosophy acts as a central point for the organisation of knowledge, eliminating difference and disorderly elements. Lyotard seeks to dethrone the subject from this organisational role, which in effect means decentring it as a philosophical category. He sees the subject not as primary, foundational, and central, but as one element among others which should be examined by thought. Furthermore, he does not see the subject as a transcendent and immutable entity, but as produced by wider social and political forces. In the libidinal philosophy, the subject is construed as one organisational structure or dispositif which channels and exploits libidinal energies. Like other structures which threaten to be hegemonic, Lyotard proposes its disruption through the release of the libidinal forces it contains which are not consistent with it. That is, the opening of the subject to forces which are deemed irrational, such as feelings and desires. Furthermore, Lyotard’s insistence that the freeing of dissimulated libidinal forces can only be passively done and not actively controlled is motivated by his identification of wilful acts with the organisational subject.

In Lyotard’s postmodern philosophy, the fragmentation of language games also means the social subject fragments and seems to dissolve. The subject cannot be seen as a master of language games, a unifying power, but is rather a node at which different incommensurable language games intersect. Lyotard furthermore asserts that avant-garde art works of the twentieth century do not reinforce the subject, but call it into question through the unsettling effect of the sublime. Humanism is also called into question in Lyotard’s later philosophy through the term “Inhuman.” Lyotard objects to humanism on the grounds that it depends upon a definition of the human which is exclusionary of difference. He asks why, if humanism is correct that there is a human nature, we are not born human but rather have to go through a terroristic education in order to become acceptably human. The term “Inhuman” has two meanings for Lyotard. Firstly, it refers to the dehumanising effects of science and technology in society. Secondly, it refers to those potentially positive forces that the idea of the human tries to repress or exclude, but which inevitably return with disruptive effects. Lyotard tries to show the limit of the humanistic ideal by imagining a science-fiction-like scenario in which, in 4.5 billion years time when our sun explodes, the human race will have developed the ability to survive without the Earth. In one sense this survival is the humanist dream (since survival is essential for the central importance of the human race in the universe), but in another sense it might constitute the end of the human, since the changes required to survive in space would be so radical as to erase anything we currently recognise as human. On the one hand Lyotard criticises the dehumanising effects of the progress of science and technology that are themselves bound up with the idea of human progress, and on the other he affirms the dehumanising forces that open up our thinking to more than a simple definition of the human.

7. Science and Technology

Lyotard develops some reflections on science and technology within the scope of his postmodern philosophy [see The Postmodern Condition]. The changing status of science and technology is a primary feature of the postmodern condition, and Lyotard calls certain new forms of science postmodern. His concern with an ontology of events and a politics of competing representations of those events underlies his theorization of science and technology in postmodernity, in which the collapse of metanarratives has meant the proliferation of multiple, incommensurable language games (of which science is only one). We should interpret Lyotard as taking this to be a good thing, since such a proliferation more accurately reflects his general ontological view of the world as composed of events which give rise to multiple interpretations, and which can never be accurately captured by a single narrative. Metanarratives do violence to alternative representations of events that are valid in their own right. Lyotard sees the rise of capital, science and technology linked through legitimation by performativity as a similar threat, however. He calls this threat “terrorism”: the threat of exclusion from playing a language game.

The principle of legitimation functioning in capitalism is efficiency or performativity [see The Postmodern Condition], and this principle attempts to be hegemonic. Science and technology are prime candidates for this attempted hegemony, since they contribute to the growth of capital. Lyotard accepts that performativity is a legitimate criterion for technology, but argues that it is not proper to science. He develops his argument around what he calls postmodern science, by which he means recent sciences such as Benoit Mandelbrot’s fractal theory and Rene Thom’s catastrophe theory that search for instabilities rather than regularities in systems. Following to some extent philosophers of science Thomas Kuhn and Paul Feyerabend, Lyotard argues that the performativity criterion does not accurately capture the kind of knowledge developed in the sciences nor the way such knowledge develops. For Lyotard, science is a language game to which legitimation by performativity is not proper. Such performativity merely subordinates science to capital. According to Lyotard, it is the idea of a deterministic system that allows performativity in science, since determinism allows the prediction and calculation of input/output values.

Postmodern science, however, does not function according to a legitimation by performativity precisely because it undermines determinism. Postmodern science searches for instabilities in systems, undermining predictability. Lyotard cites thermodynamics as the beginning of performativity in terms of determinism, and suggests that quantum mechanics and atomic physics have limited the applicability of this principle. Postmodern sciences, which concern themselves with undecidables, the limits of precise control, conflicts characterized by incomplete information, “fracta,” catastrophes, and pragmatic paradoxes, continue to undermine performativity in the form of determinism. Furthermore, postmodern science is undermining legitimation by performativity by retheorizing the way science itself develops: science does not develop in a progressive fashion and towards a unified knowledge, but in a discontinuous and paradoxical manner, undermining previous paradigms by the development of new ones. This is what Lyotard calls legitimation by paralogy. He suggests that science may be undergoing a paradigm shift from deterministic performativity to the paralogy of instabilities. Yet this is only a possibility: performativity still looms large on the horizon. Lyotard suggests science could go either way. He champions paralogy over performativity, since it contributes to healthy research in the sciences and undermines the hegemonic control capital attempts to have. Postmodern science is about the generation of new ideas rather than the efficient application of existing knowledge.

Lyotard is also concerned about the social impact of science and technology in postmodernity. He sees the performativity criterion as applying not just to science, technology, and capital, but to the State as well. According to the performativity criterion, society is seen as a system which must aim for efficient functioning, and this efficiency is a kind of terror which threatens to exclude inefficient elements. Furthermore, in post-industrial society information has become a primary mode of production, and Lyotard is concerned that in the interests of maximising profits information will become increasingly privatised by corporations. He proposes the possibility of IBM having exclusive control of databases and satellites. In response to these threats, Lyotard proposes that the public be given free access to memory and data banks. This will allow computerization to contribute to knowledge functioning by paralogy rather than by performativity, and to the free functioning of society as a set of heterogenous elements rather than an efficient system, removing the threat of terror.

8. Politics

Lyotard’s early political commitments were to revolutionary socialism and a relatively orthodox Marxism (see Biography and Early Works (b) Algeria). Despite his radical disillusion with these early political commitments, however, a strong political concern remains a central feature of all of Lyotard’s mature works. Lyotard’s notion of the political, however, must be understood as quite distinct from that employed in much traditional and contemporary politics and political theory. Having rejected the possibility of a politics based on a single theory that will accurately capture the truth of all social events (such as Marxism), Lyotard’s later concern is to do justice to multiple social realities. He is concerned with the free proliferation of heterogenous elements in society, and for him the institutions of politics and traditional political theory limit multiplicities and differences. Lyotard’s politics can be traced back to his general concern for events and the limits of representation. There is a strong correlation between his concern that events are not done justice by any one theoretical, representational system, and his concern that events of political import are not done justice by the way any particular political party or philosophy represents them.

The politics of the libidinal philosophy revolves around a nuanced reading of Marx and a duplicitous relation to capitalism. While Lyotard has given up on the possibility and desirability of a socialist revolution, he is still interested in the deployment of revolutionary desires. Libidinal Economy contains a reading of Marx’s texts as works of art, an emphasis which seeks to release the libidinal aspects of Marx, the desire for revolution. Lyotard’s interpretation of capitalism in the libidinal economy sees two possibilities inherent in capitalism, each entwined and inextricable. On the one hand, capitalism is a good system for the circulation of libidinal energies; it encourages enterprising explorations of and investments in new areas. On the other hand, capitalism tends to hoard up libidinal energy into structured and regulated systems, restricting its flow. This latter tendency is at work in the capitalist exploitation that Marx rallied against. Lyotard interprets these two tendencies of capitalism in terms of the theory of dissimulation. For Lyotard, there is no possible society that is not open to the desire to exploit and hoard libidinal energy in the way the capitalist does. This means that there is no utopian society free from exploitation, either pre-capitalist or post-revolutionary. Lyotard’s libidinal politics is not aimed at overthrowing capitalism, then, but of working within it to release the libidinal energies dissimulated within its structures. Practically, this also means working within existing political institutions, but “passively,” so as to release as much desire dissimulated within those institutions as possible, without constraining desires through planned outcomes.

Lyotard’s postmodern politics involves the attempt to rethink the political after the death of metanarratives such as Marxism and liberalism. Lyotard rejects all dominant political ideologies as master-narratives which exclude minorities and do violence to the heterogenous nature of social reality. This rejection is manifested in the philosophy of paganism that preceded Lyotard’s postmodernism. Here, the notion of “impiety” associated with the pagan is a rejection of “pious” political ideologies which unquestioningly assert principles and values as universally and unquestioningly true. In its mature form, Lyotard’s postmodern politics deals with the concern for justice and the need to bear witness to the differend. In the case of a differend, a wrong is done to a party who cannot phrase their hurt (See Postmodernism (c) The Differend). For Lyotard, no just resolution of a differend is possible. Because of the radical incommensurability of phrase regimes in the case of a differend, any “resolution” would only assert the legitimacy of one phrase regime at the cost of silencing the other, thus deepening the wrong. Justice demands a witnessing and a remembering of the fact that there is a differend. This means presenting the fact that a wrong has been done which cannot itself be presented. This is then the contradictory task of presenting the unpresentable, a task Lyotard sees as best accomplished in the arena of art.

9. Art and Aesthetics

Lyotard was a prolific writer on both art and philosophical aesthetics. An aesthetic theory focusing on the avant-garde deeply informs both major phases of his philosophical thought (the libidinal and the postmodern). Examples from particular movements in art and individual artists and writers are common in his philosophical works, and in addition he wrote a number of books on individual artists, including Georges Guiffrey, Albert Ayme, Gian-franco Baruchello, Jacques Monory, Valerio Adami, Shusaku Arakawa, and Daniel Buren. Lyotard also organised an art exhibition, Les immatériaux, at the Centre Georges Pompidou in 1985. The exhibition collected works which explored connections between the media, art, space, and matter.

Art has a privileged place in Lyotard’s philosophy of events, since it calls attention to the limits of representation. In the earlier phase of his work, art is celebrated for its figural and libidinal aspects that oppose and deregulate systems of discourse and rational thought. In Lyotard’s postmodern period, art is privileged for its sublime effects and the attention it calls to the differend. It is not all kinds of art that Lyotard celebrates; he is particularly interested in the avant-garde. Some forms of art can reinforce structured systems of meaning, but the special feature of avant-garde art is to disrupt expectations, conventions, and established orders of reception. In Discours, figure, visual arts are associated with the figural and the process of seeing. However, poetry is also privileged as a manifestation of the figural in the way it upsets established orders of meaning, following Lyotard’s move from the figural as simply sensuous to the figural as disruptive force in any system. The libidinal philosophy engages with art on the level of its affective force: shapes and colours act as tensors within the system of signification that the artwork forms, and unlike more rigidly structured systems, artworks more readily release their affective energy into different systems of interpretation, reception, and influence. Furthermore, the process of painting exemplifies the ambiguously passive yet active way in which Lyotard sees the release of libidinal energies as most effective. A painting is not a rigidly pre-planned structured piece of work in which the outcome is determined beforehand, but a process of experimentation. In this process, affects are inscribed on a surface without being strictly controlled by an actively willing and organising subject. The most important artists for Lyotard in this period include Paul Cézanne, Marcel Duchamp, and Robert Delaunay.

In Lyotard’s philosophy of postmodernism and the differend, he develops an aesthetic theory of postmodern art. It is essential to distinguish Lyotard’s concept of postmodern art from other ideas of postmodern art. There are many theories of postmodernism in the arts, literature, architecture, and other areas of cultural practise. Other theorists (such as Jean Baudrillard) have also proposed aesthetic theories of postmodernism which differ from Lyotard’s understanding of postmodernism in the arts. In particular, Lyotard’s postmodern art must be distinguished from the stylistic trends often called postmodern in the art world (such as the anti-modern return to representational realism or the simulationism of Peter Halley, Sherrie Levine, Jeff Koons and others). Lyotard’s concept of postmodernism in the arts relates more to what is usually called modernism in the arts. It focuses on the experimentation of the avant-garde, and Lyotard takes as privileged examples Abstract Expressionism and particularly the work of Barnett Newman. Lyotard makes his own distinction between the categories of modern and postmodern in art, however, in a couple of ways. Firstly, postmodernism is said to be the avant-garde movement always at work within modernism itself. It is that which is so new and different it can only be called modern in retrospect. In this sense, postmodernism is the spirit of experimentation that drives modernism into ever-changing forms; it is the disruptive force that unsettles accepted rules for reception and meaning. For Lyotard something must be postmodern before it can become modern. That is, it must be unsettling before it becomes an accepted norm.

Secondly, however, according to Lyotard postmodern avant-garde art never entirely loses its ability to disturb. This power of disturbance is related to the feeling of the sublime, and it is an indication of the differend. In this context, modern and postmodern art can be distinguished in the following way. Both are concerned with the unpresentable: that which cannot be presented (or represented) in art. Modern art, however, presents the fact that there is an unpresentable, while postmodern art attempts to present the unpresentable. This is a paradoxical task, and arouses in the viewer the mixture of pleasure and pain that is the sublime. Lyotard takes Barnett Newman’s work as a paragon of postmodern, avant-garde art. Newman consciously seeks to achieve the sublime in his paintings, and Lyotard believes he achieves this by making his viewers feel that something profound and important is going on in his works, but without being able to identify what this is. Postmodern art has a political importance for Lyotard, since it can call attention to differends through the feeling of the sublime, showing us that a wrong has been done. Bearing witness to the differend is the primary focus of Lyotard’s postmodern politics, and art is the privileged arena in which this witnessing takes place.

10. Late Works

a. Malraux

Two of Lyotard’s latest works were on the French writer, activist, and politician, André Malraux. Signed, Malraux is an unconventional autobiography. Lyotard’s philosophical commitments distance him from the presuppositions underlying the traditional genre of biography, where the subject is assumed to be unified and the text is taken to represent the truth about that subject. Lyotard instead takes Malraux as a set of heterogenous elements (texts, political activities, personal relationships, etc), which he, as author, consciously unifies through the creation of a fictional character. Lyotard’s interest in Malraux may be explained through the commonalities they share, in particular a problematic relation to the political and an attempted solution to this problem through art. Soundproof Room: Malraux’s Anti-Aesthetics situates Malraux’s work in a nihilist and abjectivist tradition of writers that includes Louis Céline, Georges Bataille, Antonin Artaud, and Albert Camus. What these writers share is a concern with the decline of belief in objective values (the “death of God”) and the strangeness and nausea of the human body.

b. Augustine

The Confession of Augustine was incomplete at the time of Lyotard’s death, and has been published posthumously in partial form, with working notes appended. At first glance this somewhat cryptic, poetic, and quasi-religious work seems to bear little resemblance to any other piece in Lyotard’s oeuvre. On closer inspection, however, the themes Lyotard works through in his reading of Augustine’s Confessions can be recognised as those already touched on in earlier works. The discussion of signs recalls Lyotard’s analysis of the nihilism of semiotics in Libidinal Economy, where he refers to Augustine, and what is perhaps the main theme of this work – Augustine’s writing as a study in the phenomenology of time – is referred to in the earlier paper “The Sublime and the Avant-Garde.” Lyotard reads Augustine as the precursor to the phenomenological studies of time developed by Edmund Husserl, Martin Heidegger, and Jean-Paul Sartre. This study problematises the temporal mode of the ‘now’, the present, in its relations to the past and the future. The problematic of time is a recurring feature in Lyotard’s work, and thus The Confession of Augustine can be seen as a further investigation into one of Lyotard’s ongoing concerns.

11. References and Further Reading

The following is a list of books by and about Lyotard available in English. For further bibliographical references, including further original French editions, journal articles, and contributions by and about Lyotard, see Lyotard’s  Peregrinations and Joan Nordquist’s Jean-François Lyotard: A Bibliography.

a. Books by Lyotard

  • Phenomenology, trans. Brian Beakley (Albany: State University of New York Press, 1991). Trans. of La Phénoménology (Paris: Presses Universitaires de France, 1986). [1st. ed. 1954]
  • Discourse, Figure, trans. Antony Hudek and Mary Lydon (Minneapolis: University of Minnesota Press, 2011). Trans. of Discours, figure (Paris: Klincksieck, 1971).
  • Driftworks, ed. Roger McKeon (New York: Semiotext(e), 1984). Trans. of several essays from Dérive à partir de Marx et Freud (Paris: Union Général d’Editions, 1973) and Des Dispositifs Pulsionnels (Paris: Union Général d’Editions, 1973).
  • Libidinal Economy, trans. Iain Hamilton Grant (London: Athlone, 1993). Trans. of Économie libidinale (Paris: Minuit, 1974).
  • Duchamp’s TRANS/formers, ed. Herman Parret, trans. Ian McLeod. Jean-François Lyotard: Writings on Contemporary Art and Artists vol. III (Leuven: Leuven University Press, 2010). Bilingual edition with trans. of Les Transformateurs Duchamp (Paris: Galilée, 1977).
  • Pacific Wall, trans. Bruce Boone (Venice: Lapis Press, 1990). Trans. of Le Mur du Pacifique (Paris: Galilée, 1979).
  • (With Jean-Loup Thébaud) Just Gaming, trans. Wlad Godzick (Minneapolis: University of Minnesota Press, 1985). Trans. of Au juste: conversations. (Paris: Bourgois, 1979).
  • The Postmodern Condition: A Report on Knowledge, trans. Geoff Bennington and Brian Massumi (Manchester: Manchester University Press, 1984). Trans. of La Condition postmoderne: rapport sur le savoir (Paris: Minuit, 1979).
  • The Differend: Phrases in Dispute, trans. Georges Van Den Abbeele (Manchester: Manchester University Press, 1988). Trans. of Le Différend (Paris: Minuit, 1983).
  • The Assassination of Experience by Painting, Monory, trans. Rachel Bowlby and Jeanne Bouniort, ed. Sarah Wilson (London: Black Dog, 1998). Bilingual edition with trans. of L’Assassinat de l’expérience par la peinture, Monory (Paris: Le Castor Astral, 1984). Also forthcoming as Vol. VI of Jean-François Lyotard: Writings on Contemporary Art and Artists (Leuven: Leuven University Press).
  • Enthusiasm: The Kantian Critique of History, trans. Georges Van Den Abbeele (Stanford: Stanford University Press, 2009). Trans. of L’Enthousiasme: la critique kantienne de l’histoire (Paris: Galilée, 1986).
  • The Postmodern Explained to Children, ed. Julian Pefanis and Morgan Thomas (Sydney: Power Publications, 1992). Trans. of Le Postmoderne expliqué aux enfants: correspondance 1982-1985 (Paris: Galilée, 1986).
  • What to Paint? Adami, Arakawa, Buren, ed. Herman Parret. Jean-François Lyotard: Writings on Contemporary Art and Artists vol. V (Leuven: Leuven University Press, forthcoming). Bilingual edition with trans. of Que peindre? Adami. Arakawa. Buren (Paris: Éditions de la Différence, 1987).
  • Peregrinations: Law, Form, Event (New York: Columbia University Press, 1988).
  • Heidegger and “The Jews”, trans. Andreas Michel and Mark S. Roberts (Minneaplis: University of Minnesota Press, 1990). Trans. of Heidegger et “les juifs” (Paris: Galilée, 1988).
  • The Inhuman: Reflections on Time, trans. Geoffrey Bennington and Rachel Bowlby (Cambridge: Polity Press, 1991). Trans. of L’Inhumain: causeries sur le temps (Paris: Galilée, 1988).
  • Lessons on the Analytic of the Sublime: Kant’s Critique of Judgment, 23-29, trans. Elizabeth Rottenberg (Stanford: Stanford University Press, 1994). Trans. of Leçons sur l’Analytique du sublime: Kant, Critique de la faculté de juger, 23- 29 (Paris: Galilée, 1991).
  • Sam Francis: Lesson of Darkness, trans. Geoffery Bennington, ed. Herman Parret. Jean-François Lyotard: Writings on Contemporary Art and Artists vol. II (Leuven: Leuven University Press, 2010). Bilingual edition. First edition: Sam Francis: Lesson of Darkness, trans. Geoffery Bennington (Los Angeles, CA: Lapis Press, 1993).
  • Postmodern Fables, trans. Georges Van Den Abbeele (Minneapolis: University of Minnesota Press, 1997). Trans. of Moralités postmodernes (Paris: Galilée, 1993).
  • (with Eberhard Gruber) The Hyphen: Between Judaism and Christianity, trans. Pascale-Anne Brault and Michael Naas (Atlantic Highlands, NJ: Humanity Books, 1999). Trans. of Un trait d’union (Sainte-Foy, Quebec: Editions Le Griffon d’argile, 1994).
  • Signed Malraux, trans. Robert Harvey (Minneapolis: University of Minnesota Press, 1999). Trans. of Signé Malraux: biographie (Paris: Grasset, 1996).
  • Karel Appel: A Gesture of Colour, ed. Herman Parret, trans. Vlad Ionescu and Peter W. Milne. Jean-François Lyotard: Writings on Contemporary Art and Artists vol. I. (Leuven: Leuven University Press, 2009). Bilingual edition. Original German edition: Karel Appel: Ein Farbgestus, Essays zur Kunst Karel Appels mit einer Bildauswahl des Autors (Berlin: Gachnang & Springer, Bern, 1998).
  • Soundproof Room: Malraux’s Anti-aesthetics, trans. Robert Harvey (Stanford: Stanford University Press, 2001). Trans. of La Chambre sourde: L’Antiésthetique de Malraux (Paris: Galilée, 1998).
  • The Confession of Augustine, trans. Richard Beardsworth (Stanford: Stanford University Press, 2000). Trans. of La Confession d’Augustin (Paris: Galilée, 1998).
  • The Lyotard Reader, ed. Andrew Benjamin (Oxford: Blackwell, 1989).
  • Toward the Postmodern, ed. Robert Harvey and Mark S. Roberts (New Jersey: Humanities Press, 1993).
  • Jean-François Lyotard: Political Writings, trans. and ed. Bill Readings and Kevin Paul Geiman (London: UCL, 1993).
  • The Lyotard Reader and Guide, Ed. Keith Crome and James Williams (Edinburgh: Edinburgh University Press, 2006).
  • Miscellaneous Texts I: Aesthetics and Theory of Art, ed. Herman Parret, trans. Vlad Ionescu, Erica Harris and Peter W. Milne. Jean-François Lyotard: Writings on Contemporary Art and Artists vol. IVa (Leuven: Leuven University Press, 2012).
  • Miscellaneous Texts II: Contemporary Artists, ed. Herman Parret, trans. Vlad Ionescu, Erica Harris and Peter W. Milne. Jean-François Lyotard: Writings on Contemporary Art and Artists vol. IVb (Leuven: Leuven University Press, 2012).

b. Books about Lyotard

  • Benjamin, Andrew (ed.), Judging Lyotard (London: Routledge, 1992).
  • Bennington, Geoffrey, Lyotard: Writing the Event (Manchester: Manchester University Press, 1988).
  • Bennington, Geoffrey, Late Lyotard (CreateSpace, 2008).
  • Browning, Gary K., Lyotard and the End of Grand Narratives (Cardiff: University of Wales Press, 2000).
  • Carrol, David, Paraesthetics: Foucault, Lyotard, Derrida (London: Routledge, 1987).
  • Crome, Keith, Lyotard and Greek Thought: Sophistry (Houndmills, Basingstoke: Palgrave Macmillan, 2004).
  • Curtis, Neal, Against Autonomy: Lyotard, Judgement and Action (Aldershot, Hants & Burlington, VT: Ashgate, 2001).
  • Dhillon, Pradeep A. and Paul Standish, eds., Lyotard: Just Education (London & New York: Routledge, 2000).
  • Grebowicz, Margaret (ed.), Gender After Lyotard (Albany: SUNY, 2007).
  • Haber, Honi Fern, Beyond Postmodern Politics : Lyotard, Rorty, Foucault (New York : Routledge, 1994).
  • Harvey, Robert, ed., Afterwords: Essays in Memory of Jean-François Lyotard (Stony Brook, NY: Humanities Institute, 2000).
  • Harvey, Robert and Lawrence R. Schehr, eds., Jean-François: Time and Judgment (New Haven & London: Yale University Press, 2001).
  • Jones, Graham, Lyotard Reframed (London: I. B. Tauris, forthcoming).
  • Kearney, Richard, Poetics of Imagining: From Husserl to Lyotard (London: HarperCollins Academic, 1991).
  • Kilian, Monika, Modern and Postmodern Strategies: Gaming and the Question of Morality: Adorno, Rorty, Lyotard, and Enzensberger (New York: Lang, 1998).
  • Malpas, Simon, Jean-François Lyotard (New York: Routledge, 2002).
  • Nordquist, Joan, Jean-François Lyotard: A Bibliography (Santa Cruz, CA: Reference and Research Services, 1991).
  • Nouvet, Claire, Zrinka Stahuljak and Kent Still (eds.), Minima Memoria: Essays in the Wake of Jean-François Lyotard (Stanford, CA: Stanford University Press, 2006).
  • Pefanis, Julian, Heterology and the Postmodern Bataille, Baudrillard, and Lyotard (Durham: Duke University Press, 1991).
  • Peters, Michael (ed.), Education and the Postmodern Condition (Wesport, Connecticut & London: Bergin & Garvey, 1995).
  • Raffel, Stanley, Habermas, Lyotard and the Concept of Justice (London: Macmillan Press, 1992).
  • Readings, Bill, Introducing Lyotard: Art and Politics (London: Routledge, 1991).
  • Robbins, Derek (ed.), Jean-François Lyotard. 3 vols. (London; Thousand Oaks: Sage, 2004).
  • Rojeck, Chris and Turner, Bryan S. (ed.) The Politics of Jean-François Lyotard. (London: Routledge, 1998).
  • Sheilds, Rob and Heidi Bickis (eds.), Listening to the Late: New Encounters with Jean-François Lyotard (Surrey: Ashgate, forthcoming).
  • Silverman, Hugh J. (ed.), Lyotard: Philosophy, Politics and the Sublime (New York: Routledge, 2002).
  • Sim, Stuart, Jean-François Lyotard (New York: Prentice Hall/Harvester Wheatsheaf, 1995).
  • Sim, Stuart, Lyotard and the Inhuman (Cambridge: Icon/Totem, 2000).
  • Sim, Stuart (ed.), The Lyotard Dictionary (Edinburgh: Edinburgh University Press, 2011).
  • Slade, Andrew, Lyotard, Beckett, Duras, and the Postmodern Sublime (New York: Peter Lang, 2007).
  • Steuerman, Emilia, The Bounds of Reason: Habermas, Lyotard, and Melanie Klein on Rationality (London & New York: Routledge, 2000).
  • Taylor, Victor E. and Gregg Lambert (eds.), Jean-François Lyotard: Critical Evaluations in Cultural Theory (London; New York: Routledge, 2005).
  • Williams, James, Lyotard: Towards a Postmodern Philosophy (Cambridge: Polity Press, 1998).
  • Williams, James, Lyotard and the Political (London: Routledge, 2000).
  • Woodward, Ashley, Nihilism in Postmodernity: Lyotard, Baudrillard, Vattimo (Aurora, Colorado: The Davies Group, 2009).

Author Information

Ashley Woodward
Email: phallacy@tpg.com.au
The Melbourne School of Continental Philosophy
Australia

Martin Luther (1483—1546)

lutherGerman theologian, professor, pastor, and church reformer.  Luther began the Protestant Reformation with the publication of his Ninety-Five Theses on October 31, 1517.  In this publication, he attacked the Church’s sale of indulgences.  He advocated a theology that rested on God’s gracious activity in Jesus Christ, rather than in human works.  Nearly all Protestants trace their history back to Luther in one way or another.  Luther’s relationship to philosophy is complex and should not be judged only by his famous statement that “reason is the devil’s whore.”

Given Luther’s critique of philosophy and his famous phrase that philosophy is the “devil’s whore,” it would be easy to assume that Luther had only contempt for philosophy and reason. Nothing could be further from the truth. Luther believed, rather, that philosophy and reason had important roles to play in our lives and in the life of the community. However, he also felt that it was important to remember what those roles were and not to confuse the proper use of philosophy with an improper one.

Properly understood and used, philosophy and reason are a great aid to individuals and society. Improperly used, they become a great threat to both. Likewise, revelation and the gospel when used properly are an aid to society, but when misused also have sad and profound implications.

Table of Contents

  1. Biography
  2. Theology
    1. Theological Background: William of Occam
    2. Theology of the Cross
    3. The Law and the Gospel
    4. Deus Absconditus – The Hidden God
  3. Relationship to Philosophy
  4. References and Further Reading
    1. Primary Sources
    2. Secondary Sources

1. Biography

Martin Luther was born to peasant stock on November 10, 1483 in Eisleben in the Holy Roman Empire – in what is today eastern Germany.  Soon after Luther’s birth, his family moved from Eisleben to Mansfeld. His father was a relatively successful miner and smelter and Mansfeld was a larger mining town. Martin was the second son born to Hans and Magarete (Lindemann) Luther. Two of his brothers died during outbreaks of the plague.  One other brother, James, lived to adulthood.

Luther’s father knew that mining was a cyclical occupation, and he wanted more security for his promising young son.  Hans Luther decided that he would do whatever was necessary to see that Martin could become a lawyer. Hans saw to it that Martin started school in Mansfeld probably around seven. The school stressed Latin and a bit of logic and rhetoric.  When Martin was 14 he was sent to Magdeburg to continue his studies. He stayed only one year in Magdeburg and then enrolled in Latin school in Eisenach until 1501. In 1501 he enrolled in the University of Erfurt where he studied the basic course for a Master of Arts (grammar, logic, rhetoric, metaphysics, etc.). Significant to his spiritual and theological development was the principal role of William of Occam’s theology and metaphysics in Erfurt’s curriculum. In 1505, it seemed that Han’s Luther’s plans were about to finally be realized.  His son was on the verge of becoming a lawyer.  Han’s Luther’s plans were interrupted by a thunderstorm and vow.

In July of 1505, Martin was caught in a horrific thunderstorm.  Afraid that he was going to die, he screamed out a vow, “Save me, St. Anna, and I shall become a monk.” St. Anna was the mother of the Virgin Mary and the patron saint of miners. Most argue that this commitment to become a monk could not have come out of thin air and instead represents an intensification experience in which an already formulated thought is expanded and deepened. On July 17th Luther entered the Augustinian Monastery at Erfurt.

The decision to enter the monastery was a difficult one. Martin knew that he would greatly disappoint his parents (which he did), but he also knew that one must keep a promise made to God. Beyond that, however, he also had strong internal reasons to join the monastery. Luther was haunted by insecurity about his salvation (he describes these insecurities in striking tones and calls them Anfectungen or Afflictions.) A monastery was the perfect place to find assurance.

Assurance evaded him however. He threw himself into the life of a monk with verve. It did not seem to help. Finally, his mentor told him to focus on Christ and him alone in his quest for assurance. Though his anxieties would plague him for still years to come, the seeds for his later assurance were laid in that conversation.

In 1510, Luther traveled as part of delegation from his monastery to Rome (he was not very impressed with what he saw.) In 1511, he transferred from the monastery in Erfurt to one in Wittenberg where, after receiving his doctor of theology degree, he became a professor of biblical theology at the newly founded University of Wittenberg.

In 1513, he began his first lectures on the Psalms.  In these lectures, Luther’s critique of the theological world around him begins to take shape. Later, in lectures on Paul’s Epistle to the Romans (in 1515/16) this critique becomes more noticeable. It was during these lectures that Luther finally found the assurance that had evaded him for years. The discovery that changed Luther’s life ultimately changed the course of church history and the history of Europe.  In Romans, Paul writes of the “righteousness of God.” Luther had always understood that term to mean that God was a righteous judge that demanded human righteousness. Now, Luther understood righteousness as a gift of God’s grace. He had discovered (or recovered) the doctrine of justification by grace alone. This discovery set him afire.

In 1517, he posted a sheet of theses for discussion on the University’s chapel door. These Ninety-Five Theses set out a devastating critique of the church’s sale of indulgences and explained the fundamentals of justification by grace alone. Luther also sent a copy of the theses to Archbishop Albrecht of Mainz calling on him to end the sale of indulgences. Albrecht was not amused. In Rome, cardinals saw Luther’s theses as an attack on papal authority. In 1518 at a meeting of the Augustinian Order in Heidelberg, Luther set out his positions with even more precision. In the Heidelberg Disputation, we see the signs of a maturing in Luther’s thought and new clarity surrounding his theological perspective – the Theology of the Cross.

After the Heidelberg meeting in October 1518, Luther was told to recant his positions by the Papal Legate, Thomas Cardinal Cajetan. Luther stated that he could not recant unless his mistakes were pointed out to him by appeals to “scripture and right reason” he would not, in fact, could not recant. Luther’s refusal to recant set in motion his ultimate excommunication.

Throughout 1519, Luther continued to lecture and write in Wittenberg. In June and July of that year, he participated in another debate on Indulgences and the papacy in Leipzig. Finally, in 1520, the pope had had enough. On June 15th the pope issued a bull (Exsurge Domini – Arise O’Lord) threatening Luther with excommunication. Luther received the bull on October 10th. He publicly burned it on December 10th.

In January 1521, the pope excommunicated Luther.  In March, he was summonsed by Emperor Charles V to Worms to defend himself. During the Diet of Worms, Luther refused to recant his position. Whether he actually said, “Here I stand, I can do no other” is uncertain. What is known is that he did refuse to recant and on May 8th was placed under Imperial Ban.

This placed Luther and his duke in a difficult position. Luther was now a condemned and wanted man. Luther hid out at the Wartburg Castle until May of 1522 when he returned to Wittenberg. He continued teaching. In 1524, Luther left the monastery. In 1525, he married Katharina von Bora.

From 1533 to his death in 1546 he served as the Dean of the theology faculty at Wittenberg. He died in Eisleben on 18 February 1546.

2. Theology

a. Theological Background: William of Occam

The medieval worldview was rational, ordered, and synthetic. Thomas Aquinas embodied it. It survived until the acids of war, plague, poverty, and social discord began to eat away its underlying presupposition – that the world rested on the being of God.

All of life was grounded in the mind of God. In the hierarchy of Being that establishes justice, the church was understood as the connection between the secular and divine. However, as the crises of the late middle ages increased, this reassurance no longer assuaged.

William of Occam recognized the shortcomings of Thomas’s system and cut away most of the ontological grounding of existence. In its place, Occam posited revelation and covenant. The world does not need to be grounded in some artificial, unknowable, ladder of Being.  Instead, one must rely on God’s faithfulness. We are contingent upon God alone.

This contingency would be terrible and unbearable without the assurance of God’s covenant. In terms of God’s absolute power (potentia absoluta), God can do anything.  He can make a lie the truth, he can make adultery a virtue and monogamy a vice. The only limit to this power is consistency—God cannot contradict his own essence. To live in a world ordered by whim would be terrible; one would never know if one was acting justly or unjustly. However, God has decided on a particular way of acting (potentia ordinata). God has covenanted with creation, and committed himself to a particular way of acting.

While rejecting some of Thomas, Occam did not reject the entire scholastic project.  He, too, synthesized and depended heavily upon Aristotle. This dependence becomes significant in the covenantal piety of justification. The fundamental question of justification is where does one find fellowship with God, i.e., how does one know one is accepted by God?  The logic of Aristotle taught Thomas and Occam that “like is known by like.”  Thus, union or fellowship with God must take place on God’s level. How does this happen? Practice.

All people are born, it was argued, with potential. Even though all creation suffers under the condemnation of the Fall of Adam and Eve, there remains a divine spark of potentiality, a syntersis. This potential must be actualized. It must be habituated. Habituation was important for both Thomas and Occam; however, Occam slightly modifies Thomas and that modification has important implications in Luther’s search for a gracious God.

From Thomas’s perspective the divine spark is infused with God’s grace, giving one the power to be contrite (contritio) and co-operate with God. This co-operation with God’s grace merits God’s reward (meritum de condign).  However, Occam asked an important question: if the process begins with God’s infusion of grace, can it truly merit anything? He answered, no! Therefore you should do the best you can. By doing your best, even as minimal as it is, this will merit (meritum de congruo) an infusion of grace: facienti quod in se est Deus non denegat gratiam (God will not deny his grace to anyone who does what lies within him.) Doing one’s best meant rejecting evil and doing good.

Within this context of covenant Luther struggled to prove that he was good enough to merit God’s grace. However, he failed to convince himself. He might have been contrite, but was he contrite enough?  This uncertainty afflicted (Anfectungen) him for years.

b. Theology of the Cross

Luther’s attempts to prove his worthiness failed.  He continued to be plagued by uncertainty and doubt concerning his salvation. Finally, during his Lectures on Paul’s Epistle to the Romans he found solace.  Instead of storehouses of merit, indulgences, habituation, and “doing what is within one,” God accepts the sinner in spite of the sin. Acceptance is based on who one is rather than what one does. Justification is bestowed rather than achieved. Justification is not based on human righteousness, but on God’s righteousness—revealed and confirmed in Christ.

In St. Paul, Luther finally found a word of hope. He finally found a word of assurance and discovered the graciousness of God. The discovery of God’s graciousness pro me (for me) revolutionizes all aspects of Luther’s life and thought. From now on, Luther’s response to the trials of his life and the crises of the late medieval period was to be certain of God, but never to be secure in human society.

A tautology of Luther’s theology becomes: one must always “Let God be God.”  This frees human beings to be human.  We do not have to achieve salvation; rather, it is a gift to be received.  Salvation thus is the presupposition of the life of the Christian and not its goal.  This belief engendered his rejection of indulgences and his movement to a theologia crucis (Theology of the Cross).

Why were indulgences rejected? Simply put, they epitomize everything that from Luther’s perspective was wrong with the church. Instead of dependence upon God, they placed salvation in the hands of traveling salesmen hocking indulgences. They embody his rejection of all types of theology that are based in models of covenant.

The import of the Theology of the Cross was the discovery of God’s passive righteousness and theological models based in Testament.  From the author of Hebrews, Luther takes an understanding of Jesus Christ as the last will and testament of God. God has written humanity in the will as heirs of God and co-heirs with Christ (See Romans 8).

The rejection of covenant model theologies and the movement to testament is a fundamental aspect of Luther’s theologia crucis. It is a rejection of any type of a theology of glory (theologia gloriae). The rejection of the theology of glory has a profound impact on Luther’s anthropology of a Christian.

This rejection is illustrated by Luther’s small but significant alteration of Augustinian anthropology. In that system, human beings are partim bonnum, partim malum or partim iustus, partim peccare (partly good/just, partly bad/sinner). The goal of a Christian’s life is to grow in righteousness. In other words, one must work to decrease the side of the equation that is bad and sinful. As one decreases the sin in oneself, the good and just aspects of one’s being increase.

Luther’s anthropology, however, is an outright and total rejection of progress; because no matter how one understands it, it is a work and thus must be rejected. Luther’s alternative characterization of Christian anthropology was simul iustus et peccator (at once righteous and sinful.) Now, he begins to speak of righteousness in two ways: coram deo (righteousness before God) and coram hominibus (before man). Instead of a development in righteousness based in the person, or an infusion of merit from the saints, a person is judged righteous before God because of the works of Christ. But, absent the perspective of God and the righteousness of Christ, based on one’s own merit—a Christian still looks like a sinner.

c. The Law and the Gospel

The distinction between the Law and the Gospel is a fundamental dialectic in Luther’s thought. He argues that God interacts with humanity in two fundamental ways – the law and the gospel. The law comes to humanity as the commands of God – such as the Ten Commandments. The law allows the human community to exist and survive because it limits chaos and evil and convicts us of our sinfulness. All humanity has some grasp of the law through the conscience. The law convicts us our sin and drives us to the gospel, but it is not God’s avenue for salvation.

Salvation comes to humanity through the Good News (Gospel) of Jesus Christ. The Good News is that righteousness is not a demand upon the sinner but a gift to the sinner. The sinner simply accepts the gift through faith. For Luther the folly of indulgences was that they confused the law with the gospel. By stating that humanity must do something to merit forgiveness they promulgated the notion that salvation is achieved rather than received. Much of Luther’s career focused on deconstructing the idea of the law as an avenue for salvation.

d. Deus Absconditus – The Hidden God

Another fundamental aspect of Luther’s theology is his understanding of God. In rejecting much of scholastic thought Luther rejected the scholastic belief in continuity between revelation and perception. Luther notes that revelation must be indirect and concealed. Luther’s theology is based in the Word of God (thus his phrase sola scriptura – scripture alone). It is based not in speculation or philosophical principles, but in revelation.

Because of humanity’s fallen condition, one can neither understand the redemptive word nor can one see God face to face. Here Luther’s exposition on number twenty of his Heidelberg Disputation is important. It is an allusion to Exodus 33, where Moses seeks to see the Glory of the Lord but instead sees only the backside. No one can see God face to face and live, so God reveals himself on the backside, that is to say, where it seems he should not be. For Luther this meant in the human nature of Christ, in his weakness, his suffering, and his foolishness.

Thus revelation is seen in the suffering of Christ rather than in moral activity or created order and is addressed to faith. The Deus Absconditus is actually quite simple. It is a rejection of philosophy as the starting point for theology. Why? Because if one begins with philosophical categories for God one begins with the attributes of God: i.e., omniscient, omnipresent, omnipotent, impassible, etc. For Luther, it was impossible to begin there and by using syllogisms or other logical means to end up with a God who suffers on the cross on behalf of humanity. It simply does not work. The God revealed in and through the cross is not the God of philosophy but the God of revelation. Only faith can understand and appreciate this, logic and reason – to quote St. Paul become a stumbling block to belief instead of a helpmate.

3. Relationship to Philosophy

Given Luther’s critique of philosophy and his famous phrase that philosophy is the “devil’s whore,” it would be easy to assume that Luther had only contempt for philosophy and reason. Nothing could be further from the truth. Luther believed, rather, that philosophy and reason had important roles to play in our lives and in the life of the community. However, he also felt that it was important to remember what those roles were and not to confuse the proper use of philosophy with an improper one.

Properly understood and used, philosophy and reason are a great aid to individuals and society. Improperly used, they become a great threat to both. Likewise, revelation and the gospel when used properly are an aid to society, but when misused also have sad and profound implications.

The proper role of philosophy is organizational and as an aid in governance. When Cardinal Cajetan first demanded Luther’s recantation of the Ninety-Five Theses, Luther appealed to scripture and right reason. Reason can be an aid to faith in that it helps to clarify and organize, but it is always second-order discourse. It is, following St. Anselm, fides quarenes intellectum (faith seeking understanding) and never the reverse. Philosophy tells us that God is omnipotent and impassible; revelation tells us that Jesus Christ died for humanity’s sin. The two cannot be reconciled. Reason is the devil’s whore precisely because it asks the wrong questions and looks in the wrong direction for answers. Revelation is the only proper place for theology to begin. Reason must always take a back-seat.

Reason does play a primary role in governance and in most human interaction. Reason, Luther argued, is necessary for a good and just society. In fact, unlike most of his contemporaries, Luther did not believe that a ruler had to be Christian, only reasonable. Here, opposite to his discussion of theology, it is revelation that is improper. Trying to govern using the gospel as one’s model would either corrupt the government or corrupt the gospel. The gospel’s fundamental message is forgiveness, government must maintain justice. To confuse the two here is just as troubling as confusing them when discussing theology. If forgiveness becomes the dominant model in government, people being sinful, chaos will increase. If however, the government claims the gospel but acts on the basis of justice, then people will be misled as to the proper nature of the gospel.

Luther was self-consciously trying to carve out proper realms for revelation and philosophy or reason. Each had a proper role that enables humanity to thrive. Chaos only became a problem when the two got confused.One cannot understand Luther’s relationship to philosophy and his discussions of philosophy without understanding that key concept.

4. References and Further Reading

a. Primary Sources

Key Primary Sources in English:

  • Luther’s Works (LW), ed. J. Pelikan and H.T. Lehmann. St. Louis, MO: Concordia, and Philadelphia, PA: Fortress Press, 1955 -1986. 55 vols.
    • Of all the major works of Luther, this is the best edition in English. It will soon be out on CD-Rom.
  • 1513-1515, Lectures on the Psalms (LW: 10 -11).
    • Luther’s earliest lectures. These are important because we begin to see themes that will eventually become the Theology of the Cross.
  • 1515-1516, Lectures on Romans (LW: 25).
    • The patterns of the Theology of the Cross become a bit more evident. Many scholars believe that Luther made his final discovery of the doctrine of Justification by Faith while giving these lectures.
  • 1517, Ninety Five Theses (LW: 31).
    • The seminal document of the Reformation in Germany. These theses led to the eventual break with Rome over indulgences and grace.
  • 1518, Heidelberg Disputation (LW: 31)
    • The best example of Luther’s emerging Theology of the Cross.He contrasts human works to God’s works in and through the Cross and shows the emptiness of human achievement and the importance of grace.
  • 1519, Two Kinds of Righteousness (LW:31).
    • Summary of his position that righteousness is received rather than achieved.
  • 1520, Freedom of a Christian (LW: 31).
    • Luther’s ethics, in which he explains that “A Christian is a perfectly free lord of all, subject to none. A Christian is perfectly dutiful servant of all, subject to all.”
  • 1520, To the German Nobility (LW: 44).
    • A call for reform in Germany, it highlights some of the complexity of Luther’s thought on church and state relations.
  • 1521, Concerning the Letter and the Spirit (LW:39).
    • A summary of the Law and Gospel.
  • 1522, Preface to Romans (LW: 35).
    • A summary of Luther’s understanding of Justification by Faith.
  • 1523, On Temporal Authority (LW 45).
    • Sets out Luther’s doctrine of the Two Kingdom’s most clearly.
  • 1525, The Bondage of the Will (LW: 33).
    • In a debate with Erasmus about human freedom and bondage to sin. Luther argues that humanity is bound to sin completely and only freed from that bondage by God’s Grace.
  • 1525, Against the Robbing and Murdering Hordes of Peasants (LW:45).
    • Written before the Peasant’s War, it was published afterward.
  • 1530, Larger Catechism (LW:34).
    • A summary of Christian doctrine, to be used in instruction.
  • 1531, Dr. Martin Luther’s Warning to His Dear German People (LW:45).
    • Luther’s first expression of a right to resist tyranny.
  • 1536, Disputation Concerning Justification (LW: 34).
    • A mature presentation of Luther’s doctrine on Justification.
  • 1536, Disputation Concerning Man (LW: 34).
    • His anthropology, but also gives a glimpse of his understanding of the proper role of philosophy and reason.

b. Secondary Sources

Key Secondary Sources in English on the Life and Thought of Luther:

  • Bainton,Roland H.Here I Stand: A Life of Martin Luther.  New York: Abingdon-Cokesbury Press, 1950.
    • The most popular biography of Luther, it is readeable and very thorough.
  • Brecht, Martin. Martin Luther. Three Volumes. Translated by James L. Schaaf. Philadelphia: Fortress Press, 1985-1993.
    • The authoritative biography of Luther.
  • Cameron, Euan. The European Reformation.Oxford: Clarendon Press, 1991.
    • An excellent introduction to the Reformation era.
  • Cargill Thompson,W.D.J. The Political Thought of Martin Luther.  Edited by Philip Broadhead. Totowa, NJ: Barnes & Noble Books, 1984.
    • The best work on Luther’s political theology.
  • Edwards, Mark U., Jr. Luther’s Last Battles: Politics and Polemics, 1531-1546.Ithaca: Cornell University Press, 1983.
    • One of the few books to focus on the older Luther. It is an excellent study in Luther after the Diet of Augsburg.
  • Forde, Gerhard, O.On Being a Theologian of the Cross: Reflections on Luther’s Heidelberg Disputation, 1518. Grand Rapids, MI: Eerdmans, 1997.
    • The Theology of the Cross is a fundamental doctrine in Luther. Forde takes an new look at the doctrine in light of Luther’s role as pastor.
  • George, Timothy. Theology of the Reformers.  Nashville: Broadman Press, 1988.
    • This is an excellent introduction to Luther and puts his thought in dialogue with other major reformers, i.e., Zwingli and Calvin.
  • Lindberg, Carter. The European Reformations Oxford: Blackwell Publishers, Ltd., 1996.
    • The best introduction to the Reformation era, it covers not only the reformers but the context and culture of the era as well.
  • Loewenich, Walter von. Luther’s Theology of the Cross, trans. Herber J.A. Bouman. Minneapolis: Augsburg Publishing House, 1976.
    • The classic work on the Theology of the Cross.
  • Lohse, Bernhard. Martin Luther:An Introduction to his Life and Work.  Translated by Robert C. Schultz.Philadelphia: Fortress Press, 1986.
    • In a handbook format, this is an essential ready-reference to Luther and his works.
  • McGrath, Alister E. The Intellectual Origins of the European Reformation. Oxford: Blackwell Press, 1987.
    • This book covers the scholastic and nominalist background of the reformation.
  • Oberman,Heiko. The Dawn of the Reformation: Essays in Late Medieval and Early Reformation Thought. Edinburgh: T & T Clark, 1986.
    • A classic that places the reformation era within the wider context of the late medieval era and the early modern era.
  • Luther: Man between God and the Devil.  Translated by Eileen Walliser-Schwarzbart. New York: Image Books, Doubleday:1982.
    • An excellent biography of Luther that examines Luther in light of his quest for a gracious God and his fight against the Devil.
  • Ozment, Steven. The Age of Reform:1250-1550:An Intellectual and Religious History of Late Medieval and Reformation Europe.  New Haven:Yale University Press, 1980.
    • Ozment places the reformation in a wider context and sees the impetus for reform stretching back into what is normally considered the High Medieval Era.
  • Pelikan, Jaroslav. The Christian Tradition: A History of the Development of Doctrine. Volume 4: Reformation of Church and Dogma (1300-1700). Chicago: University of Chicago Press, 1984.
    • Part of a five volume history of doctrine, Pelikan looks at the doctrinal issues at work in the reformation. He is not as concerned with history as he is with theological development.
  • Rupp,Gordon. Patterns of Reformation.  Philadelphia: Fortress Press,1969.
    • A thorough study of the wider issues raised by the reformation.
  • Watson,Philip S. Let God be God!: An Interpretation of the Theology of Martin Luther. London: Epworth Press, 1947.
    • A classic study stressing the theocentric nature of Luther’s thought.

Author Information

David M. Whitford
Claflin University
U. S. A.

Bernard Lonergan (1904—1984)

LonerganWhen we try to reconcile opposing moral opinions we usually appeal to shared ethical principles. Yet often enough the principles themselves are opposed. We may then try to reconcile opposing principles by clarifying how we arrived at them. But since most of our principles are cultural inheritances, discussions halt at a tolerant mutual respect, even when we remain convinced that the other person is wrong. What is needed is a method in ethics that can uncover the sources of error. After all, even culturally inherited principles first occurred to someone, and that someone may or may not have been biased. So there is considerable merit to investigating the innate methods of our minds and hearts by which we construe – and sometimes misconstrue – ethical principles. The work of Bernard Lonergan can guide this investigation. His opus covers methodological issues in the natural sciences, the human sciences, historical scholarship, aesthetics, economics, philosophy and theology. He begins with an invitation to consider in ourselves what occurs when we come to knowledge. He then defines a corresponding epistemological meaning of objectivity. From there he lays out basic metaphysical categories applicable in the sciences. Finally, he proposes a methodical framework for collaboration in resolving basic differences in all these disciplines.

This review will begin by tracing the origins of Lonergan’s approach. Following that will be the four steps of a cognitional theory, an epistemology, a metaphysics, and a methodology, particularly as they apply to resolving differences in moral opinions and in ethical principles. Finally, there will be a reexamination of several fundamental categories in ethics.

Table of Contents

  1. Origins
  2. Cognitional Theory
  3. Epistemology (Objectivity)
  4. Metaphysics
    1. Genetic Intelligibility
    2. Dialectical Intelligibility
    3. Radical Unintelligibility
  5. Methodology
  6. Categories
    1. Action, Concepts, and Method
    2. Good and Bad
    3. Better and Worse
    4. Authority and Power
    5. Principles and People
    6. Duties and Rights
  7. Summary
  8. References and Further Reading
    1. Major Works of Lonergan
    2. Shorter Works Relevant to Ethics
    3. Other Works

1. Origins

Bernard Lonergan, a preeminent Canadian philosopher, theologian and economist, (1904-1984) was the principal architect of what he named a “generalized empirical method.” Born in Buckingham, Quebec, Lonergan received a typical Catholic education and eventually entered the Society of Jesus (Jesuits), leading to his ordination to the priesthood in 1936. He specialized in both theology and economics at this time, having been deeply influenced by his doctoral work on Thomas Aquinas and by his long-standing interest in the philosophy of culture and history, honed by his reading of Hegel and Marx. In the early 1950s, while teaching theology in Toronto, Lonergan wrote Insight: A Study of Human Understanding – his groundbreaking philosophical work. Then, in the early 70s, he published his equally fundamental work, Method in Theology. Throughout his career, he lectured and wrote on topics related to theology, philosophy, and economics. The University of Toronto has undertaken the publication of The Collected Works of Bernard Lonergan, for which 20 volumes are projected.

Lonergan aimed to clarify what occurs in any discipline – science, math, historiography, art, literature, philosophy, theology, or ethics. The need for clarification about methods has been growing over the last few centuries as the world has turned from static mentalities and routines to the ongoing management of change. Modern languages, modern architecture, modern art, modern science, modern education, modern medicine, modern law, modern economics, the modern idea of history and the modern idea of philosophy all are based on the notion of ongoing creativity. Where older philosophies sought to understand unchanging essentials, logic and law were the rule. With the emergence of modernity, philosophies have turned to understanding the innate methods of mind by which scientists and scholars discover what they do not yet know and create what does not yet exist.

The success of the empirical methods of the natural sciences confirms that the mind reaches knowledge by an ascent from data, through hypothesis, to verification. To account for disciplines that deal with humans as makers of meanings and values, Lonergan generalized the notion of data to include the data of consciousness as well as the data of sense. From that compound data, one may ascend through hypothesis to verification of the operations by which humans deal with what is meaningful and what is valuable. Hence, a “generalized empirical method” (GEM).

Lonergan also referred to GEM as a critical realism. By realism, in line with the Aristotelian and Thomist philosophies, he affirmed that we make true judgments of fact and of value, and by critical, he aimed to ground knowing and valuing in a critique of the mind similar to that proposed by Kant.

GEM traces to their roots in consciousness the sources of the meanings and values that constitute personality, social orders, and historical developments. GEM also explores the many ways these meanings and values are distorted, identifies the elements that contribute to recovery, and proposes a framework for collaboration among disciplines to overcome these distortions and promote better living together.

These explorations are conducted in the manner of personal experiments. In Insight and Method in Theology, Lonergan leads readers to discover what happens when they reach knowledge, evaluate options, and make decisions. He expects that those who make these discoveries about themselves reach an explicit knowledge of how anyone reaches knowledge and values, how inquiries are guided by internal criteria, and how therefore any inquiry may be called “objective.” Such objectivity implies structural parallels between the processes of inquiry and the structures of what any inquirer, in any place or time, can know and value. Lonergan proposes that these structures, in turn, provide a personally verified clarification of the methods specific to the natural and human sciences, historiography and hermeneutics, economics, aesthetics, theology, ethics, and philosophy itself.

So there are four questions, as it were, that GEM proposes for anyone seeking to ground the methods of any discipline. (1) A cognitional theory asks, “What do I do when I know?” It encompasses what occurs in our judgments of fact and value. (2) An epistemology asks, “Why is doing that knowing?” It demonstrates how these occurrences may appropriately be called “objective.” (3) A metaphysics asks “What do I know when I do it?” It identifies corresponding structures of the realities we know and value. (4) A methodology asks, “What therefore should we do?” It lays out a framework for collaboration, based on the answers to the first three questions.

In the following sections, a review of how ethicists familiar with GEM deal with each of these four questions will reveal dimensions that directly affect one’s method in ethics.

2. Cognitional Theory

GEM relies on a personal realization that we know in two different manners – commonsense and theoretical. In both we experience insights, which are acts of understanding. In the commonsense mode, we grasp how things are related to ourselves because we are concerned about practicalities, our interpersonal relations, and our social roles. In the theoretical mode, we grasp how things are related to each other because we want to understand the nature of things, such as the law of gravity in physics or laws of repression in psychology. Theoretical insights may not be immediately practical, but because they look at the always and everywhere, their practicality encompasses any brand of common sense with its preoccupation with the here and now.

The theoretical terms defined in GEM should not be confused with their commonsense usage. To take a basic distinction, GEM defines morality as the commonsense assessments and behaviors of everyday living and ethics as the theoretical constructs that shape morality.

Each mode of knowing has its proper criteria, although not everyone reputed to have either common sense or theoretical acumen can say what these criteria are. A recurring theme throughout Lonergan’s opus is that the major impediment in theoretical pursuits is the assumption that understanding must be something like picturing. For example, mathematicians who blur understanding with picturing will find it difficult to picture how 0.999… can be exactly 1.000…. Now most adults understand that 1/3 = 0.333…, and that when you triple both sides of this equation, you get exactly 1.000… and 0.999…. But only those who understand that an insight is not an act of picturing but rather an act of understanding will be comfortable with this explanation. Among them are the physicists who understand what Einstein and Heisenberg discovered about subatomic particles and macroastronomical events – it is not by picturing that we know how they function but rather by understanding the data.

Lonergan also notes that philosophers who blur the difference between picturing and the theoretical modes of knowing will be confused about objectivity. When it comes to understanding how the mind knows, they typically picture a thinker in here and reality out there, and ask how one gets from in here to out there – failing to notice that it is not by any picture but by verifying one’s understanding of data that the thinker already knows that he or she really thinks.

GEM’s goal of a theory of cognition, therefore, is not a set of pictures. It is a set of insights into the data of cognitive activities, followed by a personal verification of those insights. In disciplines that study humans, GEM incorporates the moral dimension by addressing how we know values that lead to moral decisions. So, in GEM’s model of the thinking and choosing person, consciousness has four levels – experience of data, understanding the data, judgment that one’s understanding is correct, and decision to act on the resulting knowledge. These are referred to as levels of self-transcendence, meaning that they are the principal set of operations by which we transcend the solitary self and deal with the world beyond ourselves through our wonder and care.

GEM builds on these realizations by the further personal discovery of certain innate norms at each of the four levels. On the level of experience, our attention is prepatterned, shifting our focus, often desultorily, among at least seven areas of interest – biological, sexual, practical, dramatic, aesthetic, intellectual, and mystical. On the level of understanding, our intellects pursue answers to questions of why and how and what for, excluding irrelevant data and half-baked ideas. On the level of judgment, our reason tests that our understanding makes sense of experience. On the level of decision, our consciences make value judgments and will bother us until we conform our actions to these judgments. Lonergan names these four innate norming processes “transcendental precepts.” Briefly expressed, they are: Be attentive, Be Intelligent, Be reasonable, and Be responsible. But these expressions are not meant as formulated rules; they are English words that point to the internal operating norms by which anyone transcends himself or herself to live in reality. GEM uses the term authenticity to refer to the quality in persons who follow these norms.

Any particular rules or principles or priorities or criteria we formulate about moral living stem ultimately from these unformulated, but pressing internal criteria for better and worse. Whether our formulations of moral stances are objectively good, honestly mistaken, or malevolently distorted, there are no more fundamental criteria by which we make moral judgments. Maxims, such as “Treat others as you want to be treated,” cannot be ultimately fundamental, since it is not on any super-maxim that we selected this one. Nor do authorities provide us with our ultimate values, since there is no super-authority to name the authorities we ought to follow. Rather, we rely on the normative criteria of being attentive, intelligent, reasonable and responsible; howsoever they may have matured in us, by which we select all maxims and authorities.

GEM includes many other elements in this analysis, including the roles of belief and inherited values, the dynamics of feelings and our inner symbolic worlds, the workings of bias, the rejection of true value in favor of mere satisfaction, and the commitment to love rather than hate.

3. Epistemology (Objectivity)

GEM may be characterized as a systems approach that correlates the subject’s operations of knowing and choosing to their corresponding objects. Hence it understands objectivity as a correlation between the subject’s intentionality and the realities and values intended. A subject’s intention of objectivity functions as an ideal to be continuously approached. That ideal may be defined as the totality of correct judgments, supported by understanding, and verified in experience. Because our knowledge and values are mostly inherited, objectivity is the intended cumulative product of all successful efforts to know what is truly so and appreciate what is truly good. Clearly, we never know everything real or appreciate everything good. But despite any shortfalls, this principal notion of objectivity – the totality of correct judgments — remains the recurring desire and the universal goal of anyone who wonders. In GEM’s correlation-based, theoretical definition, such objectivity is a progressively more intelligent, reasonable and responsible worldview. Briefly put, an objective worldview is the fruit of subjective authenticity.

Confusion about objectivity may be traced to confusion about knowing. GEM proposes that any investigator who realizes that knowing is a compound of experience, understanding, and judgment may also recognize a persistent tendency to reduce objectivity to only one of these components.

There is an experiential component of objectivity in the sheer givenness of data. In commonsense discourse, we imagine that what we experience through our five senses is really “out there.” But we also may refer to what we think is true or good as really “out there.” Unfortunately, such talk stifles curiosity about the criteria we use to come to this knowledge. Knowing reality is easily reduced to a mental look. Similarly, the notion of moral objectivity collapses into a property of objects, detached from occurrences in subjects, so that we deem certain acts or people as “objectively evil” or “objectively good,” where “objectively” means “out there for anyone to see.” This naiveté about objectivity condenses the criteria regarding the morality of an act to what we picture, overlooking the meanings that the actors attach to the act.

Beyond this experiential component, which bows to the data as “objectively” given, there is a normative component, which bows to the inner norming processes to be attentive, intelligent, reasonable, and responsible. When we let these norms have their way, we raise relevant questions, assemble a coherent set of insights, avoid rash judgments, and test whether our ideas make sense of the data. This normative component is not a property of objects; it is a property of subjects. We speak of it when we say, “You’re not being objective” or “Objectively speaking, I say….” It guards us against wishful thinking and against politicizing what should be an impartial inquiry. Still, while this view incorporates the subject in moral assessments, some philosophers tend to collapse other aspects of objectivity into this subjective normativity. For them, thorough analysis, strict logic, and internal coherence are sufficient for objectivity. They propose their structural analyses not as hypotheses that may help us understand concrete experience correctly but as complete explanations of concrete realities. The morality of an act is determined by its coherence with implacable theory, suppressing further questions about actual cases that fall outside their conceptual schemes.

Beyond the experiential and normative components of objectivity, there is an absolute component, by which all inquiry bows to reality as it is. The absolute component lies in our intention to affirm what is true or good independent of the fact that we happen to affirm it. It is precisely what is absent when what we affirm as real or good is not real or good. The absolute component lies neither in the object alone nor the subject alone but in a linking of the two. It exists when the subject’s normative operations correctly confirm that the given experiential data meet all the conditions to make the judgment that X is so or Y is good. As a correlation between objective data and subjective acts, it corresponds to Aristotle’s understanding of truth as a relation between what we affirm and what really is so. Moralists who collapse knowing into judgment alone typically overlook the conditions set by experience and understanding that make most moral judgments provisional. The result is the dogmatist, out of touch with experience and incapable of inviting others to reach moral judgments by appeal to their understanding.

4. Metaphysics

In popular use, metaphysics suggests a cloud of speculations about invisible forces on our lives. Among philosophers, metaphysics is the science that identifies the basic concepts about the structures of reality. GEM not only identifies basic concepts, but also traces them to their sources in the subject. Thus, concepts issue from insights, and insights issue from questions, and questions have birthdates, parented by answers to previous generations of questions. Moreover, the so-called raw data are already shaped by the questions that occur to an inquirer. These questions, in turn, contain clues to their answers insofar as the insight we expect is related to the kind of judgment we expect. It could be a logical conclusion, a judgment of fact, a judgment that an explanation is correct, or a judgment of value.

Because these complexities of human wonder are part of reality, GEM’s metaphysics encompasses the relationship between the processes that guide our wonder and the realities we wonder about. The assumption is that when they operate successfully, the processes of wonder form an integrated set isomorphic to the integral dimensions of reality. For example, the scientific movement from data to hypothesis to verification corresponds to Lonergan’s view that knowing moves from experience to understanding to judgment, as well as to Aristotle’s view that reality consists of potency, form, and act. In GEM, then, metaphysics comprises both the processes of knowing and the corresponding features of anything that can be known.

This metaphysics is latent but operative before it is conceptualized and named. People who consistently tackle the right question and sidestep the wrong ones already possess latent abilities to discern some structured features of the object of their inquiry. With moral questions, their heuristic anticipations show up as seemingly innate strategies: Don’t chisel your moral principles in stone. Consider historical circumstances. A bright idea is not necessarily a right idea. And so forth.

Eventually, these canny men and women may conceptualize and name their latent metaphysics. Should they ask themselves how they ever learned to discern the difference between good thinking and bad thinking, they may look beneath what they think about and wonder how their thinking works. They may realize what GEM takes as fundamental: Any philosophy will rest upon the operative methods of cognitional activity, either as correctly conceived or as distorted by oversights and mistaken orientations. Then, insofar as they correctly understand their cognitional activity, they may begin to make their latent metaphysics explicit.

In the remainder of this article, some of Lonergan’s metaphysical terms particularly relevant to ethics are highlighted in bold face.

When we expect to understand anything, our insights fall into two classes. We can understand things as they currently function, or we can understand things as they develop over time. Regarding things as they currently function, we may notice that we have both direct insights and “inverse” insights. These correspond to two different kinds of intelligibilities that may govern what we aim to understand. Lonergan’s use of “intelligibility” here corresponds to what Aristotle referred to as “form” and what modern science calls “the nature of.”

A classical intelligibility (corresponding to the “classical” scientific insights of Galileo, Newton and Bacon) is grasped by a direct insight into functional correlations among elements. We understand the phases of the moon, falling bodies, pushing a chair – any events that result necessarily from prior events, other things being equal. A statistical intelligibility is grasped by an inverse insight that there is no direct insight available. But while we often understand that many events cannot be functionally related to each other, we also may understand that an entire set of such events within a specific time and place will cluster about some average. For if any subset of events we consider random varies regularly from this average, we will look for regulating factors in this subset, governed by a classical intelligibility to be grasped through a direct insight. Statistical intelligibility, then, does not regard events resulting necessarily from prior events. It regards sets of events, in place P during time T, resulting under probability from multiple and shifting events.

This distinction affects moral appeals to a “natural law.” For example, those who hold that artificial birth control is morally wrong typically appeal to a direct, functional relationship between intercourse and conception. However, the nature of this relationship is not one conception per intercourse but the probability of one conception for many acts of intercourse – a relationship of statistical intelligibility. If this is the nature of births, then the natural law allows that each single act of intercourse need not be open to conception.

Regarding things as they develop over time, there are two basic kinds of development, again based on the distinction between direct and inverse insights.

A genetic intelligibility is grasped by a direct insight into some single driving factor that keeps the development moving through developmental phases, such as found in developmental models of stars, plants, human intelligence, and human morality. A dialectical intelligibility is grasped by an inverse insight that there is no single driving factor that keeps the development moving. Instead, there are at least two driving factors that modify each other while simultaneously modifying the developing entity.

These anticipations are key to understanding moral developments. Inquiry into a general pattern of moral development will anticipate a straight-line, genetic unfolding of a series of stages. Inquiry into a specific, actual moral development will anticipate a dialectical unfolding wherein the drivers of development modify each other at every stage, whether improving or worsening.

a. Genetic Intelligibility

Genetic intelligibility is what we expect to grasp when we ask how new things emerge out of old. In this perspective, the metaphysical notion of potency takes on a particularly important meaning for ethics. Potency covers all the possibilities latent in given realities to become intelligible elements of higher systems. What distinguishes creative thinkers is not just their habit of finding uses in things others find useless. They expect that nature brings about improvements even without their help as, for example, when floating clouds of interstellar dust congeal into circulating planets or when damaged brains develop alternate circuits around scar tissue.

In this universe characterized by the potency for successive higher systems, the field of ethics extends to anything we can know. Hence, the “goodness” of the universe lies partly in its potentials for more intelligible organization. Human concern is an instance, indeed a most privileged instance, of a burgeoning universe. A sense of this kind of finality commands respect for whatever naturally comes to be even if no immediate uses come to mind.

An ethics whose field covers universal potentials will trace how morality is about allowing better. It means allowing not only the potentials of nature to reveal themselves but also a maximum freedom to the innate human imperative to do better. It means thinking of any moral option as essentially a choice between preventing and allowing the exercise of a pure desire for the better. Thus, the work of moral living is largely preventive – preventing our neurotic fixations or egotism from narrowing our horizons, preventing our loyalties from suppressing independent thinking, or preventing our mental impatience from abandoning the difficult path toward complete understanding. The rest feels less like work and more like allowing a natural exuberance to a moral creativity whose range has not been artificially narrowed by bias.

In contrast, a commonsense view of the universe imagines only the dimensions studied by physicists. The rule is simple: Any X either does or does not exist. Without this rule, scientists could never build up knowledge of what is and what is not. However, in cases like ourselves, where the universal potency for higher forms has produced responsible consciousness, this rule does not cover all possibilities. We also make the value judgments that some Xs should or should not exist. To recognize that the universe produces normative acts of consciousness is to recognize that the universe is more than a massive factual conglomeration. It is a self-organizing, dynamic and improving entity. Its moral character emerges most clearly with us, in raising moral objections when things get worse, in anticipating that any existing thing may potentially be part of something better, and, sadly, in acting against our better judgment.

Another key metaphysical element within the dynamism of reality toward fuller being is the notion of development. GEM rejects the mechanist view that counts on physics alone to explain the appearance of any new thing. It also rejects the vitalist view that pictures a wondrous life force driving everything from atoms, molecules, and cells, to psyches, minds and hearts. The reality of development, particularly moral development, involves a historical sequence of notions about better and worse. We inherit moral standards, subtract what we think is nonsense and add what we think makes sense. Our inheritance is likewise a sum of our previous generation’s inheritance, what they subtracted from it and added to it. Any moral tradition is essentially a sequence of moral standards, each linked to the past by an impure inheritance and to the future by the bits added and subtracted by a present generation.

Not every tradition is a morally progressing sequence, of course, but those that make progress alternate between securing past gains and opening the door to future improvements. GEM names the routines that secure gains a higher system as integrator. It names the routines within the emerged system that open the door to a better system a higher system as operator. Within a developing moral tradition, value judgments perform the integrator functions, while value questions perform the operator functions. The integrating power of value judgments will be directly proportional to the absence of operator functions — specifically, any further relevant value questions. So we regard some values as rock solid because no one has raised any significant questions about them. Value judgments that are provisional will function as limited integrators – limited, to be exact, to the extent that lingering value questions function as operators, scrutinizing value judgments for factual errors, misconceived theories, or bias in the investigator.

Feelings may function as either operators or integrators. As operators, they represent our initial response to possible values, moving us to pose value questions. As integrators they settle us in our value judgments as our psyches link our affects to an image of the valued object. Lonergan names this linkage of affect and image a symbol. (This is a term that identifies an event in consciousness; it is not to be confused with the visible flags and icons we also call “symbols.”) The concrete, functioning symbols that suffuse our psyches can serve as integrator systems for how we view our social institutions, various classes of people, and our natural environment, making it easy for us to respond smoothly without having to reassess everything at every moment. Symbols can also serve as operators insofar as the affect-image pair may disturb our consciousness, alerting us to danger or confusion, and prompting the questions we pose about values.

Although the operators that improve a community’s tradition involve the questions that occur to its members, not all questions function as operators. Some value questions are poorly expressed, even to ourselves. We experience disturbing symbols, but have yet to pose a value question in a way that actually results in a positive change. Some value questions are posed by biased investigators, which degrade a community’s moral heritage. Only those individuals who pose the questions that actually add values or remove disvalues will function as operators in an improving tradition. What makes any tradition improve, then, is neither the number of cultural institutions, nor governmental support of the arts, nor legal protections for freedom of thought, nor freedom of religion. These support the operators, and need to be regulated as such. But the operators themselves are the questions raised by the men and women who put true values above mere satisfactions.

The same alternating dynamic is evident in the moral development of an individual. While psychotherapists expect that an individual’s age is not a reliable measure of moral maturity, those who understand development as an alternation of operators and integrators may pose their questions about a patient’s maturity much more precisely: How successfully did this person meet the sequence of operator questions at turning points in his or her life? And what are the resultant integrator symbols guiding this person today? Similarly, in theories of individual development, what counts is what the operators may be at any stage. Where some theorists only describe the various stages, GEM looks for an account of a prior stage as integrator that connects directly to the operator questions to which an emerging stage is an answer.

b. Dialectical Intelligibility

The foregoing genetic model of development gives a gross view of stages and a first approximation to actual development. But actual development is the bigger story. Who we are is a unique weaving of the mutual impacts of external challenges and our internal decisions. So we come to the kind of intelligibility that accounts for concrete historical growth or decline – dialectical intelligibility. We expect this kind of understanding when we anticipate a tension among drivers of development and changes in these very drivers, depending on the path that the actual development takes.

Friendship, for example, has been compared to a garden that needs tending, but the analogy is misleading. What we understand about gardens falls under genetic intelligibility. Seeds will produce their respective vegetables, fruits or flowers; all we do is provide the nutrients. In a friendship, however, each partner is changed with each compromise, accommodation, resistance or refusal. So the inner dynamic of any friendship is a concrete unfolding of two personalities, each linked to the other yet able to oppose the other.

A community, too, is a dialectical reality. Its members’ perceptions, their patterns of behavior, their ways of collaborating and disputing, and all their shared purposes are the concrete result of three linked but opposed principles: their spontaneous intersubjectivity, their practical intelligence, and their values.

Spontaneous Intersubjectivity: Our spontaneous needs and wants constitute the primitive, intersubjective dimensions of community. We nest; we take to our kind; we share the unreflective social routines of the birds and bees, seeking one particular good after another.

Practical Intelligence: We also get insights into how to meet our needs and wants more efficiently. We design our houses to fit our circumstances and pay others to build them. In exchange, others pay us to make their bread, drive them to work, or care for their sick. Here is where the intelligent dimensions of a community emerge, comprising all the linguistic, technological, economic, political and social systems springing from human insight that constitute a society.

Values: Where practical intelligence sets up what a community does, values ground why they do it. Here is where the moral dimensions of community emerge – the shoulds and should-nots conveyed in laws, agreements, education, art, public opinion and moral standards. They embody all the commitments and priorities that constitute a culture.

These three principles are linked. Spontaneously, we pursue the particular goods that we need or want. Intellectually, we discover the technical, economic, political and social means to ensure the continuing flow of these particular goods, and we adapt our personal skills and habits to work within these systems. Morally, we decide whether the particular goods and the systems that deliver them actually improve our lives. Yet the principles are forever opposed. Insight often suppresses the urges of passion, while passion unmoored from insight would carry us along its undertow. Conscience, meanwhile, passes judgment on both our choices of particular goods and the systems we set up to keep them coming.

A dialectical anticipation regards a community as a moving, concrete resultant of the mutual conditioning of these three principles. When spontaneous intersubjectivity dominates a community, its members’ intellects are deformed by animal passion. When practical intelligence ignores spontaneous intersubjectivity, a society becomes stratified into an elite with its grand plans and a proletariat living from hand to mouth. Where members prefer mere satisfactions over values, intelligences are biased, and deeper human needs for authenticity are ignored. In any case, communities move, pushed and pulled by these principles, now converging toward, now diverting away from genuine progress.

c. Radical Unintelligibility

The idea of development implies a lack of intelligibility, namely, the intelligibility yet to be realized. Likewise, there is a lack of intelligibility in the distorted socio-cultural institutions and self-defeating personal habits that pose the everyday problems confronting us. Yet even these are intelligibly related to the events that created them.

What lacks intelligibility it itself, however, is the refusal to make a decision that one deems one ought to make. GEM follows the Christian tradition of the apostle Paul, of Augustine, and of Aquinas in recognizing the phenomenon that we can act against our better judgment. This tradition is aware that much wrongdoing results from coercion, or conditioning, or invincible ignorance, but it asserts nonetheless that we can refuse to choose what we know is worth choosing. Lonergan refers to these events as “basic sin” to distinguish them from the effects of such refusals on one’s socio-cultural institutions and personal habits. Their unintelligibility is radical, in the sense that a deliberate refusal to obey a dictate of one’s deliberation cannot be explained, even if, as often happens, later deliberation dictates something else. It is radical also in the etymological sense of a root that branches into the actions, habits and institutions that we consider “bad.”

5. Methodology

Different media subdivide ethics in different ways. News media divide it according to the positions people take on moral issues. Many college textbooks divide it into three related disciplines: metaethics (methods), normative ethics (principles), and applied ethics (case studies). This division implies that we first settle issues of method, then establish general moral principles, and finally apply those principles straightaway into practice. GEM proposes that moral development is not the straight line of genetic development nourished solely by principles but rather a dialectical interplay of spontaneous intersubjectivity, practical intelligence, and values. So, instead of a deductive, three-step division of moral process, GEM expects moral reflection to spiral forward inductively, assessing new situations with new selves at every turn. The question then becomes how ethicists might collaborate in wending the way into the future.

In his Method in Theology, Lonergan grouped the processes by which theology reflects on religion into eight specializations, each with functional relationships to the other seven. As illustrated in the chart below, the four levels of human self-transcendence – being attentive, intelligent, reasonable, and responsible – function in the two phases of understanding the past and planning for the future. Thus, we learn about the past by moving upward through research, interpretation, history, and a dialectical evaluation. We move into the future by moving downward through foundational commitments, basic doctrines, systematic organizations of doctrines, and communication of the resulting meanings and values. Our future slips into our past soon enough, and the process continues, turn after turn, reversing or advancing the forces of decline, meeting ever new challenges or buckling under the current ones.

While Lonergan presented this view primarily to meet problems in theology, he extended the notion of functional specialties to ethics, historiography and the human sciences by associating doctrines, systematics, and communications with policies, plans and implementations, respectively. These eight functional specialties are not distinct professions or separate university departments. They represent Lonergan’s grouping of the operations of mind and heart by which we actually do better. That is, he is not suggesting a recipe for better living; he is proposing a theoretical explanation of how the mind and heart work whenever we actually improve life, along with a proposal for collaboration in light of this explanation.

lonergan-fig

The bottom three rows of functions will be initially familiar to anyone involved in practically any enterprise. The top row of functions is less familiar, but it represents Lonergan’s clarification of the evaluative moments that occur in any collaboration that improves human living.

The functional specialty dialectic occurs when investigators explicitly sort out and evaluate the basic elements in any human situation. They evaluate the data of research, the explanations of interpreters, and the accounts of historians. To ensure that all the relevant questions are met, they bring together different people with different evaluations with a view to clarifying and resolving any differences that may appear.

From a GEM perspective, the most radical differences result from the presence or absence of conversion. Three principal types have been identified. There is an intellectual conversion by which a person has personally met the challenges of a cognitional theory, an epistemology, a metaphysics, and a methodology. There is a moral conversion by which a person is committed to values above mere satisfactions. And there is an affective conversion by which a person relies on the love of neighbor, community, and God to heal bias and prioritize values.

By attending to these radical differences, GEM rejects the typical liberal assumption that (1) people always lie, cheat and steal; (2) realistically, nothing can be done about these moral shortcomings; and (3) social institutions can do no more than balance conflicting interests. This assumption constricts moral vision to a pragmatism that may look promising in the short run but fails to deal with the roots of moral shortcomings in the long run. Dialectic occurs when investigators explicitly deal with each other’s intellectual, moral and affective norms, under the assumption that converted horizons are objectively better than unconverted horizons.

The functional specialty foundations occurs when investigators make their commitments and make them explicit. Relying on the evaluations and mutual encounters that occur in the specialty, dialectic, investigators deliberately select the horizons and commitments upon which they base any proposed improvements. These foundations are expressed in explanatory categories insofar as investigators make explicit their latent metaphysics and the horizons opened by their intellectual, moral and affective conversions.

Regarding ethics, investigators use a number of categories to formulate ethical systems, to track developments, to propose moral standards, and to express specific positions on issues. By way of illustration below, there are six sets of categories that seem particularly important: (1) action, concepts and method, (2) good and bad, (3) better and worse, (4) authority and power, (5) principles and people, and (6) duties and rights.

While commonsense discourse uses these terms descriptively, GEM’s theoretical approach defines them as correlations between subjective operations and their objective correlatives. An ethics based on GEM assumes that if science is to take seriously the data of consciousness, then it is necessary to deal explicitly with the normative elements that make consciousness moral. Because these subjective operations include moral norms and because their objective correlatives involve concrete values, the categories will not be empirically indifferent. Their power to support explanations of moral situations and proposals will derive from normative elements in their definitions, which, in turn are openly grounded in the innate norms to be attentive, intelligent, reasonable, and responsible.

6. Categories

a. Action, Concepts, and Method

Interest in method may be considered as a third plateau in humanity’s progressive enlargement of what has become meaningful.

  • A first plateau regards action. What is meaningful is practicality, technique, and palpable results.
  • A second plateau regards concepts. What counts are the language, the logic, and the conceptual systems that give a higher and more permanent control over action.
  • The third plateau regards method. As modern disciplines shift from fixed conceptual systems to the ongoing management of change, the success of any conceptual system depends on a higher control over its respective methods.

Morality initially regards action, but it has expanded into a variety of conceptual systems under the heading of ethics. It is these systems, and their associated categories, which are the focus of the third-plateau methodological critique. On the third plateau, concepts lose their rigidity. As long as investigators are explicit about their cognitional theory, epistemology and metaphysics, they will continually refine or replace concepts developed in previous historical contexts.

Although the second plateau emerged from the first and the third is currently emerging from the second, GEM anticipates that any investigator today may be at home with action only, with both action and concepts, or with action, concepts, and method. The effort of foundations is for investigators to include all three plateaus in their investigations. The effort of dialectic is to invite all dialog partners to do the same.

b. Good and Bad

Where second-plateau minds would typically name things good or bad insofar as they fall under preconceived concepts such as heroism or murder, liberation or oppression, philanthropy or robbery, third-plateau minds look to concrete assessments of situations. To ensure that this assessment is sufficiently grounded in theory, GEM requires an understanding of certain correlations between intentional acts and their objects. This requires more than a notional assent to concepts; it requires personally verified insight into what minds and hearts intend and how they intend it.

The relevant correlations that constitute anything called bad or good may be viewed according to the three levels of intentionality that dialectically shape any community. (1) Spontaneously, our interests, actions and passions intend particular goods. (2) Intelligently and reasonably, our insights and judgments intend the vast, interlocking set of systems that give us these particular goods regularly. (3) Responsibly and affectively, our decisions and loves intend what is truly worthwhile among these particular goods and the systems that deliver them.

In authentic persons, affectivity and responsibility shape reasonable and intelligent operations, which in turn govern otherwise spontaneous interests, actions and passions. This hierarchy in intentionality correlates with a priority of cultural values over social systems, and social systems over the ongoing particular activities of a populace. Thus, GEM regards human intelligence and reason as at the service of moral and affective orientations. This turns upside down the view of “materialistic” economic and educational institutions that dedicate intelligence and reason to serving merely spontaneous interests, actions, and passions.

At the same time, moral and affective orientations rely on intelligent and reasonable analyses of situations to produce moral precepts – an approach that contrasts with ethics that look chiefly to virtue and good will for practical guidance. Lonergan demonstrated how intelligent and reasonable analyses produce moral precepts in his works on the economy (Macroeconomic Dynamics: An Essay in Circulation Analysis) and on marriage (“Finality, Love, Marriage”).

So GEM regards the concepts of good and bad as useful for expressing moral conclusions, provide they are rooted in intelligent analysis, dialectical encounter, and personal conversion. GEM relies on dialectical encounter to expose the oversights when “good” and “bad” are used to categorize actions in the abstract.

c. Better and Worse

The complexities of one’s situation involve not only its history, but the views of history embraced by its participants. Darwinian, Hegelian and Marxist views of history are largely genetic, insofar as they support the liberal thesis that life automatically improves, and that wars, disease, and economic crashes are necessary steps in the forward march of history. GEM declares an end to this age of scientific innocence. It regards this thesis of progress as simply a first of three successively more thorough approximations toward a full understanding of actual situations. A second approximation takes in the working of bias and the resulting dynamics of historical decline. A third approximation takes in the factors of recovery by which bias and its objective disasters may be reversed.

First Approximation: What drives progress. We experience a situation and feel the impulse to improve it. We spot what’s missing, or some overlooked potentials. We express our insight to others, getting their validation or refinement. We make a plan and put it into effect. The situation improves, bringing us back to feeling yet further impulses to improve things. The odds of spotting new opportunities grow as, with each turn of the cycle, more and more of what doesn’t make sense is replaced by what does. Such is the nature of situations that improve.

Second Approximation: What drives decline. Again, we experience a situation and an impulse to improve it. But we do not, or will not, spot what’s missing. We express our oversight to others, making it out to be an insight. If they lack any critical eye, they take us at our word rather than notice our oversight. We make a plan, put it into effect, and discover later the inevitable worsening of the situation. Now the odds of spotting ways to improve things decrease, owing to the additional complexity and cross-purposes of the anomalies. With each turn of the cycle, less and less makes sense. Such is the nature of situations that worsen.

Lonergan proposed that such oversights might be rooted in any of four biases endemic to consciousness: (1) Neurosis resists insight into one’s psyche. (2) Egoism resists insight into what benefits others. (3) Loyalism resists insights into the good of other groups. (4) Anti-intellectualism resists insights that require any thorough investigation, theory-based analyses, long-range planning, and broad implementation. In each type, one’s intelligence is selectively suppressed and one’s self-image is supported by positive affects that reinforce the bias and by negative affects toward threats to the bias.

Third Approximation: What drives recovery. GEM offers an analysis of love to show how it functions to reverse the dynamics of decline.

  • Love liberates the subject to see values: Some values result not from logical analyses of pros and cons but rather from being in love. Love impels friends of the neurotic and egoist to draw them out of their self-concern, freeing their intelligence to consider the value of more objective solutions. Love of humanity frees loyalists to regard other groups with the same intelligence, reason and responsibility as they do their own. Love of humanity frees the celebrated person of common sense to appreciate the more comprehensive viewpoints of critical history, science, philosophy and theology. Love of a transcendent, unreservedly loving God frees a person from blinding hatred, greed and power mongering, liberating him or her to a divinely shared commitment to what is unreservedly intelligible, reasonable, responsible and loving.
  • Love brings hope: There is a power in the human drama by which we cling to some values no matter how often our efforts are frustrated. Our hopes may be dashed, but we still hope. This hope is a desire rendered confident by love. Those who are committed to self-transcendence trust their love to strengthen their resolve, not only to act against the radical unintelligibility of basic sin, but also to yield their personal advantage for the sake of the common good. Such love-based hope works directly against biased positive self-images as well as negative images of fate that give despair the last word. To feel confident about the order we hope for, we do not look to theories or logic. We rely on the symbols that link our imagination and affectivity. These inner symbols are secured through the external media of aesthetics, ritual, and liturgy.
  • Love opposes revenge: There is an impulse in us to take an eye for an eye, a tooth for a tooth. While any adolescent can see that this strategy cannot be the foundation of a civil society, it is difficult to withhold vengeance on those who harm us. It is the nature of love, however, to resist hurting others and to transcend vengeance. It is because of such transcendent love that we move beyond revenge to forgiveness and beyond forgiveness to collaboration.

GEM’s perspective on moral recovery aims to help historians and planners understand how any situation gets better or worse. It helps historians locate the causes of problems in biases as opposed to merely deploring the obvious results. It helps planners propose solutions based on the actual drivers of progress and recovery, as opposed to mere cosmetic changes.

d. Authority and Power

Common sense typically thinks of authority as the people in power. GEM roots the meaning of authority in the normative functions of consciousness and defines the expression of authority in terms of legitimate power.

An initial meaning of power is physical, and physical power is multiplied by collaboration. But in the world of social institutions, a normative meaning of power emerges – the power produced by insights and value judgments. Insights are expressed in words; words raise questions of value; judgments of value lead to decisions; decisions result in cooperation; and this kind of cooperation vastly reduces the physical power needed while achieving vastly better results. The social power of a community grows as it consolidates the gains of the past, restricts behaviors that would diminish the community’s effectiveness, organizes labors for specific tasks, and spells out moral guidelines for the future. As normative, the memory and commitments involved in this heritage constitute a community’s “word of authority.”

The community appoints “authorities” to implement these tasks. Authorities are the spokespersons, delegates, and caretakers of a community’s spiritual and material assets. Winning the vote does not confer an authority upon them; it confers a responsibility upon them to speak and embody the community’s word of authority. The honor owed to them by titles and ceremony does not derive from any virtue of their persons but rather from the honorable heritage and common purpose with which they have been entrusted.

While the community’s social power resides in its ways and means, not all its ways and means are legitimate. A community’s heritage is a mixed bag of sense and nonsense. To the extent that authorities lack the authenticity of being attentive, intelligent, reasonable and responsible, their power to build up is diminished. Even if everyone does what they say, inauthentic authorities will be blind to the higher viewpoints and better ideas needed to stave off chaos and seize opportunities for improving life together. Their power is justifiably called naked because it is stripped of the intelligent, reasonable, and responsible contributions their subjects are quite capable of making. Similarly, to the extent that the subjects lack authenticity, they will cripple their own creativity, which otherwise would foresee problems, overcome obstacles, and open new lines of development. At the extremes, a noble leader of egotistical followers has no more effective power than an egotistical leader of noble followers. Between these extremes, the typical dynamic is an ongoing dialectic between an incomplete authenticity of the community and an incomplete authenticity of its authorities.

In this concrete perspective, GEM defines authority as power legitimated by authenticity. That is, authority is that portion of a heritage produced by attention, intelligence, reason, and responsibility. As only a portion of a heritage, authority is a dialectical reality, to be worked out in mutual encounter, rather than a dictatorial iron law (a classical reality), an anarchical or libertarian social order (a statistical reality), or a natural, evolutionary dynasty (a genetic reality).

This definition of authority as the power legitimated by authenticity offers historians defensible explanations for their distinctions between legitimate and illegitimate exercises of power within a historical period. It offers policymakers the normative categories they need to explain to their constituents the reasons for proposed changes in the community’s constitution, laws, and sanctions. It reminds authorities that they have been entrusted with the maintenance and refinement of a heritage created by the community.

e. Principles and People

A commonsense use of “moral principles” usually means any set of conceptualized standards, such as, “The punishment should fit the crime” or “First, do no harm.”

When ethicists consider how moral principles should be used, disagreements arise. Some scorn them because principles are only abstract generalizations that do not apply in concrete situations. When we try to apply them, disputes arise about the meaning of terms such as “crime” or “harm.” Particular cases always require further value judgments on the relative importance of mitigating factors, which generalizations omit. What counts is a thorough assessment of the concrete situation, which will result in an intuition of what seems best.

Others reject such situation-based ethics because people have different intuitions about what seems best in particular situations. What is needed is a general principle that supports the common good. Moreover, history proves that formulated principles are good things. Because they represent wisdom gained by others who met threats to their well being, to neglect them is to unknowingly expose oneself to the same threats. We codify principles in our laws, appeal to them in our debates, and teach them to our children. For children in particular, and for adults whose moral intelligence has not matured, principles are firm anchors in a stormy sea.

GEM regards principles as concepts that need the critique of a third-plateau reflection on the methods used to develop them. They are not really principles in the sense of starting points. That is, they are not the source of normative demands. The actual sources of normative demands are self-transcending people being attentive, intelligent, reasonable, and responsible. Formulated principles are the products of people shaped by an ambiguous heritage, exposed to a dialectic of opinions, and directed by personal commitments within intellectual, moral and affective horizons. These horizons may complement each other; they may develop from earlier stages; or they may be dialectically opposed, as when people who mouth the same principles attach opposite meanings to them, or when people espouse the principle but act otherwise.

GEM grants no exception for moral principles proposed by religions. A religious revelation is considered neither a delivery from the sky of inscribed tablets nor a dictation heard from unseen divinities. In its data of consciousness perspective, GEM considers revelation as a person’s judgment of value regarding known proposals, whether inscribed or spoken or imagined. Its religious sanction is based on a person’s claim that this judgment is prompted by a transcendent love from a transcendent source in his or her heart.

Those who formulate specific moral principles need to understand that there are distinct methodological issues associated with each of the eight specialties that form a group in consciousness. This understanding begins with men and women who think about their intellectual, moral and affective commitments in explanatory categories (foundations). It is first expressed in these categories as judgments of fact or value (doctrines/policies). It expands through understanding the relationships these principles have with other principles (systematics/planning). It becomes effective thorough adaptations that take into account the current worldview of a community, the media used, and the values implicit in the community’s language (communications/implementation). These adaptations become data (research) for further understanding (interpretation) within historical contexts (history) to be evaluated (dialectic.)

GEM’s strategy for resolving differences among principles is to exercise the functional specialty dialectic to reveal their true source. Investigators evaluate not only the historical accounts of how any principle arose, but also the principle itself. GEM proposes that where investigators overcome disagreements, the parties have lain open their basic horizons, particularly the intellectual, moral and affective horizons that reveal the radical grounds of disagreements and agreements. In this mutual encounter, people concerned about morality are already familiar with normative elements in their consciousness and may only lack the insights and language to make them intelligible parts of how they present their views. The strategy is not to prove one’s principle or disprove another’s but to tap one another’s experience of a desire for authenticity. GEM counts on the probability that those people with more effective intellectual, moral and affective horizons will, by laying bare the roots of any differences, attract and guide those whose horizons are less effective.

Besides people who appreciate authenticity, there are people who crave its opposite, as the history of hatred amply demonstrates. If GEM has accurately identified the dialectic of decline as driven by an increasingly degraded authenticity, with its increasingly narrow and unconnected solutions to problems, then the reversal of moral evil must appeal to any remnants of authenticity in the hater. The appeal involves enlargements of horizons at many levels. For communities of hatred, this enlargement will require moving from legends about their heritage to a critical history, revising the rhetoric and rituals that secure commitment, and rewriting their laws. At the same time, there is also an enlargement to be expected of the communities who seek to convert communities of hatred. This is because more comprehensive political protocols and moral standards will be required to achieve a yet higher integration of those portions of both heritages that resulted from authenticity.

f. Duties and Rights

In the perspective of GEM, the elemental meaning of duty is found in the originating set of “oughts” in the impulses to be attentive, intelligent, reasonable, and responsible, plus the overriding “ought” to maintain consistency between what one knows and how one acts. The oughts issued by conscience not only provide all the norms expressed in written rules, but also issue far more commands and prohibitions than parents, police, and public policy ever could. It is this inner duty that enables one to break from a minor authenticity that obeys the written rule and to exercise a major authenticity that may expose a written rule as illegitimate.

At first glance, the GEM view of morality may appear sympathetic to “deontological” theories that base all moral obligation on duty rather than consequences. While it is true that GEM traces all specific obligations to an underlying, universal duty, it goes deeper than concept-based maxims by identifying the dynamic originating duty in every person to be attentive, intelligent, reasonable and responsible. By tracing the source of any maxims about duty to their historical origins, GEM leaves open the possibility that new historical circumstances may require new maxims.

Moreover, insofar as any formulations of duty are consequences of past historical situations, and as new formulations will be consequences of new situations, GEM supports the consideration of consequences in ethical theory. What this approach adds, however, is the requirement that all consequences pass under the scrutiny of dialectic, which aims to filter merely satisfying consequences from the truly valuable, and to consider how specific consequences contribute to historical progress, decline, or recovery. These consequences include not only changes in observable behaviors and social standards but also any shifts in the intellectual, moral and affective horizons of a community.

As adults juggle their customary duties to social norms and their originating duty to be authentic, many discover that the best parts of these social norms arose from the authenticity of forebears. With this discovery comes a recognition of a present duty to preserve those portions of one’s heritage based on authenticity, to critique those portions based on bias, and to create the social and economic institutions that facilitate authenticity.

Lonergan depicted such preservation, critique, and creativity as an ongoing experiment of history. The success of the race, and of any particular peoples, depends on collaborative efforts to conduct this experiment rather than serve as its guinea pigs. Collaboration, in turn, requires authenticity of all collaborators.

Any collaboration that successfully makes life more intelligible will require a freedom to speak one’s mind, to associate, to maintain one’s health, and to be educated. The notion of human rights, therefore, is a derivative of this intelligibility intrinsic to nourishing a heritage. While “rights” usually appear as one-way demands by one party upon others, their essential meaning is that they are expressions of the mutual demands intrinsic to any collaborative process aimed at improving life. Any individual’s claim in the name of rights is essentially an assumption that others will honor his or her duty to contribute to the experiment to improve a common heritage.

Conflicts of rights are often the ordinary conflicts involved in any compromise. More seriously, they may be differences between plateaus of meaning among a community’s members. First-plateau minds, focused on action, will think of rights as the behaviors and entitlements that lawmakers allow to citizens. Many will conclude that they have a right to do wrong. In contrast, GEM views lawmakers as responsible for protecting the liberty of citizens to live authentically. Thus, while the law lets every dog have a free bite, GEM repudiates the conclusion that anyone has a right to do wrong.

Second-plateau minds promote the ancient and honorable notion that rights are a set of immutable, universal properties of human nature. GEM considers that the strength of the modern notion of rights has been based mainly on logical consistency and permanent validity. However, from the methods perspective of the third plateau of meaning, GEM also recovers elements in the ancient notion of natural right that include personal authenticity and defines these elements in terms of personal conversion. On that basis, GEM proposes a collaborative superstructure driven by the functional specialties, dialectic and foundations.

In any case, GEM considers rights as historically conditioned means for authentic ends. As historically conditioned means, rights may take any number of legal and social forms. So, for example, the historical expansion from civil rights (speech, assembly, suffrage) to social rights (work, education, health care), to group rights (women, homosexuals, ethnic groups) is evidence of the ongoing emergence of new kinds of claims on each other’s duty to replenish a heritage. As oriented toward authentic ends, the validity of any rights claim depends on how well it enables authentic living, a question addressed through the mutual exposures that occur in the functional specialty dialectic. Consequently, ethicists familiar with GEM rely less on the language of rights and more on the language of dialog, encounter, and heritage.

7. Summary

A generalized empirical method in ethics clarifies the subject’s operations regarding values. The effort relies on a personal appropriation of what occurs when making value judgments, on a discovery of innate moral norms, and on a grasp of the meaning of moral objectivity. These innate methods of moral consciousness are expressed in explanatory categories, to be used both for conceptualizing for oneself what occurs regarding value judgments and for expressing to others the actual grounds for one’s value positions.

GEM is based on a gamble that the odds of genuine moral development are best when the players lay these intellectual, moral and affective cards on the table. Concretely, this implies a duty to acknowledge the historicity of one’s moral views as well as a readiness to admit oversights in one’s self-knowledge. Moreover, given the proliferation of moral issues that affect confronting cultures with different histories today, it also implies a duty to meet the stranger in a place where this openness can occur.

8. References and Further Reading

a. Main Works of Lonergan

  • Insight: A Study of Human Understanding. Volume 3 of the Collected Works of Bernard Lonergan. Toronto: University of Toronto Press, 1997. Originally published 1957.
  • Method in Theology. New York: Herder & Herder, 1972.
  • “Cognitional Structure,” Collection. Montreal: Palm, 1967, pp 221-239.
  • “Dimensions of Meaning,” Ibid., pp 252-267.
  • “The Subject,” A Second Collection. London: Darton, Longman & Todd, 1974, pp. 69-87.
  • Macroeconomic Dynamics: An Essay in Circulation Analysis. Volume 15 of the Collected Works of Bernard Lonergan. Toronto: University of Toronto Press, 1999.

b. Shorter Works Relevant to Ethics

  • “Finality, Love, Marriage.” Collection, op. cit., pp 16-55.
  • “The Example of Gibson Winter,” A Second Collection, op. cit., pp 189-192.
  • “The Dialectic of Authority,” A Third Collection. New York: Paulist Press, 1985, pp 5-12.
  • “Method: Trend and Variations,” ibid., pp 13-22.
  • “Healing and Creating in History,” ibid., pp. 100-109.
  • “The Ongoing Genesis of Methods,” ibid., pp. 146-165.
  • “Natural Right and Historical Mindedness,” ibid., pp. 169-183.
  • “Lectures on Existentialism,” Part Three of Phenomenology and Logic: The Boston College Lectures on Mathematical Logic and Existentialism, Volume 18 of the Collected Works of Bernard Lonergan, op.cit., pp. 219-317.

c. Other Works

  • Melchin, Kenneth R. Living with Other People. Ottawa: St. Paul University Press, 1998.
  • Morelli, Mark D. and Morelli, Elizabeth A. The Lonergan Reader. Toronto: University of Toronto Press, 1997.

Author Information

Tad Dunne
Email: tdunne@sienaheights.edu
U. S. A.

The Lyceum

The Lyceum was a gymnasium near Athens and the site of a philosophical school founded by Aristotle.

Table of Contents

  1. Location, Structures, and Layout of the Lyceum
    1. Apodyterion
    2. Dromoi and Peripatoi
    3. Gymnasium Building
    4. Palaistra
    5. Sanctuaries
    6. Seats
    7. Stoas
    8. Trees and Streams
  2. History of the Use of the Lyceum
  3. References and Further Reading

1. Location, Structures, and Layout of the Lyceum

Archaeological exploration of the topography of the Lyceum has been hampered by the sprawl of buildings in modern Athens. The general location of the Lyceum outside and East of the ancient city wall is well-attested (Strabo 9.1.24, Cleidemus, FGrH 323F18, and Pausanias 1.19.3). Ancient literary and epigraphic sources and modern archaeological investigation provide an occasional glimpse into the layout and use of the Lyceum area in antiquity. While most often connected with philosophical teaching and discourse, the Lyceum was used for military exercises, meetings of the Athenian assembly, and cult practice as well as athletic training.

This multiplicity of use had a direct impact on the types of structures in the area and on the general development of the Lyceum. From the earliest times, the area was characterized by large open spaces and shady groves of trees, bounded roughly by the Ilissos river to the south and the Eridanus river and Mt. Lykabettos to the north. A series of roads led to the Lyceum from in and around the city. From the sixth century BC to the sixth century AD the area saw ever increasing numbers of buildings constructed to serve its multiple functions.

Some literary references to the Lyceum give a fuller picture. For example, in the first lines of Plato’sLysis, Socrates is walking along a road from the Academy to the Lyceum that ran under the city wall when he meets his friends Hippothales and Ktesippos near the Panops springhouse (Lysis203a-204a). This springhouse may be the one mentioned by Strabo (9.1.19), who adds that these springs were “near the Lyceum.” Strabo also tells us that the Ilissus river flowed down “from above Agrai and the Lyceum” (9.1.24). In addition, Xenophon records that during a raid by the Spartans against the city from their encampment at Dekeleia to the East of the city, the Athenians came out and drew up their troops “immediately near the Lyceum gymnasium” (Hellenica 1.1.33).

Recent excavations by the Greek Archaeological Service in the area of modern Syntagma have revealed that the area immediately to the East of the ancient city wall was filled with ancient cemeteries and factories, and an immense bathing complex of the Roman period. In addition, sections of a broad, ancient road running East -West through this area have been uncovered. These finds merely add to the list of similar buildings, baths, and graves previously found in the Syntagma area.

In 1996 excavations in the area of modern Rigillis Street uncovered a structure that has been identified by the excavator as a palaistra in the Lyceum. The site continues to be excavated and studied and has not yet been fully published.

In sum, the ancient literary, epigraphic, and archaeological evidence indicates that the area known as the “Lyceum” probably covered a large area to the East of the ancient city wall, but was not immediately adjacent to the wall. It may have begun just to the West of the modern Amalias Blvd. and continued East through the modern National Gardens with the Olympieion and Ilissos river forming its southern boundaries; it may have extended northward as far as modern Kolonaki plateia. If further excavation at the site near Rigillis St. confirms the excavator’s assertions that the ancient buildings there were located in the Lyceum, then we may at least have an indication of the eastern extent of the gymnasium area.

A number of different types of construction are mentioned in the literary and epigraphic sources as being in the Lyceum: an apodyterion (dressing room), dromoi (roads or running tracks) andperipatoi (walks), a gymnasium building, and a palaistra (wrestling school), cult sanctuaries, seating areas, and stoas. Irrigation channels were constructed to keep the area green and wooded.

a. Apodyterion

Literally an “an undressing room.” The building in which the scene of Plato’s Euthydemus (272e-273b) begins. This structure may be either a part of the gymnasium building or the palaistra, or it may have stood independently to serve the covered dromos (racetrack) mentioned in the passage.

b. Dromoi and Peripatoi

Dromoi and Peripatoi: The sources refer to two different types of dromoi: 1) the main East-West road leading up to the city wall through the Lyceum (Xenophon, Hellenica 2.4.27, Xenophon, The Cavalry Commander 3.6, Callimachus, fr. 261 Pfeiffer), and 2) the running tracks (Plato,Euthydemus 272e-273b). Stretches of the main road, which ran through modern Syntagma square and past the modern Parliament building parallel to the present Vasilis Sophias Blvd. have been uncovered and should be the main dromos to which our sources refer. Excavation has not produced any evidence for the location of the dromoi for foot or horse races.

c. Gymnasium Building

An honorary decree for the Athenian statesman Lycurgus (IG II2 457) states that this building was repaired in the 330s BC. The original structure may have been built either by Pericles (Philochoros,FGrH 328F37) in the fifth century BC or Pisistratus (Theopompus, FGrH 115F136) in the sixth century BC. It is the only gymnasium building that is attested in Athens during the Classical period.

d. Palaistra

The scene of Plato’s Euthydemus, which was also perhaps repaired in the 330s BC by Lycurgus ([Plutarch], Lives of the Orators 841d).

e. Sanctuaries

The Lyceum itself was a sanctuary to Apollo, but it also contained a shrine to the Muses that housed many dedications and a portrait bust of Aristotle (Diogenes Laertius 5.51, also cf. IG II2 2613). There was also a cult of Hermes on the Lyceum grounds (IG II2 1357b4).

f. Seats

There were seats for the athlothetai (judges in an athletic contest) somewhere in the Lyceum grounds (Aischin. Soc. fr. 2.2 Dittmar=Demetr. Elocut. 205).

g. Stoas

A small stoa stood near the sanctuary to the Muses, and another stoa had maps of the earth displayed on tablets on the walls (Diogenes Laertius 5.51-52; Lucian, Anachar. 33).

h. Trees and Streams

Parts of the Lyceum were apparently wooded, and channels were dug from the Ilissus and Eridanus rivers to keep the area green. Theophrastus notes one enormous plane tree in particular that “sent out roots a distance of 33 cubits” (Theophrastus, On Plants 1.7.1). The woods were said to have been chopped down during the siege by Sulla in 86 BC (Plutarch, Sulla 12.3).

2. History of the Use of the Lyceum

The Lyceum, like the other famous Athenian gymnasia (the Academy and Cynosarges) was more than a space for physical exercise and philosophical discussion, reflection, and study. It contained cults of Hermes, the Muses, and Apollo, to whom the area was dedicated and belonged. It was also used for military exercises, the marshaling of troops, and for military displays. The Lyceum thus encompassed a fairly large area, including large open spaces, buildings, and cult sites.

The Lyceum was named after Apollo Lyceus, Apollo “the wolf-god.” From at least the sixth-century BC the Lyceum is said to have been the place where the polemarch (head of the army) had his office (Hesychius, “Epilykeion” and Suda, “ArchÙn”). The area was also used for military exercises (Suda, “Lykeion”) and for the marshaling of troops before departing on campaign (Aristophanes, Peace 351-357); it was also the site of cavalry displays (Xenophon, The Cavalry Commander 3.1). The Lyceum was also the place for meetings of the Athenian assembly before the establishment of a permanent meeting area on the Pnyx hill during the fifth century BC (IG I3 105).

The Lyceum was a place of philosophical discussion and debate well before Aristotle founded his school there in 335 BC. Socrates (Euthydemus 271a, Euthyphro 2a, Symposium 223d), Prodicus of Chios ([Plato], Eryxias 397c-d), and Protagoras (Diogenes Laertius 9.54) all apparently frequented the Lyceum to debate, discuss, and teach during the last third of the fifth-century BC. Plato’s great rival Isocrates taught rhetoric in the Lyceum during the first half of the fourth century BC, as did other sophists and philosophers. Rhapsodes were said to teach there as well (Alexis, PCG fr. 25, Antiphanes, PCG fr. 120, and Isocrates, Panathenaecus 33.5).

Upon his return to Athens in 335 BC, Aristotle rented some buildings in the Lyceum and established a school there. For nearly the remainder of his life, it was here that Aristotle lectured, wrote most of his philosophical treatises and dialogues, and systematically collected books for the first library in European history. After Aristotle’s death in 322 BC the headship of the school passed to Theophrastus, who continued his master’s program of research and teaching in the Lyceum. Theophrastus also purchased buildings and land that he bequeathed to the school in his will. However, the quality of the school’s library may have declined after Theophrastus’ death in 287 BC with the apparent loss of many of Aristotle’s works to Neleus of Scepsis (Strabo 13.1.54).

From the time of Aristotle until 86 BC there was a continuous succession of philosophers in charge of the school in the Lyceum. The common name for the school, Peripatetic, was derived either from the peripatos in the Lyceum grounds or from Aristotle’s habit of lecturing while walking. The school was a part of the military/educational institution for the city’s elite, the ephebeia. This program of study and military service provided eighteen- to twenty-year-old Athenian males with a curriculum of philosophy, knowledge of the ancestral cults, and instruction in the art of war. The Lyceum’s fame-and the fame of other schools in Athens-attracted increasing numbers of philosophers and students from all over the Mediterranean world.

The brutal sack of Athens by the Roman general Sulla in 86 BC destroyed much of the Lyceum and disrupted the life of the school considerably. The school may have been refounded later in the first century BC by Andronicus of Rhodes, but this is uncertain. By the second century AD, the Lyceum was again a flourishing center of philosophical activity. The Roman emperor Marcus Aurelius appointed teachers to all the main philosophical schools in Athens, including the Lyceum. The utter destruction of Athens in AD 267 probably ended this renaissance of scholarly activity. The work of Peripatetic philosophers continued elsewhere, but it is unclear whether they returned to the Lyceum. Nothing certain is known about the Lyceum during the remainder of the third through early sixth centuries AD. Any remaining philosophical activity would certainly have ended in AD 529, when the emperor Justinian closed all the philosophical schools in Athens.

3. References and Further Reading

  • J.P. Lynch, Aristotle’s School: A Study of a Greek Educational Institution. Berkeley 1972.
  • C.E. Ritchie, “The Lyceum, the Garden of Theophrastos and the Garden of the Muses. A Topographical Reevaluation”
  • J. Travlos, Pictorial Dictionary of Ancient Athens. Athens 1971.
  • Wycherley, R.E. The Stones of Athens. Princeton 1978.

Author Information

William Morison
Email: morisonw@gvsu.edu
Department of History

Lucretius (c. 99—c. 55 B.C.E.)

LucretiusLucretius (Titus Lucretius Carus) was a Roman poet and the author of the philosophical epic De Rerum Natura (On the Nature of the Universe), a comprehensive exposition of the Epicurean world-view. Very little is known of the poet’s life, though a sense of his character and personality emerges vividly from his poem. The stress and tumult of his times stands in the background of his work and partly explains his personal attraction and commitment to Epicureanism, with its elevation of intellectual pleasure and tranquility of mind and its dim view of the world of social strife and political violence. His epic is presented in six books and undertakes a full and completely naturalistic explanation of the physical origin, structure, and destiny of the universe. Included in this presentation are theories of the atomic structure of matter and the emergence and evolution of life forms – ideas that would eventually form a crucial foundation and background for the development of western science. In addition to his literary and scientific influence, Lucretius has been a major source of inspiration for a wide range of modern philosophers, including Gassendi, Bergson, Spencer, Whitehead, and Teilhard de Chardin.

Table of Contents

  1. Life
    1. Italy during the First Century BCE
    2. Lucretius’ Personality and Outlook
  2. Philosophy
    1. Epicurus
    2. Epicureanism
      1. Physics
      2. Canonic
      3. Ethics
    3. The Design of the Poem
    4. Lucretius as a Philosopher
    5. Influence and Legacy
    6. Conclusion
  3. References and Further Reading
    1. Texts
    2. English Translations
    3. Critical and Scholarly Studies

1. Life

Of Lucretius’ life remarkably little is known: he was an accomplished poet; he lived during the first century BC; he was devoted to the teachings of Epicurus; and he apparently died before his magnum opus, De Rerum Natura, was completed. Almost everything else we know (or think we know) about this elusive figure is a matter of conjecture, rumor, legend, or gossip.

Some scholars have imagined that this lack of information is the result of a sinister plot – a conspiracy of silence supposedly conducted by pious Roman and early Christian writers bent on suppressing the poet’s anti-religious sentiments and materialist blasphemies. Yet perhaps more vexing for our understanding of Lucretius than any conspiracy of silence has been the single lurid item about his death that appears in a fourth century chronicle history by St. Jerome:

94

[sic] BC. . . The poet Titus Lucretius is born. He was later driven mad by a love philtre and, having composed between bouts of insanity several books (which Cicero afterwards corrected), committed suicide at the age of 44.

Certainly the possibility that Lucretius (whose blistering, two hundred line denunciation of sexual love comprises one of the memorable highlights of the poem) may himself have fallen victim to a love potion is a superb irony. Unfortunately, there is not a shred of evidence to support the claim. Nor is it highly likely that Cicero (a skeptical-minded thinker with sympathies toward Stoicism) would have assisted to any large degree in the publication of an epic celebrating the Epicurean creed. As for the suggestion that Lucretius produced De Rerum Natura in lucid periods between intervals of raging insanity, the poem itself stands as a strong argument to the contrary. At the very least it must be considered improbable that a work of such scope and complexity, of such intellectual depth and sustained reasoning power, could have been the product of fitful composition and a diseased mind.

Fortunately, even if we dismiss Jerome’s account as little more than an edifying fable and resign ourselves to the absence of even a scrap of reliable biographical information on Lucretius, there is still one source we can turn to for valuable insights into the poet’s character, personality, and habits of mind, and that is De Rerum Natura itself. For although the poem tells us almost nothing about the day to day affairs of Lucretius the man, it nevertheless furnishes a large and revealing portrait of Lucretius the poet, philosopher, social commentator, critic of religion, and observer of the world.

Indeed one does not have to read very far into the poem to discover that not only is Lucretius a serious student of philosophy and science, but that above all he is a great poet of nature. He reveals himself as a lover of woods, fields, streams, and open spaces, acutely sensitive to the beauties of landscape and the march of seasons. He proves a keen observer of plants and animals and at least as knowledgeable and interested in crops, weather, soil, and horticulture as in the existence of gods or the motion of atoms. The preponderance of natural descriptions and images in the poem has led some readers to suppose that the author must have led some form of rural existence, perhaps as the owner of a country estate. True or not, it is clearly not the city, with its hurly-burly of commerce, money grubbing, social climbing, and political strife, but the quiet countryside with its contemplative retreats, solitude, and simple pleasures that inspires his poetry and (as was the case with his master Epicurus in his garden at Athens) his philosophical reveries.

It is generally assumed that the poet, as his name implies, was a member of the aristocratic clan of the Lucretii. On the other hand, it is also possible that he was a former slave and freedman of that same noble family. Support for the idea of his nobility comes in part from his suave command of learning and the polished mastery of his style, but mostly from the easy and natural way (friend to friend, rather than subordinate to superior) in which he addresses Memmius, his literary patron and the addressee of the poem.

Gaius Memmius was a Roman patrician who was at one time married to Sulla’s daughter, Fausta. In 54 BC (one year after Lucretius’ death), he stood for consul, but was defeated owing to an electoral violation, which he himself revealed but was afterwards condemned for. In 52 BC he went into exile at Athens, and it is unknown whether he ever returned to Rome. Lucretius dedicated his poem to him, and throughout the epic the poet is at pains to remind Memmius of the sweet rewards of the Epicurean lifestyle and the bitter tribulations of public life. No doubt it would have distressed the poet deeply to know that his chief literary sponsor, instead of following the lofty path to Epicurean tranquilitas, ended his career with a vain descent into the tarnishing world of power politics and personal ambition.

Literary tradition has supplied Lucretius with a wife, Lucilla. However, except for a line or two in the poem suggesting the author’s personal familiarity with marital discord and the bedroom practices of “our Roman wives” (4. 1277), there is no evidence that he himself was ever married.

a. Italy during the First Century BCE

For the most part, the forty-four years of Lucretius’ lifetime was a period of nearly non-stop violence: a time of civil wars, grueling overseas campaigns, political assassinations, massacres, revolts, conspiracies, mass executions, and social and economic chaos. Even a brief chronology of the times paints a grim picture of devastation, with each decade bearing witness to some new disturbance or uprising:

100 BC: riots erupt in the streets of Rome; two public officials, the tribune L. Appuleius Saturninus and praetor C. Servilius Glaucia, are murdered. 91 BC: the so-called Social War (between Rome and her Italian allies) breaks out. No sooner is this bitter struggle ended (88 BC) than Lucius Cornelius Sulla, a ruthless politician and renegade army commander, marches on Rome, and an even more convulsive and bloody Civil War begins. 82 BC: Sulla becomes dictator. His infamous proscription results in the arrest and execution of more than 4000 leading citizens, including 40 senators. 71 BC: Spartacus’ massive slave revolt (involving an army of 90,000 former slaves and outlaws) is finally put down by Cassius and Pompey. More than 6000 of the captured rebels are crucified and their bodies left for display along the Appian Way. 62 BC: Defeat and death of Catiline. By this point in his career this former lieutenant of Sulla had become a living plague upon Roman politics and a virtual byword for scandal, intrigue, conspiracy, demagoguery, and vain ambition.Such was Rome from the rise of Sulla to the fall of Catiline, a period of seemingly endless bloodshed and civil unrest. With such a background, it is little wonder that the precepts of Epicurus – with their emphasis on contemplative pursuits and quiet pleasures and severe strictures against ambition, fame, and the world of politics – struck a responsive chord in the heart of a young Roman poet. To a sensitive intellectual like Lucretius, the teachings of Epicurus must have had the force of a philosophical revelation. In this respect, it is noteworthy (and ironic) that throughout De Rerum Natura whenever the poet writes about Epicurus he praises him not simply as a great teacher and brilliant philosopher, but virtually as a kind of oracle and even a god. Meanwhile, he seems to have viewed his own role as that of an Epicurean evangelist: he is a poetic apostle dedicated to spreading the master’s gospel of liberation from the bondage of superstition and error, of inner peace attained through the study of philosophy and the enjoyment of modest pleasures.

b. Lucretius’ Personality and Outlook

Unlike his hero Epicurus, who had a reputation for being gentle and self-effacing, Lucretius’ excitable personality springs vividly from his pages. Though naturally passionate and intellectually contentious, he also reveals himself as reflective and prone to melancholy. Like his master, he detests war, strife, and social tumult and favors a life quietly devoted to sweet friendship (suavis amicitia) and intellectual pleasures.

At the beginning of Book 2 of his poem, the poet compares the prospect of a person armed with the insights of Epicurus to that of a secure spectator looking down upon a scene of strife:

Pleasant it is, when over the great sea the winds shake the waters,
To gaze down from shore on the trials of others;
Not because seeing other people struggle is sweet to us,
But because the fact that we ourselves are free from such ills strikes us as pleasant.
Pleasant it is also to behold great armies battling on a plain,
When we ourselves have no part in their peril.
But nothing is sweeter than to occupy a lofty sanctuary of the mind,
Well fortified with the teachings of the wise,
Where we may look down on others as they stumble along,
Vainly searching for the true path of life. . . . (2. 1-10)

This idea of philosophy as a private citadel or quiet refuge in a world of anxiety and turmoil, or of some form of contemplation as the true path to enlightenment, has been a recurrent theme in world literature from the Buddha to Boethius, from Socrates to Schopenhauer. The idea is a central component of Epicurean doctrine and a favorite theme and image of Lucretius, whose characteristic vantage point throughout the poem is that of a critical observer above the fray. As narrator, he stands aloof, a scornful yet at the same time sympathetic witness to mankind’s dark strivings and tribulations:

Lo, see them: contending with their wits, fighting for precedence,
Struggling night and day with unending effort,Climbing, clawing their way up the pinnacles of wealth and power.
O miserable minds of men! O blind hearts!
In what darkness, among how many perils,
You pass your short lives! Do you not see
That our nature requires only this:
A body free from pain, and a mind, released from worry and fear,
Free to enjoy feelings of delight? (2. 11-19.)

Like his master, Lucretius obviously feels that the true purpose of moral philosophy is not merely to diagnose human miseries; but to heal them.

2. Philosophy

a. Epicurus

From the very start of the poem, and especially in the opening lines of Book 3 (a ringing tribute to Epicurus), Lucretius makes it clear that his main purpose is not so much to display his own talents as to render accurately in a suitably sublime style the glorious philosophy of his master:

O you who out of the vast darkness were the first to raise
A shining light, illuminating the blessings of life,
O glory of the Grecian race, it is you I follow,
Tracing in your clearly marked footprints my own firm steps,
Not as a contending rival, but out of love, for I yearn to imitate you.
For why should the swallow vie with the swan?
Why should a young kid on spindly limbs
Dare to match strides with a mighty steed? (3. 1-8.)

The poetry, Lucretius keeps reminding his readers, is secondary, a sugar coating to sweeten Epicurus’ healing medicine. The Epicurean system is what is important, and the poet pledges all his skill to presenting it as clearly, as faithfully, and as persuasively as possible. In his view nothing less than universal enlightenment and the liberation of mankind is at stake.

Epicurus was born at Samos, an Athenian colony, in 341 BC. Reduced to its simplest level, the goal of his teaching was to free humanity from needless cares and anxieties (especially the fear of death) . By furnishing a complete explanation of the origin and structure of the universe, he sought to open men’s eyes to a true understanding of their condition and liberate them from ignorant fears and superstitions. Though by all accounts he was a voluminous writer, only a tiny fraction of his original output has survived, with the result that Lucretius’ poem has served as one of the primary vehicles for conveying his thought.

b. Epicureanism

The Epicurean system consists of three linked components: Physics, Ethics, and Canonic. These three elements are designed to be interdependent, each one supposedly uniting with and reinforcing the other two. (To cite just one example, Epicurus’ physics supposedly validates both the existence of free will and the fact that the soul disintegrates with the body, ideas that are crucial to Epicurean ethics. The canonic claims to validate the authority and reliability of sensation, which in turn serves as a basis for Epicurean physical theories and ethical views relating to pleasure and pain.) In actual fact, however, the three components are quite separable, and it is certainly possible, for example, to accept Epicurus’ ethical doctrines while entirely denying his canonic teachings and physics.

i. Physics

One of the great achievements of the scientific imagination, the Epicurean cosmos is based on three fundamental principles: materialism, mechanism, and atomism. According to Epicurus the universe covers an infinitude of space and consists entirely of matter and void. For the most part the philosopher upholds Democritus’ theory that all matter is composed of imperishable atoms, tiny indivisible particles that can neither be created or destroyed. He also shares Democritus’ view that the atoms are infinite in number and homogenous in substance, while differing in shape and size. However, whereas Democritus held that the number of atomic sizes and shapes is infinite, Epicurus argued that their number, while large, is nevertheless finite. (As Lucretius notes, if atoms could be any size, some would be visible, and possibly even immense.) As for atomic motion, Democritus had claimed that the atoms move in straight lines in all directions and always in accordance with the iron laws of “necessity” (anangke). Epicurus, on the other hand, contends that their natural motion is to travel straight downwards at a uniform high velocity. At random and unpredictable moments, moreover, they deviate ever so slightly from their regular course, their resulting collisions thus occurring not by strict necessity but always with some element of chance. This theory of atomic “swerve” or clinamen is a crucial feature of the Epicurean world-view, providing (so Lucretius and other adherents believed) a firm physical foundation supporting the existence of free will.

Armed with these basic principles, Epicurus is able to explain the universe as an ongoing cosmic event – a never-ending binding and unbinding of atoms resulting in the gradual emergence of entire new worlds and the gradual disintegration of old ones. Our world, our bodies, our minds are but atoms in motion. They did not occur because of some purpose or final cause. Nor were they created by some god for our special use and benefit. They simply happened, more or less randomly and entirely naturally, through the effective operation of immutable and eternal physical laws.

Here it should be noted that Epicurus is a materialist, not an atheist. Although he argues that not only our earth and all its life forms, but also all human civilizations and arts came into being and evolved without any aid or sponsorship from the gods, he does not deny their existence. He merely denies that they have any knowledge of or interest in human affairs. They live on immune to destruction in their perfectly compounded material bodies in the serene and cloudless spaces between the worlds (intermundia), perfectly oblivious of human anxieties and cares. Lucretius imagines that Epicurus rivaled them in their divine tranquility.

ii. Canonic

The so-called canonic teachings of Epicurus (from the Greek kanon, “rule”) include his epistemological theories and especially his theories of sensation and perception. In certain respects, these theories represent Epicurus’ thought at its most original and prescient – and in one or two instances at its most fanciful and absurd.

The central principle of the canonic is that our sense data provide a true and accurate picture of external reality. Sensation is the ultimate source and criterion of truth, and its testimony is incontrovertible. Epicurus considered the reliability of the senses a bulwark of his philosophy, and Lucretius refers to trust in sensation as a “holdfast,” describing it as the only thing preventing our slide into the abyss of skepticism (4. 502-512).

But if our sensory input is always true and dependable, how are we to account for hallucinations, fantasies, dreams, delusions, and other forms of perceptual error? According to Epicurus, such errors are always due to some higher mental process. They arise, for example, when we apply judgment or reasoning or some confused product of memory to the actual data presented to us by sensation. As Lucretius remarks, we deceive ourselves because we tend to “see some things with our mind that have not been seen by the senses”:

For nothing is harder than to distinguish the real things of sense
From those doubtful versions of them that the mind readily supplies. (4. 466-468.)

Epicurus’ theory of sensory perception is consistent with and follows from his materialism and atomism. Like Democritus, he postulates that external objects send off emanations or “idols” (eidola) of themselves that travel through the air and impinge upon our senses. In effect, these subtle atomic images or films imprint themselves on the senses, leaving behind trace versions of the external world (auditory and olfactory as well as visual) that can be apprehended and stored in memory. Once again, perceptual errors can occur in this process, but not because of any inherent problem with sensation itself. Instead, mistakes arise due either to the contamination of the “idols” by other atoms or because of the “false opinions” that we ourselves, through defects in our higher mental operations, introduce.

In short, unless it is distorted by some form of external “noise” or by some processing error attributable to reason, all information conveyed through the senses is true. This is Epicurus’ core canonic teaching. Unfortunately, this belief in the infallibility of sense perception and the unreliability of logic and reason led him and his followers (including Lucretius) into a number of strange conclusions – such as the absurd claim that the sun, moon, and stars are exactly the size and shape that they appear to be to our naked eye. Thus (as strict Epicurean doctrine would have it) the moon truly is a small, silver disc, the sun is a slightly larger golden fire, and the stars are but tiny points of light.

iii. Ethics

Epicurus’ ethics represents the true goal and raison d’etre of his philosophical mission, the capstone atop the impressive (though hardly flawless) pillars of his physics and epistemology. Like Socrates, he considered moral questions (What is virtue? What is happiness?) rather than cosmological speculations to be the ultimate concerns of philosophical inquiry.

As mentioned earlier, it is possible to accept one component of the Epicurean system without necessarily subscribing to the others. But from Epicurus’ (and Lucretius’) point of view, it is the ethical component that is of vital importance.

As many commentators have noted, the term “Epicure” (in the sense of a self-indulgent bon vivant or luxurious pleasure-seeker) is entirely out of place when applied to Epicureanism in general and to its founder in particular. By all accounts, Epicurus’ own living habits were virtually Spartan, and it is said that he attracted many of his disciples more by his solid character and agreeable temper than by his philosophical arguments. His moral philosophy is a form of hedonism, meaning that it is a system based on the pursuit of pleasure (Gr. ēdonewhich it identifies as the greatest good. But Epicurean hedonism is hardly synonymous with sensual extravagance; nor is it a matter (in St. Paul’s disparaging terms) of “let us eat and drink; for tomorrow we die.” It is instead a system that requires severe self-denial and moral discipline. For Epicurus places a much greater emphasis on the avoidance of pain than on the pursuit of pleasure, and he favors intellectual pleasures (which are long-lasting and never cloying) over physical ones (which are short-lived and lead to excess). As for self-indulgence, he argued that it is better to abstain from coarse or trivial pleasures if they prevent our enjoyment of richer, more satisfying ones.

In Epicurean ethics physical pain is the great enemy of happiness and is to be avoided in almost all cases. Mental anguish is even more threatening and potentially debilitating. It follows that the fear of death – and especially the superstitious belief in an after-life of eternal torment – can be particularly devastating source of anxiety and take a terrible toll on humanity, which is why Epicurus sets out so determinedly to crush it.

c. The Design of the Poem

De Rerum Natura is an epic in six books and is expertly organized to provide both expository clarity as well as powerful narrative and lyric effects. In one respect, the poem represents the unfolding of a complex philosophical argument, and in many places the poet is challenged to explain abstract and often extremely prosaic technical material in a lucid and lively way. (At times during the poem he complains about the relative poverty of Latin as a philosophical medium compared to the technical richness of Greek.) At the same time, he must be careful not to overwhelm or upstage his philosophical presentation with a surplus of brilliant literary devices and gaudy stylistic displays. The basic organization is as follows:

Book 1: The poem begins with a justly famous invocation to Venus (the poet’s symbol for the forces of cohesion, integration, and creative energy in the universe). Presented as a kind of life principle, the Lucretian Venus is associated with the figure of Love (Gr. philia, the unifying or binding force in the philosophy of Empedocles, and also identified with her mythical role as Venus Genetrix, the patron goddess and mother of the Roman people. In the remainder of the book the poet begins the work of explaining the Epicurean system and refuting the systems of other philosophers. He starts by setting forth the major principles of Epicurean physics and cosmology, including atomism, the infinity of the universe, and the existence of matter and void.

Book 2. This book begins with a lyric passage celebrating the “serene sanctuaries” of philosophy and lamenting the condition of those poor human beings who struggle vainly outside its protective walls. The poet explains atomic motion and shapes and argues that the atoms do not have secondary qualities (color, smell, heat, moisture, etc.).

Book 3. After a glowing opening apostrophe to Epicurus (“O glory of the Greeks!”), the poet proceeds with an extended explanation and proof of the materiality – and mortality – of the mind and soul. This explanation culminates in the climactic declaration, “Nil igitur mors est ad nos. . .” (“Therefore death is nothing to us.”), a stark, simple statement which effectively epitomizes the main message and central doctrine of Epicureanism.

Book 4. Following introductory verses on the art of didactic poetry, this book begins with a full account of Epicurus’ theory of vision and sensation. It concludes with one of Lucretius’ greatest passages of verse, his famous (and caustic) analysis of the biology and psychology of sexual love.

Book 5. Lucretius begins this book with another tribute to the genius of Epicurus, whose heroic intellectual achievements, it is argued, exceed even the twelve labors of Hercules. The remainder of the book is devoted to a full account of Epicurean cosmology and sociology, with the poet explaining the stages of life on earth and the origin and development of civilization. This book includes the remarkable passage (837-886) in which the poet offers his own evolutionary hypothesis on the proliferation and extinction of life forms.

Book 6. Though partly unfinished, this book contains some of Lucretius’ greatest poetry, with effective technical explanations of meteorological and geologic phenomena and vivid descriptions of thunderstorms, lightning, and volcanic eruptions. The poem closes with a horrifying account of the great plague of Athens (430 BC), a grim reminder of universal mortality.

d. Lucretius as a Philosopher

Critics universally recognize Lucretius as a major poet and the author of one of the great classics of world literature. But in part because of his accepted role as a spokesperson for Epicureanism rather than an originator, it has been more difficult to assess his merit as a philosopher.

In this respect, it is noteworthy that at least two important philosophers have voiced strong support for Lucretius’ status as a philosophical innovator and original thinker. In 1884, while still a young faculty member at the Blaise Pascal Lycee in Paris, the French philosopher Henri Bergson (1859-1941) published an edition of De Rerum Natura with notes, commentary, and an accompanying critical essay. Throughout this work, Bergson commends Lucretius not only as a poet of genius, but also as an inspired and “singularly original” thinker. In particular, he points out that in his view the poet’s instinctive grasp of the physical operations of nature and his comprehensive, truly scientific world-view exceed anything found in the theories of Democritus and Epicurus.

The Spanish poet and Harvard philosopher George Santayana (1863-1952) held a similarly high opinion of Lucretius’ power as a scientific thinker. Democritus and Epicurus, he argues, are mere sketch artists who offer no more than bare hints and vague outlines of a thoroughly imagined and truly scientifically conceived universe. It thus remained for the deeper, more visionary poet not just to flesh out their rough drafts in fine words, but in essence to actually create and give body to the entire Epicurean system. In Santayana’s view, Epicurus was but a supplier of half-baked ideas; it was Lucretius who was the true creator of scientific materialism and the real founder of Epicureanism.

Hyperbole aside, what both Bergson and Santayana are pointing to is the frequently underrated and misunderstood role of imagination in the production of almost all major systems of philosophy. Great philosophers from Plato and Aristotle to Kant and Nietzsche (and Bergson himself) have never been simply logic mills or thinking machines, but bold thinkers with an imaginative “feel” for abstract reality. In this respect, even if we dismiss the assessments of Bergson and Santayana as extravagant, we can still accept Lucretius as a bona fide philosopher and not just as a poetical embellisher and interpreter.

Every philosopher has strengths and weaknesses; those of Lucretius are conspicuous. In addition to his powerful imagination, his main strength (not surprisingly) is his verbal skill and force of expression. He is one of the most quotable of philosophers, with a flair for striking images and tightly packed statements. A few samples:

On superstition:

“So powerful is religion at persuading to evil.” 1. 101.

On luxuries:

“Hot fevers do not depart your body more quickly
If you toss about on pictured tapestries or rich purple coverlets
Than if you lie sick under a poor man’s blanket.” 2. 34-36.

On life without philosophy:

“All life is a struggle in the dark.” 2. 54.
“After a while the life of a fool is hell on earth.” 3. 1023.

On new truths:

“No fact is so obvious that it does not at first produce wonder,
Nor so wonderful that it does not eventually yield to belief.” 2. 1026-27.

On reason:

“Such is the power of reason to overcome inborn vices
That nothing prevents our living a life worthy of gods.” 3. 321-22.

On the language of love:

“We say a foul, dirty woman is ‘sweetly disordered,’
If she is green-eyed, we call her ‘my little Pallas’;
If she’s flighty and tightly strung, she’s ‘a gazelle’;
A squat, dumpy dwarf is ‘a little sprite,’
While a hulking giantess is ‘divinely statuesque.’
If she stutters or lisps, she speaks ‘musically.’
If she’s dumb, she’s ‘modest’; and if she’s hot-tempered
And a chatterbox, she’s ‘a ball of fire.’
When she’s too skinny to live, she’s ‘svelte,’
And she’s ‘delicate’ when she’s dying of consumption. . .
It would be wearisome to run through the whole list.” 4. 1159-1171.

Of all Lucretius’ intellectual strengths, perhaps none is more characteristic or stands out more impressively than his hard, clear commitment to naturalism. Throughout the poem he consistently attacks supernatural explanations of phenomena and resists the temptation to give in to some form of natural religion or “scientific” supernaturalism. The world, he argues, was not created by divine intelligence, nor is it imbued with any form of mind or purpose. Instead, it must be understood as an entirely natural phenomenon, the outcome of a random (though statistically inevitable and lawful) process. In short, whatever happens in the universe is not the product of design, but part of an ongoing sequence of purely physical events.

Lucretius’ principal philosophical shortcoming is that not only will he occasionally follow Epicurean doctrine to the point of absurdity (e.g., the supposedly tiny size of the sun and moon) but he will also introduce logical fallacies or scientific errors of his own (such as his claim that the atoms travel faster than light – 2. 144ff.). As Bergson points out, these howlers can usually be attributed to the defective method of ancient science, which, because it did not require that hypotheses be confirmed by experimentation, allowed even the wildest conjectures to pass as plausible truths. One further problem is that, for all his reliance on naturalistic explanations and his attempted reduction of metaphysics to physics, Lucretius at times seems to back away, if only ever so slightly, from a purely materialist world view. Indeed in his effusive descriptions of the creative power of nature, effectively symbolized by the figure of Venus, he seems almost (like Bergson) to postulate an immaterial life-force surging through the universe and operating above or beyond raw nature. To read this romantic streak into him is clearly a mistake. Lucretius remains a thorough-going naturalist. Yet when his verse is in high gear, one almost gets the impression that somewhere inside this staunchly scientific, fiercely anti-religious poet there is a romantic nature-worshipper screaming to get out.

e. Influence and Legacy

Lucretius’ literary influence has been long-lasting and widespread, especially among poets with epic ambitions or cosmological interests, from Virgil and Milton to Whitman and Wordsworth. Not surprisingly, as one of the main proponents and principal sources of Epicurean thought, his philosophical influence has also been considerable. The extent of his communication with and influence on his contemporaries, including other Epicurean writers, is not known. What is known is that by the end of the first century A.D. De Rerum Natura was hardly read and its author had already begun a long, slow descent into philosophical oblivion. It was not until the Renaissance, with the recovery of lost Lucretian manuscripts, that a true revival of the poet became possible.

It is probably an exaggeration to say that the restoration and study of Lucretius’ poem was crucial to the rise of Renaissance “new philosophy” and the birth of modern science. On the other hand, one must not ignore its importance as a spur to innovative sixteenth- and seventeenth-century scientific thought and cosmological speculation. Greek atomism and Lucretius’ account of the universe as an infinite, lawfully integrated whole provided an important background stimulus not only for Newtonian science, but also (if only in a negative or contrary way) for Spinoza’s pantheism and Leibniz’s monadology.

While admitting that “one poem by itself was certainly not responsible for an entire intellectual, moral, and social transformation,” Renaissance scholar Stephen Greenblatt has nevertheless argued convincingly that Lucretius’s epic had a decisive and lasting historical impact. The subtitle of Greenblatt’s study “How the World Became Modern” summarizes his point as he shows how the poem effectively influenced a wide range of Renaissance scientists, philosophers, and literary intellectuals.

As expected, the first figures to spread and expound upon the recently rediscovered poem and its Epicurean gospel were the Italian humanists of the 15th century. Chief among them was the Catholic priest and expert Latinist Lorenzo Valla, author of “On Pleasure,” a fictional debate on whether the best way to achieve a virtuous and happy life is to follow the tenets of Christianity or those of Epicureanism.

Niccolo Machiavelli and Michel de Montaigne were avid readers of the poem. Lucretius appears to have played a major role in shaping Machiavelli’s political thought, while Montaigne quotes abundantly from De Rerum Natura throughout his Essays. No doubt Lucretius’s skeptical outlook and withering critique of religious dogma and political violence appealed to the French writer’s own skepticism, particularly at a time when bloody religious wars were in full fury not far outside the walls of his chateau retreat. Meanwhile, in England, Sir Thomas More proposed a version of Epicurean hedonism as a moral ideal in his fictional fantasy Utopia. However, it remains uncertain whether the Catholic martyr and saint’s account of a pleasure-based, non-religious ethics was sincere or ironical.

Lucretius’ influence on early modern thought is most directly visible in the work of the French scientist and neo-Epicurean philosopher Pierre Gassendi (1592-1655). In 1649 Gassendi published his Syntagma Philosophiae Epicuri, a theoretical refinement and elaboration of Epicurean science. A Catholic priest with a remarkably independent mind, Gassendi seemingly had no problem reconciling his personal philosophical commitment to atomism and materialism with his Christian beliefs in the immortality of the soul and the doctrine of divine providence. Lucretius’s poem also inspired some of the cosmological speculations of Giordano Bruno (1548-1600) and especially his idea of an unbounded cosmos with an infinitude of suns and planets. Bruno was condemned by the Inquisition and burned at the stake for his heretical opinions. However, his views appear to have been at least as deeply rooted in mysticism and pantheism as in Lucretian materialism and atomic theory.

Every modern reader of De Rerum Natura has been struck by the extent to which Lucretius seems to have anticipated modern evolutionary theories in the fields of geology, biology, and sociology. However, to acknowledge this connection is not to say that the poet deserves accredited status as some kind of scientific “evolutionist” or pre-Darwinian precursor. It is merely to point out that, however we choose to define and evaluate its influence, De Rerum Natura was from the 17th century onward a massive cultural presence and hence a ready source of evolutionary ideas. The poem formed part of the cultural heritage and intellectual background of virtually every evolutionary theorist in Europe from Lamarck to Herbert Spencer (whose hedonistic ethics also owed a debt to the poet) – including (though he claimed never to have read Lucretius’ epic) Darwin himself.

Bergson’s early study of Lucretius obviously played an important role in the foundation and development of his own philosophy. In 1907 Bergson published Creative Evolution, outlining his bold, new vitalistic theory of evolution, in opposition to both the earlier vitalism of Lamarck and the naturalism of Darwin, and Spencer. It is hard not to see in the French philosophers’ concept of the élan vital a powerful life force akin to and strongly influenced by the immortal Venus of his great Latin predecessor. Bergson’s evolutionary philosophy influenced the later “process” philosophy of Alfred North Whitehead (1861-1947) and the teleological scientific theories of Pierre Teilhard de Chardin (1881-1955), with the interesting result that it is possible to trace out a fairly direct, if unlikely, line of descent from Greek atomism through the pagan anti-spiritualist Lucretius to the Catholic naturalist Gassendi and then on, via the Jewish-Catholic Bergson, to the highly abstract theism of Whitehead and the “spiritualized” evolutionism of Father Teilhard. That Lucretius’ ideas wound up two thousand years after his death influencing those of a godly British mathematical theorist and a highly original and even eccentric French scientist-priest is remarkable testimony to their durability, adaptability, and persuasive power.

f. Conclusion

In conclusion, it seems fair to say that, far from being a mere conduit for earlier Greek thought, the poet Titus Lucretius Carus was a bold innovator and original thinker who fully deserves the appellation of philosopher. While his literary fame clearly (and properly) comes first, and although his philosophical reputation is based largely (and again properly) on his role as one of the principle sources and prime exponents of Epicureanism, his own ideas, especially his evolutionary theories and his entirely naturalistic explanation of all universal phenomena, have exerted a long and important influence on western science and philosophy and should not be underestimated.

3. References and Further Reading

The most authoritative manuscripts of De Rerum Natura are the so-called O and Q codices in Leiden. Both date from the 9th century. Recently, however, scholars have deciphered a much older and previously illegible manuscript, consisting of papyri discovered in Herculaneum and possibly dating from as early as the first century AD. All other Lucretian manuscripts date from the 15th and 16th century and are based on the one (no longer extant) discovered in a monastery by the Italian humanist Poggio Bracciolini in 1417. Bracciolini’s discovery and the philosophical revolution it helped to bring about is the subject of Greenblatt’s study The Swerve.

a. Texts

  • Lucretius: On the Nature of Things. W.H.D. Rouse, trans. Revised and edited by Martin F. Smith. Cambridge, MA: Harvard University Press, 1992.
  • Bailey, C. ed. De Rerum Natura. 3 volumes with commentary. Oxford, 1947.

b. English Translations

  • Munro, H.A.J. (prose). Cambridge, 1864.Latham, R.E. (prose). Harmondsworth, UK: Penguin, 1951.
  • Humphries, Rolphe. (verse). Bloomington: Indiana University Press, 1968.
  • Copley, Frank O. (verse). New York: Norton, 1977.

c. Critical and Scholarly Studies

  • Bergson, Henri. Philosophy of Poetry: The Genius of Lucretius. Wade Baskin, trans. New York: Philosophical Library, 1959.
  • Brown, Allison. “Lucretius and the Epicureans in the Social and Political Context of Renaissance Florence.” In I Tatti Studies in the Italian Renaissance. Vol. 9 (2001), pp. 11-62.
  • Clay, D. Lucretius and Epicurus. Ithaca, NY, 1983.
  • Greenblatt, Stephen. The Swerve: How the World Became Modern. New York: Norton, 2011.
  • Hendrick, PJ. “Montaigne, Lucretius, and Scepticism: An Interpretation of the ‘Apologie de Raimond Sebond.” Proceedings of the Royal Irish Academy: Archeology, Culture, History, Literature. Vol. 79 (1979), pp. 139-52.
  • Jones, H. The Epicurean Tradition. London: 1989.
  • Kenney, E. J. Lucretius. Oxford, 1977.
  • Rahe, Paul A. “In the Shadow of Lucretius: Epicurean Foundations of Machiavelli’s Political Thought.” History of Political Thought, Vol. 28, No. 1 (Spring 2007), pp. 30-55.
  • Santayana, George. Three Philosophical Poets. Cambridge, MA: Harvard University Press.
  • Sikes, E.E. Lucretius: Poet and Philosopher. Cambridge, 1936.

Author Information

David Simpson
Email: dsimpson@condor.depaul.edu
DePaul University
U. S. A.

Logical Consequence

Logical consequence is arguably the central concept of logic. The primary aim of logic is to tell us what follows logically from what. In order to simplify matters we take the logical consequence relation to hold for sentences rather than for abstract propositions, facts, state of affairs, etc. Correspondingly, logical consequence is a relation between a given class of sentences and the sentences that logically follow. One sentence is said to be a logical consequence of a set of sentences, if and only if, in virtue of logic alone, it is impossible for the sentences in the set to be all true without the other sentence being true as well. If sentence X is a logical consequence of a set of sentences K, then we may say that K implies or entails X, or that one may correctly infer the truth of X from the truth of the sentences in K. For example, Kelly is not at work is a logical consequence of Kelly is not both at home and at work and Kelly is at home. However, the sentence Kelly is not a football fan does not follow from All West High School students are football fans and Kelly is not a West High School student. The central question to be investigated here is: What conditions must be met in order for a sentence to be a logical consequence of others?

One popular answer derives from the work of Alfred Tarski, one of the preeminent logicians of the twentieth century, in his famous 1936 paper, “The Concept of Logical Consequence.” Here Tarski uses his observations of the salient features of what he calls the common concept of logical consequence to guide his theoretical development of it. Accordingly, we begin by examining the common concept focusing on Tarski’s observations of the criteria by which we intuitively judge what follows from what and which Tarski thinks must be reflected in any theory of logical consequence. Then two theoretical definitions of logical consequence are introduced: the model theoretic and the deductive theoretic definitions. They represent two major approaches to making the common concept of logical consequence more precise. The article concludes by highlighting considerations relevant to evaluating model theoretic and deductive theoretic characterizations of logical consequence.

Table of Contents

  1. Introduction
  2. The Concept of Logical Consequence: Model-Theoretic and Deductive-Theoretic Conceptions of Logic
    1. Tarski’s Characterization of the Common Concept of Logical Consequence
      1. The Logical Consequence Relation Has a Modal Element
      2. The Logical Consequence Relation is Formal
      3. The Logical Consequence Relation is A Priori
    2. Logical and Non-Logical Terminology
      1. The Nature of Logical Constants Explained in Terms of Their Semantic Properties
      2. The Nature of Logical Constants Explained in Terms of Their Inferential Properties
  3. Model-Theoretic and Deductive-Theoretic Conceptions of Logic
  4. Conclusion
  5. References and Further Reading

1. Introduction

For a given language, a sentence is said to be a logical consequence of a set of sentences, if and only if, in virtue of logic alone, the sentence must be true if every sentence in the set were to be true. This corresponds to the ordinary notion of a sentence “logically following” from others. Logicians have attempted to make the ordinary concept more precise relative to a given language L by sketching a deductive system for L, or by formalizing the intended semantics for L. Any adequate precise characterization of logical consequence must reflect its salient features such as those highlighted by Alfred Tarski: (1) that the logical consequence relation is formal, that is, depends on the forms of the sentences involved, (2) that the relation is a priori, that is, it is possible to determine whether or not it holds without appeal to sense-experience, and (3) that the relation has a modal element.

For more comprehensive presentations of the two definitions of logical consequence, as well as further critical discussion, see the entries Logical Consequence, Model-Theoretic Conceptions and Logical Consequence, Deductive-Theoretic Conceptions.

2. The Concept of Logical Consequence

a. Tarski’s Characterization of the Common Concept of Logical Consequence

Tarski begins his article, “On the Concept of Logical Consequence,” by noting a challenge confronting the project of making precise the common concept of logical consequence.

The concept of logical consequence is one of those whose introduction into a field of strict formal investigation was not a matter of arbitrary decision on the part of this or that investigator; in defining this concept efforts were made to adhere to the common usage of the language of everyday life. But these efforts have been confronted with the difficulties which usually present themselves in such cases. With respect to the clarity of its content the common concept of consequence is in no way superior to other concepts of everyday language. Its extension is not sharply bounded and its usage fluctuates. Any attempt to bring into harmony all possible vague, sometimes contradictory, tendencies which are connected with the use of this concept, is certainly doomed to failure. We must reconcile ourselves from the start to the fact that every precise definition of this concept will show arbitrary features to a greater or less degree. (Tarski 1936, p. 409)

Not every feature of the technical account will be reflected in the ordinary concept, and we should not expect any clarification of the concept to reflect each and every deployment of it in everyday language and life. Nevertheless, despite its vagueness, Tarski believes that there are identifiable, essential features of the common concept of logical consequence.

…consider any class K of sentences and a sentence X which follows from this class. From an intuitive standpoint, it can never happen that both the class K consists of only true sentences and the sentence X is false. Moreover, since we are concerned here with the concept of logical, that is, formal consequence, and thus with a relation which is to be uniquely determined by the form of the sentences between which it holds, this relation cannot be influenced in any way by empirical knowledge, and in particular by knowledge of the objects to which the sentence X or the sentences of class K refer. The consequence relation cannot be affected by replacing designations of the objects referred to in these sentences by the designations of any other objects. (Tarski 1936, pp. 414-415)

According to Tarski, the logical consequence relation as it is employed by typical reasoners is (1) necessary, (2) formal, and (3) not influenced by empirical knowledge. I now elaborate on (1)-(3) in order to shape two preliminary characterizations of logical consequence.

i. The Logical Consequence Relation Has a Modal Element

Tarski countenances an implicit modal notion in the common concept of logical consequence. If X is a logical consequence of K, then not only is it the case that not all of the elements of K are true and X is false, but also this is necessarily the case. That is, X follows from K only if it is not possible for all of the sentences in K to be true with X false. For example, the supposition that All West High School students are football fans and that Kelly is not a West High School student does not rule out the possibility that Kelly is a football fan. Hence, the sentences All West High School students are football fans and Kelly is not a West High School student do not entail Kelly is not a football fan, even if she, in fact, isn’t a football fan. Also, Most of Kelly’s male classmates are football fans does not entail Most of Kelly’s classmates are football fans. What if the majority of Kelly’s class is composed of females who are not fond of football?

We said above that Kelly is not both at home and at work and Kelly is at home jointly imply Kelly is not at work. Note that it doesn’t seem possible for the first two sentences to be true and Kelly is not at work false. But it is hard to see what this comes to without further clarification of the relevant notion of possibility. For example, consider the following pairs of sentences.

Kelly kissed her sister at 2:00pm.
2:00pm is not a time during which Kelly
and her sister were 100 miles apart.
Kelly is a female.
Kelly is not the US President.
There is a chimp in Paige’s house.
There is a primate in Paige’s house.
Ten is a prime number.
Ten is greater than nine.

For each pair of sentences, there is a sense in which it is not possible for the first to be true and the second false. At the very least an account of logical consequence must distinguish logical possibility from other types of possibility. Should truths about physical laws, US political history, zoology, and mathematics constrain what we take to be possible in determining whether or not the first sentence of each pair could logically be true with the second sentence false? If not, then this seems to mystify logical possibility (e.g., how could ten be a prime number?). To paraphrase questions asked by G.E. Moore (1959, pp. 231-238), given that I know that George W. Bush is US President and that he is not a female named Kelly, isn’t it inconsistent for me to grant the logical possibility of the truth of Kelly is a female and the falsity of Kelly is not the US President? Or should I ignore my present state of knowledge in considering what is logically possible? Tarski does not derive a clear notion of logical possibility from the common concept of logical consequence. Perhaps there is none to be had, and we should seek the help of a proper theoretical development in clarifying the notion of logical possibility. Towards this end, let’s turn to the other features of logical consequence highlighted by Tarski, starting with the formality criterion of logical consequence.

ii. The Logical Consequence Relation is Formal

Tarski observes that logical consequence is a formal consequence relation. And he tells us that a formal consequence relation is a consequence relation that is uniquely determined by the form of the sentences between which it holds. Consider the following pair of sentences

(1) Some children are both lawyers and peacemakers.
(2) Some children are peacemakers

Intuitively, (2) is a logical consequence of (1). It appears that this fact does not turn on the subject matter of the sentences. Replace ‘children’, ‘lawyers’, and ‘peacemakers’ in (1) and (2) with the variables S, M, and P to get the following.

(1′) Some S are both M and P
(2′) Some S are P

(1′) and (2′) are forms of (1) and (2), respectively. Note that there is no interpretation of S, M, and P according to which the sentence that results from (1′) is true and the resulting instance of (2′) is false. Hence, (2) is a formal consequence of (1) and on each interpretation of S, M, and P the resulting (2′) is a formal consequence of the sentence that results from (1′) (e.g., Some clowns are sad is a formal consequence of Some clowns are both lonely and sad). Tarski’s observation is that for any sentence X and set K of sentences, X is a logical consequence of K only if X is a formal consequence of K. The formality criterion of logical consequence can work in explaining why one sentence doesn’t entail another in cases where it seems impossible for the first to be true and the second false. For example, (3) is false and (4) is true.

(3) Ten is a prime number
(4) Ten is greater than nine

Does (4) follow from (3)? One might think that (4) does not follow from (3) because being a prime number does not necessitate being greater than nine. However, this does not require one to think that ten could be a prime number and less than or equal to nine, which is probably a good thing since it is hard to see how this is possible. Rather, we take

(3′) a is a P
(4′) a is R than b

to be the forms of (5) and (6) and note that there are interpretations of ‘a’, ‘b’, ‘P’, and ‘R’ according to which the first is true and the second false (e.g. let ‘a’ and ‘b’ name the numbers two and ten, respectively, and let ‘P’ mean prime number, and ‘R’ greater). Note that the claim here is not that formality is sufficient for a consequence relation to qualify as logical but only that it is a necessary condition. I now elaborate on this last point by saying a little more about forms of sentences (that is, sentential forms) and formal consequence.

Distinguishing between a term of a sentence replaced with a variable and one held constant determines a form of the sentence. In Some children are both lawyers and peacemakers we may replace ‘Some’ with a variable and treat all the other terms as constant. Then

(1”) D children are both lawyers and peacemakers

is a form of (1), and each sentence generated by assigning a meaning to D shares this form with (1). For example, the following three sentences are instances of (1”), produced by interpreting D as ‘No’, ‘Many’, and ‘Few’.

No children are both lawyers and peacemakers
Many children are both lawyers and peacemakers
Few children are both lawyers and peacemakers

Whether X is a formal consequence of K then turns on a prior selection of terms as constant and others replaced with variables. Relative to such a determination, X is a formal consequence of K if and only if (iff) there is no interpretation of the variables according to which each of the K are true and X is false. So, taking all the terms, except for ‘Some’, in (1) Some children are both philosophers and peacemakers and in (2) Some children are peacemakers as constants makes the following forms of (1) and (2).

(1”) D children are both lawyers and peacemakers
(2”) D children are peacemakers

Relative to this selection, (2) is not a formal consequence of (1) because replacing ‘D’ with ‘No’ yields a true instance of (1”) and a false instance of (2”).

Consider the following pair.

(5) Kelly is female
(6) Kelly is not US President

(6) is a formal consequence of (5) relative to replacing ‘Kelly’ with a variable. Given current U.S. political history, there is no individual whose name yields a true (5) and a false (6) when it replaces ‘Kelly’. This is not, however, sufficient reason for seeing (6) as a logical consequence of (5). There are two ways of thinking about why, a metaphysical consideration and an epistemological one. First the metaphysical consideration. It seems possible for (5) to be true and (6) false. The course of U.S. political history could have turned out differently. One might think that the current US President could–logically–have been a female named, say, ‘Sally’. Using ‘Sally’ as a replacement for ‘Kelly’ would yield in that situation a true (5) and a false (6). Also, it seems possible that in the future there will be a female US President. In order for a formal consequence relation from K to X to qualify as logical it has to be the case that it is necessary that there is no interpretation of the variables in K and X according to which the K-sentences are true and X is false.

The epistemological consideration is that one might think that knowledge that X follows logically from K should not essentially depend on being justified by experience of extra-linguistic states of affairs. Clearly, the determination that (6) follows formally from (5) essentially turns on empirical knowledge, specifically knowledge about the current political situation in the US. This leads to the final highlight of Tarski’s rendition of the intuitive concept of logical consequence: that logical consequence cannot be influenced by empirical knowledge.

iii. The Logical Consequence Relation is A Priori

Tarski says that by virtue of being formal, knowledge that X follows logically from K cannot be affected by knowledge of the objects that X and the sentences of K are about. Hence, our knowledge that X is a logical consequence of K cannot be influenced by empirical knowledge. However, as noted above, formality by itself does not insure that the extension of a consequence relation is not influenced by empirical knowledge. So, let’s view this alleged feature of logical consequence as independent of formality. We characterize empirical knowledge in two steps as follows. First, a priori knowledge is knowledge “whose truth, given an understanding of the terms involved, is ascertainable by a procedure which makes no reference to experience” (Hamlyn 1967, p. 141). Empirical, or a posteriori, knowledge is knowledge that is not a priori, that is, knowledge whose validation necessitates a procedure that does make reference to experience. We can safely read Tarski as saying that a consequence relation is logical only if knowledge that something falls in its extension is a priori, that is, only if the relation is a priori. Knowledge of physical laws, a determinant in people’s observed sizes, is not a priori and such knowledge is required to know that there is no interpretation of k, h, and t according to which (7) is true and (8) false.

(7) k kissed h at time t
(8) t is not a time during which k and h were 100 miles apart

So (8) cannot be a logical consequence of (7). However, my knowledge that Kelly is not Paige’s only friend follows from Kelly is taller than Paige’s only friend is a priori since I know a priori that nobody is taller than herself.

Let’s summarize and tie things together. We began by asking, for a given language L, what conditions must be met in order for a sentence X of L to be a logical consequence of a class K of L-sentences? Tarski thinks that an adequate response must reflect the common concept of logical consequence, that is, the concept as it is ordinarily employed. By the lights of this concept, an adequate account of logical consequence must reflect the formality and necessity of logical consequence, and must also reflect the fact that knowledge of what follows logically from what is a priori. Tying the criteria together, in order to fix what follows logically from what in a given language L, we must select a class of constants that determines a formal consequence relation that is both necessary and known, if at all, a priori. Such constants are called logical constants, and we say that the logical form of a sentence is a function of the logical constants that occur in the sentence and the pattern of the remaining expressions. As was illustrated above, the notion of formality does not presuppose a criterion of logical constancy. A consequence relation based on any division between constants and terms replaced with variables will automatically be formal with respect to the latter.

b. Logical and Non-Logical Terminology

Tarski’s basic move from his rendition of the common concept of logical consequence is to distinguish between logical terms and non-logical terms and then say that X is a logical consequence of K only if there is no possible interpretation of the non-logical terms of the language L that makes all of the sentences in K true and X false. The choice of the right terms as logical will reflect the modal element in the concept of logical consequence, that is, will insure that there is no ‘possible’ interpretation of the variable, non-logical terms of the language L that makes all of the K true and X false, and will insure that this is known a priori. Of course, we have yet to spell out the modal notion in the concept of logical consequence. Tarski pretty much left this underdeveloped in his (1936). Lacking such an explanation hampers our ability to clarify the rationale for a selection of terms to serve as the logical ones.

Traditionally, logicians have regarded sentential connectives such as and, not, or, ifthen, the quantifiers all and some, and the identity predicate ‘=’ as logical terms. Remarking on the boundary between logical and non-terms, Tarski (1936, p. 419) writes the following.

Underlying this characterization of logical consequence is the division of all terms of the language discussed into logical and extra-logical. This division is not quite arbitrary. If, for example, we were to include among the extra-logical signs the implication sign, or the universal quantifier, then our definition of the concept of consequence would lead to results which obviously contradict ordinary usage. On the other hand, no objective grounds are known to me which permit us to draw a sharp boundary between the two groups of terms. It seems to be possible to include among logical terms some which are usually regarded by logicians as extra-logical without running into consequences which stands in sharp contrast to ordinary usage.

Tarski seems right to think that the logical consequence relation turns on the work that the logical terminology does in the relevant sentences. It seems odd to say that Kelly is happy does not logically follow from All are happy because the second is true and the first false when All is replaced with Few. However, by Tarski’s version of the ordinary concept of logical consequence there is no reason not to treat say taller than as a logical term along with not and, therefore, no reason not to take Kelly is not taller than Paige as following logically from Paige is taller than Kelly. Also, it seems plausible to say that I know a priori that there is no possible interpretation of Kelly and is mortal according to which it is necessary that Kelly is mortal is true and Kelly is mortal is false. This makes Kelly is mortal a logical consequence of it is necessary that Kelly is mortal. Given that taller than and it is necessary that, along with other terms, were not generally regarded as logical terms by logicians of Tarski’s day, the fact that they seem to be logical terms by the common concept of logical consequence, as observed by Tarski, highlights the question of what it takes to be a logical term. Tarski says that future research will either justify the traditional boundary between the logical and the non-logical or conclude that there is no such boundary and the concept of logical consequence is a relative concept whose extension is always relative to some selection of terms as logical (p. 420). For further discussion of Tarski’s views on logical terminology and contemporary views see Logical Consequence, Model-Theoretic Conceptions: Section 5.3.

How, exactly, does the terminology usually regarded by logicians as logical work in making it the case that one sentence follows from others? In the next two sections two distinct approaches to understanding the nature of logical terms are sketched. Each approach leads to a unique way of characterizing logical consequence and thus yields a unique response to the above question.

i. The Nature of Logical Constants Explained in Terms of Their Semantic Properties

Consider the following metaphor, borrowed from Bencivenga (1999).

The locked room metaphor

Suppose that you are locked in a dark windowless room and you know everything about your language but nothing about the world outside. A sentence X and a class K of sentences are presented to you. If you can determine that X is true if all the sentences in K are, X is a logical consequence of K.

Ignorant of US politics, I couldn’t determine the truth of Kelly is not US President solely on the basis of Kelly is a female. However, behind such a veil of ignorance I would be able to tell that Kelly is not US President is true if Kelly is female and Kelly is not US President is true. How? Short answer: based on my linguistic competence; longer answer: based on my understanding of the semantic contribution of and to the determination of the truth conditions of a sentence of the form P and Q. For any sentences P and Q, I know that P and Q is true just in case P is true and Q is true. So, I know, a priori, if P and Q is true, then Q is true. As noted by one philosopher, “This really is remarkable since, after all, it’s what they mean, together with the facts about the non-linguistic world, that decide whether P or Q are true” (Fodor 2000, p.12).

Taking not and and to be the only logical constants in (9) Kelly is not both at home and at work, (10) Kelly is at home, and (11) Kelly is not at work, we formalize the sentences as follows, letting k mean Kelly, H mean is at home, and W mean is at work.

(9′) not-(Hk and Wk)
(10′) Hk
(11′) not-Wk

There is no interpretation of k, H, and W according to which (9′) and (10′) are true and (11′) is false. The reason why turns on the semantic properties of and and not, which are knowable a priori. Suppose (9′) and (10′) are true on some interpretation of the variable terms. Then the meaning of not in (9′) makes it the case that Hk and Wk is false, which, by the meaning of and requires that Hk is false or Wk is false. Given (10′), it must be that Wk is false, that is, not-Wk is true. So, there can’t be an interpretation of the variable terms according to which (9′) and (10′) are true and (11′) is false, and, as the above reasoning illustrates, this is due exclusively to the semantic properties of not and and. So the reason that it is impossible that an interpretation of k, H, and W make (9′) and (10′) true and (11′) false is that the supposition otherwise is inconsistent with the semantic functioning of not and and. Compare: the supposition that there is an interpretation of k according to which k is a female is true and k is not US President is false does not seem to violate the semantic properties of the constant terms. If we identify the meanings of the predicates with their extensions in all possible worlds, then the supposition that there is a female U.S. President does not violate the meanings of female and US President for surely it is possible that there be a female US President. But, supposing that (9′) and (10′) could be true with (11′) false on some interpretation of k, H, and W, violates the semantic properties of either and or not.

In sum, our first-step characterization of logical consequence is the following. For a given language L,

X is a logical consequence of K if and only if there is no possible interpretation of the non-logical terminology of L according to which all the sentence in K are true and X is false.

A possible interpretation of the non-logical terminology of the language L according to which sentences are true or false is a reading of the non-logical terms according to which the sentences receive a truth-value (that is, is either true or false) in a situation that is not ruled out by the semantic properties of the logical constants. The philosophical locus of the technical development of ‘possible interpretation’ in terms of models is Tarski (1936). A model for a language L is the theoretical development of a possible interpretation of non-logical terminology of L according to which the sentences of L receive a truth-value. Models have become standard tools for characterizing the logical consequence relation, and the characterization of logical consequence in terms of models is called the Tarskian or model-theoretic characterization of logical consequence. We say that X is a model-theoretic consequence of K if and only if all models of K are models of X. This relation may be represented as K ⊨ X. If model-theoretic consequence is adequate as a representation of logical consequence, then it must reflect the salient features of the common concept, which, according to Tarski means that it must be necessary, formal and a priori.

For further discussion of this conception of logical consequence, see the article, Logical Consequence, Model-Theoretic Conceptions.

ii. The Nature of Logical Constants Explained in Terms of Their Inferential Properties

We now turn to a second approach to understanding logical constants. Instead of understanding the nature of logical constants in terms of their semantic properties as is done on the model-theoretic approach, on the second approach we appeal to their inferential properties conceived of in terms of principles of inference, that is, principles justifying steps in deductions. We begin with a remark made by Aristotle. In his study of logical consequence, Aristotle comments that

A syllogism is discourse in which, certain things being stated, something other than what is stated follows of necessity from their being so. I mean by the last phrase that they produce the consequence, and by this, that no further term is required from without in order to make the consequence necessary. (Prior Analytics, p. 24b)

Adapting this to our X and K, we may say that X is a logical consequence of K when the sentences of K are sufficient to produce X. How are we to think of a sentence being produced by others? One way of developing this is to appeal to a notion of an actual or possible deduction. X is a deductive consequence of K if and only if there is a deduction of X from K. In such a case, we say that X may be correctly inferred from K or that it would be correct to conclude X from K. A deduction is associated with a pair ; the set K of sentences is the basis of the deduction, and X is the conclusion. A deduction from K to X is a finite sequence S of sentences ending with X such that each sentence in S (that is, each intermediate conclusion) is derived from a sentence (or more) in K or from previous sentences in S in accordance with a correct principle of inference.

For example, intuitively, the following inference seems correct.

  Kelly is not both at home and at work
  Kelly is at home
(therefore) Kelly is not at work

The set K of sentences above the line is the basis of the inference and the sentence X below is the conclusion. We represent their logical forms, again, as follows.

  (9′) not-(Hk and Wk)
  (10′) Hk
(therefore) (11′) not-Wk

Consider the following deduction of (11′) from (10′) and (9′).

Deduction: Assume that (12′) Wk. Then from (10′) and (12′) we may deduce that (13′) Hk and Wk. (13′) contradicts (9′) and so (12′), our initial assumption, must be false. We have deduced not-Wk from not-(Hk and Wk) and Hk.

Since the deduction of not-Wk from not-(Hk and Wk) and Hk did not depend on the interpretation of k, W, and H, the deductive relation is formal. Furthermore, my knowledge of this is a priori because my knowledge of the underlying principles of inference in the above deduction is not empirical. For example, letting P and Q be any sentences, we know a priori that P and Q may be inferred from the set K={P, Q} of basis sentences. This principle grounds the move from (10′) and (12′) to (13′). Also, the deduction appeals to the principle that if we deduce a contradiction from an assumption, then we may infer that the assumption is false. The correctness of this principle seems to be an a priori matter. Let’s look at another example of a deduction.

  (1) Some children are both lawyers and peacemakers
(therefore) (2) Some children are peacemakers

The logical forms are, again, the following.

  (1′) Some S are both M and P
(therefore) (2′) Some S are P

Again, intuitively, (2′) is deducible from (1′).

Deduction: The basis tells us that at least one S–let’s call this Sa‘–is both an M and a P. Clearly, a is a P may be deduced from a is both an M and a P. Since we’ve assumed that a is an S, what we derive with respect to a we derive with respect to some S. So our derivation of a is a P is a derivation of Some S is a P, which is our desired conclusion.

Since the deduction is formal, we have shown not merely that (2) can be correctly inferred from (1), but we have shown that for any interpretation of S, M, and P it is correct to infer (2′) from (1′).

Typically, deductions leave out steps (perhaps because they are too obvious), and they usually do not justify each and every step made in moving towards the conclusion (again, obviousness begets brevity). The notion of a deduction is made precise by describing a mechanism for constructing deductions that are both transparent and rigorous (each step is explicitly justified and no steps are omitted). This mechanism is a deductive system (also known as a formal system or as a formal proof calculus). A deductive system D is a collection of rules that govern which sequences of sentences, associated with a given , are allowed and which are not. Such a sequence is called a proof in D (or, equivalently, a deduction in D) of X from K. The rules must be such that whether or not a given sequence associated with qualifies as a proof in D of X from K is decidable purely by inspection and calculation. That is, the rules provide a purely mechanical procedure for deciding whether a given object is a proof in D of X from K.

We say that a deductive system D is correct when for any K and X, proofs in D of X from K correspond to intuitively valid deductions. For example, intuitively, there are no correct principles of inference according to which it is correct to conclude

Some animals are both mammals and reptiles

on the basis of the following two sentences.

Some animals are mammals
Some animals are reptiles

Hence, a proof in a deductive system of the former sentence from the latter two is evidence that the deductive system is incorrect. The point here is that a proof in D may fail to represent a deduction if D is incorrect.

A rich variety of deductive systems have been developed for registering deductions. Each system has its advantages and disadvantages, which are assessed in the context of the more specific tasks the deductive system is designed to accomplish. Historically, the general purpose of the construction of deductive systems was to reduce reasoning to precise mechanical rules (Hodges 1983, p. 26). Some view a deductive system defined for a language L as a mathematical model of actual or possible chains of correct reasoning in L. Sundholm (1983) offers a thorough survey of three main types of deductive systems. For a shorter, excellent introduction to the concept of a deductive system see Henkin (1967). A deductive system is developed in detail in the accompanying article, Logical Consequence, Deductive-Theoretic Conceptions.

If there is a proof of X from K in deductive system D, then we may say that X is a deductive consequence in D of K, which is sometimes expressed as K ⊢D X. Relative to a correct deductive system D, we characterize logical consequence in terms of deductive consequence as follows.

X is a logical consequence of K if and only if X is a deductive consequence in D of K, that is, there is an actual or possible proof in D of X from K.

This is called the deductive-theoretic (or proof-theoretic) characterization of logical consequence.

3. Model-Theoretic and Deductive-Theoretic Conceptions of Logic

We began with Tarski’s observations of the common or ordinary concept of logical consequence that we employ in daily life. According to Tarski, if X is a logical consequence of a set of sentences, K, then, in virtue of the logical forms of the sentences involved, if all of the members of K are true, then X must be true, and furthermore, we know this a priori. The formality criterion makes the logical constants the essential determinant of the logical consequence relation. The logical consequence relation is fixed exclusively in terms of the nature of the logical terminology. We have highlighted two different approaches to the nature of a logical constant: (1) in terms of its semantic contribution to sentences in which it occurs and (2) in terms of its inferential properties. The two approaches yield distinct conceptions of the notion of necessity inherent in the common concept of logical consequence, and lead to the following characterizations of logical consequence.

(1) X is a logical consequence of K if and only if there is no possible interpretation of the non-logical terminology of the language according to which all the sentences in K are true and X is false.

(2) X is a logical consequence of K if and only if X is deducible from K.

We make the notions of possible interpretation in (1) and deducibility in (2) precise by appealing to the technical notions of model and deductive system. This leads to the following theoretical characterizations of logical consequence.

(1) The model-theoretic characterization of logical consequence: X is a logical consequence of K iff all models of K are models of X.

(2) The deductive- theoretic characterization of logical consequence: X is a logical consequence of K iff there is a deduction in a correct deductive system of X from K.

Following Shapiro (1991, p. 3) define a logic to be a language L plus either a model-theoretic or a deductive-theoretic account of logical consequence. A language with both characterizations is a full logic just in case both characterizations coincide. A soundness proof establishes K ⊢D X only if K ⊨ X, and a completeness proof establishes K ⊢D X if K ⊨ X. These proofs together establish that the two characterizations coincide, and in such a case the deductive system D is said to be complete and sound with respect to the model-theoretic consequence relation defined for the relevant language L.

We said that the primary aim of logic is to tell us what follows logically from what. These two characterizations of logical consequence lead to two different orientations or conceptions of logic (see Tharp 1975, p. 5).

Model-theoretic approach: Logic is a theory of possible interpretations. For a given language the class of situations that can–logically–be described by that language.

Deductive-theoretic approach: Logic is a theory of formal deductive inference.

The article now concludes by highlighting three considerations relevant to evaluating a particular deployment of the model-theoretic or deductive-theoretic definition in defining logical consequence. These considerations emerge from the above development of the two theoretic definitions from the common concept of logical consequence.

4. Conclusion

The two theoretical characterizations of logical consequence do not provide the means for drawing a boundary in a language L between logical and non-logical terms. Indeed, their use presupposes that a list of logical terms is in hand. Hence, in evaluating a model-theoretic or deductive-theoretic definition of logical consequence for a language L the issue arises whether or not the boundary in L between logical and non-logical terms has been correctly drawn. This requires a response to a central question in the philosophy of logic: what qualifies as a logical constant? Tarski gives a well-reasoned response in his (1986). (For more recent discussion see McCarthy 1981 and 1998, Hanson 1997, and Warbrod 1999.)

A second thing to consider in evaluating a theoretical account of logical consequence is whether or not its characterization of the logical terminology is accurate. For example, model-theoretic and deductive accounts of logical consequence are inadequate unless they reflect the semantic and inferential properties of the logical terms, respectively. So a model-theoretic account is inadequate unless it gets right the semantic contributions of the logical terms to the truth conditions of the sentences formed using them. For a particular deductive system D, the question arises whether or not D’s rules of inference reflect the inferential properties of the logical terms. (For further discussion of the semantic and inferential properties of logical terms see Haack 1978 and 1996, Read 1995, and Quine 1986.)

A third consideration in assessing the success of a theoretical definition of logical consequence is whether or not the definition, relative to a selection of terms as logical, reflects the salient features of the common concept of logical consequence. There are criticisms of the theoretical definitions that claim that they are incapable of reflecting the common concept of logical consequence. Typically, such criticisms are used to question the status of the model-theoretic and deductive-theoretic approaches to logic.

For example, there are critics who question the model-theoretic approach to logic by arguing that any model-theoretic account lacks the conceptual resources to reflect the notion of necessity inherent in the common concept of logical consequence because such an account does not rule out the possibility of there being logically possible situations in which sentences in K are true and X is false even though every model of K is a model of X. Kneale (1961) is an early critic, Etchemendy (1988, 1999) offers a sustained and multi-faceted attack. Also, it is argued that the model-theoretic approach to logic makes knowledge of what follows from what depend on knowledge of the existence of models, which is knowledge of worldly matters of fact. But logical knowledge should not depend on knowledge about the extra-linguistic world (recall the locked room metaphor in 2.2.1). This standard logical positivist line has been recently challenged by those who see logic penetrated and permeated by metaphysics (e.g., Putnam 1971, Almog 1989, Sher 1991, Williamson 1999).

The status of the deductive-theoretic approach to logic is not clear for, as Tarski argues in his (1936), deductive-theoretic accounts are unable to reflect the fact that, according to the common concept, logical consequence is not compact. Relative to any deductive system D, the ⊢D-consequence relation is compact if and only if for any sentence X and set K of sentences, if K ⊢D X, then K’ ⊢D X, where K’ is a finite subset of sentences from K. But there are intuitively correct principles of inference according to which one may infer a sentence X from a set K of sentences, even though it is incorrect to infer X from any finite subset of K. This suggests that the intuitive notion of deducibility is not completely captured by any compact consequence relation. We need to weaken

X is a logical consequence of K if and only if there is a proof in a correct deductive system of X from K,

given above, to

X is a logical consequence of K if there is a proof in a correct deductive system of X from K.

In sum, the issue of the nature of logical consequence, which intersects with other areas of philosophy, is still a matter of debate. Tarski’s analysis of the concept is not universally accepted; philosophers and logicians differ over what the features of the common concept are. For example, some offer accounts of the logical consequence relation according to which it is not a priori (e.g., see Koslow 1999, Sher 1991 and see Hanson 1997 for criticism of Sher) or deny that it even need be strongly necessary (Smiley 1995, 2000, section 6). The entry Logical Consequence, Model-Theoretic Conceptions gives a model-theoretic definition of logical consequence. For a detailed development of a deductive system see the entry Logical Consequence, Deductive-Theoretic Conceptions. The critical discussion in both articles deepens and extends points made in the conclusion of this article.

5. References and Further Reading

  • Almog, J. (1989): “Logic and the World”, pp. 43-65 in Themes From Kaplan, ed. J. Almog, J. Perry, J., and H. Wettstein. New York: Oxford UP.
  • Aristotle. (1941): Basic Works, ed. R. McKeon. New York: Random House.
  • Bencivenga, E. (1999): “What is Logic About?”, pp. 5-19 in Varzi (1999).
  • Etchemendy, J. (1983): “The Doctrine of Logic as Form”, Linguistics and Philosophy 6, pp. 319-334.
  • Etchemendy, J. (1988): “Tarski on truth and logical consequence”, Journal of Symbolic Logic 53, pp. 51-79.
  • Etchemendy, J. (1999): The Concept of Logical Consequence. Stanford: CSLI Publications.
  • Fodor, J. (2000): The Mind Doesn’t Work That Way. Cambridge: The MIT Press.
  • Gabbay, D. and F. Guenthner, eds. (1983): Handbook of Philosophical Logic, Vol 1. Dordrecht: D. Reidel Publishing Company.
  • Haack, S. (1978): Philosophy of Logics . Cambridge: Cambridge University Press.
  • Haack, S. (1996): Deviant Logic, Fuzzy Logic. Chicago: The University of Chicago Press.
  • Hodges, W. (1983): “Elementary Predicate Logic”, in Gabbay, D. and F. Guenthner (1983).
  • Hamlyn, D.W. (1967): “A Priori and A Posteriori”, pp.105-109 in The Encyclopedia of Philosophy, Vol. 1, ed. P. Edwards. New York: Macmillan & The Free Press.
  • Hanson, W. (1997): “The Concept of Logical Consequence”, The Philosophical Review 106, pp. 365-409.
  • Henkin, L. (1967): “Formal Systems and Models of Formal Systems”, pp. 61-74 in The Encyclopedia of Philosophy, Vol. 8, ed. P. Edwards. New York: Macmillan & The Free Press.
  • Kneale, W. (1961): “Universality and Necessity”, British Journal for the Philosophy of Science 12, pp. 89-102.
  • Koslow, A. (1999): “The Implicational Nature of Logic: A Structuralist Account”, pp. 111-155 in Varzi (1999).
  • McCarthy, T. (1981): “The Idea of a Logical Constant”, Journal of Philosophy 78, pp. 499-523.
  • McCarthy, T. (1998): “Logical Constants”, pp. 599-603 in Routledge Encyclopedia of Philosophy, Vol. 5, ed. E. Craig. London: Routledge.
  • McGee, V. (1999): “Two Problems with Tarski’s Theory of Consequence”, Proceedings of the Aristotelean Society 92, pp. 273-292.
  • Moore, G.E., (1959): “Certainty”, pp. 227-251 in Philosophical Papers. London: George Allen & Unwin.
  • Priest. G. (1995): “Etchemendy and Logical Consequence”, Canadian Journal of Philosophy 25, pp. 283-292.
  • Putnam, H. (1971): Philosophy of Logic. New York: Harper & Row.
  • Quine, W.V. (1986): Philosophy of Logic, 2nd ed.. Cambridge: Harvard UP.
  • Read, S. (1995): Thinking About Logic. Oxford: Oxford UP.
  • Shapiro, S. (1991): Foundations without Foundationalism: A Case For Second-Order Logic. Oxford: Clarendon Press.
  • Shapiro, S. (1993): ” Modality and Ontology”, Mind 102, pp. 455-481.
  • Shapiro, S. (1998): “Logical Consequence: Models and Modality”, pp. 131-156 in The Philosophy of Mathematics Today, ed. Matthias Schirn. Oxford, Clarendon Press.
  • Shapiro, S. (2000): Thinking About Mathematics , Oxford: Oxford University Press.
  • Sher, G. (1989): “A Conception of Tarskian Logic”, Pacific Philosophical Quarterly 70, pp. 341-368.
  • Sher, G. (1991): The Bounds of Logic: A Generalized Viewpoint, Cambridge, MA: The MIT Press.
  • Sher, G. (1996): “Did Tarski commit ‘Tarski’s fallacy’?” Journal of Symbolic Logic 61, pp. 653-686.
  • Sher, G. (1999): “Is Logic a Theory of the Obvious?”, pp. 207-238 in Varzi (1999).
  • Smiley, T. (1995): “A Tale of Two Tortoises”, Mind 104, pp. 725-36.
  • Smiley, T. (1998): “Consequence, Conceptions of”, pp. 599-603 in Routledge Encyclopedia of Philosophy, vol. 2, ed. E. Craig. London: Routledge.
  • Sundholm, G. (1983): “Systems of Deduction”, in Gabbay and Guenthner (1983).
  • Tarski, A. (1933): “Pojecie prawdy w jezykach nauk dedukeycyjnych”, translated as “On the Concept of Truth in Formalized Languages”, pp. 152-278 in Tarski (1983).
  • Tarski, A. (1936): “On the Concept of Logical Consequence”, pp. 409-420 in Tarski (1983).
  • Tarski, A. (1983): Logic, Semantics, Metamathematics, 2nd ed. Indianapolis: Hackett Publishing.
  • Tarski, A. (1986): “What are logical notions?” History and Philosophy of Logic 7, pp. 143-154.
  • Tharp, L. (1975): “Which Logic is the Right Logic?” Synthese 31, pp. 1-21.
  • Warbrod, K., (1999): “Logical Constants” Mind 108, pp. 503-538.
  • Williamson, T. (1999): “Existence and Contingency”, Proceedings of the Aristotelian Society Supplementary Vol. 73, pp. 181-203.
  • Varzi, A., ed. (1999): European Review of Philosophy, Vol. 4: The Nature of Logic, Stanford: CSLI Publications.

Author Information

Matthew McKeon
Email: mckeonm@msu.edu
Michigan State University
U. S. A.

Literary Theory

“Literary theory” is the body of ideas and methods we use in the practical reading of literature. By literary theory we refer not to the meaning of a work of literature but to the theories that reveal what literature can mean. Literary theory is a description of the underlying principles, one might say the tools, by which we attempt to understand literature. All literary interpretation draws on a basis in theory but can serve as a justification for very different kinds of critical activity. It is literary theory that formulates the relationship between author and work; literary theory develops the significance of race, class, and gender for literary study, both from the standpoint of the biography of the author and an analysis of their thematic presence within texts. Literary theory offers varying approaches for understanding the role of historical context in interpretation as well as the relevance of linguistic and unconscious elements of the text. Literary theorists trace the history and evolution of the different genres—narrative, dramatic, lyric—in addition to the more recent emergence of the novel and the short story, while also investigating the importance of formal elements of literary structure. Lastly, literary theory in recent years has sought to explain the degree to which the text is more the product of a culture than an individual author and in turn how those texts help to create the culture.

Table of Contents

  1. What Is Literary Theory?
  2. Traditional Literary Criticism
  3. Formalism and New Criticism
  4. Marxism and Critical Theory
  5. Structuralism and Poststructuralism
  6. New Historicism and Cultural Materialism
  7. Ethnic Studies and Postcolonial Criticism
  8. Gender Studies and Queer Theory
  9. Cultural Studies
  10. References and Further Reading
    1. General Works on Theory
    2. Literary and Cultural Theory

1. What Is Literary Theory?

“Literary theory,” sometimes designated “critical theory,” or “theory,” and now undergoing a transformation into “cultural theory” within the discipline of literary studies, can be understood as the set of concepts and intellectual assumptions on which rests the work of explaining or interpreting literary texts. Literary theory refers to any principles derived from internal analysis of literary texts or from knowledge external to the text that can be applied in multiple interpretive situations. All critical practice regarding literature depends on an underlying structure of ideas in at least two ways: theory provides a rationale for what constitutes the subject matter of criticism—”the literary”—and the specific aims of critical practice—the act of interpretation itself. For example, to speak of the “unity” of Oedipus the King explicitly invokes Aristotle’s theoretical statements on poetics. To argue, as does Chinua Achebe, that Joseph Conrad’s The Heart of Darkness fails to grant full humanity to the Africans it depicts is a perspective informed by a postcolonial literary theory that presupposes a history of exploitation and racism. Critics that explain the climactic drowning of Edna Pontellier in The Awakening as a suicide generally call upon a supporting architecture of feminist and gender theory. The structure of ideas that enables criticism of a literary work may or may not be acknowledged by the critic, and the status of literary theory within the academic discipline of literary studies continues to evolve.

Literary theory and the formal practice of literary interpretation runs a parallel but less well known course with the history of philosophy and is evident in the historical record at least as far back as Plato. The Cratylus contains a Plato’s meditation on the relationship of words and the things to which they refer. Plato’s skepticism about signification, i.e., that words bear no etymological relationship to their meanings but are arbitrarily “imposed,” becomes a central concern in the twentieth century to both “Structuralism” and “Poststructuralism.” However, a persistent belief in “reference,” the notion that words and images refer to an objective reality, has provided epistemological (that is, having to do with theories of knowledge) support for theories of literary representation throughout most of Western history. Until the nineteenth century, Art, in Shakespeare’s phrase, held “a mirror up to nature” and faithfully recorded an objectively real world independent of the observer.

Modern literary theory gradually emerges in Europe during the nineteenth century. In one of the earliest developments of literary theory, German “higher criticism” subjected biblical texts to a radical historicizing that broke with traditional scriptural interpretation. “Higher,” or “source criticism,” analyzed biblical tales in light of comparable narratives from other cultures, an approach that anticipated some of the method and spirit of twentieth century theory, particularly “Structuralism” and “New Historicism.” In France, the eminent literary critic Charles Augustin Saint Beuve maintained that a work of literature could be explained entirely in terms of biography, while novelist Marcel Proust devoted his life to refuting Saint Beuve in a massive narrative in which he contended that the details of the life of the artist are utterly transformed in the work of art. (This dispute was taken up anew by the French theorist Roland Barthes in his famous declaration of the “Death of the Author.” See “Structuralism” and “Poststructuralism.”) Perhaps the greatest nineteenth century influence on literary theory came from the deep epistemological suspicion of Friedrich Nietzsche: that facts are not facts until they have been interpreted. Nietzsche’s critique of knowledge has had a profound impact on literary studies and helped usher in an era of intense literary theorizing that has yet to pass.

Attention to the etymology of the term “theory,” from the Greek “theoria,” alerts us to the partial nature of theoretical approaches to literature. “Theoria” indicates a view or perspective of the Greek stage. This is precisely what literary theory offers, though specific theories often claim to present a complete system for understanding literature. The current state of theory is such that there are many overlapping areas of influence, and older schools of theory, though no longer enjoying their previous eminence, continue to exert an influence on the whole. The once widely-held conviction (an implicit theory) that literature is a repository of all that is meaningful and ennobling in the human experience, a view championed by the Leavis School in Britain, may no longer be acknowledged by name but remains an essential justification for the current structure of American universities and liberal arts curricula. The moment of “Deconstruction” may have passed, but its emphasis on the indeterminacy of signs (that we are unable to establish exclusively what a word means when used in a given situation) and thus of texts, remains significant. Many critics may not embrace the label “feminist,” but the premise that gender is a social construct, one of theoretical feminisms distinguishing insights, is now axiomatic in a number of theoretical perspectives.

While literary theory has always implied or directly expressed a conception of the world outside the text, in the twentieth century three movements—”Marxist theory” of the Frankfurt School, “Feminism,” and “Postmodernism”—have opened the field of literary studies into a broader area of inquiry. Marxist approaches to literature require an understanding of the primary economic and social bases of culture since Marxist aesthetic theory sees the work of art as a product, directly or indirectly, of the base structure of society. Feminist thought and practice analyzes the production of literature and literary representation within the framework that includes all social and cultural formations as they pertain to the role of women in history. Postmodern thought consists of both aesthetic and epistemological strands. Postmodernism in art has included a move toward non-referential, non-linear, abstract forms; a heightened degree of self-referentiality; and the collapse of categories and conventions that had traditionally governed art. Postmodern thought has led to the serious questioning of the so-called metanarratives of history, science, philosophy, and economic and sexual reproduction. Under postmodernity, all knowledge comes to be seen as “constructed” within historical self-contained systems of understanding. Marxist, feminist, and postmodern thought have brought about the incorporation of all human discourses (that is, interlocking fields of language and knowledge) as a subject matter for analysis by the literary theorist. Using the various poststructuralist and postmodern theories that often draw on disciplines other than the literary—linguistic, anthropological, psychoanalytic, and philosophical—for their primary insights, literary theory has become an interdisciplinary body of cultural theory. Taking as its premise that human societies and knowledge consist of texts in one form or another, cultural theory (for better or worse) is now applied to the varieties of texts, ambitiously undertaking to become the preeminent model of inquiry into the human condition.

Literary theory is a site of theories: some theories, like “Queer Theory,” are “in;” other literary theories, like “Deconstruction,” are “out” but continue to exert an influence on the field. “Traditional literary criticism,” “New Criticism,” and “Structuralism” are alike in that they held to the view that the study of literature has an objective body of knowledge under its scrutiny. The other schools of literary theory, to varying degrees, embrace a postmodern view of language and reality that calls into serious question the objective referent of literary studies. The following categories are certainly not exhaustive, nor are they mutually exclusive, but they represent the major trends in literary theory of this century.

2. Traditional Literary Criticism

Academic literary criticism prior to the rise of “New Criticism” in the United States tended to practice traditional literary history: tracking influence, establishing the canon of major writers in the literary periods, and clarifying historical context and allusions within the text. Literary biography was and still is an important interpretive method in and out of the academy; versions of moral criticism, not unlike the Leavis School in Britain, and aesthetic (e.g. genre studies) criticism were also generally influential literary practices. Perhaps the key unifying feature of traditional literary criticism was the consensus within the academy as to the both the literary canon (that is, the books all educated persons should read) and the aims and purposes of literature. What literature was, and why we read literature, and what we read, were questions that subsequent movements in literary theory were to raise.

3. Formalism and New Criticism

“Formalism” is, as the name implies, an interpretive approach that emphasizes literary form and the study of literary devices within the text. The work of the Formalists had a general impact on later developments in “Structuralism” and other theories of narrative. “Formalism,” like “Structuralism,” sought to place the study of literature on a scientific basis through objective analysis of the motifs, devices, techniques, and other “functions” that comprise the literary work. The Formalists placed great importance on the literariness of texts, those qualities that distinguished the literary from other kinds of writing. Neither author nor context was essential for the Formalists; it was the narrative that spoke, the “hero-function,” for example, that had meaning. Form was the content. A plot device or narrative strategy was examined for how it functioned and compared to how it had functioned in other literary works. Of the Russian Formalist critics, Roman Jakobson and Viktor Shklovsky are probably the most well known.

The Formalist adage that the purpose of literature was “to make the stones stonier” nicely expresses their notion of literariness. “Formalism” is perhaps best known is Shklovsky’s concept of “defamiliarization.” The routine of ordinary experience, Shklovsky contended, rendered invisible the uniqueness and particularity of the objects of existence. Literary language, partly by calling attention to itself as language, estranged the reader from the familiar and made fresh the experience of daily life.

The “New Criticism,” so designated as to indicate a break with traditional methods, was a product of the American university in the 1930s and 40s. “New Criticism” stressed close reading of the text itself, much like the French pedagogical precept “explication du texte.” As a strategy of reading, “New Criticism” viewed the work of literature as an aesthetic object independent of historical context and as a unified whole that reflected the unified sensibility of the artist. T.S. Eliot, though not explicitly associated with the movement, expressed a similar critical-aesthetic philosophy in his essays on John Donne and the metaphysical poets, writers who Eliot believed experienced a complete integration of thought and feeling. New Critics like Cleanth Brooks, John Crowe Ransom, Robert Penn Warren and W.K. Wimsatt placed a similar focus on the metaphysical poets and poetry in general, a genre well suited to New Critical practice. “New Criticism” aimed at bringing a greater intellectual rigor to literary studies, confining itself to careful scrutiny of the text alone and the formal structures of paradox, ambiguity, irony, and metaphor, among others. “New Criticism” was fired by the conviction that their readings of poetry would yield a humanizing influence on readers and thus counter the alienating tendencies of modern, industrial life. “New Criticism” in this regard bears an affinity to the Southern Agrarian movement whose manifesto, I’ll Take My Stand, contained essays by two New Critics, Ransom and Warren. Perhaps the enduring legacy of “New Criticism” can be found in the college classroom, in which the verbal texture of the poem on the page remains a primary object of literary study.

4. Marxism and Critical Theory

Marxist literary theories tend to focus on the representation of class conflict as well as the reinforcement of class distinctions through the medium of literature. Marxist theorists use traditional techniques of literary analysis but subordinate aesthetic concerns to the final social and political meanings of literature. Marxist theorist often champion authors sympathetic to the working classes and authors whose work challenges economic equalities found in capitalist societies. In keeping with the totalizing spirit of Marxism, literary theories arising from the Marxist paradigm have not only sought new ways of understanding the relationship between economic production and literature, but all cultural production as well. Marxist analyses of society and history have had a profound effect on literary theory and practical criticism, most notably in the development of “New Historicism” and “Cultural Materialism.”

The Hungarian theorist Georg Lukacs contributed to an understanding of the relationship between historical materialism and literary form, in particular with realism and the historical novel. Walter Benjamin broke new ground in his work in his study of aesthetics and the reproduction of the work of art. The Frankfurt School of philosophers, including most notably Max Horkheimer, Theodor Adorno, and Herbert Marcuse—after their emigration to the United States—played a key role in introducing Marxist assessments of culture into the mainstream of American academic life. These thinkers became associated with what is known as “Critical theory,” one of the constituent components of which was a critique of the instrumental use of reason in advanced capitalist culture. “Critical theory” held to a distinction between the high cultural heritage of Europe and the mass culture produced by capitalist societies as an instrument of domination. “Critical theory” sees in the structure of mass cultural forms—jazz, Hollywood film, advertising—a replication of the structure of the factory and the workplace. Creativity and cultural production in advanced capitalist societies were always already co-opted by the entertainment needs of an economic system that requires sensory stimulation and recognizable cliché and suppressed the tendency for sustained deliberation.

The major Marxist influences on literary theory since the Frankfurt School have been Raymond Williams and Terry Eagleton in Great Britain and Frank Lentricchia and Fredric Jameson in the United States. Williams is associated with the New Left political movement in Great Britain and the development of “Cultural Materialism” and the Cultural Studies Movement, originating in the 1960s at Birmingham University’s Center for Contemporary Cultural Studies. Eagleton is known both as a Marxist theorist and as a popularizer of theory by means of his widely read overview, Literary Theory. Lentricchia likewise became influential through his account of trends in theory, After the New Criticism. Jameson is a more diverse theorist, known both for his impact on Marxist theories of culture and for his position as one of the leading figures in theoretical postmodernism. Jameson’s work on consumer culture, architecture, film, literature and other areas, typifies the collapse of disciplinary boundaries taking place in the realm of Marxist and postmodern cultural theory. Jameson’s work investigates the way the structural features of late capitalism—particularly the transformation of all culture into commodity form—are now deeply embedded in all of our ways of communicating.

5. Structuralism and Poststructuralism

Like the “New Criticism,” “Structuralism” sought to bring to literary studies a set of objective criteria for analysis and a new intellectual rigor. “Structuralism” can be viewed as an extension of “Formalism” in that that both “Structuralism” and “Formalism” devoted their attention to matters of literary form (i.e. structure) rather than social or historical content; and that both bodies of thought were intended to put the study of literature on a scientific, objective basis. “Structuralism” relied initially on the ideas of the Swiss linguist, Ferdinand de Saussure. Like Plato, Saussure regarded the signifier (words, marks, symbols) as arbitrary and unrelated to the concept, the signified, to which it referred. Within the way a particular society uses language and signs, meaning was constituted by a system of “differences” between units of the language. Particular meanings were of less interest than the underlying structures of signification that made meaning itself possible, often expressed as an emphasis on “langue” rather than “parole.” “Structuralism” was to be a metalanguage, a language about languages, used to decode actual languages, or systems of signification. The work of the “Formalist” Roman Jakobson contributed to “Structuralist” thought, and the more prominent Structuralists included Claude Levi-Strauss in anthropology, Tzvetan Todorov, A.J. Greimas, Gerard Genette, and Barthes.

The philosopher Roland Barthes proved to be a key figure on the divide between “Structuralism” and “Poststructuralism.” “Poststructuralism” is less unified as a theoretical movement than its precursor; indeed, the work of its advocates known by the term “Deconstruction” calls into question the possibility of the coherence of discourse, or the capacity for language to communicate. “Deconstruction,” Semiotic theory (a study of signs with close connections to “Structuralism,” “Reader response theory” in America (“Reception theory” in Europe), and “Gender theory” informed by the psychoanalysts Jacques Lacan and Julia Kristeva are areas of inquiry that can be located under the banner of “Poststructuralism.” If signifier and signified are both cultural concepts, as they are in “Poststructuralism,” reference to an empirically certifiable reality is no longer guaranteed by language. “Deconstruction” argues that this loss of reference causes an endless deferral of meaning, a system of differences between units of language that has no resting place or final signifier that would enable the other signifiers to hold their meaning. The most important theorist of “Deconstruction,” Jacques Derrida, has asserted, “There is no getting outside text,” indicating a kind of free play of signification in which no fixed, stable meaning is possible. “Poststructuralism” in America was originally identified with a group of Yale academics, the Yale School of “Deconstruction:” J. Hillis Miller, Geoffrey Hartmann, and Paul de Man. Other tendencies in the moment after “Deconstruction” that share some of the intellectual tendencies of “Poststructuralism” would included the “Reader response” theories of Stanley Fish, Jane Tompkins, and Wolfgang Iser.

Lacanian psychoanalysis, an updating of the work of Sigmund Freud, extends “Postructuralism” to the human subject with further consequences for literary theory. According to Lacan, the fixed, stable self is a Romantic fiction; like the text in “Deconstruction,” the self is a decentered mass of traces left by our encounter with signs, visual symbols, language, etc. For Lacan, the self is constituted by language, a language that is never one’s own, always another’s, always already in use. Barthes applies these currents of thought in his famous declaration of the “death” of the Author: “writing is the destruction of every voice, of every point of origin” while also applying a similar “Poststructuralist” view to the Reader: “the reader is without history, biography, psychology; he is simply that someone who holds together in a single field all the traces by which the written text is constituted.”

Michel Foucault is another philosopher, like Barthes, whose ideas inform much of poststructuralist literary theory. Foucault played a critical role in the development of the postmodern perspective that knowledge is constructed in concrete historical situations in the form of discourse; knowledge is not communicated by discourse but is discourse itself, can only be encountered textually. Following Nietzsche, Foucault performs what he calls “genealogies,” attempts at deconstructing the unacknowledged operation of power and knowledge to reveal the ideologies that make domination of one group by another seem “natural.” Foucaldian investigations of discourse and power were to provide much of the intellectual impetus for a new way of looking at history and doing textual studies that came to be known as the “New Historicism.”

6. New Historicism and Cultural Materialism

“New Historicism,” a term coined by Stephen Greenblatt, designates a body of theoretical and interpretive practices that began largely with the study of early modern literature in the United States. “New Historicism” in America had been somewhat anticipated by the theorists of “Cultural Materialism” in Britain, which, in the words of their leading advocate, Raymond Williams describes “the analysis of all forms of signification, including quite centrally writing, within the actual means and conditions of their production.” Both “New Historicism” and “Cultural Materialism” seek to understand literary texts historically and reject the formalizing influence of previous literary studies, including “New Criticism,” “Structuralism” and “Deconstruction,” all of which in varying ways privilege the literary text and place only secondary emphasis on historical and social context. According to “New Historicism,” the circulation of literary and non-literary texts produces relations of social power within a culture. New Historicist thought differs from traditional historicism in literary studies in several crucial ways. Rejecting traditional historicism’s premise of neutral inquiry, “New Historicism” accepts the necessity of making historical value judgments. According to “New Historicism,” we can only know the textual history of the past because it is “embedded,” a key term, in the textuality of the present and its concerns. Text and context are less clearly distinct in New Historicist practice. Traditional separations of literary and non-literary texts, “great” literature and popular literature, are also fundamentally challenged. For the “New Historicist,” all acts of expression are embedded in the material conditions of a culture. Texts are examined with an eye for how they reveal the economic and social realities, especially as they produce ideology and represent power or subversion. Like much of the emergent European social history of the 1980s, “New Historicism” takes particular interest in representations of marginal/marginalized groups and non-normative behaviors—witchcraft, cross-dressing, peasant revolts, and exorcisms—as exemplary of the need for power to represent subversive alternatives, the Other, to legitimize itself.

Louis Montrose, another major innovator and exponent of “New Historicism,” describes a fundamental axiom of the movement as an intellectual belief in “the textuality of history and the historicity of texts.” “New Historicism” draws on the work of Levi-Strauss, in particular his notion of culture as a “self-regulating system.” The Foucaldian premise that power is ubiquitous and cannot be equated with state or economic power and Gramsci’s conception of “hegemony,” i.e., that domination is often achieved through culturally-orchestrated consent rather than force, are critical underpinnings to the “New Historicist” perspective. The translation of the work of Mikhail Bakhtin on carnival coincided with the rise of the “New Historicism” and “Cultural Materialism” and left a legacy in work of other theorists of influence like Peter Stallybrass and Jonathan Dollimore. In its period of ascendancy during the 1980s, “New Historicism” drew criticism from the political left for its depiction of counter-cultural expression as always co-opted by the dominant discourses. Equally, “New Historicism’s” lack of emphasis on “literariness” and formal literary concerns brought disdain from traditional literary scholars. However, “New Historicism” continues to exercise a major influence in the humanities and in the extended conception of literary studies.

7. Ethnic Studies and Postcolonial Criticism

“Ethnic Studies,” sometimes referred to as “Minority Studies,” has an obvious historical relationship with “Postcolonial Criticism” in that Euro-American imperialism and colonization in the last four centuries, whether external (empire) or internal (slavery) has been directed at recognizable ethnic groups: African and African-American, Chinese, the subaltern peoples of India, Irish, Latino, Native American, and Philipino, among others. “Ethnic Studies” concerns itself generally with art and literature produced by identifiable ethnic groups either marginalized or in a subordinate position to a dominant culture. “Postcolonial Criticism” investigates the relationships between colonizers and colonized in the period post-colonization. Though the two fields are increasingly finding points of intersection—the work of bell hooks, for example—and are both activist intellectual enterprises, “Ethnic Studies and “Postcolonial Criticism” have significant differences in their history and ideas.

“Ethnic Studies” has had a considerable impact on literary studies in the United States and Britain. In W.E.B. Dubois, we find an early attempt to theorize the position of African-Americans within dominant white culture through his concept of “double consciousness,” a dual identity including both “American” and “Negro.” Dubois and theorists after him seek an understanding of how that double experience both creates identity and reveals itself in culture. Afro-Caribbean and African writers—Aime Cesaire, Frantz Fanon, Chinua Achebe—have made significant early contributions to the theory and practice of ethnic criticism that explores the traditions, sometimes suppressed or underground, of ethnic literary activity while providing a critique of representations of ethnic identity as found within the majority culture. Ethnic and minority literary theory emphasizes the relationship of cultural identity to individual identity in historical circumstances of overt racial oppression. More recently, scholars and writers such as Henry Louis Gates, Toni Morrison, and Kwame Anthony Appiah have brought attention to the problems inherent in applying theoretical models derived from Euro-centric paradigms (that is, structures of thought) to minority works of literature while at the same time exploring new interpretive strategies for understanding the vernacular (common speech) traditions of racial groups that have been historically marginalized by dominant cultures.

Though not the first writer to explore the historical condition of postcolonialism, the Palestinian literary theorist Edward Said’s book Orientalism is generally regarded as having inaugurated the field of explicitly “Postcolonial Criticism” in the West. Said argues that the concept of “the Orient” was produced by the “imaginative geography” of Western scholarship and has been instrumental in the colonization and domination of non-Western societies. “Postcolonial” theory reverses the historical center/margin direction of cultural inquiry: critiques of the metropolis and capital now emanate from the former colonies. Moreover, theorists like Homi K. Bhabha have questioned the binary thought that produces the dichotomies—center/margin, white/black, and colonizer/colonized—by which colonial practices are justified. The work of Gayatri C. Spivak has focused attention on the question of who speaks for the colonial “Other” and the relation of the ownership of discourse and representation to the development of the postcolonial subjectivity. Like feminist and ethnic theory, “Postcolonial Criticism” pursues not merely the inclusion of the marginalized literature of colonial peoples into the dominant canon and discourse. “Postcolonial Criticism” offers a fundamental critique of the ideology of colonial domination and at the same time seeks to undo the “imaginative geography” of Orientalist thought that produced conceptual as well as economic divides between West and East, civilized and uncivilized, First and Third Worlds. In this respect, “Postcolonial Criticism” is activist and adversarial in its basic aims. Postcolonial theory has brought fresh perspectives to the role of colonial peoples—their wealth, labor, and culture—in the development of modern European nation states. While “Postcolonial Criticism” emerged in the historical moment following the collapse of the modern colonial empires, the increasing globalization of culture, including the neo-colonialism of multinational capitalism, suggests a continued relevance for this field of inquiry.

8. Gender Studies and Queer Theory

Gender theory came to the forefront of the theoretical scene first as feminist theory but has subsequently come to include the investigation of all gender and sexual categories and identities. Feminist gender theory followed slightly behind the reemergence of political feminism in the United States and Western Europe during the 1960s. Political feminism of the so-called “second wave” had as its emphasis practical concerns with the rights of women in contemporary societies, women’s identity, and the representation of women in media and culture. These causes converged with early literary feminist practice, characterized by Elaine Showalter as “gynocriticism,” which emphasized the study and canonical inclusion of works by female authors as well as the depiction of women in male-authored canonical texts.

Feminist gender theory is postmodern in that it challenges the paradigms and intellectual premises of western thought, but also takes an activist stance by proposing frequent interventions and alternative epistemological positions meant to change the social order. In the context of postmodernism, gender theorists, led by the work of Judith Butler, initially viewed the category of “gender” as a human construct enacted by a vast repetition of social performance. The biological distinction between man and woman eventually came under the same scrutiny by theorists who reached a similar conclusion: the sexual categories are products of culture and as such help create social reality rather than simply reflect it. Gender theory achieved a wide readership and acquired much its initial theoretical rigor through the work of a group of French feminist theorists that included Simone de Beauvoir, Luce Irigaray, Helene Cixous, and Julia Kristeva, who while Bulgarian rather than French, made her mark writing in French. French feminist thought is based on the assumption that the Western philosophical tradition represses the experience of women in the structure of its ideas. As an important consequence of this systematic intellectual repression and exclusion, women’s lives and bodies in historical societies are subject to repression as well. In the creative/critical work of Cixous, we find the history of Western thought depicted as binary oppositions: “speech/writing; Nature/Art, Nature/History, Nature/Mind, Passion/Action.” For Cixous, and for Irigaray as well, these binaries are less a function of any objective reality they describe than the male-dominated discourse of the Western tradition that produced them. Their work beyond the descriptive stage becomes an intervention in the history of theoretical discourse, an attempt to alter the existing categories and systems of thought that found Western rationality. French feminism, and perhaps all feminism after Beauvoir, has been in conversation with the psychoanalytic revision of Freud in the work of Jacques Lacan. Kristeva’s work draws heavily on Lacan. Two concepts from Kristeva—the “semiotic” and “abjection”—have had a significant influence on literary theory. Kristeva’s “semiotic” refers to the gaps, silences, spaces, and bodily presence within the language/symbol system of a culture in which there might be a space for a women’s language, different in kind as it would be from male-dominated discourse.

Masculine gender theory as a separate enterprise has focused largely on social, literary, and historical accounts of the construction of male gender identities. Such work generally lacks feminisms’ activist stance and tends to serve primarily as an indictment rather than a validation of male gender practices and masculinity. The so-called “Men’s Movement,” inspired by the work of Robert Bly among others, was more practical than theoretical and has had only limited impact on gender discourse. The impetus for the “Men’s Movement” came largely as a response to the critique of masculinity and male domination that runs throughout feminism and the upheaval of the 1960s, a period of crisis in American social ideology that has required a reconsideration of gender roles. Having long served as the de facto “subject” of Western thought, male identity and masculine gender theory awaits serious investigation as a particular, and no longer universally representative, field of inquiry.

Much of what theoretical energy of masculine gender theory currently possesses comes from its ambiguous relationship with the field of “Queer theory.” “Queer theory” is not synonymous with gender theory, nor even with the overlapping fields of gay and lesbian studies, but does share many of their concerns with normative definitions of man, woman, and sexuality. “Queer theory” questions the fixed categories of sexual identity and the cognitive paradigms generated by normative (that is, what is considered “normal”) sexual ideology. To “queer” becomes an act by which stable boundaries of sexual identity are transgressed, reversed, mimicked, or otherwise critiqued. “Queering” can be enacted on behalf of all non-normative sexualities and identities as well, all that is considered by the dominant paradigms of culture to be alien, strange, unfamiliar, transgressive, odd—in short, queer. Michel Foucault’s work on sexuality anticipates and informs the Queer theoretical movement in a role similar to the way his writing on power and discourse prepared the ground for “New Historicism.” Judith Butler contends that heterosexual identity long held to be a normative ground of sexuality is actually produced by the suppression of homoerotic possibility. Eve Sedgwick is another pioneering theorist of “Queer theory,” and like Butler, Sedgwick maintains that the dominance of heterosexual culture conceals the extensive presence of homosocial relations. For Sedgwick, the standard histories of western societies are presented in exclusively in terms of heterosexual identity: “Inheritance, Marriage, Dynasty, Family, Domesticity, Population,” and thus conceiving of homosexual identity within this framework is already problematic.

9. Cultural Studies

Much of the intellectual legacy of “New Historicism” and “Cultural Materialism” can now be felt in the “Cultural Studies” movement in departments of literature, a movement not identifiable in terms of a single theoretical school, but one that embraces a wide array of perspectives—media studies, social criticism, anthropology, and literary theory—as they apply to the general study of culture. “Cultural Studies” arose quite self-consciously in the 80s to provide a means of analysis of the rapidly expanding global culture industry that includes entertainment, advertising, publishing, television, film, computers and the Internet. “Cultural Studies” brings scrutiny not only to these varied categories of culture, and not only to the decreasing margins of difference between these realms of expression, but just as importantly to the politics and ideology that make contemporary culture possible. “Cultural Studies” became notorious in the 90s for its emphasis on pop music icons and music video in place of canonical literature, and extends the ideas of the Frankfurt School on the transition from a truly popular culture to mass culture in late capitalist societies, emphasizing the significance of the patterns of consumption of cultural artifacts. “Cultural Studies” has been interdisciplinary, even antidisciplinary, from its inception; indeed, “Cultural Studies” can be understood as a set of sometimes conflicting methods and approaches applied to a questioning of current cultural categories. Stuart Hall, Meaghan Morris, Tony Bennett and Simon During are some of the important advocates of a “Cultural Studies” that seeks to displace the traditional model of literary studies.

10. References and Further Reading

a. General Works on Theory

  • Culler, Jonathan. Literary Theory: A Very Short Introduction. Oxford: Oxford University Press, 1997.
  • During, Simon. Ed. The Cultural Studies Reader. London: Routledge, 1999.
  • Eagleton, Terry. Literary Theory. Minneapolis, MN: University of Minnesota Press, 1996.
  • Lentricchia, Frank. After the New Criticism. Chicago: University of Chicago Press, 1980.
  • Moore-Gilbert, Bart, Stanton, Gareth, and Maley, Willy. Eds. Postcolonial Criticism. New York: Addison, Wesley, Longman, 1997.
  • Rice, Philip and Waugh, Patricia. Eds. Modern Literary Theory: A Reader. 4th edition.
  • Richter, David H. Ed. The Critical Tradition: Classic Texts and Contemporary Trends. 2nd Ed. Bedford Books: Boston, 1998.
  • Rivkin, Julie and Ryan, Michael. Eds. Literary Theory: An Anthology. Malden, Massachusetts: Blackwell, 1998.

b. Literary and Cultural Theory

  • Adorno, Theodor. The Culture Industry: Selected Essays on Mass Culture. Ed. J. M. Bernstein. London: Routledge, 2001.
  • Althusser, Louis. Lenin and Philosophy: And Other Essays. Trans. Ben Brewster. New York: Monthly Review Press, 1971.
  • Auerbach, Erich. Mimesis: The Representation of Reality in Western Literature. Trans.
  • Willard R. Trask. Princeton, NJ: Princeton University Press, 1953.
  • Bakhtin, Mikhail. The Dialogic Imagination. Trans. Caryl Emerson and Michael Holquist. Austin, TX: University of Texas Press, 1981.
  • Barthes, Roland. Image—Music—Text. Trans. Stephen Heath. New York: Hill and Wang, 1994.
  • Barthes, Roland. The Pleasure of the Text. Trans. Richard Miller. New York: Hill and Wang, 1975.
  • Beauvoir, Simone de. The Second Sex. Tr. H.M. Parshley. New York: Knopf, 1953.
  • Benjamin, Walter. Illuminations. Ed. Hannah Arendt. Trans. Harry Zohn. New York: Schocken, 1988.
  • Brooks, Cleanth. The Well-Wrought Urn: Studies in the Structure of Poetry. New York: Harcourt, 1947.
  • Derrida, Jacques. Of Grammatology. Trans. Gayatri C. Spivak. Baltimore: Johns Hopkins, 1976.
  • Dubois, W.E.B. The Souls of Black Folk: Essays and Sketches. Chicago: A. C. McClurg & Co., 1903.
  • Fish, Stanley. Is There a Text in This Class? The Authority of Interpretive Communities. Harvard, MA: Harvard University Press, 1980.
  • Foucault, Michel. The History of Sexuality. Volume 1. An Introduction. Trans. Robert Hurley. Harmondsworth, UK: Penguin, 1981.
  • Foucault, Michel. The Order of Things: An Archaeology of the Human Sciences. New York: Vintage, 1973.
  • Gates, Henry Louis. The Signifying Monkey: A Theory of African-American Literary Criticism. New York: Oxford University Press, 1989.
  • hooks, bell. Ain’t I a Woman: Black Women and Feminism. Boston: South End Press, 1981.
  • Horkheimer, Max and Adorno, Theodor. Dialectic of Enlightenment: Philosophical Fragments. Ed. Gunzelin Schmid Noerr. Trans. Edmund Jephcott. Stanford, CA: Stanford University Press, 2002.
  • Irigaray, Luce. This Sex Which Is Not One. Ithaca, NY: Cornell University Press, 1985.
  • Jameson, Frederic. Postmodernism: Or the Cultural Logic of Late Capitalism. Durham, NC: Duke University Press, 1999.
  • Lacan, Jacques. Ecrits: A Selection. London: Routledge, 2001.
  • Lemon Lee T. and Reis, Marion J. Eds. Russian Formalist Criticism: Four Essays. Lincoln, NE: University of Nebraska Press, 1965.
  • Lukacs, Georg. The Historical Novel. Trans. Hannah and Stanley Mitchell. Lincoln, NE: University of Nebraska Press, 1962.
  • Marcuse, Herbert. Eros and Civilization. Boston: Beacon Press, 1955.
  • Nietzsche, Friedrich. The Genealogy of Morals. Trans. Walter Kaufmann. New York: Vintage, 1969.
  • Plato. The Collected Dialogues. Ed. Edith Hamilton and Huntington Cairns. Princeton, NJ: Princeton University Press, 1961.
  • Proust, Marcel. Remembrance of Things Past. Trans. C.K. Scott Moncrieff and Terence Kilmartin. New York: Vintage, 1982.
  • Said, Edward. Orientalism. New York: Pantheon, 1978.
  • Sedgwick, Eve Kosofsky. Between Men. Between Men: English literature and Male Homosocial Desire. New York: Columbia University Press, 1985.
  • Sedgwick, Eve Kosofsky Epistemology of the Closet. London: Penguin, 1994.
  • Showalter, Elaine. Ed. The New Feminist Criticism: Essays on Women, Literature, and Theory. London: Virago, 1986.
  • Tompkins, Jane. Sensational Designs: the Cultural Work of American Fiction, 1790-1860. New York: Oxford University Press, 1986.
  • Wellek, Rene and Warren, Austin. Theory of Literature. 3rd ed. New York: Harcourt Brace, 1956.
  • Williams, Raymond. The Country and the City. New York: Oxford University Press, 1973.

Author Information

Vince Brewton
Email: vbrewton@unanov.una.edu
University of North Alabama
U. S. A.

Justus Lipsius (1547—1606)

LipsiusJustus Lipsius, a Belgian classical philologist and Humanist, wrote a series of works designed to revive ancient Stoicism in a form that would be compatible with Christianity. The most famous of these is De Constantia (‘On Constancy’) in which he advocated a Stoic-inspired ideal of constancy in the face of unpleasant external events, but also carefully distinguished those parts of Stoic philosophy that the orthodox Christian should reject or modify. This modified form of Stoicism influenced a number of contemporary thinkers, creating an intellectual movement that has come to be known as Neostoicism.

Lipsius has been described as the greatest Renaissance scholar of the Low Countries after Erasmus. The role that he played in the revival of interest in Stoicism during the late Renaissance was similar to that performed by Marsilio Ficino with regard to Platonism and Pierre Gassendi with regard to Epicureanism. As such, he stands as a key figure in the history of Renaissance philosophy and the Renaissance revival of ancient thought.

Table of Contents

  1. Background
  2. Life
  3. Works
    1. Politicorum sive Civilis Doctrinae Libri Sex
    2. De Constantia Libri Duo
      1. Form
      2. Analysis of Contents
      3. Definition of constantia
      4. Four Arguments Concerning Public Evils
      5. Four Modifications of Ancient Stoicism
      6. Summary
    3. Later Stoic Works
  4. Conclusion
  5. References and Further Reading

1. Background

Justus Lipsius’s philosophical reputation rests upon his status as the principal figure in the Renaissance revival of Stoicism. Stoicism was one of the great Hellenistic schools of philosophy and dominated ancient intellectual life for at least 400 years. Founded by Zeno of Citium around 300 B.C.C., the school developed under Cleanthes, Chrysippus, Panaetius, and Posidonius. In the first century B.C. it appealed to high-ranking Romans including Cicero and Cato. In the first two centuries C.E. it reached its height of popularity under the influence of Musonius Rufus and Epictetus. In the second century C.E. it found its most famous exponent in the form of the Roman Emperor Marcus Aurelius. However, after the second century Stoicism was soon eclipsed in popularity by Neoplatonism.

Despite this decline in late antiquity, Stoicism continued to exert an influence. Its ideas were discussed by Church Fathers such as St. Augustine, Lactantius, and Tertullian. In the Middle Ages its impact can be seen in the ethical works of Peter Abelard and his pupil John of Salisbury, transmitted via the readily available Latin works of Seneca and Cicero. In the fourteenth century Stoicism attracted the attention of Petrarch who produced a substantial ethical work entitled De Remediis Utriusque Fortunae (‘On the Remedies of Both Kinds of Fortune’) inspired by Seneca and drawing upon an account of the Stoic theory of the passions made by Cicero. With the rediscovery of the works of the Stoic philosopher Epictetus by famous Humanists such as Perotti and Politian in the fifteenth century, interest in Stoicism continued to develop. However, the Renaissance revival of Stoicism remained somewhat limited until Justus Lipsius.

2.Life

Justus Lipsius (the Latinized version of Joest Lips) was born in Overyssche, a village near Brussels and Louvain, in 1547. He studied first with the Jesuits in Cologne and later at the Catholic University of Louvain. After completing his education he visited Rome, in his new position as secretary to Cardinal Granvelle, staying for two years in order to study the ancient monuments and explore the unsurpassed libraries of classical literature. In 1572 Lipsius’s property in Belgium was taken by Spanish troops during the civil war while he was away on a trip to Vienna (a trip that would later be used as the backdrop for the dialogue in De Constantia over a decade later). Without property, Lipsius applied for a position at the Lutheran University of Jena. This was the first of a number of institutional moves that required Lipsius to change his publicly professed faith. His new colleagues at Jena remained sceptical of this radical transformation and Lipsius was eventually forced to leave Jena after only two years in favour of Cologne. While at Cologne he prepared notes on Tacitus that he used in his critical edition of 1574.

In 1576 Lispius returned to Catholic Louvain. However after his property was looted by soldiers a second time he fled again in 1579, this time to the Calvinist University of Leiden. He remained at Leiden for thirteen years and it is to this period that his two most famous books – De Constantia Libri Duo (1584) and Politicorum sive Civilis Doctrinae Libri Sex (1589) – belong. However, Lipsius was by upbringing a Catholic and eventually he sought to return to Louvain, via a brief period in Liège. In 1592 Lipsius accepted the Chair of Latin History and Literature at Louvain. To this final period belong his editorial work on Seneca and his two detailed studies of Stoicism, the Manuductio ad Stoicam Philosophiam and Physiologia Stoicorum. The two studies were published first in 1604 and the edition of Seneca in 1605. Lipsius died in Louvain in 1606.

Among Lipsius’s friends was his publisher, the famous printer Christopher Plantin, with whom he often stayed in Antwerp. Among his pupils was Philip Rubens, brother of the painter Peter Paul Rubens who portrayed Lipsius after his death in ‘The Four Philosophers’ (c. 1611, now in the Pitti Palace, Florence). Among his admirers was Michel de Montaigne who described him as one of the most learned men then alive (Essais 2.12).

3. Works

Lipsius was a prolific author, publishing his first work Variarum Lectionum Libri IV (‘Four Books of Various Readings’) – a collection of philological comments and conjectures – in 1569, while still in his twenties. His reputation today is primarily as a Latin philologist and stands upon his critical editions of Tacitus and Seneca. He also produced a number of philological studies and a large correspondence, some of which he published. His principal philosophical works are De Constantia Libri Duo and Politicorum sive Civilis Doctrinae Libri Sex, complementing his editions of Seneca and Tacitus respectively.

a. Politicorum sive Civilis Doctrinae Libri Sex

In his Politicorum sive Civilis Doctrinae Libri Sex (‘Six Books on Politics or Civil Doctrine’) Lipsius drew upon a wide range of classical sources, with a particular emphasis upon Tacitus, and the work has been characterized, not unfairly, as not much more than a compendium of quotations. In it he argued that no State should permit more than one religion within its borders and that all dissent should be punished without mercy. Experience had taught him that civil conflict enflamed by religious intolerance was far more dangerous and destructive than despotism.

The treatise is concerned with the creation of civil life, defined as ‘that which we lead in the society of men, one with another, to mutual commodity and profit, and common use of all’ (Pol. 1.1). Such a life has two necessary conditions, virtue (virtute) and prudence (prudentia). Book One is devoted to an analysis of these two conditions: virtue requires piety and goodness; prudence is dependent upon use and memory. Book Two opens by arguing that government is necessary for civil life and that the best form of government is a principality. Civil concord requires all to submit to the will of one. ‘Principality’ (principatus) is defined as ‘rule by one for the good of all’ (Pol. 2.3). For the Prince to achieve this he himself must have both virtue and prudence. The remainder of Book Two is devoted to princely virtues, the most important being justice and clemency. Book Three moves on to consider princely prudence, and this remains the theme for the rest of the work. There are two types of prudence, one’s own and the advice of others. Book Three focuses upon prudent advisors in the form of counsellors and ministers. Book Four is concerned with a Prince’s own prudence, which must be carefully developed in the light of experience. This itself may be divided into civil and military prudence. The rest of Book Four outlines two types of civil prudence, that concerned with matters divine and that concerned with matters human. Military Prudence is the subject of Books Five and Six. Book Five deals with external military prudence (war with foreign powers), while Book Six deals with internal military prudence (civil war).

The central theme of the work is clear from the outset. Lipsius – pre-empting Hobbes – places order and peace far above civil liberties and personal freedom. Individual political rights are little consolation when surrounded by violent anarchy. The first task for politics is to secure peace for all and this can only be done if power is concentrated in one individual. It can also only be achieved if only one religion is allowed in any particular State. If one has concerns about such a concentration of power, the proper way to reduce them is to educate the holder of power, to develop his virtue and prudence, and to remind him that he holds power in order to secure peace, not to create terror. If a Prince forgets this last point and turns into a tyrant, there may be grounds to challenge his position. However Lipsius emphasizes that there is nothing more miserable than civil war which should be avoided at all costs.

b. De Constantia Libri Duo

Lipsius’s principal philosophical work is De Constantia Libri Duo (‘Two Books on Constancy’), published in 1584. The title is borrowed from Seneca’s dialogue De Constantia Sapientis. This work was immensely popular and went through numerous editions. It was translated into English four times between 1594 and 1670. It for this work that Lipsius became famous in the succeeding centuries, inspiring the intellectual movement that has come to be known as Neostoicism. This work was conceived as an attempt to revive Stoic philosophy as a living movement as it had been in antiquity and, in particular, as a practical antidote to public evils.

i. Form

The work takes the form of a dialogue between Lipsius and his friend Langius (Charles de Langhe, Canon of Liège). This no doubt fictional conversation is set within the context of a visit to Langius by Lipsius during the course of a trip to Vienna that Lipsius had actually undertaken in 1572. While some distance from his troubled homeland, the dialogue’s character Lipsius reflects upon the nature of public evils (mala publica) and is guided by the older and wiser Langius into whose mouth the positive content of the dialogue is placed.

ii. Analysis of Contents

The dialogue is divided into two books. However a single structure operates throughout the entire work. The opening chapters of Book One introduce the idea that in order to escape public evils one must change one’s mind, not one’s location (Const. 1.1-3). The concept of constancy is introduced as that which must be cultivated in the mind in order to achieve such a change (Const. 1.4-7). After a brief survey of the enemies of constancy (Const. 1.8-12), the four central arguments of the work, concerning the nature of public evil, are introduced (Const. 1.13). The first two of these arguments occupy the remainder of Book One (Const. 1.14 and 15-22). After a brief interlude at the beginning of Book Two on the nature of the philosophical project at hand (Const. 2.1-5), the remaining two arguments follow (Const. 2.6-17 and 18-26). The final chapter functions as a summary (Const. 2.27).

iii. Definition of constantia

The central concept in this work is, not surprisingly, constancy (constantia). It is introduced in Const. 1.4 and defined as a right and immovable strength of mind, neither elated nor depressed by external or chance events. The mother of constancy is patience (patientia), defined as a voluntary endurance without complaint of all things that can happen to or in a man.

However key to both of these concepts is the distinction between reason (ratio) and opinion (opinio). While opinion leads to inconstancy, it is reason that is able to form the foundation for constancy. Cultivating reason is thus the way in which one can reach the goal of constancy. Here Lipsius draws upon relatively common Stoic ideas concerning the passions or emotions (affectus; in Greek, pathê). Emotions are the product of mere opinions and lead to distress and imbalance. Analysing and rejecting those opinions in favour of rational understanding will free one from emotions and thus the inconstancy that they create. The wise man who enjoys constancy will be free from emotions such as desire (cupiditas), joy (gaudium), fear (metus), and sorrow (dolor).

iv. Four Arguments Concerning Public Evils

The core of De Constantia is the series of four arguments concerning the nature of public evils. These are outlined in Const. 1.13 and then developed, in turn, in Const. 1.14, 1.15-22, 2.6-17, and 2.18-26. It is argued that public evils are (a) imposed by God; (b) the product of necessity; (c) in reality profitable to us; (d) neither grievous nor unusual.

The first argument claims that all public evils form part of God’s divine plan. They derive form the same source as all those profitable parts of nature and it would be impious to take only part of God’s creation and criticise Him for the remainder. We are born into God’s creation and it is our duty to obey Him by accepting all of His works. In any case, even if one does not follow God’s will freely, one will nevertheless be drawn along forcibly (echoing the famous Stoic donkey and cart analogy reported in Hippolytus Refutatio 1.21). Thus the only option is to obey God (deo parere).

The second argument claims that the continual cycle of creation and destruction are the inevitable consequence of the necessary laws of Nature. If even the stars in the heavens are subject to the processes of creation and destruction, then it is only natural that man-made cities will rise and fall, for “all things run into this fatal whirlpool of ebbing and flowing” (Const. 1.16). However Lipsius is careful here to distance himself from Stoic materialism and outlines four points where Stoic doctrine must be modified in the light of Christian truth (see the next section).

The third argument is merely a variation upon traditional Christian responses to the problem of evil. Those terrible things that happen must in some sense be good if they are part of God’s divine plan and Lipsius attempts to show this by claiming that public evils constitute exercise (exercendi) for the good, correction (castigandi) for the weak-willed, and punishment (puniendi) for the bad.

The fourth argument focuses upon the particular public evils that Lipsius wanted to avoid, namely the religious civil wars in the Low Countries. He argues that these wars are neither particularly grievous nor uncommon. In order to place these present conflicts into perspective Lipsius, drawing upon his extensive classical learning, cites numerous examples of wars, plagues, and acts of cruelty from Jewish, Greek, and Roman history. The conflict from which Lipsius has fled is neither excessively brutal nor particularly unusual. What would be unusual would be an individual insulated and exempted from the cycles of birth and death, creation and destruction. It is the human lot to suffer at the hands of this continual change; the philosophical task, however, is to decide how one will face that suffering. One can do so either with sorrow (dolor) or with constancy (constantia).

v. Four Modifications of Ancient Stoicism

During the course of the second argument concerning the nature of public evils, Lipsius outlines four points where Stoicism and Christianity diverge. He is careful to distance himself from these parts of Stoic philosophy and the modification of Stoicism that he makes here (Const. 1.20) in order to reconcile it with Christianity forms the basis for the intellectual movement that has come to be known as Neostoicism. The four points in question are the Stoic claims that (a) God is submitted to fate; (b) that there is a natural order of causes (and thus no miracles); (c) that there is no contingency; (d) that there is no free will. All four of these points derive from the Stoic theory of determinism and it is this to which Lipsius primarily objects.

Stoic determinism is itself built upon Stoic materialism, which affirms that only bodies exist. These bodies act as causes and so anything that acts, including the soul, must be corporeal. Aulus Gellius reports that the Stoic Chrysippus defined fate as a natural and everlasting order of causes in which each event follows from another in an unalterable interconnection (Noctes Atticae 7.2.3). Thus, as Cicero notes, the Stoic doctrine of fate, conceived as an order and sequence of material causes, is “not the fate of superstition but rather that of physics” (De Divinatione 1.126). By rejecting this doctrine, Lipsius attempts to disengage the Stoic ethical ideas to which he is drawn from their foundations in Stoic physics. This is absolutely essential if he is to be able to present Stoic ethics in a form acceptable to a Christian audience.

vi. Summary

The central theme of De Constantia – that public evils are the product of the mind and thus must be treated rather than fled – contrasts sharply with Lipsius’s own earlier behaviour when faced with the religious wars then raging. Perhaps experience had taught him that, no matter how many geographical moves he made, he would not be able to escape the evils surrounding him until he examined himself. Only wisdom and constancy – the products of philosophical reflection – can bring true peace of mind.

c. Later Stoic Works

De Constantia was not Lipsius’s only work devoted to Stoicism. He also produced two studies of Stoic philosophy during the course of the preparation of his 1605 edition of Seneca; the Manuductio ad Stoicam Philosophiam (‘Digest of Stoic Philosophy’) and the Physiologia Stoicorum (‘Physics of the Stoics’), both published in 1604. These works offer an interpretation of every aspect of Stoic philosophy and draw together under subject headings large numbers of quotations and doxographical reports preserved in a wide range of ancient authors. These two works may be seen as the precursors to the, now standard, edition of the fragments of the early Stoics compiled by Hans von Arnm (Stoicorum Veterum Fragmenta, 1903-24).

These later studies of Stoicism – based upon a more systematic survey of the surviving sources – are marked by two features which distinguish them from De Constantia. The first is a more developed awareness of the systematic inter-relation between ethics and physics in Stoic philosophy; the second is a revised and more positive attitude towards the Stoic theory of determinism. In Phys. 1.12, for instance, Lipsius demonstrates a more thorough understanding of the Stoic theory of fate, and on the basis of this he suggests that it can in fact be reconciled with Christian doctrine without modification. In order to do this, he draws upon St. Augustine’s discussion of Stoic definitions of fate in De Civitate Dei 5.8 where it is argued that fate does not impinge upon the power of God but rather is the expression of the will of God.

While De Constantia was a popular and highly readable dialogue, these later studies were primarily works of classical scholarship. They were conceived as supplementary volumes designed to complement – and perhaps even justify – Lipsius’s final great work, his 1605 critical edition of the philosophical works of Seneca. This handsome folio edition included all of Seneca’s prose works, detailed summaries for each, commentary, and a biography of the great Roman Stoic. In this final publication, Lipsius’s admiration of Stoic philosophy and his talents as a classical philologist are united so as to form a highly appropriate culmination to his intellectual career.

4. Conclusion

Lipsius has been described as the greatest Renaissance scholar of the Low Countries after Erasmus. The role that he played in the revival of interest in Stoicism during the late Renaissance was similar to that performed by Marsilio Ficino with regard to Platonism and Pierre Gassendi with regard to Epicureanism. As such, he stands as a key figure in the history of Renaissance philosophy and the Renaissance revival of ancient thought.

5. References and Further Reading

a. The Works of Justus Lipsius

All of Lipsius’s works are gathered together in his Opera Omnia of 1637. Another edition appeared in 1675. Full bibliographical details for all of his works can be found in F. Van Der Haeghen’s Bibliographie Lipsienne: Oeuvres de Juste Lipse, 2 vols (Ghent: Université de Gand, 1886).

i) Politicorum sive Civilis Doctrinae Libri Sex

  • Politicorum sive Civilis Doctrinae Libri Sex (Leiden: Plantin, 1589) – the first edition.
  • Sixe Bookes of Politickes or Civil Doctrine, Done into English by William Jones (London: Richard Field, 1594) – there is also a facsimile reprint of this edition (Amsterdam: Theatrum Orbis Terrarum, 1970).

ii) De Constantia Libri Duo

  • De Constantia Libri Duo, Qui alloquium praecipue continent in Publicis malis(Antwerp: Plantin, 1584) – the first edition.
  • Traité de la constance, Traduction nouvelle précédée d’une notice sur Juste Lipse par Lucien du Bois (Brussels & Leipzig: Merzbach, 1873) – still the most recent edition of the Latin text, with a facing French translation.
  • Two Bookes of Constancie Written in Latine by Iustus Lipsius, in English by Sir John Stradling, Edited with an Introduction by Rudolf Kirk (New Brunswick: Rutgers University Press, 1939) – the most recent edition in English, reprinting a translation first published in 1594.

iii) Later Stoic Works

  • Manuductionis ad Stoicam Philosophiam Libri Tres, L. Annaeo Senecae, aliisque scriptoribus illustrandis (Antwerp: Plaintin-Moretus, 1604) – extracts reprinted and translated into French in Lagrée (below) – extracts also translated into English in J. Kraye, ed. Cambridge Translations of Renaissance Philosophical Texts 1: Moral Philosophy (Cambridge: Cambridge University Press, 1997), 200-09.
  • Physiologiae Stoicorum Libri Tres, L. Annaeo Senecae, aliisque scriptoribus illustrandis (Antwerp: Plantin-Moretus, 1604) – extracts reprinted and translated into French in Lagrée (below).
  • Annaei Senecae Philosophi Opera, Quae Existant Omnia, A Iusto Lipsio emendata, et Scholiis illustrata (Antwerp: Plantin-Moretus, 1605) – Lipsius’s ‘Life of Seneca’ and his summaries are translated by Thomas Lodge in his The Workes of Lucius Annaeus Seneca (London: William Stansby, 1620), which is based upon Lipsius’s edition.

b. Studies

  • ANDERTON, B., ‘A Stoic of Louvain: Justus Lipsius’, in Sketches from a Library Window (Cambridge: Heffer, 1922), 10-30.
  • GERLO, A., ed., Juste Lipse (1547-1606), Travaux de l’Institut Interuniversitaire pour l’étude de la Renaissance et de l’Humanisme IX (Brussels: University Press, 1988)
  • LAGRÉE, J., Juste Lipse et la restauration du stoïcisme: Étude et traduction des traités stoïciens De la constance, Manuel de philosophie stoïcienne, Physique des stoïciens (Paris: Vrin, 1994)
  • LAGRÉE, J. ‘Juste Lipse: destins et Providence’, in P.-F, Moreau, ed., Le stoïcisme au XVIe et au XVIIe siècle (Paris: Albin Michel, 1999), 77-93.
  • LAGRÉE, J. ‘La vertu stoïcienne de constance’, in P.-F, Moreau, ed., Le stoïcisme au XVIe et au XVIIe siècle (Paris: Albin Michel, 1999), 94-116.
  • LAUREYS, M., ed., The World of Justus Lipsius: A Contribution Towards his Intellectual Biography, Bulletin de l’Institut Historique Belge de Rome LXVIII (Brussels & Rome: Brepols, 1998)
  • LEVI, A. H. T., ‘The Relationship of Stoicism and Scepticism: Justus Lipsius’, in J. Kraye and M. W. F. Stone, eds, Humanism and Early Modern Philosophy (London: Routledge, 2000), 91-106.
  • MARIN, M., ‘L’influence de Sénèque sur Juste Lipse’, in A. Gerlo, ed., Juste Lipse: 1547-1606 (Brussels: University Press, 1988), 119-26.
  • MORFORD, M., Stoics and Neostoics: Rubens and the Circle of Lipsius (Princeton: Princeton University Press, 1991)
  • MORFORD, M. ‘Towards an Intellectual Biography of Justus Lipsius – Pieter Paul Rubens’, Bulletin de l’Institut Historique Belge de Rome 68 (1998), 387-403.
  • OESTREICH, G., Neostoicism and the Early Modern State, trans. D. McLintock (Cambridge: Cambridge University Press, 1982)
  • SAUNDERS, J. L., Justus Lipsius: The Philosophy of Renaissance Stoicism (New York: The Liberal Arts Press, 1955)
  • ZANTA, L., La renaissance du stoïcisme au XVIe siècle (Paris: Champion, 1914)

References to further works dealing with Neostoicism may be found at the end of the IEP article Neostoicism.

Author Information

John Sellars
Email: john.sellars (at) wolfson.ox.ac.uk
University of the West of England
United Kingdom

Deductive-Theoretic Conceptions of Logical Consequence

According to the deductive-theoretic conception of logical consequence, a sentence X is a logical consequence of a set K of sentences if and only if X is a deductive consequence of K, that is, X is deducible or provable from K. Deductive consequence is clarified in terms of the notion of proof in a correct deductive system. Since, arguably, logical consequence conceived deductive-theoretically is not a compact relation and deducibility in a deductive system is, there are languages for which deductive consequence cannot be defined in terms of deducibility in a correct deductive system. However, it is true that if a sentence is deducible in a correct deductive system from other sentences, then the sentence is a deductive consequence of them. A deductive system is correct only if its rules of inference correspond to intuitively valid principles of inference. So whether or not a natural deductive system is correct brings into play rival theories of valid principles of inference such as classical, relevance, intuitionistic, and free logics.

Table of Contents

  1. Introduction
  2. Linguistic Preliminaries: the Language M
    1. Syntax of M
    2. Semantics for M
  3. What is a Logic?
  4. Deductive System N
  5. The Status of the Deductive Characterization of Logical Consequence in Terms of N
    1. Tarski’s argument that the model-theoretic characterization of logical consequence is more basic than its characterization in terms of a deductive system
    2. Is deductive system N correct?
      1. Relevance logic
      2. Intuitionistic logic
      3. Free logic
  6. Conclusion
  7. References and Further Reading

1. Introduction

According to the deductive-theoretic conception of logical consequence, a sentence X is a logical consequence of a set K of sentences if and only if X is a deductive consequence of K, that is, X is deducible from K. X is deducible from K just in case there is an actual or possible deduction of X from K. In such a case, we say that X may be correctly inferred from K or that it would be correct to conclude X from K. A deduction is associated with a pair ; the set K of sentences is the basis of the deduction, and X is the conclusion. A deduction from K to X is a finite sequence S of sentences ending with X such that each sentence in S (that is, each intermediate conclusion) is derived from a sentence (or more) in K or from previous sentences in S in accordance with a correct principle of inference. The notion of a deduction is clarified by appealing to a deductive system. A deductive system D is a collection of rules that govern which sequences of sentences, associated with a given , are allowed and which are not. Such a sequence is called a proof in D (or, equivalently, a deduction in D) of X from K. The rules must be such that whether or not a given sequence associated with qualifies as a proof in D of X from K is decidable purely by inspection and calculation. That is, the rules provide a purely mechanical procedure for deciding whether a given object is a proof in D of X from K. We write

K ⊢D X

to mean

X is deducible in deductive system D from K.

See the entry Logical Consequence, Philosophical Considerations for discussion of the interplay between the concepts of logical consequence and deductive consequence, and deductive systems. We say that a deductive system D is correct when for any K and X, proofs in D of X from K correspond to intuitively valid deductions. For a given language the deductive consequence relation is defined in terms of a correct deductive system D only if it is true that

X is a deductive consequence of K if and only if X is deducible in D from K.

Sundholm (1983) offers a thorough survey of three main types of deductive systems. In this article, a natural deductive system is presented that originates in the work of the mathematician Gerhard Gentzen (1934) and the logician Fredrick Fitch (1952). We will refer to the deductive system as N (for ‘natural deduction’). For an in-depth introductory presentation of a natural deductive system very similar to N see Barwise and Etchemendy (2001). N is a collection of inference rules. A proof of X from K that appeals exclusively to the inference rules of N is a formal deduction or formal proof. We shall take a formal proof to be associated with a pair where K is a set of sentences from a first-order language M, which will be introduced below, and X is an M-sentence. The set K of sentences is the basis of the deduction, and X is the conclusion. We say that a formal deduction from K to X is a finite sequence S of sentences ending with X such that each sentence in S is either an assumption, deduced from a sentence (or more) in K, or deduced from previous sentences in S in accordance with one of N’s inference rules.

Formal proofs are not only epistemologically significant for securing knowledge, but also the derivations making up formal proofs may serve as models of the informal deductive reasoning performed using sentences from language M. Indeed, a primary value of a formal proof is that it can serve as a model of ordinary deductive reasoning that explains the force of such reasoning by representing the principles of inference required to get to X from K.

Gentzen, one of the first logicians to present a natural deductive system, makes clear that a primary motive for the construction of his system is to reflect as accurately as possible the actual logical reasoning involved in mathematical proofs. He writes,

My starting point was this: The formalization of logical deduction especially as it has been developed by Frege, Russell, and Hilbert, is rather far removed from the forms of deduction used in practice in mathematical proofs…In contrast, I intended first to set up a formal system which comes as close as possible to actual reasoning. The result was a ‘calculus of natural deduction’. (Gentzen 1934, p. 68)

Natural deductive systems are distinguished from other deductive systems by their usefulness in modeling ordinary, informal deductive inferential practices. Paraphrasing Gentzen, we may say that if one is interested in seeing logical connections between sentences in the most natural way possible, then a natural deductive system is a good choice for defining the deductive consequence relation.

The remainder of the article proceeds as follows. First, an interpreted language M is given. Next, we present the deductive system N and represent the deductive consequence relation in M. After discussing the philosophical significance of the deductive consequence relation defined in terms of N, we consider some standard criticisms of the correctness of deductive system N.

2. Linguistic Preliminaries: the Language M

Here we define a simple language M, a language about the McKeon family, by first sketching what strings qualify as well-formed formulas (wffs) in M. Next we define sentences from formulas, and then give an account of truth in M, that is we describe the conditions in which M-sentences are true.

a. Syntax of M

Building blocks of formulas

Terms

Individual names—’beth’, ‘kelly’, ‘matt’, ‘paige’, ‘shannon’, ‘evan’, and ‘w1‘, ‘w2‘, ‘w3‘, etc.

Variables—’x’, ‘y’, ‘z’, ‘x1‘, ‘y1‘, ‘z1‘, ‘x2‘, ‘y2‘, ‘z2‘, etc.

Predicates

1-place predicates—’Female’, ‘Male’

2-place predicates—’Parent’, ‘Brother’, ‘Sister’, ‘Married’, ‘OlderThan’, ‘Admires’, ‘=’.

Blueprints of well-formed formulas (wffs)

Atomic formulas: An atomic wff is any of the above n-place predicates followed by n terms which are enclosed in parentheses and separated by commas.

Formulas: The general notion of a well-formed formula (wff) is defined recursively as follows:

(1) All atomic wffs are wffs.
(2) If α is a wff, so is ''.
(3) If α and β are wffs, so is '(α & β)'.
(4) If α and β are wffs, so is 'v β)'.
(5) If α and β are wffs, so is '(α → β)'.
(6) If Ψ is a wff and v is a variable, then 'vΨ' is a wff.
(7) If Ψ is a wff and v is a variable, then 'vΨ' is a wff.
Finally, no string of symbols is a well-formed formula of M unless the string can be derived from (1)-(7).

The signs ‘~’, ‘&’, ‘v‘, and ‘→’, are called sentential connectives. The signs ‘∀’ and ‘∃’ are called quantifiers.

It will prove convenient to have available in M an infinite number of individual names as well as variables. The strings ‘Parent(beth, paige)’ and ‘Male(x)’ are examples of atomic wffs. We allow the identity symbol in an atomic formula to occur in between two terms, e.g., instead of ‘=(evan, evan)’ we allow ‘(evan = evan)’. The symbols ‘~’, ‘&’, ‘v‘, and ‘→’ correspond to the English words ‘not’, ‘and’, ‘or’ and ‘if…then’, respectively. ‘∃’ is our symbol for an existential quantifier and ‘∀’ represents the universal quantifier. 'vΨ' and 'vΨ' correspond to for some v, Ψ, and for all v, Ψ, respectively. For every quantifier, its scope is the smallest part of the wff in which it is contained that is itself a wff. An occurrence of a variable v is a bound occurrence iff it is in the scope of some quantifier of the form 'v' or the form 'v', and is free otherwise. For example, the occurrence of ‘x’ is free in ‘Male(x)’ and in ‘∃y Married(y, x)’. The occurrences of ‘y’ in the second formula are bound because they are in the scope of the existential quantifier. A wff with at least one free variable is an open wff, and a closed formula is one with no free variables. A sentence is a closed wff. For example, ‘Female(kelly)’ and ‘∃y∃x Married(y, x)’ are sentences but ‘OlderThan(kelly, y)’ and ‘(∃x Male(x) & Female(z))’ are not. So, not all of the wffs of M are sentences. As noted below, this will affect our definition of truth for M.

b. Semantics for M

We now provide a semantics for M. This is done in two steps. First, we specify a domain of discourse, that is, the chunk of the world that our language M is about, and interpret M’s predicates and names in terms of the elements composing the domain. Then we state the conditions under which each type of M-sentence is true. To each of the above syntactic rules (1-7) there corresponds a semantic rule that stipulates the conditions in which the sentence constructed using the syntactic rule is true. The principle of bivalence is assumed and so ‘not true’ and ‘false’ are used interchangeably. In effect, the interpretation of M determines a truth-value (true, false) for each and every sentence of M.

Domain D—The McKeons: Matt, Beth, Shannon, Kelly, Paige, and Evan.

Here are the referents and extensions of the names and predicates of M.

Terms: ‘matt’ refers to Matt, ‘beth’ refers to Beth, ‘shannon’ refers to Shannon, etc.

Predicates. The meaning of a predicate is identified with its extension, that is the set (possibly empty) of elements from the domain D the predicate is true of. The extension of a one-place predicate is a set of elements from D, the extension of a two-place predicate is a set of ordered pairs of elements from D.

The extension of ‘Male’ is {Matt, Evan}.

The extension of ‘Female’ is {Beth, Shannon, Kelly, Paige}.

The extension of ‘Parent’ is {<Matt, Shannon>, <Matt, Kelly>, <Matt, Paige>, <Matt, Evan>, <Beth, Shannon>, <Beth, Kelly>, <Beth, Paige>, <Beth, Evan>}.

The extension of ‘Married’ is {<Matt, Beth>, <Beth, Matt>}.

The extension of ‘Sister’ is {<Shannon, Kelly>, <Kelly, Shannon>, <Shannon, Paige>, <Paige, Shannon>, <Kelly, Paige>, <Paige, Kelly>, <Kelly, Evan>, <Paige, Evan>, <Shannon, Evan>}.

The extension of ‘Brother’ is {<Evan, Shannon>, <Evan, Kelly>, <Evan, Paige>}.

The extension of ‘OlderThan’ is {<Beth, Matt>, <Beth, Shannon>, <Beth, Kelly>, <Beth, Paige>, <Beth, Evan>, <Matt, Shannon>, <Matt, Kelly>, <Matt, Paige>, <Matt, Evan>, <Shannon, Kelly>, <Shannon, Paige>, <Shannon, Evan>, <Kelly, Paige>, <Kelly, Evan>, <Paige, Evan>}.

The extension of ‘Admires’ is {<Matt, Beth>, <Shannon, Matt>, <Shannon, Beth>, <Kelly, Beth>, <Kelly, Matt>, <Kelly, Shannon>, <Paige, Beth>, <Paige, Matt>, <Paige, Shannon>, <Paige, Kelly>, <Evan, Beth>, <Evan, Matt>, <Evan, Shannon>, <Evan, Kelly>, <Evan, Paige>}.

The extension of ‘=’ is {<Matt, Matt>, <Beth, Beth>, <Shannon, Shannon>, <Kelly, Kelly>, <Paige, Paige>, <Evan, Evan>}.

The atomic sentence ‘Female(kelly)’ is true because, as indicated above, the referent of ‘kelly’ is in the extension of the property designated by ‘Female’. The atomic sentence ‘Married(shannon, kelly)’ is false because the ordered pair is not in the extension of the relation designated by ‘Married’.

(I) An atomic sentence with a one-place predicate is true iff the referent of the term is a member of the extension of the predicate, and an atomic sentence with a two-place predicate is true iff the ordered pair formed from the referents of the terms in order is a member of the extension of the predicate.
(II) '' is true iff α is false.
(III) '(α & β)' is true when both α and β are true; otherwise '(α & β)' is false.
(IV) 'v β)' is true when at least one of α and β is true; otherwise 'v β)' is false.
(V) '(α → β)' is true if and only if (iff) α is false or β is true. So, '(α → β)' is false just in case α is true and β is false.

The meanings for ‘~’ and ‘&’ roughly correspond to the meanings of ‘not’ and ‘and’ as ordinarily used. We call '' and '(α & β)' negation and conjunction formulas, respectively. The formula '(~α v β)' is called a disjunction and the meaning of ‘v‘ corresponds to inclusive or. There are a variety of conditionals in English (e.g., causal, counterfactual, logical), with each type having a distinct meaning. The conditional defined by (V) above is called the material conditional. One way of following (V) is to see that the truth conditions for '(α → β)' are the same as for '~(α & ~β)'.

By (II) ‘~Married(shannon, kelly)’ is true because, as noted above, ‘Married(shannon, kelly)’ is false. (II) also tells us that ‘~Female(kelly)’ is false since ‘Female(kelly)’ is true. According to (III), ‘(~Married(shannon, kelly) & Female(kelly))’ is true because ‘~Married(shannon, kelly)’ is true and ‘Female(kelly)’ is true. And ‘(Male(shannon) & Female(shannon))’ is false because ‘Male(shannon)’ is false. (IV) confirms that ‘(Female(kelly) v Married(evan, evan))’ is true because, even though ‘Married(evan, evan)’ is false, ‘Female(kelly)’ is true. From (V) we know that the sentence ‘(~(beth = beth) → Male(shannon))’ is true because ‘~(beth = beth)’ is false. If α is false then '(α → β)' is true regardless of whether or not β is true. The sentence ‘(Female(beth) → Male(shannon))’ is false because ‘Female(beth)’ is true and ‘Male(shannon)’ is false.

Before describing the truth conditions for quantified sentences we need to say something about the notion of satisfaction. We’ve defined truth only for the formulas of M that are sentences. So, the notions of truth and falsity are not applicable to non-sentences such as ‘Male(x)’ and ‘((x = x) → Female(x))’ in which ‘x’ occurs free. However, objects may satisfy wffs that are non-sentences. We introduce the notion of satisfaction with some examples. An object satisfies ‘Male(x)’ just in case that object is male. Matt satisfies ‘Male(x)’, Beth does not. This is the case because replacing ‘x’ in ‘Male(x)’ with ‘Matt’ yields a truth while replacing the variable with ‘beth’ yields a falsehood. An object satisfies ‘((x = x) → Female(x))’ if and only if it is either not identical with itself or is a female. Beth satisfies this wff (we get a truth when ‘beth’ is substituted for the variable in all of its occurrences), Matt does not (putting ‘matt’ in for ‘x’ wherever it occurs results in a falsehood). As a first approximation, we say that an object with a name, say ‘a’, satisfies a wff 'Ψv' in which at most v occurs free if and only if the sentence that results by replacing v in all of its occurrences with ‘a’ is true. ‘Male(x)’ is neither true nor false because it is not a sentence, but it is either satisfiable or not by a given object. Now we define the truth conditions for quantifications, utilizing the notion of satisfaction. For a more detailed discussion of the notion of satisfaction, see the article, “Logical Consequence, Model-Theoretic Conceptions.”

Let Ψ be any formula of M in which at most v occurs free.

(VI) 'vΨ' is true just in case there is at least one individual in the domain of quantification (e.g. at least one McKeon) that satisfies Ψ.
(VII) 'vΨ' is true just in case every individual in the domain of quantification (e.g. every McKeon) satisfies Ψ.

Here are some examples. ‘∃x(Male(x) & Married(x, beth))’ is true because Matt satisfies ‘(Male(x) & Married(x, beth))’; replacing ‘x’ wherever it appears in the wff with ‘matt’ results in a true sentence. The sentence ‘∃xOlderThan(x, x)’ is false because no McKeon satisfies ‘OlderThan(x, x)’, that is replacing ‘x’ in ‘OlderThan(x, x)’ with the name of a McKeon always yields a falsehood.

The universal quantification ‘∀x( OlderThan(x, paige) → Male(x))’ is false for there is a McKeon who doesn’t satisfy ‘(OlderThan(x, paige) → Male(x))’. For example, Shannon does not satisfy ‘(OlderThan(x, paige) → Male(x))’ because Shannon satisfies ‘OlderThan(x, paige)’ but not ‘Male(x)’. The sentence ‘∀x(x = x)’ is true because all McKeons satisfy ‘x = x’; replacing ‘x’ with the name of any McKeon results in a true sentence.

Note that in the explanation of satisfaction we suppose that an object satisfies a wff only if the object is named. But we don’t want to presuppose that all objects in the domain of discourse are named. For the purposes of an example, suppose that the McKeons adopt a baby boy, but haven’t named him yet. Then, ‘∃x Brother(x, evan)’ is true because the adopted child satisfies ‘Brother(x, evan)’, even though we can’t replace ‘x’ with the child’s name to get a truth. To get around this is easy enough. We have added a list of names, ‘w1′, ‘w2′, ‘w3′, etc. to M, and we may say that any unnamed object satisfies 'Ψv' iff the replacement of v with a previously unused wi assigned as a name of this object results in a true sentence. In the above scenerio, ‘∃xBrother(x, evan)’ is true because, ultimately, treating ‘w1‘ as a temporary name of the child, ‘Brother(w1, evan)’ is true. Of course, the meanings of the predicates would have to be amended in order to reflect the addition of a new person to the domain of McKeons.

3. What is a Logic?

We have characterized an interpreted formal language M by defining what qualifies as a sentence of M and by specifying the conditions under which any M-sentence is true. The received view of logical consequence entails that the logical consequence relation in M turns on the nature of the logical constants in the relevant M-sentences. We shall regard just the sentential connectives, the quantifiers of M, and the identity predicate as logical constants (the language M is a first-order language). For discussion of the notion of a logical constant see Logical Consequence, Philosophical Considerations and Logical Consequence, Model-Theoretic Conceptions. Intuitively, one M-sentence is a logical consequence of a set of M-sentences if and only if it is impossible for all the sentences in the set to be true without the former sentence being true as well. A model-theoretic conception of logical consequence in M clarifies this intuitive characterization of logical consequence by appealing to the semantic properties of the logical constants, represented in the above truth clauses (I)-(VII). The entry Logical Consequence, Model-Theoretic Conceptions formalizes the account of truth in language M and gives a model-theoretic characterization of logical consequence in M. In contrast to the model-theoretic conception, the deductive-theoretic conception clarifies logical consequence, conceived of in terms of deducibility, by appealing to the inferential properties of logical constants portrayed as intuitively valid principles of inference, that is, principles justifying steps in deductions. See Logical Consequence, Philosophical Considerations for discussion of the relationship between the logical consequence relation and the model-theoretic and deductive-theoretic conceptions of it.

Deductive system N’s inference rules, introduced below, are introduction and elimination rules, defined for each logical constant of our language M. An introduction rule introduces a logical constant into a proof and is useful for deriving a sentence that contains the constant. An elimination rule for the constant makes it possible to derive a sentence that has at least one less occurrence of the logical constant. Elimination rules are useful for deriving a sentence from another in which the constant appears.

Following Shapiro (1991, p. 3), we define a logic to be a language L plus either a model-theoretic or a deductive-theoretic account of logical consequence. A language with both characterizations is a full logic just in case both characterizations coincide. For discussion on the relationship between the model-theoretic and deductive-theoretic accounts of logical consequence, see Logical Consequence, Philosophical Considerations. The logic for M developed below may be viewed as a classical logic or a first-order theory.

4. Deductive System N

In stating N’s rules, we begin with the simpler inference rules and give a sample formal deduction of them in action. Then we turn to the inference rules that employ what we shall call sub-proofs. In the statement of the rules, we let P and Q be any sentences from our language M. We shall number each line of a formal deduction with a positive integer. We let k, l, m, n, o, p and q be any positive integers such that k < m, and l < m, and m < n < o < p < q.

&-Intro

k. P
l. Q
m. (P & Q) &-Intro: k, l

&-Elim

k. (P & Q) k. (P & Q)
m. P &-Elim: k m. Q &-Elim: k

&-Intro allows us to derive a conjunction from both of its two parts (called conjuncts). According to the &-Elim rule we may derive a conjunct from a conjunction. To the right of the sentence derived using an inference rule is the justification. Steps in a proof are justified by identifying both the lines in the proof used and by citing the appropriate rule. The vertical lines serve as proof margins, which, as you will shortly see, help in portraying the structure of a proof when it contains embedded sub-proofs.

~-Elim

k. ~~P
m. P ~-Elim: k

The ~-Elim rule allows us to drop double negations and infer what was subject to the two negations.

v-Intro

k. P k. P
m. (P v Q) v-Intro: k m. (Q v P) v-Intro: k

By v-Intro we may derive a disjunction from one of its parts (called disjuncts).

-Elim

k. (P → Q)
l. P
m. Q →-Elim: k, l

The →- Elim rule corresponds to the principle of inference called modus ponens: from a conditional and its antecedent one may infer the consequent.

Here’s a sample deduction using the above inference rules. The formal deduction–the sequence of sentences 4-11—is associated with the pair

<{(Female(paige) & Female (kelly)), (Female(paige) → ~~Sister(paige, kelly)), (Female(kelly) → ~~Sister(paige, shannon))}, ((Sister(paige, kelly) & Sister(paige, shannon)) v Male(evan))>.

The first element is the set of basis sentences and the second element is the conclusion. We number the basis sentences and list them (beginning with 1) ahead of the deduction. The deduction ends with the conclusion.

1. (Female(paige) & Female (kelly)) Basis
2. (Female(paige) → ~~Sister(paige, kelly)) Basis
3. (Female(kelly) → ~~Sister(paige, shannon)) Basis
4. Female(paige) &-Elim: 1
5. Female(kelly) &-Elim: 1
6. ~~Sister(paige, kelly) →-Elim: 2, 4
7. Sister(paige, kelly) ~-Elim: 6
8. ~~Sister(paige, shannon) →-Elim: 3, 5
9. Sister(paige, shannon) ~-Elim: 8
10. (Sister(paige, kelly) & Sister(paige, shannon)) &-Intro: 7, 9
11. ((Sister(paige, kelly) & Sister(paige, shannon)) v Male(evan)) v-Intro: 10

Again, the column all the way to the right gives the explanations for each line of the proof. Assuming the adequacy of N, the formal deduction establishes that the following inference is correct.

(Female(paige) & Female (kelly))
(Female(paige) → ~~Sister(paige, kelly))
(Female(kelly) → ~~Sister(paige, shannon))


(therefore) ((Sister(paige, kelly) & Sister(paige, shannon)) v Male(evan))

For convenience in building proofs, we expand M to include ‘⊥’, which we use as a symbol for a contradiction (e.g., ‘(Female(beth) & ~Female(beth))’).

⊥-Intro

k. P
l. ~P
m. ⊥-Intro: k, l

⊥-Elim

k.
m. P ⊥-Elim: k

If we have derived a sentence and its negation we may derive ⊥ using ⊥-Intro. The ⊥-Elim rule represents the idea that any sentence P is deducible from a contradiction. So, from ⊥ we may derive any sentence P using ⊥-Elim.

Here’s a deduction using the two rules.

1. (Parent(beth, evan) & ~Parent(beth, evan)) Basis
2. Parent(beth, evan) &-Elim: 1
3. ~Parent(beth, evan) &-Elim: 1
4. ⊥-Intro: 2, 3
5. Parent(beth, shannon) ⊥-Elim: 4

For convenience, we introduce a reiteration rule that allows us to repeat steps in a proof as needed.

Reit

k. P
.
.
.
m. P Reit: k

We now turn to the rules for the sentential connectives that employ what we shall call sub-proofs. Consider the following inference.

1. ~(Married(shannon, kelly) & OlderThan(shannon, kelly))
2. Married(shannon, kelly)


(therefore) ~Olderthan(shannon, kelly)

Here is an informal deduction of the conclusion from the basis sentences.

Proof: Suppose that ‘Olderthan(shannon, kelly)’ is true. Then, from this assumption and basis sentence 2 it follows that ‘((Shannon is married to Kelly) & (Shannon is taller than Kelly))’ is true. But this contradicts the first basis sentence ‘~((Shannon is married to Kelly) & (Shannon is taller than Kelly))’, which is true by hypothesis. Hence our initial supposition is false. We have derived that ‘~(Shannon is married to Kelly)’ is true.

Such a proof is called a reductio ad absurdum proof (or reductio for short). Reductio ad absurdum is Latin for ‘reduction to the absurd’. (For more information, see the article “Reductio ad absurdum“.) In order to model this proof in N we introduce the ~-Intro rule.

~-Intro

k. P Assumption
.
.
.
m.
n. ~P ~-Intro: k-m

The ~-Intro rule allows us to infer the negation of an assumption if we have derived a contradiction, symbolized by ‘⊥’, from the assumption. The indented proof margin (k-m) signifies a sub-proof. In a sub-proof the first line is always an assumption (and so requires no justification), which is cancelled when the sub-proof is ended and we are back out on a line that sits on a wider proof margin. The effect of this is that we can no longer appeal to any of the lines in the sub-proof to generate later lines on wider proof margins. No deduction ends in the middle of a sub-proof.

Here is a formal analogue of the above informal reductio.

1. ~(Married(shannon, kelly) & OlderThan(shannon, kelly)) Basis
2. Married(shannon, kelly) Basis
3. OlderThan(shannon, kelly) Assumption
4. (Married(shannon, kelly) & OlderThan(shannon, kelly)) &-Intro: 2, 3
5. ⊥-Intro: 1, 4
6. ~Olderthan(shannon, kelly) ~-Intro: 3-5

We signify a sub-proof with the indented proof margin line; the start and finish of a sub-proof is indicated by the start and break of the indented proof margin. An assumption, like a basis sentence, is a supposition we suppose true for the purposes of the deduction. The difference is that whereas a basis sentence may be used at any step in a proof, an assumption may only be used to make a step within the sub-proof it heads. At the end of the sub-proof, the assumption is discharged. We now look at more sub-proofs in action and introduce another of N’s inference rules. Consider the following inference.

1. (Male(kelly) v Female(kelly))
2. (Male(kelly) → ~Sister(kelly, paige))
3. (Female(kelly) → ~Brother(kelly, evan))


(therefore) (~Sister(kelly, paige) v ~Brother(kelly, evan))

Informal Proof:

By assumption ‘(Male(kelly) v Female(kelly))’ is true, that is, by assumption at least one of the disjuncts is true.

Suppose that ‘Male(kelly)’ is true. Then by modus ponens we may derive that ‘~Sister(kelly, paige)’ is true from this assumption and the basis sentence 2. Then ‘(~Sister(kelly, paige) v ~Brother(kelly, evan))’ is true.

Suppose that ‘Female(kelly)’ is true. Then by modus ponens we may derive that ‘~Brother(kelly, evan)’ is true from this assumption and the basis sentence 3. Then ‘(~Sister(kelly, paige) v ~Brother(kelly, evan))’ is true.

So in either case we have derived that ‘(~Sister(kelly, paige) v ~Brother(kelly, evan))’ is true. Thus we have shown that this sentence is a deductive consequence of the basis sentences.

We model this proof in N using the v-Elim rule.

v-Elim

k. (P v Q)
m. P Assumption
.
.
.
n. R
o. Q Assumption
.
.
.
p. R
q. R v-Elim: k, m-n, o-p

The v-Elim rule allows us to derive a sentence from a disjunction by deriving it from each disjunct, possibly using sentences on earlier lines that sit on wider proof margins.

The following formal proof models the above informal one.

1. (Male(kelly) v Female(kelly)) Basis
2. (Male(kelly) → ~Sister(kelly, paige)) Basis
3. (Female(kelly) → ~Brother(kelly, evan)) Basis
4. Male(kelly) Assumption
5. ~Sister(kelly, paige) →-Elim: 2, 4
6. (~Sister(kelly, paige) v ~Brother(kelly, evan)) v-Intro: 5
7. Female(kelly) Assumption
8. ~Brother(kelly, evan) →-Elim: 3, 7
9. (~Sister(kelly, paige) v ~Brother(kelly, evan)) v-Intro: 8
10. (~Sister(kelly, paige) v ~Brother(kelly, evan)) v-Elim: 1, 4-6, 7-9

1. (P v Q) Basis
2. ~P Basis
3. P Assumption
4. ⊥-Intro: 2, 3
5. Q ⊥-Elim: 4
6. Q Assumption
7. Q Reit: 6
8. Q v-Elim: 1, 3-5, 6-7

Now we introduce the →-Intro rule by considering the following inference.

1. (Olderthan(shannon, kelly) → OlderThan(shannon, paige))
2. (OlderThan(shannon, paige) → OlderThan(shannon, evan))


(therefore) (Olderthan(shannon, kelly) → OlderThan(shannon, evan))

Informal proof:

Suppose that OlderThan(shannon, kelly). From this assumption and basis sentence 1 we may derive, by modus ponens, that OlderThan(shannon, paige). From this and basis sentence 2 we get, again by modus ponens, that OlderThan(shannon, evan). Hence, if OlderThan(shannon, kelly), then OlderThan(shannon, evan).

The structure of this proof is that of a conditional proof: a deduction of a conditional from a set of basis sentence which starts with the assumption of the antecedent, then a derivation of the consequent, and concludes with the conditional. To build conditional proofs in N, we rely on the →-Intro rule.

-Intro

k. P Assumption
.
.
.
m. Q
n. (P → Q) →-Intro: k-m

According to the →-Intro rule we may derive a conditional if we derive the consequent Q from the assumption of the antecedent P, and, perhaps, other sentences occurring earlier in the proof on wider proof margins. Again, such a proof is called a conditional proof.

We model the above informal conditional proof in N as follows.

1. (Olderthan(shannon, kelly) → OlderThan(shannon, paige)) Basis
2. (Olderthan(shannon, paige) → OlderThan(shannon, evan)) Basis
3. OlderThan(shannon, kelly) Assumption
4. OlderThan(shannon, paige) →-Elim: 1, 3
5. OlderThan(shannon, evan) →-Elim: 2, 4
6. (OlderThan(shannon, kelly) → OlderThan(shannon, evan)) →-Intro: 3-5

Mastery of a deductive system facilitates the discovery of proof pathways in hard cases and increases one’s efficiency in communicating proofs to others and explaining why a sentence is a logical consequence of others. For example, suppose that (1) if Beth is not Paige’s parent, then it is false that if Beth is a parent of Shannon, Shannon and Paige are sisters. Further suppose (2) that Beth is not Shannon’s parent. Then we may conclude that Beth is Paige’s parent. Of course, knowing the type of sentences involved is helpful for then we have a clearer idea of the inference principles that may be involved in deducing that Beth is a parent of Paige. Accordingly, we represent the two basis sentences and the conclusion in M, and then give a formal proof of the latter from the former.

1. (~Parent(beth, paige) → ~(Parent(beth, shannon) → Sister(shannon, paige))) Basis
2. ~Parent(beth, shannon) Basis
3. ~Parent(beth, paige) Assumption
4. ~(Parent(beth, shannon) → Sister(shannon, paige)) →-Elim: 1, 3
5. Parent(beth, shannon) Assumption
6. ⊥-Intro: 2, 5
7. Sister(shannon, paige) ⊥-Elim: 6
8. (Parent(beth, shannon) → Sister(shannon, paige)) →-Intro: 5-7
9. ⊥-Intro: 4, 8
10. ~~Parent(beth, paige) ~-Intro: 3-9
11. Parent(beth, paige) ~-Elim: 10

Because we derived a contradiction at line 9, we got ‘~~Parent(beth, paige)’ at line 10, using ~-Intro, and then we derived ‘Parent(beth, paige)’ by ~-Elim. Look at the conditional proof (lines 5-7) from which we derived line 8. Pretty neat, huh? Lines 2 and 5 generated the contradiction from which we derived ‘Sister(shannon, paige)’ at line 7 in order to get the conditional at line 8. This is our first example of a sub-proof (5-7) embedded in another sub-proof (3-9). It is unlikely that independent of the resources of a deductive system, a reasoner would be able to readily build the informal analogue of this pathway from the basis sentences to the sentence at line 11. Again, mastery of a deductive system such as N can increase the efficiency of our performances of rigorous reasoning and cultivate skill at producing elegant proofs (proofs that take the least number of steps to get from the basis to the conclusion).

We now introduce the Intro and Elim rules for the identity symbol and the quantifiers. Let n and n’ be any names, and 'Ωn' and 'Ωn’ ' be any well-formed formulas in which n and n’ appear and that have no free variables.

=-Intro

k. (n = n) =-Intro

=-Elim

k. Ωn
l. (n = n’ )
m. Ωn’ =-Elim: k, l

The =-Intro rule allows us to introduce '(n = n)' at any step in a proof. Since '(n = n)' is deducible from any sentence, there is no need to identify the lines from which line k is derived. In effect, the =-Intro rule confirms that ‘(paige = paige)’, ‘(shannon = shannon)’, ‘(kelly = kelly)’, etc… may be inferred from any sentence(s). The =-Elim rule tells us that if we have proven 'Ωn' and '(n = n’ )', then we may derive 'Ωn’ ' which is gotten from 'Ωn' by replacing n with n’ in some but possibly not all occurrences. The =-Elim rule represents the principle known as the indiscernibility of identicals, which says that if '(n = n’ )' is true, then whatever is true of the referent of n is true of the referent of n’. This principle grounds the following inference

1. ~Sister(beth, kelly)
2. (beth = shannon)


(therefore) ~Sister(shannon, kelly)

The indiscernibility of identicals is fairly obvious. If I know that Beth isn’t Kelly’s sister and that Beth is Shannon (perhaps ‘Shannon’ is an alias) then this establishes, with the help of the indiscernibility of identicals, that Shannon isn’t Kelly’s sister. Now we turn to the quantifier rules.

Let 'Ωv' be a formula in which v is the only free variable, and let n be any name.

∃-Intro

k. Ωn
m. vΩv ∃-Intro: k

∃-Elim

k. vΩv
[n] m. Ωn Assumption
.
.
.
n. P
o. P ∃-Elim: k, m-n

Here, n must be unique to the subproof, that is, n doesn’t occur on any of the lines above m and below n.

The ∃-Intro rule, which represents the principle of inference known as existential generalization, tells us that if we have proven 'Ωn', then we may derive 'vΩv' which results from 'Ωn' by replacing n with a variable v in some but possibly not all of its occurrences and prefixing the existential quantifier. According to this rule, we may infer, say, ‘∃x Married(x, matt)’ from the sentence ‘Married(beth, matt)’. By the ∃-Elim rule, we may reason from a sentence that is produced from an existential quantification by stripping the quantifier and replacing the resulting free variable in all of its occurrences by a name which is new to the proof. Recall that the language M has an infinite number of constants, and the name introduced by the ∃-Elim rule may be one of the wi. We regard the assumption at line l, which starts the embedded sub-proof, as saying “Suppose n names an arbitrary individual from the domain of discourse such that 'Ωn' is true.” To illustrate the basic idea behind the ∃-Elim rule, if I tell you that Shannon admires some McKeon, you can’t infer that Shannon admires any particular McKeon such as Matt, Beth, Shannon, Kelly, Paige, or Evan. Nevertheless we have it that she admires somebody. The principle of inference corresponding to the ∃-Elim rule, called existential instantiation, allows us to assign this ‘somebody’ an arbitrary name new to the proof, say, ‘w1‘ and reason within the relevant sub-proof from ‘Shannon admires w1‘. Then we cancel the assumption and infer a sentence that doesn’t make any claims about w1. For example, suppose that (1) Shannon admires some McKeon. Let’s call this McKeon ‘w1‘, that is, assume (2) that Shannon admires a McKeon named ‘w1‘. By the principle of inference corresponding to v-Intro we may derive (3) that Shannon admires w1 or w1 admires Kelly. From (3), we may infer by existential generalization (4) that for some McKeon x, Shannon admires x or x admires Kelly. We now cancel the assumption (that is, cancel (2)) by concluding (5) that for some McKeon x, Shannon admires x or x admires Kelly from (1) and the subproof (2)-(4), by existential instantiation. Here is the above reasoning set out formally.

1. ∃x Admires(shannon, x) Basis
[w1] 2. Admires(shannon, w1) Assumption
3. (Admires(shannon, w1) v Admires(w1, kelly)) v-Intro: 2
4. ∃x(Admires(shannon, x) v Admires(x, kelly)) ∃-Intro: 3
5. ∃x(Admires(shannon, x) v Admires(x, kelly)) ∃-Elim: 1, 2-4

The string at the assumption of the sub-proof (line 2) says “Suppose that ‘w1 ‘ names an arbitrary McKeon such that ‘Admires(shannon, w1)’ is true.” This is not a sentence of M, but of the meta-language for M, that is, the language used to talk about M. Hence, the ∃-Elim rule (as well as the ∀-Intro rule introduced below) has a meta-linguistic character.

∀-Intro

[n] k. Assumption
.
.
.
m. Ωn
n. vΩv ∀-Intro: k-m
n must be unique to the subproof

∀-Elim

k. vΩv
m. Ωn ∀-Elim: k

The ∀-Elim rule corresponds to the principle of inference known as universal instantiation: to infer that something holds for an individual of the domain if it holds for the entire domain. The ∀-Intro rule allows us to derive a claim that holds for the entire domain of discourse from a proof that the claim holds for an arbitrary selected individual from the domain. The assumption at line k reads in English “Suppose n names an arbitrarily selected individual from the domain of discourse.” As with the ∃-Elim rule, the name introduced by the ∀-Intro rule may be one of the wi. The ∀-Intro rule corresponds to the principle of inference often called universal generalization.

For example, suppose that we are told that (1) if a McKeon admires Paige, then that McKeon admires himself/herself, and that (2) every McKeon admires Paige. To show that we may correctly infer that every McKeon admires himself/herself we appeal to the principle of universal generalization, which (again) is represented in N by the ∀-Intro rule. We begin by assuming that (3) a McKeon is named ‘w1‘. All we assume about w1 is that w1 is one of the McKeons. From (2), we infer that (4) w1 admires Paige. We know from (1), using the principle of universal instantiation (the ∀-Elim rule in N), that (5) if w1 loves Paige then w1 loves w1. From (4) and (5) we may infer that (6) w1 loves w1 by modus ponens. Since w1 is an arbitrarily selected individual (and so what holds for w1 holds for all McKeons) we may conclude from (3)-(6) that (7) every McKeon loves himself/herself follows from (1) and (2) by universal generalization. This reasoning is represented by the following formal proof.

1. ∀x(Admires(x, paige) → Admires(x, x)) Basis
2. ∀x Admires(x, paige) Basis
[w1] 3. Assumption
4. Admires(w1, paige) ∀-Elim: 2
5. (Admires(w1, paige) → Admires(w1, w1)) ∀-Elim: 1
6. Admires(w1, w1) →-Elim: 4, 5
7. ∀x Admires(x, x) ∀-Intro: 3-6

Line 3, the assumption of the sub-proof, corresponds to the English sentence “Let ‘w1‘ refer to an arbitrary McKeon.” The notion of a name referring to an arbitrary individual from the domain of discourse, utilized by both the ∀-Intro and ∃-Elim rules in the assumptions that start the respective sub-proofs, incorporates two distinct ideas. One, relevant to the ∃-Elim rule, means “some specific object, but I don’t know which”, while the other, relevant to the ∀-Intro rule means “any object, it doesn’t matter which” (See Pelletier 1999, pp. 118-120 for discussion.)

Consider:

K = {All McKeons admire those who admire somebody, Some McKeon admires a McKeon}
X = Paige admires Paige

Here’s a proof that X is deducible from K.

1. ∀x(∃y Admires(x, y) → ∀z Admires(z, x)) Basis
2. ∃x∃y Admires(x, y) Basis
[w1] 3. ∃y Admires(w1, y) Assumption
4. (∃y Admires(w1, y) → ∀z Admires(z, w1)) ∀-Elim: 1
5. ∀z Admires(z, w1) →-Elim: 3, 4
6. Admires(paige, w1) ∀-Elim: 5
7. ∃y Admires(paige, y) ∃-Intro: 6
8. (∃y Admires(paige, y) → ∀z Admires(z, paige)) ∀-Elim: 1
9. ∀z Admires(z, paige) →-Elim: 7, 8
10. Admires(paige, paige) ∀-Elim: 9
11. Admires(paige, paige) ∃-Elim: 2, 3-10

An informal correlate put somewhat succinctly, runs as follows.

Let’s call the unnamed admirer, mentioned in (2), w1. From this and (1), every McKeon admires w1 and so Paige admires w1. Hence, Paige admires somebody. From this and (1) it follows that everybody admires Paige. So, Paige admires Paige. This is our desired conclusion

Even though the informal proof skips steps and doesn’t mention by name the principles of inference used, the formal proof guides its construction.

5. The Status of the Deductive Characterization of Logical Consequence in Terms of N

We began the article by presenting the deductive-theoretic characterization of logical consequence: X is a logical consequence of a set K of sentences if and only if X is deducible from K, that is, there is a deduction of X from K. To make it official, we now characterize the deductive consequence relation in M in terms of deducibility in N.

X is a deductive consequence of K if and only if K ⊢N X, that is, X is deducible in N from K

We now inquire into the status of this characterization of deductive consequence.

The first thing to note is that deductive system N is complete and sound with respect to the model-theoretic consequence relation defined in Logical Consequence, Model-Theoretic Conceptions: Section 4.4. Let

K ⊢N X

abbreviate

X is deducible in N from K

Similarly, let

K ⊨ X

abbreviate

X is a model-theoretic consequence of K, that is, every M-structure that is a model of K is also a model of X. (For more information on structures and models, see Logical Consequence, Model-Theoretic Conceptions.)

The completeness and soundness of N means that for any set K of M sentences and M-sentence X, K ⊢N X if and only if K ⊨ X. A soundness proof establishes K ⊢N X only if K ⊨ X, and a completeness proof establishes K ⊢N X if K ⊨ X. So, the ⊢N and ⊨ relations, defined on sentences of M, are extensionally equivalent. The question arises: which characterization of the logical consequence relation is more basic or fundamental?

a. Tarski’s argument that the model-theoretic characterization of logical consequence is more basic than its characterization in terms of a deductive system

The first thing to note is that the ⊢N-consequence relation is compact. For any deductive system D and pair there is a K’ such that, K ⊢D X if and only if K’ ⊢D X, where K’ is a finite subset of sentences from K. As pointed out by Tarski (1936), among others, there are intuitively correct principles of inference reflected in certain languages according to which one may infer a sentence X from a set K of sentences, even though it is incorrect to infer X from any finite subset of K. Here’s a rendition of his reasoning, focusing on the ⊢N-consequence relation defined on a language for arithmetic, which allows us to talk about the natural numbers 0, 1, 2, 3, and so on. Let ‘P’ be a predicate defined over the domain of natural numbers and let ‘NatNum(x)’ abbreviate ‘x is a natural number’. According to Tarski, intuitively,

∀x(NatNum(x) → P(x))

is a logical consequence of the infinite set S of sentences

P(0)
P(1)
P(2)
.
.
.

However, the universal quantification is not a ⊢N-consequence of the set S. The reason why is that the ⊢N-consequence relation is compact: for any sentence X and set K of sentences, X is a ⊢N-consequence of K, if and only if X is a ⊢N-consequence of some finite subset of K. Proofs in N are objects of finite length; a deduction is a finite sequence of sentences. Since the universal quantification is not a ⊢N-consequence of any finite subset of S, it is not a ⊢N-consequence of S. By the completeness of system N, it follows that

∀x(NatNum(x) → P(x))

is not a ⊨-consequence of S either. Consider the structure U* whose domain is the set of McKeons. Let all numerals name Beth. Let the extension of ‘NatNum’ be the entire domain, and the extension of ‘P’ be just Beth. Then each element of S is true in U*, but ‘∀x (NatNum(x) → P(x))’ is not true in U*. (See Logical Consequence, Model-Theoretic Conceptions for further discussion of structures.) Note that the sentences in S only say that P holds for 0, 1, 2, and so on, and not also that 0,1, 2, etc., are all the elements of the domain of discourse. The above interpretation takes advantage of this fact by reinterpreting all numerals as names for Beth.

However, we can reflect model-theoretically the intuition that ‘∀x(NatNum(x) → P(x))’ is a logical consequence of set S by doing one of two things. We can add to S the functional equivalent of the claim that 1, 2, 3, etc., are all the natural numbers there are on the basis that this is an implicit assumption of the view that the universal quantification follows from S. Or we could add ‘NatNum’ and all numerals to our list of logical terms. On either option it still won’t be the case that ‘∀x(NatNum(x) → P(x))’ is a ⊢N-consequence of the set S. There is no way to accommodate the intuition that ‘∀x(NatNum(x) → P(x))’ is a logical consequence of S in terms of a compact consequence relation. Tarski takes this to be a reason to think that the model-theoretic account of logical consequence is definitive as opposed to an account of logical consequence in terms of a compact consequence relation such as ⊢N.

Tarski’s illustration shows that what is called the ω-rule is a correct inference rule.

The ω-rule is that from:

{P(0), P(1), P(2), …}

one may infer

∀x(NatNum(x) → P(x))

with respect to any predicate P. Any inference guided by this rule is correct even though it can’t be represented in a deductive system as this notion has been used here and discussed in Logical Consequence, Philosophical Considerations.

Compactness is not a salient feature of logical consequence conceived deductive theoretically. This suggests, by the third criterion of a successful theoretical definition of logical consequence mentioned in Logical Consequence, Philosophical Considerations, that no compact consequence relation is definitive of the intuitive notion of deducibility. So, assuming that deductive system N is correct (that is, deducibility is co-extensive in M with the ⊢N-relation), we can’t treat

X is intuitively deducible from K if and only if K ⊢N X.

as a definition of deducibility in M since

X is a deductive consequence of K if and only if X is deducible in a correct deductive system from K.

is not true with respect to languages for which deducibility is not captured by any compact consequence relation (that is, not captured by any deduction-system account of it ). Some (e.g., Quine) demur using a language for purposes of science in which deducibility is not completely represented by a deduction-system account because of epistemological considerations. Nevertheless, as Tarski (1936) argues, the fact that there cannot be deduction-system accounts of some intuitively correct principles of inference is reason for taking a model-theoretic characterization of logical consequence to be more fundamental than any characterization in terms of a deductive system sound and complete with respect to the model-theoretic characterization.

b. Is deductive system N correct?

In discussing the status of the characterization of logical consequence in terms of deductive system N, we assumed that N is correct. The question arises whether N is, indeed, correct. That is, is it the case that X is intuitively deducible from K if and only if K ⊢N X? The biconditional holds only if both (1) and (2) are true.

(1) If sentence X is intuitively deducible from set K of sentences, then K ⊢N X.
(2) If K ⊢N X, then sentence X is intuitively deducible from set K of sentences.

So N is incorrect if either (1) or (2) is false. The truth of (1) and (2) is relevant to the correctness of the characterization of logical consequence in terms of system N, because any adequate deductive-theoretic characterization of logical consequence must identify the logical terms of the relevant language and account for their inferential properties (for discussion, see Logical Consequence, Philosophical Considerations: Section 4). (1) is false if the list of logical terms in M is incomplete. In such a case, there will be a sentence X and set K of sentences such that X is intuitively deducible from set K because of at least one inferential property of logical terminology unaccounted for by N and so false that K ⊢N X (for discussion of some of the issues surrounding what qualifies as a logical term see Logical Consequence, Model-theoretic Conceptions: Section 5.3). In this case, N would be incorrect because it wouldn’t completely account for the inferential machinery of language M. (2) is false if there are deductions in N that are intuitively incorrect. Are there such deductions? In order to fine-tune the question note that the sentential connectives, the identity symbol, and the quantifiers of M are intended to correspond to or, and, not, if…then (the indicative conditional), is identical with, some, and all. Hence, N is a correct deductive system only if the Intro and Elim rules of N reflect the inferential properties of the ordinary language expressions. In what follows, we sketch three views that are critical of the correctness of system N because they reject (2).

i. Relevance logic

Not everybody accepts it as a fact that any sentence is deducible from a contradiction, and so some question the correctness of the ⊥-Elim rule. Consider the following informal proof of Q from 'P & ~P', for sentences P and Q, as a rationale for the ⊥-Elim rule.

From (1) P and not-P, we may correctly infer (2) P, from which it is correct to infer (3) P or Q. We derive (4) not-P from (1). (5) P follows from (3) and (4).

The proof seems to be composed of valid modes of inference. Critics of the ⊥-Elim rule are obliged to tell us where it goes wrong. Here we follow the relevance logicians Anderson and Belnap (1962, pp.105-108; for discussion, see Read 1995, pp. 54-60). In a nutshell, Anderson and Belnap claim that the proof is defective because it commits a fallacy of equivocation. The move from (2) to (3) is correct only if or has the sense of at least one. For example, from Kelly is female it is legit to infer that at least one of the two sentences Kelly is female and Kelly is older than Paige is true. On this sense of or given that Kelly is female, one may infer that Kelly is female or whatever you like. However, in order for the passage from (3) and (4) to (5) to be legitimate the sense of or in (3) is if not-…then. For example from if Kelly is not female, then Kelly is not Paige’s sister and Kelly is not female it is correct to infer Kelly is not Paige’s sister. Hence, the above “support” for the ⊥-Elim rule is defective for it equivocates on the meaning of or.

Two things to highlight. First, Anderson and Belnap think that the inference from (2) to (3) on the if not-…then reading of or is incorrect. Given that Kelly is female it is problematic to deduce that if she is not then Kelly is older than Paige—or whatever you like. Such an inference commits a fallacy of relevance for Kelly not being female is not relevant to her being older than Paige. The representation of this inference in system N appeals to the ⊥-Elim rule, which is rejected by Anderson and Belnap. Second, the principle of inference underlying the move from (3) and (4) to (5)—from P or Q and not-P to infer Q—is called the principle of the disjunctive syllogism. Anderson and Belnap claim that this principle is not generally valid when or has the sense of at least one, which it has when it is rendered by ‘v‘ (e.g., see above). If Q is relevant to P, then the principle holds on this reading of or.

It is worthwhile to note the essentially informal nature of the debate. It calls upon our pre-theoretic intuitions about correct inference. It would be quite useless to cite the proof in N of the validity of disjunctive syllogism (given above) against Anderson and Belnap for it relies on the ⊥-Elim rule whose legitimacy is in question. No doubt, pre-theoretical notions and original intuitions must be refined and shaped somewhat by theory. Our pre-theoretic notion of correct deductive reasoning in ordinary language is not completely determinant and precise independently of the resources of a full or partial logic. (See Shapiro 1991, chaps. 1 and 2 for discussion of the interplay between theory and pre-theoretic notions and intuitions.) Nevertheless, hardcore intuitions regarding correct deductive reasoning do seem to drive the debate over the legitimacy of deductive systems such as N and over the legitimacy of the ⊥-Elim rule in particular. Anderson and Belnap (1962, p. 108) write that denying the principle of the disjunctive syllogism, regarded as a valid mode of inference since Aristotle, “… will seem hopelessly naïve to those logicians whose logical intuitions have been numbed through hearing and repeating the logicians fairy tales of the past half century, and hence stand in need of further support”. The possibility that intuitions in support of the general validity of the principle of the disjunctive syllogism have been shaped by a bad theory of inference is motive enough to consider argumentative support for the principle and to investigate deductive systems for relevance logic.

A natural deductive system for relevance logic has the means for tracking the relevance quotient of the steps used in a proof and allows the application of an introduction rule in the step from A to B “only when A is relevant to B in the sense that A is used in arriving at B” (Anderson and Belnap 1962, p. 90). Consider the following proof in system N.

1. Admires(evan, paige) Basis
2. ~Married(beth, matt) Assumption
3. Admires(evan, paige) Reit: 1
4. (~Married(beth, matt) → Admires(evan, paige)) →-Intro: 2-3

Recall that the rationale behind the →-Intro rule is that we may derive a conditional if we derive the consequent Q from the assumption of the antecedent P, and, perhaps, other sentences occurring earlier in the proof on wider proof margins. The defect of this rule, according to Anderson and Belnap is that “from” in “from the assumption of the antecedent P” is not taken seriously. They seem to have a point. By the lights of the → -Intro rule, we have derived line 4 but it is hard to see how we have derived the sentence at line 3 from the assumption at step 2 when we have simply reiterated the basis at line 3. Clearly, ‘~Married(beth, matt)’ was not used in inferring ‘Admires(evan, beth)’ at line 3. The relevance logician claims that the →-Intro rule in a correct natural deductive system should not make it possible to prove a conditional when the consequent was arrived at independently of the antecedent. A typical strategy is to use classes of numerals to mark the relevance conditions of basis sentences and assumptions and formulate the Intro and Elim rules to tell us how an application of the rule transfers the numerical subscript(s) from the sentences used to the sentence derived with the help of the rule. Label the basis sentences, if any, with distinct numerical subscripts. Let a, b, c, etc., range over classes of numerals. The →-rules for a relevance natural deductive system may be represented as follows.

→-Elim

k. (P → Q)a
l. Pb
m. Qab →-Elim: k, l

→-Intro

k. P{k} Assumption
.
.
.
m. Qb
n. (P → Q)b – {k} →-Intro: k-m, provided kb
The numerical subscript of the assumption
at line k must be new to the proof.
This is insured by using the line number
for the subscript.

In the directions for the →-Intro rule, the proviso that kb insures that the antecedent P is used in deriving the consequent Q. Anderson and Belnap require that if the line that results from the application of either rule is the conclusion of the proof the relevance markers be discharged. Here is a sample proof of the above two rules in action.

1. Admires(evan, paige)1 Assumption
2. (Admires(evan, paige) → ~Married(beth, matt))2 Assumption
3. ~Married(beth, matt)1, 2 →-Elim: 1,2
4. ((Admires(evan, paige) → ~Married(beth, matt)) → ~Married(beth, matt))1 →-Intro: 2-3
5. (Admires(evan, paige) → ((Admires(evan, paige) → ~Married(beth, matt)) → ~Married(beth, matt))) →-Intro: 1-4

For further discussion see Anderson and Belnap (1962). For a comprehensive discussion of relevance deductive systems see their (1975). For a more up-to-date review of the relevance logic literature see Dunn (1986).

ii. Intuitionistic logic

We now consider the correctness of the ~-Elim rule and consider the rule in the context of using it along with the ~-Intro rule.

~-Intro

k. P Assumption
.
.
.
m.
n. ~P ~-Intro: k-m

~-Elim

k. ~~P
m. P ~-Elim: k

Here is a typical use in classical logic of the ~-Intro and ~-Elim rules. Suppose that we derive a contradiction from the assumption that a sentence P is true. So, if P were true, then a contradiction would be true which is impossible. So P cannot be true and we may infer that not-P. Similarily, suppose that we derive a contradiction from the assumption that not-P. Since a contradiction cannot be true, not-P is not true. Then we may infer that P is true by ~-Elim.

The intuitionist logician rejects the reasoning given in bold. If a contradiction is derived from not-P we may infer that not-P is not true, that is, that not-not-P is true, but it is incorrect to infer that P is true. Why? Because the intuitionist rejects the presupposition behind the ~-Elim rule, which is that for any proposition P there are two alternatives: P and not-P. The grounds for this are the intuitionistic conceptions of truth and meaning.

According to intuitionistic logic, truth is an epistemic notion: the truth of a sentence P consists of our ability to verify it. To assert P is to have a proof of P, and to assert not-P is to have a refutation of P. This leads to an epistemic conception of the meaning of logical constants. The meaning of a logical constant is characterized in terms of its contribution to the criteria of proof for the sentences in which it occurs. Compare with classical logic: the meaning of a logical constant is semantically characterized in terms of its contribution to the determination of the truth conditions of the sentences in which it occurs. For example, the classical logician accepts a sentence of the form 'P v Q' only when she accepts that at least one of the disjuncts is true. On the other hand, the intuitionistic logician accepts ' P v Q' only when she has a method for proving P or a method for proving Q. But then the Law of Excluded Middle no longer holds, because a sentence of the form P or not-P is true, that is assertible, only when we are in a position to prove or refute P, and we lack the means for verifying or refuting all sentences. The alleged problem with the ~-Elim rule is that it illegitimately extends the grounds for asserting P on the basis of not-not-P since a refutation of not-P is not ipso facto a proof of P.

Since there are finitely many McKeons and the predicates of language M seem well defined, we can work through the domain of the McKeons to verify or refute any M-sentence and so there doesn’t seem to be an M-sentence that is neither verifiable nor refutable. However, consider a language about the natural numbers. Any sentence that results by substituting numerals for the variables in ‘x = y + z’ is decidable. This is to say that for any natural numbers x, y, and z, we have an effective procedure for determining whether or not x is the sum of y and z. Hence, for all x, y, and z either we may assert that x = y + z or we may assert the contrary. Let ‘A(x)’ abbreviate ‘if x is even and greater than 2 then there exists primes y and z such that x = y + z’. Since there are algorithms for determining of any number whether or not it is even, greater than 2, or prime, the hypothesis that the open formula ‘A(x)’ is satisfied by a given natural number is decidable for we can effectively determine for all smaller numbers whether or not they are prime. However, there is no known method for verifying or refuting Goldbach’s conjecture, for all x, A(x). Even though, for each numeral n standing for a natural number, the sentence 'A(n)' is decidable (that is, we can determine which of 'A(n)' or 'not-A(n)' is true), the sentence ‘for all x, A(x)’ is not. That is, we are not in a position to hold that either Goldbach’s conjecture is true or that it is not. Clearly, verification of the conjecture via an exhaustive search of the domain of natural numbers is not possible since the domain is non-finite. Minus a counterexample or proof of Goldbach’s conjecture, the intuitionist demurs from asserting that either Goldbach’s conjecture is true or it is not. This is just one of many examples where the intuitionist thinks that the law of excluded middle fails.

In sum, the legitimacy of the ~-Elim rule requires a realist conception of truth as verification transcendent. On this conception, sentences have truth-values independently of the possibility of a method for verifying them. Intuitionistic logic abandons this conception of truth in favor of an epistemic conception according to which the truth of a sentence turns on our ability to verify it. Hence, the inference rules of an intuitionistic natural deductive system must be coded in such a way to reflect this notion of truth. For example, consider an intuitionistic language in which a, b, … range over proofs, ‘a: P’ stands for ‘a is a proof of P’, and ‘(a, b)’ stands for some suitable pairing of the proofs a and b. The &-rules of an intuitionistic natural deductive system may look like the following:

&-Intro

k. a: P
l. b: Q
m. (a, b): (P & Q) &-Intro: k, l

&-Elim

k. (a, b): (P & Q) & nbsp; k. (a, b): (P & Q)
m. a: P &-Elim: k m. b: Q &-Elim: k

Apart from the negation rules, it is fairly straightforward to dress the Intro and Elim rules of N with a proof interpretation as is illustrated above with the &-rules. For the details see Van Dalen (1999). For further introductory discussion of the philosophical theses underlying intuitionistic logic see Read (1995) and Shapiro (2000). Tennant (1997) offers a more comprehensive discussion and defense of the philosophy of language underlying intuitionistic logic.

iii. Free Logic

We now turn to the ∃-Intro and ∀-Elim rules. Consider the following two inferences.

(1) Male(evan)


(3) ∀x Male(x)


(therefore) (2) ∃x Male(x) (therefore) (4) Male(evan)

Both are correct by the lights of our system N. Specifically, (2) is derivable from (1) by the ∃-Intro rule and we get (4) from (3) by the ∀-Elim rule. Note an implicit assumption required for the legitimacy of these inferences: every individual constant refers to an element of the quantifier domain. If this existence assumption, which is built into the semantics for M and reflected in the two quantifier rules, is rejected, then the inferences are unacceptable. What motivates rejecting the existence assumption and denying the correctness of the above inferences?

There are contexts in which singular terms are used without assuming that they refer to existing objects. For example, it is perfectly reasonable to regard the individual constants of a language used to talk about myths and fairy tales as not denoting existing objects. It seems inappropriate to infer that some actually existing individual is jolly on the basis that the sentence Santa Claus is jolly is true. Also, the logic of a language used to debate the existence of God should not presuppose that God refers to something in the world. The atheist doesn’t seem to be contradicting herself in asserting that God does not exist. Furthermore, there are contexts in science where introducing an individual constant for an allegedly existing object such as a planet or particle should not require the scientist to know that the purported object to which the term allegedly refers actually exists. A logic that allows non-denoting individual constants (terms that do not refer to existing things) while maintaining the existential import of the quantifiers (‘∀x’ and ‘∃x’ mean something like ‘for all existing individuals x’ and ‘for some existing individuals x’, respectively) is called a free logic. In order for the above two inferences to be correct by the lights of free logic, the sentence Evan exists must be added to the basis. Correspondingly, the ∃-Intro and ∀-Elim rules in a natural deductive system for free logic may be portrayed as follows. Again, let 'Ωv' be a formula in which v is the only free variable, and let n be any name.

∀-Elim ∃-Intro
k. vΩv k. Ωn
l. E!n l. E!n
m. Ωn ∀-Elim: k, l m. vΩv ∃-Intro: k, l

'E!n' abbreviates n exists and so we suppose that ‘E!’ is an item of the relevant language. The ∀-Intro and ∃-Elim rules in a free logic deductive system also make explicit the required existential presuppositions with respect to individual constants (for details see Bencivenga 1986, p. 387). Free logic seems to be a useful tool for representing and evaluating reasoning in contexts such as the above. Different types of free logic arise depending on whether we treat terms that do not denote existing individuals as denoting objects that do not actually exist or as simply not denoting at all.

In sum, there are contexts in which it is appropriate to use languages whose vocabulary and syntactic formation rules are independent of our knowledge of the actual existence of the entities the language is about. In such languages, the quantifier rules of deductive system N sanction incorrect inferences, and so at best N represents correct deductive reasoning in languages for which the existential presupposition with respect to singular terms makes sense. The proponent of system N may argue that only those expressions guaranteed a referent (e.g., demonstratives) are truly singular terms. On this view, advocated by Bertrand Russell at one time, expressions that may not have a referent such as Santa Claus, God, Evan, Bill Clinton, the child abused by Michael Jackson are not genuinely singular expressions. For example, in the sentence Evan is male, Evan abbreviates a unique description such as the son of Matt and Beth. Then Evan is male comes to

There exists a unique x such that x is a son of Matt and Beth and x is male.

From this we may correctly infer that some are male. The representation of this inference in N appeals to both the ∃-Intro and &exists;-Elim rules, as well as the &-Elim rule. However, treating most singular expressions as disguised definite descriptions at worst generates counter-intuitive truth-value assignments (Santa Claus is jolly turns out false since there is no Santa Claus) and seems at best an unnatural response to the criticism posed from the vantagepoint of free logic.

For a short discussion of the motives behind free logic and a review of the family of free logics see Read (1995, chap. 5). For a more comprehensive discussion and a survey of the relevant literature see Bencivenga (1986). Morscher and Hieke (2001) is a collection of recent essays devoted to taking stock of the past fifty years of research in free logic and outlining new directions.

6. Conclusion

This completes our discussion of the deductive-theoretic conception of logical consequence. Since, arguably, logical consequence conceived deductive-theoretically is not compact it cannot be defined in terms of deducibility in a correct deductive system. Nevertheless correct deductive systems are useful for modeling deductive reasoning and they have applications in areas such as computer science and mathematics. Is deductive system N correct? In other words: Do the Intro and Elim rules of N represent correct principles of inference? We sketched three motives for answering in the negative, each leading to a logic that differs from the classical one developed here and which requires altering Intro and Elim rules of N. It is clear from the discussion that any full coverage of the topic would have to engage philosophical issues, still a matter of debate, such as the nature of truth, meaning and inference. For a comprehensive and very readable survey of proposed revisions to classical logic (those discussed here and others) see Haack (1996). For discussion of related issues, see also the entries, “Logical Consequence, Philosophical Considerations” and “Logical Consequence, Model-Theoretic Conceptions” in this encyclopedia.

7. References and Further Reading

  • Anderson, A.R. and N. Belnap (1962): “Entailment”, pp. 76-110 in Logic and Philosophy, ed. G. Iseminger. New York: Appleton-Century-Crofts, 1968.
  • Anderson, A.R., and N. Belnap (1975): Entailment: The Logic of Relevance and Necessity. Princeton: Princeton University Press.
  • Barwise, J. and J. Etchemendy (2001): Language, Proof and Logic. Chicago: University of Chicago Press and CSLI Publications.
  • Bencivenga, E. (1986): “Free logics”, pp. 373-426 in Gabbay and Geunthner (1986).
  • Dunn, M. (1986): “Relevance Logic and Entailment”, pp. 117-224 in Gabbay and Geunthner (1986).
  • Fitch, F.B. (1952): Symbolic Logic: An Introduction. New York: The Ronald Press.
  • Gabbay, D. and F. Guenthner, eds. (1983): Handbook of Philosophical Logic, Vol 1. Dordrecht: D. Reidel.
  • Gabbay, D. and F. Guenthner, eds. (1986): Handbook of Philosophical Logic, Vol. 3. Dordrecht: D. Reidel.
  • Gentzen, G. (1934): “Investigations Into Logical Deduction”, pp. 68-128 in Collected Papers, ed. M.E. Szabo. Amsterdam: North-Holland, 1969.
  • Haack, S. (1978): Philosophy of Logics. Cambridge: Cambridge University Press.
  • Haack, S. (1996): Deviant Logic, Fuzzy Logic. Chicago: The University of Chicago Press.
  • Morscher E. and A. Hieke, eds. (2001): New Essays in Free Logic: In Honour of Karel Lambert, Dordrecht: Kluwer.
  • Pelletier, F.J. (1999): “A History of Natural Deduction and Elementary Logic Textbooks”, pp.105-138 in Logical Consequence: Rival Approaches, ed. J. Woods and B. Brown. Oxford: Hermes Science Publishing, 2001.
  • Read, S. (1995): Thinking About Logic. Oxford: Oxford University Press.
  • Shapiro, S. (1991): Foundations without Foundationalism: A Case For Second-Order Logic. Oxford: Clarendon Press.
  • Shapiro, S. (2000): Thinking About Mathematics. Oxford: Oxford University Press.
  • Sundholm, G. (1983): “Systems of Deduction”, in Gabbay and Guenthner (1983).
  • Tarski, A. (1936): “On the Concept of Logical Consequence”, pp. 409-420 in Tarski (1983).
  • Tarski, A. (1983): Logic, Semantics, Metamathematics, 2nd ed. Indianapolis: Hackett Publishing.
  • Tennant, N. (1997): The Taming of the True. Oxford: Clarendon Press.
  • Van Dalen, D. (1999): “The Intuitionistic Conception of Logic”, pp. 45-73 in Varzi (1999).
  • Varzi, A., ed. (1999): European Review of Philosophy, Vol. 4, The Nature of Logic, Stanford: CSLI Publications.

Author Information

Matthew McKeon
Email: mckeonm@msu.edu
Michigan State University
U. S. A.

Clarence Irving Lewis (1883—1964)

C. I. Lewis was a major American pragmatist. He was educated at Harvard,  taught at the University of California from 1911 to 1919 and at Harvard from 1920 until his retirement in 1953. Known as the father of modern modal logic and as a proponent of the given in epistemology, he also was an influential figure in value theory and ethics.

Lewis’s philosophy as a whole reveals a systematic unity in which logic, epistemology, value theory and ethics all take their place as forms of rational conduct in its broadest sense of self-directed agency. In his first major work, Mind and the World Order (MWO), published in 1929, Lewis put forward a position he called “conceptualistic pragmatism” according to which empirical knowledge depends upon a sensuous ‘given’, the constructive activity of a mind and a set of a priori concepts which the agent brings to, and thereby interprets, the given. These concepts are the product of the agent’s social heritage and cognitive interests, so they are not a priori in the sense of being given absolutely: they are pragmatically a priori. They admit of alternatives and the choice among them rests on pragmatic considerations pertaining to cognitive success.

His 1932 Symbolic Logic presented his system of strict implication and a set of successively stronger modal logics, the S systems. He showed that there are many alternative systems of logic, each self-evident in its own way, a fact which undermines the traditional rationalistic view of metaphysical first principles as being logically undeniable. As a result, he concluded that the choice of first principles and of deductive systems must be grounded in extra-logical or pragmatic considerations.

Table of Contents

  1. Introduction
  2. The Early Years
  3. Logical Investigations
  4. Mind and the World Order
  5. The Conversation with Positivism
  6. Analysis of Knowledge and Valuation
  7. Valuation and Rightness
  8. The Late Ethics
  9. References and Further Reading
    1. Major Works by Lewis
    2. Secondary Sources

1. Introduction

Lewis’s philosophy as a whole reveals a systematic unity in which logic, epistemology, value theory and ethics all take their place as forms of rational conduct in its broadest sense of self-directed agency. In his first major work, Mind and the World Order (MWO), published in 1929, Lewis put forward a position he called “conceptualistic pragmatism” according to which empirical knowledge depends upon a sensuous ‘given’, the constructive activity of a mind and a set of a priori concepts which the agent brings to, and thereby interprets, the given. These concepts are the product of the agent’s social heritage and cognitive interests, so they are not a priori in the sense of being given absolutely: they are pragmatically a priori. They admit of alternatives and the choice among them rests on pragmatic considerations pertaining to cognitive success.

His 1932 Symbolic Logic presented his system of strict implication and a set of successively stronger modal logics, the S systems. He showed that there are many alternative systems of logic, each self- evident in its own way, a fact which undermines the traditional rationalistic view of metaphysical first principles as being logically undeniable. As a result, he concluded that the choice of first principles and of deductive systems must be grounded in extra-logical or pragmatic considerations.

After the War his work played an important part in giving shape to academic philosophy as a profession. His 1946 Carus Lectures, An Analysis of Knowledge and Valuation (AKV) which represents a refinement of the doctrines of MWO and their extension to a theory of value, set the issues of postwar epistemology. The thoroughness of his discussion, and the technicalities of his writing were important models for postwar analytic philosophy. A student of Josiah Royce, William James and Ralph Barton Perry, a contemporary of Reichenbach, Carnap and the logical empiricists of the 30’s and 40’s, and the teacher of Quine, Frankena, Goodman, Chisholm, Firth and others, C.I. Lewis played a pivotal role in shaping the marriage between pragmatism and empiricism which has come to dominate much of current analytic philosophy.

After AKV, Lewis directed the final 20 years of his life to the foundation of ethics, giving numerous public lectures. He died in 1964 leaving a vast collection of unpublished manuscripts on ethical theory which are housed at the Stanford University Library.

2. The Early Years

Lewis was born on April 12, 1883, in relative poverty at Stoneham, Massachusetts. He enrolled in Harvard in 1902 , working part time as a tutor and a waiter, and received his B.A. degree three years later, taking an appointment to teach high school English in Quincy, Massachusetts. The following year he was appointed Instructor in English at the University of Colorado, moved to Boulder, and that winter married his high school sweetheart, Mabel Maxwell Graves. They stayed in Boulder for two years and in 1908 he enrolled in the PhD program, receiving his degree two years later in 1910, in part because financial concerns precluded a more leisurely pace. His thesis, The Place of Intuition in Knowledge prefigured important themes in his later work.

As an undergraduate, Lewis’s principal influences were James and Royce. When he returned to Harvard as a graduate student, James had retired, and the absolute idealism of Royce and Bradley was under attack by the New Realism of Moore and Russell in Great Britain and of W.P. Montague and Ralph Barton Perry at Harvard. The debate between Royce and James over monism and pluralism had been replaced by a debate between Royce and Perry over realism and idealism. Lewis studied metaphysics with Royce, and he studied Kant and epistemology with Perry. The debate between Royce and Perry framed Lewis’s dissertation and in it he attempted to forge a neo-Kantian middle road.

It is worth briefly discussing his dissertation because in many way it prefigures his later views. In his dissertation Lewis argued that the possibility of valid, justified, knowledge requires both givenness (or intuition) and the mind’s legislative or constructive activity. Lewis used the egocentric predicament in a dialectical argument against both the realist and idealist solutions to the problem of knowledge. Against Perry’s direct realism, he argued that what is known transcends what is present to the mind in the act of knowledge and that the real object is thus never given in consciousness; since knowledge requires that what is given to the mind be interpreted by our purposeful activity the real object of knowledge is made instead of given.

Against Royce, Lewis asserted the necessity of a given sensuous element that is neither a product of willing nor necessarily implicit in the cognitive aim of ideas. The mind’s activity is not constitutive of the known object because it does not make the given. Its purpose is rather to understand, or interpret, the given by referring it to an object which is real in some category or another. To be real is a matter of classification and only future experience will confirm or disconfirm the correctness of our classification, but some classification of the given will necessarily be correct. Whatever is unreal is so only relative to a certain way of understanding it Relative to some other purpose of understanding it will be real; the contents of a dream, for example are unreal only relative to a misclassification of them as a veridical perception. All knowledge contains a given element which shapes possible interpretation but the object known also transcends present experience.

It is remarkable how many themes in his mature work are already mobilized in his dissertation. Lewis’s solution to the problem of knowledge had both realist and idealist elements in an unstable equilibrium and his position would change several times over the next few years. Under the influence of Royce and Hume’s skepticism, Lewis came to believe that no realist answer to the problem of knowledge could work, and only an idealist solution would suffice. “How could the given be intelligible to the mind if it were independent of its interpretive activity?” This is a question which Lewis would not solve to his satisfaction until much later when he read Peirce. There is no doubt, however, that Lewis saw that a realist of Perry’s sort had no answer to it. At this point Lewis clearly had neither proof nor account of the relation of knowledge to independent reality. The synthesis of his dissertation had raised deep problems which were only to be answered by the mature system in MWO . “How can the given be intelligible if it is independent of the mind?” “If the mind does not shape or condition what is given to it how could valid knowledge be possible?” It seemed clear to Lewis that if justified knowledge were possible at all, then realism must be wrong. But idealism, as Lewis understood it, appealed to a necessary agreement between human will and the absolute in knowledge which was also unjustifiable.

3. Logical Investigations

Lewis received his PhD in 1910 but there were no jobs. This was a bitter disappointment for Lewis, who with a wife and small child, had hoped the financial difficulties of the past two years would be over. After a summer at his uncle’s farm the Lewises returned to Cambridge where Lewis spent the year tutoring and serving as an assistant in Royce’s logic class. Royce was one of America’s premier logicians during the time that Lewis was studying at Harvard and he introduced Lewis to Volume 1 of Russell and Whitehead’s Principia Mathematica which had just been published.

In the fall of 1911, Lewis went to the University of California at Berkeley as an instructor where, except for a stint in the army during World War I, he was to stay until his return to Harvard in 1920. During this period, Lewis worked primarily on epistemology and logic and, finding no logic texts available, was soon at work on a text on symbolic logic. This work would appear at the end of the war in 1918 as A Survey of Symbolic Logic the first history of the subject in English — and would form the basis of his better known Symbolic Logic , written together with C. H. Langford and published in 1932. Lewis’s work on logic was dictated in part by the need for a good text book and in part by objections to the paradoxes of material implication in Principia Mathematica and his desire to develop an account of inference more reflective of human reasoning. However, Lewis was still exercised by the problem of knowledge from his dissertation and was increasingly unhappy with the quasi-idealist solution he had explored there. In fact, Lewis’s study of logic during this period was at least in part directed towards examining important idealist assumptions about logic, which he would come to reject.

To solve the problem of knowledge the idealist needed logical truth to be absolute, for if the categorial form of our constructive will could vary then we would have no reason to take our interpretations to be true of the world. Lewis would attack the idealist assumptions in four related ways. First, he would argue that the coherence of a system of propositions depends upon the consistency of the propositions with each other and not on their dependence upon a set of absolute or self-evident truths. Secondly, he argued that a system rich enough to capture the notion of a world, or system of facts, is necessarily pluralistic in the sense that it must contain elements which are logically independent of each other. Thirdly, he argued that the existence of alternative deductive systems completely undermines the rationalistic view that metaphysical first principles can be shown to be logically necessary through the argument of ‘reaffirmation through denial’ (where in the attempt to deny a logical principle we necessarily presuppose its truth). Finally, he concluded that given the existence of alternative systems of logic, the choice of first principles and of deductive systems must be grounded in extra-logical, pragmatic considerations.

Lewis’s work in logic was also guided in part by concerns about Russell’s choice of material implication as a paradigm of logical deduction. Lewis constructed his own logical calculus based on relations in intention and strict implication, which he saw as a more adequate model of actual inference. Material implication has the property that a false proposition implies everything and so argued Lewis it is useless as a model of real inference. What we want to know is what would follow from a proposition if it were true and for Lewis this amounts to saying that the real basis of the inference is the strict implication where ‘A strictly implies B’ means that ‘The truth of A is inconsistent with the falsity of B.’ Lewis saw his account of strict implication to have important consequences for metaphysics and for the normative in general. He argued that the line dividing propositions corroborated or refuted by logic alone (necessary or logically impossible propositions) from the class of empirical truths or falsehood was of first importance of the theory of knowledge. The categories of possible and impossible, contingent and necessary, consistent and inconsistent are all independent of material truth and are founded on logic itself.

In 1920 Lewis was invited to return to Harvard to take up a one year position as Lecturer in Philosophy and was to remain for over 30 years until his retirement in 1953. There Lewis was reintroduced to Peirce and the last piece of his account of knowledge would fall into place, THE PRAGMATIC a priori.

After Peirce’s death Royce had arranged for the Peirce manuscripts to be brought to Harvard, and at the time of Lewis’s appointment the department was concerned that the manuscript remains, consisting of thousands of pages of apparently unorganized material, be catalogued. Lewis was given the job and although the task of arranging and cataloguing the papers ultimately passed to others, the two years he spent on that task gave Lewis the final building blocks for his mature epistemological position which he would call conceptualistic pragmatism. Lewis would find in Peirce’s “conceptual pragmatism,” with its emphasis upon the instrumental and empirical significance of concepts rather than upon any non-absolute character of truth, a resonance with his logical investigations.

Lewis in effect would turn the idealist thesis that mind determined the structure of reality on its head without giving up the idealist view of the legislative power of the mind. The mind interprets the given by way of concepts: the real, ultimately, becomes a matter of criterial commitment. The mind does not thereby manufacture what is given to it, but meets the independent given with interpretive structures which it brings to the encounter. In his dissertation Lewis had argued that the possibility of valid, justified, knowledge requires both givenness and the mind’s legislative or constructive activity. The epistemological view Lewis would now develop retained this basic structure but embedded it in a richer, psycho-biological model of inquiry and a more adequate account of the role of a priori concepts in knowledge. In the early 20’s Lewis would publish two seminal articles, “A Pragmatic Conception of The a priori,” and “The Pragmatic Element in Knowledge.” These two papers laid out the core of Lewis’s pragmatic theory of knowledge, which would be developed more richly in Mind and the World Order (MWO).

In “A Pragmatic Conception of the a priori,” Lewis rejected traditional concepts of the a priori arguing that, “The thought which both rationalism and empiricism have missed is that there are principles, representing the initiative of mind, which impose upon experience no limitations whatever, but that such conceptions are still subject to alternation on pragmatic grounds when the expanding boundaries of experience reveal their felicity as intellectual instruments.” What is important about an hypothesis is that it is a “concept” — a purely logical meaning — which can be brought to bear on experience. The concepts we formulate are in part determined by our pragmatic interests and in part by the nature of experience. Fundamental scientific laws are a priori because they order experience so that it can be investigated. The same is true of our more fundamental categorial notions. The given contains both the real and illusion, dream and fantasy. Our categorial concepts allow us to sort experience so that it can be interrogated. Thus the fact that we must fix our meanings before we can apply them productively in experience, is entirely compatible with their historical alteration or even abandonment.

In “The Pragmatic Element in Knowledge”, Lewis extended his pragmatism about the a priori to the theory of knowledge. Here, following Peirce and Royce, he identifies three elements in knowledge which are separable only by analysis: the element of experience which is given to an agent, the structure of concepts with which the agent interprets what is given, and the agent’s act of interpreting what is given by means of those concepts. The distinctively pragmatic character of this theory lies both in the fact that knowledge is activity or interpretation and that the concepts with which the mind interrogates experience reflect fallible and revisable commitments to future experiential consequences. Knowledge is an interpretation of the experiential significance for an agent with certain interests of what is given in experience; a significance testable by its consequences for action.

A priori truth is independent of experience because it is purely analytic of our concepts and can dictate nothing to the given. The formal sciences depend on nothing which is empirically given, depending purely on logical analysis for their content. So a priori truth is not assertive of fact but is instead definitive. There is logical order arising from our definitions in all knowledge. Ordinarily we do not separate out this logical order, but it is always possible to do so, and it is this element which minds must have in common if they are to understand each other. As Lewis puts it, “At the end of an hour which feels very long to you and short to me, we can meet by agreement, because our common understanding of that hour is not a feeling of tedium or vivacity, but means sixty minutes, one round of the clock…”. In short, shared concepts do not depend upon the identity of sense feeling, but in their objective significance for action.

The concept, the purely logical pattern of meaning, is an abstraction from the richness of actual experience. It represents what the mind brings to experience in the act of interpretation. The other element, that which the mind finds , or what is independent of thought, is the given. The given is also an abstraction, but it cannot be expressed in language because language implies concepts and because the given is that aspect of experience which concepts do not convey. Knowledge is the significance which experience has for possible action and the further experience to which such action would lead.

4. Mind and the World Order

Lewis first major book, Mind and the World Order (MWO) develops these results in three principal theses: first, a priori truth is definitive and offers criteria by means of which experience can be discriminated; second, the application of concepts to any particular experience is hypothetical and the choice of conceptual system meets pragmatic needs; and third, the susceptibility of experience to conceptual interpretation requires no particular metaphysical assumption about the conformity of experience to the mind or its categories. These principles allow Lewis to present the traditional problem of knowledge as resting on a mistake. There is no contradiction between the relativity of knowledge to the knowing mind and the independence of its object. The assumption that there is, is the product of Cartesian representationalism, the ‘copy theory’ of thought, in which knowledge of an object is taken to be qualitative coincidence between the idea in the mind and the external real object. For Lewis knowledge does not copy anything but concerns the relation between this experience and other possible experiences of which this experience is a sign. Knowledge is expressible not because we share the same data of sense but because we share concepts and categorial commitments.

All knowledge is conceptual; the given, having no conceptual structure of its own, is not even a possible object of knowledge. Foundationalism of the classical empiricist sort is thus directly precluded. Lewis’s task for MWO is in effect a pragmatic solution to Hume’s problem of induction: an account of the order we bring to experience which renders knowledge possible but makes no appeal to anything lying outside of experience. Prefiguring contemporary externalist accounts of representation, Lewis argues that both representative realism and phenomenalism are incoherent. Knowledge as correct interpretation is independent of whether the phenomenal character of experience is a “likeness” of the real object known, because the phenomenal character of experience only receives its function as a sign from its conceptual interpretation, that is, from its significance for future experience and action. The question of the validity of knowledge claims is thus for Lewis fundamentally the question of the normative significance of our empirical assessments for action.

Lewis argued that our spontaneous interpretation of experience by way of concepts that have objective significance for future experience constitutes a kind of diagnosis of appearance . If we could not recognize a sensuous content in our classification of it with qualitatively similar ones which have acquired predictive significance in the past, interpretation would be impossible. Despite the fact that such recognition is spontaneous and unconsidered it has the logical character of a generalization. To recognize an object — “this is a round penny” — is to make a fallible empirical claim, but to recognize the appearance is to classify it with other qualitatively similar appearances. The basis of the empirical judgment lies in the fact that past instances of such classification have been successful. Our empirical knowledge claims are dependent for their justification upon this body of conceptual interpretations in two ways. First, the world, in the form of future events implicitly predicted (or not) by our empirical judgments, will confirm or disconfirm those judgments: all empirical knowledge is thus merely probable. But secondly, the classification of immediate apprehensions by way of concepts justifying particular empirical judgments is itself generalization even when those concepts have come to function as a criterion of sense meaning. Concepts become criteria of classification because they allow us to make empirically valid judgments, and because they fit usefully in the larger structure of our concepts.

This structure, looked at apart from experience is an a priori system of concepts. The application of one of its constituent concepts to any particular is a matter of probability but the question of applying the system in general is a matter of the choice of an abstract system and can only be determined by pragmatic considerations. The implications of a concept within a system become criteria of its applicability in that system. If later experience does not accord with the logical implications of our application of a concept to a particular, we will withdraw the application of the concept. Persistent failure of individual concepts to apply fruitfully to experience will lead us to readjust the system as a whole. Our conceptual interpretations form a hierarchy in which some are more fundamental than others; abandoning them will have more radical consequences than abandoning others. Lewis’s account of inquiry offers both a non-metaphysical account of induction and an early version of the so called ‘theory-ladenness of observation terms’. There is no need for synthetic a priori or metaphysical truths to bridge the gap between abstract concepts in the mind and the reality presented in experience. Lewis offers a kind of ‘Kantian deduction of the categories’ providing a pragmatic vindication of induction but without Kant’s assumption that experience is limited by the modes of intuition and fixed forms of thought. Without a system of conceptual interpretation, no experience is possible, but which system of interpretation we use is a matter of choice and what we experience is given to us by reality. The importance of the given in this story is its independence . Our conceptual system can at best specify a system of possible worlds; within it the actual is not to be deduced but acknowledged. In short, Lewis’s theory of knowledge in MWO is a pragmatic theory of inquiry which combines rationalist and naturalistic elements to make knowledge of the real both fallible and progressive without recourse to transcendental guarantees.

5. The Conversation with Positivism

MWO was published in 1929 during a time of tragedy for Lewis and his family. MWO was very well received and Lewis’s career was now secure; he was elected to the American Academy of Arts and Sciences in May of 1929 and made a full professor at Harvard in 1930. But his daughter died that year after two years of a mysterious ailment and a few years later Lewis suffered a heart attack due to overwork. Despite life’s trials, the period between MWO and AKV was a period of intellectual expansion for Lewis. Lewis began to explore the consequences of his views for value theory and ethics. At the same time his logical interests shifted. While technical issues continued to occupy his attention for the next few years, largely in the form of replies to responses to his work in Symbolic Logic , his thinking shifted decisively away from technicalities and towards the experiential structure of meaning and its relation to value and knowledge. There were several reasons for this.

The period was a time of decisive change in philosophy in America generally. The influx of British and German philosophy into the United States during the thirties and the increasing professionalization of the universities, posed deep and ambiguous problems for American philosophers with a naturalistic or pragmatic orientation, and for Lewis in particular. Logical empiricism, with its emphasis on scientific models of knowledge and on the logical analysis of meaning claims was emerging as the most pervasive tendency in American philosophy in the thirties and forties, and Lewis was strongly identified with that movement. But Lewis was never completely comfortable in this company. For Lewis, experience was always at the center of the cognitive enterprise. The rapid abandonment of experiential analysis in favor of physicalism by the major positivists and their rejection of value as lacking cognitive significance both struck him as particularly unfortunate. Indeed his own deepening conversation with the pragmatic tradition led him in the opposite direction. It is only within experience that anything could have significance for anything, and Lewis came to see that rather than lacking cognitive significance, value is one way of representing the significance which knowledge has for future conduct. Attempting to work out these convictions led him to reflect on the differences between pragmatism and positivism, and to begin to investigate the cognitive structure of value experiences.

The pragmatist, Lewis holds, is committed to the Peircean pragmatic test of significance. But, as he notes in his 1930 essay, “Pragmatism and Current Thought,” this dictum can be taken in either of two directions. On the one hand, its emphasis on experience could be developed in a psychologistic direction and promote a form of subjectivism. On the other, the fact that the Peircean test limits meaning to that which makes a verifiable difference in experience takes it in the direction which he developed in MWO, to a view of concepts as abstractions in which “the immediate is precisely that element which must be left out.” But this claim must be correctly understood. An operational account of concepts empties them only of what is ineffable in experience. “If your hours are felt as twice as long as mine, your pounds twice as heavy, that makes no difference, which can be tested, in our assignment of physical properties to things.” A concept is thus merely a relational pattern. But it does not follow from this that the world as it is experienced is thrown out the window. “In one sense that of connotation a concept strictly comprises nothing but an abstract configuration of relations. In another sense its denotation or empirical application this meaning is vested in a process which characteristically begins with something given and ends with something done in the operation which translates a presented datum into an instrument of prediction and control.” Knowledge is a matter of two moments, beginning and ending in experience although it does not end in the same experience in which it begins. Knowledge of something requires that the experience which is anticipated or envisaged as verifying it is actually met with. Thus, the appeal to an operational definition or test of verifiability as the empirical meaning of a statement is, for the pragmatist, the requirement that the speaker know how to apply or refuse to apply the statement in question and to trace its consequences in the case of presented or imagined situations.

In his 1933 presidential address to the American Philosophical Association, “Experience and Meaning”, Lewis dealt with the question of verifiable significance in a very general way emphasizing both the points of agreement and difference between pragmatism and logical positivism. Lewis framed the discussion of meaning in terms of the distinction between immediacy and transcendence, sketching arguments against both phenomenalism and representational realism. What remains, the third way, is a view of meaning common to absolute idealism, logical positivism and pragmatism. Meaning is a relation of verifiability or signification between present and possible future experience.

In “Logical Positivism and Pragmatism”, Lewis compared his pragmatic conception of empirical meaning with the verificationism of logical positivism in a sharply critical way. Both movements, he argued, are forms of empiricism and hold conceptions of empirical meaning as verifiable ultimately by reference to empirical eventualities. The pragmatic conception of meaning looks superficially very much like the logical- positivist theory of verification despite its different formulation and its focus on action. But, argues Lewis, there is a deep difference. Whereas the pragmatic account rests meaning ultimately upon conceivable experience, the positivist account logicizes the relation. Lewis’s complaint is that this results in a conception of meaning which omits precisely what a pragmatist would count as the empirical meaning. Specifying which observation sentences are consequences of a given sentence helps us know the empirical meaning of a sentences only if the observation sentences themselves have an already understood empirical meaning in terms of the specific qualities of experience to which the observations predicates of the statement apply. Thus for Lewis the logical positivist fails to distinguish between linguistic meaning, which concerns logical relations with other terms, and empirical meaning, which concerns the relation expressions have to what may be given in experience, and as a result, leaves out precisely the thing which actually confirms a statement, namely the content of experience.

The emphasis on the experience of the knower points to a yet larger contrast between positivism and pragmatism regarding the difference between judgments of value and judgments of fact. Lewis was entirely opposed to the positivist conception of value statements as devoid of cognitive content, as merely expressive. For the pragmatist all judgments are, implicitly, judgments of value. Lewis would develop both the conception of sense meaning and the thesis that valuation is a form of empirical cognition in AKV .

6. Analysis of Knowledge and Valuation

In 1946 The Analysis of Knowledge and Valuation (AKV) was published, and Lewis was awarded the Edgar Pierce Professorship at Harvard, the chair which had been held by Perry and would be held by Quine after Lewis. AKV was the most widely discussed book of its day.

The pragmatic psycho-biological model of inquiry which Lewis adopted from Peirce and James is even more visibly a part of AKV than it was in MWO. Knowledge, action and evaluation are essentially connected animal adaptive responses. Cognition, as a vital function, is a response to the significance which items in an organism’s experiential environment have for that organism. Any psychological attitude which carries cognitive significance as a response will exhibit some value character of utility or disutility which can ground the correctness or incorrectness of that response as knowledge. Cognitively guided behavior is a kind of adaptive response, and the correctness of behavior guiding experience, to the extent that it carries cognitive significance, depends simply on whether the expectations lodged in it come about as the result of action. Meaning, in this sense is anticipation of further experience associated with present content and the truth of it concerns the verifiability of expected consequences of action. It is because of this that sense-apprehension is basic and underlies other forms of empirical cognition. Perceptual cognition involves a sign-function connecting present experience and possible future eventualities grounded in some mode of action which, pervading the experience in its immediacy, gives it its cognitive content.

The signifying character of the expectancies lodged in immediate experience is enormously expanded by the web of concepts we inherit as language users. Lewis did not, however, identify meaning with linguistic signs. Linguistic signs are secondary to something more basic in our experience which we share with animals generally and which occurs when something within our experience stands for something else as a sign. When the cat comes running because she hears you opening a can and takes it as a sign of dinner, she is responding to the meaning of her experience. While this meaning is independent of whether or not you are opening a can of cat food her expectation will be confirmed if the can contains cat food and disconfirmed if it doesn’t.

Meaning in this sense of empirical significance could only be available to a creature who can act in anticipation of events to be realized or avoided. Accordingly, the possible is epistemologically prior to the actual. Only an agent, for whom experience could have anticipatory significance, could have a concept of objective reality as that which is possible to verify or change. In addition to meaning as empirical significance Lewis distinguished the kind of meaning involved in the apprehension of our concepts. A definition represents a mode of classification, and although alternative modes of classification can be more or less useful, classification cannot be determined by that which is to be classified. Knowledge of meanings in this sense is analytic.

In AKV, Lewis distinguishes between four modes of meaning: (1) the denotation or extension of a term is the class of actual things to which the terms applies; (2) the comprehension of a term is the class of all possible things to which the term would correctly apply; (3) the signification of a term is the character of things the presence or absence of which determines the comprehension of the term; (4) the intension of a term is the conjunction of all the other terms which must be correctly applicable to anything to which the term correctly applies. A proposition is a term capable of signifying a state of affairs; it comprehends any possible world which would contain the state of affairs it signifies. The intension of a proposition includes whatever the proposition entails and thus comprises whatever must be true of any possible world for that proposition to be true of it.

Intentional and denotational modes of meaning are two aspects of cognitive apprehension in general, the denotational being that aspect of apprehension which, given our classifications, is dependent upon how experience turns out, and the intentional being that aspect of apprehension which reflects the classifications or definitions we have made and is thus independent of experience. Our choice of classification is essentially pragmatic, however, so what may count as an empirical matter in one context may count legislatively in another, generalizations may be corrected by future experience and our definitions replaced on the grounds of inadequacy. The analytic element in knowledge is indispensable because unless our intensions are fixed our terms have no denotation, but nothing determines how we shall fix our intensions save the superior utility of one set of terms over others.

While intensional meaning is primary for him, Lewis distinguishes between two different ways in which we can think of it. First, linguistic meaning is intension as constituted by the pattern of definitions of our terms. Secondly, sense meaning is intension as the criterion in terms of sense by which the application of terms to experience is determined. Sense meaning is more fundamental. Learning involves the extension of generalizations to unobserved cases and correlatively recognizing in new experiences the correct applicability of our terms. The sense meaning of a term is our criterion for applying the term correctly. In a thought experiment anticipating Searle’s “Chinese Room,” Lewis imagines a person who somehow learns Arabic using only an Arabic dictionary thus learning all the linguistic patterns in the language. This person would grasp the linguistic meanings of all the terms in Arabic but might nonetheless not know the meaning of any of the terms in the sense of knowing their application to the world. The language would remain a meaningless and arbitrary system of syntactic relationships. Linguistic meaning is nonetheless central in communication because what can be shared is conceptual structure. Understanding between two minds depends not on postulated identity of imagery or sensation but on shared definitions and concepts.

The validation of empirical knowledge has two dimensions, its verification and its justification. Verification is predictive and formulates our expectations for verification or falsification. Justification looks to the rational credibility of those expectations prior to their verification. In the acquisition of knowledge these dimensions support each other. The warrant which our present beliefs have is shaped by the history of past verifications of similar beliefs. Reflection on the warranted expectancies in our present beliefs leads us to formulate new generalizations and normative principles we can subject to tests. The common stock of concepts in our language embeds such principles and empirical generalizations in the intensions of terms. As a result our use of terms decisively shapes what is warranted and verifiable for us.

Lewis distinguishes between three classes of empirical statements. First, there are what he calls expressive statements which attempt to express what is presently given in experience. An ordinary perceptual judgment, say seeing my cat by the fridge, outstrips what is presently evident. This added content is carried by the intensions of the concepts in the judgment insofar as they convey the expectancies found in the experience. These expectancies, although partly a function of past learning and knowledge of the intension of terms, are simply given in the experience, they are the part we do not invent and cannot change but merely find. Lewis suggests that we can use language expressively to capture this presentational content by stripping our meaning of its ordinary implication of objective content. Secondly, there are statements which formulate predictions. The judgment that if I do action A the outcome will include E, where E indicates an aspect of experience expressively characterized, is one which can be completely verified by putting it to the test. Upon acting the content E will either be given or it will not. Lewis calls empirical judgments of this sort terminating judgments. Finally, there are judgments which assert the actuality of some state of affairs. Although they can be rendered increasingly probable by tests, no set of eventualities envisioned can exhaust their significance. Lewis calls these judgments non-terminating because there are indefinitely many further tests which could, theoretically speaking, falsify the prediction and any actual verification can be no more than partial.

The ground of empirical judgments is past experience of like cases. At bottom those experiences have a warrant-producing character for a particular response because of the directly apprehended qualitative character of the signal combined with the expectations due to similar experiences in the past. In short, an empirical judgment is justified by its relation to past experiences of like cases. The warrant producing character of those experiences for a particular judgment depends upon the recognition of the presentation as classifiable with other qualitatively similar appearances as significant of future experience, and the character of the passages of experience attending past instances of the judgment. Epistemic warrant at its bottom level is the animal’s recognition of future objectivity lodged in present experience; present experience is a sign of experience to come. A multi-storied interpretive structure of concepts is built upon this adaptive responsiveness. Concepts become criteria of classification because they allow us to make empirically valid judgments, and because they fit usefully in the larger structure of our concepts. The structure, viewed apart from experience, is an a priori system of concepts, but looked at in terms of experience it is a network of sense meanings. The concept of probability plays a more prominent role in AKV than it does in MWO, but it is not a role of a different kind.

Perceptual knowledge has two aspects: the givenness of the experience and the objective interpretation which, in light of past experience, we put on it. But these are both abstractions and only distinguishable by analysis. What is given in experience as spontaneously arising expectancies is already conceptually structured, to recognize the given is to classify it with qualitatively similar cases and that recognition, although spontaneous, has the logical character as a generalization. The system of concepts within which our judgments are formulated and the pyramidal structure of empirical beliefs which intend a set of possible worlds of which ours is but one, by themselves suggest a coherence theory of justification. But here, as in MWO, Lewis resists this idealist alternative. Lewis takes the given to be essential for a series of interrelated reasons. Mere coherence of a system of statements does not even give meaning; the student of Arabic mentioned earlier does not know what any of the terms mean and cannot even use a statement to express a judgment. The given thus plays the role of fixing what beliefs mean because it lodges the actual world among the various possible worlds which are compatible with my knowledge: whichever world I am in it is this one. A merely hypothetical system of congruent and consistent statements could be fabricated out of whole cloth, as a novelist does, but however richly developed, the congruence and coherence of the system would be no evidence of fact at all. Independently given facts are indispensable and they are the actually given expectancies whose objective intent we then can evaluate for their mutual congruence and coherence.

Lewis’s emphasis on the given has been taken by many contemporary philosophers to be an instance of classical foundationalism. As we saw in the discussion of MWO Lewis considered the very idea of sense data to be incoherent. There is, however, a debate about whether his views changed between that book and AKV. Christopher Gowans (in “Two Concepts of the Given in C.I. Lewis, Realism and Foundationalism”) has argued that Lewis had two different conceptions of the given but failed to recognize the difference between them. On this view, while Lewis was an anti-foundationalist in MWO he embraced foundationalism in AKV and his later thinking. Determining Lewis’s position is, of course, a matter of interpretation. I think that a non-foundationalist position is dictated by the larger structure of his thought. He was certainly not a foundationalist in the British empiricist sense of the word.

7. Valuation and Rightness

Lewis rejected the “scandal” of emotivism and noncognitivism and directed much of his late thinking to two tasks: demonstrating that valuation is a species of empirical knowledge and establishing that there are valid nonrepudiable imperatives or principles of rightness. Lewis’s acceptance of the psycho-biological model of inquiry and it’s emphasis on the evolutionary and biological ground of cognition in animal adaptive response, committed him to the ineliminability of value in knowledge. Inquiry directed towards epistemic goals is, he argued, no less a species of conduct than practical and moral inquiry. Conduct of any sort will be directed towards ends appropriate to it and in light of which both its success can be measured and its aim be critiqued as reasonable or unreasonable. Lewis argued that evaluations are a form of empirical knowledge no different fundamentally from other forms of empirical knowledge regarding the determination of their truth or falsity, or of their validity or justification.

Much of Lewis’s discussion takes the form of an analysis of the concepts surrounding rational agency. Purposeful activity intrinsically involves rational cognitive appraisal. Action is behavior which is deliberate in the sense of being subject to critique and alterable upon reflection. It is behavior for the sake of realizing something to which a positive value is ascribed. He characterizes an action as sensible just in case the result or its intent, is ascribed comparative value. The purpose of an act, by which he means that part of the intent of an act for the sake of which it is adopted, can also be said to be sensible because what is purposed is something to which comparative value is ascribed. An act is successful in the circumstance that it is adopted for a sensible purpose which is realized in the result.

The verification of success will depend upon the purpose for which the act is done. The success of an action aimed at an enjoyable experience can be decisively verified if that experience is attained, but typically the purpose of an act will be to bring about a state of affairs whose value-consequences extend into the future and will thus be affected by other states of affairs, and so the success of the act may never be fully verified. In addition, an act may fail of its purpose in two ways: the expected result may fail to follow or it may be realized but fail to have the value ascribed to it.

Just as there are two aspects to the validation of empirical belief, verification and justification, Lewis distinguishes the success (or verification) of an action from its practical justification, which is the character belonging to a belief just in case its intent is an expectation which is a warranted empirical belief. Given these distinctions, Lewis argues that unless values were truth-apt in the sense of being genuine empirical cognitions capable of confirmation or disconfirmation, no intention or purpose could be serious and hence no action could be justified or attain success. The enterprise of human life can only prosper, he says, if there are value judgments which are true. Those who deny it fall into a kind of practical contradiction similar to that of Epimenides the Cretan who said that all Cretans are liars. Making a judgment, framing an argument, and deciding to take an action, are all activities which involve bringing to bear cognitive criteria of classification, inference and cogency on the matter at hand. Thinking is an activity which presupposes selective and intelligent choice concerning the path of thought. Repudiation of the rational imperativeness of so selectively choosing is thus nothing less than a repudiation of the cognitive aim of thinking. All the different forms of imperatives, the epistemic and logical imperatives, the technical, prudential and moral imperatives, are of a piece: they are principles of right intellectual conduct, in short, principles of intelligent practice. The notions of correctness, conduct, objectivity and reality are all forged within the system of communal practices which give these concepts ground. Our conceptual framework is not merely a set of common concepts but also a set of communal norms regulating our conduct. We can reject these norms only by repudiating our conceptual framework, but there is no other ground of rational choice which could provide a warrant for an act of repudiation, so that the act of repudiating norms tacitly presupposes the warrant which norms provide. The skeptic’s own claims constitute a reductio ad absurdum against his position.

As we saw, Lewis distinguished between three classes of empirical statement, expressive, terminating and non-terminating statements. Since valuation is a species of empirical knowledge Lewis distinguishes between three kinds of value-predications. First, there are expressive statements of found value quality as directly experienced. Such predications require no verification as they make no claim which could be subjected to test. Secondly, there are terminating evaluations which predict the success of an action aimed at some value experience as result. These can be put to test by so acting and thus are directly verifiable. Finally there are non-terminating evaluations which ascribe an objective value property to an object or state of affairs. Like any other judgment of objective empirical fact such claims are always fallible though some may attain practical certainty.

Since the aim of sensible action is the realization of some positive value in experience, only what is immediately valuable can be valuable for its own sake or intrinsically valuable. Extrinsic values divide into values which are instrumental for some thing else and values found to be inherent in objects, situations or states of affairs. Value, Lewis argues, is not a kind of quality but a dimension-like orientational mode pervading all experience. To live and to act is necessarily to be subject to imperatives, to recognize the validity of norms. The good which we seek in action is not this or that presently given value experience but a life which is good on the whole. That is something which cannot be immediately disclosed in present experience but can only be comprehended by some imaginative or synthetic envisagement of its on- the-whole quality. We are subject to imperatives because future possibilities are present in our experience only as signs of the significance which that experience has for the future if we decide to act one way rather than another. Since we are free to act or not we must move ourselves in accordance with the directive import of our experience to realize future goods. Life is not an aggregate of separate moments but a synthetic whole in which no single experience momentarily given says the last word about itself: each moment has its own fixed and unalterable character but the significance of that character for the whole, like the significance of a note within a piece of music, depends upon the character of other experiences to which it stands in relation. The value assessment of experiential wholes can never be directly certain nor decisively verified in any experience because what is to be assessed is a whole of experiences as it is experienced, and there is no moment in which this experiential whole is present. The value of experiential wholes thus essentially involves memory and narrative interpretation.

8. The Late Ethics

A discussion of Lewis’s philosophy would not be complete without a discussion of his late work in ethics. Lewis’s ethics, toward which the whole of his mature philosophical work aimed, is a richly developed foundation for a common sense reflective morality, broadly within the American pragmatic naturalistic tradition. No one can cogently repudiate the ethical task and it is not the special mission of any discipline. At the center of Lewis’s theory of practical reason is the rational imperative. While a naturalist with respect to values, he held practical thinking in all its forms to rest for its cogency on categorically valid principles of right. Ethics, epistemology and logic are all inquiries into species of right conduct. They are kinds of thinking, subject to our deliberate self-government and thus to normative critique, and as a consequence they are all forms of practical reason.

Under the influence of Kant, he held that imperatives are rational constraints put on our thinking by our nature as rational beings. He offered several arguments including a pragmatic ‘Kantian deduction’ of the principles of practice, arguing that without universally valid principles of practice, our experience of ourselves as agents would be impossible. He also offered a reductio ad absurdum against the skeptic. The denial of moral imperatives is pragmatically incoherent because it in effect attempts to mount a valid argument to the conclusion that there is no such thing as validity in argument; the skeptic’s attempt to deny the universal validity of such imperatives involves him in what Lewis called a pragmatic contradiction and leads by a reductio ad absurdum to the confirmation of their validity. By implicitly asking us to weigh and consider his reasons, the skeptic appeals to reasons and argument as things which should constrain us in our beliefs and decisions, whether we like it or not and thus acknowledges their force in his practice. Imperatives are not arbitrary commands or recommendations to the self; they are directly and cognitively present in the agent’s experience.

Rational imperatives must underlie all forms of rational self-regulation, of which ethics proper is only one department. Arguing, concluding, believing are also forms of self-governed conduct and it is to these forms that his argument first turns. Experience itself is for Lewis dynamically shaped by our classifications and judgments; as a temporal process its present moments are pervaded by implicit judgments, expectations and valuations, grounded in past expectations and confirmations. Permeated with value and active assessment, experience is a weave of givenness and conduct, of doing and suffering. Value qualities are verifiably found in experience; objective valuations are both fallible and corrigible. They are judgments which reflect the justified expectation of good (or unfavorable) consequences on the assumption of actions envisaged. Accordingly, the evaluative ought the rational imperative is at the heart of human experience. At the beginning his 1954 Woodbridge Lectures, as The Ground and Nature of the Right , he argues “To say that a thing is right is simply to characterize it as representing the desiderated commitment of choice in any situation calling for deliberate decision. What is right is thus the question of all questions; and the distinction of right and wrong extends to every topic or reflection and to all that human self-determination of act or attitude may affect.”

Despite the critical priority of the right it is in the service of the good; and Lewis’s account of both reflects a single commitment to the pragmatic structure of inquiry. Ethics grows out of the fact that human beings are active creatures who enter into the process of reality in order to change it. We are also social creatures whose experience and needs are taken up thematically in the categories and organized practices which make up our social inheritance. For Lewis both what is judged justifiably to be good and what ways of achieving it are validly imperative are fallibly grounded in human experience; skepticism about either the right or the good is ultimately a failure to acknowledge that fact. Since we are endowed with the capacity to do by choosing we are obligated to exercise it. We must decide even if we choose to do nothing, and the world will be different depending on how we decide

To say that human beings are self-conscious and self-governing creatures means, for Lewis, that they perceive their environment in terms of predictively hypothetical imperatives between which they are able to choose. Beliefs and imperatives are thus only modally distinct; they contain the same information. What Lewis calls the “Law of Objectivity” is governing oneself by the advice of cognition, in contravention if necessary to our impulses and inclination. Directives of doing, determined by the good or bad results of conforming to them, fall into various modes, principally the technical, the prudential and the moral and the logical. The imperative force of technical rules presumes as antecedently determined some class of ends; they justify actions only on the assumption of the justification of those ends. The rules of technique are thus hypothetical imperatives. By contrast, the rules of the critique of consistence and cogency, of prudence and of the moral are non-repudiable; they are categorical.

In his final years Lewis worked on a book on the foundations of ethics. It is clear from his manuscripts and letters that the ethics book occupied Lewis’s attention in the early forties and for the rest of his life. While it is difficult to understand why Lewis was unable to work the material into a form which satisfied him, I think that it had come to have an importance in his mind, a finality, which combined with his declining health, prevented a final satisfactory version being written for he continued to work on his ethics book writing almost daily until his death in February of 1964.

9. References and Further Reading

a. Major Works by Lewis

  • Lewis, C.I., 1929. Mind and The World Order: an Outline of a Theory of Knowledge . Charles Scribner’s Sons, New York, 1929, reprinted in paperback by Dover Publications, Inc. New York, 1956.
  • Lewis, C.I., 1932a. Symbolic Logic (with C.H. Langford). New York: The Appleton-Century Company, 1932 pp. xii +506, reprinted in paperback by New York: Dover Publications, 1951.
  • Lewis,C. I., 1946. An Analysis of Knowledge and Valuation , (The Paul Carus Lectures, Series 8, 1946) Open Court, La Salle, 1946.
  • Lewis, C.I., 1955a. The Ground and Nature of the Right , The Woodbridge Lectures, V, delivered at Columbia University in November 1954, New York, Columbia University Press, 1955.
  • Lewis, C.I., 1957a. Our Social Inheritance , Mahlon Powell Lectures at University of Indiana, 1956, Bloomington, Indiana, Indiana University Press, 1957.
  • Collected Papers of Clarence Irving Lewis , ed. John D. Goheen and John L. Mothershead, Jr., Stanford University Press, Stanford, 1970.
    • Includes most of Lewis’s most important articles.
  • Values and Imperatives, Studies in Ethics , ed. John Lange, Stanford University Press, Stanford, California, 1969.
    • Includes a number of Lewis’s late, unpublished talks on ethics.

b. Secondary Sources

  • Dayton, Eric. AC I Lewis And The Given@, Transactions of the Charles S . Peirce Society , 31(2), Spr 1995, pp. 254-284.
  • Flower, Elizabeth and Murphey, Murray G. A History of Philosophy in America , New York, G.P. Putnam’s Sons, 1977, Chapter 15. pp.892-958.
  • Gowans, Christopher W. ATwo Concepts Of The Given In C I Lewis: Realism And Foundationalism@. The Journal of the History of Philosophy , 27(4), 1989, pp. 573-590.
  • Haack, Susan. “C I Lewis” In American Philosophy , Singer, Marcus G (Ed), Cambridge, Cambridge University Press, 1986, pp. 215-238.
  • Hill, Thomas English. Contemporary Theories of Knowledge , The Ronald Press Co., New York, 1961, chapter 12, pp. 362-387.
  • Kuklick, Bruce. The Rise of American Philosophy, New Haven, Yale University Press, 1977, chapter 28, pp. 533-562.
  • Reck, Andrew J. The New American Philosophers , Louisiana State University Press, Baton Rouge, 1968, pp. 3-43.
  • Rosenthal, Sandra B. The Pragmatic a priori: Study In The Epistemology Of C I Lewis . St Louis, Green, 1976.
  • Schilpp, Paul Arthur (Ed). The Philosophy Of C I Lewis . La Salle Il Open Court, 1968.
  • Thayer, H S. Meaning And Action: A Critical History Of Pragmatism. Indianapolis Bobbs-Merrill, 1968, chapter 4, pp.205-231.

 

Author Information

Eric Dayton
Email: eric.dayton@usask.ca
University of Saskatchewan
Canada

Gottfried Leibniz: Metaphysics

leibnizThe German rationalist philosopher, Gottfried Wilhelm Leibniz (1646-1716), is one of the great renaissance men of Western thought. He has made significant contributions in several fields spanning the intellectual landscape, including mathematics, physics, logic, ethics, and theology. Unlike many of his contemporaries of the modern period, Leibniz does not have a canonical work that stands as his single, comprehensive piece of philosophy. Instead, in order to understand Leibniz’s entire philosophical system, one must piece it together from his various essays, books, and correspondences. As a result, there are several ways to explicate Leibniz’s philosophy. This article begins with his theory of truth, according to which the nature of truth consists in the connection or inclusion of a predicate in a subject.

Together with several apparently self-evident principles (such as the principle of sufficient reason, the law of contradiction, and the identity of indiscernibles), Leibniz uses his predicate-in-subject theory of truth to develop a remarkable philosophical system that provides an intricate and thorough account of reality. Ultimately, Leibniz’s universe contains only God and non-composite, immaterial, soul-like entities called “monads.” Strictly speaking, space, time, causation, material objects, among other things, are all illusions (at least as normally conceived). However, these illusions are well-founded on and explained by the true nature of the universe at its fundamental level. For example, Leibniz argues that things seem to cause one another because God ordained a pre-established harmony among everything in the universe. Furthermore, as consequences of his metaphysics, Leibniz proposes solutions to several deep philosophical problems, such as the problem of free will, the problem of evil, and the nature of space and time. One thus finds Leibniz developing intriguing arguments for several philosophical positions—including theism, compatibilism, and idealism.

This article is predominately concerned with this broad view of Leibniz’s philosophical system and does not deal with Leibniz’s work on, for example, aesthetics, political philosophy, or (except incidentally) physics. Leibniz’s “mature metaphysical career” spanned over thirty years. During this period, it would be surprising if some of his basic ideas did not change, but, remarkably, the broad outline of his philosophy does remain constant.

Table of Contents

  1. Life
  2. The Idea of Truth
  3. Sufficient Reason
  4. Substance, Briefly
  5. Necessary Being
  6. Problems of Freedom, Sin, and Evil
    1. Freedom and Sin
    2. Problem of Evil
  7. Space, Time, and Indiscernibles
    1. Against the Absolute Theory
    2. The Relational Theory
    3. Objections and Replies
  8. Substance as Monad
    1. Monads and Complete Concepts
    2. Pre-established Harmony, Windowlessness, and Mirroring
  9. Implications of Substances as Monads
    1. Levels of Reality
    2. Little Perceptions
    3. Composites and Substantial Forms
    4. Innate Ideas
  10. Monadic Activity and Time
  11. Influence
  12. Editions of Leibniz

1. Life

Gottfried Wilhelm Leibniz was born in Leipzig, Germany, on July 1, 1646. He was the son of a professor of moral philosophy. After university study in Leipzig and elsewhere, it would have been natural for him to go into academia. Instead, he began a life of professional service to noblemen, primarily the dukes of Hanover (Georg Ludwig became George I of England in 1714, two years before Leibniz’s death). His professional duties were various, such as official historian and legal advisor. Above all, he was required to travel widely, meeting many of the foremost intellectuals in Europe—of particularly formative importance were the astronomer, mathematician, and physicist Huygens, and the philosopher Spinoza.

Leibniz was one of the great polymaths of the modern world. Moreover, a list of his significant contributions is almost as long as the list of his activities. As an engineer, he worked on calculating machines, clocks, and even mining machinery. As a librarian, he more or less invented the modern idea of cataloguing. As a mathematician, he not only produced ground-breaking work in what is now called topology, but came up with the calculus independently of (though a few years later than) Newton, and his notation has become the standard. In logic, he worked on binary systems, among numerous other areas. As a physicist, he made advances in mechanics, specifically the theory of momentum. He also made contributions to linguistics, history, aesthetics, and political theory.

Leibniz’s curiosity and genius ranged widely, but one of the most constant of his concerns was to bring about reconciliation by emphasizing the truths on each side of even the most seemingly contradictory positions. Throughout his life, he hoped that his work on philosophy, as well as his work as a diplomat, would form the basis of a theology capable of reuniting the Church, which had been divided since the Reformation in the 16th Century. Similarly, he was willing to engage with, and borrow ideas from, the materialists as well as the Cartesians, the Aristotelians as well as the most modern scientists. It is quite ironic, then, that he was a partial cause of a dispute between British and Continental mathematicians concerning who was first to develop the calculus (and who might have plagiarized who), a dispute which slowed the advance of mathematics in Europe for over a century.

However, the great variety of Leibniz’s work meant that he completed few of his ambitious projects. For present purposes, this means above all that Leibniz’s rich and complex philosophy has to be gathered primarily from a large set of quite short manuscripts, many fragmentary and unpublished, as well has his various correspondences. (The last section of this article provides bibliographical details of several editions of Leibniz’s work.) As a result, a major controversy in Leibniz scholarship is the question of where to begin. Insofar as Leibniz is a logician, it is tempting to begin with his conception of truth (and, indeed, this will be the starting point of this article). But insofar as Leibniz is a metaphysician, it is equally tempting to begin with his account of the nature of reality, in particular his notion of substance as monads. Less common, but perhaps equally likely, starting points might reside in Leibniz the mathematician, the theologian, or the physicist. These controversies, however, already contain a lesson: to an important degree it doesn’t matter. So integrated were his various philosophical interests—so tightly laced together into a system—that one ought to be able to begin anywhere and reconstruct the whole. Or at least Leibniz evidently thought so, since often he uses an idea from one part of his philosophy to concisely prove something in an apparently quite distant philosophical region. However, due to this systematic nature of his philosophy, in which every idea seems to rely upon others, engaging Leibniz’s ideas often proves to be challenging.

2. The Idea of Truth

According to Leibniz, a conception of truth has important consequences for a conception of reality and how it is to be understood at its most profound level. Intuitively, a proposition is true when its content is adequate to the situation in the world to which it refers. For example, “the sky is gray” is true if and only if the thing out there in the world called “the sky” is actually the color called “gray” at the time the proposition is stated. This, however, raises issues about the relationship of language to the world and what “adequacy” consists in.

Leibniz claims that one can bypass problems with the intuitive notion of truth, at least for the moment. Truth, according to Leibniz, is simply a proposition in which the predicate is contained in the subject. The predicate is what is asserted; the subject is what the assertion is about. All true propositions, then, can be expressed by the following general form: “subject is predicate.” This is not, by any means, an idea unique to Leibniz. What is unique, however, is the single-mindedness with which he pursues the consequences of such an idea of truth. (See, for example, “Correspondence with Arnauld,” 14 July 1686.)

This notion of truth seems straight-forward enough for what are commonly called analytic propositions, such as “Blue is a color,” which has more to do with the definition of blue than it does with the world. The notion of color is part of the notion of blue. Similarly, in the basic logical truth “A is A,” the predicate is not just contained in the subject, it is the subject. But, Leibniz states that this “being contained” is implicitly or virtually the case with other truths (see “Primary Truths” and “The Nature of Truth”). Take, for example, the statement “Peter is ill.” Intuitively, this proposition is true only if it refers to a real world in which Peter is, in fact, ill. Leibniz, however, analyzes this as follows: if one knew everything there is to know about Peter, that is, if one had a complete concept of Peter, one would also know (among many other things) that he is ill at the moment. Therefore, the statement “Peter is ill” is true not primarilybecause of some reference to the world, but in the first instance because someone has the concept of Peter, which is the subject of the proposition, and that concept contains (as a predicate) his being ill. Of course, it may be the case that one happens to know that Peter was ill because one refers to the world (perhaps sees him cough repeatedly). But the fact that one finds out about Peter in this way does not make the statement that “Peter is ill” true and thus a piece of knowledge because of that reference. One must distinguish the concept of truth from pragmatic or methodological issues of how one happens to find out about that truth, or what one can do with the truth. The latter, according to Leibniz, are completely irrelevant to the question “What is truth?” in itself.

Leibniz also claims that a statement is true for all time—that is, whenever the statement is made. So, for example, the statement “Peter is ill (on January 1st, 1999)” was true in the year 1998 (even though no one knew it yet) as well as in the year 2000 (even though everyone may have forgotten about the illness by then). It was also true a million years ago, and will be true a million years from now, although it is very unlikely that anyone will actually know this truth at those times.

Leibniz’s own example is of Julius Caesar. He writes:

For if some person were capable of completing the whole demonstration by means of which he could prove this connection of the subject (which is Caesar) with the predicate (which is his successful enterprise [winning the battle of Pharsalus, etc.]), he would then show that the future dictatorship of Caesar had its foundation in his notion or nature, that a reason can be found there why he resolved to cross the Rubicon rather than stop, and why he won rather than lost the day at Pharsalus… (Discourse on Metaphysics, §13).

However, there are several ideas Leibniz introduces in this passage that require further investigation. What is meant by “completing the whole demonstration,” by something having a “foundation,” or by “a reason can be found?”

3. Sufficient Reason

As previously stated, for any proposition, truth is defined by Leibniz in the same way: the predicate is contained in the subject. It only takes a little thought to realize that for any one subject (like Peter or Caesar), the number of predicates which are true of it will be infinite (or at least very large), for they must include every last thing Peter or Caesar did or will do, as well as everything that did or will ever happen to them. But now it is natural to ask: Why do all these predicates come together in the one subject? It could be that the predicates are a quite arbitrary or random collection—although Leibniz does not believe this, and it is certainly not intuitive. Rather, one predicate or set of predicates explains another. For example, Peter’s coming into contact with a virus explains his illness. Or, Caesar’s ambition and boldness explains why he decided to cross the Rubicon. So, many (at least) of the predicates that are true of a subject “hang together” as a network of explanations.

Leibniz goes further still by claiming that for every predicate that is true of a subject, there must be a set of other true predicates which constitute a sufficient reason for its being true. This he calls the principle of sufficient reason—that there must be a sufficient reason for why things are as they are and not otherwise. This is why he uses words like “foundation” and “reason” in the quotation above. Unless this were true, Leibniz argues, the universe would not make any sense, and science and philosophy both would be impossible (see, for example, New Essays on Human Understanding, preface, p. 66). Moreover, it would be impossible to account for a basic notion like identity unless there was a sufficient reason why Caesar, for example, with his particular properties at a given time, is identical with the Caesar who existed a week prior with such different properties (see “Remarks on Arnauld’s Letter,” May 1686).

The principle of sufficient reason also accounts for why Leibniz uses the phrase “completing the whole demonstration” in the above quote. If the complete concept of the subject (that is, all of its true predicates) together constitutes a complete network of explanation, then these explanations can be followed forward and backward, so to speak, at least in principle. That is, working forward, one coulddeduce that Caesar will cross the Rubicon from a all the predicates that have been true of him; or, working backward, one can deduce from all those predicates true of Caesar at his death the reasons why he won the battle of Pharsalus. The “whole demonstration,” then, is the revelation of the logical structure of the network of explanations that make Caesar who he is.

However, this is clearly not something the average person can do. Human minds are not subtle and capacious enough for a task which may be infinite. Still, in a more limited way, one can certainly talk about personalities, characters, and causes or reasons for things. The quotation from Leibniz given above continues:

… [he who completed the whole demonstration would then show] that it was rational and therefore definite that this would happen, but not that it is necessary in itself, or that the contrary implies a contradiction (Discourse on Metaphysics, §13).

These qualifications are quite important for Leibniz. It was often suggested by Leibniz’s contemporaries (and is still being suggested) that his idea of the sufficient reason of all the predicates of a subject meant that everything true of a subject is necessarily true. This might entail that Caesar did not choose to cross the Rubicon, but that he was acting in a determined manner, like a machine. In other words, Leibniz seems to be denying any sort of free will. The free will problem will be discussed in more detail below, but for the moment, a few observations can be made.

First, Leibniz claims that Caesar’s crossing of the Rubicon is not necessary in the sense that “A is A” is necessary. Because while “A is not A” is a contradiction, Caesar’s deciding not to cross the Rubicon does not imply a contradiction. To be sure, history would have been different—even Caesar would have been different—but there is no contradiction in that strong sense. Caesar’s properties are not logically necessary.

Second, any truth about Caesar–indeed, the whole complete concept of Caesar–is not “necessary in itself.” Caesar is Caesar, but nothing about Caesar in himself proves that Caesar has to be. By contrast, “A is A” doesn’t need any other explanation for its truth. So, while every property of Caesar is explained by some other property of Caesar, no property explains why it is true that Caesar existed. Caesar is not anecessary being.

What the precise details are of Leibniz’s account of free will remain a strenuously debated issue in Leibniz scholarship (especially what the exact nature is of these distinctions, whether he is justified in making them, and even if justified whether they yield the results he claims in the area of free will). More detail will be added to this account below, but the existence of this debate should be kept in mind throughout.

4. Substance, Briefly

At this point, it is useful to turn from a conception of truth to a conception of substance. Leibniz’s philosophy of substance will be explicated in more detail in section 8 (Substance as Monad). For the moment, simply observe that for humans (though not for God), complete concepts are always concepts of existing substances–that is, of really existing things. Leibniz writes:

Now it is obvious that all true predication has some foundation in the nature of things, and when a proposition is not identical, that is to say when the predicate is not expressly included in the subject, it must be virtually included in it.[…] This being so, we can say that the nature of an individual substance or of a complete being is to have a notion so complete that it is sufficient to include, and to allow the deduction of, all the predicates of the subject to which that notion is attributed (Discourse on Metaphysics, §8, emphasis added).

To be the individual substance, Caesar, then, is to be such as to have a notion which includes everything that can truthfully be predicated of the subject Caesar. Thus, one might say that, for Leibniz, a substance is a complete concept made real, and a complete concept is a real substance expressed or “perceived” in thought. Moreover, just as for any one predicate, the complete concept contains other predicates which explain that predicate, for any given property of a substance, the complete individual substance will itself be the explanation for that property. Caesar chose to cross the Rubicon for many complex reasons, but they all boil down to this: that was the kind of individual Caesar was.

Leibniz has much more to say about substance, but he claims that it all follows from this insight. However, the exact relationship Leibniz intended between the logical idea of a complete concept and the metaphysical idea of a substance is still debated in Leibniz scholarship.

5. Necessary Being

The complete concept of Caesar, according to Leibniz, cannot explain itself in its entirety. Expressed ontologically, this means that Caesar himself provides no explanation of why Caesar should have existed at all–Caesar is a contingent being. “Contingent” here simply means something that could have been otherwise; in the case of Caesar as a being, then, it means something that could have not existed at all. The principle of sufficient reason must not only apply to each predicate in the complete concept of a subject, but also it must apply to the concept itself in its entirety as the concept of an existing thing. Thus, there must be a sufficient reason for why this particular substance, Caesar, exists, rather than some other substance, or nothing at all.

What, then, sufficiently explains a contingent being such as Caesar? Possibly other substances, such as his parents, and they in turn are explained by still others? But the entire course of the universe, the total aggregate of substances across space and time, are one and all contingent. There are other possible things, to be sure; but there are also other possible universes that could have existed but did not. The totality of contingent things themselves do not sufficiently explain themselves. Here again, the principle of sufficient reason applies. There must be, Leibniz insists, something beyond the totality of contingent things which explains them, something which is itself necessary and therefore requires no explanation other than itself. (Note, however, that this does not assume an origin or beginning in any sense. Even if time stretched infinitely into the past, there would still be no explanation for the total course of things.)

suffrea

God, according to Leibniz, is the necessary being which constitutes the sufficient explanation of the totality of contingent things–why the universe is this way rather than any other. Thus far, God’s necessity is the only thing mentioned about such a being (there is not much religious or theological about this initially bare metaphysical concept). God as a being may be necessary, but if the contingent universe were simply a random or arbitrary act of God, then God would not constitute the required explanation of all things. In other words, God must not only be necessary, but also the source of the intelligibility of all things. It must be possible, therefore, to inquire into the reasons God had for authorizing or allowing this, rather than any other, universe to be the one that actually exists. And if God is to be the explanation of the intelligibility of the universe, then God must have access to that intelligibility, such that God could be said to know what it is that is being allowed to exist–that is, God must have the ability to grasp complete concepts, and to see at once the “whole demonstration” discussed above. God so far is therefore (i) a necessary being, (ii) the explanation of the universe, and (iii) the infinite intelligence.

Here Leibniz famously brings in the notion of perfection (see, for example, “A Specimen of Discoveries”). One has to try to imagine God, outside of time, contemplating the infinite universe that “he” is going to, not create, but allow to be actual and sustain in existence. In the mind of God are an infinite number of infinitely complex and complete concepts, all considered as possibly existent substances, none having any particular “right” to exist. There is just one constraint on this decision: it must not violate the other basic principle of Leibniz’s, the law of non-contradiction (also known as “the law of contradiction”). In other words, each substance may individually be possible, but they must all be possible together–the universe forming a vast, consistent, non-contradictory system. For example, God could not create a universe in which there are both more sheep than cows and more cows than sheep. God could choose a universe in which there is the greatest possible quantity of pizza, or in which everything is purple, and so on. However, according to Leibniz, God chooses the universe that is the most perfect. This principle of perfection is not surprising since it is most consummate with the idea of God as an infinite being; to choose any other less perfect universe would be to choose a lesser universe. Thus, according to Leibniz, the actual world is the best of all possible worlds. (This claim, and its apparent implications, were very effectively and famously satirized by Voltaire in his Candide. Note also that Leibniz is often taken as an ancestor of modern possible worlds semantics; however, it is undeniable that at least the context and purpose of Leibniz’s notion of a possible universe was quite different.) Leibniz explores the theological consequences of this at, for example, the end of Discourse on Metaphysics. (There may be a difficult theological implication here: must God be thought of as constrained, first by the concept of perfection, and then by the systemic nature of his creation? Leibniz attempts, for example, in the “Correspondence with Arnauld” to escape this conclusion.)

To try to understand further this notion of perfection, Leibniz explores several concepts in various writings: notions of the best, the beautiful, the simply compossible, greatest variety or the greatest quantity of essence. The last of these is the explanation he continually comes back to: perfection simply means the greatest quantity of essence, which is to say the greatest richness and variety in each substance, compatible with the least number of basic laws, so as to exhibit an intelligible order that is “distinctly thinkable” in the variety (see “A Resume of Metaphysics;” there is a relationship to the Medieval, and particularly Augustine, notion of plenitude). Leibniz seems to understand this principle as simply self-evident. It certainly seems to be a big jump to the aesthetic, moral, and wise God from the ontological conception of God deduced above. However, Leibniz may have a point in arguing that it would be absurd in some sense for an infinite being to choose anything other than an infinitely rich and thus perfect universe. He also finds this aesthetic observed throughout nature: natural forms tend towards a maximum of variety compatible with orderliness. Nevertheless, contemporary philosophers generally find Leibniz’s conclusion here to not strictly follow from the previous considerations.

For Leibniz, this forms a proof for the existence of God (see Monadology §§37-39 and “A Specimen of Discoveries”). In fact, it is a version of the third of the cosmological arguments given by St. Thomas Aquinas, and subject to many of the same difficulties. One might, for example, object in a Kantian vein that the concept of explanation, rightly demanded of all individual contingent beings, is applied beyond its proper sphere in demanding an explanation of the totality of contingent beings. But Leibniz might well counter that this objection assumes a whole theory of the “proper spheres” of concepts.

6. Problems of Freedom, Sin, and Evil

a. Freedom and Sin

Leibniz’s conception of God, however, may seem to cause more problems than it solves. For example, if the complete concept of any being, such as a human being, is known for all time, and was chosen by God for existence, then is such a being free? It seems that what one means by “freedom” is that the outcome is not predictable, as opposed to, for example, the way in which the operation of a washing machine or the addition of two numbers is predictable. Further, what must one make of morality and sin? Why, for example, should God punish Adam and Eve for sinning when they seemed to have no free choice, since God knew in advance (predicted and, indeed, made it the case) that they were going to sin?

While Leibniz’s philosophical system demands a certain sense of determinism about the universe, he does not want to deny the existence of free will. Leibniz thus seeks to substantiate a form or compatibilism(that is, a view which takes determinism to be compatible with free will). In order to accomplish this, Leibniz distinguishes between several ways in which things might be determined in advance. Whatever is determined is clearly true. Truth, however, comes in several varieties. (Much of the following is taken from the set of distinctions Leibniz makes in “Necessary and Contingent Truths;” Leibniz makes similar but rarely identical sets of distinctions in a variety of texts.)

  1. Truths of Essence
    These come in two varieties:

    1. Primary/original truth: the law of non-contradiction, for example.
    2. Eternal, metaphysical, or geometrical truths: the laws of arithmetic or geometry, for example, which Leibniz claims can be reduced by a finite process of argumentation and substitution of definitions to primary truth. These are valid in all possible universes.
  2. Truths of Existence, of Fact, or of Hypothesis
    Here, arguably, Leibniz sees four varieties:

    1. Absolutely universal truths: those truths definitive of this universe as being the most perfect universe. Leibniz writes: “Indeed, I think that in this series of things there are certain propositions which are true with absolute universality, and which cannot be violated even by a miracle” (“Necessary and Contingent Truths”).
    2. Universal-physical truths: the laws of physics and other such efficient causes, for example; truths which hold universally of all substances in this, but not in all possible, universes, but which also could, in principle, be violated by a miracle, in accordance with overall divine providence.
    3. Individual metaphysical truths: truths about the properties of individual substances, where those properties follow from the complete concept–and thus are apparent to God, but do not follow any “subordinate universal laws.” Deduction of such truths is available to no being, no matter how perfect or perceptive, other than God.
    4. Hypothetical truths: only truths of essence can be necessary, absolutely and strictly speaking. All other truths, such as the actions of Caesar, are only “hypothetically” necessary–that is, only on the hypothesis that a universe exists as it is, with beings such as these in it (see Discourse on Metaphysics, §13 and “Correspondence with Arnauld,” April 12th, 1686).

A person’s actions are, therefore, not necessary by definition (regardless, at this point, of which type of “truth of existence” they fall under). Thus, the concept of an individual “inclines without necessitating” (seeDiscourse on Metaphysics, §30). Leibniz further writes:

For speaking absolutely, our will is in a state of indifference, in so far as indifference is opposed to necessity, and it has the power to do otherwise, or to suspend its action altogether, both alternatives being and remaining possible. […] It is true, however, and indeed it is certain from all eternity, that a particular soul will not make use of this power on such and such an occasion. But whose fault is that? Does it have anyone to blame but itself? (Discourse on Metaphysics, §30, emphasis added)

By “indifference,” Leibniz means a physical indifference–that is to say, there is no universal-physical truth, as defined above, which governs human action. For Leibniz, this means that human action is further freed: the will has the power to suspend its action with respect to the physical sequence of efficient causes, but also even with respect to what would otherwise be seen as a decisive final cause. Leibniz states: “For they [free or intelligent substances] are not bound by any certain subordinate laws of the universe, but act as it were by a private miracle” (“Necessary and Contingent Truths”).

Minds, then, are different from mechanical causes. (As it will be shown below, Leibniz goes against the trend of 17th and 18th century thought by reintroducing the Aristotelian and Scholastic notion of a final cause and, indeed, substantial forms.) Although Leibniz occasionally uses the analogy of a machine to describe the soul, the kinds of forces and causes operative in the former are simply inapplicable to the latter. Thus, if by individual free choice one means an individual action that cannot be known in advance by even an infinitely subtle application of the laws of physics, chemistry, or biology, then humans have free choice in that sense as well.

Leibniz also offers the following additional arguments for his particular conception of human free will:

(i) Freedom as “unpredictability” might be taken to mean freedom as an act uncaused. But this makes no sense, for free choice is not randomness. Caesar’s free act, for example, has a cause–namely, Caesar. Why should one complain when the individual concept of Caesar intrinsically determines what Caesar does? Isn’t Caesar free if he is the source of his action, and not anyone or anything else?

(ii) A necessary ignorance of the future is practically, perhaps even logically, equivalent to freedom. Again, grasping the full explanation of any predicate that lies in the complete concept is an infinite task. To help illustrate the distinction between contingent and necessary truths, Leibniz makes a famous analogy with the incommensurability of any whole number or fraction with a “surd” (for example, the square root of two, the value of which cannot be represented numerically by any finite series of numbers.) For finite human minds, that incommensurability is a positive fact, just like contingency–no matter that for God neither calculation is impossible, or even more difficult. Thus contingent truths can in principle be known from all time, but necessarily not by a human being (see, for example, “On Freedom”). Leibniz writes: “Instead of wondering about what you cannot know and what can tell you nothing, act according to your duty, which you do know” (Discourse on Metaphysics, §30). (It should be pointed out that this is somewhat more than an analogy, since it is closely related to the kinds of problems infinitesimal calculus was designed to deal with–and Leibniz takes the possibility of a calculus as having real metaphysical implications.)

(iii) A famous scholastic debate concerned the so-called “Sloth Syllogism.” If everything is fated, the argument goes, then whatever action one “does” will or will not happen whether or not one wills it, therefore one need not will anything at all. One can just be a sloth, and let the universe happen. Leibniz thinks this is absurd–indeed, immoral. The will of an individual matters. If John Doe is the kind of person who is a sloth, then (everything else being the same) the course of his life will indeed be quite different than if he is the kind of person (like Caesar) who takes events by the scruff of the neck.

(iv) What many philosophers mean by “contingent” is that an individual predicate “could have been different,” and everything else the same. For Leibniz, this is impossible. To change one predicate means to alter the whole complete concept, the substance, and with it the whole universe. Leibniz thus claims that philosophers of a more radical sense of freedom do not take seriously the extent to which the universe is an integrated network of explanations, and that this in turn has implications for the idea of contingency (see the discussion of Adam in Leibniz’s letter to Landgraf Ernst von Hessen-Rheinfels, April 12, 1686). Thus, contingent events, even one’s free acts, must be part of the perfection of the universe. Although, that does not mean that all contingent events are so in the same way.

According to Leibniz, any remaining objections to this idea of free will only result from a metaphysically incoherent idea of what freedom means. There is no question that Leibniz introduced a spirited and powerful position into the age-old philosophical debate concerning free will. Which position is “metaphysically incoherent,” however, remains under debate. (For more on the philosophical debate of free will, see “Free Will“.)

b. Problem of Evil

Leibniz’s approach to the classic problem of evil is similar. The problem of evil, for Leibniz, can be put in the following way: If God is supremely good, and the creator (or author) of the best possible universe, then why is there so much pain and sin in the world? Leibniz claims that this apparent paradox is not a real problem. Leibniz coined the term “theodicy” to refer to an attempt to reconcile God’s supremely benevolent and all-good nature with the evil in the world. Thus, Leibniz’s Theodicy is largely a proposed solution to the problem of evil. However, his thoughts on the issue are to be found spread over many texts. (For more on the problem of evil, see the entries “The Evidential Problem of Evil” and “The Logical Problem of Evil.”)

Here, very briefly, are three of Leibniz’s main replies to the problem of evil:

(i) Human minds are only only aware of a small fraction of the universe. To judge it full of misery on this small fraction is presumptuous. Just as the true design–or, indeed, any design–of a painting is not visible from viewing a small corner of it, so the proper order of the universe exceeds one’s ability to judge it.

(ii) The best possible universe does not mean no evil, but that less overall evil is impossible.

(iii) Similarly to the previous argument, and in the best Neo-Platonist tradition, Leibniz claims that evil and sin are negations of positive reality. All created beings are limitations and imperfect; therefore evil and sin are necessary for created beings (see Discourse on Metaphysics, §30).

7. Space, Time, and Indiscernibles

a. Against the Absolute Theory

Between 1715 and 1716, at the request of Caroline, Princess of Wales, a series of long letters passed between Leibniz and the English physicist, theologian, and friend of Newton, Samuel Clarke. It is generally assumed that Newton had a hand in Clarke’s end of the correspondence. They were published in Germany and in England soon after the correspondence ceased and became one of the most widely read philosophical books of the 18th Century. Leibniz and Clarke had several topics of debate: the nature of God’s interaction with the created world, the nature of miracles, vacua, gravity, and the nature of space and time. Although Leibniz had written about space and time previously, this correspondence is unique for its sustained and detailed account of this aspect of his philosophy. It is also worth pointing out that Leibniz (and after him Kant) continues a long tradition of philosophizing about space and time from the point of view of space, as if the two were always in a strict analogy. It is only rarely that Leibniz deals in any interesting way with time on its own (we shall return to this in section 10).

Newton, and after him Clarke, argued that space and time must be absolute (that is, fixed background constants) and in some sense really existent substances in their own right (at least, this was Leibniz’s reading of Newton). The key argument is often called the “bucket argument.” When an object moves, there must be some way of deciding upon a frame of reference for that motion. With linear motion, the frame does not matter (as far as the mathematics are concerned, it does not matter if the boat is moving away from the shore, or the shore is moving away from the boat); even linear acceleration (changing velocity but not direction) can be accounted for from various frames of reference. However, acceleration in a curve (to take Newton’s example, water forced by the sides of a bucket to swirl in a circle, and thus to rise up the sides of the bucket), could only have one frame of reference. For the water rising against the sides of the bucket can be understood if the water is moving within a stationary universe, but makes no sense if the water is stationary and the universe is spinning. Such curved acceleration requires the postulation of absolute space which makes possible fixed and unique frames of reference. (Similar problems made Einstein’s General Theory of Relativity so much more mathematically complicated than the Special Theory.)

Leibniz, however, has a completely different understanding of space and time. First of all, Leibniz finds the idea that space and time might be substances or substance-like absurd (see, for example, “Correspondence with Clarke,” Leibniz’s Fourth Paper, §8ff). In short, an empty space would be a substance with no properties; it will be a substance that even God cannot modify or destroy.

But Leibniz’s most famous arguments for his theory of space and time stem from the principle of sufficient reason (the principle that everything which happens has, at least in principle, an explanation of why it happened as it did and not otherwise). From this principle, together with the law of non-contradiction, Leibniz believes that there follows a third: the principle of the identity of indiscernibles, which states that any entities which are indiscernible with respect to their properties are identical. Leibniz is fond of using leaves as an example. Two leaves often look absolutely identical. But, Leibniz argues, if “two” things are alike in every respect, then they are the same object, and not two things at all. So, it must be the case that no two leaves are ever exactly alike.

Leibniz’s support for the principles of the identity of indiscernibles primarily derives from his commitment to the principle of sufficient reason in the following way. If any objects are in every way the same, but actually distinct, then there would be no sufficient reason (that is, no possible explanation) for why the first is where (and when) it is, and the second is where (and when) it is, and not the other way around. If, then, one posits the possible existence of two identical things (things that differ in number only–that is, one can count them, but that is all), then one also posits the existence of an absurd universe, one in which the principle of sufficient reason is not universally true. Leibniz often expresses this in terms of God: if two things were identical, there would be no sufficient reason for God to choose to put one in the first place and the other in the second place. (Note that Leibniz’s argument relates to a scholastic debate centered on the notion of “Buridan’s Ass.”)

Similar considerations apply to Newtonian absolute space. Leibniz’s argument against the Newton-Clarke position can be understood here as two related reductio ad absurdum arguments. The first concerns the violation of the principle of the identity of indiscernibles. Suppose that space is absolute. Since every region of space would be indiscernible from any other and spatial relations would be construed as extrinsic, it would be possible for two substances to be indiscernible yet distinct in virtue of being in different locations. But this is absurd, Leibniz argues, because it violates the principle of the identity of indiscernibles. Therefore, space must not be absolute (see “Correspondence with Clarke,” Leibniz’s Third Paper). The second reductio concerns the violation of the principle of sufficient reason. Suppose that space is absolute. Leibniz argues that there would then be no sufficient reason for why the whole universe was created here instead of two meters to the left (because no region of space is discernible from any other). Thus, absolute space is absurd, because it violates the principle of sufficient reason (see “Correspondence with Clarke,” Leibniz’s Fourth Paper). (Analogous problems are thought to result from a conception of absolute time.)

space

b. The Relational Theory

That is the negative portion of Leibniz’s argument. But what does all this say about space? For Leibniz, the location of an object is not a property of an independent space, but a property of the located object itself (and also of every other object relative to it). This means that an object here can indeed be different from an object located elsewhere simply by virtue of its different location, because that location is a real property of it. That is, space and time are internal or intrinsic features of the complete concepts of things, not extrinsic. Let us return to the two identical leaves. All of their properties are the same, except that they are in different locations. But that fact alone makes them completely different substances. To swap them would not just involve moving things in an indifferent space, but would involve changing the things themselves. That is, if the leaf were located elsewhere, it would be a different leaf. A change of location is a change in the object itself, since spatial properties are intrinsic (similarly with location in time).

Leibniz’s view has two major implications. First, there is no absolute location in either space or time; location is always the situation of an object or event relative to other objects and events. Second, space and time are not in themselves real (that is, not substances). Space and time are, rather, ideal. Space and time are just metaphysically illegitimate ways of perceiving certain virtual relations between substances. They are phenomena or, strictly speaking, illusions (although they are illusions that are well-founded upon the internal properties of substances). Thus, illusion and science are fully compatible. For God, who can grasp all at once complete concepts, there is not only no space but also no temptation of an illusion of space. Leibniz uses the analogy of the experience of a building as opposed to its blueprint, its overall design (see, for example, “Correspondence with Arnauld” 12 April 1686 and Monadology §57). It is sometimes convenient to think of space and time as something “out there,” over and above the entities and their relations to each other, but this convenience must not be confused with reality. Space is nothing but the order of co-existent objects; time nothing but the order of successive events. This is usually called a relational theory of space and time. (For more information, see §6 on relative vs. absolute theories of time).

Space and time, according to Leibniz, are thus the hypostatizations of ideal relations, which are real insofar as they symbolize real differences in substances, but illusions to the extent that (i) space or time are taken as a thing in itself, or (ii) spatial/temporal relations are taken to be irreducibly exterior to substances, or (iii) extension or duration are taken to be a real or even fundamental property of substances. Take the analogy of a virtual reality computer program. What one sees on the screen (or in a specially designed virtual reality headset) is the illusion of space and time. Within the computer’s memory are just numbers (and ultimately mere binary information) linked together. These numbers describe in an essentially non-spatial and temporal way a virtual space and time, within which things can “exist,” “move” and “do things.” For example, in the computer’s memory might be stored the number seven, corresponding to a bird. This, in turn, is linked to four further numbers representing three dimensions of space and one of time–that is, the bird’s position. Suppose further the computer contains also the number one, corresponding to the viewer and again linked to four further numbers for the viewer’s position, plus another three giving the direction in which the viewer’s virtual eyes are looking. The bird appears in the viewer’s headset, then, when the fourth number associated with the bird is the same as the viewer’s fourth number (they are together in time), and when the first three numbers of the bird (its position in virtual space) are in a certain algebraic relation to the number representing the viewer’s position and point of view. Space and time are reduced to non-spatial and non-temporal numbers. For Leibniz, God in this analogy apprehends these numbers as numbers, rather than through their translation into space and time.

c. Objections and Replies

This, however, raises a serious logical problem for Leibniz. Recall Leibniz’s theory of truth as the containedness of a predicate in a subject. This seemed acceptable, perhaps, for propositions such as “Caesar crossed the Rubicon” or “Peter is ill.” But what about “This leaf is to the left of that leaf?” The latter proposition involves not one subject, but three (the two leaves, and whatever is occupying the point-of-view from which the one is “to the left”). Leibniz has to argue that all relational predicates are in fact reducible to internal properties of each of the three substances. This includes time, as well as relations such as “the sister of” or “is angry at.” But can all relations be so reduced, at least without radically deforming their sense? Modern logicians often see this as the major flaw in Leibniz’s logic and, by extension, in his metaphysics.

Furthermore, Leibniz must provide a response to the Newtonian bucket argument. Indeed, Leibniz thinks that one simply needs to provide a rule for the reduction of relations. For linear motion the virtual relation is reducible to either or both the object and the universe around it. For non-linear motion, one must posit a rule such that the relation is not symmetrically reducible to either of the subjects (bucket, or universe around it). Rather, non-linear motion is assigned only when, and precisely to the extent that, the one subject shows the effects of the motion. That is, the motion is a property of the water, if the water shows the effects (see “Correspondence with Clarke,” Leibniz’s Fifth Paper, §53). Perhaps it seems strange that the laws of nature should be different for linear as opposed to non-linear motion. It sounds like anarbitrary new law of nature, but Leibniz might respond that it is no more arbitrary that any other law of nature; people have just become used to the illusion of space and time as extrinsic relations of entities that they are not used to thinking in these terms.

8. Substance as Monad

We are now, finally, ready to get a picture of what Leibniz thinks the universe is really like. It is a strange, and strangely compelling, place. Around the end of the Seventeenth Century, Leibniz famously began to use the word “monad” as his name for substance. “Monad” means that which is one, has no parts and is therefore indivisible. These are the fundamental existing things, according to Leibniz. His theory of monads is meant to be a superior alternative to the theory of atoms that was becoming popular in natural philosophy at the time. Leibniz has many reasons for distinguishing monads from atoms. The easiest to understand is perhaps that while atoms are meant to be the smallest unit of extension out of which all larger extended things are built, monads are non-extended (recall that space is an illusion on Leibniz’s view).

a. Monads and Complete Concepts

We must begin to understand what a monad is by beginning from the idea of a complete concept. As previously stated, a substance (that is, monad) is that reality which the complete concept represents. Acomplete concept contains within itself all the predicates of the subject of which it is the concept, and these predicates are related by sufficient reasons into a vast single network of explanation. So, relatedly, the monad must not only exhibit properties, but contain within itself “virtually” or “potentially” all the properties it will exhibit in the future, as well as contain the “trace” of all the properties it did exhibit in the past. In Leibniz’s extraordinary phrase, found frequently in his later work, the monad is “pregnant” with the future and “laden” with the past (see, for example, Monadology §22). All these properties are “folded up” within the monad; they unfold when they have sufficient reason to do so (see, for example,Monadology §61). Furthermore, the network of explanation is indivisible; to divide it would either leave some predicates without a sufficient reason or merely separate two substances that never belonged together in the first place. Correspondingly, the monad is one, simple and indivisible.

Just as in the analysis of space and time Leibniz argues that all relational predicates are actually interior predicates of some complete concept, so the monad’s properties include all of its relations to every other monad in the universe. A monad, then, is self-sufficient. Having all these properties within itself, it doesn’t need to be actually related to or influenced by another other monad. Leibniz writes:

So if I were capable of considering distinctly everything which is happening or appearing to me now, I would be able to see in it everything which will ever happen or appear to me for all time. And it would not be prevented, and would still happen to me, even if everything outside me were destroyed, so long as there remained only God and me (Discourse on Metaphysics, §14).

Thus, just like space and time, cause and effect is a “well-founded” illusion. According to Leibniz, causation is to be account for by saying that one thing, A, causes another, B, when the virtual relation between them is more clearly and simply expressed in A than in B. But metaphysically, Leibniz argues, it makes no difference which way around the relation is understood, because the relation itself is not real. Leibniz writes:

Thus, in strict metaphysical precision, we have no more reason to say that the ship pushes the water to produce this large number of circles…than to say that the water is caused to produce all these circles and that it causes the ship to move accordingly (“Draft letter to Arnauld,” 8 December 1686).

Leibniz goes on to insist that the first direction of explanation is much simpler, since the second would involve leaping directly to the action of God to explain the extraordinary action of so many individual bits of water. But that simplicity is hardly the same as truth.

b. Pre-established Harmony, Windowlessness, and Mirroring

So, instead of cause and effect being the basic agency of change, Leibniz is offering a theory of pre-established harmony (sometimes referred to as the hypothesis of concomitance) to understand the apparently inter-related behavior of things. Consider the common analogy of two clocks. The two clocks are on different sides of a room and both keep good time (that is, they tell the same time). Now, someone who didn’t know how clocks work might suspect that one was the master clock and it caused the other clock to always follow it. When two things behave in corresponding ways, then it is often assumed (without any real evidence) that there is causation occurring. But another person who knew about clocks would explain that the two clocks have no influence one on the other, but rather they have a common cause (for example, in the last person to set and wind them). Since then, they have been independently running in sync with one another, not causing each other. On Leibniz’s view, every monad is like a clock, behaving independently of other monads. Nevertheless, every monad is synchronized with one another by God, according to his vast conception of the perfect universe. (We must be careful, however, not to take this mechanical image of a clock too literally. Not all monads are explicable in terms of physical, efficient causes.)

In accordance with his theory of pre-established harmony, Leibniz argues that monads do not affect one another and that each monad expresses the entire universe. He has rather unique and extraordinary set of phrases for this; Leibniz states that every monad mirrors the whole of the universe in that it expresses every other monad, but no monad has a window through which it could actually receive or supply causal influences (see Monadology, §7 & §56). Furthermore, since a monad cannot be influenced, there is no way for a monad to be born or destroyed (except by God through a miracle–defined as something outside the natural course of events). All monads are thus eternal. (It is fair to say that Leibniz’s attempt to account for what happens to “souls” before the birth of body, and after its death, lead him to some colorful, but rather strained, speculations.)

9. Implications of Conceiving Substances as Monads

We will examine briefly four important implications of Leibniz’s account of substance: first, the distinction between metaphysical truth and phenomenal description; second, the idea of little perceptions; third, the infinitely composite nature of all body; and fourth, innate ideas.

a. Levels of Reality

Leibniz posits a distinction between levels or “spheres” in his account of reality (“Discourse on Metaphysics,” §10). The primary, most fundamental level of reality is the metaphysical level, which includes only monads, their perceptions, and their appetitions (no causality, no space, no time–at least as ordinarily understood–each monad spontaneously unfolding according to the kind of thing that it is). Thephenomenal or descriptive level involves what appears to be happening from the finite, imperfect perspective of human minds (things cause one another in space and time). Science’s object is the latter, which is an illusion, but in which nothing happens that is not based upon what really happens in the metaphysical level (that is, the illusion is “well-founded”). Therefore, the laws of physics are perfectly correct, as a description. (Berkeley borrows this idea, see especially his “De Motu,” and Kant produces a highly original version of it.) Indeed, Leibniz believes, following Descartes and many other materialists, that all such laws are mechanical in nature, exclusively involving the interaction of momenta and masses–hence his accusation that Newton’s idea of gravity is merely “occult.” However, at the metaphysical level, no account of reality could be less mechanical. Not surprisingly, then, Leibniz’s own contributions to physical science were in the fields of the theory of momentum and engineering.

A serious error would arise only if one took the “objects” of science (matter, motion, space, time, etc.) as if they were real in themselves. Consider the following analogy: in monitoring a nation’s economy, it is sometimes convenient to speak of a retail price index, which is a way of keeping track of the average change in the prices of millions of items. But there is nothing for sale anywhere which costs just that amount. As a measure it works well, provided one does not take it literally. Science, in order to be possible for finite minds, involves that kind of simplification or “abbreviation” (see, for example, “Letter to Arnauld,” 30 April 1687).

b. Little Perceptions

Leibniz is one of the first philosophers to have analyzed the importance of that which is “unconscious” in one’s mental life. That a monad is a “mirror” of the whole universe entails that one’s soul will actually have an infinite number and complexity of perceptions. Obviously, however, one does not apperceive (that is, one is not conscious of) all these little perceptions, as Leibniz calls them. Thus, perception for Leibniz does not mean apperception. (Leibniz argues that this is a major error on Descartes’ part.) Further, where one is conscious of some perception, it will be of a blurred composite perception. Leibniz’s analogy is of the roar of the waves of the beach: the seemingly singular sound of which one is conscious is in fact made up of a vast number of individual sounds of which one is not conscious–droplets of water smacking into one another.

For Leibniz, little perceptions are an important philosophical insight. First and foremost, this relates to one of Leibniz’s main general principles, the principle of continuity. Nature, Leibniz claims, “never makes leaps” (New Essays on Human Understanding, 56). This follows, Leibniz believes, from the principle of sufficient reason together with the idea of the perfection of the universe (consisting of something like plenitude). But the idea of little perceptions allows Leibniz to account for how such continuity actually happens even in everyday circumstances. The principle of continuity is very important for Leibniz’s physics (see “Specimen Dynamicum”) and turns up in Leibniz’s account of change in the monad (see below).

Second, little perceptions explain the acquisition of innumerable minor habits and customs, which make up a huge part of one’s distinctiveness as an individual personality. Such habits accumulate continuously and gradually, rather than all at once like decisions, and thus completely bypass the conscious will. Further, these little perceptions account for one’s pre-conscious connection with the world. For Leibniz, one’s relation with the world is not one just of knowledge, or of apperceived sensation. An individual’s relation with the world is richer than either of these, a kind of background feeling of being-a-part-of. (Thus, a thorough-going skepticism, however plausible at a logical level, is ultimately absurd.)

Finally, Leibniz’s idea of little perceptions gives a phenomenal (rather than metaphysical) account for the impossibility of real indiscernibles: there will always be differences in the petite perceptions of otherwise very similar monads. The differences may not be observable at the moment, but will “unfold in the fullness of time” into a discernible difference (New Essays on Human Understanding, 245-6).

c. Composites and Substantial Forms

According to Leibniz, everything one perceives which is a unified being must be a single monad. Everything else is a composite of many monads. A coffee cup, for example, is made of many monads (an infinite number, actually). In everyday life, one tends to call it a single thing only because the monads all act together. One’s soul, however, and the soul of every other living thing, is a single monad which “controls” a composite body. Leibniz thus says that, at least for living things, one must posit substantial forms, as the principle of the unity of certain living composites. (See, for example, “A New System of Nature.” The term is derived from Aristotle: that which structures and governs the changes of mere matter in order to make a thing what it is.) One’s soul, a monad otherwise like any other monad, thus becomes the substantial form of one’s otherwise merely aggregate body.

Furthermore, according to Leibniz, such composite bodies must be made of an infinite number of other inanimate as well as animated monads. This follows from the universe being the most perfect possible, which, again, seems to mean the richest in controlled complexity, in “plenitude.” Leibniz argues that it would be a great waste of possible perfection to only allow living beings to have bodies at that particular level of aggregation with which one is phenomenally familiar. (Perhaps Leibniz was understandably impressed by the different levels of magnitude being revealed by relatively recently invented instruments like the microscope and telescope.) Leibniz writes:

Every portion of matter can be thought of as a garden full of plants, or as a pond full of fish. But every branch of the plant, every part of the animal, and every drop of its vital fluids, is another such garden, or another such pool. […] Thus there is no uncultivated ground in the universe; nothing barren, nothing dead. (Monadology, §§67 & 69)

(Note: Although there is an extraordinary sublimity of such an image, Leibniz is often accused of making rather too much of an inadequate conception of the infinite.)

Further, the particular monads making up one’s body are constantly changing as one breaths in and out, sheds skin, etc., although not all at once. The substantial form is thus a unified explanation of bodily form and function. A mere chunk of stuff has, of course, an explanation, but not a unified one–not in one monad, the soul. Leibniz thus distinguishes four types of monads: humans, animals, plants, and matter. All have perceptions, in the sense that they have internal properties that “express” external relations; the first three have substantial forms, and thus appetition; the first two have memory; but only the first has reason (see Monadology §§18-19 & 29).

d. Innate Ideas

An innate idea is any idea which is intrinsic to the mind rather than arriving in some way from outside it. During this period in philosophy, innate ideas tended to be opposed to the thorough-going empiricism of Locke. Like Descartes before him–and for many of the same reasons–Leibniz found it necessary to posit the existence of innate ideas. At the metaphysical level, since monads have no “windows,” it must be the case that all ideas are innate. That is to say, an idea in one’s monad/soul is just another property of that monad, which happens according to an entirely internal explanation represented by the complete concept. But at the phenomenal level, it is certainly the case that many ideas are represented as arriving through one’s senses. In general, at least any relation in space or time will appear in this way.

Thus, one could imagine Leibniz being a thorough-going empiricist at the phenomenal level of description. This would amount to the claim that the metaphysically true innateness of all ideas is epistemologically useless information. Leibniz finds it necessary, therefore, to advance the following arguments in favor of phenomenally innate ideas:

(i) Some ideas are characterized by universal necessity, such as ideas in geometry, logic, metaphysics, morality, and theology. But it is impossible to derive universal necessity from experience. (Note that this argument is hardly new to Leibniz.)

(ii) An innate idea need not be an idea consciously possessed (because of “little perceptions,” for example). An innate idea can be potential, as an inclination of reason, as a rigid distortion in Locke’stabula rasa. (Here, Leibniz provides the famous analogy of the veins in the marble prior to the sculptor’s work.) It requires “attention” (especially in the form of philosophical thinking) to bring to explicit consciousness the operation, and to clarify the content, of these innate ideas.

(iii) Consider the possibility of foreseeing an event that is not similar to (and thus merely an associated repetition of) a past event. By using rational principles of physics, for example, one can analyze a situation and predict the outcome of all the masses and forces, even without ever having experienced a similar situation or outcome. This, Leibniz says, is the privilege of humans over animals (“brutes”), who only have the “shadow” of reason, because they can only move from one idea to another by association of similars (see Leibniz’s joke about empiricists in Monadology, §28).

monad

Thus, at the phenomenal level, Leibniz can distinguish between innate and empirical ideas. An empirical idea is a property of a monad which itself expresses a relation to some other substance or which arises from another internal property that is the expression of an external substance. Although the difference between empirical and innate is in fact an illusion, it does make a difference, for example, to the methodology of the sciences. This is similar to the distinction made above between the idea of truth (as the containedness of the predicate in the subject), and the pragmatic/methodological issue of how one comes to know that truth. The latter is not irrelevant, except to the foundation and definition of truth. (Leibniz’s most extensive discussion of innate ideas, not surprisingly, is in the New Essays on Human Understanding.)

10. Monadic Activity and Time

Correlate to the inter-connectedness of predicates in the complete concept is an active power in the monad, which thus always acts out its predicates spontaneously. Predicates are, to use a fascinating metaphor of Leibniz’s, “folded up” within the monad. In later writings such as the Monadology, Leibniz describes this using the Aristotelian/Medieval idea of entelechy: the becoming actual or achievement of a potential. This word is derived from the idea of perfections. What becomes actual strives to finish or perfect the potential, to realize the complete concept, to unfold itself perfectly as what it is in its entirety. This active power is the essence of the monad. Leibniz has several different names for this property (or closely related properties) of monads: entelechy, active power, conatus or nisus (effort/striving, or urge/desire), primary force, internal principle of change, and even light (in “On the Principle of Indiscernibles”).

This activity is not just a property of human souls, but of all types of monads. This inner activity must mean not only being the source of action, but also being affected (passivity), and of resisting (inertia). Again, what one calls “passivity” is just a more complex and subtle form of activity. Both a monad’s activity and resistance, of course, follow from its complete concept, and are expressed in phenomena as causes and as effects. Change in a monad is the intelligible, constantly, and continuously (recalling here the principle of continuity discussed above) unfolding being of a thing, from itself, to itself. “Intelligible” here means: (i) according to sufficient reason, not random or chaotic; and (ii) acting as if designed or purposed, as if alive–hence Leibniz’s contribution to the philosophical tradition of “vitalism.”

It is important to understand that this is not just a power to act, conceived as separable from the action and its result. Rather, Leibniz insists that one must understand that power together with (i) the sufficient reason of that power; (ii) the determination of the action at a certain time and in a certain way; (iii) together with all the results of the action, first as the merely potential and then as the actual. (See “On the Principle of Indiscernibles,” and Monadology §§11-15.) One is not, therefore, to understand it as a sequence of states, the individual bits of which are even ideally separable (except as an object of mere description for science), nor a sequence of causes and effects, again understood to be ideally separable (as if there could have been the cause without the effect). All this follows from the complete concept, the predicates of which are connected in one concept. Each state therefore contains the definite trace of all the past, and is (in Leibniz’s famous phrase) “pregnant” with the future.

But time, like space, is an illusion. How then is one to understand change without time? The important question is: what conception of time is being discussed? Just like space, Leibniz is objecting to any conception of time which is exterior to the objects that are normally said to be “in” time (time as an exterior framework, a dimension). Also, he objects to time as mere chronology, a conception of time as a sequence of “now points” that are ideally separable from one another (that is, not essentially continuous) and are countable and orderable separately from any thing being “in” them (that is, abstract).

However, in discussing relational properties above (and, in particular, Leibniz’s response to the Newton-Clarke argument about non-linear motion), “space” was in a sense preserved as a set of rules about the representative properties of monads. Here, too, but in a more profound way, “time” is preserved immanently to the monad. The active principle of change discussed above is immanent to monads, and no one state can be separated from all the others, without completely altering the thing in question into a thing that never changes (that has only the one state for all eternity). For Leibniz, the past and future are no more disconnected, in fact less, from the present than “here” is from “there.” Both distinctions are illusions, but temporal relations in a substance form an explanatory, intelligible sequence of a self-same thing. The principle of change becomes an original, internal and active power of the thing constantly becoming the thing that it is, as the spontaneous happening and internal principle of the particular order of things which make up that substance. In other words, substances unfold, become the things God always knew them to be, in a time that is nothing other than precisely that becoming.

Time, then, has three levels, according to Leibniz

  1. the atemporality or eternality of God;
  2. the continuous immanent becoming-itself of the monad as entelechy;
  3. time as the external framework of a chronology of “nows.”

The difference between (ii) and (iii) is made clear by the account of the internal principle of change. The real difference between the necessary being of God and the contingent, created finitude of a human being is the difference between (i) and (ii).

11. Influence

Leibniz’s mathematics, in parallel to Newton’s, made a significant difference in European science of the 18th century. Other than that, however, his contributions as engineer or logician were relatively quickly forgotten and had to later be re-invented elsewhere.

However, Leibniz’s metaphysics was highly influential, renewing the Cartesian project of rational metaphysics, and bequeathing a set of problems and approaches that had a huge impact on much of 18th century philosophy. Kant above all would have been unthinkable without Leibniz’s philosophy, especially the accounts of space and time, of sufficient reason, of the distinction between phenomenal and metaphysical reality, and his approach to the problem of freedom. Rarely did Kant agree with his great predecessor–indeed, rendering the whole Cartesian/Leibnizian approach conceptually impossible–but the influence was nevertheless necessary. After Kant, Leibniz was more often than not a mine of individual fascinating ideas, rather than a systematic philosopher, ideas appearing (in greatly modified forms) in for example Hegelian idealism, romanticism, and Bergson.

In the 20th century, Leibniz has been widely studied by Anglo-American “analytic” philosophy as a great logician who made significant contributions to, for example, the theory of identity and modal logic. In Continental European philosophy, Leibniz has perhaps been less commonly treated as a great predecessor, although fascinating texts by Heidegger and, much later, by Deleuze, show the continuing fertility of his philosophical ideas.

12. Editions of Leibniz

As noted above, Leibniz did not publish much in his lifetime which fits the familiar description of a philosophy book. Much was published, however, shortly after his death. But there remained for the dedication of future editors a huge estate of short papers, letters, drafts of letters, and notes. The standard edition of the works of Leibniz is the Akademie-Verlag of Berlin. The most comprehensive collection of these in English, together with some published material, is in Leibniz, Philosophical Papers and Letters, translated and edited by L. E. Loemker, 2 volumes, University of Chicago Press, 1956.

Several good, inexpensive and shorter anthologies of key texts:

  • Philosophical Essays. Edited and translated by Ariew and Garber. Hackett, 1989.
  • Philosophical Texts. Translated by Francks and Woolhouse. Oxford University Press, 1998.
  • Philosophical Writings. Edited by Parkinson, translated by Morris and Parkinson. Everyman, 1973.

Finally, editions in English of more specialized selections, the longer texts, and correspondences of Leibniz:

  • The Correspondence with Clarke. Edited by Alexander. Manchester University Press, 1956.
  • The Leibniz-Arnauld Correspondence. Edited and translated by Mason. Manchester University Press, 1967.
  • Logical Papers. Edited and translated by Parkinson. Oxford University Press, 1966.
  • The Political Writings of Leibniz. Edited and translated by Riley. Cambridge University Press, 1972.
  • New Essays on Human Understanding. Edited and translated by Remnant and Bennett. Cambridge University Press, 1996.
  • Theodicy. Edited by Farrer, translated by Huggard. Routledge and Kegan Paul, 1951.

Author Information

Douglas Burnham
Email: H.D.Burnham@staffs.ac.uk
Staffordshire University
United Kingdom

Legal Pragmatism

Legal pragmatism is a theory critical of more traditional pictures of law and, more specifically, judicial decision-making. The classical view of law offers a case-based theory of law that emphasizes the universal and foundational quality of specifically legal facts, the meticulous analysis of precedent and argument from analogy. Legal pragmatism, on the other hand, emphasizes the need to include a more diverse set of data and claims that law is best thought of as a practice that is rooted in the specific context at hand, without secure foundations, instrumental, and always attached to a perspective. A pragmatic stance towards jurisprudence offers many philosophical challenges to more traditional descriptions of the legal domain.

Table of Contents

  1. The Classical Picture of Judicial Decision-Making
  2. The Pragmatist’s Picture of Judicial Decision-Making
    1. Contextual
    2. Antifoundational
    3. Instrumental
    4. Perspectival
  3. Legal Pragmatism as a Descriptive Theory
  4. Legal Pragmatism as a Normative Theory
  5. Selected Bibliography

1. The Classical Picture of Judicial Decision-Making

The “classical picture” of legal argumentation and analysis dominates theoretical descriptions of judicial decision-making. It also is the dominant picture among legal practitioners. The classical model of legal argumentation is based upon the casebook method, the use of precedent and rigorous arguments from analogy. The casebook method assumes that the essential and exhaustive materials for a legal decision are summed up in the published opinions that accompany the conclusion of controversies in court. What an attorney or, more importantly, a judge is supposed to look to so as to render the proper verdict are reasons offered and situations analyzed in previous decisions that seem relevantly similar. The data for the decision is therefore the casebook. From a set of precedents, of written court opinions, is distilled a general set of rules and a specific verdict in the controversy before the court. Given a legal controversy, the practitioner (judge, attorney or the like) looks at previous cases for similar situations and then tries to distill the reasons that have been accepted as legally relevant for his or her client’s position. From these sources a legal conclusion should be drawn.

This classical picture of legal argumentation is historically attributed to former Harvard Law School Dean Christopher Columbus Langdell. Langdell put the first case book together as a educational tool, and bundled this type of book with a Socratic style of teaching that reigns supreme in legal practice and education today. Both the use of the casebook and the Socratic method presuppose a somewhat insular and rationalistic view of legal institutions. One of the most influential sources of the classical model of more recent vintage is offered by Edward Levi in An Introduction to Legal Reasoning. As he describes it legal reasoning is a “three-step process” where a “similarity is seen between cases; next the rule of law inherent in the first case is announced; then the rule of law is made applicable to the second case (Levi 1949, p. 2).” The implicit assumption is that once the similarity between cases is recognized, legal reasoning is simply a matter of making a logically valid deduction of a holding from a statement of the law (major premise) and a statement of the facts (minor premise). But by far the most influential current advocate of the main elements of the classical view is Ronald Dworkin.

Dworkin’s theory functions as a normative theory as well as a descriptive one. Taken as a descriptive claim the theory offers a portrait of what judges actually do when arriving at a legal conclusion. Dworkin’s own version of legal decision-making is entitled “law as integrity” (Dworkin, 1986). According to this theory, consistency with past judicial decisions should be emphasized as one of the most important legal virtues. He offers the picture of an imaginary creation, the “chain-novel,” to argue for the centrality of precedent in law. A chain-novel is a novel that is written one chapter at a time. After the creation of each new chapter, the novel is passed to a new author for further elaboration. Dworkin argues that in this enterprise we surely would want the new author to find as supremely important the need to cohere with and respect the content of the chapters already completed. An author that didn’t follow this rule would be not properly fulfilling his or her role. Dworkin then argues the same assumptions should rule the legal world and, therefore, the judge’s activity. That is, each case is directly analogous to a new chapter in the chain-novel. If one accepts the analogy, and there seems to be much too little analysis critiquing the acceptability of such an analogy, one gets a picture of a somewhat insulated legal system running upon a deep need for internal coherence. While Dworkin disavows the deductivist picture offered by Langdell, and allows in a moral dimension, in his attachment to traditional legal materials and practices he is clearly a proponent of the classical view. The legal pragmatist finds much to argue with in this picture of jurisprudence.

2. The Pragmatist’s Picture of Judicial Decision-Making

Legal pragmatists such as Daniel Farber, Thomas Grey, Margaret Radin and Richard Posner think that such a picture of jurisprudence is severely flawed. The legal pragmatist thinks that the classical view is overly legalistic, naively rationalistic and based upon misunderstandings of legal institutions. As opposed to the self-imposed limitations entailed by the classical view of judicial decision-making, legal pragmatists emphasize the eclectic nature and the diverse aims of the law. More specifically, legal pragmatists largely agree upon four main aspects of a pragmatist version of jurisprudence: (1) the important of context; (2) the lack of foundations; (3) the instrumental nature of law; and (4) the unavoidable presence of alternate perspectives.

a. Contextual

For the legal pragmatist all legal controversies are essentially attached to a specific and unique context. As Posner describes it, emphasizing the unavoidable presence of a specific context “disconnects the whirring machinery of philosophical abstraction from the practical business of governing our lives and our societies (Posner 1995, p. 463).” While there is some irony in a foremost proponent of neo-classical economics critiquing “philosophical abstractions,” Posner here correctly highlights the contextualist’s slogan of “return from abstractions to the concrete.” Certainly Dworkin and Langdell can be seen as overly fond of abstractions. In this case they mirror the actual practitioners. Tamanaha argues that the contrasting contextualism of legal pragmatism is best shown in Justice Holmes’ strategy whereby he used historical analysis to expose such seemingly timeless abstract legal concepts as being actually derived from contingent and context-specific needs (Tamanaha 1996, p. 315). Through this strategy the illusion of an eternal set of essential legal concepts is exposed as actually being a contingent creation of specific conflicts. While even legal formalists expect to apply concepts to a context, the legal pragmatist differs in seeing the concepts themselves as products of context. Because of this, the assumption that the legal concepts are applicable beyond their originating controversy is questioned.

The basic claim offered by the contextualist critique is that all legal decision-making, as well as any legal controversy, takes place in a specific and unique context that is so constitutive of the issues and the ultimate decision that the decision is distorted if seen from a non-contextual perspective. More importantly, the concepts used are questionable when applied between different controversies. Because of this, the abstractionist tendencies of the classical view of legal decision-making is thought undesirable and a view that emphasizes context, such as the legal pragmatist’s, to be superior.

b. Antifoundational

In addition to the need to emphasize context, the legal pragmatist also argues that the lack of foundations in legal decision-making must be recognized. Foundationalists hold that there is some core principle or principles that all legal decisions can be deduced from. While today very few will admit to an extreme view of such foundationalism, most legal theorizing assumes a more moderate foundationalist view. This moderate view argues that the judge has a sufficient set of tools from within the traditional materials of the classical view of legal decision-making (the case method) to make properly informed decisions in present cases. In other words, the moderate view sees cases as the necessary and sufficient foundation from which to deduce sufficiently analyzed legal conclusions.

A legal pragmatist sees this as descriptively wrong. First, “the idea that correct outcomes can be deduced from some overarching principle – or set of principles” is rejected (Cotter 1996, p. 2085). In place of deductive certainty is offered a picture of induction and an emphasis upon the creative problem-solving act of jurisprudence. Second, pragmatism in general stands for a rejection of the metaphysical picture of knowledge or decision-making that sees either as needing (or indeed having) a foundation. Knowledge and reason in law, as in any other domain, are seen as essentially open-ended concepts in need of continual testing and revision, and therefore law is an activity that would outgrow any purported foundations. So, if cases are thought to provide a foundation to legal decisions the legal pragmatist argues that they will not be inevitably up to the challenge of the next case, and therefore the foundationalist picture is at the very least incomplete.

c. Instrumental

While the classical view of legal decision-making emphasizes consistency with past decisions (the high value of respect for precedent), the instrumentalist advocates an investigation of the effects a decision might have and the capabilities of the legal institution. An instrumental view is therefore less interested in precedent and more based upon a “orientation towards the future (Rosenfeld 1996, p. 98).” That is, instead of an emphasis upon consistency with the essence of past decisions the pragmatist judge looks to the worldly implications of his or her decision. For instance, in a contract dispute a judge following the classical model of legal reasoning would look to antecedently held rights and obligations as shown in earlier cases in order to decide. A pragmatist judge, on the other hand, would see those issues as important but would also look at the greater implications for contract disputes in the future. This prospective attitude would bring in data as to the effects of the contract decision upon third parties, how a ruling would affect daily life, etc.

This orientation towards the future, and towards the empirical, means that for the legal pragmatist judge a whole new set of reasons become applicable and legally relevant when making a decision. While the advocate of the classical view can limit the reasons and facts to those allowed in the analogous cases, the cases accepted as precedents, the pragmatist judge must allow in other sorts of data, for instance sociological or economic data, in order to properly access the individual case at hand. Therefore, instead of emphasizing the primacy of consistency with precedent, a pragmatist of a legal bent emphasizes “the primacy of consequences in interpretation (Posner 1995, p. 252).”

d. Perspectival

Finally, the legal pragmatist adopts a stance that embraces the problem of perspective. Perspectivalism entails a suspicion of broad generalities and an acknowledgment of eclectic manners of description. As opposed to legal formalism, which “holds that determinate meanings exist in legal texts which may be discerned by reason and that objective, immutable principles simultaneously inform and transcend the practice of applying rules,” perspectivism emphasizes that all is messy, open-ended, and subject to revision in light of another perspective or further information (Shutkin 1993, p. 66). The acknowledgment of perspective entails that an overly deferential stance towards precedent and previously endorsed analogies could be unfairly restrictive towards new and possibly more inclusive descriptions.

As can be seen from the above, legal pragmatism offers a significant alternative to more traditional views of the legal domain. In fact, Stuart Scheingold argues that this lack of awareness of conflicting perspectives is a pervasive quality of traditional legal thought. As he puts it “Law professors and lawyers do not believe that they are either encumbered or enlightened by a special view of the world. They simply feel that their legal training has taught them to think logically. In a complex world, they have the intellectual tools to strip a problem, any problem, down to its essentials (Scheingold 1974, p. 161).” But if such an assumption is itself just one perspective, and one that obviously would distort any appreciation of other alternative perspectives, such ignorance of their own perspective would be an important vice to identify.

But important issues remain even if one finds such a description of legal pragmatism attractive. First, is legal pragmatism offered as a descriptive or a normative picture of jurisprudence? Second, does such a stance really offer any desirable features that the more classical picture of law cannot deliver or does it suffer from more intractable flaws?

3. Legal Pragmatism as a Descriptive Theory

Legal pragmatism can be characterized as a theory with descriptive pretensions. That is, as a theory as to what really happens in law, despite the ideological prevalence of the classical model. The descriptive legal pragmatist thinks that the classical picture of jurisprudence does not fit the facts of law, and that a pragmatist picture offers a better alternative. A legal pragmatism of this type looks to the legal realists as historical precursor. The legal realists claimed that law was a much sloppier and more political, as well as less reasonable, institution than those following the Langdell model admitted. In other words, that the reasons and data offered by the classical model of legal decision-making do not properly explain the actions of legal institutions. The legal pragmatist, therefore, looks for empirical evidence that argues against such a constrained view of decision-making.

Such evidence is not too hard to come by. First, it is clear that political actors do not treat the court system as neutral and functioning only upon respect for precedent. The full-blown fights over judicial appointments shows that actors outside of the court system view judges as politically important. Second, there is much empirical research that questions the assumption that precedent actually has the authority claimed for it. Some studies have claimed that decisions are more influenced by the political beliefs of the judge than by precedent (Goldman 1979, p. 208). Another study claimed an 85% success rate in prediction of future case decisions based upon a study of the judge’s “values” (Rohde/Spaeth 1976, p. 157). A further study concluded, “Supreme Court justices are not influenced by landmark precedents with which they disagree (Segal/Spaeth 1996, p. 971).” What the empirical data tends to show, then, is that the classical model does not explain the way actual judges decide cases very well.

On the other hand, the legal pragmatist model has difficulties as a descriptive theory as well. First, judges for the most part certainly act and write as if they are following precedent and the traditional legal materials. Second, it seems as if judges that were really pragmatic would have to be more rigorous in the following out of empirical implications of their decisions.

But this possibility raises many questions. For instance, would the current fear of statistics and sociological data that lawyers have as an rule have to be overcome in order for law to be actually and accurately described as pragmatic? Furthermore, there is the question of institutional competence. Does the legal system really have the resources to gather and digest all the data necessary to make an informed pragmatic decision? Does a judge have the capacity to digest all the relevant material in order to have any competent idea as to the real-world ramifications of any non-clerical decision? Would not a judge that described him or herself as a pragmatist judge be just as deluded as the judge that adopts the more traditional description?

4. Legal Pragmatism as a Normative Theory

Because neither option seems to accurately fit what really goes on in the jurisprudential domain, perhaps legal pragmatism should be better thought of as a normative theory. That is, perhaps it is a conceptual stance offered as a picture of what judicial decision-making should be.

In its normative mode legal pragmatism treats law and the legal realm as a tool useful for social purposes. The legal pragmatist opposes the a priori and rationalistic style of argumentation traditionally applied in legal argumentation by arguing that such methods have no valid claim to authority and, indeed, lack the tools necessary to justify their own adoption. The more traditional style of legal reasoning, that which keeps its attention upon cases, excludes broader and more scientifically warranted data. Therefore the user of the classical theory can offer not much more than a heart-felt and resounding exclamation – “it works” – when confronted with the question of the empirical effectiveness of a decision. All pragmatist thought brings with it a suspicion of unquestioned and non-experimental pictures of reason. Indeed the pragmatist is liable to see in such a claim something akin to the statement “because God commanded it.” This “it works” exclamation is an example of just such an a priori, rationalist and non-experimental claim. What exactly does it work in comparison to? For the pragmatist such statements only have meaning if they can be tested, and the classical picture of jurisprudence doesn’t have the tools with which to test such claims in each case or on a more global level.

On the other hand, adoption of a pragmatist theory offers the ideal of a system rooted in experience and the experimental method. As opposed to the overly rationalistic and insular picture of legal decision-making offered by the classical legal theorist, the legal pragmatist argues for a more empirical jurisprudence. The normative argument, in outline, is that a jurisprudential theory rooted in sensitivity to context, a theory that functions without a belief in false foundations, one that is judged along explicitly instrumental criteria and that also acknowledges the inevitability of perspective, is better suited to bring about justice in a complex and unpredictable world than a theory that rests upon untested essentialistic assumptions and a non-experimental and universalistic view of reason.

5. References and Further Reading

Brint, Micheal and William Weaver, Pragmatism in Law and Society (Boulder: Westview Press, 1991)

Cotter, Thomas F., “Legal Pragmatism and the Law and Economics Movement,” 84 Georgetown Law Journal 2071 (1996)

Dickstein, Morris, The Revival of Pragmatism: New Essays on Social Thought, Law, and Culture (Durham: Duke University Press, 1998).

Dworkin, Ronald M., Law’s Empire (Cambridge: Harvard University Press, 1986)

Farber, Daniel, “Reinventing Brandeis: Legal Pragmatism for the Twenty-First Century,” 1995 University of Illinois Law Review 163 (1995)

Goldman, Sheldon, “The Effect of Past Judicial Behavior on Subsequent Decision-Making,” 19 Jurimetrics Journal 208 (1979)

Grey, Thomas G., “Freestanding Legal Pragmatism,” 18 Cardozo Law Review 21 (1996)

Grey, Thomas G., “Holmes and Legal Pragmatism,” 41 Stanford Law Review 787 (1989)

Levi, Edward, H., An Introduction To Legal Reasoning (Chicago: University of Chicago Press, 1949)

MacCormick, Neil, Legal Reasoning and Legal Theory (Oxford: Clarendon Press, 1978)

Posner, Richard, Overcoming Law (Cambridge: Harvard University Press, 1995)

Posner, Richard, “Pragmatic Adjudication,” 18 Cardozo Law Review 1 (1996)

Radin, Margaret Jane, “The Pragmatist and the Feminist,” 63 Southern California Law Review 1699 (1990)

Rohde, David W., and Harold J. Spaeth, Supreme Court Decision Making (San Francisco: W.H. Freeman, 1976)

Rorty, Richard, “The Banality of Pragmatism and the Poetry of Justice,” in Pragmatism in Law and Society

Rosenberg, Gerald D., The Hollow Hope: Can Courts Bring About Social Change? (Chicago: The University of Chicago Press, 1991).

Rosenfeld, Michel, “Pragmatism, Pulralism and Legal Interpretation: Posner’s and Rorty’s Justice Without Metaphysics Metts Hate Speech,” 18 Cardozo Law Review 97 (1996)

Segal, Jeffrey A., and Horold J. Spaeth, “The Influence of Stare Decisis on the Votes of Supreme Court Justices,” 40 American Journal of Political Science 971 (1996)

Scheingold, Stuart A., The Politics of Rights (New Haven: Yale University Press, 1974)

Shutkin, William Andrew, “Pragmatism and the Promise of Adjudication,” 18 Vermont Law Review 57 (1993)

Smith, Steven D., “The Pursuit of Pragmatism,” 100 Yale Law Journal 409 (1990)

Tamanaha, Brian Z., “Pragmatism in U.S. Legal Theory: Its Application to Normative Jurisprudence, Sociolegal Studies, and the Fact-Value Distinction, 41 American Journal of Jurisprudence 315 (1996)

Wells, Catharine P. “Improving One’s Situation: Some Pragmatic Reflections on the Art of Judging,” 49 Washington and Lee Law Review 323 (1992)

 

Author Information

Brian Edgar Butler
Email: bbutler@unca.edu
University of North Carolina at Asheville
U. S. A.

Middle Platonism

The period designated by historians of philosophy as the “Middle Platonic” begins with Antiochus of Ascalon (ca. 130-68 B.C.E.) and ends with Plotinus (204-70 C.E.), who is considered the founder of Neoplatonism. The Middle Platonic philosophers inherited the exegetical and speculative problems of the Old Academy, established by Plato and continued by his successors Speusippus (ca. 407-339 B.C.E.), Xenocrates (ca. 396-314 B.C.E.) , and Polemo (ca. 350-267 B.C.E.). Many of these problems centered about the interpretation of Plato’s so-called Unwritten Doctrines, inspired by Pythagorean philosophy and involving a primordial, generative pair of first principles—the One and the Dyad—and how to square this doctrine with the account of creation given in the Timaeus dialogue. This was also the main concern of the Neopythagorean philosophy that emerged with the work of Ocellus Lucanus in the second century B.C., whose treatise On the Nature of the Universe shows the influence of both Platonic and Aristotelian conceptions.

The Academy took a new turn after the founding of the Stoic school by Zeno of Citium (334-262 B.C.), a pupil of Polemo. Arcesilaus (ca. 315-241 B.C.E.) is regarded as the founder of the New Academy, known for its skepticism. Later, Antiochus asserted the fundamental harmony of the Platonic, Peripatetic (Aristotelian), and Stoic philosophies, and Eudorus of Alexandria (fl. ca. 25 B.C.E.) elucidated the highly influential teleological dogma of Platonism: “likeness to god as far as possible” (Plato, Theaetetus 176b). Other important Middle Platonists were Philo of Alexandria (ca. 30 B.C.E.—45 C.E.), who interpreted Hebrew Scripture along Platonic lines, exercising an immense influence on developing Christianity; Plutarch of Chaeronea (ca. 45-125 A.D.) whose treatise De Iside et Osiride (“On Isis and Osiris”), with its Greco-Egyptian syncretism, is an important example of the religious tendencies of later Middle Platonic philosophy; and Numenius of Apamea (fl. 150-176 C.E.) whose highly syncretic philosophy exercised a profound influence on Plotinus, who was accused of plagiarizing Numenius.

In addition to these “mainstream” philosophers, the Middle Platonic period includes the more esoteric systems of the Gnostics, the Corpus Hermeticum and the Chaldaean Oracles. All of these involved an “astral piety” with a notion of planetary powers and intra-cosmic daemons mediating between humanity and the highest cosmic deities.

Table of Contents

  1. Plato’s “Unwritten Doctrines”
  2. The Old Academy
    1. Speusippus
    2. Xenocrates
    3. Polemo
    4. Other Important Members of the Old Academy
  3. Skepticism and the New Academy
    1. Arcesilaus
    2. Carneades
  4. The Beginning of Middle Platonism
    1. Philo of Larissa
    2. Antiochus of Ascalon
    3. Posidonius
  5. Neopythagorean Philosophy
    1. Ocellus Lucanus
    2. Timaeus Locrus
    3. Archytas
    4. Eudorus of Alexandria
  6. Later Middle Platonism
    1. Philo of Alexandria
    2. Plutarch of Chaeronea
    3. Numenius of Apamea
    4. Albinus
  7. “Esoteric” Platonism
    1. Hermeticism
    2. Gnosticism
    3. The Chaldaean Oracles
  8. Conclusion
  9. References and Further Reading
    1. Primary Sources
    2. Secondary Sources

1. Plato’s “Unwritten Doctrines”

Platonic philosophy did not originate solely with the Dialogues of Plato. There is ample evidence from antiquity that Plato taught certain doctrines within the Academy that he did not write down; moreover, these doctrines were sufficiently vague as to cause divergent interpretations even among the first three successors of Plato in the Academy. It is these doctrines — perhaps even moreso than the Dialogues (excepting the Timaeus) – from which are derived the problems and approaches characteristic of Middle Platonic thought. A basic outline of these doctrines follows.

Drawing upon Pythagorean mathematical theory, Plato began his metaphysical schema with a pair of opposed first principles, the One and the Indefinite Dyad. The One is the active principle which imposes limit on the indefinite or unlimited Dyad, thereby laying the ground for the orderly construction of the cosmos. Through this influence of the One upon the Dyad numbers are generated, that is, the Decad, which in turn generates all other numbers. The most important of these primordial numbers is the tetraktys, numbers one through four, the sum total of which is ten, the Decad. The tetraktys also was interpreted by Plato as generating the four mathematical dimensions, with the number one corresponding to the point, two to the line, three to the plane, and four to the solid. Between the Ideal-Numbers or Decad Plato places the World-Soul, corresponding roughly to the Demiurge of the Timaeus. The World-Soul mediates between the Ideal realm and matter, projecting the four dimensions on base matter in order to form the four elements, Fire, Air, Water, and Earth. This basic schema of a first and second principle, and third intellectual and craftsmanly principle responsible for forming the cosmos, was to have an immense influence on the history of Greek philosophy, especially the period reviewed in this article. The following cryptic passage from the Platonic Second Letter (generally accepted as from Plato’s hand in antiquity) had a profound effect on the imagination of Platonic and Pythagorean philosophers of the Middle and Neoplatonic periods. This passage, though more than likely written by a student of Plato, nevertheless provides a hint of what the teacher’s more esoteric teachings may have been like.

Upon the king of all do all things turn; he is the end of all things and the cause of all good. Things of the second order turn upon the second principle, and those of the third order upon the third (312e, tr. G.R. Morrow, in J.M. Cooper, ed., 1997).

Among the many problems inherited by Plato’s successors and their students and colleagues are included the questions of whether the creation of the cosmos, as described in the Timaeus, took place in time or is atemporal, and the manner in which Demiurge of that dialogue relates to the World-Soul of the unwritten doctrines.

2. The Old Academy

The term “Old Academy” is used to refer to the educational institution established by Plato in Athens, and run by his three immediate successors. This is to differentiate it from the “New Academy,” so-called because of its turn toward a more sceptical mode of philosophizing.

a. Speusippus

After the death of Plato the headship of the Academy passed to his nephew Speusippus (ca. 407-339 B.C.), according to Plato’s wishes. Speusippus seems to have revised Plato’s doctrine of the One and the Dyad by placing the One above Intellect, declaring that it is superior to Being and “free[ing] it even from the status of a principle” (fragment in Klibansky 1953, tr. Dillon 1977, p. 12). In this he differed, as Dillon observes, “with all official Platonism up to Plotinus” (p. 18). The result of this difference is that the Dyad is now considered the sole productive source of multiplicity, from which all other levels of reality derive. Speusippus elaborated a multi-layered cosmic schema in ten stages or “grades” (Zeller 1955, p. 169) of Being: 1.) the supreme One beyond Being, 2.) the Indefinite Dyad or the Many (producer of multiplicity), 3.) Number (beginning with three, the first stage of multiplicity), 4.) the Soul, source of all geometrical extension, 5.) the celestial bodies, 6.) all ensouled beings, including irrational animals and plants, 7.) Thought, and the seven planets and the seven Greek vowels, 8.) instinct and the passions, 9.) motion, 10.) the Good, and repose. By locating the Good at the end of this emanative process – which is properly understood, as Zeller (1955, p. 169) writes, as “eternal principles of things and their stages of development” – Speusippus is not denying the ontological supremacy of the One, rather he is recognizing the One as the most simplex and primordial of all realities, and as “the cause of goodness and being for all other things” (Dillon 1977, p. 12). According to Speusippus the cosmos is eternally generated; therefore, he interpreted the creation account in the Timaeus as intended for purposes of instruction, and not to be taken literally. In the sphere of ethics Speusippus seems to have taught that happiness is leading a moral life, which likely meant for him a median between pleasure and pain, both of which, according to Aulus Gellius (Noctes Atticae IX, 5.4), Speusippus considered to be evils.

b. Xenocrates

Xenocrates (ca. 396-314 B.C.) succeeded Speusippus as headmaster of the Academy, and held that post for a quarter of a century (339-314 B.C.), until his death. He departed from Speusippus in identifying the One as Intellect or Nous, which he also named “Father”; the Dyad he called “Mother.” There is evidence that Xenocrates identified the Dyad with primordial Matter (fragment 28; Dillon 1977, p. 24), and considered it an “evil and disorderly principle” (Dillon, p. 26). Xenocrates divided the sensible universe into the realm above the moon (the supra-lunar) and the realm below the moon (the sub-lunar). It is unclear whether he added a further division to include a purely intelligible realm, or considered the One and the Dyad as occupying the highest sphere above the stars. Above the moon there exists the seven planets, which Xenocrates considered to be divine, along with the stars and the pure fire that is the base element of the universe. The realm below the moon he believed to be occupied by daemons. He held a theory that there are two types of gods, Olympians and Titans, the former born of heaven and the latter of earth (fragments 18 and 20; Dillon, pp. 26-27, also see Zeller 1955, p. 170). Theophrastus, the pupil of Aristotle, gave credit to Xenocrates for his exhaustive account of the cosmos, distinguishing him from Speusippus and others who only provided an account of the One and the Dyad, barely touching upon anything else besides numbers and geometrical shapes. Xenocrates, he says, discoursed not only on divine things and mathematicals, but on objects of sense-perception as well (Theophrastus, Metaphysics 6a.23-6b.9). Perhaps the most important contribution of Xenocrates to the history of Platonism (and all of philosophy as well) is the doctrine that the Ideas are thoughts in the mind of the One (Dillon, p. 29). Xenocrates made a distinction between practical and scientific wisdom, and taught that happiness is to be found in virtue and the means conducive to it (Zeller, p. 170).

c. Polemo

Xenocrates was succeeded by Polemo (ca. 350-267 B.C.), who became headmaster of the Academy upon the latter’s death in 314. Eduard Zeller, in his seminal work on the history of Greek philosophy, remarks that there is a scarcity of original thinking in the work of Polemo (Zeller 1955, p. 170). This is unfair, not only because we do not possess any works of Polemo by which to accurately judge him, but because if one looks carefully at the surviving evidence, Polemo’s importance for the emergence and development of Stoic philosophy will be seen. While it is true that Polemo’s metaphysical schema was likely dependent upon his predecessors, with little or no development, he did make at least two important contributions to ethics, both of which influenced emerging Stoicism. The first is the concept of self-sufficiency (autarkheia), which Polemo identified as the key to happiness. He understood self-sufficiency in respect of virtue, and not in terms of material wealth or bodily pleasure, teaching that one could be happy even in the absence of all physical comfort, provided that one had achieved virtue. The second is the concept of conciliation or appropriation (oikeiôsis), which was of immense importance for later Stoic philosophers. The basic presumption of this doctrine is that all living beings strive for conciliation with their environment, and that this necessarily involves an existence in accordance with nature which, for human beings, is a virtuous existence. There is evidence in Cicero that Polemo taught such a doctrine, but we have no way of knowing whether he actually used the term oikeiôsis.

d. Other Important Members of the Old Academy

Besides the headmasters of the Old Academy discussed above, other pupils of Plato made contributions to Platonic philosophy. The astronomer and mathematician Philip of Opus, believed by most scholars to be the author of the pseudo-Platonic dialogue Epinomis, taught that the greatest wisdom is to be attained through contemplation of the divine celestial bodies. However, he placed importance as well on the intermediary capacity of the daemons in this endeavor. Following Plato in the Laws (896e-898d) he taught a doctrine of an evil World-Soul. Eudoxus of Cnidus was a pupil of Plato as well as of the Pythagorean Archytas. He believed that the Forms reside in material mixtures, and that pleasure is the highest good. It is likely that Plato wrote his Philebus in response to Eudoxus’ theory of pleasure. Heraclides of Pontus was an astronomer who borrowed the Pythagorean theory of the diurnal revolution of the earth, and revised it with his own theory that Mercury and Venus revolve around the sun. He held a materialistic view of the soul, believing it to be composed of aether, the purest element. Finally, Crantor of Soloe (ca. 330-270 B.C.) achieved fame as author of the first commentary on Plato’s Timaeus, and for his widely read treatise On Grief, an early example of the consolation genre of writing found much later in Boethius. Against the Stoics he argued that all pain, including grief, is a necessity, and is to be controlled rather than eradicated (Dillon, p. 42, Zeller pp. 171-172). He followed Plato and the Pythagoreans in regarding life as a punishment, and philosophy as practice for death.

3. Skepticism and the New Academy

The designation “New Academy” is intended to represent the shift away from exegesis of Plato’s doctrines and metaphysical speculation, toward a more sceptical mode of philosophizing. The following two philosophers are its major representatives.

a. Arcesilaus

Scholars generally consider the “New Academy” to have begun with Arcesilaus (ca. 315-240 B.C.) who, under the influence of Pyrrhonian skepticism called into question the idea that knowledge and certainty is obtainable through sense-perception, denying that even reason or understanding is capable of arriving at uncontestable truth. In this he was attacking Stoic cosmology and theology, with its belief in an eternally ordered universe pervaded by reason. His skepticism was so thorough that he refused even to declare the validity of his own sceptical stance. He did not, however, do away with all criteria for living a proper life, considering perception as linked to the will, and rational activity as following a judgment based on probability of desired effect.

b. Carneades

Carneades (214-129 B.C.) followed Arcesilaus in his sceptical approach, and honed the latter’s notion of probability, recognizing three “grades” of probability involving increasing levels of validation based on mutual confirmation of related representations (Zeller, p. 264). Carneades, like Arcesilaus, attacked Stoic doctrine, especially the idea of “conceptual representations” (phantasia katalêptikê), arguing that there exists no representation that cannot be convincingly reproduced by artificials means; therefore, we can never be certain that the representation we are experiencing is true or authentic. He likely followed Arcesilaus in the realm of ethics, adopting judgment based on probability as the guide for practical life.

4. The Beginning of Middle Platonism

Scholars generally consider the Middle Platonic period to have begun with the work of Antiochus of Ascalon (d. 68 B.C.), who was responsible for overhauling the increasingly stifling skepticism of the New Academy. His teacher was Philo of Larissa (fl. 88-79 B.C.), who also taught Cicero. We will examine briefly the teachings of Philo, before moving on to Antiochus. We will then discuss Posidonius who, though a Stoic rather than a Platonist, contributed much to the development of Middle Platonic philosophy.

a. Philo of Larissa

Unlike his predecessors in the New Academy, Philo of Larissa did not consider knowledge an impossibility, although he did follow them in criticizing the Stoic doctrine of “conceptual representations” as the key to knowledge. However, he sought not to deny all possibility of knowledge, but rather to establish a middle course between mere probability, and knowledge. He believed that there is a level of obviousness where skepticism must give way to conviction, although this conviction must not be regarded as absolute knowledge. Philo’s main concern was with ethics, and he used his middle ground approach to formulate a detailed ethical theory in a manner never attempted by Arcesilaus or Carneades.

b. Antiochus of Ascalon

The fundamental agreement of Platonic, Stoic, and Peripatetic philosophy was asserted by Antiochus of Ascalon, who returned to the basic approach, if not the actual doctrines, of the Old Academy. This notion of agreement of the earlier philosophers on matters of doctrine served as a way for Antiochus to get past the skepticism of his teacher, in order to establish his own philosophical stance. What we know of Antiochus’ doctrines is contained in various writings of Cicero, usually placed in the mouth of Antiochus’ influential pupil Varro. No writings of Antiochus survive; therefore, as with all of the philosophers discussed so far – with the exception of Plato – we must rely solely on reports by contemporaries, near contemporaries, and later writers. Nevertheless, it is possible to reconstruct with some confidence the doctrines put forth by Antiochus.

Antiochus, likely for the first time since the advent of academic skepticism, busied himself with the interpretation of Plato’s dialogues, notably the Timaeus, as the Old Academics had done, thereby providing us with the first example of what would later become a full-fledged systematic approach in the later Middle Platonists. Antiochus rejected the Aristotelian “fifth element” and returned to the four basic elements – Fire, Air, Water, and Earth – as the primary material principles of the cosmos. Matter (hulê) is the substrate of these elements. Following Stoic philosophy, Antiochus taught that the stars and planets, as well as minds, are composed of the purest fire. Even god is composed of this fire and does not transcend the cosmos, but occupies its highest reaches. He combined the Demiurge of the Timaeus and the World-Soul of the Unwritten Doctrines into an intra-cosmic, unitive, rational force which he termed Logos. Antiochus denied that the Platonic Ideas or Forms transcend the cosmos, asserting instead that they are conceptions common to all humanity, constructed by way of analogies (similitudines, analogiai), and existing only within the mind of each rational being, including god (Cicero, De oratore 8 ff.). Like Xenocrates earlier, Antiochus understood the Ideas as thoughts in the mind of god (Dillon, pp. 94-95).

With the rise of Stoicism as the most influential dogmatic philosophy of the Hellenistic era, the problem of fate versus free will came to the fore, and Antiochus responded by rejecting fate (heimarmenê) as an efficient cause, relegating it to the class of “material cause” (aition prokatarktikon), along with time, matter, and other things that are necessary, but not sufficient, to produce an effect. This allowed for efficient causes to arise from human initiative, and preserved the freedom of human activity, or at least response, within an ordered cosmos.

Again following Xenocrates, Antiochus expressed a belief in daemons, who inhabit the sub-lunar realm (the supra-lunar realm being reserved for the divine celestial bodies). He also appears to have believed in divination, not only through the motion of the celestial bodies, but by way of dreams, oracles, beasts, and even inanimate objects (Cicero, De divinatione I.12 ff.; Dillon, p. 89).

While not a strikingly brilliant philosopher – at least as far as we can tell from surviving accounts of his doctrines – Antiochus is responsible for articulating themes that would later become prominent in Platonic philosophy. His notion of the Ideas as thoughts in the mind of god was accepted as authentic Platonic doctrine by Philo of Alexandria, who gave it his own unique spin, as we shall see; the problem of the Demiurge and the World-Soul was taken up by Numenius in rather gnosticizing fashion, as we will discuss; and Antiochus’ teaching regarding divination and daemons is a precedent of the Neoplatonic system of Iamblichus (who, due to his later date, will not be discussed in this article).

c. Posidonius

Although not a Platonist, strictly speaking, but a Stoic, Posidonius (135-51 B.C.) nevertheless exercised an immense influence on the development of Middle Platonic thought. Among his many works, all unfortunately lost except for a few scant fragments, is a commentary on the Timaeus, which was likely the main source of his influence on Platonism. Posidonius recognized two principles in the cosmos, one active and one passive: god and matter, respectively. In this he was following Plato’s doctrine of the mixing bowl, as put forth in the Timaeus. In his cosmology, Posidonius posited, as did Platonists like Xenocrates and Antiochus, a bipartite cosmos consisting of a supra- and a sub-lunar realm. He considered the supra-lunar realm to be imperishable, and the sub-lunar perishable, dissolving into the void (kenon) outside the cosmos during the conflagration (ekpurôsis), after which it is reconstituted anew (this being a variation of standard Stoic doctrine going back to Chrysippus). Posidonius understood human beings as forming a bridge between these two realms, and theorized that souls originate in the sun and travel to earth by way of the moon (Zeller, pp. 269-270). Some of these souls become humans while others become daemons or heroes, a doctrine developed in his treatise On Heroes and Demons, which had an immense influence on later Platonists, especially Plutarch.

Posidonius believed that the cosmos is held together by cosmic sympathy (sumpatheia), and this formed the basis for his ideas concerning fate and divination (cf. Cicero, De divinatione I, and De fato). He believed the cosmos to be controlled by three forces, Zeus, Nature, and Fate, and that human beings cannot escape the causality that is the source of cosmic unity. This led Posidonius naturally to a belief in astrology, and there is ample evidence that he practiced it as well (fragments 111, 112, Edelstein-Kidd). He also theorized regarding other forms of divination, and from his doctrine of cosmic sympathy arrived at the conclusion that all life and events in the cosmos are connected, making divination from an animal’s liver, for example, possible. Posidonius asserted the immortality of the soul and its ability to exist apart from the body. In ethics he largely followed Plato, teaching that the passions are not to be eradicated but controlled (Zeller, p. 270, Dillon, pp. 109-112).

5. Neopythagorean Philosophy

During the late second century and early first century B.C. a number of writings began to appear that were attributed to various historical followers of Pythagoras. This renewed interest in Pythagorean philosophy likely grew out of the desire to find harmony between the three major philosophical schools of the era. The writings compromising the Pseudo-Pythagorica, as the collection of about ninety treatises by fifty authors is often called, contain elements of Platonism, Stoicism, and Peripatetic philosophy, as well as typical Pythagorean number theory and cosmological motifs, such as the eternity of the world. There is little, in fact, to differentiate Neopythagoreanism from Middle Platonism, as one can easily find Pythagorean elements in the work of thinkers commonly designated as Platonists, and vice-versa. Following John Dillon in his definitive study of Middle Platonism, however, I am making the distinction for the sake of scholarly rigor.

a. Ocellus Lucanus

Of the writings of Ocellus Lucanus (second century B.C.) we possess a treatise On the Nature of the Universe and a fragment of a lost treatise On Laws. Ocellus was concerned with maintaining the doctrine of the eternity of the world against the Stoic doctrine of periodic conflagration and reconstitution of the universe. Since there are only two types of generation – from a lesser to a greater state and vice-versa – Ocellus argued that it is just as absurd to state that the universe began in a lesser state and progressed to a greater, as it is to state the opposite, for both statements imply either a growth or a diminution, and since the cosmos is whole and self-contained (so he insisted) there is no place into which it can either grow or diminish. Posidonius’ doctrine of a void into which the cosmos periodically dissolves held no place in Ocellus’ philosophy.

Although positing the eternity of the cosmos, Ocellus nevertheless admitted the obvious, that generation and dissolution occurs here on earth. Like Xenocrates and other Platonists, Ocellus understood the cosmos as divided in two parts, the supra-lunar and the sub-lunar, the gods existing in the former and daemons and humans in the latter. It is only in the sub-lunar regions, he argued, that generation and decay occurs, for it is in this region that “nonessential” beings undergo alteration according to nature. The generation that occurs in the sub-lunar realm is produced by the supra-lunar realm, the primary cause being the sun, and the secondary causes the planets. He apparently did not believe in a transcendent realm beyond the material cosmos.

Ocellus’ work is one of the earliest examples of Hellenistic-era astrological doctrine. At the end of his On the Nature of the Universe he entreats prospective parents to be attentive in choosing times of conception, so that their children may be born noble and graceful; and in the fragment On Laws he declares that the active supra-lunar realm governs the passive sub-lunar realm. In his ethical doctrine Ocellus adhered to strict Pythagorean asceticism, holding that sexual intercourse is to be reserved for reproductive purposes only, and that alchoholic beverages are to be avoided.

b. Timaeus Locrus

Scholars are not certain whether the eponymous Timaeus Locrus of Plato’s dialogue ever really existed. In any case, the treatise On the World and the Soul attributed to this person is an early to mid-first century B.C. work containing an epitome of the Timaeus dialogue, though with some omissions. Given the renewed interest in Pythagorean philosophy in this period, it is likely that the work was widely read. Though containing clear Pythagorean motifs, such as a table of musical tones and their respective numbers, and a section elaborating the geometrical construction of the cosmos, the treatise is, as Thomas Tobin (1985) has demonstrated, a Middle Platonic interpretation of the highly Pythagorean-influenced Timaeus dialogue.

According to “Timaeus” the universe has two causes: Mind, which governs rational beings, and Necessity, which governs bodies and all irrational beings. Interpreting Plato literally, “Timaeus” affirmed the temporal creation of the cosmos, and while stating that the cosmos is capable of being destroyed by the one who created it (the Demiurge), he denied that it would ever actually be destroyed, since it is divine and the Demiurge, being good and divine himself, would never destroy divinity. In what is possibly a later addition to the text, “Timaeus” assigns numerical values to the various proportions produced by the mixture of the Same and the Different (these being the two opposing forces, productive of all motion, growth, and change in the cosmos, as discussed in the Timaeus dialogue). The substratum of all generated things is matter, and their reason-principle or logos is ideal-form. “Timaeus” then proceeds with an account of the geometrical proportions of the cosmos, finally declaring that the image of the cosmos is the dodecahedron, since that is the closest approximation to the perfect sphere, which is the image of purely intellectual reality.

According to “Timaeus,” the Demiurge initiated the creation of souls, but then handed over completion of the task to Nature (hypostatized in the feminine) who completed their creation and introduced them into into the cosmos, some by way of the sun, others the moon, and yet more from the planets that wander according to the principle of the Different (the source of the irrational part of the soul). Each soul, however, received a portion of the principle of Sameness, which became the rational part of the soul. A soul who received more of this principle would have a happier fate than one receiving less. Here again, as in Ocellus, we have a relatively early witness of astrological doctrine within Hellenistic philosophy. The ethical doctrine of “Timaeus” involved a taming of the passions and the moderation of bodily pleasures, the final goal being a state of repose conducive to the contemplation of divine things.

c. Archytas

Several fragments purporting to be from the hand of Plato’s contemporary, the Pythagorean Archytas of Tarentum (though in fact composed some time during the late second or early first century B.C.) are of importance for Middle Platonic philosophy, notably the fragments of a treatise On First Principles where a principle is posited above the One and the Dyad, out of which the primordial pair is said to have emerged. “Archytas” places mind above soul as the most divine part in man, though he departs from standard Pythagoreanism by assigning the circle rather than the tetragon as the representation of the soul, since the soul is self-moved (the circle, with no definite beginning or end point, symbolized endless movement). He believed that there is a space outside of the material cosmos in which the cosmos is contained. Time, according to “Archytas” is continuous, not a series of units or parts as in number, speech, and music, and he apparently made some distinction between psychic time (pertaining to the soul) and natural time, though what this distinction entailed is not clear. In ethics he is no innovator, simply stating the standard notion that happiness depends on virtue, but virtue is independent of all other things.

d. Eudorus of Alexandria

Eudorus of Alexandria (fl. ca. 50-25 B.C.) was much concerned with ethics, which he considered the first subject of philosophy to be studied. He defined ethics not in terms of existence in accordance with nature, but rather in terms of striving for a proper end (telos), which he considered to be “likeness to god as far as possible” (homoiôsis theô kata to dunaton). This phrase is from Plato’s Theaetetus (176b) where the qualification “as far as possible” simply means to the extent that a mortal can achieve a divine state. Eudorus, however, interpreted it as referring to the intellect, that part of the soul most closely akin to the divine (cf. Dillon, pp. 122-123). This conception of ethics led Eudorus to depart from earlier Platonists like Antiochus who considered physical pleasures as contributing to, or at least enhancing, the happiness that depends on virtue, and declare that true happiness is of the intellect alone, although he does seem to have allowed a preliminary role for physical pleasure in achieving happiness (Dillon, p. 124).

In metaphysics and cosmology Eudorus follows largely Pythagorean lines, though some Stoic conceptions are present in his thought. He departed from earlier Pythagorean philosophy and, in a move likely inspired by “Archytas,” posited a supreme principle above the One and the Dyad, even positing this principle as the producer of matter. Traditional Pythagorean philosophy posited a primordial pair of principles, Limit and Unlimited, with no supreme One above this pair. The monism of Eudorus’ doctrine was particularly attractive to the Jewish Platonist Philo of Alexandria in his quest to square Old Testament theology with Platonic philosophy.

Eudorus rejected the Aristotelian “fifth element” and followed Stoic cosmology in positing pure fire as the base element of he heavens. He considered the stars and planets to be divine, and insisted that the world is eternal. Eudorus brought together the apparently opposing views of Xenocrates and Crantor regarding the origin of numbers; the former stating that they are produced by the One and the Dyad, the latter that they are produced in the mind of the World-Soul as he contemplates the Forms. Eudorus taught that number was generated simultaneously with the World-Soul, who was responsible for translating the smallest multiplicity (the number three) into solid bodies (the number four).

Finally, we must note Eudorus’ revision of Aristotle’s Categories, which was to exercise an immense influence on later Platonists, especially Porphyry, who endeavored to find a harmony of doctrine in Plato and Aristotle. Eudorus interpreted substance (ousia) as strictly material substance, and concluded that Aristotle’s categories only apply to the physical world, not to the purely intellectual realm, where Platonists have always located supreme reality.

6. Later Middle Platonism

Notable Middle Platonists after Eudorus include Moderatus of Gades (first century A.D.), a self-conscious Pythagorean who considered Plato a mere student of Pythagoras. During the same period Thrasyllus, Nero’s court astrologer, prepared a new edition of Plato’s Dialogues, arranged in tetralogies, as well as an edition of the collected works of Democritus. Interesting in a different manner is Apollonius of Tyana, who had the reptuation of a magician and wonder-worker, and is a prime example of the prophet-figures influenced by Platonism, Pythagoreanism, and sundry other intellectual streams. Another example of such a figure is Simon Magus (mid-first century A.D.) who wandered about working miracles with a prostitute claiming to be Divine Wisdom Herself. Simon was considered the first Gnostic by the early Christian heresiologists.

The most important Middle Platonists after Eudorus are Philo of Alexandria (ca. 30 B.C. – 45 A.D.) and Plutarch of Chaeronea (ca. 45-125 A.D.). Numenius of Apamea (fl. ca. 150-176 A.D.), though more of a Neopythagorean than a Platonist (to the extent that such a distinction can be made in this period), had a profound influence on the emergence of Neoplatonism, not least in the deep and abiding influence his thought had on the philosophical development of Plotinus, who was actually accused of plagiarizing Numenius. Finally, we will discuss Albinus (fl. ca. 149-157) whose handbook of Platonic philosophy is an interesting example of Middle Platonic eclecticism (in the best sense of that term).

a. Philo of Alexandria

The work of Philo of Alexandria (also called Philo Judaeus) is the most prominent and philosophically accomplished example of the Jewish-Hellenistic syncretism that flourished at Alexandria beginning at least as early as the translation of the Hebrew Scriptures into Greek (the Septuagint), during the reign of Ptolemy II Philedelphus (285-247 B.C.). We already detect the influence of Hellenistic philosophy on Jewish thought in the biblical book of Ecclesiastes, and the later apocryphal work Wisdom of Sirach (ca. 30 B.C.) displays Platonic and Pythagorean affinities. So it is clear that by Philo’s time Jewish thinkers of the Diaspora were quite comfortable with Greek philosophy. In the work of Philo himself there is an attempt to square Old Testament theology with the Greek philosophical tradition, leading Philo to posit Moses as the first sage and teacher of the venerable ancients of the Greek tradition. The work of Philo was to have an immense influence on emerging Christian philosophy, especially in the work of Origen.

According to Philo, God transcends all first principles, including the Monad, is incorporeal and cannot even be said to occupy a space or place; He is eternal, changeless, self-sufficient and free from all constraint or necessity (cf. Tripolitis 1978, pp. 5-6 ff.). God freely willed the creation of the cosmos, first in a purely intellectual manner, and then, through the agency of His Logos (Philo’s philosophical term for the Wisdom figure of Proverbs 8:22) He brought forth the physical cosmos. Philo describes the Logos in a two-fold manner, first as the sum total of the thoughts of God, and then as a hypostatization of those thoughts for the purpose of physical creation. Thus we see Philo linking the cosmos to the intellectual realm by way of a mediating figure rather like the Platonic World-Soul. Borrowing a term from Stoic philosophy, Philo calls the thoughts of the Logos “rational seeds” (logoi spermatikoi), and describes them as having a role in the production of the cosmos which, he insists, was brought into being out of non-being by the agency of God.

Philo adhered to standard Platonism when he declared that the cosmos is a copy of the purely intellectual realm. However, he taught, following biblical doctrine, that the cosmos was created in time, but went on to state that, although having a temporal creation, the cosmos will exist eternally, since it is the result of God’s outpouring of love. The rational beings dwelling in the cosmos are divided by Philo into three types: the purely intellectual souls (created first by God), all animals (created second), and finally man (last of all rational creation, combining the attributes of the first two). Of the purely intellectual and incorporeal souls, Philo recognized varying degrees of perfection; some of the souls aid humanity, for example, providing guidance and giving signs, while other fell into vice themselves, and aim to lead man astray. These are the beings called angels by the Jews and daemons by the Greeks.

Philo’s ethical doctrine emphasized the free will of human beings. According to Philo, the meaning of the biblical statement that humanity is created in the image and likeness of God is that although sometimes constrained by external forces, all human souls are capable of overcoming these constraints and attaining freedom. He further adds, in a formulation that was to have a profound influence on Origen, that God aids souls in their quest for freedom in proportion to their love and devotion for Him and for their fellows.

b. Plutarch of Chaeronea

Plutarch was intensely interested in religion, and his philosophy bears the stamp of a profound religious piety. Like Eudorus, Plutarch understood the highest goal of existence as achieving likeness to god, yet he had little confidence in the ability of human reason to adequately contemplate and understand divinity, believing instead in the possibility of divine revelations. Plutarch considered all the religions of his time as bearing witness to one eternal truth, though expressed in different ways. His ability to use allegory in order to prove this assertion is most evident in his treatise On Isis and Osiris.

Plutarch did not, like Archytas and Eudorus, posit a principle higher than the Pythagorean One, which Plutarch also called, in Platonic fashion, the Good. The Dyad was considered by Plutarch as a disruptive or even downright evil principle, which the One or Monad had to struggle to control. This tension at the highest ontological level translates into a dualistic cosmology where the principle of reason is described as being in constant strife with unreason. The rational principle, Logos, is both transcendent and immanent. In its former aspect the Logos is understood by Plutarch as the sum-total of thoughts in the mind of god; in its latter aspect, Logos is understood allegorically as Osiris, whose body is routinely torn apart by Typhon, only to be reassembled ever again by Isis. Osiris’ body parts are interpreted as the Ideas dispersed throughout the material realm, and rationally maintained by Isis in her demiurgic role as cosmic steward.

Plutarch departed from standard Pythagorean doctrine in declaring the creation of the cosmos in time. In keeping with his Zoroastrian-style dualism, Plutarch posited a simultaneous intellectual conception of the created cosmos in the minds of both the One and its evil counterpart, the Dyad. Thus we see a dualism at the highest level of his thought; however, a dualism that is not akin to Gnosticism, for Plutarch’s opposing principles are equi-primordial, unlike the subversive Sophia in Gnostic mythology, who introduces a disruptive element into the intellectual realm.

Plutarch accepted the immortality of the soul, excepting only the notion of transmigration or reincarnation, and made the distinction, found again later in Origen, between mind (nous) and soul (psukhê). In the realm of ethics, Plutarch defended free will against fatalism, understanding divine providence (pronoia) as involving a co-operation between human will and divine agency (cf. Dillon, pp. 199-203 ff.; also Zeller, pp. 306-308), another notion later adopted by Origen.

c. Numenius of Apamea

Numenius has been called both a pythagorizing Platonist and a platonizing Pythagorean. However, the key to his attitude toward philosophy is summed up in his own statement that “Plato pythagorizes” (P. Henry 1991, p. lxx). He took the mysterious passage about the three kings in the Platonic Second Letter as coming from Socrates, and he likely used this passage as support for the triad of gods which he posited as first principles. Plato and Pythagoras were considered by him as the twin sources of philosophical truth, with which the traditions of the Hebrews, Egyptians, the Zoroastrian Magi, and even the Brahmins were all in agreement.

Numenius’ triad of gods begins with the First God, called also the Good, who is eternal, immutable, and at rest, concerned only with the intellectual realm. He is likened by Numenius to the owner of a farm who, after having sown the fields, leaves it up to his farmhands to cultivate the crops. The Second God, called Mind and Demiurge is responsible for translating the things of the intellectual realm to the realm of matter, thereby establishing a cosmos. In this capacity the Second God is called World-Soul. However, once this Soul comes into contact with matter, the source of all evil according to Numenius, it becomes divided into a rational and an irrational part, the former remaining in contemplation of the divine realm, and the latter immersing itself in the material realm. It is not clear whether Numenius intended to posit two World-Souls (one good, one evil) or if he had in mind simply a division within that Soul of an irrational and a rational part. If Numenius’ triad involves a strict separation of three distinct divinities (and this is a matter of interpretation) then we should speak of a separate World-Soul that is evil. If the triad is intended to imply a three-fold series of activities emanating from the divine realm, then we are correct in assuming that Numenius posited a single World-Soul with two warring parts. Due to the fragmentary nature of his surviving writings, however, it is impossible to know for sure what he intended.

Human souls were described by Numenius as divine fragments of the Demiurge, each one a microcosm of both the intellectual and the physical realm (Tripolitis, pp. 26-30). He taught that all souls contain both a rational and an irrational element, the former derived from the Second God, the latter from association with the material realm. Numenius taught that souls enter the cosmos by way of the Tropic of Cancer, acquiring various characteristics as they pass through the seven planetary spheres. The soul that leads a virtuous life – which for Numenius meant living a contemplative life detached from bodily things – will re-ascend to heaven (the sphere of the fixed stars) by way of the Tropic of Capricorn. The soul that fails to lead a correct life will enter Hades (located by Numenius in the mists above the world) where it will undergo chastisement until reincarnated in another body suitable to its nature. Numenius taught that certain souls may become so corrupted that they will enter the bodies of animals. In a doctrine that likely influenced Origen (in his doctrine of multiple ages), Numenius taught that the series of reincarnations are finite, and will eventually lead the soul back to the divine realm, though how this is accomplished for a soul existing in animal bodies is not entirely clear, since such a soul is presumably not susceptible to any rational exhortations to virtue.

No overtly ethical fragments of Numenius’ works survive, but we do know that he considered existence in this realm a struggle, with the irrational part of the soul in constant strife with the rational. Salvation from this state only takes place when the soul leaves the material realm for the divine. One is reminded of St. Paul’s lament in Romans 7:18-23 where he describes the war taking place between his flesh (body, matter) and his mind. His mind knows the good, he says, but his flesh continually prevents him from achieving this good. It is possible that Numenius read St. Paul, but more likely that the two thinkers simply were responding to a shared intellectual milieu consisting not only of Platonic philosophy, but Gnostic and Hermetic doctrines as well.

The influence of Numenius extended well beyond his life-time; his doctrines are recorded in the writings of later Neoplatonists like Porphyry and Proclus, and Plotinus himself was at one point accused of plagiarizing Numenius (Porphyry, Life of Plotinus 17). In the case of Plotinus, we see a clear Numenian influence regarding the triadic arrangement of principles, although Plotinus developed this basic notion in a quite original way. Plotinus also responded to Numenius’ doctrine of an evil World-Soul, developing in the process a quite sophisticated doctrine concerning matter and the nature of evil.

d. Albinus

Albinus (fl. ca. 149-157) left behind two complete works, excellent sources of first-century A.D. Platonism, the Isagogê (an introduction to Platonic philosophy) and the Didaskalikos (a summary of Plato’s philosophy). As an interpreter of Plato, Albinus relied heavily on Aristotle and, to a lesser extent, Stoicism. Like Numenius, Albinus posited a triadic set of principles: First God (also Mind and Good), Second God or Universal Intellect, and World-Soul. The First God is not described as creating the others, but rather as generating them from his mind as he thinks upon his own thoughts (cf. Tripolitis, pp. 31-36). This conception of divine emanation is present later in the philosophy of Plotinus and, in a more developed fashion, in Proclus. The First God is described along the lines of Aristotle’s Unmoved Mover, and is said to produce motion through the desire he inspires in the second and third gods. Albinus employs negative or apophatic language when describing the First God, a method of theologizing that would become of immense importance for later Christian Neoplatonists, especially Pseudo-Dionysius.

Individual human souls, according to Albinus, were created in the same manner as the second and third gods, that is, by a hypostatization of thoughts in the divine mind. Once generated, the souls enter the sphere of the fixed stars, where each soul is allotted its own star and set in a chariot or vehicle (okhêma). Following the myth of the soul in the Phaedrus, Albinus states that the duty of the soul in the material realm is to place unreason in subjection to reason, and to steer one’s chariot to the rim of heaven where one’s allotted star is waiting to receive the perfected soul.

Although Albinus describes the life of the soul as one of constant strife between the rational and the irrational parts, he does not posit, as did Numenius, an evil World-Soul, nor does he totally degrade all material embodiment as the source of evil. Albinus described the union of body and soul as akin to that of fire and asphalt, meaning that the one is the vehicle of the other. In the realm of ethics Albinus held the by-now-standard Platonic line of “likeness to god” as the highest goal of existence. He taught a doctrine of reincarnation including the entrance of the soul into animal bodies. As in Numenius, it is unclear how souls, once so incarnated, will ever attain to the reason requisite for salvation (cf. R.E. Witt 1937, p. 139).

Albinus anticipated Plotinus in the prime role he allotted to contemplation in the ideal existence of the soul, and Origen in his doctrine of the intellectual generation of souls by the godhead.

7. “Esoteric” Platonism

This final section will be devoted to a brief discussion of a branch or offshoot of Middle Platonic thought that I hesitantly labelled “esoteric,” in spite of the fact that these schools of thought or sects (or whatever one should call them) were quite widespread during this period, Gnosticism especially. However, though widespread, they were veiled in mystery and secrecy, leading John Dillon to refer to them in the perhaps more apt phrase “the Platonic Underworld.” We will be discussing three examples of this “underworld”: Hermeticism, Gnosticism, and the Chaldaean Oracles. The writings comprising the Corpus Hermeticum, so-called because of its supposed derivation from the teachings of the legendary sage Hermes Trismegistus, bear the marks of a variety of philosophies, Platonism and Neopythagoreanism being the most prominent. Hermetic ideas are found in Christianity as early as the writings of St. Paul, and Gnostic elements are to be discerned in John’s Gospel as well as in Paul. The earliest Christian theologians were Gnostics, and the most prominent among them, Valentinus, nearly became pope. The systems of the Gnostics, especially Valentinus, attempted (among other things) to solve certain problems of Platonic and related philosophies by employing mythological language, astrological symbolism, and elements of alchemy and ritual magic. Finally, the Chaldaean Oracles, a mysterious composition melding Platonic and Neopythagorean philosophy with a revelatory religiosity, was a major source of inspiration for later Neoplatonists.

a. Hermeticism

Hermeticism is a loose label for collections of texts on various subjects bearing the name Hermes Trismegistus, “Thrice-great Hermes,” who was believed to have been a sage of remote antiquity. According to the third-century B.C. historian Manetho of Sebennytos, a tradition existed in which Thoth-Hermes was said to have written down his teachings on tablets before the Flood. These tablets were said to be kept by the Egyptian priests, who later translated them into Greek. The earliest Hermetic writings are called the “technical Hermetica” and can be dated back to the early- to mid-second century B.C. These texts contain astrological material and information on the magical properties of gems. The co-called “philosophical Hermetica,” that is, the treatises comprising what today is called the Corpus Hermeticum, began to be written down a bit later, the earliest probably in the mid-first century B.C.

The most important treatise in this collection (at least for the history of Platonism) is the Poimandres. This text begins with the appearance of Poimandres (a name suggesting “Shepherd of Men” in Greek), the Divine Intellect, who reveals to the unknown author of the text a vision displaying the generation of the cosmos. The cosmos is described as beginning with a darkness coiling downward from the light (the intellectual realm) like a snake. It is at first indiscernible and disturbing, but then divine reason descends upon it and imposes order, and the earth comes into being. This account is dependent on both Plato’s Timaeus and the book of Genesis (especially as these two works were interpreted by Philo, whom our author likely read). The image of the descending darkness implies an evil or irrational principle, or World-Soul, as in Numenius, that must be brought under control by reason. Other affinities with Numenius, as well as Albinus, include the direct generation of souls by the Demiurge, and the descent and ascent of souls through the planetary spheres. One important difference is that both Numenius and Albinus considered the highest attainment of the soul as “likeness to god.” The Poimandres, however, declares that the souls who make the ascent to the divine realm actually become gods themselves, an idea that was to become central in the Eastern Orthodox Christian tradition, with its concept of deification or theôsis. It is highly likely that Numenius was acquainted with, if not the Poimandres itself, another text or texts similar in content. He was also most certainly familiar with Gnosticism, to a discussion of which we now turn.

b. Gnosticism

The writings called “gnostic” vary in content, style, date, and region of origin, to such a degree that certain modern scholars have called for a moratorium on the term (cf. M.A. Williams 1996). Yet there are certain basic elements common to most so-called Gnostic systems, as opposed to stray texts the provenance of which is unknown or dubious. The most important of these systems is that of Basilides and Valentinus, two early Christian theologians who are influenced heavily by Middle Platonic thought. (For a more in-depth discussion, see Gnosticism.)

The system of Basilides (fl. ca. 132-135 A.D.) begins with the engendering of Intellect (Nous) by the First (unengendered) Parent. From this Intellect, Logos is generated, and Logos in turn generates Prudence (phronêsis) who then generates Wisdom (Sophia) and Power (dunamis). This is a mythological elaboration of the standard Middle Platonic emanation schemas that we have encountered in Eudorus and later philosophers, like Numenius, who have posited a supreme principle above Intellect. Basilides apparently attempted to “flesh out” the standard triadic schemas of the more mainstream Middle Platonists by adding certain anthropomorphic attributes like “prudence” to the mix. Basilides was among the first Christian thinkers besides John the Evangelist to explicitly identify Jesus as the earthly manifestation of the Divine Intellect. He also dabbled in astrology, revising practices current in his time to suit his own peculiar cosmology. Using numerology, he identified the ruler of the celestial realm as “Abrasax” or “Abraxas,” a name used in the practice of ritual magic, the numerical value of which is (according to Greek numerology) 365, corresponding to the number of “heavens” believed by Gnostics and other to exist above the familiar spheres of the seven planets.

Valentinus (ca. 100-175 A.D.) begins his system, in Pythagorean fashion, not with a unity but a primal duality, the members of which he calls the Ineffable and Silence. The primal duality produces a second duality called the Parent and Truth, from which spring a quartet consisting of Logos, Life, Primal Man, and the Church. As a Christian, Valentinus held a rather peculiar notion of the nature and role of Christ in the cosmos, considering Him to have been engendered along with a “shadow” (matter) that it was His responsibility to control. Here again we see an elaboration on a particular aspect of Middle Platonism, namely the manner in which unwieldy matter is brought under control by a rationalizing force. Valentinus was apparently the first Christian theologian to refer to the Trinity in terms of persons, and he affirmed the eternity and immortality of souls, implying a notion of pre-existence of souls such as we find later in Origen.

Gnosticism had an immense influence not only on the development of Christianity but on emerging Neoplatonism as well. Plotinus, for example, was forced to respond to the increasingly vocal, it seems, Gnostics attending his lectures. Later, Iamblichus posited a One even higher than the Plotinian One, in a manner similar to Gnostics like Basilides and Valentinus who, as we have seen, separated their highest principles from all others by positing an unengendered parent, and a primal duality productive of a second duality, respectively.

c. The Chaldaean Oracles

The writings known as the Chaldaean Oracles were very likely composed by a certain Julian the Theurgist, who served in the Roman army during Marcus Aurelius’ campaign against the Quadi, and claimed to have saved the Roman camp from fiery destruction by causing a rainstorm (Dillon, pp. 392-393). The circumstances surrounding the writing of the Oracles is mysterious, the most likely explanation being that Julian uttered them after inducing a sort of trance akin to that of the classical oracles of Greece (E.R. Dodds 1973, p. 284). There is much Platonic content in the Oracles, resembling very closely the philosophy of Numenius, which is why they are of interest in this survey of Middle Platonism.

The metaphysical schema of the Chaldaean Oracles begins with an absolutely transcendent deity called Father, with whom resides Power, a productive principle, it seems, whence proceeds Intellect. This Intellect has a two-fold function, to contemplate the Forms of the purely intellectual realm of the Father, and to craft and govern the material realm. In this latter capacity the Intellect is Demiurge. The Oracles further posits a barrier between the intellectual and the material realm, personified as Hecate. In the capacity of barrier, or more properly “membrane” (hupezôkôs humên), Hecate separates the two “fires,” that is, the purely intellectual fire of the Father, and the material fire from which the cosmos is created, and mediates all divine influence upon the lower realm. From Hecate is derived the World-Soul, which in turn emanates Nature, the governor of the sub-lunar realm (Dillon, p. 394-395). From Nature is derived Fate, which is capable of enslaving the lower part of the human soul. The goal of existence then is to purify the lower soul of all contact with Nature and Fate, by living a life of austerity and contemplation. Salvation is achieved by an ascent through the planetary spheres, during which the soul casts off the various aspects of its lower soul, and becomes pure intellect.

8. Conclusion

It is evident, even from a brief survey such as this one, that the thinkers comprising the philosophy generally referred to as Middle Platonism held widely varying and sometimes even divergent ideas, not only on relatively minor points like the role of physical pleasure in happiness, but on major points like the eternity of the world or the number of first principles. A student encountering Middle Platonism for the first time, armed only with a knowledge of Plato’s Dialogues, will likely wonder why we even call some of these thinkers Platonists at all. That is understandable. However, it must be remembered that Plato did not bequeath a set of doctrines on his students and successors; his legacy is rather a series of problems that have exercised the minds of philosophers for over two millennia. Platonism, therefore, should not be thought of a simple elucidation of Plato’s doctrines, but rather as a creative engagement with Plato’s texts and with certain doctrines handed down by the Academy as belonging to Plato. Middle Platonism ends with Origen of Alexandria and his younger contemporary Plotinus, both of whom were deeply indebted to many of the philosophers discussed in this article, yet moved in directions uniquely their own. It is with them that Neoplatonism begins.

9. References and Further Reading

a. Primary Sources

  • Albinus, Didaskalikos, ed. P. Louis, in Albinos. Épitomé (Paris: Les Belles Lettres 1945).
  • Antiochus of Ascalon, Fragmenta, in Der Akademiker Antiochus, ed. G. Luck (Bern: Haupt 1953).
  • Arcesilaus, Fragmenta, in Supplementum Hellenisticum, ed. H. Lloyd-Jones, P. Parsons (Berlin: De Gruyter 1983).
  • Archytas (pseudo-), Fragmenta, in The Pythagorean Texts of the Hellenistic Period, ed. H. Thesleff (Abo: Abo Akademi 1965).
  • Cicero, The Nature of the Gods and On Divination, tr. C.D. Yonge (New York: Prometheus Books 1997).
  • The Chaldean Oracles, tr. G.R.S. Mead (Montana: Kessinger Publishing, no date).
  • Numenius, Numénius. Fragments, ed. É. des Places (Paris: Les Belles Lettres 1974).
  • Ocellus Lucanus, De universi natura and Fragmenta, in Neue philologische Untersuchungen, vol. 1, ed. R. Harder (Berlin: Weidmann 1926).
  • Philo of Alexandria, On the Creation of the World (De opificio Mundi), Allegorical Interpretation (Legum allegoria), tr. F.H. Colson, G.H. Whitaker, in Philo, vol. 1, Loeb Classical Library (Cambridge: Harvard University Press 1929).
  • Plato, Plato: Complete Works, ed. J.M. Cooper (Indianapolis: Hackett 1997).
  • Plutarch, De Iside et Osiride, in Plutarchi moralia, vol. 2.3, ed. W. Sieveking (Leipzing: Teubner 1935).
  • Posidonius, Posidonius. Die Fragmente, vol. 1, ed. W. Theiler (Berlin: De Gruyter 1982).
  • Speusippus, Fragmenta, in Speusippus of Athens, ed. L. Tarán (Philosophia Antiqua 39; Leiden: Brill 1981).
  • Timaeus Locrus, Fragmenta et titulus, in The Pythagorean Texts of the Hellenistic Period, ed. H. Thesleff (Abo: Abo Akademi 1965).
  • Xenocrates, Testimonia, doctrina et fragmenta, in Senocrate-Ermodoro. Frammenti, ed. M.I. Parente (Naples: Bibliopolis 1982).

b. Secondary Sources

  • Billings, T.H., The Platonism of Philo Judaeus (Chicago: University of Chicago Press 1919).
  • Brehier, E., The History of Philosophy, vol. 2: The Hellenistic and Roman Age, tr. W. Baskin (Chicago: University of Chicago Press 1958).
  • Copenhaver, B.P., tr., Hermetica (New York: Cambridge University Press 1992).
  • Copleston, F., A History of Philosophy, vol. 1, part 2: Greece and Rome (New York: Image Books 1962).
  • Dillon, J.M., The Middle Platonists (Ithaca: Cornell University Press 1977).
  • Dillon, J.M., Long, A.A., ed., The Question of “Eclecticism”: Studies in Later Greek Philosophy (Los Angeles: University of California Press 1988).
  • Dodds, E.R., The Greeks and the Irrational (Los Angeles: University of California Press 1973).
  • Festugiere, A-J, Personal Religion Among the Greeks (Los Angeles: University of California Press 1954).
  • Fowden, G., The Egyptian Hermes: A Historical Approach to the Late Pagan Mind (New York: Cambridge University Press 1986).
  • Guthrie, K.S., The Pythagorean Sourcebook and Library (Grand Rapids: Phanes Press 1988).
  • Henry, P., “The Place of Plotinus in the History of Thought” in Plotinus, The Enneads, tr. S. MacKenna (New York: Penguin Books 1991).
  • Jonas, H., The Gnostic Religion, third edition (Boston: Beacon Press 2001).
  • Layton, B., The Gnostic Scriptures (New York: Doubleday 1987).
  • Lovejoy, A.O., The Great Chain of Being (New York: Harper and Row 1965).
  • Tripolitis, A., The Doctrine of the Soul in the Thought of Plotinus and Origen (New York: Libra 1978).
  • Williams, M.A., Rethinking “Gnosticism”: An Argument for Dismantling a Dubious Category (Princeton: Princeton University Press 1996).
  • Witt, R.E., Albinus and the History of Middle Platonism (Cambridge: Cambridge University Press 1937).
  • Zeller, E., Outlines of the History of Greek Philosophy, tr. L.R. Palmer (New York: Meridian Books 1955).

Author Information

Edward Moore
Email: emoore@theandros.com
St. Elias School of Orthodox Theology
U. S. A.

Mencius (c. 372—289 B.C.E.)

menciusBetter known in China as “Master Meng” (Chinese: Mengzi), Mencius was a fourth-century BCE Chinese thinker whose importance in the Confucian tradition is second only to that of Confucius himself. In many ways, he played the role of St. Paul to Confucius’ Jesus, interpreting the thought of the master for subsequent ages while simultaneously impressing Confucius’ ideas with his own philosophical stamp. He is most famous for his theory of human nature, according to which all human beings share an innate goodness that either can be cultivated through education and self-discipline or squandered through neglect and negative influences, but never lost altogether. While it is not clear that Mencius’ views prevailed in early Chinese philosophical circles, they eventually won out after gaining the support of influential medieval commentators and thinkers such as Zhu Xi (Chu Hsi, 1130-1200 CE) and Wang Yangming (1472-1529 CE). (See Romanization systems for Chinese terms.) Today contemporary philosophical interest in evolutionary psychology and sociobiology has inspired fresh appraisals of Mencius, while recent philological studies question the coherence and authenticity of the text that bears his name. Mencius remains a perennially attractive figure for those intrigued by moral psychology, of which he was the foremost practitioner in early China.

Table of Contents

  1. The Mencius of History
  2. The Mencius of the Text
  3. Theodicy
  4. Government
  5. Human Nature
  6. Teleology
  7. Virtue Theory
  8. Moral Psychology
  9. Key Interpreters of Mencius
  10. References and Further Reading

1. The Mencius of History

Like the historical Confucius, the historical Mencius is available only through a text that, in its complete form at least, postdates his traditional lifetime (372-289 BCE). The philological controversy surrounding the date and composition of the text that bears his name is far less intense than that which surrounds the Confucian Analects, however. Most scholars agree that the entire Mencius was assembled by Mencius himself and his immediate disciples, perhaps shortly after his death. The text records several encounters with various rulers during Mencius’ old age, which can be dated between 323 and 314 BCE, making Mencius an active figure no later than the late fourth century BCE.

The other major source of information about Mencius’ life is the biography found in the Shiji (Records of the Grand Historian) of Sima Qian (c. 145-90 BCE), which states that he was a native of Zou (Tsou), a small state near Confucius’ home state of Lu in the Shandong peninsula of northeastern China. He is said to have studied with Confucius’ grandson, Zisi (Tzu-ssu), although most modern scholars doubt this. He also is thought to have become a minister of the state of Qi (Ch’i), which also was famous as the home of the Jixia (Chi-hsia) Academy. The Jixia Academy was a kind of early Chinese “think tank” sponsored the ruler of Qi that produced, among other thinkers, Mencius’ later opponent Xunzi (Hsun-tzu, 310-220 BCE).

Mencius was born in a period of Chinese history known as the Warring States (403-221 BCE), during which various states competed violently against one another for mastery of all of China, which once was unified under the Zhou dynasty until its collapse, for all intents and purposes, in 771 BCE. It was a brutal and turbulent era, which nonetheless gave rise to many brilliant philosophical movements, including the Confucian tradition of which Mencius was a foremost representative. The common intellectual and political problem that Warring States thinkers hoped to solve was the problem of China’s unification. While no early Chinese thinker questioned the need for autocratic rule as an instrument of unification, philosophers differed on whether and how the ruler ought to consider moral limitations on power, traditional religious ceremonies and obligations, and the welfare of his subjects.

Into the philosophical gap created by a lack of political unity and increasing social mobility stepped members of the shi (“retainer” or “knight”) class, from which both Confucius and Mencius arose. As feudal lords were defeated and disenfranchised in battle and the kings of the various warring states began to rely on appointed administrators rather than vassals to govern their territories, these shi became lordless anachronisms and fell into genteel poverty and itinerancy. Their knowledge of aristocratic traditions, however, helped them remain valuable to competing kings, who wished to learn how to regain the unity imposed by the Zhou and who sought to emulate the Zhou by patterning court rituals and other institutions after those of the fallen dynasty.

Thus, a new role for shi as itinerant antiquarians emerged. In such roles, shi found themselves in and out of office as the fortunes of various patron states ebbed and flowed. Mencius’ office in the state of Qi probably was no more than an honorary title. While out of office, veteran shi might gather small circles of disciples – young men from shi backgrounds who wished to succeed in public life – and seek audiences with rulers who might give them an opportunity to put their ideas into practice. The text of the Mencius claims to record Mencius’ teachings to his disciples as well as his dialogues with the philosophers and rulers of his day.

2. The Mencius of the Text

Mencius inherits from Confucius a set of terms and a series of problems. In general, one can say that where Confucius saw a unity of inner and outer – in terms of li (ritual propriety), ren (co-humanity), and the junzi (profound person)-xiaoren (small person) distinction – Mencius tends to privilege the inner aspects of concepts, practices, and identities. For Mencius, the locus of philosophical activity and self-cultivation is the xin (hsin), a term that denotes both the chief organ of the circulatory system and the organ of thought, and hence is translated here and in many other sources as “heart-mind.” Mencius’ views of the divine, political organization, human nature, and the path toward personal development all start and end in the heart-mind.

Mencius’ philosophical concerns, while scattered across the seven books of the text that bears his name, demonstrate a high degree of consistency unusual in early Chinese philosophical writing. They can be categorized into four groups:

  • Theodicy
  • Government
  • Human Nature
  • Self-Cultivation

3. Theodicy

Again, as with Confucius, so too with Mencius. From late Zhou tradition, Mencius inherited a great many religious sensibilities, including theistic ones. For the early Chinese (c. 16th century BCE), the world was controlled by an all-powerful deity, “The Lord on High” (Shangdi), to whom entreaties were made in the first known Chinese texts, inscriptions found on animal bones offered in divinatory sacrifice. As the Zhou polity emerged and triumphed over the previous Shang tribal rule, Zhou apologists began to regard their deity, Tian (“Sky” or “Heaven”) as synonymous with Shangdi, the deity of the deposed Shang kings, and explained the decline of Shang and the rise of Zhou as a consequence of a change in Tianming (“the mandate of Heaven”). Thus, theistic justifications for conquest and rulership were present very early in Chinese history.

By the time of Mencius, the concept of Tian appears to have changed slightly, taking on aspects of “fate” and “nature” as well as “deity.” For Confucius, Tian provided personal support and sanction for his sense of historical mission, while at the same time prompting Job-like anxiety during moments of ill fortune in which Tian seemed to have abandoned him. Mencius’ faith in Tian as the ultimate source of legitimate moral and political authority is unshakeable. Like Confucius, he says that “Tian does not speak – it simply reveals through deeds and affairs” (5A5). He ascribes the virtues of ren (co-humanity), yi (rightness), li (ritual propriety), zhi (wisdom), and sheng (sagehood) to Tian (7B24) and explicitly compares the rule of the moral king to the rule of Tian (5A4).

Mencius thus shares with Confucius three assumptions about Tian as an extrahuman, absolute power in the universe: (1) its alignment with moral goodness, (2) its dependence on human agents to actualize its will, and (3) the variable, unpredictable nature of its associations with mortal actors. To the extent that Mencius is concerned with justifying the ways of Tian to humanity, he tends to do so without questioning these three assumptions about the nature of Tian, which are rooted deep in the Chinese past, as his views on government, human nature, and self-cultivation will show.

4. Government

The dependence of Tian upon human agents to put its will into practice helps account for the emphasis Mencius places on the satisfaction of the people as an indicator of the ruler’s moral right to power, and on the responsibility of morally-minded ministers to depose an unworthy ruler. In a dialogue with King Xuan of Qi (r. 319-301 BCE), Mencius says:

The people are to be valued most, the altars of the grain and the land [traditional symbols of the vitality of the state] next, the ruler least. Hence winning the favor of the common people you become Emperor…. (7B14)

When the ruler makes a serious mistake they admonish. If after repeated admonishments he still will not listen, they depose him…. Do not think it strange, Your Majesty. Your Majesty asked his servant a question, and his servant dares not fail to answer it directly. (5B9)

Mencius’ replies to King Xuan are bracingly direct, in fact, but he can be coy. When the king asks whether it is true that various sage kings (Tang and Wu) rebelled against and murdered their predecessors (Jie and Zhou), Mencius answers that it is true. The king then asks:

“Is it permissible for a vassal to murder his lord?”

Mencius replied, “One who robs co-humanity [ren] you call a `robber’; one who robs the right [yi] you call a `wrecker’; and one who robs and wrecks you call an `outlaw.’ I have heard that [Wu] punished the outlaw Zhou – I have not heard that he murdered his lord. (1B8)

In other words, Wu was morally justified in executing Zhou, because Zhou had proven himself to be unworthy of the throne through his offenses against ren and yi – the very qualities associated with the Confucian exemplar (junzi) and his actions. This is an example of Mencius engaging in the “rectification of names” (zhengming), an exercise that Confucius considered to be prior to all other philosophical activity (Analects 13.3).

While Mencius endorses a “right of revolution,” he is no democrat. His ideal ruler is the sage-king, such as the legendary Shun, on whose reign both divine sanction and popular approval conferred legitimacy:

When he was put in charge of sacrifices, the hundred gods delighted in them which is Heaven accepting him. When he was put in charge of affairs, the affairs were in order and the people satisfied with him, which is the people accepting him. Heaven gave it [the state] to him; human beings gave it to him. (5A5)

Mencius proposes various economic plans to his monarchical audiences, but while he insists on particular strategies (such as dividing the land into five-acre settlements planted with mulberry trees), he rejects the notion that one should commit to an action primarily on the grounds that it will benefit one, the state, or anything else. What matters about actions is whether they are moral or not; the question of their benefit or cost is beside the point. Here, Mencius reveals his antipathy for – and competition with – philosophers who followed Mozi, a fifth-century BCE contemporary of Confucius who propounded a utilitarian theory of value based on li (benefit):

Why must Your Majesty say “benefit” [li]? I have only the co-humane [ren] and the right [yi]. (1A1)

In the end, Mencius is committed to a type of benevolent dictatorship, which puts moral value before pragmatic value and in this way seeks to benefit both ruler and subjects. The sage-kings of antiquity are a model, but one cannot simply adopt their customs and institutions and expect to govern effectively (4A1). Instead, one must emulate the sage-kings both in terms of outer structures (good laws, wise policies, correct rituals) and in terms of inner motivations (placing ren and yi first). Like Confucius, Mencius places an enormous amount of confidence in the capacity of the ordinary person to respond to an extraordinary ruler, so as to put the world in order. The question is, how does Mencius account for this optimism in light of human nature?

5. Human Nature

Mencius is famous for claiming that human nature (renxing) is good. As with most reductions of philosophical positions to bumper-sticker slogans, this statement oversimplifies Mencius’ position. In the text, Mencius takes a more careful route in order to arrive at this view. Following A. C. Graham, one can see his argument as having three elements: (1) a teleology, (2) a virtue theory, and (3) a moral psychology.

6. Teleology

Mencius’ basic assertion is that “everyone has a heart-mind which feels for others.” (2A6) As evidence, he makes two appeals: to experience, and to reason. Appealing to experience, he says:

Supposing people see a child fall into a well – they all have a heart-mind that is shocked and sympathetic. It is not for the sake of being on good terms with the child’s parents, and it is not for the sake of winning praise for neighbors and friends, nor is it because they dislike the child’s noisy cry. (2A6)

It is important to point out here that Mencius says nothing about acting on this automatic affective-cognitive response to suffering that he ascribes to the bystanders at the well tragedy. It is merely the feeling that counts. Going further and appealing to reason, Mencius argues:

Judging by this, without a heart-mind that sympathizes one is not human; without a heart-mind aware of shame, one is not human; without a heart-mind that defers to others, one is not human; and without a heart-mind that approves and condemns, one is not human. (2A6)

Thus, Mencius makes an assertion about human beings – all have a heart-mind that feels for others – and qualifies his assertion with appeals to common experience and logical argument. This does little to distinguish him from other early Chinese thinkers, who also noticed that human beings were capable of altruism as well as selfishness. What remains is for him to explain why other thinkers are incorrect when they ascribe positive evil to human nature – that human beings are such that they actively seek to do wrong.

7. Virtue Theory

Mencius goes further and identifies the four basic qualities of the heart-mind (sympathy, shame, deference, judgment) not only as distinguishing characteristics of human beings – what makes the human being qua human being really human – but also as the “sprouts” (duan) of the four cardinal virtues:

A heart-mind that sympathizes is the sprout of co-humanity [ren]; a heart-mind that is aware of shame is the sprout of rightness [yi]; a heart-mind that defers to others is the sprout of ritual propriety [li]; a heart-mind that approves and condemns is the sprout of wisdom [zhi]…. If anyone having the four sprouts within himself knows how to develop them to the full, it is like fire catching alight, or a spring as it first bursts through. If able to develop them, he is able to protect the entire world; if unable, he is unable to serve even his parents. (2A6)

Now the complexity of Mencius’ seemingly simplistic position becomes clearer. What makes us human is our feelings of commiseration for others’ suffering; what makes us virtuous – or, in Confucian parlance, junzi – is our development of this inner potential. To paraphrase Irene Bloom on this point, there is no sharp conflict between “nature” and “nurture” in Mencius; biology and culture are co-dependent upon one another in the development of the virtues. If our sprouts are left untended, we can be no more than merely human – feeling sorrow at the suffering of another, but unable or unwilling to do anything about it. If we tend our sprouts assiduously — through education in the classical texts, formation by ritual propriety, fulfillment of social norms, etc. – we can not only avert the suffering of a few children in some wells, but also bring about peace and justice in the entire world. This is the basis of Mencius’ appeal to King Hui of Liang (r. 370-319 BCE):

[The king] asked abruptly, “How shall the world be settled?”

“It will be settled by unification,” I [Mencius] answered.

“Who will be able to unify it?”

“Someone without a taste for killing will be able to unify it…. Has Your Majesty noticed rice shoots? If there is drought during the seventh and eighth months, the shoots wither, but if dense clouds gather in the sky and a torrent of rain falls, the shoots suddenly revive. When that happens, who could stop it? … Should there be one without a taste for killing, the people will crane their necks looking out for him. If that does happen, the people will go over to him as water tends downwards, in a torrent – who could stop it? (1A6)

Mencius devotes some energy to arguing that “rightness” (yi) is internal, rather than external, to human beings. He does so using examples taken from that quintessentially Confucian arena of human relations, filial piety (xiao). Comparing the rightness that manifests itself in filial piety to such visceral activities as eating, drinking, and sexual intercourse, Mencius points out that, just as one’s attraction or repulsion regarding these activities is determined by one’s internal orientation (hunger, thirst, lust), one’s filial behavior is determined by one’s inner attitudes, as the following imaginary dialogue with one of his opponents shows:

[Ask the opponent] “Which do you respect, your uncle or your younger brother?” He will say, “My uncle.” “When your younger brother is impersonating an ancestor at a sacrifice, then which do you respect?” He will say, “My younger brother.” You ask him, “What has happened to your respect for your uncle?” He will say, “It is because of the position my younger brother occupies.” (6A5)

In other words, the rightness that one manifests in filial piety is not dependent on fixed, external categories, such as the status of one’s younger brother qua younger brother or one’s uncle qua one’s uncle. If it were, one always would show respect to one’s uncle and never to one’s younger brother or anyone else junior to oneself. But as it happens, shifts in external circumstances can effect changes in status; one’s younger brother can temporarily assume the status of a very senior ancestor in the proper ritual context, thus earning the respect ordinarily given to seniors and never shown to juniors. For Mencius, this demonstrates that the internal orientation of the agent (e.g., rightness) determines the moral value of given behaviors (e.g., filial piety).

Having made a teleological argument from the inborn potential of human beings to the presumption of virtues that can be developed, Mencius then offers his sketch of moral psychology – the structures within the human person that make such potential identifiable and such development possible.

8. Moral Psychology

The primary function of Mencius’ moral psychology is to explain how moral failure is possible and how it can be avoided. As Antonio S. Cua has noted, for Mencius, moral failure is the failure to develop one’s xin (heart-mind). In order to account for the moral mechanics of the xin, Mencius offers a quasi-physiological theory involving qi (vital energy) – “a hard thing to speak about” (2A2), part vapor, part fluid, found in the atmosphere and in the human body, that regulates affective-cognitive processes as well as one’s general well-being. It is especially abundant outdoors at night and in the early morning, which is why taking fresh air at these times can act as a physical and spiritual tonic (6A8). When Mencius is asked about his personal strengths, he says:

I know how to speak, and I am good at nourishing my flood-like qi. (2A2)

It is interesting to note the apparent link between powers of suasion – essential for any itinerant Warring States shi, whether official or teacher – and “flood-like qi.” The goal of Mencian self-cultivation is to bring one’s qi, xin, and yan (words) together in a seamless blend of rightness (yi) and ritual propriety (li). Mencius goes on to describe what he means by “flood-like qi“:

It is the sort of qi that is utmost in vastness, utmost in firmness. If, by uprightness, you nourish it and do not interfere with it, it fills the space between Heaven and Earth. It is the sort of qi that matches the right [yi] with the Way [Dao]; without these, it starves. It is generated by the accumulation of right [yi] – one cannot attain it by sporadic righteousness. If anything one does fails to meet the standards of one’s heart-mind, it starves. (2A2)

It is here that Mencius is at his most mystical, and recent scholarship has suggested that he and his disciples may have practiced a form of meditative discipline akin to yoga. Certainly, similar-sounding spiritual exercises are described in other early Chinese texts, such as the Neiye (“Inner Training”) chapter of the Guanzi (Kuan-tzu, c. 4th-2nd centuries BCE). It also is at this point that Mencius seems to depart most radically from what is known about the historical Confucius’ teachings. While faint glimpses of what may be ascetic and meditative disciplines sometimes appear in the Analects, nowhere in the text are there detailed discussions of nurturing one’s qi such as can be found in Mencius 2A2.

In spite of the mystical tone of this passage, however, all that the text really says is that qi can be nurtured through regular acts of “rightness” (yi). It goes on to say that qi flows from one’s xin (2A2), that one’s xin must undergo great discipline in order to produce “flood-like qi” (6B15), and that a well-developed xin will manifest itself in radiance that shines from one’s qi into one’s face and general appearance (7A21). In short, here is where Mencius’ case for human nature seems to leave philosophy and reasoned argumentation behind and step into the world of ineffability and religious experience. There is no reason, of course, why Mencius shouldn’t take this step; as Alan K. L. Chan has pointed out, ethics and spirituality are not mutually exclusive, either in the Mencius or elsewhere.

To sum up, both biology and culture are important for Mencian self-cultivation, and so is Tian. “By fully developing one’s heart-mind, one knows one’s nature, and by knowing one’s nature, one knows Heaven.” (7A1) One cannot help but begin with “a heart-mind that feels for others,” but the journey toward full humanity is hardly complete without having taken any steps beyond one’s birth. Guided by the examples of ancient sages and the ritual forms and texts they have left behind, one starts to develop one’s heart-mind further by nurturing its qi through habitually doing what is right, cultivating its “sprouts” into virtues, and bringing oneself up and out from the merely human to that which Tian intends for one, which is to become a sage. Nature is crucial, but so is nurture. Mencius’ model of moral psychology is both a “discovery” model (human nature is good) and a “development” model (human nature can be made even better):

A person’s surroundings transform his qi just as the food he eats changes his body. (7A36)

9. Key Interpreters of Mencius

Detailed discussion of Mencius’ key interpreters is best reserved for an article on Confucian philosophy. Nonetheless, an outline of the most important commentators and their philosophical trajectories is worth including here.

The two best known early interpreters of Mencius’ thought – besides the compilers of the Mencius themselves – are the Warring States philosophers Gaozi (Kao-tzu, 300s BCE) and Xunzi (Hsun-tzu, 310-220 BCE). Gaozi, who is known only from the Mencius, evidently knew Mencius personally, but Xunzi knew him only retrospectively. Both disagreed with Mencius’ views on human nature.

Gaozi’s dialogue with Mencius on human nature can be found in book six of the Mencius, in which both Mencius’ disciples and Gaozi himself question him on his points of disagreement with Gaozi. Gaozi – whom later Confucians identified, probably anachronistically, as a Daoist — offers multiple hypotheses about human nature, each of which Mencius refutes in Socratic fashion. Gaozi first argues that human nature is neither bad nor good, and presents two organic metaphors for its moral neutrality: wood (which can be carved into any object) and water (which can be made to flow east or west).

Challenging the carved wood metaphor, Mencius points out that in carving wood into a cup or bowl, one violates the wood’s nature, which is to become a tree. Does one then violate a human being’s nature by training him to be good? No, he says, it is possible to violate a human being’s nature by making him bad, but his nature is to become good. As for the water metaphor, Mencius rejects it by remarking that human nature flows to the good, just as water’s nature flows down. It is possible to make people bad, just as it is possible to make water flow up – but neither is a natural process or end. “Although man can be made to become bad, his nature remains as it was.” (6A2)

Like Mencius, Xunzi claims to interpret Confucius’ thought authentically, but leavens it with his own contributions. While neither Gaozi nor Mencius is willing to entertain the notion that human beings might originally be evil, this is the cornerstone of Xunzi’s position on human nature. Against Mencius, Xunzi defines human nature as what is inborn and unlearned, and then asks why education and ritual are necessary for Mencius if people really are good by nature. Whereas Mencius claims that human beings are originally good but argues for the necessity of self-cultivation, Xunzi claims that human beings are originally bad but argues that they can be reformed, even perfected, through self-cultivation. Also like Mencius, Xunzi sees li as the key to the cultivation of renxing.

Although Xunzi condemns Mencius’ arguments in no uncertain terms, when one has risen above the smoke and din of the fray, one may see that the two thinkers share many assumptions, including one that links each to Confucius: the assumption that human beings can be transformed by participation in traditional aesthetic, moral, and social disciplines. (Gaozi’s metaphor of carved wood, incidentally, is one of Xunzi’s favorites.) Through an accident of history, Mencius had no occasion to meet Xunzi and thus no opportunity to refute his arguments, but if he had, he might have replied that Xunzi cannot truly believe in the original depravity of human beings, or else he could not place such great faith in the morally-transformative power of culture.

Later interpreters of Mencius’ thought between the Tang and Ming dynasties are often grouped together under the label of “Neo-Confucianism.” This term has no cognate in classical Chinese, but is useful insofar as it unites several thinkers from disparate eras who share common themes and concerns. Thinkers such as Zhang Zai (Chang Tsai, 1020-1077 CE), Zhu Xi (Chu Hsi, 1130-1200 CE) and Wang Yangming (1472-1529 CE), while distinct from one another, agree on the primacy of Confucius as the fountainhead of the Confucian tradition, share Mencius’ understanding of human beings as innately good, and revere the Mencius as one of the “Four Books” — authoritative textual sources for standards of ritual, moral, and social propriety. Zhang Zai’s interest in qi as the unifier of all things surely must have been stimulated by Mencius’ theories, while Wang Yangming’s search for li (cosmic order or principle) in the heart-mind evokes Mencius 6A7: “What do all heart-minds have in common? Li [cosmic order] and yi [rightness].” Both thinkers also display a bent toward the cosmological and metaphysical which disposes them toward the mysticism of Mencius 2A2, and betrays the influence of Buddhism (of which Mencius knew nothing) and Daoism (of which Mencius indicates little knowledge) on their thought.

During the Qing (Ch’ing) dynasty (1644-1911 CE), late Confucian thinkers such as Dai Zhen (Tai Chen, 1724-1777 CE) developed critiques of Xunzi that aimed at the vindication of Mencius’ position on human nature. Kwong-loi Shun has pointed out that Dai Zhen’s defense of Mencius actually owes more to Xunzi than to Mencius, particularly in regard to how Dai Zhen sees one’s heart-mind as learning to appreciate li (cosmic order) and yi (rightness), rather than naturally taking pleasure in such things, as Mencius would have it. Although Dai Zhen shares Mencius’ view of the centrality of the heart-mind in moral development, in the end, he does not ascribe to the heart-mind the same kind of ethical directionality that Mencius finds there.

More recently, the philosophers Roger Ames and Donald Munro have developed postmodern readings of Mencius that involve contemporary developments such as process thought and evolutionary psychology. Although their philosophical points of departure differ, both Ames and Munro share a distaste for the prominence of Tian in Mencius’ thought, and each seeks in his own way to separate the “essence” of Mencian thought from the “dross.” For Ames, the “essence” – although, as a postmodern thinker, he rejects any notion of “essentialism” – is Mencius’ “process” model of human nature and the cosmos, while the “dross” is Mencius’ understanding of Tian as transcendent, which (in Ames’ reading) undermines human agency. For Munro, the “essence” is Mencius’ grounding ethics in inborn nature, while the “dross” is Mencius’ appeals to Tian as the author of that inborn nature. Their work is an attempt to make Mencius not only intelligible, but also valuable, to contemporary Westerners. At the same time, critics have noted that much of the authentic Mencius may be discarded on the cutting room floor in this process of reclaiming him for contemporary minds. One thinks of David Nivison’s warning to philosophers, past and present, not to indulge in “wishful thinking” and excise or explain away what one does not wish to see in the Mencius.

This cursory review of some important interpreters of Mencius’ thought illustrates a principle that ought to be followed by all who seek to understanding Mencius’ philosophical views: suspicion of the sources. Almost all of our sources for reconstructing Mencius’ views postdate him or come from a hand other than his own, and thus all should be used with caution and with an eye toward possible influences from outside of fourth century BCE China.

10. References and Further Reading

  • Allan, Sarah. The Way of Water and Sprouts of Virtue. Albany: State University of New York Press, 1997.
  • Ames, Roger T. “Mencius and a Process Notion of Human Nature,” in Mencius: Contexts and Interpretations, ed. Alan K. L. Chan (Honolulu: University of Hawai’i Press, 2002), 72-90.
  • Ames, Roger T. “The Mencian Conception of ren xing: Does It Mean `Human Nature’?” in Chinese Texts and Philosophical Contexts: Essays Dedicated to Angus C. Graham, ed. Henry Rosemont, Jr. (La Salle, IL: Open Court, 1991), 143-175.
  • Berthrong, John. “Trends in the Interpretation of Confucian Religiosity,” in The Confucian-Christian Encounter in Historical and Contemporary Perspective, ed. Peter K. H. Lee (Lewiston, ME: Edwin Mellen Press, 1991), 226-254.
  • Bloom, Irene. “Biology and Culture in the Mencian View of Human Nature,” in Mencius: Contexts and Interpretations, ed. Alan K. L. Chan (Honolulu: University of Hawai’i Press, 2002), 91-102.
  • Bloom, Irene. “Mencian Arguments on Human Nature (jen-hsing).” Philosophy East and West 44/1 (1994): 19-53.
  • Boodberg, Peter A. “The Semasiology of Some Primary Confucian Concepts,” in Selected Works of Peter A. Boodberg, ed. Alvin P. Cohen (Berkeley: University of California Press, 1979), 26-40.
  • Bosley, Richard. “Do Mencius and Hume Make the Same Ethical Mistake?” Philosophy East and West 38/1 (1988): 3-18.
  • Brooks, Bruce, and E. Taeko Brooks. “The Nature and Historical Context of the Mencius,” in Mencius: Contexts and Interpretations, ed. Alan K. L. Chan (Honolulu: University of Hawai’i Press, 2002), 242-281.
  • Chan, Wing-tsit, ed. A Sourcebook in Chinese Philosophy. Princeton: Princeton University Press, 1963.
  • Cua, Antonio S. “Xin and Moral Failure: Notes on an Aspect of Mencius’ Moral Psychology,” in Mencius: Contexts and Interpretations, ed. Alan K. L. Chan (Honolulu: University of Hawai’i Press, 2002), 126-150.
  • Dobson, W. A. C. H., trans. Mencius. Toronto and Buffalo: University of Toronto Press, 1963.
  • Eno, Robert. The Confucian Creation of Heaven. Albany: State University of New York Press, 1990.
  • Graham, A. C. Disputers of the Tao: Philosophical Argument in Ancient China. La Salle, IL: Open Court, 1989.
  • Ivanhoe, Philip J. Ethics in the Confucian Tradition: The Thought of Mencius and Wang Yang-ming. Atlanta: Scholars Press, 1990.
  • Lau, D. C. “Meng tzu (Mencius),” in Early Chinese Texts: A Bibliographical Guide, ed. Michael Loewe (Berkeley: Society for the Study of Early China and the Institute of East Asian Studies, University of California, Berkeley, 1993), 331-335.
  • Lau, D. C. trans. Mencius. 2 vols. Hong Kong: Chinese University Press, 1984.
  • Lau, D. C. “On Mencius’ Use of the Method of Analogy in Argument.” In Lau, trans., Mencius (London: Penguin Books, 1970), 235-263.
  • Legge, James, trans. The Works of Mencius. New York: Dover Publications, 1970.
  • Munro, Donald J. “Mencius and an Ethics of the New Century,” in Mencius: Contexts and Interpretations, ed. Alan K. L. Chan (Honolulu: University of Hawai’i Press, 2002), 305-316.
  • Munro, Donald J. The Concept of Man In Early China. Stanford, CA: Stanford University Press, 1969.
  • Nivison, David S. “The Classical Philosophical Writings,” in The Cambridge History of Ancient China: From the Origins of Civilization to 221 B.C., ed. Michael Loewe and Edward L. Shaughnessy (Cambridge: Cambridge University Press, 1999), 745-812.
  • Nivison, David S. The Ways of Confucianism: Investigations in Chinese Philosophy. Ed. Bryan W. Van Norden. Chicago and La Salle, IL: Open Court, 1996.
  • Schwartz, Benjamin I. The World of Thought in Ancient China. Cambridge, MA: The Belknap Press of Harvard University Press, 1985.
  • Shun, Kwong-loi. “Mencius, Xunzi, and Dai Zhen: A Study of the Mengzi ziyi shuzheng,” in Mencius: Contexts and Interpretations, ed. Alan K. L. Chan (Honolulu: University of Hawai’i Press, 2002), 216-241.
  • Shun, Kwong-loi. Mencius and Early Chinese Thought. Stanford, CA: Stanford University Press, 1997.
  • Taylor, Rodney L. “The Religious Character of the Confucian Tradition.” Philosophy East and West 48/1 (January 1998): 80-107.
  • Yearley, Lee H. Mencius and Aquinas: Theories of Virtue and Conceptions of Courage. Albany: State University of New York Press, 1990.

Author Information

Jeffrey Richey
Email: Jeffrey_Richey@berea.edu
Berea College
U. S. A.

Legal Positivism

Legal positivism is a philosophy of law that emphasizes the conventional nature of law—that it is socially constructed. According to legal positivism, law is synonymous with positive norms, that is, norms made by the legislator or considered as common law or case law. Formal criteria of law’s origin, law enforcement and legal effectiveness are all sufficient for social norms to be considered law.  Legal positivism does not base law on divine commandments, reason, or human rights.  As an historical matter, positivism arose in opposition to classical natural law theory, according to which there are necessary moral constraints on the content of law.

Legal positivism does not imply an ethical justification for the content of the law, nor a decision for or against the obedience to law. Positivists do not judge laws by questions of justice or humanity, but merely by the ways in which the laws have been created. This includes the view that judges make new law in deciding cases not falling clearly under a legal rule. Practicing, deciding or tolerating certain practices of law can each be considered a way of creating law.

Within legal doctrine, legal positivism would be opposed to sociological jurisprudence and hermeneutics of law, which study the concrete prevailing circumstances of statutory interpretation in society.

The word “positivism” was probably first used to draw attention to the idea that law is “positive” or “posited,” as opposed to being “natural” in the sense of being derived from natural law or morality.

Table of Contents

  1. The Pedigree Thesis
  2. The Separability Thesis
    1. Inclusive vs. Exclusive Positivism
  3. The Discretion Thesis
  4. Classic Criticisms of Positivism
    1. Fuller’s Internal Morality of Law
    2. Positivism and Legal Principles
    3. The Semantic Sting
  5. References and Further Reading

1. The Pedigree Thesis

The pedigree thesis asserts that legal validity is a function of certain social facts. Borrowing heavily from Jeremy Bentham, John Austin argues that the principal distinguishing feature of a legal system is the presence of a sovereign who is habitually obeyed by most people in the society, but not in the habit of obeying any determinate human superior (Austin 1995, p. 166). On Austin’s view, a rule R is legally valid (that is, is a law) in a society S if and only if R is commanded by the sovereign in S and is backed up with the threat of a sanction. The severity of the threatened sanction is irrelevant; any general sovereign imperative supported by a threat of even the smallest harm is a law.

Austin’s command theory of law is vulnerable to a number of criticisms. One problem is that there appears to be no identifiable sovereign in democratic societies. In the United States, for example, the ultimate political power seems to belong to the people, who elect lawmakers to represent their interests. Elected lawmakers have the power to coerce behavior but are regarded as servants of the people and not as repositories of sovereign power. The voting population, on the other hand, seems to be the repository of ultimate political authority yet lacks the immediate power to coerce behavior. Thus, in democracies like that of the United States, the ultimate political authority and the power to coerce behavior seem to reside in different entities.

A second problem has to do with Austin’s view that the sovereign lawmaking authority is incapable of legal limitation. On Austin’s view, a sovereign cannot be legally constrained because no person (or body of persons) can coerce herself (or itself). Since constitutional provisions limit the authority of the legislative body to make laws, Austin is forced to argue that what we refer to as constitutional law is really not law at all; rather, it is principally a matter of “positive morality” (Austin 1977, p. 107).

Austin’s view is difficult to reconcile with constitutional law in the United States. Courts regard the procedural and substantive provisions of the constitution as constraints on legal validity. The Supreme Court has held, for example, that “an unconstitutional act is not a law; it confers no rights; it imposes no duties; it is, in legal contemplation, as inoperative as though it had never been passed.” (Norton v. Shelby County, 118 U.S. 425 (1886)). Moreover, these constraints purport to be legal constraints: the Supremacy Clause of Article VI of the Constitution states that “[t]his Constitution … shall be the supreme Law of the Land; and the Judges in every State shall be bound thereby.”

The most influential criticisms of Austin’s version of the pedigree thesis, however, owe to H. L. A. Hart’s seminal work, The Concept of Law. Hart points out that Austin’s theory provides, at best, a partial account of legal validity because it focuses on one kind of rule, namely that which requires citizens “to do or abstain from certain actions, whether they wish to or not” (Hart 1994, p. 81). While every legal system must contain so-called primary rules that regulate citizen behavior, Hart believes a system consisting entirely of the kind of liberty restrictions found in the criminal law is, at best, a rudimentary or primitive legal system.

On Hart’s view, Austin’s emphasis on coercive force leads him to overlook the presence of a second kind of primary rule that confers upon citizens the power to create, modify, and extinguish rights and obligations in other persons. As Hart points out, the rules governing the creation of contracts and wills cannot plausibly be characterized as restrictions on freedom that are backed by the threat of a sanction. These rules empower persons to structure their legal relations within the coercive framework of the law-a feature that Hart correctly regards as one of “law’s greatest contributions to social life.” The operation of power-conferring primary rules, according to Hart, indicates the presence of a more sophisticated system for regulating behavior.

But what ultimately distinguishes societies with full-blown systems of law from those with only rudimentary or primitive forms of law is that the former have, in addition to first-order primary rules, secondary meta-rules that have as their subject matter the primary rules themselves:

[Secondary rules] may all be said to be on a different level from the primary rules, for they are all about such rules; in the sense that while primary rules are concerned with the actions that individuals must or must not do, these secondary rules are all concerned with the primary rules themselves. They specify the way in which the primary rules may be conclusively ascertained, introduced, eliminated, varied, and the fact of their violation conclusively determined (Hart 1994, p. 92).

Hart distinguishes three types of secondary rules that mark the transition from primitive forms of law to full-blown legal systems: (1) the rule of recognition, which “specif[ies] some feature or features possession of which by a suggested rule is taken as a conclusive affirmative indication that it is a rule of the group to be supported by the social pressure it exerts” (Hart 1994, p. 92); (2) the rule of change, which enables a society to add, remove, and modify valid rules; and (3) the rule of adjudication, which provides a mechanism for determining whether a valid rule has been violated. On Hart’s view, then, every society with a full-blown legal system necessarily has a rule of recognition that articulates criteria for legal validity that include provisions for making, changing and adjudicating law. Law is, to use Hart’s famous phrase, “the union of primary and secondary rules” (Hart 1994, p. 107). Austin theory fails, on Hart’s view, because it fails to acknowledge the importance of secondary rules in manufacturing legal validity.

Hart also finds fault with Austin’s view that legal obligation is essentially coercive. According to Hart, there is no difference between the Austinian sovereign who governs by coercing behavior and the gunman who orders someone to hand over her money. In both cases, the subject can plausibly be characterized as being “obliged” to comply with the commands, but not as being “duty-bound” or “obligated” to do so (Hart 1994, p. 80). On Hart’s view, the application of coercive force alone can never give rise to an obligation-legal or otherwise.

Legal rules are obligatory, according to Hart, because people accept them as standards that justify criticism and, in extreme cases, punishment of deviations:

What is necessary is that there should be a critical reflective attitude to certain patterns of behavior as a common standard, and that this should display itself in criticism (including self-criticism), demands for conformity, and in acknowledgements that such criticism and demands are justified, all of which find their characteristic expression in the normative terminology of ‘ought’, ‘must’, and ‘should’, and ‘right’ and ‘wrong’ (Hart 1994, p. 56).

The subject who reflectively accepts the rule as providing a standard that justifies criticism of deviations is said to take “the internal point of view” towards it.

On Hart’s view, it would be too much to require that the bulk of the population accept the rule of recognition as the ultimate criteria for legal validity: “the reality of the situation is that a great proportion of ordinary citizens-perhaps a majority-have no general conception of the legal structure or its criteria of validity” (Hart 1994, p. 111). Instead, Hart argues that what is necessary to the existence of a legal system is that the majority of officials take the internal point of view towards the rule of recognition and its criteria of validity. All that is required of citizens is that they generally obey the primary rules that are legally valid according to the rule of recognition.

Thus, on Hart’s view, there are two minimum conditions sufficient and necessary for the existence of a legal system: “On the one hand those rules of behavior which are valid according to the system’s ultimate criteria of validity must be generally obeyed, and, on the other hand, its rules of recognition specifying the criteria of legal validity and its rules of change and adjudication must be effectively accepted as common public standards of official behavior by its officials” (Hart 1994, p. 113).

Hart’s view is vulnerable to the same criticism that he levels against Austin. Hart rejects Austin’s view because the institutional application of coercive force can no more give rise to an obligation than can the application of coercive force by a gunman. But the situation is no different if the gunman takes the internal point of view towards his authority to make such a threat. Despite the gunman’s belief that he is entitled to make the threat, the victim is obliged, but not obligated, to comply with the gunman’s orders. The gunman’s behavior is no less coercive because he believes he is entitled to make the threat.

Similarly, in the minimal legal system, only the officials of the legal system take the internal point of view towards the rule of recognition that endows them with authority to make, execute, adjudicate, and enforce the rules. The mere presence of a belief in the officials that they are entitled to make law cannot give rise to an obligation in other people to comply with their enactments any more than the presence of a belief on the part of a gunman that he is entitled to issue orders gives rise to an obligation in the victim to comply with those orders. Hart’s minimal legal system is no less coercive than Austin’s legal system.

2. The Separability Thesis

The second thesis comprising the foundation of legal positivism is the separability thesis. In its most general form, the separability thesis asserts that law and morality are conceptually distinct. This abstract formulation can be interpreted in a number of ways. For example, Klaus Faber (1996) interprets it as making a meta-level claim that the definition of law must be entirely free of moral notions. This interpretation implies that any reference to moral considerations in defining the related notions of law, legal validity, and legal system is inconsistent with the separability thesis.

More commonly, the separability thesis is interpreted as making only an object-level claim about the existence conditions for legal validity. As H.L.A. Hart describes it, the separability thesis is no more than the “simple contention that it is in no sense a necessary truth that laws reproduce or satisfy certain demands of morality, though in fact they have often done so” (Hart 1994, pp. 181-82). Insofar as the object-level interpretation of the separability thesis denies it is a necessary truth that there are moral constraints on legal validity, it implies the existence of a possible legal system in which there are no moral constraints on legal validity.

a. Inclusive vs. Exclusive Positivism

Though all positivists agree there are possible legal systems without moral constraints on legal validity, there are conflicting views on whether there are possible legal systems with such constraints. According to inclusive positivism (also known as incorporationism and soft positivism), it is possible for a society’s rule of recognition to incorporate moral constraints on the content of law. Prominent inclusive positivists include Jules Coleman and H.L.A. Hart, who maintains that “the rule of recognition may incorporate as criteria of legal validity conformity with moral principles or substantive values … such as the Sixteenth or Nineteenth Amendments to the United States Constitution respecting the establishment of religion or abridgements of the right to vote” (Hart 1994, p. 250).

In contrast, exclusive positivism (also called hard positivism) denies that a legal system can incorporate moral constraints on legal validity. Exclusive positivists like Joseph Raz (1979, p. 47) subscribe to the source thesis, according to which the existence and content of law can always be determined by reference to its sources without recourse to moral argument. On this view, the sources of law include both the circumstances of its promulgation and relevant interpretative materials, such as court cases involving its application.

At first glance, exclusive positivism may seem difficult to reconcile with what appear to be moral criteria of legal validity in legal systems like that of the United States. For example, the Fourth Amendment provides that “[t]he right of the people to be secure in their persons, houses, papers, and effects against unreasonable searches and seizures, shall not be violated.” Likewise, the First Amendment prohibits laws abridging the right of free speech. Taken at face value, these amendments seem to make moral standards part of the conditions for legal validity.

Exclusive positivists argue that such amendments can require judges to consider moral standards in certain circumstances, but cannot incorporate those standards into the law. When a judge makes reference to moral considerations in deciding a case, she necessarily creates new law on an issue-and this is so even when the law directs her to consider moral considerations, as the Bill of Rights does in certain circumstances. On this view, all law is settled law and questions of settled law can be resolved without recourse to moral arguments:

The law on a question is settled when legally binding sources provide its solution. In such cases judges are typically said to apply the law, and since it is source-based, its application involves technical, legal skills in reasoning from those sources and does not call for moral acumen. If a legal question is not answered by standards deriving from legal sources then it lacks a legal answer-the law on such questions is unsettled. In deciding such cases courts inevitably break new (legal) ground and their decision develops the law…. Naturally, their decisions in such cases rely at least partly on moral and other extra-legal considerations (Raz 1979, pp. 49-50).

If the judge can resolve an issue involving the First Amendment merely by applying past court decisions, then the issue is settled by the law; if not, then the issue is unsettled. Insofar as the judge looks to controversial moral standards to resolve the issue, she is going beyond the law because the mere presence of controversy about the law implies that it is indeterminate. Thus, on Raz’s view, references to moral language in the law, at most, direct judges to consider moral requirements in resolving certain unsettled questions of law. They cannot incorporate moral requirements into the law.

3. The Discretion Thesis

Third thesis commonly associated with positivism is the discretion thesis, according to which judges decide difficult cases by making new law in the exercise of discretion. Ronald Dworkin describes this thesis as follows:

The set of these valid legal rules is exhaustive of ‘the law’, so that if someone’s case is not clearly covered by such a rule . . . then that case cannot be decided by ‘applying the law.’ It must be decided by some official, like a judge, ‘exercising his discretion,’ which means reaching beyond the law for some other sort of standard to guide him in manufacturing a fresh legal rule or supplementing an old one (Dworkin 1977, p. 17).

On this view, a judge cannot decide a case that does not fall clearly under a valid rule by interpreting or applying the law; she must decide the case by creating or promulgating a law that did not exist prior to the adjudication. Thus, the discretion thesis implies that judges are empowered with a quasi-legislative lawmaking authority in cases that cannot be decided merely by applying law.

Though often associated with positivism, the discretion thesis does not belong to positivism’s theoretical core. The pedigree and separability theses purport to be conceptual claims that are true of every possible legal system. These two claims jointly assert that, in every possible legal system, propositions of law are valid in virtue of having been manufactured according to some set of social conventions. On this view, there are no moral constraints on the content of law that hold in every possible legal system.

But many positivists regard the discretion thesis as a contingent claim that is true of some, but not all, possible legal systems. Hart, for example, believes there will inevitably arise cases that do not fall clearly under a rule, but concedes a rule of recognition could deny judges discretion to make law in such cases by requiring judges “to disclaim jurisdiction or to refer the points not regulated by the existing law to the legislature to decide” (Hart 1994, p. 272). Indeed, Hart’s inclusive positivism allows him to hold that a rule of recognition could require judges to decide cases in precisely the manner that Dworkin advocates (Hart 1994, p. 263; and see Section IV-2, infra). Thus, at least for inclusive positivists like Hart, the discretion thesis makes a different kind of claim than the conceptual claims that form positivism’s theoretical core (Himma 1999).

Moreover, the discretion thesis is consistent with some forms of natural law theory. According to Blackstone’s classical naturalism, conformity with the natural law is a necessary condition for legal validity in every possible legal system. But insofar as the natural law is incomplete, there will inevitably arise issues that have multiple outcomes consistent with the natural law. Since none of the relevant outcomes in such cases offend the natural law, there is nothing in the assumption of necessary moral constraints on the content of law, in and of itself, that precludes Blackstone from endorsing the discretion thesis in such cases. Of course, if Blackstone believes the natural law contains a principle denying discretion to judges, then that commitment is inconsistent with the discretion thesis. But the assertion there are necessary constraints on the content of law, in and of itself, is consistent with the discretion thesis, even construed as a conceptual claim, as long as there are cases to which the natural law is indifferent.

In any event, Dworkin distinguishes three different senses in which a judge might be said to have discretion: (1) a judge has discretion when she exercises judgment in applying a legal standard to a particular case; (2) a judge has discretion when her decision is not subject to reversal by any other authority; and (3) a judge has discretion when her decision is not bound by any legal standards.

According to Dworkin, positivism’s discretion thesis is committed to the third sense of discretion, which he refers to as strong discretion. On Dworkin’s view, the thesis that judges have discretion only in the sense that they exercise judgment is trivially true, while the thesis that judges have discretion in the sense that their decisions are not subject to being reversed by a higher authority is false. Even the Supreme Court can be reversed by Congress or by constitutional amendment. Thus, on Dworkin’s view, the discretion thesis implies that judges have discretion to decide hard cases by what amounts to an act of legislation because the judge is not bound by any legal standards.

Thus construed, the discretion thesis is inconsistent with ordinary legal practice. Even in the most difficult of cases where there is no clearly applicable law, lawyers do not ask that the judge decide the relevant issue by making new law. Each lawyer cites cases favorable to her client’s position and argues that the judge is bound by those cases to decide in her client’s favor. As a practical matter, lawyers rarely, if ever, concede there are no legal standards governing a case and ask the judge to legislate in the exercise of discretion.

Nevertheless, the problem with Dworkin’s analysis is that it falsely presupposes an official cannot make new law unless there are no legal standards constraining the official’s decision. Indeed, lawmaking authorities in legal systems like the U.S. never have what Dworkin describes as strong discretion. Even the legislative decisions of Congress, the highest legislative authority in the nation, are always constrained by constitutional standards. For example, under the Fourteenth Amendment, Congress cannot enact a law that sets one speed limit for male drivers on interstate highways and another for female drivers.

For his part, Hart concedes that judicial lawmaking authority is limited in two respects: “not only are the judge’s powers subject to many constraints narrowing his choice from which a legislature may be quite free, but since the judge’s powers are exercised only to dispose of particular instant cases he cannot use these to introduce large-scale reforms or new codes” (Hart 1994, p. 273). What explains the judge’s discretion to make new law in a given case, on Hart’s view, is not the absence of legal standards constraining her decision; rather it is the absence of legal standards that dictate a uniquely correct answer to the case. The judge cannot decide such a case merely by applying existing law because there is more than one available outcome that coheres with existing law. In such instances, it is impossible to render a substantive decision (as opposed to simply referring the matter back to the legislature) without creating new law.

The discretion thesis is vulnerable to one powerful objection. Insofar as a judge decides a difficult case by making new law in the exercise of discretion, the case is being decided on the basis of a law that did not exist at the time the dispute arose. If, for example, a judge awards damages to a plaintiff by making new law in the exercise of discretion, it follows that she has held the defendant liable under a law that did not exist at the time the dispute arose. And, as Dworkin points out, it seems patently unfair to deprive a defendant of property for behavior that did not give rise to liability at the time the behavior occurred.

Nevertheless, Dworkin’s view fares no better on this count. While Dworkin acknowledges the existence of difficult cases that do not fall clearly under a rule, he believes they are not resolved by an exercise of judicial discretion. On Dworkin’s view, there is always a right answer to such cases implicit in the pre-existing law. Of course, it sometimes takes a judge of Herculean intellectual ability to discern what the right answer is, but it is always there to be found in pre-existing law. Since the right answer to even hard legal disputes is always part of pre-existing law, Dworkin believes that a judge can take property from a defendant in a hard case without unfairness (Dworkin 1977, pp. 87-130).

But if fairness precludes taking property from a defendant under a law that did not exist at the time of the relevant behavior, it also precludes taking property from a defendant under a law that did not give reasonable notice that the relevant behavior gives rise to liability. Due process and fundamental fairness require reasonable notice of which behaviors give rise to liability. As long as Dworkin acknowledges the existence of cases so difficult that only the best of judges can solve them, his theory is vulnerable to the same charge of unfairness that he levels at the discretion thesis.

4. Classic Criticisms of Positivism

a. Fuller’s Internal Morality of Law

In The Morality of Law, Lon L. Fuller argues that law is subject to an internal morality consisting of eight principles: (P1) the rules must be expressed in general terms; (P2) the rules must be publicly promulgated; (P3) the rules must be (for the most part) prospective in effect; (P4) the rules must be expressed in understandable terms; (P5) the rules must be consistent with one another; (P6) the rules must not require conduct beyond the powers of the affected parties; (P7) the rules must not be changed so frequently that the subject cannot rely on them; and (P8) the rules must be administered in a manner consistent with their wording (Fuller 1964, p. 39).

On Fuller’s view, no system of rules that fails minimally to satisfy these principles of legality can achieve law’s essential purpose of achieving social order through the use of rules that guide behavior. A system of rules that fails to satisfy (P2) or (P4), for example, cannot guide behavior because people will not be able to determine what the rules require. Accordingly, Fuller concludes that his eight principles are “internal” to law in the sense that they are built into the existence conditions for law: “A total failure in any one of these eight directions does not simply result in a bad system of law; it results in something that is not properly called a legal system at all” (Fuller 1964, p. 39).

These internal principles constitute a morality, according to Fuller, because law necessarily has positive moral value in two respects: (1) law conduces to a state of social order and (2) does so by respecting human autonomy because rules guide behavior. Since no system of rules can achieve these morally valuable objectives without minimally complying with the principles of legality, it follows, on Fuller’s view, that they constitute a morality. Since these moral principles are built into the existence conditions for law, they are internal and hence represent a conceptual connection between law and morality that is inconsistent with the separability thesis.

Hart responds by denying Fuller’s claim that the principles of legality constitute an internal morality; on Hart’s view, Fuller confuses the notions of morality and efficacy:

[T]he author’s insistence on classifying these principles of legality as a “morality” is a source of confusion both for him and his readers…. [T]he crucial objection to the designation of these principles of good legal craftsmanship as morality, in spite of the qualification “inner,” is that it perpetrates a confusion between two notions that it is vital to hold apart: the notions of purposive activity and morality. Poisoning is no doubt a purposive activity, and reflections on its purpose may show that it has its internal principles. (“Avoid poisons however lethal if they cause the victim to vomit”….) But to call these principles of the poisoner’s art “the morality of poisoning” would simply blur the distinction between the notion of efficiency for a purpose and those final judgments about activities and purposes with which morality in its various forms is concerned (Hart 1965, pp. 1285-86).

On Hart’s view, all actions, including virtuous acts like lawmaking and impermissible acts like poisoning, have their own internal standards of efficacy. But insofar as such standards of efficacy conflict with morality, as they do in the case of poisoning, it follows that they are distinct from moral standards. Thus, while Hart concedes that something like Fuller’s eight principles are built into the existence conditions for law, he concludes that they do not constitute a conceptual connection between law and morality.

Unfortunately, Hart’s response overlooks the fact that most of Fuller’s eight principles double as moral ideals of fairness. For example, public promulgation in understandable terms may be a necessary condition for efficacy, but it is also a moral ideal; it is morally objectionable for a state to enforce rules that have not been publicly promulgated in terms reasonably calculated to give notice of what is required. Similarly, we take it for granted that it is wrong for a state to enact retroactive rules, inconsistent rules, and rules that require what is impossible. Poisoning may have its internal standards of efficacy, but such standards are distinguishable from the principles of legality in that they conflict with moral ideals.

Nevertheless, Fuller’s principles operate internally, not as moral ideals, but merely as principles of efficacy. As Fuller would likely acknowledge, the existence of a legal system is consistent with considerable divergence from the principles of legality. Legal standards, for example, are necessarily promulgated in general terms that inevitably give rise to problems of vagueness. And officials all too often fail to administer the laws in a fair and even-handed manner-even in the best of legal systems. These divergences may always be prima facie objectionable, but they are inconsistent with a legal system only when they render a legal system incapable of performing its essential function of guiding behavior. Insofar as these principles are built into the existence conditions for law, it is because they operate as efficacy conditions-and not because they function as moral ideals.

Fuller’s jurisprudential legacy, however, should not be underestimated. While positivists have long acknowledged that law’s essential purpose is to guide behavior through rules (e.g., John Austin writes that “[a] law .. may be defined as a rule laid down for the guidance of an intelligent being by an intelligent being having power over him” Austin 1977, p. 5), they have not always appreciated the implications of this purpose. Fuller’s lasting contribution to the theory of law was to flesh out these implications in the form of his principles of legality.

b. Positivism and Legal Principles

Dworkin argues that, in deciding hard cases, judges often invoke legal principles that do not derive their authority from an official act of promulgation (Dworkin 1977, p. 40). These principles, Dworkin believes, must be characterized as law because judges are bound to consider them when relevant. But if unpromulgated legal principles constitute law, then it is false, contra the pedigree thesis, that a proposition of law is valid only in virtue of having been formally promulgated.

According to Dworkin, principles and rules differ in the kind of guidance they provide to judges:

Rules are applicable in an all-or-nothing fashion. If the facts a rule stipulates are given, then either the rule is valid, in which case the answer it supplies must be accepted, or it is not, in which case it contributes nothing to the decision…. But this is not the way principles operate…. [A principle] states a reason that argues in one direction, but does not necessitate a particular decision (Dworkin 1977, pp. 24-25).

On Dworkin’s view, conflicting principles provide competing reasons that must be weighed according to the importance of the respective values they express. Thus, rules are distinguishable from principles in two related respects: (1) rules necessitate, where principles only suggest, a particular outcome; and (2) principles have, where rules lack, the dimension of weight.

Dworkin cites the case of Riggs v. Palmer as representative of how judges use principles to decide hard cases. In Riggs, the court considered the question of whether a murderer could take under the will of his victim. At the time the case was decided, neither the statutes nor the case law governing wills expressly prohibited a murderer from taking under his victim’s will. Despite this, the court declined to award the defendant his gift under the will on the ground that it would be wrong to allow him to profit from such a grievous wrong. On Dworkin’s view, the court decided the case by citing “the principle that no man may profit from his own wrong as a background standard against which to read the statute of wills and in this way justified a new interpretation of that statute” (Dworkin 1977, p. 29).

The positivist might respond that when the Riggs court considered this principle, it was reaching beyond the law to extralegal standards in the exercise of judicial discretion. But Dworkin points out that the Riggs judges would “rightfully” have been criticized had they failed to consider this principle; if it were merely an extralegal standard, there would be no rightful grounds to criticize a failure to consider it (Dworkin 1977, p. 35). Accordingly, Dworkin concludes that the best explanation for the propriety of such criticism is that principles are part of the law.

Further, Dworkin maintains that the legal authority of standards like the Riggs principle cannot derive from promulgation in accordance with purely formal requirements: “[e]ven though principles draw support from the official acts of legal institutions, they do not have a simple or direct enough connection with these acts to frame that connection in terms of criteria specified by some ultimate master rule of recognition” (Dworkin 1977, p. 41). Unlike legal rules, legal principles lack a canonical form and hence cannot be explained by formal promulgation.

On Dworkin’s view, the legal authority of a binding principle derives from the contribution it makes to the best moral justification for a society’s legal practices considered as a whole. According to Dworkin, a legal principle maximally contributes to such a justification if and only if it satisfies two conditions: (1) the principle coheres with existing legal materials; and (2) the principle is the most morally attractive standard that satisfies (1). The correct legal principle is the one that makes the law the moral best it can be. Thus, Dworkin concludes, “if we treat principles as law we must reject the positivists’ first tenet, that the law of a community is distinguished from other social standards by some test in the form of a master rule” (Dworkin 1977, p. 44).

In response, positivists concede that there are legal principles, but argue that their authority as law can be explained in terms of the conventions contained in the rule of recognition:

Legal principles, like other laws, can be enacted or repealed by legislatures and administrative authorities. They can also become legally binding through establishment by the courts. Many legal systems recognize that both rules and principles can be made into law or lose their status as law through precedent (Raz 1972, p. 848).

According to this view, legal principles are like legal rules in that both derive their authority under the rule of recognition from the official acts of courts and legislatures. If the Riggs principle that no person shall profit from her own wrong has legal authority, it is because that principle was either declared by a court in the course of adjudicating a dispute or formally promulgated by the appropriate legislative body.

Further, inclusive positivists argue that Dworkin’s account of principles is itself consistent with the pedigree thesis. As Hart puts it, “this interpretative test seems not to be an alternative to a criterion provided by a rule of recognition, but … only a complex ‘soft-positivist’ form of such a criterion identifying principles by their content not by their pedigree” (Hart 1994, p. 263). The idea, familiar from Section II, is that a rule of recognition can incorporate content-based constraints on legal validity, even those rooted ultimately in morality.

c. The Semantic Sting

In Law’s Empire, Dworkin distinguishes two kinds of disagreement legal practitioners can have about the law. Lawyers can agree on the criteria a rule must satisfy to be legally valid, but disagree on whether those criteria are satisfied by a particular rule. For example, two lawyers might agree that a rule is valid if enacted by the state legislature, but disagree on whether the rule at issue was actually enacted by the state legislature. Such disagreements are empirical in nature and hence pose no theoretical difficulties for positivism.

There is, however, a second kind of disagreement that Dworkin believes is inconsistent with positivism. Lawyers often agree on the facts about a rule’s creation, but disagree on whether those facts are sufficient to endow the rule with legal authority. Such disagreement is considerably deeper than empirical disagreement as it concerns the criteria for legal validity-which, according to positivism, are exhausted by the rule of recognition. Dworkin calls this second kind of disagreement theoretical disagreement about the law.

Theoretical disagreement, on Dworkin’s view, is inconsistent with the pedigree thesis because the pedigree thesis explains the concept of law in terms of shared criteria for creating, changing and adjudicating law:

If legal argument is mainly or even partly about [the properties that make a proposition legally valid], then lawyers cannot all be using the same factual criteria for deciding when propositions of law are true and false. Their arguments would be mainly or partly about which criteria they should use. So the project of the semantic theories, the project of digging out shared rules from a careful study of what lawyers say and do, would be doomed to fail (Dworkin 1986, p. 43).

If lawyers disagree about the criteria of legal validity, then the grounds of legal validity cannot be exhausted by the shared criteria contained in a rule of recognition. The semantic sting, then, implies that there must be more to the concept of legal validity than can be explained by promulgation in accordance with shared criteria embodied in a rule of recognition.

The semantic sting resembles one of Dworkin’s earlier criticisms of Hart’s pedigree thesis. Hart believes that the rule of recognition is a social rule and is hence constituted by the conforming behavior of people who also accept the rule as a ground for criticizing deviations. Like all social rules, then, the rule of recognition has an external and internal aspect. The external aspect of the rule of recognition consists in general obedience to those rules satisfying its criteria of validity; the internal aspect is constituted by its acceptance as a public standard of official behavior. Hart believes it is this double aspect of the rule of recognition that accounts for its normativity and enables him to distinguish his theory from Austin’s view of law as a system of coercive commands. For, as Hart points out, a purely coercive command can oblige, but never obligate, a person to comply (see Section I, supra).

Dworkin argues that this feature of Hart’s theory commits him to the claim that there cannot be any disagreement about the content of rule of recognition:

Hart’s qualification … that the rule of recognition may be uncertain at particular points … undermines [his theory]…. If judges are in fact divided about what they must do if a subsequent Parliament tries to repeal an entrenched rule, then it is not uncertain whether any social rule [of recognition] governs that decision; on the contrary, it is certain that none does (Dworkin 1977, pp. 61-62).

On Dworkin’s view, the requirements of a social rule cannot be uncertain since a social rule is constituted by acceptance and conforming behavior by most people in the relevant group: “two people whose rules differ … cannot be appealing to the same social rule, and at least one of them cannot be appealing to any social rule at all” (Dworkin 1977, p. 55).

Jules Coleman responds that if the rule of recognition is a social rule, then Hart’s view implies there must be general agreement among the officials of a legal system about what standards constitute the rule of recognition, but it does not imply there cannot be disagreement as to what those standards require in any given instance:

The controversy among judges does not arise over the content of the rule of recognition itself. It arises over which norms satisfy the standards set forth in it. The divergence in behavior among officials as exemplified in their identifying different standards as legal ones does not establish their failure to accept the same rule of recognition. On the contrary, judges accept the same truth conditions for propositions of law…. They disagree about which propositions satisfy those conditions (Coleman 1982, p. 156).

Coleman, then, distinguishes two kinds of disagreement practitioners can have about the rule of recognition: (1) disagreement about what standards constitute the rule of recognition; and (2) disagreement about what propositions satisfy those standards. On Coleman’s view, Hart’s analysis of social rules implies only that (1) is impossible.

Under the U.S. rule of recognition, for example, a federal statute is legally valid if and only if it has been enacted in accordance with the procedural requirements described in the body of the Constitution and is consistent with the first fourteen amendments. Since, on Hart’s view, the U.S. rule of recognition is a social rule, U.S. officials must agree on the procedures the federal government must follow in enacting law, the set of sentences constituting the first fourteen amendments, and the requirement that federal enactments be consistent with those amendments.

But Hart’s view of social rules does not imply there cannot be any disagreement about whether a given enactment is consistent with the first fourteen amendments. Legal practitioners can and do disagree on what Hart calls penumbral (or borderline) issues regarding the various amendments. While every competent practitioner in the U.S. would agree, for example, that torturing a person to induce a confession violates the fifth amendment right against self-incrimination, there is considerable disagreement about whether compelling a defendant to undergo a psychiatric examination for the purpose of increasing her sentence also violates that right. On Coleman’s view, there is nothing in Hart’s analysis of social rules that precludes such borderline disagreements about whether a practice is consistent with the Fifth Amendment.

Despite its resemblance to this earlier criticism, Dworkin’s semantic sting argument takes aim at a deeper target. The semantic sting targets all so-called semantic theories of law that articulate the concept of law in terms of “shared rules … that set out criteria that supply the word’s meaning” (Dworkin 1986, p. 31). Thus, while the earlier criticism is directed at Hart’s extraneous account of social rules, the semantic sting is directed at what Dworkin takes to be the very heart of positivism’s theoretical core, namely, the claim that there are shared criteria that exhaust the conditions for the correct application of the concept of law.

At the root of the problem with semantic theories, on Dworkin’s view, is a flawed theory of what makes disagreement possible. According to Dworkin, semantic theories mistakenly assume that meaningful disagreement is impossible unless “we all accept and follow the same criteria for deciding when our claims are sound, even if we cannot state exactly, as a philosopher might hope to do, what these criteria are” (Dworkin 1986, p. 45). On this flawed assumption, two people whose concepts of law differ cannot be disagreeing about the same thing.

Perhaps with Coleman’s response to his earlier criticism in mind, Dworkin concedes that semantic theories are consistent with theoretical disagreements about borderline or penumbral cases: “people do sometimes speak at cross-purposes in the way the borderline defense describes” (Dworkin 1986, p. 41). But Dworkin denies semantic theories are consistent with theoretical disagreement about pivotal (or core) cases.  According to semantic theories, he says,

[Y]ou and I can sensibly discuss how many books I have on my shelf, for example, only if we both agree, at least roughly, about what a book is. We can disagree over borderline cases: I may call something a slim book that you would call a pamphlet. But we cannot disagree over what I called pivotal cases. If you do not count my copy of Moby-Dick as a book because in your view novels are not books, any disagreement is bound to be senseless (Dworkin 1986, p. 45).

The problem, on Dworkin’s view, is that many difficult appellate cases like Riggs involve theoretical disagreement about pivotal cases:

The various judges who argued about our sample cases did not think they were defending marginal or borderline claims. Their disagreements about legislation and precedent were fundamental; their arguments showed that they disagreed not only about whether Elmer should have his inheritance, but about why any legislative act, even traffic codes and rates of taxation, impose the rights and obligations everyone agrees they do…. They disagreed about what makes a proposition of law true not just at the margin but in the core as well (Dworkin 1986, pp. 42-43).

On Dworkin’s view, the judges in Riggs were not having a borderline dispute about some accepted criterion for the application of the concept of law. Rather, they were having a disagreement about the status of some putatively fundamental criterion itself: the majority believed, while the dissent denied, that courts have power to modify unambiguous legislative enactments.

Accordingly, theoretical disagreement about pivotal cases like Riggs is inconsistent with semantic theories of law, on Dworkin’s view, because it shows that shared criteria do not exhaust the proper conditions for the application of the concept of law. For the majority and dissenting judges in Riggs were having a sensible disagreement about law even though it centered on a pivotal case involving the criteria of legal validity. Thus, Dworkin concludes, the concept of law cannot be explained by so-called criterial semantics.

In response, Hart denies both that his theory is a semantic theory and that it assumes such an account of what makes disagreement possible:

[N]othing in my book or in anything else I have written supports [a semantic account] of my theory. Thus, my doctrine that developed municipal legal systems contain a rule of recognition specifying the criteria for the identification of the laws which courts have to apply may be mistaken, but I nowhere base this doctrine on the mistaken idea that it is part of the meaning of the word ‘law’ that there should be such a rule of recognition in all legal systems, or on the even more mistaken idea that if the criteria for the identification of the grounds of law were not uncontroversially fixed, ‘law’ would mean different things to different people (Hart 1994, p. 246).

Instead, Hart argues that his theory of law is “a descriptive account of the distinctive features of law in general as a complex social phenomenon” (Hart 1994, p. 246). Hart presents his theory, not as an account of how people apply the concept of law, but rather as an account of what distinguishes systems of law from other systems of social rules. On Hart’s view, it is the presence of a rule of recognition establishing criteria of validity that distinguishes law from other systems of social rules. Thus, according to Hart, Dworkin’s criticism fails because it mischaracterizes positivism as providing a criterial explanation of the concept of law.

5. References and Further Reading

  • Austin, John, Lectures on Jurisprudence and the Philosophy of Positive Law (St. Clair Shores, MI: Scholarly Press, 1977)
  • Austin, John, The Province of Jurisprudence Determined (Cambridge: Cambridge University Press, 1995)
  • Bentham, Jeremy, Of Laws In General (London: Athlone Press, 1970)
  • Blackstone, William, Commentaries on the Law of England (Chicago: The University of Chicago Press, 1979)
  • Coleman, Jules, “Negative and Positive Positivism,” 11 Journal of Legal Studies 139 (1982)
  • Dworkin, Ronald M., Law’s Empire (Cambridge: Harvard University Press, 1986)
  • Dworkin, Ronald M., Taking Rights Seriously (Cambridge: Harvard University Press, 1977)
  • Finnis, John, Natural Law and Natural Rights (Oxford: Clarendon Press, 1980)
  • Fuller, Lon L., The Morality of Law, Revised Edition (New Haven: Yale University Press, 1969)
  • Fuller, Lon L., “Positivism and Fidelity to Law–A Reply to Professor Hart,” 71 Harvard Law Review 630 (1958)
  • Faber, Klaus, “Farewell to ‘Legal Positivism’: The Separation Thesis Unraveling,” in George, Robert P., The Autonomy of Law: Essays on Legal Positivism (Oxford: Clarendon Press, 1996), 119-162
  • George, Robert P., “Natural Law and Positive Law,” in George, Robert P., The Autonomy of Law: Essays on Legal Positivism (Oxford: Clarendon Press, 1996), 321-334
  • Hart, H.L.A., The Concept of Law, Second Edition (Oxford: Clarendon Press, 1994)
  • Hart, H.L.A., “American Jurisprudence through English Eyes: The Nightmare and the Noble Dream,” reprinted in Hart, H.L.A., Essays in Jurisprudence and Philosophy (Oxford: Clarendon Press, 1983), 123-144.
  • Hart, H.L.A., “Book Review of The Morality of Law” 78 Harvard Law Review 1281 (1965)
  • Hart, H.L.A., Essays on Bentham (Oxford: Clarendon Press, 1982)
  • Hart, H.L.A., “Positivism and the Separation of Law and Morals,” 71 Harvard Law Review 593 (1958)
  • Himma, Kenneth E., “Judicial Discretion and the Concept of Law,” forthcoming in Oxford Journal of Legal Studies vol. 18, no. 1 (1999)
  • Mackie, J.L., “The Third Theory of Law,” Philosophy & Public Affairs, vol. 7, no. 1 (Fall 1977)
  • Moore, Michael, “Law as a Functional Kind,” in George, Robert P. (ed.), Natural Law Theory: Contemporary Essays (Oxford: Clarendon Press, 1992), 188-242
  • Raz, Joseph, The Authority of Law: Essays on Law and Morality (Oxford: Clarendon Press, 1979)
  • Raz, Joseph, “Authority, Law and Morality,” The Monist, vol. 68, 295-324
  • Raz, Joseph, “Legal Principles and the Limits of Law,” 81 Yale Law Review 823 (1972)
  • Raz, Joseph, “Two Views of the Nature of the Theory of Law: A Partial Comparison,” Legal Theory, vol. 4, no. 3 (September 1998), 249-282
  • Waluchow, W.J., Inclusive Legal Positivism (Oxford: Clarendon Press, 1994)

Author Information

Kenneth Einar Himma
Email: himma@spu.edu
Seattle Pacific University
U. S. A.

An encyclopedia of philosophy articles written by professional philosophers.