Existentialism

Existentialism is a catch-all term for those philosophers who consider the nature of the human condition as a key philosophical problem and who share the view that this problem is best addressed through ontology. This very broad definition will be clarified by discussing seven key themes that existentialist thinkers address. Those philosophers considered existentialists are mostly from the continent of Europe, and date from the 19th and 20th centuries. Outside philosophy, the existentialist movement is probably the most well-known philosophical movement, and at least two of its members are among the most famous philosophical personalities and widely read philosophical authors. It has certainly had considerable influence outside philosophy, for example on psychological theory and on the arts. Within philosophy, though, it is safe to say that this loose movement considered as a whole has not had a great impact, although individuals or ideas counted within it remain important. Moreover, most of the philosophers conventionally grouped under this heading either never used, or actively disavowed, the term ‘existentialist’. Even Sartre himself once said: “Existentialism? I don’t know what that is.” So, there is a case to be made that the term – insofar as it leads us to ignore what is distinctive about philosophical positions and to conflate together significantly different ideas – does more harm than good.

In this article, however, it is assumed that something sensible can be said about existentialism as a loosely defined movement. The article has three sections. First, we outline a set of themes that define, albeit very broadly, existentialist concerns. This is done with reference to the historical context of existentialism, which will help us to understand why certain philosophical problems and methods were considered so important. Second, we discuss individually six philosophers who are arguably its central figures, stressing in these discussions the ways in which these philosophers approached existentialist themes in distinctive ways. These figures, and many of the others we mention, have full length articles of their own within the Encyclopedia. Finally, we look very briefly at the influence of existentialism, especially outside philosophy.

Table of Contents

  1. Key Themes of Existentialism
    1. Philosophy as a Way of Life
    2. Anxiety and Authenticity
    3. Freedom
    4. Situatedness
    5. Existence
    6. Irrationality/Absurdity
    7. The Crowd
  2. Key Existentialist Philosophers
    1. Søren Kierkegaard (1813-1855) as an Existentialist Philosopher
    2. Friedrich Nietzsche (1844-1900) as an Existentialist Philosopher
    3. Martin Heidegger (1889-1976) as an Existentialist Philosopher
    4. Jean-Paul Sartre (1905-1980) as an Existentialist Philosopher
    5. Simone de Beauvoir (1908-1986) as an Existentialist Philosopher
    6. Albert Camus (1913-1960) as an Existentialist Philosopher
  3. The Influence of Existentialism
    1. The Arts and Psychology
    2. Philosophy
  4. References and Further Reading
    1. General Introductions
    2. Anthologies
    3. Primary Bibliography
    4. Secondary Bibliography
    5. Other Works Cited

1. Key Themes of Existentialism

Although a highly diverse tradition of thought, seven themes can be identified that provide some sense of overall unity. Here, these themes will be briefly introduced; they can then provide us with an intellectual framework within which to discuss exemplary figures within the history of existentialism.

a. Philosophy as a Way of Life

Philosophy should not be thought of primarily either as an attempt to investigate and understand the self or the world, or as a special occupation that concerns only a few. Rather, philosophy must be thought of as fully integrated within life. To be sure, there may need to be professional philosophers, who develop an elaborate set of methods and concepts (Sartre makes this point frequently) but life can be lived philosophically without a technical knowledge of philosophy.  Existentialist thinkers tended to identify two historical antecedents for this notion. First, the ancient Greeks, and particularly the figure of Socrates but also the Stoics and Epicureans. Socrates was not only non-professional, but in his pursuit of the good life he tended to eschew the formation of a ‘system’ or ‘theory’, and his teachings took place often in public spaces. In this, the existentialists were hardly unusual. In the 19th and 20th centuries, the rapid expansion of industrialisation and advance in technology were often seen in terms of an alienation of the human from nature or from a properly natural way of living (for example, thinkers of German and English romanticism).

The second influence on thinking of philosophy as a way of life was German Idealism after Kant. Partly as a response to the 18th century Enlightenment, and under the influence of the Neoplatonists, Schelling and Hegel both thought of philosophy as an activity that is an integral part of the history of human beings, rather than outside of life and the world, looking on. Later in the 19th century, Marx famously criticised previous philosophy by saying that the point of philosophy is not to know things – even to know things about activity – but to change them.  The concept of philosophy as a way of life manifests itself in existentialist thought in a number of ways. Let us give several examples, to which we will return in the sections that follow. First, the existentialists often undertook a critique of modern life in terms of the specialisation of both manual and intellectual labour. Specialisation included philosophy. One consequence of this is that many existentialist thinkers experimented with different styles or genres of writing in order to escape the effects of this specialisation. Second, a notion that we can call ‘immanence’: philosophy studies life from the inside. For Kierkegaard, for example, the fundamental truths of my existence are not representations – not, that is, ideas, propositions or symbols the meaning of which can be separated from their origin. Rather, the truths of existence are immediately lived, felt and acted. Likewise, for Nietzsche and Heidegger, it is essential to recognise that the philosopher investigating human existence is, him or herself, an existing human. Third, the nature of life itself is a perennial existentialist concern and, more famously (in Heidegger and in Camus), also the significance of death.

b. Anxiety and Authenticity

A key idea here is that human existence is in some way ‘on its own’; anxiety (or anguish) is the recognition of this fact. Anxiety here has two important implications. First, most generally, many existentialists tended to stress the significance of emotions or feelings, in so far as they were presumed to have a less culturally or intellectually mediated relation to one’s individual and separate existence. This idea is found in Kierkegaard, as we mentioned above, and in Heidegger’s discussion of ‘mood’; it is also one reason why existentialism had an influence on psychology. Second, anxiety also stands for a form of existence that is recognition of being on its own. What is meant by ‘being on its own’ varies among philosophers. For example, it might mean the irrelevance (or even negative influence) of rational thought, moral values, or empirical evidence, when it comes to making fundamental decisions concerning one’s existence. As we shall see, Kierkegaard sees Hegel’s account of religion in terms of the history of absolute spirit as an exemplary confusion of faith and reason. Alternatively, it might be a more specifically theological claim: the existence of a transcendent deity is not relevant to (or is positively detrimental to) such decisions (a view broadly shared by Nietzsche and Sartre). Finally, being on its own might signify the uniqueness of human existence, and thus the fact that it cannot understand itself in terms of other kinds of existence (Heidegger and Sartre).

Related to anxiety is the concept of authenticity, which is let us say the existentialist spin on the Greek notion of ‘the good life’. As we shall see, the authentic being would be able to recognise and affirm the nature of existence (we shall shortly specify some of the aspects of this, such as absurdity and freedom). Not, though, recognise the nature of existence as an intellectual fact, disengaged from life; but rather, the authentic being lives in accordance with this nature. The notion of authenticity is sometimes seen as connected to individualism. This is only reinforced by the contrast with a theme we will discuss below, that of the ‘crowd’. Certainly, if authenticity involves ‘being on one’s own’, then there would seem to be some kind of value in celebrating and sustaining one’s difference and independence from others. However, many existentialists see individualism as a historical and cultural trend (for example Nietzsche), or dubious political value (Camus), rather than a necessary component of authentic existence. Individualism tends to obscure the particular types of collectivity that various existentialists deem important.

For many existentialists, the conditions of the modern world make authenticity especially difficult. For example, many existentialists would join other philosophers (such as the Frankfurt School) in condemning an instrumentalist conception of reason and value. The utilitarianism of Mill measured moral value and justice also in terms of the consequences of actions. Later liberalism would seek to absorb nearly all functions of political and social life under the heading of economic performance. Evaluating solely in terms of the measurable outcomes of production was seen as reinforcing the secularisation of the institutions of political, social or economic life; and reinforcing also the abandonment of any broader sense of the spiritual dimension (such an idea is found acutely in Emerson, and is akin to the concerns of Kierkegaard). Existentialists such as Martin Heidegger, Hanna Arendt or Gabriel Marcel viewed these social movements in terms of a narrowing of the possibilities of human thought to the instrumental or technological. This narrowing involved thinking of the world in terms of resources, and thinking of all human action as a making, or indeed as a machine-like ‘function’.

c. Freedom

The next key theme is freedom. Freedom can usefully be linked to the concept of anguish, because my freedom is in part defined by the isolation of my decisions from any determination by a deity, or by previously existent values or knowledge. Many existentialists identified the 19th and 20th centuries as experiencing a crisis of values. This might be traced back to familiar reasons such as an increasingly secular society, or the rise of scientific or philosophical movements that questioned traditional accounts of value (for example Marxism or Darwinism), or the shattering experience of two world wars and the phenomenon of mass genocide. It is important to note, however, that for existentialism these historical conditions do not create the problem of anguish in the face of freedom, but merely cast it into higher relief. Likewise, freedom entails something like responsibility, for myself and for my actions. Given that my situation is one of being on its own – recognised in anxiety – then both my freedom and my responsibility are absolute. The isolation that we discussed above means that there is nothing else that acts through me, or that shoulders my responsibility. Likewise, unless human existence is to be understood as arbitrarily changing moment to moment, this freedom and responsibility must stretch across time. Thus, when I exist as an authentically free being, I assume responsibility for my whole life, for a ‘project’ or a ‘commitment’. We should note here that many of the existentialists take on a broadly Kantian notion of freedom: freedom as autonomy. This means that freedom, rather than being randomness or arbitrariness, consists in the binding of oneself to a law, but a law that is given by the self in recognition of its responsibilities. This borrowing from Kant, however, is heavily qualified by the next theme.

d. Situatedness

The next common theme we shall call ‘situatedness’. Although my freedom is absolute, it always takes place in a particular context. My body and its characteristics, my circumstances in a historical world, and my past, all weigh upon freedom. This is what makes freedom meaningful. Suppose I tried to exist as free, while pretending to be in abstraction from the situation. In that case I will have no idea what possibilities are open to me and what choices need to be made, here and now. In such a case, my freedom will be naïve or illusory. This concrete notion of freedom has its philosophical genesis in Hegel, and is generally contrasted to the pure rational freedom described by Kant. Situatedness is related to a notion we discussed above under the heading of philosophy as a way of life: the necessity of viewing or understanding life and existence from the ‘inside’.  For example, many 19th century intellectuals were interested in ancient Greece, Rome, the Medieval period, or the orient, as alternative models of a less spoiled, more integrated form of life. Nietzsche, to be sure, shared these interests, but he did so not uncritically: because the human condition is characterised by being historically situated, it cannot simply turn back the clock or decide all at once to be other than it is (Sartre especially shares this view). Heidegger expresses a related point in this way: human existence cannot be abstracted from its world because being-in-the-world is part of the ontological structure of that existence. Many existentialists take my concretely individual body, and the specific type of life that my body lives, as a primary fact about me (for example, Nietzsche, Scheler or Merleau-Ponty). I must also be situated socially: each of my acts says something about how I view others but, reciprocally, each of their acts is a view about what I am. My freedom is always situated with respect to the judgements of others. This particular notion comes from Hegel’s analysis of ‘recognition’, and is found especially in Sartre, de Beauvoir and Jaspers. Situatedness in general also has an important philosophical antecedent in Marx: economic and political conditions are not contingent features with respect to universal human nature, but condition that nature from the ground up.

e. Existence

Although, of course, existentialism takes its name from the philosophical theme of ‘existence’, this does not entail that there is homogeneity in the manner existence is to be understood. One point on which there is agreement, though, is that the existence with which we should be concerned here is not just any existent thing, but human existence. There is thus an important difference between distinctively human existence and anything else, and human existence is not to be understood on the model of things, that is, as objects of knowledge. One might think that this is an old idea, rooted in Plato’s distinction between matter and soul, or Descartes’ between extended and thinking things. But these distinctions appear to be just differences between two types of things. Descartes in particular, however, is often criticised by the existentialists for subsuming both under the heading ‘substance’, and thus treating what is distinctive in human existence as indeed a thing or object, albeit one with different properties. (Whether the existentialist characterisation of Plato or Descartes is accurate is a different question.) The existentialists thus countered the Platonic or Cartesian conception with a model that resembles more the Aristotelian as developed in the Nichomachean Ethics. The latter idea arrives in existentialist thought filtered through Leibniz and Spinoza and the notion of a striving for existence. Equally important is the elevation of the practical above the theoretical in German Idealists. Particularly in Kant, who stressed the primacy of the ‘practical’, and then in Fichte and early Schelling, we find the notion that human existence is action. Accordingly, in Nietzsche and Sartre we find the notion that the human being is all and only what that being does. My existence consists of forever bringing myself into being – and, correlatively, fleeing from the dead, inert thing that is the totality of my past actions. Although my acts are free, I am not free not to act; thus existence is characterised also by ‘exigency’ (Marcel). For many existentialists, authentic existence involves a certain tension be recognised and lived through, but not resolved: this tension might be between the animal and the rational (important in Nietzsche) or between facticity and transcendence (Sartre and de Beauvoir).

In the 19th and 20th centuries, the human sciences (such as psychology, sociology or economics) were coming to be recognised as powerful and legitimate sciences. To some extend at least their assumptions and methods seemed to be borrowed from the natural sciences. While philosophers such as Dilthey and later Gadamer were concerned to show that the human sciences had to have a distinctive method, the existentialists were inclined to go further. The free, situated human being is not an object of knowledge in the sense the human always exists as the possibility of transcending any knowledge of it. There is a clear relation between such an idea and the notion of the ‘transcendence of the other’ found in the ethical phenomenology of Emmanuel Levinas.

f. Irrationality/Absurdity

Among the most famous ideas associated with existentialism is that of ‘absurdity’. Human existence might be described as ‘absurd’ in one of the following senses. First, many existentialists argued that nature as a whole has no design, no reason for existing. Although the natural world can apparently be understood by physical science or metaphysics, this might be better thought of as ‘description’ than either understanding or explanation. Thus, the achievements of the natural sciences also empty nature of value and meaning. Unlike a created cosmos, for example, we cannot expect the scientifically described cosmos to answer our questions concerning value or meaning. Moreover, such description comes at the cost of a profound falsification of nature: namely, the positing of ideal entities such as ‘laws of nature’, or the conflation of all reality under a single model of being. Human beings can and should become profoundly aware of this lack of reason and the impossibility of an immanent understanding of it. Camus, for example, argues that the basic scene of human existence is its confrontation with this mute irrationality.  A second meaning of the absurd is this: my freedom will not only be undetermined by knowledge or reason, but from the point of view of the latter my freedom will even appear absurd. Absurdity is thus closely related to the theme of ‘being on its own’, which we discussed above under the heading of anxiety. Even if I choose to follow a law that I have given myself, my choice of law will appear absurd, and likewise will my continuously reaffirmed choice to follow it. Third, human existence as action is doomed to always destroy itself. A free action, once done, is no longer free; it has become an aspect of the world, a thing. The absurdity of human existence then seems to lie in the fact that in becoming myself (a free existence) I must be what I am not (a thing).  If I do not face up to this absurdity, and choose to be or pretend to be thing-like, I exist inauthentically (the terms in this formulation are Sartre’s).

g. The Crowd

Existentialism generally also carries a social or political dimension. Insofar as he or she is authentic, the freedom of the human being will show a certain ‘resolution’ or ‘commitment’, and this will involve also the being – and particularly the authentic being – of others. For example, Nietzsche thus speaks of his (or Zarathustra’s) work in aiding the transformation of the human, and there is also in Nietzsche a striking analysis of the concept of friendship; for Heidegger, there must be an authentic mode of being-with others, although he does not develop this idea at length; the social and political aspect of authentic commitment is much more clear in Sartre, de Beauvoir and Camus.

That is the positive side of the social or political dimension. However, leading up to this positive side, there is a description of the typical forms that inauthentic social or political existence takes. Many existentialists employ terms such as ‘crowd’, ‘horde’ (Scheler) or the ‘masses’ (José Ortega y Gasset). Nietzsche’s deliberately provocative expression, ‘the herd’, portrays the bulk of humanity not only as animal, but as docile and domesticated animals. Notice that these are all collective terms: inauthenticity manifests itself as de-individuated or faceless. Instead of being formed authentically in freedom and anxiety, values are just accepted from others because ‘that is what everybody does’. These terms often carry a definite historical resonance, embodying a critique of specifically modern modes of human existence. All of the following might be seen as either causes or symptoms of a world that is ‘fallen’ or ‘broken’ (Marcel): the technology of mass communication (Nietzsche is particularly scathing about newspapers and journalists; in Two Ages, Kierkegaard says something very similar), empty religious observances, the specialisation of labour and social roles, urbanisation and industrialisation. The theme of the crowd poses a question also to the positive social or political dimension of existentialism: how could a collective form of existence ever be anything other than inauthentic? The 19th and 20th century presented a number of mass political ideologies which might be seen as posing a particularly challenging environment for authentic and free existence. For example, nationalism came in for criticism particularly by Nietzsche. Socialism and communism: after WWII, Sartre was certainly a communist, but even then unafraid to criticise both the French communist party and the Soviet Union for rigid or inadequately revolutionary thinking. Democracy: Aristotle in book 5 of his Politics distinguishes between democracy and ochlocracy, which latter essentially means rule by those incapable of ruling even themselves. Many existentialists would identify the latter with the American and especially French concept of ‘democracy’. Nietzsche and Ortega y Gasset both espoused a broadly aristocratic criterion for social and political leadership.

2. Key Existentialist Philosophers

a. Søren Kierkegaard (1813-1855) as an Existentialist Philosopher

Kierkegaard was many things: philosopher, religious writer, satirist, psychologist, journalist, literary critic and generally considered the ‘father’ of existentialism. Being born (in Copenhagen) to a wealthy family enabled him to devote his life to the pursuits of his intellectual interests as well as to distancing himself from the ‘everyday man’ of his times.

Kierkegaard’s most important works are pseudonymous, written under fictional names, often very obviously fictional. The issue of pseudonymity has been variously interpreted as a literary device, a personal quirk or as an illustration of the constant tension between the philosophical truth and existential or personal truth. We have already seen that for the existentialists it is of equal importance what one says and the way in which something is said. This forms part of the attempt to return to a more authentic way of philosophising, firstly exemplified by the Greeks. In a work like Either/Or (primarily a treatise against the Hegelians) theoretical reflections are followed by reflections on how to seduce girls. The point is to stress the distance between the anonymously and logically produced truths of the logicians and the personal truths of existing individuals. Every pseudonymous author is a symbol for an existing individual and at times his very name is the key to the mysteries of his existence (like in the case of Johanes de Silentio, fictional author of Fear and Trembling, where the mystery of Abraham’s actions cannot be told, being a product of and belonging to silence).

Kierkegaard has been associated with a notion of truth as subjective (or personal); but what does this mean? The issue is linked with his notorious confrontation with the Danish Church and the academic environment of his days. Kierkegaard’s work takes place against the background of an academia dominated by Hegelian dialectics and a society which reduces the communication with the divine to the everyday observance of the ritualistic side of an institutionalized Christianity. Hegel is for Kierkegaard his arch-enemy not only because of what he writes but also what he represents. Hegel is guilty for Kierkegaard because he reduced the living truth of Christianity (the fact that God suffered and died on the Cross) to just another moment, which necessarily will be overcome, in the dialectical development of the Spirit. While Hegel treats “God” as a Begriff (a concept), for Kierkegaard the truth of Christianity signifies the very paradoxicality of faith: that is, that it is possible for the individual to go beyond the ‘ethical’ and nevertheless or rather because of this very act of disobedience to be loved by ‘God’. Famously, for Hegel ‘all that is real is rational’ – where rationality means the historically articulated, dialectical progression of Spirit – whereas for Kierkegaard the suspension of rationality is the very secret of Christianity. Against the cold logic of the Hegelian system Kierkegaard seeks “a truth which is truth for me” (Kierkegaard 1996:32). Christianity in particular represents the attempt to offer one’s life to the service of the divine. This cannot be argued, it can only be lived. While a theologian will try to argue for the validity of his positions by arguing and counter-arguing, a true Christian will try to live his life the way Jesus lived it. This evidently marks the continuation of the Hellenic idea of philosophy as a way of life, exemplified in the person of Socrates who did not write treatises, but who died for his ideas. Before the logical concepts of the theologians (in the words of Martin Heidegger who was hugely influenced by Kierkegaard) “man can neither fall to his knees in awe nor can he play music and dance before this god” (Heidegger 2002:42). The idea of ‘subjective truth’ will have serious consequences to the philosophical understanding of man. Traditionally defined as animale rationale (the rational animal) by Aristotle and for a long time worshiped as such by generations of philosophical minds, Kierkegaard comes now to redefine the human as the ‘passionate animal’. What counts in man is the intensity of his emotions and his willingness to believe (contra the once all powerful reason) in that which cannot be understood. The opening up by Kierkegaard of this terra incognita of man’s inner life will come to play a major role for later existentialists (most importantly for Nietzsche) and will bring to light the failings and the weaknesses of an over-optimistic (because modelled after the Natural sciences) model of philosophy which was taught to talk a lot concerning the ‘truth’ of the human, when all it understood about the human was a mutilated version.

In the Garden of Eden, Adam and Eve lived in a state of innocence in communication with God and in harmony with their physical environment. The expulsion from the Garden opened up a wide range of new possibilities for them and thus the problem of anxiety arose. Adam (the Hebrew word for man) is now free to determine through his actions the route of things. Naturally, there is a tension here. The human, created in God’s image, is an infinite being. Like God he also can choose and act according to his will. Simultaneously, though, he is a finite being since he is restricted by his body, particular socioeconomic conditions and so forth. This tension between the finite and infinite is the source of anxiety. But unlike a Hegelian analysis, Kierkegaard does not look for a way out from anxiety; on the contrary he stresses its positive role in the flourishing of the human. As he characteristically puts it: “Because he is a synthesis, he can be in anxiety; and the more profoundly he is in anxiety, the greater is the man” (Kierkegaard 1980:154). The prioritization of anxiety as a fundamental trait of the human being is a typical existentialist move, eager to assert the positive role of emotions for human life.

Perhaps the most famous work of Kierkegaard was Fear and Trembling, a short book which exhibits many of the issues raised by him throughout his career. Fear and Trembling retells the story of the attempted sacrifice of Isaac by his father Abraham. God tells Abraham that in order to prove his faith he has to sacrifice his only son. Abraham obeys, but at the last moment God intervenes and saves Isaac. What is the moral of the story? According to our moral beliefs, shouldn’t Abraham refuse to execute God’s vicious plan? Isn’t one of the fundamental beliefs of Christianity the respect to the life of other? The answer is naturally affirmative. Abraham should refuse God, and he should respect the ethical law. Then Abraham would be in a good relation with the Law itself as in the expression ‘a law abiding citizen’. On the contrary what Abraham tries to achieve is a personal relation with the author of the moral law. This author is neither a symbolic figure nor an abstract idea; he is someone with a name. The name of ’God’ is the unpronounceable Tetragrammaton (YHVE), the unpronounceability indicates the simultaneous closeness and distance of the great Other. The Christian God then, the author of the moral law at his will suspends the law and demands his unlawful wish be obeyed. Jacques Derrida notes that the temptation is now for Abraham the ethical law itself (Derrida 1998:162): he must resist ethics, this is the mad logic of God. The story naturally raises many problems. Is not such a subjectivist model of truth and religion plainly dangerous? What if someone was to support his acts of violence as a command of God? Kierkegaard’s response would be to suggest that it is only because Abraham loved Isaac with all his heart that the sacrifice could take place. “He must love Isaac with his whole soul….only then can he sacrifice him” (Kierkegaard 1983:74). Abraham’s faith is proved by the strength of his love for his son. However, this doesn’t fully answer the question of legitimacy, even if we agree that Abraham believed that God loved him so that he would somehow spare him. Kierkegaard also differentiates between the act of Abraham and the act of a tragic hero (like Agamemnon sacrificing his daughter Iphigenia). The tragic hero’s act is a product of calculation. What is better to do? What would be more beneficial? Abraham stands away from all sorts of calculations, he stands alone, that is, free in front of the horror religiosus, the price and the reward of faith.

b. Friedrich Nietzsche (1844-1900) as an Existentialist Philosopher

“I know my lot. Some day my name will be linked to the memory of something monstrous, of a crisis as yet unprecedented on earth…” (Nietzsche 2007:88).  Remarkably, what in 1888 sounded like megalomania came some years later to be realized. The name ‘Nietzsche’ has been linked with an array of historical events, philosophical concepts and widespread popular legends. Above all, Nietzsche has managed somehow to associate his name with the turmoil of a crisis. For a while this crisis was linked to the events of WWII. The exploitation of his teaching by the Nazi ideologues (notably Alfred Rosenberg and Alfred Baeumler), although utterly misdirected, arguably had its source in Nietzsche’s own “aristocratic radicalism”. More generally, the crisis refers to the prospect of a future lacking of any meaning. This is a common theme for all the existentialists to be sure. The prospect of millennia of nihilism (the devaluation of the highest values) inaugurates for Nietzsche the era in which the human itself, for the first time in its history, is called to give meaning both to its own existence and to the existence of the world. This is an event of a cataclysmic magnitude, from now on there are neither guidelines to be followed, lighthouses to direct us, and no right answers but only experiments to be conducted with unknown results.

Many existentialists, in their attempt to differentiate the value of individual existence from the alienating effects of the masses, formed an uneasy relation with the value of the ‘everyday man’. The ‘common’ man was thought to be lacking in will, taste in matter of aesthetics, and individuality in the sense that the assertion of his existence comes exclusively from his participation in larger groups and from the ‘herd’ mentality with which these groups infuse their members. Nietzsche believed that men in society are divided and ordered according to their willingness and capacity to participate in a life of spiritual and cultural transformation. Certainly not everyone wishes this participation and Nietzsche’s condemnation of those unwilling to challenge their fundamental beliefs is harsh; however it would be a mistake to suggest that Nietzsche thought their presence dispensable. In various aphorisms he stresses the importance of the ‘common’ as a necessary prerequisite for both the growth and the value of the ‘exceptional’. Such an idea clashes with our ‘modern’ sensitivities (themselves a product of a particular training). However, one has to recognize that there are no philosophers without presuppositions, and that Nietzsche’s insistence on the value of the exceptional marks his own beginning and his own understanding of the mission of thought.

Despite the dubious politics that the crisis of meaning gave rise to, the crisis itself is only an after-effect of a larger and deeper challenge that Nietzsche’s work identifies and poses. For Nietzsche the crisis of meaning is inextricably linked to the crisis of religious consciousness in the West. Whereas for Kierkegaard the problem of meaning was to be resolved through the individual’s relation to the Divine, for Nietzsche the militantly anti-Christian, the problem of meaning is rendered possible at all because of the demise of the Divine. As he explains in The Genealogy of Morality, it is only after the cultivation of truth as a value by the priest that truth comes to question its own value and function. What truth discovers is that at the ground of all truth lies an unquestionable faith in the value of truth. Christianity is destroyed when it is pushed to tell the truth about itself, when the illusions of the old ideals are revealed. What is called ‘The death of God’ is also then the death of truth (though not of the value of truthfulness); this is an event of immense consequences for the future.

But one has to be careful here. Generations of readers, by concentrating on the event of the actual announcement of the ‘death of God’, have completely missed madman’s woeful mourning which follows the announcement. “‘Where is God?’ he cried; ‘I‘ll tell you! We have killed him – you and I! We are all his murderers. But how did we do this? How were we able to drink up the sea? Who gave us the sponge to wipe away the entire horizon? What were we doing when we unchained this earth from its sun? Where is it moving? Where are we moving to? Away from all suns?” (Nietzsche 2001:125). The above sentences are very far from constituting a cheerful declaration: no one is happy here! Nietzsche’s atheism has nothing to do with the naive atheism of others (for example Sartre) who rush to affirm their freedom as if their petty individuality were able to fill the vast empty space left by the absence of God. Nietzsche is not naive and because he is not naive he is rather pessimistic. What the death of God really announces is the demise of the human as we know it. One has to think of this break in the history of the human in Kantian terms. Kant famously described Enlightenment as “man’s emergence from his self-incurred immaturity” (Kant 1991:54). Similarly Nietzsche believes that the demise of the divine could be the opportunity for the emergence of a being which derives the meaning of its existence from within itself and not from some authority external to it. If the meaning of the human derived from God then, with the universe empty, man cannot take the place of the absent God. This empty space can only be filled by something greater and fuller, which in the Nietzschean jargon means the greatest unity of contradictory forces. That is the Übermensch (Overhuman) which for Nietzsche signifies the attempt towards the cultural production of a human being which will be aware of his dual descent – from animality and from rationality – without prioritizing either one, but keeping them in an agonistic balance so that through struggle new and exciting forms of human existence can be born.

Nietzsche was by training a Klassische Philologe (the rough equivalent Anglosaxon would be an expert in classics – the texts of the ancient Greek and Roman authors). Perhaps because of his close acquaintance with the ancient writers, he became sensitive to a quite different understanding of philosophical thinking to that of his contemporaries. For the Greeks, philosophical questioning takes place within the perspective of a certain choice of life. There is no ‘life’ and then quite separately the theoretical (theoria: from thea – view, and horan – to see) or ‘from a distance’ contemplation of phenomena. Philosophical speculation is the result of a certain way of life and the attempted justification of this life. Interestingly Kant encapsulates this attitude in the following passage: “When will you finally begin to live virtuously?’ said Plato to an old man who told him he was attending classes on virtue. The point is not always to speculate, but also ultimately to think about applying our knowledge. Today, however, he who lives in conformity with what he teaches is taken for a dreamer” (Kant in Hadot 2002:xiii). We have to understand Nietzsche’s relation to philosophy within this context not only because it illustrates a stylistically different contemplation but because it demonstrates an altogether different way of philosophizing. Thus in Twilight of the Idols Nietzsche accuses philosophers for their ‘Egyptism’, the fact that they turn everything into a concept under evaluation. “All that philosophers have been handling for thousands of years is conceptual mummies; nothing real has ever left their hands alive” (Nietzsche 1998:16). Philosophical concepts are valuable insofar as they serve a flourishing life, not as academic exercises. Under the new model of philosophy the old metaphysical and moral questions are to be replaced by new questions concerning history, genealogy, environmental conditions and so forth. Let us take a characteristic passage from 1888: “I am interested in a question on which the ‘salvation of humanity’ depends more than on any curio of the theologians: the question of nutrition. For ease of use, one can put it in the following terms: ‘how do you personally have to nourish yourself in order to attain your maximum of strength, of virtù in the Renaissance style, of moraline-free virtue?” (Nietzsche 2007:19).

What is Nietzsche telling us here? Two things: firstly that, following the tradition of Spinoza, the movement from transcendence to immanence passes through the rehabilitation of the body. To say that, however, does not imply a simple-minded materialism. When Spinoza tells “nobody as yet has determined the limits of the body’s capabilities” (Spinoza 2002: 280) he is not writing about something like bodily strength but to the possibility of an emergence of a body liberated from the sedimentation of culture and memory. This archetypical body is indeed as yet unknown and we stand in ignorance of its abilities. The second thing that Nietzsche is telling us in the above passage is that this new immanent philosophy necessarily requires a new ethics. One has to be clear here because of the many misunderstandings of Nietzschean ethics. Nietzsche is primarily a philosopher of ethics but ethics here refers to the possible justification of a way of life, which way of life in turn justifies human existence on earth. For Nietzsche, ethics does not refer to moral codes and guidelines on how to live one’s life. Morality, which Nietzsche rejects, refers to the obsessive need (a need or an instinct can also be learned according to Nietzsche) of the human to preserve its own species and to regard its species as higher than the other animals. In short morality is arrogant. A Nietzschean ethics is an ethics of modesty. It places the human back where it belongs, among the other animals. However to say that is not to equate the human with the animal. Unlike non-human animals men are products of history that is to say products of memory. That is their burden and their responsibility.

In the Genealogy of Morality Nietzsche explains morality as a system aiming at the taming of the human animal. Morality’s aim is the elimination of the creative power of animal instincts and the establishment of a life protected within the cocoon of ascetic ideals. These ‘ideals’ are all those values and ideologies made to protect man against the danger of nihilism, the state in which man finds no answer to the question of his existence. Morality clings to the preservation of the species ‘man’; morality stubbornly denies the very possibility of an open-ended future for humans. If we could summarize Nietzsche’s philosophical anthropology in a few words, we would say that for Nietzsche it is necessary to attempt (there are no guarantees here) to think of the human not as an end-in-itself but only as a means to something “…perfect, completely finished, happy, powerful, triumphant, that still leaves something to fear!” (Nietzsche 2007:25).

c. Martin Heidegger (1889-1976) as an Existentialist Philosopher

Heidegger exercised an unparalleled influence on modern thought. Without knowledge of his work recent developments in modern European philosophy (Sartre, Gadamer, Arendt, Marcuse, Derrida, Foucault et al.) simply do not make sense. He remains notorious for his involvement with National Socialism in the 1930s. Outside European philosophy, Heidegger is only occasionally taken seriously, and is sometimes actually ridiculed (famously the Oxford philosopher A.J. Ayer called him a ‘charlatan’).

In 1945 in Paris Jean-Paul Sartre gave a public lecture with the title ‘Existentialism is a Humanism’ where he defended the priority of action and the position that it is a man’s actions which define his humanity. In 1946, Jean Beaufret in a letter to Heidegger poses a number of questions concerning the link between humanism and the recent developments of existentialist philosophy in France. Heidegger’s response is a letter to Beaufret which in 1947 is published in a book form with the title ‘Letter on Humanism’. There he repudiates any possible connection of his philosophy with the existentialism of Sartre. The question for us here is the following: Is it possible, given Heidegger’s own repudiation of existentialism, still to characterise Heidegger’s philosophy as ‘existentialist’? The answer here is that Heidegger can be classified as an existentialist thinker despite all his differences from Sartre. Our strategy is to stress Heidegger’s connection with some key existentialist concerns, which we introduced above under the labels ‘Existence’, ‘Anxiety’ and the ‘Crowd’.

We have seen above that a principle concern of all existentialists was to affirm the priority of individual existence and to stress that human existence is to be investigated with methods other than those of the natural sciences. This is also one of Heidegger’s principle concerns. His magnum opus Being and Time is an investigation into the meaning of Being as that manifests itself through the human being, Dasein. The sciences have repeatedly asked ‘What is a man?’ ‘What is a car?’ ‘What is an emotion?’ they have nevertheless failed – and because of the nature of science, had to fail – to ask the question which grounds all those other questions. This question is what is the meaning of (that) Being which is not an entity (like other beings, for example a chair, a car, a rock) and yet through it entities have meaning at all? Investigating the question of the meaning of Being we discover that it arises only because it is made possible by the human being which poses the question. Dasein has already a (pre-conceptual) understanding of Being because it is the place where Being manifests itself. Unlike the traditional understanding of the human as a hypokeimenon (Aristotle) – what through the filtering of Greek thought by the Romans becomes substantia, that which supports all entities and qualities as their base and their ground – Dasein refers to the way which human beings are. ”The essence of Dasein lies in its existence” (Heidegger 1962: 67) and the existence of Dasein is not fixed like the existence of a substance is. This is why human beings locate a place which nevertheless remains unstable and unfixed. The virtual place that Dasein occupies is not empty. It is filled with beings which ontologically structure the very possibility of Dasein. Dasein exists as in-the-world. World is not something separate from Dasein; rather, Dasein cannot be understood outside the referential totality which constitutes it. Heidegger repeats here a familiar existentialist pattern regarding the situatedness of experience.

Sartre, by contrast, comes from the tradition of Descartes and to this tradition remains faithful. From Heidegger’s perspective, Sartre’s strategy of affirming the priority of existence over essence is a by-product of the tradition of Renaissance humanism which wishes to assert the importance of man as the highest and most splendid of finite beings. Sartrean existence refers to the fact that a human is whereas Heidegger’s ek-sistence refers to the way with which Dasein is thrown into a world of referential relations and as such Dasein is claimed by Being to guard its truth. Sartre, following Descartes, thinks of the human as a substance producing or sustaining entities, Heidegger on the contrary thinks of the human as a passivity which accepts the call of Being. “Man is not the lord of beings. Man is the shepherd of Being” (Heidegger 1993:245). The Heideggerian priority then is Being, and Dasein’s importance lies in its receptiveness to the call of Being.

For Kierkegaard anxiety defines the possibility of responsibility, the exodus of man from the innocence of Eden and his participation to history. But the birthplace of anxiety is the experience of nothingness, the state in which every entity is experienced as withdrawn from its functionality. “Nothing … gives birth to anxiety” (Kierkegaard 1980:41). In anxiety we do not fear something in particular but we experience the terror of a vacuum in which is existence is thrown. Existentialist thinkers are interested in anxiety because anxiety individualizes one (it is when I feel Angst more than everything that I come face to face with my own individual existence as distinct from all other entities around me). Heidegger thinks that one of the fundamental ways with which Dasein understands itself in the world is through an array of ‘moods’. Dasein always ‘finds itself’ (befinden sich) in a certain mood. Man is not a thinking thing de-associated from the world, as in Cartesian metaphysics, but a being which finds itself in various moods such as anxiety or boredom. For the Existentialists, primarily and for the most part I don’t exist because I think (recall Descartes’ famous formula) but because my moods reveal to me fundamental truths of my existence. Like Kierkegaard, Heidegger also believes that anxiety is born out of the terror of nothingness. “The obstinacy of the ‘nothing and nowhere within-the-world’ means as a phenomenon that the world as such is that in the face of which one has anxiety” (Heidegger 1962:231). For Kierkegaard the possibility of anxiety reveals man’s dual nature and because of this duality man can be saved. “If a human being were a beast or an angel, he could not be in anxiety. Because he is a synthesis, he can be in anxiety; and the more profoundly he is in anxiety, the greater is the man” (Kierkegaard 1980:155). Equally for Heidegger anxiety manifests Dasein’s possibility to live an authentic existence since it realizes that the crowd of ‘others’ (what Heidegger calls the ‘They’) cannot offer any consolation to the drama of existence.

In this article we have discussed the ambiguous or at times downright critical attitude of many existentialists toward the uncritical and unreflecting masses of people who, in a wholly anti-Kantian and thus also anti-Enlightenment move, locate the meaning of their existence in an external authority. They thus give up their (purported) autonomy as rational beings. For Heidegger, Dasein for the most part lives inauthentically in that Dasein is absorbed in a way of life produced by others, not by Dasein itself. “We take pleasure and enjoy ourselves as they [man] take pleasure; we read, see and judge about literature and art as they see and judge…” (Heidegger 1962:164). To be sure this mode of existence, the ‘They’ (Das Man) is one of the existentialia, it is an a priori condition of possibility of the Dasein which means that inauthenticity is inscribed into the mode of being of Dasein, it does not come from the outside as a bad influence which could be erased. Heidegger’s language is ambiguous on the problem of inauthenticity and the reader has to make his mind on the status of the ‘They’. A lot has been said on the possible connections of Heidegger’s philosophy with his political engagements. Although it is always a risky business to read the works of great philosophers as political manifestos, it seems prima facie evident that Heidegger’s thought in this area deserves the close investigation it has received.

Heidegger was a highly original thinker. His project was nothing less than the overcoming of Western metaphysics through the positing of the forgotten question of being. He stands in a critical relation to past philosophers but simultaneously he is heavily indebted to them, much more than he would like to admit. This is not to question his originality, it is to recognize that thought is not an ex nihilo production; it comes as a response to things past, and aims towards what is made possible through that past.

d. Jean-Paul Sartre (1905-1980) as an Existentialist Philosopher

In the public consciousness, at least, Sartre must surely be the central figure of existentialism. All the themes that we introduced above come together in his work. With the possible exception of Nietzsche, his writings are the most widely anthologised (especially the lovely, if oversimplifying, lecture ‘Existentialism and Humanism’) and his literary works are widely read (especially the novel Nausea) or performed. Although uncomfortable in the limelight, he was nevertheless the very model of a public intellectual, writing hundreds of short pieces for public dissemination and taking resolutely independent and often controversial stands on major political events. His writings that are most clearly existentialist in character date from Sartre’s early and middle period, primarily the 1930s and 1940s. From the 1950s onwards, Sartre moved his existentialism towards a philosophy the purpose of which was to understand the possibility of a genuinely revolutionary politics.

Sartre was in his late 20s when he first encountered phenomenology, specifically the philosophical ideas of Edmund Husserl. (We should point out that Heidegger was also deeply influenced by Husserl, but it is less obvious in the language he employs because he drops the language of consciousness and acts.) Of particular importance, Sartre thought, was Husserl’s notion of intentionality. In Sartre’s interpretation of this idea, consciousness is not to be identified with a thing (for example a mind, soul or brain), that is to say some kind of a repository of ideas and images of things. Rather, consciousness is nothing but a directedness towards things. Sartre found a nice way to sum up the notion of the intentional object: If I love her, I love her because she is lovable (Sartre 1970:4-5).  Within my experience, her lovableness is not an aspect of my image of her, rather it is a feature of her (and ultimately a part of the world) towards which my consciousness directs itself. The things I notice about her (her smile, her laugh) are not originally neutral, and then I interpret the idea of them as ‘lovely’, they are aspects of her as lovable. The notion that consciousness is not a thing is vital to Sartre. Indeed, consciousness is primarily to be characterised as nothing: it is first and foremost not that which it is conscious of. (Sartre calls human existence the ‘for-itself’, and the being of things the ‘in-itself’.) Because it is not a thing, it is not subject to the laws of things; specifically, it is not part of a chain of causes and its identity is not akin to that of a substance. Above we suggested that a concern with the nature of existence, and more particularly a concern with the distinctive nature of human existence, are defining existentialist themes.

Moreover, qua consciousness, and not a thing that is part of the causal chain, I am free. From moment to moment, my every action is mine alone to choose. I will of course have a past ‘me’ that cannot be dispensed with; this is part of my ‘situation’. However, again, I am first and foremost not my situation. Thus, at every moment I choose whether to continue on that life path, or to be something else. Thus, my existence (the mere fact that I am) is prior to my essence (what I make of myself through my free choices). I am thus utterly responsible for myself. If my act is not simply whatever happens to come to mind, then my action may embody a more general principle of action. This principle too is one that I must have freely chosen and committed myself to. It is an image of the type of life that I believe has value. (In these ways, Sartre intersects with the broadly Kantian account of freedom which we introduced above in our thematic section.) As situated, I also find myself surrounded by such images – from religion, culture, politics or morality – but none compels my freedom. (All these forces that seek to appropriate my freedom by objectifying me form Sartre’s version of the crowd theme.) I exist as freedom, primarily characterised as not determined, so my continuing existence requires the ever renewed exercise of freedom (thus, in our thematic discussion above, the notion from Spinoza and Leibniz of existence as a striving-to-exist). Thus also, my non-existence, and the non-existence of everything I believe in, is only a free choice away. I (in the sense of an authentic human existence) am not what I ‘am’ (the past I have accumulated, the things that surround me, or the way that others view me). I am alone in my responsibility; my existence, relative to everything external that might give it meaning, is absurd. Face to face with such responsibility, I feel ‘anxiety’. Notice that although Sartre’s account of situatedness owes much to Nietzsche and Heidegger, he sees it primarily in terms of what gives human freedom its meaning and its burden. Nietzsche and Heidegger, in contrast, view such a conception of freedom as naively metaphysical.

Suppose, however, that at some point I am conscious of myself in a thing-like way. For example, I say ‘I am a student’ (treating myself as having a fixed, thing-like identity) or ‘I had no choice’ (treating myself as belonging to the causal chain). I am ascribing a fixed identity or set of qualities to myself, much as I would say ‘that is a piece of granite’. In that case I am existing in denial of my distinctively human mode of existence; I am fleeing from my freedom. This is inauthenticity or ‘bad faith’. As we shall see, inauthenticity is not just an occasional pitfall of human life, but essential to it. Human existence is a constant falling away from an authentic recognition of its freedom. Sartre here thus echoes the notion in Heidegger than inauthenticity is a condition of possibility of human existence.

Intentionality manifests itself in another important way. Rarely if ever am I simply observing the world; instead I am involved in wanting to do something, I have a goal or purpose. Here, intentional consciousness is not a static directedness towards things, but is rather an active projection towards the future. Suppose that I undertake as my project marrying my beloved. This is an intentional relation to a future state of affairs. As free, I commit myself to this project and must reaffirm that commitment at every moment. It is part of my life project, the image of human life that I offer to myself and to others as something of value. Notice, however, that my project involves inauthenticity. I project myself into the future where I will be married to her – that is, I define myself as ‘married’, as if I were a fixed being. Thus there is an essential tension to all projection. On the one hand, the mere fact that I project myself into the future is emblematic of my freedom; only a radically free consciousness can project itself. I exist as projecting towards the future which, again, I am not. Thus, I am (in the sense of an authentic self) what I am not (because my projecting is always underway towards the future). On the other hand, in projecting I am projecting myself as something, that is, as a thing that no longer projects, has no future, is not free. Every action, then, is both an expression of freedom and also a snare of freedom. Projection is absurd: I seek to become the impossible object, for-itself-in-itself, a thing that is both free and a mere thing. Born of this tension is a recognition of freedom, what it entails, and its essential fragility. Thus, once again, we encounter existential anxiety. (In this article, we have not stressed the importance of the concept of time for existentialism, but it should not be overlooked: witness one of Nietzsche’s most famous concepts (eternal recurrence) and the title of Heidegger’s major early work (Being and Time).)

In my intentional directedness towards my beloved I find her ‘loveable’. This too, though, is an objectification. Within my intentional gaze, she is loveable in much the same way that granite is hard or heavy. Insofar as I am in love, then, I seek to deny her freedom. Insofar, however, as I wish to be loved by her, then she must be free to choose me as her beloved. If she is free, she escapes my love; if not, she cannot love. It is in these terms that Sartre analyses love in Part Three of Being and Nothingness. Love here is a case study in the basic forms of social relation. Sartre is thus moving from an entirely individualistic frame of reference (my self, my freedom and my projects) towards a consideration of the self in concrete relations with others. Sartre is working through – in a way he would shortly see as being inadequate – the issues presented by the Hegelian dialectic of recognition, which we mentioned above. This ‘hell’ of endlessly circling acts of freedom and objectification is brilliantly dramatised in Sartre’s play No Exit.

A few years later at the end of the 1940s, Sartre wrote what has been published as Notebooks for an Ethics. Sartre (influenced in the meantime by the criticisms of Merleau-Ponty and de Beauvoir, and by his increasing commitment to collectivist politics) elaborated greatly his existentialist account of relations with others, taking the Hegelian idea more seriously. He no longer thinks of concrete relations so pessimistically. While Nietzsche and Heidegger both suggest the possibility of an authentic being with others, both leave it seriously under-developed. For our purposes, there are two key ideas in the Notebooks. The first is that my projects can be realised only with the cooperation of others; however, that cooperation presupposes their freedom (I cannot make her love me), and their judgements about me must concern me. Therefore permitting and nurturing the freedom of others must be a central part of all my projects. Sartre thus commits himself against any political, social or economic forms of subjugation. Second, there is the possibility of a form of social organisation and action in which each individual freely gives him or herself over to a joint project: a ‘city of ends’ (this is a reworking of Kant’s idea of the ‘kingdom of ends’, found in the Grounding for the Metaphysics of Morals). An authentic existence, for Sartre, therefore means two things. First, it is something like a ‘style’ of existing – one that at every moment is anxious, and that means fully aware of the absurdity and fragility of its freedom. Second, though, there is some minimal level of content to any authentic project: whatever else my project is, it must also be a project of freedom, for myself and for others.

e. Simone de Beauvoir (1908-1986) as an Existentialist Philosopher

Simone de Beauvoir was the youngest student ever to pass the demanding agrégation at the prestigious École Normale Supérieure. Subsequently a star Normalienne, she was a writer, philosopher, feminist, lifelong partner of Jean-Paul Sartre, notorious for her anti-bourgeois way of living and her free sexual relationships which included among others a passionate affair with the American writer Nelson Algren. Much ink has been spilled debating whether de Beauvoir’s work constitutes a body of independent philosophical work, or is a reformulation of Sartre’s work. The debate rests of course upon the fundamental misconception that wants a body of work to exist and develop independently of (or uninfluenced by) its intellectual environment. Such ‘objectivity’ is not only impossible but also undesirable: such a body of work would be ultimately irrelevant since it would be non-communicable. So the question of de Beauvoir’s ‘independence’ could be dismissed here as irrelevant to the philosophical questions that her work raises.

In 1943 Being and Nothingness, the groundwork of the Existentialist movement in France was published. There Sartre gave an account of freedom as ontological constitutive of the subject. One cannot but be free: this is the kernel of the Sartrean conception of freedom. In 1945 Merleau-Ponty’s Phenomenology of Perception is published. There, as well as in an essay from the same year titled ‘The war has taken place’, Merleau-Ponty heavily criticizes the Sartrean stand, criticising it as a reformulation of basic Stoic tenets. One cannot assume freedom in isolation from the freedom of others. Action is participatory: “…my freedom is interwoven with that of others by way of the world” (Merleau-Ponty in Stewart 1995:315).  Moreover action takes place within a certain historical context. For Merleau-Ponty the subjective free-will is always in a dialectical relationship with its historical context. In 1947 Simone de Beauvoir’s Ethics of Ambiguity is published. The book is an introduction to existentialism but also a subtle critique of Sartre’s position on freedom, and a partial extension of existentialism towards the social. Although de Beauvoir will echo Merleau-Ponty’s criticism regarding the essential interrelation of the subjects, nevertheless she will leave unstressed the importance that the social context plays in the explication of moral problems. Like Sartre it is only later in her life that this will be acknowledged. In any case, de Beauvoir’s book precipitates in turn a major rethink on Sartre’s part, and the result is the Notebooks for an Ethics.

In Ethics of Ambiguity de Beauvoir offers a picture of the human subject as constantly oscillating between facticity and transcendence. Whereas the human is always already restricted by the brute facts of his existence, nevertheless it always aspires to overcome its situation, to choose its freedom and thus to create itself. This tension must be considered positive, and not restrictive of action. It is exactly because the ontology of the human is a battleground of antithetical movements (a view consistent with de Beauvoir’s Hegelianism) that the subject must produce an ethics which will be continuous with its ontological core. The term for this tension is ambiguity. Ambiguity is not a quality of the human as substance, but a characterisation of human existence. We are ambiguous beings destined to throw ourselves into the future while simultaneously it is our very own existence that throws us back into facticity. That is to say, back to the brute fact that we are in a sense always already destined to fail –  not in this or that particular project but to fail as pure and sustained transcendence. It is exactly because of (and through) this fundamental failure that we realize that our ethical relation to the world cannot be self-referential but must pass through the realization of the common destiny of the human as a failed and interrelated being.

De Beauvoir, unlike Sartre, was a scholarly reader of Hegel. Her position on an existential ethics is thus more heavily influenced by Hegel’s view in the Phenomenology of Spirit concerning the moment of recognition (Hegel 1977:111). There Hegel describes the movement in which self-consciousness produces itself by positing another would be self-consciousness, not as a mute object (Gegen-stand) but as itself self-consciousness. The Hegelian movement remains one of the most fascinating moments in the history of philosophy since it is for the first time that the constitution of the self does not take place from within the self (as happens with Descartes, for whom the only truth is the truth of my existence; or Leibniz, for whom the monads are ‘windowless’; or Fichte, for whom the ‘I’ is absolutely self-constitutive) but from the outside. It is, Hegel tells us, only because someone else recognizes me as a subject that I can be constituted as such. Outside the moment of recognition there is no self-consciousness. De Beauvoir takes to heart the Hegelian lesson and tries to formulate an ethics from it.

What would this ethics be? As in Nietzsche, ethics refers to a way of life (a βίος), as opposed to morality which concerns approved or condemned behaviour. Thus there are no recipes for ethics. Drawn from Hegel’s moment of recognition, de Beauvoir acknowledges that the possibility of human flourishing is based firstly upon the recognition of the existence of the other (“Man can find a justification of his own existence only in the existence of the other men” (Beauvoir 1976:72) and secondly on the recognition that my own flourishing (or my ability to pose projects, in the language of existentialists) passes through the possibility of a common flourishing. “Only the freedom of others keeps each one of us from hardening in the absurdity of facticity,” (Beauvoir 1976:71) de Beauvoir writes; or again “To will oneself free is also to will others free” (Beauvoir 1976:73). The Ethics of Ambiguity ends by declaring the necessity of assuming one’s freedom and the assertion that it is only through action that freedom makes itself possible. This is not a point to be taken light-heartedly. It constitutes a movement of opposition against a long tradition of philosophy understanding itself as theoria: the disinterested contemplation on the nature of the human and the world. De Beauvoir, in common with most existentialists, understands philosophy as praxis: involved action in the world and participation in the course of history. It is out of this understanding that The Second Sex is born.

In 1949 Le Deuxième Sexe is published in France. In English in 1953 it appeared as The Second Sex in an abridged translation. The book immediately became a best seller and later a founding text of Second Wave Feminism (the feminist movement from the early 60’s to the 70’s inspired by the civil rights movement and focusing at the theoretical examination of the concepts of equality, inequality, the role of family, justice and so forth). More than anything, The Second Sex constitutes a study in applied existentialism where the abstract concept ‘Woman’ gives way to the examination of the lives of everyday persons struggling against oppression and humiliation. When de Beauvoir says that there is no such thing as a ‘Woman’ we have to hear the echo of the Kierkegaardian assertion of the single individual against the abstractions of Hegelian philosophy, or similarly Sartre’s insistence on the necessity of the prioritization of the personal lives of self-creating people (what Sartre calls ‘existence’) as opposed to a pre-established ideal of what humans should be like (what Sartre calls ‘essence’). The Second Sex is an exemplary text showing how a philosophical movement can have real, tangible effects on the lives of many people, and is a magnificent exercise in what philosophy could be.

“I hesitated a long time before writing a book on woman. The subject is irritating, especially for women…” (Beauvoir 2009:3). The Second Sex begins with the most obvious (but rarely posed) question: What is woman? De Beauvoir finds that at present there is no answer to that question. The reason is that tradition has always thought of woman as the other of man. It is only man that constitutes himself as a subject (as the Absolute de Beauvoir says), and woman defines herself only through him. “She determines and differentiates herself in relation to man, and he does not in relation to her; she is the inessential in front of the essential…” (Beauvoir 2009:6). But why is it that woman has initially accepted or tolerated this process whereby she becomes the other of man? De Beauvoir does not give a consoling answer; on the contrary, by turning to Sartre’s notion of bad faith (which refers to the human being’s anxiety in front of the responsibility entailed by the realization of its radical freedom) she thinks that women at times are complicit to their situation. It is indeed easier for one – anyone – to assume the role of an object (for example a housewife ‘kept’ by her husband) than to take responsibility for creating him or herself and creating the possibilities of freedom for others. Naturally the condition of bad faith is not always the case. Often women found themselves in a sociocultural environment which denied them the very possibility of personal flourishing (as happens with most of the major religious communities). A further problem that women face is that of understanding themselves as a unity which would enable them to assume the role of their choosing. “Proletarians say ‘we’. So do blacks” (Beauvoir 2009:8). By saying ‘we’ they assume the role of the subject and turn everyone else into ‘other’. Women are unable to utter this ‘we’. “They live dispersed among men, tied by homes, work, economic interests and social conditions to certain men – fathers or husbands – more closely than to other women. As bourgeois women, they are in solidarity with bourgeois men and not with women proletarians; as white women, they are in solidarity with white men and not with black women” (Beauvoir 2009:9). Women primarily align themselves to their class or race and not to other women. The female identity is “very much bound up with the identity of the men around them…” (Reynolds 2006:145).

One of the most celebrated moments in The Second Sex is the much quoted phrase: “One is not born, but rather becomes, woman” (Beauvoir 2009:293). She explains: “No biological, physical or economic destiny defines the figure that the human female takes on in society; it is civilization as a whole that elaborates this intermediary product between the male and the eunuch that is called feminine” (Beauvoir 2009:293). For some feminists this clearly inaugurates the problematic of the sex-gender distinction (where sex denotes the biological identity of the person and gender the cultural attribution of properties to the sexed body). Simply put, there is absolutely nothing that determines the ‘assumed’ femininity of the woman (how a woman acts, feels, behaves) – everything that we have come to think as ‘feminine’ is a social construction not a natural given. Later feminists like Monique Wittig and Judith Butler will argue that ‘sex’ is already ‘gender’ in the sense that a sexed body exists always already within a cultural nexus that defines it. Thus the sex assignment (a doctor pronouncing the sex of the baby) is a naturalized (but not at all natural) normative claim which delivers the human into a world of power relations.

f. Albert Camus (1913-1960) as an Existentialist Philosopher

Albert Camus was a French intellectual, writer and journalist. His multifaceted work as well as his ambivalent relation to both philosophy and existentialism makes every attempt to classify him a rather risky operation. A recipient of the 1957 Nobel Prize for Literature primarily for his novels, he is also known as a philosopher due to his non-literary work and his relation with Jean-Paul Sartre. And yet his response was clear: “I am not a philosopher, because I don’t believe in reason enough to believe in a system. What interests me is knowing how we must behave, and more precisely, how to behave when one does not believe in God or reason” (Camus in Sherman 2009: 1). The issue is not just about the label ‘existentialist’. It rather points to a deep tension within the current of thought of all thinkers associated with existentialism. The question is: With how many voices can thought speak? As we have already seen, the thinkers of existentialism often deployed more than one. Almost all of them share a deep suspicion to a philosophy operating within reason as conceived of by the Enlightenment. Camus shares this suspicion and his so called philosophy of the absurd intends to set limits to the overambitions of Western rationality. Reason is absurd in that it believes that it can explain the totality of the human experience whereas it is exactly its inability for explanation that, for example, a moment of fall designates. Thus in his novel “The Fall” the protagonist’s tumultuous narrative reveals the overtaking of a life of superficial regularity by the forces of darkness and irrationality. “A bourgeois hell, inhabited of course by bad dreams” (Camus 2006:10). In a similar fashion Camus has also repudiated his connection with existentialism. “Non, je ne suis pas existentialist” is the title of a famous interview that he gave for the magazine Les Nouvelles Littéraires on the 15 of November, 1945. The truth of the matter is that Camus’ rejection of existentialism is directed more toward Sartre’s version of it rather than toward a dismissal of the main problems that the existential thinkers faced. Particularly, Camus was worried that Sartre’s deification of history (Sartre’s proclaimed Marxism) would be incompatible with the affirmation of personal freedom. Camus accuses Hegel (subsequently Marx himself) of reducing man to history and thus denying man the possibility of creating his own history, that is, affirming his freedom.

Philosophically, Camus is known for his conception of the absurd. Perhaps we should clarify from the very beginning what the absurd is not. The absurd is not nihilism. For Camus the acceptance of the absurd does not lead to nihilism (according to Nietzsche nihilism denotes the state in which the highest values devalue themselves) or to inertia, but rather to their opposite: to action and participation. The notion of the absurd signifies the space which opens up between, on the one hand, man’s need for intelligibility and, on the other hand, ‘the unreasonable silence of the world’ as he beautifully puts it. In a world devoid of God, eternal truths or any other guiding principle, how could man bear the responsibility of a meaning-giving activity? The absurd man, like an astronaut looking at the earth from above, wonders whether a philosophical system, a religion or a political ideology is able to make the world respond to the questioning of man, or rather whether all human constructions are nothing but the excessive face-paint of a clown which is there to cover his sadness. This terrible suspicion haunts the absurd man. In one of the most memorable openings of a non-fictional book he states: “There is but one truly serious philosophical problem and that is suicide. Judging whether life is or is not worth living amounts to answering the fundamental question of philosophy. All the rest – whether or not the world has three dimensions, whether the mind has nine or twelve categories – comes afterwards. These are games; one must first answer” (Camus 2000:11). The problem of suicide (a deeply personal problem) manifests the exigency of a meaning-giving response. Indeed for Camus a suicidal response to the problem of meaning would be the confirmation that the absurd has taken over man’s inner life. It would mean that man is not any more an animal going after answers, in accordance with some inner drive that leads him to act in order to endow the world with meaning. The suicide has become but a passive recipient of the muteness of the world. “…The absurd … is simultaneously awareness and rejection of death” (Camus 2000:54). One has to be aware of death – because it is precisely the realization of man’s mortality that pushes someone to strive for answers – and one has ultimately to reject death – that is, reject suicide as well as the living death of inertia and inaction. At the end one has to keep the absurd alive, as Camus says. But what does it that mean?

In The Myth of Sisyphus Camus tells the story of the mythical Sisyphus who was condemned by the Gods to ceaselessly roll a rock to the top of a mountain and then have to let it fall back again of its own weight. “Sisyphus, proletarian of the gods, powerless and rebellious, knows the whole extent of his wretched condition: it is what he thinks of during his descent. The lucidity that was to constitute his torture at the same time crowns his victory. There is no fate that cannot be surmounted by scorn” (Camus 2000:109). One must imagine then Sisyphus victorious: fate and absurdity have been overcome by a joyful contempt. Scorn is the appropriate response in the face of the absurd; another name for this ‘scorn’ though would be artistic creation. When Camus says: “One does not discover the absurd without being tempted to write a manual of happiness” (Camus 2000:110) he writes about a moment of exhilarated madness, which is the moment of the genesis of the artistic work. Madness, but nevertheless profound – think of the function of the Fool in Shakespeare’s King Lear as the one who reveals to the king the most profound truths through play, mimicry and songs. Such madness can overcome the absurd without cancelling it altogether.

Almost ten years after the publication of The Myth of Sisyphus Camus publishes his second major philosophical work, The Rebel (1951). Camus continues the problematic which had begun with The Myth of Sisyphus. Previously, revolt or creation had been considered the necessary response to the absurdity of existence. Here, Camus goes on to examine the nature of rebellion and its multiple manifestations in history. In The Myth of Sisyphus, in truly Nietzschean fashion, Camus had said: “There is but one useful action, that of remaking man and the earth” (Camus 2000:31). However, in The Rebel, reminiscent of Orwell’s Animal Farm, one of the first points he makes is the following: “The slave starts by begging for justice and ends by wanting to wear a crown. He too wants to dominate” (Camus 2000b:31). The problem is that while man genuinely rebels against both unfair social conditions and, as Camus says, against the whole of creation, nevertheless in the practical administration of such revolution, man comes to deny the humanity of the other in an attempt to impose his own individuality. Take for example the case of the infamous Marquis de Sade which Camus explores. In Sade, contradictory forces are at work (see The 120 Days of Sodom). On the one hand, Sade wishes the establishment of a (certainly mad) community with desire as the ultimate master, and on the other hand this very desire consumes itself and all the subjects who stand in its way.

Camus goes on to examine historical manifestations of rebellion, the most prominent case being that of the French Revolution. Camus argues that the revolution ended up taking the place of the transcendent values which it sought to abolish. An all-powerful notion of justice now takes the place formerly inhabited by God. Rousseau’s infamous suggestion that under the rule of ‘general will’ everyone would be ‘forced to be free’ (Rousseau in Foley 2008:61) opens the way to the crimes committed after the revolution. Camus fears that all revolutions end with the re-establishment of the State. “…Seventeen eighty-nine brings Napoleon; 1848 Napoleon III; 1917 Stalin; the Italian disturbances of the twenties, Mussolini; the Weimar Republic, Hitler” (Camus 2000b:146). Camus is led to examine the Marxist view of history as a possible response to the failed attempts at the establishment of a true revolutionary regime. Camus examines the similarities between the Christian and the Marxist conception of history. They both exhibit a bourgeois preoccupation with progress. In the name of the future everything can be justified: “the future is the only kind of property that the masters willingly concede to the slaves” (Camus 2000b:162). History according to both views is the linear progress from a set beginning to a definite end (the metaphysical salvation of man or the materialistic salvation of him in the future Communist society). Influenced by Kojève’s reading of Hegel, Camus interprets this future, classless society as the ‘end of history’. The ‘end of history’ suggests that when all contradictions cease then history itself will come to an end. This is, Camus argues, essentially nihilistic: history, in effect, accepts that meaning creation is no longer possible and commits suicide. Because historical revolutions are for the most part nihilistic movements, Camus suggests that it is the making-absolute of the values of the revolution that necessarily lead to their negation. On the contrary a relative conception of these values will be able to sustain a community of free individuals who have not forgotten that every historical rebellion has begun by affirming a proto-value (that of human solidarity) upon which every other value can be based.

3. The Influence of Existentialism

a. The Arts and Psychology

In the field of visual arts existentialism exercised an enormous influence, most obviously on the movement of Expressionism. Expressionism began in Germany at the beginning of the 20th century. With its emphasis on subjective experience, Angst and intense emotionality, German expressionism sought to go beyond the naiveté of realist representation and to deal with the anguish of the modern man (exemplified in the terrible experiences of WWI). Many of the artists of Expressionism read Nietzsche intensively and following Nietzsche’s suggestion for a transvaluation of values experimented with alternative lifestyles. Erich Heckel’s woodcut “Friedrich Nietzsche” from 1905 is a powerful reminder of the movement’s connection to Existentialist thought. Abstract expressionism (which included artists such as de Kooning and Pollock, and theorists such as Rosenberg) continued with some of the same themes in the United States from the 1940s and tended to embrace existentialism as one of its intellectual guides, especially after Sartre’s US lecture tour in 1946 and a production of No Exit in New York.

German Expressionism was particularly important during the birth of the new art of cinema. Perhaps the closest cinematic work to Existentialist concerns remains F.W. Murnau’s The Last Laugh (1924) in which the constantly moving camera (which prefigures the ‘rule’ of the hand-held camera of the Danish Dogma 95) attempts to arrest the spiritual anguish of a man who suddenly finds himself in a meaningless world. Expressionism became a world-wide style within cinema, especially as film directors like Lang fled Germany and ended up in Hollywood. Jean Genet’s Un chant d’amour (1950) is a moving poetic exploration of desire. In the sordid, claustrophobic cells of a prison the inmates’ craving for intimacy takes place against the background of an unavoidable despair for existence itself. European directors such as Bergman and Godard are often associated with existentialist themes. Godard’s Vivre sa vie (My Life to Live, 1962) is explicit in its exploration of the nature of freedom under conditions of extreme social and personal pressure. In the late 20th and early 21st centuries existentialist ideas became common in mainstream cinema, pervading the work of writers and directors such as Woody Allen, Richard Linklater, Charlie Kaufman and Christopher Nolan.

Given that Sartre and Camus were both prominent novelists and playwrights, the influence of existentialism on literature is not surprising. However, the influence was also the other way. Novelists such as Dostoevsky or Kafka, and the dramatist Ibsen, were often cited by mid-century existentialists as important precedents, right along with Kierkegaard and Nietzsche. Dostoevsky creates a character Ivan Karamazov (in The Brothers Karamazov, 1880) who holds the view that if God is dead, then everything is permitted; both Nietzsche and Sartre discuss Dostoevsky with enthusiasm. Within drama, the theatre of the absurd and most obviously Beckett were influenced by existentialist ideas; later playwrights such as Albee, Pinter and Stoppard continue this tradition.

One of the key figures of 20th century psychology, Sigmund Freud, was indebted to Nietzsche especially for his analysis of the role of psychology within culture and history, and for his view of cultural artefacts such as drama or music as ‘unconscious’ documentations of psychological tensions. But a more explicit taking up of existentialist themes is found in the broad ‘existentialist psychotherapy’ movement. A common theme within this otherwise very diverse group is that previous psychology misunderstood the fundamental nature of the human and especially its relation to others and to acts of meaning-giving; thus also, previous psychology had misunderstood what a ‘healthy’ attitude to self, others and meaning might be.  Key figures here include Swiss psychologists Ludwig Binswanger and later Menard Boss, both of who were enthusiastic readers of Heidegger; the Austrian Frankl, who invented the method of logotherapy; in England, Laing and Cooper, who were explicitly influenced by Sartre; and in the United States, Rollo May, who stresses the ineradicable importance of anxiety.

b. Philosophy

As a whole, existentialism has had relatively little direct influence within philosophy. In Germany, existentialism (and especially Heidegger) was criticised for being obscure, abstract or even mystical in nature. This criticism was made especially by Adorno in The Jargon of Authenticity, and in Dog Years, novelist Gunter Grass gives a Voltaire-like, savage satire of Heidegger. The criticism was echoed by many in the analytic tradition. Heidegger and the existentialist were also taken to task for paying insufficient attention to social and political structures or values, with dangerous results. In France, philosophers like Sartre were criticised by those newly under the influence of structuralism for paying insufficient attention to the nature of language and to impersonal structures of meaning. In short, philosophy moved on, and in different directions. Individual philosophers remain influential, however: Nietzsche and Heidegger in particular are very much ‘live’ topics in philosophy, even in the 21st century.

However, there are some less direct influences that remain important. Let us raise three examples. Both the issue of freedom in relation to situation, and that of the philosophical significance of what otherwise might appear to be extraneous contextual factors, remain key, albeit in dramatically altered formulation, within the work of Michel Foucault or Alain Badiou, two figures central to late 20th century European thought. Likewise, the philosophical importance that the existentialists placed upon emotion has been influential, legitimising a whole domain of philosophical research even by philosophers who have no interest in existentialism. Similarly, existentialism was a philosophy that insisted philosophy could and should deal very directly with ‘real world’ topics such as sex, death or crime, topics that had most frequently been approached abstractly within the philosophical tradition. Mary Warnock wrote on existentialism and especially Sartre, for example, while also having an incredibly important and public role within recent applied ethics.

4. References and Further Reading

a. General Introductions

  • Warnock Mary. Existentialism (Oxford: Oxford University Press, 1970)
  • Barrett William. Irrational Man: A Study in Existential Philosophy (New York: Anchor House, 1990)
  • Cooper E. David. Existentialism (Oxford: Wiley-Blackwell, 1999)
  • Reynolds Jack. Understanding Existentialism (Stocksfield: Acumen, 2006)
  • Earnshaw Steven. Existentialism: A Guide for the Perplexed (London: Continuum, 2006)

b. Anthologies

  • Kauffman Walter.  Existentialism from Dostoevsky to Sartre (New York: Penguin, 1975)
  • Paul S. MacDonald. The Existentialist Reader An Anthology of Key Texts (Edinburgh: Edinburg University Press, 2000)
  • Solomon C. Robert. Existentialism (USA: Oxford University Press, 2004)

c. Primary Bibliography

  • Beauvoir de Simone. The Ethics of Ambiguity (New York: Citadel Press, 1976)
  • Beauvoir de Simone. The Second Sex (London: Jonathan Cape, 2009)
  • Camus Albert. The Myth of Sisyphus (London: Penguin, 2000)
  • Camus Albert. The Rebel (London: Penguin, 2000b)
  • Camus Albert.  The Fall, (London: Penguin, 2006)
  • Heidegger Martin, Introduction to Metaphysics (New Heaven & London: Yale University Press,2000)
  • Heidegger Martin. Letter on Humanism: in Heidegger Martin. Basic Writings, (London: Routledge, 1993)
  • Heidegger Martin. Being and Time (Oxford: Blackwell, 1962)
  • Heidegger Martin. Identity and Difference (Chicago: Chicago University Press, 2002)
  • Kierkegaard Søren. The Concept of Anxiety (Princeton: Princeton University Press, 1980)
  • Kierkegaard Søren. Fear and Trembling (New Jersey: Princeton University Press, 1983)
  • Kierkegaard Søren. Papers and Journals: A Selection, (London: Penguin Book, 1996)
  • Nietzsche Friedrich. Ecce Homo (Oxford: Oxford University Press, 2007)
  • Nietzsche Friedrich. The Gay Science (Cambridge: Cambridge University Press, 2001)
  • Nietzsche Friedrich. Twilight of the Idols (Oxford: Oxford University Press, 2008)
  • Nietzsche Friedrich. On the Genealogy of Morality (Cambridge: Cambridge University Press, 2007)
  • Sartre Jean-Paul. Being and Nothingness (London and New York: Routledge, 2003)
  • Sartre Jean-Paul, “Intentionality: A fundamental idea of Husserl’s Phenomenology.” Trans. by Joseph P. Fell, Journal of the British Society for Phenomenology, 1970, Vol. 1, No. 2

d. Secondary Bibliography

  • Camus
  • Todd Oliver. Albert Camus A Life (London: Vintage, 1998)
  • Sherman David. Camus (Oxford: Blackwell, 2009)
  • Foley John. Albert Camus From the Absurd to Revolt (Stocksfield: Accumen, 2009)
  • Sartre
  • Cox Gary. Sartre A guide for the Perplexed (London: Continuum, 2006)
  • Gardner Sebastian. Sartres Being and Nothingness (London: Continuum, 2009)
  • Stewart John, “Merleau-Ponty’s criticisms of Sartre’s theory of freedom. Philosophy Today, 39:3 (1995:Fall)
  • Heidegger
  • Beistegui de Miguel. The New Heidegger (London & New York: Continuum, 2005)
  • Marx Werner. Heidegger and the Tradition (Evanson: Northwestern University Press, 1971)
  • Polt Richard. Heidegger An Inroduction (London: UCL Press, 1999)
  • Safranski Rüdiger. Martin Heidegger: Between Good and Evil (Cambridge, Massachusetts: Harvard University Press, 1999)
  • Watts Michael. The philosophy of Heidegger (Durham: Acumen, 2011)
  • Nietzsche
  • Ansell-Pearson Keith. An Introduction to Nietzsche as Political Thinker (Cambridge: Cambridge University Press, 1994)
  • Burnham Douglas. Reading Nietzsche An Analysis of Beyond Good an Evil (Stocksfield: Accumen, 2007)
  • Burnham Douglas and Jesinghausen Martin. Nietzsches Thus Spoke Zarathustra (Edinburgh: Edinburgh University Press, 2010)
  • Burnham Douglas and Jesinghausen Martin. Nietzsches The Birth of Tragedy (London: Continuum, 2010)
  • Safranski Rüdiger. Nietzsche – A Philosophical Biography (London: Granta Books, 2002)
  • Kierkegaard
  • Pattison George. The Philosophy of Kierkegaard (Chesham: Acumen, 2005)
  • Weston Michael. Kierkegaard and Modern Continental Philosophy: An Introduction (London: Routledge, 1994)

e. Other Works Cited

  • Hegel G.W.F. Phenomenology of Spirit, (Oxford and New York, Oxford University Press, 1977)
  • Spinoza Baruch Ethics in: Spinoza Baruch Complete Works, (Indianapolis, Hackett Publishing, 2002)
  • Kant Immanuel An Answer to the Question: What is Enlightenment? in Kant Immanuel Political Writings, (Cambridge: Cambridge University Press, 1991)

 

Author Information

Douglas Burnham
Email: h.d.burnham@staffs.ac.uk
Staffordshire University
United Kingdom

and

George Papandreopoulos
Email: g.papandreopoulos@staffs.ac.uk
Staffordshire University
United Kingdom

Epistemic Luck

Epistemic luck is a generic notion used to describe any of a number of ways in which it can be accidental, coincidental, or fortuitous that a person has a true belief. For example, one can form a true belief as a result of a lucky guess, as when one believes through guesswork that “C” is the right answer to a multiple-choice question and one’s belief just happens to be correct. One can form a true belief via wishful thinking; for example, an optimist’s belief that it will not rain may luckily turn out to be correct, despite forecasts for heavy rain all day. One can reason from false premises to a belief that coincidentally happens to be true. One can accidentally arrive at a true belief through invalid or fallacious reasoning. And one can fortuitously arrive at a true belief from testimony that was intended to mislead but unwittingly reported the truth. In all of these cases, it is just a matter of luck that the person has a true belief.

Until the twenty-first century, there was nearly universal agreement among epistemologists that epistemic luck is incompatible with knowledge. Call this view “the incompatibility thesis.” In light of the incompatibility thesis, epistemic luck presents epistemologists with three distinct but related challenges. The first is that of providing an accurate analysis of knowledge (in terms of individually necessary and jointly sufficient conditions for “S knows that p,” where ‘S’ represents the knower and ‘p’ represents the proposition known). An adequate analysis of knowledge must succeed in specifying conditions that rule out all instances of knowledge-destroying epistemic luck. The second challenge is to resolve the skeptical paradox that the ubiquity of epistemic luck generates: As will become clear in section 2c, epistemic luck is an all-pervasive phenomenon. Coupling this fact with the incompatibility thesis entails that we have no propositional knowledge. The non-skeptical epistemologist must somehow reconcile the strong intuition that epistemic luck is not compatible with knowledge with the equally evident observation that it must be. The third challenge concerns the special skeptical threat that epistemic luck seems to pose for more reflective forms of knowledge, such as knowing that one knows. Each of these challenges will be explored in the present article.

Table of Contents

  1. Epistemic Luck and the Analysis of Knowledge
    1. The Incompatibility Thesis
    2. The Justified-True-Belief Analysis of Knowledge
    3. The Gettier Problem
    4. Purported Solutions to the Gettier Problem
      1. No False Grounds
      2. No Essential False Grounds
      3. Defeasibility Approaches
      4. The Externalist Turn
      5. The Causal Theory of Knowing
    5. Controversial Cases
      1. Beneficial Falsehoods
      2. Misleading Evidence One Does Not Possess
      3. Impact of These Cases
    6. Where Things Stand
  2. The Paradox of Epistemic Luck
    1. The Knowledge Thesis
    2. The Incompatibility Thesis (Again)
    3. The Ubiquity Thesis
    4. The Skeptical Challenge
    5. Rejecting the Incompatibility Thesis
    6. Knowledge-Destroying Epistemic Luck
      1. Evidential vs. Veritic Luck
      2. Justification-Oriented Luck
      3. Modal Veritic Luck
    7. Second-Wave Anti-Luck Epistemologies
      1. Sensitivity
      2. Safety
    8. Paradox Lost
  3. Epistemic Luck and Knowing that One Knows
    1. Internalism, Epistemic Luck, and the Problem of Knowing that One Knows
    2. Epistemic Luck and Reflective Knowledge
  4. Conclusion
  5. References and Further Reading

1. Epistemic Luck and the Analysis of Knowledge

There is no settled agreement as to how best to characterize the accidentality or fortuitousness of an epistemically lucky true belief. Some have attempted to cash out the accidentality of epistemically lucky beliefs modally. For example, Mark Heller (1999) contends that person S’s belief that p is epistemically lucky (and hence not knowledge) if p is true in the actual world, but there is at least one world, in a contextually-determined set of possible worlds, where S’s belief that p is false. On Duncan Pritchard’s modal characterization (2005), S’s belief is epistemically lucky if it is true in the actual world, but false in a majority of nearby possible worlds where S forms the belief in the same way. Others (Riggs 2007; Coffman 2007) insist that epistemic luck be cashed out in terms of a lack of control condition. Each of these proposals has been criticized in the literature. Despite the lack of agreement concerning the exact nature of epistemic luck, there is widespread agreement that epistemic luck is incompatible with knowledge.

One of the earliest recorded illustrations of knowledge-destroying luck can be found in Plato’s Theaetetus. In this dialogue, Socrates inquires as to what knowledge is. When Theaetetus suggests that knowledge is true belief, Socrates quickly convinces him that he is mistaken by noting that a jury may luckily arrive at a true belief either as a result of the rhetorical skill of a jurist intent on getting a certain verdict or on the basis of unsubstantiated hearsay, and in either case, the lucky true belief would fall short of knowledge. The Socratic challenge posed in the Theaetetus is to specify what must be added to true belief to get knowledge. To meet that challenge, one must provide an analysis of knowledge that correctly identifies the conditions that are individually necessary and jointly sufficient for S to know that p (where ‘S’ represents the knower and ‘p’ represents the proposition known). As will become readily apparent in what follows, the possibility of epistemic luck makes the already difficult task of meeting the Socratic challenge all the more difficult.

a. The Incompatibility Thesis

Epistemologists have long agreed with Plato that epistemic luck is incompatible with knowledge. To see just how widespread commitment to the incompatibility thesis is, consider the remarks of just few representative epistemologists. In The Problems of Philosophy (1912, p. 131), Bertrand Russell asks the question: “Can we ever know anything at all, or do we merely sometimes by good luck believe what is true?”—the implication being that lucky true belief is not knowledge. In Theory of Knowledge (1990, p. 12), Keith Lehrer stresses that knowledge requires more than lucky true acceptance: “If I accept something without evidence or justification . . . and, as luck would have it, this turns out to be right, I fall short of knowing that what I have accepted is true.” In Reasons and Knowledge (1981, p. 31), Marshall Swain maintains that: “lucky guesses do not constitute factual knowledge.” In his Stanford Encyclopedia of Philosophy article on the analysis of knowledge (2006), Matthias Steup expressly endorses the incompatibility thesis: “Let us refer to a belief’s turning out to be true because of mere luck as epistemic luck. It is uncontroversial that knowledge is incompatible with epistemic luck.” In his Internet Encyclopedia of Philosophy article on epistemology (2007), David Truncellito concurs: “a lucky guess cannot constitute knowledge. Similarly, misinformation and faulty reasoning do not seem like a recipe for knowledge, even if they happen to lead to a true belief.” In Epistemic Luck (2005, p. 1), Duncan Pritchard calls attention to “the seemingly universal intuition that knowledge excludes luck, or, to put it another way, that the epistemic luck that sometimes enables one to have true beliefs . . . is incompatible with knowledge.”

The nearly universal intuition that epistemic luck is incompatible with knowledge is rooted in compelling examples like the following:

Jack of Hearts

Dylan is an avid euchre player. One night between hands, the dealer asks Dylan which card he believes to be on the top of the freshly shuffled euchre deck. Dylan thinks for a moment and, recalling his fondness of bowers, comes to believe that the top card is the jack of hearts. After Dylan reports his belief, the dealer turns over the top card, which just so happens to be the jack of hearts.

Since the probability of the jack of hearts being the top card of a randomly shuffled euchre deck is 1/32, it is just a matter of luck that Dylan’s belief was true. Dylan certainly didn’t know that the jack of hearts was the top card. He just happened to guess correctly, and knowledge requires more than lucky guesswork.

b. The Justified-True-Belief Analysis of Knowledge

Examples like Jack of Hearts clearly show that true belief is not sufficient for knowledge. What, then, must be added to true belief in order to get knowledge? Prior to 1963, most epistemologists maintained that justification is what is required to convert true belief to knowledge, and as a result, they endorsed the justified-true-belief analysis of knowledge:

(JTB)   For any subject S and any proposition p, S knows that p if and only if:

(i) p is true,

(ii) S believes that p [Bp], and

(iii) S is justified in believing that p [Jp].

Fallibilists and infallibilists disagree about the kind of justification required by (iii). Infallibilists maintain that knowledge requires infallible justification. Infallible justification entails the truth of the proposition for which it is justification. Fallibilists, on the other hand, endorse a weaker justification requirement. They contend that the kind of justification requisite for knowledge need only render probable, but need not entail, that for which it is justification.

At first blush, it might look as if infallible justification holds the key to eliminating epistemic luck and is, thus, the kind of justification needed for knowledge. After all, if S believes that p on the basis of infallible truth-entailing justification for p, it is impossible for S to be mistaken with respect to p. Unfortunately, the legacy of infallibilism is nearly wholesale skepticism. The point can be demonstrated as follows: Since our evidence for the non-cogito contingent empirical propositions we believe never entails the truth of those propositions, it follows that if the kind of justification required for knowledge is infallible truth-entailing justification, then we are never justified in believing, and hence never know, that such propositions are true. An infallibilist justification requirement would go a long way toward eliminating epistemic luck, but it would do so at the cost of making empirical knowledge impossible—hardly an adequate non-skeptical solution to the problem of epistemic luck.

Recognizing the skeptical implications of infallibilism, most contemporary epistemologists have embraced fallibilism so that empirical knowledge remains at least in principle possible. Fallibilistic justification is thought to rule out epistemic luck by making one’s belief extremely probable. When one’s belief that p is extremely probable, it is not just a matter of luck that one’s belief is true. Recall Jack of Hearts. Prior to the dealer’s turning over the top card, Dylan has no evidence as to what the top card is. As such, it is extremely improbable that the top card is the jack of hearts. Consider how Dylan’s epistemic situation changes after the dealer turns over the top card, and Dylan sees the jack of hearts. Now Dylan has good perceptual evidence that the card is the jack of hearts. Given his newly-acquired perceptual evidence, it is now extremely probable that the card is the jack of hearts, and as a result, it is no longer just a matter of luck that his belief that it is the jack of hearts is true. Granted, it is possible that a Cartesian evil demon could have caused Dylan to hallucinate the jack of hearts right as the dealer flipped over some other card (which illustrates that Dylan’s perceptual evidence doesn’t entail that the card is the jack of hearts), and so, his evidence doesn’t eliminate all chance of error; but it does make the chance of error extremely low, and when error is extremely improbable, it is not simply a matter of luck that one’s belief is true.

c. The Gettier Problem

Although the role of the justification condition in the JTB-analysis is to rule out lucky guesses as instances of knowledge, it remains possible, given any fallibilistic account of justification, to have a justified belief that is only luckily true, a fact that went largely unnoticed until the publication of Edmund Gettier’s seminal article “Is Justified True Belief Knowledge?” (1963). Therein, Gettier provides two compelling counterexamples to the JTB-analysis of knowledge. He dubs these examples “Case I” and “Case II.” Both cases involve a person who justifiably comes to believe a true proposition by validly deducing it from a justified-but-false belief. Consider first Gettier’s Case II.

Case II: Smith has good evidence for believing that Jones owns a Ford [J]. Indeed, Smith’s evidence for thinking that Jones owns a Ford is at least as strong as the evidence that we typically have for thinking that our friends and family members own the cars they do. Smith’s evidence consists of the following: As far back as Smith can remember, Jones has always owned a Ford; just that morning, Jones gave Smith a ride while driving a Ford; and Smith was with Jones a few months back when Jones purchased a Ford exactly like the one she was driving when she offered Smith the ride earlier that morning. Based on this evidence, Smith justifiedly believes that Jones owns a Ford [J]. On the basis of her justified belief that J, Smith justifiedly deduces and comes to believe the disjunction that either Jones owns a Ford or Brown is in Barcelona [J or B], despite having no idea of Brown’s whereabouts. As it turns out, Jones no longer owns a Ford. She recently sold her Ford and is driving a rental. However, purely by coincidence, Brown happens to be in Barcelona. Obviously, it is just a matter of luck that Smith’s justified belief that J or B is true. Nearly every epistemologist who has considered this case agrees that Smith’s luck-infused justified-true-belief that J or B falls short of knowledge.

Here is a slightly modified version Gettier’s other example. Case I: While waiting for a job interview, Smith sees Nelson take the coins out of her pocket, count them (ten coins in all), and then put them back in her pocket. Smith also overhears the boss on the phone telling someone that Nelson is the person who will get the job. On the basis of this evidence, Smith justifiedly believes the conjunction:

(N)  Nelson will get the job, and Nelson has ten coins in her pocket.

On basis of her justified belief that N, Smith deduces and justifiedly comes to believe:

(P)  The person who will get the job has ten coins in her pocket.

Despite Smith’s evidence, N is false. The boss misspoke on the phone. Actually, it is Smith, not Nelson, who will get the job, and purely by chance, Smith happens to have exactly ten coins in her pocket. Once again, it is just a matter of luck that Smith’s belief—this time her belief that P—is true. With these two examples, Gettier showed that fallibilistic justification is incapable of eliminating all forms of knowledge-destroying epistemic luck and that, as a result, justified true belief is not sufficient for knowledge.

d. Purported Solutions to the Gettier Problem

Gettier’s paper gave rise to a plethora of articles attempting to solve the problem that now bears his name. Many of these purported solutions sought to resolve the problem by supplementing the JTB-analysis with a fourth condition, while others abandoned the JTB-analysis in favor of non-traditional ajustificational accounts of knowledge. Consider first some of the prominent fourth condition responses.

i. No False Grounds

In both of Gettier’s examples, Smith justifiably infers a true belief from a justified-but-false belief, and it has seemed to many that a true belief is not knowledge when it is deduced from a false belief. As a result, a number of epistemologists sought to resolve the Gettier problem by supplementing JTB with a “No False Grounds” clause along the following lines:

(NFG) S knows that p if and only if (i) p is true, (ii) S believes that p, (iii) S is justified in believing that p, and (iv) S’s justification for p does not rest on any false beliefs.

An analysis of knowledge can be too strong or too weak: It is too strong if it is possible for a person to know that p without satisfying all of the conditions spelled out in the analysis. It is too weak if one can fail to know that p when all the conditions in the analysis have been met. To see that NFG is too strong, we need only modify Gettier’s Case II as follows:

Café

Smith is sitting in a café in Barcelona with Brown having a cup of espresso. While there, with Brown, Smith justifiably infers and comes to believe that J or B on the basis of her justified-but-false belief that Jones owns a Ford [J] and on the basis of her justified-true-belief that Brown is Barcelona [B].

In this scenario, Smith has excellent evidence for B along with her misleading evidence for J. Since Smith knows B is true and validly deduces J or B from her knowledge that B, it is not at all lucky that her justified belief that J or B is true; and so, Smith knows that J or B, despite the fact that part of her evidence, namely, J, is false. Hence, NFG is false, for it entails that a person fails to know that p whenever any part (even a dispensable and thus superfluous part) of her justification is false, when, intuitively, a person with some false evidence for p can still know that p provided she has at least one independent chain of all-true-evidence justification for p.

ii. No Essential False Grounds

In Gettier’s Case II (where Smith clearly fails to know that J or B), Smith’s justification for J or B essentially depends on Smith’s justified-but-false belief that J. In Café (where intuitively Smith does know that J or B), Smith has two independent strands of justification for J or B. The first strand is her justified-but-false belief that J. The second strand is her justified-true-belief that B. As a result, Smith could dispense with the first strand entirely and still be justified in believing the disjunction J or B. Our markedly different appraisals of Smith’s epistemic status vis-à-vis J or B in these two cases suggest that the presence of false grounds precludes the knowledge that p only when those grounds play an indispensable role in a person’s justification for p. Given this insight, it might seem that the no false grounds condition in NFG should be replaced with a no essential false grounds condition as follows:

(NEFG)  S knows that p if and only if (i) p is true, (ii) S believes that p, (iii) S is justified in believing that p, and (iv) S’s justification for p does not essentially depend on any false beliefs.

Unfortunately, NEFG is too weak because there can be all-true-evidence Gettier cases—cases where the person’s justification for her lucky true belief does not depend on any false beliefs. An example of an all-true-evidence Gettier case is provided by Brian Skyrms’s (1967) case involving Sure-Fire matches:

Pyromaniac

Pete knows that Sure-Fire matches have always lit in the past when struck. Pete also knows that the match he is holding is a Sure-Fire match. Based on this evidence, which he knows to be true, Pete justifiably believes that the match he is holding will light when struck [L]. Unbeknownst to Pete, the match he is holding is a defective Sure-Fire match (the first ever!) with impurities that raise its combustion temperature above that which can be produced by striking friction. As luck would have it, just as Pete strikes the match, a sudden burst of Q-radiation ignites the match.

In Pyromaniac, Pete has a justified true belief that L, which is based entirely on true evidence that Pete knows, and yet, it is still just a matter of luck that his belief is true. This example shows that one can have a lucky true belief that p that falls short of knowledge, even when all of one’s evidence for p is true. Thus, NEFG is too weak. There are genuine Gettier cases that it fails to rule out.

iii. Defeasibility Approaches

In each of Gettier’s original cases, there is a true proposition unbeknownst to Smith such that were that proposition added to the rest of Smith’s evidence, Smith would no longer be justified in believing the Gettiered belief. Call such a proposition a defeater. In Case I, the defeater is the true proposition that Nelson will not get the job [~N]. If ~N were added to Smith’s evidence, Smith would not be justified in believing that the person who will get the job has ten coins in her pocket, for she would no longer have any idea who will get the job. In Case II, the defeater is the true proposition that Jones does not own a Ford [~J]. Since Smith has no knowledge as to Brown’s whereabouts, if ~J were added to Smith’s evidence, she would no longer be justified in believing that J or B. Notice, however, that in the case of Café (where Smith is with Brown in Barcelona), the true proposition ~J is not a defeater, because adding ~J to Smith’s evidence in Café would not prevent Smith from being justified in believing that J or B. Smith would still be justified in believing J or B on the basis of her justified true belief that B.

Defeasibility theorists contend that a person fails to know that p whenever there is a defeater for her justification for p. Their proposal for solving the Gettier problem is to supplement the JTB-analysis with a No Defeaters condition as follows:

(ND)    S knows that p if and only if (i) p is true, (ii) S believes that p, (iii) S is justified in believing that p, and (iv) there are no defeaters for S’s justification for p.

The biggest problem facing the No Defeaters approach is that there is no agreement among defeasibility theorists themselves as to the correct account of defeaters. For example, Roderick Chisholm (1964) and Peter Klein (1971) have characterized defeaters as follows:

(D1)     When evidence e justifies S in believing that p, then proposition d is a defeater for S’s justification if and only if (i) d is true and (ii) the conjunction of d and e does not justify S in believing that p.

Keith Lehrer and Thomas Paxson Jr. (1969) contend that D1 is too weak, as a definition of defeaters, because it counts as defeaters certain statements that intuitively are not defeaters. They offer the following case in point:

Grabit

While at the library, I see a student of mine, Tom Grabit, take a book from the shelf, conceal it under his coat, and leave the library without checking it out. I know Tom Grabit well, and I am sure that he stole the book. I justifiedly believe that Tom Grabit stole the book, and he did.

Intuitively, I know that Tom Grabit stole the book. But here’s the rub: Unbeknownst to me, Tom Grabit’s mother said that on the day in question Tom was not in the library, indeed, he was thousands of miles away, and that Tom’s identical twin, John Grabit, was in the library. On the D1 account of defeaters, the following true proposition is a defeater for my justification for thinking that Tom stole the book:

(M) Tom’s mother said that Tom was not in the library at the time of the theft, but his identical twin John was.

If M were added to my evidence, I would no longer be justified in believing that Tom stole the book. This result might seem like the right result until we discover that Tom’s mother is both delusional and a pathological liar, that she said these things to herself in her padded cell, that John Grabit is a figment of her demented mind, and that Tom stole the book just as I thought. Lehrer and Paxson argue that the fact that it is true that a delusional mental patient uttered false statements about Tom Grabit’s location on the day of the theft should not defeat my knowledge that Tom Stole the book. They conclude that the Chisholm/Klein account of defeaters should be replaced with the follow account:

(D2)     When evidence e justifies S in believing that p, then proposition d is a defeater for S’s justification if and only if (i) d is true, (ii) S is completely justified in believing that d is false, and (iii) the conjunction of d and e does not justify S in believing that p.

In Grabit, I do not have any evidence concerning what Tom’s mother said or didn’t say, and so, I am not completely justified in believing that it is false that she said those things. As a result, condition (ii) of D2 is not satisfied, and so, on the Lehrer/Paxson account of defeaters that fact that Ms. Grabit said those things is not a defeater for my evidence that Tom stole the book. Consequently, on the D2-account of defeaters, I have an undefeated justified true belief that Tom stole the book and thus know that Tom stole the book, which is the intuitively correct result.

The Chisholm/Klein D1-account of defeaters gets the Grabit case wrong, for it entails that the true statement M defeats my justification for thinking that Tom stole the book. Since M is a defeater on the Chisholm/Klein account, their account entails that I do not know that Tom stole the book; when, intuitively, I do know that Tom stole the book. I saw him steal it, and the insane ramblings of his demented mother do nothing to undermine my knowledge.

Now consider a different case:

Locked

John Lock is compulsive when it comes to locking his doors. This morning when he left for work, he locked the front door and tripled checked that the door was locked. It is now 11:00 a.m., and John is sitting in his office recalling his morning ritual. He clearly and distinctly remembers locking his front door and triple checking to make sure that it was in fact locked. On the basis of this vivid memorial evidence e, he comes to believe that his front door is locked. Lucy Lock, John’s wife, is notoriously unreliable about locking the doors when she leaves home, which is why John always insists on leaving home after Lucy leaves for work. Unbeknownst to John, Lucy forgot her workout clothes and returned home at 10:30 a.m. to retrieve them, and she just happened to lock the door when she left five minutes later.

So, at 11:00 a.m., John’s belief that the front door is locked is true. Presumably, John does not know that the front door is locked. He thinks the door is locked because he remembers locking it, but that is not why it is locked. It is locked because Lucy absentmindedly and uncharacteristically happened to lock it on her way out. Intuitively, John’s knowledge is defeated by the following true proposition:

(U) The door was unlocked by Lucy at 10:30 a.m.

If U were added to John’s memorial evidence e, John would no longer be justified in believing that his front door is locked. On the Chisholm/Klein D1-account of defeaters, U is a defeater because U is true and the conjunction of U and e would not justify John in believing that his front door is locked. However, on the Lehrer/Paxson D2-account of defeaters, U would not count as a defeater because sitting in his office at 11:00 a.m., John has no evidence concerning whether or not his wife returned home to retrieve her gym clothes, and so, he is not completely justified in believing it false that the door was unlocked by Lucy Lock at 10:30 a.m. Since U is not a defeater on the Lehrer/Paxson account, their account entails that John knows that his front door is locked; when, intuitively, he fails to know that his door is locked, because it is just a matter of luck that Lucy absentmindedly locked it when she left.

The problem for the No Defeaters approach, then, is this: D1 is too weak of an account of defeaters, and as a result, employing a D1-account of defeaters in ND would make ND too strong an account of knowledge; whereas D2 is too strong an account of defeaters, and so, employing it in ND would make ND too weak. Absent an adequate account of defeaters, the No Defeaters approach fails to provide a solution to the problem of epistemic luck.

iv. The Externalist Turn

Externalist theories of justification maintain that epistemic justification is (at least) partly a function of features external to the cognizer, that is, features outside the cognizer’s ken. For example, one prominent externalist theory, process reliabilism, makes a belief’s justificatory status a function of the actual reliability (rather than the perceived reliability) of the process producing that belief. One motivation behind externalism with respect to justification is its unique ability to provide a truth connection that conceptually links justification with truth. To appreciate the importance of this motivation, recall that the role of the justification condition in the JTB-analysis is to rule out lucky guesses as instances of knowledge. In order for justification to be able to properly play that role, there must be some sort of internal connection between justification and truth that makes the former objectively indicative of the latter. Indeed, many epistemologists insist that it is precisely its internal connection to truth that distinguishes epistemic justification from moral and pragmatic justification. To be objectively indicative of truth, justification must be conceptually connected (not merely coincidentally or contingently connected) with truth. In order for there to be a conceptual connection between justification and truth, the following condition must hold: In every possible world W, if conditions C make belief b justified in W, then conditions C also make b objectively probable in W. The rationale for requiring such a truth connection is this: If there were no conceptual connection between justification and truth, it would be just as much a matter of luck when a justified belief turned out to be true as when an unjustified belief turned out to be true. Moreover, a better justified belief would be no more likely to be true than a much less well justified belief, for without a truth connection, no amount of justification is an objective indication of truth.

Unlike externalist theories of justification, no internalist theory of justification can provide the desired conceptual connection between justification and truth. Internalist theories maintain that epistemic justification is solely a function of states internal to the cognizer, such as perceptual states, belief states, memorial seemings, and introspective states. Examples of such theories include classical foundationalism, coherentism, and evidentialism. That no internalist theory of justification can provide a truth connection can be demonstrated as follows: Every internalist theory of justification maintains that the conditions that make a belief justified are entirely specifiable in terms of states internal to the cognizer, and for any set of entirely internally specifiable justification-conferring conditions, CI, there will always be a possible world, WD, where a Cartesian evil demon has seen to it that S possesses the requisite internal states and, hence, satisfies CI, even though all of S’s contingent empirical beliefs are false. Since the conditions CI that make S’s belief b internalistically justified in WD­­ do not make b objectively probable in WD, no internalist theory is capable of providing a truth connection. Because internalistic justification is not conceptually connected to truth, one can easily be internalistically justified in holding a false belief, which can in turn be used to justifiably infer some other belief that may coincidentally turn out to be true. Consequently, employing an internalistic justification condition in the JTB-analysis makes JTB particularly susceptible to Gettier cases.

At first glance, externalistic justification looks more promising as a means of preventing luck from playing a role in the acquisition of true belief, for some externalist theories of justification do provide a conceptual connection between justification and truth. Consider, for example, the following simplified version of process reliabilism:

(PR)     S’s belief b is justified in world W if and only if S’s belief b is produced by a belief-forming cognitive process [BCP] that is W-reliable (where a -reliable BCP is a process that tends to produce beliefs in W that are true in W).

Since PR asserts that a belief is justified in W if and only if it is produced by a W-reliable BCP, and since, by definition, the beliefs produced by a W-reliable process tend to be true in W, PR-justified beliefs have a high objective probability of being true. Because reliably-produced, externalistically-justified beliefs are objectively likely to be true, one might think that replacing the internalistic justification condition in the traditional JTB-analysis with an externalistic justification condition would render JTB immune to Gettier-style counterexamples. William Harper (1996) quickly dispels any such notion, with the following counterexample:

Falcon

Smith believes that Jones owns a Ford. Smith was with Jones when Jones purchased her Pinto; Smith has seen the official title to the car in Jones’s name; Jones is a reliable informant that has never deceived anyone; and just this morning, Jones gave Smith a ride to work in her Pinto. Smith has a reliably-produced and reliably-sustained belief that Jones owns a Ford. It is now 1:00 p.m. Unbeknownst to Smith, at noon Jones’s Pinto was vaporized by a terrorist bomb; but, also unbeknownst to Smith, exactly at noon Jones won a Ford Falcon in the State Lottery. Hence, Smith has a reliably-formed true belief that Jones owns a Ford, but her belief is not knowledge.

While internalistic justification may be particularly susceptible to being undermined by Gettier-style knowledge-destroying luck, Harper’s counterexample shows that the Gettier problem plagues all fallibilistic theories of justification, both internalistic and externalistic alike. Whatever virtues externalistic justification might have, solving the Gettier problem is not among them.

v. The Causal Theory of Knowing

In his early work, Alvin Goldman (1967) offers a different diagnosis of what has gone wrong in Gettier cases. In Case II, what makes J or B true is the fact that Brown is in Barcelona, but this fact plays no causal role in Smith’s coming to believe that J or B. In Case I, what makes P (P = the person who will get the job has ten coins in her pocket) true is the number of coins in Smith’s pocket, but this fact plays no role in Smith’s coming to believe P. What causes Smith to believe P is the fact that Nelson has ten coins in her pocket, and this latter fact is not what makes P true. Goldman observes that in these cases there is no causal connection between the Gettiered belief and the fact that makes it true. It is the absence of such a connection that allows for the possibility of belief’s being true purely by luck. Goldman concludes that the traditional JTB-analysis should be replaced with the following causal theory of knowledge:

(CTOK)  S knows that p if and only if the fact that p is causally connected in an appropriate way with S’s believing that p.

The appropriate knowledge-producing causal processes that Goldman identifies include: (i) perception, (ii) memory, (iii) inferentially reconstructed causal chains, each inference of which is warranted, and (iv) combinations of (i)-(iii). The causal theory correctly handles all of the cases we have considered so far. We have already seen how it handles Gettier’s original cases. In Café, what makes J or B true is the fact that Brown is in Barcelona, and that fact is appropriately causally connected with Smith’s believing that J or B, because Smith is having an espresso with Brown in Barcelona. Accordingly, CTOK correctly entails that, in Café, Smith knows that J or B. In Grabit, I see Tom steal the book. Tom’s stealing the book in plain eyesight perceptually causes me to believe that he did, and so, once again, CTOK yields the right result: I know that Tom Grabit stole the book. In Locked, what makes it true that the front door is locked is the fact that Lucy locked it, and this fact plays no causal role in John’s believing his front door is locked. In this case, CTOK correctly entails that John does not know that his front door is locked. He’s just lucky that Lucy happened to lock it. Finally, in Falcon, what makes it true that Jones owns a Ford is the fact that she just won a Ford Falcon in the state lottery, and that fact plays no causal role in Smith’s believing that Jones owns a Ford, and so, Smith fails to know that Jones owns a Ford.

Despite its success in handling these cases, the causal theory falls prey to the following counterexample:

Fake Barn County

An eccentric farmer in Minnesota owns all of the land in Fake Barn County. Wanting to appear much richer than he is, this farmer has erected fake barns all throughout the county. From the road, these fake barns look exactly like real barns, when, in reality, they are just two dimensional barn façades. While nearly every barn-looking structure in the county is a fake, there are a few real barns interspersed among the myriad fakes. Henry, who is driving through Fake Barn County, has no idea that there are any fake barns in the county. Looking out the window of his car, Henry sees what looks to be a barn on the hill just up the road and comes to believe that there is a barn on the hill. Purely by chance, Henry happens to be looking at one of the few real barns in the county.

Intuitively, Henry does not know that there is a barn on the hill. He is just lucky to be looking at one of the few real barns in the county. The lucky nature of his present belief becomes even more obvious once we discover that Henry has been forming barn beliefs ever since entering Fake Barn County, and all of these other barn beliefs have been false. Henry has consistently been duped by the façades.

The causal theory fails because it cannot account for Henry’s lack of knowledge in this case. Henry is now looking at one of the few real barns in the county, and this real barn is what is causing him to believe that there is a barn on the hill. Since Henry’s true belief that there is a barn on that hill is appropriately caused via perception by that very barn on that hill, the causal theory mistakenly entails that Henry knows there is a barn on that hill, when clearly he does not.

e. Controversial Cases

As analyses of knowledge aimed as at solving the Gettier problem have grown in sophistication and complexity, so have the purported counterexamples aimed at refuting these analyses. Some of these purported counterexamples are sufficiently complex and controversial that there is no consensus among epistemologists as to whether or not the person in the example knows the proposition in question. Two such cases are discussed below.

i. Beneficial Falsehoods

Although the no essential false grounds approach was largely abandoned once it was shown that there can be all-true-evidence Gettier cases—cases where S’s justification for her lucky belief p does not depend on any false beliefs—there has remained nearly universal agreement among epistemologists that a person fails to know that p if her justification for p essentially depends on a false belief. Peter Klein (2008) is a noteworthy exception. He contends that there can be beneficial falsehoods—falsehoods essential to one’s justification that nevertheless give one knowledge. Here is one example that Klein offers in support of his controversial view:

Appointment

Based on memory, I believe that my secretary told me on Friday that I have an appointment with a student on Monday. Based on that justified-but-false memorial belief, I come to justifiedly believe that I have an appointment on Monday. As it turns out, I do have that appointment on Monday, and my secretary did tell me of the appointment. However, he didn’t tell me on Friday. He told me on Thursday.

Klein contends that I know that I have an appointment on Monday [A], even though my belief that A essentially depends on my false belief that my secretary told me on Friday about my Monday appointment. It might seem that the false belief that my secretary told me on Friday that I have a Monday appointment is not essential to my justification for A, because if I “remember” that my secretary told me on Friday of my Monday appointment, then presumably I also actually remember that my secretary told me I have an appointment on Monday, and this latter belief is true. But suppose that the secretary was out with the flu the first three days of the week, and also suppose that I do not remember being on campus on Thursday. In fact, I’m confident that I wasn’t on campus on Thursday, having totally forgotten that I briefly stopped in on Thursday to get my mail. Klein contends that in such a situation I would not believe that my secretary told me of the appointment at all, unless I believed that he told me this on Friday. Klein contends that I know that A, even though that belief essentially depends on my false belief that my secretary told me on Friday about my Monday appointment.

What distinguishes beneficial falsehoods from knowledge-destroying falsehoods? Under what circumstances does a false belief f allow S to acquire knowledge that p? Klein’s answers to these questions are rooted in and flow out of his preferred theory of knowledge. Klein contends that knowledge requires both propositional and doxastic justification. Proposition p is propositionally justified for S if and only if S has an epistemically adequate basis for p. S’s belief that p is doxastically justified for S if and only if S’s belief that p has an appropriate causal basis. The basic idea is that in order for S to know that p, S’s belief that p must be epistemically justified and appropriately caused. Armed with the distinction between propositional and doxastic justification, Klein argues that a false belief f is a beneficial falsehood just in case the following seven conditions are met: (i) f is false, (ii) S’s belief that f is doxastically justified (that is, S’s belief that f has an appropriate causal pedigree), (iii) the belief that f is essential in the causal production of the belief that p, (iv) f propositionally justifies p, (v) f entails a true proposition t, (vi) t propositionally justifies p, and (vii) whatever doxastically justifies S in believing that f also propositionally justifies t for S. When these conditions are met, Klein contends that the false belief f is “close enough to the truth” to give one knowledge that p. Applied to the case at hand, F is the false proposition that my secretary told me on Friday that I have an appointment on Monday, and T is the true proposition that my secretary told me that I have an appointment on Monday. Klein contends that my belief that F meets all the conditions for being a beneficial falsehood: (i) F is false, (ii) my belief that F is doxastically justified (appropriately caused) by the fact that he did tell me, (iii) my belief that F is essential to my believing that A (for if I didn’t believe F, I would not believe he had told me about an appointment at all and so I would not believe A), (iv) F propositionally justifies A, (v) F entails the true proposition T, (vi) T propositionally justifies A, and (vii) the fact that my secretary told me on Thursday about my Monday appointment propositionally justifies T for me.

Klein contends that, in the case at hand, it doesn’t really matter what day my secretary told me that I have an appointment on Monday. What matters is the fact that he told me. The false belief that he told me on Friday is close enough to the true proposition that he told me as to give me knowledge that I have an appointment on Monday.

While interesting and provocative, Klein’s case is difficult to assess because it depends on controversial assumptions about belief individuation. Is it possible, for example, to believe that my secretary told me on Friday that I have an appointment on Monday [F], without also believing that my secretary told me that I have an appointment on Monday [T]? If not, then rather than providing us with a case of an indispensable knowledge-generating false belief, Klein may have simply given us another case of justificatory over-determination; for if it is impossible to believe F without also believing T, then there seem to be two independent strands of justification only one of which depends on a false belief, in which case Appointment is simply an analogue of Café above.

ii. Misleading Evidence One Does Not Possess

Gilbert Harman (1973) contends that S’s knowledge that p can be undermined by readily available misleading evidence that S does not possess. In Harman cases, despite the fact that the undermining evidence is misleading, if S were to possess it, S would no longer be justified in believing that p. The idea behind Harman cases seems to be this: Since the misleading evidence is readily available, it is just a matter of luck that S does not possess that evidence, and since luck is incompatible with knowledge, S fails to know that p. Here is one of Harman’s cases:

Assassination

A political leader is assassinated. His associates, fearing a coup, decide to pretend that the bullet hit someone else. On Nationwide television they announce that an assassination attempt has failed to kill the leader but has killed a secret service man by mistake. However, before the announcement is made, an enterprising reporter on the scene telephones the real story to his newspaper, which has included the story in its final edition. Jill buys a copy of that paper, reads the story of the assassination, and believes that the President has been assassinated based on the story. What she reads is true, and so are her assumptions about how the story came to be in the paper. (1973, 143)

Harman insists that Jill does not know that the President has been assassinated. He finds it highly implausible that Jill should know simply because she lacks evidence that everyone else possesses. Harman’s diagnosis is that Jill’s knowledge is undermined by readily available evidence – the misleading televised retraction – that she does not possess.

Epistemologists who have reflected on Harman’s Assassination case remain divided over whether or not Jill knows that the President has been assassinated. Those who think that she does know that the President has been assassinated tend to focus on the facts that (i) all of her evidence is true, (ii) she knows her evidence is true, and (iii) the evidence she has is an accurate indicator of the President’s assassination.

Those epistemologists who think that Jill does not know that the President has been assassinated do not focus on the quality of Jill’s evidence, which is impeccable. Rather, they focus on the lucky nature of her evidence. If Jill had turned on the TV when she got home, like she usually does, she would have seen the televised retraction, and she would have found herself in the same epistemic predicament as everyone else. Given the conflicting reports, she would not have known what to believe. Clearly, Jill is lucky to be in the evidential situation she is in. Since luck is generally thought to be incompatible with knowledge, these epistemologists conclude that Jill fails to know that the President has been assassinated.

iii. Impact of These Cases

Controversial cases like these make the challenge of providing an accurate analysis of knowledge even more difficult. If Jill does know that the President has been assassinated, then all those theories of knowledge that imply that she lacks such knowledge (including Harman’s own theory) are mistaken. On the other hand, if Jill does not know that the President has been assassinated, then all those theories that imply she does are mistaken. Similarly, if I do not know that I have an appointment on Monday, then all those theories that imply I do (including Klein’s theory) are mistaken. If I do know that I have an appointment on Monday, then all those theories that imply I lack such knowledge are mistaken. The competing intuitions these cases engender make the already difficult task of arriving at a mutually agreed upon account of knowledge even more formidable.

f. Where Things Stand

While various proponents of the above proposals might still embrace them, the general consensus is that none of the above attempts at eliminating epistemic luck succeeds. One problem with these first attempts at resolving the Gettier problem is that they tended to emerge in a piecemeal fashion as responses to specific counterexamples, only to fall prey to more elaborate counterexamples themselves. What seems to be needed is a better understanding of epistemic luck itself. If we can get clear on the exact nature of knowledge-destroying luck, we might be in a better position to formulate a condition that can eliminate it. The next section will examine a number of attempts at clarifying the nature of knowledge-destroying luck and will assess several modal conditions that have been proposed to eliminate such luck.

2. The Paradox of Epistemic Luck

In addition to generating problems for those epistemologists seeking an analysis of knowledge, the phenomenon of epistemic luck gives rise to an epistemological paradox in its own right. The paradox is generated by the following three theses: the knowledge thesis, the incompatibility thesis, and the ubiquity thesis. The paradox arises because each of these theses is antecedently plausible, but together they form an inconsistent triad. Each thesis is discussed below.

a. The Knowledge Thesis

According to the knowledge thesis, we possess a great deal of knowledge about the world around us. Commonsense tells us that the knowledge thesis is true. For example, I know that I am in a coffee shop. I know that I am drinking a cup of coffee. I know that I am wearing a blue shirt. I know that I am typing on a laptop computer. And I know that the person sitting next to me is talking on his cell phone at an inappropriate volume. You know that you have eyes. You know that you are reading an IEP article on epistemic luck. You know that the article you are reading is written in English. Together, we know a lot. At least, we think we do, until we encounter a skeptical paradox like the paradox of epistemic luck.

b. The Incompatibility Thesis (Again)

The incompatibility thesis is the thesis that epistemic luck simpliciter is incompatible with knowledge. As noted above, there has been nearly universal agreement among epistemologists that knowledge is incompatible with epistemic luck. The post-Gettier literature is replete with evermore-sophisticated counterexamples to the array of purported accounts of knowledge proffered in an effort to resolve the Gettier problem. The standard formula for generating a counterexample to a purported analysis of knowledge is to conjure up a case where, despite satisfying all the conditions in the analysis, it is still just a matter of luck that the person’s belief that p is true. The element of luck involved is ipso facto thought to prevent the belief from being an instance of knowledge. The nearly unanimous acceptance of such examples illustrates just how widespread commitment to the incompatibility thesis is.

c. The Ubiquity Thesis

Epistemic luck is an all-pervasive phenomenon that infects every fallibilistic epistemology in one form or other. Its inescapability can be demonstrated as follows: To convert true belief to knowledge, every viable fallibilistic epistemology requires satisfying either some internalistic justification condition or some externalistic condition (that may or may not be a justification condition). But neither an internalistic nor an externalistic condition can completely succeed in eliminating epistemic luck. A little recognized consequence of the new evil demon problem is that internalistic justification is not conceptually connected to truth in any robust way, for demon-world victims have internalistically justified beliefs almost all of which are false. Given the absence of a robust truth connection, it is always in some sense a matter of luck when a merely internalistically justified belief turns out to be true. To see why, consider my twin in an evil demon world WD. By hypothesis, he has the same beliefs that I have, he has the same memorial seemings that I have, he possesses the same experiential evidence that I possess, and he goes through exactly the same internal reflections that I do. In short, our internal cognitive lives are phenomenologically, doxastically, and reflectively indistinguishable. Consequently, if I satisfy the internal conditions for justifiedness (whatever they may be), then my demon-world twin satisfies them as well, and so, we are both internalistically justified in our beliefs. If, on the other hand, I fail to satisfy those conditions, then my twin also fails to satisfy them, and so neither of us is internalistically justified in our beliefs. Now assume the former scenario where both of us are internalistically justified in our beliefs. The only relevant difference between my twin and me is that he is being systematically deceived, whereas, as epistemic good fortune would have it, I am not. If he and I were to change places, there would be no introspectable difference, and each of us would continue to believe as we do, only now I would be the hapless victim of demon deception. Clearly, I am epistemically fortunate to be in the world I am in (assuming I am in the world I take myself to be in) and not in WD. Since I am lucky to be in the world I am in, there is a clear sense in which it is epistemically lucky that my internalistically justified beliefs are true. My twin is not nearly so lucky, for, thanks to the demon, all of his internalistically justified beliefs are false. Since these results can be generated no matter which internalistic theory of justification one employs, it is always a matter of luck when a merely internalistically justified belief happens to be true.

Truth-connected externalist approaches (for example, reliabilist, truth-tracking, and safety-based accounts) avoid this kind of epistemic luck. However, they are subject to another kind of ineliminable epistemic luck. Recall, from section 1, the externalistic process-reliabilist account of epistemic justification:

(PR)     S’s belief b is justified in world W if and only if S’s belief b is produced by a belief-forming cognitive process that is W-reliable.

Call a belief that is justified in virtue of being reliably produced a PR-justified belief. Although it is not typically a matter of luck when a PR-justified belief turns out to be true (since PR-justification is conceptually connected to truth), it is a matter of luck when a belief turns out to be PR-justified. To see why, consider, once again, my twin in the demon world WD. By hypothesis, he and I share the same beliefs, possess the same evidence, go through the same internal reflections, and have phenomenologically, doxastically, and reflectively indistinguishable cognitive lives. Even so, our beliefs do not have the same PR-justificatory status. His beliefs are not PR-justified, because they are produced by processes that the demon has rendered unreliable in WD, whereas my beliefs are PR-justified because they are produced by processes that are reliable in the actual world (Again, I’m assuming, for the sake of the example, that the actual world is the world we think it is.). According to PR, it is not a matter of luck that my beliefs are true and his beliefs are false, because my beliefs are PR-justified and his are not, and PR-justified beliefs have a high objective probability of being true. What is a matter of luck is the fact that my beliefs are PR-justified and his are not. After all, we both take ourselves to be in non-demon-manipulated worlds, and we both take ourselves to have reliably-produced PR-justified beliefs. As luck and ill luck, respectively, would have it, I am correct and he is incorrect. Since there is no introspectively discernible difference between our worlds, given what each of us has to go on, there is a clear sense in which I could have just as easily been mistaken and been the one with demon-rendered-unreliable processes. Compared to my twin, I am epistemically fortunate to be in a non-demon world where my cognitive faculties are reliable. Since I am epistemically lucky (compared with my twin) to be in a world where I have reliable cognitive processes, there is clearly a sense in which it is just a matter of luck that I have PR-justified beliefs.

Analogous considerations can be applied to any externalistic constraint on knowledge. Consider the externalistic condition of being a safe belief (to be explained below). While a safe belief’s being true is not epistemically lucky, having safe beliefs is epistemically lucky, for in a demon world none of one’s beliefs are safe. Since every fallibilistic epistemology incorporates either an internalistic justification condition or an externalistic condition, no fallibilistic epistemology can rid us of epistemic luck’s intractable presence.

d. The Skeptical Challenge

Epistemic luck, then, is ubiquitous and unavoidable. If all forms of epistemic luck are incompatible with knowledge, as the incompatibility thesis maintains, skepticism is correct and the knowledge thesis is false. And yet, we remain convinced that we possess lots of knowledge. The task facing the anti-skeptical epistemologist is to reconcile the rather strong intuition that epistemic luck is not compatible with knowledge with the equally evident observation that it must be. Since the ubiquity thesis is unassailable, the anti-skeptical epistemologist must reject the incompatibility thesis.

e. Rejecting the Incompatibility Thesis

Peter Unger (1968) was the first epistemologist to note that not all forms of epistemic luck are incompatible with knowledge. He identified the following three types of benign epistemic luck: (1) Propositional luck: It can be entirely accidental that p is true, and S can still know that p. For example, a person who witnesses an automobile accident can certainly know that the accident occurred. (2) Existential luck: For S to know that p, S must exist, and it might be extraordinarily lucky that S exists. If S is the lone survivor of a fiery plane crash, S is lucky to be alive, but S’s existential luck does not preclude her from knowing that she survived the crash. (3) Facultative luck: To know that p, S must possess the cognitive skills requisite for knowledge. Suppose S is shot in the head but the bullet narrowly misses all vital regions of the brain required for conceptual thought and knowledge. S is overwhelmingly lucky that she still possesses the cognitive capacities needed for knowledge, but since she does possess them, she is still capable of knowing many things, including that she was shot in the head.

f. Knowledge-Destroying Epistemic Luck

Unger has successfully identified three types of harmless epistemic luck, but not all forms of epistemic luck are benign. What is needed is an account of knowledge-undermining luck.

i. Evidential vs. Veritic Luck

Mylan Engel Jr. (1992) distinguishes two kinds of epistemic luck, evidential luck and veritic luck, and argues that only the latter is incompatible with knowledge. Engel characterizes these two types of luck as follows:

(EL)     A person S is evidentially lucky in believing that p in circumstances C if and only if it is just a matter of luck that S has the evidence e for p that she does, but given her evidence e, it is not a matter of luck that her belief that p is true in C.

(VL)    A person S is veritically lucky in believing that p in circumstances C if and only if, given S’s evidence for p, it is just a matter of luck that S’s belief that p is true in C.

To see that evidential luck is compatible with knowledge, suppose that a bank robber’s mask slips momentarily during a holdup and the startled teller sees clearly that the robber is the bank president. In such a situation, the teller would clearly be lucky to have the evidence she does, but she would nevertheless know that the bank president is the villain.

Engel argues that all genuine Gettier cases involve veritic luck. In Gettier’s Case II presented above, Smith’s belief that J or B is veritically lucky: Given Smith’s misleading evidence of Jones’s Ford-ownership status and her total lack of evidence concerning Brown’s whereabouts, it is just a matter of luck that Smith’s belief that J or B is true. Veritic luck with respect to p is incompatible with knowing that p, because it undercuts the connection between S’s evidence for p and the truth of p in a way that makes it entirely coincidental from S’s point of view that p is true.

Engel then uses the distinction between evidential and veritic luck to assess Harman cases. Jill is not veritically lucky in believing that the President has been assassinated, for she has accurate, reliable evidence concerning the assassination in the form of a reputable newspaper’s column, and given this evidence, it is not a matter of luck that her belief is true. However, Jill is evidentially lucky—she is lucky to be in the evidential situation that she is in, for had she turned on the TV and seen the fabricated retraction, she would have been in a much worse evidential situation vis-à-vis the President’s assassination. Lucky for her, she did not turn on the TV Like the bank teller above, Jill is lucky to have the evidence she does, but given her evidence, she is not lucky that her belief is true. Having argued that only veritic luck is incompatible with knowledge, Engel concludes that Jill does know the President has been assassinated. If Engel is right, then Harman cases do not provide examples of knowledge-undermining luck.

ii. Justification-Oriented Luck

Hamid Vahid (2001) maintains that there are two types of knowledge-destroying epistemic luck. He agrees with Engel that veritic luck as characterized by VL is incompatible with knowledge, but he argues, contra Engel, that there is a kind of evidential luck (which he dubs ‘justification-oriented luck’) that is also incompatible with knowledge. Vahid contends that knowledge-precluding justification-oriented luck is a function of how easily a person’s belief could have been unjustified:

(JL)      A person suffers from knowledge-precluding justification-oriented luck, when she is justified in believing that p, but given her epistemic circumstances, she could have easily been unjustified in holding that very belief.

Vahid contends that Harman’s Assassination case provides an example of knowledge-precluding justification-oriented luck. Jill could have easily been unjustified in believing that the President was assassinated. Had she turned on the TV like she usually does, she would not have been justified in holding that belief. Vahid concludes that Jill does not know that the President was assassinated—her knowledge is destroyed by justification-oriented luck.

While JL might yield the right result in Harman’s Assassination case, it seems to yield the wrong result with respect to the bank teller case. The teller is justified in believing that the bank president is the robber because she just happened to look up during the brief moment when his mask had slipped and clearly saw the robber’s face, but she could have easily been unjustified in this belief. Had she continued to look in her cash drawer while nervously collecting the cash for the robber, she would not have seen the robber’s face. Clearly, the teller knows that the bank president is the robber, and yet, JL implies that she lacks such knowledge.

iii. Modal Veritic Luck

Duncan Pritchard (2003) agrees that, of these types of luck, only veritic luck is incompatible with knowledge, but he replaces Engel’s evidence-based characterization of veritic luck with the following modal analysis:

(MVL)  For all agents S and propositions p, the truth of S’s belief that p is veritically lucky if and only if S’s belief that p is true in the actual world a but false in nearly all nearby possible worlds in which S forms the belief in the same manner as in a.

MVL differs from VL in the following way: it concerns the connection between the method of belief formation and proposition believed, rather than the connection between S’s evidence and the proposition for which it is evidence. Pritchard argues that a safety-based neo-Moorean account, according to which knowledge is safe true belief, is capable of eliminating veritic luck. In a moment, we will see, contra Pritchard, that safe true belief is incapable of ruling out certain paradigm cases of veritic luck.

g. Second-Wave Anti-Luck Epistemologies

The post-Gettier literature is rife with attempts at supplementing or amending the traditional JTB-analysis with a satisfactory anti-luck constraint on knowledge. As surveyed in Section 1, the first wave of proposals included adding a no-false-grounds or no-essential-false-grounds condition to JTB, supplementing JTB with a defeasibility condition, incorporating an externalistic justification condition in JTB, and replacing JTB with a causal theory of knowing. These and similar proposals have fallen prey to ever-more-complicated Gettier-style examples. The general consensus is that none of these proposals succeeds. Second-wave luck-eliminating proposals invoke counterfactual or subjunctive constraints on knowing, principal among them: sensitivity and safety. Let us consider each of these proposals in turn.

i. Sensitivity

S’s belief that p is sensitive to p’s truth-value if and only if S would not believe that p if p were false (that is, if and only if S does not believe p in any of the closest ~p-worlds). To be sure, sensitive belief does preclude veritic luck, but it does so at a steep price. First, the sensitive-true-belief account of knowledge results in closure failure. Second, there are compelling reasons to think that knowledge does not require sensitivity. Let’s examine each cost in turn.

Most epistemologists regard it as all but axiomatic that we can expand our knowledge by competently deducing some currently unknown proposition u from some other known proposition k whenever we know that k entails u. This widely-embraced idea is codified in the principle of epistemic closure which has been formulated in each of the following ways:

(PEC1)   If S knows that p and S knows that p entails q, then S knows (or is in a position to know) that q.

(PEC2)   If (i) S knows that p, (ii) S knows that p entails q, (iii) S competently deduces q from her knowledge that p and that p entails q, and (iv) S comes to believe q as a result of that competent deduction, then S knows that q.

One reason the principle of epistemic closure has enjoyed such widespread endorsement is this: If I know that p and know that p entails q and I deduce and come to believe q from that knowledge, my belief that q could not be false (because knowledge is factive and the truth of p entails q, together with the truth of p, guarantees the truth of q).

The following example illustrates why sensitive-true-belief accounts of knowledge result in closure failure. I currently believe that I am in a coffee shop [C]. My belief that C is sensitive. If I were not in a coffee shop, I would not believe that I was, for if I were not in a coffee shop, I would be somewhere else, for example, the grocery store or my office, and would not mistakenly think that I was at a coffee shop. Since my belief that C is sensitive (that is, I would not believe it if it were false), the sensitive-true-belief account of knowledge entails that I know that C. I also currently believe that I am not at home in bed having a lifelike dream of being in a coffee shop [~H], but my belief that ~H is not sensitive, for if I were at home in bed having a lifelike dream of being in a coffee shop (that is, if ~H were false), I would still believe that ~H. So, according to the sensitive-true-belief account, I do not know that ~H. Of course, my being at the coffee shop entails that I am not at home in bed dreaming that I am in a coffee shop (that is, C ==> ~H), and I know that C ==> ~H. The sensitive-true-belief account results in closure failure because it entails that I know that C and know that C ==> ~H, but I do not know (and cannot come to know) that ~H on that basis.

Most epistemologists regard the principle of epistemic closure to be so plausible that they find any theory of knowledge that results in closure failure deeply problematic if not outright absurd. In fairness to sensitivity theorists, they recognize that their theories entail closure failure and acknowledge the antecedent implausibility of closure failure, but they argue that, despite its counterintuitiveness, there are principled reasons for thinking that knowledge is not closed under known implication. They grant that we have all sorts of ordinary knowledge, but insist that we do not know and cannot know that the skeptic’s hypotheses are false. Thus, they embrace closure failure because they think that it accurately captures our actual epistemic situation. Perhaps sensitivity theorists are right, but given how widely accepted the principle of epistemic closure is, it would be preferable to identify an anti-luck constraint that avoids closure failure.

The second major problem facing the sensitivity proposal, as Jonathan Vogel (1999) shows with Hole-In-One, is that knowledge does not require sensitivity. The fourth hole at Augusta National Golf Course where The Masters is played is a tricky 240-yard par 3, euphemistically called “Flowering Crabapple.” In 2007, not one player shot a hole-in-one on this diabolical hole, and there were only eleven birdies throughout four rounds of play. Right now, I know that not all seventy-two players in this year’s Masters will shoot a hole-in-one on Flowering Crabapple in the first round of play, but my belief to this effect is not sensitive. Were every golfer to shoot a hole-in-one on Flowering Crabapple in Round One of the Masters in defiance of the astronomical odds against it, I would still believe that they were not going to do so. So, sensitivity is not necessary for knowledge.

ii. Safety

Considerations such as these have led a number of epistemologists (Sosa 1999 & 2000, Williamson 2000a & 2000b, Pritchard 2005) to replace the sensitivity condition with some sort of safety condition. Safety comes in different strengths: S’s true belief that p is strongly safe if and only if were S to believe that p, p would be true (that is, in all the closest worlds where S believes p, p is true). S’s true belief that p is weakly safe if and only if S would not easily be mistaken with respect to p (that is, in the overwhelming majority of nearby worlds where S believes that p, p is true).

Peter Murphy (2005) employs Saul Kripke’s famous counterexample to sensitivity to show that strong safety results in closure failure. Suppose the following is true of Fake Barn County: The landscape is peppered with barn façades, there are a few real barns in the county, some of the real barns are red and some are blue, but all of the façades are red. Driving through Fake Barn County, Mary is unaware that the most of the barn-looking structures are façades. She looks at a blue barn and comes to believe that she is looking at a blue barn. Her belief is safe. In all nearby worlds where she believes she is looking at a blue barn, she is looking at a blue barn, for there are no blue façades. However, her belief that she is looking at a barn is not safe. There are many nearby worlds where she believes she’s looking at a barn, but is really just looking at a façade. So, strong safety entails that Mary knows she’s looking at a blue barn, but does not know she’s looking at a barn.

Weak safety is open to a different worry. If knowledge only requires weakly-safe justified true belief, then a person who justifiably believes her lottery ticket will lose knows that her ticket will lose (unless, of course, it happens to win), because in the overwhelming majority of nearby worlds, her ticket is a loser. Many epistemologists (though not all) insist that people do not know their lottery tickets will lose, prior to hearing the announced results. Anyone convinced that people do not know their tickets will lose, before learning of the results, will think that weak safety is too weak of an anti-luck constraint on knowledge.

Avram Hiller and Ram Neta (2007) convincingly argue that no safe belief condition can eliminate all cases of veritic luck as follows: Start with a justified-but-false-and-unsafe belief like Smith’s belief that Jones owns a Ford. Next, have Smith justifiably infer a disjunction of the form J or ~G, where Smith has no evidence whatsoever that ~G is true and where unbeknownst to Smith, ~G is true in all nearby worlds. Let ~G = Brown will not win a Grammy. Suppose that, unbeknownst to Smith, Brown is totally devoid of musical talent and there is no remotely close world where Brown wins a Grammy. Then, Smith’s true belief that J or ~G will be safe, but veritically lucky nonetheless, because given Smith’s evidence, it is just a matter of luck that J or ~G is true. Since the safe-true-belief account cannot rule out all cases of veritic luck, safe-true-belief is not sufficient for knowledge.

Hiller and Neta’s example also shows that Pritchard’s modal account of veritic luck [MVL] is not the correct analysis of veritic luck. Smith’s belief that J or ~G is clearly veritically lucky: Smith bases her belief that J or ~G on her justified-but-false-and-unsafe-belief that Jones owns a Ford. Since Smith has absolutely no knowledge or evidence of Brown’s total lack of musical talent, given Smith’s evidence, it is just a matter of luck that her belief that J or ~G is true. But MVL entails that Smith’s belief is not veritically lucky. According to MVL, a belief is veritically lucky if it is true in the actual world but false in nearly all nearby worlds where Smith forms the belief in the same manner. In Hiller and Neta’s case, Smith’s belief that J or ~G is true in the actual world, but it is also true in all nearby worlds where it is formed in the same way (because ~G is true in all nearby worlds). Thus, according to MVL, Smith’s belief that J or ~G is not veritically lucky. Since Smith’s belief is veritically lucky, the MVL analysis of veritic luck is mistaken.

h. Paradox Lost

The paradox of epistemic luck dissolves once we recognize that the incompatibility thesis is false. Not all forms of epistemic luck are incompatible with knowledge. Certainly propositional, existential, and facultative luck are compatible with knowledge, and at least some forms of evidential luck, like the evidential luck had by the bank teller above, are also compatible with knowledge. There is growing consensus that veritic luck is the principal form of knowledge-destroying luck. Since veritic luck is far from ubiquitous, the incompatibility of veritic luck with knowledge poses no general threat to the possibility of knowledge. One can know that p whenever it is not a matter of veritic luck that one’s justified belief that p is true.

3. Epistemic Luck and Knowing that One Knows

Although there remains broad disagreement over how exactly to formulate the condition needed to rule out knowledge-destroying epistemic luck in a theory of knowledge, there is widespread consensus that whatever the correct condition is, S does not need to know that that condition has been met in order to know that p. The point can be illustrated as follows: Let the expression “S is not Gettiered with respect to p” serve as a placeholder for whatever the correct substantive luck-eliminating condition is. If we add this condition to the traditional justified-true-belief analysis, we get the following schema for analyzing knowledge:

(K) S knows that p if and only if:

(i) p is true,

(ii) S believes that p,

(iii) S is justified in believing that p, and

(iv) S is not Gettiered with respect to p.

According to (K), S does not need to know that conditions (i)-(iv) are met in order to know that p. All that (K) requires for S to know that p is that conditions (i)-(iv) be met. Since S need not know or even believe that she is not Gettiered with respect to p in order to know that p, the possibility of Gettier-style, knowledge-destroying, veritic luck poses no special obstacle to first-order knowledge (where ‘first-order knowledge’ refers to knowing that p and ‘second-order knowledge’ refers to knowing that one knows that p). As long as S is not veritically lucky with respect to p, she will know that p, according to schema (K), provided she has a justified true belief that p.

The situation seems to be quite different when it comes to knowing that one knows, for one of the most natural ways of coming to know that one knows that p is by knowing that one has met the conditions required for knowing that p, and knowing the latter requires knowing that one is not Gettiered with respect to p. The burden of the present section is to examine whether the phenomenon of knowledge-destroying epistemic luck undermines more reflective forms of knowledge, such as, knowing that one knows.

a. Internalism, Epistemic Luck, and the Problem of Knowing that One Knows

Arch internalist H.A. Pritchard (1950) famously remarked: “We must recognize that whenever we know something we either do, or at least can, by reflecting, directly know that we are knowing it.” Other internalists have been less sanguine about the prospects of second-order knowledge. For example, Roderick Chisholm (1986) argues that one cannot generally know that one knows on the grounds that one cannot generally know whether or not one’s evidence for p is defeated by Gettier considerations. Is Chisholm right? Does the Gettier problem pose special—indeed, generally insurmountable—obstacles to internalistically knowing that one internalistically knows that p? Richard Feldman (1981) does not think so. He thinks that the Gettier problem poses a minor obstacle to second-order knowledge, but one that can be easily overcome with minimal intellectual effort. Mylan Engel Jr. (2000) disagrees. Siding with Chisholm, Engel argues that the Gettier problem poses three distinct challenges to second-order knowledge, which, when taken together, threaten to undermine the possibility of knowing that one knows. Michael Roth (1990) contends that the Gettier problem poses no threat to second-order knowledge whatsoever. To assess these competing views, it will be helpful to have a clearer idea of just what is required for internalistic knowledge.

Post-Gettier internalists with respect to knowledge tend to work within the JTB+ tradition in that they maintain that, in addition to true belief, knowledge requires internalistic justification as well as some fourth externalistic anti-luck condition to rule out Gettier cases. Accordingly, by replacing condition (iii) in schema (K) above with an explicitly internalistic justification condition, we arrive at a schema for internalistic knowledge that most internalists would readily embrace:

(Ki)      S internalistically knows (knowsi) that p if and only if:

(k1) p is true,

(k2) S believes that p,

(k3) S is internalistically justified (justifiedi) in believing that p, and

(k4) S is not Gettiered with respect to p.

Since (Ki) provides a perfectly general account of knowledgei, we can arrive at the conditions for second-order knowledgei simply by substituting S knowsi that p for p in schema (Ki):

(KiKi) S knowsi that S knowsi that p if and only if:

(kk1) S knowsi that p is true,

(kk2) S believes that S knowsi that p,

(kk3) S is justifiedi in believing that S knowsi that p, and

(kk4) S is not Gettiered with respect to S knowsi that p.

Chisholm doubts that (kk3) can be satisfied. To appreciate Chisholm’s worry, consider one of the most natural and straightforward ways of satisfying condition (kk3), namely, being justifiedi in believing that one has met all of the conditions required for knowingi that p:

(JiKip)    S is justifiedi­­ in believing that S knowsi that p if and only if:

(jk1) S is justifiedi­­ in believing that p,

(jk2) S is justifiedi­­ in believing that S believes that p,

(jk3) S is justifiedi­­ in believing that S is justifiedi in believing that p, and

(jk4) S is justifiedi­­ in believing that S is not Gettiered with respect to p.

Since (jk1) is identical to (k3), (jk1) is satisfied whenever S knowsi that p; and if we assume both doxastic and justificationali transparency (that is, if we assume that we have introspective access to what we believe and to our justificationi for what we believe), as do many internalists, then (jk2) and (jk3) also pose no special problems for the would-be second-order knower.

Chisholm’s concern is with (jk4). He contends that we cannot typically tell whether or not our evidence for p is defeated by Gettier considerations. Based on this contention, Chisholm argues as follows: Let S be any one of us and let p be a proposition that S knowsi. Since S cannot tell whether S’s evidence for p is defeated by Gettier considerations, S is not justifiedi in believing that S is not Gettiered with respect to p. Hence, (jk4) is not satisfied. Since (jk4) is not satisfied, S is not justifiedi­ in believing that S knowsi that p, that is, (kk3) is not satisfied. Since (kk3) is a necessary condition for knowingi that one knowsi that p, S does not knowi­ that S knowsi that p. The gist of Chisholm’s argument is this: Since we are not justifiedi in believing that we are not Gettiered with respect to p, we do not knowi that we knowi that p.

Feldman disagrees. He thinks that, with minimal effort, a person who knowsi that p can be justifiedi in believing that she is not Gettiered with respect to p. Feldman offers two reasons for thinking that it is relatively easy to be justifiedi in believing that one’s evidence for p is not defective and thus that one is not Gettiered with respect to p. Since Feldman is primarily concerned with determining when a person who knowsi that p knowsi that she knowsi that p, he assumes that S has first-order knowledgei that p when presenting his reasons.

No False Evidence

Assume that S knowsi that p. S’s knowingi that p entails that S is justifiedi in believing that p. Since S is justifiedi in believing that p, S is also justified­i­ in believing that all of her evidence for p is true. Since false evidence is usually what makes one’s evidence defective, S is justified in believing that her justificationi for p is not defective and thus that she is not Gettiered with respect to p.

Induction

Since S has very rarely found herself to be the victim of Gettier-type situations, she is justifiedi in believing that such situations are very rare and atypical. Given their rarity and atypicality, S is justifiedi in believing that she is not is such a situation with respect to p.

Feldman contends that No False Evidence and Induction provide S with good internalistic reasons for believing that she is not Gettiered with respect to p. Since internalistic justification is a function of having good internalistic reasons and S has good internalistic reasons for believing that she is not Gettiered with respect to p, S is justifiedi in believing that she is not Gettiered with respect to p, that is, (jk4) is satisfied. Since (jk1)-(jk3) are also easily satisfiable, with minimal intellectual effort, S can be justifiedi in believing that she knowsi that p. Feldman concludes that satisfying (kk3) poses no special obstacle to knowingi that one knowsi.

Engel contends that the Gettier problem generates three distinct challenges for the would-be second-order knower—challenges that threaten to undermine the satisfaction of (kk1), (kk3), and (kk4), respectively:

(1) First-order actual Gettierization: One way the Gettier problem can preclude S from knowingi that she knowsi that p is by preventing S from knowingi that p. If S is Gettiered with respect to p, then S fails to knowi that p, and thus, she fails to knowi that she knows that p, since (kk1) is unsatisfied. However, since first-order actual Gettierization precludes second-order knowledgei only when it obtains, it poses no greater threat to second-order knowledgei than it poses to first-order knowledgei.

(2) First-order possible (but non-actual) Gettierization: While actual first-order Gettierization, when it obtains, undermines second-order knowledgei by falsifying (kk1), possible (but non-actual) first-order Gettierization threatens to thwart one of the most natural ways of satisfying (kk3), namely, satisfying conditions (jk1)-(jk4) of JiKip. Like Chisholm, Engel’s concern here is with (jk4). He argues that the reasons Feldman offersNo False Evidence and Inductiondo not provide adequate reasons for thinking that one is not Gettiered with respect to p. No False Evidence is not a good reason to think that one has not been Gettiered with respect to p because, as noted in Section 1, there can be all-true-evidence Gettier cases, a point that Feldman himself demonstrated in an earlier article (Feldman, 1974). While No False Evidence may provide S with a reason for thinking that she is not the victim of a Gettier case involving a justified-false-belief, it provides her with no reason to think that she is not the victim of an all-true-evidence Gettier case. The problem with Induction is that many of the Gettier cases described in the literature are what we might call “invisible” Gettier cases, that is, they are cases such that, were they to obtain, the Gettier victim would never find out. They are cases that look and feel like knowledge and pass away unnoticed. Unless Pyromaniac Pete is wearing a Geiger counter, he will never discover that it was Q-radiation and not striking friction that caused his defective Sure Fire match to light. Unless John Lock interrogates Lucy Lock about her morning routine, he will likely never discover that she unlocked the doors to their house at 10:30 a.m. Unless Henry leaves the highway and investigates, he will likely never discover that most of the barn-looking structures are façades. Considerations such as these make it plausible to think that invisible Gettier cases are more likely to be the norm than visible Gettier cases. The fact that S has rarely found herself to be Gettiered in the past may provider her with a reason for thinking that visible Gettier cases are rare, but it provides her no reason to think that invisible Gettier cases are rare, and without such a reason, she is not justified­i in believing that she is not being (invisibly) Gettiered with respect to p. Engel concludes that it is much more difficult to be justifiedi in believing that one is not Gettiered with respect to p than Feldman alleges.

(3) Meta-Gettierization: Engel dubs second-order Gettierization “meta-Gettierization.” Just as first-order Gettierization occurs when S’s justification for p is defective in a way that makes S veritically lucky with respect to p, meta-Gettierization occurs when S’s justification for believing that S knowsi that p is defective in a way that makes S veritically lucky with respect to S knowsi that p. By way of illustration, Engel asks us to consider Professor Cleaver, a fictitious philosophy professor from the 1950s, who, as a pre-Gettier epistemologist, justifiably accepts the JTB-analysis of knowledge. Since Cleaver is justifiedi in believing that knowledgei is justifiedi true belief, he is justifiedi in believing that (jk1)-(jk3) are jointly sufficient for being justifiedi in believing that one knowsi that p. Since he is justifiedi in believing that (jk1)-(jk3) are jointly sufficient for being justifiedi in believing that one knowsi that p, he is justifiedi in believing that he knowsi that p provided that he is justifiedi in believing that he has justifiedi-true-belief that p. Let p be a proposition that Cleaver knowsi. If he believes that he knowsi that p, and if he is justifiedi in believing that he knowsi that p on the basis of his justifiedi-but-false-belief that knowledgei is justifiedi true belief, together with his justifiedi-true-belief that he has a justifiedi-true-belief that p, then Cleaver will have a justifiedi-true-belief that he knowsi that p, which falls short of knowledgei because his justificationi essentially depends on his justifiedi-but-false-belief that knowledgei is justifiedi true belief. The point generalizes. Anytime that Cleaver comes to believe that he knowsi that p on the basis of his justifiedi-but-false-belief that the JTB-analysis is correct, he will automatically be meta-Gettierized and will, thus, fail to knowi that he knowsi that p. Engel then argues that whether those of us who have grown up in the post-Gettier enlightenment can avoid Cleaver’s fate depends on whether any of us justifiedly­i believes a true epistemology. Since no epistemology to date is immune to objection, Engel thinks it doubtful that any of us holds a true epistemology (no matter how well justifiedi we might be in our preferred epistemology). Given how likely it is that we are operating with a false epistemology, Engel contends that whenever we come to believe that we knowi that p on the basis of our preferred epistemology, we are almost certain to become yet another meta-Gettier casualty, for we are almost certain to have based our belief that we ­­knowi that p on a justifiedi false belief about the requirements for knowledgei.

Roth contends that the debate over whether the Gettier problem poses a major or minor obstacle to second-order knowledgei is entirely misguided. He argues that Gettier considerations pose no obstacle to second-order knowledgei whatsoever. His argument is rooted in what he calls the Fallibilist Assumption Governing Empirical Knowledge:

(FA)    For every proposition of the form Kp (where p is empirical and K is the knowledge operator), there are certain contingencies such that: (i) their obtaining is physically possible, (ii) were they to obtain, Kp would be false, and (iii) S is completely justified in disregarding any of these contingencies in considering whether she has adequate justification for p.

Roth contends that there are two types of Kp-falsifying contingencies. “Type I contingencies” satisfy conditions (i), (ii), and (iii) of (FA). “Type II contingencies” satisfy conditions (i) and (ii), but not (iii). Roth asks us to imagine a great dividing wall – The Wall of Fallibilism – that separates the Type I contingencies from the Type II contingencies. As Roth envisions it, the Wall of Fallibilism plays an important role in protecting us from knowledge-destroying epistemic luck. If, given S’s evidence for p in circumstances C, it is simply a matter of luck that p is true in C, then S does not know that p in C. To ensure that it is not just a matter of veritic luck that S’s belief that p is true (in C), S must be suitably protected from error with respect to p (in C). According to Roth, the Wall protects us from the slings and arrows of outrageous Type I error possibilities by cordoning us off from these remote properly ignorable Kp-falsifying contingencies. We do not need evidence that these contingencies do not obtain in order to knowi that p. Being safely outside the Wall, we do not need to take them into account in our epistemic reflections at all. Their sheer remoteness and improbability protects us from having to worry about them. As long as they do not actually obtain, these contingencies provide no obstacle to knowledgei whatsoever. But the Wall does not provide us with all the protection from luck and error that we need in order to possess knowledgei. We must also be protected from error with respect to those Type II contingencies that are inside the Wall. These p-falsifying contingencies are genuinely in doubt. Were any of these contingencies to obtain, p would be false, and as a result, so too would Kp. To protect us from these realistic non-ignorable ~p-possibilities, we need justification that precludes them. The picture of fallible knowledgei that emerges is this:

S knowsi that p only if (i) S’s justificationi is strong enough to rule out all of the relevant Type II ~p-possibilities inside the Wall and (ii) none of the Type I contingencies outside the Wall obtain.

Roth thinks that the Wall metaphor explains why Gettier considerations pose no obstacle to second-order knowledgei­­. Gettier considerations are paradigm cases of Type I contingencies. We do not need to knowi or even believe that Type I contingencies do not obtain in order to knowi that p. As long as no Type I contingencies obtain, S will knowi that p provided she satisfies the other conditions required for knowingi that p. Like Type I contingencies generally, Gettier considerations only undermine knowledgei when they obtain. We do not need to knowi or even believe that no Gettier circumstances obtain in order to knowi that p. As long as they do not obtain, we will knowi that p provided we have met the other conditions required for knowingi that p. Since Gettier contingencies are outside the Wall, Roth contends that it is perfectly proper to ignore them when trying to determine whether one knowsi that p.

Roth’s reason for thinking that Gettier contingencies pose no obstacle to knowingi that one knowsi is that he thinks that Gettier possibilities are properly ignorable Type I contingencies that lie safely outside the Wall. The problem with Roth’s argument is that the Wall’s location is not fixed. As Roth himself admits, where the Wall is situated is relativized to a particular attempt to acquire knowledgei of a particular proposition. Which contingencies are outside the Wall and which are not, that is, which contingencies are properly ignorable and which are not, is a function of the proposition one is attempting to come to knowi and the circumstances under which one is trying to come to knowi it. While Gettier contingencies vis-à-vis p are clearly properly ignorable where coming to knowi that p is concerned, they are not properly-ignorable when it comes to knowingi that Kp. To the contrary, it seems that Kp-destroying Gettier contingencies are precisely the kind of contingencies that one needs to be able to rule out in order to know that one knows that p. Gettier contingencies are not p-falsifying contingencies (for p is true in Gettier situations), but they are Kp-falsifying contingencies. As such, they are Type I contingencies when it comes to knowingi that p, but Type II contingencies when it comes to knowingi that one knowsi that p. In effect, the Wall moves outward where second-order knowledgei is concerned. The very same Gettier contingencies that are outside the p-Wall are inside the Kp-Wall. Being inside the Kp-Wall, they are not properly ignorable when it comes to knowingi that Kp. To knowi that one knowsi that p, one must knowi that no Gettier Kp-falsifying contingencies obtain. It is precisely because we cannot generally knowi that no Gettier contingencies obtain that Chisholm and Engel contend that second-order knowledgei is difficult to attain.

b. Epistemic Luck and Reflective Knowledge

Even if veritic luck poses no special problem for reflectively knowing that one knows, Duncan Pritchard contends that another more worrisome kind of epistemic luck does preclude such knowledge. Reflective epistemic luck arises when, from the agent’s reflective position, it is just a matter of luck that her belief is true. More precisely:

MRL   For all S and p, the truth of S’s belief that p is reflectively lucky if and only if S’s belief that p is true in the actual world but, in nearly all nearby possible worlds consistent with what S is able to know by reflection alone, were S to believe p, p would be false.

When it comes to modal reflective luck, the epistemically relevant possible worlds are ordered in a non-standard way solely in terms of what the agent is able to know on the basis of her subjective internal reflections alone. Accordingly, any possible world consistent with S’s having that same internally accessible evidence that she has in the actual world will be reflectively equally close to the actual world. Since, by hypothesis, S would have exactly the same internally accessible evidence in a demon world or a BIV-world that she has in the actual world, these worlds are just as close, reflectively, to the actual world as is the world where everything is just as it seems. Since our ordinary commonsense perceptual beliefs are false in a wide variety of these reflectively equally close skeptical-scenario possible worlds, Pritchard maintains that MRL entails that our ordinary commonsense perceptual beliefsif true in the actual worldare reflectively lucky. [Whether MRL actually entails that all of our true commonsense perceptual beliefs are reflectively lucky is by no means obvious. The fact that our commonsense beliefs are false in malevolent demon and BIV worlds does not show that these beliefs are false in nearly all reflectively equally close possible worlds. After all, for every malevolent demon world where we are systematically deceived, there is a corresponding benevolent demon world that is just as close, reflectively, in which the benevolent demon sees to it that all of our commonsense beliefs are true.]

Pritchard thinks that reflective luck is not incompatible with ordinary knowledge (he thinks only veritic luck is), but he insists that reflective luck is incompatible with a much-desired internalistic kind of robust reflective knowledge. Pritchard contends that skeptical challenges force us to confront the fundamental human epistemic predicament, to wit, that we cannot know, on the basis of reflection alone, that the skeptic’s radical hypotheses are false. For example, he thinks that we cannot know, by reflection alone, that we are not bodiless brains being kept alive in vats of nutrient being deceived into thinking we have hands.

If Pritchard is right that we lack reflective knowledge that the skeptic’s hypotheses are false, then those who think that reflective knowledge is closed under known entailment face an even greater skeptical threat. According to the principle of epistemic closure (PEC1): If S knows that p and also knows that p entails q, then S either knows or is in a position to know that q. Since we know that having hands entails not being a deceived bodiless brain in a vat, if we cannot have reflective knowledge that we are not deceived bodiless brains in vats, then given PEC1, we cannot have reflective knowledge that we have hands. The point can, of course, be generalized. Since radical skeptical hypotheses are incompatible with virtually all of the ordinary propositions we routinely take ourselves to know, if we lack reflective knowledge that radical skeptical hypotheses are false, then we lack reflective knowledge of the most mundane of ordinary propositions.

Pritchard contends that skeptical challenges force us to recognize the reflectively lucky nature of our anti-skeptical beliefs and that this, in turn, explains the enduring epistemic angst that skeptical hypotheses engender. Pritchard argues that the ineliminability of reflective luck shows that we not only lack reflective knowledge that the skeptic’s hypotheses are false, we also lack reflective knowledge that our ordinary commonsense beliefs are true. If Pritchard is right, we may, indeed, possess a great deal of ordinary knowledge, but the ineliminability of reflective luck will forever preclude us from reflectively being able to tell that we do.

4. Conclusion

Reflecting on the nature and scope of epistemic luck gives us deeper insight into the nature and scope of knowledge. Gettier cases demonstrate that fallible justification is not capable of ruling out all forms of knowledge-destroying epistemic luck and that thus knowledge requires more than justified true belief. Just what anti-luck condition must be added justified true belief to arrive at an adequate analysis of knowledge remains an open question.

Recognizing which forms of epistemic luck are incompatible with knowledge and which are not puts us one step closer to identifying the correct luck-eliminating condition. It is now generally acknowledged that veritic luck is incompatible with knowledge. Whether other forms of epistemic luck, such as, justification-oriented luck, are incompatible with knowledge is a question that deserves more attention. At a minimum, any adequate theory of knowledge must be capable of ruling out all cases of veritic luck and to date no theory has been able to do so.

The possibility of knowledge-destroying veritic luck poses no special skeptical threat where first-order knowledge is concerned. As long as a person is not veritically lucky with respect to p, she will know that p, provided she has met the other conditions required for knowledge. The situation appears to be different where second-order knowledge is concerned. While there is no consensus to date as to how serious an obstacle the Gettier problem poses for second-order knowledge, it poses enough of an obstacle to such knowledge to render implausible the once widely held KK-thesis according to which knowing entails knowing that one knows.

Veritic luck is not the only form of epistemic luck that threatens more reflective forms of knowledge. Reflective luck also threatens to undermine the possibility of reflectively knowing that one knows. Our apparent inability to know, on the basis of reflection alone, that the skeptic’s radical hypotheses are false, together with the principle of  epistemic closure, threatens to undermine the possibility of reflective knowledge altogether.

5. References and Further Reading

  • Chisholm, Roderick. 1986. “The Place of Epistemic Justification.” Philosophical Topics 14: 85-92.
    • Argues that one cannot generally know that one knows on the grounds that one cannot generally know whether or not one’s evidence for p is defeated by Gettier considerations.
  • Chisholm, Roderick. 1964. “The Ethics of Requirement.” American Philosophical Quarterly 1: 147-153.
    • Provides a No Defeaters response to the Gettier problem.
  • Clarke, Michael. 1963. “Knowledge and Grounds: A Comment on Mr. Gettier’s Paper.” Analysis XXIV: 46-48.
    • Argues that No False Grounds is mistaken since S can derive a justified true belief that p from a justified true belief that q and still fail to know that p because S’s grounds for q are false. Contends that knowledge is “fully grounded” justified true belief, where in order to be fully grounded, the chain of reasons leading up to S’s proximate grounds for p must itself contain no false grounds at any point in the chain.
  • Coffman, E.J. 2007. “Thinking about Luck.” Synthese 158: 385-398.
    • Defends a lack of control account of luck.
  • Dretske, Fred. 1971. “Conclusive Reasons.” Australasian Journal of Philosophy 49: 1-22.
    • Argues that in order to rule out knowledge-destroying luck, one’s reasons for p must be conclusive in the sense that one would not have had those reasons if p were false.
  • Engel Jr., Mylan. 2000. “Internalism, the Gettier Problem, and Metaepistemological Skepticism.” Grazer Philosophische Studien 60: 99-117.
    • Argues that the Gettier problem poses three distinct challenges to second-order knowledge which, when taken together, threaten to undermine the possibility of knowing that one knows.
  • Engel Jr., Mylan. 1992. “Is Epistemic Luck Compatible with Knowledge?” Southern Journal of Philosophy 30: 59-75.
    • Identifies veritic luck as the principal form of knowledge-destroying luck. Distinguishes veritic luck from evidential luck. Argues that, of these two types of luck, only veritic luck is incompatible with knowledge. Further argues that only externalist epistemologies are capable of ruling out veritic luck.
  • Feldman, Richard. 1981. “Fallibilism and Knowing that One Knows.” Philosophical Review XC: 266-282.
    • Defends the iterative KK-thesis. Argues that the Gettier problem poses a minor, but hardly insurmountable, obstacle to second-order knowledge.
  • Feldman, Richard. 1974. “An Alleged Defect in Gettier Counterexamples.” Australasian Journal of Philosophy 52: 68-69.
    • Provides a decisive example of an all-true-evidence Gettier case that shows that No False Grounds is too weak.
  • Fumerton, Richard. 1995. Metaepistemology and Skepticism. Lanham, MA: Rowman and Littlefield Publishers.
    • Argues that second-order knowledge is too easy on externalistic accounts of knowledge and that, therefore, such accounts fail to capture the kind of knowledge that interests us.
  • Gettier, Edmund. 1963. “Is Justified True Belief Knowledge?” Analysis 23: 121-3.
    • Demonstrates that justified true belief is not sufficient for knowledge. Highlights two paradigm examples of knowledge-destroying epistemic luck.
  • Greco, John. 2004. “A Different Sort of Contextualism. Erkenntnis 61: 383-400.
    • Defends a contextualist virtue epistemology. Offers a situationalist account of veritic luck, that is, an account tied to one’s epistemic situation rather than to one’s evidence: S is veritically lucky in believing that p if and only if, given S’s epistemic situation, it is just a matter of luck that S’s belief that p is true.
  • Greco, John. 2003. “Virtue and Luck, Epistemic and Otherwise.” Metaphilosophy 34: 353-366.
    • Defends a virtue theoretic solution to the Gettier problem. Argues that when S has a true belief that p because S believes out of intellectual virtue (that is, when S’s believing out of intellectual virtue is what accounts for her have a true belief that p rather than a false belief or no belief), then S’s true belief that p is not veritically lucky.
  • Goldman, Alvin. 1979. “What Is Justified Belief?” In Justification and Knowledge. Ed. George Pappas. Dordrecht, Holland: D. Reidel Publishing Company.
    • Develops and defends an externalistic, process reliabilist account of justified belief.
  • Goldman, Alvin. 1967. “A Causal Theory of Knowing.” The Journal of Philosophy 64: 355-372.
    • Attempts to solve the Gettier problem by replacing the traditional justification condition with a causal constraint requiring that one’s belief that p be caused by the fact that makes p true.
  • Harman, Gilbert. 1973. Thought. Princeton, NJ: Princeton University Press.
    • Presents three much-discussed examples intended to show that knowledge can be undermined by readily available misleading evidence that one does not possess. Defends a No Essential False Grounds response to the Gettier problem.
  • Harper, William. 1996. “Knowledge and Luck.” Southern Journal of Philosophy 34: 273-283.
    • Demonstrates that the Gettier problem plagues all fallibilistic theories of justification, both internalistic and externalistic alike. Argues that in addition to the traditional justification, truth, and belief conditions, an adequate analysis of knowledge must incorporate a “no luck” condition.
  • Heller, Mark. 1999. “The Proper Role for Contextualism in an Anti-Luck Epistemology.” Philosophical Perspectives, 13, Epistemology: 115-29.
    • Proposes a context-sensitive modal account of epistemic luck according to which ‘S’s belief that p is epistemically lucky’ is true if and only if there is at least one world (in a contextually-determined set of epistemically relevant worlds) where S’s belief that p is false. Defends a contextualist anti-luck epistemology which maintains that ‘S knows that p’ is true if and only if there is no world (in a contextually-determined set of epistemically relevant worlds) where S’s belief that p is false.
  • Hetherington, Stephen. 2005. “Gettier Problems.” Internet Encyclopedia of Philosophy.
    • Canvasses various purported solutions to the Gettier problem. Concludes with the contentious suggestion that justified true belief is sufficient for knowledge and that veritically-lucky justified true beliefs, like those in Gettier’s original examples, are actually cases of knowledge.
  • Hiller, Avram and Ram Neta. 2007. “Safety and Epistemic Luck.” Synthese 158: 303-13.
    • Demonstrates that safe true belief is not sufficient for knowledge by providing an example of a veritically lucky safe true belief that clearly falls short of knowledge.
  • Klein, Peter. 2008. “Useful False Beliefs.” In Epistemology: New Essays. Ed. Quentin Smith. Oxford: Oxford University Press.
    • Argues that there can be beneficial falsehoods—falsehoods essential to one’s justification—that nevertheless give one knowledge.
  • Klein, Peter. 1971. “A Proposed Definition of Propositional Knowledge.” Journal of Philosophy 68: 471-482.
    • Defends a No Defeaters solution to the Gettier problem.
  • Lackey, Jennifer. 2007. “Why We Don’t Deserve Credit for Everything We Know.” Synthese 158: 345-361.
    • Argues that knowledge is not credit-worthy true belief and that thus virtue theoretic accounts of knowledge are mistaken.
  • Lackey, Jennifer. 2006. “Pritchard’s Epistemic Luck.” The Philosophical Quarterly 56: 284-289.
    • Provides a counterexample to Pritchard’s modal account of luck.
  • Lehrer, Keith. 1990. Theory of Knowledge. Boulder, CO: Westview Press.
    • Defends a coherence theory of justification. Argues that knowledge is undefeated justified true acceptance.
  • Lehrer, Keith and Thomas Paxson Jr. 1969. “Knowledge: Undefeated Justified True Belief.” The Journal of Philosophy 66: 225-237.
    • Defends an alternative No Defeaters response to the Gettier problem. Introduces the Tom Grabit counterexample to the Chisholm/Klein account of defeaters.
  • Lycan, William. 1977. “Evidence One Does Not Possess.” Australasian Journal of Philosophy 55: 114-26.
    • Argues that misleading evidence one does not possess does not undermine one’s knowledge and that, thus, Harman Cases are actually cases of knowledge.
  • Murphy, Peter. 2005. “Closure Failure for Safety.” Philosophia 33: 331-34.
    • Adapts Kripke’s famous “blue barn” counterexample to Nozick’s analysis of knowledge to show that safe true belief accounts of knowledge also result in closure failure.
  • Myers, Robert G. and Kenneth Stern. 1973. “Knowledge without Paradox.” Journal of Philosophy 70: 147-160.
    • Argues that p can justify S in believing some other proposition q only if p is true and that, thus, Gettier’s original cases do not provide examples of justified true beliefs that fall short of knowledge.
  • Nozick, Robert. 1981. “Knowledge and Skepticism.” In Philosophical Explanations. Cambridge, MA: Harvard University Press.
    • Develops and defends a sensitivity-based subjunctive conditionals analysis of knowledge.
  • Plato: Theaetetus in Plato: Collected Dialogues. Eds. E. Hamilton and H. Cairns. Princeton, NJ: Princeton University Press, 1961.
  • Pritchard, Duncan. 2005. Epistemic Luck. Oxford: Oxford University Press.
    • Proposes a modal account of veritic luck. Argues that safety precludes veritic luck. Defends an externalist neo-Moorean safe true belief account of ordinary knowledge. Concedes to the skeptic that reflective luck is ineliminable and that such luck is incompatible with reflective knowledge.
  • Pritchard, Duncan. 2003. “Virtue Epistemology and Epistemic Luck.” Metaphilosophy 34: 106-30.
  • Pritchard, H.A. 1950. Knowledge and Perception. Oxford: The Clarendon Press.
  • Riggs, Wayne. 2007. “Why Epistemologists Are So Down on their Luck.” Synthese 158: 329-344.
    • Defends a lack of control account of luck. Argues that knowledge is credit-worthy true belief.
  • Roth, Michael. 1990. “The Wall and the Shield: K-K Reconsidered.” Philosophical Studies 59: 147-157.
    • Argues that the Gettier problem poses no obstacle to second-order knowledge on the grounds that Gettier-type contingencies lie safely outside the wall of fallibilism and can simply be ignored (unless they actually obtain).
  • Russell, Bertrand. 1912. The Problems of Philosophy. Oxford, England: Oxford University Press.
  • Skyrms, Brian. 1967. “The Explication of ‘X Knows that p’.” Journal of Philosophy 64: 373-389.
    • Provides one of the first cases of an all true evidence Gettier case.
  • Sosa, Ernest. 2007. A Virtue Epistemology: Apt Belief and Reflective Knowledge, Volume 1. Oxford, England: Oxford University Press.
    • Develops and defends a virtue epistemology which maintains that knowledge is reliably-produced safe true belief, the correctness of which is attributable to one’s epistemic competence. Argues that when the correctness of a reliably-produced safe belief is attributable to the proper exercise of an epistemic competence, then the resultant belief is not epistemically lucky.
  • Sosa, Ernest. 2000. “Skepticism and Contextualism.” Philosophical Issues, 10, Skepticism: 1-18.
    • Defends a non-contextualist safety-based Moorean response to the skeptical paradox.
  • Sosa, Ernest. 1999. “How to Defeat Opposition to Moore.” Philosophical Perspectives, 13, Epistemology: 141-54.
    • Develops a safety-based Moorean response to the skeptical paradox. Argues that such a response is preferable to skeptical, tracking, relevant-alternative, and contextualist accounts.
  • Steup, Matthias. 2006. “The Analysis of Knowledge.” Stanford Encyclopedia of Philosophy.
    • Discusses the necessary conditions for knowledge. Examines internalistic and externalistic analyses of knowledge.
  • Swain, Marshall. 1981. Reasons and Knowledge. Ithaca, NY: Cornell University Press.
  • Truncellito, David A. 2007. “Epistemology,” Internet Encyclopedia of Philosophy.
  • Unger, Peter. 1968. “An Analysis of Factual Knowledge.” Journal of Philosophy 65: 157-70.
    • Shows that various forms of epistemic luck – including propositional luck, existential luck, and facultative luck – are compatible with knowledge. Argues that knowledge is non-accidentally true belief.
  • Vahid, Hamid. 2001. “Knowledge and Varieties of Epistemic Luck.” Dialectica 55: 351-362.
    • Argues that truth-oriented veritic luck and justification-oriented luck are both incompatible with knowledge.
  • Vogel, Jonathan. 1999. “The New Relevant Alternatives Theory.” Philosophical Perspectives, 13, Epistemology: 155-80.
    • Uses Hole-in-One to argue that sensitivity is not necessary for knowledge.
  • Williamson, Timothy. 2000a. “Scepticism and Evidence.” Philosophy and Phenomenal Research 60: 613-28.
  • Williamson, Timothy. 2000b. Knowledge and Its Limits. Oxford, England: Oxford University Press.
  • Zagzebski, Linda. 1996. Virtues of the Mind. Cambridge, England: Cambridge University Press.
    • Characterizes Gettier-style knowledge-destroying luck as cases of “double luck” where epistemic bad luck is cancelled out by epistemic good luck. Argues that no fallibilist epistemology can rule out knowledge-destroying luck. Defends a virtue-based epistemology according to which knowledge is a state of cognitive contact with reality arising out of acts of intellectual virtue, and argues that this definition of knowledge is immune to the Gettier problem because truth is entailed by the other components of the definition.

 

Author Information

Mylan Engel Jr.
Email: mylan-engel@niu.edu
Northern Illinois University
U. S. A.

Edmund Husserl: Phenomenology of Embodiment

HusserlFor Husserl, the body is not an extended physical substance in contrast to a non-extended mind, but a lived “here” from which all “there’s” are “there”; a locus of distinctive sorts of sensations that can only be felt firsthand by the embodied experiencer concerned; and a coherent system of movement possibilities allowing us to experience every moment of our situated, practical-perceptual life as pointing to “more” than our current perspective affords. To identify such experiential structures of embodiment, however, Husserl must clarify and set aside not only the ways in which the natural sciences approach the body, but also the ways in which we have tacitly taken over natural-scientific assumptions into our everyday understanding of embodiment. Husserl’s phenomenological investigations eventually lead to the notion of kinaesthetic consciousness, which is not a consciousness “of” movement, but a consciousness or subjectivity that is itself characterized in terms of motility, that is, the very ability to move freely and responsively. In Husserl’s phenomenology of embodiment, then, the lived body is a lived center of experience, and both its movement capabilities and its distinctive register of sensations play a key role in his account of how we encounter other embodied agents in the shared space of a coherent and ever-explorable world. Many of Husserl’s findings were taken up by such later figures in the phenomenological tradition as Heidegger and Merleau-Ponty, who gave these findings an ontological interpretation. However, Husserl’s main focus is epistemological, and for him, lived embodiment is not only a means of practical action, but an essential part of the deep structure of all knowing.

Table of Contents

  1. Introduction
    1. Sources and Themes
    2. Terms and Concepts
  2. Naturalistic Presuppositions about the Body
  3. Embodied Personhood
  4. The Structure of Embodied Experience
    1. The Body as a Center of Orientation
    2. Distinctive Bodily Sensations
    3. Movement and the “I Can”
  5. Kinaesthetic Consciousness
    1. Systems of Kinaesthetic Capabilities
    2. Kinaesthetic Capabilities and Perceptual Appearances
    3. Kinaesthetic Experience and the Experience of Others
    4. Further Philosophical Issues
  6. Conclusion
  7. References and Further Reading
    1. Primary Sources
    2. Secondary Sources

1. Introduction

a. Sources and Themes

Edmund Husserl (1859–1938), the founder of phenomenology, addressed the body throughout his philosophical life, with much of the relevant material to be found in lecture courses, research manuscripts, and book-length texts not published during his lifetime. One of the most important texts—the second volume of his Ideas Pertaining to a Pure Phenomenology and to a Phenomenological Philosophy, subtitled Studies in the Phenomenology of Constitution and usually referred to as Ideas 2—was particularly influential. Heidegger, for example, had access to it in manuscript before writing his own major work, Being and Time (1927), and Merleau-Ponty consulted it while working on his Phenomenology of Perception (1945); indeed, Ideas 2 first became generally known on the basis of Merleau-Ponty’s references to it in Phenomenology of Perception. It has long been known that the text posthumously published in 1952 as Ideas 2 had been shaped by not one, but two editors, Edith Stein and Ludwig Landgrebe (each of whom worked on the text while serving as Husserl’s assistant). But more recent scholarship by Sawicki (1997) suggests that Edith Stein (1891–1942) should be seen as the guiding architect of the work, which she attempted to recast in terms of her own philosophical commitments so as to “correct” what she saw as problems and shortcomings in Husserl’s original 1912 draft. This may be why the text as we currently have it is marked by certain gaps and tensions. In fact, no faithful account of this seminal work will be possible until a new edition is published, fully disentangling Husserl’s own train of thought from Stein’s argument. The present article is therefore based on texts from all periods, and the copious amount of relevant material has been organized in terms of four main tasks of a Husserlian phenomenology of embodiment: bringing naturalistic presuppositions about the body to light; setting aside the naturalized body in favor of embodied personhood; offering phenomenological descriptions of the structure of embodied experience; and demonstrating that transcendental (inter)subjectivity itself must be thought as kinaesthetic consciousness. Before turning to these themes, however, let us pause for a brief overview of some of the key Husserlian terms and concepts used in this article.

b. Terms and Concepts

Husserlian phenomenology stands in opposition to naturalism, for which material nature is simply a given and conscious life itself is part of nature, to be approached with natural-scientific methods oriented toward empirical facts and causal explanations. In contrast, phenomenology turns directly to the evidence of lived experience—of first-person subjective life—in order to provide descriptions of experiencing and of objects as experienced, rather than causal explanations. For Husserl, these descriptions are to be eidetic (or “essential”) insofar as what is being described is not a specific set of empirical facts considered for their own sake, but invariants governing a certain range of facts—for instance, structural patterns that must obtain for something to be an object of a certain type at all, or eidetically necessary laws such as “any conceivable color has some extension.” Husserl’s investigations of essential structures of conscious life and experience further focus on consciousness as transcendental rather than mundane, which means that consciousness is taken not as a part of the world, but as the constitutive presupposition for experiencing any world whatsoever.

Husserl’s technical term constitution takes on many nuances as his work develops. But all constitutive phenomenology is concerned with the correlation between “experiencing” and “that which is experienced”—for example, between perceiving and the perceived, remembering and the remembered, and so on. This “universal a priori of correlation” (Husserliana 6, §46) encompasses not only conscious performances actively carried out by the I (for instance, a judging whose correlate is the corresponding judgment), but also deeper strata of subjective experience that often remain unnoticed in everyday life. They can, however, be brought to light by reflecting on the structure of the type of experience concerned. For example, that only one side of the perceptual object actually appears to me at any given moment has its subjective correlate in the situatedness of embodied experience, so that any spatial thing is always seen from a particular standpoint; at the same time, that I am currently seeing “this side” of the object has its subjective correlate in my capability for movement, since I am able in principle to move in such a way as to bring “other sides” into view. In short, Husserl does not presuppose a subject-object split, but operates with a subject-object correlation—a correlation he works out in detail for almost every sphere and stratum of experience.

Moreover, as the examples indicate, a Husserlian approach to consciousness or subjectivity is not restricted to the realm of the mental as traditionally understood; instead, the phenomenological notion of embodied experience offers an alternative to mind-body dualism. And Husserl’s investigations ultimately embrace not only the achievements and correlates of constituting subjectivity, but also those of intersubjectivity, that is, of the “we” rather than solely the “I.”

Finally, a general feature of Husserl’s terminology must also be mentioned: he frequently takes over words used differently in other contexts and expects the reader to understand these words not in terms of linguistic definitions set forth in advance, but in light of their referents—the experiential features or nuances that he is describing. Thus the Husserlian tradition is not merely a tradition of texts to comment upon or argue against, but a permanent possibility of checking descriptive claims against the touchstone of the appropriate experiential evidence so as to confirm or correct such claims. Bearing this in mind, let us now return to the four main moves accomplished by a Husserlian phenomenology of embodiment: criticizing naturalistic presuppositions about the body; thematizing embodied personhood; describing the structure of embodied experience; and investigating kinaesthetic consciousness.

2. Naturalistic Presuppositions about the Body

Summary: Husserl criticizes the assumption that the body is a psychophysical entity, in order to make “the body as directly experienced by the embodied experiencer concerned” a theme for phenomenological investigation.

Let us begin by sketching Husserl’s response to the philosophical and scientific tradition in which he found himself—and in particular to the naturalism of the positivistic natural sciences, which he addresses through a critique of its presuppositions. He is specifically concerned to demonstrate how a natural-scientific tradition that has inherited a Cartesian dualism of substances (res extensa/res cogitans), and is committed to mathematization as the measure of truth, deals with the “mental” by approaching the “psyche” in terms of the “psychophysical”: it is only by taking intangible minds as localizable in tangible living bodies that natural science can bring the “mental” side of the inherited dualism into the realm of real-spatial causality, and thus into the domain of calculability, prediction, and control. Rather than automatically accepting these assumptions, Husserl brings them to light; traces their historical development; establishes the limits of their legitimacy; and offers an alternative account of “consciousness” or “subjectivity,” an account that relies on rigorous philosophical methods and on a radical turn to the evidence of lived experience, rather than on the assumptions and methods of natural-scientific cognition.

But in the course of carrying out these larger tasks, Husserl highlights a major presupposition concerning embodiment. The received tradition, with its tendency to think in terms of the “psychophysical” (even when one is not actively carrying out psychophysical investigations or making specifically psychophysical claims), not only attempts to tie the “mind” to a material “body,” but is already operating under a more basic assumption—namely, that this body can itself be taken as a physical body (Körper) like any other spatial thing, albeit a thing with certain distinctive sorts of characteristics. For even if “organisms” are the province of special natural sciences (for example, anatomy and physiology) having to do with living rather than non-living things, it is still taken for granted that like “inanimate” objects, the “animate” ones too belong to the realm of real, spatially extended entities to be explained in terms of causal laws. Yet such a presupposition completely ignores what is essential to the body as a lived body (Leib)—as my body, someone’s body, experienced in a unique way by the embodied experiencer concerned. In other words, what is missing in naturalism is the body of embodiment, which must not be taken physically, but as directly experienced from within.

Here Husserl is not challenging the right of scientific practice to approach living bodies in causal terms; in Ideas 3 (originally written in 1912, but not published until 1952), he even proposes a new science—somatology—that would incorporate both physiological investigation of the material properties of the body as a living organism and experiential investigation of firsthand, first-person somatic perception (for example, of sensing tactile contact). But he does indeed insist on clarifying the presuppositions governing natural-scientific cognition, recognizing them for what they are and acknowledging their limits, so that, as he puts it in 1936 in §9h of The Crisis of European Sciences and Transcendental Phenomenology, we do not “take for true being what is actually a method.” Thus the historical fact that living bodies can be, and have been, approached with natural-scientific methods does not automatically allow us to relegate the body of embodiment to the res extensa side of Cartesian dualism. Instead, appropriate modes of inquiry must be developed to do justice to the body of direct experience.

Accordingly, Husserl not only provides a critique of the presupposition of the “psychophysical” (and of the lived body as a physical body), but opens up several further ways in which a phenomenology of embodiment can be pursued. In Ideas 1 (first published in 1913), he sets the body aside in order to reach the realm of “pure consciousness.” Commentators sometimes mistake this strategic move for Husserl’s “position,” and accuse him of postulating a disembodied, desituated consciousness. But the body that is set out of play here is merely the body that is assumed to be the “physical” half of the inherited dualism. Moreover, this is only the first step in the critique: Husserl is effectively suspending the tacit hegemony of the prevailing presupposition whereby it is automatically accepted, as a matter of course, that the body is a physical reality that is a part of nature—and setting this assumption out of play frees us to address the body and embodiment phenomenologically rather than naturalistically. After suspending the unquestioned validity of naturalistic presuppositions concerning the body, then, the next step is to retrieve the body of experience, and Husserl employs various pivotal distinctions in order to open up the experience of embodiment for phenomenological investigation.

3. Embodied Personhood

Summary: Husserl shows that embodied experience is geared into the world as a communal nexus of meaningful situations, expressive gestures, and practical activities.

One key distinction emphasized in Ideas 2 contrasts the “naturalistic” attitude, as the theoretical attitude within which natural science is practiced, with the “personalistic” attitude that characterizes personal and social experience in the world of everyday life—the “lifeworld” (Lebenswelt), the cultural world that is the province of the cultural or human sciences. Within the personalistic attitude, our intersubjective encounters are always experienced as embodied encounters, and our ongoing practical life is already an embodied one. Thus, for example, we greet one another with culturally specific gestures such as shaking hands; we communicate with others, responding to their facial expressions, gestures, and tones of voice; we use tools in practical, goal-directed actions; we rely on bodily capabilities and develop new skills that improve with practice or grow rusty with disuse; and so on. In other words, what we come upon are others embodying themselves in particular ways (serenely or impatiently, adroitly or clumsily, buoyantly or dragged down by pain or fatigue, and so forth): we immediately see embodied persons, not material objects animated by immaterial minds, and the immediacy of this carnal intersubjectivity is the foundation of community and sociality (with culturally specific “normal” embodiment playing a privileged role as the measure from which the “anomalous” and the “abnormal” diverge). Similarly, we make immediate use of the bodily possibilities at our disposal, which serve as the “means whereby” we carry out our everyday activities, without having to appeal to psychophysical explanations: I simply reach for my cup, pick it up, and drink from it, without ever giving a thought to the neurophysiological processes that allow me to keep my balance as I reach, move the cup without spilling the liquid, and swallow without choking. And even if my abilities are compromised by illness or injury, the lived experience of “I can no longer do it” is qualitatively different from the physician’s causal explanations for my condition.

For the most part, Husserl himself provides passing examples, rather than extended analyses, of embodied experience in the personalistic attitude. Yet if we recall that his aim is not to carry out concrete cultural-scientific investigations but to clarify the philosophical bases of the cultural or human sciences, we can see that his critique of naturalistic presuppositions about the body both secures a theoretical foundation for work in such areas as nonverbal communication (as well as other sorts of studies of embodiment carried out within phenomenological psychology, phenomenological sociology, and so on), and anticipates more recent concerns with socially shaped patterns of embodiment (including, for example, issues of gendered embodiment, as contrasted with the biological “sex” of an individual—although even the medical assignment of sex at birth may display, in certain problematic cases, social/cultural assumptions and priorities).

Husserl’s discussions of the “personalistic” attitude in Ideas 2 are echoed in his extensive discussions of the lifeworld in the Crisis, and several further points concerning embodiment can be made in this connection. First of all, for Husserl, the “prescientific” world of experience is more basic than the “objective” world constituted as a correlate to scientific practice in the naturalistic attitude; for example, natural-scientific investigation of the body as an object presupposes a functioning bodily subjectivity on the part of each of the scientists concerned, for whom their own lived bodies tacitly serve as organs of perception, communication, and action, even while they are engaged in carrying out detailed research into, say, the neurophysiology of motor behavior. At the same time, however, scientific assumptions and constructs “flow back into” everyday lifeworldly language and experience, so that, for instance, I may refer to my own body in anatomical terms as a matter of course, or offer causal explanations (rather than experiential descriptions) of my own bodily condition, even in a casual conversation with a friend. Thus although there is a functional priority of the personalistic over the naturalistic attitude, the former is ongoingly shaped and reshaped by the historical acquisitions of the latter—as well as by its unnoticed philosophical presuppositions and its habitual abstractions. Moreover, despite their important differences, both the naturalistic attitude and the personalistic attitude fall within a more general attitude that Husserl terms the “natural attitude.” In the natural attitude, not only are we typically straightforwardly directed toward objects rather than reflecting on the structures of our own subjective experience, but entities such as “bodies” (whether these are taken as “psychophysical realities” or “embodied persons”) are given as ready-made realities within a pregiven world; even the experiencer for whom such entities are given is him/herself taken as one entity among others in the world. And the natural attitude is both all-pervasive and anonymous—it is so taken for granted that we are not even cognizant of it as an “attitude” at all. But when we do become aware of it, still further insights into embodied experience become possible.

4. The Structure of Embodied Experience

Summary: Husserl’s phenomenological investigations emphasize that the lived body functions as the central “here” from which spatial directions and distances are gauged; that it is the locus of distinctive sorts of directly felt sensations such as the experience of tactile contact; and that it is capable of self-movement, opening a rich range of practical possibilities.

Husserl’s approach to disclosing the natural attitude for what it is and suspending its wholesale, automatic efficacy is termed the phenomenological reduction, which leads us from the natural attitude of everyday life to the phenomenological attitude. Within the phenomenological attitude, we set aside questions framed in terms of an ultimate “being” or “reality” existing utterly in itself; instead, we make experiencing—and correlatively, phenomena, which means whatever is experienced, exactly as experienced—the focus of our investigations. For a phenomenology of embodiment, this means turning to the body of direct experience in a way that is even more radical than acknowledging everyday encounters with embodied persons in the personalistic attitude. Why is it more radical? It is because in everyday practical life, we are typically occupied with things and tasks, and ignore the bodily “means whereby” we perceive things and carry out our activities. Although the “anonymity” of this tacitly functioning, everyday body becomes a key theme in existential phenomenology of the body, Husserl too was well aware of it, and it was his groundbreaking research that initially retrieved this lived body and bodily experience from its anonymity.

a. The Body as a Center of Orientation

One mode of inquiry that Husserl uses in his descriptive investigations of the body of lived experience is eidetic phenomenology. The eidetic reduction—which is the shift whereby we enter the eidetic attitude—takes whatever it is that we are experiencing as but one “example of” a particular structure or possibility. (Although Husserl speaks of “essences” in this connection, his use of the term must be distinguished both from Platonic essences and from more recent concerns with “essentialism.”) Thus, for example, as I write these words, the carrots growing in my garden are to my left; later in the day, when I have moved to a different spot in the sun, the carrots will be to my right. But in each case, what I experience is not an empty, homogeneous, mathematical space; instead, I experience lived space as an oriented space whose directional axes—left/right, above/below, in front/behind—are gauged from my own lived body as the central “here” from which all there’s are “there” and from which things are relatively “near” or “far” (right now, the lettuce is “closer to me” than the carrots). One way to refer to this invariant structural feature of embodied experiencing (of which my current relation to the plants in my garden is but one of innumerable possible variations—each of us could contribute a different example, but they would all be examples “of” the same experiential structure) is to speak of the lived body as the “nullpoint of orientation.” But Husserl’s account is more nuanced than this. Although visual experience does indeed seem to proceed from a “point” somewhere in the head, behind the eyes (so that, for example, what you can see of your nose is “in front,” what you can see of your lower lid is “below,” and so on), Husserl also refers to the bodily “here” as a whole with such expressions as “null-position” and “null-posture,” so that the structure of the experiential “center” need not be point-like. And in exploring, say, the underside of my own chin and jaw with my hand, I may find that I am living in the touching hand as the functional center of orientation and experiencing what I am touching as being “above” this hand. Like all descriptive phenomenological claims, the latter claim—namely, that the functional center of orientation can vary from the central “point” from which vision proceeds—is an invitation to consult the relevant experiential evidence for yourself; is the example I have mentioned a possibility that you can actualize? Can you “find” it, experientially, immediately … or does this structure of experience only emerge after a while, or in a different way? Spiegelberg (1966/1986/2004), for example, explores further experiential variations concerning the lived location of the embodied “center” of experiencing, and more descriptive work on this theme (especially work carried out by phenomenologists of diverse cultural backgrounds) would be welcome.

b. Distinctive Bodily Sensations

However, the lived body is more than a tacit “zero,” an abiding “here” from which spatial dimensions of perception and action unfold; it is not an abstract or empty center, but a filled one, with its own familiar feel, for to be embodied is to experience certain sorts of sensations as “mine” in a unique way. In some passages, Husserl replaces the usual German term for sensations—Empfindungen—with a new term, “Empfindnisse” (translated as “feelings” or “sensings”). Such distinctive sensitivities may be collectively referred to as the “somaesthetic” dimension of experience, including, for example, sensations felt in our bodily depths as well as on our bodily surfaces, and encompassing many nuances beyond “pleasure” and “pain.” But one of Husserl’s most important examples is tactile contact: when you touch my body, you are touching me, and I feel it. Sometimes one and the same episode of touching can be experienced in a double way: I might, for example, be exploring a small sculpture with my fingers, intent on its contours, textures, variations in temperature, and so on, or I can continue to palpate the object, but shift to an experiential attitude focused on sensing myself in contact with it precisely here, in exactly this way—retrieving the “bodily” side of my embodied commerce with the thing. Or I myself can furnish both sides of the example, touching one hand with the other (an example of Husserl’s to which Merleau-Ponty accords great importance). Here it becomes clear not only that my own body can be given to me both as the organ and as the object of touch—both as the means whereby the activity of touching is carried out, and as the phenomenon I experience through this activity (for example, the contours and textures I can feel on the surface of my touched hand)—but also that the same touched hand that is the object explored by the touching hand is itself alive to this contact, feeling it subjectively, so that I am living in this hand too as “mine.” In this connection the term “lived” body may connote a certain “undergoing,” emphasizing “affectivity” (being affected) rather than “activity” (although both are important for Husserl, who routinely mentions them together in his later research manuscripts).

c. Movement and the “I Can”

Nevertheless, actions too are “mine” (albeit in a qualitatively different way than the immediate bodily feelings of contact, pleasure, pain, warmth, cold, and so forth, are). And along with the body’s role as the center of orientation and its unique somaesthetic sensations, Husserl also emphasizes bodily motility—the capability for self-movement per se—as an essential feature of embodiment. “Being able to move” is the foundation for any specific bodily “I do” and for what he typically terms the bodily “I can” (which can be experienced as such even without actually performing the movement concerned—for instance, one can find the lived consciousness, “I can nod my head,” without actually doing it, experiencing it instead as a practical possibility given in the sheer “I could”). The range of the “I can” is enriched when I cultivate my capabilities or learn new skills, although as I have already indicated, it may also be temporarily eroded or permanently truncated in cases of illness or injury, so that the “I can” becomes an “I cannot.” For Husserl, however, the lived experience of embodied motility goes far beyond movement that is actively initiated by the I: there are also movements such as breathing, which normally goes on without my active intervention, yet can indeed be deliberately altered to some extent. Husserl therefore speaks of all such bodily movement as pertaining to the I in a broad sense that encompasses, but also includes more than, the active, awake I. For example, habitual movement patterns such as playing a familiar piece on the piano can indeed proceed without my explicit, moment by moment direction, yet are still lived as “mine,” and although they may often remain marginal, they can also be informed with awareness—or with a kind of “active allowing,” as when I lend them my “fiat” and am consciously letting the movement unfold. Thus here motility is a broader concept than agency in the strict sense whereby an “agent” would be actively, explicitly involved in initiating and directing the action throughout.

5. Kinaesthetic Consciousness

 

Summary: Husserl describes the articulation of kinaesthetic capabilities into coordinated systems of specific movement possibilities; outlines the “if-then” structure through which actualizing certain kinaesthetic possibilities brings coherent fields of appearances to givenness; suggests how a different “if-then” structure—one linking the deployment of my own kinaesthetic capability with the bodily feel of the movement concerned—is implicated in coming to experience other moving bodies as other sentient beings “like me”; and addresses the tension between “embodiment” as an ongoing dynamic, subjective process and the “body” as one object among others in the world.

Husserl devotes considerable attention to the theme of motility, and sketching out some of this work in more detail will allow us to see how his descriptive phenomenological work on embodiment fits into the larger philosophical context of his constitutive phenomenology (recalling that here “constitution” ultimately refers to the correlations between that which is experienced and the relevant performances and achievements of “consciousness” or “subjectivity”). Here a distinction given terminological form by one of Husserl’s assistants, Ludwig Landgrebe (1902–1991), is particularly helpful: that between the body-as-constituted and the body-as-constituting. The body-as-constituted is the body as experienced, that is, it is “that which” is experienced in the experience; the body-as-constituting is the experiencing body “by means of which” something is experienced. And for Husserl, this embodied, experiencing subjectivity (the body-as-constituting) is above all a kinaesthetic consciousness (Claesges 1964)—not as a consciousness “of” movement, but as a consciousness or subjectivity capable of movement.

a. Systems of Kinaesthetic Capabilities

Thus Husserl’s recourse to “kinaestheses” does not refer to “sense data” (for example, sensations pertaining to muscles or joints) postulated as “ingredients” of perceptual experience (for instance, of my own limbs given to me as material objects moving in space), but to the sheer experience of the subjective capability for movement per se (including the “I could” already mentioned) and to its organization into kinaesthetic systems, each with its own (multidimensional) leeway of movement possibilities. For example, in visual perception, the movement of the eyes alone forms one system; head movement affords a second system; the possibilities of rotating one’s entire body on the spot counts as a third system; and locomotor movement (for instance, walking) adds yet another system. When I turn to the left to look for the bird in the birdbath, my eye, head, and torso movements are typically vectorially combined into one integrated gesture. Turning my head allows me to see farther to my left than if eye movements alone were involved, and turning my torso expands my view beyond what eye and head movements can offer together—but whatever combination of eye, head, and torso movements is swung into play when I hear the splash in the birdbath, “turning to the left” will eventually allow me to bring what initially appeared only at (or beyond) the left-hand periphery of the visual field into the center of the field. Kinaesthetic systems can also stand in for one another—if my arms are full, I may hold the door open with my hip or acknowledge a friend’s wave with a gesture of my head rather than my hand. In this way the interarticulated kinaesthetic systems work together as one total kinaesthetic system whose multifarious possibilities of coordination typically take on the more circumscribed form of a habitual repertoire of familiar movement possibilities and customary ways to move.

Even within this more limited leeway, however, motility is characterized by a certain essential freedom that can be contrasted with the physical motion of spatial objects. This by no means implies complete freedom in every case—once I jump off the diving board, it is too late to change my mind, and I am headed for the water, since—unlike a bird—I have no way to fly back up into the sky. But the lived motility in which kinaesthetic consciousness holds sway is more typically experienced as reversible: having turned my head to the left, I can turn it back to the right; having extended my hand, I can withdraw it; having gone in one direction, I can retrace my steps. Moreover, the lived movement can be not only reversed, but repeated, interrupted, and inhibited; for Husserl, even “holding still” is a dynamic event, since it involves ongoingly maintaining a certain kinaesthetic “constellation” or “situation.”

b. Kinaesthetic Capabilities and Perceptual Appearances

Such descriptions retrieve kinaesthetic functioning from its anonymity, but remain abstract as long as its constitutive role is not specified more precisely. For example, enacting certain kinaesthetic possibilities brings certain correlative perceptual appearances to givenness in a concordant, regulated, non-arbitrary manner. “From here” I can see “this side” of the house, but this side already promises more, a situation for which Husserl uses the technical terms “inner horizon” and “outer horizon.” The current appearance of this side points to an inner horizon of possible future perceptions in which this very same side would itself be more fully given—for instance, if I were to move closer, then it could be touched as well as seen, or what is currently seen indistinctly could be seen in more detail, and so on. But “this” side of the building also points to an outer horizon of possible future perceptions of other “sides,” as well as further features of the surroundings, including currently unseen sides of other objects in the background, and so on.

Here what is important is not merely that Husserl’s account of perception emphasizes a correlation between, on the one hand, an embodied perceiver functioning as a center of orientation and, on the other hand, the perspectivity that is the invariable mode of givenness of perceived things in space; rather, what is at stake is a coherent, explorable, transcendent, open world. In other words, it is not merely that I see things from my own standpoint: it is that my own motility is the subjective correlate both of the world’s open explorability—its transcendence “beyond” the aspect of it given at any moment—and of its concordant coherence, since if I enact the appropriate kinaesthetic sequences, then what is currently “emptily” predelineated can be “fulfilled” in itself-givenness of the anticipated side or feature concerned (or can be “disappointed” and corrected instead). For example, I see a “corner” of the house; inseparable from the experience of this as a “corner” is that there is “more” to the house to be seen “around the corner” (even if this “more” is as yet indeterminate), and “if” I move there, “then” I will see precisely this “more,” determine its features in more detail, and so forth (or perhaps discover that all that is left of the building is a facade). However, the “if-then” relation that is at stake here is not a causal one, since the correlations in question pertain to the ordered structure of experience purely as experienced, not to real relations between physical entities considered in the naturalistic attitude (an attitude that we are, of course, free to take up if we wish).

Within the phenomenological attitude, in other words, the point is not to establish causal relations between “turning my head to the left” and “seeing a birdbath”; instead, the horizon of freedom pertaining to kinaesthetic consciousness opens ordered fields of display that can be seamlessly expanded as I move, so that turning my head to the left allows the corresponding further stretch of the visible world to come into view, whatever there may in fact be for me to see in any given situation or on any given occasion. And the same fundamental correlation between kinaesthetic capabilities and coherent fields of spatial display holds good for movement in any direction, as well as for the intersensorial world. Thus the description identifies an essential structure of experience per se, rather than offering a causal explanation of a particular empirical/factual event. Moreover, it turns out that Husserl’s analyses are not confined to the kinaesthetic circumstances swung into play in experiencing individual transcendent things “in” space, but demonstrate that kinaesthetic consciousness is itself space-constituting. (Early extensive analyses are found in the 1907 lectures published in Thing and Space, but Husserl refined his account throughout his life.) This, then, is another example of a Husserlian critique of presuppositions: he does not naively assume “space” as a pregiven framework for embodied perception and action (for example, as some kind of ready-made “container”), but devotes many pages to the experiential evidence that is at stake in the givenness of various types or levels of “space,” including not only the most immediate, “preobjective” space, but the infinite and homogeneous space of the natural sciences.

c. Kinaesthetic Experience and the Experience of Others

At this point, a second set of analyses come into play, for it is a feature of lived embodiment that I cannot jump out of my own skin and walk around myself in order to survey myself from all sides: the seeing consciousness is always at the center of orientation, and although I may be able to see parts of myself from various angles, I cannot see myself as a whole, as a figure on a ground or as an object “over there” from which I could definitively move away. Instead, I function as the uncancellably abiding “here” from which space-perception invariably proceeds. This means, however, that my “solo” experience of situated motility leaves me with a “hole” in space wherever I go—a mobile but non-surveyable center around which the rest of the panorama unfolds. The constitution of a genuinely homogeneous, objective, three-dimensional space requires the contribution of others, for whom I myself am indeed “over there,” inhabiting one among many possible “there’s.”  Thus space-constitution is tied up with the Husserlian theme of intersubjectivity, which is also a key motif in his phenomenology of embodiment. Although Husserl gives various accounts of intersubjectivity, the present article pulls together some pieces of the puzzle that depend directly on his work with kinaesthetic consciousness. Note, however, that this account is not a linear account of discrete “steps” to be carried out, as though we began our existence utterly alone and only gradually discovered fellow living beings; rather, the explication furnishes a kind of “exploded diagram” of certain structural moments involved in the lived experience of recognizing embodied others—here construed broadly enough to encompass non-human as well as human cases. (Thus in the technical language of phenomenology, the “exploded diagram” account is “static,” rather than providing a “genetic” description of the origin and development of a certain type of experience as an abiding acquisition.)

For Husserl, a double dimension of “localization” of kinaestheses comes into play in this regard (always recalling that here the term “kinaesthetic” refers to motility per se, to the sheer “I can”/“I could” rather than to specific sorts of “sense data”). First of all, it is possible for at least some enactments of my own directly experienced motility to be co-given to me in the form of something perceivable in the same way as things of the world are. Thus, for example, not only can I move my own limbs, but—within limits—I can see them as moving objects in the same field of vision where other spatial things are given: subjective motility is “localized” in objectively appearing movements displaying certain distinctive styles of movement and modes of relating to the surrounding world (think, for example, of my own active/responsive hands being visible to me as I reach for an object and grasp it). Similarly, the kinaesthetic experience of speaking, singing, or crying out is paired with sounds appearing in the same audial field in which other sounds are given. (Although what I am providing here is, as I have indicated, a structural explication of intersubjectivity rather than a genetic-developmental account, it should be pointed out that in one brief passage on the mother-child relationship, Husserl emphasizes the child linking his/her own kinaesthetic capability for vocalization with certain heard sounds in the audial sphere in general—and then hearing sounds resembling these in certain respects, but without simultaneously experiencing the relevant kinaestheses, so that this contrast mediates the emerging own/other distinction.) But at the same time, enacting this or that kinaesthetic possibility (or constellation of possibilities) from the total kinaesthetic horizon yields yet another “if-then” order, above and beyond the coherent correlations discussed above whereby kinaesthetic circumstances motivate corresponding perceptual appearances. For “if” I move in a certain way, “then” even without touching myself, I can experience correlative somaesthetic sensations or sensings (Empfindnisse): kinaesthetic enactments are “localized” in corresponding patterns of felt embodiment (for instance, experiences of straining or releasing) through which my own lived body is concretely, sensuously present for me (whether marginally, as when I am immersed in something I am reading, or thematically—think, for example, of how it feels to stretch luxuriously).

To put it another way, the “mineness” of my own act of moving is linked with the “mineness” of the accompanying somaesthetic sensations, as well as with recognizable styles of externally perceivable movement. When I perceive movement in such a style, then, but without the correlative kinaesthetic consciousness (the tacit or explicit “I move”) and its accompanying patterns of felt somaesthetic localization, I experience (via what Husserl terms “passive syntheses of association”—which, however, must not be confused with “associationistic” psychological theories) another subjectivity who, like me, is a subject of both action and affection, both agency and ownership, both doing and undergoing. Thus it is not necessary to see another body that looks the same as mine in the sense of being roughly the same size, shape, and color in order to motivate the experience of recognizing the other’s subjectivity—in fact, the view of myself “from the outside” that this would require is precisely something that I can never fully have: it is a possibility that is itself motivated from the experience of the other as having his/her own point of view on me, and thus cannot serve to motivate my recognition of the other as another subjectivity in the first place. Instead, what I experience when I see the other stands at a higher degree of universality: I see a style of movement associated with certain eidetic features proper to sentient/sensitive motility per se, namely, kinaesthetic capability and somaesthetic sensibility. But exactly because these invariants are open to exemplification in so many ways, they provide the foundation for the lived experience of difference-from-the-other as well as that of similarity-with-the-other, since they are the very identity that permits the experience of “difference” here at all.

I do not, in other words, recognize others because I see them as reiterations of myself in my concrete embodiment; the I-other pairing does not consist of a model and a replica, but of two mutually contrasting variations of “embodiment per se,” only one of which I have genuinely original access to (when I pick up the heavy stone, I experience both my own effort and the stone’s resistance firsthand; seeing the other struggle with the stone, I may understand the degree of effort involved and realize that I am the stronger of the two of us, but I do not experience the other’s effort in the same direct way I experience my own, nor do I directly experience the other’s pain if the stone slips and lands on the other’s toe). Thus the other active, sentient, sensitive, relational body is not some sort of duplicate of my own body, but precisely “a” lived body lived from an experiential standpoint I myself can never inhabit, a “here” that is truly transcendent to my own precisely because I inevitably experience it as a “there” in paired contrast to my own “here.”

d. Further Philosophical Issues

So far, I have sketched out how embodiment understood as kinaesthetic consciousness functions in Husserl’s philosophical accounts of the transcendent spatial world and transcendent fellow subjectivities. Here it is not possible to flesh out these accounts in any more detail, although it can be said that for Husserl, our very openness to the world essentially involves a kinaesthetic engagement with what is most immediately, sensuously given in such a way that the genetic origins of transcendental logic itself can be traced back to these kinaesthetic capabilities and performances and their correlative sensuous “givens” (see Husserliana 11), matters he thematizes under the title of “transcendental aesthetics” (although he takes this term in a different sense than Kant’s). However, a further step must at least be touched on, one that draws upon yet another important distinction—that between the transcendental and the mundane. Husserl’s analyses of kinaesthetic consciousness assume a transcendental attitude, yet in the natural attitude, the body is—as we have seen—“obviously” a mundane reality, a part of the world. Although his earlier efforts were geared toward clarifying the philosophical foundations of the sciences that study such a reality, some of his later writings (see, for example, Husserliana 15, 282–328, 648–57) are framed as an inquiry into the experiential achievements whereby transcendental consciousness “mundanizes” itself in the first place (that is, takes itself as part of the world), even prior to “naturalizing” itself in psychophysical terms. Without going into detail about his approach to the problem (which is also known as the paradox of subjectivity—how can the very consciousness that constitutes the world simultaneously be a part of this world?), it should be emphasized that for Husserl, what is at stake is ultimately not at all how a “disembodied” consciousness could somehow acquire a “body.” Instead, after demonstrating that kinaesthetic capability is an essential structural moment of transcendental subjectivity itself, he asks how kinaesthetic consciousness as an ongoing flow of purely experiential potentialities (the possibilities of primal motility per se), and of ever-changing actualizations of these possibilities, can come to count as a mundane entity apprehended as one thing among others in the world (and here Husserl’s descriptions of the lived experience of “resistance” offer important clues). In any case, however, the Husserlian critique of presuppositions concerning the body leads to something like the possibility of transcendental corporeality—a notion that places many aspects of the Western philosophical tradition itself into question.

6. Conclusion

Recognizing the tension between the transcendental experience of embodiment as kinaesthetic consciousness (or indeed, of “consciousness” or “subjectivity” as kinaesthetic) on the one hand and the mundane experience of the body as a material, psychophysical reality on the other can now allow us to summarize two of Husserl’s most important contributions to a phenomenology of embodiment (above and beyond his pioneering descriptions of essential features of bodily subjectivity). First, taken transcendentally, embodiment is not something accomplished once and for all, but is—to borrow a telling phrase from Zaner’s The Problem of Embodiment (1964)—“a continuously on-going act”: at every moment (even during periods of relative quiescence), I am involved in a dynamic process of “embodying” that is carried out through the current actualization of my own kinaesthetic capabilities, with certain possibilities rather than others being actualized in this or that way. This is the case whether the particular kinaestheses swung into play at any given moment arise from instinctual strivings, involuntary adjustments, acquired habits, or volitionally directed free movement, and whether these patterns of kinaesthetic actualization are going completely unnoticed; are marginally present for me; are experientially prominent due to difficulty or discomfort; or are consciously appreciated in lucid awareness “from within” (note that these two sets of possibilities—one having to do with volition, the other with awareness—can intersect in a number of ways). Second, that I can apprehend myself as a “psychophysical unity” is not simply something to be naively accepted, but something to be clarified as a historical achievement whereby embodied experience is localized in a mundane object, “the body,” as one item among others in a world of material, natural realities. Thus within the natural attitude, “embodiment” winds up signifying the external expression of the inwardness always already essentially pertaining to “bodies” insofar as they are lived bodies (as opposed to mere physical things). And indeed, experiential evidence for the latter sense of “embodiment” is readily available in everyday life in our times—we immediately encounter one another as embodied persons, not as machines that we suspect or conclude must harbor minds. However, experiencing myself as an ongoing realization of my own kinaesthetic capabilities taken not “psychophysically,” but sheerly experientially (which deactivates, rather than presupposing, mind-body dualism) requires shifting from the mundane to the transcendental attitude (in Husserl’s sense of the term “transcendental”), although once this insight has been historically achieved, it too, like the achievements of naturalism, can “flow back into” everyday experience.

In the end, then, whether he is providing a phenomenological genealogy of the “psychophysical” or offering an alternative account of embodiment in terms of “kinaesthetic consciousness,” Husserl provides a powerful critique of Cartesian dualism. Nevertheless, his own interests are basically epistemological in character: the accent lies not on claims about what the body really “is,” but on the epistemic contribution of embodiment itself to our knowledge of the world in the first place (as well as on the legitimacy of the foundations of our knowledge of such matters). And as in all of his phenomenological work, Husserl does not merely “hold a position” and offer arguments to support it, but consistently and rigorously takes the ultimate court of appeal to be the experiential evidence pertaining to the phenomena themselves—evidence that continually outruns our inherited terms and concepts and requires us to place them in question.

7. References and Further Reading

a. Primary Sources

  • Husserl, Edmund. Ding und Raum. Vorlesungen 1907. Ed. Ulrich Claesges. Husserliana 16. Den Haag: Martinus Nijhoff, 1973; Thing and Space: Lectures of 1907. Trans. Richard Rojcewicz. Dordrecht: Kluwer Academic Publishers, 1997, Sections IV–VI (129–245).
  • Husserl, Edmund. Ideen zu einer reinen Phänomenologie und phänomenologischen Philosophie. Zweites Buch. Phänomenologische Untersuchungen zur Konstitution [1912]. Ed. Marly Biemel. Husserliana 4. Den Haag: Martinus Nijhoff, 1952; Ideas Pertaining to a Pure Phenomenology and to a Phenomenological Philosophy. Second Book. Studies in the Phenomenology of Constitution. Trans. Richard Rojcewicz and André Schuwer. Dordrecht: Kluwer Academic Publishers, 1989, especially §§18a–b (60–70), §§36–42 (152–69), §§59–60a (266–77), et passim.
  • Husserl, Edmund. Ideen zu einer reinen Phänomenologie und phänomenologischen Philosophie. Drittes Buch. Die Phänomenologie und die Fundamente der Wissenschaften [1912]. Ed. Marly Biemel. Husserliana 5. Den Haag: Martinus Nijhoff, 1952; Ideas Pertaining to a Pure Phenomenology and to a Phenomenological Philosophy. Third Book. Phenomenology and the Foundations of the Sciences. Trans. Ted E. Klein and William E. Pohl. The Hague: Martinus Nijhoff, 1980, §2 (4–9); Supplement I, §4 (103–12).
  • Husserl, Edmund. Analysen zur passiven Synthesis. Aus Vorlesungs- und Forschungsmanuskripten 1918–1926. Ed. Margot Fleischer. Husserliana 11. Den Haag: Martinus Nijhoff, 1966; Analyses Concerning Passive and Active Synthesis: Lectures on Transcendental Logic. Trans. Anthony J. Steinbock. Dordrecht: Kluwer Academic Publishers, 2001, §3 (47–53); “Perception and its Process of Self-Giving,” §2 (581–88); Appendix 25 (534–36).
  • Husserl, Edmund. Phänomenologische Psychologie. Vorlesungen Sommersemester 1925. Ed. Walter Biemel. Husserliana 9. Den Haag: Martinus Nijhoff, 1962, Beilage VIII (“Die somatologische Struktur der objektiven Welt,” 390–95, not translated);Phenomenological Psychology: Lectures, Summer Semester, 1925. Trans. John Scanlon. The Hague: Martinus Nijhoff, 1977, §15 (79–83), §21 (99–101), §39 (150–53).
  • Husserl, Edmund. Cartesianische Meditationen [1931] und Pariser Vorträge. Ed. Stephan Strasser. Husserliana 1. Den Haag: Martinus Nijhoff, 1950; Cartesian Meditations: An Introduction to Phenomenology. Trans. Dorion Cairns. The Hague: Martinus Nijhoff, 1960, Fifth Meditation, especially §44 (92–99), §§51–56 (112–31).
  • Husserl, Edmund. Die Krisis der europäischen Wissenschaften und die transzendentale Phänomenologie. Eine Einleitung in die phänomenologische Philosophie [1936]. Ed. Walter Biemel. Husserliana 6. Den Haag: Martinus Nijhoff, 1954; The Crisis of European Sciences and Transcendental Phenomenology: An Introduction to Phenomenological Philosophy. Trans. David Carr. Evanston, IL: Northwestern University Press, 1970, §28 (103–11), §47 (161–64), §62 (215–19).
  • Husserl, Edmund. Zur Phänomenologie der Intersubjektivität. Texte aus dem Nachlass. Erster Teil: 1905–1920; Zweiter Teil: 1921–1928; Dritter Teil: 1929–1935. Ed. Iso Kern. Husserliana 13, 14, 15. Den Haag: Martinus Nijhoff, 1973, passim.
  • Husserl, Edmund. Transzendentaler Idealismus. Texte aus dem Nachlass (1908–1921). Ed. Robin D. Rollinger with Rochus Sowa. Husserliana 36. Dordrecht: Kluwer Academic Publishers, 2003, Text Nr. 7 (132–45), Text Nr. 9 (151–66).
  • Husserl, Edmund. Die Lebenswelt. Auslegungen der vorgegebenen Welt und ihrer Konstitution. Texte aus dem Nachlass (1916–1937). Ed. Rochus Sowa. Husserliana 39. Dordrecht: Springer, 2008, especially Part IX (603–672).

b. Secondary Sources

  • Behnke, Elizabeth A. “Edmund Husserl’s Contribution to Phenomenology of the Body in Ideas II” [1989]. Rpt. in Issues in Husserl’s “Ideas II.” Ed. Thomas Nenon and Lester Embree. Dordrecht: Kluwer Academic Publishers, 1996, 135–60; rev. in Phenomenology: Critical Concepts in Philosophy. Ed. Dermot Moran and Lester E. Embree with Tanja Staehler and Elizabeth A. Behnke. Volume 2. Phenomenology: Themes and Issues. London: Routledge, 2004, 235–64 [includes further references to work in this area].
  • Bernet, Rudolf, Iso Kern, and Eduard Marbach. Edmund Husserl. Darstellung seines Denkens [1989]. 2nd rev. ed. Hamburg: Felix Meiner, 1996; An Introduction to Husserlian Phenomenology. Evanston, IL: Northwestern University Press, 1993, Chapter 4, §3 (“The Kinaesthetic Motivation in the Constitution of Thing and Space,” 130–40, 259–60); Chapter 5, §2 (“Our Experience of the Other,” 154–65, 261–62).
  • Claesges, Ulrich. Edmund Husserls Theorie der Raumkonstitution. Den Haag: Martinus Nijhoff, 1964, Parts II and III (55–144) [includes significant citations on lived body and kinaesthetic consciousness from Husserl’s D manuscripts].
  • Depraz, Natalie. Lucidité du corps. De l’empirisme transcendantal en phénoménologie. Dordrecht: Kluwer Academic Publishers, 2001.
  • Dodd, James. Idealism and Corporeity: An Essay on the Problem of the Body in Husserl’s Phenomenology. Dordrecht: Kluwer Academic Publishers, 1997.
  • Mohanty, J. N. “Intentionality and the Mind/Body Problem.” In Organism, Medicine, and Metaphysics: Essays in Honor of Hans Jonas. Ed. Stuart F. Spicker. Dordrecht: D. Reidel, 1978, 283–300; rpt. in his The Possibility of Transcendental Philosophy. Dordrecht: Martinus Nijhoff, 1985, 121–38; rpt. in Phenomenology: Critical Concepts in Philosophy. Ed. Dermot Moran and Lester E. Embree with Tanja Staehler and Elizabeth A. Behnke. Volume 2. Phenomenology: Themes and Issues. London: Routledge, 2004, 316–32.
  • Sawicki, Marianne. Body, Text, and Science: The Literacy of Investigative Practices and the Phenomenology of Edith Stein. Dordrecht: Kluwer Academic Publishers, 1997, Chapter 2, D (“Nature and Intellect in Ideen II,” 73–89); Chapter 4, B, 1 (“Stein’s work for Husserl,” 153–65).
  • Seebohm, Thomas M. Hermeneutics: Method and Methodology. Dordrecht: Kluwer Academic Publishers, 2004, §12 (“The givenness of the other living body and animal understanding,” 98–105).
  • Spiegelberg, Herbert. “On the Motility of the Ego.” In Conditio Humana: Erwin W. Straus on his 75th birthday. Ed. Walter von Baeyer and Richard M. Griffith. Berlin: Springer-Verlag, 1966, 289–306; rpt., with Postscript 1978, in Spiegelberg, Steppingstones toward an Ethics for Fellow Existers: Essays 1944–1983. Dordrecht: Martinus Nijhoff, 1986, 65–86; rpt. in Phenomenology: Critical Concepts in Philosophy. Ed. Dermot Moran and Lester E. Embree with Tanja Staehler and Elizabeth A. Behnke. Volume 2. Phenomenology: Themes and Issues. London: Routledge, 2004, 217–34.
  • Ströker, Elisabeth. Philosophische Untersuchungen zum Raum. Frankfurt am Main: Vittorio Klostermann, 1965; Investigations in Philosophy of Space. Trans. Algis Mickunas. Athens, OH: Ohio University Press, 1987, Part One (“Lived Space,” 13–172).
  • Zahavi, Dan. “Husserl’s Phenomenology of the Body.” Études Phénoménologiques No. 19 (1994), 63–84.
  • Zaner, Richard M. The Problem of Embodiment: Some Contributions to a Phenomenology of the Body. The Hague: Martinus Nijhoff, 1964, 249–61, 287–89.

 

Author Information

Elizabeth A. Behnke
Email: sppb@openaccess.org
U. S. A.

American Transcendentalism

American transcendentalism is essentially a kind of practice by which the world of facts and the categories of common sense are temporarily exchanged for the world of ideas and the categories of imagination. The point of this exchange is to make life better by lifting us above the conflicts and struggles that weigh on our souls. As these chains fall away, our souls rise to heightened experiences of freedom and union with the good. Emerson and Thoreau are the two most significant nineteenth century proponents of American transcendentalism.

Looking at the world through common sense categories, such as time, space, and causation, yields hard and fast limits that can hurt us. Causation seems to make certain outcomes unavoidable whether we like them or not. Space separates us from the ones we love and the places we would rather be. Not to be outdone, time brings all good things to an end and converts the living into the dead. The categories of imagination free us from these detestable limits. We can imagine a world in which physical space is no more than an idea, enabling us to move from place to place at the speed of our thoughts. Emanation and fulguration make congenial substitutes for causation, because they generate only what is true, beautiful, and good. Not even time presents a problem for imagination, since we can readily view all things from the standpoint of eternity.

Most philosophers start with theories, searching for ways in which to practice what they preach only if they are serious about their philosophies. The transcendentalists reversed this procedure. They began with practices and then attempted to establish them on solid theoretical foundations. Yet these practices all involved spurning certain facts in favor of ideas, leading them invariably to theories that are inconsistent and vague. Their honesty would not allow them to spurn all facts, so they were ever at work reshaping intractable facts to fit their theories or stretching the fabric of their views to cover uncooperative facts. Unwitting victims of their own scruples, they found themselves hating facts that did not fit the mold and being frustrated with theories they knew failed to capture all the facts.

The final victim was transcendentalism itself. Critics, eager to wield the sword of criticism, overlooked the life-enhancing practices at the core of transcendentalism, concentrating their efforts on the many chinks and thin plates in its theoretical armor. Their blades penetrated easily, and they quickly pronounced their victim hopelessly baffling. Even friendly critics felt obliged to begin their articles with the proviso that transcendentalism is not easily articulated.

The transcendentalists were suspended between imagination and common sense. If they had been consistent empiricists or materialists, their theories might have been securely founded on facts. Had they been fully fledged idealists or rationalists, their theories might have been firmly fixed on logical relations. In reality, they were neither consistent nor fully fledged theorists. Emerson complained of a see-saw in his voice. Yet what is most valuable in the legacy of transcendentalism is not theoretical and is not in need of theoretical backing. It is the practices by which the transcendentalists managed, at least occasionally, to re-make the world in the image of what they loved.

Table of Contents

  1. Emerson and His Practices
  2. German Influence
  3. British Influence
  4. Idealism
  5. Morality
  6. Beauty
  7. Contemporary Relevance
  8. References and Further Reading

1. Emerson and His Practices

Although he denied he was a transcendentalist, Ralph Waldo Emerson (1803-1882) was rightly viewed by his peers and is rightly viewed by contemporary scholars as the primary philosophical exponent of American transcendentalism, followed by Henry David Thoreau (1817-1862). Emerson distinguished at least three practices by which facts may be exchanged for ideas. The first enacts a form of idealism. Instead of seeing the world as an independent power that may lay waste to our purposes and plans, we can view it as a display of images or pictures created by us, rendering it harmless and even benevolent. Secondly, we can focus on moral actions and rejoice in their goodness. The third practice distinguished by Emerson is perhaps the one for which transcendentalism is best known. It is that of contemplating beauty.

These practices come naturally to many of us. We many not connect them to Emerson, his contemporaries, or the period in American intellectual history─roughly between the publication of Emerson’s “Nature” in 1836 and Thoreau’s death in 1862─when transcendentalism flourished as a movement; but inasmuch as we seek to improve our lives by turning away from facts and embracing ideas, we are transcendentalists.

2. German Influence

Word of Kant’s transcendental idealism may have reached Emerson through Frederick Henry Hedge (1805-1890), a Unitarian minister who had studied in Germany and knew German philosophy in its native tongue. In 1836, Hedge, Emerson, and George Ripley (1802-1880) founded an informal group they called Hedge’s Club for the purpose of stimulating discussion of current topics in philosophy and theology. The group convened irregularly for about seven years and grew to include at least a dozen members. It became known as the Transcendental Club. These meetings provided ample opportunity for Hedge to share his knowledge of Kant’s transcendental philosophy with Emerson.

The heart of Kant’s Critique of Pure Reason is the distinction he draws between the transcendental and the transcendent. The transcendental refers to the necessary conditions of the possibility of experience. Input from perception is grasped through twelve categories, space, time, and what he calls the transcendental unity of apperception. These are structures of mind, so without them experience would be impossible. In contrast, the transcendent is that which lies beyond the scope of experience and is therefore inaccessible to conceptual knowledge. In his Critique of Pure Reason, Kant rejects the classic arguments for the existence of God on the grounds that knowledge of the transcendent is impossible. Ironically, the transcendental structure of mind is necessarily incapable of providing knowledge of the transcendent.

But Kant followed the Critique of Pure Reason with the Critique of Practical Reason, in which he adds an important qualification to his original view that God is unknowable. The transcendent, he argues, is the very foundation of morality. If not for God, the wicked might go unpunished and the righteous unrewarded. We may not be able to have conceptual knowledge of the transcendent, but we are morally obligated to act as the transcendent in the form of the free and rational moral will.

Kant’s view of our relation to the transcendent underwent a second transformation in his third major work, the Critique of Judgment. There he reconsiders some of the classic arguments for God’s existence and opts for the teleological. Organic beings are purposive, he tells us, though we cannot glean what their purposes are. Their purposiveness consists in the fact that they and their parts are mutually supporting. The parts are means to the wholes, and the wholes are means to the parts. Two important conclusions fall out of this analysis. The first is that everything can be seen as beautiful, because all objects have form, which hints at a kind of purposiveness based on the internal complexity of objects. This purposiveness of form creates a universal disinterested satisfaction that is experienced as beauty. The second conclusion is that conceptual understanding of beauty is impossible for human beings.

Although they are very different philosophies, the influence of Kant’s transcendental idealism on American transcendentalism is easy to see. The philosophies are markedly different because the Americans did not preserve Kant’s distinction between the transcendent and the transcendental. This distinction provides the framework for Kant’s entire system as presented in the Critique of Pure Reason, but the transcendentalists were not interested in systems or the frameworks of systems. What attracted them to Kant’s philosophy was the sentiment behind his words. One look at the prevailing sentiment of American transcendentalism and Kant’s influence is unmistakable. The Americans eagerly joined him in celebrating the rightness of moral action, the beauty of the world, and the majesty of God. Kant’s influence also shows itself in the view of the nature of beauty embraced by the transcendentalists, namely that it surpasses all understanding.

3. British Influence

The word “transcendental” brings to mind Kant and the German philosophers he influenced, but German thinkers were not the only ones to leave their mark on American transcendentalism. Emerson was a great admirer of William Wordsworth and Samuel Taylor Coleridge, both of whom he met when he traveled to Europe in 1832. Their romanticism was intoxicating to him, and he seems to have passed some of that intoxication to his friend Thoreau. The British romantics shared the same love of beauty, morality, and God that animated both Kant and the American transcendentalists, but the romantics had developed a unique perspective on our relation to those realities. This perspective provided one of the central features of American transcendentalism.

The British romantics saw tremendous beauty and goodness in the world. At the same time, they saw that all of that goodness and beauty is flawed. Human beings often embody great virtues, but this is not always so. Sometimes their behavior is monstrous in its selfishness and cruelty. The sky, the meadow, and the rose are breathtakingly beautiful, but as time passes their beauty fades. This double vision of the romantics, although it did not betray any facts, nevertheless placed them in the uncomfortable position of both hating and loving the world.

To get out of their predicament, the romantics made a bold move. They set their sights on the perfect, which for them could exist only beyond the awful limits of the world. Yet they could not forget that they were, as flesh and blood, inextricably tied to those very limits. Nor could they forget that it was those very limits that provided the precious glimpses of beauty and goodness, however degraded, that they cherished. This was a great tragedy, they decided, and it was made even greater by the fact that it was inevitable. If you live in this world and you have enough humanity to love the good and the beautiful, you will be constantly assailed by the pain of falling short of those ideals. Yet, they suggested, the more the tragedy of life appeared to us in all of its inevitability and pain, the more beautiful life would be in our eyes.

Had they embraced this perspective at face value, the transcendentalists might have been cheered. As a sequence of random events, some good and others bad, life is arguably meaningless and not worth living. But view it as a tragedy and life takes on a marvelous aesthetic unity. Anguish and tears become literary realities, beautiful in their significance and in the timeless moral lessons they convey. But the transcendentalists were too pragmatic to embrace such an intellectual view of life. The world was too much with them, and although they never tired of translating facts into ideas, they could not shake the sense that facts were somehow more real. Instead of cheering them up, their contact with romanticism ultimately saddened them. They fell in love with the perfect like good romantics, but they could find little beauty in the countless misfortunes that befell them. They felt betrayed by life. In Emerson, this feeling expressed itself in the form of sheer disbelief at the terrible things that happened to him. In Thoreau, it created a thin layer of bitterness and resentment that never dissipated.

The influence of the British romantics shaped American transcendentalism at least as much and probably more than that of Kant’s transcendental philosophy. Their influence contributed a longing for the perfect, one of the central features of transcendentalism in America. The other side of this contribution, equally central, was a treacherous undercurrent of disappointment and sadness.

4. Idealism

The idealism of the American transcendentalists, like their morality and their love of beauty, took the form of practices before it became, as an afterthought, a sort of theory. Emerson stood with his head between his legs and took note of the fact that this opened a very different reality. His long country rambles produced in him a profound feeling of the lawfulness and rationality of nature. The passions that stirred in his breast often burst forth in the form of an essay or a poem. Looking at the world from different angles, delighting in the patterns nature manifests, and writing poetry or prose are idealistic practices in the sense that they give consciousness a kind of priority. In creating new experiences or ideas, we seem to create new worlds, and mind takes on a status close to that of a divinity. The old familiar facts, when filtered through the categories of imagination, are given an almost miraculous appearance. We are free from their tedious and often sorrowful limits to roam at liberty in thought.

For the transcendentalists, this freedom was almost always short-lived. This is because they felt they should ground their idealistic practices in a consistent theory. The freedom and satisfaction their practices provided, they reasoned, would be more secure if given an adequate theoretical backing. Their reasoning may have been sound, but their attempt to ground their practices in an adequate theory had the opposite effect. They became tangled in theoretical problems that proved intractable, and this made engaging with ideas for the sake of the activity itself more difficult. An ulterior motive for this engagement was always pressing on them. The ideas could not be fully trusted unless somehow it could be shown that they captured the inner nature of things.

But the main idea implied by the transcendentalists’ practices, that all existence must conform to consciousness, was verified by their experience only occasionally. There were flashes of verification, as when Emerson dreamed he ate the world, but all too often it was consciousness that had to give way to the cold facts of existence. The loss of his beloved wife Ellen Tucker, his cherished son Waldo, and his dear friend Thoreau signaled to Emerson that something alien to mind was at work in the world.

What this alien something might be, the transcendentalists had no clear idea. Their idealism gave consciousness, rational principles, and human values the status of omnipotent governing powers. Evil was that for which there would be compensation, or it was an instrument necessary for the creation of a good far greater than any that would have been possible without its use. It was, in short, thoroughly intelligible, just, and even benevolent: a low mood or a contradictory thought passing through the Oversoul for the sake of its ultimate enrichment. Evil is good, or rather it has to be good if one takes the idealism of the transcendentalists at face value. Not even they were capable of doing this all the time, yet they had no means of understanding evil except through the lens of their idealism, nor would they have been comfortable viewing it through a different lens as a brute fact or an irrational power.

The only option for the transcendentalists was to live with one more evil, namely the fact that evil positively confounded their attempts to explain it. This made the evils they suffered that much worse, adding avoidable surprise and puzzlement to unavoidable pain. Had they been content to practice their idealism without attempting to expound it, they might have saved themselves considerable grief. Their idealistic practices alone do not give mind the kind of priority proper to a power, but they do give it a sort of valuational priority. In lavishly applying the categories of imagination to the world and marveling at the results, we affirm the priority of consciousness and its products in the sense of loving them the most. As long as this does not lead us to the tenuous theory that all existence must submit to mind, we are free to understand evil through the categories of common sense, as a rogue power that goes against our purposes and often overwhelms them. This does not translate evil into good, but it at least removes the sting of surprise and confusion when evil strikes.

5. Morality

Emerson was often accused of being a reluctant reformer, and behind those accusations there was a kernel of truth. His contemporaries in the United States and Europe were hungry for moral progress, and they wasted no time putting shoulder to wheel for their favorite causes. Emerson was different in that his temperament inclined him to be first a scholar. The purpose of scholars, he said, is to nurture the good in others; but what makes a scholar is the ability to see the larger picture of things, and for this a certain distance from the heat of action is required. Emerson was by nature a visionary and a poet, not a man of action. No matter how much he sympathized with the ideals to which reformers aspired, he could not engage in actual reform without some initial discomfort. It was not what he was born to do.

Yet Emerson’s accusers failed to see the full reality of the accused. He may have hesitated before throwing his weight behind a cause for fear of losing the reflective distance he naturally valued; but if the cause was just he almost always ended up fighting for it. When the government of the United States announced plans to force Cherokees from their native lands, Emerson wrote President Van Buren in protest. The reflective but deeply moral Emerson opposed slavery openly both in writing and in public lectures. Inspired by his friend Margaret Fuller, Emerson supported the burgeoning women’s rights movement, becoming Vice President of the New England Women’s Suffrage Association in 1869. It is ironic that for all the accusations of reluctance made against him by his peers, today’s scholars tend to view Emerson as one of the nineteenth century’s most important reformers.

Emerson’s reflective temperament made him skeptical of reform movements, which seemed to him to be driven by narrow and hasty views. At the same time, he was convinced that it is not the group but the individual that is the ultimate agent of moral progress. He admired and celebrated the moral actions of principled individuals and was often spurred by them to act in spite of his temperament. This is transcendentalism at its moral best. Moral action is not governed or called forth by a theory; it is a spontaneous response to the feeling of one’s individual potentiality combined with one’s natural love of the good. Assuming the good is not conceived narrowly, as it almost never was by Emerson, this kind of morality is less problematic than many others. Emerson was lead by it to some of his most important contributions to the reform movements of his day.

Again, the problem came when the transcendentalists attempted to ground their spontaneous practices in a theory that lodged their values in the nature of things. Emerson said many times that nature itself is the supreme model and ultimate ground of morality, since it is a manifestation of timeless moral laws. All evil is a form of instruction, pointing the way to better lives and a better world. No doubt Emerson embraced this view wholeheartedly, but its implications for specific evils that visited him created turmoil in his soul. He could not hide from himself the fact that some evils seemed to lack pedagogical value. It was hardly possible for him to view the death of his five year old son as instruction for the improvement of life. Nor was he easily able to see the fire that consumed his home as a lesson in how to make the world a better place. Their view of the world as inherently moral, like the idealism they attempted to develop, created for the transcendentalists a painful rift between the theory they thought they should live by and the actual events, actions, and emotions that filled their lives.

This was less so, at least as regards morality, for Thoreau. Emerson’s greatest student was less committed than his mentor to the view that nature is intrinsically moral. Thoreau, moreover, was more practical than Emerson in moral matters. He perceived with greater clarity the fact that institutions exist because of the free actions of individuals. If individual slaveholders decided to stop holding slaves, slavery would gradually be destroyed. Nor did Thoreau confuse this practical insight with Emerson’s more theoretical position that individuals are the ultimate moral agents because they have access, through reason and contemplation, to the moral laws inherent in nature.

While Emerson’s love of reflection often slowed his response time, Thoreau tended to react to moral problems quickly and decisively. He immediately urged those around him to examine their consciences and change their behaviors accordingly. Thoreau’s letter in support of John Brown, a controversial abolitionist convicted of treason, captures this pragmatic approach to morality. The letter is full of motivational rhetoric. Advancing the cause of abolition, Thoreau seems to have reasoned, was simply a matter of convincing enough individuals to oppose Brown’s conviction.

In their morality, the transcendentalists did not abandon their practice of exchanging facts and the categories of common sense for ideas and the categories of imagination. Nor did they lose all awareness of common sense or all contact with facts. This combination of circumstances might have propelled them to dizzying heights of theorizing in an attempt to reconcile the two poles of their vision, as it often did in relation to their idealism. But on the whole, not counting the theoretical flights in which Emerson argued for a moral structure of the world, the transcendentalists were more down to earth about morality than they were about metaphysics. Not only did they tend to avoid too much theorizing, they took their practice a step further. Having seen and appreciated a vision of how things ought to be, they went to work to improve the relevant facts in light of their ideas. In a word, the transcendentalists were meliorists.

6. Beauty

If there is a single practice with which American transcendentalism can be identified, it is contemplation of beauty. Emerson responded to Plato’s theory that beauty, truth, and goodness are one by saying that even so beauty is the best of the three. Children seem to see it radiating from the most ordinary objects to their exquisite delight. Adults sometimes find themselves feeling like children again in its presence. The transcendentalists thought of beauty as eternal, because a mere glimpse of it was enough to make them drop everything and simply take in what they heard or saw with neither motive nor intention. This activity satisfied them so deeply that while they were thus engaged it was as if time stood still.

Perhaps the most famous experience of beauty described by Emerson is the one in which he became a “transparent eyeball.”  Alone in the woods, “head bathed by the blithe air, and uplifted into infinite space”, he found that he had vanished from his own experience. It was as if he consisted of nothing but impersonal vision, the object of which was “unconstrained and immortal beauty.”  Thoreau was equally susceptible to such enthralling experiences of beauty.  Snow drifts, the shapes and colors of leaves, and the way light falls revealed to him so much beauty that he thought of them as imprints of the divine.

Of the three practices distinguished by Emerson, contemplation of beauty is perhaps the one that came most naturally to the transcendentalists. They were ready, like Socrates, for beauty to give them wings on which they could ascend to heaven and see reality from the standpoint of the gods. Facts, with all of their hard edges, quickly melted into images or ideas under beauty’s divine influence. The contemplative absorptions this created were immensely satisfying, but they were also deceptive. Had the facts actually melted or did it merely seem as if they had?

Motivated by the tremendous appeal of the former hypothesis, Emerson attempted to extend the influence of beauty far beyond momentary absorptions. He reached the conclusion that everything is beautiful by arguing that beauty derives from purposiveness. Emerson thought of nature as a single, all-embracing system governed by immutable moral laws. In such a system, everything has a purpose in relation to the whole and is rendered beautiful by that relation. Marcus Aurelius proposed something similar in his Meditations. He said that even the foam at the mouths of ravening beasts takes on a certain beauty once its purpose is known.

The argument that purposiveness confers beauty is plausible when purposes are present and when they are at least arguably benevolent. A towering concrete dam may spoil the beauty of a river, until we understand that it was built to provide water to surrounding communities. Yet even though we see a benevolent purpose behind it, the ugliness of the dam may not be diminished. The argument is much weaker when applied to objects or events the purposes of which are evil, and it fails altogether in the case of realities that are non-purposive. Weapons of mass destruction are not rendered beautiful in light of their purpose. Earthquakes that strike major cities may occur for the sake of maintaining the structural equilibrium of the planet, but this adds little beauty to them. Moreover, nothing prevents us from understanding natural disasters as mechanical rather than purposive processes.

The transcendentalists were eminently capable of stretching their imaginations. When they exercised this capacity to the fullest, they saw around them an abstract world of interrelated ideas. The beauty of that world was so captivating that it tended to blind them to all external realities. Emerson went as far as to say there is a certain beauty in a corpse. We can hardly fault the transcendentalists for wishing to live always in the presence of the beautiful; yet the feats of imagination by which they conjured an ideal world could not be sustained forever. The transcendentalists were drawn to the beauty of ideas, but they knew they had to make their way in a world that also included stubborn facts. Once more we can see that it was not their practices, but their efforts to ground them in theories that created problems for the transcendentalists. Emerson’s attempt to demonstrate that all things are beautiful made ugliness a little uglier and a little sadder. Even worse, it made it confounding. When a homely or a grotesque fact intruded into the beautiful world of his ideas, he loathed it all the more because he could not make sense of the intrusion.

7. Contemporary Relevance

Theories that attempt to establish the inner nature of the world will always be interesting and instructive. They capture the imagination and, if we care to see, show us the limits of what we know. The transcendentalists never produced a complete theory. Producing one that placed their values at the core of reality would have required them to set aside their interest in action and their instinctive loyalty to facts. Instead, they theorized spontaneously in an attempt to shore up the practices that brought them closer to the good. The bits and pieces of theory they devised are of continuing relevance not because they capture corresponding portions of ultimate reality but because they show us the extent to which human beings are capable of loving all that is good in the world.

Although the transcendentalists did not succeed in grounding their practices in a fully developed theory of absolute reality, they did not need to succeed in this. Appreciating the marvelous creativity of consciousness, affirming moral action, and contemplating beauty are self-standing practices. One might explain them and the values they uphold equally well through a variety of theories, and many philosophers have developed complete systems that assign to beauty, morality, or consciousness the status of the real. The transcendentalists sought to secure their practices to theoretical foundations, but their practices are independent of all attempts to develop a satisfactory account of them. They do not need the support of theories as an airplane does not need wires to hold it in the air. Not only that, but the effort to ground practices in theories is fraught with frustration, since there are many plausible theories, and those that one dislikes must constantly be defended against. The life-long devotion of the transcendentalists to their practices, even as their theories changed and vexed them anew, suggests they sensed their practices would and perhaps should stand on their own.

It is hard to overstate the value of the practices that form the heart of transcendentalism. We tend to lose sight of the sheer improbability of awareness and the wonder of its products. The rightness of moral action does not always impress us, and we often fail to see beauty in ordinary objects and events. The focus on improvement instilled in many of us from childhood bogs us down in facts that demand accounting and predicting. We grow increasingly blind to the realm of imagination and possibilities. Consider what the world would be like if we practiced transcendentalism all the time. We would view consciousness as a wonder that is without equal in the universe. A single moral action either performed or witnessed would be cause for feeling good. Perhaps best of all, we would not miss even the smallest particle of beauty. A pattern, a color, a sparkle seen out of the corner of the eye would lift us above all dreary facts to the heights of contemplative joy. None of this would establish the good or the beautiful or even the true in the inner sanctum of reality, but it would enrich the reality that is our experience, and that is what the transcendentalists sought above all.

8. References and Further Reading

  • Emerson, R. W. Emerson’s Complete Works. 12 Vols. Boston: Houghton, Mifflin, and Company, 1883-93.
  • Emerson, R. W. Emerson in His Journals. Joel Porte, Ed. Cambridge, MA: Harvard University Press, 1982.
  • Gougeon, L. Virtue’s Hero: Emerson, Antislavery, and Reform. Athens, GA: University of Georgia Press, 1990.
  • Gray, H. D. Emerson: A Statement of New England Transcendentalism as Expressed in the Philosophy of its Chief Exponent. New York: Frederick Ungar, 1970.
  • McAleer, J. J. Ralph Waldo Emerson: Days of Encounter. Boston: Little, Brown, and Company, 1984.
  • Packer, B. L. The Transcendentalists. Athens, GA: University of Georgia Press, 2007.
  • Richardson, R. D. Henry Thoreau: A Life of the Mind. Berkeley: University of California Press, 1986.
  • Thoreau, H. D. The Best of Thoreau’s Journals. Carl Bode, Ed. Carbondale, IL: Southern Illinois University Press, 1971.
  • Thoreau, H. D. Walden. Michael Meyer, Ed. London: Macmillan, 1995.
  • Thoreau, H. D. A Week on the Concord and Merrimack Rivers. Carl Hovde, Ed. Princeton, NJ: Princeton University Press, 1980.
  • Myerson, Joel, Ed. Transcendentalism: A Reader. New York: Oxford University Press, 2000.

 

Author Information

Michael Brodrick
Email: michael.brodrick@gmail.com
Indiana University Purdue University Indianapolis
U. S. A.

Edmund Husserl (1859—1938)

HusserlAlthough not the first to coin the term, it is uncontroversial to suggest that the German philosopher, Edmund Husserl (1859-1938), is the “father” of the philosophical movement known as phenomenology.  Phenomenology can be roughly described as the sustained attempt to describe experiences (and the “things themselves”) without metaphysical and theoretical speculations. Husserl suggested that only by suspending or bracketing away the “natural attitude” could philosophy becomes its own distinctive and rigorous science, and he insisted that phenomenology is a science of consciousness rather than of empirical things. Indeed, in Husserl’s hands phenomenology began as a critique of both psychologism and naturalism.  Naturalism is the thesis that everything belongs to the world of nature and can be studied by the methods appropriate to studying that world (that is, the methods of the hard sciences). Husserl argued that the study of consciousness must actually be very different from the study of nature. For him, phenomenology does not proceed from the collection of large amounts of data and to a general theory beyond the data itself, as in the scientific method of induction. Rather, it aims to look at particular examples without theoretical presuppositions (such as the phenomena of intentionality, of love, of two hands touching each other, and so forth), before then discerning what is essential and necessary to these experiences. Although all of the key, subsequent phenomenologists (Heidegger, Sartre, Merleau-Ponty, Gadamer, Levinas, Derrida) have contested aspects of Husserl’s characterization of phenomenology, they have nonetheless been heavily indebted to him. As such, he is arguably one of the most important and influential philosophers of the twentieth century. The key features of his work, and his understanding of the phenomenological method, are considered in what follows.

Table of Contents

  1. Biography
  2. How to Interpret Husserl’s Texts
  3. On the Concept of Number (Übert den Begriff der Zahl, 1887)
  4. Logical Investigations (Logische Untersuchungen, 1900-01)
  5. Ideas I (Ideen I, 1913)
  6. Ideas II (Ideen II)
  7. References and Further Reading

1. Biography

Edmund Husserl was born April 8, 1859, into a Jewish family in the town of Prossnitz in Moravia, then a part of the Austrian Empire. Although there was a Jewish technical school in the town, Edmund’s father, a clothing merchant, had the means and the inclination to send the boy away to Vienna at the age of 10 to begin his German classical education in the Realgymnasium of the capital. A year later, in 1870, Edmund transferred to the Staatsgymnasium in Olmütz, closer to home. He was remembered there as a mediocre student who nevertheless loved mathematics and science, “of blond and pale complexion, but of good appetite.” He graduated in 1876 and went to Leipzig for university studies.

At Leipzig Husserl studied mathematics, physics, and philosophy, and he was particularly intrigued with astronomy and optics. After two years he went to Berlin in 1878 for further studies in mathematics. He completed that work in Vienna, 1881-83, and received the doctorate with a dissertation on the theory of the calculus of variations. He was 24. Husserl briefly held an academic post in Berlin, then returned again to Vienna in 1884 and was able to attend Franz Brentano’s lectures in philosophy.

In 1886 he went to Halle, where he studied psychology and wrote his Habilitationsschrift on the concept of number. He also was baptized. The next year he became Privatdozent at Halle and married a woman from the Prossnitz Jewish community, Malvine Charlotte Steinschneider, who was baptized before the wedding. The couple had three children. They remained at Halle until 1901, and Husserl wrote his important early books there. The Habilitationsschrift was reworked into the first part of Philosophie der Arithmetik, published in 1891. The two volumes of Logische Untersuchungen came out in 1900 and 1901.

In 1901 Husserl joined the faculty at Göttingen, where he taught for 16 years and where he worked out the definitive formulations of his phenomenology that are presented in Ideen zu einer reinen Ph‰nomenologie und phänomenologischen Philosophie (Ideas Pertaining to a Pure Phenomenology and to a Phenomenological Philosophy). The first volume of Ideen appeared in the first volume of Husserl’s Jahrbuch für Philosophie und phänomenologische Forschung in 1913. Then the world war disrupted the circle of Husserl’s younger colleagues, and Wolfgang Husserl, his son, died at Verdun. Husserl observed a year of mourning and kept silence professionally during that time.

However Husserl accepted appointment in 1916 to a professorship at Freiburg im Breisgau, a position from which he would retire in 1928. At Freiburg Husserl continued to work on manuscripts that would be published after his death as volumes two and three of the Ideen, as well as on many other projects. His retirement from teaching in 1928 did not slow the pace of his phenomenological research. But his last years were saddened by the escalation of National Socialism’s racist policies against Jews. He died of pleurisy in 1938, on Good Friday, reportedly as a Christian.

Most commentators, therefore, recognize three periods in Husserl’s career: the work at Halle, Göttingen, and Freiburg, respectively. Some argue that one or another of these periods ought to be taken as definitive and used as the interpretive key to unlock the others. But such an approach highlights disjunctions in Husserl’s thought while neglecting the significant continuities. Important strands of Husserl’s philosophy have their beginning long before his academic career commenced.

The community into which Husserl was born, Prossnitz, was a center of talmudic learning whose yeshiva had produced or welcomed a number of famous rabbis during the two centuries before Husserl’s birth. This scholarly activity was supported by the industries of textile and clothing manufacture, through which Prossnitz’s Jews had enhanced the prosperity of the region. Jews and Germans were minorities in the town and appear to have comprised its middle class. Their interests were naturally allied against those of the Slavic majority. (For example, the census of 1900 counted 1,680 Jews among the town’s 24,000 inhabitants, according to The Jewish Encyclopedia.) In the ethnically diverse town, several dialects were spoken, and the language of the Husserl home probably was Yiddish.

The Jewish community of Prossnitz had established a technical school in 1843, and it became a public school for all the town’s children in 1869–one year before young Edmund Husserl was sent off to Vienna’s Realgymnasium. 1868 was also a year when civic authorities called for reform of Jewish education at all levels throughout Moravia. These developments reflect a movement toward modernization and integration after centuries of enforced segregation and legal restriction of Jewish life.

Prossnitz was the second-largest Jewish community in Moravia, with 328 families. Exactly 328 families; it could have no more, because of the quota established by the Bohemian Familianten Gesetz in 1787. The Jewish population was controlled through marriage licenses. Civil law set specific economic, age, and educational requirements; but in addition, the license could be granted only after a death freed up one of the allotted 328 slots. In effect, only first sons could hope to marry. Others had to emigrate if they wanted to have families of their own. This population-control policy was enforced until 1849, ten years before Edmund Husserl’s birth. The requirement that Jews obtain special marriage licenses remained in effect until late in 1859, some months after Edmund’s birth.

But Edmund Husserl’s childhood was spent during an era of liberalization for Prossnitz’s Jews. He received an elite secular education and probably made his father quite proud. At that period, gymnasia provided separate religious instruction for Christian boys and Jewish boys. Edmund’s Jewish education would have continued in that context and in the language of secular culture, High German. He could hear and read the Bible in that modern language as well, for in the nineteenth century a wave of new translations into the language of German culture was spawned by Moses Mendelssohn’s groundbreaking work. (Mendelssohn’s 1783 translation into High German was printed in Hebrew characters, phonetically, to make it easy to read.) Some of these editions were lavishly illustrated for display in bourgeois homes like Edmund’s, and most took into account the findings of recent historical and philological science. But during Edmund’s childhood, translating the Hebrew Bible was still a controversial issue. Some educational leaders in the Jewish community warned that it would undermine Hebrew learning among the young. Hebrew learning was evidently not prized by a father who would send his son to the capital to study Greek and Latin at the age when boys traditionally were sent down the street to learn Hebrew and Torah. To complicate the picture, in 1870 when Edmund was eleven, a new rabbi came to serve the Prossnitz community.

One may surmise, then, that Edmund Husserl came by his knowledge of the Bible through his classical secular education, not his religious tradition. It was of a piece with the German cultural heritage for him. It was a source of literary allusions, and in later life he could compare himself to Moses and to Sisyphus with equal ease.

Literary allusions, along with fragments of correspondence, are all that remain to us for the reconstruction of what Husserl may have felt about himself and his work. There is no autobiography per se. But there are retrospective texts. One of the most illuminating is the brief introduction that Husserl prepared for the 1931 publication in English of the first book of Ideen, originally brought out in 1913.

Now in his seventies, Husserl complains that most readers have misunderstood his life’s work. When he undertakes to reformulate what phenomenology is and what he has accomplished, however, he writes from a vantage point that he did not have some two decades earlier. Husserl becomes, in effect, a critic and interpreter of his own work, which he describes with a sustained metaphor. He portrays himself as an explorer who has opened the way into new territory so that others may conquer, map, and farm it. Of himself, Husserl writes:

“(H)e who for decades did not speculate about a new Atlantis but instead actually journeyed in the trackless wilderness of a new continent and undertook the virgin cultivation of some of its areas will not allow himself to be deterred in any way by the rejection of geographers who judge his reports according to their habitual ways of experiencing and thinking and thereby excuse themselves from the pain of undertaking travels in the new land” (422)

Here is another example of this characterization:

I can see spread out before me the endlessly open plains of true philosophy, the ‘promised land’, though its thorough cultivation will come after me” (429)

By means of this spatial, geographical metaphor of crossing over into the “new land,” Husserl conveys something of the adventure and pioneer courage that should accompany phenomenological work. This science is related to “a new field of experience, exclusively its own, the field of ‘transcendental subjectivity’,” and it offers “a method of access to the transcendental-phenomenological sphere” (408). Husserl is the “first explorer” (419) of this marvelous place.

2. How to Interpret Husserl’s Texts

Husserl had already employed the spatial metaphor in the 1913 text, although without explicit reference to himself as explorer. In chapter I-1 of Ideen I he had distinguished states of affairs (Sachverhaltnis) from essences (Wesen) by assigning them to two “spheres”: the factual or material, and the formal or eidetic, respectively. These spheres are connected only by the mind’s ability to pass between them as easily as moving around within either of them; they do not connect on their own, as it were. That is, no causality obtains between them. “Movement between” and “movement within” are of course further elaborations upon the spatial metaphor, and serve to designate the ability of consciousness to flow along, concentrate itself, linger, combine, focus, or disperse as it will. Such acts of consciousness belong to these spheres. They are worldly. They are “psychological.”

Husserl’s task is to get from those spheres into another “field” that is quite unlike them. It will be the sphere of absolute consciousness, consciousness when it isn’t going anywhere. As the title of chapter II-3 puts it, this will be “The Region of Pure Consciousness.” You can’t “go there” with consciousness; instead you have to let the worldly go away and then inhabit what’s left. This is the import of the infamous fantasy that opens paragraph 33: “(W)as kann als Sein noch setzbar sein, wenn das Weltall, das All der Realit‰t eingeklammert bleibt?” (In Kersten’s paraphrase: “What can remain, if the whole world, including ourselves with all our cogitare, is excluded?” [63])

Now, it’s quite curious that Husserl should choose the spatial metaphor to introduce and induce his phenomenological reduction. This metaphor invites confusion for anyone familiar with Descartes– who after all named spatial extension as the substantial attribute of material being. None of Husserl’s “spheres” is literally extended, in the Cartesian sense; yet all are coextensive (coincident) with material being–inasmuch as there’s literally nowhere else besides the material universe where they could be. Why then should Husserl choose such an incongruous and counterproductive metaphor? A different metaphor (such as “fabric” or “organism,” for example) could have conveyed the notions of coherence, separation, and access that Husserl intended. What is distinctive about the spatial metaphor, however, is that it connotes exploration and conquest. If transcendental consciousness is a promised land, then you need a Moses to lead you toward it. You need Husserl. When Husserl remarks, in the 1931 Introduction, that he can look down across that land that he has discovered, but that others will enter, this is a literary allusion to the figure of Moses, who led his people to Canaan, “the promised land,” but did not lead them into it (Deuteronomy 34).

If these allusions from 1931 can be taken as a thumbnail self- portrait, still one must remember that it was sketched during Husserl’s retirement. But Husserl’s thought grew and changed throughout his long career. In his maturity, the philosopher joined his readers in producing commentary upon his youthful work. The three phases of Husserl’s career–Halle, Göttingen, and Freiburg–invite facile divisions, and decisive turning points have been suggested within each of those periods. (The survival of nearly 45,000 pages of stenographic notes from Husserl’s teaching and his private researches has fueled disputes about when he might have had the first glimmer of a thought that led to a lecture comment that led to a paragraph that found its way into a book published long after the man’s papers and ashes were shelved in Louvain!)

Husserl himself insisted that the threads of continuity throughout the evolution of his thought were more significant than any false starts that later had to be repudiated. It seems well to grant him this point. Yet on two issues one must take seriously the critical discussion arising from disjunctions in Husserl’s thought: (a) the question whether to characterize Husserl as realist or idealist, and (b) the question of which stage of Husserl’s evolution–if any–should be taken as the definitive version through which all other versions are to be read. Husserl himself, writing as his own critic later in life, took a position on each of those issues. On (a), he insisted that he was and always had meant to be a transcendental idealist. On (b), he claimed competence to correct the insights of 1887, 1900, and 1913 with the insights of the 1920’s and 1930’s. Thus the mature Husserl would wish to erase the impression that his early work resolved the realism-idealism conundrum in favor of realism, and that it did so in fidelity to an insight already expressed in his earliest work on number.

Various punctuations of Husserl’s career by time, place, and predominant question have been suggested by commentators (for example, Kockelmans 1967: 17-23; Ricoeur 1967: 3-12; Biemel 1970; and Bell 1990). Husserl’s phenomenology developed gradually, but there were several relatively sudden turns and several stalls. Two examples suffice to illustrate. While at Halle shortly after the publication of Philosophie der Arithmetik, Husserl distanced himself from his recent efforts to establish mathematical and logical principles upon the psychological operations of the mind—a project that he later termed “psychologism.” Many commentators have characterized this as an abrupt turn made in response to Frege’s effective criticism of Philosophie der Arithmetik. However Mohanty (1982: 13), who examines the Frege-Husserl correspondence along with other documentary evidence, concludes to the contrary, that:

the seeds of development of Husserl’s philosophy from the Philosophie der Arithmetik to the Prolegomena [i.e., the first volume of the Logische Untersuchungen, 1900] were immanent to his own thinking, so that the hypothesis of a traumatic effect of Frege’s 1894 review of his book and a consequent reversal of his mode of thinking is not only uncalled for but also unsubstantiated by the available evidence.

Mohanty, then, provides ample warrant for a reading of Husserl that pursues threads of continuity between his early mathematical work and the breakthrough to phenomenology while at Halle.

In a second example of a supposed disjuncture in Husserl’s development, there has been discussion of whether he changed his stance from realism to idealism between Göttingen and Freiburg. On the one hand, Eugen Fink (1933) and many others see a consistent evolution of transcendental idealism from the work published in Ideen I onward. They tend either to dismiss the earlier works as if they were merely youthful failures, or forcibly to harmonize the realist passages with Husserl’s later positions. Husserl himself endorsed such a reading. On the other hand, those who studied with Husserl at Göttingen insist that his work at that time had validity and integrity in its own right. His former student Edith Stein (1932: 44-45) remarks that Husserl’s disciples were surprised at the idealistic passages in Ideen, and she calls Fink a latecomer to Husserl’s phenomenology. One of Stein’s contemporaries among Husserl students, Roman Ingarden (1962: 159), says that:

the idealistic tendencies apparent in volume I of the Ideen had been opposed by his disciples when the work was being studied during the seminars at Göttingen and . . . his disciples pointed out many passages in the Ideen which seemed to contain direct arguments against his idealism.

Subsequently Ingarden presented arguments, based on both the text of Logische Untersuchungen and his conversations with Husserl, in support of the view that Husserl originally espoused a realist standpoint but later abandoned it (Ingarden 1975: 4-8). Further discussion of the issue is to be found in Kockelmans (1967: 418-449) and in Van de Pitte (1981: 36-42)–who suggests that the discrepancy will vanish if one reads Husserl’s idealism as an epistemological or methodological approach to a metaphysically real world.

For his own part, Husserl (1931: 418-9) claimed that his transcendental idealism had advanced altogether beyond ordinary idealism, beyond realism, and beyond the very distinction between them. He denied that he ever had held a realist position:

. . . I still consider, as I did before, every form of the usual philosophical realism nonsensical in principle, no less so than that idealism which it sets itself up against in its arguments and which it “refutes.” [Phenomenological reduction] is a piece of pure self- reflection, exhibiting the most original evident facts; moreover, if it brings into view in them the outlines of idealism . .. it is still anything but a party to the usual debates bewteen idealism and realism. . . .

Husserl argued that transcendental-phenomenological idealism did not deny the actual existence of the real world, but sought instead to clarify the sense of this world (which everyone accepts) as actually existing.

Thus Husserl joins the company of those who read his work “backwards,” from the standpoint of Freiburg, interpreting the earliest work in light of the transcendental idealism of the latest. This reading grants no validity to the earlier work in its own right. It sets Husserl against Kant, and phenomenology’s thoroughgoing idealism against Kantian critical idealism. Fink, in his detailed response to neo- Kantians’ readings of Husserl’s phenomenology (1932), scolds them for even addressing arguments made in Husserl’s 1900-1 and 1913 publications–for Fink contends that those positions now must be assimilated to Husserl’s later formulations. The extreme hermeneutical implications of this stance come clear in Fink’s delineation of the threefold paradox entailed in reading Husserl’s phenomenology: (1) It is inevitably misunderstood if the reader has not first cultivated the transcendental attitude; yet that attitude arises from the reading. (2) The words necessarily miss their meaning, and fail to refer effectively to the pre-worldly realm of transcendental subjectivity, since all available words are worldly. (3) Phenomenology goes to a realm beyond logic, individuation, and determination, which ordinarily structure understanding. In this extreme form, then, the Freiburg reading of Husserl’s work is a locked door for the newcomer who is trying to get acquainted with Husserl’s phenomenology.

Fortunately, there are other hermeneutical options. A second group of commentators read Husserl “forward” from his intellectual beginnings at Vienna and Halle. The early work in mathematics and logic continues to attract the interest of Analytic philosophers. They are among those who argue that Husserl’s concern with numbers and logical reasoning, stimulated by the Kantian challenge, fructified in the prescription of eidetic and, eventually, phenomenological reductions.

Besides reading Husserl from Halle “forward” or from Freiburg “backward,” there is yet a third option. One may base one’s reading upon the Göttingen period and upon questions involving the genesis of the Ideen, as the keystone in the arch of Husserl’s development. This is the stance suggested by Ingarden, who considered Husserl’s later transcendentalism a big mistake, and by Stein, whose own subsequent works unfold the implications of the realism and personalism embraced by Husserl at that period. On this view the world, lost by Kant, is won back for science.

The problems of oneness and unity occupied Husserl throughout all the phases of his philosophical development: his earliest work on number and logic, his pre-war realist descriptive phenomenology, and his idealist transcendental phenomenology. His philosophy in some respects parallels the emergence of modern psychology, with whose tenets it should not be confused. The following are his major works.

3. On the Concept of Number (Übert den Begriff der Zahl, 1887)

Husserl’s Habilitationsschrift is subtitled “psychological analyses,” and it addresses the question how we recognize manyness within a group. Husserl remarks that the common definition of number–that number is a multiplicity of units–leaves two key questions unanswered: “What is ‘multiplicity’? And what is ‘unity’?” It is the former question, multiplicity, that occupies his attention throughout the essay. However the latter question, unity, haunts the discussion and refuses to be ignored.

Husserl locates the origin of multiplicity in the activity of combining, which he takes to be a psychological process. After much consideration he identifies this activity as synthesis, or the gathering of items into a set. He notices then that synthetic unities are of two kinds. Either the relationship through which the multiple items belong to the one set is a content of the mental representation of those items (right in there alongside them as another item that can be attended to and counted), or it is not there. In the former case, the unity is physical. Otherwise it is psychical, stemming from the unifying mental act that sets the contents into the relationship.

Having made that distinction between natural or physical unity, and arbitrary or imposed unity, Husserl then goes on to contrast these varieties of synthetic oneness with something else entirely: unsynthesized unity. His example is a rose, whose so-called parts are continuous and come apart only for the examining mind.

“In order to note the uniting relations in such a whole, analysis is necessary. If, for example, we are dealing with the representational whole which we call ‘a rose,’ we get at its various parts successively, by means of analysis: the leaves, the stem…. Each part is thrown into relief by a distinct act of noticing, and is steadily held together with those parts already segregated” (114).

Ironically, Husserl has struck gold while mining coal, and doesn’t quite recognize what he’s got hold of. His description of nonsynthesized unity comes almost as a byproduct of his attempt to differentiate physical or real collective combination from psychic combination. He writes:

“… these combining relations present themselves as, so to speak, a certain ‘more,’ in contrast to the mere totality, which appears merely to seize upon its parts, but not really to unite them [because they’re already united, independently of the mind!]…. In the totality there is a lack of any intuitive unification, as that sort of unification so clearly manifests itself in the metaphysical or continuous whole” (114).

Husserl has succeeded in distinguishing between natural and artificially synthesized wholes, on the one hand, and, on the other hand, those totalities that are known as having been accomplished neither by natural aggregation nor by mental combination. The unity of such wholes is known to be real, even though it admits of subsequent mental analysis or physical dissection.

Again ironically, in his concluding discussion of “number” Husserl neglects to notice the number one even as he employs it to illustrate how combination works. Substituting the term “and” for the term “collective combination,” Husserl remarks:

“(T)otality or multiplicity in abstracto is nothing other than ‘something or other’, and ‘something or other’, and ‘something or other’, etc.; or, more briefly, one thing, and one thing, and one thing, etc. Thus we see that the concept of the multiplicity contains, besides the concept of collective combination, only the concept something. Now this most general of all concepts is, as to its origin and content, easily analyzed” (116).

Husserl terms the concept something the most general concept. It stands for any object–real or unreal, physical or psychical–upon which we reflect. Thus he says that multiplicity as a concept arises out of the indetermination of the et-cetera that allows the series of “one and one and one and …” to go however far you like.

Yet an objection must be registered concerning what Husserl has found but not noticed. Multiplicity is but relatively undetermined; ultimately, multiplicity is in fact determined, or reined in, by one itself. This happens at three points. (a) One is the starting point of the counting series. Every number except the first number is a multiplicity; therefore the set of natural numbers is greater (by one!) than the set of multiplicities. (b) One determines the unit of counting. Only one something at a time gets counted. The and‘s must be put in between one‘s. (c) Although the series can stop anywhere, nevertheless it has to stop at one single place, not at several places. Every number is one distinct number.

Husserl, however, tries to produce the concept number by suppressing what he has taken to be the absolute indetermination of the something-series. This is how he gets determinate multiplicity, which he equates with number. In other words, the and‘s are the main ingredient for making numbers Husserl-style. This is incorrect, of course, but it is incorrect in an interesting way. For example, to make the number five, you would need four and‘s. To come up with those four and‘s, you would have to count them out; but before you could count to four, you would need three and‘s with which to make that four. But… there’s a regression back to one. The number five is four and’s, and five one’s.

The maddening difficulty of focusing upon combination eventually will have a happy outcome, which Husserl did not see in 1887. The truly interesting problem is one, the prime ingredient in numbers and the determiner whose own determination was to become Husserl’s guiding quest.

4. Logical Investigations (Logische Untersuchungen, 1900-01)

With the turn of the century, Husserl’s attention turned from and to one; that is, away from the mental activity of combining, and toward that which is reliably there to be combined. He wanted to show that mental activity is not the source of the latter. Chapter 8 of LU I exposes and refutes the three premises or “prejudices” of psychologism. In short, “psychologism” for Husserl is the error of collapsing the normative or regulative discipline of logic down onto the merely descriptive discipline of psychology. It would make mental operations (such as combination) the source of their own regulation. The “should” of logic, that utter necessity inhering in logical inference, would become no more than the “is” or facticity of our customary thinking processes, empirically described.

Husserl’s formulation and refutation of the three psychologistic premises is wickedly clever, but cannot be treated in detail here. (See # 43-49 of LU I.) One example must suffice. Psychologism, Husserl charges, would place logical inferences on the same plane with mental operations (# 44), and this would make even mathematics into a branch of psychology (# 45). Indeed, math and logic do have structures that are isomorphic to those of mental operations, such as combination and distinction. But given that similarity, how then would one distinguish the regulation of any of these processes from the description of it? Under psychologism, there’s no way. But Husserl makes the distinction in a way that also shows how regulation (that is, the laws of logic) comes from elsewhere than the plane of mental activity.

And he does this by virtue of one. In # 46 Husserl agrees with his opponents that arithmetical operations occur in patterns that refer back to mental acts for their origin and also for their meaning. However, there’s a difference between them as well. Mental acts transpire in time: they begin and end, and they can be repeated and individually counted. Numbers, in contrast, are timeless. While they can be represented in mental acts, this representation is not a fresh production of the number but rather an instantiation of its form. There is only one five. Any time we count five things, it isn’t a production of a new five but merely a deja vu for the same old five, eternal five. We can’t count numbers themselves, for there’s only one of each. (A similar argument is made in #22 of Ideas I.)

The same goes for logic, Husserl says. Concepts comprising the laws of pure logic can have no empirical range. Their range or sphere is ideal singulars, not mental generalizations from multiple instantiations. The operators of logic are other than those mental acts that happen to share the same names: “and,” “not,” “is,” “or,” “implies,” “may,” “must,” “should.” Psychologically, there can be many factual acts of combining, negating, etc. Logically, there is only one “and,” one “not,” etc. Husserl concedes here, as he did for arithmetic, that the logical operators take their origin and meaning from the mental acts. This accounts for the equivocal character of logical terms, which refer both to ideal singulars, and to mental states and acts. But if you fail to notice this equivocation, you become ensnared in psychologism, losing the possibility of pure logic and unified science.

The danger of equivocation extends over judgments as well. On the one hand, we can count multiple apperceptive events of affirmation, occurring psychologically, which proceed in time, begin and end, and recur as often as we like, in happenings that can be distinguished one from another. On the other hand, the judgment thus reached remains the same throughout each act accessing it. It seems to persist and to be called back for encore appearances; it seems even to have pre-existed its first appearance to me (# 47). In this latter sense, the judgment is not the same as the mental act that reaches it. Moreover, the truth of the judgment is neither equivalent to nor dependent upon the psychological experience of clear evidence that accompanies the mental act embracing it. Husserl easily shows this by recalling that in both logic and arithmetic, there are truths that have never been entertained in any human consciousness, and indeed could never be humanly conceived (# 50). (Cases of truth without the possibility of psychological evidence would include the computation of very large numbers, and decisions about membership in sets that are uncountably large. The arithmetical and logical operations connected with such determinations could never be “done” by a human mind or a computer. Their truth cannot be “factual.”)

The number one, then, has become Husserl’s touchstone for discriminating between psychological processes and logical laws. It is his reality detector. What is psychological (or empirical) comes on in discrete individual instances–ones–and you can examine their edges. What is logical (or ideal) comes on as a seamless oceanic unity without temporal edges, reliably persisting even when not attended to. Husserl’s sensitivity to the modes of unity, first expressed in the Habilitationsschrift and developed in LU, provides the launching pad for transcendental phenomenology.

5. Ideas I (Ideen I, 1913)

What launches transcendental phenomenology is the recognition that those modes of unity correlate with each other and with a third mode of unity, in ways that are tantalizingly asymmetrical. These three onenesses are: the factual unity of things and states of affairs, the eidetic unity of essences, and the living unity of consciousness as it flows along in a stream of experiences. Each has, and exhibits, its own distinctive kind of identity and persistence. Factual and essential unities give objects to the straightforward regard of consciousness, entering it as items of experience, each in its distinctive way; but consciousness can also deflect its regard back onto these enterings and discover its own unity, which is unlike either of theirs.

The possibility of this complex correlation is provided by the “principle of principles”: that intuitions come on to us with distinctive boundary-conditions that we can accept as sources insuring the correctness of our knowledge of them. Or in Husserl’s formulation:

“… that every originary presentive intuition is a legitimizing source of cognition, that everything originarily (so to speak, in its “personal” actuality) offered to us in ‘intuition’ is to be accepted simply as what it is presented as being, but also only within the limits in which it is presented there” (44).

The different kinds of unities have different kinds of edges, and these give away what kind of a unity each of them is going to be. But it’s easy to miss the differences. That happens in the natural attitude, Husserl says, when all the objects of consciousness are taken as if they were factual items. Husserl complains that even his Logische Untersuchungen have been misunderstood as advocating just this error of “Platonic realism,” by those who read into his use of the term “object” the implication that, through a perverse hypostatization, every thought turns into a thing (# 22). On the contrary, he says, the eidetic reduction, operative already in LU, empowers him to differentiate between how essences appear, and how cases appear.

Now with Ideen I, this distinction is sketched in beautiful detail. You can tell when the object occupying your consciousness is a physical thing, because things don’t give themselves to you all at once. What you get instead is a perspective inviting you to move around to the other side to perceive some more of the thing. All the while the thing keeps its unity to itself, as the reference point of all the angles it gives to you, and out of which you must reproduce or copy or simulate the unified thing as you conceive it. But in conceiving, you don’t have to put an “and” between two separate perceptions, the north face of a building and the south face, in order to yield the perception of the building as if it were a sum. These different views are given to you as continuous, as views of one thing.

Husserl terms this “shading off” or adumbration. (The notion of off-shading is reminiscent of a multiple-exposure photograph that captures successive phases of a movement in a single frame. Such photos were being seen for the first time at the turn of the century. Husserl also mentions new media such as the stereoscope and the cinema.) In contrast, essences give themselves to you all at once. Their boundaries are not sides, but rather laws entailing the characteristic necessities and possibilities of kinds of things (more about which below). The unity of any particular essence coheres within that determinate outermost boundary which free imaginative variations of possible cases must not exceed if they are to remain cases of this particular kind. Essential unity is centripetal, so to speak.

Then are those other unities–the ones presenting themselves as extended or factual–to be termed centrifugal, inasmuch as each spins off appearances in all directions from an inaccessible center? No, for their off-shading appears contextualized, as a foreground; and even as we focus upon the foreground it pulls its background into readiness for perception as soon as attention may shift to it. Every one is surrounded by a halo of and‘s, and beyond that are other somethings, seemingly without end. Whatever is extended is inexorably connected to whatever else is extended. (This last formulation, by the way, is an instance of an eidetic law. But the shift of attention that brings this essential rule into view is an eidetic reduction, and it wrenches us away from our naive attention to instances of things naturally appearing, under consideration here.) Every perception “motivates” another, stretching on toward expanding horizons.

The shift to the transcendental attitude–that is, the phenomenological or transcendental reduction–brings to Husserl’s notice a third kind of unity, which discloses the off-shading of things in a startling new way. We notice now that what is adumbrated is spatial, but the adumbration itself is not spatial. It arises in consciousness. “Abschattung ist Erlebnis” (95), while what is adumbrated, das Abgeschattete, has to be something spatial. The off-shading of things is at the same time the streaming of conscious life. Peculiarly, the giving off of partial perceptibilities (by the thing) coincides with the taking up of partial perceptions (by streaming consciousness). Which one is doing the shading? Agency cannot be imputed absolutely to either side.

But on the “side” of consciousness, as it were, we now recognize that we are dealing with more than a progression of life-bites strung together in series with and‘s. The stream of conscious life is not a sum or aggregate; nor is it a generalization. That is, it exhibits a unity unlike either the sachverhaltig unity of a factual case or the eidetisch unity of an essence. Husserl must account for that unity, which he calls an ego, Ich.

Moreover, and of paramount significance, with the benefit of the transcendental reduction it can now be told that these three kinds of unities themselves are not connected merely in series, with and‘s combining them, as if they were three discrete somethings. Their relationship is vastly more subtle. In order to understand it, through reduction we try to isolate unity from what accounts for unity. (We are not looking for something “prior to” unity — such as some “cause” of unity –, because we can’t have priority without having the number one, and oneness is just what is in question.)

Isolating oneness from the live experience-stream means removing the individual subject (you or me or Napoleon or whomever) from consideration. What is left, says Husserl, is transcendental subjectivity, “the pure act-process with its own essence” (“das reine Akterlebnis mit seinem eigenen Wesen“). (Paradoxically, we can see, right here in this formulation, that the reduction has not at all done away with essence, with states of affairs, or even with identity. We still have Eigenheit and Wesen, set in relation within a sentence. But these are now supposedly purified.) Husserl likens this de-individualized ego to a ray (# 92) or glance (# 101). Characteristically (or essentially) it has two poles or directions: the noematic and the noetic (from Greek terms noema and noesis, indicating what is thought and the act of thinking, respectively).

Husserl’s discussion of “noetic-noematic structures” fails in its attempt to show how the ego reaches and secures both the unity of the known object, and the unity of the knowing subject. But it fails in a spectacular starburst of insight. Husserl notices that the mental stream has its own distinctive kind of adumbrations or continuities, which are more complex than those discussed above, the relatively simple off-shaded appearings of spatial objects in perception. Beyond that simple sort of off-shading, consciousness can also turn back on itself and reflect upon its own intending acts, or on any component thereof. The stream meanders among spatial objects, but can also at whim objectify aspects of its own acts of intending, and consider them. This yields a thick layering of possible objects (# 97). For example, here are some noemata that might enter the live experience stream: pencils … writing … German verbs … the frustration of strong verbs … Ulrike … memories in general … the unreliability of memory … components of perceptions … the advisability of analyzing perceptions into their components … the smell of popcorn wafting into the study … the effort to resist distractions … and so forth.

Some of these arise directly from things, while others arise as objectifications of what was inherent a moment ago in the very act of knowing, the noesis. How can we tell the difference? Husserl answers that you can tell when the ego-beam has penetrated through to the bottom of the stack of noemata, so to speak, and has gotten ahold of a thing itself, because at that point, all the aspects of the thing are known immanently–really–in the act of perceiving as being contained in the sense of the thing (# 98). For example, you know popcorn itself when you are perceiving the taste of butter and salt. (You do not know popcorn when you read this sentence; instead, you are reflecting on what it is to know popcorn, and popcorn’s qualities are not given immanently within your object. But then while tasting popcorn, saltiness was given immanently but not objectified.)

Husserl rightly points out that we are able to slide up and down the pole of the ego-beam at will, moving now toward the thing, now away from it to consider the act of knowing and its modalities. For example, noematically I can consider a certain cat who probably exists, but then I can turn back noetically to assess the degree of certitude that characterizes my consideration of that selfsame cat as existing (# 105). Now if we were to slide down to the point where all modalities are behind us on the noetic side of the pole, and if there we were to face the object, we would get the pure sense of the object in which its unity is given.

In # 102 Husserl claims that this can happen, and that we can indeed slide far enough toward the object that the unity of the noema will be known as not having been imposed by the act of knowing. At that point, all of its qualities supposedly will be given immanently, really, contained in the perception rather than in the secondary conscious act that may grasp it a split-second later. Its sense will have been captured as something known with certainty to comprise its qualities, without the interference of a synthetic conscious act. (If this worked, it would effectively ensure the objectivity of knowledge, and would win the day for realism against idealism.) Husserl writes:

“The noematic objects … are unities transcendent to, but evidentially intended to in, the mental process. But if that is the case, then characteristics, which arise in [those unities] for consciousness and which are seized upon as their properties in focusing the regard on them, cannot possibly be regarded as really inherent moments of the mental process” (248-249).

Rather, they inhere in the object’s sense, and subsequently are lifted out for analysis in the mental process.

The ambitiousness of this claim is matched by that of another, which has to do with the opposite end of the ego-pole. In # 108 Husserl says that we can also shinny far enough up the ego-pole that we can capture the affirming noesis in its purity. All the modalities will have been loaded over onto the side of the noema, and the no_sis will be a believing affirmation, pure and simple: an unqualified yes. Thus Husserl insists that there is a crucial difference between (a) being validly negated and (b) not-being. For example, he would distinguish (a) denying correctly that my spayed cat has a kitten, from (b) affirming that the kitten of my spayed cat is a non-entity. With (a), the negativity inheres in the noesis, which has not yet been purified of all modality; but with (b), the noesis would be pure affirmation (# 104).

How correct is Husserl’s argument? We must grant that whatever makes this particular kitten impossible inheres elsewhere than in my knowing about it, for my denying something can’t make it go away. Furthermore, there’s nothing to prevent my forcing myself to think positively the thought of the kitten that my cat never had. Such a noetic posture is at least conceivable. However, its mere possibility is not enough to accomplish Husserl’s purpose. Husserl needs to show that this pure affirming belief really is done, somewhere somehow, in the toughest case, the case of an intrinsically impossible entity such as the kitten of a spayed cat. (That is, has anyone succeeded in recapturing that magic moment of purely affirming noesis with regard to an intrinsically impossible object? And if so, how would one go about certifying the accomplishment?)

Unfortunately, neither end of the ego-ray connects as Husserl had hoped. At the noetic pole, the purely affirming ego eludes the grasp of consciousness; so does the pure sense of the thing itself, at the noematic pole. These terms may remain as ideal asymptotes toward which the ego-ray continually points while continually falling short. The successful recovery of the connection between knowing and reality awaits another strategy, to be mounted by Husserl in the posthumously published second volume of Ideen.

6. Ideas II (Ideen II)

The second volume of Husserl’s Ideen (publication withheld until 1952) is the work of many hands. Husserl was dissatisfied with it and did not publish it. The first draft was written very rapidly in 1912, immediately after the manuscript of the first volume was completed. Husserl added material in 1915, and turned it over for editing to his assistant Edith Stein, who had come with him to Freiburg from Gottingen. Stein transcribed the work from Husserl’s shorthand in 1916. He gave her further material, and in 1918 she produced a collation arranged and titled as at present: the constitution of material nature, of animal nature, and of the cultural world. But Husserl’s phenomenology was evolving, and the manuscript did not suit him. Another assistant, Ludwig Landgrebe, worked on it 1923-25, and Husserl himself edited it in again 1928. It finally came out posthumously.

If the pursuit of unity had guided Husserl like a north star from his earliest writing on through the discovery and first articulation of phenomenology, then in Ideen II that star becomes obscured by “light pollution” from numerous more recent and competing insights. Without access to the manuscripts, it is impossible to know with precision how that came about. In portions of the text as we have it, the concern with unity remains a significant factor.

However, other portions seem to go against the grain of key insights from the first volume and the earlier works. For example, in LU and Ideen I, the material sphere had comprised states of affairs; that is, facts or cases such as could be expressed in logical propositions. There were indeed “things” in there, such as roses, yet the emphasis was upon the factual scenarios into which these things figured. By contrast, in Ideen II “material nature” is populated with substantial items, and the fact they are embedded in circumstances has to be additionally stipulated, almost as an afterthought (# 15c). By the same token, in the earlier work the eidetic sphere had comprised the forms of logical propositions and the rules of inference. While there were indeed “essences” entailed there, nevertheless the emphasis fell upon the lawful patterns of thinking about being. By contrast, in Ideen II “animal nature” is populated by psychic items whose unity is analogous to that of physical things yet whose active engagement with the latter can hardly be explained.

This shift matters, because judgments and perceptions reach unity in quite different ways. To certify that one selfsame proposition (e.g., that the cat is on the mat) returns to our consciousness on several occasions is quite a different task than to certify that one selfsame substantial entity (e.g., this mat-loving cat) returns to our sight every afternoon. Husserl’s early discoveries about unity had to do with judgment, and they were based upon the lived difference between synthetic judgments and analytic judgments. His ambitions then were not primarily metaphysical or epistemological. Moreover, it is relatively easy to “feel” the difference among three sorts of judgment: (a) a synthetic judgment that arbitrarily groups several items together, (b) a synthetic judgment that groups things in recognition of some characteristic that all share independently of the judgment, and (c) a judgment that the unity imputed to a thing is not owing to judgment at all. The distinction among these judgment-forms was already established in the Habilitationsschrift. However the task undertaken in Ideen II is forcibly to transpose that distinction onto perception, and so to come up with a general test for certifying when knowledge is genuinely in touch with reality.

This project is set in motion in # 9, where new terminology is introduced for the threefold distinction first made in “Begriff der Zahl.” (However, now that the transcendental reduction is presupposed, the arrow of causality should be removed. There can be only correlation or its absence.) BZ’s “psychic relation” now becomes “categorial synthesis,” in which perception serendipitously collects disparate items into one group, for no special reason intrinsic to the items. BZ’s “content relation” (or “physical relation”) becomes “aesthetic synthesis” (or “sensuous synthesis”), in which perception recognizes some intrinsic reason for grouping these items and finds itself constrained to do so by something other than mere whim. And BZ’s uncomposed unity (e.g., “that rose there”) becomes the “pure sense-object.”

In BZ, “synthesis” meant a combining judgment: a judgment that erected a set of things with many members. A set with one member–that is, a unified thing–obviously needed no synthesizing judgment to set it up. In Ideen II, however, “synthesis” means a perception that, while receiving multiple impressions (the off-shadings or Abschattungen), composes an object out of them. But this object is a unity, not a group; in fact, it is what Husserl would earlier have called an uncomposed unity. In other words aesthetic synthesis–operating now over partial views, not discrete items–finds that it has a reason for referring those multiple impressions to one object, even though the unity of the thing never gives itself directly to consciousness. What is that reason? This question is enticing, because Husserl is tantalizingly close here to describing a way in which the real unity of things is available for knowledge.

Husserl works on this question in # 15b, where “the spatial body is a synthetic unity of a manifold of strata of ‘sensuous appearances’ of different senses” (42-43). The spatially extended thing is a unity drawing together all the experiences we have had of it, and summoning us toward further experiences of it through sight and touch and our other senses. It achieves its unity as a spatial location, which seems not to depend upon whether or not it is actually perceived.

However, Husserl cautions, this unity alone is insufficient to validate itself. He writes:

“(W)e have first taken the body as independent of all causal conditioning, i.e., merely as a unity which presents itself visually or tactually, through multiplicities of sensations, as endowed with an inner content of characteristic features…. But in what we have said, it is also implied that under the presupposition referred to (namely, that we take the thing outside of the nexuses in which it is a thing) we do not find, as we carry out experiences, any possibility for deciding, in a way that exhibits, whether the experienced material thing is actual or whether we are subject to mere illusion and are experiencing a mere phantom” (43).

Thus, reality is not guaranteed for an isolated item, even when it seems to be giving us a reason to take it as the unified core attracting its manifold appearings to one hub of reference. The central location of the thing is dependent upon its real circumstances, as Husserl goes on to say in # 15c. The reality of “one” depends on “others”; i.e., on thing-connection. The thing is what it is in relation to its surroundings. This becomes apparent when things move and change, for their changes must correlate coherently with reciprocal changes in the things next to them.

Such co-variance is what certifies reality–or materiality, which Husserl seems to equate with it. In # 15c, reality means substantial causality. Within the webwork of material things, everything affects everything else. The real is the causal. Co-variance across the material realm, then, is what certifies the oneness and reality of that realm (# 15e).

Animated bodies also connect in the webwork of material things (# 13). Each of them is a center of appearings, a one, just as every other thing is. However, unlike soulless bodies, each animal is also a zero. It lives at a point of origination. The animal body bears the zero-point of orientation for the pure ego (61), as its absolute “here” (135, 166). Arithmetically, this is a stunning contrast. Every “something” whatsoever is either a one or a one-and-one-and-etc. But the animated body, in addition to being just one of those somethings, is also the one who is zero: the one from whom the counting starts, the one who chooses whether and where it is appropriate to insert the and‘s.

But any series that is initiated by/at/in the living body is counted off nonarbitrarily. Such series go in order; they are “motivated.” This is owing to the movement of the body itself within the material web. The body’s own kinesthetic sense will coordinate with the corresponding changes in sensory perceptions as it navigates among things. Thus, the zero shifts position in relation to the other unified centers to which perceptions accrue; but as it does so, the series of their appearings change in a regular way (63).

What about counting zero’s? Are they multiple; are there many human bodies? Husserl declines to pursue this avenue of approach into the problem of other minds and human community. Intersubjectivity will treated instead as an implication of the reality of the material world, not a precondition for it. The multiplicity of bodies is taken up only on page 83, where it is admitted that the foregoing analysis has been framed on the assumption that there would be only one, “solipsistic,” point-zero in reality. Belatedly, other bodies now are brought into the picture–but not because they are necessary for its unity, or because they have been apprehended among the realities presenting to consciousness. The others are brought in because they are required for the full unification of the thing in reality, whether that thing is one of the physical bodies or my very own live body. To be is to be describable (87). Reality for the thing entails a possibility of appearing to anyone at all. Being counted from one zero-point is not enough for the real thing. To count, it needs the possibility of being counted from multiple directions.

The thing is a rule of appearances. That means that the thing is a reality as a unity of a manifold of appearances connected according to rules. Moreover, this unity is an intersubjective one…. The physicalistic thing is intersubjectively common in that it has validity for all individuals who stand in possible communion with us (91-92).

To be real, the thing must count as a place or location, a center, independently of any particular point of origin. Yet what grants reality to the thing is not some consensus reached by observers. Indeed, the thing may look entirely different to different observers; however, its reality constrains all to agree that, at least, “it is there.” Oddly, then, the real thing is another kind of zero, for its barest reality consists in its being an empty place-holder (91-93).

Finally, Husserl makes unity a synonym for the philosophical term “substance” as traditionally meant. For example, he says that both the soul and the body are unities, so that an analogy obtains between psychic unity and material unity (129, 131). Oneness becomes the ontological form that determines substantial reality (133). The pure ego is one with respect to an individual stream of consciousness, that is, before the transcendental reduction has de-individuated the latter (117); however the pure ego is insubstantial and not one whenever the reduction is in effect (128).

And so Husserl’s quest for unity splinters and spends itself out by diverting into many contradictory projects pursued by the many unharmonized voices of Ideen II. Although the manuscript remained unpublished, it was made available for consultation by a number of Husserl’s younger colleagues. Among the last publication of Husserl’s lifetime was the Cartesian Meditations of 1931, in which he addressed the apparent solipsism of his transcendental phenomenology. That work itself was undergoing a comprehensive reworking in partnership with Husserl’s assistant Eugen Fink during the years before Husserl’s death in 1938.

7. References and Further Reading

Husserl’s publications and his extensive Nachlass are being brought out in a multi-volume critical edition entitled Husserliana – Edmund Husserl, Gesammelte Werke, from Nijhoff in The Hague. The major works published during Husserl’s lifetime are the following:

  • Über den Begriff der Zahl. Psychologische Analysen, 1887.
  • Philosophie der Arithmetik. Psychologische und logische Untersuchungen, 1891.
  • Logische Untersuchungen. Erste Teil: Prolegomena zur reinen Logik, 1900; reprinted 1913.
  • Logische Untersuchungen. Zweite Teil: Untersuchungen zur Phänomenologie und Theorie der Erkenntnis, 1901; second edition 1913 (for part one); second edition 1921 (for part two).
  • “Philosophie als strenge Wissenschaft,” Logos 1 (1911) 289-341.
  • Ideen zu einer reinen Ph‰nomenologie und phänomenologischen Philosophie. Erstes Buch: Allgemeine Einführung in die reine Phänomenologie, 1913.
  • “Vorlesungen zur Phänomenologie des inneren Zeitbewusstseins,” Jahrbuch für Philosophie und phänomenologische Forschung 9 (1928), 367-498.
  • “Formale und transzendentale Logik. Versuch einer Kritik der logischen Vernunft,” Jahrbuch für Philosophie und phänomenologische Forschung 10 (1929) 1-298.
  • Mèditations cartèsiennes, 1931.
  • “Die Krisis der europäischen Wissenschaften und die transzentale Phänomenologie: Eine Einleitung in die phänomenologische Philosophie,” Philosophia 1 (1936) 77-176.

Works translated into English by Husserl (In chronological order.  Publication dates of the German originals are in brackets.)

  • “Philosophy as Rigorous Science,” trans. in Q. Lauer (ed.), Phenomenology and the Crisis of Philosophy, New York: Harper [1910], 1965.
  • Formal and Transcendental Logic, trans. D. Cairns. The Hague: Nijhoff [1929], 1969.
  • The Crisis of European Sciences and Transcendental Philosophy, trans. D. Carr. Evanston: Northwestern University Press
    [1936/54], 1970.
  • Logical Investigations, trans. J. N. Findlay, London: Routledge [1900/01; 2nd, revised edition 1913], 1973.
  • Experience and Judgement, trans. J. S. Churchill and K. Ameriks, London: Routledge [1939], 1973.
  • Ideas Pertaining to a Pure Phenomenology and to a Phenomenological Philosophy – Third Book: Phenomenology and the Foundations of the Sciences, trans. T. E. Klein and W. E. Pohl, Dordrecht: Kluwer, 1980.
  • Ideas Pertaining to a Pure Phenomenology and to a Phenomenological Philosophy — First Book: General Introduction to a Pure Phenomenology, trans. F. Kersten. The Hague: Nijhoff (= Ideas) [1913], 1982.
  • Cartesian Meditations, trans. D. Cairns, Dordrecht: Kluwer [1931], 1988.
  • Ideas Pertaining to a Pure Phenomenology and to a Phenomenological Philosophy – Second Book: Studies in the Phenomenology of Constitution, trans. R. Rojcewicz and A. Schuwer, Dordrecht: Kluwer, 1989.
  • On the Phenomenology of the Consciousness of Internal Time (1893-1917), trans. J. B. Brough, Dordrecht: Kluwer [1928], 1990.
  • Early Writings in the Philosophy of Logic and Mathematics, trans. D. Willard, Dordrecht: Kluwer, 1994.
  • The Essential Husserl, ed. D. Welton, Bloomington: Indiana University Press, 1999.

Further Reading:

  • Bell, David (1990) Husserl, London: Routledge.
  • Bernet, Rudolf and Kern, Iso and Marbach, Eduard (1993) An Introduction to Husserlian Phenomenology, Evanston: Northwestern University Press.
  • Carr, David (1987) Interpreting Husserl, Dordrecht: Nijhoff.
  • Derrida, Jacques (1978) Edmund Husserl’s ‘Origin of Geometry’, trans. J.P. Leavy, New York: Harvester Press, 1978.
  • Dreyfus, Hubert (ed.) (1982) Husserl, Intentionality, and Cognitive Science, Cambridge/Mass.: MIT Press.
  • Drummond, John (1990) Husserlian Intentionality and Non-Foundational Realism, Dordrecht: Kluwer.
  • Levinas, E., (1973) The Theory of Intuition in Husserl’s Phenomenology, Evanston: Northwestern University Press.
  • Mohanty, J. N. and McKenna, William (eds.) (1989) Husserl’s Phenomenology: A Textbook, Lanham: University Press of America.
  • Smith, Barry and Smith, David Woodruff (eds.) (1995) The Cambridge Companion to Husserl, Cambridge: Cambridge University Press.
  • Sokolowski, Robert (ed.) (1988) Edmund Husserl and the Phenomenological Tradition, Washington: Catholic University of America Press.
  • Zahavi, Dan (2003) Husserl’s Phenomenology, Stanford: Stanford University Press.

 

Author Information

Marianne Sawicki
Email: law.sawicki@gmail.com
Huntingdon, Pennsylvania
U. S. A.

Hedonism

The term “hedonism,” from the Greek word ἡδονή (hēdonē) for pleasure, refers to several related theories about what is good for us, how we should behave, and what motivates us to behave in the way that we do. All hedonistic theories identify pleasure and pain as the only important elements of whatever phenomena they are designed to describe.  If hedonistic theories identified pleasure and pain as merely two important elements, instead of the only important elements of what they are describing, then they would not be nearly as unpopular as they all are. However, the claim that pleasure and pain are the only things of ultimate importance is what makes hedonism distinctive and philosophically interesting.

Philosophical hedonists tend to focus on hedonistic theories of value, and especially of well-being (the good life for the one living it). As a theory of value, hedonism states that all and only pleasure is intrinsically valuable and all and only pain is intrinsically not valuable. Hedonists usually define pleasure and pain broadly, such that both physical and mental phenomena are included. Thus, a gentle massage and recalling a fond memory are both considered to cause pleasure and stubbing a toe and hearing about the death of a loved one are both considered to cause pain. With pleasure and pain so defined, hedonism as a theory about what is valuable for us is intuitively appealing. Indeed, its appeal is evidenced by the fact that nearly all historical and contemporary treatments of well-being allocate at least some space for discussion of hedonism.  Unfortunately for hedonism, the discussions rarely endorse it and some even deplore its focus on pleasure.

This article begins by clarifying the different types of hedonistic theories and the labels they are often given. Then, hedonism’s ancient origins and its subsequent development are reviewed. The majority of this article is concerned with describing the important theoretical divisions within Prudential Hedonism and discussing the major criticisms of these approaches.

Table of Contents

  1. Types of Hedonism
    1. Folk Hedonism
    2. Value Hedonism and Prudential Hedonism
    3. Motivational Hedonism
    4. Normative Hedonism
    5. Hedonistic Egoism
    6. Hedonistic Utilitarianism
  2. The Origins of Hedonism
    1. Cārvāka
    2. Aritippus and the Cyrenaics
    3. Epicurus
    4. The Oyster Example
  3. The Development of Hedonism
    1. Bentham
    2. Mill
    3. Moore
  4. Contemporary Varieties of Hedonism
    1. The Main Divisions
    2. Pleasure as Sensation
    3. Pleasure as Intrinsically Valuable Experience
    4. Pleasure as Pro-Attitude
  5. Contemporary Objections
    1. Pleasure is Not the Only Source of Intrinsic Value
    2. Some Pleasure is Not Valuable
    3. There is No Coherent and Unifying Definition of Pleasure
  6. The Future of Hedonism
  7. References and Further Reading
    1. Primary Sources
    2. Secondary and Mixed Sources

1. Types of Hedonism

a. Folk Hedonism

When the term “hedonism” is used in modern literature, or by non-philosophers in their everyday talk, its meaning is quite different from the meaning it takes when used in the discussions of philosophers. Non-philosophers tend to think of a hedonist as a person who seeks out pleasure for themselves without any particular regard for their own future well-being or for the well-being of others. According to non-philosophers, then, a stereotypical hedonist is someone who never misses an opportunity to indulge of the pleasures of sex, drugs, and rock ‘n’ roll, even if the indulgences are likely to lead to relationship problems, health problems, regrets, or sadness for themselves or others. Philosophers commonly refer to this everyday understanding of hedonism as “Folk Hedonism.” Folk Hedonism is a rough combination of Motivational Hedonism, Hedonistic Egoism, and a reckless lack of foresight.

b. Value Hedonism and Prudential Hedonism

When philosophers discuss hedonism, they are most likely to be referring to hedonism about value, and especially the slightly more specific theory, hedonism about well-being. Hedonism as a theory about value (best referred to as Value Hedonism) holds that all and only pleasure is intrinsically valuable and all and only pain is intrinsically disvaluable. The term “intrinsically” is an important part of the definition and is best understood in contrast to the term “instrumentally.” Something is intrinsically valuable if it is valuable for its own sake. Pleasure is thought to be intrinsically valuable because, even if it did not lead to any other benefit, it would still be good to experience. Money is an example of an instrumental good; its value for us comes from what we can do with it (what we can buy with it). The fact that a copious amount of money has no value if no one ever sells anything reveals that money lacks intrinsic value. Value Hedonism reduces everything of value to pleasure. For example, a Value Hedonist would explain the instrumental value of money by describing how the things we can buy with money, such as food, shelter, and status-signifying goods, bring us pleasure or help us to avoid pain.

Hedonism as a theory about well-being (best referred to as Prudential Hedonism) is more specific than Value Hedonism because it stipulates what the value is for. Prudential Hedonism holds that all and only pleasure intrinsically makes people’s lives go better for them and all and only pain intrinsically makes their lives go worse for them. Some philosophers replace “people” with “animals” or “sentient creatures,” so as to apply Prudential Hedonism more widely. A good example of this comes from Peter Singer’s work on animals and ethics. Singer questions why some humans can see the intrinsic disvalue in human pain, but do not also accept that it is bad for sentient non-human animals to experience pain.

When Prudential Hedonists claim that happiness is what they value most, they intend happiness to be understood as a preponderance of pleasure over pain. An important distinction between Prudential Hedonism and Folk Hedonism is that Prudential Hedonists usually understand that pursuing pleasure and avoiding pain in the very short-term is not always the best strategy for achieving the best long-term balance of pleasure over pain.

Prudential Hedonism is an integral part of several derivative types of hedonistic theory, all of which have featured prominently in philosophical debates of the past. Since Prudential Hedonism plays this important role, the majority of this article is dedicated to Prudential Hedonism. First, however, the main derivative types of hedonism are briefly discussed.

c. Motivational Hedonism

Motivational Hedonism (more commonly referred to by the less descriptive label, “Psychological Hedonism“) is the theory that the desires to encounter pleasure and to avoid pain guide all of our behavior. Most accounts of Motivational Hedonism include both conscious and unconscious desires for pleasure, but emphasize the latter. Epicurus, William James, Sigmund Freud, Jeremy Bentham, John Stuart Mill, and (on one interpretation) even Charles Darwin have all argued for varieties of Motivational Hedonism. Bentham used the idea to support his theory of Hedonistic Utilitarianism (discussed below). Weak versions of Motivational Hedonism hold that the desires to seek pleasure and avoid pain often or always have some influence on our behavior. Weak versions are generally considered to be uncontroversially true and not especially useful for philosophy.

Philosophers have been more interested in strong accounts of Motivational Hedonism, which hold that all behavior is governed by the desires to encounter pleasure and to avoid pain (and only those desires). Strong accounts of Motivational Hedonism have been used to support some of the normative types of hedonism and to argue against non-hedonistic normative theories. One of the most notable mentions of Motivational Hedonism is Plato’s Ring of Gyges example in The Republic. Plato’s Socrates is discussing with Glaucon how men would react if they were to possess a ring that gives its wearer immense powers, including invisibility. Glaucon believes that a strong version of Motivational Hedonism is true, but Socrates does not. Glaucon asserts that, emboldened with the power provided by the Ring of Gyges, everyone would succumb to the inherent and ubiquitous desire to pursue their own ends at the expense of others. Socrates disagrees, arguing that good people would be able to overcome this desire because of their strong love of justice, fostered through philosophising.

Strong accounts of Motivational Hedonism currently garner very little support for similar reasons. Many examples of seemingly-pain-seeking acts performed out of a sense of duty are well-known – from the soldier who jumps on a grenade to save his comrades to that time you rescued a trapped dog only to be (predictably) bitten in the process. Introspective evidence also weighs against strong accounts of Motivational Hedonism; many of the decisions we make seem to be based on motives other than seeking pleasure and avoiding pain. Given these reasons, the burden of proof is considered to be squarely on the shoulders of anyone wishing to argue for a strong account of Motivational Hedonism.

d. Normative Hedonism

Value Hedonism, occasionally with assistance from Motivational Hedonism, has been used to argue for specific theories of right action (theories that explain which actions are morally permissible or impermissible and why). The theory that happiness should be pursued (that pleasure should be pursued and pain should be avoided) is referred to as Normative Hedonism and sometimes Ethical Hedonism.  There are two major types of Normative Hedonism, Hedonistic Egoism and Hedonistic Utilitarianism. Both types commonly use happiness (defined as pleasure minus pain) as the sole criterion for determining the moral rightness or wrongness of an action. Important variations within each of these two main types specify either the actual resulting happiness (after the act) or the predicted resulting happiness (before the act) as the moral criterion. Although both major types of Normative Hedonism have been accused of being repugnant, Hedonistic Egoism is considered the most offensive.

e. Hedonistic Egoism

Hedonistic Egoism is a hedonistic version of egoism, the theory that we should, morally speaking, do whatever is most in our own interests. Hedonistic Egoism is the theory that we ought, morally speaking, to do whatever makes us happiest – that is whatever provides us with the most net pleasure after pain is subtracted. The most repugnant feature of this theory is that one never has to ascribe any value whatsoever to the consequences for anyone other than oneself. For example, a Hedonistic Egoist who did not feel saddened by theft would be morally required to steal, even from needy orphans (if he thought he could get away with it). Would-be defenders of Hedonistic Egoism often point out that performing acts of theft, murder, treachery and the like would not make them happier overall because of the guilt, the fear of being caught, and the chance of being caught and punished. The would-be defenders tend to surrender, however, when it is pointed out that a Hedonistic Egoist is morally obliged by their own theory to pursue an unusual kind of practical education; a brief and possibly painful training period that reduces their moral emotions of sympathy and guilt. Such an education might be achieved by desensitising over-exposure to, and performance of, torture on innocents. If Hedonistic Egoists underwent such an education, their reduced capacity for sympathy and guilt would allow them to take advantage of any opportunities to perform pleasurable, but normally-guilt-inducing, actions, such as stealing from the poor.

Hedonistic Egoism is very unpopular amongst philosophers, not just for this reason, but also because it suffers from all of the objections that apply to Prudential Hedonism.

f. Hedonistic Utilitarianism

Hedonistic Utilitarianism is the theory that the right action is the one that produces (or is most likely to produce) the greatest net happiness for all concerned. Hedonistic Utilitarianism is often considered fairer than Hedonistic Egoism because the happiness of everyone involved (everyone who is affected or likely to be affected) is taken into account and given equal weight. Hedonistic Utilitarians, then, tend to advocate not stealing from needy orphans because to do so would usually leave the orphan far less happy and the (probably better-off) thief only slightly happier (assuming he felt no guilt). Despite treating all individuals equally, Hedonistic Utilitarianism is still seen as objectionable by some because it assigns no intrinsic moral value to justice, friendship, truth, or any of the many other goods that are thought by some to be irreducibly valuable. For example, a Hedonistic Utilitarian would be morally obliged to publicly execute an innocent friend of theirs if doing so was the only way to promote the greatest happiness overall. Although unlikely, such a situation might arise if a child was murdered in a small town and the lack of suspects was causing large-scale inter-ethnic violence. Some philosophers argue that executing an innocent friend is immoral precisely because it ignores the intrinsic values of justice, friendship, and possibly truth.

Hedonistic Utilitarianism is rarely endorsed by philosophers, but mainly because of its reliance on Prudential Hedonism as opposed to its utilitarian element. Non-hedonistic versions of utilitarianism are about as popular as the other leading theories of right action, especially when it is the actions of institutions that are being considered.

2. The Origins of Hedonism

a. Cārvāka

Perhaps the earliest written record of hedonism comes from the Cārvāka, an Indian philosophical tradition based on the Barhaspatya sutras. The Cārvāka persisted for two thousand years (from about 600 B.C.E.). Most notably, the Cārvāka advocated scepticism and Hedonistic Egoism – that the right action is the one that brings the actor the most net pleasure. The Cārvāka acknowledged that some pain often accompanied, or was later caused by, sensual pleasure, but that pleasure was worth it.

b. Aritippus and the Cyrenaics

The Cyrenaics, founded by Aristippus (c. 435-356 B.C.E.), were also sceptics and Hedonistic Egoists. Although the paucity of original texts makes it difficult to confidently state all of the justifications for the Cyrenaics’ positions, their overall stance is clear enough. The Cyrenaics believed pleasure was the ultimate good and everyone should pursue all immediate pleasures for themselves. They considered bodily pleasures better than mental pleasures, presumably because they were more vivid or trustworthy. The Cyrenaics also recommended pursuing immediate pleasures and avoiding immediate pains with scant or no regard for future consequences. Their reasoning for this is even less clear, but is most plausibly linked to their sceptical views – perhaps that what we can be most sure of in this uncertain existence is our current bodily pleasures.

c. Epicurus

Epicurus (c. 341-271 B.C.E.), founder of Epicureanism, developed a Normative Hedonism in stark contrast to that of Aristippus. The Epicureanism of Epicurus is also quite the opposite to the common usage of Epicureanism; while we might like to go on a luxurious “Epicurean” holiday packed with fine dining and moderately excessive wining, Epicurus would warn us that we are only setting ourselves up for future pain. For Epicurus, happiness was the complete absence of bodily and especially mental pains, including fear of the Gods and desires for anything other than the bare necessities of life. Even with only the limited excesses of ancient Greece on offer, Epicurus advised his followers to avoid towns, and especially marketplaces, in order to limit the resulting desires for unnecessary things. Once we experience unnecessary pleasures, such as those from sex and rich food, we will then suffer from painful and hard to satisfy desires for more and better of the same. No matter how wealthy we might be, Epicurus would argue, our desires will eventually outstrip our means and interfere with our ability to live tranquil, happy lives. Epicureanism is generally egoistic, in that it encourages everyone to pursue happiness for themselves. However, Epicureans would be unlikely to commit any of the selfish acts we might expect from other egoists because Epicureans train themselves to desire only the very basics, which gives them very little reason to do anything to interfere with the affairs of others.

d. The Oyster Example

With the exception of a brief period discussed below, Hedonism has been generally unpopular ever since its ancient beginnings. Although criticisms of the ancient forms of hedonism were many and varied, one in particular was heavily cited. In Philebus, Plato’s Socrates and one of his many foils, Protarchus in this instance, are discussing the role of pleasure in the good life. Socrates asks Protarchus to imagine a life without much pleasure but full of the higher cognitive processes, such as knowledge, forethought and consciousness and to compare it with a life that is the opposite. Socrates describes this opposite life as having perfect pleasure but the mental life of an oyster, pointing out that the subject of such a life would not be able to appreciate any of the pleasure within it. The harrowing thought of living the pleasurable but unthinking life of an oyster causes Protarchus to abandon his hedonistic argument. The oyster example is now easily avoided by clarifying that pleasure is best understood as being a conscious experience, so any sensation that we are not consciously aware of cannot be pleasure.

3. The Development of Hedonism

a. Bentham

Normative and Motivational Hedonism were both at their most popular during the heyday of Empiricism in the 18th and 19th Centuries. Indeed, this is the only period during which any kind of hedonism could be considered popular at all. During this period, two Hedonistic Utilitarians, Jeremy Bentham (1748-1832) and his protégé John Stuart Mill (1806-1873), were particularly influential. Their theories are similar in many ways, but are notably distinct on the nature of pleasure.

Bentham argued for several types of hedonism, including those now referred to as Prudential Hedonism, Hedonistic Utilitarianism, and Motivational Hedonism (although his commitment to strong Motivational Hedonism eventually began to wane). Bentham argued that happiness was the ultimate good and that happiness was pleasure and the absence of pain. He acknowledged the egoistic and hedonistic nature of peoples’ motivation, but argued that the maximization of collective happiness was the correct criterion for moral behavior. Bentham’s greatest happiness principle states that actions are immoral if they are not the action that appears to maximise the happiness of all the people likely to be affected; only the action that appears to maximise the happiness of all the people likely to be affected is the morally right action.

Bentham devised the greatest happiness principle to justify the legal reforms he also argued for. He understood that he could not conclusively prove that the principle was the correct criterion for morally right action, but also thought that it should be accepted because it was fair and better than existing criteria for evaluating actions and legislation. Bentham thought that his Hedonic Calculus could be applied to situations to see what should, morally speaking, be done in a situation. The Hedonic Calculus is a method of counting the amount of pleasure and pain that would likely be caused by different actions. The Hedonic Calculus required a methodology for measuring pleasure, which in turn required an understanding of the nature of pleasure and specifically what aspects of pleasure were valuable for us.

Bentham’s Hedonic Calculus identifies several aspects of pleasure that contribute to its value, including certainty, propinquity, extent, intensity, and duration. The Hedonic Calculus also makes use of two future-pleasure-or-pain-related aspects of actions – fecundity and purity. Certainty refers to the likelihood that the pleasure or pain will occur. Propinquity refers to how long away (in terms of time) the pleasure or pain is. Fecundity refers to the likelihood of the pleasure or pain leading to more of the same sensation. Purity refers to the likelihood of the pleasure or pain leading to some of the opposite sensation. Extent refers to the number of people the pleasure or pain is likely to affect. Intensity refers to the felt strength of the pleasure or pain. Duration refers to how long the pleasure or pain are felt for. It should be noted that only intensity and duration have intrinsic value for an individual. Certainty, propinquity, fecundity, and purity are all instrumentally valuable for an individual because they affect the likelihood of an individual feeling future pleasure and pain. Extent is not directly valuable for an individual’s well-being because it refers to the likelihood of other people experiencing pleasure or pain.

Bentham’s inclusion of certainty, propinquity, fecundity, and purity in the Hedonic Calculus helps to differentiate his hedonism from Folk Hedonism. Folk Hedonists rarely consider how likely their actions are to lead to future pleasure or pain, focussing instead on the pursuit of immediate pleasure and the avoidance of immediate pain. So while Folk Hedonists would be unlikely to study for an exam, anyone using Bentham’s Hedonic Calculus would consider the future happiness benefits to themselves (and possibly others) of passing the exam and then promptly begin studying.

Most importantly for Bentham’s Hedonic Calculus, the pleasure from different sources is always measured against these criteria in the same way, that is to say that no additional value is afforded to pleasures from particularly moral, clean, or culturally-sophisticated sources. For example, Bentham held that pleasure from the parlor game push-pin was just as valuable for us as pleasure from music and poetry. Since Bentham’s theory of Prudential Hedonism focuses on the quantity of the pleasure, rather than the source-derived quality of it, it is best described as a type of Quantitative Hedonism.

b. Mill

Bentham’s indifferent stance on the source of pleasures led to others disparaging his hedonism as the philosophy of swine. Even his student, John Stuart Mill, questioned whether we should believe that a satisfied pig leads a better life than a dissatisfied human or that a satisfied fool leads a better life than a dissatisfied Socrates – results that Bentham’s Quantitative Hedonism seems to endorse.

Like Bentham, Mill endorsed the varieties of hedonism now referred to as Prudential Hedonism, Hedonistic Utilitarianism, and Motivational Hedonism. Mill also thought happiness, defined as pleasure and the avoidance of pain, was the highest good. Where Mill’s hedonism differs from Bentham’s is in his understanding of the nature of pleasure. Mill argued that pleasures could vary in quality, being either higher or lower pleasures. Mill employed the distinction between higher and lower pleasures in an attempt to avoid the criticism that his hedonism was just another philosophy of swine. Lower pleasures are those associated with the body, which we share with other animals, such as pleasure from quenching thirst or having sex. Higher pleasures are those associated with the mind, which were thought to be unique to humans, such as pleasure from listening to opera, acting virtuously, and philosophising. Mill justified this distinction by arguing that those who have experienced both types of pleasure realise that higher pleasures are much more valuable. He dismissed challenges to this claim by asserting that those who disagreed lacked either the experience of higher pleasures or the capacity for such experiences. For Mill, higher pleasures were not different from lower pleasures by mere degree; they were different in kind. Since Mill’s theory of Prudential Hedonism focuses on the quality of the pleasure, rather than the amount of it, it is best described as a type of Qualitative Hedonism.

c. Moore

George Edward Moore (1873-1958) was instrumental in bringing hedonism’s brief heyday to an end. Moore’s criticisms of hedonism in general, and Mill’s hedonism in particular, were frequently cited as good reasons to reject hedonism even decades after his death. Indeed, since G. E. Moore, hedonism has been viewed by most philosophers as being an initially intuitive and interesting family of theories, but also one that is flawed on closer inspection. Moore was a pluralist about value and argued persuasively against the Value Hedonists’ central claim – that all and only pleasure is the bearer of intrinsic value. Moore’s most damaging objection against Hedonism was his heap of filth example. Moore himself thought the heap of filth example thoroughly refuted what he saw as the only potentially viable form of Prudential Hedonism – that conscious pleasure is the only thing that positively contributes to well-being. Moore used the heap of filth example to argue that Prudential Hedonism is false because pleasure is not the only thing of value.

In the heap of filth example, Moore asks the reader to imagine two worlds, one of which is exceedingly beautiful and the other a disgusting heap of filth. Moore then instructs the reader to imagine that no one would ever experience either world and asks if it is better for the beautiful world to exist than the filthy one. As Moore expected, his contemporaries tended to agree that it would be better if the beautiful world existed.  Relying on this agreement, Moore infers that the beautiful world is more valuable than the heap of filth and, therefore, that beauty must be valuable. Moore then concluded that all of the potentially viable theories of Prudential Hedonism (those that value only conscious pleasures) must be false because something, namely beauty, is valuable even when no conscious pleasure can be derived from it.

Moore’s heap of filth example has rarely been used to object to Prudential Hedonism since the 1970’s because it is not directly relevant to Prudential Hedonism (it evaluates worlds and not lives). Moore’s other objections to Prudential Hedonism also went out of favor around the same time. The demise of these arguments was partly due to mounting objections against them, but mainly because arguments more suited to the task of refuting Prudential Hedonism were developed. These arguments are discussed after the contemporary varieties of hedonism are introduced below.

4. Contemporary Varieties of Hedonism

a. The Main Divisions

Several contemporary varieties of hedonism have been defended, although usually by just a handful of philosophers or less at any one time. Other varieties of hedonism are also theoretically available but have received little or no discussion. Contemporary varieties of Prudential Hedonism can be grouped based on how they define pleasure and pain, as is done below. In addition to providing different notions of what pleasure and pain are, contemporary varieties of Prudential Hedonism also disagree about what aspect or aspects of pleasure are valuable for well-being (and the opposite for pain).

The most well-known disagreement about what aspects of pleasure are valuable occurs between Quantitative and Qualitative Hedonists. Quantitative Hedonists argue that how valuable pleasure is for well-being depends on only the amount of pleasure, and so they are only concerned with dimensions of pleasure such as duration and intensity. Quantitative Hedonism is often accused of over-valuing animalistic, simple, and debauched pleasures.

Qualitative Hedonists argue that, in addition to the dimensions related to the amount of pleasure, one or more dimensions of quality can have an impact on how pleasure affects well-being. The quality dimensions might be based on how cognitive or bodily the pleasure is (as it was for Mill), the moral status of the source of the pleasure, or some other non-amount-related dimension. Qualitative Hedonism is criticised by some for smuggling values other than pleasure into well-being by misleadingly labelling them as dimensions of pleasure. How these qualities are chosen for inclusion is also criticised for being arbitrary or ad hoc by some because inclusion of these dimensions of pleasure is often in direct response to objections that Quantitative Hedonism cannot easily deal with. That is to say, the inclusion of these dimensions is often accused of being an exercise in plastering over holes, rather than deducing corollary conclusions from existing theoretical premises. Others have argued that any dimensions of quality can be better explained in terms of dimensions of quantity. For example, they might claim that moral pleasures are no higher in quality than immoral pleasures, but that moral pleasures are instrumentally more valuable because they are likely to lead to more moments of pleasure or less moments of pain in the future.

Hedonists also have differing views about how the value of pleasure compares with the value of pain. This is not a practical disagreement about how best to measure pleasure and pain, but rather a theoretical disagreement about comparative value, such as whether pain is worse for us than an equivalent amount of pleasure is good for us. The default position is that one unit of pleasure (sometimes referred to as a Hedon) is equivalent but opposite in value to one unit of pain (sometimes referred to as a Dolor). Several Hedonistic Utilitarians have argued that reduction of pain should be seen as more important than increasing pleasure, sometimes for the Epicurean reason that pain seems worse for us than an equivalent amount of pleasure is good for us. Imagine that a magical genie offered for you to play a game with him. The game consists of you flipping a fair coin. If the coin lands on heads, then you immediately feel a burst of very intense pleasure and if it lands on tails, then you immediately feel a burst of very intense pain. Is it in your best interests to play the game?

Another area of disagreement between some Hedonists is whether pleasure is entirely internal to a person or if it includes external elements. Internalism about pleasure is the thesis that, whatever pleasure is, it is always and only inside a person. Externalism about pleasure, on the other hand, is the thesis that, pleasure is more than just a state of an individual (that is, that a necessary component of pleasure lies outside of the individual). Externalists about pleasure might, for example, describe pleasure as a function that mediates between our minds and the environment, such that every instance of pleasure has one or more integral environmental components. The vast majority of historic and contemporary versions of Prudential Hedonism consider pleasure to be an internal mental state.

Perhaps the least known disagreement about what aspects of pleasure make it valuable is the debate about whether we have to be conscious of pleasure for it to be valuable. The standard position is that pleasure is a conscious mental state, or at least that any pleasure a person is not conscious of does not intrinsically improve their well-being.

b. Pleasure as Sensation

The most common definition of pleasure is that it is a sensation, something that we identify through our senses or that we feel. Psychologists claim that we have at least ten senses, including the familiar, sight, hearing, smell, taste, and touch, but also, movement, balance, and several sub-senses of touch, including heat, cold, pressure, and pain. New senses get added to the list when it is understood that some independent physical process underpins their functioning. The most widely-used examples of pleasurable sensations are the pleasures of eating, drinking, listening to music, and having sex. Use of these examples has done little to help Hedonism avoid its debauched reputation.

It is also commonly recognised that our senses are physical processes that usually involve a mental component, such as the tickling feeling when someone blows gently on the back of your neck. If a sensation is something we identify through our sense organs, however, it is not entirely clear how to account for abstract pleasures. This is because abstract pleasures, such as a feeling of accomplishment for a job well done, do not seem to be experienced through any of the senses in the standard lists. Some Hedonists have attempted to resolve this problem by arguing for the existence of an independent pleasure sense and by defining sensation as something that we feel (regardless of whether it has been mediated by sense organs).

Most Hedonists who describe pleasure as a sensation will be Quantitative Hedonists and will argue that the pleasure from the different senses is the same. Qualitative Hedonists, in comparison, can use the framework of the senses to help differentiate between qualities of pleasure. For example, a Qualitative Hedonist might argue that pleasurable sensations from touch and movement are always lower quality than the others.

c. Pleasure as Intrinsically Valuable Experience

Hedonists have also defined pleasure as intrinsically valuable experience, that is to say any experiences that we find intrinsically valuable either are, or include, instances of pleasure. According to this definition, the reason that listening to music and eating a fine meal are both intrinsically pleasurable is because those experiences include an element of pleasure (along with the other elements specific to each activity, such as the experience of the texture of the food and the melody of the music). By itself, this definition enables Hedonists to make an argument that is close to perfectly circular. Defining pleasure as intrinsically valuable experience and well-being as all and only experiences that are intrinsically valuable allows a Hedonist to all but stipulate that Prudential Hedonism is the correct theory of well-being. Where defining pleasure as intrinsically valuable experience is not circular is in its stipulation that only experiences matter for well-being. Some well-known objections to this idea are discussed below.

Another problem with defining pleasure as intrinsically valuable experience is that the definition does not tell us very much about what pleasure is or how it can be identified. For example, knowing that pleasure is intrinsically valuable experience would not help someone to work out if a particular experience was intrinsically or just instrumentally valuable. Hedonists have attempted to respond to this problem by explaining how to find out whether an experience is intrinsically valuable.

One method is to ask yourself if you would like the experience to continue for its own sake (rather than because of what it might lead to). Wanting an experience to continue for its own sake reveals that you find it to be intrinsically valuable. While still making a coherent theory of well-being, defining intrinsically valuable experiences as those you want to perpetuate makes the theory much less hedonistic. The fact that what a person wants is the main criterion for something having intrinsic value, makes this kind of theory more in line with preference satisfaction theories of well-being. The central claim of preference satisfaction theories of well-being is that some variant of getting what one wants, or should want, under certain conditions is the only thing that intrinsically improves one’s well-being.

Another method of fleshing out the definition of pleasure as intrinsically valuable experience is to describe how intrinsically valuable experiences feel. This method remains a hedonistic one, but seems to fall back into defining pleasure as a sensation.

It has also been argued that what makes an experience intrinsically valuable is that you like or enjoy it for its own sake. Hedonists arguing for this definition of pleasure usually take pains to position their definition in between the realms of sensation and preference satisfaction. They argue that since we can like or enjoy some experiences without concurrently wanting them or feeling any particular sensation, then liking is distinct from both sensation and preference satisfaction. Liking and enjoyment are also difficult terms to define in more detail, but they are certainly easier to recognise than the rather opaque “intrinsically valuable experience.”

Merely defining pleasure as intrinsically valuable experience and intrinsically valuable experiences as those that we like or enjoy still lacks enough detail to be very useful for contemplating well-being. A potential method for making this theory more useful would be to draw on the cognitive sciences to investigate if there is a specific neurological function for liking or enjoying. Cognitive science has not reached the point where anything definitive can be said about this, but a few neuroscientists have experimental evidence that liking and wanting (at least in regards to food) are neurologically distinct processes in rats and have argued that it should be the same for humans. The same scientists have wondered if the same processes govern all of our liking and wanting, but this question remains unresolved.

Most Hedonists who describe pleasure as intrinsically valuable experience believe that pleasure is internal and conscious. Hedonists who define pleasure in this way may be either Quantitative or Qualitative Hedonists, depending on whether they think that quality is a relevant dimension of how intrinsically valuable we find certain experiences.

d. Pleasure as Pro-Attitude

One of the most recent developments in modern hedonism is the rise of defining pleasure as a pro-attitude – a positive psychological stance toward some object. Any account of Prudential Hedonism that defines pleasure as a pro-attitude is referred to as Attitudinal Hedonism because it is a person’s attitude that dictates whether anything has intrinsic value. Positive psychological stances include approving of something, thinking it is good, and being pleased about it. The object of the positive psychological stance could be a physical object, such as a painting one is observing, but it could also be a thought, such as “my country is not at war,” or even a sensation. An example of a pro-attitude towards a sensation could be being pleased about the fact that an ice cream tastes so delicious.

Fred Feldman, the leading proponent of Attitudinal Hedonism, argues that the sensation of pleasure only has instrumental value – it only brings about value if you also have a positive psychological stance toward that sensation. In addition to his basic Intrinsic Attitudinal Hedonism, which is a form of Quantitative Hedonism, Feldman has also developed many variants that are types of Qualitative Hedonism. For example, Desert-Adjusted Intrinsic Attitudinal Hedonism, which reduces the intrinsic value a pro-attitude has for our well-being based on the quality of deservedness (that is, on the extent to which the particular object deserves a pro-attitude or not). For example, Desert-Adjusted Intrinsic Attitudinal Hedonism might stipulate that sensations of pleasure arising from adulterous behavior do not deserve approval, and so assign them no value.

Defining pleasure as a pro-attitude, while maintaining that all sensations of pleasure have no intrinsic value, makes Attitudinal Hedonism less obviously hedonistic as the versions that define pleasure as a sensation. Indeed, defining pleasure as a pro-attitude runs the risk of creating a preference satisfaction account of well-being because being pleased about something without feeling any pleasure seems hard to distinguish from having a preference for that thing.

5. Contemporary Objections

a. Pleasure is Not the Only Source of Intrinsic Value

The most common argument against Prudential Hedonism is that pleasure is not the only thing that intrinsically contributes to well-being. Living in reality, finding meaning in life, producing noteworthy achievements, building and maintaining friendships, achieving perfection in certain domains, and living in accordance with religious or moral laws are just some of the other things thought to intrinsically add value to our lives. When presented with these apparently valuable aspects of life, Hedonists usually attempt to explain their apparent value in terms of pleasure. A Hedonist would argue, for example, that friendship is not valuable in and of itself, rather it is valuable to the extent that it brings us pleasure. Furthermore, to answer why we might help a friend even when it harms us, a Hedonist will argue that the prospect of future pleasure from receiving reciprocal favors from our friend, rather than the value of friendship itself, should motivate us to help in this way.

Those who object to Prudential Hedonism on the grounds that pleasure is not the only source of intrinsic value use two main strategies. In the first strategy, objectors make arguments that some specific value cannot be reduced to pleasure. In the second strategy, objectors cite very long lists of apparently intrinsically valuable aspects of life and then challenge hedonists with the prolonged and arduous task of trying to explain how the value of all of them can be explained solely by reference to pleasure and the avoidance of pain. This second strategy gives good reason to be a pluralist about value because the odds seem to be against any monistic theory of value, such as Prudential Hedonism. The first strategy, however, has the ability to show that Prudential Hedonism is false, rather than being just unlikely to be the best theory of well-being.

The most widely cited argument for pleasure not being the only source of intrinsic value is based on Robert Nozick’s experience machine thought-experiment. Nozick’s experience machine thought-experiment was designed to show that more than just our experiences matter to us because living in reality also matters to us. This argument has proven to be so convincing that nearly every single book on ethics that discusses hedonism rejects it using only this argument or this one and one other.

In the thought experiment, Nozick asks us to imagine that we have the choice of plugging in to a fantastic machine that flawlessly provides an amazing mix of experiences. Importantly, this machine can provide these experiences in a way that, once plugged in to the machine, no one can tell that their experiences are not real. Disregarding considerations about responsibilities to others and the problems that would arise if everyone plugged in, would you plug in to the machine for life? The vast majority of people reject the choice to live a much more pleasurable life in the machine, mostly because they agree with Nozick that living in reality seems to be important for our well-being. Opinions differ on what exactly about living in reality is so much better for us than the additional pleasure of living in the experience machine, but the most common response is that a life that is not lived in reality is pointless or meaningless.

Since this argument has been used so extensively (from the mid 1970’s onwards) to dismiss Prudential Hedonism, several attempts have been made to refute it. Most commonly, Hedonists argue that living an experience machine life would be better than living a real life and that most people are simply mistaken to not want to plug in. Some go further and try to explain why so many people choose not to plug in. Such explanations often point out that the most obvious reasons for not wanting to plug in can be explained in terms of expected pleasure and avoidance of pain. For example, it might be argued that we expect to get pleasure from spending time with our real friends and family, but we do not expect to get as much pleasure from the fake friends or family we might have in the experience machine. These kinds of attempts to refute the experience machine objection do little to persuade non-Hedonists that they have made the wrong choice.

A more promising line of defence for the Prudential Hedonists is to provide evidence that there is a particular psychological bias that affects most people’s choice in the experience machine thought experiment. A reversal of Nozick’s thought experiment has been argued to reveal just such a bias. Imagine that a credible source tells you that you are actually in an experience machine right now. You have no idea what reality would be like. Given the choice between having your memory of this conversation wiped and going to reality, what would be best for you to choose? Empirical evidence on this choice shows that most people would choose to stay in the experience machine. Comparing this result with how people respond to Nozick’s experience machine thought experiment reveals the following: In Nozick’s experience machine thought experiment people tend to choose a real and familiar life over a more pleasurable life and in the reversed experience machine thought experiment people tend to choose a familiar life over a real life. Familiarity seems to matter more than reality, undermining the strength of Nozick’s original argument. The bias thought to be responsible for this difference is the status quo bias – an irrational preference for the familiar or for things to stay as they are.

Regardless of whether Nozick’s experience machine thought experiment is as decisive a refutation of Prudential Hedonism as it is often thought to be, the wider argument (that living in reality is valuable for our well-being) is still a problem for Prudential Hedonists. That our actions have real consequences, that our friends are real, and that our experiences are genuine seem to matter for most of us regardless of considerations of pleasure. Unfortunately, we lack a trusted methodology for discerning if these things should matter to us. Perhaps the best method for identifying intrinsically valuable aspects of lives is to compare lives that are equal in pleasure and all other important ways, except that one aspect of one of the lives is increased. Using this methodology, however, seems certain to lead to an artificial pluralist conclusion about what has value. This is because any increase in a potentially valuable aspect of our lives will be viewed as a free bonus. And, most people will choose the life with the free bonus just in case it has intrinsic value, not necessarily because they think it does have intrinsic value.

b. Some Pleasure is Not Valuable

The main traditional line of criticism against Prudential Hedonism is that not all pleasure is valuable for well-being, or at least that some pleasures are less valuable than others because of non-amount-related factors. Some versions of this criticism are much easier for Prudential Hedonists to deal with than others depending on where the allegedly disvaluable aspect of the pleasure resides. If the disvaluable aspect is experienced with the pleasure itself, then both Qualitative and Quantitative varieties of Prudential Hedonism have sufficient answers to these problems. If, however, the disvaluable aspect of the pleasure is never experienced, then all types of Prudential Hedonism struggle to explain why the allegedly disvaluable aspect is irrelevant.

Examples of the easier criticisms to deal with are that Prudential Hedonism values, or at least overvalues, perverse and base pleasures. These kinds of criticisms tend to have had more sway in the past and doubtless encouraged Mill to develop his Qualitative Hedonism. In response to the charge that Prudential Hedonism mistakenly values pleasure from sadistic torture, sating hunger, copulating, listening to opera, and philosophising all equally, Qualitative Hedonists can simply deny that it does. Since pleasure from sadistic torture will normally be experienced as containing the quality of sadism (just as the pleasure from listening to good opera is experienced as containing the quality of acoustic excellence), the Qualitative Hedonist can plausibly claim to be aware of the difference in quality and allocate less value to perverse or base pleasures accordingly.

Prudential Hedonists need not relinquish the Quantitative aspect of their theory in order to deal with these criticisms, however. Quantitative Hedonists, can simply point out that moral or cultural values are not necessarily relevant to well-being because the investigation of well-being aims to understand what the good life for the one living it is and what intrinsically makes their life go better for them. A Quantitative Hedonist can simply respond that a sadist that gets sadistic pleasure from torturing someone does improve their own well-being (assuming that the sadist never feels any negative emotions or gets into any other trouble as a result). Similarly, a Quantitative Hedonist can argue that if someone genuinely gets a lot of pleasure from porcine company and wallowing in the mud, but finds opera thoroughly dull, then we have good reason to think that having to live in a pig sty would be better for her well-being than forcing her to listen to opera.

Much more problematic for both Quantitative and Qualitative Hedonists, however, are the more modern versions of the criticism that not all pleasure is valuable. The modern versions of this criticism tend to use examples in which the disvaluable aspect of the pleasure is never experienced by the person whose well-being is being evaluated. The best example of these modern criticisms is a thought experiment devised by Shelly Kagan. Kagan’s deceived businessman thought experiment is widely thought to show that pleasures of a certain kind, namely false pleasures, are worth much less than true pleasures.

Kagan asks us to imagine the life of a very successful businessman who takes great pleasure in being respected by his colleagues, well-liked by his friends, and loved by his wife and children until the day he died. Then Kagan asks us to compare this life with one of equal length and the same amount of pleasure (experienced as coming from exactly the same sources), except that in each case the businessman is mistaken about how those around him really feel. This second (deceived) businessman experiences just as much pleasure from the respect of his colleagues and the love of his family as the first businessman. The only difference is that the second businessman has many false beliefs. Specifically, the deceived businessman’s colleagues actually think he is useless, his wife doesn’t really love him, and his children are only nice to him so that he will keep giving them money. Given that the deceived businessman never knew of any of these deceptions and his experiences were never negatively impacted by the deceptions indirectly, which life do you think is better?

Nearly everyone thinks that the deceived businessman has a worse life. This is a problem for Prudential Hedonists because the pleasure is quantitatively equal in each life, so they should be equally good for the one living it. Qualitative Hedonism does not seem to be able to avoid this criticism either because the falsity of the pleasures experienced by the deceived businessman is a dimension of the pleasure that he never becomes aware of. Theoretically, an externalist and qualitative version of Attitudinal Hedonism could include the falsity dimension of an instance of pleasure even if the falsity dimension never impacts the consciousness of the person. However, the resulting definition of pleasure bears little resemblance to what we commonly understand pleasure to be and also seems to be ad hoc in its inclusion of the truth dimension but not others. A dedicated Prudential Hedonist of any variety can always stubbornly stick to the claim that the lives of the two businessmen are of equal value, but that will do little to convince the vast majority to take Prudential Hedonism more seriously.

c. There is No Coherent and Unifying Definition of Pleasure

Another major line of criticism used against Prudential Hedonists is that they have yet to come up with a meaningful definition of pleasure that unifies the seemingly disparate array of pleasures while remaining recognisable as pleasure. Some definitions lack sufficient detail to be informative about what pleasure actually is, or why it is valuable, and those that do offer enough detail to be meaningful are faced with two difficult tasks.

The first obstacle for a useful definition of pleasure for hedonism is to unify all of the diverse pleasures in a reasonable way. Phenomenologically, the pleasure from reading a good book is very different to the pleasure from bungee jumping, and both of these pleasures are very different to the pleasure of having sex. This obstacle is unsurpassable for most versions of Quantitative Hedonism because it makes the value gained from different pleasures impossible to compare. Not being able to compare different types of pleasure results in being unable to say if a life is better than another in most even vaguely realistic cases. Furthermore, not being able to compare lives means that Quantitative Hedonism could not be usefully used to guide behavior since it cannot instruct us on which life to aim for.

Attempts to resolve the problem of unifying the different pleasures while remaining within a framework of Quantitative Hedonism, usually involve pointing out something that is constant in all of the disparate pleasures and defining that particular thing as pleasure. When pleasure is defined as a strict sensation, this strategy fails because introspection reveals that no such sensation exists. Pleasure defined as the experience of liking or as a pro-attitude does much better at unifying all of the diverse pleasures. However, defining pleasure in these ways makes the task of filling in the details of the theory a fine balancing act. Liking or pro-attitudes must be described in such a way that they are not solely a sensation or best described as a preference satisfaction theory. And they must perform this balancing act while still describing a scientifically plausible and conceptually coherent account of pleasure. Most attempts to define pleasure as liking or pro-attitudes seem to disagree with either the folk conception of what pleasure is or any of the plausible scientific conceptions of how pleasure functions.

Most varieties of Qualitative Hedonism do better at dealing with the problem of diverse pleasures because they can evaluate different pleasures according to their distinct qualities. Qualitative Hedonists still need a coherent method for comparing the different pleasures with each other in order to be more than just an abstract theory of well-being, however. And, it is difficult to construct such a methodology in a way that avoids counter examples, while still describing a scientifically plausible and conceptually coherent account of pleasure.

The second obstacle is creating a definition of pleasure that retains at least some of the core properties of the common understanding of the term ‘pleasure’. As mentioned, many of the potential adjustments to the main definitions of pleasure are useful for avoiding one or more of the many objections against Prudential Hedonism. The problem with this strategy is that the more adjustments that are made, the more apparent it becomes that the definition of pleasure is not recognisable as the pleasure that gave Hedonism its distinctive intuitive plausibility in the first place. When an instance of pleasure is defined simply as when someone feels good, its intrinsic value for well-being is intuitively obvious. However, when the definition of pleasure is stretched, so as to more effectively argue that all valuable experiences are pleasurable, it becomes much less recognisable as the concept of pleasure we use in day-to-day life and its intrinsic value becomes much less intuitive.

6. The Future of Hedonism

The future of hedonism seems bleak. The considerable number and strength of the arguments against Prudential Hedonism’s central principle (that pleasure and only pleasure intrinsically contributes positively to well-being and the opposite for pain) seem insurmountable. Hedonists have been creative in their definitions of pleasure so as to avoid these objections, but more often than not find themselves defending a theory that is not particularly hedonistic, realistic or both.

Perhaps the only hope that Hedonists of all types can have for the future is that advances in cognitive science will lead to a better understanding of how pleasure works in the brain and how biases affect our judgements about thought experiments. If our improved understanding in these areas confirms a particular theory about what pleasure is and also provides reasons to doubt some of the widespread judgements about the thought experiments that make the vast majority of philosophers reject hedonism, then hedonism might experience at least a partial revival. The good news for Hedonists is that at least some emerging theories and results from cognitive science do appear to support some aspects of hedonism.

7. References and Further Reading

a. Primary Sources

  • Bentham, Jeremy (1789). An Introduction to the Principles of Morals and Legislation, First printed in 1780 and first published in 1789. A corrected edition with extra footnotes and paragraphs at the end was published in 1823. Recent edition: Adamant Media Corporation, 2005.
    • Bentham’s main discussion of his Quantitative Hedonistic Utilitarianism.
  • Blake, R. M. (1926). Why Not Hedonism? A Protest, International Journal of Ethics, 37(1): 1-18.
    • An excellent refutation of G. E. Moore’s main arguments against hedonism.
  • Crisp, Roger (2006). Reasons and the Good, Oxford: Clarendon Press.
    • Discusses the importance of ultimate reasons and argues that the best of these do not use moral concepts. The volume also defends Prudential Hedonism, especially Chapter 4.
  • Crisp, R. (2006). Hedonism Reconsidered, Philosophy and Phenomenological Research, LXXIII(3): 619-645.
    • Essentially the same as Chapter 4 from his Reasons and the Good.
  • De Brigard, F. (2010). If You Like it, Does it Matter if it’s Real?, Philosophical Psychology, 23(1): 43-57.
    • Presents empirical evidence that the experience machine thought experiment is heavily affected by a psychological bias.
  • Feldman, Fred (1997). Utilitarianism, Hedonism, and Desert: Essays in Moral Philosophy, Cambridge: Cambridge University Press.
    • Contains a mixture of topics relevant to hedonism, including modern and ancient theories and objections. There is a detailed section on adjusting pleasure to take deservedness into account (Part III).
  • Feldman, Fred (2004). Pleasure and the Good Life, Oxford: Clarendon Press.
    • The best and most detailed account of Attitudinal Hedonism. This volume also includes a very detailed account of how Prudential Hedonism should be defined.
  • Kagan, Shelly (1998). Chapter 2: The Good, in his Normative Ethics, Oxford: Westview Press, pp. 25-69.
    • See especially pp. 34-36 for the first discussion of the Deceived Businessman thought experiment.
  • Kawall, J. (1999). The Experience Machine and Mental State Theories of Well-being, The Journal of Value Inquiry, 33: 381-387.
    • An excellent article about the strengths and weaknesses of the experience machine thought experiment as it is used against hedonism.
  • Kringelbach, Morten L. & Berridge, Kent B. (eds.) (2010). Pleasures of the Brain, Oxford University Press.
    • This edited volume collects papers from the leading experts on pleasure from the disciplines of psychology and neuroscience. Perhaps of most value is a chapter at the front of the book in which the experts all answer a standard set of questions posed by the editors. The questions include: “Is pleasure necessarily a conscious feeling?”, “Is pleasure simply a sensation, like sweetness?”, and “Is there a common currency for all sensory pleasures… [o]r are different sensory pleasures mediated by different neural circuits?” The experts answers to these questions is the perfect starting point for a philosopher looking to find out more about pleasure from cognitive science.
  • Mill, John Stuart (1861). Utilitarianism, Indianapolis: Bobbs-Merrill, 1957.
    • Mill’s main discussion of his Qualitative Hedonistic Utilitarianism.
  • Moore, George E. (1903). Principia Ethica, Cambridge: Cambridge University Press.
    • See especially Chapter 3 on hedonism, which contains Moore’s influential arguments against hedonism.
  • Nozick, R. (1974). Anarchy, State, and Utopia, Oxford: Blackwell, 1991.
    • See especially pp. 42-45 for the first discussion of the experience machine thought experiment.
  • Plato (1937). Philebus, in The Dialogues of Plato, trans. by B. Jowett, New York: Random House.
    • See especially Part ii, p. 353 for the oyster example.
  • Plato (1974). Book II: Preliminaries, in The Republic, trans. by Desmond Lee, second edition (revised), Penguin Books, 1983.
    • Original discussion of the Ring of Gyges example.
  • Sobel, D. (1999). Pleasure as a Mental State, Utilitas, 11(2): 230-234.
    • Argues against the viability of defining pleasure as intrinsically valuable experience.
  • Sobel, D. (2002). Varieties of Hedonism, Journal of Social Philosophy, 33(2): 240-256.
    • Describes some of the main types of Prudential Hedonism and the problems with them.
  • Tännsjö, T. (1998). Hedonistic Utilitarianism, Edinburgh University Press.
    • Tännsjö endorses unconscious pleasures as being valuable, an unusual contemporary position. This book can be difficult to acquire.
  • Tännsjö, T. (2007). Narrow Hedonism, Journal of Happiness Studies, 8:79-98.
    • Worth a look if you do not have access to his Hedonistic Utilitarianism.
  • Weijers, D. (2011). Intuitive Biases in Judgements about Thought Experiments: The Experience Machine Revisited, Philosophical Writings, 50 & 51.
    • An explanation of how psychological biases can affect our judgements about thought experiments using the experience machine thought experiment as an example.

b. Secondary and Mixed Sources

  • Gunaratna. Tarkarahasyadīpika. Cārvāka/Lokāyata: an Anthology of Source Materials and Some Recent Studies. Ed. Debiprasad Chattopadhyaya. New Delhi: Indian Council of Philosophical Research in association with Rddhi-India Calcutta, 1990.
    • An anthology of primary source materials on the Cārvāka with some more recent analysis.
  • Inwood, B., & Gerson, L. P. (eds.) (1994). The Epicurus Reader: Selected Writings and Testimonia. Indianapolis, Hackett Publishing.
    • An inexpensive collection that has most of the major extant writings of Epicurus, in addition to other ancient sources such as Cicero and Plutarch who wrote about Epicureanism. However, there is little commentary or explication of the material, and some of the primary sources are fairly opaque. Unusually, the writing of Lucretius does not feature much in this collection.
  • Laertius, Diogenes (1925) Aristippus, in Volume I, Book II of his Lives of the Philosophers, trans. by R. D. Hicks, The Loeb Classical Library, Harvard University Press.
  • Laertius, Diogenes (1925) Epicurus, in Volume II, Book X of Lives of Eminent Philosophers, trans. by R. D. Hicks, The Loeb Classical Library, Harvard University Press.
  • Mitsis, Phillip (1988). Epicurus’ Ethical Theory: The Pleasures of Invulnerability, Ithaca: Cornell University Press.
    • Covers all of the major areas of Epicurean ethics, from pleasure, to friendship, justice, and human freedom. Mitsis is especially good at showing how Epicurus’ conception of pleasure differs from that of the more modern Utilitarians.
  • O’Keefe, Tim (2002). The Cyrenaics on Pleasure, Happiness, and Future-Concern, Phronesis, 47(4), pp. 395-416.
    • Explores the question of why the Cyrenaics, alone among ancient Greek ethical theorists, claim that happiness is not the highest good, but particular pleasures are instead, and that one should not worry about the long-term consequences of one’s actions but instead concentrate on obtaining pleasures that are near at hand.
  • Smith, James and Sosa, Ernest (eds.) (1969). Mill’s Utilitarianism: Text and Criticism, Belmont CA: Wadsworth.
    • A great collection of Mill’s writing and commentaries on it by Mill scholars. The volume also includes an extensive section on suggested further reading. The Further reading section is helpfully broken down by which of Mill’s essays they are most relevant to.
  • West, Henry R. (ed.) (2006). The Blackwell Guide to Mill’s Utilitarianism, Oxford: Blackwell.
    • A collection of essays on different aspects of Mill’s Hedonistic Utilitarianism and the relevant original passages from Mill. See especially Chapter 4, Wendy Donner’s article, Mill’s Theory of Value.

 

Author Information

Dan Weijers
Email: danweijers@gmail.com
Victoria University of Wellington
New Zealand

The Safety Condition for Knowledge

A number of epistemologists have defended a necessary condition for knowledge that has come to be labeled as the “safety” condition. Timothy Williamson, Duncan Pritchard, and Ernest Sosa are the foremost defenders of safety. According to these authors an agent S knows a true proposition P only if S could not easily have falsely believed P. Disagreement arises, however, with respect to how they capture the notion of a safe belief.

Unlike Pritchard and Sosa, who have gone on to incorporate the safety condition into a virtue account of knowledge, Williamson distances himself from the project of offering reductive analyses of knowledge. Williamson’s project can best be thought of as an illumination of the structural features of knowledge by way of safety. The maneuvers of Pritchard and Sosa into the domain of virtue epistemology are not discussed here.

This article is a treatment of the different presentations and defenses of the safety condition for knowledge. Special attention is first paid to an elucidation of the various aspects or features of the safety condition. Following a short demonstration, of the manner in which the safety condition handles some rather tough Gettier-like cases in the literature, some problems facing safety conclude this article.

Table of Contents

  1. Historical Background
  2. The Safety Condition as a Necessary Condition for Knowledge
    1. Timothy Williamson
    2. Duncan Pritchard
    3. Ernest Sosa
  3. Elucidating the Safety Condition
    1. What Counts as a Close World?
      1. The Time Factor
      2. What Type of Reliability does Safety Require?
      3. Methods
      4. Skepticism
    2. How do the Safety and Sensitivity Conditions Differ?
    3. The Semantics of Safety
  4. Safety in Action
    1. Gettier and Chisholm
    2. Fake Barns
    3. Matches
  5. Problems for Safety
    1. Knowledge of Necessarily True Propositions
    2. Knowledge of the Future
      1. Williamson’s Response
      2. Pritchard’s Response
    3. Safety and Determinism
  6. References and Further Reading

1. Historical Background

Knowledge is incompatible with accidentally true belief. That is to say, if an agent S is lucky that her belief P is true, S does not know P. This feature of knowledge was made explicit by Bertrand Russell (1948: 170) and, more famously, by Edmund Gettier (1963) who demonstrated that a justified true belief (JTB) is insufficient for knowledge. Gettier provided us with cases in which there is strong intuitive pull towards the judgment that S can have a justified true belief P yet not know P because S is lucky that S’s belief P is true. To use Russell’s case, suppose S truly believes it’s noon as a result of looking at a clock that correctly reads noon. However, unbeknownst to S this clock broke exactly twelve hours prior. Even though S has good reasons to believe it’s noon and S’s belief is true, S does not know it’s noon since S is lucky that her belief is true.

Several notable attempts were made to improve the JTB analysis of knowledge; in particular, some were attracted to the idea that a stronger justification condition would resolve Gettier problems (Shope 1983: 45-108). Thus began the vast literature on the nature of epistemic justification. Others, though disagreeing among themselves about the place of justification in an account of knowledge, sought a solution to the Gettier problem in a new anti-luck condition for knowledge. (The majority of these accounts dropped the justification requirement.) One of these attempts is particularly relevant here. Fred Dretske (1970) and Robert Nozick (1981) proposed accounts of knowledge central to which were a  counterfactual condition, Nozick’s being the more popular of the two. Nozick proposed the following counterfactual as a necessary condition for knowledge (1981: 179): S knows P via a method M only if, were P false, S would not believe P via MP ☐→ ¬B(P)]. This came to be termed the sensitivity condition for knowledge. To satisfy this condition it must be the case that in the closest world in which P is false S does not believe P. That is, S must track the truth of P to know P (where possible worlds are ordered as per their similarity to the actual world).

Nozick’s account enjoyed widespread popularity because of its anti-skeptical capabilities. Following Nozick, I count as knowing that there is tree in my garden since I would not believe that if none were planted there, that is, in the closest world in which there is no tree in my garden (for example, when none is planted there), I do not believe that there is a tree in my garden. Worlds where radically skeptical scenarios are true count as further off since those worlds are more dissimilar to the actual world than the world in which no tree is planted in my garden. That I would believe falsely in those worlds is thus irrelevant. In other words, that I would falsely believe in such a far off world is inconsequential to whether I believe truly in the actual world.

Nozick’s account came with two significant costs, however. Firstly, it cannot accommodate the very intuitive principle that knowledge is closed under known entailment. Roughly, this principle states that if S knows P and S knows that P entails Q then S knows Q. It follows, then, that if I know that I have hands, and I know that if I have hands entails that I am not a handless brain in the vat, then I know that I am not a handless brain in the vat. However, I fail to know that I am not a handless brain in the vat since I would falsely believe I was not a handless brain in the vat in the closest world in which the proposition “I am not a handless brain in the vat” is false (that is, the world in which I am a handless brain in the vat). In other words, the sensitivity condition for knowledge cannot be satisfied when it comes to the denial of radically skeptical hypotheses. Seeing no way to redeem his account from this problem, Nozick (1981: 198ff) was forced into the rather unorthodox position of having to deny the universal applicability of closure as a feature of knowledge.

Secondly, Nozick admits that the sensitivity condition cannot feature as a condition for knowledge of necessarily true propositions as there is no world in which such propositions are false since, by definition, necessarily true propositions are true in every possible world. The scope of the sensitivity condition is thus limited to knowledge of contingently true propositions. That the sensitivity condition cannot, for example, illuminate the nature of our mathematical or logical knowledge makes it less preferable, ceteris paribus, than a condition that can.

At the end of the twentieth century and the beginning of the twenty-first, several authors proposed a novel and relatively similar condition for knowledge that has come to be known as the safety condition, the elucidation of which being the objective here. As the relevant features of the safety condition are presented and explained, the following salient points will emerge. The safety condition is similar to the sensitivity condition in that it too is a modal condition for knowledge. That’s where any significant similarity ends. As shall be demonstrated at length, safety differs from sensitivity in the following ways. Firstly, and most importantly, safety permits knowing the denial of a radically skeptical hypothesis in a manner that maintains the closure principle. This advantage by itself acts as a strong point in favor of the safety condition. Secondly, most formulations of the safety condition are not in the form of a counterfactual. Thirdly, the safety condition is more expansive than the sensitivity condition in that its scope includes knowledge of both necessarily true and contingently true propositions. Lastly, epistemologists since then generally believe the safety condition opens the way to a more enlightened response to skepticism.

2. The Safety Condition as a Necessary Condition for Knowledge

The literature on the safety condition is challenging for even the seasoned philosopher. Seeing that Williamson, Pritchard, and Sosa have developed their thoughts over a lengthy period of time and in a large number of publications, it has become quite a task to keep track of the epicycles in the conceptual development and defense of the safety condition. Additionally, each of its advocates is motivated to formulate the safety condition in a distinct way, where even slight differences in formulation make for significant conceptual divergence. In light of these considerations, it is best to begin with a separate treatment of each author’s presentation of the safety condition before proceeding to an overall elucidation of the safety condition.

a. Timothy Williamson

Williamson (2000) is involved in the project of illuminating several structural features of knowledge. His safety condition is both the result of this project and an integral part of it.

Williamson, in stark opposition to the standard practice in the post-Gettier period, resists being drawn into offering a conceptual analysis of knowledge in terms of non-trivial and non-circular necessary and jointly sufficient conditions for knowledge, a project he thinks is futile given its history of repetitive failures. Knowledge, for Williamson, requires avoidance of error in similar cases. This is to be taken as a schema by which to elucidate the structural features of knowledge. The basic idea, then, is that S knows P only if S is safe from error; that is, there must be no risk or danger that S believes falsely in a similar case. The relevant modal notions of safety, risk, and danger are cashed out in terms of possible worlds such that there is no close world surrounding the actual world in which S falls into error: “safety is a sort of local necessity” (Williamson 2009d: 14). These possible worlds in which S truly believes P act as a kind of “buffer zone” from error and thereby prevent the type of epistemic luck that characterize Gettier cases. In Russell’s case, for example, S does not know that it’s noon since there is a close world in which S falsely believes that it’s noon, for instance, one in which S looks at the clock slightly before or after noon.

Despite Williamson’s opposition to the project of analyzing knowledge into non-trivial necessary and sufficient conditions, it seems clear enough that Williamson should be read as putting forward safety as a necessary condition for knowledge given that he presents the safety condition as a conditional using the appropriate locution only if. And this is how his critics typically interpret him. As Williamson formulates the safety condition in a number of different ways on different occasions, it is impossible to pin down one formulation as representing Williamson’s view. Here is one formulation that will suffice for the time being:

(SF)     If one knows, one could not easily have been wrong in a similar case (2000: 147)

Nevertheless, the manner in which Williamson expresses the safety condition still separates him from those who offer necessary conditions for knowledge. As Williamson emphasizes time and again, the safety condition, as he states it, is to be taken as a circular necessary condition in that whether or not a case β counts as a relevantly similar case to α is in part determined by whether we are inclined to attribute knowledge to the agent in α; safety (reliability in similar enough cases) cannot be defined without reference to knowledge, and knowledge without reference to safety (2000: 100; 2009d: 9-10). Safety, by Williamson’s lights is thus not to be taken as a recipe by which to determine whether or not the agent is to be attributed knowledge in each and every case. Rather, it is a model by which to illuminate the structural features of knowledge and by which we can begin talking about knowledge in individual cases. As Williamson observes, “the point of the safety conception of knowing is not to enable one to determine whether a knowledge attribution is true in a given case by applying one’s general understanding of safety, without reference to knowing itself. If one tried to do that, one would very likely get it wrong” (2009d: ibid.).

b. Duncan Pritchard

Pritchard is attracted to the idea that a conceptual analysis of our concept “luck” will yield sufficient insight from which to build a satisfactory anti-luck condition for knowledge. In other words, with a conceptual mastery of the nature of luck in hand, the problem of epistemic luck can, in theory, be overcome. Pritchard’s safety condition is thus formulated in a manner which reflects his work on the conceptual analysis of luck.

For Pritchard (2005: 128) an event E counts as lucky for an agent S if and only if E is significant to S and E obtains in the actual world but does not obtain in a wide class of nearby possible worlds in which the relevant initial conditions for that event are the same as in the actual world. For example, one is lucky not to be killed by a sniper’s bullet since in the actual world the bullet misses but it does not miss in the close worlds. Importantly, luck comes in degrees. One naturally counts as lucky if the sniper missed by a meter. But one counts as luckier if the sniper missed by a centimeter. The agent counts as luckier in the latter case, claims Pritchard (2009b: 39), because the world in which one is killed in that case is much closer to the actual world than the world in which one is killed in the former case (since the second sniper is better). Close worlds, then, can be roughly divided into two classes—near and non-near, and are to be weighted accordingly—the near close worlds count for more than the non-near close worlds.

The foregoing analysis of luck motivates the following analysis of epistemic luck: S is very lucky that her belief P is true in the actual world if P is false in at least one of the near close worlds. And S is lucky, but not as lucky, that her belief P is true if P is true in the actual world but false in at least one of the non-near close worlds. Stated otherwise, false belief in a very close world is incompatible with knowledge while false belief in a non-near close world is compatible with knowledge. Here is a formulation of the safety condition by Pritchard (2007, 2008, 2009a) as a non-circular necessary condition in a standard account of knowledge:

(SF*)     S’s belief is safe if and only if in most nearby possible worlds in which S continues to form her belief about the target proposition in the same way as in the actual world, and in all very close nearby possible worlds in which S continues to form her belief about the target proposition in the same way as in the actual world, the belief continues to be true.

That is to say, as long as S truly believes P using the same method in all of the very close worlds and in nearly all of the non-near close worlds, S’s belief P is safe. Accordingly, the agent in Russell’s case fails to know that it’s noon since there is a very close world in which she falsely believes that it’s noon, for example, the very close world in which that clock stops slightly before or after noon.

Pritchard’s focus is knowledge of contingently true propositions. However, Pritchard (2009a: 34) claims that to extend the safety condition to handle knowledge of a necessarily true proposition P, which is true in all possible worlds, his safety condition can easily be augmented in such a way as to require that S not falsely believe a different proposition Q in a very close world and nearly all non-near close worlds using the same method that S used in the actual world.

Of the many ways in which Pritchard’s condition differs from Williamson’s, it is evident that they differ in one very important respect: Pritchard’s condition permits relatively few cases of falsely believing P in non-near close worlds. Pritchard is motivated to make this concession since his safety condition is informed by his account of luck, which counts S lucky vis-à-vis an event E even though E occurs in some non-near close worlds. Williamson’s condition, by contrast, has a zero tolerance policy for false belief in any close world.

c. Ernest Sosa

Sosa arrives at his formulation of the safety condition as a necessary condition for knowledge by way of working through some of the fundamental shortcomings he identifies in Goldman’s relevant alternatives condition and Nozick’s sensitivity condition. As Sosa puts it, both Goldman and Nozick failed to adequately capture the way in which the proposition believed must be modally related to the truth of that proposition. For Sosa, an agent S counts as knowing P only if S believes P by way of a safe method or, in Sosa’s words, a safe “indication” or “deliverance.”

Sosa’s formulation of the safety condition differs from both Williamson’s and Pritchard’s in that it is stipulated in the form of the following counterfactual (1999a: 146):

(SF**)       If S were to believe P, P would be true [B(P) ☐→ P]

The following short note on counterfactuals helps explain the logic of (SF**). Firstly, according to Lewis’s semantics for counterfactuals (1973), a counterfactual of the form P ☐→ Q is true at a world W only if some world in which P and Q are true is closer to W than any world in which P is true but Q false. Since Lewis thinks that the closest world to W is W itself, the counterfactual PQ is trivially true at W if P and Q are both true at W. Accordingly, when the antecedent of a counterfactual is true, it follows that P & Q entails P ☐→ Q. Nozick (1981: 176, 680 n.8) finds this result untenable and rejects Lewisian semantics for counterfactuals with true antecedents. Sosa concurs with Nozick. On their semantics for counterfactuals with true antecedents, the counterfactual B(P) ☐→ P is true at a world W if and only if S truly believes P by method M in W and in all close worlds in which S believes P by method M, P is true. It follows, then, that like (SF) and unlike (SF*), Sosa’s condition entails zero tolerance for false belief in any close world.

Secondly, though remarkably similar to the sensitivity condition, (SF**) is not logically equivalent to the sensitivity condition [¬P ☐→ ¬B(P)] since contraposition is invalid for counterfactuals. The following example, from Lewis (1973: 35) demonstrates that contraposition (A→B) ↔ (¬B → ¬ A) is invalid for counterfactuals. Consider the following counterfactual: (A) If Boris had gone to the party, then Olga would still have gone. It should be clear that (A) is not equivalent to its contraposition (B) If Olga had not gone to the party then Boris would still not have gone, because althoughwhile (A) is true (B) is false since Boris would have gone had Olga been absent from the party.

In light of these considerations about counterfactuals, Sosa’s formulation of safety explains why the agent in Russell’s case lacks knowledge. Since there is a close world in which he uses the same method as he does in the actual world at a slightly earlier or later time, namely consulting a broken clock, and thereby comes to falsely believe it to be noon, his belief in the actual world is not safe.

Sosa has since moved on from defending a “brute” safety condition as a necessary condition for knowledge. Sosa (2007, 2009) argues that an agent’s belief must be apt and adroit to count as knowledge, where such virtues differ from safety considerations.

3. Elucidating the Safety Condition

The presentation of the safety condition thus far has been intentionally bare-boned for introductory purposes. This section is devoted to spelling out the finer details or characteristics of the condition, which is a rather challenging task given the presence of some vague patches in the safety literature.

It goes without saying that for epistemic purposes possible worlds W1, …, WN count as relevantly closer to or further from the world W in which S believes P at time T on a case by case basis relative to most or all of the following factors: the belief P, time T, the agent S, and the method M by which S formed the belief P at T in W. In other words, the conditions of belief formation, represented by the set {S, P, T, M}, play a constitutive, though not exclusive, role in a determination of closeness. (With respect to safety, one can either think of these possible worlds as branching possibilities à la Hawthorne and Lasonen-Aarnio (2009) or as concentric circles surrounding a subject-centered world, as Lewis (1973: 149) does in his semantics for counterfactuals.) It follows, then, that the adequacy of the safety condition will turn on, among other things, how close worlds are to be specified, the time of the belief formation, what type of reliability is at play, and how safety theorists understand methods. A foray into these important questions follows.

a. What Counts as a Close World?

Before attempting to answer this important question, four points must be made. Firstly, to satisfy the safety condition it is not the case that in every close world the agent must truly believe the relevant proposition; that is to say, S can safely believe P in W even though there are close worlds in which the agent does not form the belief P, for example, where S does not believe the target proposition in several of the close worlds because S is distracted or preoccupied. For instance, in world W S comes to believe that a car is approaching when S sees a car coming down the road. There may very well be a close world in which S is standing in exactly that same position at that very time but does not form the belief that a car is approaching because S turns her head in the opposite direction to look at a squirrel in a tree. The lack of belief about the approaching car in these close worlds does not prevent S from safely believing in W that a car is approaching. In light of such considerations, it is useful to consider close worlds as divided into two broad categories—relevant and irrelevant—a distinction which will prove important in the discussion on skepticism. below.

Secondly, as Williamson points out (2000: 100), the safety condition is notoriously vague owing to knowledge and reliability being vague concepts. As such it is unlikely that we will arrive at a very determinate answer as to exactly which worlds count as close; our expectations must be lowered. The problems this vagueness generates will become evident in section 4.

Thirdly, as Hawthorne (2004: 56) notes, closeness, as it pertains to safety, cannot be cashed out in terms of the notion of similarity found in counterfactuals. A counterfactual of the form P ☐→ Q is non-vacuously true at a world W only if some world in which P and Q are true is closer to W than any world in which P is true but Q false. When determining the truth conditions for counterfactuals, the history of both the actual world and the close world in which the antecedent is true are held fixed. When it comes to safety, possible worlds with a different history to W can nevertheless count as close, as will become evident below. Additionally, unlike the similarity of two worlds for epistemic purposes, the similarity of worlds for the purposes of counterfactuals need not be agent-relative.

Lastly, it is unclear whether believing a truthvalueless proposition (for example, one that fails to refer) in a close world should count as a knowledge-denying error possibility. Hawthorne (2004: 56) thinks it should since these count as “failed attempts at a true belief.” None of the safety theorists discuss this type of case.

Now on to the four main determinants of closeness:

i. The Time Factor

As far as safety goes, two worlds W and W* may count as similar at a time T with respect to the set {S, P, M} yet count as distant from each another, with respect to that same set, at a time prior to or following T. Consequently, if S falsely believes P in W* at T, then S’s belief P in W at T is unsafe. The following two cases illustrate that for the purposes of safe belief closeness must be understood as indexed to a point in time.

Cases concerning knowledge of the future demonstrate that similarity between two worlds at the time of belief formation trumps dissimilarity at a later time. Suppose, for the sake of illustration, that in a world W at time T (sometime in May 2009) an agent S truly believes that London will host the 2012 Olympics as a result of reading so in a local newspaper. In many possible worlds S similarly believes as much from reading the paper.  Yet in some of these worlds things in 2012 may be radically different from the way things will be in W in 2012 when the Olympics indeed take place in London. For example, in one of these worlds W* the British economy collapses and no Olympics take place in London in 2012. In W* S’s belief at T is thus false. Nonetheless, W and W* may count as close at T despite these significant differences between these two worlds in 2012 given how similar these two worlds are with respect to the set {S, M, P, T} i.e. the details of S’s belief episode at T in which she comes to believe that London will host the 2012 Olympics as a result of reading so in the newspaper.

It is not the case, however, that for a world W* to be close to world W at T it must share a complete history with W up to and including time T. The following case elucidates this point. It is taken for granted that if in W Sally walks into a showroom displaying red shoes under red overhead lights she does not know that there are red shoes on display if there is a close world W* in which there are white shoes on display but which look red under red lights. Notice that W* counts as close to W at T even though they do not share an identical history: at T-N, where N is some duration, the factory owner in W* is placing white shoes on the display shelves and turning red lights on.

Additionally, insisting on shared histories would make safety trivially true in some cases where the target proposition believed is true and concerns the present or the past; namely, were close worlds only those worlds which share complete histories with the actual world until the moment at which the belief is formed, then it would follow that in some cases the proposition believed would be trivially safe, which is an unacceptable consequence. Accordingly, if I recall going to the gym yesterday then I know I went to the gym yesterday only if there is no close world which differs from the actual world with respect to my going to the gym yesterday and in which I falsely believe I went to the gym yesterday.

There is room to think that the conceptual content of “could easily have falsely believed” permits playing around with the time of the belief formation itself. It stands to reason, then, cases of belief formation in a possible world W* which occur shortly before or after the belief formation in W should be factored into knowledge determinations as well. If S forms a false belief in those cases then S’s belief in W is unsafe. The motivation for permitting this flexibility with the time factor is that it allows safety to handle a wide variety of cases in which time is part of the content of the proposition believed, as exemplified by the Russell case. For example, S looks at two people kissing at a new year’s party and forms the true belief that it’s the new year. S does not count as knowing that it’s the new year if there is a close world in which just prior to midnight these two people begin kissing slightly before midnight, as a result of which S falsely believes it’s the new year.

ii. What Type of Reliability does Safety Require?

Reliability, as a property of a belief-forming method, comes in different kinds, two of which are important for the purpose at hand—local and global. The latter refers to a method M’s reliability in producing a range of token output beliefs in different propositions P, Q, R, …, and so forth. A method M is globally reliable if and only if it produces sufficiently more true beliefs than false beliefs in a range of different propositions. For example, M could be the visual process and P the proposition that there is a pencil on the desk, Q the proposition that there are clouds in the sky, and R the proposition that the bin is black. If a sufficiently high number of P, Q, R, … are true then method M is globally reliable. A method M is locally reliable with respect to an individual target belief P if and only if M produces a sufficient ratio of more true beliefs than false beliefs in that very proposition P.

Accounts of knowledge in the post-Gettier period differ with regards to which type of reliability is necessary for knowledge. Nozick and Dretske think only local reliability is needed, McGinn (1999) requires global reliability to the exclusion of local reliability, and Goldman (1986: 47) requires both. Where do Williamson, Pritchard, and Sosa fall on this spectrum? With respect to (SF**), it is evident from the manner in which Sosa formulates his safety condition that he thinks that only local reliability is necessary for knowledge in so far as (SF**) concerns truly believing a specific proposition P; that is, no mention is made of not falsely believing a different proposition Q. Notice, however, that as far as safety goes, Sosa requires that the agent exhibit perfect local reliability; that is, there can be no close world in which S falsely believes P.

Unlike Sosa, Pritchard, in order to handle knowledge of necessarily true propositions, requires global reliability, but a nuanced version thereof. Recall that Pritchard permits some false beliefs in non-near close worlds but has a zero tolerance for false beliefs in the nearer close worlds. Therefore, Pritchard can be classified as requiring perfect global reliability in the near close worlds and regular global reliability in the non-near close worlds. Additionally, both Pritchard and Sosa permit falsely believing P in a close world via a different method than the one used in the actual world.

With regards to Williamson, it is much harder to pin down the type of reliability at work in (SF). As mentioned, Williamson formulates the safety condition in different ways in different places. Some of these formulations clearly advocate for local reliability only, while others incorporate global reliability. And, further still, others push for subtler versions of both. Starting with local reliability, consider this formulation:

(SF1)     “[I]n a case α one is safe from error in believing that [a condition] C obtains if and only if there is no case close to α in which one falsely believes that C obtains” (2000: 126-7).

A condition, for Williamson (ibid.: 52), is specified by a “that” clause relative to an agent and a time. Thus, “S believes that the tree is i inches tall” counts as S believing that a certain condition obtains. According to Williamson (ibid.: 114ff), a typical agent who looks at a tree and believes “that the tree is i inches tall” does not know “that the tree is i inches tall” because there is a close world in which the agent uses that same method and comes to falsely believe “that the tree is i inches tall” when in fact it is i+1 inches tall. Most people are unreliable about the height of trees to the nearest inch because our eyesight is not that powerful; we cannot tell the height of a tree to the nearest inch just by looking at it. This case demonstrates that Williamson requires local reliability since this is a case in which the agent lacks knowledge because there is a close world in which he falsely believes the same proposition using the same method as that used in the actual world. Given that for Williamson safe belief entails a zero tolerance for false belief in a close world, Williamson requires perfect local reliability.

Here is another formulation of the safety condition by Williamson (2000: 124):

(SF2)     “One avoids false belief reliably in α if and only if one avoids false belief in every  case similar enough to α.”

This formulation seems to rule out knowledge in the following case. Pat is pulling cards out of a hat on which sentences are written. Pat pulls the first out and upon reading it truly believes that oranges are fruits. Pat then pulls a second card out and upon reading the sentence written on it falsely believes that America is a province of Australia. Pat’s true belief that oranges are fruits is unsafe because Pat does not avoid false belief in a similar case; that is, Pat could easily have falsely believed a different proposition using the same method in a close world. Because Pat uses a globally unreliable method she lacks knowledge. Given that for Williamson, safe belief entails a zero tolerance for false belief in a close world, Williamson therefore also requires perfect global reliability.

Yet further formulations of safety by Williamson advocate for subtler versions of local and global reliability. Recall that as Pritchard and Sosa present the safety condition, knowing P is compatible with falsely believing P via a different method in a close world. Williamson agrees, but with a caveat:

(SF3)    “P is required to be true only in similar cases in which it is believed on a similar basis” (2009: 364-5).

So for S to safely believe P via M not only must S not falsely believe P in any close world via M, S must also not falsely believe P using a relevantly similar method to M. Williamson extends this principle in a way that results in a non-standard version of global reliability:

(SF4)   If in a case α one knows P on a basis B, then in any case close to α in which one believes a proposition P* close to P on a basis [B*] close to B, then P* is true (2009: 325).

In other words, to safely believe P via M in α it must also be the case that one does not falsely believe P* via M* in a close case. For ease of reference, here is a gloss in the vicinity of Williamson’s conception of a safe belief:

(SF!)  S safely believes P via a method M in world W if and only if there is no close world to W in which:

(i)    S falsely believes P via M or a relevantly similar method M*; or

(ii)   S falsely believes any proposition via M; or

(iii)  S falsely believes a relevantly similar proposition P* using a relevantly similar method M*.

Williamson is thus committed to S knowing P in W at T only if S (SF!)-safely believes P. Since Williamson’s picture of “could easily have falsely believed” is richer than Pritchard’s or Sosa’s, more is needed to be safe from error for Williamson than for the latter two.

There are reasons independent of any of these three authors that suggest that knowledge should require both global and a local reliability. Firstly, the problem of vagueness supports a global reliability formulation of safety as follows. Some vague concepts may have different meanings in different worlds. It follows, then, that sentences with the same words can express different propositions in different worlds even when these worlds are very close (Williamson 1994: 230-4). For example, the property expressed by bald in the actual world might be having less than twenty hairs on one’s head while the property expressed by bald in a close world W might be having less than eighteen hairs on one’s head. If this is the case, then the sentence Pollock is bald expresses different propositions in these two worlds. Hence if Jackson, in the actual world, believes of Pollock that he is bald (Pollock having nineteen hairs on his head) then his belief will turn out to be unsafe as there is a close world, namely W, in which Jackson falsely believes of Pollock that he is bald. In cases such as these, for an agent to know P via M it must be the case that the agent could not easily have falsely believed P* via M (where P* counts as a different proposition in that close world).

Knowledge of propositions with singular content requires safety to be formulated in a globally reliable way. Consider the case in which Jones, looking at a real barn surrounded by fake barns, forms the true belief that “that is a barn.” The intuition is to deny Jones knowledge despite the fact that there is no close world in which that very barn is not a barn (assuming that a barn is essentially a barn). Since Jones could easily have falsely believed of a fake barn that “that is a barn,” which expresses a different and false proposition, Jones is denied knowledge.

iii. Methods

Methods can be individuated in a variety of ways: internally or externally, and in a coarse-grained or fine-grained way.

A way of individuating methods is internal if it respects the constraint that agents who form a belief P and who are internal duplicates share the same method; and external if it does not respect that constraint. Alternatively, if method individuation supervenes solely on brain states, then methods are internally individuated; if two agents can be in the same brain state yet be using different methods, then methods are individuated externally.

A way of individuating methods is coarse-grained if methods are described broadly or generally for example, the visual method. On the other hand, a way of individuating methods is fine-grained if methods are described in detail for example, the visual method for large objects at close range under favorable lighting conditions. As the degree of detail to which a method can be described is a parameter along a continuous spectrum, fine-grained and coarse-grained individuation permit of a wide range of generality or detail. Specifying the relevant detail for each method is known as the generality problem for reliabilism. Given that reliably believing is part of safety, safety faces the generality problem, something Williamson acknowledges (2009: 308).

Nozick (1981: 233) argues for an accessibility constraint on method individuation; that is, regardless of how methods are individuated, a difference in methods must always be accessible to the agent. It is evident, then, that an accessibility constraint is in tension with both external and fine-grained individuation since, ex hypothesi, neither the difference between seeing and hallucinating nor the difference between two finely-grained methods would be detectable by the typical agent.

Williamson and Pritchard deny such an accessibility constraint, thereby opening the way for external, fine-grained individuation of methods. For Williamson the accessibility constraint assumes that methods are a luminous condition, where a luminous condition is defined as a condition such that whenever it obtains the agent is in a position to know that it obtains (Williamson 2000: 95). But, as Williamson (ibid.: 96-8) argues, no non-trivial condition is luminous. Therefore the accessibility condition should be disregarded.

Pritchard (2005: 153) argues that safety will get the wrong result in some cases unless the accessibility condition is dropped because agents are fallible when it comes to determining which methods they use. For example, S might incorrectly think that she believes P via method M when in fact she believes it via M*. In some cases M delivers safe belief while M* doesn’t. Were the relevant method for a determination of safety the method the agent considers to be the one by which she believes, safety would get the wrong result in such cases.

One further argument against the accessibility condition is that it generates an infinite regress: S must be aware of which method she uses to believe P, the method she used to determine that, the method she used to determine that, and so on. Although these three arguments do not entail that internal and coarse-grained individuation are unsustainable, they do show that one reason in favor of such positions is unpromising.

We typically talk about methods or bases of belief in a coarse-grained way.  Williamson, however, adopts a fine-grained, external individuation of bases. For example, Williamson (2009b: 307, 325 n.13) thinks that, other things being equal, seeing a daschund and seeing a wolf count as different bases; believing that one is drinking pure, unadulterated water on the basis of drinking pure, unadulterated water from a glass is not the same basis as believing as much when drinking water from a glass that has been doctored with undetectable toxins by conniving agents; believing that one was shown x number of flashes after drinking regular orange juice does not count as the same basis as believing that one was shown x number of flashes after drinking a glass of orange juice with a tasteless mind-altering drug; and, finally, believing that S1 is married by looking at S1’s wedding ring and believing that S2 is married by looking at S2’s wedding ring count as different methods if S1 reliably wears her ring and S2 does not.

Williamson is inclined towards external, (super) fine-grained individuation of methods owing to his position vis-à-vis luminosity and skepticism. Regarding the former, in some cases the circumstances of a case can change in very gradual ways that the agent fails to detect such that at the start of the case the basis of belief is reliable while unreliable at the end of the case. Consider, for example, a case in which I see a pencil on a desk in front of me under favorable conditions. Assumedly I know that there is a pencil on the desk. I then begin to gradually walk backwards from the desk all the while keeping my eyes on the pencil until I reach a point at which it appears as a mere blur in the distance. At that point beliefs I form based on vision are no more than guesses. At each point in my growing distance from the desk my visual abilities start deteriorating slowly such that at some indiscernible point my eyesight no longer counts as reliable with respect to the pencil. Were bases of belief individuated in an internal, coarse-grained manner such that my looking at the pencil close-up and my looking at the pencil at a distance count as the same method, then I would fail to know that there is a pencil on the desk when close to the table since there is a close world in which I look at it from a distance and form a false belief that there is pen on the desk, which is intuitively the incorrect result. Consequently, minimal changes in the external environment result in a difference in the basis of belief formation.

iv. Skepticism

One of the selling points of safety is that it, unlike the relevant alternatives and sensitivity conditions, permits one to know the denial of skeptical hypotheses, thereby maintaining closure. Here is the skeptical argument from closure:

(1)   I know I have hands.

(2)   If I know I have hands then I know I am not a brain in the vat.

(3)   I don’t know that I am not a brain in the vat.

This triad is inconsistent because, claims the skeptic, one cannot know the denials of skeptical hypotheses; that is, one cannot know that one is not in the bad case (the denial of (3)). In other words, if I know I have hands, then by closure I should know I am not a handless brain in the vat. But, claims the skeptic, one is never in a position to know that one is not a handless brain in the vat. It follows, then, that I do not know that I have hands.

Pritchard (2005: 159) claims that if one is in the good case then one sees that one has hands based on perception. In the bad case one does not see that one has hands; rather, one is fed images of hands. As a result of this difference in method, the bad case automatically counts as irrelevant since only those cases in which one forms beliefs based on veridical perception count as relevant: “only those nearby possible worlds in which the agent formed her belief in the same way as in the actual world are at issue” (ibid. 161). Since, by definition of the cases, the brain in the vat is not using the same method as the agent in the good case, one can consequently know the denial of the skeptical hypothesis entailed by one’s knowledge of everyday propositions since there is no close world to the good case in which one falsely believes the denial of the skeptical hypothesis.

Williamson resists skepticism by exposing and undermining those claims that tempt us towards (3); namely the idea that a brain in the vat and the agent in the good case have exactly the same evidence. According to Williamson (2000: 9) “one’s total evidence is simply one’s total knowledge.” Since the agent in the good case has good evidence and the brain the vat has bad evidence, this constitutes a sufficient dissimilarity between the cases. Therefore, the false belief in the bad case counts as irrelevant to true belief in the good case. Alternatively, Williamson can be read as saying that individuating methods externally and in a fine-grained manner leads to the conclusion that believing truly on the basis of good evidence is sufficiently dissimilar to believing falsely on the basis of bad evidence (ibid.: 169). The epistemic impoverishment of the brain in the vat is thus irrelevant. Williamson (2009d: 21) has made the following further claim:

The idea is that when one knows p “on basis b,” worlds in which one does not believe p “on basis b” do not count as close; but knowing “on basis b” requires p to be true in all close worlds in which one believes p “on basis b;” thus p is true in all close worlds. In this sense, the danger from which one is safe is p’s being false, not only one’s believing p when it is false.

Thus the bad case counts as far off because in the bad case P is false. This difference between the good and bad cases constitutes a sufficient dissimilarity to permit one to know in the good case.

Since Sosa is not as explicit about how he builds methods into his safety condition, all three strategies are compatible with what he says. For example, he sometimes talks as if the bad case is far off (1999a: 147; 2000: 15), while at other times (1999b: 379) he can be read as thinking that even if it were close it would be irrelevant because the agent is using a different method in that case.

There are thus three different strategies a safety theorist can employ to oppose skepticism:

(i)  Since the agent in the bad case uses a different method from the agent in the good case, the bad case is sufficiently dissimilar from the good case and thus does not count as close;

(ii) The bad case counts as close to the good case yet is irrelevant given that the agent in the bad case uses a different method from the agent in the good case;

(iii) While the agents in the good and bad cases use the same method, the bad case counts as far off given the overall dissimilarities between it and the good case;

The safety condition is therefore a powerful tool against skepticism. For skepticism to be an appealing theory the skeptic would have to provide some reason for thinking that in  every case α involving an agent S, method M, time T, and proposition P, there is a close  and relevant case β in which a skeptical hypothesis is true such that S could easily have failed to be locally or globally reliable in α with respect to P at T (where the definitions of local and global reliability differ depending on which safety theorist is in question).

b. How do the Safety and Sensitivity Conditions Differ?

Given that the sensitivity condition for knowledge enjoyed such prominence, it is important to determine how the safety condition differs from it. Such a comparison will shed light on some virtues of the safety condition relative to the sensitivity condition.

In some cases sensitivity is the more stringent condition, while in others safety is. The following two points of logic elicit the difference between the safety and sensitivity conditions. When it comes to cases concerning knowledge of the denial of skeptical hypotheses, the safety principle is less demanding than the sensitivity principle. The latter principle requires that the agent not believe P in the nearest possible world in which P is false. As such no agent can know the denial of skeptical hypotheses by the simple sensitivity test, for example, I am not a brain in the vat, because in the nearest possible world in which the agent is a brain in the vat the agent continues to believe (falsely) that he is not a brain in the vat. So while agents typically satisfy the sensitivity condition with respect to everyday propositions and thus count as knowing many everyday propositions, they cannot satisfy the sensitivity condition with respect to the denial of skeptical hypothesis. Hence the incompatibility of the sensitivity condition and single-premise closure, for knowledge of everyday propositions entails knowledge of the denial of skeptical hypotheses incompatible with those propositions.

The safety principle, however, is compatible with single-premise closure for it permits knowing the denial of skeptical hypotheses. By the safety principle I count as knowing the everyday proposition P “that I have hands” by method M only if I safely believe P. It follows, then, that if I safely believe P then there is no close world in which I am a brain in the vat and am led to falsely believe that I have hands by M (as explained in the previous section). Consequently, if I know that I have hands and I know that that entails that I am not a brain in the vat, then I know that I am not a brain in the vat.

On the other hand, cases can be constructed in which safety is more demanding than sensitivity. Consider the following case: S truly believes P via M in the actual world but (i) in the closest world in which P is false S does not believe P, and (ii) there is a close world in which S falsely believes P via M. In this case S satisfies the sensitivity condition but fails to satisfy the safety condition. A case by Goldman (1986: 45) can be used to illustrate this point. Mary has an unreliable thermometer in her medicine cabinet which she uses to measure her temperature. It just so happens to correctly read her temperature of 38°C in this case. However, in the nearest world in which her temperature is not 38°C and she uses this thermometer to take her temperature, she is distracted by her son and she doesn’t form any belief about her temperature. She accordingly satisfies the sensitivity condition for knowledge since she does not believe P in the nearest world in which P is false. However, there is some other close world in which she uses this thermometer to take her temperature and forms a false belief thereby. Mary thus fails to satisfy the safety condition. It follows, then, that the following pair of conditionals are false:

If S safely believes P then S sensitively believes P.

If S sensitively believes P then S safely believes P.

The logic of these conditionals makes explicit the respects in which safety is similar to and different from the sensitivity condition.

c. The Semantics of Safety

In a non-epistemic context it is easy to see that “safe” can function as a gradable adjective. For instance, if S has three paths to choose from to get to her destination, it is perfectly acceptable to say that although path X is safe, Y is safer, and that path Z the safest of the three paths. “Similarity” also comes in degrees: London is more similar to Manchester than to Kabul. Possible worlds can thus be closer to or further from the actual world on a sliding scale of similarity. S’s belief P, therefore, can be safer than S’s belief Q. Although “safe” is a gradable adjective, the safety condition is not presented within the framework of a contextualist semantics for “knowledge,” where, roughly speaking, contextualism about ”knowledge” is the claim that the truth conditions of the proposition “S knows P” depend on the context of the attributer. In other words, “knowledge” picks out different relations in the different contexts of attribution where said contexts are a function of the varying interests of the attributer, not the possessor, of knowledge. Contextualism has gained its popularity through, among other things, its proposed solution to the skeptical challenge from closure. Sosa, Williamson, and Pritchard are all standard invariantists about the semantics of knowledge, invariantism being the denial of contextualism. (See Williamson (2009d: 18) for two different ways in which the gradability of safety can be accommodated without adopting a contextualist semantics for “know.”) If one’s main concern is skepticism, then the safety theorist has no need for a contextualist semantics for “knowledge” given the three strategies available to them for opposing skepticism (listed above). Nonetheless, it is easy enough to see how one could model the safety condition along contextualist lines if one had independent reasons for adopting a contextualist semantics for “knowledge”—those factors that weigh in on the similarity measure of close worlds will be those salient to the attributer, not the agent.

4. Safety in Action

To get a better feel for how the safety condition works, it proves beneficial to undertake an exercise in seeing how safety handles some of the troubling cases in the literature. Obviously each case can be modified in such a way as to make things harder or easier for the safety theorist. For present purposes such cases will be ignored.

a. Gettier and Chisholm

Jones is told by his boss that Smith will get the promotion. Jones then sees Smith putting ten  coins in his pocket. Jones accordingly infers that the man who will get the promotion has ten coins in his pocket. However, Jones (not Smith) gets the job and Jones just so happens to have ten coins in his pocket. According to Gettier (1963) Jones’s belief does not amount to knowledge. How does the safety condition handle this case? Jones’s belief is unsafe because there are close worlds in which (a) in which Carter gets the job but has no coins in his pocket, or (b) in which Jones get the job but has nine coins in his pocket.

The same reasoning applies to Chisholm’s (1977) case in which Jones believes that there is a sheep in the field upon seeing a fluffy white animal in the distance. However, while what Jones sees is a white dog there is indeed a sheep in the field lying behind a rock hidden from Jones’s sight. According to Chisholm, Jones doesn’t count as knowing that there is a sheep in the field. The safety condition captures this intuition. Jones’s belief is unsafe because there is a close world in which there is no sheep behind the rock and Jones falsely believes that there is a sheep in the field; that is, the method of inferring the presence of sheep by seeing dogs is unreliable.

b. Fake Barns

Jones is in an area with many fake barns. Jones sees a real barn in the field and forms the belief that there is a barn in the field. Does Jones know that there is a barn in the field? Prima facie, Jones’s belief counts as unsafe as there is a close world in which he looks at a fake barn and falsely believes that it is a (real) barn.

However, this case turns out to be a little harder to explain because the details of the case can be manipulated into yielding bizarre intuitions in similarly structured cases (Hawthorne and Gendler 2005). What if, for example, Jones would not have come across such a fake barn because he wasn’t in walking distance of it? The permutations of the standard setup of this case abound (see for example, Peacocke (1999: 324), Neta and Rohrbaugh (2004: 399), Comesaña (2005: 396), and Lackey (2006)). Similar permutations can be made for the Gettier and Chisholm cases, for example, where circumstances are such that the person who gets the job in all close worlds has ten coins in his or her pocket or that in all close worlds there is a sheep behind the rock.

This is one of those cases that manifests the vagueness present in the safety condition. As Williamson (2000: 100; 2009b: 305) indicates, there will be cases in which whether or not one thinks that there is a close world in which the agent falsely believes depends on whether or not one is inclined to attribute knowledge to that agent in that case; the vagueness in “relevantly similar,” “reliable,” and “knowledge” knowledge determinations in some cases notoriously difficult. Accordingly, the direction of one’s intuitions about whether or not Jones knows in each permutation of these cases will influence whether or not one thinks Jones has a false belief in a close world, and vice versa.

There is one significant permutation of this case that requires attention. Suppose the details of the case remain identical except that instead of forming the belief P that there is a barn in the field, Jones forms the belief Q that that is a barn (Hawthorne 2004: 56). Recall that Q is a singular proposition but P is not, where, roughly, a singular proposition is one that is constitutively about some particular. Sosa would have to find other reasons to deny Jones knowledge in this case, if he thinks he lacks it, given that his safety condition requires local reliability and true singular propositions are true in all close worlds. According to Williamson and Pritchard, Jones lacks knowledge in this case because there is a close world in which Jones looks at a fake barn and his belief that that is a barn expresses a different and false proposition.

c. Matches

Jones is about to light a match and forms the belief that the match will light once struck since all dry matches of this brand that he has struck have lit after being struck. However, the match doesn’t light because it was struck but rather does so because of some rare burst of radiation (adapted from Skyrms 1967: 383). Stipulate further that in all close worlds the match lights by friction. Is Jones’s belief safe?

The safety theorist seems drawn into denying knowledge in this case because there is a sense in which Jones is still lucky, in an epistemically malignant way, that his belief is true. When described in this way, this case is a stronger version of many of the Gettier cases mentioned so far because Jones’s belief is true by luck in the actual world but not so in any close world. Such cases would demonstrate that safety is not necessary for knowledge.

One way around this difficulty would be via Williamson’s claim that worlds which differ as far as trends go count as far off (see below B(i)). Hence, only worlds in which the match lights by a freak burst of radiation count as close. If worlds are ordered in this way, the example is presented in a flawed way that incorrectly indicates a problem for safety. Since the match lights in all those close worlds via radiation, Jones knows that his match will light.

5. Problems for Safety

As epistemologists ponder the details of the safety condition, it is to be expected that some will identify what they perceive to be its weaknesses or its failures. This section is devoted to three problem areas for safety.

a. Knowledge of Necessarily True Propositions

A necessarily true proposition is one which is true in all possible worlds. One might think, therefore, that knowledge of such propositions presents a problem for safety since there can be no close world in which S falsely believes such propositions. It should be clear at this point that this is a problem primarily for Sosa since his condition requires local reliability only; that is, not falsely believing P in close worlds. In other words, the counterfactual B(P) ☐→ P will be trivially true for any proposition P which is necessarily true. So knowledge of necessarily true propositions is going to be a problem for any account of knowledge that requires local reliability only.

Williamson and Pritchard have no such problems with knowledge of necessary truths since both require global reliability. There are cases that demonstrate that the method used to believe a necessarily true proposition can be globally unreliable. For example, suppose I use a coin to decide whether to believe 42 x 17 = 714 or to believe 32 ÷ 0.67 = 40, where I have no idea which is true without the use of a calculator. If the coin lands in such a way indicating that I should believe the first, which is necessarily true, then I am lucky to believe the necessary truth and not the necessary falsehood. I consequently do not know that 42 x 17 = 714 as I could just have easily have falsely believed the different proposition expressed by  32 ÷ 0.67 = 40.

b. Knowledge of the Future

The following lottery puzzle is particularly troublesome for safety. On the assumption that a proposition about a future state of affairs is either true or false, we take ourselves to know many things about the future, for example, that the Lakers game is next Tuesday, or that the elections will be held next month. This being the case, intuitively at least, Suzy knows that she won’t be able to afford to buy a new house this year. On the other hand, we deny that Suzy knows that her lottery ticket will lose (even if the draw has already taken place and Suzy has not yet learnt of the draw result). This state of affairs, however, presents the following puzzle: assuming single-premise closure true, if Suzy knows that she won’t be able to afford to buy a new house this year, and knows that this entails that her ticket is a loser, then Suzy should be in a position to know that her ticket will lose (by deduction). But it is commonly held that agents do not know that their lottery tickets will lose. (The aptness of this intuition is often demonstrated by the impropriety of flatly asserting that one knows that one’s ticket will lose, or selling one’s ticket for a penny before learning of the draw results.) The intuitive pull of single-premise closure is in tension with intuitions about what can be known about the future and about lottery tickets.

Problems involving lotteries generalize (Hawthorne 2004: 3). For instance, we are willing to say that Peter knows that (P) he will be living in Sydney this coming year. Yet we are hesitant to say that Peter knows that (Q) he won’t be one of those unfortunate few to be involved in a fatal car accident in the coming months. Assuming single premise closure true, if we are willing to attribute to Peter knowledge of P, and Peter knows that P entails Q, we should then be willing to attribute Peter knowledge of Q.

One way of explaining why agents do not know that their lottery tickets will lose or that they won’t die in unexpected accidents is that both events have a non-zero objective probability of occurring. That is, events with a non-zero probability of occurring can occur in close worlds. Naturally, then, one might think that the world in which one’s lottery ticket wins or in which one dies from an unexpected motor accident is close and that therefore one’s beliefs that one will lose the lottery or not die in an accident are unsafe.

This line of thinking is devastating for safety, however, as it would effectively rule out knowledge of any propositions the content of which regards the future since, assuming indeterminism true, there is a non-zero probability that any proposition about the future will be false; that is, for any true proposition P about the future there will be a close world in which P is false and one believes P. If safety leads directly to skepticism about knowledge of the future this would be a good reason to give up safety.

One line of thought for a safety theorist to pursue in response to this problem is to support the following high-chance-close-world principle (HCCW): if there is a high objective chance at T1 that the proposition P believed by S at T1 will be false at T2 given the state of the world at T1 and the laws of nature, then S does not know P at T1 as P is unsafe (even if P is true). The thinking behind this response is that if there is a high chance of some event occurring then that event could easily have occurred, which indicates that there is a natural connection between high chance and danger. For instance, if there is a high objective chance that the tornado will move in the direction of Kentucky, then it seems natural to say that Kentucky’s inhabitants are in danger.

Hawthorne and Lasonen-Aarnio (2009) demonstrate that HCCW presents some rather unwelcomed problems for the safety theorist. Firstly, HCCW is in tension with knowledge by multi-premise closure. Suppose, by way of example, that at T1 a subject S knows a range of chancy propositions P, Q, R, … about the future; that is, there is no close world in which any of those propositions are false. That said, while there may be a low probability for each proposition in that set that it will be false, for a sufficiently high number of propositions the probability at T1 that the conjunction of {P, Q, R, …} will be true at T2 will be very low . Accordingly, the probability of the negation of {P, Q, R, …} is very high at T1. By the lights of HCCW there will then be a close world in which that conjunction is false. Therefore, while an agent may know each conjunct in a set of chancy propositions about the future, the safety theorist who is committed to HCCW must deny that the agent knows the conjunction of those propositions. HCCW is therefore incompatible with multi-premise closure.

HCCW also creates problems for single-premise closure. Consider Plumpton who is about to begin a significantly long series of deductions from a true premise P1 towards a true conclusion PN. Suppose that at every step there is a significantly low objective probability that Plumpton’s deductive faculty will misfire leading him towards a false belief. If the chain is sufficiently long then there will be a high enough probability that the belief at the end of Plumpton’s deductive chain will be false, in which case, by HCCW, such a possibility counts as close. If closeness of worlds is cashed out in terms of HCCW, then Plumpton does not know PN if he deduced it from PN-1, which is effectively the denial of single-premise closure for whenever the chance that the next step will be false is high enough (for example, the step leading from PN-1 to PN in Plumpton’s case) the deduction from that previous step will be ruled out as unsafe. The same problem arises for knowing a proposition at the end of a very long testimony or memory chain when there is a non-zero objective probability that the process will go astray at any given link of the chain.

Moreover, HCCW also struggles to explain the inconsistency of why, in some cases, we do attribute knowledge to agents concerning events with substantially low probabilities of occurring while in some case we do not. For instance, we are happy to say, following Greco (2007) and Vogel (1999), that a veteran cop knows that his rookie partner will fail to disarm the mugger by shooting a bullet down the barrel of the mugger’s gun, or that not all sixty golfers will score a hole-in-one on the par three hole, or that this monkey will not type out a copy of War and Peace if placed in front of a computer. Yet it is common to deny knowledge in the lottery case where the chances are substantially lower.

The safety theorist, therefore, owes us some story about how close worlds calibrate in cases involving objective chance.

i. Williamson’s Response

Williamson denies that there is a straightforward correlation between safety and objective probability. When it comes to knowledge there are two conceptions of safety that one can have—a no risk conception or a small risk conception. Williamson (2009d) rejects the latter owing to the way we use the concepts of safety and danger in ordinary, non-epistemic contexts. By way of argument, Williamson (ibid.: 11) asks us to consider the following two valid arguments that involve the use of our ordinary, non-epistemic concept “safe,” where the context is held fixed between premises and conclusion:

Argument Asafety S was shot
───────────────────────
S was not safe from being shot

 

Argument Bsafety S was safe from being shot by X
S was safe from being shot by Y
S was safe from being shot by Z
S was safe from being shot other than by X, Y, or Z
───────────────────────────
S was safe from being shot

Williamson then asks us to consider which of the two competing conception of safety (the “small risk” or “no risk”) secures the validity of these arguments by plugging in these conceptions in the relevant premises and conclusions:

Argument Asmall risk S was shot
─────────────────────────
S’s risk of being shot was not small  

 

Argument Bsmall risk S’s risk of being shot by X was small
S’s risk of being shot by Y was small
S’s risk of being shot by Z was small
S’s risk of being shot other than by X, Y, or Z was small
───────────────────────────────────────
S’s risk of being shot was small

 

Argument Ano risk S was shot
───────────────────────────
S was at some risk of being shot

 

Argument Bno risk S was at no risk of being shot by X
S was at no risk of being shot by Y
S was at no risk of being shot by Z
S was at no risk of being shot by other than X, Y, or Z
──────────────────────────────────────
S was at no risk of being shot

With regards to the “small risk” conception of safety, the argument A small risk is invalid since even events with a small risk of occurring in a world W do occur in W, for example, lottery wins. Argument B small risk is invalid because small risks add up to large ones. On the other hand, the “no risk” conception of safety fairs much better for these reasons. Since S was shot in some world close to W, and W being the closest world to itself, S was at some risk of being shot, which demonstrates the validity of Argument A no risk. This explains why S is not safe from being shot in W at a time T. Similarly, Argument B no risk is valid since if S was not shot by X in any close world to W at T, and so on with respect to Y, or Z or anyone else, then there is no close world in which S was shot. This exercise with the ordinary conception of safety demonstrates that the ordinary conception thereof is not in terms of small risk or probability. (Peacocke (1999: 310-11) likewise understands the concept of safety in this way: “The relevant kind of possibility is one under which something’s not being possible means that in a certain way one can rely on its not obtaining” (original emphasis).) Therefore, argues Williamson, the notion of a safe belief is not one correlated with probability.

In light of this divergence between safety and probability, one counts as safely believing a conjunction, by Williamson’s lights,  if and only if one safely believes the conjunction on a basis that includes safely believing each conjunct. Similarly, if one safely believes P and safely believes PQ, then one safely believes Q if and only if the basis on which one believes Q includes the basis on which one believes P and PQ, for in that case there will be no close world in which one believes Q and Q is false. It stands to reason then, that there will be cases in which S safely believes P and safely believes PQ, yet does not safely believe Q since the basis on which S believes the latter does not include the basis of the former two beliefs. One must safely derive that which is entailed by what one already safely believes before one counts as safely believing the entailment: “We might say that safe derivation means that one makes a ‘knowledgeable’ connection from premises to conclusion, rather than that one knows the connection” (Williamson 2009d: 27).

Given these arguments, Williamson (ibid.: 19), demonstrates that in some cases knowing and objective probability dramatically diverge. For example, suppose I designate the winning lottery ticket “Lucky” and then believe that Lucky will win the lottery (where “Lucky” is a rigid designator). Nonetheless, I count as knowing in advance that Lucky will win despite each ticket having the very same low probability of winning.

For these reasons the cases involving knowledge of risky propositions do not bother a no risk conception of safety so long as one safely believes the conjunction on a basis that includes the bases on which one safely believes each conjunct. The same applies to very lengthy derivations. It stands to reason, then, that Plumpton knows PN despite there being a very high objective probability that PN is false. And so long as there is no close world in which one falsely believes a proposition P about the future, then one safely believes P despite there being a non-zero-probability that P is false, for example, that no monkey will type out War and Peace, that not all sixty golfers will score a hole-in-one, or that the rookie will not disarm the mugger. With respect to knowledge of the future, Williamson (2009c: 327) writes that “the occurrence of an event in β that bucks a relevant trend in α may be a relevant lack of closeness between α and β, even though the trend falls well short of a being a strict law.” Trends are further indicators of closeness between cases. So in a case α an agent S can be in a position to know a proposition P about the future even though there is a non-zero probability that P will be false since the case β in which it is false is sufficiently distant from α owing to P’s being false in β bucking a trend in α.

Matters involving lottery puzzles remain troublesome for Williamson, however. In the cases where the known proposition entails a risky proposition about the future for example, that one will be healthy for the rest of the year, Williamson is happy to admit that one does safely believe that risky proposition given the divergence between safety and small risk (as explained above). However, this seems to indicate that Williamson is happy to permit that one can safely infer that one’s lottery ticket will lose, which is problematic since it contradicts a widely-held intuition and goes against Williamson’s prior commitment in print that one does not know that one’s ticket will lose (2000: 117, 247). In conversation Williamson has made two salient remarks in response to these points. First, he still maintains that one does not know one’s ticket will lose when this belief is formed on the basis of reflecting on the low odds of it winning. He is open to one’s knowing that one’s ticket will lose by other bases of belief, for example, safe derivation from known propositions about the future. So in some lottery puzzles Williamson will concede that one can know that one’s ticket will lose. Second, Williamson has emphasized that lottery puzzles are unstable since one readily attributes knowledge about the future only to retract it when the lottery entailment becomes salient. Since Williamson’s concerns are the structural features of knowledge, he is not overly perturbed by problems generated by specific cases which rest on very unstable intuitions.

ii. Pritchard’s Response

Pritchard (2008: 41; 2009: 29), like Williamson, argues that the relationship between objective probability and safety is not one of direct correlation but motivates his claim using intuitions from a different lottery case. We say that S does not know by reflecting on the extremely long odds of winning a lottery that her ticket will lose (even if the draw has already occurred and S is unaware of the results) but that S does know that it lost from reading the result in the newspaper. This is a somewhat surprising result given that the objective probability of being wrong in the former case is lower than in the latter case since newspapers do sometimes print errors. Were closeness determined according to the HCCW principle, the the intuitions should be the converse. Safety, argues Pritchard, captures the intuitions in this case since the world in which one wins a lottery is very much like the actual world since all that differentiates the two worlds in this context is a bunch of balls falling differently. That is why one cannot know that one’s ticket will lose. However, given the copious editing processes at newspapers, quite a bit would have to go wrong for there to be a printing error.

Using this understanding of closeness Pritchard believes he can answer the lottery puzzles Hawthorne raises. Pritchard contends that we are mistaken in thinking that these are puzzles because our intuitions are being misled by a lack of detail in the presentation of the cases (ibid. 43-8). If S has a lottery ticket in his pocket for a draw taking place tomorrow, Pritchard claims that we ought to resist attributing knowledge to S that he won’t have enough money to go on a safari this year since the world in which he wins a major prize in the forthcoming lottery is close. In such a case, argues Pritchard, the agent also does not know that his ticket will lose. Conversely, if S does not have a lottery ticket in hand, then S knows he won’t go on safari and knows that he won’t win the lottery. Either way closure is preserved.

In a similar fashion Pritchard argues away some of the other lottery-like puzzles. If we are told that S is a healthy person then we are prone to affirm that S knows that S will be living in Wyoming this coming year and that S knows that S won’t have a heart attack since the world in which S, a healthy person drops dead from a totally unexpected heart attack, is far off. Likewise, if we are told that S has very high cholesterol, then we will deny that S knows that S will be living in Wyoming this year and that S won’t have a heart attack. Closure is maintained in both cases.

Some might have reservations about the adequacy of Pritchard’s response, however. It is a matter of differing intuitions whether or not there is a relevant difference between the actual world and worlds in which perfectly healthy people die from a sudden and unexpected heart attack or are involved in a fatal car accident. If such worlds are relevantly similar to the actual world, then such worlds should accordingly count as close on Pritchard’s line of thought. Therefore, contrary to Pritchard, such agents should be denied knowledge of their future whereabouts. The same line of reasoning can be applied to the lottery and newspaper case; the world in which the type setting machine prints an error owing to a technical glitch is much closer to the actual world than the world in which the seven balls corresponding to the numbers on one’s lottery ticket fall into the dedicated slots because much less has to change in the former case than in the latter case. If closeness of worlds is determined by how much the two worlds actually differ on the details of the case, then one ought to be unable to know stuff by reading the newspaper, which is an untenable result. Finally, it is also unclear how Pritchard’ strategy can handle the troublesome cases involving multi-premise closure that Hawthorne and Lasonen-Aarnio describe. The world in which Plumpton makes a mistake in the very long deductive chain he is about to embark upon seems very similar to the actual world. A natural reading of “at each step the chance that he will make a mistake is exceedingly low but that he will make a mistake overall exceedingly high” is that the two worlds are very similar; not much change is required for Plumpton to make a mistake somewhere along the way.

Despite these concerns, the disparity between closeness and objective probability that Pritchard is urging does seem to handle the Vogel and Greco cases quite well. Events in the actual would have to change significantly for sixty golfers all to score holes-in-one, or for the rookie to disarm the mugger, or for the monkey to type War and Peace.  The angles of the club face, timing, ball spin, wind speeds, strength of swing, and so forth. would all have to somehow fall together in such a way on sixty different occasions for all sixty golfers to succeed in scoring holes-in-one. Similar thoughts apply to the rookie and monkey cases.

c. Safety and Determinism

The safety theorist argues that if S knows P then S could not have easily been wrong. Suppose, for the sake of argument, that our world is a deterministic world  in the sense that the state of the world at TN is determined by the state of the world at TN—1 plus the laws of nature.. In what sense, then, could S have easily gone wrong since, if determinism is true, S could not but have believed P truly? Williamson (2000: 124, 2009: 325) argues that “determinism does not trivialize safety.” Williamson demonstrates this point by way of an example of a ball balanced on the tip of a cone. Such a ball, even in a deterministic world, is not safe from falling because, argues Williamson, the initial conditions could easily have been different such that the ball falls. By the “initial conditions” he means “the time of the case, not to the beginning of the universe” (2000: 124).

The suggestion, then, seems to be that in a case α in a deterministic world W, S safely believes P if and only if had the initial conditions of the case been slightly different S would still have truly believed P. What remains unclear, however, is why Williamson says that only the initial conditions of the case need to be changed and not the initial conditions of the universe, for, after all, altering the initial conditions of the case in a deterministic world can only be achieved if one alters the initial conditions of the universe itself. So altering the initial conditions of the case necessitates altering the initial conditions of the universe. In addition, on some conceptions of determinism, small scale changes at the beginning of the universe generate large-scale changes further down the chain of events. Consequently, it is unclear whether altering the initial conditions of the universe will generate sufficiently similar cases in which S falsely believes P. Finally, it seems somewhat odd to say that even if the actual world is a deterministic world, then even though I am currently typing in Oxford I could just have easily been hunting bears in Mongolia since a slight alteration in the initial conditions of the universe would have resulted in my being a bear hunter.

One maneuver a safety theorist can make in response to the foregoing difficulties is to adopt a move Lewis makes in his work on the semantics of counterfactuals. Suppose a world W is a deterministic world and in a case α in W S truly believes P at T. The safety theorist could argue that S safely believes P at T in W if and only if had there been a small miracle at T or some time shortly before T such that different conditions prevailed in a case β very similar to α, S truly believes P. (Williamson has raised such an option in conversation.)

Some might be wary of such a metaphysics since they would assume that miracles are not the kind of things we want in our ontology or epistemology. So it appears that unless the safety theorist wishes to adopt a somewhat unorthodox metaphysics, safety, despite Williamson’s insistence to the contrary, is hostage to determinism. But in the safety theorist’s defense, our best physics seems to provide a better case for indeterminism than determinism. It remains the case, nevertheless, that the safety theorist needs be more forthcoming about the relationship between the physical conditions of the world and the modality of the safety condition.

6. References and Further Reading

  • Armstrong, D. 1973. Belief, Truth, and Knowledge. Cambridge: Cambridge University Press.
    • An important early contribution to the study of knowledge in the post-Gettier period, particularly the idea that knowledge requires reliability.
  • Chisholm, R. 1977. Theory of Knowledge. 2nd ed. NJ: Prentice Hall.
    • One of the notable works in the early period of contemporary analytic epistemology.
  • Comesaña, J. 2005. “Unsafe Knowledge.” Synthese 146: 395-404.
    • This author argues that safe belief is not necessary for knowledge.
  • Dretske, F. 1971. “Conclusive Reasons.” Australasian Journal of Philosophy 49: 1-22.
    • In this paper Dretske argues for a sensitivity condition for knowledge.
  • Gettier, E. 1963. “Is Justified True Belief Knowledge?” Analysis 23: 121-3.
    • Here the famous counterexamples to the justified true belief account of knowledge are presented.
  • Goldman, A. 1967. “A Causal Theory of Knowing.” The Journal of Philosophy, 64 (12): 357-372.
    • Goldman presents a causal account of knowledge, which was an early attempt at solving the Gettier problem. Goldman later abounded this account in favor of his relevant alternatives account, a position he still maintains today.
  • Goldman, A.  1976. “Discrimination and Perceptual Knowledge.” The Journal of Philosophy, 73: 771-91.
    • The relevant alternatives condition for knowledge is explicated and defended.
  • Goldman, A. 1986. Epistemology and Cognition. USA: Harvard University Press.
    • A contemporary classic that presents Goldman’s epistemics—his multidisciplinary project of bringing the developments in cognitive psychology to bear on questions in individual and social epistemology.
  • Goldman, A. 2007. “Philosophical Intuitions: Their Target, Their Source, and Their Epistemic Status.”  Grazer Philosophische Studien 74: 1-26.
    • In this paper Goldman discusses the place of intuition in philosophy and the epistemic status of such intuitions, which is currently a hot topic in epistemology.
  • Greco, J. 2007. “Worries about Pritchard’s Safety.” Synthese 158: 299-302.
    • Problems for Pritchard’s safety condition are raised.
  • Hawthorne, J. 2004. Knowledge and Lotteries. Oxford: Oxford University Press.
    • This is a masterful treatment of the lottery problem and includes a helpful comparative assessment of the various semantic solutions proposed to this problem.
  • Hawthorne, J. & Gendler, T. 2005. “The Real Guide to Fake Barns.” Philosophical Studies 124: 331-352.
    • A humorous and pointed display of how some Gettier cases can be manipulated into yield even tougher cases for accounts of knowledge to handle.
  • Hawthorne, J. & Lasonen-Aarnio, M. 2009. “Knowledge and Objective Chance”. In: Greenough, P. & Pritchard, D. (eds.). Williamson on Knowledge. Oxford: Oxford University Press, pp. 92-108.
    • In this piece the problematic relationship between safety and probability is identified.
  • Lackey, J. 2006. “Pritchard’s Epistemic Luck.” Philosophical Quarterly 56: 284-9.
    • This author argues for inadequacies in Pritchard’s work on safety.
  • Lewis, D. 1973. Counterfactuals. Oxford: Blackwell.
    • Here Lewis presents his modal semantics for counterfactuals.
  • McGinn, C. 1999. “The Concept of Knowledge.” In: McGinn, C. Knowledge and Reality: Selected Essays. Oxford: Oxford University Press, pp. 7-35.
    • In this collection of his essays, McGinn defends his favored account of knowledge.
  • Neta, R. & Rohrbaugh, G. “Luminosity and the Safety of Knowledge.” Pacific Philosophical Quarterly 85 (2004) 396–406.
    • Arguments against safety are presented.
  • Nozick, R. 1981. Philosophical Explanations. Oxford: Oxford University Press.
    • Chapter 3 contains Nozick’s defense of his sensitivity condition for knowledge.
  • Peacocke, C. 1999. Being Known. Oxford: Oxford University Press.
    • In §7.5 Peacocke presents a useful elaboration of the notion of “could easily have.”
  • Pritchard, D. 2005. Epistemic Luck. Oxford: Oxford University Press.
    • A masterful exposition of the place luck plays in epistemology.
  • Pritchard, D. 2007. “Anti-Luck Epistemology.” Synthese 158: 277-98.
    • An argument for a refined safety condition for knowledge.
  • Pritchard, D. 2008. “Knowledge, Luck, and Lotteries.” In: Hendricks, V. & Pritchard, D. (eds.). New Waves in Epistemology. London: Palgrave Macmillan, pp. 28-51.
    • Here Pritchard discusses the lottery problem for safety at length.
  • Pritchard, D. 2009a. “Safety-Based Epistemology: Whither Now?” Journal of Philosophical Research 34: 33-45.
    • Further refinements of the safety condition.
  • Pritchard, D. 2009b. Knowledge. London: Palgrave Macmillan
    • A general and accessible introduction to knowledge.
  • Russell, B. 1948. Human Knowledge: Its Scope and its Limits. London: Allen & Unwin.
    • Here Russell lays out a general treatment of human knowledge part of which discusses his famous clock case.
  • Sainsbury, R.M. 1997. “Easy Possibilities.” Philosophy and Phenomenological Research 57(4): 907-919.
    • A discussion of easy possibility with respect to S not easily falsely believing P.
  • Shope, R. 1983. An Analysis of Knowing: A Decade of Research. Princeton: Princeton University Press.
    • A helpful overview of the early post-Gettier literature.
  • Skyrms, F.B. 1967. “The Explication of ‘X knows that P’.” Journal of Philosophy 64: 373-89.
    • An early work in epistemology in the post-Gettier period.
  • Sosa, E. 1999a. “How to Defeat Opposition to Moore.” Philosophical Perspectives 13: 141-54.
    • A discussion of safety in the context of skepticism.
  • Sosa, E. 1999b. “How must knowledge be modally related to what is known?” Philosophical Topics 26 (1&2): 373-384.
    • Sosa argues that a contraposition of Nozick’s sensitivity condition of knowledge is a superior condition for knowledge.
  • Sosa, E. 2007. A Virtue Epistemology: Apt Belief and Reflective Knowledge Volume I. Oxford: Oxford University Press.
  • Sosa, E. 2009. Reflective Knowledge: Apt Belief and Reflective Knowledge Volume II. Oxford: Oxford University Press.
    • In these two publications Sosa develops and refines his work on safety into a virtue account of knowledge that does not lean so heavily on a “brute” safety condition for knowledge. As a result, Sosa (and Pritchard) can no longer be strictly labeled as a safety theorist. Indeed, Sosa is open to there being cases of lucky knowledge.
  • Vogel, J. 1999. “The New Relative Alternative Theory”. Philosophical Perspectives 13: 155-80.
    • Here one finds, among other things, some interesting cases involving luck.
  • Williamson, T. 1994. Vagueness. London: Routledge.
    • Here Williamson presents a  case for his epistemic conception of vagueness.
  • Williamson, T. 2000. Knowledge and its Limits. Oxford: Oxford University Press.
    • A contemporary classic in epistemology in which Williamson argues for some rather iconoclastic positions about knowledge and evidence, among other important questions.
  • Williamson, T. 2009a. “Reply to Cassam.” In: Greenough, P. & Pritchard, D. (eds.). Williamson on Knowledge. Oxford: Oxford University Press, pp. 285-292.
    • This is a collection of concerns several authors raise about various aspects of Williamson’s work in epistemology. The book concludes with Williamson’s replies.
  • Williamson, T. 2009b. “Reply to Goldman.” In: Greenough, P. & Pritchard, D. (eds.). Williamson on Knowledge. Oxford: Oxford University Press, pp. 305-312.
  • Williamson, T. 2009c. “Reply to John Hawthorne and Maria Lasonen-Aarnio.” In: Greenough, P. & Pritchard, D. (eds.). Williamson on Knowledge. Oxford: Oxford University Press, pp. 313-29.
  • Williamson, T. 2009d. “Probability and Danger.” The Amherst Lecture in Philosophy.
    • A clarification of the relationship between safe belief and probability.

 

Author Information

Dani Rabinowitz
Email: dani.rabinowitz@philosophy.ox.ac.uk
Oxford University
United Kingdom

Surveillance Ethics

Surveillance involves paying close and sustained attention to another person. It is distinct from casual yet focused people-watching, such as might occur at a pavement cafe, to the extent that it is sustained over time. Furthermore the design is not to pay attention to just anyone, but to pay attention to some entity (a person or group) in particular and for a particular reason. Nor does surveillance have to involve watching. It may also involve listening, as when a telephone conversation is bugged, or even smelling, as in the case of dogs trained to discover drugs, or hardware which is able to discover explosives at a distance.

The ethics of surveillance considers the moral aspects of how surveillance is employed. Is it a value-neutral activity which may be used for good or ill, or is it always problematic and if so why? What are the benefits and harms of surveillance? Who is entitled to carry out surveillance, when and under what circumstances? Are there any circumstances under which someone should never be under surveillance?

This article provides a brief overview of the history of surveillance ethics, beginning with Jeremy Bentham and George Orwell. It then looks at the development of surveillance studies in the light of Michel Foucault and the challenges posed by new techniques of surveillance which allow unprecedented collection and retention of information. The bulk of this article focuses on considering the ethical challenges posed by surveillance. These include why surveillance is undertaken and by whom, as well as when and how it may be employed. This is followed by an examination of a number of concerns regarding the impact of surveillance such as social sorting, distance and chilling effects.

Table of Contents

  1. Origins
  2. Recent History
  3. Privacy
  4. Trust and Autonomy
  5. Cause
  6. Authority
  7. Necessity
  8. Means
  9. Social Sorting
  10. Function Creep
  11. Distance
  12. Chilling Effects
  13. Power
  14. References and Further Reading

1. Origins

Jeremy Bentham’s idea of the Panopticon is arguably the first significant reference to surveillance ethics in the modern period (Bentham 1995). The Panopticon was to be a prison, comprising a circular building with the cells adjacent to the outside walls. In the centre was a tower in which the prison supervisor would live and monitor the inmates. Large external windows and smaller internal windows in each cell would allow the supervisor to monitor the activities of the inmates, while a system of louvres in the central tower would prevent the inmates from seeing the supervisor. A rudimentary form of directed loudspeaker would enable the supervisor to communicate with the prisoners. Through not knowing when they were under surveillance, Bentham argued, the inmates would come to assume that they were always under surveillance. This would encourage them to be self-disciplined and well-behaved during their incarceration. The prospect of living in this way would also deter those who visited the prison from wanting to commit crimes. Hence the Panopticon would serve as a deterrent to the inmates from misbehaving or committing future crimes and to general society from committing crimes and finding themselves so incarcerated.

George Orwell’s 1984 extended the Panopticon to encompass the whole of society, or at least the middle classes (Orwell 2004). In this novel the Panopticon became electrical with the invention of the telescreen, a two-way television which allowed the state almost total visual and auditory access to the homes, streets and workplaces of the citizens. As the inmates of the Panopticon were reminded of the supervisor’s presence by the loudspeaker, so citizens in Orwell’s vision were told repeatedly that “Big Brother is watching you”. Orwell used the novel to discuss, among other things, both the reasons of the state for wanting ubiquitous surveillance and the impact that this has on the individual and the nature of a society under ubiquitous surveillance.

The theme of the Panopticon was revisited by Michel Foucault in Discipline and Punish, an overview of the history of prisons and the value they serve (Foucault 1991). Foucault’s particular concern was with the use of power and its increasing bureaucratization in the modern period. His study began with torture and the emphasis on the sovereignty and power of the king. With the Enlightenment the prison was introduced as a more efficient means of punishment, supported by society’s increasing acceptance of the value of discipline beyond merely the military or religious arenas. Oversight became a fundamental tool in enforcing discipline, and so the Panopticon served as both a means of punishment and a form of discipline of the inmates, owing to the seemingly persistent gaze of the supervisor. With time, Foucault argued, the prison was combined with the workhouse and the hospital to simultaneously deprive inmates of their freedom whilst attempting to discipline and reform them.

Aside from Foucault’s comments on the nature of prisons and their value in society, his reference to the Panopticon introduced the concept to a new generation of scholars unfamiliar with Bentham’s penal theories. As such it is the Panopticon read through the lens of Foucault, along with Orwell’s dystopian vision, that came to dominate early discussions of surveillance and its impact on society and the individual.

2. Recent History

While Bentham/Foucault and Orwell successfully raised questions about the value and harms of surveillance, these had limited impact in many philosophy departments. As such there is little written on the ethics of surveillance from a strictly philosophical perspective. However, the interest sparked by Foucault, coupled with recent advances in technology, has led to a questioning of the role surveillance plays in contemporary society. This questioning has been increasingly reflected in academia with the creation of the inter-disciplinary field of Surveillance Studies, bringing together sociologists, jurists, political scientists, geographers, and increasingly philosophers, to consider issues connected with and arising from surveillance.

Although there may be a degree of continuity with earlier forms of surveillance, the “new surveillance” (Marx 1998) introduced by technological advances adds degrees of complexity and mobility with which society has not been faced before. Closed Circuit Television (CCTV) cameras offer a potentially ubiquitous gaze and a hidden, anonymous watcher akin to the Panopticon on a hitherto unimaginable scale. Wireless networks transmit vast quantities of information on systems vulnerable to intercept. The Internet has seen the creation of virtual identities, “data doubles” – reflections of the “real” person in cyberspace – which are vulnerable to abuse and theft; online storing of medical, banking and other personal data which may be hacked or simply lost by the institution responsible; and the increased commodification of personal information by web sites which sell that data to advertising companies or governments. In each of these cases not only are the technology and services offered new but the nature of the technology means that almost limitless information can be collected, stored indefinitely and returned to or searched at will.

Taken together these issues have moved on the discussion from analogies drawn with the Panopticon or 1984. While each maintains rhetorical force, the computerization of society has rendered them limited in their ability to capture the complexities of contemporary surveillance, still less the benefits and harms that it can bring. Furthermore, both the Panopticon and Big Brother are authoritarian and negative images which lend weight to the suggestion that surveillance is always unethical or problematic. However, this is not the case. There are instances in which surveillance can be not only justified but even embraced by the surveilled. Surveillance is itself an ethically neutral concept. What determines the ethical nature of a particular instance of surveillance will be the considerations which follow, such as justified cause, the means employed, and questions of proportionality. Before turning to these, however, we will discuss the areas of life most impacted by surveillance: Privacy, trust and autonomy.

3. Privacy

One of the core arguments against surveillance is that it poses a threat to privacy, which is of value to the individual and to society. This raises a number of questions about privacy, what it is and to what extent and why it is valuable.

Much of the early work on privacy was carried out in the legal realm, particularly in the United States. Warren and Brandeis’ The Right to Privacy (Warren and Brandeis 1890) is generally taken as the first attempt to define the concept of privacy. Here the authors claim that the right to privacy is an instance of the “right to be let alone” and establish limits to that right, arguing that it is not absolute. Developments in technology then gave rise to defining legal cases, such as Katz v. United States (1967) which related privacy and surveillance to the Fourth Amendment of the US Constitution (forbidding unreasonable search and seizure by the state). Eisenstadt v. Baird (1972) then established that the right to privacy involved the right to make important choices without government intervention, drawing a connection between privacy and autonomy. This was drawn upon in Roe vs. Wade (1972) to argue for a woman’s rights in abortion.

In the aftermath of these legal decisions the concept of privacy was increasingly debated by philosophers. Judith Jarvis Thomson (Thomson 1975) argued that the right to privacy consists of a cluster of rights which overlap with both property rights and rights of the person. She held that there are no privacy rights which do not overlap with clusters of other rights, and so there is no distinct right to privacy. A violation of someone’s right to privacy only occurs when one of these other rights has also been violated in a relevant manner. Hence the illicit viewing of another’s diary involves a breach of his right to dispose of his property as he sees fit; extracting information through torture involves a violation of someone’s right not to be injured. In both cases there is a violation of a person’s privacy, but this is only because other, more fundamental rights have been violated.

Thomas Scanlon (Scanlon 1975) responded by arguing that Thomson’s analysis was convoluted and counter-intuitive. Instead he proposed that we have socially-defined zones of privacy which enable us to act with the assumption that we are not being monitored. These zones are motivated by our interest in not having to be alert to specific observation at all times. James Rachels (Rachels 1975), responding to both Thomson and Scanlon, argued that privacy was rather a matter of relationships. In defining our relationships with others, we use varying degrees of privacy to establish intimacy. With a stranger we uphold a high degree of privacy, whilst with a close family member we may have and expect much less privacy. Indeed, he argued, what it means to be a friend is for the relationship to involve less privacy than would otherwise be the case.

More recently W.A. Parent (Parent 1983) argued that privacy involved the control of undocumented information about oneself. This has been contested by Jeffrey Reimann (Reiman 2004) and Tony Doyle (Doyle 2009), who hold that privacy is not restricted to information. A porn star whose body is freely available for all to see may still have his or her privacy violated if spied upon in his or her own home. Daniel Nathan (Nathan 1990) and Danah Boyd (Boyd 2010) agree with Parent that control is an important issue, while Herman Tavani and James Moor (Tavani and Moor 2001) hold that privacy relates more accurately to the access another has to me than to who controls the information about me.

Despite the disagreements, most would agree that on an individual level, privacy affords us the space to be ourselves and to define ourselves through giving us a degree of autonomy and protecting our dignity. In our interactions with others we may define the intimacy of our relationships with them through the amount of privacy we relinquish in that relationship. As we engage with society at large we gain confidence and security from our privacy, safe that those we do not know do not in turn know all about us. We fear the stranger and what they might do if they knew our vulnerabilities. Through keeping those vulnerabilities private, we maintain a level of personal safety.

Privacy is also of value to society at large. As noted, we may appear in public safe in the knowledge that our weaknesses are not on display for all to see, allowing for confident personal interaction. When we vote we do so in the belief that no-one can see our decision and treat us well or poorly in the light of how we voted. Privacy is thus important in the social context of democracy. In many cases we do not want to know everything about everyone around us and so privacy can protect the rest of us from being exposed to too much information. Thanks to a level of anonymity I may also feel emboldened to speak out publicly against corruption or injustice, or simply to be more creative in self-expression. Many of these benefits can be seen through the contrast with states employing high levels of surveillance, such as the former German Democratic Republic. Here the surveillance carried out by the Ministerium für Staatssicherheit (Stasi) was instrumental in quashing open dissent and enforcing the behavioural uniformity foreseen by Orwell (Funder 2004).

There is also of a tension between the safety of the individual as granted by his or her privacy, and the safety of the community which comes from denying the individual his or her privacy. On the most basic level, I feel safest if you know nothing about me but I know everything about you. This is reversed from your perspective, leading to the tension of balancing privacy against security. This balance suggests that it may be morally justifiable to deny one person’s privacy in the interests of the security of the community, although it is by no means always clear when these situations might arise.

4. Trust and Autonomy

Linked closely to the issue of privacy is that of trust. As highlighted by Rachels (Rachels 1975), privacy is often held in an inverse relationship to trust such that the more trust exists between two people, the less need there is for privacy. This is not universally true as intimate partners may still lock the bathroom door. Nonetheless committed relationships are often marked by a higher degree of trust and a reduced level of privacy. When one of those elements is breached, either through intruding on (limited) privacy or through a breaking of trust, the relationship is damaged. One reflection of diminished trust in a relationship is increased surveillance, as when suspicious partners hire private investigators to determine an infidelity. Conversely, the discovery of increased surveillance, especially when the surveilled party is innocent, may also lead to decreased levels of trust. At a personal level trust is often reciprocal: Why should I trust you if you don’t trust me? The discovery of surveillance could well therefore damage personal relationships.

Surveillance also limits the opportunity to present oneself in the manner of one’s own choosing. It is hence limiting on the individual’s autonomy, impacting how that individual interacts with the world. While Bentham believed the Panopticon would encourage inmates to self-discipline, this would only occur through fear of repercussions. The inmates would be denied the opportunity to demonstrate willingness to reform without the surveillance. There would therefore be no opportunities for the supervisor of the prison to place his trust in the prisoner, nor for that prisoner to demonstrate his trustworthiness other than in the presence of surveillance. Any traits displayed would then arguably not be genuine reflections of the character of the inmate. The same is true of surveillance in the workplace, schools and society at large. If the surveilled is suspicious of or conscious of the surveillance then they might conform to the expected norm, but this will not necessarily reflect their character.

Surveillance therefore diminishes the need to trust the surveilled person. Its presence will pressure that person to conform and so render his actions more predictable. Furthermore, as in the Panopticon, if he does not conform there is the chance he will be subject to sanctions. Surveilled people therefore can become more predictable if they fear reprisals for acting in ways that merit the disapproval of the surveillant. In that sense they are therefore more trustworthy (an authority can trust that they will act in such a manner). If the purpose of surveillance is to control or deter people, then surveillance of which the subject is aware could be effective. If, on the other hand, the purpose is to assess the character of people as that character is expressed in integrity, then surveillance of which the subject is aware will be of little help.

We have seen the impacts of surveillance on privacy, trust and autonomy. We are now in a position to consider when surveillance may be justified in spite of (or because of) those impacts.

5. Cause

The purpose of surveillance, or one particular instantiation of surveillance, is probably the most fundamental ethical question that can be asked on this subject. We may think of security as an obvious response, especially as it concerns state surveillance in the form of espionage, or in the form of security cameras surrounding particular buildings. In a sense this throws the question back one degree to ask whether security, or this degree of security, is justified under the circumstances. This will then hinge in part on who it is that is carrying out the surveillance and in part on whom they are monitoring. Is state surveillance of political dissidents, for instance, really necessary for state security?

Security isn’t the only use of surveillance, however. Many retail establishments use surveillance for the mutual benefit of themselves and their customers. Loyalty cards in supermarkets enable the stores to see who is buying which goods and build up detailed pictures of their customer base. Customers participate in this surveillance in return for exclusive deals. Frequent users receive special offers either to widen their shopping experience or to encourage loyalty to the particular store.

Efficiency savings such as these are not limited to retail. They also apply on public and private transport when smart cards enable a person to use the subway or a toll road without using cash. They can aid the rapid transfer of information regarding a person’s health if they fall ill or have an accident when away from home. Security and customer benefit may also come together as, for example, when credit cards are suspended following atypical spending habits of the user. This might also be the case if law enforcement agencies find that they need to establish an alibi or build a profile of a suspect.

Finally there is the possibility of using surveillance for personal gain. This might be financial or emotional, but can extend to other reasons. An unethical computer hacker might break into a website to steal credit card numbers which she can then use for her own ends. Alternatively a Peeping Tom might steal up to someone’s window with voyeuristic intent, or an ex-spouse might seek to gain incriminating information in order to secure custody of their child (Allen 2008).

While issues of simple personal gain which involve violating the privacy of another seem to be unacceptable, although there might be exceptions, questions of security and efficiency are less clear-cut. Many choose to opt-in to supermarket or transport surveillance precisely for the benefits that these systems offer, despite the intensely personal knowledge of the customer that the store can gain from these interactions. In the case of state security the questions often fall along the lines of how far should the privacy of the few be sacrificed for the security of the many. As shall be seen when we come to consider social sorting, however, questions of distribution also arise: Who stands to gain and who to lose from a particular instance of surveillance?

Consent is a major consideration in the justification of surveillance, and particularly the cause of surveillance. If I invite you into my home I am consenting for you to see me in a context which would hitherto have been private and secluded from you. In the popular game show Big Brother contestants consented to being monitored round the clock for up to three months. This does not appear to be problematic from the perspective of the surveillance. As noted above, we do not feel an imposition has occurred if we choose to take up a supermarket’s offer of a loyalty card and the convenience that it brings. We might, however, object strongly if it transpires that the store has been monitoring the spending of individuals without such cards by recording their credit card usage and correlating this with itemised receipts.

While consent can justify surveillance, however, the lack of consent does not automatically thereby invalidate it. Law enforcement does not need to seek the consent of the criminals it wishes to monitor to accumulate evidence against them, nor does the state need to gain consent of those who are genuine threats to its security. As such, we must look to justifiable causes for non-consensual surveillance.

One justification often given for large-scale surveillance is the consequentialist appeal to the greater good. This might apply when the security of the community is best served by monitoring some or all in that community. If the community in question is a state then the numbers involved will be too great to realistically gain complete acceptance of the surveillance by every citizen. As such the state may then appeal to the benefits that will come to more people as a result of the surveillance to justify the imposition.

Deontologists are likely to resist this justification as it implies that the rights of the few may be overridden by the interests of the many. A deontological justification will look rather to the entity to be surveilled and ask what it is about that entity that means it deserves or is in some way liable to be monitored in this way. Given the aforementioned harms of surveillance, there must be a good reason as to why this person or group should be exposed to those harms.

In practice, justifications for surveillance often include both consequentialist and deontological considerations. Hence state security is justified in both protecting the majority and focusing its attention on particular wrongdoers who pose a threat to that majority. Similarly, CCTV in the public square is justified in providing peace of mind to the general public by monitoring all, but targeting only particular individuals or groups who are believed to pose a threat.

The type of surveillance might also have a bearing on whether a consequentialist or  deontological justification is sought. CCTV, which is indiscriminate in whom it monitors, lends itself to a consequentialist perspective. In shopping malls the majority of people surveilled by CCTV have done nothing wrong and have no intention of wrongdoing. Nonetheless, the benefits which CCTV brings in detecting the minority of wrongdoers and punishing them may be taken to justify the surveillance of all. By contrast, a more discriminating form of surveillance, such as tapping a suspect’s telephone, lends itself more to a deontological approach. If a person has given an authority such as the police reasonable suspicion to believe he has committed a crime, so he has rendered himself liable to be monitored in this way.

Finally, differences between deontologists and consequentialists emerge in opposition to surveillance. Deontologists will typically find surveillance less acceptable when it violates certain rights of individuals such as the right to privacy. By contrast, consequentialists will tend to be more sanguine about concerns with individual rights in favour of overall costs and benefits to society. If a particular instance of surveillance can be shown to improve the wellbeing of society, albeit at the cost of the privacy of a few individuals, then consequentialists are less likely to see this as problematic than deontologists.

6. Authority

Much of the justification of surveillance, and particularly the cause of that surveillance, will depend on who it is that is carrying out the surveillance. State security can and should be carried out by state intelligence agencies. By contrast it should not be carried out by journalists or foreign aid workers, who need to maintain a level of neutrality in order to carry out their work effectively. If this is the role of state intelligence agencies then those agencies would not be justified in the surveillance of domestic employers to ensure that they are not abusing their workforce. This should rather be the domain of domestic law enforcement.

State surveillance of genuine enemies of the state is one of the less controversial elements of surveillance. Even here, though, it is important to be clear as to precisely whose security is being guarded: That of the state or of those currently empowered to run the state? When the protests occurred in Tiananmen Square in 1989, were the protestors challenging 1) the security of China, 2) the security of the Communist Party running China or 3) the security of those individuals leading the Communist Part of China? To what extent was China the Communist Party and how much of the identity of the state is tied up with those who run the state?

The decision to employ surveillance does not lie entirely with the state, although the state may chose to regulate the use of surveillance. Employers sometimes monitor their employees, either to prevent theft or whistle-blowing or to ensure that they are working to their maximum ability. Retailers, as noted above, monitor customer spending habits to improve efficiency and sales. Parents monitor sleeping infants so as to respond should the child wake in the night. Private investigators might engage in surveillance to establish infidelity, while Peeping Toms might do so for kicks. While it might be felt that the investigator is justified and the Peeping Tom is not, what of the case when the private investigator is attempting to establish infidelity and simultaneously enjoying his work a little too much?

In each case the ethical authority to carry out surveillance is intimately linked to the justifying cause of that surveillance. Hence an individual is justified in carrying out surveillance of his property if it is to secure the property from theft, but not if it is to spy on his tenants. Parents are justified (indeed, often expected) to monitor their infant children as they sleep, but whether they are also justified in monitoring the babysitter watching over their children is far more controversial. Groups of people are justified in watching their street, particularly if it has been subject to a recent spate of theft, through Neighbourhood Watch schemes, but not in intimidating an unpopular neighbour through persistent overt surveillance. This is not to suggest that intention alone can justify surveillance. A landlord might wish to secure his property by placing a camera in the bathroom (lest a burglar enter through the window). While his intention might not be to spy on his tenants the effect will be precisely that. Similarly, baby monitors left in areas where they are likely to record intimate phone conversations of a babysitter are still an invasion of the babysitter’s privacy, irrespective of the parents’ intentions.

7. Necessity

Necessity is often cited as an important condition for justified surveillance. Article 8 of the European Union Convention on Human Rights, for example, states that “there shall be no interference by a public authority with the exercise of this right [to privacy] except such as is in accordance with the law and is necessary in a democratic society in the interests of national security, public safety or the economic well-being of the country, for the prevention of disorder or crime, for the protection of health or morals, or for the protection of the rights and freedoms of others” (Council of Europe 1950). We shall discuss proportionality and discrimination below. Here we shall focus on what is meant by necessity in the context of surveillance.

When is surveillance necessary, though? Should surveillance, like war, be a matter of last resort? If so, when is that moment of last resort reached?

The concept of necessity can limit surveillance from being undertaken arbitrarily or prematurely. An authority may not monitor anyone at any time. Surveillance must rather be required by the circumstances of the case. However, this is simply to replace “necessary” with “required” and so does not help. We are still left with the question as to when surveillance is required by the circumstances of the case.

John Lango (Lango 2006) has suggested two criteria for necessity: The feasibility standard and the awfulness standard. The first occurs when there is sufficient evidence to suggest that there is no feasible alternative, the second when the alternatives are worse than the proposed course of action. When one of these criteria is met the action may be deemed necessary. Given the harms of surveillance, it should therefore be avoided if there are less harmful alternatives. However, surveillance becomes necessary when either there is no alternative, or when the alternatives such as physical intrusion or arrest are more harmful than the surveillance itself.

8. Means

How surveillance is carried out is a further consideration which should be taken into account. Is the surveillance proportionate to its aim and is it discriminate in whom it targets? Proportionality of action is a familiar concept in legal and military ethics, but it has application to surveillance as well. We might return to the images of Big Brother or the Panopticon to picture scenarios in which surveillance is total and unending, and the horror which this often arouses in our minds. In these cases it is hard to imagine the occasioning justification which would see such surveillance as a proportionate response. Even major wars do not justify the perpetual monitoring of all citizens around the clock.

More recent, non-fictional cases exist in the surveillance of school children through using fingerprinting technology either to grant entrance and egress from the school, or to pay for school lunches. While the former case might be seen as providing protection to the children from those who should not be in the school, the latter seems highly invasive and an extreme manner to respond to playground bullies stealing lunch money or parents’ desire to know what food their children are eating. Similar questions have been raised about the full-body screening of airline passengers which was introduced in 2009 in the US and the UK, leading to monochrome “nude” images of all those who went through the scanners. Irrespective of health concerns associated with the scanners, they were seen by many to be extremely invasive of privacy without offering a concomitant level of security to those flying on the airline.

If proportionality questions the depth, or intrusiveness of surveillance, discrimination considers its breadth. It asks how many people are likely to be monitored as a result of the particular form of surveillance. Some aspects of surveillance, such as wire tapping, are highly discriminating and target only those using the particular phone under observation. Others, such as CCTV in public places, are broadly indiscriminate and collect information about a great number of people, only some of which will be of interest to the surveillant. We may ask if there is an onus on the surveillant to be as discriminating as possible and only collect information or invade the privacy of as few people as absolutely necessary, given the confines of what is reasonably possible.

A related question is whether any form of surveillance should be absolutely prohibited. Possible candidates for impermissible surveillance would be that of public toilets or private bedrooms. However even here it would appear as if there are cases when these might become of critical importance to justifying causes, such as state security. This might occur if a civil servant with access to state secrets is believed to be involved in a sexual liaison with a member of a foreign intelligence agency. Less exotically, an organised crime syndicate might use a public toilet as a dead letter drop for passing drugs, guns or money. In each of these cases it might be felt that the perhaps obvious places for banning surveillance could in fact become legitimate contexts. In these cases, however, it would be important to protect the innocent as much as possible by limiting the intrusion. Film which is not useful as evidence should be promptly deleted; the monitoring of toilets should be carried out by a member of the same sex; and if possible software should be used which grants anonymity to all captured on film by default and can only reveal individual details upon request.

9. Social Sorting

Much of the discussion surrounding the ethics of surveillance concerns threats to individual or group privacy, and the balance of power between the individual and the state or the individual’s employers. There is a further potential harm of surveillance in the form of social sorting (Lyon 2002). The purpose of surveillance, it is argued, is to sort people into categories for ends which are either good or ill. The danger, however, is that social stereotypes are carried over into these categories and may even be enshrined and institutionalized in them. As a result particular forms of surveillance might serve to have real impact on people’s life chances owing to such institutionalized prejudice. For example, a recent study found that CCTV operators were disproportionately monitoring the young, the male and ethnic minorities “for no obvious reason” (Norris & Armstrong 1999). That is, in the absence of suspicious behaviour they were choosing to focus their attention on these categories of people. The result is that anyone falling into these categories is more likely to be caught if doing something wrong than someone else, thus perpetuating the stereotype. Furthermore, as these groups were being watched more frequently than others, they were more likely to be seen as doing something suspicious. This in turn could lead to disproportionate response rates by security forces on the ground, contributing to a sense of alienation and rejection by society.

10. Function Creep

Function creep (Winner 1977) involves extending the use of a technology from the cause for which it was initially intended to a different cause. This is readily seen in the use of identity cards in the UK, introduced in the 1939 National Registration Act for the purposes of security, national service and rationing. By 1950 the same cards were being used by 39 government agencies for reasons as diverse as collecting parcels from the post office to routine police enquiries. While any or even all of these were arguably justified, few could be justified under the terms of the initial Act. It was a combination of protest and the eventual recognition of this extension of use which led to the abolition of the Act that same year.

It is not just an extension of surveillance technology which can count as function creep but also an extension of the information retrieved by that technology. CCTV may be installed in a public transport hub in order to better ease traffic flows and predict suicides in order to facilitate timely intervention. In the event of a terrorist atrocity occurring in that hub, though, the same images can be used to identify the terrorist and the means of carrying out the atrocity. In this way function creep can be seen to be complex in its application: Its new use might be fully justifiable. What is problematic is the application of the technology in a new area in one instance leading to its regular and repeated use in that area, especially when this extension has not been subject to ethical scrutiny.

11. Distance

Surveillance typically puts a distance between the surveillant and the person or group surveilled. This can be of benefit to both as it removes the surveillant from the immediacy of the situation and may provide her with time and space to deliberate before reacting to a situation. It might also mean that she does not feel personally threatened in a situation and so react more calmly than would otherwise be the case. However it also simplifies everyday levels of human interaction such as negotiation, discretion, and the use of subtlety: From the surveillant’s perspective someone is either a target or not, and the surveilled subject is not given a platform to respond. This is a concern which is exacerbated by the automation of surveillance and threat detection as the software operating the surveillance can only see people in these terms. There is a further concern that the distance between operator and subject means that the two might never meet. Yet without personal confrontation an operator with social prejudices may never be challenged in her views. She might never meet a person from an ethnic minority (or not one from the minority of which she is suspicious) and so fail to be challenged in her view that all members of that social group are, by virtue of their membership, inherently worthy of suspicion.

12. Chilling Effects

International law states that people have certain human rights, such as the right to free speech, the right to association and the right to protest (United Nations 1948). Suspicion as to a state’s motives, however, may lead to cynicism as to how the state will employ its surveillance technology in self-protection. Even if there is no evidence of wrong-doing the state may nonetheless choose to keep records on those who publicly confess to a certain belief, or who choose to associate with those whom the state believes pose a threat. These records may then be used against citizens at a later date by the state, or by a future iteration of the state if the individuals running the executive change. The knowledge of the accumulation and possession of these records by the state may disincline some citizens from engaging in these legitimate activities, preferring to keep their heads down and avoid notice by the state. These so-called “chilling effects” are at odds with human rights and democratic practice and can lead to behavioural uniformity and a stifling of creativity. In certain dictatorial regimes this may be seen as advantageous. Again one can return to Orwell’s 1984 for a dystopian vision of chilling effects (see also Funder 2004).

13. Power

Throughout this article there has been a recurring theme of power. Through the act of surveillance the surveillant gains power over the surveilled, either through the gathering of information regarding that person which they would rather keep secret (or, at least, keep control over its distribution), or through distancing the person and treating them as acceptable or unacceptable for whatever is the purpose of that surveillance. The balance of power between individuals, or between individuals and groups such as employers or the state, is therefore an important consideration in assessing what it is that is wrong or dangerous about many forms of surveillance.

If we return to the parental monitoring of infants, the context is one of the empowered over the powerless and the cause of the monitoring is paternal care. As noted, this is often seen as a duty of the parent and so one which is justified. As children grow and become more independent, however, they require less care and gain an increasingly strong claim to their own privacy. This is true of surveillance in general as it transfers power from the surveilled to the surveillant. When consent is given then this is more, although possibly not always, justifiable. In the absence of consent, however, this disempowerment of the individual is highly problematic, threatening their dignity and ultimately their responsibility for their own lives.

14. References and Further Reading

  • Allen, A.L. (2008). “The Virtuous Spy: Privacy as an Ethical Limit”, The Monist 91.1: 3-22.
  • Bentham, J. (1995). The Panopticon Writings. Verso Books.
  • Boyd, D. (2010). “Making Sense of Privacy and Publicity”. SXSW. Austin, Texas, March 13.
  • Council of Europe (1950). Convention for the Protection of Human Rights and Fundamental Freedoms as amended by Protocols No. 11 and No. 14.
  • Doyle, T. (2009). “Privacy and Perfect Voyeurism”. Ethics and Information Technology 11: 181-189.
  • Foucault, M. (1991). Discipline and Punish: The Birth of the Prison new edn., Penguin.
  • Funder, A. (2004). Stasiland: Stories from Behind the Berlin Wall new edn., Granta Books.
  • Lango, J. (2006). “Last Resort and Coercive Threats: Relating a Just War Principle to a Military Practice”. Joint Services Conference on Professional Ethics.
  • Lyon, D. (2002). Surveillance as Social Sorting: Privacy, Risk and Automated Discrimination, Routledge.
  • Marx, G.T. (1998). “Ethics for the New Surveillance”. The Information Society 14: 171-185.
  • Nathan, D. (1990). “Just Looking: Voyeurism and the Grounds of Privacy”. Public Affairs Quarterly 4.4: 365-386.
  • Norris, C. and Armstrong, G. (1999). “CCTV and the Social Structuring of Surveillance”. In K. Painter and N. Tilley, eds. Surveillance of Public Space: CCTV, Street Lighting and Crime Prevention, Crime Prevention Studies, Monsey, New York, Criminal Justice Press: 157-78.
  • Orwell, G. (2004). 1984 Nineteen Eighty-Four new edn., Penguin Classics.
  • Parent, W.A. (1983). “Privacy, Morality and the Law”. Philosophy and Public Affairs 12.4: 269-288.
  • Rachels, J. (1975). “Why Privacy is Important”. Philosophy and Public Affairs 4.4: 323-333.
  • Reiman, J. (2004). “Driving to the Panopticon: A Philosophical Exploration of the Risks to Privacy Posted by the Information Technology of the Future”. In Privacies: Philosophical Evaluations, Stanford, Stanford University Press.
  • Scanlon, T. (1975). “Thomson on Privacy”. Philosophy and Public Affairs 4.4: 315-322.
  • Tavani, H.T. and Moor, J.H. (2001). “Privacy Protection, Control of Information, and Privacy-Enhancing Technologies”. Computers and Society 31.1: 6-11.
  • Thomson, J.J. (1975). “The Right to Privacy”. Philosophy and Public Affairs 4.4: 295-314.
  • United Nations (1948). “The Universal Declaration of Human Rights”.
  • Warren, S.D. and Brandeis, L.D. (1890). “The Right to Privacy”. Harvard Law Review: 1-19.
  • Winner, L. (1977). Autonomous Technology: Technics-out-of-control as a Theme for Political Thought, The MIT Press.

 

Author Information

Kevin Macnish
Email: Kevin.Macnish@gmail.com
University of Leeds
United Kingdom

Thrasymachus (fl. 427 B.C.E.)

Thrasymachus of ChalcedonThrasymachus of Chalcedon is one of several “older sophists” (including Antiphon, Critias, Hippias, Gorgias, and Protagoras) who became famous in Athens during the fifth century B.C.E. We know that Thrasymachus was born in Chalcedon, a colony of Megara in Bithynia, and that he had distinguished himself as a teacher of rhetoric and speechwriter in Athens by the year 427. Beyond this, relatively little is known about his life and works. Thrasymachus’ lasting importance is due to his memorable place in the first book of Plato‘s Republic. Although it is not quite clear whether the views Plato attributes to Thrasymachus are indeed the views the historical person held, Thrasymachus’ critique of justice has been of considerable importance, and seems to represent moral and political views that are representative of the Sophistic Enlightenment in late fifth century Athens.

In ethics, Thrasymachus’ ideas have often been seen as the first fundamental critique of moral values. Thrasymachus’ insistence that justice is nothing but the advantage of the stronger seems to support the view that moral values are socially constructed and are nothing but the reflection of the interests of particular political communities. Thrasymachus can thus be read as a foreshadowing of Nietzsche, who argues as well that moral values need to be understood as socially constructed entities. In political theory, Thrasymachus has often been seen as a spokesperson for a cynical realism that contends that might makes right.

Table of Contents

  1. Life and Sources
  2. Doctrines
  3. Influence
  4. References and Further Reading
    1. Primary Sources
    2. Secondary Sources

1. Life and Sources

The precise years of Thrasymachus’ birth and death are hard to determine. According to Dionysius, he is younger than Lysias, who Dionysius falsely believed to be born in 459 B.C.E. Aristotle places him between Tisias and Theodorus, but he does not list any precise dates. Cicero mentions Thrasymachus several times in connection with Gorgias and seems to imply that Gorgias and Thrasymachus were contemporaries. A precise reference date for Thrasymachus’ life is provided by Aristophanes, who makes fun of him in his first play Banqueters. That play was performed in 427, and we can conclude therefore that he must have been teaching in Athens for several years before that. One of the surviving fragments of Thrasymachus’ writing (Diels-Kranz Numbering System 85b2) contains a reference to Archelaos, who was King of Macedonia from 413-399 B.C.E. We thus can conclude that Thrasymachus was most active during the last three decades of the fifth century.

2. Doctrines

The remaining fragments of Thrasymachus’ writings provide few clues about his philosophical ideas. They either deal with rhetorical issues or they are excerpts from speeches (DK 85b1 and b2) that were (probably) written for others and thus can hardly be seen as the expression of Thrasymachus’ own thoughts. The most interesting fragment is DK 85b8. It contains the claim that the gods do not care about human affairs since they do not seem to enforce justice. Scholars have, however, been divided whether this claim is compatible with the position Plato attributes to Thrasymachus in the first book of the Republic. Plato’s account there is by far the most detailed, though perhaps historically suspect, evidence for Thrasymachus’ philosophical ideas.

In the first book of the Republic, Thrasymachus attacks Socrates’ position that justice is an important good. He claims that ‘injustice, if it is on a large enough scale, is stronger, freer, and more masterly than justice’ (344c). In the course of arguing for this conclusion, Thrasymachus makes three central claims about justice.

  1. Justice is nothing but the advantage of the stronger (338c)
  2. Justice is obedience to laws (339b)
  3. Justice is nothing but the advantage of another (343c).

There is an obvious tension among these three claims. It is far from clear why somebody who follows legal regulations must always do what is in the interest of the (politically) stronger, or why these actions must serve the interests of others. Scholars have tried to resolve these tensions by emphasizing one of the three claims at the expense of the other two.

First, there are those scholars (Wilamowitz 1920, Zeller 1889, and Strauss 1952) who take (1) as the central element of Thrasymachus’ thinking about justice. According to this view, Thrasymachus is an advocate of natural right who claims that it is just (by nature) that the strong rule over the weak. This interpretation stresses the similarities between Thrasymachus’ arguments and the position Plato attributes to Callicles in the Gorgias.

A second group of scholars (Hourani 1962, and Grote 1850) emphasizes the importance of (2) and contends that Thrasymachus advocates a form of legalism. According to this interpretation, Thrasymachus is a relativist who denies that justice is anything beyond obedience to existing laws.

A third group (Kerferd 1947, Nicholson 1972) argues that (3) is the central element in Thrasymachus’ thinking about justice. Thrasymachus therefore turns out to be an ethical egoist who stresses that justice is the good of another and thus incompatible with the pursuit of one’s self-interest. Scholars in this group view Thrasymachus primarily as an ethical thinker and not as a political theorist.

In addition, there is a group of scholars (A.E. Taylor 1960, and Burnet 1964) who read Thrasymachus as an ethical nihilist. According to this view, Thrasymachus’ project is to show that justice does not exist. Burnet writes in this context: ‘[Thrasymachus] is the real ethical counterpart to the cosmological nihilism of Gorgias.’

Others (Barney 2004 and Johnson 2005) have stressed that Thrasymachus should not be read as a philosopher who offers precise definitions of justice, but rather as a sociologist or political scientist who offers empirical observations that amount to a cynical commentary on those who follow a traditional, Hesiodic conception of justice.

Finally, there are a number of scholars who claim that Thrasymachus is a confused thinker. Cross and Woozley (1964) contend, for example, that Thrasymachus advances different criteria for justice ‘without appreciating that they do not necessarily coincide.’ This claim has been renewed by Everson (1998). J.P. Maguire (1971) argues that only some of the arguments in book I of the Republic are Thrasymachus’ own, while other ideas are falsely attributed to Thrasymachus by Plato in order to prepare the ground for his own arguments.

3. Influence

In spite of the disagreement about how to interpret Thrasymachus’ arguments in book I of the Republic, his ideas have been influential in ethical and political theory. In ethics, Thrasymachus’ ideas have often been seen as the first fundamental critique of moral values. Thrasymachus’ insistence that justice is nothing but the advantage of the stronger seems to support the view that moral values are socially constructed and are nothing but the reflection of the interests of particular political communities. Thrasymachus can thus be read as a foreshadowing of Nietzsche, who argues as well that moral values need to be understood as socially constructed entities. In political theory, Thrasymachus has often been seen as a spokesperson for a cynical realism that contends that might makes right. This view frequently associates Thrasymachus with the arguments Thucydides attributes to the Athenians in their negotiations with the island of Melos (History of the Peloponnesian War, Chapter XVII). Thrasymachus is therefore frequently portrayed as an early version of Machiavelli who argues in The Prince that the true statesman does not recognize any moral constrains in his pursuit of power. In the scholarship on Socrates, Thrasymachus is sometimes seen as an interlocutor who shows the limits of the Socratic elenchus. C.D.C. Reeve (1988) argues, for instance, that the conversation between Socrates and Thrasymachus illustrates that Socratic questioning cannot benefit a person like Thrasymachus, who categorically denies that justice is a virtue. Reeve contends that this limit of the elenctic method provided the impetus why Plato proceeded to modify Socrates’ ethical principles in the remaining books of the Republic.

4. References and Further Reading

a. Primary Sources

  • Diels, Hermann. Die Fragmente der Vorsokratiker. Rev. Walther Kranz. Berlin: Weidmann, 1972-1973.
  • Plato. Republic. Trans. G.M.A. Grube (rev. C.D.C. Reeve). Indianapolis: Hackett, 1992.
  • <Sprague, Rosamund Kent, ed. The Older Sophists: A Complete Translation by Several Hands. Columbia SC: University of South Carolina Press, 1972.

b. Secondary Sources

  • Adkins, A.W.H.Merit and Responsibility: A Study in Greek Values Oxford: Clarendon Press, 1960.
  • Balot, R.K.Greed and Injustice in Classical Athens Princeton: Princeton University Press, 2001.
  • Barney, R.”Callicles and Thrasymachus”Stanford Encyclopedia of Philosophy
  • Burnet, J. Greek Philosophy. London: Macmillian, 1964.
  • Chapell,T.D.J. “The Virtues of Thrasymachus” Phronesis 38 (1993): 1-17
  • Chapell,T.D.J. “Thrasymachus and Definition” Oxford Studies in Ancient Philosophy 18 (2000): 101-107
  • Everson, S. “The Incoherence of Thrasymachus” Oxford Studies in Ancient Philosophy 16 (1998): 99-131.
  • Cross, R.C. and Woozley, A.D. Plato’s Republic. A Philosophical Commentary. London: Macmillian, 1964.
  • Grote, G. A History of Greece. London: J. Murnay, 1888.
  • Hourani, C.F. “Thrasymachus’ Definition of Justice in Plato’s RepublicPhronesis 7 (1962): 110-120.
  • Johnson, C.Socrates and the Immoralists Lanham: Lexington Books, 2005
  • Kerferd, G.B. “The doctrine of Thrasymachus in Plato’s RepublicDurham Univ. Journal 40 (1947): 19-27.
  • Kerferd, G. B. The Sophistic Movement. Cambridge: Cambridge University Press, 1981.
  • Maguire, J.P. “Thrasymachus…or Plato?” Phronesis 16 (1971):142-163.
  • Nicholson, P.P. “Unravelling Thrasymachus’ Argument in the RepublicPhronesis 19 (1974): 210-232.
  • O’Neill, B. “The Struggle for the Soul of Thrasymachus”Ancient Philosophy 8 (1988):167-85.
  • Reeve, C.D.C. “Socrates meets Thrasymachus” Archiv für Geschichte der Philosophie 67 (1985): 246-265.
  • Reeve, C.D.C. Philosopher-Kings. Princeton: Princeton University Press, 1988.
  • Strauss, L. Natural Right and History. Chicago: University of Chicago Press, 1952.
  • Taylor, A.E. Plato, the Man and his Work. London: Methenn, 1960.
  • White, S.A.”Thrasymachus the Diplomat”Classical Philology 90 (1995): 307-27.
  • Willamowitz-Moellendorff, U.v. Platon I. Berlin: Weidmann, 1920.
  • Zeller, E. Outlines of the History of Greek Philosophy. New York: H.Holt, 1889.

Author Information

Nils Rauhut
Email: nrauhut@coastal.edu
Coastal Carolina University
U. S. A.

Johann Wolfgang von Goethe (1749—1832)

GoetheGoethe defies most labels, and in the case of the label ‘philosopher’ he did so intentionally. “The scholastic philosophy,” in his opinion, “had, by the frequent darkness and apparent uselessness of its subject- matter, by its unseasonable application of a method in itself respectable, and by its too great extension over so many subjects, made itself foreign to the mass, unpalatable, and at last superfluous” (Goethe 1902, 1: 294). But it is nothing exceptional for a philosopher to disdain the character of what is passed along under the name philosophy by professional academics. If Diogenes, Montaigne, Nietzsche, Wittgenstein, Sartre, or Rorty, can be considered philosophers, then it may even be a rule that to reject the appellation is a condition of having earned it. That said, Goethe is certainly not a philosopher in the sense made popular in his day: a builder of self-grounding systems of thought. Neither is he a philosopher by today’s most common definitions: either a professional analyzer of arguments or a critic of contemporary cultural practices. The paradigm under which Goethe might be classified a philosopher is much older, recalling the ancient and then renaissance conception of the polymath, the man of great learning and wisdom, whose active life serves as the outward expression of his thinking.

In terms of influence, Goethe’s upon Germany is second only to Martin Luther’s. The periods of his dramatic and poetic writing –Sturm und Drang, romanticism, and classicism— simply are the history of the high-culture in Germany from the late eighteenth to the early nineteenth century. Philosophically, his influence is indelible, though not as wide-reaching. His formulation of an organic ontology left its mark on thinkers from Hegel to Wittgenstein; his theory of colors challenged the reigning paradigm of Newton’s optics; and his theory of morphology, that of Linnaeus’ biology.

Table of Contents

  1. Life and Works
  2. Philosophical Background
  3. Scientific Background and Influence
  4. Morphology, Compensation, and Polarity
  5. Theory of Colors
  6. Philosophical Influence
  7. References and Further Reading
    1. Primary Sources
      1. German Editions of Goethe’s Works
      2. Letters and Conversations
      3. English Translations of Goethe’s Works
    2. Selected Secondary Scholarship
      1. Historical and Philosophical Context
      2. Science and Methodology
      3. Aesthetics, Politics, and Theology

1. Life and Works

Historical studies should generally avoid the error of thinking that the circumstances of a philosopher’s life necessitate their theoretical conclusions. With Goethe, however, his poetry, scientific investigations, and philosophical worldview are manifestly informed by his life, and are indeed intimately connected with his lived experiences. In the words of Georg Simmel, “…Goethe’s individual works gradually appear to take on less significance than his life as a whole. His life does not acquire the sense of a biography that strings together external phenomena, but is rather like the portrait of a singular vastness, depth and dynamism of existence, the pure expression of an internal vigor in its relation to the world, the spiritualization of an extraordinary sphere of reality,” (Simmel 2007, 85f).

Johann Wolfgang von Goethe was born August 28, 1749 in Frankfurt, Germany. His father was the Imperial Councillor Johann Kaspar Goethe (1710-1782) and his mother Katharina Elisabeth (Textor) Goethe (1731-1808). Goethe had four siblings, only one of whom, Cornelia, survived early childhood.

Goethe’s early education was inconsistently directed by his father and sporadic tutors. He did, however, learn Greek, Latin, French, and Italian relatively well by the age of eight. In part to satisfy his father’s hope for material success, Goethe enrolled in law at Leipzig in 1765. There he gained a reputation within theatrical circles while attending the courses of C.F. Gellert. And there he gained notoriety for his extracurricular activities at what would become Faust’s haunt, Auerbach’s Keller. In 1766 he fell in love with Anne Catharina Schoenkopf (1746-1810) and wrote his joyfully exuberant collection of nineteen anonymous poems, dedicated to her simply with the title Annette.

After a case of tuberculosis and two years convalescence, Goethe moved to Strassburg in 1770 to finish his legal degree. There he met Johann Gottfried Herder (1744-1803), unofficial leader of the Sturm und Drang movement. Herder encouraged Goethe to read Homer, Ossian, and Shakespeare, whom the poet credits above all with his first literary awakening. Inspired by a new flame, this time Friederike Brion, he published the Neue Lieder (1770) and his Sesenheimer Lieder (1770-1771). Though set firmly on the path to poetry, he was promoted Licentitatus Juris in 1771 and returned to Frankfurt where with mixed success he opened a small law practice. Seeking greener pastures, he soon after moved to the more liberal city of Darmstadt. Along the road, so the story goes, Goethe obtained a copy of the biography of a noble highwayman from the German Peasants’ War. Within the astounding span of six weeks, he had reworked it into the popular anti-establishment protest, Götz von Berlichingen (1773).

His next composition, Die Leiden des jungen Werther (1774), brought Goethe nearly instant worldwide acclaim. The plot of the book is mostly a synthesis of his friendships with Charlotte Buff (1753-1828) and her fiancé Johann Christian Kestner (1741-1800), and the suicide of Goethe’s friend Karl Wilhelm Jerusalem (1747-1772). It remains the archetype of the Sturm und Drang’s elevation of emotion over reason, disdain for social proprieties, and exhortation for action in place of reflection. Besides Werther, Goethe composed Die Hymnen (among them Ganymed, Prometheus and Mahomets Gesang), and several shorter dramas, among them Götter, Helden und Wieland (1774), and Clavigo (1774).

On the strength of his reputation, Goethe was invited in 1775 to the court of then eighteen-year-old Duke Carl August (1757-1828), who would later become Grand Duke of Saxe-Weimar-Eisenach. Although Weimar was then a village of only six thousand residents, it was in the process of a cultural revolution thanks to the foresight and aesthetic vision of Duchess Anna Amalia (1739-1807), mother of the Duke and matron of the “Court of the Muses.” Goethe became enveloped in court life, where he could turn his limitless curiosity to an astonishing range of civic activities. As court-advisor and special counsel to the Duke, he took directorship of the mining concern, the finance ministry, the war  and roads commission, the local theater, not to mention construction of the beautiful Park-am-Ilm. He was eventually granted nobility by Emperor Joseph II, and became Geheimrat of Weimar in 1782.

From 1786 to 1788 Goethe took his Italienische Reise, in part out of his growing enthusiasm for the Winckelmannian rebirth of classicism. There he met the artists Kaufmann and Tischbein, and also Christiane Vulpius (1765–1816), with whom he held a rather scandalous love affair until their eventual marriage in 1806.

Although Goethe had first met Friedrich Schiller (1759-1805) in 1779, when the latter was a medical student in Karlsruhe, there was hardly an immediate friendship between them. When Schiller came to Weimar in 1787, Goethe dismissively considered Schiller an impetuous though undeniably talented upstart. As Goethe wrote to his friend Körner in 1788, “His entire being is just set up differently than mine; our intellectual capacities appear essentially at odds.” After some years of maturation on Schiller’s part and of mellowing on Goethe’s, the two found their creative spirits in harmony. In 1794, the pair became intimate friends and collaborators, and began nothing less than the most extraordinary period of literary production in German history. Working alongside Schiller, Goethe finally completed his Bildungsroman, the great Wilhelm Meisters Lehrjahre (1795-6), as well as his epic Hermann und Dorothea (1796-7) and several balladic pieces. Schiller, for his part, completed the Wallenstein trilogy (1799), Maria Stuart (1800), Die Jungfrau von Orleans (1801), Die Braut von Messina (1803) and Wilhelm Tell (1804). To Goethe’s great sorrow and regret, Schiller died at the height of his powers on April 29, 1805. Of their collaboration’s historical importance, Alfred Bates commemorates, “Schiller and Goethe have ever been inseparable in the minds of their countrymen, and have reigned as twin stars in the literary firmament. If Schiller does not hold the first place he is more beloved, though Goethe is more admired,” (Bates 1906, 11: 75).

Johann Wolfgang von Goethe died on March 22, 1832 in Weimar, having finally finished Faust the previous year. His famous last words were a request that his servant let in “more light.” The prince of poets, Goethe was laid to rest in the Fürstengruft of the Historischer Friedhof in Weimar, side by side with his friend Schiller.

2. Philosophical Background

The Kultfigur of Goethe as the unspoiled and uninfluenced genius is doubtless over-romanticized. Goethe himself gave rise to this myth, both in his conversations with others and in his own quasi-biographical work, Dichtung und Wahrheit (1811-1833). About his study of the history of philosophy, he writes, “one doctrine or opinion seemed to me as good as another, so far, at least, as I was capable of penetrating into it,” (Goethe 1902, 182). Albert Schweitzer, usually even-handed in his attributions, writes, “Goethe borrows nothing from any of the philosophies with which he is in contact. Thanks, however, to his conscientious study of the thought of others, he attains an ever clearer grasp of his own ideas,” (Schweitzer 1949, 70).

Goethe’s way of reading was neither that of the scholar seeking out arguments to analyze nor that of the historian curious about the ideas of the great minds. No disciple of any particular philosopher or system, he instead borrows in a syncretic way from a number of different and even opposing thought systems in the construction of his Weltanschauung. And whenever particular subjects could not be put to practical use, Goethe’s attention quickly moved on. In a rather telling recollection, Goethe characterizes his philosophy lectures thusly, “At first I attended my lectures assiduously and faithfully, but the philosophy would not enlighten me at all. In logic it seemed strange to me that I had so to tear asunder, isolate, and, as it were, destroy, those operations of the mind which I had performed with the greatest ease from my youth upwards, and this in order to see into the right use of them. Of the thing itself, of the world, and of God, I thought I knew about as much as the professor himself; and, in more places than one, the affair seemed to me to come into a tremendous strait. Yet all went on in tolerable order till towards Shrovetide, when, in the neighborhood of Professor Winkler’s house on the Thomas Place, the most delicious fritters came hot out of the pan just at the hour of lecture,” (Goethe 1902, 205). Philosophy apparently held just slightly less interest than good pastry. Notwithstanding this estimation, indelible philosophical influences are nevertheless discernible.

For many intellectuals in Goethe’s generation, Rousseau (1712-78) represented the struggle against the Cartesian mechanistic world view. Rousseau’s elevation of the emotional and instinctual aspects of human subjectivity galvanized the traditional German Wanderlust into a far reaching cry to ‘return to nature’ in terms of a longing for pre-civilized society and pre-Enlightenment efforts to harmonize with rather than conquer nature. Goethe felt this unity with nature keenly in his Sturm und Drang period, something equally evident in Werther’s desire for aesthetic wholeness and in his emotional outbursts. From 1784 to 1804, there is a notable decline in enthusiasm for Rousseau’s privileging emotion over reason, though never an explicit rejection. Some scholars attribute this to Goethe’s participation in the sorts of civic bureaucracies that Rousseau so lamented in modern life. But it is clear that there are philosophical reasons besides these practical ones. Goethe’s classical turn in these years is marked by his view that the fullest life was one that balanced passion and duty, creativity and regulation. Only through the interplay of these oppositions, which Rousseau never came to recognize, could one attain classical perfection.

Although educated in a basically Leibnizian-Wolffian worldview, it was Spinoza (1632-77) from whom Goethe adopted the view that God is both immanent with the world and identical with it. While there is little to suggest direct influence on other aspects of his thought, there are certain curious similarities. Both think that ethics should consist in advice for influencing our characters and eventually to making us more perfect individuals. And both hold that happiness means an inner, almost stoically tranquil superiority over the ephemeral troubles of the world.

Kant (1724-1804) was doubtless the most famous living philosopher of Goethe’s youth. Yet Goethe only came to read him seriously in the late 1780s, and even then only with the help of Karl Reinhold (1757-1823). While he shared with Kant the rejection of externally imposed norms of ethical behavior, his reception was highly ambivalent. In a commemoration for Wieland (1773-1813) he asserts that the Kritik der reinen Vernunft (1781/7) is “a dungeon which restrains our free and joyous excursions into the field of experience.” Like Aristotle before him, Goethe felt the only proper starting point for philosophy was the direct experience of natural objects. Kant’s foray into the transcendental conditions of the possibility of such an experience seemed to him an unnecessary circumvention of precisely that which we are by nature equipped to undertake. The critique of reason was like a literary critique: both could only pale in value to the original creative activity. Concerning Kant’s Kritik der praktischen Vernunft (1788), Goethe was convinced that dicta of pure practical reason, no matter how convincing theoretically, had little power to transform character. Perhaps with Kant’s ethics in mind, he wrote, “Knowing is not enough; we must apply. Willing is not enough; we must do” On the other hand, a letter to Eckermann of April 11, 1827, indicates that he considers Kant to be the most eminent of modern philosophers. And he certainly appreciated Kant’s Kritik der Urteilskraft (1790) for having shown that nature and art each have their ends within themselves purposively rather than as final causes imposed from without.

Influenced in part by Herder’s conception of Einfühlen, Goethe formulated his own morphological method (see below). More the Kantian than Goethe, Herder’s belief in Über den Ursprung der Sprache (1772) that language could be explained naturalistically as a creative impulse within human development rather than a divine gift influenced Goethe’s theoretical work on poetry. And the trace of Herder’s claims about the equal worth of historical epochs and cultures can still be seen in the eclectic art collection in Goethe’s house on Weimar’s Frauenplan.

3. Scientific Background and Influence

 

Goethe considered his scientific contributions as important as his literary achievements. While few scholars since have shared that contention, there is no doubting the sheer range of Goethe’s scientific curiosity. In his youth, Goethe’s poetry and dramatic works featured the romantic belief in the ‘creative energy of nature’ and evidenced a certain fascination with alchemy. But court life in Weimar brought Goethe for the first time in contact with experts outside his literary comfort zone. His directorship of the silver-mine at nearby Ilmenau introduced him to a group of mineralogists from the Freiburg Mining Academy, led by Johann Carl Voigt (1752-1821). His 1784 discovery of the intermaxillary bone was a result of his study with Jena anatomist Justus Christian Loder (1753-1832). Increasingly fascinated by botany, he studied the pharmacological uses of plants under August Karl Batsch (1761-1802) at the University of Jena, and began an extensive collection of his own. He grew dissatisfied with the system of Linnaeus as an artificial taxonomy of plants, considering it “a shade of a great harmony, which one must study as a whole, otherwise each individual is a dead letter,” (Letter to Knebel, 17 November, 1784).

There is a passionate ambivalence about Goethe’s scientific reputation. He has alternately been received as a universal man of learning whose methods and intuitions have contributed positively to many aspects of scientific discourse, or else denounced as a dilettante incapable of understanding the figures— Linnaeus and Isaac Newton—against whom his work is a feeble attempt to revolt. Goethe’s scientific treatises were neglected by many in the nineteenth century as the amateurish efforts of an otherwise great poet, one who should have stayed within the arena that best suited him. Positivists of the early twentieth century virtually ignored him. Erich Heller claims Goethe “made no contribution to scientific progress or technique,” (Heller 1952, 7). On the other hand, some of the great scientific minds have expressed enthusiastic respect and even approval of Goethe’s contributions, among them Helmholtz, Einstein, and Planck (Cf. Stephenson 1995).

4. Morphology, Compensation, and Polarity

In Goethe’s day, the reigning systematic botanical theory in Europe was that of Carl Linnaeus (1707-1778). Plants were classified according to their relation to each other into species, genera, and kingdom. As an empirical method, Linnaeus’s taxonomy ordered external characteristics — size, number, and location of individual organs — as generic traits. The problem for Goethe was two-fold. Although effective as an organizational schema, it failed to distinguish organic from inorganic natural objects. And by concentrating only on the external characteristics of the plant, it ignored the inner development and transformation characteristic of living things generally. Goethe felt that the exposition of living objects required the same account of inner nature as it did for the account of the inner unity of a person.

Goethe believed that all living organisms bore an inner physiognomic ‘drive to formation’ or Bildungstrieb. In his “First Sketch of a General Introduction into Comparative Anatomy, Starting from Osteology” (1795), Goethe discussed a law binding the action of the Bildungstrieb, that “nothing can be added to one part without subtracting from another, and conversely,” (Goethe 1961-3, 17: 237). This notion of ‘compensation’ bears a likeness to the laws of vital force put forward by Johann Friedrich Blumenbach (1752-1840) and Carl Friedrich Kielmeyer (1765-1844) in the early 1790s. But whereas their versions dealt with the generation and corruption of living beings, Goethe sought the common limitations imposed on organic beings by external nature.

 

Whereas his earlier romanticism considered nature the raw material on which human emotions could be imparted, Goethe’s studies in botany, mineralogy, and anatomy revealed to him certain common patterns in the development and modifications of natural forms. The name he gave to this new manner of inquiry was ‘morphology’. No static concept, morphology underwent its own metamorphosis throughout Goethe’s career. Morphology is first named as such in Goethe’s notes of 1796. But he only fully lays out the position as an account of the form and transformation of organisms in the 1817 Zur Morphologie. He continued to publish articles in his journal “On Science in General, On Morphology in Particular” from 1817 to 1824. Goethe’s key contention here is that every living being undergoes change according to a compensatory dynamic between the successive stages of its development. In the plant, for example, this determination of each individual member by the whole arises insofar as every organ is built according to the same basic form. As he wrote to Herder on May 17, 1787:

It has become apparent to me that within the organ that we usually address as ‘leaf’ there lies hidden the true Proteus that can conceal and manifest itself in every shape. Any way you look at it, the plant is always only leaf, so inseparably joined with the future germ that one cannot think the one without the other. […]With this model and the key to it, one can then go on inventing plants forever that must follow lawfully; which, even if they don’t exist, still could exist…

Goethe’s morphology, in opposition to the static taxonomy of Linnaeus, studied these perceptible limitations not merely in order to classify plants in a tidy fashion, but as instances of natural generation for the sake of intuiting the inner working of nature itself, whole and entire. Since all organisms undergo a common succession of internal forms, we can intuitively uncover within these changes an imminent ideal of development, which Goethe names the ‘originary phenomenon’ or Urphänomen. These pure exemplars of the object in question are not some abstracted Platonic Idea of the timeless and unchanging essence of the thing, but “the final precipitate of all experiences and experiments, from which it can ever be isolated. Rather it reveals itself in a constant succession of manifestations,” (Goethe 1981, 13: 25). The Urphänomen thus offer a sort of “guiding thread through the labyrinth of diverse living forms,” (Goethe 1961-3, 17: 58), which thereby reveals the true unity of the forms of nature in contrast to the artificially static and lifeless images of Linneaus’ system. Through the careful study of natural objects in terms of their development, and in fact only in virtue of it, we are able to intuit morphologically the underlying pattern of what the organic object is and must become. “When, having something before me that has grown, I inquire after its genesis and measure the process as far back as I can, I become aware of a series of stages, which, though I cannot actually see them in succession, I can present to myself in memory as a kind of ideal whole,” (Goethe 1947ff, I/10: 131).

The morphological method is thus a combination of careful empirical observation and a deeper intuition into the idea that guides the pattern of changes over time as an organism interacts with its environment. Natural observation is the necessary first step of science; but because the senses can only attend to outer forms, a full account of the object also requires an intuition that apprehends an object with the ‘eyes of the mind’. Morphology reveals, “the laws of transformation according to which nature produces one part through another and achieves the most diversified forms through the modification of a single organ,” (Goethe 1961-3, 17: 22). While the visible transformations are apparent naturalistically, the inner laws by which they are necessary are not. They are, in Goethe’s word, dämonisch, apparent intuitively but unable to be explicated more concretely by means of the understanding.

Whereas Linneaus’ taxonomy only considered the sensible qualities of the object, Goethe believed a sufficient explanation must address that object in terms of organic wholeness and development. To do that, the scientist needs to describe the progressive modification of a single part of an object as its modification over time relates to the whole of which it is the part. Considering the leaf as an example of this Urphänomen, Goethe traced its metamorphosis from a seed into the stem, then leaves, then flowers, and finally its stamen or pistil. This continuous development was described by Goethe as an ‘intensification’ or Steigerung of the original form.

The oppositional tension between the creative force and the compensatory limitations within all living things exemplifies the notion of ‘polarity’ or Polarität. In his 1790 essay, “The Metamorphosis of Plants,” Goethe represented the intensification of a plant as the result of the interaction between the nutritive forces of the plant and the organic form of the primal leaf. Polarity between a freely creative impulse and an objectively structuring law is what allows the productive restraint of pure creativity and at the same time the playfulness and innovation of formal rules. Polarity also plays a marked role in Goethe’s Farbenlehre (see below), as the principle of interplay between light and darkness out of which the Urphänomen of color is exhibited. “With light poise and counterpoise, nature oscillates within her prescribed limits, yet thus arise all the varieties and conditions of the phenomena which are presented to us in space and time,” (Goethe 1970, xxxix).

Goethe’s theories of morphology, polarity, and compensation each have their roots in his dramatic and poetic writings. But rather than a fanciful application of an aesthetic doctrine to the nature, Goethe believed that the creativity great artists, insofar as they are great, was a reflection of the purposiveness of nature. After all, “masterpieces were produced by man in accordance with the same true and natural laws as the masterpieces of nature,” (Goethe 1961-3, 11: 435–6). Goethe’s classicism features a similarly polarized intertwining of the unbridled creativity of the artistic drives and the formal rules of technique. As with a plant, the creative forces of life must be guided, trained, and restricted, so that in place of something wild and ungainly can stand a balanced structure which achieves, in both organic nature and in the work of art, its full intensification in beauty. As the work of the botanist is to trace the morphology of an individual according to an ideal Urphänomen, so does it fall to the classical author to intensify his characters within the contextualized polarity of the plot in a way simultaneously unique and yet typical. The early drafts of Torquato Tasso (begun in the 1780s), for example, reveal its protagonist as a veritable force of nature, pouring out torrential feelings upon a conservative and repressed external world. By the time of the published version in 1790, the Sturm und Drang character of Tasso is polarized against the aristocratically reposed and reasonable character of Antonio. Only in conjunction with Antonio can Tasso come into classical fullness and perfection. As the interplay of polarities in nature is the principle of natural wholeness, so is it the principle of equipoise in the classical drama. Polarities are also visible in Wilhelm Meister’s Lehrjahr (1795-6). Again in marked contrast to an earlier version of the text, in the final version Wilhelm’s romantic love of art and theatre is now just one piece of his coming-into-himself, which requires its polar opposite: the restraint inculcated within a conservatively aristocratic society. Only from the polarized tension does his drive to self-formation achieve intensification and eventually classical perfection.

5. Theory of Colors

“As to what I have done as a poet… I take no pride in it… but that in my century I am the only person who knows the truth in the difficult science of colors – of that, I say, I am not a little proud, and here I have a consciousness of a superiority to many,” (Goethe 1930, 302). Coming from the preeminent literary figure of his age, Goethe’s remarkable statement reveals to what extent he considered the Farbenlehre (1810) his life’s true work. At the same time, it was the source of perhaps his greatest disappointment. Like his work on morphology, his theory of colors fell on mostly deaf ears.

As his morphology targeted the system of Linnaeus, Goethe’s Farbenlehre challenged what was then and among the general public still remains the leading view of optics, that of Isaac Newton (1642-1727). However, most of Goethe’s vitriol was not directed at Newton himself, but the dismissive attitudes of his adherents, who would not so much as entertain the possibility that their conceptual framework was inadequate. He compares Newton’s optics, “to an old castle, which was at first constructed by its architect with youthful precipitation […] The same system was pursued by his successors and heirs: their increased wants within, and harassing vigilance of their opponents without, and various accidents compelled them in some place to build nearby, in others in connection with the fabric, and thus to extend the original plan,” (Goethe 1970, xlii). Thus, while Goethe esteems Newton as a redoubtable genius, his issue is with those half-witted apologists who effectively corrupted that very same edifice they fought to defend. His aim is accordingly to, “dismantle it from gable and roof downwards; so that the sun may at last shine into the old nest of rats and owls…” (Goethe 1970, xliii).

As was the case with Linnaeus, Goethe’s guiding criticism of Newton concerned his ostensibly artificial method. Through Newton’s famous experiments with prismatic phenomenon, he discovered that pure light already contained within itself all the colors available to the human visual spectrum. The refraction of pure white light projected at a prism produces the seven individual colors. Pragmatically, this allowed Newton to quantify the angular bending of light beams and to predict which colors would be produced at a given frequency. That frequency could be calculated simply by accounting for the distance between the light source and the prism and again the distance from the prism to the surface upon which the color was projected.

But by reducing the thing itself to its perceptible qualities, the Newtonians had made a grave methodological mistake. The derivative colors produced by the prismatic experiments are identified with the spectrum that appears in the natural world. But since the light has been artificially manipulated to fit the constraints of the experiment, there is no prima facie reason to think that natural light would feature the same qualities. Sending a beam of light through a turbid prismatic medium ─ one among a nearly infinite variety of media ─ produced a reliably quantifiable set of results, but by no means either the only or even an obviously preferable set. In Goethe’s words, “[Newton] commits the error of taking as his premise a single phenomenon, artificial at that, building a hypothesis on it, and attempting to explain with it the most numerous and unlimited phenomena,” (Goethe 1981, 13: 50).

Goethe’s alternative relies upon his ideas of morphology and polarity. Just as the study of a plant had to proceed from the empirical observation of a great variety of particulars in order to intuit the Urphänomen that was common to all of them, so too should a Farbenlehre proceed from as great a variety of natural observations as possible. Whereas Newton universalizes from a controlled and artificial experiment, Goethe thinks “[i]t is useless to attempt to express the nature of a thing abstractedly. Effects we can perceive, and a complete history of those effects would, in fact, sufficiently define the nature of the thing itself. We should try in vain to describe a man’s character, but let his acts be collected and an idea of the character will be presented to us. The colors are acts of lights; its active and passive modifications: thus considered we may expect from them some explanation respecting life itself,” (Goethe 1970, xxxvii). These ‘acts’ of light reveal the same coordinate tension found in the rest of polarized nature. A light beam is no static thing with a substantial ontological status, but an oppositional tension that we perceive only relationally. Through careful observation of their interplay alone do we apprehend color. As defined by Goethe, “color is an elementary phenomenon in nature adapted to the sense of vision; a phenomenon which, like all others, exhibits itself by separation and contrast, by commixture and union, by augmentation and neutralization, by communication and dissolution: under these general terms its nature may be best comprehended,” (Goethe 1970, liv). Color arises from the polarity of light and darkness. Darkness is not the absence of light, as both Newton and most contemporary theorists believe, but its essential antipode, and thereby an integral part of color.

Through a series of experiments on his thesis that color is really the interplay of light and dark, Goethe discovered a peculiarity that seemed to confute the Newtonian system. If Newton is right that color is the result of dividing pure light, then there should be only one possible order to the spectrum, according to the frequency of the divided light. But there are clearly two ways to produce a color spectrum: with a light beam projected in a dark room, and with a shadow projected within a lighted room. Something bright, seen through something turbid, appears yellow. If the turbidity of the medium gradually increases, then what had appeared as yellow passes over into yellowish-red and eventually into bright-red as its frequency proportionally decreases. Something dark, seen through something turbid, appears blue; with a decreasing turbity, it appears violet. The color produced also depends upon the color of the material on which the light or shadow is cast. If a white light is projected above a dark boundary, the light extends a blue-violet edge into the dark area. A shadow projected above a light boundary, on the other hand, yields a red-yellow edge. When the distances between the projection and the surface are increased, the boundaries will eventually overlap. Done in a lighted room, the result of the overlap is green. The same procedure conducted in a dark room, however, produces magenta. If Newton was correct that only the bending of the light beam affects the given color, then neither the relative brightness of the room, the color of the background, nor the introduction of shadow should have altered the resultant color.

Reversing the artificial conditions of Newton’s original experiment, Goethe reformulated the problem of color to account for the role of both the observer and his or her context. Alongside the physical issues involved with optics, Goethe thus also realized the aesthetic conditions in the human experience of color. The perceptual capacities of the brain and eye, and their situatedness in a real world of real experience must be considered essential conditions of how colors could be seen. But while his observations of the double color-spectrum are intriguing, Goethe’s physiognomic speculations as to how the subject renders perceptual experience are, even by his contemporary standards, quite amateur. His reification of darkness, moreover, remains difficult to conceptualize coherently, much less to accept.

Although almost entirely ignored in his own time, and even undermined by his once and former collaborator, Schopenhauer, Goethe’s theory did win some later acclaim. His call to recognize the role of the subject in the perception of color does have positive echoes in the neo-Kantian theories of perception of Lange, Helmholtz, and Boscovich. Traces can also be found in twentieth century thinkers as divergent as Wittgenstein and Merleau-Ponty. Despite the fact that almost no serious thinker has ever counted themselves a strict adherent of Goethe’s Farbenlehre, the theory has had a remarkable persistence. Part of the explanation for this may be the obvious superiority of Goethe’s prose; his text is one of very few scientific treatises that can be read by amateurs with pleasure. Part is also due to decline of Newtonian physics generally.

6. Philosophical Influence

 

 

Goethe’s general influence on European culture is gargantuan. In 19th century Germany alone, authors like Heine, Novalis, Jean Paul, Tieck, Hoffman, and Eichendorff all owe tremendous debts to Götz and Werther. Thomas Carlyle, Ralph Waldo Emerson, Mark Twain, Kurt Tucholsky, Thomas Mann, James Joyce and too many others to name have since paid tribute to the master from Weimar. Composers like Mozart, Liszt, and Mahler dedicated works to Goethe’s drama, while Beethoven himself mused that the greatest musical accomplishment possible would be a perfect musical expression Faust. Goethe’s ideas have truly launched a thousand ships upon their cultural and intellectual expeditions. Philosophically, the lineage is comparatively more defined.

In his mature years, Goethe was to witness the philosophical focus in Germany shift from Kant to the Idealists. But by the early 1800s, Goethe was too convinced of the worth of his own ideas to be much influenced by what he considered philosophical fashions. Despite his proximity to and considerable influence at the University of Jena, Goethe had little positive contact with Fichte (1762-1814), who arrived there in 1794. Neither Fichte’s Pecksniffian sermonizing nor nearly illegible compositional style would have endeared him personally to the poet. Goethe’s more ambivalent attitude toward Schelling (1775-1854) vacillated between an approval of his appreciation for the deep mysteriousness of nature and an aversion to his futile attempt to solve it by means of an abstracted and artificial system. Schelling’s Naturphilosophie, like Goethe’s morphology, views nature as a constant organic development. But where Goethe saw polarity as an essential part of growth, Schelling understood dualities generally as something to be overcome in the intuition of the ‘absolute’.

Goethe’s relationship with Hegel (1770-1831) was both more direct and more influential. Most overtly, Hegel’s logic draws upon Goethe’s conception of metamorphosis. A letter from Hegel to Goethe on February 20, 1821 reads:

The simple and abstract, what you quite aptly call the archetypal phenomenon, this you put first, and then show the concrete phenomena as arising through the participation of still other influences and circumstances, and you direct the whole process in such a way that the sequence proceeds from the simple determining factors to the composite ones, and, thus arranged, something complex appears in all its clarity through this decomposition. To seek out the archetypal phenomenon, to free it from other extraneous chance surroundings — to grasp it abstractly, as we call it — this I consider to be a task for a great spiritual sense for nature, just as I consider that procedure altogether to be what is truly scientific in gaining knowledge in this field.

For Hegel, famously, a natural object has achieved its greatest perfection when it brings forth its full implicit content in explicit conceptual representation. Because the intellectual world ranks higher than the material, a phenomenology of the whole must observe the gradual unfolding of all possible logical forms from mere sense certainty through the self-recognition of consciousness to absolute knowing. To no small degree, Hegel’s criticism of Kant’s lifeless schematism of the understanding was foreshadowed by Goethe, who wrote, “Reason has to do with becoming, understanding with what has become. The former does not bother with the question, ‘what use?’; the latter does not ask ‘whence?’. Reason takes pleasure in development; understanding wishes to hold everything fixed so that it can exploit it,” (Goethe 1907, 555). Hegel’s formulation of Begriff, which designates the inner plan of the development of an object, was not wholly unlike Goethe’s Urphänomen (see below). The Hegelian dialectic, as an unveiling the movement of the concept would then correspond to the morphology. The problem, for Goethe, was that Hegel’s attempt to articulate wholeness began by the analysis of the logical concept of Being in the Logik and by the sublimation of the sense-certain observation of natural objects in the Phänomenologie, which for Goethe unjustifiably overlooks precisely that which it was the task of science to understand: the development of the natural forms of life, of which the mind is certainly a central one, but indeed only one example. As Goethe writes in a letter to Soret on February 13, 1829, “Nature is always true, always serious, always severe; it is always right, and mistakes and errors are always the work of men.” Similar to his critique of Kant, then, Goethe accused Hegel of creating a grand and abstract system to explain a phenomenon which in both ordinary life and in scientific observation could simply be assumed. Nature presents itself to the epistemologically reflective and to the naïve equally and without preference.

Arthur Schopenhauer’s (1788-1860) mother Johanna became fast friends with Goethe and his lover Christiane Vulpius when she moved to Weimra in 1804. His sister Adele was the lifelong confident of Ottile Pogwisch, who married Goethe’s and Christiane’s son Auguste. But for the young Arthur, due in part to an unavoidable clash of personalities, the established Goethe had little patience. Goethe recognized his intelligence early on, but declined to provide him a letter of recommendation to the university at Göttingen and offered him only a tepid letter of introduction to the classicist Friedrich August Wolf in Berlin. Schopenhauer’s dissertation, however, interested Goethe very much. In the winter of 1813-4, Goethe and Schopenhauer were engaged in extensive philosophical conversation concerning the former’s anti-Newtonian Farbenlehre (see below), out of which grew the latter’s Über das Sehen und die Farben in 1815. When Schopenhauer sent him the manuscript in the hopes of a recommendation, he grew impatient with the elder’s reticence to take his efforts sufficiently seriously. In truth, Schopenhauer’s work largely revealed Goethe’s as a failed attempt to overcome Newtonian visual theory, a fact which wounded Goethe deeply. Goethe followed Schopenhauer’s career with interest, however, and generally praised Die Welt als Wille und Vorstellung. It remains a question, though, whether Goethe ever read the book carefully since scant reference to its ideas can be found.

Like that of his Erzieher Schopenhauer, Nietzsche’s (1844-1900) relationship with Goethe’s thought was deeply ambivalent. Nietzsche often admired Goethe as emblematic of a healthy, fully-formed individual. Goethe is said to be “the last German for whom I feel reverence,” (Nietzsche, Twilight of the Idols, “Skirmishes of an Untimely Man,” section 51). Nietzsche’s early contention that the tragic age of culture began only with the fortuitous interaction of the Apollonian and Dionysian drives bears a similarity to Goethe’s classical understanding of art as a tensional polarity between the blindly creative will and the constraint of formal rules. Yet Nietzsche takes Goethe to task for having invested too much in Winckelmann’s attribution of ‘Heiterkeit’ to classical antiquity and thereby for having ignored its deeply irrational underside. Moreover, Nietzsche’s ontology, if indeed he had one, is like Goethe’s in its rejection of static atomic substances and in its attempt to conceive an intrinsically agonistic process of becoming as the true character of the world. Similar, too, to Goethe’s ‘intensification’ principle, Nietzsche’s notoriously ambiguous ‘Will to Power’ characterizes the dynamic process by which entities ‘become what they are’ by struggling against oppositional limitations that are at the same time the necessary condition for growth. Due to this shared ontological outlook, Goethe and Nietzsche both thought contemporary science was constricted by an outdated conception of substance and, as a result, mechanistic modes of explanation should be reformulated to account for the dynamic character of nature. Despite these commonalities, Nietzsche jettisoned Goethe’s Bildungstrieb for an overarching drive–not to expression or growth within formal constraint—but for overcoming, for power.

 

Finally, Wittgenstein’s (1889-1951) claim that things which cannot be put into propositional form might nevertheless be shown bears a family resemblance to Goethe’s formulation of the daimonisch. But where Wittgenstein removes the proverbial ladder on which he ascends to his intuitions about the relation between logic and the world, thereby reducing what cannot be bound by the rules of logic as nonsensical, Goethe believed he could communicate what were admittedly ineffable Urphänomene in a non-propositional way, through the feelings evoked by drama. There is, moreover, a distinct similarity in Goethe’s and Wittgenstein’s views on the proper task of philosophy. Its aim, for both, can never be accomplished, once and for all, by means of ‘the right argument’. Argumentation, explanation, and demonstration only go so far in their attempt to unravel the mysteries of the world. “Philosophy simply puts everything before us; it fails to deduce anything,” (Wittgenstein, Philosophical Investigations, 126).

Philosophy’s role in our life should guide us to be reflective people, ever ready to critique inherited dogmas, and always ready to revise our hypotheses in light of new observations. Goethe, through his ceaseless energy, limitless fascination with the world as it was presented to him, and his perpetual willingness to test his convictions against new evidence, carries a timeless appeal to philosophers, not because he demonstrated or explained what it meant to live philosophically, but because, through the example of the course of his life, he showed it.

7. References and Further Reading

a. Primary Sources

i. German Editions of Goethe’s Works

  • Akademie-Ausgabe: Werke, edited under the Institut für Deutsche Sprache und Literatur der Deutschen Akademie der Wissenschaften zu Berlin (Berlin: Akademie-Verlag, 1952ff).
  • Berliner Ausgabe: Poetische Werke. Kunsttheoretische Schriften und Übersetzungen, edited by the Bearbeiter-Kollektiv unter Leitung von Siegfried Seidel et al., 22 Volumes (Berlin/Weimar: Aufbau-Verlag, 1965-78).
  • Die Schriften zur Naturwissenschaft, edited by Kuhn et al. (Weimar: Deutschen Akademie der Naturforscher, 1947ff).
  • DTV-Gesamtausgabe: Sämtliche Werke: Nach den Texten der Gedenkausgabe des Artemis-Verlages, edited by Peter Boerner, 45 Volumes (München: Deutscher Taschenbuch Verlag, 1961-63).
  • Frankfurter Ausgabe: Sämtliche Werke. Briefe, Tagebücher und Gespräche, edited by Dieter Borchmeyer et al., 40 volumes in 2 divisions (Frankfurt a. M.: Deutscher Klassiker Verlag, 1985ff.).
  • Hamburger Ausgabe: Werke Hamburger Ausgabe in 14 Bänden, edited by Erich Trunz (Hamburg: Chr. Wegner, 1948-60; Reprinted, C. H. Beck, 1981).
  • Maximen und Reflexionen, edited by Max Hecker (Weimar: Schriften der Goethe Gesellschaft, 1907).
  • Münchner Ausgabe: Sämtliche Werke nach Epochen seines Schaffens, edited by Karl Richter et al., 20 volumes (München: C. Hanser, 1985-1998).
  • Weimarer Ausgabe (Sophienausgabe): Goethes Werke, edited under the sponsorship of Großherzogin Sophie von Sachsen, 143 Volumes in 4 divisions (Weimar: H. Böhlau, 1887-1919; Reprinted München: Deutscher Taschenbuch Verlag, 1987).
  • ii. Letters and Conversations

ii. Letters and Conversations

  • Eckermann, J.P., Gespräche mit Goethe in den letzten Jahren seines Lebens: 1823-1832, 3 Volumes (Leipzig: Geiger, 1836-1848).
  • Goethes Briefe: Hamburger Ausgabe, edited by Karl Robert Mandelkow, 4 Volumes (Hamburg, 1962-67 [Post-1972 Publication Site: München: Beck, 1972ff.).
  • Goethe: Begegnungen und Gespräche, edited by Ernst und Renate Grumach, 14 Volumes (Berlin: De Gruyter, 1965-2011).

iii. English Translations of Goethe’s Works

  • Conversations of Goethe with Johann Peter Eckermann, translated by John Oxenford (London: J.M. Dent & Sons, 1930).
  • Theory of Colors, translated by C.L. Eastlake (Boston: MIT Press, 1970).
  • Truth and Fiction Relating to my Life, translated by John Oxenford (Boston: Simonds & Co., 1902).

b. Selected Secondary Scholarship

i. Historical and Philosophical Context

  • Bates, A. (ed.), The Drama: Its History, Literature and Influence on Civilization, 20 vols. (London: Historical Publishing Company, 1906).
  • Borchmeyer, D., Goethe: Der Zeitbürger (München/Wien: Hanser, 1999).
  • Boyle, N., Goethe: The Poet and the Age (Oxford: Oxford University Press, 1991).
  • Breithaupt, F., Jenseits der Bilder: Goethes Politik der Wahrnehmung (Freiburg im Breisgau: Rombach, 2000).
  • Breithaupt, F., et al. (eds.), Goethe and Wittgenstein: Seeing the World’s Unity in its Variety (Frankfurt a.M.: Peter Lang, 2003).
  • Bruford, W.H., Culture and Society in Classical Weimar: 1775-1806 (Cambridge: Cambridge University Press, 1962).
  • Cassirer, E., Goethe und die geschichtliche Welt (Repr. Hamburg: Meiner, 1932).
  • Hildebrandt, G., Goethes Naturerkenntnis (Hamburg: Stromverlag, 1949).
  • Heller, E., The Disinherited Mind: Essays in Modern German Literature and Thought (Harmondsworth: Penguin Books, 1952).
  • Hinderer, W., Goethe und das Zeitalter der Romantik (Würzburg: Königshausen & Neumann, 2002).
  • Hofman, P., Goethes Theologie (Paderborn: Schöningh, 2001).
  • Lauxtermann, P., Schopenhauer’s Broken World-View: Colours and Ethics between Kant and Goethe (Dordrecht: Kluwer, 2000).
  • Möckel, C., Anschaulichkeit des Wissens und kulturelle Sinnstiftung: Beiträge aus Lebensphilosophie, Phänomenologie und symbolischem Idealismus zu einer Goetheschen Fragestellung (Berlin: Logos, 2003).
  • Nicholls, A.J., Goethe’s Concept of the Daemonic: After the Ancients (Rochester, NY: Camden House, 2006).
  • Reed, T.J., Goethe (Oxford: Oxford University Press, 1984).
  • Richards, R.J., The Romantic Conception of Life: Science and Philosophy in the Age of Goethe (Chicago: University of Chicago Press, 2002).
  • Schweitzer, A., Goethe: Four Studies, edited and translated by Charles R. Joy (Boston: Beacon Press, 1949).
  • Simmel, G., “Goethe und die Jugend,” in Der Tag 395 [6] (August, 1914), translated by Ulrich Teucher and Thomas M. Kemple in Theory, Culture, Society 24 (2007): 85-90.
  • Stephenson, R.H., Studies in Weimar Classicism: Writing as Symbolic Form (Oxford: Peter Lang, 2010).
  • Tantillo, A.O., The Will to Create: Goethe’s Philosophy of Nature (Pittsburgh: University of Pittsburgh Press, 2002).
  • Weier, W., Idee und Wirklichkeit: Philosophie deutscher Dichtung (Paderborn: Schöningh, 2005).

ii. Science and Methodology

  • Breidbach, O., Goethes Metamorphosenlehre (München: Fink, 2006).
  • Burwick, F., The Damnation of Newton: Goethe’s Color Theory and Romantic Perception (Berlin, Walter de Gruyter, 1986).
  • Ciamarra, L.P., Goethe e la storia: studi sulla “Geschichte der Farbenlehre” (Napoli: Liguori, 2001).
  • Holland, J., German Romanticism and Science: The Procreative Poetics of Goethe, Novalis, and Ritter (New York: Routledge, 2009).
  • Jardine, N., Scenes of Inquiry: On the Reality of Questions in the Sciences (Oxford: Clarendon Press, 2000).
  • Jürgen, T., Hoffnung und Gefahr (Frankfurt a.M.: Suhrkamp, 2001).
  • Krätz, O., Goethe und die Naturwissenschaften (München: Callwey, 1992).
  • Moiso, F., Goethe: La Natura e le sue Forme (Milano: Mimesis, 2002).
  • Nisbet, H.B., Goethe and the Scientific Tradition (London: Institute of Germanic Studies, 1972).
  • Nussbaumer, I., Zur Farbenlehre: Entdeckung der unordentlichen Spektren (Wien: Ed. Splitter, 2008).
  • Richards, R.J., The Tragic Sense of Life: Ernst Haeckel and the Struggle over Evolutionary Thought (Chicago: University of Chicago Press, 2008).
  • Seamon, D., & Zajonic, A., Goethe’s Way of Science (Albany: SUNY Press, 1998).
  • Sepper, D.L., Goethe contra Newton: Polemics and the Project for a New Science of Color (Cambridge: Cambridge University Press, 2007).
  • Sherrington, C., Goethe on Nature and Science (Cambridge: Cambridge University Press, 1942).
  • Steigerwald, J., “Goethe’s Morphology: Ürphänomene and Aesthetic Appraisal,” Journal of the History of Biology 35 (2002): 291-328.
  • Stephenson, R.H., Goethe’s Conception of Knowledge and Science (Edinburgh: Edinburgh University Press, 1995).
  • Wells, G.A., Goethe and the Development of Science: 1750-1900 (Alphen aan den Rijn: Sijthoff & Noordhoff, 1978).

iii. Aesthetics, Politics, and Theology

  • Bell, M., The German Tradition of Psychology in Literature and Thought, 1700-1840 (Cambridge: Cambridge University Press, 2009).
  • Dönike, M., Pathos, Ausdruck und Bewegung: zur Ästhetik des Weimarer Klassizismus 1796 – 1806 (Berlin: Walter de Gruyter, 2005).
  • Fröschle, H., Goethes Verhältnis zur Romantik (Würzburg: Königshausen & Neumann, 2002).
  • Hibbitt, R., Dilettantism and its Values: from Weimar Classicism to the fin de siècle
  • (London: Legenda, 2006).
  • Kuhn, B.H., Autobiography and Natural Science in the Age of Romanticism: Rousseau, Goethe, Thoreau (Farnham/Surrey: Ashgate, 2009).
  • Oergel, M., Culture and Identity: Historicity in German Literature and Thought 1770 – 1815 (Berlin: Walter de Gruyter, 2006).

 

Author Information

Anthony K. Jensen
Email: anthony.jensen@lehman.cuny.edu
City University of New York / Lehman College
U. S. A.

Causal Role Theories of Functional Explanation

Functional explanations are a type of explanation offered in the natural and social sciences. In giving these explanations, researchers appeal to the functions that a structure or system has. For instance, a biologist might say, “The kidney has the function of eliminating waste products from the bloodstream.” Or a sociologist might say, “The purpose of monogamy is to preserve the family structure.” Each of these is concerned with a function that a structure or system is believed to possess. Philosophical interest in this issue concerns understanding what exactly these statements amount to, and whether they are explanatory. Of particular concern is whether such statements commit us to problematic views about the existence of teleology, or purposes, in nature, and whether this is legitimate in the sciences.

This article considers the debate over functional explanations in the philosophical literature from the 1950’s to the early 21st century. It begins by considering the background to philosophical interest in this subject. Then it looks at two prominent early approaches to functional explanation: Ernest Nagel’s deductive-nomological approach from the 1950’s, and Robert Cummins’ causal account from the 1970’s, as well as objections to both. Throughout there is consideration of illustrative examples of functional explanations from different sciences. Although there are other accounts of functional explanation in the literature, such as the evolutionary and design-oriented accounts, they will be mentioned only in relation to the other two. The broad history of the concept of teleology and the details of the other accounts will not be developed here.

Table of Contents

  1. Background
  2. Nagel’s Early Account
  3. Difficulties
  4. Cummins’ View
  5. An Example
  6. Objections and Replies
  7. Other Developments
  8. References and Further Reading
    1. References
    2. Suggested Reading

1. Background

Prior to the 1950’s, philosophers and scientists were concerned with appeals to teleological concepts in the sciences. These concepts have a history going back to Aristotle, who claimed that understanding such things as physical objects or an animal’s physiological structures involves knowing what they are for. In his view, a complete explanation of these structures requires understanding the purposes towards which they are directed. For instance, he thought that one does not understand what a kidney is merely by knowing the material it is made from; one also has to understand that it has the purpose of filtering blood (what it’s for). In Aristotle’s view purposes were conceived as unusual properties like “ends” or “final causes”. But after the scientific revolution these properties were hard to understand in a manner consistent with the sciences, and were considered to be obscure. At a later point this way of thinking was also believed by many to have become questionable because of Darwin’s theory of evolution, which held that talk of Aristotelian purposes was problematic. The idea was to replace talk of such purposes with reference to the notion of natural selection and the environment. The theory of evolution was taken by many scientists to show that appeals to purposive concepts in biology and the social sciences were merely misguided appeals to an outmoded form of explanation. However, the difficulty remained and scientists often continued through the 1900’s to use purposive language in explaining the phenomena that interested them. If appeals to purposive concepts were misguided, then one would have expected these appeals to wither away. So a problem remained in the scientific community over what this situation implied. There was concern about whether this continued talk of purposes was illegitimate in the sciences, or whether there was an acceptable way of understanding such language.

2. Nagel’s Early Account

Modern discussions of functional explanation usually begin with the views developed by Ernest Nagel and others around the 1950’s in the context of scientific explanation (Nagel 1961; Hempel 1959). In his book The Structure of Science, one of Nagel’s concerns was describing the various forms of explanation that occur in the sciences. When we consider the biological and social sciences, he observes, we often find researchers describing the structures or systems that concern them in terms of the functions they have. For instance, a biologist might say, “The heart has the function of pumping blood through the circulatory system.” Or a sociologist might say, “The purpose of the religious ritual is to increase cohesion in society.” Put generally, Nagel says these statements can be understood as stating that “the function of system X is to do Y.” He claims that there are several issues that such statements raise for further examination. We want to understand the detailed structure of such statements and how they work. We also want to understand how these statements are related to other kinds of explanations offered in the natural and social sciences.

Nagel claims that functional statements can be understood in the following manner. These statements are explanation sketches that when fully spelled out reduce to another form of explanation common in the physical sciences called the Deductive-Nomological model. Nagel’s account focuses on the attributions of functions to the components of a structure (or system). Consider the statement, “The heart has the function of pumping blood through the circulatory system.” When understood, this statement serves to explain why hearts are present in vertebrates. To see this, we have to note that hearts occur only together with a certain physical organization of the vertebrate body and in a certain external environment that the organism lives in. So, what the statement really says is that, in vertebrate bodies with an organization of blood and blood vessels and in a certain external environment, circulation occurs only if an organism has a heart. If we focus on this latter statement, Nagel says that the information in it can be expanded into a D-N explanation. When this is done, we then have the following explanation: Every vertebrate body with the appropriate organization and in a certain environment engages in circulation. If the vertebrate body does not have a heart, then it does not engage in circulation. Hence, the vertebrate body must have a heart.

This means functional statements in general have the following form for Nagel: The function of A in a system S with organization C is to enable S in environment E to engage in process P. And this can be expanded into an explicit explanation in this way: Every system S with organization C and in environment E engages in process P. If S with organization C and in environment E does not have A, then S does not engage in P. Hence, S with organization C must have A (1961, 403).

Understood this way, functional statements can be seen as explaining why components like hearts are present in certain organisms in which circulation occurs. For what such statements say is that for organisms having a heart is a necessary condition for pumping blood in the circulatory system. Nagel says this means the statement “The heart has the function of pumping blood through the circulatory system” says the same thing as the statement “Organisms in which circulation occurs pump blood only if they have a heart.” Because of the equivalence, he writes that it appears that “when a function is ascribed to a constituent element in an organism, the content of the teleological [functional] statement is fully conveyed by another statement that is not explicitly teleological and that simply asserts a necessary . . . condition for the occurrence of a certain trait or activity of the organism” (1961, 405).

There are two features of Nagel’s account worth noting at this point. First, he holds that when we attribute a function to a component, this is always relative to a goal state of the larger system. Researchers are not interested in merely any effects of the components a system has. Rather, they are interested in certain effects that are important to the maintenance of the organism. Biologists, for example, are interested in understanding the heart’s pumping blood because this effect contributes to the activity of circulation, which is important for the organism’s survival. It is important to note that for Nagel talk of goals in this context is not intended to make reference to conscious entities of some kind. It only concerns the characteristic activities that researchers believe to be present in the systems that concern them (for example, survival and reproduction).

Second, we saw that functional statements that mention the functions of components can be translated into equivalent statements that lack these notions. Nagel claims this shows there’s nothing scientifically suspect about these statements. We shouldn’t think that such statements commit us to problematic entities like “purposes” in nature that somehow attach to components, or to “ends” or “final causes” towards which components are believed to strive (as in Aristotle’s view of teleology). Instead, what the account makes clear is that, in attributing a function to a component, researchers are merely concerned with explaining why the component is present in the larger system. In light of this, Nagel suggests that functional statements are unproblematic despite their earlier associations in history with unacceptable teleological notions. Understood properly he thinks these statements should be seen as belonging with other legitimate forms of explanation in the sciences (on this point compare Ayala 1970).

3. Difficulties

There are two difficulties commonly noted with the account presented. The first concerns the reference to goals in identifying the effects of the components that are the targets of explanation. As we have seen for Nagel, the function of a component is identified in relation to the effects that contribute towards the important activities of an organism, like survival and reproduction. But this is problematic for some (Cummins 1975) since it suggests there are components that researchers might want to say have functions, but with this account do not. For instance, let us suppose that the wings of a particular species of bird stopped contributing to the capacity for survival and reproduction (maybe this is due to a specific type of airborne predator). In this situation, it seems researchers would not say that the wings did not have any function to perform; they would still want to understand how the wings contributed to the birds’ capacity for flight. The present account would classify these structures as lacking functions when some would say they have functions.

The second problem concerns the sense in which functional statements explain the presence of the components involved. In Nagel’s view, we explain the presence of a component by inferring that it is necessary for the performance of a capacity of a system. In the example considered, the heart is said to be a necessary condition for the occurrence of blood pumping in vertebrates. But it has been held false by some (Wright 1973; Cummins 1975) that having a heart is a necessary condition for pumping blood. For surely there are functionally equivalent structures like artificial hearts that are capable of pumping blood through the circulatory system, or circulation might even be achieved in other ways. If this is the case, however, then functional statements cannot be interpreted as explaining why certain structures must be present in an organism (because they needn’t be). Nagel knew of this concern and said his interest was with actual living systems and what they include (as opposed to logical possibilities). But it is not clear this resolves the worry since we find components like artificial hearts in actual living systems (Buller 1999). Aside from this issue, there are believed to be general problems with the framework of Deductive-Nomological explanation that lies behind Nagel’s account of functional explanation, which have raised concerns with his approach. It is no longer accepted by everyone, for example, that scientific explanations have to be seen as involving deductions from universal, law like statements as required with the Deductive-Nomological model of explanation. The result of this was that people became less inclined to think that an account of functional explanation had to be incorporated into the particular explanatory framework Nagel employed. This was important in making people receptive to the alternative accounts that were being developed.

4. Cummins’ View

After Nagel the most prominent view developed along these lines was by Robert Cummins (1975, 1983). Cummins agrees that the previous account suffers from the problems that were described. The point of functional statements is not to explain why certain components are present in a structure in relation to some goal state the structure has. Instead, Cummins suggests that functional statements are merely used to explain the contributions made by components of a structure to a capacity of a containing system. The performance of a capacity of a system is explained in terms of the capacities of the components it contains, and how they are organized. Consider researchers’ interest in explaining what it is for a system S to have property P. Cummins writes that “the natural strategy for answering such a question is to construct an analysis of S that explains S’s possession of P by appeal to the properties of S’s components and their mode of organization” (1983, 15). For example, let us suppose that researchers are interested in understanding how circulation occurs in vertebrates. To explain this capacity, they search for the structure in the body that contributes to this capacity by moving the blood around. They observe that blood is moved through the arteries by some sort of pumping motion. When they learn that the heart serves as the pumping mechanism in the body, they identify this with its function (they report “the heart has the function of pumping blood”). The capacity for circulation is thus explained in terms of the capacities of the components of the system that enable it to perform the task.

There are different stages that are involved in giving the explanation. Researchers begin with a specification of the larger function of the system they want to explain. The explanation then consists in showing how the function depends on the capacities of the components of the system, and their organization. There are two stages that are involved in this process. The first stage is to analyze the function in question in terms of the capacities involved in bringing about the function, which Cummins calls the analytical strategy. He says “The Analytical Strategy proceeds by analyzing a disposition [function] into a number of other relatively less problematic dispositions such that [the] organized manifestation of these analyzing dispositions amounts to a manifestation of the analyzed disposition” (1977, 272). The second stage is to show that there is a physical structure present that realizes the various capacities. This is needed to show that the function is, in fact, realized by an actual structure or mechanism. As Cummins explains, “Ultimately . . . a complete property theory for a dispositional property must exhibit the details of the target property’s instantiation in the system (or system type) that has it. Analysis of the disposition . . . is only the first step; instantiation is the second” (1983, 31). While the two stages are independent of one another, it is important to note that both stages are needed for the explanation to be complete. This is worth keeping in mind because one sometimes finds discussions in the literature that focus merely on one stage of the explanation and not the other.

Cummins notes that the explanation can be iterated by applying it to the functions of the components cited in the earlier explanation. This process can be repeated until researchers are satisfied with the level reached, or when they reach a level of physical components where no further explanation can be given. In practice, where this line is drawn is relative to the particular interests the researcher has.

Understood this way, functional statements are not used, as Nagel says, in explaining why certain components are present in a system, but to explain how a component contributes to the capacity of the system that contains it. This concern with the causal organization of components is why Cummins’ account is referred to as the “causal role” account of functional explanation. Furthermore, there is no requirement with the account that the larger function being explained be related to the organism’s survival and reproduction (or similar activity). This means the goal state requirement of the previous account has been dropped. In this respect, Cummins’ account can be seen as broadening the number of capacities of systems that are the appropriate subjects of explanation. The broader applicability of the account is seen by many to be a benefit of the view, but we will see later that for some it is thought to raise problems.

5. An Example

With Cummins’ account, the same form of explanation is said to be applicable to a range of different structures or systems in the sciences. The account applies to the functions of structures like the heart in physiology, but it can also be applied to other kinds of systems, including chemical systems, psychological systems, social systems, and others. We can illustrate this with an example from psychology. Consider the explanation of color vision in the human system (Dawson 1998, 163). The function of color vision consists in the capacity (F) to perceive information about the colors of objects in the environment. The trichromatic theory of color vision provides an explanation of how this works. It holds that the function is performed in virtue of the capacities (C1a, C2a, . . . Cna) of the components of the visual system (S1a, S2a, . . . Sna), and the way they are organized. In particular, the function depends on the capacities of the parts of the eye to produce differential responses to wavelengths of light. Researchers have learned that when light falls on the retina there are three kinds of cone cells there containing photopigments (red, green, and blue) that respond differently to the different wavelengths of light present. These responses are combined together in the retina through cellular connections and produce a distinctive response signal in the nervous system. The signal is sent to the visual cortex in the brain, and leads to the color perceptions we have. In turn, the subcapacities of the individual pigments in the cones (say C1a) have themselves been explained in terms of the capacities (C1b, C2b, . . . Cnb) of the molecular components (S1b, S2b, . . . Snb) of the photopigments and how they respond to light (for example, the capacities of vitamin A and various proteins to change when exposed to light). In this way, the function of color vision has been decomposed into the various capacities of the anatomical components within the eye and nervous system that underlie the function.

In this respect functional explanation is intended to be a general strategy of explanation that can be applied in different sciences. What is needed is for researchers to identify a capacity of the system they want to explain, and then describe how this occurs as a result of the organized behavior of the components which make up the system. In pursuing this strategy, researchers can be seen as undertaking a kind of mechanical analysis in attempting to explain the behaviors of the system that interest them. It is, in part, the broad applicability of the account to different sciences that has made philosophers interested in understanding its details and implications. One will find further illustrations of the account by looking at discussions of functionalism in the philosophy of mind. This will reveal how the account has been used in theories in other areas.

6. Objections and Replies

There are several objections that have been made to the causal role account. Here we will consider four of the common ones. The first is the concern that too many kinds of components can be ascribed functions that on their face don’t seem to have functions. Consider a bit of dirt that has become lodged in a pipe and which operates as a one-way valve (Griffiths 1993). This material can be seen as contributing to the capacity of the pipe to control the flow of liquid, and so can be part of a functional explanation in line with the account. It seems odd to attribute a function to the dirt in this situation, but it becomes a possibility once we have dropped the notion of a goal state from the account. Once this occurs, it seems any effect of a structure that contributes to a capacity can be used in an explanation.

A second objection concerns the various kinds of things that we know objects like hearts are capable of doing. Millikan (1989a) claims that objects like hearts not only move blood through the circulatory system, but also make a thumping noise that doctors can listen to. Making a noise is an effect of the structure that can be explained in terms of the account presented before. But while biologists take the function of the heart to be the circulation of blood, they do not say that making thumping noises is. So the account seems too liberal since it fails to distinguish between genuine functions and mere side effects of the systems.

In reply to these sorts of concerns, Cummins (1975) argues that there is no objective way of making the distinction between genuine functions and other effects. The effects of a component may be relevant to the explanation of different overall capacities. The limits on what capacities should be explained depend on the particular explanatory interests researchers have. Relative to the capacity for blood circulation in the body, the heart can be said to function as a pumping mechanism; but relative to the capacity for making sounds, the heart can be said to function as a noise maker. There is no saying which of these counts as a genuine function in an absolute sense since researchers’ interests are what matter. This response raises issues about the nature of scientific explanations and whether these should be seen as objective, or whether it is sufficient merely to appeal to the interests researchers have. The answer to this depends on what one sees as the appropriate characteristics of a scientific explanation. In addition to making this point, Cummins also added a general restriction on the kinds of explanations that should be given. He said that the appropriateness of a functional explanation is related to the “interestingness” of the explanation being offered. An explanation counts as interesting when the component capacities appealed to in the explanation are less complex and different in kind from the larger capacity being explained. We can illustrate this with the piece of dirt which is lodged in the pipe. For example, the capacity of the dirt to obstruct the flow of liquid is neither less complex nor really different from the overall capacity of the pipe being explained, and so a functional explanation has no interest in this case. This response is intended to place limits on the functional explanations that are appropriate to make in these sorts of cases. But it has been perceived as being somewhat vague in describing the systems to which it applies.

This sort of concern has also been considered by Davies (2001), who argues that there are specific constraints on the effects that are appropriate to use in an explanation. He suggests we can supplement the account offered by recognizing that functions are appropriately attributed, not just to any components, but to those in a hierarchically organized system. A system is hierarchically organized just when the function is performed in virtue of the lower-level organization of the system in question. In this view, the effects of a component are not functional unless they are due to the specific hierarchical organization of the structure. For example, aside from pumping blood the heart beat has the effect of vibrating the sternum in the chest. But this effect does not contribute anything to a larger capacity of the circulatory system, or to other related capacities of the organism. Therefore, there is no reason to accept this as a genuine function of the component in question.

A third problem concerns the character of some of the components to which researchers ascribe functions. Neander (1991; see Millikan 1989b) claims that researchers in sciences like biology commonly refer to components that may be diseased and malformed as having functions. Due to congenital disease, for example, a heart may lack the parts necessary to pump blood around the circulatory system in an organism and thus not work. This presents a problem because here the component will be unable to perform its causal role, and lack the function as a result. Despite this fact, Neander claims that, in this situation, researchers still classify the components functionally as being hearts. She claims this shows we need another notion of function independent of the causal role notion. The evolutionary account she prefers is presently the main rival to the causal role account. It holds that functional explanations are a type of evolutionary explanation. Roughly, to say that a component has a function F is to say that F is an effect of the component that was selected for by natural selection in the past. So the heart has the function of pumping blood because hearts were selected for pumping blood in our ancestors, and this led to the present existence of hearts. With this view the idea is that natural selection confers on components functional roles they are supposed to perform despite their inability to perform their causal role.

Different replies have been offered to this objection. Cummins (1975) accepts that malfunctioning components do not have their causal role functions to perform. If a component is unable to perform its causal role, then this implies the loss of the relevant capacity. A similar sort of view has also been supported by Davies (2001, 176). He maintains that we should not classify components as being malfunctional in these cases. He says that components are wrongly classified this way on the basis of our prior experience of physically similar structures, which leads us to expect the structures will function as the other structures do. But, properly speaking, the structures should be classified as nonfunctional “because natural traits cannot malfunction.” This point is made in relation to a larger complaint he makes that the notion of function appealed to in evolutionary theories does not fit well with the ontologies of the natural sciences.

Another reply has also been offered by Amundson and Lauder (1994). They argue it is false that malfunctioning components can only be classified in terms of the evolutionary functions they perform. They claim that researchers in physiology commonly classify structures in terms of their homologous relationships to other structures, and independently of their functions. Homologies are defined as traits in different species that are similar due to common ancestry (a standard example is the forelimbs of humans and bats that are structurally similar). These traits can be identified in terms of such features as the physical similarity present, correspondence of parts, and other features. These criteria enable researchers to classify structures on the basis of their anatomical features alone. Since researchers can classify a malformed heart as a heart by appeal to its structural features, it is claimed that there is no problem for the account with structures that malfunction. Whether this response is correct depends on how classification works in the practice of researchers in physiology. This has been a subject of early 21st century controversy among philosophers working in this area.

The final problem concerns the role of functional statements in relation to what they explain. As was noted before, on the evolutionary account a functional statement is said to explain the presence of a component of a system. For instance, the statement “the heart functions to pump blood” serves to explain why hearts exist in certain organisms. The idea is that in the past, hearts that pumped blood contributed to the survival of an organism, which explains the present existence of hearts. So, the functional statement points in the direction of an evolutionary account for the rise of the traits which are subject to explanation. In contrast, the causal role account does not provide an explanation in this sense. Functional statements do not serve to explain the presence of the component, but explain the contribution of the component to a capacity of a particular system. This is not a matter of why the component exists in the system, but the task the component performs. The different accounts raise issues about the aims theories of functional explanation are believed to have and what needs explaining. The different perspectives people have taken on this topic are related to how the explanations are used by researchers in different areas of science, and the particular roles the explanations have in these areas.

7. Other Developments

So far, we have seen that the causal role account is applicable to different fields of science. Another issue is discussion over the exact fields it applies to, and how it relates to the other accounts mentioned. In this respect, it will help to describe the relation between the causal role and other accounts as they have been discussed in the biological sciences.

According to Neander (1991), there is a single notion of function used across different areas of biological research. The basic notion is the evolutionary notion we described before that is explained in terms of natural selection. The idea is that the heart functions to pump blood because pumping blood was the effect that hearts were selected for in the past, and which led to the present existence of hearts. Neander claims this is the basic notion at work when researchers appeal to the functions of a component and not the causal notion. In another vein, Kitcher (1993) argues that the concept of design can be seen to underlie all functional explanations. Roughly, to say that a component has a function F is to say that F is what the component was designed to do (the account allows there are different sources of design). He believes the notion applies in evolutionary contexts that involve a past selection process, as well as in physiological investigations that concern the current causal contributions of a component to a system’s capacity (in the latter case, Kitcher claims that a selection process is involved in an indirect way). In both accounts, appeals to functions can be unified under some general concept that applies across different areas of research.

Not everyone agrees, though, that the notion of function can be treated in this way. Godfrey-Smith (1993) argues that it is wrong to think there is a unified concept of function at work in different areas. He suggests there are distinct notions of function that are appropriate to different fields. The causal role notion is appropriate in physiological investigations where researchers are concerned with understanding how the capacities of a system depend on the capacities of its components. These investigations can be undertaken independently from historical considerations about selection. Alternatively, the evolutionary notion is appropriate in areas like evolution and behavioral ecology where researchers are interested in explaining why organisms have the structures and behaviors they have. In this context, the focus is on past selection pressures in the environment, and a historical approach is appropriate. So, in Godfrey-Smith’s view it is a mistake to think that we should be attempting to unify the various uses of functional language under a single account; we should rather accept pluralism. The different notions should merely be seen as reflecting the different kinds of information researchers are concerned with in different areas of investigation. This point can be related to the previous issue concerning the interest-relative character of the explanations researchers offer.

Not only are there concerns about the areas of research where the causal role notion is applicable, but there have been questions raised about similar notions in the vicinity. The causal notion is often thought to be the basic notion at work in physiology (this is controversial for some who hold the evolutionary or other account). Wouters (1995) argues, though, that there is another notion that has been neglected by philosophers in this area. He says that when researchers talk of functional explanations they are often referring to what he calls “viability explanations.” These have a different focus than explaining how a function depends on the capacities of the components of a system. In these cases, researchers are interested in explaining the traits that individuals need to survive and reproduce in their environments. For example, given the distance between the central organs and the outer periphery of the human body, the function of circulating oxygen cannot be performed by simple diffusion. This is why a circulatory system is needed for the performance of the function. This explanation contributes to our understanding by showing why the circulatory system has to be present for maintaining the viability of the organism. Wouters claims that this form of explanation is distinct from the previous accounts we described, and involves its own explanatory structure. Moreover, this explanation is said to share features with the earlier notion of functional explanation described by those like Nagel. In this respect, it is suggested that philosophers may have overlooked an important notion of functional explanation worthy of further examination.

There is a lot more that could be said about the subject of causal role functional explanation, and the debate about functions. We have not covered everything that might be considered on this issue. One thing that is clear from our discussion, though, is that a proper understanding of functional explanations cannot be achieved independently from considering how they are used in the sciences. Whether, and to what extent, the causal role notion is applicable in a particular area needs to be determined by examining the science in question. In this respect, those interested in furthering our understanding of the notion will have to familiarize themselves with the details of the examples being considered. An understanding of how functional explanations are used is an important part of helping us improve our understanding of this concept.

8. References and Further Reading

a. References

  • Amundson, R., & Lauder, G. (1994). “Function without purpose: the uses of causal role function in evolutionary biology.” Biology and Philosophy 9: 443-469.
  • Ariew, A., Cummins, R., & Perlman, M. (eds.) (2002). Functions: New Essays in the Philosophy of Psychology and Biology. Oxford: Oxford University Press.
  • Ayala, F. (1970). “Teleological explanations in evolutionary biology.” Philosophy of  Science 37: 1-15.
  • Block, N. (1980). “Introduction: what is functionalism?” In Block, N. (ed.) Readings in Philosophy of Psychology, vol. 1. Cambridge, MA: Harvard University Press.
  • Cummins, R. (1977). “Programs in the explanation of behavior.” Philosophy of Science 44: 269-287.
  • Cummins, R. (1983). The Nature of Psychological Explanation. Cambridge, MA: MIT Press.
  • Davies, P. (2001). Norms of Nature: Naturalism and the Nature of Functions. Cambridge, MA: MIT Press.
  • Dawson, M. (1998). Understanding Cognitive Science. Malden, MA: Blackwell.
  • Enç, B. & Adams, F. (1992). “Functions and goal directedness.” Philosophy of Science 59: 635-654.
  • Godfrey-Smith, P. (1993). “Functions: consensus without unity.” Pacific Philosophical Quarterly 74: 196-208.
  • Griffiths, P. E. (1993). “Functional analysis and proper functions.” British Journal for the Philosophy of Science 44: 409-422.
  • Hempel, C. G. (1959). “The logic of functional analysis.” In Gross, L. (ed.) Symposium on Sociological Theory. Evanston, IL: Harper and Row Publishers.
  • Hempel, C. G. & Oppenheim, P. (1948). “Studies in the logic of explanation.” Philosophy of Science 15: 135-175.
  • Kitcher, P. (1993). “Function and design.” Midwest Studies in Philosophy 18: 379-397.
  • McLaughlin, P. (2001). What Functions Explain: Functional Explanation and Self-Reproducing Systems. Cambridge: Cambridge University Press.
  • Millikan, R. G. (1989a). “An ambiguity in the notion of function.” Biology and Philosophy 4: 172-176.
  • Millikan, R. G. (1989b). “In defense of proper functions.” Philosophy of Science 56: 288- 302.
  • Nagel, E. (1961). The Structure of Science. Indianapolis, IN: Hackett.
  • Nagel, E. (1977). “Teleology revisited: goal-directed processes in biology.” Journal of Philosophy 74: 261-301.
  • Neander, K. (1991). “The teleological notion of ‘function’.” Australasian Journal of Philosophy 69 (4): 454-468.
  • Neander, K. (2002). “Types of traits: the importance of functional homologies.” In Ariew, A., Cummins, R., & Perlman, M. (eds.) (2002).
  • Polger, T. (2004). Natural Minds. Cambridge, MA: MIT Press.
  • Wouters, A. (1995). “Viability explanation.” Biology and Philosophy 10: 435-457.
  • Wright, L. (1973). “Functions.” Philosophical Review 82: 139-168.

b. Suggested Reading

  • Allen, C., Bekoff, M., & Lauder, G. (eds.) (1998). Nature’s Purposes: Analyses of Function and Design in Biology. Cambridge, MA: MIT Press.
  • Buller, D. (ed.) (1999). Function, Selection, and Design. Albany, NY: State University of New York Press.
  • Cummins, R. (1975). “Functional Analysis.” Journal of Philosophy 72: 741-765. Reprinted with minor alterations in Allen, Bekoff, & Lauder (1998) and Buller (1999).
  • Wouters, A. (2005). “The function debate in philosophy.” Acta Biotheoretica 53: 123-151.

 

Author Information

Mark B. Couch
Email: mark.couch@shu.edu
Seton Hall University
U. S. A.

The Third Earl of Shaftesbury (1671—1713)

ShaftesburyAnthony Ashley Cooper, the Third Earl of Shaftesbury (1671-1713) was an English philosopher who profoundly influenced 18th century thought in Britain, France, and Germany.  As a part of an important social circle of English Freethinkers along with early deists such as John Toland, Matthew Tindal, and Anthony Collins, Shaftesbury’s work had a significant influence on French deists such as Voltaire and Rousseau.  He also corresponded with some of the most important thinkers of his day, including Locke, Leibniz, and Bayle.  Shaftesbury was most influential in the history of English language philosophy through his concept of the moral sense which heavily influenced Hutcheson, Butler, Hume, and Adam Smith; and Shaftesbury was influential in Germany through his concept of enthusiasm which recovered (intuitive) reason from mere (discursive) reasoning and influenced the Romantic idea of the creative imagination as found in German thinkers such as Lessing, Mendelssohn, Goethe, Herder, and Schiller.

Although Shaftesbury was enormously influential in the 18th century, his prestige declined in the 20th century, primarily due to the rise of analytic philosophy which defined philosophy such that Shaftesbury’s work seemed more like literature or rhetoric than proper philosophy. Those trained in analytic philosophy continue to have trouble reading Shaftesbury, largely because he self-consciously rejects systematic philosophy and focuses more on rhetoric and literary persuasion than providing numbered premises.  Shaftesbury is interested as much in moral formation as he is in moral theorizing, though his work does contain some, albeit intentionally veiled, discussion of theoretical concerns.

As Shaftesbury saw it, Hobbes had set the agenda of British moral philosophy (a search for the grounding of universal moral principles), and Locke had established its method (empiricism).  Shaftesbury’s important contribution was to focus that agenda by showing what a satisfactory response to Hobbes might look like but without giving up too much of Locke’s method.  Shaftesbury showed the British moralists that if we think of moral goodness as analogous to beauty, then (even within a broadly empiricist framework) it is still possible for moral goodness to be non-arbitrarily grounded in objective features of the world and for the moral agent to be attracted to virtue for its own sake, not merely out of self-interest.

In his most influential works, Shaftesbury thinks of moral judgment as self-reflection. First we have motives, and then we reflect on those motives resulting in a feeling of moral approval or condemnation. The process is the same when evaluating other agents:  we reflect on their motives and feel approval or condemnation.  In Shaftesbury’s aesthetic language, the state of having the morally correct motives is the state of being “morally beautiful,” and the state of approving the morally correct motives upon reflection is the state of having “good moral taste.” Shaftesbury argues that the morally correct motives which constitute moral beauty turn out to be those motives which are aimed at the good of one’s society as a whole.  This good is understood teleologically.  Furthermore Shaftesbury argues that both the ability to know the good of one’s society and the reflective approval of the motivation toward this good are innate capacities which must nevertheless be developed by proper socialization.

Table of Contents

  1. Life
  2. Works
  3. Aims and Methods
  4. Major Themes
    1. Moral Realism
    2. Moral Beauty
    3. Moral Sense
    4. Personal Identity
  5. References and Further Reading
    1. Primary Texts
    2. Secondary Texts

1. Life

Shaftesbury was part of an important political family.  Shaftesbury’s grandfather, the First Earl of Shaftesbury (1621-1683), was an influential and controversial Whig politician.  The Whigs were the party in favor of the supremacy of parliament over the monarchy in England.  The opposing party, the Tories, supported the monarchy and also tended to support a hierarchical state-sponsored religion, either Anglicanism or Roman Catholicism.  The Whigs favored freedom of religion, supporting religious “Dissenters,” at first Puritans, Calvinists, Quakers, etc., and later Deists.  In 1679 the First Earl of Shaftesbury introduced a bill into Parliament attempting to exclude King Charles II’s brother James (later King James II) from the throne because James was Roman Catholic.  The bill failed, and the First Earl was eventually charged with treason and fled to Holland where he died in exile.

Shaftesbury was very close to his grandfather and revered the memory of the First Earl.  Because Shaftesbury’s father had an unidentified degenerative illness, Shaftesbury was raised in the household of the First Earl from the age of four.  After the First Earl’s disgrace, Shaftesbury made it one of his life’s goals to rehabilitate his family’s reputation. Though his votes in Parliament occasionally sided with the Tories, Shaftesbury always stayed true to the political principles of his grandfather, consistently fighting for religious tolerance and a balance of powers at both the national and international level (see Voitle, p. 73; cf. p. 414).

John Locke (1632-1704) was a close friend of the First Earl and an advisor to the family for years to come after the First Earl’s death.  Locke was the personal physician and general advisor to the First Earl.  He supervised the childhood medical care of Shaftesbury’s father, the Second Earl (1652-1699). He also helped find a wife for the Second Earl and he cared for her during  her pregnancy  with the Third Earl.  Most significantly for our purposes, Locke supervised the Third Earl’s education.  He personally chose Shaftesbury’s governess Elizabeth Birch and designed a curriculum for her to follow in her instruction of the child.  This experience was, presumably, the basis for Locke’s later work Thoughts Concerning Education.  Under Birch’s tutelage, Shaftesbury received a strong education in the Classics and became fluent in Greek and Latin by the age of eleven.  Locke continued to check on Shaftesbury’s progress over the years.  After the First Earl’s death when Shaftesbury was twelve years old, he attended Winchester College, a secondary school which at the time was dominated by Tory sentiment. Shaftesbury felt persecuted by his peers on account of the First Earl’s political reputation.

In 1687, at the age of 16, Shaftesbury began his two-year “Grand Tour” of Europe, a customary part of a British nobleman’s education during the period.  After an extended stay in Paris, Shaftesbury spent most of his tour in Italy where, based on his diary, he focused his attention on art and architecture.  He was especially interested in ruins from the classical Roman period.  The first stop on his tour, however, was Holland where Shaftesbury spent several months visiting John Locke.

After returning from his Grand Tour in 1689, Shaftesbury took over managing much of the family estates and interests from his bedridden father.  Shaftesbury also had to supervise the education of his brothers and the marriages of his sisters; oversee the family finances and investments; govern the family lands, a job which included adjudicating disputes among the tenants; and, in 1695, take his place as a member of Parliament in the House of Commons.  Locke served as a primary advisor to the young Shaftesbury as he found his footing in these new duties, though Shaftesbury did not always follow Locke’s advice.  Shaftesbury had many philosophical conversations with Locke, some of which are preserved in correspondence. During this time, Shaftesbury wrote his first philosophical works, An Inquiry Concerning Virtue or Merit and the “Preface” to his edition of Whichcote’s sermons.

The smoky, polluted air of London did not agree with Shaftesbury, and he developed what would become a life-long and eventually fatal respiratory disease. Shaftesbury was diagnosed with a form of asthma, though Voitle suggests that the evidence points toward tuberculosis (Voitle, p. 226).  In 1698, due to his health, he retired from public life and spent a year in Holland where he met important thinkers of the day including Pierre Bayle (1647-1706), who became a close friend, despite their philosophical disagreements.

While in retirement he also began keeping his philosophical journal, which he labeled Askemata (Greek: exercises), posthumously published under the title of Philosophical Regimen.  The Askemata reveals Shaftesbury as a disciple of the ancient Stoics, especially Epictetus and Marcus Aurelius.

After the death of his father, Shaftesbury inherited the title of Earl and felt obligated to return to Parliament in 1700 (this time in the House of Lords).  Yet his health continued to worsen, and he gradually spent more and more time in retirement, during which time he prepared his philosophical works for publication.  By the time his collected works appeared in 1711 as Characteristics of Men, Manners, Opinions, Times, Shaftesbury’s health was so poor he decided to move to Italy in search of a more hospitable climate. There he continued to write. He prepared a revised second edition of the Characteristics, reading the work aloud to see how it sounded and making changes mostly in style and grammar.  He also added important illustrations, an allegorical headpiece for each treatise.  Finally, he began a sequel to the Characteristics to be titled Second Characters. The warm, sulphurous air off the Bay of Naples did help Shaftesbury’s health, but not enough.  He died in 1714 without finishing his work.

2. Works

 

Shaftesbury’s first publication was a collection of sermons written by Cambridge Platonist Benjamin Whichcote.  His introduction to that volume praised Whichcote for maintaining the goodness of human nature and the existence of a natural impulse toward benevolence. This was in contrast to most other 17th century divines who followed secular thinkers in holding that self-interest is the only motive to action and were therefore required to ground moral motivation in the rewards and punishments of the afterlife.

Shaftesbury’s major work Characteristics of Men, Manners, Opinions, Times (first edition published 1711) is an anthology of five previously published essays, sometimes with substantial revisions: An Inquiry Concerning Virtue or Merit (1699); A Letter Concerning Enthusiasm (1708); Sensus Communis, An Essay on the Freedom of Wit and Humor (1709); The Moralists, A Philosophical Rhapsody (1709); and Soliloquy, or Advice to an Author (1710). Along with these earlier works, Shaftesbury appended five new chapters of Miscellaneous Reflections roughly corresponding to the five essays which attempt to bring some coherence to the collection by commenting on and qualifying Shaftesbury’s earlier views.

His other published philosophical works include A Notion of the Historical Draught or Tablature of the Judgment Hercules and A Letter Concerning Design, originally a set of instructions for a painting Shaftesbury had commissioned and a letter commenting on those instructions, both written in 1712.  Shaftesbury planned to include these works in a projected sequel to the Characteristics called Second Characters, but he died before the project could be completed.  The Notion was subsequently included in the posthumous 1714 edition of the Characteristics, while the Letter Concerning Design was also added in the 1732 edition.  These two late works were included, along with some of Shaftesbury’s unfinished works (including a dialogue called The Picture of Cebes and a treatise to be entitled Plastics, an Epistolary Excursion in the Original Progress and Power of Designatory Art), in Benjamin Rand’s attempted reconstruction of Second Characters, published in 1914.

Finally, we have some of Shaftesbury’s correspondence and private journals (the Askemata), albeit in an unreliable transcription reordered and edited by Rand, published in 1900 under the title The Life, Unpublished Letters, and Philosophical Regimen of Anthony, Earl of Shaftesbury

3. Aims and Methods

 

 

Shaftesbury is primarily known in the history of philosophy for two things.  To moral philosophers he is known as the father of moral sense theory in British moralism; and to philosophers of art, he is known as “the first great aesthetician that England produced” (Cassirer 1953, 166) whose work was seminal for the German Romantics. But neither of these is what Shaftesbury himself thought was most important about his work.  Shaftesbury was not simply working out an epistemology of ethics or an account of aesthetic experience.  He did examine both of these issues, but his more direct interest was in transforming British moral inquiry by synthesizing ethics and aesthetics.

To use Shaftesbury’s own terms, the chief aim of his work is to introduce the concepts of “moral beauty” and “moral taste” to eighteenth  century British society.  In the final chapter of the Characteristics Shaftesbury sums up his overall project: “It has been the main scope and principal end of these volumes to assert the reality of a beauty and charm in moral as well as natural subjects, and to demonstrate the reasonableness of a proportionate taste and determinate choice in life and manners” (Miscellaneous Reflections V.iii, 466). Earlier he claims he is “intent chiefly on this single point, to discover how we may to best advantage form within ourselves what in the polite world is called a relish or good taste” (Miscellaneous Reflections III.i, 404).  Therefore Shaftesbury’s goal is to show that not every preference is equally appropriate to human nature and that there is such a thing as “good taste” in art and morality.

Shaftesbury’s writing style is intrinsically related to his philosophical goals.  Since Shaftesbury’s major audience was genteel or “polite” society, he often writes in a playfully oblique and ironic rhetorical style.  He uses elaborate analogies and metaphors to entertain and disarm his readers but not necessarily to carry any great philosophical weight (the most significant and easily misunderstood being the famous analogy of moral judgment with sensation). Therefore modern readers must always keep in mind the looseness and lack of analytic rigor with which Shaftesbury approaches his material.  He is trying to provide powerful suggestions aimed at forming his readers’ moral sentiments rather than to give detailed arguments to establish a theoretical system.  In other words, Shaftesbury wants to help his readers actually develop good moral taste, not merely to theorize about it.

4. Major Themes

a. Moral Realism

Shaftesbury did not see himself as inventing a new synthesis of aesthetics and ethics.  Rather, he thought he was protecting a classical synthesis.  And Shaftesbury thought the primary threat to the classical notions of moral beauty and moral taste came from the empiricist philosophy of Hobbes and Locke.

The four elements of Hobbes’s view to which Shaftesbury objected were empiricism, mechanism, voluntarism, and egoism.  Empiricism rejected innate ideas of morality.  Instead, moral principles were understood as a result of subjective emotions whereas classical moral philosophy saw moral principles as deriving from the human being’s objective teleology.  The modern mechanistic physics, however, rejected natural teleology.  If there was to be a set of universal moral principles, it could not be grounded on universal human nature, but it must be, as voluntarism asserts, grounded on a sovereign will (either God’s or the human government’s) expressed in positive law.

Shaftesbury believed Hobbes had reduced morality to self-interest, and it is to this Hobbesian “skepticism” that Shaftesbury is responding.  But Shaftesbury also had another, more personal target.  As a child Shaftesbury had been tutored by John Locke, a personal advisor of  his grandfather.  But as  Shaftesbury grew up, he came to reject the moral skepticism he thought followed from Locke’s empiricism and voluntaristic divine command theory. According to Shaftesbury, many of those who rejected Hobbes’s political philosophy, including the “free writers” (read: deists) and Locke, nevertheless went “in the self-same tract” as Hobbes’s philosophy.  Shaftesbury goes on to argue that Locke was in fact more dangerous than Hobbes or the Deists because, unlike them, Locke had the reputation of “sincerity as a most zealous ‘Christian’ and believer” and was thus able to make the nominalist position attractive to a wide audience.  And it was Locke who had succeeded in convincing many of the British moralists to give up the idea of goodness as natural rather than as socially constructed.

It was Mr. Locke that struck the home blow: for Mr. Hobbes’s character and base slavish principles in government took off the poison of his philosophy. ’Twas Mr. Locke that struck at all fundamentals, threw all order and virtue out of the world, and made the very ideas of these (which are the same as those of God) “unnatural,” and without foundation in our minds. (Rand 1900, p. 403)

Hence it was against Locke that Shaftesbury thought morality most needed to be defended.

What bothered Shaftesbury and many of his contemporaries about Hobbes and Locke were the empiricists’ apparent rejection of the “reality” of virtue. In his dialogue The Moralists, Shaftesbury has the character Philocles distinguish two ways modern religious philosophers defended the link between religion and morality: “Some of them hold zealously for virtue, and are realists in the point. Others, one may say, are only nominal moralists by making virtue nothing in itself, a creature of will only or a mere name of fashion” (The Moralists, II.2, p. 262, my emphasis).  Later in the dialogue, the character Theocles says that the author of “a certain fair Inquiry” (i.e., Shaftesbury’s own Inquiry Concerning Virtue and Merit) argued against a specifically religious basis for ethics:

For being, in respect of virtue, what you lately called a realist, he endeavours to show that it is really something in itself and in the nature of things, not arbitrary or factitious (if I may so speak), not constituted from without or dependent on custom, fancy or will, not even on the supreme will itself, which can no way govern it but, being necessarily good, is governed by it and ever uniform with it. (The Moralists, II.3, p. 266-7)

Here a “realist” about virtue is someone who holds the view that morality is “in the nature of things.”  On this scheme, then, moral realism is opposed not only to relativism (the view that morality is constituted by “custom”) and subjectivism (the view that morality is constituted by an individual’s “fancy”) but also to voluntarism (the view that morality is constituted by the “will” of a sovereign, whether Locke’s God or Hobbes’s Leviathan).  So, according to Shaftesbury, to give morality a subjective basis in individual self-interest (even if one then attempted to construct on this subjective basis a set of objective and universal moral laws), rather than an objective basis in some intrinsic feature of character or action, is to deny the reality of moral distinctions, a position synonymous in the early modern mind with moral skepticism. Conversely, to call morality “real” was to commit oneself to what Shaftesbury’s predecessor Ralph Cudworth called “eternal and immutable” principles of morality.

b. Moral Beauty

 

Throughout the Characteristics, Shaftesbury argues that moral beauty is a “beauty of the sentiments, the grace of actions, the turn of characters, and the proportions of a human mind” (Sensus Communis IV.ii, p. 62).  In general, for Shaftesbury, beauty is a matter of harmonious proportion or “numbers.” The “beauties of the human soul,” then, are “the harmony and numbers of an inward kind” (Sensus Communis IV.ii, p. 63).  They are an “inward anatomy” of soul which, like the outward anatomy of the body, must be brought into the “order or symmetry” that is constitutive of beauty and health (Inquiry 2.I.ii, p. 194).

For Shaftesbury, the concept of moral beauty is not merely a metaphorical comparison between ethics and aesthetics.  Rather beauty and goodness are “one and the same” (The Moralists III.ii, p. 320) such that moral or mental beauty turns out to be more fundamental than physical beauty.  Beauty, Shaftesbury argues, is primarily a property of souls or minds, not of bodies at all: “the beautifying not the beautified, is the really beautiful” (Moralists III.ii, p. 322).   When we judge a body to be beautiful we are really judging the act of designing and creating the body to be beautiful.  Shaftesbury argues for this conclusion by pointing out that when we say a statue is beautiful, we aren’t admiring the “matter” (the marble or bronze or whatever) but the “art and design” which Shaftesbury calls “the form or forming power.”

Yet terms like “design” and “form” can be either nouns or verbs.  That is, we can speak either of the form of an object or the act of forming the object.  Shaftesbury concludes, “Here therefore is double beauty. For here is both the form, the effect of mind, and mind itself.” (Moralists III.ii, p. 323)  He calls the passive objects “dead forms … which bear a fashion and are formed, whether by man or nature, but have no forming power, no action or intelligence,” and he calls the active subjects variously “living forms,” “forming forms,” or “the forms which form, that is, which have intelligence, action and operation.”

Thus Shaftesbury distinguishes two distinct “degrees or orders of beauty” before going on to argue for a third order of beauty “which forms not only such as we call mere forms but even the forms which form” (Moralists III.ii, p. 323-4). Hence we have these three orders of beauty:  first the dead forms, second the forming forms, and third the “supreme and sovereign beauty.” If the first order of beauty is the form of the object in the sense of the object’s design, then the second order of beauty is the active mental subject capable of creating this sort of intelligent design. In other words, the second order of beauty is the human mind itself which, through its intelligent creativity, imposes ordered design on the matter.  But, while the mind is a forming power which gives form to the body, at the same time the mind has its own form which is given to it by its participation in the divine Mind, “the principle, source, and fountain of all beauty” (Moralists III.ii, p. 324). Thus the three forms or orders of beauty seem to be natural beauty, moral beauty, and the beauty of God.

Following ancient Stoicism, Shaftesbury thinks of the world as a unified organism infused by the immanent living “soul” or “mind” of God without which even the natural world would be dead and hence could not be beautiful.  God (“the universal and sovereign Genius”) is a “uniting principle” which makes individual parts of nature into a system, a living organism directed to a teleological end (Moralists III.i, p. 301). Nature, then, is not simply a “mere body, a mass of modified matter,” but is a rationally structured “whole” which constitutes a “self” or mind whose body is the world (Moralists III.i, p. 302-3).

The teleological element of this view is emphasized when Shaftesbury describes his vision of an ever-widening system of interconnectedness.  Shaftesbury starts with the concept of an ordered system:  “Whatever things have order, the same have unity of design and concur in one, are parts constituent of one whole or are, in themselves, entire systems” (The Moralists II.iv, p. 274).  In other words, the concept of order is teleological: a system has order insofar as its parts are aimed at a single end.  Organisms and artifacts are examples of this sort of system: “Such is a tree with all its branches, an animal with all its members, and edifice with all its exterior and interior ornaments” (ibid.).  Just as the parts of a human artifact, such as a piece of architecture, are designed so as to form a unified whole, the parts of a plant or animal are also interconnected in such a way as to form a complete system.   Thus something forms a system if its parts are not “independent but all apparently united … according to one simple, consistent and uniform design” (ibid.).  For example, this sort of “mutual dependency of things” can be seen “in any dissected animal, plant or flower where [even] he who is no anatomist nor versed in natural history sees that the many parts have a relation to the whole, for thus much even a slight view affords” (The Moralists II.iv, p. 275).

But individual organisms are only relatively self-sufficient.  Their parts are internally united, but at the same time organisms are externally united to other organisms.  As Shaftesbury puts it in the Inquiry:

If therefore, in the structure of this or any other animal, there be anything which points beyond himself and by which he is plainly discovered to have relation to some other being or nature besides his own, then will this animal undoubtedly be esteemed a part of some other system. (Inquiry I.ii.2, p. 168).

So, for example, human organisms, especially as infants, are “helpless, weak, and infirm” and are thus inherently (“purposely, and not by accident”) “rational and sociable” such that humanity “can no otherwise increase or subsist than in that social intercourse and community which is his natural state” (The Moralists II.iv, p. 283). Likewise the human species is dependent on other species of plants and animals for their survival just as, for example, “to the existence of the spider that of the fly is absolutely necessary,” showing that each individual species is “in general, a part only of some other system,” namely “the system of all animals” (Inquiry I.ii.2, p. 168).

Thus human organisms form a system called a community.  Likewise, all human communities form the human species (“the system of his kind”), which fits into a certain ecosystem of the planet Earth, which has its ordered place in the universe as a whole.  Thus Shaftesbury concludes, “all things in this world are united” (The Moralists II.iv, p. 274).

For Shaftesbury, this cosmic order has moral implications.  If man is an ordered system of parts, then “there must be somewhere a last or ultimate end in man” (Regimen, p. 48).  Because human beings have instinctive “dispositions of mind such as plainly refer to a species and society, and to the enjoyment of converse, mutual alliance, and friendship, then is the end of man society” (Regimen, p. 48-9; cf. the similar argument at Inquiry II.i, p. 167).

The virtues (things such as “integrity, justice, faith” etc.) are those character traits which allow us to live in society with other humans thereby fulfilling our “end.”  Therefore the virtues are “part of a man, as he is a man” (Regimen, p. 50) – they allow us to be fully human and to “live according to nature” (Regimen p. 52).

In this way, the natural order grounds the normativity of individual moral beauty.  Following classical Platonic and Stoic sources Shaftesbury holds that everything else is beautiful to the degree that it works in harmony with the supreme beauty of providential design (The Moralists II.iii, p. 277).  Thus there is a “nature upon which the world depends, and … every genius else must be subordinate to that one good genius” (Moralists III.i, p. 300).

While the supreme beauty can serve as a standard to ground the particular choices of particular minds, it is not a subjectively chosen standard.  It is the objective ordering of the universe. For Shaftesbury beauty in general is a proper ordering between the parts of something according to the universal natural rules of harmony and proportion.  Therefore a viewer’s subjective failure to judge the proper objective value of this ordering does not diminish its value.  This value is a “symmetry and proportion founded still in nature” (Soliloquy III.iii, p. 157-8).  And when “we are reconciled to the goodly order of the universe” by developing beautiful souls “we harmonize with nature and live in friendship with God and man” (Moralists III.iii, p. 334). It is this harmonious relationship to the natural order that Shaftesbury calls moral beauty.

In summary, beauty, whether of a body or a soul, is grounded in an objective standard of order – namely the mind of God as expressed in the natural order.  At the highest level this order is the “supreme beauty” of God which guides God’s own creation of the natural order and which we in turn imitate when we bring order to our bodies, souls, or artworks.

c. Moral Sense

 

In his essay Sensus Communis, Shaftesbury argues for an understanding of “common sense” as a sense of the common good (Sensus Communis III.1-2, 48-53). Shaftesbury finds a predecessor in the Roman tradition which followed Marcus Aurelius’s coining of the term koinonoemosune to describe the same sort of sense of the common good (Sensus Communis III.1, 48n19).  This notion of the common good recalls the distinction between one’s “private good” and one’s “real good” which Shaftesbury draws in his essay An Inquiry Concerning Virtue or Merit. The private good or “self-interest” is the “end” or “interest” which is “right” for an individual of one’s species and toward which the natural affections point when they are not “ill.”  And the real good or “virtue” is the end in which one’s private good harmonizes with the common good of one’s species as a whole (Inquiry I.II.1, p. 167).  Note that pursuing one’s private good is not necessarily selfish.  In fact, for Shaftesbury, pursuing one’s private good is necessary, natural, and therefore good (insofar as it does not conflict with the public good).  “Selfishness” is not just any regard for one’s private good, but an “immoderate” one which is “inconsistent with the interest of the species or public” (Inquiry I.II.2, p. 170).

Shaftesbury emphasizes the importance of one’s relation to society when he says that a creature is “nowise” good (that is, neither “privately” nor “really” good) if it is naturally part of a “system” but is either detached from the system or harms that system (Inquiry I.II.1, p. 168).  Recall, as discussed above, that a human’s most immediate “system” is society. In this way the sensus communis becomes a necessary component in Shaftesbury’s ethics.  On Shaftesbury’s view, for any action to be considered good, the agent must be moved to action by an affection for the good of the system: one can only be “supposed good when the good or ill of the system to which he has relation is the immediate object of some passion or affection moving him” (Inquiry I.II.1, p. 169).  According to Shaftesbury, then, we could not have an affection toward the common good if we didn’t somehow represent the common good to ourselves.  And it is the sensus communis which allows us to do that. Shaftesbury is clear that it is not enough that our actions be in fact aimed at the common good though still inwardly motivated by self-interest: “as soon as he has come to have any affection towards what is morally good and can like or affect such good for its own sake, as good and amiable in itself, then is he in some degree good and virtuous, and not till then” (Inquiry I.iii.3, p. 188).  To be virtuous, an action must be aimed at the common good because we recognize that it is the common good and have an affection toward it as such.  Thus a truly virtuous and good creature is “one as by the natural temper or bent of his affections is carried primarily and immediately, and not secondarily and accidentally, to good and against ill” (Inquiry I.ii.2, 171).  Shaftesbury thinks this affection toward the good of one’s species is natural and common to every member of the species.  Thus a virtuous action “ought by right” to have as its “real motive” the natural affection for one’s species.

Being motivated by an affection toward the common good is, however, only a necessary, not a sufficient, condition for being virtuous. While anything can be good under Shaftesbury’s definition, only a human being can be virtuous.  This is because virtue requires a “reflected sense” (that is, the ability to reflect on what is good and right) which requires a high degree of reason.  Shaftesbury says:

But to proceed from what is esteemed mere goodness and lies within the reach and capacity of all sensible creatures, to that which is called virtue or merit and is allowed to man only: In a creature capable of forming general notions of things, not only the outward beings which offer themselves to the sense are the objects of affection, but the very actions themselves and the affections of pity, kindness, gratitude and their contraries, being brought into the mind by reflection, become objects. So that, by means of this reflected sense, there arises another kind of affection towards those very affections themselves, which have been already felt and have now become the subject of a new liking or dislike. (Inquiry I.ii.3, p. 172)

The view seems to be that the sensus communis shows us what is good for our species and we naturally “approve” of that good and have an “affection” towards it, thereby motivating us to act.  Those actions are individually good which are motivated by an affection toward the good of the whole. Then our “reflected sense” gives us a “new affection” towards the motives which result in good actions.   On the next page Shaftesbury refers to this “reflected sense” as a “sense of right and wrong” which he defines as “a sentiment or judgment of what is done through just, equal and good affection or the contrary” (Inquiry I.ii.3, p. 173).  The notion of the moral sentiments, as Shaftesbury employs it, presupposes the existence of the sensus communis.  A properly functioning person is already motivated by the right affections as represented by the sensus communis, and then our moral sentiment (our “sense of right and wrong”) confirms that those are in fact the right motivations by giving us a higher-order “feeling,” “affection,” or “sentiment” of which actions are done by the right affections. In other words, moral sentiment is a second-order affection toward the “right” first-order affections. Note that, while Shaftesbury also talks as if not only first-order affections but also actions, tempers, etc., can be the objects of the moral affection, it must be remembered that for Shaftesbury no action or temper is truly good or virtuous unless it is motivated by affection for the common good.  In sum, after the sensus communis determines the moral action and motivates us to pursue it as good, then moral sentiment approves of what the common sense tells us via a feeling of affection and thereby motivates us to pursue it as virtuous.

It is important to notice here that, while Shaftesbury refers to our moral sentiments as our “conscience” and even as our “sense of right and wrong,” he is not trying to establish a “moral sense” as a distinct mental “faculty” for receiving moral ideas.  As D.D. Raphael notes, “the casual application of the word ‘sense’ to the moral faculty is hardly more significant in Shaftesbury than it is in Samuel Clarke, who was a severe rationalist” (The Moral Sense, p. 16). We talk of a “sense of purpose,” a “sense of urgency,” a “sense of adventure,” a “sense of humor,” etc.   Sometimes we even speak of morally relevant “senses” such as a “sense of decency,” a “sense of shame,” a “sense of duty,” etc.  But we don’t mean to suggest that any of these “senses” ought to be thought of as analogous to the physical senses or that they are special mental faculties metaphysically distinct from our ordinary mental faculties.  Likewise, Shaftesbury’s use of the phrase “sense of right and wrong” is simply a figure of speech.  He thought we used our ordinary faculties of thinking, feeling, and desiring to make moral judgments.

Shaftesbury sometimes seems to suggest that moral judgment is instinctive, yet this is not his considered view. For example, in the dialogue titled The Moralists, A Philosophical Rhapsody, Shaftesbury seems to advance the claim that our sense of beauty is innate: “Nothing surely is more strongly imprinted on our minds or more closely interwoven with our souls than the idea or sense of order and proportion” (The Moralists II.4, p. 273-4).   In this context, Shaftesbury is specifically talking about natural beauty, but, as we have seen above, moral beauty is a function of one’s relationship to the natural order.  Shaftesbury notes that we can easily tell the difference between a structure created by an architect and a mere “heap of sand and stones” and claims that “this difference is immediately perceived by a plain internal sensation.”  The source of this sensation seems to be the common sense.  In the Sensus Communis essay, Shaftesbury argues that true beauty in art requires the artist to submit the “particulars” of the artwork “to the general design” and make “all things subservient to that which is principal” (Sensus Communis IV.3, p. 66), adding that “common sense (according to just philosophy) judges of those works which want the justness of a whole and show their author, however curious and exact in particulars, to be in the main a very bungler” (Sensus Communis IV.3, p. 67). Hence it is the common sense (or “sense of beauty” as he calls it in The Moralists) which discerns “order and proportion” so that taste can approve or disapprove of them.

Now, Shaftesbury seems to think this ability of common sense to detect beauty is innate.  When we perceive an object or action we immediately (“straight”) distinguish the beautiful from the ugly (The Moralists III.2, p. 326),  Similarly, he says in the Inquiry that the mind “cannot be without .. nor can it withhold” judgments of moral taste, and he compares the functioning of the moral faculty to the functioning of a bodily organ:  “this affection of a creature towards the good of the species or common nature is as proper and natural to him as it is to any organ, part or member of an animal body, or mere vegetable, to work in its known course and regular way of growth” (Inquiry II.I.1, 192). But these statements are misleading in isolation.

By this point in The Moralists, Shaftesbury has already observed that taste requires cultivation: “How long before a true taste is gained! How many things shocking, how many offensive at first, which afterwards are known and acknowledge the highest beauties! For it is not instantly we acquire the sense by which these beauties are discoverable” (The Moralists III. 2, p. 320).  Shaftesbury also says (following the Cambridge Platonists) that the affection for and knowledge of the good can be lost by vice: “contrary habit and custom (a second nature) is able to displace” even the most natural instincts (Inquiry I.III.1, 179).  Likewise in the Miscellaneous Reflections, Shaftesbury writes that “a legitimate and just taste can neither be begotten, made, conceived or produced without the antecedent labour and pains of criticism” (Miscellany III.2, p. 408).

If anything about the sensus communis or moral taste is innate, it is the potential to develop good taste.  Everyone is born with these faculties.  But everyone must be educated in how to use them.  Moral taste is a natural faculty but it is also a cultivated faculty.  Elsewhere Shafesbury argues that though “good rustics who have been bred remote from the formed societies of men” might have been “so happily formed by nature herself that, with the greatest simplicity or rudeness of education, they have still something of a natural grace and comliness in their action,” it is nevertheless “undeniable, however, that the perfection of grace and comliness of action and behavior can be found only among the people of a liberal education” since such perfection requires knowledge of “those particular rules of art which philosophy alone exhibits” (Soliloquy I.3 p. 85-7). So virtue must be cultivated like good taste in art or wine.  Only then can one act “from his nature, in a manner necessarily and without reflection” (Sensus Communis IV.1, p. 60).

In summary, our moral sense is a not a special instinctive faculty, but an innate potential to approve of certain actions that must be activated by good education in society. Once we have been trained in the art of sociable conversation, our moral sense will inevitably approve of those actions which are motivated by the teleological good of society as a whole.

d. Personal Identity

 

In his essay Soliloquy Shaftesbury describes moral reasoning through the mechanism of conscience as requiring the agent to partition herself into multiple voices (or “selves”) in order to engage in fruitful internal discussion on the model of a Socratic dialogue.  Soliloquy, he says, is a kind of “self-dissection” in which an individual “becomes two distinct persons” in order to “be his own subject” of advice and edification (Soliloquy I.i, p. 72).  He calls “this method of soliloquy” an “art” or “regimen” which is “practiced” by “all great wits,” especially by “the poet and philosopher” and even “the orator” (Soliloquy I.i, p. 73).

Here soliloquy means something like the examination of conscience.  The point of dividing oneself into two dialogue partners is to achieve the kind of consensus that results from rational discussion (Soliloquy I.ii, p. 77).  We reflect within ourselves and notice that we are of two minds about something (we “discover a certain duplicity of soul”).  Then we discuss the issue with ourselves until we bring the two views into dialectical agreement.  In this way we achieve integrity and self-unity within our mind (we “make us agree with ourselves and be of a piece within”).  In aesthetic terms, we are trying to bring our soul into harmony with itself.

For Shaftesbury, the purpose of soliloquy is not only self-creation, but also preparation for public discourse.  It is significant that the full title of the essay is Soliloquy, or Advice to an Author, an “author” being one who “publishes” (makes public) his “meditations” (private thoughts). Shaftesbury’s advice is that we test our thoughts by the method of soliloquy before we presume to share them: “so that, unless the party has been used to play the critic thoroughly upon himself, he will hardly be found proof against the criticisms of others” (Soliloquy I.i, p. 76).  What is significant here is that public edification is the assumed goal of philosophical thinking.

However, prior to public discourse, we must endeavor to construct a coherent self through the method of soliloquy.  “Our thoughts,” says Shaftesbury, “have generally such an obscure implicit language that it is the hardest thing in the world to make them speak out distinctly. For this reason, the right method is to give them voice and accent” (Soliloquy I.ii, p. 78).  By the method of giving voice to our thoughts here, Shaftesbury has in mind the “meditations, occasional reflections, solitary thoughts or other such exercises as come under the notion of this self-discoursing practice” (Soliloquy I.i, p. 74) that make up his own private notebooks which he labeled Askemata (exercises or “regimen”). In his Regimen Shaftesbury applies the practice of dialectical reasoning to his own inner life via the method of soliloquy.

For Shaftesbury, the result of achieving harmony of soul is the construction of a unified “self.”  “It is the known province of philosophy,” Shaftesbury writes, “to teach us ourselves, keep us the self-same person and to regulate our governing fancies, passions and humors as to make us comprehensible to ourselves” (Soliloquy III.i, p. 127).   With regard to the question of the self, Shaftesbury ridicules that “which stands for philosophy in some famous schools,” saying it cannot generate “manners or understanding” because “It pretends indeed some relation to manners as being definitive of the natures, essences and properties of spirits” (Soliloquy III.i, p. 128).   In other words, scholastic philosophy confuses an accidental property (a “relationship”) of the self with an essential one.  It does this by “defining ‘material’ and ‘immaterial substances’ and distinguishing their properties and modes” as if this were “the right manner of proceeding in the discovery of our own natures” (Soliloquy III.i, p. 129-130).  This focus on the metaphysics of “modes and substances” is, however, “beside the mark and reaches nothing we can truly call our interest or concern.”  It does not tell us who we really are (Soliloquy III.i, p. 130).  The scholastics make the same mistake as a person who attempts to understand the nature of a watch by asking “of what metal or what matter each part was composed” rather than “what the real use was of such an instrument or by what movements its end was best attained and its perfection acquired” (Soliloquy III.i, p. 131).  Likewise, the philosopher engaged in metaphysical speculation has “considered not the real operation or energy of his subject, nor contemplated the man as real man and as a human agent, but as a watch or common machine” (ibid).

Shaftesbury argues that true self-knowledge comes from the study of the passions. This is because I am my passions:  “These passions, according as they have the ascendancy in me and differ in proportion with one another, affect my character and make me different with respect to myself and others” (Soliloquy III.i, p. 132).  My character or self is a function of my passions, and that character is the real me, not the material (or immaterial) substance that my passions are made out of (or inhere in).  Were my “passions, affections, and opinions” to change radically enough, then I would become a different self, even if, contra Locke, I retained in my “memory the faint marks or tokens of former transactions” (Soliloquy III.i, p. 127).  This is because my passions are aimed at what I take to be my happiness (Soliloquy III.i, p 132).  And, as the teleological end of my life, my happiness is what makes me who I am.

True philosophy, then, is the kind of self-reflection which makes known to an agent what her passions are aimed at so that she can bring them into harmony with one another and thereby construct a coherent self.  Until I can understand my own passions (“ascertain my ideas”) and control them (“keep my opinion, liking and esteem of things the same” from moment to moment), I will remain “the same mystery to myself as ever” (Soliloquy III.i, p. 134), regardless of the truth of my metaphysical arguments.

The problem of an incoherent self arises primarily, in Shaftesbury’s view, when we define our lives in pursuit of pleasure.  Pleasure is not stable, since pleasure comes from a variety of sources and the pursuit of pleasure demands a constant search for new sources, for “when we follow pleasure merely, we are disgusted and change from one sort to another, condemning that at one time which at another we earnestly approve, and never judging equally of happiness while we follow passion and mere humor” (Soliloquy III.ii, p. 138).  What we need is a “rule of good” which can “control my fancy and fix it, if possible, on something which may hold good” (ibid).  This is what Shaftesbury finds in the “honest pleasure” of moral beauty, the honestum which is both attractive and right.

Only the “pleasure of society” is “constant” enough to ground a coherent self, enabling me to “bring my other pleasures to correspond and be friends with it” (Soliloquy III.ii, p. 139).  Thus, “when I employ my affection in friendly and social actions, I find I can sincerely enjoy myself” without risking the kind of self-dissolution which comes from pursuit of self-interested pleasures (Soliloquy III.ii, p. 138).  Shaftesbury argues the pleasure I receive from moral affection is derived via sympathy (“by communication, a receiving it, as it were, by reflection”) from your enjoyment of my actions (Inquiry II.ii.1, p. 204).  In the Inquiry Shaftesbury argues that human beings have natural affections for the good of society, and we cannot flourish as human beings unless we live in society and develop the virtues which allow us to live according to our nature.  Thus we are not fully human if we are cut off from a sympathetic relationship with a community. What Shaftesbury adds in Soliloquy is to draw out the implication that we can only construct a self-identity on the basis of sympathetic pleasure.  In this way we construct our identities out of our social interactions with others.

Thus Shaftesbury’s account of self-construction is essentially intersubjective and dialectical.  It involves at least soliloquy if not actual interpersonal dialogue.  Since it is often difficult to know what our true passions and opinions are, we need talk therapy (a “vocal looking-glass”) in the form of dialogue or soliloquy: “Our thoughts have generally such an obscure implicit language that it is the hardest thing in the world to make them speak out distinctly. For this reason, the right method is to give them voice and accent” (Soliloquy I.ii, p. 78).

Shaftesbury’s own dialogue The Moralists seems to be an example of this process since both characters in the dialogue present Shaftesburian viewpoints and help each other come to the truth (see Prince, p. 69). Indeed the assembled text of the Characteristics itself expresses this view through its literary structure.   The Moralists is explicitly a dialogue.  The treatise Soliloquy, or Advice to an Author, as its title implies, is an internal dialogue in which Shaftesbury is addressing himself.  The Miscellaneous Reflections on the Preceding Treatises and Other Critical Subjects, Shaftesbury’s self-commentary, written in the third-person, are in effect an extended soliloquy that can be read as a dialogue between Shaftesbury the literary critic and Shaftesbury the philosopher.  In this light we can see that even the (seemingly) ordinary philosophical treatise An Inquiry Concerning Virtue or Merit takes on a dialogical character.  Not only does the author of the Inquiry show up as a character in The Moralists (see II.3, p. 265), the Inquiry itself, when read in the overall context of the other more obviously dialogical works of the Characteristics and thus located amidst a series of overheard conversations, begins to read like an overheard scholastic-style philosophy lecture.

None of the individual treatises of the Characteristics is written unambiguously in Shaftesbury’s own voice. The character of “Shaftesbury,” the author of Characteristics of Men, Manners, Opinions, Times, emerges only as the harmonious unity of the other voices. In this way, the structure of Shaftesbury’s work as a whole is an embodiment of intersubjective reasoning.  The truth of Shaftesbury’s philosophy is the product of a (metaphorical) community of persons reasoning together through (simulated) interpersonal dialogue.  He is trying to achieve what he says Plato achieved in his Socratic dialogues: “they exhibited [real characters and manners] alive and set the countenances and complexions of men plainly in view. And by this means they not only taught us to know others, but, what was principal and of highest virtue in them, they taught us to know ourselves” (Soliloquy I.iii, p. 87).

5. References and Further Reading

a. Primary Texts

  • Shaftesbury, Anthony Ashley Cooper, Third Earl of. Characteristics of Men, Manners, Opinions, Times, ed. Lawrence E. Klein (Cambridge: Cambridge University Press, 1999) [Cited by treatise title, part, section, and page number.]
  • Shaftesbury, Anthony Ashley Cooper, Third Earl of. The Life, Unpublished Letters, and Philosophical Regimen of Anthony, Earl of Shaftesbury, ed. Benjamin Rand (Macmillan, 1900)
  • Shaftesbury, Anthony Ashley Cooper, Third Earl of. Second Characters or the Language of Forms, ed. Benjamin Rand (Cambridge University Press, 1914)

b. Secondary Texts

  • Secondary Texts
  • Aldridge, A.O. “Shaftesbury and the Deist Manifesto” Transactions of the American Philosophical Society, new series 41:2 (June, 1951)
  • Bernstein, John Andrew. Shaftesbury, Rousseau and Kant: An Introduction to the Conflict between Aesthetic and Moral Values in Modern Thought (Rutherford: Fairlegh Dickinson, 1980)
  • Cassirer, Ernst. The Platonic Renaissance in England, trans. James P. Pettegrove. (Austin: University of Texas Press, 1953)
  • Darwall, Stephen. The British Moralists and the Internal ‘Ought’ (Cambridge University Press, 1995)
  • Den Uyl, Douglas J. “Shaftesbury and the Modern Problem of Virtue” Social Philosophy and Policy 15:1 (Winter 1998)
  • Gill, Michael. The British Moralists on Human Nature and the Birth of Secular Ethics (Cambridge, 2006)
  • Grean, Stanley. Shaftesbury’s Philosophy of Religion and Ethics (Ohio University Press, 1967)
  • Klein, Lawrence E. Shaftesbury and the Culture of Politeness: Moral Discourse and Cultural Politics in Early Eighteenth-Century England (Cambridge: Cambridge University Press, 1994)
  • Lawrence E. Klein, ‘Cooper, Anthony Ashley, third earl of Shaftesbury (1671–1713)’, Oxford Dictionary of National Biography, Oxford University Press, 2004; online edn, Jan 2008, http://www.oxforddnb.com/view/article/6209
  • Prince, Michael. Philosophical Dialogue in the British Enlightenment (Cambridge, 1996)
  • Raphael, D.D. The Moral Sense (Oxford University Press, 1947)
  • Rivers, Isabel. Reason, Grace, and Sentiment: A Study of the Language of Religion and Ethics in England, 1660-1780, Vol. II: Shaftesbury to Hume. (Cambridge University Press, 2000)
  • Schneewind, J.B. The Invention of Autonomy (Cambridge, 1998)
  • Voitle, Robert. The Third Earl of Shaftesbury, 1671-1713 (Louisiana State University Press, 1984)
  • Winkler, Kenneth P. “‘All is Revolution in Us’: Personal Identity in Shaftesbury and Hume” Hume Studies 26:1 (April, 2000)

Author Information

John McAteer
Email: jmcateer@hbu.edu
Houston Baptist University
U. S. A.

Personalism

Personalism is any philosophy that considers personality the supreme value and the key to the measuring of reality. Its American form took root in the late nineteenth century, flowered in the twentieth century, and continues its life in the twenty-first century. Yet, those roots can be traced to Europe and back through Western philosophy to the Mediterranean basin. However, Personalism did not originate exclusively in America, Europe, the Mediterranean basin, or in the West. Personalism thrived in India through the six orthodox schools of Indian philosophy scattered along the Indus River Valley of the Indian subcontinent, and developed parallel to Personalism in the West.

Personalists claim that the person is the key in the search for self-knowledge, for correct insight into reality, and for the place of persons in it. Other than giving centrality to the person, Personalism has no other set of principles or unified doctrine. Although many prominent personalists have been theists, this doctrine is not a requirement.  There is also not a common set of methods or definitions, including the definition of person. Respecting that caution, personalists defend the primacy and importance of persons against any attempt to reduce persons either to the Impersonalism of an infrastructure, such as scientific naturalism, or suprastructure, such as metaphysical absolutism. Personalists focus on the concerns of persons living in a personal world. Between the Scylla and Charybdis of either type of Impersonalism, personalists trace the origin of the concept of person and the development of metaphysical personalism from the ancient world to its flowering in Europe and America. Open to the richness of their philosophical tradition, personalists trace their origin and development both on the Indian subcontinent and in the West.

Table of Contents

  1. South and East Asian Personalism
    1. India
    2. China and Japan
  2. Historical Roots of Personalism in the Mediterranean Basin
  3. European Development and Flowering
  4. North American Personalism
    1. Branches of Personalism
    2. Major figures
      1. Harvard University
      2. Boston University
      3. California
    3. African Personalism
    4. American Indian Personalism
  5. Latin American Personalism
  6. Current Trends
  7. References and Further Reading

1. South and East Asian Personalism

a. India

The religious practice and thought spawned by the Vedic (from Veda, “knowledge,” “wisdom”) texts from at least 1500 BCE, form the back drop of Personalism. The term “Hindu” is derived from the river Sindhu (the Indus) where various schools of practice and thought had formed.

Broadly conceived, Personalism in India originates within the main goal of Hindu philosophical inquiry, which is the freedom from misery. Each system of Hindu philosophy seeks to help persons to that end by giving them insight into the nature of ultimate reality and their place in it. These systems advocate self-knowledge, atmavidya, without which the desired freedom is impossible. The nature and destiny of individual persons is the common theme of the six orthodox Hindu philosophical systems: Nyaya, Vaisesika, Sankhya, Yoga, Purva-mimamsa, and Vedanta. The Vaisesika school is frequently lumped together with the Nyaya (“Logic”) school, and Yoga is classically grouped with Sankhya. Each system promises self-knowledge, atmavidya that bonds the systems into a single philosophical tradition.

Seeking freedom from misery through self-knowledge, Hindu personalist schools of thought center on four questions. What is the self? How is it related to the material world? What is the relation of the self to ultimate reality? And, what is the path from pain and misery to liberation?

First, according to each orthodox school, persons are marked by various characteristics, including a permanent and eternal soul (atman) that exists behind the veil of empirical consciousness, and that possesses a physical body (jiva) that exists as part of a changing material world. While it is agreed that the Atman is eternal, unchanging, independent essence, the six orthodox schools differ whether the transcendent I is conscious or unconscious, active or passive. Each school also recognizes that by being connected to the material world persons possess other characteristics, including agency, will, thought, desire, free will, intention, and identity.

Second, personalists focus on the indissoluble reality of the individual soul and on its relation to the empirical consciousness. That soul, the basic reality in humans and all living things, is a transcendent “I,” (atman) and is veiled by a person’s empirical consciousness. It cannot be the object of experience. The empirical consciousness, the experience of objects sensed or being sensed, comes to be interpreted as alien, attributive, essential, adventitious, permanent, or temporary. Schools differ on how to correlate the transcendent “I” and the empirical consciousness. Hindu Personalism views the empirical consciousness as either attributive or alien, monist or dualist.

At one extreme, Sankhya, the oldest school, interprets the dualism of the transcendent “I” and the empirical consciousness as a dualism of spiritual consciousness (purusa) and material nature (prakrti). Purusa is sentient and passive; prakrti is insentient and active. As sentient, purusa experiences products of prakrti and desires emancipation. As passive, purusa can be understood as unaffected and secluded. In Samkhya, the material world is not an illusion; it is real and stands over against the spiritual person. This dualism is motivated by final beatitude. However, achieving it requires, in theistic versions of Sankhya, moral support, compassionate companionship and guidance from a Supreme Being who possesses perfect knowledge and is capable of perfect action.  The Yoga thinker Pantanjali (ca. 300 CE) introduced God into atheistic Samkhyan dualism to satisfy the moral demand of the spiritual aspirant. With that addition, the Samkhyan school becomes a theistic Personalism.

At the other extreme, monism is represented as Advaita Vedanta (non-dualistic Vedanta). Sankara (c. 788-820), the leading expounder of Vedanta (literally, “the end of the Veda”) philosophy, states its principle insight that the self is One with Ultimate Reality (Brahman).  Thus, the transcendent I includes the empirical many and the Divine One.  The triune of Divine One (Brahman), the transcendent I (atman), and the empirical many is a grand unity. Sankara was the most influential Vedanta thinker who espoused the monistic framework as defined in the Upanishadstat vam asi )”Thou art that” or (“Atman is Brahman”).

Self-knowledge is necessary for the soul to enter beatitude, to be one with Brahman. During a person’s life cycle, the caste system prescribed in the Vedas, eliminated the possibility of social movement, from lower to higher caste. Self-knowledge focused on one’s place in the caste system and its accompanying duties. The soul through its life cycle within a caste system sought virtue, allowing reincarnation in a higher caste. In this way, the virtuous soul achieves release from pain and suffering to Nirvana.

Within the unorthodox systems, such as Ajivikas (fatalists) Charvakas (materialists), Jains (who accept the existence of eternal selves but reject the existence of a supreme God) and Buddhism, Personalism does not develop. Ajivikas adopted materialism on the ground that sense-perception was the only valid means of knowledge. The Ajivika materialists questioned the validity of theological and metaphysical theories that do not come within the ambit of sense-experience. This explains why they rejected the religious version of atmavada, the belief in a metaphysical self.

Buddhism was founded by Siddhārtha Gautama (c. 563 BCE to 483 BCE), rejected the doctrines of atman or purusa and accepted instead a causal account (paticcasumuppada) of the human personalituy. The person, for Siddhartha, is a causally connected bundle of psychophysical aggregates (namarupa). Kasulis says, “According to Buddhism, therefore, I am not a self-existent being who chooses with what or how I wish to relate to external circumstances.” (Kasulis, 62-63) A Buddhist Personalism, the Pudgalavada arose two centuries after the lifetime of Siddhartha. Smet says, “The pudgalavadins thought that behind the aggregates or groups (skandha) of mental and physical conditions observed by introspection, there must a substrate. Had not the Budda, in a well-known passage, spoken of the ‘bearer of the burden’ – an expression seeming to indicate some sort of ego underlying the aggregates? At the same time the pudgala could by no means be identified with the Atman, because it was merely an integrating function demanded by the aggregates and expressed by them.” (De Smet. 38) Its teaching came too close to the heresy of the eternalism of the Atman and soon died out.

b. China and Japan

Regarding China and Japan, in neither country’s faiths are persons understood as an eternal essence. The Chinese, deeply influenced by Confucianism (551-479 BCE), believed that humans elevated themselves to a position through examinations and service. The Japanese, deeply influenced by Buddhism, understood persons in terms of their relationships with other humans, nature, and the totality of things. They held to a hierarchical dyadic view of persons.

Parallel to the origin and formation of Personalism in India, China, and Japan, Personalism in the West began in the Mediterranean basin, and through Christianity it spread north to the Atlantic Rim, northern Europe and the British Isles, and America.

2. Historical Roots of Personalism in the Mediterranean Basin

Personalists trace the origin of the ontological nature of persons and their supreme importance as key to understanding of Reality to the confluence of Greco-Roman philosophy and Christian experience and theology. Both made significant contributions to the formation of the concept of person.

In its early uses, the individual person had no ontological import. Person is first found in Greek and Roman culture. Its roots lie in the Greek word prosopon that refers to the face consisting of that around and near the eyes (pros + accusative of ops). Soon it designated the masks or faces used in the Greek theatre. Its Latin cognate is persona, probably of Etruscan derivation, phersu.   Persona referred to a mask functioning as loudspeaker (persono, per, “through” + sona, “resound,”  “resound thoroughly”). The mask was worn by actors on stage aiding them by “sounding through” to be heard by an audience. In the Roman theatre persona meant a character and role in tragedy or comedy. In Roman society, the personae of individuals gain their identity, status, and responsibilities from their roles in a hierarchical, honor-shame society. For example, persona came to be used in reference to the king as king, implying a difference between the important social man and the relatively unimportant singular empirical man. By the end of the second century CE persona became a judicial term referring to a Roman citizen as possessor of legal rights, in contrast to a slave who possesses no legal rights, a non-persona

Meanwhile, as Pythagorean, Platonic and Aristotelian philosophical influences continued, individual persons had little philosophical importance. Plato distinguishes between the individual and the universal, and thereby understands the individual through the universal. The individual Socrates participates in the universal, “human being.” To understand the particular “Socrates,” first know the universal, then one can understand and account for the particular. Pythagoras and Plato used the term soma, body, and played on the similarity between soma and sema, body and tomb. Aristotle, against Plato, calls the individual primary substance and the universal conceptual. First substance is that which stands under (hypostasis) a general term referring to whatever stands under something else. But prosopon gained no ontological meaning through that understanding of substance.

Later, persona took on scant ontological meaning among the Stoics and the Neoplatonists. Stoics thought that God forms an ordered universe, a stage on which each human being as rational plays an assigned part. Each prosopon or persona is not only a social role but also the essence of a human being as constituted by God.  Possessing no ontological significance in and of themselves, persons are microcosms of the Macrocosm.

In the eastern Mediterranean among the Jews, the Hebrew word, nephesh is sometimes translated as “person.” However, in ancient Hebrew life and culture no word analogous to prosopon or persona appears. Nephesh is more often translated as soul, life, creature, or self. Nephesh can refer to the animating principle of a physical entity or the existential quality or state of life. Usually referring to a human being as a unified entity, no distinction is made between immaterial and material aspects. Nephesh as a whole is created by God; nephesh is not an attribute of a substance. The form/matter and substance attribute distinctions are foreign to ancient Hebrew thinking. However, beginning with Alexander the Great in the 4th century BCE and continuing through the Roman period, the Eastern Mediterranean was Hellenized. New Testament writers, St. Paul for example, would have known nephesh, prosopon, and persona, likely aware of semantic tensions that later found their way in theological debates within the Christian church.

The different Greek and Hebrew-Christian understandings of person moved into focus in the 4th and 5th centuries CE as the Christian church attempted to work out a satisfactory understanding of the Trinity and the individual personhood of Jesus the Christ. The details of the controversies that arose are beyond the limits here. However, central to the controversy was the understanding of the individual person. During the time of Origen (185-254 CE) under the influence of Plotinus (204-269 CE), personae lacked ontological content. Is an individual person an attribute of being; or is an individual person being who, having been created by the free and independent God and who bears God’s image, is free and dependent? If the former, the Greek metaphysical word, ousia, expresses the Trinity, as in una substantia (God) and tres personae, where Father, Son, and the Holy Spirit are understood as three independent Gods. If the latter, person is not an attribute of ousia, but upostasis. Earlier both ousia and upostasis meant substance. Eventually, they were used separately, ousia referring to substance and upostasis referring to individual person. This means that persona is no longer a kind of mask worn by an ontological substrate, ousia. The Greek Fathers, particularly the Cappadocians, led by Gregory of Nazianus (c. 329 – 389 or 390) understood that individual persons are ontologically ultimate, the central thesis of Personalism. However, an understanding of the interior life of persons lay beyond their metaphysical interests.

The analysis of the interior life of persons fell to Augustine (354 – 430) who continued the substance view by defining  person as “a rational soul using a mortal and earthly body,” (substantia quaedam rationis particeps, regendo corpora accomodata). Nevertheless, a person is one; “a soul in possession of a body does not constitute two persons but one man.” His contribution to Personalism lies elsewhere. His investigation of the inner experience of persons set a new course in philosophy that would not be developed until the modern period. In doing so he developed a key insight of Personalism, an understanding of reality as Person through an understanding of the interior life of persons. He wrote that knowledge of God moves “from the exterior to the interior and from the inferior to the superior.” Further, he argues that in persons, free will is superior to rationality. Yet Augustine continued the metaphysical principle that the higher can affect the lower, but the lower cannot affect the higher.

In the late Roman era or early Middle Ages, the Roman Catholic Church adopted Boethius’ (ca. 480–524 or 525) definition of person, as the Naturæ rationalis individua substantia (an individual substance of a rational nature). In the meantime, the Greek Orthodox Church continued the doctrines of the Cappadocians, particularly Gregory of Nazianus. Though medieval philosophers, such as Thomas Aquinas modified the definition and some such as Scotus and William of Occam were critical, the substance view of person became firmly entrenched in Western philosophy. During the modern period and the nineteenth and twentieth century personalists continued to debate the substance view of persons.

3. European Development and Flowering

As the grand systems of Christendom in the high Middle Ages cracked under the weight of heavy criticisms from Scotus, Occam, Montaigne, and the new science, a new vision arose in the Renaissance, the emerging self. Though the substance view of persons continued, aided by its institutionalization in the Western church, it was soon challenged. Under Pico della Mirandola, Luther, Descartes, and Locke, Augustine’s careful descriptions and insights into the interior life of persons contributed to emerging selves.

Influenced of pyrrhonistic skepticism, Descartes (1596 – 1650) searched for a new foundation for society, culture, and knowledge, one that measures up to the ideal of certainty as in mathematics. He argues that we can know for certain that we exist and that we can doubt every other knowledge claim, including those of the external world. In this way, inspired by new scientific investigations, he raises problems, particularly the mind-body problem. Rather than the view that God created ex nihilo substances with a rational nature, or persons are substances using a body, Descartes contends that God created two substances, res cogitans and res extensa, mind and body. Central to his view of mind is freedom of the will and rationality.  In freedom of the will we find a characteristic of God that is exactly the same as we find in ourselves. God’s reason is so far beyond our finite minds that we cannot with confidence claim that God is rationality. Descartes’ discussion of the interior life of persons in the early 17th century CE continues Augustine’s insights in the late 4th century CE.  However, Descartes’ dualism of two kinds of created substances significantly differs from Boethius’ view that a person is an individual substance with a rational nature. Though modified, the substance of persons persisted.

The dichotomy of thought (res cogitans) and extension (res extensa) raises formidable problems regarding the relation between them, encouraging some philosophers to emphasize one or the other. For example, Hobbes’(1588 – 1679) materialism on the side of extended things, and Berkeley’s (1685-1753 CE) spiritualism on the side of thinking things. Hobbes’ view is a good example of appealing to impersonal principles to account for persons, a view personalists uniformly reject as a form of Impersonalism. Berkeley’s view moves Personalism into modern Idealism (mentalism), in contrast to classical Idealism (Ideaism) such as Plato’s, and against the Impersonalism of materialism. Under the influence of Locke and against any form of materialism, Berkeley took what he believed a common sense approach. Holding to a substance view of person, he claimed that “to be is to be perceived or perceive,” esse est percipi aut percipere. The key lies in the word “exist.” Descartes had said that things exist either as thought or extension. However, thought Berkeley, whenever we use the word “exist” we must assume that the mind is involved perceiving something. And, when we or no other person perceives an object, that object exists due to its being perceived by a cosmic mind, God. The “absolute existence of unthinking things [matter] are words without meaning.” Neither do eternal ideas, Plato’s forms, exist apart from perceiving minds. They are abstractions only.

Descartes’ position helped create a modern version of the One and Many problem, first spawned by the Pre-Socratics. Descartes held that God can be grasped through ideas in the mind. Spinoza (1632 –1677) responded that God as substance is not dependent on any other than itself. If God could be grasped through a dependent substance, such as thought, God would be dependent on something other than God’s self for God’s existence. Here, two fundamental characteristics of Personalism were threatened, the rational relation of finite persons to God, and persons as free. Since knowledge of the Cosmic Person through a finite person challenges God as Substance, the relation of the Cosmic Person to finite person becomes problematic. Furthermore, persons as free can be held only so long as Cosmic Person and finite persons are in some way independent. Spinoza claimed that substance can be known only through itself, implying that persons are modifications of being, that they are not distinct as required for individual freedom.

In reaction to Descartes’ dualism and Spinoza’s monism, Leibniz’s (1646 –1716) doctrine of monads offers a pluralism. Both Descartes and Spinoza assumed that extension implies actual size and shape. Leibniz wondered why we cannot assume that all things are compounds or aggregates of simple substances. These substances are not the extended atoms of Democritus or Epicurus. Each simple substance is a monad that is unextended and has no size or shape. Each is a metaphysical point that Leibniz calls souls to emphasize their nonmaterial nature. And, each possesses its own principle of action within itself and behaves according its own created purpose, yet they work together according to God’s preestablished harmony. A person’s identity centers on a dominant monad, his soul, whose life is an “unfolding,” set from the beginning. Since the basic nature of persons is thought, development through life means moving from murky confused ideas to true ones, the way things really are. In this way, Leibniz was the first philosopher to reject the substance of persons and to adopt an agency view, even if deterministic. Some philosophers, such as Leroy E. Leomker (1770-1830), consider Leibniz the first modern personalist, where persons are agents, not substances with an attribute.

Immanuel Kant (1724-1804) and Hegel (1770-1830) exerted major influences on the formation and development of Personalism in the West.  Kant’s distinction between the phenomenal and noumenal world reinforced Berkeley’s view of “material” substance and emphasized that the only path to reality is through the practical reason of persons. With his claim that compliance with moral law is the essence of human dignity and personhood, Kant exerted the single most influence on ethical personalists. That influence was transmitted into American philosophy largely through the work of Hermann Lotze (1817-1881).

Within that background, the modern Personalist vision was first stated in the context of a philosophical issue lying at the core of 18th and 19th century German philosophy. The issue was raised in the debate between the followers of Spinoza and those of Leibniz. For Spinozists, Being is one and independent. If Being were more than one, it would not be independent; and, if Being is independent it cannot be many. For Leibnizians, reality is a pluralism of monads. The debate between the two systems sharpened to the debate between monism and pluralism. The problem of the One and the Many resurfaced. How can they be reconciled? What philosophical view can do so? At least two alternatives surfaced at the end of the 18th century and the beginning of the 19th.

The first alternative was Hegel’s. According to a recent interpreter, he sought a “‘unification philosophy’ – the need to unify not only life’s various and conflicting powers, but especially the opposing human craving – for individuality and finitude, on the one hand, and for the absolute and the infinite, on the other.” (Hegel, 48) Hegel sought it by brilliantly blending the organic growth of an Aristotle and the Absolutism of a Spinoza into a dynamic monistic system marked by idealism, pantheism, and rationalism.

In reaction to the rationalism of Absolute Idealism and to individual realism, a poet and philosopher and older contemporary of Hegel, Friedrich Heinrich Jacobi (1743-1819) offered the second alternative, Personalism. He thought that Personalism is “‘that form of idealism which gives equal recognition to both the pluralistic and monistic aspects of experience and which finds in the conscious unity, identity, and free activity of personality the key to the nature of reality and the solution of the ultimate problems of philosophy.’” (Bengtsson, 53) The insight of Jacobi continued in Schelling, in the Speculative theists (Immanuel Hermann Fichte, Weisse, Ulrici), to Hermann Lotze, the preeminent philosopher in mid-19th century Germany. It was primarily through Lotze that Personalism arrived in Boston. When Personalism arrives in Boston the distinctive modern characteristics of persons are in place: numerically distinct; individual interiority; freedom of choice among genuine options; autonomy; dignity; and agency. As Personalism makes its way to Boston and the West Coast, Personalism on the European continent and in England responds to German Idealism, specifically to Hegel.

In the debate between monism and pluralism, the traditional problem of the One and the Many resurfaces. But more, metaphysical monism and pluralism lie beneath many modern philosophies that Personalism rejects as impersonal, dehumanizing. Absolute idealism leaves little room for free will and self-determination and cannot be reconciled with the worth of the singular person. Totalitarianisms see persons as means to an end that exist for the interests of the state. Personalists respond insisting on self-determination, responsibility, inherent dignity of all persons. Individualism champions the autonomous self as its own law giver, making all other persons a means to one’s own ends. Personalism contends persons are communal, open to, cooperating with, and respecting the viewpoints of others. Personalism also rejects any reduction of persons to impersonal, deterministic laws, whether those of society, for example Auguste Comte (1798-1857), or of nature, Darwinism evolution. Personalists challenge Comtean philosophical positivism and Naturalism whether that of Darwinism or Samuel Alexander (1859-1938) by appealing to the dignity of individual persons, their free will, and values.

On the European continent, responses to Hegel led to the development of three schools: Paris, Gottingen/Freiburg, and Lublin. In Paris and under the influence of Existentialism, Mounier (1905-1950), Marcel (1889-1973), Paul Ricoeur (1913-2005), and Jacques Maritain (1882-1973) developed distinctive types of Personalism. In Gottingen and Feiburg Phenomenology developed under Husserl (1859-1938) who addressed the question of the relation of objective reality and philosophical reflection.  Later he returned to Idealism, precipitating a break with his former students, who, when on their own, made significant contributions to Personalism. These included Max Scheler (1874 – 1928), Dietrich von Hildebrand (1889-1977), Roman Ingarden (1893-1970), and Edith Stein (1891-1942). In Poland, the Catholic University of Lublin is a center of personalist thought whose foremost representative is Karol Wojtyla (1920-2005).  Personalism also developed in Prague at Charles University and under the leadership of Jan Patočka (1907 – 1977), one of Husserl’s last students. In Spain José Ortega y Gasset (1883 – 1955) espoused personalist themes. In Italy, Antonio Pavan of University of Padua ably represents Personalism. Personalism also has its adherents in Scandanavia, such as Jan Olof Bengtsson in Sweden. In Scotland the most notable 19th century personalist was Seth Andrew Pringle-Pattison (1856 – 1931), whose thought influenced William James (1842-1910), George Santayana (1863-1952), and George Herbert Mead (1863-1931). In England, Austin Farrer (1904-1968) argued for a personalist theism. John MacMurray (1891 – 1976) was the most notable Scottish Personalist in the first half of the 20th century. In Russia we find a version of Personalism in Berdyaev’s (1874-1948) thought.

4. North American Personalism

Before the mid-19th century few in North America discussed the thought of German philosophers such as Kant, Johann Gottlieb Fichte (1762-1814), Schelling (1775-1854), Hegel, and Lotze. As demands of academic life increased, young American philosophers completed their philosophical preparation with a year or two of study in Germany. When they returned, no longer fully embracing the old Scottish orthodoxy, their thinking was framed by the Spinoza-Leibniz debate, sometimes understood by theologians and many philosophers as the pantheism-individual freedom debate, and by reductions of persons to deterministic laws of society and nature. Deliberating within that framework, they thoroughly discussed the Personalism they learned while studying with Hermann Lotze at Gottingen. Among that group were George Santayana (who wrote his doctoral dissertation at Harvard University on Lotze), William James (1863-1952), Josiah Royce (1855-1916), and Borden Parker Bowne (1847-1910). That conversation grew into what Werkmeister in the mid-20th century calls the “first complete and comprehensive system of philosophy developed in America which has had lasting influence and which still counts some of our outstanding thinkers among its adherents.” (Werkmeister, 103.)

a. Branches of Personalism

Those historical roots found their way into American philosophy and formed at least four distinctive branches of Personalism. These four branches are idealistic (against threat of naturalism and realism), realistic (against reducing all to mind or person), organismic (against idealism, personalistic realism), and ethical (against collectivism and reduction of personality to mechanism and the body-brain).

Idealistic Personalism – The most distinctive type of Personalism in America, excluding Platonism or Kantianism, this idealism is expressed in three different forms: absolutistic, panpsychistic, and personalistic. Influenced by Jamesian pragmatism, Absolutistic idealists contend that reality is quantitatively and qualitatively one absolute mind, spirit, or person. All other beings, including physical and human ones, are ontologically manifestations of the absolute mind. Josiah Royce, William Ernest Hocking (1873-1966), and Mary Whiton Calkins (1863-1929) represented this view. Panpsychists are deeply influenced by Leibniz, who held that God, the supreme monad, creates all other monads and places them in a preestablished harmony. Rejecting absolute idealism, they hold that reality is composed of psychic entities of varying degrees of consciousness. Both A. N. Whitehead (1861-1947) and Charles Hartshorne (1897-2000) can be called, with qualification, panpsychists. Finally, for personalistic idealists, quantitatively, reality is pluralistic, a society of persons; and qualitatively, reality is monistic; it is person. The Infinite Person or God is the ground of all beings and the creator and sustainer of finite persons. In that sense, personalistic idealists are theistic. Representatives of this branch of Personalism include Borden Parker Bowne, Edgar Sheffield Brightman (1884-1953), Peter A. Bertocci (1910-1989), and Leroy Loemker (1900-1985).

Realistic Personalism – These personalists agree with idealistic personalists that Reality is spiritual, mental, and personal. They disagree about the ontological status of the natural order. Nature is neither intrinsically mental nor personal. It is a natural order created by God.  Realistic personalism is most notably expressed by Neo-Scholastics in Europe such as Jacques Maritain (1882-1973), Emmanuel Mounier (1905-1950), and Pope John Paul II (1920-2005), and in America W. Norris Clarke (1915- 2008) and John F. Crosby (1944 -). In America some realistic personalists stand outside the scholastic tradition, notably Georgia Harkness (1891-1974).

Personalistic Organicism – A recent form of Personalism was developed by Frederick Ferre (1933-2013). Rejecting panpsychism and personalistic idealism and influenced by Whitehead’s philosophy of organism, Ferre argues for a personalistic organicism. He claims in Living and Values that persons are “organisms with especially well-developed mental capacities leading to special needs and powers.” (Ferre, Living, 140) By these powers they can “perceive and manipulate the world, can vocalize and socialize, can create language, can imagine and plan by use of symbols freed from the immediate environment, and can guide behavior by ideal norms.” (Ferre, Living, 140)

Ethical Personalism – These personalists stress the crucial role of values in ontology and the moral life. Ethical Personalism is well represented by George Holmes Howison (1834-1916) who focuses on the Ideal or God toward which all uncreated persons move and the standard by which they measure the degree of their individual self-definition. Practically, ethical personalists concentrate on the dignity and value of persons in moral decision making.

b. Major figures

i. Harvard University

Josiah Royce’s thought was motivated by a religious view of life and reality, with an emphasis on the self and community. He sought to realize his philosophical goals through a synthesis of two traditions: the rationalistic system building of philosophers in the West, and the pragmatic emphasis on experience and practice, distinctive of American philosophical activity since the late nineteenth century. Royce also had a long and abiding interest in science and scientific inquiry and was deeply influenced by Charles Sanders Peirce (1839-1914). He wove these strands during his long and productive career.

At the root of Royce’s system is a concept of the self. Early in his career, the Absolute appears as the Self who knows all in one synoptic vision. Rejecting realism, mysticism, and critical rationalism, his central thesis is that to be real is to be a determinate, individual fulfillment of a purpose. Later he focused more on mediation and the idea of system. Toward the end of his career, the self appears as social. He developed a social theory of reality, a community of interpretation. He called this community “the Beloved Community” whose goal was to possess the truth in its totality.

William James, a leading pragmatist, shared with personalists a key insight. James said, “The more perfect and more eternal aspect of the universe is represented in our religions as having personal form. The universe is no longer a mere It to us, but a Thou, if we are religious; and any relation that may be possible from person to person might be possible here.” (William James, 27-28)

Mary Whiton Calkins, a student of Josiah Royce and William James at Harvard, taught psychology, classics, and philosophy at Smith College. Influenced by Royce and James, she adopted a monistic personal idealism and a personalistic psychology. In 1905, Calkins was elected president of the American Psychological Association and in 1918, she was elected president of the American Philosophical Association.

William Ernest Hocking believed the universe is independent of human minds but is discoverable through phenomenology. Sometimes thought of as the American Husserl, he studied for three months with Husserl at Gottingen. Scolded by the Harvard Philosophy Department for wasting his time with an unknown philosopher, he went to Berlin to complete his year abroad.  Yet, Husserl’s phenomenology made a deep impression on Hocking. Through careful phenomenological analysis, Hocking unpacked everyday phenomena and found that “nature can no longer be fully understood from the atoms upward but only from consciousness or selfhood outwardly.” (Howie, “Hocking’s ‘Transfigured Naturalism,’” 219)  Philosophy must be idealistic. Further, values keep emerging as we learn more about the world and ourselves. The mental life has unity, is deep and mysterious, and finally coheres in a single will. The finite person is an imperfect image of the universe. Influenced by personal idealism and Royce’s absolute idealism, the final unity is a self infinite in depth and mystery. The self or person is also a natural thing that is completely determined. More than a natural thing, the self is free to determine what will be fact in the next moment. Though nature and mind are in opposition to each other, one subject to the laws of nature, the other transcending them, through the ceremonies of religion that opposition is overcome.

ii. Boston University

Borden Parker Bowne claimed to be the first personalist in any thoroughgoing sense, having developed a systematic metaphysics, epistemology, ethics, and psychology. He taught at Boston University from 1876 until his death in 1910. Metaphysically Bowne was a pluralistic idealist like Howison. But his theism distinguishes his Personalism from Howison’s. God, the Divine Person, is both creator and world ground. Finite selves are created, and nature is the energizing of the Cosmic Mind. As world ground, the Divine Mind is the “self-directing intelligent agency” that accounts for the order and continuity of the phenomenal world.

Bowne was not only a systematic philosopher but also a caustic critic of Hegel’s absolutism, Spencer’s evolutionism, and all forms of materialism. These criticisms were expressed in his famous chapter in Personalism, “The Failure of Impersonalism.” In addition, any form of dogmatism or fundamentalism was the target of his searing attacks, especially when held by religious leaders in the Methodist Church.

Bowne’s teaching at Boston University attracted many young talented philosophers, some of whom formed the second generation of personalists in America. The most important among them were Albert Knudson (1873-1953), who continued the personalist tradition in the Divinity School of Boston University; Ralph Tyler Flewelling (1871-1960), who developed the School of Philosophy at the University of Southern California; and Edgar Sheffield Brightman, who led the Philosophy Department at Boston University from 1919 until his death in 1953.

Brightman (1884-1953) was the leading exponent in America of Personalist Idealism during the first half of the 20th century CE. Educated at Brown, Brightman earned his doctorate in Philosophy at Boston University under the founder of Personalism in America, Borden Parker Bowne.

Though an ordained Methodist minister and an authority on the Old Testament, for most of his scholarly life his central interests lay in the fields of Ethics, Philosophy of Religion, and Metaphysics.

A creative, brilliant, original philosopher, Brightman, in agreement with other Boston University personalists, sought truth to guide creative living in the most empirically coherent interpretation of experience. Rejecting the skepticism of Descartes, beginning the search for truth within experience, and advancing and testing hypotheses, Brightman developed the distinction between the shining present and the illuminating absent. Pointing beyond itself, the shining present is unintelligible without reference to an illuminating absent. Though the shining present does not prejudice the nature of the illuminating absent, Person is the hypothesis that most coherently illumines the shining present.

Brightman contended that everything that exists [or subsists] is in, of, or for a mind on some level. He defined Personalism as the hypothesis that all being is either a personal experience (a complex unity of consciousness) or some phase or aspect of one or more such experience. Nature is an order generated by the mind of Cosmic Person. Finite persons are created and grounded by the uncreated God, and as such possess free will. Reality is a society of persons.

Brightman’s most impressive work is his Moral Laws, in which he works out, along lines heavily indebted to Hegel, a thoroughgoing ethical theory. In ethics Brightman adopted perfectionism that moved from the abstract universal to the concrete universal. In its moral development, personality should be guided by moral laws, a kind of “regulatory system.” Understood dialectically, “the moral life is a special instance of the relation of the universal to the particular,” wherein the personality can achieve both the best possible and rational freedom. (Deats and Robb, 111) In this way, personality under the guidance of reason, seeks to become the best it can be, a fully integrated personality.

Central to his philosophy of religion is his well-known revision of the traditional view of God. If personality is the basic explanatory model, God must be seen as temporal. As temporal, God is not timeless but omnitemporal. Brightman agrees with the traditional view of God as infinite in goodness, but he disagrees that God is infinite in power. To maintain that God’s power is infinite seriously compromises the goodness of God. If evil is to be taken seriously, the will of God must be understood as limited by the nonrational Given within God’s nature. This nonrational condition in God is neither created nor approved by God, but God maintains constant and growing control of it. This controversial view was debated within personalist circles. For example, L. Harold DeWolf (1905-1906) followed Bowne’s traditional theism rather than Brightman’s, and Peter A. Bertocci (1910-1989) found in Brightman’s revisions a cogent and intelligible theism.

Bertocci, following Brightman as the leading personalist at Boston University, enriched the understanding of person through his work in psychology. Bertocci claims in “Why Personalistic Idealism?” that the person “is a self-identifying, being-becoming agent who maturing and learning as he interacts with the environment, develops a more or less systematic, learned unity of expression and adaptation that we may call his personality.” Bertocci is well known for his view that the essence of person is time. He is best known in the field of philosophy of religion for his wider teleological argument that provided increased evidence for God’s existence.

iii. California

Personalism simultaneously developed in Boston and California. One of the first American philosophers to employ the term Personalism was Howison at the University of California. After graduating from Harvard and early in his career, he was one of the St. Louis Hegelians. A thorough discussion of Hegel, however, led Howison to champion the finite individual and reject the absorption of the individual in the Absolute. In this way, Howison opposes Royce’s absolutism.

Howison succinctly stated his position, quoted by Buckman and Stratton in George Holmes Howison, “All existence is either (1) the existence of minds, or (2) the existence of the items and order of their experience; all the existences known as ‘material’ consisting in certain of these experiences, with an order organized by the self-active forms of consciousness that in their unity constitute the substantial being of a mind, in distinction from its phenomenal life.” Devoted to empiricism, Howison rejected creation. “These many minds . . . have no origin at all – no source in time whatever. There is nothing at all, prior to them, out of which their being arises. . . . They simply are, and together constitute the eternal order.” Collectively they move toward their own fulfillment as measured by the eternal standard to God.

Later at the Univesity of Southern California, Ralph Tyler Flewelling, a student of Bowne’s Personalism, taught Boston University Personalism as enriched into Democratic Personalism. He defined person as “A self-conscious unique unity capable of reflection upon its conscious states, of self-direction and transcending time. The self-identifying subject of experience, possessor of intrinsic values and creative powers. A continuum in a time-space world.” (quoted in Gacka, “American Personalism,” 160; also see R. T. Flewelling, The Person; or The Significance of Man,” 332). Emphasizing personal freedom, dignity, human potential, creativity, intrinsic moral worth, and community, Flewelling promoted Personalistic Democracy. He said, “The only abiding basis for democracy is respect for the sanctity of the person.” This includes respect for those “possibilities which reside in varying degree in every person. . . This means that personality is recognized as an intrinsic value, the most precious possession of society and the greatest source of social ‘advance and welfare.’” (Gacka, 161)

c. African Personalism

African personalists focus on total liberation and empowerment of African Americans, releasing them from oppression, and on the dignity and self-worth, especially lacking among young African American males. The conception of persons and God and the primacy of person are two appealing features of Personalism. Among American African personalists, the best known was Martin Luther King (1929-1968). King translated Personalism into social action by applying it to racism, economic exploitation, and militarism. Following closely its major themes, he emphasized the existence of a personal God, the dignity and sacredness of persons, the existence of an objective moral order and corresponding moral laws, freedom, and moral agency. ” (Burrows, Personalism, 77-78) However, the precedent for King’s social Personalism was set by John Wesley Edward Bowen (1855-1923), a student of Bowne’s, whom Burrows cites in Personalism as the “first African American academic personalist.” Rufus Burrow, Jr. (1951-) argues for a militant Personalism that considers the African American experience. Holding firmly to central personalist themes, he argues in Personalism for the sanctity of the body, the dignity of women, “we-centeredness plus I-centeredness,” preference for the poor and oppressed, immediate and radical social change, and respect for non-human life forms.

Earlier, John Wesley Edward Bowen (1855-1933) became the first African American to earn a Ph.D. degree at Boston University. Devoted to the centrality of person, Bowen argued that the importance of higher education for blacks lay in developing persons into men and women who would occupy important positions in society. Bowen also argued that progress must be intentional and accepted as slow, that social problems must be solved individualistically, and for a “concrete, not abstract ‘brotherhood’ of all persons.” (Burrows, Personalism, 80)

Recently, J. Deotis Roberts (1927-  ), a student of the British personalist Herbert H. Farmer and a black liberation theologian, emphasized that the conscious person is both “the highest intrinsic value and is personal, emphasis on freedom and self-determination, and focus on persons-in-community.” (Burrows, Personalism, 80). His thought can be organized around four personalist principles: the dignity of the whole person, God as personal, freedom and moral agency, and persons-in-community. Blacks are not subhuman, as they have often been treated; they possess intrinsic worth as whole persons and not as minds and bodies. The God of the Bible is personal and loves each of the creatures. Though God is responsible for all that happens in the world, humans are responsible for all moral wrongdoing. Finally, humans can fulfill, develop themselves only in community, which implies “sharing and caring based upon fellow feeling and deep fellowship. Ujamaa, ‘togetherness,’ ‘familyhood,’ is descriptive of community.” (Roberts, quoted by Burrows, Personalism, 85)

Feminists agree with the Personalism of Roberts and other black liberation theologians, but they emphasize the dignity and worth of black women to a degree unknown before. They do so in the context of the four personalist principles mentioned above.

d. American Indian Personalism

Vine Deloria, Jr. (1933 – 2005), a Standing Rock Sioux, exploring the metaphysics of American Indians finds personalistic themes, including a personal universe, dignity and sacredness of life, the existence of a moral order, and moral responsibility.

Every individual, whether a person, a tree, a bison, an alligator, or the sun are fundamental to the world we live in. Never isolated, what they are is constituted by their relationships. Each individual has a personality distinguished by its power and place. Power is the living energy that inhabits the universe, and place is the relationship an individual has to other individuals. More broadly, place is the relationships of things to each other.  In this way, an ear of corn is distinguished from the person who picks it and from the buffalo that provides meat for nourishment and hides for shelter and clothing. Power and place contribute to one’s habitude, “an attitude or awareness of a deep system of experiential relations on which the world is building or living. The key here is recognizing that experience is the undeveloped and untheorized site where the divisions between subjective and objective, material and spiritual, and an entire series of dichotomies disappear.” (Wildcat, “Understanding the Crisis,” in Power and Place, 34). One acquires this through the clan system developed in geographical and ecological environments. For example, the Seminole, living in the wetlands of Florida establish an important relationship the alligator, while the Cherokee, living in the mountains of North Carolina do not. One’s habitude contributes to the richness of one’s personality, to one’s interrelationships with other individuals, and to an understanding of the path one is to walk. Power, place, and habitude suggest that the interrelated universe is alive and personal and must be approached with respect. Living and its quality depend on it.

All relationships have a moral content. Contemplating an action, a person must consider whether the proposed action is appropriate. Harvesting plants involves respecting the plants, their power and place in relationship to other individuals. Considering the impact on others and the consequences of one’s action, one must never intrude into the lives of other. The universe is built upon constructive and cooperative relationships that must be maintained.

Not only must one’s actions correlate with other personalities but also with the larger movements of the universe. They followed the principle that whatever is above must be reflected below. In their villages most tribes constructed their dwellings after some model of the universe. They reproduced the cosmos in miniature and believed that spiritual change would be followed by physical change. In this way, they participated in cosmic rhythms.

Through these relationships humans understand what they are, what they are to be, and what they are to do. Deloria points out that “everything that humans experience has value and instructs in some aspect of life. . . . The real interest of the old Indians was not to discover the abstract structure of physical reality but rather to find the proper road along which for the duration of a person’s life, individuals were supposed to walk. . . . The universe is a moral universe.” (Mankiller, “Foreward,” in Spirit and Reason, vii)

5. Latin American Personalism

Personalism in Latin America developed in the 20th century against the historical background of scholasticism (16th – 19th centuries) and naturalism and positivism (19th century). The contemporary period (late 20th into 21st centuries) continued idealistic and personalistic discussions of axiology and social philosophy, manifested a shyness regarding metaphysics, emphasized existentialist themes, and witnessed the rise of dialectical materialism. The discussion of Personalism chiefly took place in three centers of philosophical activity, Argentina, Mexico, and Puerto Rico.

In Argentina, two philosophers distinguished themselves. Alejandro Korn (1860-1936), in reaction to positivism, introduced German philosophy to his countrymen and was known as “the teacher of knowledge and virtue” and “The Philosopher of Liberty.” Emphasizing the role of intuition as the basis of knowledge and the social idealism, his views were not fully developed.  Franscisco Romero (1891-1962), a younger contemporary of Korn, arriving at his philosophy by way of psychology, was a sworn foe of mechanistic and behaviorist view of persons. Persons are whole, a structure not determined by its parts. They are both of the psyche, the lower, subjective, egotistic aspect of the self; and spirit, the objective and altruistic tendencies of the self. He thought that person is the key to reality. Persons have the ability to transcend subjectivity and grasp a superindividual order, and order that transcends him. Romero thought Josiah Royce’s The World and the Individual as the greatest contribution America has made to systematic and speculative philosophy.

Mexico is the second outstanding center of philosophical activity. Chief among its eminent thinkers is José Vasconcelos (1882-1959). Though critical of idealism and leaning decidedly toward Thomistic realism, he is deeply theistic and personalistic. Holding to a theistic monism, the universe is a living whole that finds its unity in God. His chief themes are individuality, freedom, purposeful creativity, cosmic reality in process, personality, and God. Vasconcelos, unlike most philosophers in North America, was a man of action, standing for the president of Mexico, though never elected. Bergson’s call to philosophers to “think like men of action and act like men of thought” was often heeded in Latin America.

The third center is Puerto Rico, whose most distinguished thinker was Eugenio Maria de Hostos (1839-1903). An ethical and social idealist, Hostos stressed the supreme worth personality, the dignity of persons. That is the foundation stone of civilization itself. No mere ivory tower, idealistic thinker, Hostos gave a realistic analysis of the danger inherent in what he called our “machine civilization.”

In addition to those centers of philosophical activity and their signal figures, other philosophers who discussed personalist themes were Antonio Caso in Mexico (1883-1946), Alejandro Deustua (1849-1945) and Victor Andres Belaunde (1883-1966) in Peru, Enrique Molina in Chile, and Carlos Vas Ferreira (1872-1959) in Uruguay.  Recently, Ignacio Ellacuria (1930-1989) of San Salvador developed Liberation Philosophy that focused on the social and personal imperative to overcome dependency as the path toward the fullness of one’s humanity. Common themes among them include dynamic stress on action, the philosophy of persons, freedom of persons, and the law.

Regarding direct of influence of North American Personalism on Latin American philosophers, when Latin American philosophers became aware of North American philosophy, it was Personalism, especially that of Edgar Sheffield Brightman that attracted them most among then-living philosophers. Josiah Royce also attracted them, especially The World and the Individual, as we have seen. But, it was Brightman’s Personalism that exerted the greatest influence. The interest was reciprocal. Brightman established the first graduate course in Latin American philosophy in the United States. (Cornelius Kruse, 149)

6. Current Trends

Personalists in North America carry on a vibrant philosophical discussion. They are developing, modifying, and challenging concepts and themes central to 20th century Personalism. They include Richard C. Bayer at Fordham University, drawing on Catholic social thought and affirming the dignity of persons, focuses on personal development and a modified market economy. Patrick Grant at University of Victoria, British Columbia, outlines a Personalism approach appropriate for a post-modern and post-Marxist cultural phase. Thomas R. Rourke at Clarion University and Rosita A. Chazarreta Rourke at Duquesne University, heavily influenced by Mounier, Dorothy Day and the Catholic Worker movement,  defend Personalism as an alternative to liberal capitalist and socialism. Erazim Kohak (1933-), drawing on the early work of Edmund Husserl (1859-1938) and Max Scheler (1874-1928), developed a personalistic view of nature. John Howie (1929-2000) developed an environmental ethics along personalist lines. Randall Auxier (1961-), editor of Library of Living Philosophers, writes on Brightman, Hartshorne, Royce, and Whitehead and is rethinking time, a category central to Brightman’s thought. Doug Anderson writes on Pierce and American philosophy. Currently, the center of studies in Personalism is in the department of philosophy of South Illinois University, where it is taught in the American philosophy program. Rufus Burrow, Jr. writes on African Personalism, particularly Martin Luther King, Jr.  Thomas O. Buford founded and edited The Personalist Forum, now The Pluralist, and writes in education, epistemology, American Personalism, and philosophy of culture. Also, he and Charles Conti of University of Sussex Brighton England formed The International Forum on Persons in 1988.  It holds biennial meetings alternating between North American and Europe. Alternating with the international meetings, Jim McLachlin holds a week-long summer conference on Personalism at Western Carolina University in the mountains of North Carolina.

7. References and Further Reading

  • Auxier, Randall E.  Hartshorne and Brightman on God, Process, and Persons: TheCorrespondence, 1922-1945. Nashville, TN: Vanderbilt University Press, 2001.
  • Bengtsson, Jan Olof. The Worldview of Personalism, Origins and Early Development. Oxford: Oxford University Press, 2006.
  • Bertocci, Peter A.  Introduction to the Philosophy of Religion. New York, NY: Prentice-Hall, Inc., 1951.
  • Bertocci, Peter A, and Richard M. Millard.  Personality and the Good. New York, NY: David McKay, Co., 1963.
  • Blau, Joseph L. Men and Movements in American Philosophy. New York, NY: Prentice-Hall, Inc., 1955.
  • Bertocci, Peter A. “Why Personalistic Idealism?” Idealistic Studies. 10, no. 3 (1980): 181-198.
  • Boethius, Liber de Persona et Duabus Naturis. Rome: Tuguri, 1571.
  • Bowne, Borden Parker.  Metaphysics. New York, NY: Harper and Brothers, 1882.
  • Bowne, Borden Parker.  A Theory of Thought and Knowledge. New York, NY: Harper and Brothers, 1987.
  • Bowne, Borden Parker.  Principles of Ethics.  New York, NY: Harper and Brothers, 1892
  • Bowne, Borden Parker. Personalism. Boston, MA: Houghton Mifflin Company, 1908.
  • Bowne, Borden Parker.  Kant and Spencer, a Critical Exposition. Boston, MA: Houghton Mifflin, 1912.
  • Brightman, Edgar Sheffield. An Introduction to Philosophy. New York, NY: Henry Holt and Co., Inc, 1925.
  • Brightman, Edgar Sheffield.  Moral Laws. New York, NY: Abingdon Press, 1933.
  • Brightman, Edgar Sheffield.  A Philosophy of Religion. Englewood Cliffs, NJ: Prentice-Hall, Inc., 1940.
  • Brightman, Edgar Sheffield.  Person and Reality. Edited by Peter A. Bertocci with Janette E. Newhall and Robert S. Brightman. New York, NY: Ronald Press, 1958.
  • Buford, Thomas O. Trust, Our Second Nature; Crisis, Reconciliation, and the Personal. Lanham, Maryland: Lexington Books, 2008.
  • Buford, Thomas O. Self-Knowledge, An Essay in Social Personalism. Lanham, Maryland: Lexington Books, 2011.
  • Buford, Thomas O., and Harold H. Oliver, eds. Personalism Revisited, Its Proponentsand Critics. New York, NY: Rodopi, 2002.
  • Buford, Thomas O. In Search of a Calling, The College’s Role in Shaping Identity. Macon, GA: Mercer University Press, 1995.
  • Burrows, Rufus R. Personalism, a Critical Introduction. St. Louis, MO: The Chalice Press, 1999.
  • Buckham, J. W., and G. M. Stratton, eds.  George Holmes Howison, Philosopher andTeacher. Berkeley, CA: University of California Press, 1934.
  • Calkins, Mary W.  The Persistent Problems of Philosophy. New York, NY: The Macmillan Company, 1910.
  • Conti, Charles. Metaphysical Personalism, An Analysis of Austin Farrer’s Metaphysicsof Theism. Oxford: Oxford University Press, 1995.
  • Deats, Paul, and Carol Robb, eds. The Boston Personalist Tradition in Philosophy, Social Ethics, and Theology. Macon, GA: Mercer University Press, 1986.
  • Deloria, Barbara, Kristen Foehner, and Sam Scinta, eds. Spirit and Reason, the Vine
  • Deloria, Jr., Reader.  Foreward by Wilma P. Mankiller. Golden, Colorado: Fulcrum Publishing, 1999.
  • Deloria, Jr., Vine. The Metaphysics of Modern Existence. New York, NY: Harper and Row Publishers  1979.
  • Deloria, Vine, Jr., and Daniel Wildcat, eds. Power and Place: Indian Education inAmerica. Golden, Colorado: American Indian Graduate Center, 2001.
  • De Smet, Richard. Brahman and Person, Essays by Richard De Smet. Edited by Ivo Coelho. Delhi: Motilal Banarsidass Publishers, 2010.
  • DeWolf, L. H.  The Religious Revolt Against Reason. New York, NY: Harper, 1949.
  • Flewelling, Ralph Tyler. Creative Personality. New York, NY: The Macmillan Co., 1926.
  • Flewelling, Ralph Tyler. The Person, or the Significance of Man. Los Angeles, CA The Ward Ritchie Press, 1952.
  • Flewelling, Ralph Tyler. “Personalism.” In Dictionary of Philosophy, pp. 229-230. Edited by Dagobert D. Runes. New York, NY: Philosophical Library, 1943.
  • Ferre, Frederick.  Being and Value: Toward a Constructive Postmodern Metaphysics. Albany, NY: The State University of New York Press, 1996.
  • Ferre, Frederick. Knowing and Value: Toward a Constructive Postmodern Epistemology. Albany, NY: The State University of New York Press, 1998.
  • Ferre, Frederick.  Living and Value. Toward a Constructive Postmodern Ethics. Albany, NY: The State University of New York Press, 2001.
  • Fixico, Donald. The American Indian Mind in a Linear World.  New York: Routledge, 2003.
  • Franquiz, Jose A. “Personalism in Latin American Philosophy,” Philosophical Forum. XII (1954): 68-81.
  • Gacka, Bogumil.  Bibliography of American Personalism. Oficyna Wydawbucza Czas: Lublin, 1994.
  • Gallagher, Shaun. Personalism – A Brief Account.
  • Harkness, Georgia.  The Providence of God. New York, NY: Abingdon Press, 1960.
  • Hartshorne, Charles. The Divine Relativity, A Social Conception of God. New Haven, CT: Yale University Press, 1948.
  • Hartshorne, Charles. Reality as Social Process; Studies in Metaphysics and Religion. Glencoe, IL: Free Press, 1953.
  • Garcia, Jorge J.E. and Elizabeth Millan-Zeibert,eds.  Latin American Philosophy for the21st Century: the Human Condition, Values, and the Search for Values. Amherst, New York: Prometheus Books, 2004.
  • Georg W. F. Hegel. Hegel’s Preface to the Phenomenology of Spirit. Translated with running commentary by Yovel Yirmiyahu. Princeton, NJ: Princeton University Press,    2005.
  • Hocking, William Ernest.  The Meaning of God in Human Experience.  New Haven, CT: Yale University Press, 1912.
  • Howie, John. “W. E. Hocking’s ‘Transfigured Naturalism.’” In A William Ernest
  • Hocking Reader with Commentary, 217-230. Edited by John Lachs and D. Micah
  • Hester. Nashville, TN: Vanderbilt University Press, 2004.
  • Howison, George Holmes. (ed.)  The Conception of God: A Philosophical DiscussionConcerning the Nature of the Divine Idea as a Demonstrable Reality. New York,                                 NY: Macmillan, 1898.
  • Howison, George Holmes. (ed.) The Limits of Evolution and Other Essays Illustrating the MetaphysicalTheory of Personal Idealism.  New York, NY: Macmillan Co., 1901.
  • Howison, George Holmes. (ed.)  The Origin of Evolution.  New York, NY:  Macmillan Co., 1905.
  • James, William. The Will to Believe and Other Essays in Popular Philosophy. New York, NY: Dover, 1956.
  • Kasulis, Thomas P. Intimacy or Integrity, Philosophy and Cultural Difference. Honolulu, HI: The University of Hawai’i Press, 2002.
  • Kohak, Erazim  The Embers and the Stars. Chicago, IL: University of Chicago Press, 1984.
  • Knudson, Albert C.  The Philosophy of Personalism.  New York, NY: Abingdon Press, 1927.
  • Kruse, Cornelius. “Personalism in Latin America.”  In Personalism Revisited, ItsProponents and Critics, 147-156. Edited by Thomas O. Buford, and Harold H. Oliver. New York, NY: Rodopi, 2002.
  • Lachs, John and D. Micah Hester, eds.  A William Ernest Hocking Reader with -Commentary. Nashville, TN: Vanderbilt University Press, 2004.
  • Loemker, Leroy E. “Some Problems in Personalism.” In Personalism Revisited, ItsProponents and Critics, 171-185. Edited by Thomas O. Buford and Harold H. Oliver. New York, NY: Rodopi, 2002,
  • Mankiller, Wilma P. “Foreward.”  In Spirit and Reason, the Vine Deloria, Jr., Reader, pp. vii-ix. Edited by Barbara Deloria, Kristen Foehner, and Sam Scinta. Golden, Colorado: Fulcrum Publishing, 1999.
  • Mendieta, Eduarto, ed. Latin American Philosophy: Currents, Issues, and Debates. Bloomington, Indiana: Indiana University Press, 2003.
  • Muelder, Walter G.  Foundations of the Responsible Society. New York, NY: Abingdon   Press, 1960.
  • Monk, Arthur W. “The Spirit of Latin American Philosophy.” Ethics 72, 3 (April, 1962): 197-201.
  • Norton-Smith, Thomas. The Dance of the Person and Place. Albany: State University of New York Press, 2010.
  • Pavan, Antonio. Enciclopedia della persona nel xx secolo. Napoli: Edizioni Scientifiche Italiane, 2008.
  • Plotinus. The Six Enneads. Translated by Stephen MacKenna and B. S. Page. Chicago, IL: Encyclopedia Britannica. 1952.
  • Royce, Josiah.  The Philosophy of Loyalty.  New York, NY: The Macmillan Company, 1908.
  • Royce, Josiah. The Problem of Christianity. Chicago, IL: University of Chicago Press, 1968.
  • Royce, Josiah.  The World and the Individual. New York, NY: Macmillan Company, 1899.
  • Srinivason, Gummaraju. Personalism, An Evaluation of Hundu and Western Types. Delhi: Research Publication in Social Sciences, 1972.
  • Steinkraus, Warren E, ed. Representative Essays of Borden Parker Bowne. Utica, NY: Meridian Publishing Company, n.d.
  • Steinkraus, Warren E. and Robert N. Beck, eds. Studies in Personalism. Selected Writings of Edgar Sheffield Brightman. Utica, New York: Meridian Publishing Company, n.d.
  • Werkmeister, W.H. A History of Philosophical Ideas in America. New York, NY: Ronald, 1949. (Westport, Connecticut: Greenwood, 1981.)
  • Wildcat, Daniel. “Understanding the Crisis in American Education.” In Power and Place, Indian Education in America, pp. 29-39. Edited by Vine Deloria, Jr., and Daniel Wildcat.  Golden, Colorado: American Indian Graduate Center, 2001
  • Zizioulas, John D. Being as Communion. Studies in Personhood and the Church. Forward by John Meyendorff. Crestwood, NY: St Valdimir’s Seminary Press, 1985.

 

Author Information

Thomas O. Buford
Email: tom.buford@furman.edu
Furman University
U. S. A.

Individualism in Classical Chinese Thought

“Individualism” is used here to denote inborn and inalienable prerogatives, powers, or values associated with the self and person as found throughout much of the Chinese philosophical tradition. Unlike individualism in modern European and American contexts, Chinese manifestations of “individualism” do not stress an individual’s separation, total independence, and uniqueness from external authorities of power. Rather, individualism in the Chinese tradition emphasizes one’s power from within the context of one’s connection and unity (or harmony) with external authorities of power.  So while both the modern Western and Chinese contexts share a belief that individuals are morally valuable and may attain an outstanding status as such, the Western tradition tends to view the individual in an atomized, disconnected manner, whereas the Chinese tradition focuses on the individual as a vitally integrated element within a larger familial, social, political, and cosmic whole.  Chinese thinkers frequently address issues related to individual value, empowerment, authority, control, creativity, and self-determination, yet they package these crucial aspects of individualism in ways that are generally different from the way individualism has been packaged in the West.

Since the term is not indigenous to China, there is a general scholarly dispute about the relevance and appropriateness of applying the term “individualism” to Chinese philosophy. The inability of mainstream scholarship and discourse to locate and come to terms with native forms of individualism in China has had important ramifications for scholarship, politics, and international relations as well. For example, the current debate about universal human rights is founded on beliefs that individuals can lay claim to certain prerogatives simply by virtue of their existence as individuals. Some Asian polities have used the argument that Asian traditions are not individualistic in order to claim that human rights discourse is not only not universal in scope, but that it is also incompatible with traditional Asian values.

Table of Contents

  1. Dispute over “Individualism” in Chinese Philosophy
  2. The Concept of Autonomy in Chinese Individualism
  3. The Self as Individual
  4. Individualism in Classical Confucian Thought
  5. Moral Autonomy in the Mohist Writings
  6. Individualism in the Zhuangzi
  7. Individualism in the Thought of Yang Zhu
  8. References and Further Reading

1. Dispute over “Individualism” in Chinese Philosophy

Scholars of early Chinese thought such as Chad Hansen, Henry Rosemont, and Michael Nylan have often considered the term “individualism” to be irrelevant or inappropriate for studying Chinese culture and history. Popular perceptions also tend to view Chinese culture as characterized by obligation and duty rather than by individual freedoms. This characterization of Chinese culture as group-oriented rather than individual-oriented helps promote the notion that individualism, especially as it is perceived – as a doctrine that protects individual autonomy against obligations stemming from external, familial or social institutions – is inappropriate for the Chinese context.

Other scholars such as Yu Ying-shih, Donald Munro, Erica Brindley, and Irene Bloom accept the concept of individualism as relevant for the Chinese tradition, at least as a point of discussion. Brindley goes the farthest to contend that by denying individualism in Chinese thought, one effectively ignores the multiple ways in which goals and values for the individual are in fact underscored in the tradition. While Brindley, Yu, and perhaps Bloom readily concede that the term “individualism” stems historically from European and American contexts, they generally agree that this need not limit the term’s usefulness as a tool for understanding concepts relating to the value and powers of the individual in China. For, even in the West, there is no single definition of the term “individual” that has escaped scholarly and public challenge and contestation. Nor does “individualism” always strictly connote one’s uniqueness, separation, and distinction, even in Western usages. Furthermore, the lack of a term or even explicit debate over doctrines of the individual, free will, or autonomy does not mean that Chinese thinkers or even ordinary Chinese people did not imply such things in their writings, or experience them in their lives. Making use of such arguments, scholars of this persuasion therefore assert that one can apply “individualism” to Chinese philosophy to gain rich comparative insights and shed light upon the importance of the integrated individual in Chinese philosophy.

The following analysis of texts and their embedded assumptions and claims serves to draw out possible Chinese forms of individualism that appear to differ considerably from Western forms of possessive individualism, which arose specifically in seventeenth-century English contexts. The latter forms focus on an individual’s possessive claims to uniqueness, and autonomy from surroundings. Chinese forms of individualism, on the contrary, tend to stress an individual’s achievement or fulfillment of some potential from within and in terms of a larger familial, social, and cosmic whole. This concept of individualism does not support a strong sense of autonomy and independence as defined through separation or freedom from others, but rather it reveals the autonomy and independence of the individual as a fully attained and integrated being within a larger web of relationships and authorities.

2. The Concept of Autonomy in Chinese Individualism

The notion of autonomy arguably serves as a distinguishing aspect of any form of individualism. The autonomous agent in many Western discursive models is free from certain external influences. This can be seen in the fact that various individualisms of today generally recast the individual as someone with the potential to be separate and different from his environment and conventional norms. They empower individuals by emphasizing their ability to make decisions and judgments independent of mundane influences and norms in the world.

Early Chinese forms of individualism, on the contrary, do not generally focus on the radical autonomy of the individual; but rather on the holistic integration of the individual with forces and authorities in his or her surroundings (family, society, and cosmos). For early Chinese thinkers, there is no such thing as unfettered autonomy or freedom of will. Rather, early Chinese thinkers posit the existence of a relative and relational sort of autonomy; or, a type of autonomy that grants individuals the freedom to make decisions for themselves, and to shape the course of their own lives to the fullest degree that they can—all from within an intricate system of interrelationships. This type of autonomy grants authority to the individual to fulfill his or her potentials as an integrated individual. The goal of such an individual is to achieve authoritativeness as a person while at the same time duly negotiating influences, commands, and responsibilities that stem from his or her larger environment. Therefore, a crucial back-and-forth tug between the self and the various authorities surrounding it is woven into the very fabric of what it means to be a fully attained, authoritative, empowered, and integrated individual.

3. The Self as Individual

Free from the radical dichotomy between truth/essence and appearance that is characterized by Descartes, the early Chinese “self” is not encumbered by a gross split between mind and body, or between true nature and experience. Rather, the early Chinese “self” is more akin to an organism, which both consists in and emerges out of complex processes occurring inside and outside of it as it interacts with and relates to his or her environment. In such a way, the concepts of self and person are much more integrated than in certain, extreme dualistic Western traditions, as stand in constant and ever-changing relationship to what occurs both within and without.

To the extent that the self is conceived as physical, embodied, and dynamic, the early Chinese “self” necessarily entails a different definition of  “individual.” While there is no clear term in Classical Chinese that might translate consistently into “individual,” this latter term facilitates discussion of those aspects of the self that emphasize its particularity within a whole. We use the term “individual” here to refer to early Chinese notions of self that concern not so much the subjective, psychological sense of “self,” but the qualities of a person that mark him or her as a single, particular entity capable of exerting agency from within a web of relationships. In other words, we refer to the individual not as an atomistic, isolated, and undifferentiated part of a whole, but as a distinct organism that must serve particular functions and fulfill a unique set of relationships in the worlds of which he or she is a part. The individual is thus a unique participant in a larger whole—integral to both, the processes that define the whole, as well as to the change and transformation that stems from itself.

4. Individualism in Classical Confucian Thought

One of the abiding concepts in Chinese philosophy, irrespective of the school of thought, is that of self-cultivation. The Ru, or Confucian lineage, places a premium on the moral cultivation of the individual using a variety of tools and resources, both internal and external. In the Analects of Confucius, the junzi (gentleman, or  nobleman) constitutes the most important ideal for the individual, and any person who strives for such an ideal is to do so by a complicated moral regimen of intense involvement with the rites of the Zhou (dynastic house) and its music; moral education through a morally achieved ruler, master or moral exemplar; and training—involving texts and histories as well as personal resources such as will-power, moral desire, inward reflection and thought, and the active appraisal of how one’s own thoughts and actions compare to those of others.

While one may not wish to call anything mentioned in the Analects “individualism,” it is clear that the individual holds the most valuable key insofar as he or she serves as the locus for self-cultivation and, hence, for the transformation of himself or herself to contribute to a moral society and cosmos. The individual forms the basis upon which authoritative, moral meaning and behavior is to be constructed. Insofar as the individual is considered to be the fundamental site of moral transformation, it is an absolutely crucial element of Confucian thought. So, while the philosophy represented by the Analects does not promote individualism as a moral stance that stresses individual autonomy and freedom from social constraints, it does establish the individual as inherently valuable in the process of moral cultivation, with the potential to be authoritative and fully integrated as a junzi figure within a web of intricate social, political, and cosmic forces. Thus, a type of integrated individualism seems to exist even in the most basic of early Chinese Confucian texts.

The figure whose writings provide us with one of the earliest, and perhaps clearest, representations of early Chinese individualism is Mencius. In the literature prior to Mencius, the individual represents a foundational site for moral cultivation, but the source of one’s moral motivation and insight may stem largely from external authorities. Mencius changes this by appealing to the innate moral agencies of the individual through the concept of xing, (human nature). By naturalizing moral motivation through the concept of xing, Mencius reveals what appears to be a new orientation towards human agency: one that sees the individual body as a universal source of cosmic authority and natural patterns.

Mencius defines sources of moral agency and authority by outlining an internal-external dichotomy and emphasizing the internal resources of the individual in moral cultivation. This is best demonstrated in Mencius 2A2 and the entire Chapter Six, Part A of the text, where Mencius debates with an opponent, Gaozi, over the idea that xing is a source of moral agency and insight. Unlike Mencius, Gaozi advocates the total subordination of the human heart-mind, the seat of a person’s controlling mechanism, the will (zhi), to yan, or what might be translated in the passage as “sayings,” or “teachings.” In such a way, Gaozi declares the absolute necessity of study and discipline through tradition, culture, and other external inputs. Mencius counters this by showing the necessity of stilling one’s heart-mind so that it will allow for its natural, innate moral tendencies to guide the body in correct thinking and behavior.

In another famous debate, Gaozi compares moral refinement to cups and saucers, which have been constructed by man through hard work and external imprinting. His view of moral cultivation strongly denies that an individual’s internal xing could have any moral quality or potential. Mencius responds with an analogy of equal force, describing human xing in terms of water. Just as the flow of water naturally tends downward, he claims, so does human xing naturally move toward goodness. Denouncing Gaozi’s views on the external origins of morality, Mencius insists that only when internal resources such as xing are obstructed, violated, and destroyed through external forces, does immoral behavior arise.

Mencius’ claims integrate the moral motivation of xing with life processes associated with the human body. Taking advantage of a linguistic connection between the terms for “life” and “human nature” in classical Chinese, Mencius argues that the moral agency of xing is intrinsic to basic life processes. To him, moral motivation, rooted in human nature, is inextricably tied to the agency that fills our very lives with health and vitality.

In sum, to Mencius, each individual person is his or her own moral agent by virtue of living properly and healthfully as a human being. By locating the seeds of morality in xing, one’s Heaven-endowed agency for human life, Mencius demonstrates that cultivating oneself morally is tantamount to attaining the proper measures for the basic vital functions of human beings. Mencius therefore not only naturalizes moral agency by making it a universally inherent trait in every individual, he also proposes a radical, physiological claim for a type of individualism that connects proper moral cultivation to the natural growth of one’s inherent xing and life forces.

Mencius is important in the history of Chinese individualism because he grounds ultimate moral authority in the inner, innate resources of the individual. What characterizes Mencius’ form of individualism as a stronger form of individualism than that outlined in the Analects is its emphasis on the human body not merely as a medium of authority or primary locus for the attainment of idealized authority (as exemplified through self-cultivation), but as an individualized source of it as well.

It is noteworthy that all Confucians who postdate Mencius seem to understand xing in terms of powerful, innate tendencies of individuals, but some, like Xunzi, fought vigorously to deny that such tendencies were morally positive. While Xunzi may not be called an individualist in the sense that Mencius may, his thought nonetheless supports a strong notion of individual moral autonomy as represented in the Analects.

5. Moral Autonomy in the Mohist Writings

The early Mohists were famous for their views in social conformity and obedience to political authorities, such as rulers and the Son of Heaven, who abided by the authority of Heaven. There is little that is individualistic about such conformist ideals in a Western sense. However, when one considers that the basis of their views on moral meritocracy and Heaven’s Will are grounded on a fundamental belief in an individual’s rational capacity to know and learn about morality, then the Mohist individual starts to appear much more individualistic than he would at first glance. Indeed, in early Mohist writings, individuals are required to know and choose the morally correct path – that which conforms with Heaven’s Will – on their own. They are thus morally autonomous in two senses: (1) They have the ability to use their rational minds to decipher, come to know, or (in the case of unexceptional commoners and people) at least be tacitly familiar with Heaven’s Will, and (2) They have the ability to choose to conform with what is right.

The early Mohists, who argue explicitly against contemporary beliefs in ming (fate, destiny, derived from Heaven), grant the individual a high degree of control over outcomes in this world. So while the early Mohists do not place extra value or emphasis on the individual or its powers and prerogatives, much less on its self-cultivation, they implicitly grant the individual much agency and control over the course of its life and the type of moral path it wishes to follow. Through their writings one gains insights into the ways in which concepts like conformity may actually go hand-in-hand with beliefs in autonomy and free will.

6. Individualism in the Zhuangzi

The Inner Chapters of the Zhuangzi, generally considered by scholars to have been written by Zhuangzi (or Zhuang Zhou), promote a vision of the individual’s unity with the Dao of Heaven. Whether such a vision is individualistic or not is open to debate. On the one hand Zhuangzi does not explicitly attribute the processes of the Dao to powers inherent in an individual’s body or spirit. Therefore, his writings do not technically fall under the definition of “individualism,” used above when discussing Mencius, which locates the primary source of idealized agency within the mundane individual. In fact, Zhuangzi openly advocates the notion of losing one’s self-identity and sense of self or body in order to fully embrace the agency of Dao. This appears to go against any kind of individualism that might place value on the self.

On the other hand, however, Zhuangzi hopes that every individual might achieve a transcendent self, along with a freedom associated with the transcendent individual. Such freedom – spiritual in nature – is not freedom from a higher source of power, but freedom through it. Insofar as Zhuangzi promotes an ideal of spiritual freedom through individual self-cultivation, his thought is characteristic of the holistic individualism described previously. Individuals are not valued in and of themselves but through their connection with a higher authority or power.  Realized individuals – the goal in Zhuangzian thought – are not unique, autonomous individuals who stand apart from external powers, but unique manifestations of the workings of a shared Dao.

The so-called “Primitivist,” whose writings in the Outer Chapters of the Zhuangzi seem to represent a coherent voice in that text, presents a form of individualism more akin to that described in the Mencius above. Whereas the Inner Chapters expound on a philosophy whose goals appear compatible with individualistic goals, this strand of the Outer Chapters goes further to locate value inside the individual from the beginning, even in an individual’s mundane state.

The primitivist writings uniquely emphasize the idealized powers of xing in every individual, which ultimately link a person with the Dao. Using a strong language of internal-external, the Primitivist denounces morality as an external overlay and unnecessary pollution of internal xing. By recommending that each individual place all of his or her faith in the natural, innate powers of xing, the Primitivist suggests that one can rid oneself of impulses responsible for the creation of cultural and social norms. This results in the reversion of the individual not just back to his or her most basic nature – one that is not coincidentally in accordance with the Dao of the natural world – but a reversion of society to an era of primitive political structures and human interactions as well.

By rejecting the necessity of social structures, institutions, knowledge, technologies, and cultural practices in favor of a cosmic or natural law and power that is accessible through the individual, human body, the proponents of the primitivist ideology share a basic individualistic point of view. Such a view assumes that ultimate value lies in what humans possess innately and in what is naturally accessible to every individual. For the Primitivist, this internal, innate, and universal human agency to interact ideally in the world derives from xing, which is ultimately a part of the natural cycles of the cosmic Dao.

The Primitivist illuminates polarities between what is external and alien or internal and inalienable to a given object. In such a manner he pits knowledge and culture in society against an individual’s personal vitality and innate powers. This naturalizes what is ideal by locating it in the cosmic capacity and authority of an individual’s xing. In the Laozi, a text upon which the Primitivist writing heavily relies, the ruler serves as the main conduit that enables everyone’s individual access to the Dao. Unlike the Laozi, the Primitivist presents a utopian vision that speaks to every individual’s direct, bodily relationship to cosmic power. This difference points to a noteworthy distinction between theocratic conceptualizations of cosmic authority and power as expressed in the Laozi; and biocratic, individualized ones as expressed in the Primitivist ideal.

7. Individualism in the Thought of Yang Zhu

One cannot speak of individualistic movements in early China without at least coming to terms with what we know about Yang Zhu, or Yangzi (c. 4th century B.C.E.), and his legacy. Mencius claimed that Yang Zhu promoted a doctrine of egoism, which the former deemed tantamount to anarchism. Though there is no solid evidence that anything Yangzi may have authored has been transmitted through the ages, we can still gain insight into his views from descriptions and condemnations of his teachings by Mencius and other writers of the slightly later Han period. It is possible that what we have described as primitivist above is nothing more than a strain of thought influenced by Yangist tenets and beliefs.

Yang Zhu, like Mencius, appears to have viewed the self and human body as an important resource for universal, objective forms of authority through xing. We see this through the following quote from Mencius, which states: “Even if he were to benefit the world by pulling out a single hair, he would not do it.” It appears that Yangzi’s so-called egoism is founded on a principle of preserving some aspect of one’s self or body over and above anything else. A later author claims that what Yangzi valued was self in and of itself, while others described his thinking in the following way: “Keeping one’s nature whole, preserving one’s genuineness, and not letting things tire one’s form (body) – these Yangzi advocated but Mencius denounced.” In this example, the self to be valued consists in xing, the body, and in “genuineness” – a vague concept that seems to refer to a spiritual ideal – inherent or original to the individual. Based on such a description, Yang Zhu appears to have idealized certain aspects of the self that help define its essence, whether material, spiritual, or both. By insisting on a sharp separation between that which is internal or associated with the person on the one hand, and external things that might tire it on the other, Yang Zhu joins Mencius in basing his ideals on a fundamental inner/outer distinction. However, his recommendation that one keep the self and its aspects free of outside contamination, if accurate, would constitute an even more extreme form of individualism than what we have encountered with Mencius.

Like Zhuangzi, Yang Zhu (as characterized by later texts that attribute a certain, relatively consistent perspective to his beliefs) seems to have supported the preservation of some essential and vital spirit that is ultimately related to the human body and its wholeness. Unlike Zhuangzi, who wishes for individuals to transcend their own awareness of the boundaries of the self and its materiality, Yang Zhu appears to glorify the existence of these, and to call for the preservation of a strict separation between what is inside and belonging to the sphere of the self, and what is outside and belonging to the sphere of things. Thus, the main distinction between Zhuangzi and Yang Zhu lies in the fact that Yang Zhu appears to value the self as a material body that is sacred precisely because of its essential materiality and life-producing qualities. Zhuangzi, on the other hand, does not directly embrace the cult of bodily vitality. He calls for individuals to transcend their bodies and their materiality so as to embrace what he sometimes refers to as the spirit of the Dao, which should be understood as an ethereal type of vitality.

Given these descriptions of Yang Zhu’s thought, it seems fair to call him an individualist rather than an apologist for selfish egoism. After all, there is no convincing evidence that Yang Zhu promoted selfishness in the sense that he inspired individuals to seek self-profit through the exploitation of public resources or goods. Moreover, there is no clear indication that Yang Zhu tacitly condoned harming or destroying society through his ideals. Rather, most of the reliable evidence points to the fact that Yang Zhu redefined what it meant to value the self in terms of one’s personal, material-spiritual salvation. Indeed, Yang Zhu was perhaps one of the first thinkers, like Mencius, to see xing and the self as a primary source of idealized individual agency and meaning.

Individualism, as has been introduced here, was a broad orientation in early Chinese thought that posited the value and autonomy of the individual and, in some instances, located sources of idealized cosmic power and authority within the individual body. Widespread notions of self-cultivation viewed the individual as the key site of moral or spiritual transformation and, hence, the individual was the primary medium for assimilating social and cosmic authority and order. Early Chinese thinkers also presumed the moral or spiritual autonomy of the individual, granting individuals the power to effect changes in their lives and make important choices concerning morality, self-cultivation, and conformity to external sources of authority. Individualistic authors like Mencius, the Primitivist, and possibly Yang Zhu, went so far as to naturalize cosmic or divine sources of authority in the world by locating them within the human body itself. They thereby made the individual body the primary source for idealized agencies, and valued one’s cultivation of such innate agencies as the highest good.

8. References and Further Reading

  • Ames, Roger, and David L. Hall. “The Problematic of Self in Western Thought,” Thinking from the Han: Self, Truth, and Transcendence in Chinese and Western Culture. Albany: State University of New York Press, 1998.
  • Ames, Roger, Wimal Dissanayake, and Thomas Kasulis, ed. Self as Person in Asian Theory and Practice. Albany: State University of New York Press, 1994.
  • Ames, Roger, Wimal Dissanayake, and Thomas Kasulis, ed. Self as Image in Asian Theory and Practice. Albany: State University of New York Press, 1998.
  • Angle, Stephen. Human Rights in Chinese Thought: A Cross-Cultural Inquiry. Cambridge: Cambridge University Press, 2002.
  • Bauer, Joanne and Daniel Bell, eds. The East Asian Challenge for Human Rights. Cambridge: Cambridge University Press, 1999.
  • Bloom, Irene eds. “Confucian Perspectives on the Individual and the Collective,” Religious Diversity and Human Rights, Irene Bloom, J. Paul Martin, and Wayne L. Proudfoot, 114-151. New York: Columbia University Press, 1996.
  • Brindley, Erica. Individualism in Early China: Human Agency and the Self in Thought and Politics. University of Hawaii Press, 2010.
  • Chan, Joseph. “Moral Autonomy, Civil Liberties, and Confucianism,” Philosophy East and West, 52.3 (2002): 281-310.
  • Chen, Albert H. Y. “Is Confucianism Compatible with Liberal Constitutional Democracy?” Journal of Chinese Philosophy 35.2 (June 2007): 195-216.
  • Csikszentmihalyi, Mark. Material Virtue: Ethics and the Body in Early China. Leiden: Brill, 2004.
  • Emerson, John. “Yang Chu’s Discovery of the Body,” Philosophy East and West. 46.4 (1996): 533-566.
  • Graham, Angus C. Disputers of the Tao: Philosophical Argument in Ancient China. La Salle, IL: Open Court, 1989.
  • Graham, Angus C. “The Background of the Mencian Theory of Human Nature,” Tsing Hua Journal of Chinese Studies, vol. 6 (1957): pp. 215-271.
  • Greenwood, John. “Individualism and Collectivism in Moral and Social Thought,” The Moral Circle and the Self: Chinese and Western Approaches, eds., Kim-chong Chong, Sor-hoon Tan, and C. L. Ten, 163-73. La Salle, IL: Open Court, 2003.
  • Hansen, Chad. “Individualism in Chinese Thought,” Individualism and Holism: Studies in Confucian and Taoist Values, ed. Donald Munro, 35-55. Ann Arbor: Center for Chinese Studies, University of Michigan, 1985.
  • Ivanhoe, P. J. Confucian Moral Self Cultivation. New York: Peter Lang, 1993.
  • Kline, T. C. III and P. J. Ivanhoe, eds. Virtue, Nature, and Moral Agency in the Xunzi. Indianapolis: Hackett Publishing Company, Inc., 2000.
  • Kline, T. C. III. “Moral Agency and Motivation in the Xunzi,” Virtue, Nature, and Moral Agency in the Xunzi, eds. T. C. Kline III and P. J. Ivanhoe, 155-75. Indianapolis: Hackett Publishing Company, Inc., 2000.
  • Karyn Lai. “Understanding Change: The Interdependent Self in Its Environment,” New Interdisciplinary Perspectives in Chinese Philosophy, ed. Karyn Lai. Journal Supplement Series to the Journal of Chinese Philosophy (2007): 81-99.
  • Lin, Yüsheng. “The Evolution of the Pre-Confucian Meaning of Jen and the Confucian Concept of Moral Autonomy,” Monumenta Serica 31 (1974-75): 172-83.
  • Liu, Lydia H. “Translingual Practice: The Discourse of Individualism between China and the West,” Narratives of Agency: Self-making in China, India, and Japan, ed. Wimal Dissanayake. Minneapolis: University of Minnesota Press, 1996.
  • Liu, Qingping. “Filiality versus Sociality and Individuality: On Confucianism as ‘Consanguinitism,” Philosophy East and West 53.2 (2003): 234-250.
  • Liu, Xiusheng. “Mengzian Internalism,” Essays on the Moral Philosophy of Mengzi, ed. Xiusheng Liu and Philip Ivanhoe, 101-31. Indianapolis: Hackett Publishing Co., Inc., 2002.
  • Munro, Donald. The Concept of Man in Early China, reprint. Ann Arbor: Center for Chinese Studies, University of Michigan, 2001.
  • Munro, Donald, ed. Individualism and Holism: Studies in Confucian and Taoist Values. Ann Arbor: Center for Chinese Studies, University of Michigan, 1985.
  • Nylan, Michael. “Confucian Piety and Individualism in Han China,” Journal of the American Oriental Society 116.1 (1996): 1-27.
  • Rosemont, Henry, Jr. “Rights-Bearing Individuals and Role-Bearing Persons,” Rules, Rituals, and Responsibility: Essays Dedicated to Herbert Fingarette, ed., Mary Bockover, 71-101. Chicago: Open Court, 1991.
  • Rosemont, Henry, Jr. “Who Chooses?” Chinese Texts and Philosophical Contexts, ed. Henry Rosemont, Jr., 227-263. LaSalle: Open Court, 1991.
  • “Two Loci of Authority: Autonomous Individuals and Related Persons,” Confucian Cultures of Authority, ed. Peter D. Hershock and Roger T. Ames, 1-20. Albany: State University of New York Press, 2006.
  • Roth, Harold. “Psychology and Self-Cultivation in Early Taoistic Thought,” Harvard Journal of Asiatic Studies 51.2 (1991): 599-650.
  • Rubin, Vitaly A. Individual and State in Ancient China: Essays on Four Chinese Philosophers. Trans. Steven I. Levine. New York: Columbia University Press, 1976.
  • Shun, Kwong-loi. “Conception of the Person in Early Confucian Thought,” Confucian Ethics: A Comparative Study of Self, Autonomy, and Community, eds., Kwong-loi Shun and David Wong, 183-99. Cambridge: Cambridge University Press, 2004.
  • Shun, Kwong-loi and David Wong, ed. Confucian Ethics: A Comparative Study of Self, Autonomy, and Community. Cambridge: Cambridge University Press, 2004.
  • Vinograd, Richard. Boundaries of the Self: Chinese Portraits, 1600-1900. Cambridge: Cambridge University Press, 1992.
  • Wilson, Stephen A. “Conformity, Individuality, and the Nature of Virtue: A Classical Confucian Contribution to Contemporary Ethical Reflection,” Confucius and the Analects: New Essays, ed. Bryan W. Van Norden, 94-115. Oxford: Oxford University Press, 2002.
  • Yearley, Lee. “Chuang Tzu’s Cosmic Identification,” Taoist Spirituality, Vol. 10 of World Spirituality: An Encyclopedic History of the Religious Quest, edited by Tu Wei-ming, New York: Crossroads, forthcoming.
  • Yu, Ying-shih. “Individualism and the Neo-Taoist Movement in Wei-Chin China,” Individualism and Holism: Studies in Confucian and Taoist Values, ed. Donald Munro. Ann Arbor: Center for Chinese Studies, University of Michigan, 1985.
  • Zhang Qianfang. “Human Dignity in Classical Chinese Philosophy: Reinterpreting Mohism,” Journal of Chinese Philosophy 34.2 (June 2007): 239-255.

 

Author Information

Erica Brindley
Email: efb12@psu.edu
The Pennsylvania State University
U. S. A.

Supervenience and Mind

This article is an informal introduction to the concept of supervenience and the role it plays in the philosophy of mind.  It surveys some of the many ways the concept has been used to reveal the manner and degree to which mental phenomena depend on facts about our bodies and their physical features. Philosophers usually construe the supervenience relation as a relation between classes of properties, where a class of properties, F, supervenes on a class of properties, G, just in case there is no difference in F-properties without some difference in G-properties.  As David Lewis puts it, “no difference of one sort without differences of another sort” (1986, p. 14).  It is in the philosophy of mind that we find the term’s most frequent contemporary occurrence.

The goal of asking whether one set of properties supervenes on another is to better understand the ontological relation between the two sets — especially, whether the one set of properties depends entirely on the other.  Suppose, for example, that two individuals can have different moral properties while being exactly alike in terms of their actual and potential behavior; that is, suppose that one’s moral features do not supervene on one’s behavioral features.  Then we can conclude that the former depend on something more than the latter.  And if we accept this conclusion, we are then led to search for a set of properties on which our moral features do supervene, a set of properties in terms of which any two individuals must differ with any moral difference.  The goal is to isolate just that set of features on which our moral properties do wholly rely.

Suppose we succeed in identifying a set of features on which F-properties supervene (where F-properties might be moral, mental, aesthetic, economic, or any other higher-level properties).  Then we can try to discover the nature of the dependence of F-properties on the underlying G-properties (for example, behavioral, physical, neurological, or intrinsic) by asking about the manner in which the former supervene on the latter.  Is it a logical truth that a difference in F-properties requires a difference in G-properties?  Is this covariance due to the causal laws that actually obtain?  Is it a matter of metaphysical necessity?  Asking these questions about the way in which F-properties supervene may help us decide whether the dependence is, for example, a wholly analytic affair, a type of causal dependence, a matter of constitution, or a matter of genuine identity.

Table of Contents

  1. Supervenience: The Basic Idea
  2. Supervenience and Non-Reductive Physicalism
  3. Varieties of Supervenience
    1. Global and Local Versions
    2. Varieties of Necessitation
  4. Two Common Complaints with Supervenience Theses
    1. Explanatory Failure
    2. Ontological Failure
  5. Concluding Remarks
  6. References and Further Readings

1. Supervenience: The Basic Idea

Nora’s latest sculpture has many intrinsic features, including its shape, density, texture, and constituent matter.  It also has various aesthetic properties — beauty, grace, elegance, and expressive power.  No doubt, the aesthetic properties of the sculpture are in some way and to some degree a result of its intrinsic features.  But in what way exactly, and to what degree?  Thinking in terms of supervenience is a good start to finding the answer.  Imagine an artwork, x*, that is intrinsically indistinguishable from Nora’s sculpture, x — a perfect duplicate of x.  Is it possible that despite the indiscernability, x* might differ aesthetically from x?  If it is not possible for x and the intrinsically indiscernable x* to differ aesthetically, then we say that the object’s aesthetic features supervene on its intrinsic features, where a class of properties, F, supervenes on a class of properties, G, just in case a difference in F-properties requires a difference in G-properties; in other words, all the same G-properties guarantee all the same F-properties.

If we decide that the object’s aesthetic properties do supervene on its intrinsic features, then we are led to inquire whether the former are identical with the latter or whether the dependence relation is of some weaker sort — for example, causal dependence or constitution.  On the other hand, if we conclude that the object’s aesthetic properties do not supervene on its intrinsic features, that is, if x* might differ aesthetically from x despite their intrinsic similarity, then we can conclude that those aesthetic properties are at least partly a function of certain relations the object bears to external items.  We are then led to ask what the relevant external relations are.  We think in terms of supervenience again, imagining various changes in x’s environment (different origins, differences in historical context, different standards of the qualified judges, differences in popular opinion), and for each of those changes we decide whether x’s aesthetic features might also differ, until we isolate just those features of the environment on which the aesthetic properties do rely.  The conclusion would then be that the object’s aesthetic features supervene on its intrinsic properties together with those external features.

As this line of inquiry shows, the concept of supervenience is an invaluable tool for deciding whether and how one set of properties depends on another.  An analogous line of inquiry is found in discussions of mental content.  The content of one’s mental states depends largely on what the individual is like internally — on the state of the brain and the brain’s causal relations to other parts of the body, including sense organs and limbs.  But does the content of one’s mental states depend entirely on these intrinsic features?  To decide this issue, we consider whether it is possible for an indistinguishable individual, a molecule-for-molecule duplicate, to differ in terms of the content of her mental states.  And thanks to the thought experiments of Hilary Putnam (1973, 1975) and Tyler Burge (1979), it is widely thought that intrinsic duplicates can indeed differ in the content of their mental states.  Putnam has us imagine a twin-earth that is exactly like earth except that what they call “water” on twin-earth is comprised of something other than H2O molecules.  The content of your water-thoughts, it seems, differs from the content of your doppelganger’s “water”-thoughts on twin-earth — simply because of the difference in the liquid toward which those thoughts are directed.  (But note that Putnam’s example is actually designed to show a difference in linguistic content, meaning, which does not in itself entail a difference in mental content.)  Likewise, Burge shows that given suitable differences in surrounding linguistic practice, the thoughts one expresses with the word ‘arthritis’ might differ in content from those that one’s doppelganger expresses.  For example, if your duplicate inhabits a possible world in which ‘arthritis’ is regularly used to describe various conditions in addition to inflammation of the joints, then it seems that the content of the duplicate’s ‘arthritis’ thoughts will differ from yours.  Examples such as these seem to show that the content of one’s mental states does not supervene on one’s intrinsic features alone, but only on a set of features that includes features of one’s environment.

While Putnam and Burge do not use the term ‘supervenience’ in the essays mentioned above (though Burge does use it in his 1986 discussion of externalism), it is clear that the concept of “no difference of one sort without differences of another sort” is being utilized.  It was Davidson’s use of ‘supervenience’ in “Mental Events” (1970) that made the term popular in the philosophy of mind.

2. Supervenience and Non-Reductive Physicalism

While Davidson denies that there are psychophysical laws, he acknowledges (in a widely cited passage) that

mental characteristics are in some sense dependent, or supervenient, on physical characteristics.  Such supervenience might be taken to mean that there cannot be two events alike in all physical respects but differing in some mental respect, or that an object cannot alter in some mental respect without altering in some physical respect” (1970, p. 88).

Not long after the appearance of Davidson’s essay, discussions of Non-Reductive Physicalism became the major locus of supervenience-talk in the philosophy of mind and the philosophical literature in general.  Due to the popularity of functionalist accounts of mentality and the wide degree of multiple realizability they entail (that is, that the same mental property can be realized by events of different physical, chemical, and neural types), a dominant view in the philosophy of mind for the past few decades is the belief that mental properties are not identical with neural properties or any other properties of the natural sciences.  Yet, many of those who reject psychophysical property-identities also claim to support Physicalism regarding mentality, the view that all mental phenomena obtain solely by virtue of physical phenomena.  One wonders: how can mentality obtain solely by virtue of physical phenomena, as Physicalism maintains, if mental properties are not physical?  The most popular answer is that while mental properties are not identical with physical properties, they nonetheless depend on nothing other than physical properties — that is, they supervene on physical properties.

Whether a mental-physical supervenience thesis captures the content of Physicalism regarding mentality depends on how exactly the supervenience relation is to be understood.  To say that there is no mental difference without some physical difference leaves the matter rather unclear.  Much of the literature on supervenience in the philosophy of mind is devoted to adding the needed precision.  Let us consider some of the many issues that arise.

3. Varieties of Supervenience

The philosophical literature is replete with all manner of ways to describe the supervenience relation — “an unlovely proliferation,” as Lewis puts it (1986, p. 14).  Here are some of the most popular brands.

a. Global and Local Versions

John Haugeland expresses the idea that all properties supervene on physical properties as follows: “[t]he world could not have been different in any respect, without having been different in some strictly physical respect” (1984, p. 1).  This is a global supervenience thesis, claiming that nonphysical difference entails physical difference at the level of possible worlds as a whole (where a possible world is a way the world could have been, which includes the actual world, the way the world actually is).  Applied to mental and physical properties, Haugeland’s global supervenience claim is that the total state of any possible world could not have been any different mentally without differing physically.  That is,

(GS)  for any possible worlds, w1 and w2, if w1 and w2 differ mentally, then w1 and w2 differ physically.

Equivalently, if w1 and w2 are exactly the same physically, then they are exactly the same mentally.

While GS goes a long way toward capturing the physicalist belief that the mental depends entirely on the physical, it is arguable that it does not go far enough.  Kim (1987, p. 321) has us imagine that some possible world b differs physically from the actual world in only the following respect: in b one of Saturn’s rings contains an additional ammonia molecule.  GS requires mental similarity only between worlds that are physically indistinguishable as a whole.  So despite the slight and innocuous physical difference between b and the actual world, GS allows that b mentally differs from the actual world as greatly as you please, perhaps being completely devoid of mentality.  This result clearly goes against the physicalist intuition that mental phenomena obtain solely by virtue of physical phenomena.  For in the case Kim describes, all of the physical features that are relevant to mentality remain constant.  So it seems that if b differs mentally from the actual world, as GS allows, then whatever mental differences there are would have to be due to something other than physical differences.  (Note that several authors identify and formulate different types of global supervenience; for a thorough description of varieties of global supervenience, see McLaughlin and Bennett, 2006.  These complexities are ignored in this introductory survey; here the concern is with global supervenience in general.)

To come closer to capturing the content of Physicalism regarding the mind, a supervenience thesis needs to require not only that worlds differing mentally differ physically, but also that individual objects and events cannot differ mentally without differing physically.  Consider, then, the following local supervenience thesis:

(LS)  for any worlds, w1 and w2, and any individuals, x in w1 and y in w2, if x in w1 differs mentally from y in w2, then x in w1 differs physically from y in w2.

Or one might choose what Kim (1984) calls “strong” supervenience.  To say that mental properties strongly supervene on physical properties is to say that

(SS) necessarily, for any mental property M, and any individual x that has M, there is a physical property (or set of physical properties) P, such that x has P, and necessarily, for any individual y, if y has P, then y has M.

The difference between LS and SS is that LS allows the possibility that some entities have mental properties without having any physical properties.  (Yet, SS and LS are equivalent when the base set of physical properties is closed under negation, that is, when for every physical property P in the base set, its negation, ~P, is also included.)

Now suppose that Carla and Marla inhabit worlds that are physically indistinguishable in every respect except for the extra ammonia molecule in a ring of Saturn.  While the worlds themselves differ physically, Carla and Marla do not.  So unlike GS, LS and SS entail that Carla and Marla are mentally indistinguishable.  So it seems that LS and SS better capture the physicalist intuition that mental facts obtain solely in virtue of physical facts.  (However, there is some debate over whether the global thesis might actually be equivalent to SS given the right closure principles.  See, for example, Petrie, 1987, Kim, 1987, Paull & Sider, 1992, and Stalnaker, 1996.)

Note that SS differs from Kim’s formulation of what he calls “weak” supervenience (WS), which lacks the second occurrence of the word ‘necessarily.’

(WS) necessarily, for any mental property M, and any individual x that has M, there is a physical property (or set of physical properties) P, such that x has P, and for any individual y, if y has P, then y has M.

Unlike SS and LS, WS requires only that physical duplicates are mental duplicates within possible worlds, thereby allowing that physical duplicates differ mentally across possible worlds.  So unlike SS and LS, WS allows that you could have had different mental properties from those you actually have without differing in any physical way.  But if you could have had different mental properties without differing physically in any way, then the mental facts about you do not depend solely on the way you are physically, which seems to be contrary to Physicalism.  Thus, SS and LS are preferable to WS.

As indicated above, by not allowing that Carla and Marla differ mentally, LS and SS might seem preferable to GS.  However, one might think that given externalist intuitions, GS is more desirable.  If externalism regarding mental content is correct, then individuals that are the same in terms of their intrinsic physical properties are not guaranteed to be mentally the same.  GS honors externalist intuitions by requiring sameness only at the level of whole worlds, thereby allowing individuals with all the same intrinsic physical properties to differ mentally.  However, Kim (1987, pp. 322-4) points out that the strong supervenience thesis, SS, can also allow the truth of externalism simply by including extrinsic properties of individuals in the supervenience base, properties such as being causally related to stuff comprised of H2O.  If the underlying physical properties, the subvening properties, include relations to external items, then LS can also accommodate externalist intuitions.  (Although, we cannot let just any extrinsic property into the supervenience base.  If we allow the extrinsic feature of occupying a world with such-and-such complete physical profile, then the strong and local supervenience theses automatically collapse into the global version.  For in that case, having all the same mental properties is not required unless the individuals occupy worlds that are physically indistinguishable in every respect.  Vera Hoffmann and Albert Newen, 2007, develop the idea of property-dependent supervenience as a way to include only those extrinsic properties that are relevant to the instantiation of a higher-level property.)

We saw that one worry about GS is that it allows great mental differences to be accompanied by what seem to be wholly irrelevant physical differences — allowing, for example, that Carla and Marla differ mentally despite their physical similarity, simply because in Marla’s world there is one more ammonia molecule in one of the rings of Saturn.  LS and SS prevent this possibility but they still fall short of fully capturing physicalist intuitions since differences irrelevant to one’s mentality are to be found not only in distant regions of space.  Suppose that Carla and Marla live in physically indistinguishable environments and their bodies are physically indistinguishable in all but the following respect: Marla has an additional electron in one of her toenails.  Since this cuticle difference seems to be completely irrelevant to mentality, we would expect that Carla and Marla are mentally indistinguishable.  However, since they are not physically indistinguishable, LS and SS allow vast mental differences between the two.  This result seems contrary to the spirit of Physicalism since Carla and Marla are physically indistinguishable in all the ways that are relevant to mentality.

The move from GS to LS and SS is an attempt to isolate those physical properties that are relevant to the exemplification of a mental property.  For a more successful attempt, one might appeal to Terence Horgan’s regional supervenience thesis.  With his notion of a P-region, a spatio-temporal region of a physically possible world, Horgan expresses his regional supervenience thesis as: “There are no two P-regions that are exactly alike in all qualitative intrinsic physical features but different in some other qualitative intrinsic feature” (1993a, p. 571).  (Also see Horgan 1982, p. 37).  Consider the region of space occupied by Marla’s whole body minus the toenail with the extra electron and the region of space occupied by the corresponding part of Carla’s body.  Since these two regions are physically indistinguishable, the regional supervenience thesis yields the correct result that Carla-minus (Carla minus the toenail) does not differ mentally from Marla-minus.  This result together with the fact that there is no mentality supervening on the physical features of the toenail itself yields that further desired conclusion that Marla as a whole is mentally the same as Carla as a whole.  (By the way, those mental and other higher-order states that require an externalist individuation will supervene on the intrinsic features of an expanded region of space, which Horgan points out, is “a region large enough to encompass, as intrinsic features, all the contextually relevant facts”: 1982, p. 39.)

Another way to isolate just those physical features that are relevant to mentality is to utilize Kim’s notion of a B-minimal property.  A property is B-minimal when “any property weaker than it is not a supervenience base” (1984, p. 165).  If a physical property, P, is B-minimal with respect to mental property M, then P is a least sufficient condition for M — that is, P’s instantiation guarantees M’s instantiation, and there is no proper constituent of P an instance of which guarantees an instance of M.  The properties that are B-minimal with respect to M do not include the number of electrons in one’s toenails.  Thus, with only B-minimal properties in the supervenience base, Carla and Marla have all the same subvening properties despite their cuticle difference, which guarantees that they have all the same mental properties.  Suppose, then, that we strengthen LS and SS to read:

(LS*) for any mental property M, there is a physical property P that is B-minimal with respect to M, such that for any possible worlds, w1 and w2, and any individuals x in w1 and y in w2, if x in w1 differs from y in w2 in terms of M, then x in w1 differs from y in w2 in terms of P

and

(SS*) necessarily, for any mental property M, and any individual x that has M, there is a physical property (or set of physical properties) P, such that (i) x has P, (ii) P is B-minimal with respect to M, and (ii) necessarily, for any individual y, if y has P, then y has M,

respectively.  Since no physical property that is B-minimal property with respect to M includes having an extra electron in a toenail, neither LS* nor SS* allows that Carla and Marla differ mentally, which is just as Physicalism seems to require given that they are physically indistinguishable in all respects relevant to mentality.

Jeffrey Poland’s (1994) formulation of Physicalism further narrows the class of relevant base properties by adding the notion of a B-minimal property to a regional supervenience thesis.  Poland worries that a physical property might qualify as a B-minimal supervenience base with respect to some higher-level (for example, mental) property without being the property by virtue of which the higher-level property is instantiated.  To remedy this defect Poland proposes that “[f]or each non-physical attribute, N, and for each region of space-time, R, if N is actually (or possibly) instantiated in R, then there exists a minimal class of physically-based attributes, P, such that the instantiation of the members of P does (or would) provide a realization of N on that occasion” (p. 191).

Apart from deciding which of these supervenience theses to endorse, there is another issue that needs to be addressed.  In formulating a supervenience thesis adequate for understanding the ontological relation between two sets of properties, we need to decide which brand of modality is to figure in our formulation?  How should the necessity operators in WS, SS, and SS* be interpreted, and what is the range of the possible worlds mentioned in GS, LS, and LS*?

b. Varieties of Necessitation

Suppose it is true that whenever there is a mental difference there is a physical difference.  It would seem that this is not merely an accidental fact, not merely a matter of the way things happen to be.  Rather it seems that the physical facts that obtain in some way necessitate the mental facts that obtain:

(N)  facts about the distribution of physical properties necessitate all the facts about the distribution of mental properties.

But what brand of necessity is at issue here?  Answering this modal question is essential to understanding how the mental depends on the physical.  The answer is also crucial to deciding whether a supervenience thesis can adequately capture all that Physicalism regarding mentality entails.

The physical facts clearly do not logically necessitate the mental facts.  The laws of logic alone do not allow us to derive the latter from the former.  However, one might wonder whether the laws of logic together with the meanings of physical and mental terms allow us to derive all true mentalistic sentences from sentences expressed in physical vocabulary.  That is, one might wonder whether

(NL-C)  facts about the distribution of physical properties logically/conceptually necessitate all the facts about the distribution of mental properties.

Yet, NL-C seems implausible on any reasonable interpretation of ‘physical.’  If we restrict the term ‘physical’ to properties described by the science of physics itself, then the obvious difference in meaning between mental vocabulary and the vocabulary of physics seems to show that NL-C is false.  Even if the word ‘physical’ were used loosely enough to include brain properties, and even if mental properties were identical with brain properties, NL-C would still be highly dubious.  Since mental talk is not synonymous with neural talk, mind-brain identity theorists rightly held their view, not as an analytic truth, but as a significant empirical hypothesis.  (See, for example, J.J.C. Smart, 1959 and U.T. Place, 1956.)  Suppose ‘physical’ is used liberally enough to include overt behavioral responses to environmental input — for example, bringing an umbrella when confronted with rain, grimacing and groaning as the punching continues, and responding appropriately in French to questions posed in French.  With this liberal use of ‘physical’ NL-C would be accepted by logical behaviorists.  Of course, this is no credit to NL-C given the well-known problems with logical behaviorism.  Although, one might think that with such a generous use of ‘physical,’ analytic functionalism (which is not as implausible as logical behaviorism) is committed to NL-C as well.  But that is not so.  Functionalists characterize mental properties in terms of their relations to environmental input, behavioral output, and other mental properties.  For instance, the belief that it is raining will be characterized in terms of an inclination to bring an umbrella if one wishes to keep dry.  Given that mental predicates are to be interdefined, even with a liberal definition of ‘physical’ the analytic functionalist would seem compelled to deny that a purely physical description logically/conceptually guarantees the truth of any mental description.

Not only does NL-C seem to be false, it also appears to be much more than what Physicalism requires.  It might be that mental phenomena depend entirely on physical phenomena (as physicalists regarding mentality contend) without there being a logical or conceptual tie between the two.  Since predicates that differ in meaning might denote the very same property, it seems that even if mental properties were identical with physical properties, it would not follow that facts about the latter logically or conceptually guarantee facts about the former.  (However, Robert Kirk claims otherwise.  Imagine the complete set, P, of facts about the world expressed in the vocabulary of an ideal physics, and suppose that this set includes all of the physical laws that obtain.  According to Kirk, Physicalism requires that the relation between P and the set of mental facts is one of strict implication, where “statement A strictly implies a statement B just in case ‘A and not-B’ involves inconsistency of a broadly logical or conceptual kind,” 2006, p. 525.  There are also the arguments of Jackson (1994 & 1998) and Chalmers’ (1996) to consider, arguments purporting to show that much more follows a priori from the physical facts than what we might be inclined to think.  Although, for Chalmers and Jackson, these do not include facts about the qualitative character of conscious experience.)

The fact that mental differences require physical differences would seem to have something, perhaps everything, to do with the laws of nature that actually obtain.  It is arguable that the laws of nature that actually obtain are what make it the case that mental sameness guarantees physical sameness.  Consider, then, the proposal that:

(NN)  facts about the distribution of physical properties nomologically necessitate all the facts about the distribution of mental properties.

However, while it is reasonable to think that the supervenience of the mental on the physical is a matter of the laws of nature that actually obtain, NN is too weak to do the work a physicalist would want a supervenience thesis to do.  The physicalist who appeals to supervenience is trying to honor the fact that the mental depends entirely on the physical without being committed to the view that mental properties are identical with physical properties.  Suppose, as is widely believed, that mental properties are not identical with physical properties.  Then the psychophysical laws by virtue of which the physical facts fix the mental facts are not purely physical laws.  In that case, NN allows that there is a possible world that is physically indistinguishable from the actual world, including all the same purely physical laws, but with a very different distribution of mental properties.  As Crane puts it, “if fixing the mental facts requires psychophysical laws, then fixing the physical facts alone is not sufficient to fix the mental facts” (1991, pp. 237-8).

To support our physicalist intuitions, it seems that we need a supervenience relation that is committed to the following necessitation claim:

(NP)  facts about the distribution of physical properties physically necessitate all the facts about the distribution of mental properties,

where p physically necessitates q just in case ‘p and not q’ is not true in any possible world with all the same physical laws as those that actually obtain.  If the physical laws did allow worlds with the same distribution of physical properties but a different distribution of mental properties, then there would be a clear sense in which the mental facts are at least partly due to something other than physical facts.  So it is arguable that our physicalist intuitions do entail just what NP states. Indeed, Np does capture a popular way that the necessity involved in supervenience theses applied to mentality are in fact understood.  (See, for example, Papineau’s formulation of Physicalism: 1993, p. 21.)

Yet, there are some worries about NP.  Lewis, Horgan, and Jackson address what is called the Problem of Extras.  Lewis claims that “Materialism is meant to be a contingent thesis, a merit of our world that not all other worlds share” (1983, p. 362).  Even if all concrete phenomena obtain solely by virtue of physical phenomena, as Physicalism maintains, it seems that things did not have to be that way.  The world could have been such that angels, ghosts, or other immaterial beings reside alongside the physical items that actually exist.  It is also arguable that Physicalism allows non-actual possible worlds where all the actual physical laws obtain but with immaterial extras — provided that these extras do not causally interfere with the physical world.  Horgan imagines that “[i]n such worlds the spirits would not interfere with the ordinary operations of physical laws upon physical substances; they would simply co-exist with the physical” (1982, p. 35).  The concern, then, is that a possible world with the same distribution of physical properties that actually obtains and all the same physical laws might nonetheless differ from the actual world by having immaterial mental extras.  If such a possible world is consistent with the truth of Physicalism, as Lewis and Horgan think, then the truth of Physicalism does not require that physical sameness (including physical laws) guarantees mental sameness.  And if so, then NP is not necessary for the truth of Physicalism.

One way to solve the Problem of Extras is to identify some restricted class of physically possible worlds and then characterize Physicalism as the view that physical sameness entails mental (and other higher-order) sameness in that restricted class of worlds.  For example, Frank Jackson (1998) appeals to the notion of a minimal physical duplicate, where “a minimal physical duplicate of our world is a world that (a) is exactly like our world in every physical respect (instantiated property for instantiated property, law for law, relation for relation), and (b) contains nothing else in the sense of nothing more by way of kinds or particulars than it must to satisfy (a)” (p. 13).  With this notion of a minimal physical duplicate, Jackson proposes that we characterize Physicalism in general as the view that “[a]ny world which is a minimal physical duplicate of our world is a duplicate simpliciter of our world” (p. 12).  That is, if you physically duplicate the actual world and stop right there, then what results does not differ mentally or in any other way from the actual world.  To restrict the class of physically possible worlds where physical sameness entails mental sameness, Lewis (1983) relies on the notion of a property’s being “alien” to a world, where “a property is alien to a world iff (1) it is not instantiated by any inhabitant of that world, and (2) it is not analysable as a conjunction of, or as a structural property constructed out of, natural properties all of which are instantiated by inhabitants of that world” (p. 364).  With the notion of an alien property, Lewis offers the following supervenience claim: “[a]mong worlds where no natural properties alien to our world are instantiated, no two differ without differing physically; any two such worlds that are exactly alike physically are duplicates” (p. 364).  If we think that a physically possible world with the distribution of physical properties that actually obtains might nonetheless contain immaterial mental extras, then we might restrict NP with the help of either Lewis’ proposal or Jackson’s (or Horgan’s proposal, which appeals to the notion of a physically accessible world, a P-world, 1982, p. 36-7).

Another potential concern about NP is expressed by Francescotti (2000, 1998).  Suppose, as many believe, that mental properties are not identical with physical properties.  Then the purely physical laws do not range over mental properties.  In that case, Francescotti argues, the physical properties instantiated would determine which mental properties are instantiated only given the truth of irreducibly psychophysical laws.  But if so, it is arguable that the truth of Np requires that mental properties are identical with physical properties (which seems to be a threat to Non-Reductive Physicalism assuming that Physicalism requires the physical necessitation of mental facts).

The search for the exact way in which the physical facts necessitate the mental facts continues.  There are also the remaining issues described in section 3-a — whether to prefer local to global supervenience, and whether and how to make the thesis super-local to capture only those physical properties that are relevant to mentality.  There is also the issue of whether the supervenience base should be restricted to qualitative properties or whether non-qualitative, impure properties should also be included — properties such as containing a neural event that is numerically identical with neural event x.  (Recall that Horgan’s regional supervenience thesis is restricted to qualitative properties.)  There is also the question of whether it is acceptable for a supervenience thesis to be restricted to same-subject necessitation, where the base properties are restricted to properties of the very same object that bears the supervening properties.  Should we prefer multiple-domain supervenience (where the supervening properties are exemplified by a domain of items that differs from the domain of the subvening properties) — allowing, for example, that the base properties are properties of constituents of the individuals with the supervening properties?  While supervenience theorists have been dealing for years with these technical matters and many others regarding the details of the supervenience relation, some general objections have been raised to the whole project of trying to describe the dependence of the mental on the physical in terms of supervenience.  Let’s consider two common complaints.

4. Two Common Complaints with Supervenience Theses

Kim (1993) tells us that a supervenience thesis “itself says nothing about the nature of the dependence involved; it tells us neither what kind of dependency it is, nor how the dependency grounds or explains the property covariation” (pp. 165-6).  So a supervenience thesis leaves us wondering:

Is it a matter of causal dependence?  Is it in some way analogous to mereological supervenience?  Is it after all a matter of meaning dependence, as logical behaviorists and some functionalists claim?  Perhaps, a matter of divine intervention or plan as Malebranche and Leibniz thought?   Or a brute and in principle unexplainable relationship which we must accept “with natural piety,” as some emergentists used to insist?  (p. 167)

There are two distinct worries expressed here — the failure of a supervenience thesis to explain mental phenomena and its failure to ontologically ground mental phenomena.

a. Explanatory Failure

If the mental supervenes on the physical as a matter of logical-conceptual necessity, then there would be an easy explanation (in terms of the laws of logic and meanings of our mental and physical terms) for why physical sameness guarantees mental sameness.  Yet, if NL-C is to be rejected, as it seems it should, then the worry is that the dependence of the mental on the physical is left unexplained.  One might appeal to psychophysical laws as the explanation — P guarantees M because it is a law that P → N.  But this does not explain why the psychophysical law P → N obtains.  The complaint that an account expressed solely in terms of supervenience commits us to unexplained psychophysical laws has been raised by many.  See, for example, Gardner (2005, p. 201-2), Kim (1989, sec. IV, 1993, pp. 165-9), Heil, (1998), Horgan (1993a, sec. 8, 1993b, sec. 5), and Moreland (1998, pp. 50-1).  A physicalist might allow that there are brute laws, provided that these are purely physical.  But if a law is not purely physical, if it is irreducibly psychophysical, then as a physicalist one would expect an explanation in terms of purely physical laws of why that psychophysical law obtains, and this explanation is what a mere supervenience claim does not provide.  As a result, a physicalist might be compelled to find some suitable brand of what Horgan calls “superdupervenience” — “ontological supervenience that is robustly explainable in a materialistically acceptable way” (1993, p. 577).

Regarding the explanatory concern, two points are worth mentioning.  First, if we are offering an ontological account of mentality, then it is not clear that a “materialistically acceptable” explanation is required.  According to Physicalism as an ontological doctrine, mental (and other higher-level) phenomena obtain solely by virtue of physical phenomena.  However, it might be that mental phenomena are entirely due to physical phenomena in a way that is inexplicable to us.  So if our concern is with the ontological status of mentality, then it is arguably not a fault of a supervenience-based account that it falls short of superdupervenience.  Secondly, it seems that a supervenience thesis by itself can take us some way, perhaps a long way, toward understanding the nature of the psychophysical dependence.  Recall that an adequate formulation of a supervenience thesis will need to make clear what brand of necessity is involved.  Suppose that mental properties supervene on physical properties with physical necessity.  Then the purely physical laws that actually obtain provide the explanation of why changes in mental properties require changes in physical properties.  Suppose that mental properties supervene on physical properties with nomological necessity only.  Then the explanation of why the mental supervenes on the physical would appeal to irreducibly psychophysical laws.  This explanation might not (and should not) satisfy a physicalist, but it is an explanation nonetheless, and perhaps even the correct explanation.  Or maybe the physical facts fix the mental facts with a type of necessity that lies between the logical/conceptual variety and the physical brand — with the metaphysical necessity that some philosophers discuss.  Then we might decide that the mental supervenes on the physical, not because of the physical laws that actually obtain, but due to some other type of relation, perhaps identity or maybe some mereological (part-whole) relation.  It seems, then, that thinking about the modality involved in a supervenience thesis can profitably guide our thoughts about why mental properties covary with physical properties in the way that they do.

There is another reason, and perhaps a better reason than explanatory failure, to be dissatisfied with a mere supervenience thesis.

b. Ontological Failure

Since a mental-physical supervenience thesis tells us only how mental properties covary with physical properties, it is compatible not only with property dualism (as the non-reductive physicalist would hope) but also with substance dualism.  Even if substance dualism were true, it might be that immaterial minds are causally connected or otherwise related to physical bodies in such a way that any variation in the properties of these immaterial minds occurs only with a variation in physical properties of the body.  Physicalism certainly does not allow that substance dualism is true.  So, since a supervenience thesis is compatible with substance dualism, it does not fully capture our physicalist intuitions.

However, there is still hope for an adequate formulation of Physicalism regarding mentality that is largely supervenience-based.  To avoid substance dualism, we simply need to conjoin a supervenience thesis with some constraint on the composition of mental items.  For example, as Geoffrey Hellman and Frank Thompson (1975) state with their Principle of Physical Exhaustion, “everything concrete is exhausted by basic physical objects” (p. 555).  Likewise, Phillip Pettit (1993) insists that “[e]verything in the empirical world is composed in some way — composed without remainder — out of (subatomic) entities of the kind that microphysics posits, or it is itself uncomposed and microphysical” (p. 215).  Applied to mentality, the claim is: all mental particulars (all instances and bearers of mental properties) are ultimately comprised entirely of physical particulars.

Or we might wish to combine a supervenience thesis with a realization claim.  The claim that mental properties are realized physically is popular among those who support Physicalism while denying psychophysical identities.  The notion of physical realization adds to the Principle of Physical Exhaustion the idea that mental properties are functional properties that are exemplified by virtue of instances of physical properties playing the definitive functional roles.  So the idea that mental properties are realized physically would also seem to rule out immaterial mental items.  (Indeed, Andrew Melynk, for example, 1996, 2003, and 2006, argues that a realization thesis itself, suitably refined, is enough to capture the content of Physicalism even without an independent supervenience claim.)

5. Concluding Remarks

It is widely agreed that the notion of supervenience cannot by itself fully explain the dependence of the mental on the physical.  Yet, we must not underestimate the value of the concept to understanding how mental properties relate to the physical properties on which they depend.  Whether or not we actually use the term ‘supervenience,’ our first step in deciding whether F-properties depend solely on G-properties is to decide whether a difference in the former requires a difference in the latter.  On the basis of various thought-experiments, we might conclude that (i) F-properties do not supervene on G-properties, that is, that the former are not entirely a function of the latter (as many conclude regarding the relation between mental content and intrinsic bodily features).  While (i) is a negative result, it is quite useful insofar as it directs our attention to features, perhaps previously overlooked, on which F-properties do depend (for example, surrounding linguistic practice or the internal structure of the external objects of our thought).  Suppose, on the other hand, we conclude that F-properties do supervene on G-properties.  Establishing this conclusion is an essential step to concluding further that (ii) F-properties are identical with G-properties (since any set of properties trivially supervenes on itself).  Or instead of (ii), we might conclude that (iii) F-properties depend entirely on but are not identical with G-properties.  Whichever of (i)-(iii) we choose, that choice inevitably involves deciding whether supervenience obtains.

Regarding the mental and the physical, we might decide either that (ii*) mental properties are physical properties or that (iii*) while non-physical, mental properties nonetheless depend entirely on physical properties.  Being a physicalist regarding mentality seems to require accepting at least (iii*).  And since both (ii*) and (iii*) entail that the mental supervenes on the physical, it seems Kim is right to note that “mind-body supervenience represents the minimal physicalist commitment” (1993, p. 168).  A supervenience thesis, however, is not merely a minimal physicalist commitment, for it need not leave the mental-physical covariance wholly unexplained.  Suppose we accept (ii*).  Then we have a simple explanation of the covariance: changes in the mental require changes in the physical simply because the mental is the physical.  On the other hand, if we accept (iii*), then as noted in section 3-a, thinking about the brand of necessity involved in our supervenience thesis (conceptual, metaphysical, physical, or nomological) can take us some way, perhaps a long way, toward figuring out what type of dependence obtains.  And since even making a decision regarding (i)-(iii), or (ii*)-(iii*), requires engaging in thoughts about supervenience, the concept is not only helpful, but indispensible to understanding the relation between the two sets of properties.  So whether or not the term ‘supervenience’ remains in vogue, the very notion will remain a crucial and potentially highly useful part of our inquiry into the relation between the mental and the physical.  As Dean Rickles, another writer for this encyclopedia puts it, “supervenience deserves the central place that is has found in the philosophers’ toolbox” (2006).

6. References and Further Readings

  • Beckermann, A., H. Flohr, and J. Kim (1992).  Emergence or Reduction? Essays on the Prospects of Nonreductive Physicalism (Berlin: Walter de Gruyter).
  • Bennett, K. (2004). “Global Supervenience and Dependence,” Philosophy and Phenomenological Research 68: 510-529.
  • Burge, T. (1979).  “Individualism and the Mental,” Midwest Studies in Philosophy 4: 73-121.
  • Burge, T. (1986).  “Individualism and Psychology,” The Philosophical Review 95: 3-45.
  • Chalmers, D. (1996).  The Conscious Mind (New York: Oxford University Press).
  • Chalmers, D. and F. Jackson (2001).  “Conceptual Analysis and Reductive Explanation,” Philosophical Review 110: 15-60.
  • Crane, T. (1991).  “All God Has To Do,” Analysis 51: 235-244.
  • Davidson, D. (1970).  “Mental Events,” in L. Foster and J. W. Swanson (eds.), Experience and Theory (Amherst, MA: University of Massachusetts Press): 79-101.
  • Francescotti, R. (2000).  “Ontological Physicalism and Property Pluralism: Why They are Incompatible,” Pacific Philosophical Quarterly 81: 349-362.
  • Francescotti (1998).  “The Non-Reductionist’s Troubles with Supervenience,” Philosophical Studies 89: 105-124.
  • Gardner, T. (2005).  “Supervenience Physicalism: Meeting the Demands of Determination and Explanation,” Philosophical Papers 34: 189-208.
  • Hare, R.M. (1952).  The Language of Morals (Oxford: Oxford University Press).
  • Haugeland, J. (1984). “Ontological Supervenience,” The Southern Journal of Philosophy, Supplement 22: 1-12.
  • Heil, J. (1998).  “Supervenience Deconstructed,” European Journal of Philosophy 6: 146-155.
  • Hellman, G. and F. W. Thompson, (1975). “Physicalism: Ontology, Determination, and Reduction,” The Journal of Philosophy 72: 551-564.
  • Hoffmann-Kolss, V. (2010).  The Metaphysics of Extrinsic Properties (Frankfurt: Ontos Verlag).
  • Hoffmann, V. and A. Newen (2007).  “Supervenience of Extrinsic Properties,” Erkenntnis 67: 305-319.
  • Hofweber, T. (2005).  “Supervenience and Object-Dependent Properties,” The Journal of Philosophy 102: 5-32.
  • Horgan, T. (1993a).  “From Supervenience to Superdupervenience: Meeting the Demands of a Material World,” Mind 102: 555-586.
  • Horgan, T. (1993b).  “Nonreductive Materialism and the Explanatory Autonomy of Psychology,” in S. J. Wagner and R. Warner (eds.), Naturalism (Notre Dame: University of Notre Dame Press): 295-320.
  • Horgan, T. (ed.) (1984). Southern Journal of Philosophy 22: The Spindel Conference 1983 Supplement, Supervenience.
  • Horgan, T. (1982).  “Supervenience and Microphysics,” Pacific Philosophical Quarterly 63: 29-43.
  • Howell, R. J. (2009).  “Emergentism and Supervenience Physicalism,” Australasian Journal of Philosophy 87: 83-98.
  • Jackson, F. (1998).  From Metaphysics to Ethics: A Defence of Conceptual Analysis (Oxford: Clarendon Press).
  • Jackson, F. (1994).  “Armchair Metaphysics,” in M. Michael and J. O’Leary-Hawthorne (eds.), Philosophy in Mind, (Dordrecht: Kluwer): 23-42.
  • Kim, J. (1993).  Supervenience and Mind: Selected Philosophical Essays (Cambridge: Cambridge University Press).
  • Kim, J. (1990).  “Supervenience as a Philosophical Concept,” Metaphilosophy 21: 1-27.
  • Kim, J. (1989).  “The Myth of Nonreductive Physicalism,” Proceedings and Addresses of the American Philosophical Association 63: 31-47.
  • Kim, J. (1987).  “‘Strong’ and ‘Global’ Supervenience Revisited,” Philosophy and Phenomenological Research 48: 315-326.
  • Kim, J. (1984).  “Concepts of Supervenience,” Philosophy and Phenomenological Research 45: 153-176.
  • Kirk, R. (2006).  “Physicalism and Strict Implication,” Synthese 151: 523-536.
  • Kirk, R. (1996).  “Strict Implication, Supervenience, and Physicalism,” Australasian Journal of Philosophy 74: 244-256.
  • Lewis, D. (1986).  The Plurality of Worlds (Oxford: Blackwell).
  • Lewis, D. (1983).  “New Work for a Theory of Universals,” Australasian Journal of Philosophy 61: 343-377.
  • McLaughlin, B. and K. Bennett (2005).  “Supervenience,” in E. N. Zalta (ed.), The Stanford Encyclopedia of Philosophy.
  • McLaughlin, B. (1995).  “Varieties of Supervenience,” in E. Savellos and U. D. Yakrin (eds.), Supervenience: New Essays (Cambridge: Cambridge University Press): 16-59.
  • McLaughlin, B. (1992).  “The Rise and Fall of British Emergentism,” in A. Beckermann, H. Flohr, and J. Kim (eds.), Emergence or Reduction? Essays on the Prospects of Nonreductive Physicalism (Berlin: De Gruyter): 49-93.
  • Melnyk, A. (2006).  “Realization and the Formulation of Physicalism,” Philosophical Studies 131: 127-155.
  • Melnyk, A. (2003).  A Physicalist Manifesto (Cambridge: Cambridge University Press).
  • Melnyk, A. (1996).  “Formulating Physicalism: Two Suggestions,” Synthese 105: 381-407.
  • Moore, G. E. (1922).  Philosophical Studies (London: Routledge).
  • Moreland, J. P. (1998). “Should a Naturalist be a Supervenient Physicalist?,” Metaphilosophy 29: 35-57.
  • Moser, P. (1992).  “Physicalism and Global Supervenience,” The Southern Journal of Philosophy 30: 71-82.
  • Papineau, D. (1993).  Philosophical Naturalism (Oxford: Blackwell).
  • Paull, C. and T. Sider (1992).  “In Defense of Global Supervenience,” Philosophy and Phenomenological Research 32: 830-45.
  • Petrie, B. (1987).  “Global Supervenience and Reduction,” Philosophy and Phenomenological Research 48:119-30.
  • Pettit, P. (1993).  “A Definition of Physicalism,” Analysis 53: 213-223.
  • Place, U. T. (1956).  “Is Consciousness a Brain Process?,” The British Journal of Psychology 47: 44-50.
  • Poland, J. (1994).  Physicalism: The Philosophical Foundations (Oxford: Oxford University Press).
  • Putnam, H. (1975). “The Meaning of Meaning,’ in K. Gunderson (ed.), Language, Mind, and Knowledge, Minnesota Studies in the Philosophy of Science 7:131-193.
  • Putnam, H. (1973).  “Meaning and Reference,” The Journal of Philosophy 70: 699-711.
  • Rickles, D. (2006).  “Supervenience and Determination,” in J. Fieser and B. Dowden (eds.), Internet Encyclopedia of Philosophy.
  • Shoemaker, S.  (1998).  “Causal and Metaphysical Necessity,” Pacific Philosophical Quarterly 79: 59-77.
  • Shoemaker, S.  (1980).  “Causality and Properties,” in P. van Inwagen (ed.), Time and Cause (Dordrecht: D. Reidel): 109-135.
  • Shrader, W. (2008).  “On the Relevance of Supervenience Theses to Physicalism,” Acta Analytica 23: 257-271.
  • Smart, J. J. C. (1959).  “Sensation and Brain Processes,” The Philosophical Review 68: 141-156.
  • Stalnaker, R. (1996). “Varieties of Supervenience,” Philosophical Perspectives 10: 221-41.
  • Steward, H.  (1996).  “Papineau’s Physicalism,” Philosophy and Phenomenological Research 56: 667-672.
  • Van Cleve, J. (1990).  “Mind-Dust or Magic?: Panpsychism versus Emergence,” Philosophical Perspectives 4: 215-226.
  • Witmer, D. G. (1999).  “Supervenience Physicalism and the Problem of Extras,” The Southern Journal of Philosophy 37: 315-331.
  • Wilson, J.  (2005).  “Supervenience-based Formulations of Physicalism,” Nous 39: 426-459.
  • Yoshimi, J. (2007).  “Supervenience, Determination, and Dependence,” Pacific Philosophical Quarterly 88: 114-133.

 

Author Information

Robert Francescotti
Email: rfrances@mail.sdsu.edu
San Diego State University
U. S. A.

Aquinas: Metaphysics

aquinasMetaphysics is taken by Thomas Aquinas to be the study of being qua being, that is, a study of the most fundamental aspects of being that constitute a being and without which it could not be. Aquinas’s metaphysical thought follows a modified but general Aristotelian view. Primarily, for Aquinas, a thing cannot be unless it possesses an act of being, and the thing that possesses an act of being is thereby rendered an essence/existence composite. If an essence has an act of being, the act of being is limited by that essence whose act it is. The essence in itself is the definition of a thing; and the paradigm instances of essence/existence composites are material substances (though not all substances are material for Aquinas; for example, God is not). A material substance (say, a cat or a tree) is a composite of matter and form, and it is this composite of matter and form that is primarily said to exist. In other words, the matter/form composite is predicated neither of, nor in, anything else and is the primary referent of being; all other things are said of it. The details of this very rich metaphysical landscape are described below.

Table of Contents

  1. The Nature of Metaphysics
  2. Essence and Existence
  3. Participation
  4. Substance and Accident
  5. Matter and Form
  6. References and Further Reading
    1. Primary Sources
    2. General Studies

1. The Nature of Metaphysics

Saint Thomas, that is, Aquinas, clarifies the nature of metaphysics through ascertaining its particular subject-matter, its field of investigation. In order to ascertain the subject-matter of any particular science, Thomas distinguishes between the different intellectual operations that we use when engaged in some particular scientific endeavor. Broadly speaking, these fall into two categories: the speculative and the practical. Concerning some sciences, the intellect is merely speculative  by contemplating the truth of some particular subject-matter; while concerning other sciences, the intellect is practical, by  ascertaining the truth and seeking to apply. There are thus correspondingly two distinct classes of science: speculative science and practical science. Speculative sciences are those that contemplate truth whereas practical sciences are those that apply truth for some practical purpose. The sciences are then further distinguished through differentiating their various subject-matters.

Insofar as the speculative sciences merely contemplate truth but do not apply it for some practical purpose, the subject-matter of the speculative sciences is that which can be understood to some extent. Working within the Aristotelian tradition, Thomas holds that something is understood when it is separated from matter and is necessary to thing in some respect. For instance, when we understand the nature of a tree, what we understand is not primarily the matter that goes to constitute the tree in question, but what it is to be a tree, or the structuring principle of the matter that so organizes it and specifies it as a tree rather than a plant. Furthermore, assuming our understanding is correct, when we understand a thing to be a tree, we do not understand it to be a dog, or a horse, or a cat. Thus, in our understanding of a tree, we understand that which is necessary for the tree to be a tree, and not of anything that is not a tree. Hence, our understanding of a thing is separated from its matter and is necessary to it in some respect. Now, what is in motion is not necessary, since what is in motion can change. Thus, the degree to which we have understood something is conditional upon the degree to which it is separated from matter and motion. It follows then that speculative objects, the subject-matter of the speculative-sciences, insofar as they are what are understood, will be separated from matter and motion to some degree. Any distinctions that obtain amongst speculative objects will in turn signify distinctions amongst the sciences that consider those objects; and we can find distinctions amongst speculative objects based upon their disposition towards matter and motion.

There are three divisions that can apply to speculative objects, thereby permitting us to differentiate the sciences that consider such objects: (i) there is a class of speculative objects that are dependent on matter and motion both for their being and for their being understood, for instance, human beings cannot be without matter, and they cannot be understood without their constituent matter (flesh and bones); (ii) there is a class of speculative objects that depend on matter and motion for their being, but not for their being understood, for instance, we can understand lines, numbers, and points without thereby understanding the matter in which they are found, yet such things cannot be without matter; (iii) there is a class of speculative objects that depend on matter and motion neither for their being nor for their being understood.

Given these three classes of speculative objects, the speculative sciences that consider them can be enumerated accordingly: (i) physical science considers those things that depend on matter and motion both for their being and for their being understood; (ii) mathematics considers those things that depend on matter and motion for their being but not for their being understood; (iii) metaphysics or theology deals with those things that depend on matter and motion neither for their being nor for their being understood. Before going on to consider the subject-matter of metaphysics in a little more detail, it is important to point out that Thomas takes this division of the speculative sciences as exhaustive. For Thomas, there could be no fourth speculative science; the reason for this is that the subject-matter of such a science would have to be those things that depend on matter and motion for their being understood but not for their being, for all other combinations have been exhausted. Now, if a thing depends on matter and motion for its being understood but not for its being, then matter and motion would be put into its definition, which defines a thing as it exists. But if a thing’s existence is so defined as to include matter and motion, then it follows that it depends on matter and motion for its being; for it cannot be understood to be without matter and motion. Hence, all things that include matter and motion in their definitions are dependent on matter and motion for their being, but not all things that depend on matter and motion for their being depend on matter and motion for their being understood. There could be no fourth speculative science since there is no fourth class of speculative objects depending on matter and motion for their being understood but not for their being. Thomas thus sees this threefold division of the speculative sciences as an exhaustive one.

The third class of speculative objects comprises the objects of metaphysics or theology. Now Thomas does not equate these two disciplines, but goes on to distinguish between the proper subject-matter of metaphysics and the proper subject-matter of theology. Recall that this third class of speculative objects comprises those things depending on matter and motion neither for their being nor for their being understood. Such things are thus immaterial things; however, Thomas here draws a distinction. There are things that are immaterial insofar as they are in themselves complete immaterial substances; God and the angels would be examples of such things. To give the latter a title, let them be called positively immaterial. On the other hand there are things that are immaterial insofar as they simply do not depend on matter and motion, but can nevertheless be sometimes said to be found therein. In other words, things of the latter category are neutral with respect to being found in matter and motion, and hence they are neutrally immaterial. St Thomas’s examples of the latter are: being, substance, potency, form, act, one and many; such things can apply equally to material things (such as humans, dogs, cats, mice) and, to some extent, to positively immaterial things. Thus, the neutrally immaterial seem to signify certain aspects or modes of being that can apply equally to material and to immaterial things. The question then arises: what is the proper subject-matter of metaphysics: the positively immaterial or the neutrally immaterial?

According to Thomas, unaided human reason cannot have direct knowledge of the positively immaterial; this is because such things (God and angels) outstrip the human intellect’s capacity to know. Nevertheless, direct knowledge of the positively immaterial is possible, but this will not be on the basis of unaided human reason; it will require that the positively immaterial reveal themselves to us in some way, in which case direct knowledge of the positively immaterial will be dependent on some sort of revelation. As it is a purely rational science, not dependent on or presupposing the truths of revelation, metaphysics will be a study of the neutrally immaterial aspects of things, that is, a study of those modes of being that apply to all beings, whether they are material or immaterial. Such a study will be in accord with the Aristotelian conception of metaphysics as a study of being qua being, insofar as the neutrally immaterial apply to all beings and are not restricted to a certain class of beings. However, Thomas does not adopt the Aristotelian phrase (being qua being) as the subject-matter of metaphysics, he offers his own term. According to Thomas, ens commune (common being) is the proper subject-matter of metaphysics. Through an investigation of ens commune, an investigation into the aspects of being common to all beings, the metaphysician may indeed come to a knowledge of the causes of being and might thereby be led to the affirmation of divine being, but this is only at the end of the metaphysical inquiry, not at the beginning. Thus, metaphysics for Aquinas is a study of ens commune where this is understood as the common aspects of being without which a thing could not be; it does not presuppose the existence of divine being, and may not even be led to an affirmation of divine being (though Thomas of course offers several highly complex metaphysical arguments for the existence of divine being, but this should not be taken to be essential to the starting point of Thomistic metaphysics).

Metaphysics then is a study of the certain aspects common to all beings; and it is the task of the metaphysician to uncover the aspects of being that are indeed common and without which a thing could not be. There are certain aspects of being that are common insofar as they are generally applicable to all beings, and these are essence and existence; all beings exist and have an essence, hence metaphysics will be primarily concerned with the nature of essence and existence and their relationship to each other. Having completed an investigation into essence and existence, the metaphysician must investigate the aspects of being that are common to particular instances of being; and this will be a study of (i) the composition of substance and accident, and (ii) the composition of matter and form. The format of Thomistic metaphysics then takes a somewhat dyadic structure of descending generality: (i) essence and existence, (ii) substance and accident, (iii) matter and form. The format of the remainder of this article will be an investigation into these dyadic structures.

2. Essence and Existence

 

A general notion of essence is the following: essence is the definable nature of the thing that exists. Quite generally then, the essence of a thing is signified by its definition. The immediate question then is how the essence of a thing relates to its existence. In finite entities, essence is that which has existence, but it is not existence; this is a crude articulation of Thomas’s most fundamental metaphysical teaching: that essence and existence are distinct in finite entities. A consideration then of essence and existence in Thomas’s metaphysical thought will thus be a consideration of his fundamental teaching that essence and existence are distinct.

The most famous, and to a certain degree the most controversial, instance wherein Thomas argues for a distinction between a thing’s essence and its existence is in De Ente et Essentia [On Being and Essence] Chapter Four. The context is a discussion of immaterial substances and whether or not they are composed of matter. In that passage, Thomas is concerned with a popular medieval discussion known as universal hylemorphism. St Bonaventure, Thomas’s contemporary, had held that insofar as creatures are in potency in some respect, they must be material in some respect, since on the Aristotelian account, matter is the principle of potency in a thing. Thus, creatures, even immaterial creatures, must be material in some respect, even if this materiality is nothing like our corporeal materiality. Thomas takes up this issue in De Ente Chapter 4, pointing out that the Jewish thinker Avicebron seems to have been the author of this position.

Thomas takes the notion of universal hylemorphism to be absurd. Not only does it conflict with the common sayings of the philosophers, but also it is precisely as separated from matter and all material conditions that we deem separate (immaterial) substances separate, in which case they cannot be composed of matter. But if such substances cannot be composed of matter, what accounts for their potentiality? Such substances are not God, they are not pure act, they are in potentiality in some respect. So, if they are not material, then how are they in potency? Thomas is thus led to hold that they have an element of potentiality, but this is not the potency supplied by matter; rather, immaterial substances are composed of essence and existence, and it is the essence of the thing, standing in potency to a distinct act of existence, that accounts for the potentiality of creatures and thereby distinguishes them from God, who is not so composed. Thomas’s argumentation for the distinction between essence and existence unfolds on three stages and for each stage there has been at least one commentator who has held that Thomas both intended and established the real distinction therein.

In the first stage Thomas argues as follows. Whatever does not enter into the understanding of any essence is composed with that essence from without; for we cannot understand an essence without understanding the parts of that essence. But we can understand the essence of something without knowing anything about its existence; for instance, one can understand the essence of a man or a phoenix without thereby understanding the existence of either. Hence, essence and existence are distinct.

This little paragraph has generated considerable controversy, insofar as it is unclear what sort of distinction Thomas intends to establish at this stage. Is it merely a logical distinction whereby it is one thing to understand the essence of a thing and another to understand its existence? On this account, essence and existence could well be identical in the thing yet distinct in our understanding thereof (just as ‘Morning star’ and ‘Evening star’ are distinct in our conceptual expressions of the planet Venus, yet both are identical with Venus). On the other hand, does Thomas attempt to establish a real distinction whereby essence and existence are not only distinct in our understanding, but also in the thing itself? Commentators who hold that this stage only establishes a logical distinction focus on the fact that Thomas is here concerned only with our understanding of essence and not with actual (real) things; such commentators include Joseph Owens and John Wippel. Commentators who hold that this stage establishes a real distinction focus on the distinction between the act of understanding a thing’s essence and the act of knowing its existence, and they argue that a distinction in cognitional acts points to a distinction in reality; such commentators include Walter Patt, Anthony Kenny, and Steven Long.

In the second stage of argumentation, Thomas claims that if there were a being whose essence is its existence, there could only be one such being, in all else essence and existence would differ. This is clear when we consider how things can be multiplied. A thing can be multiplied in one of three ways: (i) as a genus is multiplied into its species through the addition of some difference, for instance the genus ‘animal’ is multiplied into the species ‘human’ through the addition of ‘rational’; (ii) as a species is multiplied into its individuals through being composed with matter, for instance the species ‘human’ is multiplied into various humans through being received in diverse clumps of matter; (iii) as a thing is absolute and shared in by many particular things, for instance if there were some absolute fire from which all other fires were derived. Thomas claims that a being whose essence is its existence could not be multiplied in either of the first two ways (he does not consider the third way, presumably because in that case the thing that is received or participated in is not itself multiplied; the individuals are multiplied and they simply share in some single absolute reality). A being whose essence is its existence could not be multiplied (i) through the addition of some difference, for then its essence would not be its existence but its existence plus some difference, nor could it be multiplied (ii) through being received in matter, for then it would not be subsistent, but it must be subsistent if it exists in virtue of what it is. Overall then, if there were a being whose essence is its existence, it would be unique, there could only be one such being, in all else essence and existence are distinct.

Notice that Thomas has once again concluded that essence and existence are distinct. John Wippel takes this to be the decisive stage in establishing that essence and existence are really distinct. He argues that insofar as it is impossible for there to be more than one being whose essence is its existence, there could not be in reality many such beings, in which case if we grant that there are multiple beings in reality, such beings are composed of essence and existence. On the other hand, Joseph Owens has charged Wippel with an ontological move and claims that Wippel is arguing from some positive conceptual content, to the actuality of that content in reality. Owens argues that we cannot establish the real distinction until we have established that there is something whose essence is its existence. Given the existence of a being whose essence is its existence, we can contrast its existence with the existence of finite things, and conclude that the latter are composites of essence and existence; and so Owens sees the real distinction as established at stage three of Thomas’s argumentation: the proof that there actually is a being whose essence is its existence.

Thomas begins stage three with the premise that whatever belongs to a thing belongs to it either through its intrinsic principles,  its essence, or from some extrinsic principle. A thing cannot be the cause of its own existence, for then it would have to precede itself in existence, which is absurd. Everything then whose essence is distinct from its existence must be caused to be by another. Now, what is caused to be by another is led back to what exists in itself (per se). There must be a cause then for the existence of things, and this because it is pure existence (esse tantum); otherwise an infinite regress of causes would ensue.

It is here that Owens believes that Thomas establishes the real distinction; since Thomas establishes (to his own satisfaction) that there exists a being whose essence is its existence. Consequently, we can contrast the existence of such a being with the existence of finite entities and observe that in the latter existence is received as from an efficient cause whereas in the former it is not. Thus, essence and existence are really distinct. However, it is important to note that on this interpretation, the real distinction could not enter into the argument for the existence of a being whose essence is its existence; for, on Owens’s account, such argumentation is taken to establish the real distinction. If it can be shown then that Thomas’s argumentation for the existence of a being whose essence is its existence does presuppose the real distinction, then Owens’s views as to the stage at which the real distinction is established would be considerably undermined.

Having established (at some stage) that essence and existence are distinct and that there exists a being whose essence is its existence, Thomas goes on to conclude that in immaterial substances, essence is related to existence as potency to act. The latter follows insofar as what receives existence stands in potency to the existence that it receives. But all things receive existence from the being whose essence is its existence, in which case the existence that any one finite thing possesses is an act of existence that actuates a corresponding potency: the essence. Thomas has thus shown that immaterial substances do indeed have an element of potency, but this need not be a material potency.

Notice that here Thomas correlates essence and existence as potency and act only after he has concluded to the existence of a being whose essence is its existence (God). One wonders then whether or not essence and existence can be related as potency and act only on the presupposition of the existence of God. Regardless of his preferred method in the De Ente Chapter 4, Thomas could very well have focussed on the efficiently caused character of existence in finite entities (as he does in the opening lines of the argument for the existence of God), and argued that insofar as existence is efficiently caused (whether or not this is from God), existence stands to that in which it inheres as act to potency, in which case the essence that possesses existence stands in potency to that act of existence. Therefore, Thomas need not presuppose the existence of God in order to hold that essence and existence are related as potency and act; all he need presuppose is (i) that essence and existence are distinct and (ii) that existence is efficiently caused in the essence/existence composite.

3. Participation

 

Essence/existence composites merely have existence; whatever an essence/existence composite is, it is not its existence. Insofar as essence/existence composites merely have, but are not, existence, they participate in existence in order to exist. This is a second of Thomas’s fundamental metaphysical teachings: whatever does not essentially exist, merely participates in existence. Insofar as no essence/existence composite essentially exists, all essence/existence composites merely participate in existence. More specifically, the act of existence that each and every essence/existence composite possesses is participated in by the essence that exists.

As a definition of participation, Thomas claims that to participate is to take a part (in) (partem capere) something. Following this definition, Thomas goes on to explain how one thing can be said to take a part in and thereby participate in another; this can happen in three ways.

Firstly, when something receives in a particular fashion what pertains universally to another, it is said to participate in that other; for example, a species (‘man’) is said to participate in its genus (‘animal’) and an individual (Socrates) is said to participate in its species (‘man’) because they (the species and the individual) do not possess the intelligible structure of that in which they participate according to its full universality.

Secondly, a subject is said to participate in the accidents that it has (for instance, a man is a certain colour, and thereby participates in the colour of which he is), and matter is said to participate in the formal structure that it has (for instance, the matter of a statue participates in the shape of that statue in order to be the statue in question).

Thirdly, an effect can be said to participate in its cause, especially when the effect is not equal to the power of that cause. The effect particularises and determines the scope of the cause; for the effect acts as the determinate recipient of the power of the cause. The effect receives from its cause only that which is necessary for the production of the effect. It is in this way that a cause is participated in by its effect.

In all of the foregoing modes of participation, to participate is to limit that which is participated in some respect. This follows from the original etymological definition of participation, that to participate is to take a part (in); for if to participate is merely to take a part in something, the participant will not possess the nature of the thing in which it participates in any total fashion, but only in partial fashion. What then can we conclude about the participation framework that governs essence and existence?

Essences exist, but they do not exist essentially, they participate in their acts of existence. Insofar as an essence participates in its act of existence, the essence limits that act of existence to the nature of the essence whose act it is; for the essence merely has existence, it is not existence, in which case its possession of existence will be in accord with the nature of the essence. The act of existence is thus limited and thereby individuated to the essence whose act it is. As a concrete application of this, consider the following. George Bush’s existence is not Tony Blair’s existence; when George Bush came into existence, Tony Blair did not come into existence, and when George Bush ceases to exist, Tony Blair will, in all likelihood, not cease to exist. George Bush’s existence is not indexed to the existence of Tony Blair, in which case the existence of either George Bush or Tony Blair is not identical to the other. The act of existence then is individuated to the essence whose act it is, and this because the essence merely participates in, and thereby limits, the act of existence that it possesses.

4. Substance and Accident

 

The next fundamental metaphysical category is that of substance. According to Aquinas, substances are what are primarily said to exist, and so substances are what have existence but yet are not identical with existence. Aquinas’s ontology then is comprised primarily of substances, and all change is either a change of one substance into another substance, or a modification of an already existing substance. Given that essence is that which is said to possess existence, but is not identical to existence, substances are essence/existence composites; their existence is not guaranteed by what they are. They simply have existence as limited by their essence.

Let us begin with a logical definition of substance, as this will give us an indication of its metaphysical nature. Logically speaking, a substance is what is predicated neither of nor in anything else. This captures the fundamental notion that substances are basic, and everything else is predicated either of or in them. Now, if we transpose this logical definition of substance to the realm of metaphysics, where existence is taken into consideration, we can say that a substance is that whose nature it is to exist not in some subject or as a part of anything else, but what exists in itself. Thus, a substance is a properly basic entity, existing per se (though of course depending on an external cause for its existence), and the paradigm instances of which are the medium sized objects that we see around us: horses, cats, trees and humans.

On the other hand there are accidents. Accidents are what accrue to substances and modify substances in some way. Logically speaking, accidents are predicated of or in some substance; metaphysically speaking, accidents cannot exist in themselves but only as part of some substance. As their name suggests, accidents are incidental to the thing, and they can come and go without the thing losing its identity; whereas a thing cannot cease to be the substance that it is without losing its identity.

Accidents only exist as part of some substance. It follows then that we cannot have un-exemplified properties as if they were substances in themselves. Properties are always exemplified by some substance, whereas substance itself is un-exemplifiable. For example, brown is always predicated of something, we say that x is brown, in which case brown is an accident. However, brown is never found to be in itself, it is always exemplified by something of which it is said.

Within Aquinas’s metaphysical framework, substances can be both material (cats, dogs, humans) and immaterial (angels), but as noted above, the paradigm instances of substances are material substances, and the latter are composites of matter and form; a material substance is neither its matter alone nor its form alone, since matter and form are always said to be of some individual and never in themselves. It follows then that material substances have parts, and the immediate question arises as to whether or not the parts of substances are themselves substances. In order to address this issue, we must ask two questions: (i) while they are parts of a substance, are such parts themselves substances? and (ii) are the parts of a substance themselves things that can exist without the substance of which they are parts?

Concerning (i) we must say that whilst they are parts of a substance such parts cannot be substances; this is so given the definition of substance outlined above: that whose nature it is to exist not in some subject. Given that the parts of a substance are in fact parts of a substance, it is their nature to exist in some subject  of which they are a part. Consequently, the parts of a substance cannot themselves be substances.

Concerning (ii) the case is somewhat different, now we must consider whether or not the parts of a substance can exist without the substance of which they are parts, that is, after the dissolution of the substance of which they are parts do the parts become substances in themselves? The parts of a substance receive their identity through being the parts of the substance whose parts they are. Thus, the flesh and bone of a human are flesh and bone precisely because they are parts of a human. When the human dies, the flesh and bone are no longer flesh and bone (except equivocally speaking) because they are no longer parts of a human substance; rather, the flesh and bone cease to function as flesh and bone and begin to decompose, in which case they are not themselves substances. However, on Aquinas’s view, the elements out of which a substance is made can indeed subsist beyond the dissolution of the substance. Thus, whilst the elements are parts of the substance, they are not, as parts of a substance, substances in themselves, but when the substance dissolves, the elements will remain as independent substances in their own right. Thus, in the case of the dissolution of the human being, whilst the flesh and bone no longer remain but decompose, the elements that played a role in the formation of the substance remain. In more contemporary terms we could say that before they go to make up the bodily substances we see in the world, atoms are substances in themselves, but when united in a certain form they go to make cats, dogs, humans, and cease to be independent substances in themselves. When the cat or dog or human perishes, its flesh and bones perish with it, but its atoms regain their substantial nature and they remain as substances in themselves. So, a substance can have its parts, and for as long as those parts are parts of a substance, those parts are not substances in themselves, but when the substance decomposes, those parts can be considered as substances in themselves so long as they are capable of subsisting in themselves.

5. Matter and Form

 

A very crude definition of matter would be that it is the ‘stuff’ out of which a thing is made, whereas form is signified by the organisation that the matter takes. A common example used by Aquinas and his contemporaries for explaining matter and form was that of a statue. Consider a marble statue. The marble is the matter of the statue whereas the shape signifies the form of the statue. The marble is the ‘stuff’ out of which the statue is made whereas the shape signifies the form that the artist decided to give to the statue. On a more metaphysical level, form is the principle whereby the matter has the particular structure that it has, and matter is simply that which stands to be structured in a certain way. It follows from this initial account that matter is a principle of potency in a thing; since if the matter is that which stands to be structured in a certain way, matter can be potentially an indefinite number of forms. Form on the other hand is not potentially one thing or another; form as form is the kind of thing that it is and no other.

On Aquinas’s account, there are certain levels of matter/form composition. On one level we can think of the matter of a statue as being the marble whereas we can think of the shape of the statue as signifying the form. But on a different level with can think of the marble as signifying the form and something more fundamental being the matter. For instance, before the marble was formed into the statue by the sculptor, it was a block of marble, already with a certain form that made it ‘marble’. At this level, the marble cannot be the matter of the thing, since its being marble and not, say, granite, is its form. Thus, there is a more fundamental level of materiality that admits of being formed in such a way that the end product is marble or granite, and at a higher level, this formed matter stands as matter for the artist when constructing the statue.

If we think of matter as without any form, we come to the notion of prime matter, and this is a type of matter that is totally unformed, pure materiality itself. Prime matter is the ultimate subject of form, and in itself indefinable; we can only understand prime matter through thinking of matter as wholly devoid of form. As wholly devoid of form prime matter is neither a substance nor any of the other categories of being; prime matter, as pure potency, cannot in fact express any concrete mode of being, since as pure potency is does not exist except as potency. Thus, prime matter is not a thing actually existing, since it has no principle of act rendering it actually existing.

Matter can be considered in two senses: (i) as designated and (ii) as undesignated. Designated matter is the type of matter to which one can point and of which one can make use. It is the matter that we see around us. Undesignated matter is a type of matter that we simply consider through the use of our reason; it is the abstracted notion of matter. For instance, the actual flesh and bones that make up an individual man are instances of designated matter, whereas the notions of ‘flesh’ and ‘bones’ are abstracted notions of certain types of matter and these are taken to enter into the definition of ‘man’ as such. Designated matter is what individuates some form. As noted, the form of a thing is the principle of its material organisation. A thing’s form then can apply to many different things insofar as those things are all organised in the same way. The form then can be said to be universal, since it remains the same but is predicated over different things. As signifying the actual matter that is organised in the thing, designated matter individuates the form to ‘this’ or ‘that’ particular thing, thereby ensuring individuals (Socrates, Plato, Aristotle) of the same form (man).

Given that form is the principle of organisation of a thing’s matter, or the thing’s intelligible nature, form can be of two kinds. On the one hand, form can be substantial, organising the matter into the kind of thing that the substance is. On the other hand, form can be accidental, organising some part of an already constituted substance. We can come to a greater understanding of substantial and accidental form if we consider their relation to matter. Substantial form always informs prime matter and in doing so it brings a new substance into existence; accidental form simply informs an already existing substance (an already existing composite of substantial form and prime matter), and in doing so it simply modifies some substance. Given that substantial form always informs prime matter, there can be only one substantial form of a thing; for if substantial form informs prime matter, any other form that may accrue to a thing is posterior to it and simply informs an already constituted substance, which is the role of accidental form. Thus, there can only be one substantial form of a thing.

As stated above, essence is signified by the definition of a thing; essence is the definable nature of the thing that exists. A thing’s essence then is its definition. It follows that on Thomas’s account the essence of a thing is the composition of its matter and form, where matter here is taken as undesignated matter. Contrary to contemporary theories of essence, Aquinas does not, strictly speaking, take essence to be what is essential to the thing in question, where the latter is determined by a thing’s possessing some property or set of properties in all possible worlds. In the latter context, the essence of a thing comprises its essential properties, properties that are true of it in all possible worlds; but this is surely not Aquinas’s view. For Aquinas, the essence of a thing is not the conglomeration of those properties that it would possess in all possible worlds, but the composition of matter and form. On a possible-worlds view of essence, the essence of a thing could not signify the matter/form composite as it is in this actual world, since such a composite could be different in some possible world and therefore not uniform across all possible worlds. Thus, Aquinas does not adopt a possible-worlds view of essence; he envisages the essence of a thing as the definition or quiddity of the thing existing in this world, not as it would exist in all possible worlds.

6. References and Further Reading

a. Primary Sources

  • St Thomas Aquinas, Summa Theologiae, trans. Fathers of the English Dominican Province (1948), U.S.A: Christian Classics.
    • Aquinas’s philosophical and theological masterpiece; Part I (Prima Pars) is the most important for Thomas’s metaphysical thought.
  • St Thomas Aquinas, The Divisions and Methods of the Sciences: Questions V and VI of his Commentary on the De Trinitate of Boethius, trans. Armand Maurer (1953) Toronto: Pontifical Institute of Medieval Studies.
    • A detailed consideration by Thomas of the divisions and methods of the sciences.
  • St Thomas Aquinas, On Being and Essence, trans. Armand Maurer (1968), Toronto: Pontifical Institute Medieval Studies.
    • An excellent summary from Aquinas himself of his metaphysical views.
  • St Thomas Aquinas, Summa Contra Gentiles, trans. A.C Pegis (1975), Notre Dame: University of Notre Dame Press.
    • One of Aquinas’s great Summae; books I and II are most important for Thomas’s metaphysical thought.
  • St Thomas Aquinas, Commentary on Aristotle’s Metaphysics, trans. John P. Rowan (1995), U.S.A: Dumb Ox Books.
    • A direct commentary on the Metaphysics of Aristotle.
  • St Thomas Aquinas, Aquinas on matter and form and the elements: a translation and interpretation of the De principiis naturae and the De mixtione elementorum of St. Thomas Aquinas, trans. Joseph Bobik (1998), Notre Dame: University of Notre Dame Press.
    • A translation of and commentary on Aqiunas’s works on ‘matter and form and the elements’.

b. General Studies

  • Clarke, W.N., Explorations in Metaphysics — Being, God, Person (1994), Notre Dame: University of Notre Dame Press.
  • Clarke, W.N., The One and the Many — A Contemporary Thomistic Metaphysics (2001), Notre Dame: University of Notre Dame Press.
  • Copleston, F., Aquinas (1955), London: Penguin.
  • Davies, B., The Thought of Thomas Aquinas (1993), Oxford: Clarendon Paperbacks.
  • Fabro, C., La Nozione Metafisica di Partecipazione secondo S. Tommaso d’Aquino (1950), Turin: Società Editrice Internazionale.
  • Fabro, C., Participation et causalité selon S. Thomas d’Aquin (1961), Louvain: Publications Universaitaires.
    • Both works by Fabro were groundbreaking for their uncovering the Platonic influences in Aquinas’s thought.
  • de Finance, J., Être et agir dans la philosophie de Saint Thomas (1960), Rome: Librairie Éditrice de l’Université Grégorienne.
  • Geiger, L-B., La participation dans la philosophie de s. Thomas d’Aquin (1953), Paris: Librairie Philosophique J. Vrin.
    • As with Fabro’s work, Geiger’s too uncovers the Platonic influences in Aquinas’s thought.
  • Kenny, A., Aquinas on Being (2003), Oxford: Oxford University Press.
  • Klima, G., ‘Contemporary “Essentialism” vs. Aristotelian Essentialism’, in John Haldane, ed., Mind Metaphysics, and Value in the Thomistic and Analytical Traditions (2002), Notre Dame: University of Notre Dame Press.
  • Kretzman, N., & Stump, E. eds. The Cambridge Companion to Aquinas (1993), Cambridge: Cambridge University Press.
  • Long, S., ‘On the Natural Knowledge of the Real Distinction’, Nova et Vetera (2003), 1:1.
  • Long, S., ‘Aquinas on Being and Logicism’, New Blackfriars (2005), 86.
  • Owens, J., ‘A Note on the Approach to Thomistic Metaphysics’, The New Scholasticism (1954), 28:4.
  • Owens, J., ‘Quiddity and the Real Distinction in St Thomas Aquinas’, Mediaeval Studies (1965), 27.
  • Owens, J., The Doctrine of Being in the Aristotelian Metaphysics (1978), Toronto: Pontifical Institute of Medieval Studies.
  • Owens, J., St Thomas Aquinas on the Existence of God — The Collected Papers of Joseph Owens ed. Catan, John. R. (1980), Albany: State University of New York Press.
  • Owens, J., ‘Stages and Distinction in De Ente’, The Thomist, (1981), 45.
  • Owens, J., An Elementary Christian Metaphysics (1985), Houston, Texas: Bruce Publishing Company.
  • Owens, J., An Interpretation of Existence (1985), Texas: Center for Thomistic Studies — University of St Thomas.
  • Owens, J., ‘Aquinas’s Distinction at De Ente et Essentia 4.119 — 123’, Mediaeval Studies (1986), 48.
  • Patt, W. ‘Aquinas’s Real Distinction and some Interpretations’, The New Scholasticism (1988), 62:1.
  • Stump, E., Aquinas (2003), London-New York: Routledge.
    • Part I, Chapter I, offers a very good survey of Aquinas’s metaphysics, but leaves out details of his theory of essence/existence composition. Pp. 36 – 44 are particularly illuminating on matter and form, substance and accident.
  • Torrell, J.-P., Saint Thomas Aquinas: The Person and His Work, trans. Robert Royal (1996), Washington: The Catholic University of America Press.
    • This is the most up-to-date publication on Aquinas’s life and work.
  • te Velde R. A., Participation and Substantiality in Thomas Aquinas (1995), Leiden-New York-Cologne: E.J. Brill.
  • Wippel, J., 1979: ‘Aquinas’s Route to the Real Distinction’, The Thomist, 43.
  • Wippel, J.,  Metaphysical Themes in Thomas Aquinas (1984), USA: The Catholic University of America Press.
  • Wippel, J.,  The Metaphysical Thought of Thomas Aquinas (2000), USA: The Catholic University of America Press.
  • Wippel, J.,  Metaphysical Themes in Thomas Aquinas II (2007), USA: The Catholic University of America Press.
    • Wippel’s work is generally taken to be essential for scholarly work on Aquinas’s metaphysical thought.

 

Author Information

Gaven Kerr
Email: gkerr07@qub.ac.uk
Queen’s University Belfast
Northern Ireland

David Hume (1711—1776)

“Hume is our Politics, Hume is our Trade, Hume is our Philosophy, Hume is our Religion.” This statement by nineteenth century philosopher James Hutchison Stirling reflects the unique position in intellectual thought held by Scottish philosopher David Hume. Part of Hume’s fame and importance owes to his boldly skeptical approach to a range of philosophical subjects. In epistemology, he questioned common notions of personal identity, and argued that there is no permanent “self” that continues over time. He dismissed standard accounts of causality and argued that our conceptions of cause-effect relations are grounded in habits of thinking, rather than in the perception of causal forces in the external world itself. He defended the skeptical position that human reason is inherently contradictory, and it is only through naturally-instilled beliefs that we can navigate our way through common life. In the philosophy of religion, he argued that it is unreasonable to believe testimonies of alleged miraculous events, and he hints, accordingly, that we should reject religions that are founded on miracle testimonies. Against the common belief of the time that God’s existence could be proven through a design or causal argument, Hume offered compelling criticisms of standard theistic proofs. He also advanced theories on the origin of popular religious beliefs, grounding such notions in human psychology rather than in rational argument or divine revelation. The larger aim of his critique was to disentangle philosophy from religion and thus allow philosophy to pursue its own ends without rational over-extension or psychological corruption.  In moral theory, against the common view that God plays an important role in the creation and reinforcement of moral values, he offered one of the first purely secular moral theories, which grounded morality in the pleasing and useful consequences that result from our actions. He introduced the term “utility” into our moral vocabulary, and his theory is the immediate forerunner to the classic utilitarian views of Jeremy Bentham and John Stuart Mill. He is famous for the position that we cannot derive ought from is, the view that statements of moral obligation cannot simply be deduced from statements of fact. Some see Hume as an early proponent of the emotivist metaethical view that moral judgments principally express our feelings. He also made important contributions to aesthetic theory with his view that there is a uniform standard of taste within human nature, in political theory with his critique of social contractarianism, and economic theory with his anti-mercantilist views. As a philosophical historian, he defended the conservative view that British governments are best run through a strong monarchy.

Table of Contents

  1. Life
  2. Origin and Association of Ideas
  3. Epistemological Issues
    1. Space
    2. Time
    3. Necessary Connection between Causes and Effects
    4. External Objects
    5. Personal Identity
    6. Free Will
  4. Skepticism
  5. Theory of the Passions
  6. Religious Belief
    1. Miracles
    2. Psychology of Religious Belief
    3. Arguments for God’s Existence
  7. Moral Theory
  8. Aesthetic, Political, and Economic Theory
  9. History and Philosophy
  10. References and Further Reading
    1. Recent Editions of Hume’s Writings
    2. Chronological List of Hume’s Publications
    3. Biographies, Letters, Manuscripts
    4. Bibliographies
    5. Works on Hume

1. Life

 

David Hume was born in 1711 to a moderately wealthy family from Berwickshire Scotland, near Edinburgh. His background was politically Whiggish and religiously Calvinistic. As a child he faithfully attended the local Church of Scotland, pastored by his uncle. Hume was educated by his widowed mother until he left for the University of Edinburgh at the age of eleven. His letters describe how as a young student he took religion seriously and obediently followed a list of moral guidelines taken from The Whole Duty of Man, a popular Calvinistic devotional.

Leaving the University of Edinburgh around the age of fifteen to pursue his education privately, he was encouraged to consider a career in law, but his interests soon turned to philosophy. During these years of private study he began raising serious questions about religion, as he recounts in the following letter:

Tis not long ago that I burn’d an old Manuscript Book, wrote before I was twenty; which contain’d, Page after Page, the gradual Progress of my Thoughts on that head [i.e. religious belief]. It begun with an anxious Search after Arguments, to confirm the common Opinion: Doubts stole in, dissipated, return’d, were again dissipated, return’d again [To Gilbert Elliot of Minto, March 10, 1751].

Although his manuscript book was destroyed, several pages of his study notes survive from his early twenties. These show a preoccupation with proofs for God’s existence as well as atheism, particularly as he read on these topics in classical Greek and Latin texts and in Pierre Bayle’s skeptical Historical and Critical Dictionary. During these years of private study, some of which were in France, he composed his three-volume Treatise of Human Nature, which was published anonymously in two installments before he was thirty (1739, 1740). The Treatise explores several philosophical topics such as space, time, causality, external objects, the passions, free will, and morality, offering original and often skeptical appraisals of these notions. Book I of the Treatise was unfavorably reviewed in the History of the Works of the Learned with a succession of sarcastic comments. Although scholars today recognized it as a philosophical masterpiece, Hume was disappointed with the minimal interest his book spawned and said that “It fell dead-born from the press, without reaching such distinctions even to excite a murmur among the zealots” (My Own Life).

In 1741 and 1742 Hume published his two-volume Essays, Moral and Political, which were written in a popular style and were more successful than the Treatise. In 1744-1745 he was a candidate for the Chair of Moral Philosophy at the University of Edinburgh. The Edinburgh Town Council was responsible for electing a replacement, and critics opposed Hume by condemning his anti-religious writings. Chief among the critics was clergyman William Wishart (d. 1752), the Principal of the University of Edinburgh. Lists of allegedly dangerous propositions from Hume’s Treatise circulated, presumably penned by Wishart himself. In the face of such strong opposition, the Edinburgh Town Council consulted the Edinburgh ministers. Hoping to win over the clergy, Hume composed a point by point reply to the circulating lists of dangerous propositions, which was published as A Letter from a Gentleman to his Friend in Edinburgh. The clergy were not swayed, 12 of the 15 ministers voted against Hume, and he quickly withdrew his candidacy. In 1745 Hume accepted an invitation from General St Clair to attend him as secretary. He wore the uniform of an officer, and accompanied the general on an expedition against Canada (which ended in an incursion on the coast of France) and to an embassy post in the courts of Vienna and Turin.

Because of the success of his Essays, Hume was convinced that the poor reception of his Treatise was caused by its style rather than by its content. In 1748 he published his Enquiry Concerning Human Understanding, a more popular rendition of portions of Book I of the Treatise. The Enquiry also includes two sections not found in the Treatise: “Of Miracles” and a dialogue titled “Of a Particular Providence and of a Future State.” Each section contains direct attacks on religious belief. In 1751 he published his Enquiry Concerning the Principles of Morals, which recasts parts of Book III of the Treatise in a very different form. The work establishes a system of morality upon utility and human sentiments alone, and without appeal to divine moral commands. By the end of the century Hume was recognized as the founder of the moral theory of utility, and utilitarian political theorist Jeremy Bentham acknowledged Hume’s direct influence upon him. The same year Hume also published his Political Discourses, which drew immediate praise and influenced economic thinkers such as Adam Smith, William Godwin, and Thomas Malthus.

In 1751-1752 Hume sought a philosophy chair at the University of Glasgow, and was again unsuccessful. In 1752 his new employment as librarian of the Advocate’s Library in Edinburgh provided him with the resources to pursue his interest in history. There, he wrote much of his highly successful six-volume History of England (published from 1754 to 1762). The first volume was unfavorably received, partially for its defense of Charles I, and partially for two sections which attack Christianity. In one passage Hume notes that the first Protestant reformers were fanatical or “inflamed with the highest enthusiasm” in their opposition to Roman Catholic domination. In the second passage he labels Roman Catholicism a superstition which “like all other species of superstition. . .  rouses the vain fears of unhappy mortals.” The most vocal attack against Hume’s History came from Daniel MacQueen in his 300 page Letters on Mr. Hume’s History. MacQueen scrutinizes the first volume of Hume’s work, exposing all the allegedly “loose and irreligious sneers” Hume makes against Christianity. Ultimately, this negative response led Hume to delete the two controversial passages from succeeding editions of the History.

Around this time Hume also wrote his two most substantial works on religion: The Dialogues Concerning Natural Religion and The Natural History of Religion. The Natural History appeared in 1757, but, on the advice of friends who wished to steer Hume away from religious controversy, the Dialogues remained unpublished until 1779, three years after his death. The Natural History aroused controversy even before it was made public. In 1756 a volume of Hume’s essays titled Five Dissertations was printed and ready for distribution. The essays included (1) “The Natural History of Religion;” (2) “Of the Passions;” (3) “Of Tragedy;” (4) “Of Suicide;” and (5) “Of the Immortality of the Soul.” The latter two essays made direct attacks on common religious doctrines by defending a person’s moral right to commit suicide and by criticizing the idea of life after death. Early copies were passed around, and Hume’s publisher was threatened with prosecution if the book was distributed as it was. The printed copies of Five Dissertations were then physically altered by removing the essays on suicide and immortality, and inserting a new essay “Of the Standard of Taste” in their place. Hume also took this opportunity to alter two particularly offending paragraphs in the Natural History. The essays were then bound with the new title Four Dissertations and distributed in January, 1757.

In the years following Four Dissertations, Hume completed his last major literary work, The History of England, which gave him a reputation as an historian that equaled, if not overshadowed, his reputation as a philosopher.  In 1763, at age 50, he was invited to accompany the Earl of Hertford to the embassy in Paris, with a near prospect of being his secretary. He eventually accepted, and remarks at the reception he received in Paris “from men and women of all ranks and stations.” He returned to Edinburgh in 1766, and continued developing relations with the greatest minds of the time. Among these was Jean Jacques Rousseau who in 1766 was ordered out of Switzerland by the government in Berne. Hume offered Rousseau refuge in England and secured him a government pension. In England, Rousseau became suspicious of plots, and publicly charged Hume with conspiring to ruin his character, under the appearance of helping him. Hume published a pamphlet defending his actions and was exonerated. Another secretary appointment took him away from 1767-1768. Returning again to Edinburgh, his remaining years were spent revising and refining his published works, and socializing with friends in Edinburgh’s intellectual circles. In 1770, fellow Scotsman James Beattie published one of the harshest attacks on Hume’s philosophy to ever appear in print, entitled An Essay on the Nature and Immutability of Truth in Opposition to Sophistry and Scepticism. Hume was upset by Beattie’s relentless verbal attacks against him in the work, but the book made Beattie famous and King George III, who admired it, awarded Beattie a pension of £200 per year.

In 1776, at age sixty-five, Hume died from an internal disorder which had plagued him for many months. After his death, his name took on new significance as several of his previously unpublished works appeared. The first was a brief autobiography, My Own Life, but even this unpretentious work aroused controversy. As his friends, Adam Smith and S.J. Pratt, published affectionate eulogies describing how he died with no concern for an afterlife, religious critics responded by condemning this unjustifiable admiration of Hume’s infidelity. Two years later, in 1779, Hume’s Dialogues Concerning Natural Religion appeared. Again, the response was mixed. Admirers of Hume considered it a masterfully written work, while religious critics branded it as dangerous to religion. Finally, in 1782, Hume’s two suppressed essays on suicide and immortality were published. Their reception was almost unanimously negative.

2. Origin and Association of Ideas

Drawing heavily on John Locke’s empiricism, the opening sections of both the Treatise and Enquiry discuss the origins of mental perceptions as laid out in the following categorical scheme:

Perceptions

A. Ideas

1. From memory

2. From imagination

a. From fancy

b. From understanding

(1) Involving relations of ideas

(2) Involving matters of fact

B. Impressions

1. Of sensation (external)

2. Of reflection (internal)

Hume begins by dividing all mental perceptions between ideas (thoughts) and impressions (sensations and feelings), and then makes two central claims about the relation between them. First, advancing what is commonly called Hume’s copy thesis, he argues that all ideas are ultimately copied from impressions. That is, for any idea we select, we can trace the component parts of that idea to some external sensation or internal feeling. This claim places Hume squarely in the empiricist tradition, and he regularly uses this principle as a test for determining the content of an idea under consideration. As proof of the copy thesis, Hume challenges anyone who denies it “to shew a simple impression, that has not a correspondent idea, or a simple idea, that has not a correspondent impression” (Treatise, 1.1.1). Second, advancing what we may call Hume’s liveliness thesis, he argues that ideas and impressions differ only in terms of liveliness. For example, my impression of a tree is simply more vivid than my idea of that tree. One of his early critics, Lord Monboddo (1714–1799) pointed out an important implication of the liveliness thesis, which Hume himself presumably hides. Most modern philosophers held that ideas reside in our spiritual minds, whereas impressions originate in our physical bodies. So, when Hume blurs the distinction between ideas and impressions, he is ultimately denying the spiritual nature of ideas and instead grounding them in our physical nature. In short, all of our mental operations—including our most rational ideas—are physical in nature. As Monboddo writes, “One consequence, which Mr Hume has drawn from this doctrine, is, that, as our Mind can only operate by the organs of the Body, it must perish with the Body” (Ancient Metaphysics, 1782, 2.2.2).

Hume goes on to explain that there are several mental faculties that are responsible for producing our various ideas. He initially divides ideas between those produced by the memory, and those produced by the imagination. The memory is a faculty that conjures up ideas based on experiences as they happened. For example, the memory I have of my drive to the store is a comparatively accurate copy of my previous sense impressions of that experience. The imagination, by contrast, is a faculty that breaks apart and combines ideas, thus forming new ones. Hume uses the familiar example of a golden mountain: this idea is a combination of an idea of gold and an idea of a mountain. As our imagination takes our most basic ideas and leads us to form new ones, it is directed by three principles of association, namely, resemblance, contiguity, and cause and effect. By virtue of resemblance, an illustration or sketch, of a person leads me to an idea of that actual person. The idea of one apartment in a building leads me to think of the apartment contiguous to—or next to—the first. The thought of a scar on my hand leads me to think of a broken piece of glass that caused the scar.

As indicated in the above chart, our more complex ideas of the imagination are further divided between two categories. Some imaginative ideas represent flights of the fancy, such as the idea of a golden mountain; however, other imaginative ideas represent solid reasoning, such as predicting the trajectory of a thrown ball. The fanciful ideas are derived from the faculty of the fancy, and are the source of fantasies, superstitions, and bad philosophy. By contrast, sound ideas are derived from the faculty of the understanding—or reason—and are of two types: (1) involving relations of ideas; or (2) involving matters of fact. A relation of ideas (or relation between ideas) is a mathematical relation that is “discoverable by the mere operation of thought, without dependence on what is anywhere existent in the universe,” such as the mathematical statement “the square of the hypotenuse is equal to the square of the two sides” (Enquiry, 4). By contrast, a matter of fact, for Hume, is any object or circumstance which has physical existence, such as “the sun will rise tomorrow”. This split between relations of ideas and matters of fact is commonly called “Hume’s Fork”, and Hume himself uses it as a radical tool for distinguishing between well-founded ideas of the understanding, and unfounded ideas of the fancy. He dramatically makes this point at the conclusion of his Enquiry:

When we run over libraries, persuaded of these principles, what havoc must we make? If we take in our hand any volume; of divinity or school metaphysics, for instance; let us ask, Does it contain any abstract reasoning concerning quantity or number? No. Does it contain any experimental reasoning concerning matter of fact and existence? No. Commit it then to the flames: For it can contain nothing but sophistry and illusion (Enquiry, 12).

For Hume, when we imaginatively exercise our understanding regarding relations of ideas and matters of fact, our minds are guided by seven philosophical or “reasoning” relations, which are as follows:

Principles of reasoning concerning relations of ideas (involving demonstration): (1) resemblance; (2) contrariety; (3) degrees in quality; and (4) proportions in quantity or number

Principles of reasoning concerning matters of fact (involving judgments of probability): (5) identity; (6) relations in time and place; and (7) causation

Human understanding and reasoning at its best, then, involves ideas that are grounded in the above seven principles.

3. Epistemological Issues

 

Much of Hume’s epistemology is driven by a consideration of philosophically important issues, such as space and time, cause-effect, external objects, personal identity, and free will. In his analysis of these issues in the Treatise, he repeatedly does three things. First, he skeptically argues that we are unable to gain complete knowledge of some important philosophical notion under consideration. Second, he shows how the understanding gives us a very limited idea of that notion. Third, he explains how some erroneous views of that notion are grounded in the fancy, and he accordingly recommends that we reject those erroneous ideas. We will follow this three-part scheme as we consider Hume’s discussions of various topics.

a. Space

On the topic of space, Hume argues that our proper notions of space are confined to our visual and tactile experiences of the three-dimensional world, and we err if we think of space more abstractly and independently of those visual and tactile experiences. In essence, our proper notion of space is like what Locke calls a “secondary quality” of an object, which is spectator dependent, meaning grounded in the physiology of our perceptual mental processes. Thus, our proper notion of space is not like a “primary quality” that refers to some external state of affairs independent of our perceptual mental process. Following the above three-part scheme, (1) Hume skeptically argues that we have no ideas of infinitely divisible space (Treatise, 1.2.2.2). (2) When accounting for the idea we do have of space, he argues that “the idea of space is convey’d to the mind by two senses, the sight and touch; nor does any thing ever appear extended, that is not either visible or tangible” (Treatise, 1.2.3.15). Further, he argues that these objects—which are either visible or tangible—are composed of finite atoms or corpuscles, which are themselves “endow’d with colour and solidity.” These impressions are then “comprehended” or conceived by the imagination; it is from the structuring of these impressions that we obtain a limited idea of space. (3) In contrast to this idea of space, Hume argues that we frequently presume to have an idea of space that lacks visibility or solidity. He accounts for this erroneous notion in terms of a mistaken association that people naturally make between visual and tactile space (Treatise, 1.2.5.21).

b. Time

Hume’s treatment of our idea of time is like his treatment of the idea of space, in that our proper idea of time is like a secondary quality, grounded in our mental operations, not a primary quality grounded in some external phenomenon beyond our experience. (1) He first maintains that we have no idea of infinitely divisible time (Treatise, 1.2.4.1). (2) He then notes Locke’s point that our minds operate at a range of speeds that are “fix’d by the original nature and constitution of the mind, and beyond which no influence of external objects on the senses is ever able to hasten or retard our thought” (Treatise, 1.2.3.7). The idea of time, then, is not a simple idea derived from a simple impression; instead, it is a copy of impressions as they are perceived by the mind at its fixed speed (Treatise, 1.2.3.10). (3) In contrast to this limited view of time, he argues that we frequently entertain a faulty notion of time that does not involve change or succession. The psychological account of this erroneous view is that we mistake time for the cause of succession instead of seeing it as the effect (Treatise, 1.2.5.29).

c. Necessary Connection between Causes and Effects

According to Hume, the notion of cause-effect is a complex idea that is made up of three more foundational ideas: priority in time, proximity in space, and necessary connection. Concerning priority in time, if I say that event A causes event B, one thing I mean is that A occurs prior to B. If B were to occur before A, then it would be absurd to say that A was the cause of B. Concerning the idea of proximity, if I say that A causes B, then I mean that B is in proximity to, or close to A. For example, if I throw a rock, and at that moment someone’s window in China breaks, I would not conclude that my rock broke a window on the other side of the world. The broken window and the rock must be in proximity with each other. Priority and proximity alone, however, do not make up our entire notion of causality. For example, if I sneeze and the lights go out, I would not conclude that my sneeze was the cause, even though the conditions of priority and proximity were fulfilled. We also believe that there is a necessary connection between cause A and effect B. During the modern period of philosophy, philosophers thought of necessary connection as a power or force connecting two events. When billiard ball A strikes billiard ball B, there is a power that the one event imparts to the other. In keeping with his empiricist copy thesis, that all ideas are copied from impressions, Hume tries to uncover the experiences which give rise to our notions of priority, proximity, and necessary connection. The first two are easy to explain. Priority traces back to our various experiences of time. Proximity traces back to our various experiences of space. But what is the experience which gives us the idea of necessary connection? This notion of necessary connection is the specific focus of Hume’s analysis of cause-effect.

Hume’s view is that our proper idea of necessary connection is like a secondary quality that is formed by the mind, and not, like a primary quality, a feature of the external world. (1) He skeptically argues that we cannot get an idea of necessary connection by observing it through sensory experiences (Treatise, 1.3.14.12 ff.). We have no external sensory impression of causal power when we observe cause-effect relationships; all that we ever see is cause A constantly conjoined with effect B. Neither does it arise from an internal impression, such as when we introspectively reflect on willed bodily motions or willing the creation of thoughts. These internal experiences are too elusive, and nothing in them can give content to our idea of necessary connection. (2) The idea we have of necessary connection arises as follows: we experience a constant conjunction of events A and B— repeated sense experiences where events resembling A are always followed by events resembling B. This produces a habit such that upon any further appearance of A, we expect B to follow. This, in turn, produces an internal feeling of expectation “to pass from an object to the idea of its usual attendant,” which is the impression from which the idea of necessary connection is copied (Treatise, 1.3.14.20). (3) A common but mistaken notion on this topic is that necessity resides within the objects themselves. He explains this mistaken belief by the natural tendency we have to impute subjectively perceived qualities to external things (Treatise, 1.3.14.24).

d. External Objects

 

Hume’s view on external objects is that the mind is programmed to form some concept of the external world, although this concept or idea is really just a fabrication. (1) Hume’s skeptical claim here is that we have no valid conception of the existence of external things (Treatise, 1.2.6.9). (2) Nevertheless, he argues that we have an unavoidable “vulgar” or common belief in the continued existence of objects, and this idea he accounts for. His explanation is lengthy, but involves the following features. Perceptions of objects are disjointed and have no unity in and of themselves (Treatise, 1.4.2.29). In an effort to organize our perceptions, we first naturally assume that there is no distinction between our perceptions and the objects that are perceived (this is the so-called “vulgar” view of perception). We then conflate all ideas (of perceptions), which put our minds in similar dispositions (Treatise, 1.4.2.33); that is, we associate resembling ideas and attribute identity to their causes. Consequently, we naturally invent the continued and external existence of the objects (or perceptions) that produced these ideas (Treatise, 1.4.2.35). Lastly, we go on to believe in the existence of these objects because of the force of the resemblance between ideas (Treatise, 1.4.2.36). Although this belief is philosophically unjustified, Hume feels he has given an accurate account of how we inevitably arrive at the idea of external existence. (3) In contrast to the previous explanation of this idea, he recommends that we doubt a more sophisticated but erroneous notion of existence—the so-called philosophical view—which distinguishes between perceptions and the external objects that cause perceptions. The psychological motivation for accepting this view is this: our imagination tells us that resembling perceptions have a continued existence, yet our reflection tells us that they are interrupted. Appealing to both forces, we ascribe interruption to perceptions and continuance to objects (Treatise, 1.4.2.52).

e. Personal Identity

Regarding the issue of personal identity, (1) Hume’s skeptical claim is that we have no experience of a simple, individual impression that we can call the self—where the “self” is the totality of a person’s conscious life. He writes, “For my part, when I enter most intimately into what I call myself, I always stumble on some particular perception or other, of heat or cold, light or shade, love or hatred, pain or pleasure. I never can catch myself at any time without a perception, and never can observe anything but the perception” (Treatise, 1.4.6.3). (2) Even though my perceptions are fleeting and I am a bundle of different perceptions, I nevertheless have some idea of personal identity, and that must be accounted for (Treatise, 1.4.6.4). Because of the associative principles, the resemblance or causal connection within the chain of my perceptions gives rise to an idea of myself, and memory extends this idea past my immediate perceptions (Treatise, 1.4.6.18 ff.). (3) A common abuse of the notion of personal identity occurs when the idea of a soul or unchanging substance is added to give us a stronger or more unified concept of the self (Treatise, 1.4.6.6).

f. Free Will

On the issue of free will and determinism—or “liberty” and “necessity” in Hume’s terminology—Hume defends necessity. (1) He first argues that “all actions of the will have particular causes” (Treatise, 2.3.2.8), and so there is no such thing as an uncaused willful action. (2) He then defends the notion of a will that consistently responds to prior motivational causes: “our actions have a constant union with our motives, tempers, and circumstances” (Treatise, 2.3.1.4). These motives produce actions that have the same causal necessity observed in cause-effect relations that we see in external objects, such as when billiard ball A strikes and moves billiard ball B. In the same way, we regularly observe the rock-solid connection between motive A and action B, and we rely on that predictable connection in our normal lives. Suppose that a traveler, in recounting his observation of the odd behavior of natives in a distant country, told us that identical motives led to entirely different actions among these natives.  We would not believe the traveler’s report. In business, politics, and military affairs, our leaders expect predicable behavior from us insofar as the same motives within us will always result in us performing the same action. A prisoner who is soon to be executed will assume that the motivations and actions of the prison guards and the executioner are so rigidly fixed that these people will mechanically carry out their duties and perform the execution, with no chance of a change of heart (Treatise, 2.3.1.5 ff.).  (3) Lastly, Hume explains why people commonly believe in an uncaused will (Treatise, 2.3.2.1 ff.). One explanation is that people erroneously believe they have a feeling of liberty when performing actions. The reason is that, when we perform actions, we feel a kind of “looseness or indifference” in how they come about, and some people wrongly see this as “an intuitive proof of human liberty” (Treatise, 2.3.2.2).

In the Treatise Hume rejects the notion of liberty completely. While he gives no definition of “liberty” in that work, he argues that the notion is incompatible with necessity, and, at best, “liberty” simply means chance. In the Enquiry, however, he takes a more compatiblist approach. All human actions are caused by specific prior motives, but liberty and necessity are reconcilable when we define liberty as “a power of acting or not acting, according to the determinations of the will” (Enquiry, 8). Nothing in this definition of liberty is in conflict with the notion of necessity.

4. Skepticism

In all of the above discussions on epistemological topics, Hume performs a balancing act between making skeptical attacks (step 1) and offering positive theories based on natural beliefs (step 2). In the conclusion to Book 1, though, he appears to elevate his skepticism to a higher level and exposes the inherent contradictions in even his best philosophical theories. He notes three such contradictions. One centers on what we call induction. Our judgments based on past experience all contain elements of doubt; we are then impelled to make a judgment about that doubt, and since this judgment is also based on past experience it will in turn produce a new doubt. Once again, though, we are impelled to make a judgment about this second doubt, and the cycle continues. He concludes that “no finite object can subsist under a decrease repeated in infinitum.” A second contradiction involves a conflict between two theories of external perception, each of which our natural reasoning process leads us to.  One is our natural inclination to believe that we are directly seeing objects as they really are, and the other is the more philosophical view that we only ever see mental images or copies of external objects. The third contradiction involves a conflict between causal reasoning and belief in the continued existence of matter. After listing these contradictions, Hume despairs over the failure of his metaphysical reasoning:

The intense view of these manifold contradictions and imperfections in human reason has so wrought upon me, and heated my brain, that I am ready to reject all belief and reasoning, and can look upon no opinion even as more probable or likely than another [Treatise, 1.4.7.8].

He then pacifies his despair by recognizing that nature forces him to set aside his philosophical speculations and return to the normal activities of common life. He sees, though, that in time he will be drawn back into philosophical speculation in order to attack superstition and educate the world.

Hume’s emphasis on these conceptual contradictions is a unique aspect of his skepticism, and if any part of his philosophy can be designated “Humean skepticism” it is this.  However, during the course of his writing the Treatise his view of the nature of these contradictions changed. At first he felt that these contradictions were restricted to theories about the external world, but theories about the mind itself would be free from them, as he explains here:

The essence and composition of external bodies are so obscure, that we must necessarily, in our reasonings, or rather conjectures concerning them, involve ourselves in contradictions and absurdities. But as the perceptions of the mind are perfectly known, and I have us’d all imaginable caution in forming conclusions concerning them, I have always hop’d to keep clear of those contradictions, which have attended every other system [Treatise, 2.2.6.2].

When composing the Appendix to the Treatise a year later, he changed his mind and felt that theories about the mind would also have contradictions:

I had entertained some hopes, that however deficient our theory of the intellectual world might be, it wou’d be free from those contradictions, and absurdities, which seem to attend every explication, that human reason can give of the material world. But upon a more strict review of the section concerning I find myself involv’d in such a labyrinth, that, I must confess, I neither know how to correct my former opinions, nor how to render them consistent. If this be not a good general reason for scepticism, ’tis at least a sufficient one (if I were not already abundantly supplied) for me to entertain a diffidence and modesty in all my decisions [Treatise, Appendix].

Thus, in the Treatise, the skeptical bottom line is that even our best theories about both physical and mental phenomena will be plagued with contradictions. In the concluding section of his Enquiry, Hume again addresses the topic of skepticism, but treats the matter somewhat differently: he rejects extreme skepticism but accepts skepticism in a more moderate form. He associates extreme Pyrrhonian skepticism with blanket attacks on all reasoning about the external world, abstract reasoning about space and time, or causal reasoning about matters of fact. He argues, though, that we must reject such skepticism since “no durable good can ever result from it.” Instead, he recommends a more moderate or Academic skepticism that tones down Pyrrhonism by, first, exercising caution and modesty in our judgments, and, second,  by restricting our speculations to abstract reasoning and matters of fact.

5. Theory of the Passions

 

 

Like many philosophers of his time, Hume developed a theory of the passions—that is, the emotions—categorizing them and explaining the psychological mechanisms by which they arise in the human mind. His most detailed account is in Book Two of the Treatise. Passions, according to Hume, fall under the category of impressions of reflection (as opposed to impressions of sensation). He opens his discussion with a taxonomy of types of passions, which are outlined here:

Reflective Impressions

1. Calm (reflective pleasures and pains)

2. Violent

a. Direct (desire, aversion, joy, grief, hope, fear)

b. Indirect (love, hate, pride, humility)

He initially divides passions between the calm and the violent. He concedes that this distinction is imprecise, but he explains that people commonly distinguish between types of passions in terms of their degrees of forcefulness. Adding more precision to this common distinction, he maintains that calm passions are emotional feelings of pleasure and pain associated with moral and aesthetic judgments. For example, when I see a person commit a horrible deed, I will experience a feeling of pain. When I view a good work of art, I will experience a feeling of pleasure. In contrast to the calm passions, violent ones constitute the bulk of our emotions, and these divide between direct and indirect passions. For Hume, the key direct passions are desire, aversion, joy, grief, hope, and fear.  They are called “direct” because they arise immediately—without complex reflection on our part—whenever we see something good or bad. For example, if I consider an unpleasant thing, such as being burglarized, then I will feel the passion of aversion. He suggests that sometimes these passions are sparked instinctively—for example, by  my desire for food when I am hungry. Others, though, are not connected with instinct and are more the result of social conditioning. There is an interesting logic to the six direct passions, which Hume borrowed from a tradition that can be traced to ancient Greek Stoicism. We can diagram the relation between the six with this chart:

When good/bad objects are considered abstractly

Desire (towards good objects)

Aversion (towards evil objects)

When good/bad objects are actually present

Joy (towards good objects)

Grief (towards evil objects)

When good/bad objects are only anticipated

Hope (towards good objects)

Fear (towards evil objects)

Compare, for example, the passions that I will experience regarding winning the lottery vs. having my house burglarized. Suppose that I consider them purely in the abstract—or “consider’d simply” as Hume says (Treatise, 2.3.9.6). I will then desire to win the lottery and have an aversion towards being burglarized. Suppose that both situations are actually before me; I will then experience joy over winning the lottery and grief over being burglarized. Suppose, finally, that I know that at some unknown time in the future I will win the lottery and be burglarized. I will then experience hope regarding the lottery and fear of being burglarized.

Hume devotes most of Book 2 to an analysis of the indirect passions, his unique contribution to theories of the passions. The four principal passions are love, hate, pride, and humility. They are called “indirect” since they are the secondary effects of a previous feeling of pleasure and pain. Suppose, for example, that I paint a picture, which gives me a feeling of pleasure. Since I am the artist, I will then experience an additional feeling of pride. He explains in detail the psychological process that triggers indirect passions such as pride. Specifically, he argues that these passions arise from a double relation between ideas and impressions, which we can illustrate here with the passion of pride:

1. I have an initial idea of some possession, or “subject”, such as my painting, and this idea gives me pleasure.

2. Through the associative principle of resemblance, I then immediately associate this feeling of pleasure with a resembling feeling of pride (this association constitutes the first relation in the double relation).

3. This feeling of pride then causes me to have an idea of myself, as the “object” of pride.

4. Through some associative principle such as causality, I then associate the idea of myself with the idea of my painting, which is the “subject” of my pride (this association constitutes the second relation in the double relation).

According to Hume, the three other principal indirect passions arise in parallel ways. For example, if my painting is ugly and causes me pain, then I will experience the secondary passion of humility—perhaps more accurately expressed as “humiliation”. By contrast, if someone else paints a pleasing picture, then this will trigger in me a feeling of love for that artist—perhaps more accurately expressed as “esteem”. If the artist paints a painfully ugly picture, then this will trigger in me a feeling of “hatred” towards the artist—perhaps more accurately expressed as “disesteem”.

One of the most lasting contributions of Hume’s discussion of the passions is his argument that human actions must be prompted by passion, and never can be motivated by reason. Reason, he argues, is completely inert when it comes to motivating conduct, and without some emotion we would not engage in any action. Thus, he writes, “Reason is, and ought only to be the slave of the passions, and can never pretend to any other office than to serve and obey them” (Treatise, 2.3.3.4).

6. Religious Belief

Like many of Hume’s philosophical views, his position on religious belief is also skeptical. Critics of religion during the eighteenth-century needed to express themselves cautiously to avoid being fined, imprisoned, or worse. Sometimes this involved placing controversial views in the mouth of a character in a dialogue. Other times it involved adopting the persona of a deist or fideist as a means of concealing a more extreme religious skepticism. Hume used all of the rhetorical devices at his disposal, and left it to his readers to decode his most controversial conclusions on religious subjects. During the Enlightenment, there were two pillars of traditional Christian belief: natural and revealed religion. Natural religion involves knowledge of God drawn from nature through the use of logic and reason, and typically involves logical proofs regarding the existence and nature of God, such as the causal and design arguments for God’s existence. Revealed religion involves knowledge of God contained in revelation, particularly the Bible, the quintessential examples of which are biblical prophesies and miracles where God intervenes in earthly affairs to confirm the Bible’s message of salvation. Hume attacks both natural and revealed religious beliefs in his various writings.

a. Miracles

 

In a 1737 letter to Henry Home, Hume states that he intended to include a discussion of miracles in his Treatise, but ultimately left it out for fear of offending readers. His analysis of the subject eventually appeared some ten years later in his essay “Of Miracles” from the Enquiry, and is his first sustained attack on revealed religion. It is probably this main argument to which Hume refers. The first of this two-part essay contains the argument for which Hume is most famous: uniform experience of natural law outweighs the testimony of any alleged miracle. Let us imagine a scale with two balancing pans. In the first pan we place the strongest evidence in support of the occurrence of a miracle. In the second we place our life-long experience of consistent laws of nature. According to Hume, the second pan will always outweigh the first. He writes:

It is experience only, which gives authority to human testimony [regarding miracles]; and it is the same experience, which assures us of the laws of nature. When, therefore, these two kinds of experience are contrary, we have nothing to do but subtract the one from the other, and embrace an opinion, either on one side or the other, with that assurance which arises from the remainder. But according to the principle here explained, this subtraction, with regard to all popular religions, amounts to an entire annihilation [Enquiry, 10.1].

Regardless of how strong the testimony is in favor of a given miracle, it can never come close to counterbalancing the overwhelming experience of unvaried laws of nature. Thus, proportioning one’s belief to the evidence, the wise person must reject the weaker evidence concerning the alleged miracle.

In the second part of “Of Miracles”, Hume discusses four factors that count against the credibility of most miracle testimonies: (1) witnesses of miracles typically lack integrity; (2) we are naturally inclined to enjoy sensational stories, and this has us uncritically perpetuate miracle accounts; (3) miracle testimonies occur most often in less civilized countries; and (4) miracles support rival religious systems and thus discredit each other. But even if a miracle testimony is not encumbered by these four factors, we should still not believe it since it would be contrary to our consistent experience of laws of nature. He concludes his essay with the following cryptic comment about Christian belief in biblical miracles:

upon the whole, we may conclude, that the Christian Religion not only was at first attended with miracles, but even at this day cannot be believed by any reasonable person without one. Mere reason is insufficient to convince us of its veracity: And whoever is moved by Faith to assent to it, is conscious of a continued miracle in his own person, which subverts all the principles of his understanding, and gives him a determination to believe what is most contrary to custom and experience [Enquiry, 10.2].

At face value, his comment suggests a fideist approach to religious belief such as what Pascal recommends. That is, reason is incapable of establishing religious belief, and God must perform a miracle in our lives to make us open to belief through faith. However, according to the eighteenth-century Hume critic John Briggs, Hume’s real point is that belief in Christianity requires “miraculous stupidity” (The Nature of Religious Zeal, 1775).

b. Psychology of Religious Belief

Another attack on revealed religion appears in Hume’s essay “The Natural History of Religion” (1757). It is one of the first systematic attempts to explain the causes of religious belief solely in terms of psychological and sociological factors. We might see the “Natural History” as an answer to a challenge, such as the sort that William Adams poses here in his attack on Hume’s “Of Miracles”:

Whence could the religion and laws of this people [i.e., the Jews] so far exceed those of the wisest Heathens, and come out at once, in their first infancy, thus perfect and entire; when all human systems are found to grow up by degrees, and to ripen, after many improvements; into perfection [An Essay, Part 2]?

According to Adams, only divine intervention can account for the sophistication of the ancient Jewish religion. In the “Natural History,” though, Hume offers an alternative explanation, and one that is grounded solely in human nature, without God’s direct involvement in human history.

The work may be divided into three parts. In the first (Sections 1 and 4), Hume argues that polytheism, and not monotheism, was the original religion of primitive humans. Monotheism, he believes, was only a later development that emerged with the progress of various societies. The standard theory in Judeo-Christian theology was that early humans first believed in a single God, but as religious corruption crept in, people lapsed into polytheism. Hume was the first writer to systematically defend the position of original polytheism. In the second part (Sections 2-3, 5-8), Hume establishes the psychological principles that give rise to popular religious belief. His thesis is that natural instincts—such as fear and the propensity to adulate—are the true causes of popular religious belief, and not divine intervention or rational argument. The third part of this work (Sections 9-15) compares various aspects of polytheism with monotheism, showing that one is no more superior than the other. Both contain points of absurdity. From this he concludes that we should suspend belief on the entire subject of religious truth.

c. Arguments for God’s Existence

Around the same time that Hume was composing his “Natural History of Religion” he was also working on his Dialogues Concerning Natural Religion, which appeared in print two decades later, after his death. As the title of the work implies, it is a critique of natural religion, in contrast with revealed religion. There are three principal characters in the Dialogues. A character named Cleanthes, who espouses religious empiricism, defends the design argument for God’s existence, but rejects the causal argument. Next, a character named Demea, who is a religious rationalist, defends the causal argument for God’s existence, but rejects the design argument. Finally, a character named Philo, who is a religious skeptic, argues against both the design and causal arguments. The main assaults on theistic proofs are conveyed by both Cleanthes and Philo, and, to that extent, both of their critiques likely represent Hume’s views.

The specific version of the causal argument that Hume examines is one by Samuel Clarke (and Leibniz before him). Simplistic versions of the causal argument maintain that when we trace back the causes of things in the universe, the chain of causes cannot go back in time to infinity past; there must be a first cause to the causal sequence, which is God. Clarke’s version differs in that it is theoretically possible for causal sequences of events to trace back through time to infinity past. Thus, we cannot argue that God’s existence is required to initiate a sequence of temporal causes. Nevertheless, Clarke argued, an important fact still needs to be explained: the fact that this infinite temporal sequence of causal events exists at all. Why does something exist rather than nothing? God, then, is the necessary cause of the whole series. In response, the character Cleanthes argues that the flaw in the cosmological argument consists in assuming that there is some larger fact about the universe that needs explaining beyond the particular items in the series itself. Once we have a sufficient explanation for each particular fact in the infinite sequence of events, it makes no sense to inquire about the origin of the collection of these facts. That is, once we adequately account for each individual fact, this constitutes a sufficient explanation of the whole collection. He writes, “Did I show you the particular causes of each individual in a collection of twenty particles of matter, I should think it very unreasonable, should you afterwards ask me, what was the cause of the whole twenty” (Dialogues, 9).

The design argument for God’s existence is that the appearance of design in the natural world is evidence for the existence of a divine designer. The specific version of the argument that Hume examines is one from analogy, as stated here by Cleanthes:

The curious adapting of means to ends, throughout all nature, resembles exactly, though it much exceeds, the productions of human contrivance; of human designs, thought, wisdom, and intelligence. Since, therefore, the effects resemble each other, we are led to infer, by all the rules of analogy, that the causes also resemble; and that the Author of Nature is somewhat similar to the mind of man (Dialogues, 2).

Philo presents several criticisms against the design argument, many of which are now standard in discussions of the issue. According to Philo, the design argument is based on a faulty analogy: we do not know whether the order in nature was the result of design, since, unlike our experience with the creation of machines, we did not witness the formation of the world. In Philo’s words, “will any man tell me with a serious countenance, that an orderly universe must arise from some thought and art like the human, because we have experience of it? To ascertain this reasoning, it were requisite that we had experience of the origin of worlds; and it is not sufficient, surely, that we have seen ships and cities arise from human art and contrivance” (ibid). Further, the vastness of the universe also weakens any comparison with human artifacts. Although the universe is orderly here, it may be chaotic elsewhere. Similarly, if intelligent design is exhibited only in a small fraction of the universe, then we cannot say that it is the productive force of the whole universe. Philo states that “A very small part of this great system, during a very short time, is very imperfectly discovered to us; and do we thence pronounce decisively concerning the origin of the whole?” (ibid). Philo also argues that natural design may be accounted for by nature alone, insofar as matter may contain within itself a principle of order, and “This at once solves all difficulties” (Dialogues, 6). And even if the design of the universe is of divine origin, we are not justified in concluding that this divine cause is a single, all powerful, or all good being. According to Philo, “Whether all these attributes are united in one subject, or dispersed among several independent beings, by what phenomena in nature can we pretend to decide the controversy?” (Dialogues 5).

7. Moral Theory

Hume’s moral theory appears in Book 3 of the Treatise and in An Enquiry Concerning the Principles of Morals (1751). He opens his discussion in the Treatise by telling us what moral approval is not: it is not a rational judgment about either conceptual relations or empirical facts. To make his case he criticizes Samuel Clarke’s rationalistic account of morality, which is that we rationally judge the fitness or unfitness of our actions in reference to eternal laws of righteousness, that are self-evidently known to all humans, just as is our knowledge of mathematical relations. Hume presents several arguments against Clarke’s view, one of which is an analogy from arboreal parricide: a young tree that overgrows and kills its parent exhibits the same alleged relations as a human child killing his parent. “Is not the one tree the cause of the other’s existence; and the latter the cause of the destruction of the former, in the same manner as when a child murders his parent?” (Treatise, 3.1.1.24). If morality is a question of relations, then the young tree is immoral, which is absurd. Hume also argues that moral assessments are not judgments about empirical facts. Take any immoral action, such as willful murder: “examine it in all lights, and see if you can find that matter of fact, or real existence, which you call vice” (Treatise, 3.1.1.25). You will not find any such fact, but only your own feelings of disapproval. In this context Hume makes his point that we cannot derive statements of obligation from statements of fact. When surveying various moral theories, Hume writes, “I am surpriz’d to find, that instead of the usual copulations of propositions, is, and is not, I meet with no proposition that is not connected with an ought or an ought not” (Treatise, 3.1.1.26). This move from is to ought is illegitimate, he argues, and is why people erroneously believe that morality is grounded in rational judgments.

Thus far Hume has only told us what moral approval is not, namely a judgment of reason. So what then does moral approval consist of? It is an emotional response, not a rational one. The details of this part of his theory rest on a distinction between three psychologically distinct players: the moral agent, the receiver, and the moral spectator. The moral agent is the person who performs an action, such as stealing a car; the receiver is the person impacted by the conduct, such as the owner of the stolen car; and the moral spectator is the person who observes and, in this case, disapproves of the agent’s action. This agent-receiver-spectator distinction is the product of earlier moral sense theories championed by the Earl of Shaftesbury (1671-1713), Joseph Butler (1692-1752), and Francis Hutcheson (1694-1747). Most generally, moral sense theories maintained that humans have a faculty of moral perception, similar to our faculties of sensory perception. Just as our external senses detect qualities in external objects, such as colors and shapes, so too does our moral faculty detect good and bad moral qualities in people and actions.

For Hume, all actions of a moral agent are motivated by character traits, specifically either virtuous or vicious character traits. For example, if you donate money to a charity, then your action is motivated by a virtuous character trait. Hume argues that some virtuous character traits are instinctive or natural, such as benevolence, and others are acquired or artificial, such as justice. As an agent, your action will have an effect on a receiver. For example, if you as the agent give food to a starving person, then the receiver will experience an immediately agreeable feeling from your act. Also, the receiver may see the usefulness of your food donation, insofar as eating food will improve his health. When considering the usefulness of your food donation, then, the receiver will receive another agreeable feeling from your act. Finally, I, as a spectator, observe these agreeable feelings that the receiver experiences. I, then, will sympathetically experience agreeable feelings along with the receiver. These sympathetic feelings of pleasure constitute my moral approval of the original act of charity that you, the agent, perform. By sympathetically experiencing this pleasure, I thereby pronounce your motivating character trait to be a virtue, as opposed to a vice. Suppose, on the other hand, that you as an agent did something to hurt the receiver, such as steal his car. I as the spectator would then sympathetically experience the receiver’s pain and thereby pronounce your motivating character trait to be a vice, as opposed to a virtue.

In short, that is Hume’s overall theory. There are, though, some important details that should also be mentioned. First, it is tricky to determine whether an agent’s motivating character trait is natural or artificial, and Hume decides this one virtue at a time. For Hume, the natural virtues include benevolence, meekness, charity, and generosity. By contrast, the artificial virtues include justice, keeping promises, allegiance and chastity. Contrary to what one might expect, Hume classifies the key virtues that are necessary for a well-ordered state as artificial, and he classifies only the more supererogatory virtues as natural. Hume’s critics were quick to point out this paradox. Second, to spark a feeling of moral approval, the spectator does not have to actually witness the effect of an agent’s action upon a receiver. The spectator might simply hear about it, or the spectator might even simply invent an entire scenario and think about the possible effects of hypothetical actions. This happens when we have moral reactions when reading works of fiction: “a very play or romance may afford us instances of this pleasure, which virtue conveys to us; and pain, which arises from vices” (Treatise, 3.1.2.2).

Third, although the agent, receiver, and spectator have psychologically distinct roles, in some situations a single person may perform more than one of these roles. For example, if I as an agent donate to charity, as a spectator to my own action I can also sympathize with the effect of my donation on the receiver. Finally, given various combinations of spectators and receivers, Hume concludes that there are four irreducible categories of qualities that exhaustively constitute moral virtue: (1) qualities useful to others, which include benevolence, meekness, charity, justice, fidelity and veracity; (2) qualities useful to oneself, which include industry, perseverance, and patience; (3) qualities immediately agreeable to others, which include wit, eloquence and cleanliness; and (4) qualities immediately agreeable to oneself, which include good humor, self-esteem and pride. For Hume, most morally significant qualities and actions seem to fall into more than one of these categories. When Hume spoke about an agent’s “useful” consequences, he often used the word “utility” as a synonym. This is particularly so in the Enquiry Concerning the Principles of Morals where the term “utility” appears over 50 times. Moral theorists after Hume thus depicted his moral theory as the “theory of utility”—namely, that morality involves assessing the pleasing and painful consequences of actions on the receiver. It is this concept and terminology that inspired classic utilitarian philosophers, such as Jeremy Bentham (1748–1832).

8. Aesthetic, Political, and Economic Theory

Hume wrote two influential essays on the subject of aesthetic theory. In “Of Tragedy” (1757) he discusses the psychological reasons why we enjoy observing depictions of tragic events in theatrical production. He argues that “the energy of expression, the power of numbers, and the charm of imitation” convey the sense of pleasure. He particularly stresses the technical artistry involved when an artistic work imitates the original. In “Of the Standard of Taste” (1757) he argues that there is a uniform sense of artistic judgment in human nature, similar to our uniform sense of moral judgment. Specific objects consistently trigger feelings of beauty within us, as our human nature dictates. Just as we can refine our external senses such as our palate, we can also refine our sense of artistic beauty and thus cultivate a delicacy of taste. In spite of this uniform standard of taste, two factors create some difference in our judgments: “the one is the different humours of particular men; the other, the particular manners and opinions of our age and country.”

In political theory, Hume has both theoretical discussions on the origins of government and more informal essays on popular political controversies of his day. In his theoretical discussions, he attacks two basic notions in eighteenth-century political philosophy: the social contract and the instinctive nature of justice regarding private property. In his 1748 essay “Of the Original Contract,” he argues that political allegiance is not grounded in any social contract, but instead on our general observation that society cannot be maintained without a governmental system. He concedes that in savage times there may have been an unwritten contract among tribe members for the sake of peace and order. However, he argues, this was no permanent basis of government as social contract theorists pretend. There is nothing to transmit that original contract onwards from generation to generation, and our experience of actual political events shows that governmental authority is founded on conquest, not elections or consent. We do not even tacitly consent to a contract since many of us have no real choice about remaining in our countries: “Can we seriously say that a poor peasant or artisan has a free choice to leave his country, when he knows no foreign language or manners, and lives from day to day by the small wages which he acquires?” Political allegiance, he concludes, is ultimately based on a primary instinct of selfishness, and only through reflection will we see how we benefit from an orderly society.

Concerning private property, in both the Treatise and the Enquiry Concerning the Principles of Morals (1751), Hume in essence argues against Locke’s notion of the natural right to private property. For Hume, we have no primary instinct to recognize private property, and all conceptions of justice regarding property are founded solely on how useful the convention of property is to us. We can see how property ownership is tied to usefulness when considering scenarios concerning the availability of necessities. When necessities are in overabundance, I can take what I want any time, and there is no usefulness in my claiming any property as my own. When the opposite happens and necessities are scarce, I do not acknowledge anyone’s claim to property and take what I want from others for my own survival. Thus, “the rules of equity or justice [regarding property] depend entirely on the particular state and condition in which men are placed, and owe their origin and existence to that utility, which results to the public from their strict and regular observance” (Enquiry Concerning the Principles of Morals, 3). Further, if we closely inspect human nature, we will never find a primary instinct that inclines us to acknowledge private property. It is nothing like the primary instinct of nest building in birds. While the sense of justice regarding private property is a firmly fixed habit, it is nevertheless its usefulness to society that gives it value.

As for Hume’s informal essays on popular political controversies, several of these involve party disputes between the politically conservative Tory party that supported a strong monarchy, and the politically liberal Whig party which supported a constitutional government. Two consistent themes emerge in these essays. First, in securing peace, a monarchy with strong authority is probably better than a pure republic. Hume sides with the Tories because of their traditional support of the monarchy. Except in extreme cases, he opposes the Lockean argument offered by Whigs that justifies overthrowing political authorities when those authorities fail to protect the rights of the people. Hume notes, though, that monarchies and republics each have their strong points. Monarchies encourage the arts, and republics encourage science and trade. Hume also appreciates the mixed form of government within Great Britain, which fosters liberty of the press. The second theme in Hume’s political essays is that revolutions and civil wars principally arise from zealousness within party factions. Political moderation, he argues, is the best antidote to potentially ruinous party conflict.

In economic theory, Hume wrote influential essays on money, interest, trade, credit, and taxes. Many of these target the mercantile system and its view that a country increases its wealth by increasing the quantity of gold and silver in that country. For mercantilists, three means were commonly employed to this end: (1) capture gold, silver and raw material from other countries through colonization; (2) discourage imports through tariffs and monopolies, which keeps acquired gold and silver within one’s country’s borders; and, (3) increase exports, which brings in money from outside countries. In Great Britain, mercantile policies were instituted through the Navigation Acts, which prohibited trade between British colonies and foreign countries. These protectionist laws ultimately led to the American Revolution. The most famous of Hume’s anti-mercantilist arguments is now called Hume’s gold-flow theory, and appears in his essays “Of Money” (1752) and “Of the Balance of Trade” (1752). Contrary to mercantilists who advocated locking up money in one’s home country, Hume argued that increased money in one country automatically disperses to other countries. Suppose, for example, that Great Britain receives an influx of new money. This new money will drive up prices of labor and domestic products in Great Britain. Products in foreign countries, then, will be cheaper than in Great Britain; Britain, then, will import these products, thereby sending new money to foreign countries. Hume compares this reshuffling of wealth to the level of fluids in interconnected chambers: if I add fluid to one chamber, then, under the weight of gravity, this will disperse to the others until the level is the same in all chambers. A similar phenomenon will occur if we lose money in our home country by purchasing imports from foreign countries. As the quantity of money decreases in our home country, this will drive down the prices of labor and domestic products. Our products, then, will be cheaper than foreign products, and we will gain money through exports. On the fluid analogy, by removing fluid from one chamber, more fluid is drawn in from surrounding chambers.

9. History and Philosophy

Although Hume is now remembered mainly as a philosopher, in his own day he had at least as much impact as a historian. His History of England appeared in four installments between 1754 and 1762 and covers the periods of British history from most ancient times through the seventeenth-century. To his 18th and 19th century readers, he was not just another historian, but a uniquely philosophical historian who had an ability to look into the minds of historical figures and uncover the motives behind their conduct. A political theme underlying the whole History is, once again, a conflict between Tory and Whig ideology. In the Britain of Hume’s day, a major point of contention between the two parties was whether the English government was historically an absolute or limited monarchy. Tories believed that it was traditionally absolute, with governmental authority being grounded in royal prerogative. Whigs, on the other hand, believed that it was traditionally limited, with the foundation of government resting in the individual liberty of the people, as expressed in the parliamentary voice of the commons. As a historian, Hume felt that he was politically moderate, tending to see both the strengths and weaknesses in opposing viewpoints:

With regard to politics and the character of princes and great men, I think I am very moderate. My views of things are more conformable to Whig principles; my representations of persons to Tory prejudices. Nothing can so much prove that men commonly regard more persons than things, as to find that I am commonly numbered among the Tories [Hume to John Clephane, 1756].

However, to radical Whig British readers, Hume was a conservative Tory who defended royal prerogative.

Hume takes two distinct positions on the prerogative issue. From a theoretical and idealistic perspective, he favored a mixed constitution, mediating between the authority of the monarch and that of the Parliament. Discussing this issue in his 1741 Essays, he holds that we should learn “the lesson of moderation in all our political controversies.” However, from the perspective of how British history actually unfolded, he emphasized royal prerogative. And, as a “philosophical historian,” he tried to show how human nature gave rise to the tendency towards royal prerogative. In his brief autobiography, “My Own Life,” he says that he rejected the “senseless clamour” of Whig ideology, and believed “It is ridiculous to consider the English constitution before that period [of the Stuart Monarchs] as a regular plan of liberty.” Gilbert Stuart best encapsulated Hume’s historical stance on the prerogative issue: “his history, from its beginning to its conclusion, is chiefly to be regarded as a plausible defence of prerogative” (A View of Society in Europe, 1778, 2.1.1). In short, Hume’s Tory narrative is this. As early as the Anglo Saxon period, the commons did not participate in the king’s advisory council. The Witenagemot, for example, was only a council of nobles and bishops, which the king could listen to or ignore as he saw fit. Throughout the succeeding centuries, England’s great kings were those who exercised absolute rule, and took advantage of prerogative courts such as the Star Chamber. Elizabeth—England’s most beloved monarch—was in fact a tyrant, and her reign was much like that of a Turkish sultan. Charles I—a largely virtuous man—tried to follow in her footsteps as a strong monarch. After a few minor lapses in judgment, and a few too many concessions to Catholics, Protestant zealots rose up against him, and he was ultimately executed. To avoid over-characterizing royal prerogative, Hume occasionally condemns arbitrary actions of monarchs and praises efforts for preserving liberty. Nevertheless, Whig critics like Gilbert Stuart argued that Hume’s emphasis was decisively in favor of prerogative.

There is an irony to Hume’s preference for prerogative over civil liberty. His philosophical writings were among the most controversial pieces of literature of the time, and would have been impossible to publish if Britain was not a friend to liberty. Although Hume was certainly no enemy to liberty, he believed that it was best achieved through moderation rather than Whig radicalism. He writes, “If any other rule than established practice be followed, factions and dissentions must multiply without end” (History, Appendix 3). To Hume’s way of thinking, the loudest voices favoring liberty were Calvinistic religious fanatics who accomplished little more than dissention. A strong, centralized and moderating force was the best way to avoid factious disruption from the start.

10. References and Further Reading

a. Recent Editions of Hume’s Writings

There are many published editions of Hume’s writings, the best of which are as follows (listed chronologically).

  • The Philosophical Works of David Hume (1874-1875), ed. T.H. Green and T.H. Grose.
    • This four-volume set was the definitive edition of the late nineteenth century, and is the text source of many individually published books on Hume. It does not include the History. Electronic versions of this edition are freely available on the internet.
  • Hume’s Dialogues Concerning Natural Religion (1935), ed., Norman Kemp Smith.
    • This is the definitive edition of this work, and contains a ground-breaking introductory essay.
  • Enquiries (Oxford, 1975) ed. L.A. Selby-Bigge and P.H. Nidditch.
    • This contains Hume’s two Enquiries, and was the definitive edition of these works in the late twentieth-century.
  • Treatise of Human Nature (Oxford, 1978), ed. L.A. Selby-Bigge and P.H. Nidditch.
    • This was the definitive edition of this work in the late twentieth-century.
  • History of England (Liberty Classics, 1983).
    • This is the definitive edition of this work.
  • Essays, Moral, Political and Literary (Liberty Classics, 1987), ed. E.F. Miller.
    • This is the definitive edition of this work.
  • The Clarendon Edition of the Works of David Hume (1998-ongoing), ed. Tom L. Beauchamp, Mark Box, David Fate Norton, Mary Norton, M.A. Stewart.
    • This is a carefully-researched critical edition of Hume’s philosophical works, and supersedes all previous editions. Volumes on Hume’s Essays and the Dialogues are forthcoming; it does not include Hume’s History.

b. Chronological List of Hume’s Publications

  • A Treatise of Human Nature: Being an Attempt to Introduce the Experimental Method of Reasoning into Moral Subjects (1739-40).
    • This was published anonymously in three volumes: Vol. I. Of the Understanding; Vol. II. Of the Passions. Vol. III. Of Morals. This is Hume’s principle philosophical work, the central notions of which were rewritten more popularly in Philosophical Essays Concerning Human Understanding (1748) and An Enquiry Concerning the Principles of Morals (1751).
  • An Abstract of a Book lately Published;  entituled, A Treatise of Human Nature, &c. Wherein the chief Argument  of that Book is farther Illustrated and Explained (1740).
    • This is a sixteen page pamphlet, published anonymously as an effort to bring attention to the Treatise.
  • Essays, Moral and Political (1741-1742).
    • This was published anonymously in two volumes in 1741 and 1742 respectively. In subsequent editions some essays were dropped and others added; the collection was eventually combined with his Political Discourses (1752) and retitled Essays, Moral, Political and Literary in Hume’s collection of philosophical works, Essays and Treatises on Several Subjects (1753).
  • A Letter from a Gentleman to his Friend in Edinburgh: Containing Some Observations on a Specimen of the Principles concerning Religion and Morality, said to be maintain’d in a Book lately publish’d, intituled, A Treatise of Human Nature, &c (1745).
    • This thirty-four page pamphlet was published anonymously and was prompted by Hume’s candidacy for the Chair of Moral Philosophy at the University of Edinburgh. The pamphlet responds to criticisms regarding the Treatise.
  • Philosophical Essays Concerning Human Understanding. By the Author of the Essays Moral and Political (1748).
    • This was published anonymously and later retitled Enquiry Concerning Human Understanding. It is a popularized version of key themes that appear mainly in the Treatise, Book 1.
  • A True Account of the Behaviour and conduct of Archibald Stewart, Esq; late Lord Provost of Edinburgh. In a letter to a Friend (1748).
    • This fifty-one page pamphlet was published anonymously as a defense of Archibald Stewart, Lord Provost of Edinburgh, surrounding a political controversy.
  • An Enquiry Concerning the Principles of Morals. By David Hume, Esq. (1751).
    • This is a popularized version of key themes that appear mainly in the Treatise, Book 3.
  • The Petition of the Grave and Venerable Bellmen (or Sextons) of the Church of Scotland (1751).
    • This was published anonymously and involved the Church of Scotland’s efforts to increase their stipends.
  • Political discourses. By David Hume Esq. (1752).
    • This is a collection of essays on economic and political subjects, which was eventually combined with his Essays Moral and Political (1741-1742) and retitled Essays, Moral, Political and Literary in Hume’s collection of philosophical works, Essays and Treatises on Several Subjects (1753).
  • Scotticisms (1752).
    • This six page pamphlet was published anonymously, and lists Scottish idioms.
  • The History of England, from the Invasion of Julius Cæsar to  the  Revolution in 1688 (1754-1762).
    • This was published in four installments: (a) The history of Great Britain. Vol. I.  Containing the reigns of James I. and Charles I. By David Hume, Esq. (1754); (b) The history of Great Britain. Vol. II.  Containing the Commonwealth, and the reigns of Charles II. and James  II. By David Hume, Esq. (1757); (c) The history of England, under the House of Tudor Comprehending the reigns of K. Henry VII. K. Henry VIII. K. Edward VI. Q. Mary, and Q. Elizabeth. . . .  By David Hume,  Esq (1759); (d) The history of England, from the invasion of Julius Cæsar to the  accession of Henry VII. . . .  By David Hume, Esq. (1762).
  • Essays and Treatises on Several Subjects. By David Hume, Esq; In four  volumes (1753).
    • This is Hume’s handpicked collection of philosophical works, which includes (a) Essays, Moral and Political, (b) Philosophical Essays concerning Human Understanding, (c) An Enquiry Concerning the Principles of Morals, and (d) Political Discourses. Essays from Four Dissertations (1757) were added to later editions. The collection does not include the Treatise.
  • Four Dissertations. I. The Natural History of Religion. II. Of the  Passions. III. Of Tragedy. IV. Of the Standard of Taste. By David Hume,  esq. (1757).
    • This volume was originally to include “Of Suicide” and “Of the Immortality of the Soul,” which were removed at the last minute and appeared separately in 1783 in an unauthorized posthumous edition. The four essays in Four Dissertations were later added to various sections of Essays and Treatises on Several Subjects.
  • Letter to Critical Review, April 1759, Vol. 7. pp. 323-334.
    • This is a defense of William Wilkie’s epic poem Epigoniad.
  • Exposé succinct de la contestation qui s’est élevée entre M. Hume et M. Rousseau, avec les pieces justificatives (1766).
    • This is a pamphlet containing letters between Hume and Rousseau, published anonymously, translated from English by J.B.A. Suard. The pamphlet was translated back to English in A Concise and Genuine account of the Dispute between Mr. Hume and Mr. Rousseau: with the Letters that Passed Between them during their Controversy (1766).
  • Advertisement to Baron Manstein’s Memoirs of Russia, Historical, Political and Military, from MDCXXVII, to MDCXLIV (1770).
    • This is an opening advertisement by Hume to Manstein’s work.
  • The Life of David Hume, Esq. Written by Himself (1777). This contains “My Own Life” and “Letter from Adam Smith, LL.D. to William Strahan, Esq.”
  • Dialogues Concerning Natural religion. By David Hume, Esq. (1779).
    • This is a posthumous edition from Hume’s unpublished manuscript, and contains his most detailed attack on natural religion.
  • Essays on Suicide, and the Immortality of the Soul,  ascribed to the late David Hume, Esq. Never before Published. With  Remarks, intended as an Antidote to the Poison contained in these  Performances, by the Editor. To which is added, Two Letters on Suicide,  from Rosseau’s [sic] Eloisa (1783).
    • This is an unauthorized publication of the two essays that were originally associated with Four Dissertations.

c. Biographies, Letters, Manuscripts

  • Greig, J.Y.T. Letters of David Hume (1932), two volumes.
    • This is the best collection of Hume’s letters (along with the supplementary volume by Klibansky below).
  • Greig, J.Y.T.; Beynon, Harold, eds. “Calendar of Hume MSS. in the possession of the Royal Society,” in Proceedings of the Royal Society of Edinburgh, 1931–1932, Vol. 52, pp. 1–138.
    • This is a detailed list of the Hume manuscripts with a contents summary of each item, and has been published separately in book.
  • Klibansky, Raymond; Mossner, Ernest. New Letters of David Hume (1954).
    • This volume of letters is a supplement to Greig’s two volume collection above.
  • Waldmann, Felix. Further Letters of David Hume (2014).
    • This volume contains newly discovered letters.
  • Mossner, E.C.  The Life of David Hume (Oxford, 1980).
    • This is the best biography of Hume.
  • National Library of Scotland (MS no. 23151–23163).
    • This manuscript deposit contains thirteen large volumes assembled from papers collected by Hume’s family after his death. It includes around 150 letters by Hume, 525 letters to him, and several manuscripts of published and unpublished works, most importantly the manuscript of his Dialogues Concerning Natural Religion.

d. Bibliographies

  • “The Hume Literature,” Hume Studies, 1977-present.
    • Each year Hume Studies publishes an annual bibliographical update; these bibliographies exclude articles that appear in Hume Studies itself.
  • “An Index of Hume Studies: 1975-1993.” Hume Studies 19.2 (1993).
    • This is a bibliographical index of articles that have appeared in Hume Studies since the journal’s inception until 1993.
  • Fieser, James. A Bibliography of Hume’s Writings and Early Responses (2005).
    • This is a bibliography of Hume’s writings and eighteenth and nineteenth-century responses. It is freely available on the internet.
  • Hall, Rolland. Fifty Years of Hume Scholarship: A Bibliographic Guide (1978).
    • This bibliography covers from 1925 through 1976, and is continued by the annual Hume bibliographies in Hume Studies.
  • Jessop, Thomas Edmund. A bibliography of David Hume and of Scottish philosophy from Francis Hutcheson to Lord Balfour (1938).
    • This is the first published scholarly bibliographical work on Hume, early responses to Hume, and other Scottish philosophers.
  • Tweyman, Stanley. Secondary Sources on the Philosophy of David Hume: A David Hume Bibliography in Two Volumes, 1741-2005 (2006).

e. Works on Hume

The secondary literature on Hume is voluminous. Below are a few works that cover all aspects of Hume’s philosophy. For works on specific aspects of Hume, such as his epistemology, see other IEP articles on Hume.

  • Ayer, A.J. Hume (1980).
    • This is a short but informative introduction by a great twentieth-century philosopher who sees himself as following in the Humean tradition.
  • Blackburn, Simon. How to Read Hume (2008).
    • This is a concise work on various aspects of Hume’s philosophy.
  • Fieser, James. Early Responses to Hume’s Writings (London: Continuum, 2005), 10 volumes.
    • This contains the principal eighteenth and nineteenth-century criticisms of Hume.
  • Kemp Smith, Norman. The Philosophy of David Hume (1941).
    • This groundbreaking book set a new direction for Hume scholarship.
  • Hume Studies, 1977-present.
    • This journal is devoted to Hume scholarship, and most of the volumes are freely accessible on the Hume Studies web site.
  • Jones, Peter, ed. The Reception of David Hume in Europe. London, New York: Thoemmes Continuum, 2005.
    • This work contains chapters by different writers on Hume’s impact in different European countries.
  • Norton, David Fate; Taylor, Jacqueline. The Cambridge Companion to Hume (2008).
    • This contains essays by different writers on various aspects of Hume’s philosophy.
  • Stroud, Barry. Hume (1981).
    • This is an influential analytic discussion of various problems that arise within Hume’s philosophy.

Author Information

James Fieser
Email: jfieser@utm.edu
University of Tennessee at Martin
U. S. A.

Lambda Calculi

Lambda calculi (λ-calculi) are formal systems describing functions and function application. One of them, the untyped version, is often referred to as the λ-calculus. This exposition will adopt this convention. At its core, the λ-calculus is a formal language with certain reduction rules intended to capture the notion of function application [Church, 1932, p. 352]. Through the primitive notion of function application, the λ-calculus also serves as a prototypical programming language. The philosophical significance of the calculus comes from the expressive power of a seemingly simple formal system. The λ-calculus is in fact a family of formal systems, all stemming from the same common root. The variety and expressiveness of these calculi yield results in formal logic, recursive function theory, the foundations of mathematics, and programming language theory. After a discussion of the history of the λ-calculus, we will explore the expressiveness of the untyped λ-calculus. Then we will investigate the role of the untyped lambda calculus in providing a negative answer to Hilbert’s Entscheidungsproblem and in forming the Church-Turing thesis. After this, we will take a brief foray into typed λ-calculi and discuss the Curry-Howard isomorphism, which provides a correspondence between these calculi (which again are prototypical programming languages) and systems of natural deduction.

Table of Contents

  1. History
    1. Main Developments
    2. Notes
  2. The Untyped Lambda Calculus
    1. Syntax
    2. Substitution
    3. α-equivalence
    4. β-equivalence
    5. η-equivalence
    6. Notes
  3. Expressiveness
    1. Booleans
    2. Church Numerals
    3. Notes
  4. Philosophical Importance
    1. The Church-Turing Thesis
    2. An Answer to Hilbert’s Entscheidungsproblem
    3. Notes
  5. Types and Programming Languages
    1. Overview
    2. Notes
  6. References and Further Reading

1. History

a. Main Developments

Alonzo Church first introduced the λ-calculus as “A set of postulates for the foundation of logic” in two papers of that title published in 1932 and 1933. Church believed that “the entities of formal logic are abstractions, invented because of their use in describing and systematizing facts of experience or observation, and their properties, determined in rough outline by this intended use, depend for their exact character on the arbitrary choice of the inventor” [Church, 1932, p. 348]. The intended use of the formal system Church developed was, as mentioned in the introduction, function application. Intuitively, the expression (later called a term) λx.x2 corresponds to an anonymous definition of the mathematical function f(x) = x2. An “anonymous definition” of a function refers to a function defined without a name; in the current case, instead of defining a function “f”, an anonymous definition corresponds to the mathematician’s style of defining a function as a mapping, such as “xx2”. Do note that the operation of squaring is not yet explicitly defined in the λ-calculus. Later, it will be shown how it can be. For our present purposes, the use of squaring is pedagogical. By limiting the use of free variables and the law of excluded middle in his system in certain ways, Church hoped to escape paradoxes of transfinite set theory [Church, 1932, p. 346–8]. The original formal system of 1932–1933 turned out, however, not to be consistent. In it, Church defined many symbols besides function definition and application: a two-place predicate for extensional equality, an existential quantifier, negation, conjunction, and the unique solution of a function. In 1935, Church’s students Kleene and Rosser, using this full range of symbols and Gödel’s method of representing the syntax of a language numerically so that a system can express statements about itself, proved that any formula is provable in Church’s original system. In 1935, Church isolated the portion of his formal system dealing solely with functions and proved the consistency of this system. One idiosyncratic feature of the system of 1935 was eliminated in Church’s 1936b paper, which introduced what is now known as the untyped λ-calculus. This 1936 paper also provides the first exposition of the Church-Turing thesis and a negative answer to Hilbert’s famous problem of determining whether a given formula in a formal system is provable. We will discuss these results later.

b. Notes

For a thorough history of the λ-calculus, including many modern developments, with a plethora of pointers to primary and secondary literature, see Cardone and Hindley [2006]. Church [1932] is very accessible and provides some insight into Church’s thinking about formal logic in general. Though the inconsistency proof of Kleene and Rosser [1935] is a formidable argument, the first page of their paper provides an overview of their proof strategy which should be comprehensible to anyone with some mathematical/logical background and rough understanding of the arithmetization of syntax á la Gödel. According to Barendregt [1997], Church originally intended to use the notation ˆx.x2. His original typesetter could not typeset this, and so moved the hat in front of the x, which another typesetter than read as λx.x2. In an unpublished letter, Church writes that he placed the hat in front, not a typesetter. According to Cardone and Hindley [2006, p. 7], Church later contradicted his earlier account, telling enquirers that he was in need of a symbol and just happened to pick λ.

2. The Untyped Lambda Calculus

At its most basic level, the λ-calculus is a formal system with a concrete syntax and distinct reduction rules. In order to proceed properly, we must define the alphabet and syntax of our language and then the rules for forming and manipulating well-formed formulas in this language. In the process of this exposition, formal definitions will be given along with informal description of the intuitive meaning behind the expressions.

a. Syntax

The language of the λ-calculus consists of the symbols (, ), λ and a countably infinite set of variables x, y, z, … We use upper-case variables M, N, … to refer to formulas of the λ-calculus. These are not symbols in the calculus itself, but rather convenient metalinguistic abbreviations. The terms (formulas, expressions; these three will be used interchangeably) of the calculus are defined inductively as follows:

  • x is a term for any variable.
  • If M, N are terms, then (MN) is a term.
  • If x is a variable and M a term, then (λx.M) is a term.

The latter two rules of term formation bear most of the meat. The second bullet corresponds to function application. The last bullet is called “abstraction” and corresponds to function definition. Intuitively, the notation λx.M corresponds to the mathematical notation of an anonymous function xM. As you can see from this language definition, everything is a function. The functions in this language can “return” other functions and accept other functions as their input. For this reason, they are often referred to as higher-order functions or first-class functions in programming language theory. We provide a few examples of λ terms before examining evaluation rules and the expressive power of this simple language.

  • λx.x: this represents the identity function, which just returns its argument.
  • λf.(λx.(fx)): this function takes in two arguments and applies the first to the second.
  • λf.(λx.(f(xx)))(λx.(f(xx))): this term is called a “fixed-point combinator” and usually denoted by Y. We will discuss its importance later, but introduce it here to show how more complex terms can be built.

Though I referred to the second term above as accepting two arguments, this is actually not the case. Every function in the λ-calculus takes only one argument as its input (as can be seen by a quick examination of the language definition). We will see later, however, that this does not in any way restrict us and that we can often think of nested abstractions as accepting multiple arguments. Before we proceed, a few notational conventions should be mentioned:

  • We always drop the outermost parentheses. This was already adopted in the three examples above.
  • Application associates to the left: (MNP) is shorthand for ((MN)P).
  • Conversely, abstraction associates to the right: (λx.λy.M) is short for (λx.(λy.M))
  • We can drop λs from repeated abstractions: λx.λy.M can be written λxy.M and similarly for any length sequence of abstractions.

b. Substitution

Because it will play a pivotal role in determining the equivalence between λ-terms, we must define a notion of variable substitution. This depends on the notion of free and bound variables, which we will define only informally. In a term λx.P, x is a bound variable, as are similarly bound variables in the subterm P; variables which are not bound are free. In other words, free variables are those that occur outside the scope of a λ abstraction on that variable. A term containing no free variables is called a combinator. We can then define the substitution of N for x in term M, denoted M[x := N], as follows:

  • x[x := N] = N
  • y[x := N] = y (provided that x is a variable different than y)
  • (PQ)[x := N] = P[x := N]Q[x := N]
  • (λx.P)[x := N] = λx.P
  • (λy.P)[x := N] = λy.P[x := N] (provided that x is different than y and y is not free in N or x is not free in P)

Intuitively, these substitution rules allow us to replace all the free occurrences of a variable (x) with any term (N). We cannot replace the bound occurrences of the variable (or allow for a variable to become bound by a substitution, as accounted for in the last rule) because that would change the “meaning” of the term. The scare quotes are included because the formal semantics of the λ-calculus falls beyond the scope of the present article.

c. α-equivalence

The α-equivalence relation, denoted ≡α, captures the idea that two terms are equivalent up to a change of bound variables. For instance, though I referred to λx.x as “the” identity function earlier, we also want to think of λy.y, λz.z, et cetera as identity functions. We can define α-equivalence formally as follows:

  • xα x
  • MNα PQ if Mα P and Nα Q
  • λx.Mα λx.N (provided that Mα N)
  • λx.Mα λy.M[x := y] (provided that y is not free in M)

It will then follow as a theorem that ≡α is an equivalence relation. It should therefore be noted that what was defined earlier as a λ-term is in fact properly called a pre-term. Equipped with the notion of α-equivalence, we can then define a λ-term as the α-equivalence class of a pre-term. This allows us to say that λx.x and λy.y are in fact the same term since they are α-equivalent.

d. β-equivalence

β-reduction, denoted →β, is the principle rewriting rule in the λ-calculus. It captures the intuitive notion of function application. So, for instance, (λx.x)Mβ M, which is what we would expect from an identity function. Formally, →β is the least relation of λ-terms satisfying:

  • If Mβ N then λx.Mβ λx.N
  • If Mβ N then MZβ NZ for any terms M, N, Z
  • If Mβ N then ZMβ ZN for any terms M, N, Z
  • (λx.M)Nβ M[x := N]

The first three conditions define what’s referred to as a compatible relation. →β is then the least compatible relation of terms satisfying the fourth condition, which encodes the notion of function application as variable substitution. We will use the symbol ↠β to denote the iterated (zero or more times) application of →β and =β to denote the smallest equivalence relation containing →β. Two terms M and N are said to be convertible when M =αβ N (=αβ refers to the union of =α and =β). A term of the form (λx.M)N is called a β-redux (or just a redux) because it can be reduced to M[x := N]. A term M is said to be in β-normal form (henceforth referred to simply as normal form) iff there is no term N such that Mβ N. That is to say that a term M is in normal form iff M contains no β-redexes. A term M is called normalizing (or weakly normalizing) iff there is an N in normal form such that Mβ N. A term M is strongly normalizing if every β-reduction sequence starting with M is finite. A term which is not normalizing is called divergent. Regarding β-reduction as the execution of a program, this definition says that divergent programs never terminate since normal form terms correspond to terminated programs. Although we do not prove the result here, it is worth nothing that the untyped λ-calculus is neither normalizing nor strongly normalizing. For instance, Y as defined earlier does not normalize. The interested reader may try an exercise: show that Yβ Y1β Y2β Y3β …, where YiYj for all ij. Iterated β reduction allows us to express functions of multiple variables in a language that only has one-argument functions. Consider the example before of the term λf.λx.(fx). Letting F, X denote arbitrary terms, we can evaluate:

(λf.(λx.(fx)))FX β (λx.(fx)[f := F])X
β (Fx)[x := X]
= FX

This result is exactly what we expected. The process of treating a function of multiple arguments as iterated single-argument functions is generally referred to as Currying. Moreover, ↠β satisfies an important property known either as confluence or as the Church-Rosser property. For any term M, if Mβ M1 and Mβ M2, then there exists a term M3 such that M1β M3 and M2β M3. Although a proof of this statement requires machinery that lies beyond the scope of the present exposition, it is both an important property and one that is weaker than either normalization or strong normalization. In other words, although the untyped λ-calculus is neither normalizing nor strongly normalizing, it does satisfy the Church-Rosser property.

e. η-equivalence

Another common notion of reduction is →η which is the least compatible relation (i.e. a relation satisfying the first three items in the definition of →β) such that

  • λx.Mxη M (provided that x is not free in M)

As before, =η is the least equivalence relation containing →η. The notion of η-equivalence captures the idea of extensionality, which can be broadly stated as the fact that two objects (of whatever kind) are equal if they share all their external properties, even if they may differ on internal properties. (The notion of external and internal properties used here is meant to be intuitive only.) η-equivalence is a kind of extensionality statement for the untyped λ-calculus because it states that any function is equal to the function which takes an argument and applies the original function to this argument. In other words, two functions are equivalent when they yield the same output on the same input. This assumption is an extensionality one because it ignores any differences in how the two functions compute that output.

f. Notes

In its original formulation, Church allowed abstraction to occur only over free variables in the body of the function. While this makes intuitive sense (i.e. the body of the function should depend on its input), the requirement does not change any properties of the calculus and has generally been dropped. Barendregt [1984] is the most thorough work on the untyped λ-calculus. It is also very formal and potentially impenetrable to a new reader. Introductions with learning curves that are not quite as steep include Hindley and Seldin [2008] and Hankin [2004]. The first chapter of Sørensen and Urzyczyn [2006] contains a quick, concise tour through the material, with some intuitive motivation. Other less formal expositions include the entry on Wikipedia and assorted unpublished lecture notes to be found on the web. Not every aspect of the untyped λ-calculus was covered in this section. Moreover, notions such as η-equivalence and the Church-Rosser property were introduced only briefly. These are important philosophically. For more discussion and elaboration on the important technical results, the interested reader can learn more from the above cited sources. We also omit any discussion of the formal semantics of the untyped λ-calculus, originally developed in the 1960s by Dana Scott and Christopher Strachey. Again, Barendregt’s book contains a formal exposition of this material and extensive pointers to further reading on it.

3. Expressiveness

Though the language and reduction rules of the untyped λ-calculus may at first glance appear somewhat limiting, we will see in this section how expressive the calculus can be. Because everything is a function, our general strategy will be to define some particular terms to represent common objects like booleans and numbers and then show that certain other terms properly codify functions of booleans and numbers in the sense that they β reduce as one would expect. Instead, however, of continuing these general remarks, the methodology will be used in the particular cases of booleans and natural numbers. Throughout this exposition, bold face will be used to name particular terms.

a. Booleans

We first introduce the basic definitions:

true = λxy.x
false = λxy.y

In other words, true is a function of two arguments which returns its first argument and false is one which returns the second argument. To see how these two basic definitions can be used to encode the behavior of the booleans, we first introduce the following definition:

ifthenelse = λxyz.xyz

The statement ifthenelsePQR should be read as “if P then Q else R”. It should also be noted that ifthenelsePQRβ PQR; often times, only this reduct is presented as standing for the linguistic expression quoted above. Note that ifthenelse actually captures the desired notion. When P = true, we have that

ifthenelsetrueQR β trueQR
= (λxy.x)QR
β (λy.Q)R
β Q

An identical argument will show that ifthenelse false QRβ R. The above considerations will also hold for any P such that Pβ true or Pβ false. We will now show how a particular boolean connective (and) can be encoded in the λ-calculus and then list some further examples, leaving it to the reader to verify their correctness. We define

and = λpq.pqp

To see this definition’s adequacy, we can manually check the four possible truth-values of the “propositional variables” p and q. Although any λ-term can serve as an argument for a boolean function, these functions encode the truth tables of the connectives in the sense that they behave as one would expect when acting on boolean expressions.

and true true = (λpq.pqp)(λxy.x)(λxy.x)
β (λq.pqp)[p := (λxy.x)](λxy.x)
= (λq.((λxy.x)q(λxy.x)))(λxy.x)
β ((λxy.x)q(λxy.x))[q := λxy.x]
= (λxy.x)(λxy.x)(λxy.x)
β λxy.x
= true

Similarly, we see that

and true false = (λpq.pqp)(λxy.x)(λxy.y)
β (λq.pqp)[p := (λxy.x)](λxy.y)
= (λq.((λxy.x)q(λxy.x)))(λxy.y)
β (λxy.x) (λxy.y)(λxy.x)
β λxy.y
= false

We leave it to the reader to verify that and false trueβ false and that and false falseβ false. Together, these results imply that our definition of and encodes the truth table for “and” just as we would expect it to do. The following list of connectives similarly encode their respective truth tables. Because we think it fruitful for the reader to work out the details to gain a fuller understanding, we simply list these without examples:

or = λpq.ppq
not = λpxy.pyx
xor = λpq.p(q false true)q

b. Church Numerals

At the very end of his 1933 paper, Church introduced an encoding for numbers in the untyped λ-calculus that allows arithmetic to be carried out in this purely functional setting. Because the ramifications were not fully explored until his 1936 paper, we hold off on discussion of the import of this encoding and focus here on an exposition of the ability of the λ-calculus to compute functions of natural numbers. The numerals are defined as follows:

1 = λab.ab
2 = λab.a(ab)
3 = λab.a(a(ab))

Successor, addition, multiplication, predecessor, and subtraction can also be defined as follows:

succ = λnab.a(nab)
plus = λmnab.ma(nab)
mult = λmna.n(ma)
pred = λnab.n(λgh.h(ga))(λu.b)(λu.u)
minus = λmn.(n pred)m

As in the case of the booleans, an example of using the successor and addition functions will help show how these definitions functionally capture arithmetic.

succ 1 = (λnab.a(nab))(λab.ab)
β (λab.a(nab))[n := (λab.ab)]
= λab.a((λab.ab)ab)
(λab.ab)ab β ((λb.ab)[a := a])b
= (λb.ab)b
β ab
λab.a((λab.ab)ab) β λab.a(ab)
= 2

Similarly, an example of addition:

plus 1 2 = (λmnab.ma(nab))(λab.ab)(λab.a(ab))
(λmnab.ma(nab)) 1 2 β ((λnab.ma(nab))[m := λab.ab])2
= (λnab.(λab.ab)a(nab))2
β (λab.(λab.ab)a(nab))[n := λab.a(ab)]
= λab.(λab.ab)a((λab.a(ab))ab)
(λab.ab)a β λb.ab
(λab.a(ab))ab β (λb.a(ab))b
β a(ab)
λab.1a(2ab) β λab.a(a(ab))
= 3

Seeing as how these examples capture the appropriate function on natural numbers in terms of Church numerals, we can introduce the notion of λ-definability in the way that Church did in 1936. A function f of one natural number is said to be λ-definable if it is possible to find a term f such that if f(m) = r, then fmβ r, where m and r are the Church numerals of m and r. The arithmetical functions defined above clearly fit this description. The astute reader may have anticipated the result, proved in Kleene [1936], that the λ-definable functions coincide exactly with partial recursive functions. This quite remarkable result encapsulates the arithmetical expressiveness of the untyped λ-calculus completely. While the import of the result will be discussed in section 4, I will briefly discuss how fixed-point combinators allow recursive functions to be encoded in the λ-calculus. Recall that a combinator is a term with no free variables. A combinator M is called a fixed-point combinator if for every term N, we have that MN =β N(MN). In particular, Y as defined before, is a fixed point combinator, since

YF ≡ (λf. (λx.f(xx))(λx.f(xx)))F
β (λx.F(xx))(λx.F(xx))
β F(xx)[x := λx.F(xx)]
F((λx.F(xx))(λx.F(xx)))
=β F(YF)

Although the technical development of the result lies outside the scope of the present exposition, one must note that fixed-point combinators allow the minimization operator to be defined in the λ-calculus. Therefore, all (partial) recursive functions on natural numbers—not just the primitive recursive ones (which are a proper subset of the partial recursive functions)—can be encoded in the λ-calculus.

c. Notes

In the exposition of booleans, connectives such as and, or, et cetera, were given in a very general formulation. They can also be defined more concretely in terms of true and false in a manner that may be closer to capturing their truth tables. For instance, one may define and = λpq.p(q true false) false and verify that it also captures the truth table properly even though it does not directly β-reduce to the definition presented above. In the 1933 paper, Church defined plus, mult, and minus somewhat differently than presented here. This original exposition used some of the idiosyncrasies of the early version of the λ-calculus that were dropped by the time of the 1936 paper. The predecessor function was originally defined by Kleene, in a story which will be discussed in section 4. In addition to the original Kleene paper, more modern expositions of the result that the partial recursive functions are co-extensive with the λ-definable functions can be found in Sørensen and Urzyczyn [2006, pp. 20–22] and Barendregt [1984, pp. 135–139], among others.

4. Philosophical Importance

The expressiveness of the λ-calculus has allowed it to play a critical role in debates at the foundations of mathematics, logic and programming languages. At the end of the introduction to his 1936 paper, Church writes that “The purpose of the present paper is to propose a definition of effective calculability which is thought to correspond satisfactorily to the somewhat vague intuitive notion in terms of which problems of this class are often stated, and to show, by means of an example, that not every problem of this class is solvable.” The first subsection below will explore the definition of effective calculability in terms of λ-definability. We will then explore the second purpose of the 1936 paper, namely that of proving a negative answer to Hilbert’s Entscheidungsproblem. In the course of both of these expositions, connections will be made with the work of other influential logicians (Gödel and Turing) of the time. Finally, we will conclude with a brief foray into more modern developments in typed λ-calculi and the correspondence between them and logical proofs, a correspondence often referred to as “proofs-as-programs” for reasons that will become clear.

a. The Church-Turing Thesis

In his 1936 paper, in a section entitled “The notion of effective calculability”, Church writes that “We now define the notion, already discussed, of an effectively calculable function of positive integers by identifying it with the notion of a recursive function of positive integers (or a λ-definable function of positive integers)” [Church, 1936b, p. 356, emphasis in original]. The most striking feature of this definition, which will be discussed in more depth in what follows, is that it defines an intuitive notion (effective calculability) with a formal notion. In this sense, some regard the thesis as more of an unverifiable hypothesis than a definition. In what follows, we will consider, in addition to the development of alternative formulations of the thesis, what qualifies as evidence in support of this thesis. In the preceding parts of Church’s paper, he claims to have shown that for any λ-definable function, there exists an algorithm for calculating its values. The idea of an algorithm existing to calculate the values of a function (F) plays a pivotal role in Church’s thesis. Church argues that an effectively calculable function of one natural number must have an algorithm, consisting in a series of expressions (“steps” in the algorithm) that lead to the calculated result. Then, because each of these expressions can be represented as a natural number using Gödel’s methodology, the functions that act on them to calculate F will be recursive and therefore F will be as well. In short, a function which has an algorithm that calculates its values will be effectively calculable and any effectively calculable function of one natural number has such an algorithm. Furthermore, if such an algorithm exists, it will be λ-definable (via the formal result that all recursive functions are λ-definable). Church offers a second notion of effective calculability in terms of provability within an arbitrary logical system. Because he can show that this notion also coincides with recursiveness, he concludes that “no more general definition of effective calculability than that proposed above can be obtained by either of the two methods which naturally suggest themselves” [Church, 1936b, p. 358]. The history of this particular version of the thesis has no clear story. Kleene allegedly realized how to define the predecessor function (as given in the previous section: Kleene had earlier defined the predecessor function using an encoding of numerals different from those of Church) while at the dentist for the removal of two wisdom teeth. For a first-hand account, see Crossley [1975, pp. 4–5]. According to Barendregt [1997, p. 186]:

After Kleene showed the solution to his teacher, Church remarked something like: “But then all intuitively computable functions must be lambda definable. In fact, lambda definability must coincide with intuitive computability.” Many years later—it was at the occasion of Robin Gandy’s 70th birthday, I believe—I heard Kleene say: “I would like to be able to say that, at the moment of discovering how to lambda define the predecessor function, I got the idea of Church’s Thesis. But I did not, Church did.”

Alan Turing, in his groundbreaking paper also published in 1936, developed the notion of computable numbers in terms of his computing machines. While Turing believes that any definition of an intuitive notion will be “rather unsatisfactory mathematically,” he frames the question in terms of computation as opposed to effective calculability: “The real question at issue is ‘what are the possible processes which can be carried out in computing a number?’ ” [Turing, 1937a, p. 249] By analyzing the way that a computer—in fact, an idealized human computer—would use a piece of paper to carry out this process of computing a number, which he reduces to observing a finite number of squares on a one-dimensional paper and changing one of the squares or another within a small distance of the currently observed squares, Turing shows how this intuitive notion of computing can be captured by one of his machines. Thus, he claims to have captured the intuitive notion of computability with computability by Turing machines. In the appendix of this paper added later, Turing also offers a sketch of a proof that λ-definability and his computability capture the same class of functions. In a subsequent paper, he also suggests that “The identification of ‘effectively calculable’ functions with computable functions is possibly more convincing than an identification with the λ-definable or general recursive functions” [Turing, 1937b, p. 153]. While Turing may believe that his machines are closer to the intuitive notion of calculating a number, the fact that the three classes of functions are identical means they can be used interchangeably. This fact has often been considered to count as evidence that these formalisms do in fact capture an intuitive notion of calculability. Because Turing’s analysis of the notion of computability often makes reference to a human computer, with squares on paper referred to as “states of mind”, the Church-Turing thesis, along with Turing’s result that there is a universal machine which can compute any function that any Turing machine can, has often been interpreted, in conjunction with the computational theory of mind, as having profound results on the limits of human thought. For a detailed analysis of many misinterpretations of the thesis and what it does imply in the context of the nature of mind, see Copeland [2010].

b. An Answer to Hilbert’s Entscheidungsproblem

In a famous lecture addressed at the meeting of the International Congress of Mathematicians in 1900, David Hilbert proposed 23 problems in need of solution. The second of these was to give a proof of the consistency of analysis without reducing it to a consistency proof of another system. Though much of Hilbert’s thought and the particular phrasing of this problem changed over the ensuing 30 years, the challenge to prove the consistency of certain formal systems remained. At a 1928 conference, Hilbert posed three problems that continued his general program. The third of these problems was the Entscheidungsproblem in the sense that Church used the term: “By the Entscheidungsproblem of a system of symbolic logic is here understood the problem to find an effective method by which, given any expression Q in the notation of the system, it can be determined whether or not Q is provable in the system” [Church, 1936a, p. 41]. In 1936b, Church used the λ-calculus to prove a negative solution to this problem for a wide/natural/interesting class of formal systems. The technical result proved in this paper is that it is undecidable, given two λ-terms A and B whether A is convertible into B. Church argues that any system of logic that is strong enough to express a certain amount of arithmetic will be able to express the formula ψ(a, b), which encodes that a and b are the Gödel numbers of A and B such that A is convertible into B. If the system of logic satisfied the Entscheidungsproblem, then it would be decidable for every a, b whether ψ(a, b) is provable; but this would provide a way to decide whether A is convertible into B for any terms A and B, contradicting Church’s already proven result that this problem is not decidable. Given that we have just seen the equivalence between Turing machine computability and λ-definability, it should come as no surprise that Turing also provided a negative solution to the Entscheidungsproblem. Turing proceeds by defining a formula of one variable which takes as input the complete description of a Turing machine and is provable if and only if the machine input ever halts. Therefore, if the Entscheidungsproblem can be solved, then it provides a method for proving whether or not a given machine halts. But Turing had shown earlier that this halting problem cannot be solved. These two results, in light of the Church-Turing Thesis, imply that there are functions of natural numbers that are not effectively calculable/computable. It remains open, however, whether there are computing machines or physical processes that elude the limitations of Turing computability (equivalently, λ-definability) and whether or not the human brain may embody one such process. For a thorough discussion of computation in physical systems and the resulting implications for philosophy of mind, see Piccinini [2007].

c. Notes

Kleene [1952] offers (in chapters 12 and 13) a detailed account of the development of the Church-Turing thesis from a figure who was central to the logical developments involved. Turing’s strategy of reducing the Entscheudingsproblem to the halting problem has been adopted by many researchers tackling other problems. Kleene [1952] also discusses Turing computability. The halting problem, as outlined above, however, takes an arbitrary machine and an arbitrary input. There are, of course, particular machines and particular inputs for which it can be decided whether the machine halts on that input.  For instance, every finite-state automaton halts on every input.

5. Types and Programming Languages

a. Overview

In 1940, Church gave a “formulation of the simple theory of types which incorporates certain features of λ-conversion” [Church, 1940, p. 56]. Though the history of simple types extends beyond the scope of this paper and Church’s original formulation extended beyond the syntax of the untyped λ-calculus (in a way somewhat similar to the original, inconsistent formulation), one can intuitively think of types as syntactic decorations that restrict the formation of terms so that functions may only be applied to appropriate arguments. In informal mathematics, a function f : ℕ → ℕ cannot be applied to a fraction. We now define a set of types and a new notion of term that incorporates this idea. We have a set of base types, say σ1, σ2. (In Church’s formulation, ι and o were the only base types. In many programming languages, base types include things such as nat, bool, etc. This set can be countably infinite for the purposes of proof although no programming language could implement such a type system.) For any types α, β, then αβ is also a type. (Church used the notation (βα), but the arrow is used in most modern expositions because it captures the intuition of a function type better.) In what follows, σ, τ and any other lower-case Greek letters will range over types. We will write either e : τ or eτ to indicate that term e is of type τ. The simply typed terms can then be defined as follows:

  • xβ is a term.
  • If xβ is a variable and Mα is a term, then (λxβ.Mα)βα is a term.
  • If Mαβ and Nα are terms, then (MN)β is a term.

While the first two rules can be seen as simply adding “syntactic decoration” to the original untyped terms, this third rule captures the idea of only applying functions to proper input. It can be read as saying: If M is a function from α to β and N is an α, then MN is a β. This formal system does not permit other application terms to be formed. One must note that this simply-typed λ-calculus is strongly normalizing, meaning that any well-typed term has a finite reduction sequence. It has been mentioned that λ-calculi are prototypical programming languages. We can then think of typed λ-calculi as being typed programming languages. Seen in this light, the strongly normalizing property states that one cannot write a non-terminating program. For instance, the term Y in the untyped λ-calculus has no simply-typed equivalent. Clearly, then, this is a somewhat restrictive language. Because of this property and the results in the preceding section, it follows that the simply-typed λ-calculus is not Turing complete, i.e. there are λ-definable functions which are not definable in the simply-typed version. Taking this analogy with programming languages literally, we can interpret a family of results collectively known as the Curry-Howard correspondence as providing a “proofs-as-programs” metaphor. (The correspondence is sometimes referred to as Curry-Howard-Lambek due to Lambek’s work in expressing both the λ-calculus and natural deduction in a category-theoretic setting.) In the case of the simply-typed λ-calculus, the correspondence is between the λ-calculus and the implicational fragment of intuitionistic natural deduction. One can readily see that simple types correspond directly to propositions in this fragment of propositional logic (modulo a translation of base types to propositional variables). The correspondence then states that a proposition is provable if and only if there is a λ-term with the type of the proposition. The term then corresponds to a proof of the proposition. In the simply-typed case, the correspondence can be fleshed out as:

λ-calculus propositional logic
type proposition
term proof
abstraction arrow introduction
application modus ponens
variables assumptions

Table 1. Curry-Howard in the simply-typed λ-calculus.

For example, consider the intuitionistic tautology p → ¬¬p, where ¬p =df p → ⊥. While a full exposition of natural deduction for intuitionistic propositional logic lies beyond the present scope, observe that we have the following proof of p → ¬¬p = p → ((p → ⊥) → ⊥):

  p, p → ⊥ ⊢ p → ⊥         p, p → ⊥ ⊢ p       p, p → ⊥ ⊢ ⊥         p ⊢ (p → ⊥) → ⊥     p → ((p → ⊥) → ⊥)

We can then follow this proof in order to construct a corresponding term in the simply-typed λ-calculus as follows:

  xp, yp→⊥yp→⊥         xp, yp→⊥xp       xp, yp→⊥yx         xp ⊢ (λyp→⊥.yx)(p→⊥)→⊥     ⊢ (λxp.λyp→⊥.yx)p→((p→⊥)→⊥)

As was mentioned, the Curry-Howard correspondence is not one correspondence, but rather a family. The main idea is that terms correspond to proofs of their type. In the way that we can extend logics beyond the implicational fragment of intuitionistic propositional logic to include things like connectives, negation, quantifiers, and higher-order logic, so too can we extend the syntax and type systems of λ-calculus. There are thus many variations of the proofs-as-programs correspondence. See the following notes section for many sources exploring the development of more complex correspondences. That programs represent proofs is more than an intellectual delight: it is the correspondence on which software verifiers and automated mathematical theorem provers (such as Automath, HOL, Isabelle, and Coq) are based. These tools both verify and help users generate proofs and have in fact been used to develop proofs of unproven conjectures.

b. Notes

For a thorough exposition of typed λ-calculi, starting with simple types and progressing through all faces of the “λ-cube”, see Barendregt [1992]. Sørensen and Urzyczyn [2006] explores the Curry-Howard correspondence between many different forms of logic and the corresponding type systems. Pierce [2002] focuses on the λ-calculus and its relation to programming languages, thoroughly developing calculi from the simple to the very complex, along with type checking and operational semantics for each.

6. References and Further Reading

  • Barendregt, Henk. 1984. The Lambda Calculus: Its Syntax and Semantics. Elsevier Science, Amsterdam.
  • Barendregt, Henk. 1992. Lambda Calculi with Types. Oxford University Press, Oxford.
  • Barendregt, Henk. 1997. “The Impact of the Lambda Calculus in Logic and Computer Science.” The Bulletin of Symbolic Logic 3(2):181–215.
  • Cardone, Felice and J. R. Hindley. 2006. Handbook of the History of Logic, eds., D. M. Gabbay and J. Woods, vol. 5., History of Lambda-Calculus and Combinatory Logic, Elsevier.
  • Church, Alonzo. 1932. “A Set of Postulates for the Foundation of Logic (Part I).” Annals of Mathematics, 33(2):346–366.
  • Church, Alonzo. 1933. “A Set of Postulates for the Foundation of Logic (Part II).” Annals of Mathematics, 34(4):839–864.
  • Church, Alonzo. 1935. “A Proof of Freedom from Contradiction.” Proceedings of the National Academy of Sciences of the United States of America, 21(5):275–281.
  • Church, Alonzo. 1936a. “A Note on the Entscheidungsproblem.” Journal of Symbolic Logic, 1(1):40–41.
  • Church, Alonzo. 1936b. “An Unsolvable Problem of Elementary Number Theory.” American Journal of Mathematics, 58(2):345–363, 1936b.
  • Church, Alonzo. 1940. “A Formulation of the Simple Theory of Types.” Journal of Symbolic Logic, 5(2):56–68.
  • Church, Alonzo. 1941. The Calculi of Lambda Conversion. Princeton University Press, Princeton, NJ.
  • Copeland, B. Jack. 2010. “The Church-Turing Thesis.” Stanford Encyclopedia of Philosophy.
  • Crossley, J. N. 1975. “Reminiscences of Logicians.” Algebra and Logic: Papers from the 1974 Summer Research Institute of the Australian Mathematical Society, Monash University, Australia, Lecture Notes in Mathematics, vol. 450, Springer, Berlin.
  • Hankin, Chris. 2004. An Introduction to Lambda Calculi for Computer Scientists. College Publications.
  • Hindley, J. Roger and Jonathan P. Seldin. 2008. Lambda-Calculus and Combinators: An Introduction. Cambridge University Press, Cambridge.
  • Kleene, S. C. 1936. “λ-definability and Recursiveness.” Duke Mathematical Journal, 2(2):340–353.
  • Kleene, S. C. 1952. Introduction to Metamathematics. D. van Norstrand, New York.
  • Kleene, S. C. and J. B. Rosser. 1935. “The Inconsistency of Certain Formal Logics.” Annals of Mathematics, 36(3):630–636.
  • Piccinini, Gualtiero. 2007. “Computational Modelling vs. Computational Explanation: Is Everything a Turing Machine, and Does It Matter to the Philosophy of Mind?” Australasian Journal of Philosophy, 85(1):93–115.
  • Pierce, Benjamin C. 2002. Types and Programming Languages. MIT Press, Cambridge.
  • Sørensen, M. H. and P. Urzyczyn. 2006. Lectures on the Curry-Howard Isomorphism. Elsevier Science, Oxford.
  • Turing, Alan M. 1937a. “On Computable Numbers, with an Application to the Entscheidungsproblem.” Proceedings of the London Mathematical Society, 42(2):230–265.
  • Turing, Alan M. 1937b. “Computability and λ-Definability.” Journal of Symbolic Logic, 2(4): 153–163.

Author Information

Shane Steinert-Threlkeld
Email: shanest@stanford.edu
Stanford University
U. S. A.

Plato: Phaedo

The Phaedo is one of the most widely read dialogues written by the ancient Greek philosopher Plato.  It claims to recount the events and conversations that occurred on the day that Plato’s teacher, Socrates (469-399 B.C.E.), was put to death by the state of Athens.  It is the final episode in the series of dialogues recounting Socrates’ trial and death.  The earlier Euthyphro dialogue portrayed Socrates in discussion outside the court where he was to be prosecuted on charges of impiety and corrupting the youth; the Apology described his defense before the Athenian jury; and the Crito described a conversation during his subsequent imprisonment.  The Phaedo now brings things to a close by describing the moments in the prison cell leading up to Socrates’ death from poisoning by use of hemlock.

Among these “trial and death” dialogues, the Phaedo is unique in that it presents Plato’s own metaphysical, psychological, and epistemological views; thus it belongs to Plato’s middle period rather than with his earlier works detailing Socrates’ conversations regarding ethics.  Known to ancient commentators by the title On the Soul, the dialogue presents no less than four arguments for the soul’s immortality.  It also contains discussions of Plato’s doctrine of knowledge as recollection, his account of the soul’s relationship to the body, and his views about causality and scientific explanation.  Most importantly of all, Plato sets forth his most distinctive philosophical theory—the theory of Forms—for what is arguably the first time. So, the Phaedo merges Plato’s own philosophical worldview with an enduring portrait of Socrates in the hours leading up to his death.

Table of Contents

  1. The Place of the Phaedo within Plato’s works
  2. Drama and Doctrine
  3. Outline of the Dialogue
    1. The Philosopher and Death (59c-69e)
    2. Three Arguments for the Soul’s Immortality (69e-84b)
      1. The Cyclical Argument (70c-72e)
      2. The Argument from Recollection (72e-78b)
      3. The Affinity Argument (78b-84b)
    3. Objections from Simmias and Cebes, and Socrates’ Response (84c-107b)
      1. The Objections (85c-88c)
      2. Interlude on Misology (89b-91c)
      3. Response to Simmias (91e-95a)
      4. Response to Cebes (95a-107b)
        1. Socrates’ Intellectual History (96a-102a)
        2. The Final Argument (102b-107b)
    4. The Myth about the Afterlife (107c-115a)
    5. Socrates’ Death (115a-118a)
  4. References and Further Reading
    1. General Commentaries
    2. The Philosopher and Death (59c-69e)
    3. Three Arguments for the Soul’s Immortality (69e-84b)
    4. Objections from Simmias and Cebes, and Socrates’ Response (84c-107b)
    5. The Myth about the Afterlife (107c-115a)
    6. Socrates’ Death (115a-118a)

1. The Place of the Phaedo within Plato’s works

Plato wrote approximately thirty dialogues.  The Phaedo is usually placed at the beginning of his “middle” period, which contains his own distinctive views about the nature of knowledge, reality, and the soul, as well as the implications of these views for human ethical and political life.  Its middle-period classification puts it after “early” dialogues such as the Apology, Euthyphro, Crito, Protagoras, and others which present Socrates’ search—usually inconclusive—for ethical definitions, and before “late” dialogues like the Parmenides, Theaetetus, Sophist, and Statesman.  Within the middle dialogues, it is uncontroversial that the Phaedo was written before the Republic, and most scholars think it belongs before the Symposium as well.  Thus, in addition to being an account of what Socrates said and did on the day he died, the Phaedo contains what is probably Plato’s first overall statement of his own philosophy.  His most famous theory, the theory of Forms, is presented in four different places in the dialogue.

2. Drama and Doctrine

In addition to its central role in conveying Plato’s philosophy, the Phaedo is widely agreed to be a masterpiece of ancient Greek literature. Besides philosophical argumentation, it contains a narrative framing device that resembles the chorus in Greek tragedy, references to the Greek myth of Theseus and the fables of Aesop, Plato’s own original myth about the afterlife, and in its opening and closing pages, a moving portrait of Socrates in the hours leading up to his death.  Plato draws attention (at 59b) to the fact that he himself was not present during the events retold, suggesting that he wants the dialogue to be seen as work of fiction.

Contemporary commentators have struggled to put together the dialogue’s dramatic components with its lengthy sections of philosophical argumentation—most importantly, with the four arguments for the soul’s immortality, which tend to strike even Plato’s charitable interpreters as being in need of further defense.  (Socrates himself challenges his listeners to provide such defense at 84c-d.)  How seriously does Plato take these arguments, and what does the surrounding context contribute to our understanding of them?  While this article will concentrate on the philosophical aspects of the Phaedo, readers are advised to pay close attention to the interwoven dramatic features as well.

3. Outline of the Dialogue

The dialogue revolves around the topic of death and immortality: how the philosopher is supposed to relate to death, and what we can expect to happen to our souls after we die.  The text can be divided, rather unevenly, into five sections:

(1) an initial discussion of the philosopher and death (59c-69e)

(2) three arguments for the soul’s immortality (69e-84b)

(3) some objections to these arguments from Socrates’ interlocutors and his response, which includes a fourth argument (84c-107b)

(4) a myth about the afterlife (107c-115a)

(5) a description of the final moments of Socrates’ life (115a-118a)

The dialogue commences with a conversation (57a-59c) between two characters, Echecrates and Phaedo, occurring sometime after Socrates’ death in the Greek city of Phlius.  The former asks the latter, who was present on that day, to recount what took place.  Phaedo begins by explaining why some time had elapsed between Socrates’ trial and his execution: the Athenians had sent their annual religious mission to Delos the day before the trial, and executions are forbidden until the mission returns.  He also lists the friends who were present and describes their mood as “an unaccustomed mixture of pleasure and pain,” since Socrates appeared happy and without fear but his friends knew that he was going to die.  He agrees to tell the whole story from the beginning; within this story the main interlocutors are Socrates, Simmias, and Cebes.  Some commentators on the dialogue have taken the latter two characters to be followers of the philosopher Pythagoras (570-490 B.C).

a. The Philosopher and Death (59c-69e)

Socrates’ friends learn that he will die on the present day, since the mission from Delos has returned.  They go in to the prison to find Socrates with his wife Xanthippe and their baby, who are then sent away.  Socrates, rubbing the place on his leg where his just removed bonds had been, remarks on how strange it is that a man cannot have both pleasure and pain at the same time, yet when he pursues and catches one, he is sure to meet with the other as well.  Cebes asks Socrates about the poetry he is said to have begun writing, since Evenus (a Sophist teacher, not present) was wondering about this.  Socrates relates how certain dreams have caused him to do so, and says that he is presently putting Aesop’s fables into verse.  He then asks Cebes to convey to Evenus his farewell, and to tell him that—even though it would be wrong to take his own life—he, like any philosopher, should be prepared to follow Socrates to his death.

Here the conversation turns toward an examination of the philosopher’s attitude toward death.  The discussion starts with the question of suicide.  If philosophers are so willing to die, asks Cebes, why is it wrong for them to kill themselves?  Socrates’ initial answer is that the gods are our guardians, and that they will be angry if one of their possessions kills itself without permission.  As Cebes and Simmias immediately point out, however, this appears to contradict his earlier claim that the philosopher should be willing to die: for what truly wise man would want to leave the service of the best of all masters, the gods?

In reply to their objection, Socrates offers to “make a defense” of his view, as if he were in court, and submits that he hopes this defense will be more convincing to them than it was to the jury.  (He is referring here, of course, to his defense at his trial, which is recounted in Plato’s Apology.) The thesis to be supported is a generalized version of his earlier advice to Evenus: that “the one aim of those who practice philosophy in the proper manner is to practice for dying and death” (64a3-4).

Socrates begins his defense of this thesis, which takes up the remainder of the present section, by defining death as the separation of body and soul.  This definition goes unchallenged by his interlocutors, as does its dualistic assumption that body and soul are two distinct entities.   (The Greek word psuchē is only roughly approximate to our word “soul”; the Greeks thought of psuchē as what makes something alive, and Aristotle talks about non-human animals and even plants as having souls in this sense.)  Granted that death is a soul/body separation, Socrates sets forth a number of reasons why philosophers are prepared for such an event.  First, the true philosopher despises bodily pleasures such as food, drink, and sex, so he more than anyone else wants to free himself from his body (64d-65a).  Additionally, since the bodily senses are inaccurate and deceptive, the philosopher’s search for knowledge is most successful when the soul is “most by itself.”

The latter point holds especially for the objects of philosophical knowledge that Plato later on in the dialogue (103e) refers to as “Forms.”  Here Forms are mentioned for what is perhaps the first time in Plato’s dialogues: the Just itself, the Beautiful, and the Good; Bigness, Health, and Strength; and “in a word, the reality of all other things, that which each of them essentially is” (65d).  They are best approached not by sense perception but by pure thought alone. These entities are granted again without argument by Simmias and Cebes, and are discussed in more detail later. .

All told, then, the body is a constant impediment to philosophers in their search for truth: “It fills us with wants, desires, fears, all sorts of illusions and much nonsense, so that, as it is said, in truth and in fact no thought of any kind ever comes to us from the body” (66c).  To have pure knowledge, therefore, philosophers must escape from the influence of the body as much as is possible in this life. Philosophy itself is, in fact, a kind of “training for dying” (67e), a purification of the philosopher’s soul from its bodily attachment.

Thus, Socrates concludes, it would be unreasonable for a philosopher to fear death, since upon dying he is most likely to obtain the wisdom which he has been seeking his whole life.  Both the philosopher’s courage in the face of death and his moderation with respect to bodily pleasures which result from the pursuit of wisdom stand in stark contrast to the courage and moderation practiced by ordinary people.  (Wisdom, courage, and moderation are key virtues in Plato’s writings, and are included in his definition of justice in the Republic.) Ordinary people are only brave in regard to some things because they fear even worse things happening, and only moderate in relation to some pleasures because they want to be immoderate with respect to others.  But this is only “an illusory appearance of virtue”—for as it happens, “moderation and courage and justice are a purging away of all such things, and wisdom itself is a kind of cleansing or purification” (69b-c).  Since Socrates counts himself among these philosophers, why wouldn’t he be prepared to meet death?  Thus ends his defense.

b. Three Arguments for the Soul’s Immortality (69e-84b)

But what about those, says Cebes, who believe that the soul is destroyed when a person dies?  To persuade them that it continues to exist on its own will require some compelling argument.  Readers should note several important features of Cebes’ brief objection (70a-b).  First, he presents the belief in the immortality of the soul as an uncommon belief (“men find it hard to believe . . .”).  Secondly, he identifies two things which need to be demonstrated in order to convince those who are skeptical: (a) that the soul continues to exist after a person’s death, and (b) that it still possesses intelligence.  The first argument that Socrates deploys appears to be intended to respond to (a), and the second to (b).

i. The Cyclical Argument (70c-72e)

Socrates mentions an ancient theory holding that just as the souls of the dead in the underworld come from those living in this world, the living souls come back from those of the dead (70c-d).  He uses this theory as the inspiration for his first argument, which may be reconstructed as follows:

1. All things come to be from their opposite states: for example, something that comes to be “larger” must necessarily have been “smaller” before (70e-71a).

2. Between every pair of opposite states there are two opposite processes: for example, between the pair “smaller” and “larger” there are the processes “increase” and “decrease” (71b).

3. If the two opposite processes did not balance each other out, everything would eventually be in the same state: for example, if increase did not balance out decrease, everything would keep becoming smaller and smaller (72b).

4.  Since “being alive” and “being dead” are opposite states, and “dying” and “coming-to-life” are the two opposite processes between these states, coming-to-life must balance out dying (71c-e).

5. Therefore, everything that dies must come back to life again (72a).

A main question that arises in regard to this argument is what Socrates means by “opposites.” We can see at least two different ways in which this term is used in reference to the opposed states he mentions.  In a first sense, it is used for “comparatives” such as larger and smaller (and also the pairs weaker/stronger and swifter/slower at 71a), opposites which admit of various degrees and which even may be present in the same object at once (on this latter point, see 102b-c).  However, Socrates also refers to “being alive” and “being dead” as opposites—but this pair is rather different from comparative states such as larger and smaller, since something can’t be deader, but only dead.  Being alive and being dead are what logicians call “contraries” (as opposed to “contradictories,” such as “alive” and “not-alive,” which exclude any third possibility).  With this terminology in mind, some contemporary commentators have maintained that the argument relies on covertly shifting between these different kinds of opposites.

Clever readers may notice other apparent difficulties as well.  Does the principle about balance in (3), for instance, necessarily apply to living things?  Couldn’t all life simply cease to exist at some point, without returning?  Moreover, how does Plato account for adding new living souls to the human population?  While these questions are perhaps not unanswerable from the point of view of the present argument, we should keep in mind that Socrates has several arguments remaining, and he later suggests that this first one should be seen as complementing the second (77c-d).

ii. The Argument from Recollection (72e-78b)

Cebes mentions that the soul’s immortality also is supported by Socrates’ theory that learning is “recollection” (a theory which is, by most accounts, distinctively Platonic, and one that plays a role in his dialogues Meno and Phaedrus as well).  As evidence of this theory he mentions instances in which people can “recollect” answers to questions they did not previously appear to possess when this knowledge is elicited from them using the proper methods. This is likely a reference to the Meno (82b ff.), where Socrates elicits knowledge about basic geometry from a slave-boy by asking the latter a series of questions to guide him in the right direction. Asked by Simmias to elaborate further upon this doctrine, Socrates explains that recollection occurs “when a man sees or hears or in some other way perceives one thing and not only knows that thing but also thinks of another thing of which the knowledge is not the same but different . . .” (73c).  For example, when a lover sees his beloved’s lyre, the image of his beloved comes into his mind as well, even though the lyre and the beloved are two distinct things.

Based on this theory, Socrates now commences a second proof for the soul’s immortality—one which is referred to with approval in later passages in the dialogue (77a-b, 87a, 91e-92a, and 92d-e). The argument may be reconstructed as follows:

1. Things in the world which appear to be equal in measurement are in fact deficient in the equality they possess (74b, d-e).

2. Therefore, they are not the same as true equality, that is, “the Equal itself” (74c).

3. When we see the deficiency of the examples of equality, it helps us to think of, or “recollect,” the Equal itself (74c-d).

4. In order to do this, we must have had some prior knowledge of the Equal itself (74d-e).

5. Since this knowledge does not come from sense-perception, we must have acquired it before we acquired sense-perception, that is, before we were born (75b ff.).

6. Therefore, our souls must have existed before we were born. (76d-e)

With regard to premise (1), in what respect are this-worldly instances of equality deficient?  Socrates mentions that two apparently equal sticks, for example, “fall short” of true equality and are thus “inferior” to it (74e).  Why?  His reasoning at 74b8-9—that the sticks “sometimes, while remaining the same, appear to one to be equal and another to be unequal”—is notoriously ambiguous, and has been the subject of much scrutiny.  He could mean that the sticks may appear as equal or unequal to different observers, or perhaps they appear as equal when measured against one thing but not another.  In any case, the notion that the sensible world is imperfect is a standard view of the middle dialogues (see Republic 479b-c for a similar example), and  is emphasized further in his next argument.

By “true equality” and “the Equal itself” in premises (2)-(4), Socrates is referring to the Form of Equality.  It is this entity with respect to which the sensible instances of equality fall short—and indeed, Socrates says that the Form is “something else beyond all these.”  His brief argument at 74a-c that true equality is something altogether distinct from any visible instances of equality is of considerable interest, since it is one of few places in the middle dialogues where he makes an explicit argument for why there must be Forms. The conclusion of the second argument for the soul’s immortality extends what has been said about equality to other Forms as well: “If those realities we are always talking about exist, the Beautiful and the Good and all that kind of reality, and we refer all the things we perceive to that reality, discovering that it existed before and is ours, and we compare these things with it, then, just as they exist, so our soul must exist before we are born” (76d-e).  The process of recollection is initiated not just when we see imperfectly equal things, then, but when we see things that appear to be beautiful or good as well; experience of all such things inspires us to recollect the relevant Forms.  Moreover, if these Forms are never available to us in our sensory experience, we must have learned them even before we were capable of having such experience.

Simmias agrees with the argument so far, but says that this still does not prove that our souls exist after death, but only before birth.  This difficulty, Socrates suggests, can be resolved by combining the present argument with the one from opposites: the soul comes to life from out of death, so it cannot avoid existing after death as well.  He does not elaborate on this suggestion, however, and instead proceeds to offer a third argument.

iii. The Affinity Argument (78b-84b)

The third argument for the soul’s immortality is referred to by commentators as the “affinity argument,” since it turns on the idea that the soul has a likeness to a higher level of reality:

1. There are two kinds of existences: (a) the visible world that we perceive with our senses, which is human, mortal, composite, unintelligible, and always changing, and (b) the invisible world of Forms that we can access solely with our minds, which is divine, deathless, intelligible, non-composite, and always the same (78c-79a, 80b).

2. The soul is more like world (b), whereas the body is more like world (a) (79b-e).

3. Therefore, supposing it has been freed of bodily influence through philosophical training, the soul is most likely to make its way to world (b) when the body dies (80d-81a).  (If, however, the soul is polluted by bodily influence, it likely will stay bound to world (a) upon death (81b-82b).)

Note that this argument is intended to establish only the probability of the soul’s continued existence after the death of the body—“what kind of thing,” Socrates asks at the outset, “is likely to be scattered [after the death of the body]?” (78b; my italics)  Further, premise (2) appears to rest on an analogy between the soul and body and the two kinds of realities mentioned in (1), a style of argument that Simmias will criticize later (85e ff.).  Indeed, since Plato himself appends several pages of objections by Socrates’ interlocutors to this argument, one might wonder how authoritative he takes it to be.

Yet Socrates’ reasoning about the soul at 78c-79a states an important feature of Plato’s middle period metaphysics, sometimes referred to as his “two-world theory.”  In this picture of reality, the world perceived by the senses is set against the world of Forms, with each world being populated by fundamentally different kinds of entities:

The World of the Senses The World of Forms
Composites (that is, things with parts) Non-composites
Things that never remain the same from one moment to the next Things that always remain the same and don’t tolerate any change
Any particular thing that is equal, beautiful, and so forth The Equal, the Beautiful, and what each thing is in itself
That which is visible That which is grasped by the mind and invisible

Since the body is like one world and the soul like the other, it would be strange to think that even though the body lasts for some time after a person’s death, the soul immediately dissolves and exists no further.  Given the respective affinities of the body and soul, Socrates spends the rest of the argument (roughly 80d-84b) expanding on the earlier point (from his “defense”) that philosophers should focus on the latter. This section has some similarities to the myth about the afterlife, which he narrates near the dialogue’s end; note that some of the details of the account here of what happens after death are characterized as merely “likely.” A soul which is purified of bodily things, Socrates says, will make its way to the divine when the body dies, whereas an impure soul retains its share in the visible after death, becoming a wandering phantom.  Of the impure souls, those who have been immoderate will later become donkeys or similar animals, the unjust will become wolves or hawks, those with only ordinary non-philosophical virtue will become social creatures such as bees or ants.

The philosopher, on the other hand, will join the company of the gods.  For philosophy brings deliverance from bodily imprisonment, persuading the soul “to trust only itself and whatever reality, existing by itself, the soul by itself understands, and not to consider as true whatever it examines by other means, for this is different in different circumstances and is sensible and visible, whereas what the soul itself sees is intelligible and indivisible” (83a6-b4).  The philosopher thus avoids the “greatest and most extreme evil” that comes from the senses: that of violent pleasures and pains which deceive one into thinking that what causes them is genuine.  Hence, after death, his soul will join with that to which it is akin, namely, the divine.

c. Objections from Simmias and Cebes, and Socrates’ Response (84c-107b)

After a long silence, Socrates tells Simmias and Cebes not to worry about objecting to any of what he has just said.  For he, like the swan that sings beautifully before it dies, is dedicated to the service of Apollo, and thus filled with a gift of prophecy that makes him hopeful for what death will bring.

i. The Objections (85c-88c)

Simmias prefaces his objection by making a remark about methodology.  While certainty, he says, is either impossible or difficult,  it would show a weak spirit not to make a complete investigation.  If at the end of this investigation one fails to find the truth, one should adopt the best theory and cling to it like a raft, either until one dies or comes upon something sturdier.

This being said, he proceeds to challenge Socrates’ third argument.  For one might put forth a similar argument which claims that the soul is like a harmony and the body is like a lyre and its strings.  In fact, Simmias claims that “we really do suppose the soul to be something of this kind,” that is, a harmony or proper mixture of bodily elements like the hot and cold or dry and moist (86b-c).  (Some commentators think  the “we” here refers to followers of Pythagoras.)  But even though a musical harmony is invisible and akin to the divine, it will cease to exist when the lyre is destroyed.  Following the soul-as-harmony thesis, the same would be true of the soul when the body dies.

Next Socrates asks if Cebes has any objections.  The latter says that he is convinced by Socrates’ argument that the soul exists before birth, but still doubts whether it continues to exist after death.   In support of his doubt, he invokes a metaphor of his own.  Suppose someone were to say that since a man lasts longer than his cloak, it follows that if the cloak is still there the man must be there too.  We would certainly think this statement was nonsense. (He appears to be refering to Socrates’ argument at 80c-e here.)  Just as a man might wear out many cloaks before he dies, the soul might use up many bodies before it dies.  So even supposing everything else is granted, if “one does not further agree that the soul is not damaged by its many births and is not, in the end, altogether destroyed in one of those deaths, he might say that no one knows which death and dissolution of the body brings about the destruction of the soul, since not one of us can be aware of this” (88a-b).  In light of this uncertainty, one should always face death with fear.

ii. Interlude on Misology (89b-91c)

After a short exchange in the meta-dialogue in which Phaedo and Echecrates praise Socrates’ pleasant attitude throughout this discussion, Socrates begins his response with a warning that they not become misologues.  Misology, he says, arises in much the same way that misanthropy does: when someone with little experience puts his trust in another person, but later finds him to be unreliable, his first reaction is to blame this on the depraved nature of people in general.  If he had more knowledge and experience, however, he would not be so quick to make this leap, for he would realize that most people fall somewhere in between the extremes of good and bad, and he merely happened to encounter someone at one end of the spectrum.  A similar caution applies to arguments.  If someone thinks a particular argument is sound, but later finds out that it is not, his first inclination will be to think that all arguments are unsound; yet instead of blaming arguments in general and coming to hate reasonable discussion, we should blame our own lack of skill and experience.

iii. Response to Simmias (91e-95a)

Socrates then puts forth three counter-arguments to Simmias’ objection.  To begin, he gets both Simmias and Cebes to agree that the theory of recollection is true.  But if this is so, then Simmias is not able to “harmonize” his view that the soul is a harmony dependent on the body with the recollection view that the soul exists before birth.  Simmias admits this inconsistency, and says that he in fact prefers the theory of recollection to the other view.  Nonetheless, Socrates proceeds to make two additional points.  First, if the soul is a harmony, he contends, it can have no share in the disharmony of wickedness.  But this implies that all souls are equally good.  Second, if the soul is never out of tune with its component parts (as shown at 93a), then it seems like it could never oppose these parts.  But in fact it does the opposite, “ruling over all the elements of which one says it is composed, opposing nearly all of them throughout life, directing all their ways, inflicting harsh and painful punishment on them, . . . holding converse with desires and passions and fears, as if it were one thing talking to a different one . . .” (94c9-d5).  A passage in Homer, wherein Odysseus beats his breast and orders his heart to endure, strengthens this picture of the opposition between soul and bodily emotions.  Given these counter-arguments, Simmias agrees that the soul-as-harmony thesis cannot be correct.

iv. Response to Cebes (95a-107b)

1. Socrates’ Intellectual History (96a-102a)

After summarizing Cebes’ objection that the soul may outlast the body yet not be immortal, Socrates says that this problem requires “a thorough investigation of the cause of generation and destruction” (96a; the Greek word aitia, translated as “cause,” has the more general meaning of “explanation”).  He now proceeds to relate his own examinations into this subject, recalling in turn his youthful puzzlement about the topic, his initial attraction to a solution given by the philosopher Anaxagoras (500-428 B.C.), and finally his development of his own method of explanation involving Forms.  It is debated whether this account is meant to describe Socrates’ intellectual autobiography or Plato’s own, since the theory of Forms generally is described as the latter’s distinctive contribution.  (Some commentators have suggested that it may be neither, but instead just good storytelling on Plato’s part.)

When Socrates was young, he says, he was excited by natural science, and wanted to know the explanation of everything from how living things are nourished to how things occur in the heavens and on earth.  But then he realized that he had no ability for such investigations, since they caused him to unlearn many of the things he thought he had previously known.   He used to think, for instance, that people grew larger by various kinds of external nourishment combining with the appropriate parts of our bodies, for example, by food adding flesh to flesh.  But what is it which makes one person larger than another?   Or for that matter, which makes one and one add up to two?  It seems like it can’t be simply the two things coming near one another.   Because of puzzles like these, Socrates is now forced to admit his ignorance: “I do not any longer persuade myself that I know why a unit or anything else comes to be, or perishes or exists by the old method of investigation, and I do not accept it, but I have a confused method of my own” (97b).

This method came about as follows.  One day after his initial setbacks Socrates happened to hear of Anaxagoras’ view that Mind directs and causes all things.  He took this to mean that everything was arranged for the best.  Therefore, if one wanted to know the explanation of something, one only had to know what was best for that thing.  Suppose, for instance, that Socrates wanted to know why the heavenly bodies move the way they do.  Anaxagoras would show him how this was the best possible way for each of them to be.  And once he had taught Socrates what the best was for each thing individually, he then would explain the overall good that they all share in common.  Yet upon studying Anaxagoras further, Socrates found these expectations disappointed.  It turned out that Anaxagoras did not talk about Mind as cause at all, but rather about air and ether and other mechanistic explanations.  For Socrates, however, this sort of explanation was simply unacceptable:

To call those things causes is too absurd.  If someone said that without bones and sinews and all such things, I should not be able to do what I decided, he would be right, but surely to say that they are the cause of what I do, and not that I have chosen the best course, even though I act with my mind, is to speak very lazily and carelessly.  Imagine not being able to distinguish the real cause from that without which the cause would not be able to act as a cause. (99a-b)

Frustrated at finding a teacher who would provide a teleological explanation of these phenomena, Socrates settled for what he refers to as his “second voyage” (99d).  This new method consists in taking what seems to him to be the most convincing theory—the theory of Forms—as his basic hypothesis, and judging everything else in accordance with it.  In other words, he assumes the existence of the Beautiful, the Good, and so on, and employs them as explanations for all the other things.  If something is beautiful, for instance, the “safe answer” he now offers for what makes it such is “the presence of,” or “sharing in,” the Beautiful (100d).  Socrates does not go into any detail here about the relationship between the Form and object that shares in it, but only claims that “all beautiful things are beautiful by the Beautiful” (100d).  In regard to the phenomena that puzzled him as a young man, he offers the same answer.  What makes a big thing big, or a bigger thing bigger, is the Form Bigness.  Similarly, if one and one are said to be two, it is because they share in Twoness, whereas previously each shared in Oneness.

2. The Final Argument (102b-107b)

When Socrates has finished describing this method, both Simmias and Cebes agree that what he has said is true.  Their accord with his view is echoed in another brief interlude by Echecrates and Phaedo, in which the former says that Socrates has “made these things wonderfully clear to anyone of even the smallest intelligence,” and Phaedo adds that all those present agreed with Socrates as well.  Returning again to the prison scene, Socrates now uses this as the basis of a fourth argument that the soul is immortal.  One may reconstruct this argument as follows:

1. Nothing can become its opposite while still being itself: it either flees away or is destroyed at the approach of its opposite.  (For example, “tallness” cannot become “shortness” while still being “tall.”) (102d-103a)

2. This is true not only of opposites, but in a similar way of things that contain opposites.  (For example, “fire” and “snow” are not themselves opposites, but “fire” always brings “hot” with it, and “snow” always brings “cold” with it.  So “fire” will not become “cold” without ceasing to be “fire,” nor will “snow” become “hot” without ceasing to be “snow.”) (103c-105b)

3. The “soul” always brings “life” with it. (105c-d)

4. Therefore “soul” will never admit the opposite of “life,” that is, “death,” without ceasing to be “soul.” (105d-e)

5. But what does not admit death is also indestructible. (105e-106d)

6. Therefore, the soul is indestructible. (106e-107a)

When someone objects that premise (1) contradicts his earlier statement (at 70d-71a) about opposites arising from one another, Socrates responds that then he was speaking of things with opposite properties, whereas here is talking about the opposites themselves.  Careful readers will distinguish three different ontological items at issue in this passage:

(a) the thing (for example, Simmias) that participates in a Form (for example, that of Tallness), but can come to participate in the opposite Form (of Shortness) without thereby changing that which it is (namely, Simmias)

(b) the Form (for example, of Tallness), which cannot admit its opposite (Shortness)

(c) the Form-in-the-thing (for example, the tallness in Simmias), which cannot admit its opposite (shortness) without fleeing away of being destroyed

Premise (2) introduces another item:

(d) a kind of entity (for example, fire) that, even though it does not share the same name as a Form, always participates in that Form (for example, Hotness), and therefore always excludes the opposite Form (Coldness) wherever it (fire) exists

This new kind of entity puts Socrates beyond the “safe answer” given before (at 100d) about how a thing participates in a Form.  His new, “more sophisticated answer” is to say that what makes a body hot is not heat—the safe answer—but rather an entity such as fire.  In like manner, what makes a body sick is not sickness but fever, and what makes a number odd is not oddness but oneness (105b-c).  Premise (3) then states that the soul is this sort of entity with respect to the Form of Life.  And just as fire always brings the Form of Hotness and excludes that of Coldness, the soul will always bring the Form of Life with it and exclude its opposite.

However, one might wonder about premise (5).  Even though fire, to return to Socrates’ example, does not admit Coldness, it still may be destroyed in the presence of something cold—indeed, this was one of the alternatives mentioned in premise (1).  Similarly, might not the soul, while not admitting death, nonetheless be destroyed by its presence?  Socrates tries to block this possibility by appealing to what he takes to be a widely shared assumption, namely, that what is deathless is also indestructible: “All would agree . . . that the god, and the Form of Life itself, and anything that is deathless, are never destroyed” (107d).  For readers who do not agree that such items are deathless in the first place, however, this sort of appeal is unlikely to be acceptable.

Simmias, for his part, says he agrees with Socrates’ line of reasoning, although he admits that he may have misgivings about it later on.  Socrates says that this is only because their hypotheses need clearer examination—but upon examination they will be found convincing.

d. The Myth about the Afterlife (107c-115a)

The issue of the immortality of the soul, Socrates says, has considerable implications for morality.  If the soul is immortal, then we must worry about our souls not just in this life but for all time; if it is not, then there are no lasting consequences for those who are wicked.  But in fact, the soul is immortal, as the previous arguments have shown, and Socrates now begins to describe what happens when it journeys to the underworld after the death of the body.  The ensuing tale tells us of

(1) the judgment of the dead souls and their subsequent journey to the underworld (107d-108c)

(2) the shape of the earth and its regions (108c-113c)

(3) the punishment of the wicked and the reward of the pious philosophers (113d-114c)

Commentators commonly refer to this story as a “myth,” and Socrates himself describes it this way (using the Greek word muthos at 110b, which earlier on in the dialogue (61b) he has contrasted with logos, or “argument.”).  Readers should be aware that for the Greeks myth did not have the negative connotations it often carries today, as when we say, for instance, that something is “just a myth” or when we distinguish myth from fact.  While Plato’s relation to traditional Greek mythology is a complex one—see his critique of Homer and Hesiod in Republic Book II, for instance—he himself uses myths to bolster his doctrines not only in the Phaedo, but in dialogues such as the Gorgias, Republic, and Phaedrus as well.

At the end of his tale, Socrates says that what is important about his story is not its literal details, but rather that we “risk the belief” that “this, or something like this, is true about our souls and their dwelling places,” and repeat such a tale to ourselves as though it were an “incantation” (114d).  Doing so will keep us in good spirits as we work to improve our souls in this life.  The myth thus reinforces the dialogue’s recommendation of the practice of philosophy as care for one’s soul.

e. Socrates’ Death (115a-118a)

The depiction of Socrates’ death that closes the Phaedo is rich in dramatic detail.  It also is complicated by a couple of difficult interpretative questions.

After Socrates has finished his tale about the afterlife, he says that it is time for him to prepare to take the hemlock poison required by his death sentence.  When Crito asks him what his final instructions are for his burial, Socrates reminds him that what will remain with them after death is not Socrates himself, but rather just his body, and tells him that they can bury it however they want.  Next he takes a bath—so that his corpse will not have to be cleaned post-mortem—and says farewell to his wife and three sons.  Even the officer sent to carry out Socrates’ punishment is moved to tears at this point, and describes Socrates as “the noblest, the gentlest and the best man” who has ever been at the prison.

Crito tells Socrates that some condemned men put off taking the poison for as long as possible, in order to enjoy their last moments in feasting or sex.  Socrates, however, asks for the poison to be brought immediately.  He drinks it calmly and in good cheer, and chastises his friends for their weeping.  When his legs begin to feel heavy, he lies down; the numbness in his body travels upward until eventually it reaches his heart.

Some contemporary scholars have challenged Plato’s description of hemlock-poisoning, arguing that in fact the symptoms would have been much more violent than the relatively gentle death he depicts.  If these scholars are right, why does Plato depict the death scene the way he does?  There is also a dispute about Socrates’ last words, which invoke a sacrificial offering made by the sick to the god of medicine: “Crito, we owe a cock to Asclepius; make this offering to him and do not forget.”  Did Socrates view life as a kind of sickness?

4. References and Further Reading

a. General Commentaries

  • Bostock, D. Plato’s Phaedo. Oxford, 1986.
    • In-depth yet accessible discussion of the dialogue’s arguments (does not include text of the Phaedo).  Includes a helpful chapter on the theory of Forms.
  • Dorter, K. Plato’s Phaedo: An Interpretation. University of Toronto Press, 1982.
    • Reading of the dialogue that combines both dramatic and doctrinal approaches (does not include text of the Phaedo).
  • Gallop, D. Plato: Phaedo. Oxford, 1975.
    • English translation with separate commentary that focuses on the dialogue’s argumentation.
  • Hackforth, R. Plato’s Phaedo: Translated with an Introduction and Commentary. Cambridge, 1955.
    • English translation with running commentary.
  • Rowe, C.J. Plato: Phaedo. Cambridge, 1993.
    • Original Greek text (no English) with introduction and detailed textual commentary.

b. The Philosopher and Death (59c-69e)

  • Pakaluk, M. “Degrees of Separation in the ‘Phaedo.’” Phronesis 48 (2003) 89-115.
    • Discusses Plato’s notion of the soul-body distinction at 63a-69e.
  • Warren, J. “Socratic Suicide.” The Journal of Hellenic Studies 121 (2001) 91-106.
    • On the Platonic philosopher’s attitude toward suicide in the 61e-69e passage.
  • Weiss, R. “The Right Exchange: Phaedo 69a6-c3″. Ancient Philosophy 7 (1987) 57-66.
    • Examines the notion that wisdom is the highest goal of the philosopher.

c. Three Arguments for the Soul’s Immortality (69e-84b)

  • Ackrill, J.L. “Anamnēsis in the Phaedo,” in E.N. Lee and A.P.D. Mourelatos (eds.) Exegesis and Argument: Studies in Greek Philosophy Presented to Gregory Vlastos. Assen, 1973. 177-95.
    • On the theory of recollection (73c-75).
  • Apolloni, D. “Plato’s Affinity Argument for the Immortality of the Soul.” Journal of the History of Philosophy 34 (1996) 5-32.
    • A study of the argument at 78b-80d.
  • Gallop, D. “Plato’s ‘Cyclical Argument’ Recycled.” Phronesis 27 (1982) 207-222.
    • On the first argument for the soul’s immortality (69e-72e) and its relation to the other arguments.
  • Matthen, M.  “Forms and Participants in Plato’s Phaedo.”  Noûs 18:2 (1984) 281-297.
    • Discusses Plato’s argument concerning equals at 74b7-c6.
  • Nehamas, A. “Plato on the Imperfection of the Sensible World,” in G. Fine, ed., Plato 1: Metaphysics and Epistemology. Oxford, 1999. 171-191.
    • On Plato’s view of sensible particulars, especially at 72e-78b.

d. Objections from Simmias and Cebes, and Socrates’ Response (84c-107b)

  • Frede, D.  “The Final Proof of the Immortality of the Soul in Plato’s Phaedo 102a-107a.”  Phronesis 23 (1978) 27-41.
    • A defense of Plato’s argument and examination of its underlying assumptions regarding the soul.
  • Gottschalk, H.D. “Soul as Harmonia.” Phronesis 16 (1971) 179-198.
    • Discusses Simmias’ account of the soul beginning at 85e.
  • Vlastos, G. “Reasons and Causes in the Phaedo,” in Plato: A Collection of Critical Essays, Vol. I: Metaphysics and Epistemology.  Garden City, NY: Anchor Books, 1971.
    • Are Forms causes? An examination of 95e-105c.
  • Wiggins, D. “Teleology and the Good in Plato’s Phaedo.”  Oxford Studies in Ancient Philosophy 4 (1986) 1-18.
    • On Socrates’ “second voyage” beginning at 99c2-d1.

e. The Myth about the Afterlife (107c-115a)

  • Annas, J. “Plato’s Myths of Judgment.” Phronesis 27 (1982) 119-43.
    • A study of Plato’s myths in the GorgiasPhaedo, and Republic.
  • Morgan, K.A. Myth and Philosophy from the pre-Socratics to Plato. Cambridge, 2000.
    • Includes extensive background on myth in Plato, as well as discussion of the Phaedo myth in particular.
  • Sedley, D. “Teleology and Myth in the Phaedo.” Proceedings of the Boston Area Colloquium in Ancient Philosophy 5 (1990) 359–83.

f. Socrates’ Death (115a-118a)

  • Crook, J. “Socrates’ Last Words: Another Look at an Ancient Riddle.” Classical Quarterly 48 (1998) 117-125.
    • The papers by Crook and Most (cited below) consider some puzzles regarding Socrates’ final words at the dialogue’s end.
  • Gill, C. “The Death of Socrates.” Classical Quarterly 23 (1973) 25-25.
    • On the finer details of hemlock-poisoning.
  • Most, G.W. “A Cock for Asclepius.” Classical Quarterly 43 (1993) 96-111.
  • Stewart, D. “Socrates’ Last Bath.” Journal of the History of Philosophy 10 (1972) 253-9.
    • Looks at the deeper meaning of Socrates’ bath at 116a.
  • Wilson, E. The Death of Socrates. Harvard University Press, 2007.
    • Includes discussion of the death scene in the Phaedo, as well as its subsequent reception in Western philosophy, art, and culture.

Author Information

Tim Connolly
Email: tconnolly@po-box.esu.edu
East Stroudsburg University
U. S. A.

Philosophy of Film: Continental Perspectives

This article introduces the most important perspectives on film (movies) from the continental philosophical perspective. “Continental” is not used as a geographical term, but as an abstract concept referring to nineteenth and twentieth century European philosophical traditions exemplified by German idealism, phenomenology, existentialism, hermeneutics, structuralism, post-structuralism, French feminism, and the Frankfurt School. The continental-friendly philosophy of film that has emerged in Anglophone countries since the 1980s also is taken into account in this article.

If one considers only contributions by well known philosophers, the philosophical output on film might appear relatively meager. Books that deal with the philosophy of film are equally rare. If, however, one considers the scholarly contributions from the entire field of humanities, specifically in the form of film aesthetics and film theory, the body of reflections on film inspired by philosophical ideas (in the most general sense) is impressive. Most of these works are linked to the European philosophical tradition of philosophy of film, which developed from the 1920s onward. Henri Bergson (1859-1941) was the first philosopher to show interest in film, though his influence on continental philosophy of film remained minor – though not inexistent – before the publication of Gilles Deleuze’s two volumes on cinema (1983 and 1985). In the 1980s, two French philosophers, Jean-Louis Schefer and Gilles Deleuze, decided to devote their attention to film studies. These studies began a continuous line of European philosophical works on film that stretched through to today’s writings by Jacques Rancière and Slavoj Žižek. In the English-speaking world, philosophical concepts entered the discourse on film at around the same time. Stanley Cavell’s work The World Viewed: Reflections on the Ontology of Film (1971) was a notable precursor of this tendency. In 1988, Noel Carroll published a critique of contemporary film theory (Mystifying Movies) which he criticized as being overly determined by Psycho-Semiotic Marxist paradigms. In the same year he published Philosophical Problems of Classical Film Theory that examined pre-semiotic theorists like Bazin and Arnheim in an analytical fashion.

Both representatives of the analytical and the continental tradition see thinkers that were active before the analytical-continental divide (for example, Münsterberg, Kracauer) as being central to their film studies; however, the interpretations of such thinkers differ considerably in both traditions.

A significant amount of continental work developed around the British journal Screen, which was very influential in the 1970s and has laid many of the foundations of Lacanian and neo-Marxist film theory.

Analytical philosophy of film has profited greatly from its rich tradition of analytical aesthetics. A significant part of this philosophy has attempted to push its studies in the direction of evidence-based scientific models. Continental thought has typically been inspired by the softer fields of humanities and has displayed a solid amount of political engagement. In the former Soviet Union, a complex discourse on the semiotics of film, inspired by a Russian formalist heritage that has a natural affinity with film, has made numerous philosophical statements.

Table of Contents

  1. What Is Philosophy Of Film?
    1. Continental vs. Analytical
    2. French Philosophy of Film
    3. Philosophy of Film in Other European Countries
  2. An Overview Of Theories
    1. Henri Bergson
    2. Hugo Münsterberg
    3. Formalist Approaches
      1. The Russian Formalism Tradition
        1. Sergei Eisenstein
        2. Vsevolod Pudovkin
        3. Lev Kuleshov
        4. Yuri Lotman
      2. Béla Balász: Film as Language
      3. Mikhail Bakhtin: Dialogicity and Reception
      4. Rationalism: Galvano Della Volpe
      5. Semiotics/Semiology
        1. Gilbert Cohen-Séat
        2. Roland Barthes
        3. Christian Metz
        4. Pier Paolo Pasolini
    4. Cinematographic Ontology and Phenomenology
      1. Phenomenology
        1. Maurice Merleau-Ponty
        2. Amédée Ayfre
      2. Realism:
        1. André Bazin
        2. Siegfried Kracauer
        3. Dziga Vertov
      3. Existentialism
        1. Alexandre Astruc’s Caméra-Stylo
      4. Film as Thought
        1. Jean Epstein: Film as Artificial Intelligence
        2. Jean Mitry
        3. Gilles Deleuze
        4. Filmosophy
      5. Film as Poetic Art: Susanne Langer
    5. Psychology/Psychoanalysis
      1. Rudolf Arnheim
        1. Psychoanalysis vs. Cognitive Science
      2. The Mindscreen Theory
      3. Slavoj Žižek
    6. Walter Benjamin and the Frankfurt School
    7. Hermeneutic Film Analysis
    8. Future Perspectives
  3. References and Further Reading

1. What Is Philosophy Of Film?

Many studies of film do not necessarily claim a specific philosophical heritage, but can be considered as philosophical inasmuch as their approaches are methodologically sophisticated and transgress empiricism. Though it is very difficult to establish positive standards for what is ‘’philosophical’’ and what is not, it can be concurred that an approach is “philosophical” as soon as it uses references to philosophers and some abstract concepts that are not merely technical. A degree of argumentation and claim-making supported by reasons, some sort of evidence, and analysis or interpretation, on the other hand, are not particular to philosophy but are followed by all (human) sciences.

Film Theory

Para-philosophical studies have also been conveniently labelled as ‘film theory’. Strictly speaking, film theory, which develops concepts like ‘narrativity’, ‘diegesis’, ‘genre’ or ‘authorship’ is not a unique philosophy, but is very often part of it. Although parts of film theory and philosophy of film do overlap in both traditions (and since Münsterberg and Arnheim many film theorists have maintained that what they do is also related to the philosophy of film), it is wrong to characterize every “theoretical” reflection on film as philosophical because film theory can also be limited to the analysis of mere technicalities or textual analysis. Here, film theory is not different from literary theory where reflections on authorship or analyses of narrative and genre occur most typically without necessarily receiving the label “philosophy” unless they are imbedded in more “philosophical” considerations. The contemporary Anglophone philosophy of film sees this perspective slightly differently since longstanding reflections on these subjects (by Cavell, Carroll, Bordwell, Gaut, and Branigan) are usually classified as philosophy.

While the degrees of “philosophicality” vary within different interpretations of films, it is safe to assume that any reflective study of the nature of film is philosophical. Whenever scholars attempt to spell out what film is (Is film an art? How is film different from other arts?), their discourse becomes necessarily philosophical. This reflective study is not limited to film, but is true for any academic field (it becomes most obvious in philosophy of science).

Film Aesthetics

Philosophy of film cannot be seen only as a subfield of aesthetics or of the philosophy of art. Philosophy of film also should not be labeled as ‘film aesthetics’. Philosophy of film is able to approach a spectrum of questions so broad that its link with aesthetics can sometimes be maintained only by purely formal reasons. Bergson’s philosophy of film, which has attained a central position in contemporary philosophy of film, was not developed out of an aesthetic interest at all; but film would have served Bergson as merely an illustration for his general philosophical ideas. It is even possible to say that the philosophy of film, just because of its readiness to undertake paradoxical fusions that contrast aesthetics against fields other than aesthetics, develops a discourse that stands out in the entire body of philosophy. Treating, for example, psychoanalytical or cognitive problems as aesthetical phenomena bears an immense critical potential that does not exist to the same extent in other sub-disciplines of philosophy.

Film and Philosophy

Philos-sophia, the ancient Greek term for ‘’love of wisdom’’, can be understood as an immense attempt at interpreting or questioning human existence and the world in its entirety. Logically, film can be one of its subjects. When this is the case, the approach towards film usually exceeds the label of mere interpretation and places film in a relationship with classical philosophical questions such as (its own) essence, truth, or beauty.

Philos-sophia can see film as one of its subjects, but film can also be its object, that is, film can be a philosophy through which a thinker attempts to see the world In other words, film can establish its own stances about essence, truth, or beauty. Theoretically, this can be done with any art form: it is also possible to see painting, literature, or dance as philosophical activities. However, film has been found much more apt for such models because its integrative mode of time, space, images, and movement brings it much closer to thinking (as will be developed below).

The idea of “philosophy of film” is a little like Kant’s “Critique of Pure Reason” because it implies two ideas at the same time. Kant’s title can be understood as (1) “pure reason being criticized” and as (2) “pure reason doing the criticizing.” In the first case, pure reason is a passive object of research undergoing the scrutiny of critical thought while in the second case pure reason is actively imposing its standards on this critical thought. The same is true for the phrase “philosophy of film,” which can mean (1) that film is undergoing a philosophical examination and (2) that film itself helps to develop a certain type of philosophical thinking that will subsequently be imposed upon various subjects of research. Strictly speaking, any outline of the philosophy of film should be divided into two parts: (1) a philosophy about film and (2) film as an philosophy. The latter occupies a prominent place in many recent continental and Anglo-American discourses, and has been defended by Wartenberg (2006), Mulhall (2002), Frampton (2006), and Smuts (2009), and has been criticized by Russell (2000), Murray Smith (2006), Livingston (2009), and Davies (2009). However, in the practice of much of philosophy of film, both approaches often intermingle. Film as a philosophic experience or philosophy as a filmic experience often appear as two sides of the same coin.

Films are similar to dreams and much of what the philosophical tradition has said about the “reality of the outer world” and its skeptical evaluation can be demonstrated through reflections on film. Here film theory maintains a strong contact with philosophy. The distinction between appearance (that is, dream and poiesis) and reality has been on the agenda of film theory for almost eighty years. The “reality and dream” problem is not limited to the subjectivist approach that perceives film as a manifestation of fantasies and hallucinations. However “realists” like André Bazin have classified film as real, because film is able to capture an authentic reality independent of human subjectivity (see below). Comparisons of films and dreams can reveal new concepts of space and time because dreams seem to take place in an intermediary domain of abstractness and concreteness. Film is not, according to Amédée Ayfre, a thesis about the world; rather it presents the world. Any abstractly existential stance contained in this presentation makes film a philosophical phenomenon. Psychoanalysis represents another approach towards dreams and has a status within the philosophy of film which will be explained below.

a. Continental vs. Analytical

Analytical philosophy of film has been unwilling to identify psychoanalytical elements and some sociological elements, in particular approaches of critical theory or ideology critique, as “philosophical” because it deems that this theory does not satisfy more scientific standards. While a continental film scholar like Christian Metz would find the results of the French school of filmology relevant for the most scientific forms of experimental psychology, analytical philosophy usually rejects continental models based on language. These models are usually derived from the Saussurean model of signification and are present in poetics, semiotics or Lacanian psychology. Instead analytical philosophy  favors either cognition-based models or more “classical” questions related to problems such as the ontology of film, analyses of movement, realism, the nature of filmic representation, and explanations of our emotional engagement with fiction. Further, there is in analytical philosophy a strong desire to limit work on the philosophy of film from critical theory and cultural studies practiced in English departments. This problem never occurred in the more eclectic and naturally open field of continental philosophy of film.

Still, in many cases, the distinction between continental and analytical is not easy and does not always pass smoothly along the lines of a European and an American tradition. Although David Bordwell is a cognitive philosopher,  initially he was inspired by Russian formalist terminology and was himself a subject of interest for European semioticians. Though he is one of the most scathing critics of the continental paradigm in film theory (together with Carroll he coined the term “SLAB theory” to refer to theories that use the ideas of Saussure, Lacan, Althusser, and Barthes) (Bordwell & Carroll 1996), his early work is “structuralist” or “Foucaultian” in spirit as it analyzes the rules governing the practice of institutional film criticism/theory.

b. French Philosophy of Film

French philosophy has played an outstanding role in the development of a philosophy of film. Henri Bergson was the first philosopher who adopted film as a conceptual model for philosophical thought. Cinema helped him to imagine the distinction between spatialized time and duration, an idea that would remain essential for his entire philosophy. Though Bergson’s ideas bear no relation with the more contemporary language-based models of reason (and his interpreter Gilles Deleuze never used them in that way), Bergson’s thought fused with the remaining field of French philosophy of cinema in an often paradoxical fashion. Though French philosophy of film is composed of diverse elements, French or even continental philosophy of film can appear as amazingly coherent. Deleuze’s Bergsonian concept of the “time-image,” for example, is very much compatible with ideas elaborated by the Russian director Andrei Tarkovsky who derived his insights not from Bergson, but from a critical evaluation of Russian formalist film theory.

In 1946, Gilbert Cohen Séat published the first monograph on the philosophy of film (Essai sur les principes d’une philosophie du film). Jean Epstein’s L’Intelligence d’une machine, a truly philosophical book on film, appeared in the same year. In 1948, the interdisciplinary Parisian École de Filmologie began approaching film from a sociological, psychological, and philosophical angle. Filmology can be considered as a precursor of the semiology of cinema.

In the 1960’s and 1970’s, many principal theoretical discussions (such as auteur theory and genre theory) were developed in the Parisian journal Les Cahiers du cinéma, co-founded in 1951 by André Bazin. It was in this journal that French philosophy of film began to produce its characteristic mixtures of structuralism, semiotics, psychoanalysis, and Marxism. Paradoxically, the affiliation of filmology with semiotics and psychology – provocative as it might have seemed – provided the philosophy of film access to academia in the 1960’s. For European state-financed universities, film studies could never have been the economic boon as they have been in the United States. French philosophy departments remain resistant in the 21st century and permit the teaching of the philosophy of film only in institutions that are considered off the mainstream or within the newly founded – relatively small – discipline of aesthetics. While the “sociological temptation” exists, overall, French philosophy of film has remained very distinct from the sociology of cinema.

c. Philosophy of Film in Other European Countries

French philosophy of film is a unique phenomenon; no other European country has produced a similarly philosophical output on film. In Italy, Umberto Eco and Pier Paolo Pasolini published writings on film in the 1960s that would quickly be integrated into the French discourse. In Germany, the journal Filmkritik (launched in 1957), was the German equivalent of the Cahiers du cinema, though earlier writings by Frankfurt School members Siegfried Kracauer and Walter Benjamin turned out to be most compatible with French philosophy of film. Different from French and American postwar developments, in Germany, film studies never became institutionalized nor have they developed a consistent link with the praxis of film. German Filmwissenschaft (filmology) – most prominently represented by Thomas Elsaesser – developed along the lines of comparative literature, theater science, and art history rather than plunge into an adventurous speculative discourse. It would be absorbed into academia through the disciplines of Media- and Communication Science. The activities of Filmwissenschaft are diverse though its dominant tendency may be characterized as sociological whereas systematical film analysis is highly empirical. Aestheticians working in German philosophy departments (where the main tendency is analytical) continue the philosophical considerations of film, though hermeneutic film analysis has had some impact.

2. An Overview Of Theories

a. Henri Bergson

In the early 1900’s, Henri Bergson (1859-1941) developed the concepts of “movement-image” and “time-image” (in Matière et memoire), both of which anticipated the development of film theory. Bergson declares the image to be superior to the concept because the image is able to evoke thought content in a more fluent and less abstract fashion. In lectures held at the College de France between 1902-03, Bergson briefly refers to the possibility of “comparing the mechanism of conceptual thought with that of the cinematograph” (now in L’Evolution créatrice, 1991, p. 725, note 1). Bergson’s main philosophical theme is that temporality should be thought of as  independent from concepts of spatiality. Bergson contrasts duration, as it is experienced by the human consciousness, with scientific definitions of time, the latter of which, in his view, tends to “spatialize” time. Ironically, Bergson would later reject any possibility of using film as an exemplification of his ideas, in an essay entitled “The Cinematographic Illusion” (also in L’Evolution créatrice). Subsequent developments of Bergson’s ideas on duration by Epstein, Sartre, or Deleuze go, strictly speaking, against the grain of his original thought on cinema.

b. Hugo Münsterberg

The German-American philosopher and psychologist Hugo Münsterberg (1863-1916) wrote the first book on the philosophy of film entitled Photoplay: A Psychological Study (1916, German: Das Lichtspiel: eine psychologische Studie, 1916). Active during the silent film era, Münsterberg attempted to establish cinema as an art form that is different from theater or photography. Coming to America at the age of thirty-four, Münsterberg had clearly been under Neo-Kantian influence in Germany. Photoplay is divided into two parts, the first of which is inspired by experimental psychology dealing with the mental functions of the spectator. This first part is a precursor to the cognitive theory of film. The second part bears clear traits of Neo-Kantian aesthetics as it analyzes, in a formalist fashion, film’s form and function. The Neo-Kantian input became obvious through Münsterberg’s conviction that film is the mirror of the mind and not that of the world; the goal of cinema is not to reproduce reality, but to materialize emotions. Münsterberg theorized about how close-ups and flashbacks parallel acts of consciousness (that is affect, memory) and formulated the ‘film/mind’ analogy that was much explored and criticized by other philosophers (for example Carroll, 1988b).

c. Formalist Approaches

i. The Russian Formalism Tradition

Russian Formalism designates a school of innovative linguists and literary critics. It developed out of modernist movements such as Russian symbolism and constructivism. From 1915 to 1930, both the Moscow group (led by Roman Jakobson) and the St. Petersburg Society for the Study of Poetical Language (OPOJAZ), which included notorious members like Viktor Shklovsky, Osip Brik, Boris Eikhenbaum, and Vladimir Propp, applied newly invented formalist linguistic methods to the study of literature and poetry. Rediscovered in the West in the 1960’s, the work of the Russian Formalists has had an important influence on structuralist theories of literature, and on some of the more recent varieties of Marxist literary criticism.

Several Russian Formalists wrote on film mainly by establishing analogies between film and language. The idea to interpret film as language goes back, among other things, to Shklovsky’s reaction to the writings of the Ukrainian linguist Alexander Potebnja (1836-1891) who thought of poetry as “thinking in images.” According to Shklovsky, this leads to the false assumption that the creation of symbols is the main cognitive occupation. Shklovsky suggested seeing the activity of thinking as a more abstract and relational way of thinking. This model of thinking comes close to a formalist idea of film. Finally, all artistic activ­ity should be seen as a creative reorganization of pre-aesthetic materials: art need not express new content but should make a strange habitualized form.

For formalism, cinematic time (or cinematic reality altogether) is incompatible with naturalist representations. It is not staged, nor does the director transfer reality on the screen by means of intuition (as does, for example, impressionist painting). For the formalists, time is created through montage. Technical terms such as ‘defamiliarization’ (ostranenie) or the ‘story’ (fabula) as opposed to the ‘plot’ (sjuzhet), have become important in Western European and American film studies. Both Christian Metz (1975) and David Bordwell (1985) borrowed heavily from formalism.

1. Sergei Eisenstein

The director Sergei Mikhailovich Eisenstein remains a central figure in formalist film theory and by helping to develop the above idea of time created through montage. One of Eisenstein’s aims was to overcome, in a formalist fashion, “intuitive creativity” through “rational constructive composi­tion of effective elements” (1988a, I, p. 175). Eisenstein designated artistic activity as the process of organizing raw material. A large part of this cinematic theory is based on a principle similar to what Russian Formalists called ostranenie (alien­ation, estrangement, German: Verfremdung). According to Eisen­stein, within every shot there is a conflict between an object and its spatial nature or between an event and its temporal nature. As a consequence of this conflict, cine­matic time does not exist as “real time,” but must be experienced as an artistic-technical device. For Eisenstein, montage will never produce a “rhythm” or regularly patterned series of shots because such series would still rely too much on “artistic feeling” or empathy.

Eisenstein was the first person who attempted to see cinema as a thinking process. Recognizing that film reconstructs the actions of the human mind, Eisenstein perceived montage form as “a reconstruction of the laws of the thought process” (1988c, p. 236). His theory of dialectical montage (directly referring to Hegel and Marx) suggests that a third idea can emerge from the presentation of two conflicting shots. Related to this is his concept of “shock to thought” which occurs when the conflict between two shots forces us to think its synthesis (1988b, p. 45). Deleuze takes this idea up in Image-Temps and explains that this shock forces thinking to think itself as well as to think the whole (p. 207). Another famous concept is Eisenstein’s idea of cinema as a revolutionary ‘kino-fist’ formulated in reaction against his rival Dziga Vertov’s Kino Eye group (see below). It is difficult to summarize the entire body of Eisenstein’s writings as they teem with unpredictable insights and can certainly not be reduced to formalist recipes towards technicality.

2. Vsevolod Pudovkin

Vsevolod Illarionovitch Pudovkin (1893-1953) elaborated in the late 1920s and early 1930s on a theory of film based on narrative and spatiotemporal continuity. In Kinorezhisser i kinomaterial (1926), Kinoszenarii: Teoria szenarija (1926), and Akter v fil’me (1934) (published in English as Film Technique and Film Acting), Pudovkin examines such devices as contrast, parallelism, and symbolism. Several of Pudovkin’s formulations influenced Rudolf Arnheim (see below).

3. Lev Kuleshov

The idea that editing constitutes the “essence” of film art originated with the Russian director and theoretician Lev Kuleshov (1899-1970) who experimented with montage in the 1920s in an almost scientific fashion and is also one of the key exponents of the ‘film as language’ idea. Thoughts on montage are expressed as early as 1920 in his study “The Banner of Cinematography” (Engl. in Selected Works, 1987, Moscow). Kuleshov was able to show with the help of experiments that an isolated shot has no meaning (the famous Kuleshov effect).

4. Yuri Lotman

The Russo-Estonian semiotician Yuri Mikhailovitch Lotman (1922-1993) set out to reformulate the Saussurean notion of the sign by establishing a relationship of necessity between the signified and the signifier (which Saussure believed to be arbitrary). Lotman was the main proponent of the Tartu-Moscow School and his work is directly linked to the Russian Formalist tradition. He is perhaps most noted for the coinage of the term “semiosphere.” In his Semiotika kino i problemy kinoestetiki (1973) (Semiotics of Cinema), Lotman distinguished between the different levels of illusion and reality in film by analyzing the cinematic shot, narration, as well as the ideological function of cinema as mass-media. His distinction between pictorial/iconic signs used in the visual arts and conventional signs (words, letters) as used in literature had a central position in his work. Lotman also insisted that the cinematic language of a film is always related to exterior cultural codes.

ii. Béla Balász: Film as Language

Béla Balász (1949-1984) was a Hungarian film aesthetician who wrote in Hungarian and German. His books, Der sichtbare Mensch (The Visible Man, 1924) and Der Geist des Films (The Spirit of Film, 1930) were the first theoretical books on film after Münsterberg’s Photoplay. They remain the founding stones of modern film theory though they have been translated into English only in 2010. Balász’s book Theory of the Film (Hungarian, 1948, English 1953) picked up threads from the earlier books and brought him posthumous international fame. In general, Balász strove to offer to modern man possibilities of overcoming his particular state of estrangement by designing a utopian visual culture in which film plays an essential role. Balász’s ambition to describe film as a language brought him close to the Russian Formalists; he was actually able to advance views on montage that would be too mechanistic even for Eisenstein’s standards. However, a genuinely philosophical component enters his work through complex reflections on cinematic “reality.” In Theory of the Film, Balász wrote: “Although objective reality is independent of the subject and his subjective consciousness, beauty is not merely objective reality, not an attribute of the object entirely independent of the spectator, not something that would be there objectively even without a corresponding subject even if there were no human beings on earth” (p. 33). Malcolm Turvey classifies Balász (together with Jean Epstein, Dziga Vertov, and Siegfried Kracauer) as a “revelationist” because for all these theorists, the cinema is a means of enlightenment: it escapes the limits of human sight and reveals the true nature of reality. To some extent, the particular way of tackling the reality problem in film was determined by Balász’s interest in dreams. Two of Balász’s prose works are entitled Youth of a Dreamer and Fairytale, Ritual, and Film, the latter of which testified to his interest in film as “otherworldliness.” Without giving in to mystification and decidedly refusing fades or dissolves, Balász remained fascinated by film’s “ability to transform all things in space into bearers of expressions” and interpreted some original scenes psychoanalytically, such as substantial dream images (cf. Loewi, p. 318).

iii. Mikhail Bakhtin: Dialogicity and Reception

The ‘Bakhtin School’ theorists Mikhail M. Bakhtin, Pavel N. Medvedev, and Valentin N. Voloshinov developed elements of formalism by emphasizing the dialogical and polyphonic character of texts and cultural phenomena. Though the Bakhtin School did not write explicitly on film, it should be mentioned here for two reasons. First, Mikhail Bakhtin’s (1895-1975) writings had indirect influence on the film semiotics of Julia Kristeva and Tzvetan Todorov. Bakhtinian concepts of dialogism and polyphony are crucial because they can help to address fundamental questions about film form and reception. Second, film is an extremely good example for polyphony. While in the novel we most typically encounter the monologic narrator, in the film, the narrative, the character’s appearance, the dialogues, and several other elements appear simultaneously within one time frame (cf. Vice, p. 142). Also, Bakhtin’s central notion of the chronotope exemplifies the fusion of time and space that is typical for film.

iv. Rationalism: Galvano Della Volpe

The Italian Marxist philosopher Galvano Della Volpe (1895-1968), who in 1954 published a book called Il verosimile filmico e altri scritti di estetica, designed a rational aesthetics of film emphasizing unity, coherence, and harmony. Della Volpe formulated an organic theory of literature affirming the rational instead of the sentimental value of the work of art. Art is not distinct from science because both are based on the unity of the concept (in Critica del gusto). Central is Della Volpe’s critique of Crocean idealism, but also of materialism. Della Volpe established an epistemology of art that voluntarily remains indifferent towards social contexts or contents and excels in the analysis of technical, structural or formal processes in art. Della Volpe drew greatly from Aristotle’s Poetics. Emphasizing the analysis of formal aspects, Della Volpe pointed out film’s capacity to present the world symbolically as well as its ability to express abstract concepts. The latter happened for him mainly through montage. Similar to Jean Mitry, Della Volpe criticized the reduction of film to language; furthermore he was inspired by Pudovkin’s distinction between ‘plasticity’ and ‘concreteness’ of filmic images. Finally, Della Volpe established cinematic verisimilitude as non-equivalent to reality.

v. Semiotics/Semiology

Semiotics is the study of signs and symbols and examines how meaning is created and communicated through signifying processes. In France, the term sémiologie is more often used than the term sémiotique, though there does not seem to be a clear distinction between the terms. The discipline of semiology goes back to Ferdinand de Saussure whereas semiotics was founded by Charles Sanders Peirce, who was later taken up by Deleuze (rather than Saussure).

1. Gilbert Cohen-Séat

In 1946, Gilbert Cohen-Séat published the first monograph on film which bears the word “philosophy” in its title (Essai sur les principes d’une philosophie du film). The Revue internationale de filmologie was founded soon afterwards and Cohen-Séat’s name remains linked to filmology. Arguably, his most important contribution to film studies was the distinction between “filmic facts” and “cinematic facts”: “The filmic fact consists of the expression of life (the life of the world, the spirit, the imagination, of beings and things) through a system of combined images (visual-natural or conventional- and auditory-sounds and words). The cinematic fact, instead, consists of social circulation of sensations, ideas, feelings, and materials that come from life itself and that cinema shapes according to its desires,” (Essai…, p. 57). Cohen-Séat’s scientific approach made him a precursor of the semiotics of cinema.

2. Roland Barthes

Roland Barthes’ influence on film theory is similar to that of the Bakhtin school. Though having written almost nothing on film, his narratology that strives to see theatre, photography, and music as texts has deeply influenced our ways of seeing film as the interplay of voices or codes. Paradoxically, this thinker who mused so much about “floating signifiers,” found very disturbing the fact that in films, signifiers are in motion: “The filmic, very paradoxically, cannot be grasped in the film ‘in situation’, ‘in movement’, in its natural state’, but only in that major artifact, the still” (1977, p. 65).

3. Christian Metz

Christian Metz (1931–1994) was the initiator of a film analysis that relies in the most outspoken way on semiotics and is, like Barthes, what the French call a “sémiologue.” In general, Metz’s approach towards cinema has become the prototypical example of a quasi-scientific form of film theory. Apart from linguistic structuralism, Metz borrowed from psychoanalysis, in particular from Jacques Lacan and his writings on the mechanisms of dreams and voyeurism. This allowed him to explain perceptual phenomena of the filmic narrative from the point of view of the perceiving subject and take into consideration the unconscious processes of desire that allegedly position the spectator ideologically. Metz is particularly renowned for his “grand synagmatics” (grande syntagmatique) through which he categorized the most frequent codes and signifying units (called ‘syntagmes’) in cinema. Metz established the single shot as the smallest unit and the entire film as the largest one. He classified all syntagmes in distinct categories, distinguishing, for example, between chronological and a-chronological syntagmes.

According to Metz, cinematic language (langage) is not constituted by all elements that appear in a film, but only by those things that can only appear in film. Film analysis should highlight only those signifying figures that are truly cinematographic. Metz questioned Eisenstein’s vision of cinema as a langue because a langue is a highly organized code, whereas language covers a much broader area. Film should not be seen as langue because cinematic signs are reinvented or are updated in every film (Essais sur la signification, p. 47). Some might find Metz’s approach anti-philosophical because it so vehemently denies the possibility of phenomenological considerations. On the other hand, Metz’s work can be considered as philosophical because it deals with ethical implications (employing Marxist themes) and it extensively discusses the matter of reality and of dreams through a Freudian perspective.

4. Pier Paolo Pasolini

Pier Paolo Pasolini (1922-1975) was, and remains, a relatively little known as a theorist in the United States, but he held a central position in European film theory. Under the influence of Barthes, Metz, and Gramsci, Pasolini’s essays on cinema collected in the book Empirismo Eretico (1972) are exercises in semiotics that raised, in the 1960’s, a debate involving Umberto Eco (1967) and Metz (1968). When Pasolini postulated that cinema is the “written language of reality,” he was not intending to establish cinematic reality as a sort of cinematic language, nor is he preaching realism. Pasolini wants to conceive the real as cinematic. Reality is the discourse of things that cinema re-narrates. This project is highly philosophical and, famously, in 1967 Pasolini remarked, in the Journal of the Communist Party, that “semiotics has not taken the step which would lead it to become a Philosophy.”

d. Cinematographic Ontology and Phenomenology

i. Phenomenology

Phenomenology is a philosophy that goes back to Edmund Husserl and takes as a point of departure a sort of “experience” that is strictly designed as the sensible intuition of phenomena. On the basis of this prescription, phenomenology attempts to understand the essence of what is experienced. Martin Heidegger, though influenced by Husserl, interpreted phenomenology as an ontology, that is, as a discipline attempting to understand the Being of ourselves as Dasein (existence) and preparing for an understanding of the meaning of Being as such. His version is called “existential phenomenology.” Both Husserl’s and Heidegger’s phenomenology remain critical towards metaphysics. In the philosophy of film, the phenomenological, “synthetic” approach has often been opposed to the “analytical,” semiotic one.

1. Maurice Merleau-Ponty

Like Bergson, Merleau-Ponty (1908-1961) criticized all attempts of representing the world in a purely scientific fashion as reductive. In film, the meaning of an image depends on the preceding image and their succession creates a new reality. Merleau-Ponty has had a considerable influence on Cohen-Séat (see Andrew 1978: p. 46). In 1945, Merleau-Ponty devoted a lecture to the possibilities of phenomenological interpretations of cinema (“Le cinéma et la nouvelle psychologie” in Sens et non-sens) where he depicted film not as a sum of images, but as a temporal phenomenon. The central term in Merleau-Ponty’s musings on film is the idea of immersion. For the phenomenologist, humans are thrown into a life world to which they remain attached in the most natural and inconspicuous way. Film provides a phenomenological experience par excellence because in the cinema the human consciousness is consistently immersed in a world. Merleau-Ponty’s ideas never developed into a phenomenological theory of film, but have inspired theorists of Neo-Realism, such as Ayfre and Bazin and have been developed by Vivian Sobchack (1992).

2. Amédée Ayfre

Amédée Ayfre (1895-1963), a student of Merleau-Ponty, rejected formalist components such as style as essential characteristics of cinema and attempted to establish a “phenomenological realism” (Ayfre, 1964: 214) in film studies. In his Conversion aux images (written together with Henri Agel) Ayfre explicitly referred to the phenomenological ambition of strictly adhering to mere descriptions of experience without being influenced by either scientific or psychological considerations of causes (pp. 212-13). Ayfre’s notion of écriture as it appears in the cinéma d’auteur, designated “neither a content nor a style” (1969, p. 162), but attempted to go beyond the division of film into form and content. Phenomenologi­cal existence transgresses constructive devices as well as stylization and Ayfre’s sympathies clearly went towards Neo-Realist cinema (see below). As a Jesuit priest, Ayfre strove to reconcile his existentially-minded phenomenology with his Catholicism. Ayfre’s phenomenological realism is meant to depict a “spiritual” reality and not just a “real” one; it creates an illusion due to a “prodigious asceticism of means.” Ayfre’s concept of realism is diametrically opposed to formalist ideas about “realistic” narrative modes. David Bordwell, for example, believed that realistic expressions can be grasped best through norms and codes which vary according to different criteria, but remain formalizable in the last instance. “Realistic” motivations will be applied according to what the given narrative mode defines as realistic. For Bordwell, “verisimilitude in a classical narrative film is quite different from verisimilitude in the art cinema” (Narration in the Fiction Film, 1985, pp. 153-54). Ayfre, on the other hand, saw cinematic reality as the internal logic of a universe created by the director. Ayfre’s realism was also opposed to psychological interpretations because the reality we encounter in a film is more than merely the subconscious emanation or the reverie of a director. Ayfre’s younger collaborator Henri Agel (1911-2008) pursued a long publishing career and never abandoned the initial tendency, defining himself until the end as a “humanist and a spiritualist”.

ii. Realism:

Realism attempts to recreate life in art by relying on realistic presentation while minimizing controlling devices in the process of artistic production. In literature, realism has been developed by Balzac, Flaubert, and Zola. In film, Italian Neo-Realism is the most important realist movement.

1. André Bazin

The establishment of neo-realist film theory was mainly been the task of André Bazin (1918-1958) and Ayfre. Bazin’s realism was similar to Ayfre’s phenomenological realism, more so since he accepted Merleau-Ponty’s idea of reality as “the pure appearance of everything that is in the world” (“la pure apparence des êtres au monde” (Questions IV, p. 62). In 1957, Bazin openly adopted Ayfre’s expression “phenomenological realism” in order to label the kind of films in which “reality is not modified according to psychological functions or dramatic requirements” (Questions IV, p. 138). Dudley Andrew writes that “Bazin saw in realism a kind of style which reduced signification to a minimum. In other words, he saw the rejection of style as a potential stylistic option” (Andrew, 1976, p. 143).

Together with Alexandre Astruc, Bazin was also responsible for the so-called “auteur theory,” which grew mainly out of the two writers’ works and since has been central for French New Wave cinema. François Truffaut and other Cahiers authors joined later on, as well as Andrew Sarris in the USA. Bazin’s realism had a metaphysically significant backdrop. Like Ayfre’s phenomenological realism, Bazin’s phenomenological ontology was strongly marked by religious convictions. Sometimes terms like “real presence” or “revelation of reality” appear to have almost religious connotations. In this sense, Bazin’s philosophy was essentialist, in spite of the existentialist stances to which he was submitted. Bazin remains one of the most anti-formalist theoreticians. In his opinion, neo-realist films “aim to reduce montage to zero and to transport to the screen reality in its continuity” (p. 146).

2. Siegfried Kracauer

Siegfried Kracauer’s (1889-1966) writings on film were based on sociology and psychology. His conviction that film has a realist character made him a main proponent of cinematic realism and brought him close to Bazin (whom he never mentions). Kracauer based his realism on the resemblance that he perceived between film and photography. For post-surrealists critics such as Bazin and Kracauer, cinema should look for its genuine expression within a kind of realist expression that can be described as the exact opposite of the surrealist cinema of dreams and symbolisms. Interestingly, in the end, Kracauer defined an alternative form of dreamlike realism able to transmit reality “as if it were a dream”. In his Theory of Film he wrote: “Perhaps films look most like dreams when they overwhelm us with the crude and unnegotiated presence of natural objects—as if the camera had just now extricated them from the womb of physical existence and as if the umbilical cord between image and actuality had not yet been severed. There is something in the abrupt immediacy and shocking veracity of such pictures that justifies their identification as dream images” (p. 224).

Like Ayfre, Kracauer was opposed to any stylization of reality: the realistic character of dream elements is compared to those objects that we can find in nature. Kracauer’s realism was not based on calculations with possibilities of literal reproductions, but on the director’s ability to capture a certain spiritual quantity that is supposed to be enclosed to reality. Ayfre’s phenomenological realism, which strove to depict a “spiritual” reality, and not just a “real” one is echoed by Kracauer’s distinction between “photographic reality” and “camera reality” (Theory of Film, p. 150). For Kracauer, it was important that the “the form giving tendency does not rise above the realistic one” (p. 67). Ayfre, Bazin, and Kracauer engaged in paradoxical projects: Ayfre attempted to apply an aesthetic asceti­cism on reality without applying stylization; Bazin attempted to retrieve style by rejecting style; Kracauer attempted to establish a paradoxical balance between realistic and formalizing tendencies.

3. Dziga Vertov

Dziga Vertov’s theory of the “cinema of fact” can be considered a form of cinematic realism. In 1919, Vertov founded the group Kino-oki (“cinema-eyes”) insisting, in various manifestos, that the cinema of the future will not be that of stars and of fiction, but a cinema of facts. Vertov developed a concept of cinema as “life caught in awareness,” in which the camera eye innocently captures reality “as it is” without stylizing it. This clearly joins the ambitions of the neo-realists. The Russian director Andrei Tarkovsky, who was strongly influenced by Vertov and by Russian “Documentary Aesthetics” in the 1960’s, has indeed been likened to Bazin and neo-realism. Jon Beasley-Murray points out that Tarkovsky restored the “actuality of time” by constructing a subjectivity by which this reality is inhabited (1997, p. 47).

iii. Existentialism

1. Alexandre Astruc’s Caméra-Stylo

A disciple of Bazin, Alexandre Astruc (1923-) established in his essay “La Caméra-stylo” (1948) an aesthetics of cinema that based its expressions on the idea of writing rather than on conventional conceptions of the image. The language of film can be shaped until it becomes as subtle as the language of literature. Cinema is not a consecution of images. Instead, it adopts more abstract characteristics because it is able to integrate abstraction in itself. Abstraction is no longer present as an underlying structure of the film (as is the case with montage), but it expresses itself directly: “By language I mean a form in which and by which an artist can express his thoughts, however abstract they may be, or translate his obsessions exactly as he does in the contemporary essay or novel. That is what I would like to call this new age of cinema, the age of caméra-stylo” (Astruc, 1968, p. 18).

Like Bazin’s and Kracauer’s, Astruc’s strategy is directed against surrealist cinema. But it is also directed against conventional documentaries because the caméra-stylo intends to grasp “any kind of reality.” In a film “recorded” by a caméra-stylo there is no evocation of subjective, intimate symbols; nothing has been produced by the artist through the direct transposition of an inner reality; nor is there is any objective recording of reality. Film is not a documentation undertaken from a detached point of view located outside the things filmed. This is why the camera works like a pen: it records concrete reality, but then transforms it instantly into an abstraction. A caméra-stylo can produce an “écriture,” but it would be too simple to say that a film is transformed into literature only because the camera is transformed, in a metaphori­cal way, into a pen. The use of the camera as a pen has more to do with the interplay of realization and abstraction or, to use another term, of stylization. In other words, the writing camera produces style. This makes Astruc’s theory different from “anti-stylization” movements such as realism and (to some extent) phenomenology. However, it is important to understand that the caméra-stylo philosophy does not preach formalist stylization, but attempts to capture style, which makes it, paradoxically, compatible with some realist and phenomenological stances. Marcel Martin goes as far as saying that the “new language” which cinema has discovered makes sense as long as style turns out to be the main protagonist of this medium. “The real main character of this cinema is thus the style (…). The ‘poetic cinema’ (cinéma de poésie) is thus essentially founded on the exercise of style as inspiration” (Martin, 1967, p. 68).

iv. Film as Thought

Jacques Aumont, in A quoi pensent les films?, insists that “film has the power of thinking”. In the introduction of this article, the idea to see film as a form of thinking has been addressed as an important part of the philosophy of film that begins with Eisenstein.

1. Jean Epstein: Film as Artificial Intelligence

The avant-garde director Jean Epstein (1897-1953) remains better known for his theoretical writings than for his films, many of which are no longer extant. Together with Eisenstein, Epstein is a precursor of “cinema as thought” theories. Mixing Eisenstein’s thoughts with Marxism and the intellectual potential flowing out of the French Impressionist School of Film active during the 1920’s, Epstein designs a philosophy that is more reminiscent of Bergson than of formalism and can even be called anti-semiotic. Film is much more than a text or a writing; it is a machine able to produce a dreamlike reality by unhinging the most basic rules of logic and time, and by overcoming human reason.

In his major work L’Intelligence d’une machine (1921), Epstein discovered that cinema as a thinking machine is able to liberate us from the constraints of logic in order to produce a poetic and dreamlike reality. Cinema manipulates space and time. Comparing film with the microscope, it seems that Epstein anticipated contemporary computer reality or virtual reality. Cinema is a “robot brain” (p. 71) able to transcend the physical and mental limits of humans. The cinematograph, like the calculator, is the first materialization of “machines for thinking” (p. 48) and more complex ones will follow. Obviously inspired by Bergson, Epstein observes how cinema stretches and condenses duration and lets us feel the variable and relative nature of time. Cinema thinks time  is a “partial mechanical brain” that develops a “rich philosophy full of surprises” (p. 71). The reality that cinema thinks is “the sum of many irrealities”. For some, Epstein’s theory might smack a little too much of pseudo-science but he plausibly defines cinema as a “machine for producing dreams” (p. 55).

2. Jean Mitry

Some might remember Jean Mitry (1907-1988) as an “anti-semiologist” film thinker because of his harsh criticism of Christian Metz in La Sémiologie en question (1987). Still this theorist and film maker has his own semiotic past. In 1963, the publication of the two volumes of Esthétique et psychologie du cinéma was seen as a major event in the world of French cinema theory. The highly synthetic book deals with all thinkable subjects and accepts some of Bazin’s realism as well as several formalist patterns, but does not – though the title might suggest this – engage in Freudian psychoanalytical elaborations. Mitry’s method was that of perceptual psychology and his refusal of Bazin’s ambition to discover in cinema a “world beyond the world” is absolute. A philosophical input comes from another side. Mitry was asking himself if “filmic language does not reflect thought in the way in which it produces itself [‘en train de se faire’] much better than could do words (…) which only crystallize thoughts in the form of more or less independent ideas by translating a thought that is already achieved” (p. 90). It becomes clear here that Mitry working within the thread of those “Cinema as Thought” theorists presented in the present sub-chapter. Mitry found cinema to be unique in that it signifies only while functioning (p. 63). It is thus consistent that he reproaches, in La Sémiologie en Question, Christian Metz (his admirer) his reductive application of linguistics to film because the language of film is not based on words. Mitry’s position can be summarized as that of a phenomenology of perception and his work often proceeds synoptically with the precision of a historian.

3. Gilles Deleuze

Deleuze’s (1925-1995) analysis of cinema was founded on Bergson’s work Matière et memoire and C. S. Peirce’s taxonomy of signs. Deleuze’s engagement with both represents a rupture with the Saussureian linguistic semiotic tradition. Deleuze rejected “linguistic” as well as psychoanalytic models of film theorisation. Like Mitry, Deleuze believed that film, as it represents and reflects on time, is incompatible with language; for him, the pre-signifying ‘signaletic material’ that films are made of was not assimilable to models of semiotics. In this regard, Deleuze anticipated a number of analytic-cognitivist film theorists.

Similar to Epstein, Deleuze believed that film can alter our modes of thinking about movement and time. In a Bergsonian way, Deleuze put forward that cinematic experience was as a means to perceive time and movement as a whole. The starting point of Bergson’s philosophy was the distinction between temporal and spatial reality. Bergson modified the traditional relationship between unity and repetition by attributing singularity to time and discontinuity to space. Lived time and measured time should not be confused. Bergson’s thesis is that the human who looks at the clock perceives merely juxtapositions of different positions of the hand, which bear no link among themselves. Only the ‘I’ (consciousness) experiences a time whose essence is duration. Correspondingly, Deleuze held that in cinema, our mind does not need to put together the successive percepts or sensations it perceives; rather it receives them as a whole.

Deleuze seemed to be going back in time as he attempted to base his theory of cinema on a point before the semiotic tradition and even before Shklovsky’s formalism which strove to overcome Potebnja’s model of “thinking in images.” More precisely, Deleuze held that films do not think with simple images, but with movement-images and time-images. In his two monumental books on cinema, “Image-Mouvement” (Cinéma I) and “Image-Temps” (Cinéma II), Deleuze engaged the entire history of cinema in order to show a difference between these two types of images. Frequently, we encounter the movement-image, which is based on a sensory-motor scheme (it shows an action, which produces a reaction). In the movement-image, the action imposes itself upon time, it is the action which determines the duration of a scene and the next scene is a reaction to this action (see Deleuze 1984). The time-image, on the other hand, is based on pure thinking. The time-image emerged in cinema after WWII mainly with Italian Neo-Realism and French New Wave cinema. It does not follow the scheme of action-reaction, but it can evoke a time that is prior to movement. Time-images do not simply show us actions and movements, but different layers of time, all of which converge within single points of present. These images can express a present that constantly reaches for the past and for the future.

Deleuze brought film studies closer to philosophy. For Deleuze, film was superior to other arts because it combines time and movement in such a necessary fashion. More than that, cinema must be considered as a philosophy because it constructs its own “concepts.” Cinema is not an applied philosophy submitted to traditional philosophical concepts, but it develops “cinematic” concepts. It does not simply represent reality, but is in itself an ontological practice. Therefore cinema is more than simply an art. In this sense, Deleuze was doing neither film theory nor aesthetics, but he thinks of film as a philosophy. In Deleuze’s earlier writings, art is able to draw on concepts and therefore not confined to percepts or ‘blocs of sensation.’ However, even here Deleuze did not recommend extracting as many concepts as possible from cinema: “A theory of cinema is not about cinema but about the concepts sparked by cinema” (p. 365). Later, in What is Philosophy? (1991), Deleuze (and his co-writer Felix Guattari) reserved “concepts” for philosophy only and declared that cinema thinks with affects and precepts.

Cinema’s highest objective is nothing other than promoting thought and the functioning that comes from thinking. This is what makes cinema different from mere dreams which, in Deleuze’s view, are not thinking. Bad films do nothing more than induce a dream in the spectators (Image-temps, p. 219), they simply reiterate sensory-motor clichés that provoke neither thought or affect.

4. Filmosophy

More recently, Daniel Frampton has added a brick to the wall called “Film as Thought” by insisting that film does not narrate or show things, characters, or actions. Instead, it thinks them. When watching a film we observe a thinking process. In his book Filmosophy (2006) Frampton attempts to grasp this cinematic thinking process with the help of newly coined concepts such as ‘film-thinking’ and ‘filmind’ and assigns to ‘filmosophy’ the task of “conceptualizing all film as an organic intelligence” (p. 7). Film-thinking is not a metaphorical way of arranging reality, but “the filmind has its own particular film-phenomenology, its own way of attending to its world” (p. 91). There is a film-like way of thinking. Philosophers like Bachelard, for example, were able to produce “a flow that weaves discourses together, yet still with rigor and meaning” (p. 179). Although philosophers can learn from film-thinking, Frampton insists that “film has its own kind of thinking” (p. 23), that it “cannot show us human thinking, [but that] it shows us ‘film-thinking’” (p. 47). Thinking is radically removed from the activity of merely processing data.

v. Film as Poetic Art: Susanne Langer

Susanne Langer’s philosophy of film drew very much upon the analogy of film and dream. Langer was an American pragmatist philosopher close to the continental tradition. She was also a frequent reference point in analytic philosophy of film (for example, in Carroll). For Langer, cinema is an art that has introduced the “dream mode” as its main artistic expression, a mode that finally enables cinema to estab­lish itself as an independent medium. Still, film is not a daydream – which is but a wilful imitation of a dream. In her essay “A Note on Film,” Langer wrote: “[Film] is not any poetic art we have known before; it makes the primary illusion—virtual history—in its own mode. This is, essentially, the dream mode. I do not mean that it copies dream, or puts one into a daydream. Not at all, no more than literature invokes memory, or makes us believe that we are remember­ing’’ (p. 200).

e. Psychology/Psychoanalysis

Psychological film theory began with the publication of Münsterberg’s Photoplay, in which the first part is inspired by experimental psychology. In remainder of the twentieth century, more philosophical approaches would use psychoanalysis for the decoding of unconscious elements that were supposed to contain truth by reading films through schemes of symbolization and representation. These approaches are current in the continental tradition.

i. Rudolf Arnheim

Rudolf Arnheim (1904-2007) published Film als Kunst (Film as Art) in 1932, defending film as a form of art distinct from the productions of the entertainment industry. Historically, Arnheim’s main writings on film can be situated between Münsterberg and Balász. Later Arnheim founded a full-fledged aesthetics on the theory of perception and gestalt theory (especially in Art and Visual Perception, 1954). Comments on film are sparse in Arnheim’s later work, but the following quotation, which links film interpretation to Freudian psychology, is remarkable because it addresses a possible extension of the Freudian vision of dream into Arnheim’s own domain. In Visual Thinking (1969), Arnheim pointed to a fundamental link between film and dream that psychology should follow: “Freud raises the question of how the important logical links of reasoning can be repre­sented in images. An analogous problem, he says, exists for the visual arts. There are indeed parallels between dream images and those created in art on the one hand, and the mental images serving as the vehicle of thought on the other; but by noting the resemblance one also becomes aware of the differences, and these can help to characterize thought imagery more precisely” (p. 241).

1. Psychoanalysis vs. Cognitive Science

Reflections on the relationship between the spectator and the film have become particularly central in French film theory since the publication of Christian Metz’s Le signifiant imaginaire: psychoanalyse et cinéma (1977) and Metz’s Lacanian developments as well as the writings of Jean Louis Baudry. For psychoanalysis, the proximity between film and dream is essential because film interpretation is seen as a sort of Freudian dream interpretation. Almost half a century after Arnheim’s above remark, psychoanalytical film theory is still struggling to explain the ways in which unconscious elements obstruct the reception of films and to understand how films spark unconscious or irrational processes. This approach can be understood as diametrically opposed to that of cognitive science which is also interested in how spectators make sense of and respond to films, but which observes the viewer’s conscious processing of films.

ii. The Mindscreen Theory

Bruce F. Kawin began his book Mindscreen: Bergman and First-Person Film (1978) with the question: “Film is a dream—but whose?” (p. 3). Kawin claimed that dreamlike films are self-conscious artworks in which narrative voices have been “generalized” up to a point that we perceive the artworks themselves as if they appear on a “dream screen.” Kawin adopted the idea of the dream screen from the American Psycholo­gist Bertram D. Lewin (1950) whose ideas had, in their time, contributed in an outstanding way to an increasing interest in dreams in post-Freudian America. In an article entitled “Interferences from the Dream Screen,” Lewin declared: “In a previous communication, a special structure, the dream screen was distin­guished from the rest of the dream and defined as the blank background upon which the dream picture appears to be projected. The term was suggested by the action pictures because, like the analogue in the cinema, the dream screen is either not noted by the dreaming spectator, or it is ignored due to the interest in the pictures and actions that appear on it’’ (p. 104).

Kawin’s contribution to film theory consists of examining the validity of the dream screen (or mindscreen) in cinema providing clues to particularities of the narrative structure of film. In Bergman’s Persona, for example, there is “an impression of the mindscreen’s being generalized,” so that the film’s self-consciousness appears to originate from within. Being identified neither with a specific character nor with the filmmaker, the “potential linguistic focus” takes on the characteristics of a mindscreen: “The film becomes first-person, it speaks itself” (Kawin, pp. 113-14). Kawin anticipates Frampton’s ‘filmind’.

iii. Slavoj Žižek

Given his use of psychoanalysis and Marxism, the extremely influential Slovenian philosopher Slavoj Žižek (1949-) is a typical representative of continental film theory, the more so since he has explicitly opposed Anglo-American, empirical approaches (see “Introduction to 2001”). Žižek uses Lacanian concepts and insights (most importantly his notion of the Other) and works with concepts derived from the philosophies of Derrida and Deleuze, such as a decentralized notion of the subject. The value of his philosophy of film, developed in numerous books, seems to reside mainly in his consistent and original application of abstract theories to films though he has also submitted various concepts to original developments. His writings do not represent philosophy of film as such; rather they are attempts to read ideology by analyzing contemporary popular films.

f. Walter Benjamin and the Frankfurt School

Like Siegfried Kracauer, Walter Benjamin (1892-1940) belongs to the Frankfurt School of philosophy. Benjamin believed that the techniques of reproduction can be used for the reproduction of traditional works only, and also as new modes of representation. Benjamin’s theory of film is sometimes associated with that of cinematographic realism. Most famously, in his essay Das Kunstwerk im Zeitalter seiner technischen Reproduzierbarkeit [The Work of Art in the Age of Mechanical Reproduction] (1935), Benjamin interpreted modifications of the original through the process of reproduction, as a loss of the work’s aura.

Benjamin favored an allegorical kind of photography (that is definitely opposed to the art of montage), examples of which he found in early photography because it can be perceived something like an aura: “In the fleeting expression of a human face seen in early photographs, the aura shows itself for the last time. This is what makes their melan­cholic and thus incomparable beauty” (I, 2, p. 445). Like Epstein, Benjamin believed that cinema can provide us with extraordinary access to reality: the details that are produced by the close-up or the forms that are revealed by slow motion can “give us access to the experience of an optical unconscious just like psychoanalysis offers us the experience of the instinctive unconscious” (p. 460). In this sense, knowledge about the world is a matter of a sudden “awakening” to the world and this world can be a cinematic world.

Theodor W. Adorno (1903–1969), another member of the Frankfurt School, would transform Benjamin’s notion of “mass culture” into that of the “Kulturindustrie,” in which he saw as a systematized commodification encompassing ‘high’ and ‘mass’ art. This stands in contrast to Benjamin’s view of popular or mass culture as concealing a certain emancipatory potential to be unleashed by dialectical criticism. In their book Dialectic of Enlightenment (1972), Adorno and Max Horkheimer (1895–1973), developed a cultural critique that contains some dispersed but pertinent references to the culture industry of film excelling in standardized production techniques and catering for commercial and not for cultural purposes.

g. Hermeneutic Film Analysis

The hermeneutic approach is based on the interpretation of texts and detects the semantic potential of a text as well as its different levels of meanings. Hermeneutics can be seen as a branch of phenomenology and many of Ayfre’s and Agel’s texts bear traces of hermeneutic thinking. Still, in film analysis its method has mainly been developed in proximity with the interpretation of literature. Though hermeneutics of film explains the semiotic status of iconic signs, it never engages in the construction of semiotic systems because it is constantly aware of the historicity of any analysis. Today, hermeneutic film analysis is mainly practiced in German academic departments of Media and Communication Science. Moreover, such analysis has been brought to a point by Anke-Marie Lohmeier.

h. Future Perspectives

The future of continental philosophy of film will probably be developed in several areas. Žižek’s Lacanian approach, as long as it does not entirely enter the stream of cultural studies, will attract philosophers who deem that ideology should be the primary focus of cinema studies. Another area that might develop is the one pioneered by Bruce Kawin and Daniel Frampton, who provide a vocabulary for describing our aesthetic experience of film that transcends both Deleuze’s conceptual analyses of movement and time and phenomenological approaches. It clearly opens up philosophy to new ways of thinking. Equally promising is a whole range of “crossovers,” such as those that combine semiotic and cognitive paradigms of which Warren Buckland’s The Cognitive Semiotics of Film (2000) is an example. Other crossovers synthesize Deleuzian film theory and phenomenology (Shaw 2008), or the Cavellian-Wittgensteinian approach with continental philosophy (of which Stephen Mulhall is an example).

3. References and Further Reading

  • Agel, Henri. Esthétique du cinéma. Paris: Presses Universitaires de France, 1957.
  • Aitken, Ian. European Film Theory and Cinema: A Critical Introduction. Bloomington: Indiana University Press, 2001.
  • Allen, Richard and Murray Smith. Film Theory and Philosophy. Oxford: Oxford University Press, 1997.
  • Andrew, Dudley J. Major Film Theories: An Introduction. Oxford: Oxford University Press, 1976.
  • Arnheim, Rudolf. Visual Thinking. Berkeley: University of California Press, 1969.
  • Astruc, Alexandre. “The Birth of a New Avant-Garde: La Caméra-Stylo,” in Graham, Peter, ed., The New Wave. London: Secker and Warburg, 1968.
  • Aumont, Jacques. A quoi pensent les films? Paris: Séguier, 1996.
  • Ayfre, Amédé. Conversion aux images? Paris: Cerf, 1964.
  • Ayfre, Amédé. Le Cinéma et sa vérité. Paris: Cerf, 1969.
  • Bakhtin, Mikhail. The Dialogic Imagina­tion. Austin: University of Texas Press, 1981.
  • Balász, Béla. Theory of the Film. New York: Arno Press, 1972.
  • Balázs, Béla. Early Film Theory: Visible Man and The Spirit of Film (trans. Rodney Livingstone, ed. by Erica Carter). New York: Berghahn, 2010.
  • Barthes, Roland. Image-Music-Text. New York: Hill and Wang, 1977.
  • Bazin, André. Qu’est-ce que le cinéma? Vol. I-IV. Paris: Cerf, 1985 [1958].
  • Beasley-Murray, Jon. “Whatever Happened to Neorealism? Bazin, Deleuze, and Tarkovsky’s Long Take,” in D.N. Rodowick, ed., Gilles Deleuze, philosophe du cinéma/Gilles Deleuze, philosopher of cinema, special issue of Iris 23 (Spring 1997), 37-52
  • Benjamin, Walter. Das Kunstwerk im Zeitalter seiner technischen Reproduzierbarkeit [1935] in Gesammelte Werke 1:2, ed. by R. Tiedemann and H. Schweppenhäuser. Frankfurt: Suhrkamp, 1980., p. 431-469. Engl.: The Work of Art in the Age of its Technological Reproducibility, and other Writings on Media. Boston: Belknap Press of Harvard University Press, 2008.
  • Bergson, Henri. L’Evolution créatrice Paris: Alcan,1991 [1907].
  • Bergson, Henri. Matière et memoire. Paris: Presses universitaires françaises, 1939.
  • Bordwell, David and Noël Carroll, (ed.). Post-Theory: Reconstructing Film Studies. Madison: University of Wisconsin Press, 1996.
  • Bordwell, David. Narration in the Fiction Film. London: Methuen, 1985.
  • Branigan, Edward. Projecting a Camera: Language-Games in Film Theory. New York: Routledge, 2006.
  • Buckland, Warren. The Cognitive Semiotics of Film. Cambridge University Press, 2000.
  • Carroll, Noël. “Film/Mind Analogies: The Case of Hugo Munsterberg“ in Journal of Aesthetics and Art Criticism 46:4, 188, pp. 489-499.
  • Carroll, Noël. Mystifying Movies: Fads and Fallacies in Contemporary Film Theory. New York: Columbia University Press, 1988.
  • Carroll, Noel. Philosophical Problems of Classical Film Theory. Princeton: Princeton University Press, 1988.
  • Carroll, Noël. Philosophical Problems of Classical Film Theory. Princeton University Press.
  • Casetti, Francesco. Theories of Cinema 1945-1995. Austin: Texas University Press, 1999.
  • Cohen-Séat, Gilbert. Essai sur les principes d’une philosophie de cinéma. Paris: Presses Universitaires de France, 1958.
  • Curran, Angela & Thomas Wartenberg (eds.). The Philosophy of Film. London: Blackwell, 2005.
  • Davies, David. The Thin Red Line. London: Routledge, 2009.
  • Deleuze, Gilles & Felix Guattari. Qu’est-ce que la philosophie? Paris: Minuit, 1991. Engl.: What Is Philosophy? (trans. Tomlinson and G. Burchell) Columbia University Press, 1994.
  • Deleuze, Gilles. Cours de G. Deleuze du 31/01/84, voir www.webdeleuze.com
  • Deleuze, Gilles. L’Image-mouvement. Cinéma 1. Paris: Minuit Paris, 1983.
  • Deleuze, Gilles. L’Image-temps. Cinéma 2. Paris: Minuit, 1985. Engl. transl. of both volumes: Cinema (trans. H. Tomlinson and B. Habberjam). Minneapolis : University of Minnesota Press, 1986-1989.
  • Della Volpe, Galvano. Il verosimile filmico e altri scritti di estetica. Rome: Filmcritica, 1954.
  • Eagle, Herbert. Russian Formalist Film Theory. Ann Arbor: Michigan Slavic Publica­tions, 1981.
  • Eco, Umberto. “Sulle articolazioni del codice cinematogra­fico” [1967] in Per una nuova critica. Venezia: Marsilio Editori, 1989, p.389-396.
  • Eisenstein, Sergei. 1988c. Selected Works 1: Writings 1922-34. London: BFI.
  • Eisenstein, Sergei. On Disney (trans. A. Upchurch). New York: Methuen, 1988b.
  • Eisenstein, Sergei. Writings. Bloomington and Indianapolis: Indiana University Press, 1988a.
  • Epstein, Jean. L’Intelligence d’une machine. Paris: Jacques Melot, 1946.
  • Frampton, Daniel. Filmosophy. London, New York: Walflower, 2006.
  • Gaut, Berys. A Philosophy of Cinematic Art. Cambridge University Press, 2010.
  • Goodnow, Katherine J. Kristeva in Focus: From Theory to Film Analysis. Fertility, Reproduction & Sexuality. New York: Berghahn, 2010.
  • Kawin, Bruce F. Mindscreen: Bergman, Godard and First-Person Film. Princeton: Princeton University Press, 1975.
  • Kracauer, Siegfried. Theorie des Films. Frankfurt: Suhrkamp, 1973. Engl.: Theory of Film: The Redemption of Physical Reality. Oxford: Oxford University Press, 1960.
  • Langer, Susanne. “A Note on Film” in R. D. Maccann (ed.), Film: A Montage of Theories. New York: Dutton, 1966.
  • Lemon, L. and Reis, M. Russian Formalist Criticism: Four Essays. Lincoln: University of Nebraska Press, 1965.
  • Livingston, Paisley. Cinema, Philosophy, Bergman: On Film as Philosophy. Oxford University Press, 2009.
  • Loewy, Hanno. Béla Balázs – Märchen, Ritual und Film. Berlin: Vorwerk 8, 2003.
  • Lohmeier, Anke-Marie. Hermeneutische Theorie des Films. Tübingen: Niemeyer, 1996.
  • Lotman, Yuri. Semiotika kino i problemy kinoestetiki. Tallinn: Eesti Raamat 1973. Engl.: Semiotics of Cinema (Michigan Slavic Contributions 5.) Ann Arbor: University of Michigan Press, 1976.
  • Martin, Marcel. “Ingmar Bergman’s Persona” in Cinema, 119, 1967.
  • Merleau-Ponty, Maurice. Sens et non-sens. Paris: Nagel, 1948.
  • Metz, Christian. “Le signifiant imaginaire” in Communications 23, pp. 3-55, 1975.
  • Metz, Christian. “Fiction Film and its Spectator: A Metaphysical Study” in New Literary History 8:1, 1976.
  • Metz, Christian. Essais sur la Signification au Cinéma, Vol. I. Paris: Klincksieck, 1968.
  • Metz, Christian. Psychoanalysis and Cinema: The Imaginary Signifier. London: Macmillian, 1983.
  • Mitry, Jean. Esthétique et psychologie du cinéma, 2 Vol. Paris: Éditions Universitaires, 1963. Engl.: The Aesthetics and Psychology of the Cinema. Bloomington: Indiana University Press, 2000.
  • Mitry, Jean. La Sémiologie en Question. Paris : Cerf, 1987.
  • Mulhall, Stephen. On Film: Thinking in Action. London: Routledge, 2002.
  • Peirce, C. S. Peirce on Signs: Writings on Semiotic (ed. James Hoopes). Chapel Hill: University of North Carolina, 1994.
  • Polan, Dana B. “Roland Barthes and the Moving Image” in October 18, 1981, 41-46.
  • Pudovkin, Vsevolod. Film Technique and Film Acting. New York: Bonanza Books, 1949.
  • Rancière, Jacques, La Fable cinématographique. Paris: Le Seuil, 2001.
  • Rancière, Jacques, Le Destin des images. Paris: La Fabrique, 2003. Engl.: The Future of the Image. London: Verso, 2007.
  • Rancière, Jacques, Le Spectateur émancipé. Paris: La Fabrique, 2008. Engl.: The Emancipated Spectator. London: Verso, 2010.
  • Read, Rupert and Jerry Goodenough (eds.). Film as Philosophy: Essays on Cinema after Wittgenstein and Cavell. New York: Palgrave Macmillan, 2005.
  • Rushton, Richard & Gary Bettinson. 2010. What is Film Theory? London: Open University Press.
  • Russell, Bruce. “The Philosophical Limits of Film” in Film and Philosophy, Special Edition, 2000, pp. 163-167.
  • Shaw, Daniel. Film and Philosophy: Taking Movies Seriously. London: Wallflower Press, 2008.
  • Shaw, Spencer. Film Consciousness: From Phenomenology to Deleuze. Jefferson, NC: McFarland, 2008.
  • Smith, Murray & Thomas Wartenberg (eds.). Thinking through Cinema: Film as Philosophy. Malden, MA: Blackwell, 2006.
  • Smuts, Aaron. “Film as Philosophy: In Defense of a Bold Thesis” in Journal of Aesthetics and Art Criticism 67:3, 2009, pp. 409-420.
  • Sobchack, Vivian. The Address of the Eye: A Phenomenology of Film Experience. Princeton University Press, 1992.
  • Stam, Robert. Film Theory: An Introduction. Oxford: Blackwell, 1999.
  • Todorov, Tzvetan. Poétique. Paris: Seuil, 1973. Engl.: Introduction to Poetics. Minneapolis: University of Minnesota Press, 1981.
  • Turvey, Malcolm. Doubting vision: Film and the Revelationist Tradition. Oxford: Oxford University Press, 2008.
  • Vice, Sue. Introducing Bakhtin. Manchester: Manchester University Press, 1997.
  • Wartenberg, Thomas E. Thinking on Screen: Film as Philosophy. London: Routledge, 2007.
  • Žižek, Slavoj. Everything You Always Wanted to Know about Lacan (But were Afraid to ask Hitchcock). London: Verso, 1992.
  • Žižek, Slavoj. Gaze and Voice as Love Objects. Durham: Duke University Press, 1996.
  • Žižek, Slavoj. The Fright of Real Tears: Krszystof Kieślowski Between Theory and Post-Theory. London: British Film Institute, 2001.

Author Information

Thorsten Botz-Bornstein
Email: thorstenbotz@hotmail.com
Gulf University
Kuwait

Josiah Royce (1855—1916): Overview

Josiah Royce was one of the most influential philosophers of the period of classical American philosophy, the late nineteenth century through the early twentieth century. Although often identified as an exponent of Absolute Idealism, which is a philosophical view explored by Royce particularly in his Gifford Lectures, Royce was a thinker of widely diverse interests and talents. He made major contributions to psychology including a textbook in the field and was, in fact, elected President of the American Psychological Association. He wrote a history of California, considered today a forerunner of contemporary historiography in its attention to the role of women and minority groups. He explored social ethics, developing many ideas on the social grounding of the self, some of which were later expanded upon by George Herbert Mead. Furthermore, Royce wrote on pressing social and political problems of the day including race relations. W.E.B. Dubois was one of his students. He wrote literary criticism and can be considered the founder of the Harvard school of logic, Boolean algebra, and the foundation of mathematics. Among his students at Harvard were Clarence Irving Lewis who went on to pioneer modal logic, Edward V. Huntington, the first to axiomatize Boolean algebra, and Henry M. Sheffer, known for the eponymous Sheffer stroke. Royce has recently been cited as a proto-cybernetic thinker; another of his students was Norbert Wiener, the father of cybernetics. Royce also wrote on issues in the philosophy of science.

In addition to these many philosophical achievements, Royce made major contributions to the Philosophy of Religion, writing on the problem of evil, and Christian community and presenting a phenomenological study of the religious experience of ordinary people. Royce also wrote throughout his career on ethical theory and on the conditions for creating both genuine and supportive communities as well as creative, unique, ethical individuality. At the end of his life he turned to the development of a world community through a process of mediation and interpretation.

Table of Contents

  1. Life
  2. Thought and Works
    1. Royce’s Philosophy of Community
    2. Royce’s Ethics: The Philosophy of Loyalty
    3. Religion
    4. The Problem of Evil
    5. Logic
  3. References and Further Reading
    1. Primary Sources
    2. Published Editions
    3. Secondary Sources

1. Life

Josiah Royce was a Californian by birth, born on 20 November, 1855, in Grass Valley, the son of Josiah (1812-1888) and Sarah Eleanor (Bayliss) Royce (1819-1891), whose families were recent emigrants from England, and who sought their fortune in moving west in 1849. His pioneer mother Sarah was a central figure in forging a new social and political community in Grass Valley. She was the center of much musical activity with her melodeon, the first brought to California. She also helped found a church and served as a teacher of the young, including young Josiah. Under his mother’s tutelage, Josiah developed his love of literature, reading Milton and other literary works; made his acquaintance with the Bible and religious experience; was given an introduction to music and its beauty; and experienced the joys of a warm, loving community, his family, and particularly his mother and sisters. Young Josiah began his literary career with a delightful story of the travels of Pussy Blackie, a “Huckleberry Finn cat,” who runs away from home; gets bitten by a dog; is captured by an eagle; travels on a railroad car; lives in the house of a rich family; finds a cat companion with whom Pussy exchanges stories; discusses social issues such as the contrast between the rich and the poor, as well as the treatment of the less fortunate and moral questions such as honesty, shame, killing, and war. In 1866, the Royce family moved to San Francisco where Royce first attended the Lincoln School. Royce also attended San Francisco Boys’ High School where he had as a classmate the (later famous) physicist, Albert Michelson. Continuing his pioneer trek, Royce entered, at age fourteen, an infant University of California, later becoming one of its first graduates, thus participating in the beginnings of higher education in the state. After receiving his degree in Classics in 1875, Royce traveled to Germany to study philosophy for one year. He mastered the language while attending lectures in Heidelberg, Leipzig, and Göttingen. On his return, he entered the Johns Hopkins University in Baltimore and in 1878, was awarded one of its first four doctorates.

Josiah Royce returned to the University of California, Berkley from 1878-1882. He taught composition, and literature, published numerous philosophical articles and his Primer of Logical Analysis. He married Katherine Head in 1880; the couple had three sons (Christopher, 1882; Edward, 1886; Stephen, 1889) and remained married until Josiah’s death. Royce felt isolated in California, so far from the intellectual life of the East Coast, and he sought an academic post elsewhere. William James, Royce’s friend and philosophical antagonist, secured the opportunity for Royce to replace James at Harvard when James took a one-year sabbatical. In 1882, Royce accepted the position at half of James’ salary and brought his wife and newborn son across the continent to Cambridge, Massachusetts.

At Harvard, Royce began to develop his interest in a number of different areas. In 1885, he published his first major philosophical work, The Religious Aspect of Philosophy, proposing the idea that in order for our ordinary concepts of truth and error to have meaning, there must be an actual infinite mind, an “Absolute Knower,” who encompasses the totality of actual truths and possible errors. That same year, Royce received a permanent appointment as assistant professor at Harvard, where he continued to teach for thirty years. Among his students were such notables as T.S. Eliot, George Santayana, W.E.B. Dubois, Norbert Wiener, and C.I. Lewis, the logician.

Royce was a prolific writer as well as much demanded on the public lecture circuit. In 1886, he published his History of California; he followed this with a published novel in 1887. In 1892, Royce was appointed Professor of the History of Philosophy at Harvard, and he served as Chair of the Department of Philosophy from 1894-1898. During these years Royce established himself as a leading figure in American academic philosophy with his many reviews, lectures and books, including The Spirit of Modern Philosophy, (1892), The Conception of God (1897), and Studies in Good and Evil (1898).

In 1899, and 1900, Royce delivered the prestigious Gifford Lectures at the University of Aberdeen, taking this opportunity to consolidate his thoughts on metaphysics. The result was his two-volume opus, The World and the Individual (1899-1901). This point appeared to be the culmination of Royce’s work. His public reputation was at a high and at 45 years old, Royce was elected President of the American Psychological Association in 1902, and President of the American Philosophical Association in 1903. Reviewers of The World and the Individual praised Royce’s philosophical acumen but there were serious objections to his conclusions. Sparked by these and the criticisms of his logic by Charles Sanders Peirce, Royce began reconsideration of his arguments as well as extensive study in mathematical logic. He also returned to his earlier concerns with philosophy as a means to understand human life, the nature of human society, of religious experience, ethical action, suffering and the problem of evil. His major work on ethics, The Philosophy of Loyalty appeared in 1908, but his concern with ethical issues and the “art” of loyalty continued until his death. He published a collection of essays, Race Questions, Provincialisms, and Other American Problems in 1908 and another, William James and Other Essays on the Philosophy of Life, appeared in 1911.

In 1912, while recovering from a stroke, Royce published The Sources of Religious Insight, in which he sought an explanation for the phenomena of ordinary religious faith as experienced by ordinary religious communities and individuals. He considered this a correction to James’ The Varieties of Religious Experience (The Gifford Lectures, 1901-1902), which, in Royce’s judgment, put too much emphasis on extraordinary and individualistic religious experiences. Drawing on the semiotic of Peirce and other sources, in 1913, Royce published his opus on religious community, The Problem of Christianity, a work which Yale philosopher, John E. Smith identified as “one of the finest works in the philosophy of religion ever to appear on the American scene” (Smith 1982 &1992, 122). In place of the earlier “Absolute Knower,” Royce presents the concept of an infinite community of interpretation, guided by a shared spirit of truth-seeking and community building. The Universal Community constitutes reality, and its understanding increases over time, through its members’ continual development of meaning. Royce’s concern for building a universal or “Beloved Community” is exemplified in two later works focused on building peace and a world community: War and Insurance (1914) and The Hope of the Great Community (1916). In the 1914 book, Royce made a daring political and economic proposal to use the economic power of insurance to mediate hostilities among nations, and reduce the attraction of war in the future.

Royce died September 14, 1916, before he could develop fully his new philosophical insights. Unfortunately, Royce’s works were ignored until recently and now are being revisited by theologians and philosophers interested in not only metaphysics, but also practical and theoretical ethics, philosophy of religion, philosophy of self and of community.

2. Thought and Works

Royce’s major works include The Religious Aspect of Philosophy (1885), The World and the Individual (1899-1901), The Spirit of Modern Philosophy (1892), The Philosophy of Loyalty (1908), and The Problem of Christianity (1913). In his early works, Royce presented a novel defense of idealism, the “argument from error,” and arrived at the concept of an actual infinite mind, an “Absolute Knower,” that encompasses all truths and possible errors. The correspondence theory of truth requires that if an idea or judgment is true it must correctly represent its object; when an idea does not correctly represent its object, it is an error. Royce begins with the fact that the human mind often makes such errors and yet, argued Royce, the mind “intends” or “points to” the idea’s true object. Thus, the occurrence of these errors indicates that the true object of any idea must exist, in a fully determinate state, in some actual infinite mind; this actual infinite “Mind” is the “Absolute Knower.” Royce continued to pursue this move toward Idealism in The World and the Individual. Here we find Royce agreeing with Kantian critical rationalism that a true idea is one that may be fulfilled or validated by a possible experience, but he argued that such a possibility of experience required the existence of an actual being, whose nature was to be the true object of the experience. In this work, Royce engages in criticism of three major views of metaphysics, Realism, Mysticism, and Kant’s Critical Rationalism. He finds valuable points in all three views, but also critical weaknesses. He advocates a “Fourth Conception of Being,” a view of the totality of Being as an actual Infinite, timeless, and encompassing all valid past, present, and future possible experience of fact including the facts of all finite beings.

Royce continued to develop his thoughts spurred by various criticisms and particularly the friendly but longstanding dispute with William James known as “The Battle of the Absolute.” In addition, he struggled with the problem of community and the relations of the individual and community both philosophically and practically, the latter concern evidenced in his writings on the history of California, the land disputes in California, various public problems and his deep concern for a solid theory of ethics. In answer to Charles Sanders Peirce, he turned to a deep exploration of logic, mathematics, and semiotics. In his later works, Royce dispenses with the “Absolute Mind/Knower” of previous works and develops a metaphysical view which characterizes reality as a universe of ideas or signs which occur in the process of being interpreted by an ongoing, infinite community of minds. These minds and the community they constitute also become signs for further interpretation. In The Problem of Christianity he sees representation no longer as a static, one-time experience, but as having creative, synthetic and selective aspects and knowledge is now more than accurate and completed perception of an object, or complete conception of an idea, it is a process of interpretation. This new metaphysical view is reflected in Royce’s work on ethics, philosophy of religion, philosophy of community, social philosophy and logic.

a. Royce’s Philosophy of Community

Royce was concerned about the impact of an extreme individualism and particularly with the “heroic individualism” associated with Walt Whitman, Ralph Waldo Emerson, and William James. Though inspiring as ethical visions, Royce believed these views eventually proved unsatisfactory for ethical life. While affirming the “individual” as a “most fundamental phenomenon,” he sought to address the problem of forming satisfactory communities and dealing with competing forms of community. In his later metaphysics Royce affirms the full-fledged reality of both individuals and community. In the practical social and political realm, Royce tackled the dual issues of “enlightened/genuine individualism,” and “true/genuine community.” He recognized that the task of building authentic individuality and encouraging the growth of fulfilling, moral communities were inextricably bound together—worthwhile individuality and community arising out of their mutual interaction in a creative, ongoing, infinite process.

Thus, Royce develops the following claims: (1) individuals are inextricably rooted in the social context and true individuality is forged out of that context. An individual is both self-made and a social product and the worthiness of the end result, the individual self, is the responsibility of both individual and community; (2) community is a social product, but true community is created by the hard work of free, self-conscious, self-committed, self-creative moral individuals; (3) the task of the individual is both to fashion a “beautiful life” and to build a “beautiful” community, while the obligation of community is to build a harmony of wills while also fostering the development of true individuals; (4) individuals are finite, sinful and fallible and need to extend self to develop morality and overcome error. Furthermore, individuals crave harmony and relationship. Individuals keep communities alive, moral, and sane by keeping them from stagnating into inveterate habit, moving toward exclusivity and intolerance, or degenerating into mob madness. In other words, stress must be equally on the individual and the community. For Royce, “individuals without community are without substance, while communities without individuals are blind.”

In light of the strong individualistic emphasis in philosophy today, it is striking that for Royce communities are logically prior to individuals. For Royce, there is no personal identity unless there are communities of persons that provide causes and social roles for individuals to embrace. Royce declares: “My life means nothing, either theoretically or practically, unless I am a member of a community” (Royce 2001 [1913], 357). And community is more than any association or collection of individuals; communities can only exist, in Royce’s view, where individual members are in communication with one another and there exists to some relevant way a congruence of feeling, thought and will among them. A community is for Royce, as is the human self, a temporal being. He speaks of a community of memory and ac community of hope. Each of the community’s members accepts, as part of his/her own individual life and self, some past events and the same expected future events. And, like human selves, communities are “plans of action” engaged in purposive activity in the world and constituting a will and a spirit and, for Royce, also, like human selves, morally responsible for its communal actions.

b. Royce’s Ethics: The Philosophy of Loyalty

At the center of Royce’s ethics is his contention that to lead a morally significant life, one’s actions must express a self-consciously asserted will; the self is a plan of action, a plan of life created by an individual out of the chaos of many conflicting personal desires and impulses. Such a plan is forged when one finds a cause, or causes which require a program or programs of implementation that extend through time and requires the contributions of many individuals for their advancement and fulfillment. When an individual finds a cause judged worthwhile, his/her will becomes focused and defined in terms of that cause and furthermore, the individual becomes allied with a community of others who are also committed to that same cause. Royce calls this commitment “loyalty” and thus the moral life of an individual is understood in terms of the multiple loyalties that a person embraces. “There is only one way to be an ethical individual. That is to choose your cause, and then to serve it” (Royce 1995 [l908], 47).

According to Royce’s careful definition of loyalty, “genuine” loyalty is intended to rule out loyalty to morally evil causes and communities that serve them. Royce fully recognized that many of the worst deed in human history have involved a high degree of loyalty, but, argued Royce, these loyalties were directed exclusively to a particular group and expressed in the destruction of the conditions for others’ loyal actions, Royce describes the difference between true loyalty and vicious or “predatory” loyalty as follows:

A cause is good, not only for me, but for mankind, in so far, as it is essentially a “loyalty to loyalty,” that is, an aid, and a furtherance of loyalty in my fellows. It is an evil cause in so far as, despite the loyalty that it arouses in me, it is destructive of loyalty in the world of my fellows (Royce 1995 {1908}, 56).

Communities defined by true loyalty, or adherence to a cause that exemplifies the universal ideal of “loyalty to loyalty” are referred to by Royce as “genuine communities.” Again, the degenerate communities are those that tend toward the destruction of other’s causes and possibilities of loyalty. While every community works for the accomplishment of its central cause, Royce places significant emphasis on the idea of loyalty to a “lost cause.” A lost cause is not a hopeless cause but rather one that cannot be fulfilled within the actual lifetime of any of its members. These causes are those that are “lost” simply in virtue of their scope and magnitude and these are precisely the causes that establish ideals capable of evoking our highest hope and moral commitment. “Lost causes” are indispensable, in Royce’s view, as the source of absolute norms for any community and its members.

For Royce, chief among such causes is the full attainment of truth, the complete determination of the nature of reality through inquiry and interpretation. Thus, the formula of “loyalty to loyalty” demands that one’s moral and intellectual sphere become ever broader and remain critical at all levels. In this connection, Royce reframed two central ideas or principles of pragmatism. Following James in his essay, “The Will to Believe,” Royce agreed that any philosophical view is essentially an expression of individual volition. We must first decide how we are to approach the world and then develop our philosophical theories accordingly. For Royce, the essential attitude of will that one must adopt toward the world is “loyalty to the ideal of an ultimate truth.” Secondly, Royce adopted the pragmatist view of truth, that is, truth is the property possessed by those ideas that succeed in the long run. And, following Peirce, Royce argues that to define truth using any conception of “the long run, short of the ideal end of inquiry, is self-refuting.” Having adopted these pragmatic principles, Royce referred to his own position as “Absolute Pragmatism.”

Returning to the principles of “loyalty to loyalty,” it becomes clear that, for Royce, all communities we actually know, inhabit, or identify with, are finite and to varying degrees “predatory.” Roycean loyalty requires one to scrutinize the aims and actions of our communities and those of others and to work to reform the disloyal aspects. The philosophy of loyalty calls upon us to create and embrace more cosmopolitan and inclusive communities. However, any human community, however, inclusive and committed to loyalty to loyalty, will fall short of perfect loyalty. Each community must constantly engage in critical scrutiny and actions of reform. For Royce, there is no expectation that the high ideals of perfect loyalty, truth, and reality will ever be fully realized. The process of building community is an ongoing, infinite process.

Finally, for Royce, beyond the actual communities that we directly encounter in life there is the ideal “Beloved Community” of all those who would be fully dedicated to the cause of loyalty, truth, and reality itself. This is the ultimate goal and cause. Furthermore, the sharing of individuals’ feeling, thoughts and wills in a community (including the Beloved Community) is never, for Royce, a mystical blurring or annihilation of personal identities. Individuals remain individuals, but in forming a community they build or attain a second order life that extends beyond any of their individuals lives. This life is a life coordinated through a cause and extended over time, a super-human personality at work, a community united by an “interpreting spirit.” Again, communities are more than their individual members and individuals are always unique and not lost in their communities; individuals and communities need each other.

c. Religion

Through the strong influence of his mother, Sarah Royce, and the early education she provided him and his sisters, Royce was well acquainted with the Christian Protestant world view and his writings exhibited a consistent familiarity with Scripture and with religious themes. Royce, in his own life did not embrace organized Christianity, and he was critical of many historical churches, believing that they had lost sight of the “spirit” of community that ought to guide them. He identified many “communities of grace,” genuine communities of loyalty, that were non-Christian or not self-consciously religious. Royce had a great respect for Buddhism and he even learned Sanskrit to study religious texts in this language. Although Royce maintained a strong interest in logic, science, evolutionary theory, and natural philosophy throughout his career, religious concerns did figure prominently in a number of his works beginning with his first major publication, The Religious Aspect of Philosophy as well as in his last two major works, The Sources of Religious Insight and The Problem of Christianity. The Sources was Royce’s response to the Gifford Lectures delivered by William James in 1901-1902 and published as The Varieties of Religious experience. James’ lectures were a popular and academic success and remain so even today. Royce, however, believed that James placed too much emphasis on the extraordinary religious experience of extraordinary individuals and thus failed to capture the religious experiences of the ordinary individual. Furthermore, Royce believed that religion was a central phenomenon of human experience that could not be ignored by philosophers. In The Problem of Christianity Royce works out his own religious thought and his full-blown theory of community. He maintains that the Christian model of the “loyal community,” properly understood, successfully combined the true spirit of universal interpretation with an appreciation of the “infinite worth” of the individual as a unique member of the ideal Beloved Community, the Kingdom of Heaven (Royce 2001[1913], 193). It is in this book that Royce also brings to fruition his thought and concern with the problem of evil.

Royce’s approach to religion was empirical, historical and phenomenological as well as concerned with philosophical reflection. Thus, in a 1909 essay, “What is Vital in Christianity?,” Royce approaches religion with a historic-empirical approach akin to modern anthropology. He views the religion as it has developed within human history. He writes: “any religion presents itself as a more or less connected group: (1) of religious practices, such as prayers, ceremonies, festivals, rituals and other observance, and (2) of religious ideas, the ideas taking the form of traditions, legends, and beliefs about the gods or spirits” (Royce 1911 [1909], 101). The term “vital,” says Royce, is metaphoric, such as the “vital” features of an organism. If these changed the organism would not necessarily be destroyed but would be an essentially different type of organism. Royce illustrates with the features, gill breathing” and “lung breathing.” But “vital,” also connotes “alive” for the persons who are followers of the religion and this usage is similar to Paul Tillich’s understanding of religious symbols as alive or dying. Furthermore, this term means also “primary” in practice. Thus, Royce says his first question is: “What is more vital about a religion: its religious practices, or its religious ideas, beliefs, and spiritual attitudes?” In this essay, in a humorous passage about pigeons in the Harvard Yard observing the habits of humans who feed them and concluding to the existence of a benevolent creator, Royce critiques the causal approach to God, represented in many of the proofs for God’s existence.

Royce again approaches religion through a historical-philosophical analysis in an encyclopedia article entitled “Monotheism.” After briefly reviewing monotheism as a doctrine in contrast to polytheism and pantheism, Royce makes the following claim: “…from the historical point of view, three different ways of viewing the divine being have been of great importance both for religious life and for philosophical doctrine” (Royce 1916, 818). The three ways to which Royce refers are three forms of monotheism, which were established in India, Greece, and Israel.

Royce then discusses, what in his view, are the essential features of monotheism as developed in these three cultural contexts. From Israel, we have “the ethical monotheism of the Prophets of Israel,” and “God,” in this religious context is defined as “the righteous Ruler of the world,” as the “Doer of justice,” or as the one “whose law is holy” or “who secures the triumph of the right”(Royce, 1916). Turning to Greece and to “Hellenistic monotheism,” God is defined as the “source, of the explanation, or the correlate, or the order, or the reasonableness of the world” (Royce, 1916). The third type of monotheism is labeled “Hindu pantheism.” Royce notes that this understanding had many different historical origins and appeared, in fact, as part of the Neo-Platonic philosophy and the philosophy of Spinoza. This type of monotheism insists not only upon the “sole” reality of God, but also asserts the “unreality of the world” (Royce, 1916).

In an interesting move, Royce argues that the “whole history of Christian monotheism depends upon an explicit effort to make a synthesis of the ethical monotheism of Israel and the Hellenic form of monotheism (Royce, 1916) This effort, however, says Royce, has proven especially difficult. The Hellenic tradition with its intellectualistic emphasis on the Logos was in favor of defining the unity of the divine being and the world as the essential feature of monotheism, whereas ethical monotheism dwells upon the contrast between the righteous Ruler and the sinful world, and between divine grace and fallen man. Royce, then, concludes:

Therefore, behind many of the conflicts between the so-called pantheism in Christian tradition and the doctrines of “divine transcendence” and “divine personality,” there has lain the conflict between intellectualism and voluntarism, between an interpretation of the world in terms of order and an interpretation of the world in terms of the conflict between good and evil, righteousness and unrighteousness (Royce, 1916, 819).

This history is made even more complex, says Royce, due to the influence of the Indic type of God. This concept influenced mysticism and, of course, Neo-platonic philosophy which, in turn, influenced Christian philosophy and theology. Augustine is a prime example of this influence. Furthermore, within Christianity, the mystics have often pointed to the failure to resolve the conflict with the moral and intellectual interests. Royce writes:

The mystics…have always held that the results of the intellect are negative and lead to no definite idea of God which can be defended against the skeptics, while…to follow the law of righteousness, whether or not with the aid of divine grace, does not lead, at least in the present life, to the highest type of knowledge of God (Royce, 1916).

Then, Royce, with his respect for the experiential in religion, writes: “Without this third type of monotheism, and without this negative criticism of the work of the intellect, and this direct appeal to immediate experience, Christian doctrine, in fact, would not have reached some of its most characteristic forms and expressions, and the philosophy of Christendom would have failed to put on record some of its most fascinating speculations” (Royce, 1916).

Royce then reviews the history of the so-called “proofs for God’s existence,” generally an expression of the Hellenistic influence in Christianity, and says that there is some basis in the claim that these efforts to grasp the divine nature via the intellect leads to results remote “from the vital experience upon which religious monotheism, and in particular, Christian monotheism must rest, if such…is permanently to retain the confidence of a man who is at once critical and religious (Royce, 1916).

Finally, it is not surprising that Royce argues that whatever answers to the questions about the nature of the world – is it real, rational, ethical? – that are developed must not put exclusive emphasis on any one characteristic. For, “as we have seen, the problem of monotheism requires a synthesis of all the three ideas of God,” thus any attempt to address these three questions must be an answer that does adequate justice to the three ideas and the three problems (Royce, 1916). Royce’s own efforts were aimed at achieving this synthesis and providing an answer to these three questions. Thus, he sought in his conceptions of God to give an account of the nature of reality that would satisfy the “moral insight,” “the theoretic insight,” and the “religious insight.” And the three Conceptions of Being addressed in The World and the Individual embody aspects of the three ideas of God and address the three questions.

Turning now to The Sources of Religious Insight, as indicated earlier, it is a parallel to The Varieties of Religious Experience by William James, yet it transcends that work in a number of ways. First, it explores religious experience common to much of mankind rather than the special, genius cases that are the subject of James’ work. Second, it studies both individual and communal religious experience whereas James’ work focuses on the experiences of extraordinary individuals. Third, The Sources is solidly based in the empirical and experiential. Royce writes: “The issue will be one regarding the facts of living experience” (Royce, 1912). James, in his work seeks to explain man’s religious need in terms of an experience that wells up from the subliminal self, from the soundless depths of our own subconscious. Royce, on the other hand, asserts that, “the principal religious motives are indeed perfectly natural human motives” (Royce, 1912,41-42). Man’s religious experience is, as a natural process, an incident in the history of his sensitive life, of his personal interpretation of the world, and of his more or less creative effort to fulfill his needs, and to respond in his own way to the universe. Religious experience, then, for Royce, is natural and profoundly personal and social – it is the experience of a human self in the history of his own life, set in the material, historical, social, and cultural context of that individual history and life.

Beginning then with human experience and needs, Royce seeks to get at the heart of “religious experience.” He states that the “central and essential postulate” of every religion is that “man needs to be saved” (Royce 2001[1912], 8-9). Salvation is necessary, for Royce, for two reasons: (1) There is some aim or end of human life which is more important than all other aims; and (2) Man, as he now is, or is naturally, is always in great danger of missing this highest aim and thus to render his whole life “a senseless failure” (Royce 2001 [1912], 12). This salvation must come in the form of guidance about understanding and achieving the highest aim of life so far as one is able. But given the limitations and fallibility of the human perspective, which hardly extends “beyond one’s nose,” Royce contends that this needed guidance must come from some super-human source. Religion, then, is the sphere of life in which finite human beings are able to get in touch with this source. It is interesting that in seeking to demonstrate the validity of his postulate he reviews the essentials of Buddhism, the writings of Plato in The Republic, and various literary sources. He believes that these examples show that the search for salvation belongs to no “one type of piety or of poetry or of philosophy” (Royce, 1912, 15).

The next major question is “what are the sources of insight?” Royce will consider seven sources, beginning with the human self, viewed first as uniquely individual and then as social. These are the basic and most elementary. However, Royce will find individual life and social life insufficient to meet the religious need for salvation. He then turns to five other sources. These are all dependent on the first two sources but will develop, strengthen, correct and transform in some way these. The five are: reason as a synthesizing power; the will or the volitional; loyalty; the religious mission of sorrow, and the unified community of the spirit.

In turning to the individual alone with his problem of salvation and with his efforts to know the divine that can save, Royce asserts that the individual can be in touch with a genuine source of insight, one of value, although also a source with limits. There are, says Royce, three objects which individual experience, as a source of religious insight, can reveal: The Ideal, the Need, and the Deliverer. The Ideal is, of course, the standard in terms of which the individual estimates the sense and value of his personal life. The Need for salvation is that degree to which he falls short of attaining his ideal and is sundered from this Ideal by evil fortune, his own paralysis of will, or by his inward baseness. The Deliverer is that presence, that power, that light, that truth, that great companion who helps the individual and saves him from his need.

We as humans are creatures of wavering and conflicting motives and, although we strongly desire a unity that makes life meaningful, we, on our own, cannot find this unity; we always miss our mark. Furthermore, we have glimpses of what would fulfill us, what would meet our need: a life “infinitely richer than our own.” As James, argues, we want to get in touch with something that will give us a “new dimension” to our lives. We want something more. Furthermore, for Royce, our need and desire are crucial. Royce writes: “Unless you have inwardly felt the need of salvation and learned to hunger and thirst after spiritual unity and self-possession, all the rest of religious insight is to you a sealed book” (Royce 1912, 33).

However, the individual alone cannot achieve what is needed. Thus we must turn to the social and to shared human life to attain a broader religious insight. Royce writes: “…no one who remains content with his merely individual experience of the presence of the divine and of his deliverer, has won the whole of any true insight. For as a fact, we are all members one of another; and I can have no insight into the way of my salvation unless I thereby learn of the way of salvation for all my brethren. And there is no unity of the spirit unless all men are privileged to enter it whenever they see it and know it and love it” (Royce, 1912, 34). Here is Royce’s emphasis on the need for social expansion of the self, as loyalty to loyalty demands expansion of the perspective of the community. Indeed, Royce argues that one of the principal sources of our need for salvation is our narrowness of view and especially of the meaning of our own purposes and motives. The social world, as Royce has constantly argued, broadens our outlook; an individual corrects his own narrowness by trying to share his fellow’s point of view. Social responsibilities can set limits to our fickleness; social discipline can keep us from indulging all our caprices; human companionship may steady our vision. The social world may bring us in touch with our public, great self, wherein we may find our “soul and its interests writ large” (Royce, 1912, 55). Social life is a source of religious insight and the insight it can bring is knowledge of “salvation through the fostering of human brotherhood” (Royce, 1912, 58).

Another avenue to religious insight, for Royce, is the experience of moral suffering, of the deep sense of guilt accompanied by the belief that one is an outcast from human sympathy and is hopelessly alone. He illustrates this with two literary examples: “The Ancient Mariner” and Raskolnikov in Crime and Punishment. The central conception in these two literary pieces is of salvation as reconciliation both with the social and the divine order, an escape from the wilderness of lonely guilt to the realm where men can understand one another.

This brings us directly to the “religious mission of sorrow.” Sorrows are defined by Royce as evils that we can assimilate. These become part of a constructive process, which involves growth rather than destruction, a passage to a new life. We takes these sorrows up into our plan of life, give them new meaning as they become part of a new whole. This was Royce’s emphasis in his essays on the problem of evil as well as in Religious Aspect and The Spirit. This will be discussed more in the section on the Problem of Evil.

The stage is now set for The Problem of Christianity. Christianity is viewed as a “philosophy of life,” and the question Royce asks is “In what sense, if any, can the modern man consistently be, in creed, a Christian?” (Royce, 2001 [1913], 62). In focusing on “creed,” however, Royce is clear that he is not concerned here with dogma or with particular theological beliefs. Rather he seeks the essentially “vital” and living ideas that will find expression in communal practices and religious-moral living and that will speak to all humanity. Royce will focus on living religious experience as was expressed in the early Pauline Christian communities. Royce’s focus is on the incarnation of the Spirit in the living Church, it is the Church, rather than the person of the founder that is the central idea of Christianity. For Royce, the essence of Christianity is its stress on the “saving community.” Royce writes: “The thesis of this book is that the essence of Christianity, as the Apostle Paul stated that essence, depends upon regarding the being which the early Christian Church believed itself to represent, and the being which I call…the ‘Beloved Community,’ as the true source, through loyalty, of the salvation of man” (Royce, 2001 [1913], 45).

It is indeed the “community” which allows us to understand more fully the teachings of the Master. Of Jesus’ teaching, Royce found two ideas especially crucial: his preaching of love and the “Kingdom of Heaven.” Both were mysterious and in need of interpretation. Thus, “love” is a mystery for although we know we are to love God and our neighbor, the question is how. How can I be practically useful in meeting my neighbor’s needs? Anyone who has tried to be benevolent or to meet the needs of others knows that there can be a huge crevice between your interpretation of what that person needs and their belief in that matter. It is the interpretation of Jesus’ teachings in the letters of Paul that make the difference for Royce. That which can make the loving of our neighbor less mysterious and difficult is community, for a community “when united by an active developing purpose is an entity more concrete, and, in fact, less mysterious than any individual man (Royce, 2001 [1913] 94). In community, we can come to know each other, to see what each other’s needs are. I need not ask “who is my neighbor?” for my neighbor and I are both members of one and the same community.

The essence of Christianity, for Royce, is contained in three ideas. The first of these is that the source and means of salvation is the community of believers. Community is also the basis of the ethic of love taught by Jesus. The other two essential ideas are: “the moral burden of the individual and “atonement.” These are addressed in the section on evil.

d. The Problem of Evil

Josiah Royce struggled with the problem of evil throughout his life, exploring it from various approaches, and asserting that it could neither be avoided nor dismissed by either the philosopher or the ordinary person. Thus, unlike some philosophers, he did not believe the problem of evil could be solved as a practical problem that only required improving social conditions. As a native Californian, a historian, and a social observer of the development of early California, Royce explored ways in which evil manifested itself in social relations among persons, in social bodies infected with racism, greed, and in a variety of harmful prejudices, in expressions of hate and in mob violence. Ultimately, Royce embraced a theistic process metaphysics that recognizes evil as a real force and suffering as an irreducible fact of experience. In his 1897 essay, “The Problem of Job,” Royce presents a fairly succinct overview of the problem of evil, various solutions and his own view on a possible solution. In the Job story, we have a traditional view of God as wise, omnipotent, all powerful, and all good and the situation of Job, namely, a universal situation of unearned ill-fortune, a seeming persecution of a righteous person and a reigning down of evils on a good man. For Royce, Job represents the fundamental psychological fact about the problem of evil, namely, the universal experience of unearned ill-fortune. This, asserts Royce, is the experience of every person, the kind of evil that all persons can see for themselves every day if they choose. This is the fundamental experiential and psychological fact that grounds Royce’s own answers to the problem of evil and also his dismissal of the various traditional answers to the problem of evil. Thus, for example there is the view that the purpose of the world is “soul making,” that pain teaches us the ways of the world and helps us develop our higher potentialities. Royce believes this answer is inadequate because it presupposes a greater evil, namely a world which allows evils as the only way to reach given goals. Such an answer Royce believes is unacceptable to a sufferer of evil and undeserved ills.

Another answer to the problem of evil is the infinite worth of agents with free will. Royce finds value in this view in that it acknowledges evil as a logically necessary part of a perfect moral order, but he believes this answer ultimately fails. A major problem is Job’s situation, namely, the innocent sufferer. Such unearned ills may be partly due to the free will that partly caused them, but, asserts Royce, the unearned ills are also due to God who declines to protect the innocent.

Royce believes that as long as one views God as an external power, as Job did, the problem of evil cannot be solved. Rather, one must recognize God as internally present to us and as suffering with us to produce the higher good. When we suffer, our sufferings are God’s sufferings and this is the case because without suffering, evil, and tragedy, God’s life could not be perfected. Furthermore, asserts Royce, personal overcoming evil is the essence of the moral life. Persons are instruments of God’s triumph. Thus, in The Sources of Religious Insight, Royce presents man as a destroyer of evil, a being who uses every effort to get rid of evil. Conquering evils and oppressions provide man’s greatest opportunities for loyalty and here is the source of religious insight and spiritual triumph. The encounter of human selves with the problem of evil is, for Royce, the most important moral aspect of the world. One must see the problematic situation into which human selves are immersed as part of the atoning process which tends toward an ultimate reconciliation of finite conflicts. Confronted with evils, one needs to trust within one’s limited view that the Spirit of the Universal Community reconciles.

Ultimately, Royce sees evil as an eternal part of both human and divine consciousness and the most important moral fact of the universe the human conquering of evil step by step. In addressing the doctrine of atonement in The Problem, Royce sets out in detail how the loyal community can best respond to human evil. The highest transgression in an ethics of loyalty is treason, or the willful betrayal of one’s own cause and the community of people who serve it. Such a betrayal is moral suicide for it threatens to destroy the network of purposes and social relationships that define the traitor’s self. The traitor is in what Royce calls “the hell of the irrevocable” (Royce, 2001 [1913], 162). Royce seeks an explanation of atonement which acknowledges this irrevocable nature of the deed that has been done, and which changes everything for the sinner and the community that has been harmed. None of the tradition Christian accounts of atonement are satisfactory. The answer, for Royce, is that the act of atonement can only be accomplished by the community, or on the behalf of the community, through the steadfast “loyal servant who acts, so to speak, as the incarnation of the spirit of the community itself” (Royce, 2011 [1913], 180). This person serves as a mediating party between the traitor and the betrayed community and through the atoning act genuine community is restored and all the individuals may emerge as wiser, more commitment servants of their common cause. Things are not the same as before the treason but, in fact, transformed and better. Royce then states what he believes to be the central postulate of the highest form of human spirituality, namely, that “No baseness or cruelty of treason so deep or so tragic shall enter our human world, but that loyal love shall be able in due time to oppose to just that deed of treason its fitting deed of atonement” (Royce 2001 [1913], 186). In the spirit of James, Royce asserts that this postulate cannot be proven true, but human communities can assert it and act upon it as it is were true.

e. Logic

Royce pursued his interest in logic, mathematics and science throughout his career. His first published book was a Primer of Logical Analysis for the Use of Composition Students, written for his students in California in 1881. His own proposal for a system of formal logic was published as “The Relation of the Principles of Logic to the Foundations of Geometry” in 1905, a work later extended in 1914. Among his last writings were a series of encyclopedia articles on logical topics: Axiom, Error and Truth, Mind, Negation, and Order (all reprinted in Robinson, 1951).

In addition to his discussion of science in The Religious Aspect of Philosophy, The World and the Individual, and in The Problem of Christianity, Royce published a series of articles on scientific method:” The Mechanical, Historical and the Statistical,” “The Social Character of Scientific Inquiry,” (in Josiah Royce’s Late Writings); Hypotheses and Leading Ideas,” and Introduction to H. Poincaré, The Foundations of Science (reprinted in Robinson). Like Peirce, Royce argued for the self-corrective nature of the scientific method; the necessity of experience as the starting point of inquiry; an emphasis on scientific instinct and imaginative judgment in forming hypotheses; the requirement of a proper motive – the search for truth and not fame or profit – as a necessary condition, along with the correct method, for the success of science and essence of science, the notion of science as a communal endeavor, dependent on the contributions of many others, past, present and future, and finally for the thoroughly human and fallible nature of science.

3. References and Further Reading

a. Primary Sources

  • Royce, Josiah, Primer of Logical Analysis for the Use of Composition Students, San Francisco, California: A.L. Bancroft and Co., 1881.
  • Royce, Josiah, The Religious Aspect of Philosophy: A critique of the Bases of Conduct and Faith, Boston, New York: Houghton Mifflin & Co., 1885.
  • Royce, Josiah, California from the Conquest in 1846 to the Second Vigilance Committee in San Francisco [1856]: A Study of American Character, Boston and New York: Houghton, Mifflin and Co. 1886.
  • Royce, Josiah, The Spirit of Modern Philosophy: An Essay in the Form of Lectures, Boston: Houghton Mifflin, 1892.
  • Royce, Josiah, The Conception of God: A Philosophical Discussion Concerning the Nature of the Divine Idea as a Demonstrable Reality, New York: The Macmillan Co, 1897. This includes commentary by Joseph LeConte, George Holmes Howison, and Sidney Edward Mezes.
  • Royce, Josiah, Studies in Good and Evil, New York, Appleton, 1898.
  • Royce, Josiah, The World and the Individual, 2 vols., New York: Macmillan, 1899.
  • Royce, Josiah, Outlines of Psychology: An Elementary Treatise with Some Practical Applications, New York: Macmillan, 1903.
  • Royce, Josiah, The Philosophy of Loyalty, New York: Macmillan, 1908.
  • Royce, Josiah, Race Questions, Provincialism, and Other American Problems, New York: Macmillan, 1908.
  • Royce, Josiah, William James and Other Essays on the Philosophy of Life, New York: Macmillan, 1911.
  • Royce, Josiah, The Sources of Religious Insight, Washington, D.C.: Catholic University of America Press, 2001 [1913]. Also at http://www.iupui.edu/~iat/royce/.
  • Royce, Josiah, The Problem of Christianity, Washington, D.C.: Catholic University of America Press, 2001 [1913].
  • Royce, Josiah, War and Insurance, New York: Macmillan, 1914.
  • Royce, Josiah, The Hope of the Great Community, New York: Macmillan, 1916.

b. Published Editions

  • Robinson, D.S., ed. Royce’s Logical Essays: Collected Logical Essays of Josiah Royce, Dubuque, Iowa: W. C. Brown, 1951.
  • McDermott, J.J., ed. The Basic Writings of Josiah Royce, New York: Fordham University Press, 2 vols., 2005 [1969].
  • Clendenning, J., Ed. The Letters of Josiah Royce, Chicago: University of Chicago Press, 1970.
  • Oppenheim, F., ed. Josiah Royce’s Late Writings: A Collection of Unpublished and Scattered Works, Bristol: Thoemmes Press, 2 vols.

c. Secondary Sources

  • Auxier, R, ed., Critical Responses to Josiah Royce, 1885-1916, Bristol: Thoemmes Press, 3 vols., 2000.
  • Auxier, R, Time, Will, and Purpose: Living Ideas From the Philosophy of Josiah Royce, Open Court, 2011.
  • Clendenning, The Life and Thought of Josiah Royce, revised and expanded ed., Nashville, Tennessee: Vanderbilt University Press, 1999.
  • Kegley, J., Genuine Individuals and Genuine Communities: A Roycean Public Philosophy, Nashville, Tennessee, 1997.
  • Kegley, J. Josiah Royce in Focus, Bloomington, Indiana: Indiana University Press, 2008.
  • Kiklick, B., Josiah Royce: An Intellectual Biography, Indianapolis, Indiana: Hackett Publishing Company, Inc., 1985.
  • Marcel, G., Royce’s Metaphysics, trans. V. and G. Ringer, Chicago: Henry Regnery Company, 1956. This was originally published as La Mėtaphysique de Royce, Paris, 1945.
  • Oppenheim, F.M. Royce’s Voyage Down Under: A Journey of the Mind, Lexington: University of Kentucky Press, 1980.
  • Oppenheim, F.M., Royce’s Mature Philosophy of Religion, Notre Dame, Indiana: University of Notre Dame Press, 1987.
  • Oppenheim, F.M., Royce’s Mature Ethics, Notre Dame, Indiana: University of Notre Dame Press, 1993.
  • Oppenheim, F.M., Reverence for the Relations of Life: Re-examining Pragmatism via Josiah Royce’s Interactions with Peirce, James, and Dewey, Notre Dame, Indiana: University of Notre Dame Press, 2005.
  • Smith, J.E. Royce South Infinite: The Community of Interpretation, Hamden, Conn.: Archon Books, 1969.
  • Tunstall, Dwayne, Yes, but Not Quite: Encountering Josiah Royce’s Ethico-Religious Insight, New York: Fordham University Press, 2009.
  • Trotter, G., On Royce, Belmont, California: Wadsworth, 2001.

Author Information

Jacquelyn Ann K. Kegley
Email: jkegley@csub.edu
California State University Bakersfield
U. S. A.

Arthur Schopenhauer (1788—1860)

Arthur Schopenhauer has been dubbed the artist’s philosopher on account of the inspiration his aesthetics has provided to artists of all stripes. He is also known as the philosopher of pessimism, as he articulated a worldview that challenges the value of existence. His elegant and muscular prose earns him a reputation as one of the greatest German stylists. Although he never achieved the fame of such post-Kantian philosophers as Johann Gottlieb Fichte and G.W.F. Hegel in his lifetime, his thought informed the work of such luminaries as Sigmund Freud, Ludwig Wittgenstein and, most famously, Friedrich Nietzsche. He is also known as the first German philosopher to incorporate Eastern thought into his writings.

Schopenhauer’s thought is iconoclastic for a number of reasons. Although he considered himself Kant’s only true philosophical heir, he argued that the world was essentially irrational. Writing in the era of German Romanticism, he developed an aesthetics that was classicist in its emphasis on the eternal. When German philosophers were entrenched in the universities and immersed in the theological concerns of the time, Schopenhauer was an atheist who stayed outside the academic profession.

Schopenhauer’s lack of recognition during most of his lifetime may have been due to the iconoclasm of his thought, but it was probably also partly due to his irascible and stubborn temperament. The diatribes against Hegel and Fichte peppered throughout his works provide evidence of his state of mind. Regardless of the reason Schopenhauer’s philosophy was overlooked for so long, he fully deserves the prestige he enjoyed altogether too late in his life.

Table of Contents

  1. Schopenhauer’s Life
  2. Schopenhauer’s Thought
    1. The World as Will and Representation
      1. Schopenhauer’s Metaphysics and Epistemology
      2. The Ideas and Schopenhauer’s Aesthetics
    2. The Human Will
      1. Agency and Freedom
      2. Ethics
  3. Schopenhauer’s Pessimism
  4. References and Further Reading
    1. Primary Sources Available in English
    2. Secondary Sources

1. Schopenhauer’s Life

Arthur Schopenhauer was born on February 22, 1788 in Danzig (now Gdansk, Poland) to a prosperous merchant, Heinrich Floris Schopenhauer, and his much younger wife, Johanna. The family moved to Hamburg when Schopenhauer was five, because his father, a proponent of enlightenment and republican ideals, found Danzig unsuitable after the Prussian annexation. His father wanted Arthur to become a cosmopolitan merchant like himself and hence traveled with Arthur extensively in his youth. His father also arranged for Arthur to live with a French family for two years when he was nine, which allowed Arthur to become fluent in French. From an early age, Arthur wanted to pursue the life of a scholar. Rather than force him into his own career, Heinrich offered a proposition to Arthur: the boy could either accompany his parents on a tour of Europe, after which time he would apprentice with a merchant, or he could attend a gymnasium in preparation for attending university. Arthur chose the former option, and his witnessing firsthand on this trip the profound suffering of the poor helped shape his pessimistic philosophical worldview.

After returning from his travels, Arthur began apprenticing with a merchant in preparation for his career. When Arthur was 17 years old, his father died, most likely as a result of suicide. Upon his death, Arthur, his sister Adele, and his mother were each left a sizable inheritance. Two years following his father’s death, with the encouragement of his mother, Schopenhauer freed himself of his obligation to honor the wishes of his father, and he began attending a gymnasium in Gotha. He was an extraordinary pupil: he mastered Greek and Latin while there, but was dismissed from the school for lampooning a teacher.

In the meantime his mother, who was by all accounts not happy in the marriage, used her newfound freedom to move to Weimar and become engaged in the social and intellectual life of the city. She met with great success there, both as a writer and as a hostess, and her salon became the center of the intellectual life of the city with such luminaries as Johann Wolfgang von Goethe, the Schlegel brothers (Karl Wilhelm Friedrich and August Wilhelm), and Christoph Martin Wieland regularly in attendance. Johanna’s success had a bearing on Arthur’s future, for she introduced him to Goethe, which eventually led to their collaboration on a theory of colors. At one of his mother’s gatherings, Schopenhauer also met the Orientalist scholar Friedrich Majer, who stimulated in Arthur a lifelong interest in Eastern thought. At the same time, Johanna and Arthur never got along well: she found him morose and overly critical and he regarded her as a superficial social climber. The tensions between them reached its peak when Arthur was 30 years old, at which time she requested that he never contact her again.

Before his break with his mother, Arthur matriculated to the University of Göttingen in 1809, where he enrolled in the study of medicine. In his third semester at Göttingen, Arthur decided to dedicate himself to the study of philosophy, for in his words: “Life is an unpleasant business… I have resolved to spend mine reflecting on it.” Schopenhauer studied philosophy under the tutelage of Gottlieb Ernst Schultz, whose major work was a critical commentary of Kant’s system of transcendental idealism. Schultz insisted that Schopenhauer begin his study of philosophy by reading the works of Immanuel Kant and Plato, the two thinkers who became the most influential philosophers in the development of his own mature thought. Schopenhauer also began a study of the works of Friedrich Wilhelm Joseph von Schelling, of whose thought he became deeply critical.

Schopenhauer transferred to Berlin University in 1811 for the purpose of attending the lectures of Johann Gottlieb Fichte, who at the time was considered the most exciting and important German philosopher of his day. Schopenhauer also attended Friedrich Schleiermacher’s lectures, for Schleiermacher was regarded as a highly competent translator and commentator of Plato. Schopenhauer became disillusioned with both thinkers, and with university intellectual life in general, which he regarded as unnecessarily abstruse, removed from genuine philosophical concerns, and compromised by theological agendas.

Napoleon’s Grande Armee arrived in Berlin in 1813, and soon after Schopenhauer moved to Rudolstat, a small town near Weimar, in order to escape the political turmoil. There Schopenhauer wrote his doctoral dissertation, The Fourfold Root of the Principle of Sufficient Reason, in which he provided a systematic investigation of the principle of sufficient reason. He regarded his project as a response to Kant who, in delineating the categories, neglected to attend to the forms that ground them. The following year Schopenhauer settled in Dresden, hoping that the quiet bucolic surroundings and rich intellectual resources found there would foster the development of his philosophical system. Schopenhauer also began an intense study of Baruch Spinoza, whose notion of natura naturans, a notion that characterized nature as self-activity, became key to the formulation of his account of the will in his mature system.

During his time in Dresden, he wrote On Vision and Colors, the product of his collaboration with Goethe. In this work, he used Goethe’s theory as a starting point in order to provide a theory superior to that of his mentor. Schopenhauer’s relationship with Goethe became strained after Goethe became aware of the publication. During his time in Dresden, Schopenhauer dedicated himself to completing his philosophical system, a system that combined Kant’s transcendental idealism with Schopenhauer’s original insight that the will is the thing-in-itself. He published his major work that expounded this system, The World as Will and Representation, in December of 1818 (with a publication date of 1819). To Schopenhauer’s chagrin, the book made no impression on the public.

In 1820, Schopenhauer was awarded permission to lecture at the University of Berlin. He deliberately, and impudently, scheduled his lectures during the same hour as those of G.W.F. Hegel, who was the most distinguished member of the faculty. Only a handful of students attended Schopenhauer’s lectures while over 200 students attended the lectures of Hegel. Although he remained on the list of lecturers for many years in Berlin, no one showed any further interest in attending his lectures, which only fueled his contempt for academic philosophy.

The following decade was perhaps Schopenhauer’s darkest and least productive. Not only did he suffer from the lack of recognition that his groundbreaking philosophy received, but he also suffered from a variety illnesses. He attempted to make a career as a translator from French and English prose, but these attempts also met with little interest from the outside world. During this time Schopenhauer also lost a lawsuit to the seamstress Caroline Luise Marguet that began in 1821 and was settled five years later. Marguet accused Schopenhauer of beating and kicking her when she refused to leave the antechamber to his apartment. As a result of the suit, Schopenhauer had to pay her 60 thalers annually for the rest of her life.

In 1831, Schopenhauer fled Berlin because of a cholera epidemic (an epidemic that later took the life of Hegel) and settled in Frankfurt am Main, where he remained for the rest of his life. In Frankfurt, he again became productive, publishing a number of works that expounded various points in his philosophical system. He published On the Will in Nature in 1836, which explained how new developments in the physical sciences served as confirmation of his theory of the will. In 1839, he received public recognition for the first time, a prize awarded by the Norwegian Academy, on his essay, On the Freedom of the Human Will. In 1840 he submitted an essay entitled On the Basis of Morality to the Danish Academy, but was awarded no prize even though his essay was the only submission. In 1841, he published both essays under the title, The Fundamental Problems of Morality, and included an introduction that was little more than a scathing indictment of Danish Academy for failing to recognize the value of his insights.

Schopenhauer was able to publish an enlarged second edition to his major work in 1843, which more than doubled the size of the original edition. The new expanded edition earned Schopenhauer no more acclaim than the original work. He published a work of popular philosophical essays and aphorisms aimed at the general public in 1851 under the title, Parerga and Paralipomena (Secondary Works and Belated Observations). This work, the most unlikely of his books, earned him his fame, and from the most unlikely of places: a review written by the English scholar John Oxenford, entitled “Iconoclasm in German Philosophy,” which was translated into German. The review excited an interest in German readers, and Schopenhauer became famous virtually overnight. Schopenhauer spent the rest of his life reveling in his hard won and belated fame, and died in 1860.

2. Schopenhauer’s Thought

Schopenhauer’s philosophy stands apart from other German idealist philosophers in many respects. Perhaps most surprising for the first time reader of Schopenhauer familiar with the writings of other German idealists would be the clarity and elegance of his prose. Schopenhauer was an avid reader of the great stylists in England and France, and he tried to emulate their style in his own writings. Schopenhauer often charged more abstruse writers such as Fichte and Hegel with deliberate obfuscation, describing the latter as a scribbler of nonsense in his second edition of The Fourfold Root of the Principle of Sufficient Reason.

Schopenhauer’s philosophy also stands in contrast with his contemporaries insofar as his system remains virtually unchanged from its first articulation in the first edition of The World as Will and Representation. Even his dissertation, which he wrote before he recognized the role of the will in metaphysics, was incorporated into his mature system. For this reason, his thought has been arranged thematically rather than chronologically below.

a. The World as Will and Representation

i. Schopenhauer’s Metaphysics and Epistemology

The starting point for Schopenhauer’s metaphysics is Immanuel Kant’s system of transcendental idealism as explained in The Critique of Pure Reason. Although Schopenhauer is quite critical of much of the content of Kant’s Transcendental Analytic, he endorses Kant’s approach to metaphysics in Kant’s limiting the sphere of metaphysics to articulating the conditions of experience rather than transcending the bounds of experience. In addition, he accepts the results of the Transcendental Aesthetic, which demonstrate the truth of transcendental idealism. Like Kant, Schopenhauer argues that the phenomenal world is a representation, i.e., an object for the subject conditioned by the forms of our cognition. At the same time, Schopenhauer simplifies the activity of the Kantian cognitive apparatus by holding that all cognitive activity occurs according to the principle of sufficient reason, that is, that nothing is without a reason for being.

In Schopenhauer’s dissertation, which was published under the title The Fourfold Root of Sufficient Reason, he argues that all of our representations are connected according to one of the four manifestations of the principle of sufficient reason, each of which concerns a different class of objects. The principle of sufficient reason of becoming, which regards empirical objects, provides an explanation in terms of causal necessity: any material state presupposes a prior state from which it regularly follows. The principle of sufficient reason of knowing, which regards concepts or judgments, provides an explanation in terms of logical necessity: if a judgment is to be true, it must have a sufficient ground. Regarding the third branch of the principle, that of space and time, the ground for being is mathematical: space and time are so constituted that all their parts mutually determine one another. Finally, for the principle regarding willing, we require as a ground a motive, which is an inner cause for that which it was done. Every action presupposes a motive from which it follows by necessity.

Schopenhauer argues that prior philosophers, including Kant, have failed to recognize that the first manifestation and second manifestations are distinct, and subsequently tend to conflate logical grounds and causes. Moreover, philosophers have not heretofore recognized the principle’s operation in the realms of mathematics and human action. Thus Schopenhauer was confident that his dissertation not only would provide an invaluable corrective to prior accounts of the principle of sufficient reason, but would also allow every brand of explanation to acquire greater certainty and precision.

It should be noted that while Schopenhauer’s account of the principle of sufficient reason owes much to Kant’s account of the faculties, his account is significantly at odds with Kant’s in several ways. For Kant, the understanding always operates by means of concepts and judgments, and the faculties of understanding and reason are distinctly human (at least regarding those animate creatures with which we are familiar). Schopenhauer, however, asserts that the understanding is not conceptual and is a faculty that both animals and humans possess. In addition, Schopenhauer’s account of the fourth root of the principle of sufficient reason is at odds with Kant’s account of human freedom, for Schopenhauer argues that actions follow necessarily from their motives.

Schopenhauer incorporates his account of the principle of sufficient reason into the metaphysical system of his chief work, The World as Will and Representation. As we have seen, Schopenhauer, like Kant, holds that representations are always constituted by the forms of our cognition. However, Schopenhauer points out that there is an inner nature to phenomena that eludes the principle of sufficient reason. For example, etiology (the science of physical causes) describes the manner in which causality operates according to the principle of sufficient reason, but it cannot explain the natural forces that underlie and determine physical causality. All such forces remain, to use Schopenhauer’s term, “occult qualities.”

At the same time, there is one aspect of the world that is not given to us merely as representation, and that is our own bodies. We are aware of our bodies as objects in space and time, as a representation among other representations, but we also experience our bodies in quite a different way, as the felt experiences of our own intentional bodily motions (that is, kinesthesis). This felt awareness is distinct from the body’s spatio-temporal representation. Since we have insight into what we ourselves are aside from representation, we can extend this insight to every other representation as well. Thus, Schopenhauer concludes, the innermost nature [Innerste], the underlying force, of every representation and also of the world as a whole is the will, and every representation is an objectification of the will. In short, the will is the thing in itself. Thus Schopenhauer can assert that he has completed Kant’s project because he has successfully identified the thing in itself.

Although every representation is an expression of will, Schopenhauer denies that every item in the world acts intentionally or has consciousness of its own movements. The will is a blind, unconscious force that is present in all of nature. Only in its highest objectifications, that is, only in animals, does this blind force become conscious of its own activity. Although the conscious purposive striving that the term ‘will’ implies is not a fundamental feature of the will, conscious purposive striving is the manner in which we experience it and Schopenhauer chooses the term with this fact in mind.

Hence, the title of Schopenhauer’s major work, The World as Will and Representation, aptly summarizes his metaphysical system. The world is the world of representation, as a spatio-temporal universal of individuated objects, a world constituted by our own cognitive apparatus. At the same time, the inner being of this world, what is outside of our cognitive apparatus or what Kant calls the thing-in-itself, is the will; the original force manifested in every representation.

ii. The Ideas and Schopenhauer’s Aesthetics

Schopenhauer argues that space and time, which are the principles of individuation, are foreign to the thing-in-itself, for they are the modes of our cognition. For us, the will expresses itself in a variety of individuated beings, but the will in itself is an undivided unity. It is the same force at work in our own willing, in the movements of animals, of plants and of inorganic bodies.

Yet, if the world is composed of undifferentiated willing, why does this force manifest itself in such a vast variety of ways? Schopenhauer’s reply is that the will is objectified in a hierarchy of beings. At its lowest grade, we see the will objectified in natural forces, and at its highest grade the will is objectified in the species of human being. The phenomena of higher grades of the will are produced by conflicts occurring between different phenomena of the lower grades of the will, and in the phenomenon of the higher Idea, the lower grades are subsumed. For instance, the laws of chemistry and gravity continue to operate in animals, although such lower grades cannot explain fully their movements. Although Schopenhauer explains the grades of the will in terms of development, he insists that the gradations did not develop over time, for such an understanding would assume that time exists independently of our cognitive faculties. Thus in all natural beings we see the will expressing itself in its various objectifications. Schopenhauer identifies these objectifications with the Platonic Ideas for a number of reasons. They are outside of space and time, related to individual beings as their prototypes, and ontologically prior to the individual beings that correspond to them.

Although the laws of nature presuppose the Ideas, we cannot intuit the Ideas simply by observing the activities of nature, and this is due to the relation of the will to our representations. The will is the thing in itself, but our experience of the will, our representations, are constituted by our form of cognition, the principle of sufficient reason. The principle of sufficient reason produces the world of representation as a nexus of spatio-temporal, causally related entities. Therefore, Schopenhauer’s metaphysical system seems to preclude our having access to the Ideas as they are in themselves, or in a way that transcends this spatio-temporal causally related framework.

However, Schopenhauer asserts that there is a kind of knowing that is free from the principle of sufficient reason. To have knowledge that is not conditioned by our forms of cognition would be an impossibility for Kant. Schopenhauer makes such knowledge possible by distinguishing the conditions of knowing, namely, the principle of sufficient reason, from the condition for objectivity in general. To be an object for a subject is a condition of objects that is more basic than the principle of sufficient reason for Schopenhauer. Since the principle of sufficient reason allows us to experience objects as particulars existing in space and time with a causal relation to other things, to have an experience of an object solely insofar as it presents itself to a subject, apart from the principle of sufficient reason, is to experience an object that is neither spatio-temporal nor in a causal relation to other objects. Such objects are the Ideas, and the kind of cognition involved in perceiving them is aesthetic contemplation, for perception of the Ideas is the experience of the beautiful.

Schopenhauer argues that the ability to transcend the everyday point of view and regard objects of nature aesthetically is not available to most human beings. Rather, the ability to regard nature aesthetically is the hallmark of the genius, and Schopenhauer describes the content of art through an examination of genius. The genius, claims Schopenhauer, is one who has been given by nature a superfluity of intellect over will. For Schopenhauer, the intellect is designed to serve the will. Since in living organisms, the will manifests itself as the drive for self-preservation, the intellect serves individual organisms by regulating their relations with the external world in order to secure their self-preservation. Because the intellect is designed to be entirely in service of the will, it slumbers, to use Schopenhauer’s colorful metaphor, unless the will awakens it and sets it in motion. Therefore ordinary knowledge always concerns the relations, laid down by the principle of sufficient reason, of objects in terms of the demands of the will.

Although the intellect exists only to serve the will, in certain humans the intellect accorded by nature is so disproportionately large, it far exceeds the amount needed to serve the will. In such individuals, the intellect can break free of the will and act independently. A person with such an intellect is a genius (only men can have such a capability according to Schopenhauer), and this will-free activity is aesthetic contemplation or creation. The genius is thus distinguished by his ability to engage in will-less contemplation of the Ideas for a sustained period of time, which allows him to repeat what he has apprehended by creating a work of art. In producing a work of art, the genius makes the beautiful accessible for the non-genius as well. Whereas non-geniuses cannot intuit the Ideas in nature, they can intuit them in a work of art, for the artist replicates nature in the artwork in such a manner that the viewer is capable of viewing it disinterestedly, that is, freed from her own willing, as an Idea.

Schopenhauer states that aesthetic contemplation is characterized by objectivity. The intellect in its normal functioning is in the service of the will. As such, our normal perception is always tainted by our subjective strivings. The aesthetic point of view, since it is freed from such strivings, is more objective than any other ways of regarding an object. Art does not transport the viewer to an imaginary or even ideal realm. Rather it affords the opportunity to view life without the distorting influence of his own will.

b. The Human Will: Agency, Freedom, and Ethical Action

i. Agency and Freedom

Any account of human agency in Schopenhauer must be given in terms of his account of the will. For Schopenhauer, all acts of will are bodily movements, and thus are not the internal cause of bodily movements. What distinguishes an act of will from other events, which are also expressions of the will, is that it meets two criteria: it is a bodily movement caused by a motive, and it is accompanied by a direct awareness of this movement. Schopenhauer provides both a psychological and physiological account of motives. In his psychological account, motives are causes that occur in the medium of cognition, or internal causes. Motives are mental events that arise in response to an awareness of some motivating object. Schopenhauer argues that these mental events can never be desires or emotions: desires and emotions are expressions of the will and thus are not included under the class of representations. Rather, a motive is the awareness of some object of representation. These representations can be abstract; thinking the concept of an object, or intuitive; perceiving an object. Thus Schopenhauer provides a causal picture of action, and it is one in which mental events cause physical events.

In Schopenhauer’s physiological account of motives, motives are brain processes that cause certain neural activities and these translate into bodily motion. The psychological and physical accounts are consistent insofar as Schopenhauer has a dual-aspect view of the mental and physical. The mental and the physical are not two causally linked realms, but two aspects of the same nature, where one cannot be reduced to or explained by the other. It is important to underscore the fact that in the physiological account, the will is not a function of the brain. Rather it is present as irritability in the muscular fibers of the whole body.

According to Schopenhauer, the will, as muscular irritability, is a continual striving for activity in general. Because this striving has no direction, it aims at all directions at once and thus produces no physical movement. However, when the nervous system provides the direction for this movement (that is, when motives act on the will), the movement is given direction and bodily movement occurs. The nerves do not move the muscles, rather they provide the occasion for the muscles’ movements.

The causal mechanism in acts of will is necessary and lawful, as are all causal relations in Schopenhauer’s view. Acts of will follow from motives with the same necessity that the motion of a billiard ball follows from its being struck. Yet this account leads to a problem concerning the unpredictability of acts: if the causal process is law governed, and if acts of will are causally determined, Schopenhauer must account for the fact that human actions are unpredictable. This unpredictability of human action, he argues, is due to the impossibility of knowing comprehensively the character of an individual. Each character is unique, and thus it is impossible to predict fully how a motive or set of motives will effect bodily motion. In addition, we usually do not know what a person’s beliefs are concerning the motive, and these beliefs influence how she will respond to it. However, if we had a full account of a person’s character as well as her beliefs, we could with scientific accuracy predict what bodily motion would result from a particular motive.

Schopenhauer distinguishes between causation that occurs through stimuli, which is mechanistic, and that which occurs through motives. Each kind of causality occurs with necessity and lawfulness. The difference between these different classifications of causes regards the commensurability and proximity of cause and the effect, not their degree of lawfulness. In mechanical causation, the cause is contiguous and commensurate to the effect, both cause and effect are easily perceived, and therefore their causal lawfulness is clear. For instance, a billiard ball must be struck in order to move, and the force in which one ball hits will be equal to the force in which the other ball moves. In stimuli, causes are proximate: there is no separation between receiving the impression and being determined by it. At the same time, cause and effect are not always commensurate: for instance, when a plant reaches up to the sun, the sun as cause makes no motion to produce the effect of the plant’s movement. In motive causality, the cause is neither proximate nor commensurate: the memory of Helen can cause whole armies to run to battle, for instance. Consequently the lawfulness in motive causality is difficult, if not impossible, to perceive.

Because human action is causally determined, Schopenhauer denies that humans can freely choose how they respond to motives. In any course of events, one and only one course of action is available to the agent, and the agent performs that action with necessity. Schopenhauer must, then, account for the fact that agents experience their own actions as contingent. Moreover, he must account for the active nature of agency, the fact that agents experience their actions as things they do and not things that happen to them.

Schopenhauer gives an explanation of the active nature of agency, but not in terms of the causal efficacy of agents. Instead, the key to accounting for human agency lies in the distinction between one’s intelligible and empirical character. Our intelligible character is our character outside of space and time, and is the original force of the will. We cannot have access to our intelligible character, as it exists outside our forms of knowing. Like all forces in nature, it is original, inalterable and inexplicable. Our empirical character is our character insofar as it manifests itself in individual acts of will: it is, in short, the phenomenon of the intelligible character. The empirical character is an object of experience and thus tied to the forms of experience, namely space, time and causality.

However, the intelligible character is not determined by these forms, and thus is free. Schopenhauer calls this freedom transcendental, as it is outside the realm of experience. Although we can have no experience of our intelligible character, we do have some awareness of the fact that our actions issue from it and thus are very much our own. This awareness accounts for our experiencing our deeds as both original and spontaneous. Thus our deeds are both events linked with other events in a lawfully determined causal chain and acts that issue directly from our own characters. Our actions can embody both these otherwise contradictory characterizations because these characterizations refer to the deeds from two different aspects of our characters, the empirical and the intelligible.

Our characters also explain why we attribute moral responsibility to agents even though acts are causally necessitated. Characters determine the consequences that motives effect on our bodies. Yet, states Schopenhauer, our characters are entirely our own: our characters are fundamentally what we are. This is why we assign praise or blame not to acts but to the agents who commit them. And this is why we hold ourselves responsible: not because we could have acted differently given who we are, but that we could have been different from who we are. Although there is not freedom in our action, there is freedom in our essence, our intelligible character, insofar as our essence lies outside the forms of our cognition, that is to say, space, time and causality.

ii. Ethics

Like Kant, Schopenhauer reconciles freedom and necessity in human action through the distinction between the phenomenal and noumenal realms. However, he was sharply critical of Kant’s deontological framework. Schopenhauer charged Kant with committing a petitio principii, for he assumed at the outset of his ethics that purely moral laws and then constructed an ethics to account for such laws. Schopenhauer argues, however, that Kant provides no proof for the existence of such laws. Indeed, Schopenhauer avers that no such laws, which have their basis in theological assumptions, exist. Likewise, Schopenhauer attacks Kant’s account of morality as characterized by an unconditioned ought. The notion of ‘ought’ only carries motivational force when accompanied by the threat of sanctions. Because no ought can be unconditioned insofar as its motivational force stems from its implicit threat of punishment, all imperatives are in fact, according to Schopenhauer, hypothetical.

Nor does Schopenhauer accept Kant’s claim that morality derives from reason: like David Hume, Schopenhauer regards reason as instrumental. The origins of morality are not found in reason, but rather in the feeling of compassion that allows one to transcend the standpoint of egoism. The dictum of morality is “Harm no one and help others as much as you can.” Most persons operate exclusively from egoistic motives, for, as Schopenhauer explains, our knowledge of our own weal and woe is direct, while our knowledge of the weal and woe of others is always only representation and thus does not affect us.

Although most persons are motivated primarily by egoistic concerns, certain rare persons can act from compassion, and it is compassion that forms the basis of Schopenhauer’s ethics. Compassion is prompted by the awareness of the suffering of another person, and Schopenhauer characterizes it as a kind of felt knowledge. Compassion is born of the awareness that individuation is merely phenomenal. Consequently the ethical point of view expresses a deeper knowledge than what is found in the ordinary manner of viewing the world. Indeed, the feeling of compassion is nothing other than the felt knowledge that the suffering of another has a reality equal to one’s own suffering insofar as the world in itself is an undifferentiated unity. Schopenhauer asserts that this knowledge cannot be taught or even communicated, but can only be brought about by experience.

Since compassion is the basis of Schopenhauer’s ethics, the ethical significance of conduct is found in the motive alone, an aspect of his ethics that finds affinity with Kant. Thus Schopenhauer distinguishes the just person from the good person not by the nature of their actions, but by their level of compassion: the just person sees through the principle of individuation enough to avoid causing harm to another, whereas the good person sees through it even further, to the point that the suffering he sees in others touches him almost as closely as does his own. Such a person not only avoids harming others, but actively tries to alleviate the suffering of others. At its highest point, someone may recognize the suffering of others with such clarity that he is willing to sacrifice his own well-being for the sake of others, if by doing so the suffering he will alleviate outweighs the suffering he must endure. This, says Schopenhauer, is the highest point in ethical conduct.

3. Schopenhauer’s Pessimism

Schopenhauer’s pessimism is the most well known feature of his philosophy, and he is often referred to as the philosopher of pessimism. Schopenhauer’s pessimistic vision follows from his account of the inner nature of the world as aimless blind striving.

Because the will has no goal or purpose, the will’s satisfaction is impossible. The will objectifies itself in a hierarchy of gradations from inorganic to organic life, and every grade of objectification of the will, from gravity to animal motion, is marked by insatiable striving. In addition, every force of nature and every organic form of nature participates in a struggle to seize matter from other forces or organisms. Thus existence is marked by conflict, struggle and dissatisfaction.

The attainment of a goal or desire, Schopenhauer continues, results in satisfaction, whereas the frustration of such attainment results in suffering. Since existence is marked by want or deficiency, and since satisfaction of this want is unsustainable, existence is characterized by suffering. This conclusion holds for all of nature, including inanimate natures, insofar as they are at essence will. However, suffering is more conspicuous in the life of human beings because of their intellectual capacities. Rather than serving as a relief from suffering, the intellect of human beings brings home their suffering with greater clarity and consciousness. Even with the use of reason, human beings can in no way alter the degree of misery we experience; indeed, reason only magnifies the degree to which we suffer. Thus all the ordinary pursuits of mankind are not only fruitless but also illusory insofar as they are oriented toward satisfying an insatiable, blind will.

Since the essence of existence is insatiable striving, and insatiable striving is suffering, Schopenhauer concludes that nonexistence is preferable to existence. However, suicide is not the answer. One cannot resolve the problem of existence through suicide, for since all existence is suffering, death does not end one’s suffering but only terminates the form that one’s suffering takes. The proper response to recognizing that all existence is suffering is to turn away from or renounce one’s own desiring. In this respect, Schopenhauer’s thought finds confirmation in the Eastern texts he read and admired: the goal of human life is to turn away from desire. Salvation can only be found in resignation.

4. References and Further Reading

a. Primary Sources Available in English

  • Manuscript Remains in Four Volumes. Edited by Arthur Hübscher, Translated by E.F.J. Payne. Oxford: Berg Publishers, 1988.
  • On the Fourfold Root of the Principle of Sufficient Reason. Translated by E.F.J. Payne. LaSalle: Open Court Press, 1997.
  • On the Basis of Morality. Translated by E.F.J. Payne. Indianapolis: The Bobbs Merrill Company, 1965.
  • On the Will in Nature. Translated by E.F.J. Payne, Edited by David Cartwright. New York: Berg Publishers, 1992.
  • Parerga and Paralipomena Volumes 1 and II. Translated by E.F.J. Payne. Oxford: Oxford University Press, 2000.
  • Prize Essay on the Freedom of the Will. Edited by Gunther Zoller, Translated by E.F. J. Payne. Cambridge: Cambridge University Press, 1999.
  • The World as Will and Representation. Translated by E.F.J. Payne, 2 vols. New York: Dover, 1969.

b. Secondary Sources

  • Atwell, John E. Schopenhauer: The Human Character . Philadelphia: Temple University Press, 1990.
    • Provides a lucid account of Schopenhauer’s ethics and pessimism.
  • Atwell, John E. Schopenhauer on the Character of the World: The Metaphysics of Will. Berkeley: University of California Press, 1995.
    • An excellent and comprehensive account of Schopenhauer’s metaphysics and epistemology that brings new insight into Schopenhauer’s methodology.
  • Cartwright, David E. Schopenhauer: A Biography. Cambridge: Cambridge University Press, 2010.
    • The most comprehensive biography of Schopenhauer available in English.
  • Copleston, Frederick. Arthur Schopenhauer, Philosopher of Pessimism. London: Barnes and Noble, 1975.
    • The first book length monograph on Schopenhauer written in English.
  • Hamlyn, D.W. Schopenhauer. London: Routledge & Kegan Paul, 1980.
    • A brief but substantive critical analysis of his thought that includes a strong summary of his dissertation as well as his relationship to Kant.
  • Hübscher, Arthur, The Philosophy of Schopenhauer in Its Intellectual Context: Thinker Against the Tide. Translated by Joachim T. Baer and David E. Cartwright. Lewiston, N.Y : Edwin Mellon Press, 1989.
    • An excellent intellectual biography, extensively covers his earliest (pre-dissertation) thought and the influences of German romanticism and idealism.
  • Jacquette, Dale, ed. Schopenhauer, Philosophy, and the Arts. Cambridge: Cambridge University Press, 1996.
    • A collection of essays on both Schopenhauer’s aesthetics and the influence his aesthetics had on later artists.
  • Janaway, Christopher, ed. Willing and Nothingness: Schopenhauer as Nietzsche’s Educator. Oxford; Clarendon Press, 1998.
    • These essays explore Schopenhauer’s influence on Nietzsche. The book includes a complete list of textual references to Schopenhauer in Nietzsche’s writings.
  • Magee, Bryan. The Philosophy of Schopenhauer. Oxford: Carendon Press, 1983.
    • Covers the whole of Schopenhauer’s thought, as well as an extensive account on his influence on later thinkers and artists such as Wagner and Wittgenstein.
  • Safranski, Ruediger, Schopenhauer and the Wild Years of Philosophy. Translated by Ewald Osers, London: Weidenfeld and Nicolson, 1989.
    • An entertaining biography that provides insight into the political and cultural milieu in which Schopenhauer developed his thought.
  • Young, Julian, Willing and Unwilling: A Study in the Philosophy of Arthur Schopenhauer. Dordrecht: Martinus Nijhoff, 1987.
    • An influential reading of Schopenhauer’s work, which argues that Schopenhauer’s account of the thing-in-itself cannot be wholly identified with the will.

Author Information

Mary Troxell
Email: mary.troxell.1@bc.edu
Boston College
U. S. A.

Epistemic Entitlement

In the early 1990s there emerged a growing interest with the concept of epistemic entitlement. Philosophers who acknowledge the existence of entitlements maintain that there are beliefs or judgments unsupported by evidence available to the subject, but which the subject nonetheless has the epistemic right to hold. Some of these may include beliefs non-inferentially sourced in perception, memory, introspection, testimony, and the a priori.  Unlike the traditional notion of justification, entitlement is often characterized as an externalist type of epistemic warrant, in the sense that a subject’s being entitled is determined by facts and circumstances that are independent of any reasoning capacities she may or may not have, and which the subject herself need not understand or be able to recognize. One key motivation for this view is that the inclusion of entitlement in epistemology can account for the commonly held intuition that largely unreflective individuals, such as children and non-human animals, possess warrant and basic knowledge about the world.  It also paves the way for a tenable foundationalist epistemology, according to which there exist warranted beliefs which are not themselves warranted or justified by any further beliefs.  This article explores theories of entitlement as presented by four prominent philosophers: Fred Dretske, Tyler Burge, Crispin Wright, and Christopher Peacocke.

Table of Contents

  1. Dretske on Entitlement
  2. Burge on Entitlement
  3. Wright on Entitlement
  4. Peacocke on Entitlement
  5. Conclusion
  6. References and Further Reading

1. Dretske on Entitlement

Fred Dretske has argued for the existence of epistemic entitlements.  Like justification, entitlement is an epistemic property of belief which, when the belief is true (and the subject is not in a Gettier-style situation), constitutes knowledge.  Much of Dretske’s work on this topic rises out of his (2000) article: “Entitlements: Epistemic Rights without Epistemic Duties?”  Part of understanding his view involves having a grasp of the question being posed in the article’s title.  Unlike justifications, entitlements are epistemic rights we have to believe various propositions without having any added requirements to engage, or be able to engage, in some sophisticated mental exercise.  Considering what is included in the conception of justification, the traditional answer suggests that being justified is a matter of fulfilling various epistemic duties or obligations.  Some of these duties may include: gathering ample evidence, arriving at beliefs carefully and methodically, deliberating over the strength of one’s supportive reasons, and so forth.  For instance, if it is true that Detective Johnson is justified in believing

GUILTY: Smith is guilty of the murder,

then it must be that she has good reasons for thinking GUILTY true, reasons of which she herself is aware and could appeal to if she needed to defend her belief.  Johnson’s reasons may be, for example, that Smith’s fingerprints were on the murder weapon and Smith had strong motive to kill.  Though these sorts of epistemic requirements obtain for beliefs like GUILTY (as we shall assume), Dretske thinks there are other sorts of beliefs you and I have which: (1) amount to genuine knowledge, but that also (2) are not justified in the sense just described.  Consider your perceptual belief that

BRIGHT: There is a bright screen ahead of me,

or a memorial belief that

ATE: I ate breakfast this morning.

Given that you know these propositions, it seems too strong a condition that you must recognize that which provides the epistemic support for these beliefs, or be in a position to defend them by way of some argument.  Additionally, many philosophers think that children and lower animals have perceptual and memorial knowledge, yet they lack the conceptual skills needed to be able to think about what it is that gives them this knowledge.  The point is that the conditions for acquiring a justification for a belief like GUILTY are harder to satisfy than they are for acquiring an entitlement for a belief like BRIGHT.  Thus, Dretske holds that there exists a kind of epistemic good that attaches to our various perceptual beliefs and memorial beliefs (and perhaps others) but which is distinct from the traditional notion of justification.  This epistemic good is what he calls entitlement.  To have an entitlement is to have the epistemic right to believe a proposition in which the possessor need not be consciously reflective of that which awards the entitlement.  (Notice that the relevant kinds of beliefs Dretske has in mind, like BRIGHT or ATE, are about things in the outer world, not our own conscious experiences or memories.)

In this way epistemic entitlements are analogous to various legal or constitutional rights we have.  There is no added physical effort or labor you had to exert to be awarded the right to vote in the U.S. presidential election; you have this right solely in virtue of having the status of being a U.S. citizen and at least eighteen years old.  Indeed, you have this right whether or not you know that you do.  The same goes for one’s right to the property bequeathed by a passing family member.  The recipient did not have to do anything to “earn” her entitlement to that property; that she is simply listed in the will suffices.  Similarly, what gives people the epistemic right, or entitlement, to hold various beliefs is determined by circumstances independent of any reasoning capacities they may or may not have.  A subject may have an entitlement to her belief even if she is unaware that she has it.

Before proceeding, note that Dretske is operating with a narrow characterization of justification.  He thinks that justification is an epistemically internalist notion in the sense that to be justified in holding a belief, there must be some proposition (or propositions) one already believes and which one has reflective access to and could be used to justify that belief.  On the other hand, entitlement receives an externalist characterization in the sense that one can have an entitlement without having access to that which makes one entitled.  It is worth mentioning, however, that not all epistemologists understand justification in the way Dretske does.  Whereas for Dretske, the things that do the justifying are other beliefs one has, some internalists allow other sorts of mental states, like perceptual experiences and memories, to count as reasons or justifications (Audi 1993 and Pryor 2000).  Also, some philosophers prefer to use ‘internalism’ simply as the claim that the things that justify a belief are internal to one’s mental life.  One could be an internalist about justification in this weaker sense, which is commonly referred to as mentalism (see Conee and Feldman 2003), without requiring that one be able to access or become aware of one’s justifying mental states or to believe that these states are reliable.  Thus, according to a mentalist, a child may be justified in believing BRIGHT, provided that it is properly grounded in her experience of a bright screen ahead, even if the child does not herself recognize her experience as that which makes her justified.  The point is that Dretskean entitlement, if it is a distinctive epistemic concept, differs from justification only as understood in Dretske’s restricted sense of the term as that of being backed by other beliefs to which one could appeal in support of one’s belief.

The main question Dretske wishes to answer is: if there are things we are entitled to accept as true, what grounds this entitlement?  In other words, what entitles us to believe various propositions?  It is important to emphasize that these are questions posed specifically to philosophers; entitled subjects themselves need not know, and quite often do not know, the answer to them.  Dretske begins his answer by distancing his view of entitlement from a pure reliabilist view that one has the epistemic right to the belief that p if and only if the belief that p was produced by a reliable process.  For a pure reliabilist, given that your perceptual system produced your belief that BRIGHT, and assuming that perception yields true beliefs on most occasions, you are epistemically permitted to hold this belief.  But according to Dretske, there are (at least) two reasons why entitlement must not be associated with this form of reliabilism.  First, one can surely hold what is in fact a reliably-produced belief while at the same time having independent reasons for doubting its truth (for example, you have evidence that you’ve taken a mind-altering drug, yet you continue to trust your senses as they indicate a three-foot bumble bee).  To maintain the belief in such a situation would, intuitively, be irrational on the subject’s part, which, it seems, would thereby strip the subject of her epistemic right to that belief.  However, by concentrating entirely on the reliability of a belief-forming process, a pure reliabilist cannot accommodate this intuition.  Entitlement must therefore be understood as a defeasible epistemic right, one that can be defeated by countervailing evidence.  Secondly, Dretske holds that entitlements supervene on the subjective resources of the subject.  This implies that two subjects with identical psychological make-ups cannot differ with respect to the entitlements they possess (For this reason, it is plausible that Dretskean entitlement has an element of internalism, but only of the weaker, mentalist sort described above).  For example, suppose there is some being who has the exact same experiences, beliefs, methods of reasoning, and so forth, as you do.  Suppose further that this “mental twin” of yours is, unbeknownst to her, nothing but a bodiless brain in a vat whose experiences are systematically being fed by a powerful supercomputer.  While your perceptual system is highly reliable (as we shall assume), your twin’s system is massively unreliable, because all of her experiences and corresponding beliefs are erroneous.  According to a pure reliabilist, then, whereas you have the epistemic right to believe BRIGHT when you have an experience of a bright screen before you, your envatted twin lacks the right to BRIGHT when she has the exact same experience.  Dretske, however, thinks that this is the wrong result. (The sort of case described here is commonly referred to as the New Evil Demon Problem, first introduced by Lehrer & Cohen 1983.)  He thinks it is intuitively correct that your twin has the right to believe whatever you have a right to believe, even if your beliefs are true and her’s false.  You have knowledge and your twin lacks it because knowledge requires truth and your twin’s beliefs are false; however, it seems that the unfortunate twin still retains some epistemic good.  The correct account of entitlement should allow for this intuition and therefore be suited to award entitlements to subjects who are in unfavorable, illusory environments. (Although the New Evil Demon Problem does pose a challenge to the pure version of reliabilism Dretske considers, it bears mentioning that several philosophers have advanced other versions of reliabilism which purport to avoid this problem.  See, for instance, Bach 1985, Goldman 1988, Comesaña 2002, Majors and Sawyer 2005.)

If the concept of entitlement is not grounded in the reliability of a belief-forming process, what is it grounded in?  Dretske argues that we have entitlements to beliefs which are psychologically immediate and irresistible to us.  There are certain sorts of beliefs over which we have no choice whether we hold them or not.  Perceptual beliefs are a prime example.  As you gaze your eyes on a computer screen, your visual experience of a bright screen causes you to believe BRIGHT.  The causal sequence between experience and belief is simply out of your control, and it occurs much too quickly to avoid it.  This fact about our psychologies, that we cannot resist some of our beliefs, is what accounts for our entitlements.  As Dretske says, “We have a right to accept what we are powerless to reject” (2000, p. 598).  In saying this, Dretske appeals to the well-known principle that ‘Ought implies Can’.  Given that you are unable to refrain from believing BRIGHT, it would be unreasonable to suppose that you epistemically ought not to believe it.  If this is correct, then we can see how your brain-in-a-vat twin is entitled to believe BRIGHT.  Although her experience of the screen is inaccurate, and her perceptual system is unreliable, she enjoys an entitlement to her belief because she is equally as compelled to form the belief as you are.

Entitlements are defeated when the formation of a belief is avoidable, that is, when there are things the subject could have done, or more precisely, should have done that would have prevented her from being caused to believe.  Suppose, for example, that you have independent evidence from a credible source that you are now looking at a well-crafted poster of a computer screen instead of a real one.  Suppose further that you disregard this evidence, so that when you have the experience of a bright screen you are caused to believe BRIGHT.  In this sort of case you lack an entitlement to this belief, because had you been properly responsive to the evidence in your possession, your experience would have failed to cause you to believe BRIGHT.  One might interject here that this suggestion runs contrary to Dretske’s original proposal, because even though you have ignored the countervailing evidence, you still cannot avoid being compelled to believe BRIGHT when you had the experience.  In response, Dretske maintains that entitlements should be reserved only for those beliefs that an epistemically responsible agent could not avoid being caused to form in similar circumstances.  Because of this, given that you have this evidence, you epistemically ought to refrain from believing BRIGHT.  Analogously, when a drunk driver accidentally runs over a child in the street, it may be that the driver is, at that time, in a situation where hitting the child was unavoidable.  We do not say, though, that the driver therefore had the right to run over the child.  The reason the driver is culpable for the accident is that were she more responsible, she would have taken measures to ensure that she did not get into the situation in the first place (for example, by taking a taxi home rather than driving).  Similarly, even if you cannot avoid being caused by your experience to believe BRIGHT, your entitlement to that belief is stripped because a responsible agent in that same circumstance, recognizing that she has evidence that she is looking at a computer-screen poster, would have avoided being so caused.

We will now consider two objections to Dretske’s view.  First, Michael Williams (2000) rejects Dretske’s contention that epistemic rights can exist in the complete absence of any epistemic duties.  Given that Dretske takes entitlements to be defeasible rights, it is plausible that were one’s belief challenged by another person, that would cancel the entitlement one had to that belief.  The only way to re-establish the entitlement, it seems, would be by providing reasons that defeat the challenge.  If this is right, it suggests that Dretske overstates his position that entitlements are completely free of justificational obligations.  While it may be correct that we need not actively provide positive evidential backing for every belief to which we are entitled, having an entitlement requires that one at least be able to justify or defend one’s belief in situations when one is presented with an appropriate challenge.  As Williams notes, defeasibility and justificational commitments go together.  The result, presumably, is that most young children and unreflective adults would lack entitlements to many of their perceptual and memorial beliefs, for they may lack the cognitive resources needed to be able to defend their own beliefs. However, this is a result some philosophers are willing to accept.

As for the second objection, many philosophers think that insofar as entitlements are epistemic goods that contribute to knowledge, they must be suitably connected to truth; they must be truth-conducive, or have a sufficiently high likelihood of being true.  But this suggestion appears to be at odds with Dretske’s claim that your brain-in-a-vat mental twin, whose beliefs are almost all false, is entitled to the same beliefs that you are.  One way epistemologists sometimes react to the New Evil Demon Problem is to draw a distinction between two kinds of epistemic standings: being epistemically blameless for holding a belief, on the one hand, and being fully entitled (or justified or warranted) in holding a belief, such that having this property makes an essential contribution to knowledge, on the other (see Pryor 2001).  According to this distinction, whereas you are entitled to the belief (and also know) that BRIGHT, your envatted twin is merely blameless when she forms the same belief.  It is through no fault of her own that she arrives at a false, unreliably-produced belief; nonetheless, she lacks an epistemic right which you possess.  She is simply unlucky that her experiences and beliefs are not connected to the external world in the proper way.

In response to this charge, one could claim that the envatted subject’s beliefs are conducive to truth, but only relative to more favorable environments (see Goldman 1986 and Henderson & Horgan 2006).  Given that the brain in a vat is a responsible agent, her beliefs would be mostly true if she were in an environment similar to the actual one.  Alternatively, one could deny that a distinction between being blameless and being entitled (or justified or warranted) exists at all.  The suggestion would be that to be justly entitled is for you (or an epistemically responsible agent) to be blameless in holding the belief (for defenders of this move see Ginet 1975, Chisholm 1977, and Bonjour 1985; for objections see Alston 1989a).  Whether either of these options have promise, however, is something that Dretske would need to argue for.

2. Burge on Entitlement

Tyler Burge has argued that “[w]e are entitled to rely, other things equal, on perception, memory, deductive and inductive reasoning, and on…the word of others” (1993, p. 458).  Because his work on perceptual entitlement is arguably the most developed, we shall focus our attention on perceptual beliefs: beliefs non-inferentially based on a perceptual experience.

Burge begins by introducing the concept of warrant, which he says is the most fundamental type of epistemic good.  What makes it specifically an epistemic good is that a belief’s being warranted depends first and foremost on its being a good route to truth.  Yet, it also depends on the subject’s own limitations: what information the subject has; what information is available; and how well that information is used.  Thus, when he says warrant must be a good route to truth, he does not mean that warranted beliefs necessarily are true; one can be warranted in a belief which happens to be false.  Nonetheless, the warranted subject must, at a minimum, be well-positioned to achieve truth.  Being warranted entails being reliable, that is, the productive warranted belief has a sufficiently high likelihood of being true (at least under normal circumstances).

Warrant is divided into two sub-species.  The first is justification.  Justification is epistemically internalist in the sense that it is warrant by reason that is conceptually accessible upon reflection to the subject.  Beliefs we are justified in holding are those that result from having engaged in reasoning or from having drawn an inference from other beliefs (Consider again the example from the previous section in which Johnson infers that Smith is guilty on the basis of her reasons that Smith’s fingerprints were on the murder weapon and he had motive to kill).  But when we turn to the epistemology of perceptual belief, Burge thinks the transition from a perceptual state (for example, seeing a round and red apple on the table) to a belief (for example, APPLE: There is round and red apple there) is nothing at all like drawing an inference.  In most typical situations, we do not reason our way from experience to belief; hence the warrant for our perceptual beliefs must not come in the form of justification.  There are two primary reasons why Burge thinks this is so.  First, reasoning is a sophisticated, intellectual mental exercise.  The capacities necessary for reasoning are too complex for us to plausibly attribute them to certain groups of individuals, like children and higher non-human animals.  If justification were the only form of warrant that there is, these individuals’ perceptual beliefs would lack warrant; yet it is commonly thought that these individuals are no less warranted in their perceptual beliefs than mature adults are.  The second reason why it’s wrong to think we reason from experience to belief is that reasoning essentially involves progressing from one or more propositional attitudes one has, like a belief, to another as one would draw a conclusion from the premises of an argument.  But Burge argues that perceptual states are not propositional attitudes.  In other words, perception has nonpropositional content (see also Evans 1982, Peacock 2001, Heck 2007).  Whereas a belief’s content (or what that belief concerns or is about) has predicational (or sentence-like) structure, perceptual content does not.  The content of a perceptual state is more akin to a topographical map rather than a sentence. (For more on nonpropositional perceptual content, see Peacocke 1992.) Therefore, unlike beliefs, experiences are not the kinds of mental states from which one can draw inferences.  Nonetheless, what is common among beliefs and perceptual states, and unlike other kinds of mental states like wishes and imaginings, is that they are both capable of being veridical: they have accuracy conditions that can be either satisfied or unsatisfied.

Given that perceptual beliefs are not justified in the sense just described, the type of warrant they receive must come under a different form.  This brings us to the second sub-species of warrant: entitlement.  Burgean entitlement is epistemically externalist in the sense that it is warrant that need not be fully conceptually accessible to the subject.  Like Dretskean entitlement, one can have a Burgean entitlement to a perceptual belief, such as APPLE, without knowing or justifiably believing that one does.  Also similar to Dretske’s view is the claim that entitlement is a defeasible kind of epistemic good: it can be defeated by independent reasons one may have for doubting that one’s perceptual faculties are properly functioning or that the believed proposition in question is true.  Thus, absent reasons for doubt, one has an entitlement to conceptualize one’s perceptual experience in the form of a belief (for example, to believe APPLE when one has an experience of a round, red apple).

As one clarification, Burge’s claim is not that entitlement is the only kind of warrant that can attach to perceptual beliefs.  People certainly can, and do, reason and deliberate about these sorts of beliefs.  One can think, for example: “There appears to be a round and red apple, and when things appear this way they generally turn out to be accurate.  Therefore, there is a round and red apple there.”  This is a perfectly permissible way to reason, and doing so can result in acquiring a justification in addition to the perceptual entitlement one already possesses.  The crucial point is that securing a warrant for this belief does not require that one reasons, or be able to reason, in this way.  To impose such a requirement on all forms of warrant would result, as Burge says, in hyper-intellectualizing perceptual belief.

The key question Burge wants to answer is: What is the contribution of perceptual states per se to entitlements to perceptual beliefs?  He thinks that two conditions must be met in order for a perceptual state to award the subject the epistemic right to believe as that state represents the world as being.  To understand what these conditions are, it is important first to appreciate Burge’s claim that the perceptual system has a characteristic function.  Some philosophers have argued that the function of any natural system can only be understood in terms of how it contributes to meeting the basic biological needs of the organism (see Millikan 1984).  For example, on this view the perceptual system functions well only insofar as it enables the perceiver to identify mates, avoid predators, find food, and so on.  Burge, however, distances himself from this sort of explanation.  He holds instead that it is known on a priori grounds that the perceptual system is a representational system that has the function to represent the subject’s environment veridically and reliably (2003, p. 508).  Although he agrees that reliable perception can, and oftentimes does, contribute to satisfying one’s biological needs (for example, being able to perceptually discriminate different colors can help one spot red berries to eat in a green forest), it is a mistake for philosophers to attempt to reduce the perceptual system’s function or explain it solely in terms of biological fitness.  The sense in which the perceptual system “has done a good job” in a given situation should be evaluated in terms of its capacity for producing veridical perceptual states.

Provided that this is the perceptual system’s function, there is the further question of how the system is capable of producing states that accurately represent the environment.  Burge’s answer centers on the thesis of perceptual anti-individualism (sometimes called perceptual content externalism).  According to this thesis, the nature of a perceptual state (what that state represents) is partially determined by a history of causal interactions that have occurred between objects and properties in the environment, on the one hand, and the perceiver, or the perceiver’s evolutionary ancestors, on the other (see Burge 2003; 2005, Section I; 2007, Introduction; 2007a).  What explains the fact that you are capable of (accurately or inaccurately) perceptually representing, say, shapes like roundness and colors like redness in the environment is that you, within your own lifetime, or your ancestors throughout the evolution of the species, regularly came into contact with various features in the environment, such as instances of these very shapes and colors.  According to the view, these causal interactions help to establish the principles that govern the formation of the perceptual system’s states and inform the perceptual system of what it ought to represent when it is presented, in a given situation, with some array of stimuli (for example, light waves impacting the retina in the eyes or sound waves impacting the ear drums).

Thus one of the conditions that must be met for a perceptual state to confer an entitlement to a corresponding perceptual belief is that the state is anti-individualistically individuated.  The other condition is as follows: one has an entitlement to believe as one’s perceptual state represents the environment as being, only if that type of perceptual state is reliably veridical in the subject’s normal environment.  Burge defines the normal environment as the one in which the contents of the subject’s perceptual states are established.  What this means is that for those perceptual states that do deliver entitlements, the existence of those types of perceptual states (for example, a state indicating roundness or redness) is explained by a history of causal interactions that were highly successful.  That is, one has the general ability to perceptually represent property F because the perceptual system, in its evolutionary development, regularly confronted instances of F-ness in the environment.  Thus, the normal environment might be where one currently lives, but it need not be.  It is possible that the environment a subject currently lives in is different from the one in which her perceptual system evolved, where she, or her ancestors, first acquired the abilities to represent various objects and properties.

The reason why this second condition is important is that entitlement, insofar as it is a type of epistemic warrant, must be a good route to truth.  Provided that perceptual states are established by high degrees of successful interactions with features in the environment, and hence are reliably veridical in the normal environment, “[v]eridicality enters into the very nature of perceptual states and abilities.” (Burge 2003, p. 532) To rely on a particular perceptual state the general type of which was established in this fashion in the formation of a belief makes it likely that that belief is true in the normal environment.

Burge goes on to argue that when these two conditions are met (that is, the perceptual state is anti-individualistically individuated and it is reliably veridical in the normal environment) one has a defeasible entitlement to rely on that state in any environment.  This further point provides a novel solution to the New Evil Demon Problem, which was discussed in the previous section.  Recall that this problem draws on a commonly held intuition that subjects in skeptical situations can hold warranted beliefs.  Suppose you are a regular embodied person, and an apple on the table causes you to have a perceptual experience of a round and red apple, which subsequently causes you to believe APPLE.  At the very next instant you are unknowingly transported to a different world where your brain is placed in a vat, and a computer produces the same experiences you were just having in your regular embodied state.  You have the exact same experience of a round and red apple, except that now the experience is inaccurate (there are no apples nearby, just other brains and computer equipment).  Not only is your belief false, it was produced by an unreliable process relative to this new environment.  Nevertheless, you still possess an entitlement to this belief, because the perceptual state that produced it (that is, your experience of the round and red apple) is reliably veridical, relative to the normal environment.  Because it was interactions with genuine apples (in the actual world) and instances of roundness and redness that helped to establish this type of perceptual state, you retain the epistemic right to believe as your experience represents, even though it is currently not at all indicative of what is going on in this new hostile environment.

We will now consider two criticisms of Burgean entitlement.  As for the first criticism, Burge states explicitly that his view of entitlement is not meant to speak to the problem of skepticism.  His objective is not to prove that we have perceptual entitlements, but rather to explain on the assumption that we do have them what they are.  But some epistemologists might urge that skepticism needs to be addressed in order to get his view off the ground.  Skeptical issues have typically been raised in connection with epistemic closure, which states roughly:

(Closure)    If subject S is warranted in believing that p, and S knows that p logically entails q, and S deduces q on the basis of p, then S is warranted in believing that q.

Suppose I have a warrant in the form of a Burgean perceptual entitlement to believe HAND (that I have hands).  HAND logically entails ~BIV (that I am not a handless brain in a vat).  According to Closure, I would be warranted in believing ~BIV were I to deduce it from HAND.  But many think this procedure makes it much too easy to secure warrant for beliefs like ~BIV.  It is unclear what Burge’s position on this matter is, but it is something he may need to speak to if his view is to be tenable.  Note finally that Dretske is not faced with the same problem, because he rejects closure altogether (Dretske 2005).

A different sort of criticism has to do with whether there is any real conceptual difference between how Burge characterizes entitlement and how other epistemologists have characterized justification.  Albert Casullo (2009) argues that there is not.  As we have seen, the chief distinction between justification and entitlement, according to Burge, is:

(ACC) Justification is warrant that is conceptually accessible to the subject, and entitlement is warrant that need not be conceptually accessible to the subject.

It is not immediately clear, says Casullo, what is meant by the term ‘access’ or ‘accessible’, and he suggests three possible interpretations:

A1) Subject S has access to the warrant for S’s belief that p if S has access to the epistemic ground that warrants S’s belief, that is, to the content of S’s warranting experience or belief.

A2) S has access to the warrant for S’s belief that p if S has access to the adequacy of that ground, that is, to the justification for the belief that S’s ground is adequate to warrant the belief that p.

A3) S has access to the warrant for S’s belief that p if S has access to the epistemic principle governing the ground of S’s belief that p, that is, to the conditions under which the ground warrants the belief that p and the conditions under which that warrant is undermined.

Bearing these separate interpretations in mind, Casullo provides textual evidence from Burge’s works that ACC should be read as claiming that justification satisfies A1 and A3 but not A2, whereas entitlement satisfies A1 but not A2 or A3.  But if this is the sense in which entitlements need not be accessible to the subject, this is just the view William Alston (1989b) had already proposed, which he calls an internalist externalism account of justification.  Thus, Casullo contends that any differences that exist between Burgean entitlement and Alstonian justification are terminological at best.

3. Wright on Entitlement

Crispin Wright’s approach to entitlement is motivated primarily by an attempt to ward off skeptical paradoxes.  In his (2004) paper, “Warrant for Nothing (and Foundations for Free)?” he introduces the notion of a “cornerstone proposition” for a given region of thought, according to which “it would follow from a lack of warrant for it that one could not rationally claim warrant for any belief in that region” (p. 167-68).  The skeptical project involves two steps: (1) make a case that a certain proposition, which we characteristically accept, is a cornerstone for a much wider class of belief, and (2) argue that we lack warrant for that proposition.  Wright then identifies two general patterns of skeptical reasoning that deploy these two steps.  The first comes from the Cartesian skeptic, who argues that the proposition that we are not cognitively detached from reality (instances of which come in the more common skeptical-scenario varieties like ‘I am not dreaming’ or ‘I am not a brain in a vat’, and so forth) is a cornerstone proposition for our ordinary perceptual beliefs. As for this latter claim, the skeptic reasons that for any procedure one might utilize to try to determine, say, that one is not dreaming (for example, pinching oneself or trying to read the lines of a newspaper), one must already have an independent warrant for thinking that the procedure was executed properly, and moreover that one did not simply dream carrying it out.  But since utilizing any such procedure would require that one already have warrant for believing that one is not dreaming, there is no procedure one could effectively utilize to gain a warrant for this proposition.  Thus, our perceptual beliefs are unwarranted.

The second kind of skeptic, the Humean skeptic, reveals an implicit circularity embedded in our practices of inductive reasoning, where the observance of facts about a sample population causes us to make a claim about all of the population’s members (for example, the inference from ‘All observed Fs are G’ to ‘All Fs are G’).  The inductive inference relies on one’s being warranted in accepting a cornerstone proposition, which Wright calls the Uniformity Thesis: that nature abounds in regularities.  However, there is no way to gain warrant for the Uniformity Thesis, because this could only be accomplished by relying on induction itself (For an overview of the Humean Problem of Induction, see Stroud 1977).

According to Wright, the skeptical problem can be generalized in the following way:  Let P be a proposition representing some feature of observable reality (for example, ‘I have hands’), where the best and as the skeptic will argue only evidence in favor of P is rooted in perceptual experience.  To arrive at an anti-skeptical conclusion, one will typically deploy what Wright calls the I-II-III argument:

I: My current experience is in all respects as if P

II: P

III: There is a material world

The same argumentative structure can be utilized to refute skepticisms of a variety of subject matters.  For instance, where I stands for ‘X’s behavior and physical condition are in all respects as if X is in mental state M’, III stands for ‘There are minds besides my own’.  Or where I stands for ‘I seem to remember it being the case that P yesterday’, III stands for ‘The world did not come into being today replete with apparent traces of a more extended history’.  Although the I-II-III argument may appear to warrant the conclusion, Wright contests that it engenders a failure of warrant transmission, a notion he has developed (2003).  According to Wright, it is oftentimes the case that a body of evidence e is an information-dependent warrant for a proposition P, where e is an information-dependent warrant for P if whether e is correctly regarded as warranting P depends on what one has by way of collateral information C.  In cases where C is entailed by P, the warrant for P (which is sourced in e) would fail to transmit to C, were one to infer C from P.  Setting the skeptical problem aside for the moment, consider one of Wright’s own examples.  Suppose that e is your experience of seeing a ball kicked into a net, from which you infer P, the proposition that a goal has just been scored.  Let C be the proposition that a soccer game is in progress.  Now, although C is a logical consequence of P, were you to start with e and infer P and subsequently deduce C, this chain of reasoning could not provide you with any new warrant for believing C, because e’s warranting you in believing P presupposes that you already have a warrant for believing C.  That is, the warrant for P fails to transmit to C because P cannot be warranted (by e) unless C is something for which you already have warrant.

Returning to the anti-skeptical I-II-III argument, Wright claims that III (‘There is an external world’) is a cornerstone proposition for a range of observation-sourced propositions which may be expressed by II (for example, ‘I have hands’).  The point is that whether I successfully warrants II depends on one’s already having warrant for III.  But the warrant for III cannot come by way of the I-II-III argument.  One crucial difference between this situation and the soccer example is that whereas there are means by which you can secure independent warrant for the proposition that a soccer game is in progress (you can, for example, look up the start time of the game in the local newspaper), there appears to be no way to acquire warrant, independent of one’s experiential evidence, for the proposition that there is an external world.  The skeptic will therefore conclude that type-III propositions are unwarranted, and as they are cornerstones so are the associated type-II propositions.

Wright’s aim is not to defend but rather to alleviate skeptical worries.  His strategy is to argue that we do possess warrant for type-III cornerstone propositions, but that warrant does not come in the form of evidential justification, as the skeptic assumes it must.  He says:

Suppose there is a type of rational warrant which one does not have to do any specific evidential work to earn: better, a type of rational warrant whose possession does not require the existence of evidence in the broadest sense encompassing both a priori and empirical considerations for the truth of the warranted proposition.  Call it entitlement.  If I am entitled to accept P, then my doing so is beyond rational reproach even though I can point to no cognitive accomplishment in my life, whether empirical or a priori, inferential or non-inferential, whose upshot could reasonably be contended to be that I had come to know that P, or had succeeded in getting evidence justifying P (2004, p. 174-75).

According to Wright, we have entitlements to accept various cornerstone propositions, such as: ‘There is an external world’; ‘There are other minds’; ‘The world has an ancient past’; and ‘There are regularities present in nature’.  Entitlement is a kind of rational warrant that, unlike justification, does not depend on one’s having evidence one could point to that would speak to the truth of the proposition in question.  Whereas Detective Johnson’s believing that Smith is guilty of the murder is underwritten by her own cognitive accomplishment, by way of basing that belief on recognizable empirical evidence (see Section I above), and is therefore justified, entitlements are secured in a different way.  Similar to the views of both Dretske and Burge, having a Wrightean entitlement to accept a proposition does not require that one be able to think about what it is that gives one the entitlement (Wright 2007, Fn 6).  If Wright’s proposal is tenable, we find a way to circumvent the skeptical problem: we have unearned warrants (in the form of entitlements) to type-III propositions that are acquired independently of the I-II-III argument. Hence the skeptic has no grounds for accusing the anti-skeptic of viciously circular reasoning.

It is helpful to briefly compare and contrast Wrightean entitlement with the Burgean view discussed in the previous section.  One interesting similarity has to do with the connection between entitlement and justification as they relate to the structure of our warranted beliefs.  For both Burge and Wright, entitlements reside at the very foundation of the epistemic architecture of our beliefs.  All of our justified beliefs are ultimately based upon, or at least presuppose, entitlements.  For this reason, defenders of the entitlement-justification distinction are naturally affiliated with foundationalism, the view that there exist warranted beliefs which are not themselves warranted, or justified, by any further beliefs to which one could appeal.  In contrast, those who deny that such a distinction exists naturally steer toward coherentism or holism (see Sellars 1963, Davidson 1986, Cohen 2002).  However, Burge and Wright differ with respect to the propositions they claim reside at the foundation.  As we saw in the previous section, Burgean entitlements attach to beliefs concerning ordinary objects and properties in the world, which are non-inferentially sourced in perception, testimony, and memory.  The beliefs we rationally infer on the basis of them are justified.  In contrast, Wrightean entitlements attach to cornerstone propositions.  Although we do not typically infer further propositions on the basis of them, cornerstones provide the vital preconditions needed for our experiences or memory states to evidentially justify other (type-II) propositions.

Related to this last point, Burge can be seen as taking a liberal stance toward perceptual warrant, in that one’s experience alone provides one with immediate, non-inferential warrant for one’s perceptual beliefs.  In contrast, Wright adopts epistemic conservatism, according to which an experience provides warrant for a perceptual belief only on the condition that one has a prior warrant to accept the relevant type-III propositions (The terms ‘liberal’ and ‘conservative’ in epistemology are due to Pryor 2004.  See also Wright 2007).

One question that Wright raises is: What do our entitlements give us the right to do? Whereas for both Dretske and Burge our entitlements are rights to believe propositions, Wright has a different answer.  He suggests that part of what it is to hold a belief attitude toward a proposition is for that attitude to be controlled by reasoning and evidence.  Since entitlements are characteristically unaccompanied by evidence, he denies that they attach to belief.  Rather, where P is a proposition to which we have an entitlement, we are entitled to accept P.  Acceptance is a general kind of attitude of which belief is only one mode.  But, if belief is not the appropriate mode of acceptance for entitlement, what is it?  One suggestion Wright considers is ‘acting on the assumption that P’, but he immediately dismisses this because one can surely act on the assumption that a proposition is true while at the same time remaining agnostic or even skeptical of its truth.  Because entitlements provide essential preconditions for our having any warrant at all, whatever kind of attitude we have entitlements to must be something wherein one’s coming to doubt its truth would commit one to doubting the competence of the particular project in question.  Wright in the end settles on our having an entitlement to rationally trust P, where trust is a kind of acceptance weaker than belief (because it is not controlled by evidence), but stronger than acting on the assumption (because one is non-evidentially committed to the truth of P).

Wright’s position on what we have entitlements to raises a serious question regarding epistemic closure, according to which it is a necessary condition on being justified in believing P that one is also justified in believing those propositions one knows to be entailed by P.  Thus, given that my experience as of hands justifies me in believing HAND (the type-II proposition that I have hands), and I know that this entails ~BIV (the type-III proposition that I am not a handless brain in a vat) I must also be justified, according to closure, in believing ~BIV.  Yet, although Wright maintains that we have non-evidential entitlements to rationally trust type-III propositions, he denies that we are justified in believing them.  His proposal, therefore, appears to be at odds with closure.

In response to this alleged problem, Wright advises that we should qualify standard closure principles in such a way that we need not accept that evidentially justified belief is closed under known entailment.  Rather, if we let warrant range disjunctively over both entitlement and justification, we can allow that warranted acceptance (which ranges over belief; taking for granted; rational trust; acting on the assumption; and so forth) is so closed.  Thus, given that I am warranted in accepting HAND (in the form of a justified belief), and also that I know HAND entails ~BIV, I am warranted in accepting ~BIV (in the form of an entitled trust).  The upshot is that the warrant for a type-III proposition does not come by way inferring it from the related type-II proposition (for the type-II fails to transmit its warrant to the type-III), but this is nonetheless consistent with Wright’s qualified closure principle.

We will now consider two possible objections to Wright’s view.  As for the first objection, many will agree that:

(*) If it were antecedently reasonable to reject a type-III proposition, the warrant for the relevant type-II propositions would be removed.

The skeptic takes this thought one step further and concludes that:

(**) Type-II propositions can be warranted only if it is antecedently reasonable to accept type-III propositions.

Wright endorses (**), prompting him to characterize entitlements as positive warrants to trust.  However, Martin Davies (2004) has argued that by accepting (**), Wright concedes too much to the skeptic.  Instead, Davies claims that the inference from (*) to (**) is invalid: from the fact that my having reasonable doubt that ~BIV would defeat my warrant for believing HAND, it does not follow that I must have warrant for ~BIV in order to have warrant for HAND.  An alternative strategy would be to agree that doubt of ~BIV would defeat the warrant for HAND, while at the same time denying that one needs warranted trust in ~BIV in order for my experience as of hands to warrant HAND.  Davies thinks both of these claims can be met by attributing a different kind of entitlement to type-III propositions like ~BIV.  His suggestion is that we have entitlements not to doubt, not to call in question, or not to bother about, type-III propositions (2004, p. 226).  Thus, whereas a Wrightean entitlement is an entitlement to adopt a positive attitude of trust, Davies holds that it is an entitlement to adopt a negative attitude of not doubting; one would not need any positive warrant, earned or unearned, for type-III propositions to have warrant for type-II propositions.  One could take this approach one step even further and claim that type-II propositions do, contrary to what Wright says, transmit their warrants to type-III propositions.  In other words, the I-II-III argument is epistemically victorious: where I have no reason to doubt that I am a brain in a vat, my experience as of hands warrants me in believing HAND, which thereby warrants me in believing ~BIV when I draw the inference.  Whether or not this is a plausible anti-skeptical outcome is highly controversial (For a defense, see Pryor 2000 and 2004; for a rejection, see Cohen 2002).

A second objection to Wright’s view comes from Carrie Jenkins (2007), who has argued that even if we grant Wright that it is rational for us to accept, without evidence, cornerstone propositions, this cannot be an epistemic kind of rationality.  For any proposition P, one can be epistemically rational in accepting P only if one’s acceptance is brought about in a way that addresses the specific question of whether P is true.  Thus, in order for it to be epistemically rational of you to accept the proposition that China is the most populous country, your coming into this state of acceptance must be due, in part at least, to the fact that you take it to be true that China is the most populous country.  However, where P is a cornerstone proposition, like “My sensory apparatus is properly connected to the external world,” one’s acceptance of P is divorced from any considerations on the truth or falsity of P.  Instead, on the Wrightean view, accepting P is rational, roughly, because its acceptance is necessary in order to undertake the indispensible cognitive project of forming beliefs about the world based on perception.  This is a different kind of rationality, grounded not in terms of whether P is true, but in what fortunate consequences would result from one’s accepting P.  This is similar to the sense in which it would be rational (perhaps practically rational) for a runner in a race to believe she is not tired, since the act of forming this belief could actually prevent her from slowing down in the race.  It is doubtful that we could have epistemic entitlements to cornerstone propositions without it being epistemically rational to accept them.  This creates an obstacle for Wright given that his ultimate aim is to resolve the problem of skepticism in epistemology.

4. Peacocke on Entitlement

In his book, The Realm of Reason, Christopher Peacocke argues that we have entitlements to our perceptual beliefs, a priori beliefs, beliefs based on inductive inference, beliefs about our own actions, and moral beliefs.  Our entitlements to the latter four classes of belief are explained as extensions and consequences of the explanatory structure of our entitlement to perceptual beliefs.  Accordingly, this section will focus on Peacocke’s account of perceptual entitlement.

Peacocke does not distinguish entitlement from justification in the way that Dretske, Burge, and Wright do.  His notion of ‘entitlement’ is intended to range over both inferential and non-inferential transitions between mental states.  For Peacocke, entitlement plays a more central role in epistemology, for he claims that a judgment or belief is knowledge only if it is reached by a transition to which the subject is entitled.  Indeed, at times he seems to use the terms ‘entitlement’ and ‘justification’ interchangeably.  So, whereas for Dretske, Burge, and Wright, entitlements and justifications are distinct epistemic goods that make separable contributions to knowledge, Peacocke seems only to be operating with one epistemic concept, which he frequently refers to as entitlement.  As for another difference, we saw that entitlements for Burge and Dretske attach to beliefs, while Wrightean entitlements attach to attitudes of rational trust.  For Peacocke, we have entitlements not only to beliefs or judgments, but also to the transitions that link beliefs to antecedent mental states.  So, part of what it takes to have a perceptual entitlement to the belief, for example, that there is something round ahead, is that one is also entitled to the transition that brings one from having an experience of something round to the formation of that belief.  Nonetheless, a crucial element of Peacockean entitlement common to those of the other three authors discussed in this article is his position that “[a] thinker may be entitled to make a judgment without having the capacity to think about the states which entitle him to make the judgment” (2004, p. 7).

In order to understand Peacocke’s account of perceptual entitlement, it is important first to appreciate two conditions he thinks any transition must meet in order to award a subject an entitlement.  The first condition is conveyed in what he calls the Special Truth Conduciveness Thesis:

A fundamental and irreducible part of what makes a transition one to which one is entitled is that the transition tends to lead to true judgments…in a distinctive way characteristic of rational transitions (2004, p. 11).

Here we see that Peacocke agrees with Burge that entitlement (or for Burge, all forms of warrant) must be a good route to truth.  A necessary condition on entitlement for both Peacocke and Burge is that whichever state produces the entitled judgment or belief is a reliable indicator of the way things are in the world.  Note that this condition is not necessary for Dretskean entitlement, for it could be that a fully responsible agent cannot avoid believing there to be a bright screen ahead of her while her perceptual faculties are in fact highly unreliable (as would be the case were the agent a brain in a vat).  But the principle also implies that not just any transition leading to true judgments is sufficient for an entitlement.  In addition, the transition must be one that is characteristic of rational transitions.  Some processes encompass transitions between states which are highly reliable, yet these transitions are not what we would consider rational.  For example, suppose that due to some psychological defect, whenever I see a cardinal fly above me, this visual experience causes me to judge that the St. Louis Cardinals baseball team won today.  By sheer coincidence the only time that I see cardinals flying are on days when the Cardinals happen to win.  This transition between the state of seeing a cardinal and the state of judging that the Cardinals won tends to lead to true judgments, yet it is clear that it is quite irrational of me to form these judgments (for a similar sort of example, see Bonjour 1985).  What, then, is it that distinguishes specifically rational transitions?  The answer is given by what Peacocke calls the Rationalist Dependence Thesis, which is the second condition necessary for securing an entitlement:

The rational truth-conduciveness of any given transition to which a thinker is entitled is to be philosophically explained in terms of the nature of the intentional contents and states involved in the transition (2004, p. 52).

According to this second condition, whether or not a transition is rational depends on the specific explanation for why that transition is reliable.  The feature that is characteristic of rational transitions, according to Peacocke, is that they are reliable because of the very nature of the mental states involved, the contents that they have and how it is that those states are capable of representing the world in a certain way.  If the reliability of some transition is explained by something other than the very identity of the states involved in the transition and the individuation of their contents (such as if it were some mere accident that a type of experience reliably led to true judgments, as in the cardinal example above), then that transition, though reliable, would not be rational and would therefore fail to yield any entitlement.

With the two above theses introduced, we see that transitions between states yield entitlements only if they are conducive to truth, and furthermore their being reliable can be philosophically explained in terms of the states involved in the transition.  With regard to perceptual entitlement in particular, the kinds of experiences Peacocke thinks are relevant to meeting this criterion are those which have instance-individuated contents, such as the experience expressed by “That thing looks round.”  What makes this experience’s content instance-individuated is that when one’s perceptual faculties are properly functioning, and the environment is normal, one’s having this experience with this content is “caused by the holding of the condition which is in fact the correctness condition for that content” (2004, p. 67).  In simpler terms, the correct perceptual representation of an object as round does not causally depend on any other mental states or relations aside from the subject’s being properly connected to that round object in the world.  In contrast, an experience expressed by “There looks to be a soldier over there” does not have instance-individuated content because one’s correctly representing a soldier causally depends not only on there being a soldier where one represents it as being, but also on the background empirical knowledge one needs to have of what it is to be a soldier, how soldiers typically dress, and so on.

A Peacockean perceptual entitlement can thus be described as follows: “A subject enjoying an experience with a content which is instance-individuated is entitled to judge that content (that is, accept that experience at face value), in the absence of reasons for doubting that she is perceiving properly.”  Note that in the presence of reasons for doubt, one’s entitlement is removed.  Thus, Peacockean entitlement, like that of Dretske, Burge and Wright, has defeasibility conditions built into it.

The central task for Peacocke is to offer a philosophical explanation of the entitlement relation between experiences with instance-individuated contents and their corresponding perceptual judgments, which will consist of an a priori argument that demonstrates that the transitions between these states are conducive to truth.  His strategy is to argue that the existence of perceptual experiences with instance-individuated contents is best explained by the fact that those experiences are produced by a device that evolved by natural selection to represent the world to the subject, and that the perceptual experiences produced in the subject are predominantly correct.  Moreover, judgments based on these experiences are likely to be true.  To defend this claim, he relies on what he calls the Complexity Reduction Principle:

Other things equal, good explanations of complex phenomena explain the more complex in terms of the less complex; they reduce complexity (2004, p. 83).

This principle describes a key feature of what makes an explanation a good one.  When we are presented with some complex phenomenon, the explanation we should endorse, out of all the alternatives, is the one that explains the phenomenon in the least complex terms.  By accepting this principle, we are accepting that “it is…more rational to hold, other things equal, that [things] have come about in an easy, rather than in a highly improbable, way” (2004, p. 83).  Consider one of Peacocke’s own cases stemming from natural science.  Every snowflake contains six identical patterns separated by sixty-degree segments around its center.  This is an example of a complex phenomenon, one that immediately strikes us as highly improbable.  The best explanation for why snowflakes exhibit this pattern will inform us, by reducing complexity, that the alleged improbability is merely apparent: oxygen molecules in frozen water are roughly spherical and they are arranged on a plane.  The frozen crystals grow in a way that minimizes energy, and the most energy-efficient way of packing spheres on a plane results in a hexagonal arrangement (2004, p. 75-6).  One possible alternative explanation could be that snowflakes are built on skeletons that exhibit six-fold symmetry.  However, the latter explanation is less preferable to the former, because the latter is no less complex than the phenomenon itself: we would simply shift the question to why it is that the skeleton, rather than the snowflake, exhibits this pattern.

Like the example of snowflakes, our having experiences with instance-individuated contents is a highly complex phenomenon.  That the perceptual system produces states that are in large part correct, and was naturally selected to accurately represent the world to the subject, explains the phenomenon with the least amount of complexity.  A subject’s being in a state of perceptually representing an object o as having property F would not appear improbable if an Fo in the environment caused the state, and the subject had a properly functioning system that had adapted over the millennia, by way of interacting with objects like o’s and properties like F-ness, to accurately represent features in the environment.  It is a virtue of this explanation that it does not posit any further representational states that are equally as complex as the perceptual states under discussion and are, in turn, in need of explanation.

Peacocke argues further that the proposed explanation succeeds at reducing complexity better than alternative skeptical hypotheses.  There are two general types of such hypotheses he considers.  The first is one in which the subject’s illusory perceptual states are produced by some intentional agent, such as the Cartesian Demon.  In this scenario, it is typically postulated that the deceiver either has experiences herself, in which case our perceptual states get explained in terms of another’s perceptual states, or she lacks experiences but has other intentional states with comparable complexity.  Either way, the hypothesis purporting to explain the phenomenon under discussion would stand in need of further explanation.  In the second type of skeptical hypothesis, our illusory experiences are not intentionally produced, but are instead generated randomly or coincidentally, such as your being a member of a randomly generated universe containing nothing but a population of deluded brains in vats (see Putnam 2000).  However, this sort of hypothesis involves highly complex initial conditions, such as the original set up describing how the vats are capable of producing conscious, intentional states.  Thus, it fails to reduce the complexity of the phenomenon of our having perceptual states.

To summarize, Peacocke justifies the claim that we are entitled to form judgments about the world on the basis of our experiences with instance-individuated contents, on the grounds that our having these experiences with these contents is best explained by the evolution of the perceptual system and its selection for accurate representation of the environment.  We will consider a number of criticisms of Peacocke’s view.  First, in attempting to give a philosophical explanation of the truth-conduciveness of the transition between experience and judgment, we have seen that Peacocke invokes the theory of natural selection.  But because his aim is to prove that our experiences are by and large correct, his argument cannot depend on any empirically justified premises.  However, natural selection is an empirical scientific theory that is justified in part by our experiences, so we should not expect that it could be used to answer the skeptic.  Peacocke insists though that his argument “does not have the truth of the whole empirical biological theory of evolution by natural selection as one of its premises” (2004, p. 98).  But if Peacocke’s argument is sound, and he is right that each premise is justified a priori, the implication, as Ralph Wedgwood (2007) notes, is that no scientific investigation was required whatsoever for Darwin to justify the claim that he evolved through natural selection—all he needed to do to learn that the theory of evolution is true was to engage in philosophical contemplation and to reflect on his own experiences.  But the suggestion that a theory such as this one could be justified a priori seems highly implausible.

Another challenge to Peacocke’s project is to question whether the natural selection-based explanation reduces complexity any more than the alternative explanations.  The skeptic will argue that the existence of living creatures and the evolution of psychological systems within some of its members is itself a complex phenomenon standing in need of further explanation.  In addition, the skeptic could reject the Complexity Reduction Principle altogether: that an explanation is simpler, or succeeds at reducing complexity, does not speak to whether it is the one we ought to accept (see Lycan 1988; for Peacocke’s response see 2004, p. 94-7).

Finally, even if we grant that the Complexity Reduction Principle is true and that the natural selection-based explanation demonstrates that judgments sourced in perception tend toward truth, one might still question whether it adequately demonstrates how transitions between experience and judgment are characteristically rational (that is, whether it appropriately conforms to the Rationalist Dependence Thesis).  In order for us to rationally rely on our (reliably accurate) experiences, is it the case that we must also know, or justifiably believe, that the natural selection-based explanation most reduces complexity?  Peacocke thinks the answer must be negative; otherwise, anyone unaware of the Complexity Reduction Principle (which, presumably, is most people) would lack entitlements to their perceptual beliefs, and moreover would lack perceptual knowledge.  Instead, he holds that it is sufficient that the transition from experience to judgment be rational from the subject’s own point of view, that is, that the subject appreciates that her experiential grounds for the transition to the judgment that P suffice for the truth of P (2004, p. 176).  However, Miguel Fernández (2006) notes a problem with this move.  When one comes to “appreciate one’s grounds,” this involves, at least on a natural interpretation of appreciate, that one forms an additional judgment, specifically, a second-order judgment of the form:

SOJ: This experience representing P provides adequate grounds for the truth of P.

Of course, if one’s judgment that SOJ is partially underwriting one’s entitlement to judge that P, then it must be that the transition leading one to judge SOJ is also rational from the subject’s own point of view.  To meet this requirement, however, one would need to then hold a third-order judgment of the form:

TOJ: My grounds for judging SOJ are adequate for the truth of SOJ.

The problem is that because TOJ would also need to be rational from the subject’s own point of view, we are forced into an implausible infinite regress of increasingly higher-order judgments, each used to rationalize the transition attached to the lower-ordered judgment.  Demanding that one form all of these judgments would seem to hyper-intellectualize perceptual entitlement, and it would go against Peacocke’s position, mentioned at the beginning of the section, that a subject can be entitled to a judgment without having the capacity to think about the states that entitle that judgment.  In order to avoid this undesirable outcome, Peacocke could attempt to elucidate the notion of “appreciating one’s grounds” in a way that does not suggest forming a higher-order judgment.  Or, if that option is not feasible, he could, in following Dretske and Burge, relinquish the requirement altogether that a transition be rational from the subject’s own point of view.

5. Conclusion

This article has explored the views of epistemic entitlement as presented by four prominent philosophers: Dretske, Burge, Wright, and Peacocke.  As we have seen, there are important similarities and differences among these views.  Burge and Peacocke both hold that reliability is necessary for enjoying perceptual entitlements, but also that the reliability must be explained in terms of the nature and individuation of the perceptual states involved.  For both Wright and Peacocke, their accounts of entitlement purport to provide anti-skeptical outcomes about knowledge, while Dretskean and Burgean entitlement is explained independently of the skeptical problem.  According to Dretske and Burge we have entitlements to hold various non-inferential beliefs.  In contrast, Wright holds that what we are entitled to do is to rationally trust certain corner stone propositions, like that there is an external world and there is a past.  Peacocke differs from the other three authors in that he claims that all rational judgments, as well as the transitions leading to those judgments, are underwritten by entitlements.  Despite these differences, one element common to each of the views we have discussed is the idea that entitlements are defeasible epistemic rights that are not grounded in conceptually accessible reasons or evidence.  One can have an entitlement even if one is unable to think about the states that provide the entitlement.  In this regard, distinguishing entitlement from justification has the advantage of attributing a kind of positive epistemic status to individuals who lack the critical reasoning skills needed to justify their own beliefs.  The inclusion of the concept of entitlement therefore steers away from hyper-intellectualizing the warrant for our basic beliefs about the world.

6. References and Further Reading

  • Alston W. (1989). Epistemic Justification: Essays in the Theory of Knowledge. Ithaca: Cornell University Press.
    • A collection of essays on justification and knowledge written by William Alston.
  • Alston, W. (1989a). “The Deontological Conception of Epistemic Justification.” in Alston (1989).
    • Provides a series of arguments against deontological views of justification which center around the concepts of duty, praiseworthy, requirement, blameless by relying heavily on the impossibility of our having control over our beliefs.
  • Alston, W. (1989b). “An Internalist Externalism.” in Alston (1989).
    • Attempts to reconcile the internalism/externalism debate in epistemology by advancing a view of justification that incorporates both internalist and externalist elements.
  • Bach, K. (1985). “A Rationale for Reliabilism.” The Monist, 68: 246-265.
    • Defends reliabilism against its more well-known objections by drawing a distinction between a justified belief and a subject’s being justified in holding a belief.
  • Bonjour, L. (1985). The Structure of Empirical Knowledge. Cambridge, MA: Harvard University Press.
    • Develops an anti-foundationalist, coherentist theory of knowledge.
  • Burge, T. (1993). “Content Preservation.” The Philosophical Review, 102 (4): 457-488.
    • Argues that we have non-inferential entitlements to rely on the testimony of others.
  • Burge T. (1996). “Our Entitlement to Self-knowledge.” Proceedings of the Aristotelian Society, 96 (1): 91-116.
    • Argues that we have entitlements to beliefs about our own mental states.
  • Burge, T. (2003). “Perceptual Entitlement.” Philosophy and Phenomenological Research, 67 (3): 503-548.
    • Provides a teleological account of perceptual entitlements as grounded in the anti-individualistic nature of perception.
  • Burge, T. (2005). “Disjunctivism and Perceptual Psychology.” Philosophical Topics, 33 (1): 1-78.
    • Argues against the view of perception called Disjunctivism on the grounds that it conflicts with established psychological theories of vision.
  • Burge T. (2007). Foundations of Mind. Oxford: Oxford University Press.
    • A collection of Burge’s papers in philosophy of mind on anti-individualism.
  • Burge, T. (2007a). “Cartesian Error and the Objectivity of Perception.” in Burge (2007).
    • Defends the claim that our perceptual states are anti-individualistically individuated and introduces his famous Crack-Shadow thought experiment.
  • Casullo, A. (2009). “What is Entitlement?” Acta Analytica, 22: 267-279.
    • Provides a detailed overview and critique of Burge’s proposed distinction between entitlement and justification.
  • Chisholm, R. (1977). Theory of Knowledge. (2nd edition) Englewood Cliffs, NJ: Prentice-Hall.
    • Defends a deontological conception of knowledge and justification.
  • Cohen, S. (2002). “Basic Knowledge and the Problem of Easy Knowledge.” Philosophy and Phenomenological Research, 65 (2): 309-329.
    • Denies the existence of basic knowledge (knowledge obtained from a source prior to one’s knowing that that source is reliable) through considerations involving closure and bootstrapping.
  • Comesaña, J. (2002). “The Diagonal and the Demon.” Philosophical Studies, 110: 249-266.
    • Advances a new version of reliabilism called Indexical Reliabilism, then argues it circumvents the New Evil Demon Problem.
  • Conee, E and Feldman, R. (1985). “Evidentialism.” Philosophical Studies, 48: 15-34.
    • Defends evidentialism in epistemology, the view that the justification of one’s belief is determined by the strength of one’s evidence for that belief.
  • Davidson, D. (1986). “A Coherence Theory of Truth and Knowledge.” in E. Lepore (ed.) Truth and Interpretation: Perspectives on the Philosophy of Donald Davidson. Blackwell Publishing.
    • Argues for coherentism about truth and knowledge.
  • Davies, M. (2004). “Epistemic Entitlement, Warrant Transmission and Easy Knowledge.” Proceedings of the Aristotelian Society, Supplementary Issue 78 (1): 213-245.
    • Critically discusses Crispin Wright’s views on entitlement, and advances the idea of a negative entitlement not to doubt cornerstone propositions.
  • Dretske, F. (2000). “Entitlement: Epistemic Rights without Epistemic Duties?” Philosophy and Phenomenological Research, 60 (3): 591-606.
    • Argues for the existence of epistemic entitlements as grounded in the immediacy and irresistibility of various beliefs and motivated by the New Evil Demon Problem.
  • Dretske, F. (2005). “The Case Against Closure.” in M. Steup and E. Sosa (eds.) Contemporary Debates in Epistemology. Malden, MA: Blackwell Publishing.
    • Rejects closure by first assuming skepticism is false, and secondly by arguing for the disjunction that either closure is false or skepticism is true.
  • Evans, G. (1982). The Varieties of Reference. Oxford: Oxford University Press.
    • Proposes a theory of demonstrative thought in this seminal work, and introduces the notion of nonconceptual content.
  • Fernandez. M. (2006). “Troubles with Peacocke’s Rationalism.” Critica, 38 (112): 81-103.
    • Critically discusses Christopher Peacocke’s (2004) book, The Realm of Reason.
  • Ginet, C. (1975). Knowledge, Perception, and Memory. Dordrecht: D. Reidel.
    • Argues for a strong version of internalism, according to which all of the facts that make a subject’s beliefs justified are directly recognizable by the subject upon reflection.
  • Goldman, A. (1986). Epistemology and Cognition. Cambridge, MA: Harvard University Press.
    • Makes a case for epistemology to draw upon the findings in cognitive science.
  • Goldman. A. (1988). “Strong and Weak Justification.” Philosophical Perspectives, 2: 51-69.
    • Advances a modified version of reliabilism that places a distinction between the concepts of strong justification and weak justification.
  • Heck, R. (2007). “Are There Different Kinds of Content?” in B. McLaughlin and J. Cohen (eds.), Contemporary Debates in Philosophy of Mind. Malden, MA: Blackwell Publishing.
    • Argues for the existence of nonconceptual content based on the implausibility of perceptual content, satisfying Gareth Evans’ generality constraint on thought.
  • Henderson, D. and Horgan, T. (2007). “Some Ins and Outs of Transglobal Reliabilism.” in S. Goldberg (ed.), Internalism and Externalism in Semantics and Epistemology. New York: Oxford University Press.
    • Advances a new version of reliabilism called transglobal reliabilism, according to which the reliability of a belief process is evaluated relative to the set of experientially possible global environments.
  • Janvid, M. (2009). “The Value of Lesser Goods: The Epistemic Value of Entitlement.”  Acta Analytica, 24: 263-274.
    • Compares the epistemic value of entitlement with that of justification, and criticizes both Burgean and Dretskean entitlement.
  • Jenkins, C. (2007). “Entitlement and Rationality.” Synthese, 157: 25-45.
    • Challenges Crispin Wright’s entitlement of cognitive project, arguing that such entitlement to accept a proposition cannot be construed as epistemically rational.
  • Lehrer, K and Cohen, S. (1983). “Justification, Truth, and Coherence.” Synthese, 55: 191-207.
    • Challenges the time-honored connection between justification and truth by introducing what is nowadays commonly referred to as the New Evil Demon Problem.
  • Lycan, W. (1988). Judgement and Justification. Cambridge, MA: Cambridge University Press.
    • Defends a version of epistemological explanationism as a theory of justification.
  • Majors, B. (2005). “The New Rationalism.” Philosophical Papers, 34 (2): 289-305.
    • A critical discussion of Christopher Peacocke’s (2004) book, The Realm of Reason.
  • Majors, B and Sawyer, S. (2005). “The Epistemological Argument for Content Externalism.” Philosophical Perspectives, 19: 257-280.
    • Argues that that the only epistemological view that can adequately account for the constitutive relation between justification and truth is one that involves a commitment to content externalism.
  • Millikan, R. (1984). Language, Thought and Other Biological Categories. Cambridge, MA: MIT press.
    • Develops a naturalistic account of language and thought, wherein the meaning associated with sentences and the intentionality associated with thoughts are to be explicated in terms of how sentences and thoughts properly function.
  • Peacocke, C. (1992). A Study of Concepts. Cambridge, MA: MIT Press.
    • Proposes a theory of concepts in which concepts are individuated by their possession conditions, and also introduces the notions of ‘scenario content’ and ‘protoproposition’ as they apply to perceptual content.
  • Peacocke, C. (2001). “Does Perception have a Nonconceptual Content?” Journal of Philosophy, 98: 609-615.
    • Defends the claim that perception has nonconceptual content, and wards off criticisms from John McDowell and Bill Brewer.
  • Peacocke, C. (2004). The Realm of Reason. Oxford: Oxford University Press.
    • Develops a rationalist picture of epistemic entitlement according to which we have entitlements to our perceptual beliefs, a priori beliefs, beliefs based on inductive inference, beliefs about our own actions, and moral beliefs.
  • Pryor, J. (2000). “The Skeptic and the Dogmatist.” Nous, 34: 517-49.
    • Defends dogmatism, a modest foundationalist view according to which an experience as of p’s being the case provides one with a prima facie justification to believe that p, without requiring that one have independent reason to believe one is not in a skeptical scenario.
  • Pryor, J. (2001). “Recent Highlights of Epistemology.” The British Journal for the Philosophy of Science, 52: 95-124.
    • Canvases a host of popular topics in contemporary epistemology, including contextualism, modest foundationalism, the internalism/externalism debate, and the ethics of belief.
  • Pryor, J. (2004). “What’s wrong with Moore’s Argument?” Philosophical Issues, 14: 349-378.
    • Defends the Moorean response to skepticism.
  • Putnam, H. (2000). “Brains in a Vat.” in S. Bernecker and F. Dretske (eds.), Knowledge: Readings in Contemporary Epistemology. Oxford University Press.
    • Appeals to externalist accounts of meaning to argue that the statement, “I am a brain in a vat,” is self-defeating, and so cannot be true.
  • Sellars, W. (1963). Science, Perception and Reality. London: Routledge & Kegan Paul.
    • Famously argues against the “Myth of the Given” and develops a broadly coherentist epistemology.
  • Stroud, B. (1977). Hume. New York: Routledge & Kegan Paul.
    • Offers a detailed overview of Hume’s philosophy, including the Problem of Induction.
  • Wedgwood, R. (2007). “Christopher Peacocke’s The Realm of Reason.” Philosophy and Phenomenological Research, 74 (3): 776-791.
    • Critically discusses Christopher Peacocke’s book, The Realm of Reason.
  • Williams, M. (2000). “Dretske on Epistemic Entitlement.” Philosophy and Phenomenological Research, 60 (3): 607-612.
    • Critically discusses Fred Dretske’s (2000) paper, “Entitlement: Epistemic Rights without Epistemic Duties?”
  • Wright, C. (2003). “Some Reflections on the Acquisition of Warrant by Inference” in S. Nuccetelli (ed.), New Essays on Semantic Externalism, Skepticism and Self-Knowledge. Cambridge, MA: MIT Press.
    • Explains the difference between a cogent argument and an argument in which the premises fails to transmit their warrants to the conclusion.
  • Wright, C. (2004). “Warrant for Nothing (and Foundations for Free)?” Aristotelian Society Supplementary Volume, 78 (1):167–212.
    • Develops the notion of unearned entitlements for cornerstone propositions, and explores the prospects and limitations on strategic entitlement, entitlement of cognitive project, entitlement of rational deliberation, and entitlement of substance.
  • Wright, C. (2007). “The Perils of Dogmatism.” in S. Nuccetelli and G. Seay (eds.), Themes from G. E. Moore: New Essays in Epistemology and Ethics. Oxford: Oxford University Press.
    • Presents a series of problems for dogmatist accounts of justification.

 

Author Information

Jon Altschul
Email: jlaltsch@loyno.edu
Loyola University-New Orleans
U. S. A.

The Theory-Theory of Concepts

The Theory-Theory of concepts is a view of how concepts are structured, acquired, and deployed. Concepts, as they will be understood here, are mental representations that are implicated in many of our higher thought processes, including various forms of reasoning and inference, categorization, planning and decision making, and constructing and testing explanations. The view states that concepts are organized within and around theories, that acquiring a concept involves learning such a theory, and that deploying a concept in a cognitive task involves theoretical reasoning, especially of a causal-explanatory sort.

The term “Theory-Theory” derives from Adam Morton (1980), who proposed that our everyday understanding of human psychology constitutes a kind of theory by which we try to predict and explain behavior in terms of its causation by beliefs, intentions, emotions, traits of character, and so on. The idea that psychological knowledge and understanding might be explained as theory possession also derives from Premack & Woodruff’s famous 1978 article, “Does the Chimpanzee Have a Theory of Mind?” A Theory-Theory in general is thus a proposal to explain a certain psychological capacity in terms of a tacit or explicit internally represented theory of a domain. The Theory-Theory of concepts, however, goes beyond the mere claim that we possess such theories, saying in addition that some or all of our concepts are constituted by their essential connections with these theories.

The origins of the Theory-Theory involve several converging lines of investigation. First, it arose as part of a general critique of the formerly dominant prototype theory of concepts; second, it was an empirically-motivated response to the shortcomings of the developmental theories of Piaget and Vygotsky; and third, it involved applying ideas from Kuhn’s philosophy of science to explain phenomena having to do with the development of cognition in individuals. While the theory has often been vaguely formulated, due in large part to the open-endedness inherent in the central notion of a theory, there are substantial bodies of empirical evidence that underlie the main tenets of the view. In particular, the Theory-Theory has been responsible for largely displacing the notion that cognitive development starts from a simple base of perceptual primitives grouped together by similarity. Rather, it is guided by domain-specific explanatory expectations at many stages, and these expectations can be seen to function in adult reasoning and categorization as well. While strong versions of the Theory-Theory have been subject to numerous objections, these contributions endure and continue to shape what many scholars claim are the best existing models of higher cognition.

Table of Contents

  1. Background
  2. The Theory-Theory
    1. Origins of the View
    2. Theories Defined
    3. Concepts in Theories Versus Concepts as Theories
  3. Support for the Theory-Theory
    1. Cognitive Development
    2. Adult Categorization, Inference, and Learning
  4. Relations to Other Views
    1. Relations to Essentialism
    2. Relations to Causal Modeling Approaches
  5. Objections to the Theory-Theory
    1. Holism
    2. Compositionality
    3. Scope
  6. Disanalogies Between Development and Science
  7. Conclusions
  8. References and Further Reading

1. Background

The Theory-Theory emerged in part as a reaction to existing trends in the psychology of concepts and categorization, which during the late 1970’s was dominated by the prototype theory of concepts. Exemplar models were also being developed during this time, but the prototype theory encapsulated many of the views which were the foils against which the Theory-Theory developed its main assumptions.

Prototype theory derives in large part from the work of Eleanor Rosch and her collaborators (Rosch, 1977; Rosch & Mervis, 1975; see Smith & Medin, 1981 for historical perspective and Hampton, 1995 for a canonical statement of the view). These theories assume that concepts represent statistical information about the categories that they pick out. The concept tree represents the properties that people take to be typical of trees: they have bark, they can grow to be relatively tall, they have green leaves that may change color, they have a certain silhouette, birds often nest in them, they grow potentially edible fruits, and so on. These comprise the tree prototype (or stereotype).

This stereotype is acquired by a process of abstraction from examples: individual trees are perceptually salient parts of the environment, and by repeated perception of such category instances, one gradually forms a summary representation that ‘averages’ the qualities of the trees one has observed. This summary is often represented as a list of features that belong to category members. Properties that are more frequently perceived in the instances will be assigned a greater feature weight in the prototype. This process of concept acquisition is often portrayed as a passive one.

Finally, novel objects are categorized as falling under a prototype concept in virtue of their similarity to the prototype—that is, by how many features they share with it (weighted by those features’ importance). Similarity computations also explain other phenomena, such as the fact that some objects are better examples of a category than others (flamingos and penguins are atypical birds since they lack most of the prototypical bird features).

The prototype theory has several characteristics which made it a fitting target for Theory theorists. First, it suggests that concepts have a basically superficial nature. Often, though not invariably, features in prototypes were assumed to be readily perceivable. Prototype theory was thus affiliated with a certain empiricist bent. This was reinforced by the fact that prototypes are acquired by a simple statistical-associative process akin to that assumed by classical empiricists. Second, prototype theory involved a relatively impoverished account of conceptual development and deployment. Concepts passively adjust themselves to new stimuli, and these stimuli activate stored concepts in virtue of their resemblances, but there is little role for active revision or reflective deployment of these concepts. In the wake of the anti-empiricist backlash that gave rise to contemporary cognitive science, particularly in cognitive-developmental psychology, these assumptions were ripe for questioning.

2. The Theory-Theory

a. Origins of the View

The Theory-Theory itself has a somewhat complicated origin story, with roots in a number of philosophical and psychological doctrines. One is the reaction against stage theories of cognitive development, particularly Piagetian and Vygotskian theories. Stage theories propose that children’s cognitive development follows a rigid and universal script, with a fixed order of transitions from one qualitatively distinct form of thought to another taking place across all domains on the same schedule. Each stage is characterized by a distinctive set of representations and processes. In Piaget’s theory, children move through sensorimotor, preoperational, concrete operational, and formal operational stages from birth to roughly 11 or 12 years old. Similarly, Vygotsky held that children move from a stage of representing categories in terms of sensory images of individual objects, through a stage of creating representations of objectively unified categories, and finally a stage of categories arranged around abstract, logical relationships.

While Piaget and Vygotsky’s stage theories differ, both hold that early childhood thought is characterized by representation of categories in terms of their perceivable properties and the inability to reason abstractly (causally or logically) about these categories. Early childhood cognition, in short, involves being perceptually bound. While the empirical basis and explanatory structure of these theories had been challenged before (see R. Gelman & Baillargeon, 1983 and Wellman & Gelman, 1988 for review), Theory theorists such as Carey (1985), Gopnik (1988, 1996), Gopnik & Meltzoff (1997), and Keil (1989) went beyond providing disconfirming evidence and began to lay out an alternative positive vision of how cognitive development proceeds.

A second root of the Theory-Theory derives from philosophy of science, particularly from Kuhn’s account of theory change and scientific revolutions. Kuhn’s view is too complex to summarize here, but two aspects of it have been particularly influential in developmental psychology.

One is Kuhn’s notion that theory change in science involves periods of ‘normal science’, during which a mature theory is applied successfully to a range of phenomena, and periods of ‘paradigm shifting’. A paradigm shift occurs when counterevidence to a theory has built up beyond a certain threshold and it can no longer be adequately modified in response, consistent with its not becoming intolerably ad hoc. In paradigm shifts, new explanatory notions and models take center stage, and old ones may be pushed to the margins or adopt new roles. New practices and styles of experimentation become central. These changes are relatively discontinuous compared with the usual gradual accumulation of changes and modifications characteristic of science.

A second, connected with this notion of a paradigm shift, is the Kuhnian doctrine of incommensurability. This is the idea that when new theories are constructed, the central explanatory concepts of the old theory often change their meaning, so that a claim made before and after a paradigm shift, even if it uses the same words, may not express the same proposition, since those words now express different concepts. The concept of mass, as it existed in pre-relativistic physics, no longer means the same thing—indeed, we now need to distinguish between uses of ‘mass’ that pick out rest mass and those that pick out relativistic mass. Often this involves creating new concepts that cannot be captured in the conceptual vocabulary of the old theory, differentiating two concepts that were previously conflated, or coalescing two previously distinct concepts into one. In all of these cases, the expressive vocabulary of the new conceptual scheme is not equivalent to that of the old scheme. Theory theorists have often adopted both the Kuhnian claim about paradigm shifts as a model for understanding certain phenomena in development, and the associated claim of semantic or conceptual incommensurability (Carey, 1991).

A third root involves what Keil (1989) dubs the rejection of ‘Original Sim’. Original Sim is roughly the view of category structure and learning suggested by prototype theory, particular  an empiricist one. This view is most clearly defended by Quine (1977), who proposes that children begin life with an innate similarity space that is governed by perceptual information, and over time begin to develop theoretical structures that supersede these initial groupings. On this empiricist perspective, children’s first concepts should be bundles of perceptual features, typically consisting of intrinsic rather than relational properties, and categorization should be simply a matter of matching the perceived features of a novel object to those of the concept. Inductive inferences concerning a category are within reach so long as they are confined to these observable properties, and objects share inductive potential to the extent that they are similar to the perceptual prototype. Moreover, these perceptual prototypes are assumed to be acquired by statistical tabulation of observed co-occurrences in the world, in a relatively theory-free way; seeing that certain furry quadrupeds meow is sufficient for constructing a cat concept that encodes these properties. It is only at later stages of development that concepts reflect understanding of the hidden structure of categories, and come to enable inductions that go beyond such similarities.

Fourth, Theory-Theory is often motivated by the hypothesis that certain concepts (or categories) have a kind of coherence that makes them seem especially non-arbitrary (Murphy & Medin, 1985; Medin & Wattenmaker, 1987). The categories of diamonds, sports cars, or otters seem to be relatively ‘coherent’ in the sense that their members bear interesting and potentially explainable relations to one another: diamonds are made of carbon atoms whose organization explains their observable properties, otters share a common ancestry and genetic-developmental trajectory that explain their phenotype and behavior, and so forth. On the other hand, the category of things on the left side of my desk, or things within 100 feet of the Eiffel Tower, or things that are either electrons or clown wigs, do not. They are simply arbitrary collections.

Feature-based theories of concepts, such as prototype theory, seem to have particular difficulty explaining the phenomenon of coherence, since they are inherently unconstrained and allow any set of properties to be lumped together to form a category, whereas our concepts often appear to represent categories as involving more than merely sets of ad hoc co-instantiated properties. They include relations among these properties, as well as explanatory connections of various sorts. We don’t merely think of sports cars as expensive artifacts with four wheels, big engines, a sleek shape, and bright coloration, which make a loud noise as they roar past at high speed. These features are explanatorily connected in various ways: their shape and engine size contribute to their speed, their engines explain their noisiness, their speed and attractive appearance explain their costliness, and so on. Insofar as these explanatory relations among properties are represented, concepts themselves are more coherent, reflecting our implicit belief in the worldly coherence of their categories. Theories are the conceptual glue that makes many of our everyday and scientific concepts coherent, and models of concepts that fail to accord theories an important role are missing an account of a crucial phenomenon (however, see Margolis, 1999 for detailed criticism of this notion).

From this survey, it should be clear that the development of Theory theories of concepts has been driven by a host of different motivations and pressures. Hence there exist many flavors of the view, each with its own distinctive formulation, concerns, and central phenomena. However, despite the fact that the view lacks a canonical statement, it possesses a set of family resemblances that make it an interesting source of predictions and a robust framework for empirical research, as well as a unified target of criticism.

b. Theories Defined

The first essential posit of these views is the notion of a mentally represented theory. Theories are bodies of information (or, as psychologists and linguists sometimes say, bodies of knowledge) about a particular domain. Such theories have been posited to explain numerous psychological capacities: linguistic competence results from a theory of the grammar of English or Urdu; mental state attribution results from a theory of mind; even visual perception results from a theory of how 3-D objects in space behave in relation to the observer. But theories are not just any body of information held in memory. What makes theories distinctive or special? Keil (1989, p. 279) called this “the single most important problem for future research” in the Theory-Theory tradition.

Gopnik & Meltzoff (1997, pp. 32-41) give what is probably the most comprehensive set of conditions on theories. These conditions fall into three categories: structural, functional, and dynamic. Structurally, theories are abstract, coherent, causally organized, and ontologically committed bodies of information. They are abstract in that they posit entities and laws using a vocabulary that differs from the vocabulary used to state the evidence that supports them. They are coherent in that there are systematic relations between the entities posited by the theory and the evidence. Theories are causal insofar as the structure that they posit in the world to explain observable regularities is ordinarily a causal one. Finally, they are ontologically committed if the entities that they posit correspond to real kinds, and also support counterfactuals about how things would be under various non-actual circumstances. Some of these conditions are also advanced by Keil (1989, p. 280), who proposes that causal relations are central to theories, especially where they are homeostatic and hierarchically organized.

Functionally, theories must make predictions, interpret evidence in new ways, and provide explanations of phenomena in their domain. The predictions of theories go beyond simple generalizations of the evidence, and include ranges of phenomena that the theory was not initially developed to cover. Theories interpret evidence by providing new descriptions that influence what is seen as relevant or salient and what is not. And crucially, theories provide explanations of phenomena, understood as an abstract, coherent causal account of how the phenomena are produced and sustained. Theories are essentially related to the phenomena that make up their domain; hence in Keil’s developed view, there is a key role for associative relations in providing the raw data for theoretical development as well as a ‘fallback’ for when theories run out (Keil, 1989, p. 281).

Last, theories are not static representations, but have dynamic properties. This follows from the fact that they develop in response to, and may gain in credibility or be defeated by, the empirical evidence. The sorts of dynamic properties that characterize theories include: an initial period involving the accumulation of evidence via processes of experimentation and observation, the discovery of counterevidence, the possible discounting of such evidence as noise, the generation of ad hoc hypotheses to amend a theory, the production of a new theory when an old one has accumulated too much contrary evidence or too many ugly and complicated auxiliary amendments.

c. Concepts in Theories Versus Concepts as Theories

Once the central explanatory construct of a mental theory is clear, two varieties of the Theory-Theory need to be distinguished. These views differ on the nature of the relationship between concepts and theories.

On the concepts in theories view, concepts are the constituents of theories. Theories are understood as something like bodies of beliefs or other propositional representations, and these beliefs have concepts as their constituents. The belief that electrons are negatively charged is part of our theory of electrons, and that belief contains the concept electron as a part (as well as has negative charge). The set of electron, involving beliefs that meet the sorts of constraints laid out in section 2b, constitute one’s theory of electrons. These beliefs describe the sorts of things electrons are, how they can be expected to behave, how they are detected, how they relate to other fundamental physical entities, how they can be exploited for practical purposes, and so on. For the concepts in theories view, concepts function much like theoretical terms.

On the concepts as theories view, on the other hand, the constituency relations run the opposite direction. Concepts themselves are identified with miniature theories of a particular domain. For instance, Keil (1989, p. 281) proposes that “[m]ost concepts are partial theories themselves in that they embody explanations of the relations between their constituents, of their origins, and of their relations to other clusters of features.” So the concept electron would itself be made up of various theoretical postulates concerning electrons, their relationship to other particles, their causal propensities which explain phenomena in various domains of physics, and so on. Concepts are not terms in theories, they are themselves theories.

As stated, the concepts in theories view is scarcely controversial. If people possess mentally represented theories at all, then those theories are composed of beliefs and concepts, and so at least some of our concepts are embedded in theory-like knowledge structures. Call this the weak concepts in theories view. A strong concepts in theories view, on the other hand, says that not only are concepts embedded in theories, but they are also individuated by those theories. Carey (1985, p. 198) seems to hold this view: “Concepts must be identified by the roles they play in theories.” This is just to say that what makes them the very concepts that they are is their relationships (inferential, associative, causal, explanatory, and so forth.) with the other concepts and beliefs in the theory. There are many ways of carving out different notions of inferential or theoretically significant roles for concepts to play, but on all of them, concepts are constituted by their relations to other concepts and to the evidence that governs their conditions of application.

A consequence seems to be that if those relationships change, or if the theory itself changes in certain respects, then the concepts change as well. The change from a view on which atoms are the smallest, indivisible elements of matter to one on which atoms are made up of more fundamental particles might represent a sufficiently central and important change that the concept atom itself is no longer the same after such a transition takes place; similarly, perhaps the victory of anti-vitalism entailed a change in the concept life from being essentially linked with a particular irreducible vital force to being decoupled from such commitments. Notice that this consequence also applies to the concepts as theories view. If a concept is identified with a theory (rather than being merely embedded in it), it seems as if, prima facie, any change to the theory is a change to the concept.

The concepts as theories view poses separate difficulties of its own. On this view, concepts are extremely complex data structures composed of some sort of theoretical principles, laws, generalizations, explanatory connections, and so on. What status do these have? A natural suggestion is to regard all of these as being beliefs. But this view is straightforwardly incompatible with a view on which concepts are the constituents of beliefs and other higher thoughts. It is mereologically impossible both for concepts both be identified with terms in theories and with theories themselves. We would need some other way of talking about the representations that make up beliefs if we choose to regard concepts as simply being miniature theories.

Despite the differences between these two views, the empirical evidence taken to support the Theory-Theory does not generally discriminate between them, nor have psychologists always been careful to mark these distinctions. As with many debates over representational posits, the views in question generate differing predictions only in combination with supplementary assumptions about cognitive processing and resources. However, there may be theoretical reasons for preferring one view over the other; these will be discussed further in section 5.

3. Support for the Theory-Theory

a. Cognitive Development

Much of the support for the Theory-Theory comes from developmental studies. Carey (1985) largely initiated this line of research with her investigations of children’s concepts of animal, living thing, and kindred biological notions. For example, she found that major changes occur in children’s knowledge of bodies and their functioning from four to eleven years old. The youngest children understand eating, breathing, digesting, and so forth, mainly as human behaviors, and they explain them in terms of human needs, desires, plans, and conventions. Over time, children build various new accounts of bodies, initially treating them as simple containers and finally differentiating them into separate organs that have their own biological functions. In Carey’s terms, young children start out seeing behavior as governed by an intuitive psychological theory, out of which an intuitive biology develops (1985, p. 69).

The centrality of humans to young children’s understanding of living things can be seen in several studies. Four and five year olds are reluctant to attribute animal properties—even eating and breathing—to living beings other than humans. When asked to name things that have various properties of living things, children overwhelmingly pick ‘people’ first, followed by mammals, and then a few other scattered types of creatures. The primacy of people in biology carries over to judgments of similarity, with adults displaying a smooth gradient of similarity between people and other living things and six year olds seeing a sharp dividing line between people and the rest of the animal kingdom, including mammals. Finally, in inductive projection tasks people are clearly paradigmatic for four year olds: if told that a person has an organ called a spleen, they will project having a spleen to dogs and bees, but rarely the opposite. By age 10, people are seen as no longer unique in this respect. So young children’s theory of life is focused initially around humans as the paradigm exemplars, and only later becomes generalized as they discover commonalities among all animals and other living things. Indeed, the very concept living thing comes to be acquired as this knowledge develops.

Keil (1989) added to the evidence with many striking results concerning how children’s concepts of natural kinds, nominal kinds, and artifacts develop from kindergarten onwards. He finds compelling evidence for what he initially called a ‘characteristic-to-defining’ shift in conceptual structure. Characteristic features are akin to prototypes: compilations of statistically significant but possibly superficial properties found in categories. Defining features, on the other hand, are those that genuinely make something the kind of thing that it is, regardless of how well it corresponds to the observed characteristics.

In a series of discovery studies, Keil (1989) gave children descriptions of objects that have the characteristic features belonging to a natural kind, but which were later discovered to have the (plausible) defining features of a different kind; for example, an animal that looks and acts like a horse but which is discovered to have the inside parts of cows as well as cow parents and cow babies. While at age five, children thought these things were horses, by age seven they were more likely to think them cows, and adults were nearly certain these were cows. Defining features based on biology (internal structure, parentage) come to dominate characteristic features (appearance, behavior).

In a related series of transformation studies, children heard about a member of a natural kind which underwent some sort of artificial alterations to its appearance, behavior, and insides; for example, a raccoon that was dyed to look like a skunk and operated on so that it produces a foul, skunk-like odor. Five-year olds thought these transformations changed the raccoon into a skunk, while seven year olds were more resistant, and nine year olds were nearly sure that such changes in kind weren’t possible. This effect was notably stronger for biological kinds than mineral kinds; however, children at all ages strongly resisted the idea that a member of a biological kind could be turned into something from a different ontological category (for example, an animal cannot be turned into a plant). Finally, some kinds of transformations are more likely to change a thing’s kind: among five year olds, alterations to internal or developmental features along with permanent surface parts are more effective than temporary surface changes or costumes, and internal changes retain their influence until at least age nine.

In Keil’s view, this shows that children may start out with a comparatively impoverished theory of what makes something a member of a biological kind (or a mineral kind, social kind, or artifact kind), but this theory is enriched and deepened with more causal principles governing origins, growth, internal structure, reproduction, nutrition, and behavior. As this network of causal principles becomes more enriched they recognize that the category members are defined by the presence of these theoretically significant linkages rather than by the more superficial features that initially guided them. ‘Primal theories’ develop into more mature folk theories in different domains according to their own time course.

Finally, Gopnik & Meltzoff (1997) survey a range of domains to argue for the early emergence of theories. To take one example, they argue that children’s understanding of objects and object appearances starts off as highly theoretical and develops in response to new experience until they achieve adult form. Six-month olds, for instance, fail to search for objects that are hidden behind screens, and they show no surprise when an object moves behind a screen, fails to appear at a gap in the middle of the screen, but then appears whole from behind the other side of the screen. These behaviors only emerge at 9 months. Gopnik & Meltzoff explain this change by claiming that the infants come to understand that occlusion makes objects invisible. Until 12 months, however, they continue to make the ‘A-not-B’ error, which involves searching for an object under the first occluder it disappeared behind, rather than the last one. They ascribe this failure to children’s adherence to an auxiliary hypothesis of the form: objects will be where they appeared before. This rule is abandoned when it comes to conflict with the evidence and the child’s developing theory of object behavior. In addition, from ages 12 to 18 months, children begin to systematically play (‘experiment’) with hiding and invisible displacement, suggesting that they are interested in generating evidence about this developing cognitive domain. This in turn strengthens the analogy between cognitive development and active theorizing by adult scientists.

b. Adult Categorization, Inference, and Learning

Murphy & Medin (1985) argued in largely abstract fashion that categorization should be seen as a process of explaining why an exemplar belongs with the rest of a category. A man who jumps into a swimming pool while fully clothed at a party is plausibly drunk, even though these are not features of drunks in general—or they certainly are not stored as such in one’s default drunk concept. Theories and explanatory knowledge are required to focus on the relevant features of categories in a variety of tasks and contexts. Research with adults has tended to support this perspective.

One significant piece of evidence that comports with the general Theory-Theory perspective is the causal status effect (Ahn & Kim, 2000). The effect is the tendency of participants to privilege causally deeper or more central properties in a range of tasks including categorization and similarity judgment. For example, if people are taught about a person who has a cough caused by a certain kind of virus, and then given two other descriptions, one which matches in the cause (same virus) but not the effect (runny nose), and another that matches in the effect (cough) but not the cause (different virus), common causal features make exemplars more similar. Matching causal features can even override other shared features in categorization. If taught about an example with a cause that produces two effect features and two other examples, one of which shares the cause only and the other of which shares both effects, a majority of participants group the common cause exemplar with the original, even though they differ in most features.

Murphy (2002) reviews an extensive body of evidence showing that background knowledge has a pervasive effect on category learning, categorization, and induction. To take two examples, consider artificial category learning and category construction. In learning studies, participants are given two categories that are distinguished by different lists of features. The features that describe a more ‘coherent’ category in which the features are very plausibly related to each other (for example, ‘Made in Norway, heavily insulated, white, drives on glaciers, has treads’) were played against those that describe a more ‘neutral’ category. Participants found the coherent categories much easier to learn, and retained more information about them. Similarly, if given the ability to freely sort these items into categories they tended to group the coherent category members together even when they shared only a single feature. Background knowledge concerning the likely relationships among these features plays an essential role in learning and categorizing, even when it is not explicitly brought up in the experiment itself. This further undermines the prototype theory’s account of learning as a process of atheoretical tabulation of correlations.

4. Relations to Other Views

a. Relations to Essentialism

The Theory-Theory is closely related to psychological essentialism (henceforth just ‘essentialism’), the claim that people tend to represent categories as if they possessed hidden, non-obvious properties that make them the sorts of things that they are and that causally produce or constrain their observable properties (Medin & Ortony, 1989). These essences need not be actually known, but may be believed to exist even in the absence of detailed information about them. Concepts may include either conjectures as to what their essential properties might be, or else blank ‘essence placeholders’ that govern in the absence of these as-yet-unknown essential properties. Commitment to essences may be viewed as a kind of theoretical commitment, insofar as essences are causally potent but unobserved properties that structure and explain observable properties of categories. More generally, it is the commitment to there being a certain kind of causal structure underlying the categories we commonly represent.

There is a large body of evidence that supports the psychological essentialist hypothesis (Gelman, 2003, 2004; see Strevens, 2000 for criticism). For example, children’s inductions are governed by more than superficial resemblances among objects. In a standard inductive projection paradigm, participants are presented with a triad of pictures of objects only two of which perceptually resemble each other (for example, a leaf, a leaf-shaped insect, and a small black insect) and two of which share a verbal label (for example, both insects, while dissimilar, are called ‘bugs’, and the leaf is called a ‘leaf’). They are then told that one object of the resembling pair has a certain property and asked to project the property to the third object. By 30 months, children will project properties on the basis of labeled category membership rather than similarity. This effect does not depend on the precise repetition of the verbal label (that is, synonyms work just as well), and it tends to be more powerful in natural biological kinds than in artifacts. Even among 16- to 21-month olds one can find similar effects: behaviors displayed with one sort of toy animal (barking, chewing on a bone, and so forth.) will be imitated with a perceptually dissimilar animal if they are given a matching label. This suggests that induction is not entirely governed by superficial properties even among very young children.

Children may entertain more specific hypotheses about what the underlying category essences are as well. In Keil’s transformation studies, some participants, when debriefed, maintained that parentage was important to determining kind membership. In a number of studies, Gelman and her collaborators (see, for example, Gelman & Wellman, 1991) have shown that among four to five year olds, insides have a special theoretical role to play. Children can distinguish similar-appearing objects (pigs and piggy banks) from those that have similar insides, and they judge that removing a creature’s insides both removes its category-typical behaviors and also makes it no longer the same kind of thing. Removing outsides or changing a transitory property has little effect on membership or function.

These studies provide further evidence that the Original Sim has at best a weak grip on young children. Moreover, they reinforce the claim that categorization can sometimes be dominated by an early-emerging understanding of biology that treats stereotypical properties as non-dispositive. Gelman’s own robust psychological essentialism includes further claims such as that category boundaries are invariably taken to be sharp rather than fuzzy, and that essences invariably focus on purely internal properties. Whatever the status of these additional claims, the broader moral of the essentialism literature is in line with the proposals made by Theory theorists. Children come prepared to learn about deeper causal relations in many domains and they readily treat these relations as important in categorizing and making inductions.

b. Relations to Causal Modeling Approaches

In recent years much attention has focused on the role of causality in cognition, and consequently theories of cognitive performance that emphasize causal modeling have gained prominence. The idea that concepts might be identified (at least in part) with causal models has grown out of this tradition.

The theory of causal models is a formally well-developed and quantitatively precise way of describing probabilistic and causal dependency information, particularly in graphical form (for accessible introductions, see Gopnik & Shulz, 2007; Glymour, 2001; Sloman, 2005). Briefly, a causal model of a category depicts part of the relevant causal information about how things in the domain are produced, organized, and function. A causal model of a bird notes that it has wings, a body, and feathers, but also encodes the fact that those features causally contribute to its being able to fly; a causal model of a car depicts the fact that it is drivable in virtue of having wheels and an engine, that it can transport people because it is drivable, and that it makes noise because of its engine. These structures can be represented as sets of features connected by arrows, which indicate when the presence of one property causes or sustains (and therefore makes more probable) the presence of another. These directed causal graphs provide one possible representational format for concepts.

For example, Chaigneau, Barsalou, & Sloman (2004) have proposed the H I P E theory of artifact categorization, which states that artifacts are grouped according to their Historical role, the Intentions of the agents that use them, their Physical structure, and the Events in which they participate. On H I P E, artifact concepts are miniature causal models of the relations among these properties, all of which may potentially contribute to making something the kind of artifact that it is. Similar sorts of models could be developed for natural kind concepts. Indeed, essentialism itself is one form that a causal model can take: the essence is the ‘core’ of the concept, and it causally produces the more superficial features. Causal model theory is a generalization of this idea that allows these graphs to take many different forms.

Causal model theory is best seen as one form that the Theory-Theory can take (Gopnik & Schulz, 2004; Rehder, 2003). It shares that view’s commitment to causal-explanatory structure being central to concepts. While it is tied to a more specific hypothesis about representation than Theory-Theory in general (the formalism of directed causal graphs), this is also a strength, since these models are part of a well-developed framework for learning and processing. Causal model theory gives the Theory-Theory the resources to develop more wide-ranging and detailed empirical predictions concerning categorization, induction, and naming.

It is also worth noting that causal model theory may give the concepts as theories view the resources to answer the mereological objection it faces. The components of causal models can be seen as features representing properties, connected by links representing causal relations. Many models of concepts take them to be complex structures composed of features in this way. If we see causal models as miniature theories, then we can view concepts as theories if we identify them with such models. Adopting this approach eliminates any potential problems about concepts being both the constituents of beliefs and also being composed of beliefs.

5. Objections to the Theory-Theory

a. Holism

The holism objection focuses on the fact that the individuation conditions for concepts are closely tied to those for theories. They are holistic, meaning that a concept’s identity depends on its relations to a large set of other representational states. This position is suggested by Murphy & Medin’s comment that “[i]n order to characterize knowledge about and use of a concept, we must include all of the relations involving that concept and the other concepts that depend on it” (1985, p. 297). This gives rise to problems concerning the stability of concepts. The objection may be put as follows. Suppose concepts are identified by their relation to theories. Then changes in theories entail changes in concepts: if C1,…,Cn are constituted by their relation to T1, and T1 changes into T2, then at least some of C1,…,Cn will have to change as well, so long as the changes in the theories occurs in the parts that contribute to individuating those concepts. And it is part of the developmental and dynamical account of the Theory-Theory that such transitions in theories take place. So according to the Theory-Theory, concepts are unstable; they change over time, so that one does not have the same concepts before a revision in theory that one has afterwards.

The conclusion is particularly objectionable if one assumes that there will be many changes to theories, so that concepts also change frequently. But there are reasons to want concepts to be more stable than this. First, one wants to be able to compare concepts across individuals with different theories. A young child may not have the fully developed life concept, but she and I can still have many common beliefs about particular living things and their behavior, even if she does not represent them as being alive in the way that I do (that is, even if her understanding of life is impoverished relative to mine). Second, the rationality of theory change itself depends on some intertheoretic stability of concepts: Rejecting theory T1 may involve coming to believe that belief B formulated using T1’s concepts is false. So now that I believe T2, I reject B. But if changing from T1 to T2­ involves changing the concepts involved in B, I can no longer even formulate that belief, since I now lack the required conceptual resources. So we are at a loss to describe the rational nature of the transition between theories.

What this suggests is that belief attributions are often stable across theory changes; or at least, not every change in one’s background theory should change many or all of one’s concepts (and hence beliefs). Some sort of independence from belief is required. The problem is that concepts are individuated by their roles, which in turn are determined by the causal, inferential, and evidential roles of the propositions that contain them, and these are precisely what change as theories do (Fodor, 1994; Margolis, 1995).

This problem faces both the strong concepts in theories view and the concepts as theories view, but the weak concepts in theories view is immune to it, since it allows that concepts may participate in theories without being individuated by them. Two responses to the holism objection are typical. First, some Theory theorists (for example, Gopnik & Meltzoff, 1997) have embraced it. It is, they suggest, not implausible that young children are to a certain degree incomprehensible to adults, as would be predicted if their world view is incommensurable with ours (Carey, 2009, p. 378). Second, others have attempted to avoid this conclusion by distinguishing respects in which concepts may change (such as narrow content or internal conceptual role) and respects in which they may remain stable (such as wide content or reference). This dual-factor approach is also adopted by Carey (2009). The unstable respects are those that differ with background theories, while the stable respects provide continuity so that concepts can be identified across changes and differences in view. The success of this approach depends on whether the stable respects can do the relevant explanatory work needed in psychological explanation and communication.

b. Compositionality

A representational system is compositional if the properties of complex symbols are completely determined by the properties of the simpler symbols that make them up, plus the properties of their mode of combination. So predicate logic is compositional, since the semantic value of ‘Fa’ is determined by the semantic values of the predicate ‘F’ and the individual constant ‘a’. Similarly ‘Fa & Fb’ is semantically determined as a function of ‘Fa’, ‘Fb’, and the interpretation of conjunction. Many have argued that thought is compositional as well (Fodor, 1998), which entails that the properties of complex concepts derive wholly from the properties of their constituents.

If thought is compositional, and concepts are the constituents of thoughts, then whatever concepts are must also be compositional. So if concepts are (or are individuated by) theories, then theories must similarly be compositional. However, there are good reasons to think that theories are not compositional. A standard example is the concept of pet fish. The fish might come from the theory of folk biology, while pet might derive from a theory of human social behavior (since keeping pets is a sociocultural fact about humans). If the strong concepts in theories view is right, their content is determined by their inferential role in each of these theories. But pet fish has a novel inferential role that is not obviously predictable from those roles taken individually. Instances of pet fish, for example, are typically thought to live in bowls and feed on flakes, neither of which is true of pets or fish in general. This information is not derived from one’s ‘pet theory’ or ‘fish theory’. It is therefore not compositional. The same point can be made about the concepts as theories view. If one’s causal models of pets and fish do not somehow encode this information in the features that make them up, then it cannot be derived compositionally by putting them together. Since examples like this can be multiplied indefinitely, the Theory-Theory cannot account for the general compositionality of thought.

While many psychologists have simply ignored these concerns, several responses are possible. Here are two. First, one can divide concepts into two components, a stable compositional element and a non-compositional element (Rips, 1995). The compositional element might be thought of just as a simple label, while the non-compositional element includes theoretical and prototypical information. One part has the job of accounting for concept combination, the other has the job of accounting for categorization and inductive inference. Second, one can try to weaken the compositionality requirement. Perhaps concepts are required only to be compositional in principle, not in practice; or else compositionality might be viewed as a fallback strategy to be employed when there is no other information available about the extension of a complex concept (Prinz, 2002; Robbins, 2002). Whichever approach one takes, the compositionality objection highlights the fact that while the Theory-Theory has impressive resources for explaining facts about development and concept deployment, concept combination is more challenging to account for.

c. Scope

The scope objection is one that faces nearly every theory of concepts. In general, where such a theory proposes an identification of the form ‘concepts are K’, where ‘K’ is a kind of mental structure or capacity, the question can be raised: are all concepts like this? Or are there cases where someone might possess the relevant concept but not possess K? For instance, if concepts are prototypes, then there must be the right sort of prototype for every concept we can use in thought. A theory has satisfactory scope if there exists the right sort of K for every concept that we are capable of entertaining.

For the Theory-Theory, the problem seems to be that there are too few theories. We have concepts such as car, computer, gin, lemur, and nightstick. Perhaps for some of these we have theories, at least of a highly sketchy nature. But it is less clear that we have these for other concepts. One might have the concept higgs boson (from reading newspaper articles about the Large Hadron Collider) but have essentially no interesting knowledge of the Standard Model of particle physics. One might have the concept true but not have a theory of truth. One might have the concept billiards but know nothing of the game’s rules or conventions (‘that game they play in the UK that resembles pool’). If the Theory-Theory identifies each concept with a domain-specific theory, these scope challenges are serious. Denying that we have these concepts in virtue of lacking the relevant knowledge is unappealing.

One possible response is to restrict the scope of the Theory-Theory itself. Carey (1985) takes this tack. She does not think that every concept must be associated with a proprietary theory. Rather, concepts are embedded in relatively large scale theories of whole cognitive domains: “there are only a relatively few conceptual structures that embody deep explanatory notions—on the order of a dozen or so in the case of educated nonscientists. These conceptual structures correspond to domains that might be the disciplines in a university: psychology, mechanics, a theory of matter, economics, religion, government, biology, history, and so forth.” (Carey, 1985, p. 201). This approach, favored by other domain theorists, gives this version of the concepts in theories view a slight advantage over the concepts as theories view, since the latter is more clearly vulnerable to the scope objection. A defender of the concepts as theories view might fall back to the position that even very sketchy or minimal understanding of the causal principles at work in a category can count as a theory, as, even in these cases, we meet the minimal concept possession conditions, and our understanding is often equally superficial (Rozenblit & Keil, 2002).

d. Disanalogies Between Development and Science

The Theory-Theory relies heavily on the notion that what children do in constructing their knowledge of the world is quite literally like what scientists do in producing, testing, and revising the theories that constitute scientific knowledge. This implies that there is substantial cognitive continuity across development, so that infants and young children, along with older children and adults, employ the same theory-construction mechanisms that operate on prior theoretical representations plus new evidence to produce revised and, with luck, improved theories.

Many have challenged this picture on the grounds that what children do is not in fact sufficiently similar to what scientists do for them to be seen as instances of the same cognitive or epistemic process. These complaints are summarized by Faucher, Mallon, Nazer, Nichols, Ruby, Stich, & Weinberg (2002). They argue that scientific theory revision is a process that is inseparable from a host of cultural factors. For example, there are norms governing how one ought to gather and weigh evidence, as well as how one should revise one’s beliefs, and these govern the practice of science differently across times and cultures. Moreover, theories are usually socially transmitted (in the classroom, the lab, and in less formal contexts) along with these norms. So in science, society and mind interpenetrate in such a way that individual cognition needs to be receptive to external sources of authority, both with respect to theoretical knowledge and epistemic norms. The simple picture of theory revision as involving only initial theories and evidence should be rejected.

There are at least two possible responses to this anti-individualistic argument. One is to argue that while these social factors play a role in adult science, the essential core of scientific practice remains the adjustment of theories under the influence of evidence. Normative factors can eventually come to help us perform these tasks better or in ways that fit in more productively with the surrounding culture, but the basic mechanism of evidence-based revision must be present in any case. And the evidence suggests that it is operative even before these social factors have an effect. A second response would be to argue that this picture is in fact a correct and welcome revision to the overly simplistic view originally proposed by Theory theorists. We should expect there to be substantial cultural influences on children’s cognition, and some of the cross-cultural studies cited by Faucher et al. provide evidence in favor of this hypothesis. So we should enrich the Theory-Theory view of children’s early cognition, not abandon it entirely.

6. Conclusions

The Theory-Theory consists of many interrelated claims about concept individuation, structure, development, and processing. The claim that development of concepts and domain knowledge in children is driven by causal-explanatory expectations, perhaps of an essentialist sort, has been most extensively investigated. While there are some attempts to explain these data by appeal to empiricist principles (Smith, Jones, & Landau, 1996), the Theory-Theory has strong support here. Studies with adults also suggest that causal information is often important to categorization. The behavior of both adults and children has been characterized using the framework of causal models, enabling Theory theorists to frame their view in a formally precise way. Many of the assumptions that trouble the account, such as the strong concepts in theories view that generates the problems of holism and incommensurability, turn out not to be essential to its empirical success. The greatest problem the view faces may be one of scope, but this challenge is arguably faced by all other theories of concepts currently in contention (Machery, 2009; Weiskopf, 2009). Whether or not a thoroughgoing Theory-Theory perspective is ultimately vindicated, its key insights will have to be incorporated by any future comprehensive theory of concepts.

7. References and Further Reading

  • Ahn, W., & Kim, N. S. (2000). “The causal status effect in categorization: An overview.” In D. L. Medin (Ed.), Psychology of Learning and Motivation, Vol. 40 (pp. 23-65). New York: Academic Press.
  • Carey, S. (1985). Conceptual Change in Childhood. Cambridge: M I T Press.
  • Carey, S. (1991). “Knowledge acquisition: enrichment or conceptual change?” In S. Carey & R. Gelman (Eds.), The Epigenesis of Mind (pp. 257-291). Hillsdale, N J: Erlbaum.
  • Carey, S. (2009). The Origin of Concepts. Oxford: Oxford University Press.
  • Chaigneau, S.E., Barsalou, L.W., & Sloman, S. (2004). “Assessing the causal structure of function.” Journal of Experimental Psychology: General, 133, 601-625.
  • Faucher, L., Mallon, R., Nazer, D., Nichols, S., Ruby, A., Stich, S., & Weinberg, J. (2002). “The baby in the labcoat: Why child development is an inadequate model for understanding the development of science.” In P. Carruthers, S. Stich & M. Siegal (Eds.), The Cognitive Basis of Science (pp. 335-362). Cambridge: Cambridge University Press.
  • Fodor, J. (1994). “Concepts: A potboiler.” Cognition, 50, 95-113.
  • Fodor, J. (1998). Concepts. Oxford: Oxford University Press.
  • Gelman, R., & Baillargeon, R. (1983). “A review of some Piagetian concepts.” In J. H. Flavell and E. Markman (Eds.), Cognitive Development: Vol. 3 (pp. 167-230). New York: Wiley.
  • Gelman, S. (2003). The Essential Child. Oxford: Oxford University Press.
  • Gelman, S. (2004). “Psychological essentialism in children.” Trends in Cognitive Sciences, 8, 404-409.
  • Gelman, S., & Wellman, H. (1991). “Insides and essences: Early understandings of the nonobvious.” Cognition, 38, 213-244.
  • Glymour, C. (2001). The Mind’s Arrows. Cambridge: M I T Press.
  • Gopnik, A. (1988). “Conceptual and semantic development as theory change.” Mind and Language, 3, 197-217.
  • Gopnik, A. (1996). “The scientist as child.” Philosophy of Science, 63, 485-514.
  • Gopnik, A., & Meltzoff, A. (1997). Words, Thoughts, and Theories. Cambridge: M I T Press.
  • Gopnik, A., & Schulz, L. (2004). “Mechanisms of theory-formation in young children.” Trends in Cognitive Science, 8, 371-377.
  • Gopnik, A., & Schulz, L. (Eds.) (2007). Causal Learning. Oxford: Oxford University Press.
  • Hampton, J. A. (1995). “Similarity-based categorization: The development of prototype theory.” Psychologica Belgica, 35, 103-125.
  • Keil, F. C. (1989). Concepts, Kinds, and Cognitive Development. Cambridge: M I T Press.
  • Machery, E. (2009). Doing Without Concepts. Oxford: Oxford University Press.
  • Margolis, E. (1995). “What is conceptual glue?” Minds and Machines, 9, 241-255.
  • Margolis, E. (1999). “The significance of the theory analogy in the psychological study of concepts.” Mind and Language, 10, 45-71.
  • Medin, D., & Ortony, A. (1989). “Psychological essentialism.” In S. Vosniadou & A. Ortony (Eds.), Similarity and Analogical Reasoning (pp. 179-195). Cambridge: Cambridge University Press.
  • Morton, A. (1980). Frames of Mind. Oxford: Oxford University Press.
  • Murphy, G. (2002). The Big Book of Concepts. Cambridge: M I T Press.
  • Murphy, G., & Medin, D. (1985). “The role of theories in conceptual coherence.” Psychological Review, 92, 289-316
  • Medin, D., & Wattenmaker. (1987). “Category cohesiveness, theories, and cognitive archeology.” In U. Neisser (Ed.), Concepts and Conceptual Development (pp. 25-63). Cambridge: Cambridge University Press.
  • Premack, D., & Woodruff, G. (1978). “Does the chimpanzee have a theory of mind?” Behavioral and Brain Sciences, 1, 515-526.
  • Prinz, J. (2002). Furnishing the Mind. Cambridge: M I T Press.
  • Quine, W. V. (1977). “Natural kinds.” In S. Schwartz (Ed.), Naming, Necessity, and Natural Kinds (pp. 155-175). Ithaca: Cornell University Press.
  • Redher, B. (2003). “A causal-model theory of conceptual representation and categorization.” Journal of Experimental Psychology: Learning, Memory, and Cognition, 29, 1141-59.
  • Rips, L. (1995). “The current status of research on concept combination.” Mind and Language, 10, 72-104.
  • Robbins, P. (2002). “How to blunt the sword of compositionality.” Nous, 36, 313-334.
  • Rosch, E. (1977). “Human categorization.” In N. Warren (Ed.), Advances in Cross-Cultural Psychology: Vol. 1 (pp. 1-49). London: Academic Press.
  • Rosch, E., & Mervis, C. (1975). “Family resemblances: Studies in the internal structure of categories.” Cognitive Psychology, 7, 573-605.
  • Rozenblit, L., & Keil, F. (2002). “The misunderstood limits of folk science: An illusion of explanatory depth.” Cognitive Science, 26, 521-562.
  • Sloman, S. (2005). Causal Models. Oxford: Oxford University Press.
  • Smith, E. E., & Medin, D. (1981). Concepts and Categories. Cambridge: Harvard University Press.
  • Smith, L., Jones, S., & Landau, B. (1996). “Naming in young children: A dumb attentional mechanism?” Cognition, 60, 143-171.
  • Strevens, M. (2000). “The essentialist aspect of native theories.” Cognition, 74, 149-175.
  • Weiskopf, D. (2009). “The plurality of concepts.” Synthese, 169, 145-173.
  • Wellman, H., & Gelman, S. (1988). “Cognitive development: Foundational theories of core domains.” Annual Review of Psychology, 43, 337-375.

Author Information

Daniel A. Weiskopf
Email: dweiskopf@gsu.edu
Georgia State University
U. S. A.

Lucian Blaga (1895—1961)

Lucian BlagaLucian Blaga was a prominent philosopher in Eastern Europe during the period between the two world wars. Trained in both Eastern Orthodox theology and classical philosophy, he developed a “speculative” philosophy that includes books on epistemology, metaphysics, aesthetics, philosophy of culture, philosophical anthropology, philosophy of history, philosophy of science, and philosophy of religion. A chair in philosophy of culture was created for him at the University of Cluj, a leading Romanian university of the period, now Babes-Bolyai University.

Unfortunately, the height of Blaga’s career coincided with WWII, after which Romania was occupied by Soviet troops that installed a socialist government. The new government removed Blaga and many other professors from their university posts. Although Blaga was forbidden to teach and publish, he continued to study and write. Eventually thirty-four of his books of philosophy were published. At the heart of his philosophical publications are four trilogies that constitute a systematic philosophy, a feat that has rarely been attempted since Hegel. He also published books of poetry and theater, plus one novel.

Today Blaga is a national figure in Romania, but because of the unfortunate circumstances surrounding his career, he is barely known to the outside world. However, because of his creativity and systematic vision, his work is being actively studied in Europe in the 21st century. There are those who argue that this mid-20th century philosopher can make valuable contributions to issues that philosophers are still struggling with today.

This article begins by explaining Blaga’s intellectual formation, which provides a context for understanding his philosophy. It then introduces the main features of his philosophical system and provides a bibliography of primary and secondary sources for further study.

Table of Contents

  1. Biographical Sketch
  2. Philosophy of Philosophy
  3. Epistemology
  4. Metaphysics
  5. Other Philosophical Issues
  6. References and Further Reading
    1. Primary Sources
      1. English
      2. Romanian
      3. Other Languages
    2. Secondary Sources in English

1. Biographical Sketch

 

Lucian Blaga (1895-1961) was the son of a village priest. The village was Lancram, an ethnically Romanian village on the eastern edge of the Austro-Hungarian Empire. The priest was Isidor Blaga, an avid reader who seems to have enjoyed studying German philosophy as much as Orthodox theology. Isidor only reluctantly accepted the priestly vacancy in Lancram that was created by the premature death of his own father. A lack of finances to continue his education made this choice a necessity, but the priestly vocation was not Isidor’s original goal: he seems to have harbored more academic aspirations. His interest in higher learning, his interest in philosophy and his personal library would leave a profound imprint on his youngest son.

Life in the Romanian village too left a profound imprint on Blaga. A Romanian village of the late nineteenth and early twentieth century was essentially unaltered from medieval times. There was no industry and little mechanization to speak of. The economy was agricultural. The only schooling was a one-room primary school. Yet, a wealth of traditional wisdom preserved in legends, ballads, poems, and perhaps especially the multitude of proverbs and aphorisms that Lucian Blaga later collected and published in several volumes, provided its own kind of philosophical insight. The culture of the Romanian village would be the subject of Blaga’s acceptance speech given at his induction into the Romanian Academy in 1936.

Because of the dismal academics of the village elementary school and the complete lack of a high school, Isidor and his wife, Ana, sacrificed to send Lucian to private boarding schools. The urban centers in Transylvania were either Hungarian or German, and as a result the best educational opportunities were in Hungarian or German schools. The first school that Blaga attended was a German boarding school in the nearby town of Sebes. Here he studied in German and read important German thinkers. Blaga next enrolled in the prestigious Andrei Saguna High School in the city of Brasov where he studied German, Hungarian, Latin, and Greek. He was particularly interested in the natural sciences, the philosophy of science, and also world religions. A senior thesis was required for graduation from Andrei Saguna: Blaga’s was on Einstein’s relativity and Poincare’s non-Euclidean geometry.

Blaga intended to enroll in a German university upon graduation, but the onset of WWI prevented this. He took the only option open to him besides military service: he enrolled in the Romanian Orthodox seminary in Sibiu, another Transylvanian city. Although he neither enjoyed nor excelled at the more pastoral aspects of the seminary curriculum, he surpassed his classmates in the theoretical areas of study. During this period he especially increased his understanding of philosophy of religion and the history of art.

Blaga completed his undergraduate degree in 1917 and enrolled in the PhD program in philosophy at the University of Vienna. During this time he published his first two books, a book of poetry and a book of aphorisms, which sold well in Romania and helped him finance his studies. He successfully defended his dissertation in November of 1920. It was titled Kultur und Erkenntnis (Culture and Knowledge).

Blaga’s life experiences growing up in a multicultural region (Transylvania was populated by Romanians, Hungarians, Germans, Gypsies, and several smaller minority groups); experiencing the contrast between the peasant village and the modern city; and studying Eastern Orthodox theology and liturgy side by side with European philosophy, history, and science spawned in him a philosophical outlook that gives prominent place to philosophy of culture. Within the broad framework of culture Blaga finds places for science, art, history, and religion.

Blaga’s philosophical system interacts with the theories of classical and contemporary philosophers and intellectuals. These range from the pre-Socratics through early twentieth century philosophers like William James, Edmund Husserl, and Henri Bergson, and also intellectuals such as Albert Einstein, Sigmund Freud, and Paul Tillich. The influence of Plotinus, Kant, and the German Romantics is clearly evident. But Blaga’s philosophy is not merely an attempt to synthesize the insights of other thinkers. It is a systematic and integrated attempt to address the issues that occupied the leading minds of his day.

Blaga’s early philosophical career involved research and publication rather than teaching. He published many articles and a number of books, and served as an editor for several literary magazines. His publications range from fiction to philosophy. His first book of philosophy, Filosofia stilului (The Philosophy of Style), was published in 1924. He also served his country as a statesman. In 1936 he was inducted into the Romanian Academy (a prestigious research institution). Finally in 1938 a special chair of philosophy of culture was created in his honor at the University of Cluj, a leading Romanian university in Transylvania. His inaugural essay was titled Despre plenitudinea istorica (Concerning Historical Fullness). He taught at the University of Cluj until 1949, when he was removed from his post by the newly installed Socialist government of Romania. After this he was not permitted to teach and was only permitted to publish under strict supervision. After his death in 1961 a number of his works were permitted to be posthumously published, and since the fall of Romanian communism Blaga’s entire oeuvre has been republished.

2. Philosophy of Philosophy

 

Some philosophers philosophize about a range of topics without ever turning their analytic skills to an analysis of philosophy itself. In contrast to this, Blaga begins his systematic philosophy with an entire book dedicated to a critical discussion of philosophy. The title of this book is Despre constiinta filosofica (Concerning Philosophical Consciousness). It describes philosophy as a creative and constructive attempt to understand reality. Therefore Blaga views the construction of a systematic metaphysics as the pinnacle of the philosophical enterprise.

At the same time, Blaga rejects the notion that a philosophical system can ever provide an ultimate analysis of reality. All philosophical systems are creative interpretations that attempt to express reality symbolically through human language. But despite this apparent futility inherent in the philosophical task, Blaga views the attempt to “reveal” ultimate reality as the activity that brings humans closest to fulfillment.

The philosophical method is characterized by the utilization of logic and the attempt at objectivity, but subjectivity and style also have their place. Blaga elucidates the proper function of each. He argues that every philosophical system has a central thesis that orients the system. He names the orientation that this point provides the “transcendental accent.” This accent is an expression of the style of the philosopher who created it. For example, the knowledge of the Ideals, which serves as the transcendental accent in Plato’s philosophy, seems to reflect Plato’s own expansive personality and his drive to understand the many disparate aspects of his world.

It is not only the individual who is affected by controlling themes: such beliefs affect entire cultures. Sometimes these themes enter the public subconscious and are called “common sense.” Blaga recognizes that common sense beliefs often preserve wisdom based upon a wealth of experiences, but also warns that they sometimes preserve the prejudices and mistakes of previous generations. He then makes an important and controversial observation about the relationship of philosophy to common sense: whereas many Anglo-American philosophers have taken it upon themselves to defend common sense, Blaga argues that a philosopher must shun the conformist impulse underlying belief in common sense. A philosopher sometimes needs to attack common sense beliefs in order to open up a space for greater understanding.

This position vis-à-vis common sense could be taken as an indication that Blaga views philosophy as a realist rather than a constructivist attempt to understand reality. This would be at least partially mistaken, however. Blaga views philosophy as a creative activity that is justified even when its results are mistaken, since it deepens and enriches the understanding of the problematic of the human spirit. Thus philosophical speculation is justified quite apart from the truthfulness of its theories. Philosophy is justified by its fruitfulness, vision, suggestiveness, foresight, and how it stirs the soul, even if its theories never attain complete perfection.

This seems to suggest that philosophy is closer to art than to science, a view that would displease many analytic philosophers. Blaga finds both differences and similarities between philosophy and science. The primary difference between philosophy and science is methodological. Both begin with a set of objective data (the aria) and with certain presuppositions about how the data will be handled (the interior horizon of the problem). But while the data of science is very specific and the presuppositions are detailed and complex, the data of philosophy includes all of human experience and the presuppositions are very general. Because of this, a scientific investigation is very closely controlled by its initial data and its presuppositions; while philosophical investigation is open to many creative solutions that the scientist would not consider. Initially this might seem like a disadvantage, since it seems like an admission of the undisciplined nature of philosophical investigation in comparison to the fairly rigid discipline of the natural sciences. However, it also has a significant strength: Blaga points out that because of the very specific aria and interior horizon of a scientific investigation, science methodologically anticipates its solutions much more than philosophy does. Hence philosophy is more open to discovering the unexpected.

3. Epistemology

This discussion of the difference between philosophy and science draws on several ideas that Blaga elaborates in his epistemology. His primary works on this subject are the three books that comprise his “Trilogy of Knowledge”: Eonul dogmatic (The Dogmatic Age), Cunoasterea luciferica (Luciferic Cognition), and Cenzura transcendenta (Transcendent Censorship). These books analyze, with surprising lucidity, the entire range of modes of human cognition.

Blaga analyzes cognition into seven possible modes:

  1. Positive-adequate cognition
  2. Quasi-cognition
  3. Negative cognition
  4. Cognition that is part positive-adequate and part quasi-cognition
  5. Cognition that is part positive-adequate and part negative cognition
  6. Cognition that is part positive-adequate and part quasi-cognition and part negative cognition
  7. Cognition that is part quasi-cognition and part negative cognition

He discusses each of these modes of cognition, but only the second and the seventh are actually realized by humans. The second, quasi-cognition, can be further analyzed into concrete cognition, paradisaic cognition, mythic cognition, and occult cognition. The seventh, which Blaga names “luciferic cognition,” can be analyzed into three subdivisions: plus-cognition, zero-cognition, and minus-cognition. His discussion of the subdivisions of luciferic cognition, which is the subject of his book by that name, is perhaps the most original and most interesting part of his epistemology.

Blaga remarks that the possible forms of cognition can be analyzed into three types: (1) cognition that is positive-adequate and unlimited, (2) cognition that is censured but in principle unlimited, and (3) cognition that is positive-adequate but strictly limited. The first type would pertain only to a deity. The third type pertains to simple organisms and enables them to perform functions such as the replication of lost appendages. Human cognition falls within the second type, since human cognition is potentially unlimited in extent but strictly limited qualitatively, because it never completely captures its object and never perfectly corresponds to reality.

Mystery, and the cognitive limits that produce mystery, are central to Blaga’s epistemology (and perhaps to his whole philosophy). The reason for these limits is explained in his metaphysics, and the means are explained in his philosophy of culture.

There are at least five important features of Blaga’s epistemology that are innovative, to a greater or lesser degree, and that are significant epistemological contributions. The first of these is the placing of his epistemology within a complementary and explicatory metaphysical system, which will be discussed presently. This metaphysical speculation provides answers to such epistemologically relevant questions as: what are the material, efficient, and final causes of the human epistemological situation; why this situation pertains; how it was implemented; and how it is preserved. A second important feature is Blaga’s emphasis on the important role played in human cognition by culture. According to Blaga, even categories of understanding are culturally affected. He argues that there are both subjective and beautifully creative aspects of human cognition, and also that human cognition is not thwarted by its historicity but is rather empowered by it.

The third and fourth important contributions of Blaga’s epistemology are his analysis of the two types of cognition that he calls “luciferic cognition” and “minus-cognition.” Minus-cognition is a subset of luciferic cognition. Blaga devoted an entire book of his epistemological trilogy to minus-cognition, and another is largely devoted to luciferic cognition. In his elucidation of these two types of cognition, Blaga uncovers methods of problem solving that have heretofore been largely overlooked in Western epistemology.

A fifth important aspect of Blaga’s epistemology is its constructivism. Constructivism, the view that knowledge is a human construct, is a ubiquitous element of Blaga’s philosophy. It was seen in the previous elaboration of Blaga’s philosophy of philosophy, it is reflected in his epistemology, it will be seen in his freely creative metaphysics, and it can also be seen in his philosophy of culture and philosophy of religion.

There have been numerous other constructivist philosophers, ranging from Kant through Nietzsche to 20th century figures such as Jean Piaget and Ernst van Glasersfeld. Nonetheless, there are several important things about Blaga’s constructivism that make it particularly noteworthy. The first of these is how neatly and consistently constructivism fits within the larger philosophical picture that Blaga paints. His philosophical system gives constructivism a context, an explanation and a purpose that are sometimes lacking in other constructivist philosophies. A second noteworthy aspect of his constructivism is that it is argued for in a wide variety of cognitive contexts: Blaga shows that human thought is constructivist whether it occurs in math, the natural sciences, philosophy, theology, the arts, or in any other cognitive context. A third important aspect is how his constructivism is argued: he does not cease being a constructivist when he argues for his own philosophical system. He views his own system as merely a possible thesis supported (but not proved) by evidence and pragmatic utility.

More should be said about luciferic cognition, since this is one of the key insights of Blaga’s epistemology. Paradisaic cognition (what could be considered “ordinary cognition”) attempts to quantitatively reduce the mysteries of existence. Its progress is linear, adding new facts to the existent body of knowledge. Luciferic cognition, on the other hand, seeks to qualitatively reduce mystery through attenuation or, if that is not possible, through permanentization or intensification of the mysterious. Its progress is inward, deepening and intensifying knowledge of the hidden essence of the cognitive object. Every step of progress leads to another step, ad infinitum, and thus luciferic cognition is never totally successful in grasping its object, but it is successful in providing new understanding of the layers and aspects of the mystery of its being. Luciferic cognition does not exceed the inherent limits of human cognition, but it does explore those limits and press cognition to its fullest extent.

Luciferic cognition is dependent on paradisaic cognition for its starting point, the empirical, conceptual, or imaginary data that Blaga calls “phanic material.” It then “provokes a crisis” in the phanic material through bringing out the mysteries inherent in the object. Whereas paradisaic cognition views objects of cognition as “given,” luciferic cognition views them as partly given, but also partly hidden. Whereas paradisaic cognition is subject to the “illusion of adequacy”—the mistaken belief that the object is as it is perceived to be or, more precisely, the mistaken belief that paradisaic cognition is able to grasp the object as it really is—luciferic cognition begins with the removal of this illusion. An investigation that stops at the mere defining of an object as it is “given” overlooks potentially multitudinous other facets of knowledge about the object.

The distinction between the object of paradisaic cognition and the object of luciferic cognition bears a resemblance to Kant’s phenomena-noumena distinction, but has several important differences. One significant difference is that the Kantian noumenon is an object that is one single mystery; the luciferic object is a long series of latent mysteries. An even more important difference is that whereas the Kantian noumenon is not available to human cognition, Blaga’s luciferic objects are available to luciferic cognition (but are not cognized in the same way as objects are cognized in paradisaic cognition).

Blaga’s analysis of the process of luciferic cognition is fairly detailed and very interesting. It includes ideas that mirror more recent developments in the philosophy of science, such as his concepts of “theory idea,” “directed observation,” “categorical dislocation,” and “Copernican inversion of the object.” His concept of “theory capacity” resembles ideas suggested by American Pragmatists.

Blaga weighs in on the issues of truth and verification, which have largely dominated discussions in recent analytic epistemology. He accepts coherence as a necessary but not sufficient condition of truthfulness, and demonstration of correspondence as a sufficient but not necessary condition. He suggests that the best way to verify the truthfulness of a theory is pragmatic: put it into practice. His own philosophy is consistent with these views: it is a cohesive system, but he does not appeal to this cohesiveness as the grounds for believing it true. Rather he appeals to its ability to facilitate the resolution of philosophical problems.

4. Metaphysics

Blaga’s metaphysic is contained in the three books that form his “Cosmological Trilogy”: Diferentialele divine, (Divine Differentials), Aspecte anthropologice (Anthropological Aspects), and Fiinta istorica (The Historical Being). It is centered on a cosmology that is specifically intended to complement his epistemology. It is sometimes characterized as a “speculative” metaphysic rather than an empirical one. Blaga is firmly persuaded that metaphysical theories are not (and cannot be) proven empirically, although he accords a significant role to experience in the testing of such theories. His method resembles the hypothetico-deductive method wherein a theoretically possible solution to a problem is granted as a working hypothesis, and then the consequences of this hypothesis are deduced in order to determine if they are compatible with generally accepted theory. If they are, then the hypothesis stands as provisionally vindicated, despite the fact that the hypothesis itself has not been directly verified empirically.

Blaga states that metaphysical starting points are presupposed at the outset of the metaphysical investigation and are only subsequently justified by their ability to organize data and to “construct a world.” In elucidating his metaphysical vision, Blaga proposes a cluster of premises that are essential to the system he intends to promulgate. He then elaborates how these form the basis of a system that provides, or enables, the resolution of certain important problems heretofore not satisfactorily resolved by other metaphysical systems.

It is widely acknowledged that Blaga accepts and works within a sort of neo-Kantian Idealism, wherein the actual existence of an external world is accepted as a necessary metaphysical corollary even though an external world is not directly knowable epistemologically. If doing metaphysics would be defined along realist lines, as an accurate description of “noumenal” reality, then Blaga would not be able to do metaphysics, since according to his epistemology humanity cannot have perfect knowledge of objects of cognition. However, because Blaga views metaphysics as a creative and pragmatically justified endeavor that attempts to reach beyond the empirical and provide a theoretical explanation for all of existence, metaphysics is possible.

One of the first issues addressed in Blaga’s metaphysical writings is the question of the origin of the cosmos. It is conceivable that the cosmos has no origin, and that it has always existed. Alternatively, it is possible that the cosmos has a specific origin. Blaga opts for the latter view because it “enormously facilitates approaching cosmological problems” and is therefore to be preferred. Based on this pragmatic justification, he proceeds to construct his metaphysics around a postulated beginning and source of the world.

That both the origin and the source of the cosmos are unknown is admitted by Blaga. Therefore one of the ways that he refers to the source is “The Anonymous Fund.” Theoretically, the cosmos could be a result of one or more creative acts of this Fund acting upon external preexistent sources; it could be an emanation from the Fund; or it could even be a reproduction of the Fund. Blaga rejects the possibility of creation using sources outside of the Fund, presumably because this would entail the existence of a cosmos that precedes the creation of the present cosmos, introducing a regress that thwarts the solving of the problems that Blaga is addressing. He also rejects the possibility of creation ex nihilo. Blaga opts for a theory of emanation similar to that proposed by Plotinus, an emanation wherein the Fund reproduces itself endlessly without diminishing itself in any way.

Blaga proposes that the Fund be viewed as possessing, due to its own “fullness,” the capacity of infinite self-replication. It controls its reproduction so that it will not destabilize existence. (Blaga grants that this sort of talk is necessarily metaphorical.) But it is the nature of the Fund to create/reproduce (this is inherent in the meaning of “fund”); therefore it allows itself to reproduce, but only in a specific mode that assures the longevity and greatest success of its reproductive acts. This controlled reproduction is the best compromise between the Fund’s capacity for replication and the necessity of safeguarding the centrality of existence. Had such precautions not been taken, the result of the Fund’s creative capacity would be a series of competing funds rather than the present world. What is remarkable, according to Blaga, is not so much that the present world exists, but that a series of competing funds does not exist. The present world is a result of the Fund’s own self-limitation, of the partial thwarting of the Fund’s natural creativity.

The form that the controlled reproduction of the Anonymous Fund takes is that of creation through “differentials”. Differentials are minute particles emanated from the Fund that exactly replicate minute aspects of the Fund and are emanated in endless numbers. The differentials have a natural propensity to combine with each other, forming new subcreations. The most central differentials are withheld from emanation in order to prevent the recombination of differentials into a copy of the Anonymous Fund. The recombination of emitted differentials has created the present world in its ever-changing forms.

This schema depicts the origin of the world as taking place in three phases: (1) the operation of limiting the generative possibilities of the Anonymous Fund, (2) the emission of differentials, and (3) the combination of differentials, creating more complex beings through integration. The schema also depicts the creation of the world as being based upon two fundamental factors, the Anonymous Fund’s reproductive potential and its success in directing this potential into creating in a manner that preserves its own hegemony as metaphysical center of the universe.

Integration of the differentials is a natural result of the fact that the differentials are, in their structure, particles of one integrated whole. But integration does not occur on the basis of a perfect match between differentials. If it did, there would only be one line of integration that would result in only one type of created being. Integration takes place on the basis of a merely sufficient match between differentials. This allows for a vast number of different integrations, which explains such empirical phenomena as the existence of sometimes similar, sometimes identical or parallel features in entities that belong to different kingdoms, classes, phyla, and species.

Blaga offers the following empirical analysis in support of his theory that the world is composed of differentials. Upon close inspection, it can be observed that all empirical existents display at least three types of discontinuity: (1) Structural discontinuity: some existents are very simple structurally, others are very complex. (2) Intrinsic discontinuity: existents are at the same time independent and interdependent. (3) Discontinuity of repetition: groups of existents of the same type are composed of individuals. These phenomena are explained by the existence of discontinuity in the very heart of the empirical world. This fundamental discontinuity is a result of the empirical world being composed of a multitude of diverse differentials, variously integrated and organized. Furthermore, Blaga argues that two lines of empirical proof show that creation takes place through something akin to differentials: (1) The widespread consistency of certain structures and the equally widespread variability of others indicates that at the base of existence there is a discontinuity of elements that are capable of a variety of different combinations. (2) The presence of similar or identical features in entities that are otherwise very different from each other likewise indicates that existence is composed of a variety of elements capable of forming a variety of combinations.

Blaga discusses how the Anonymous Fund concept differs from and is similar to the theistic conception of God. Both are conceived as being the source of all else, the most central of all existents, and the greatest existent, to the extent that their own existence surpasses all others in both extent and in quality. However, Blaga states that he hesitates to use the term God to refer to his conception of the central metaphysical entity both because there are significant differences between his own conception and that of traditional theology, and because he believes it impossible to know whether the attributes usually ascribed to God apply to the Anonymous Fund. He grants that the term “God” could be used as a synonym for the Anonymous Fund, since according to his metaphysics there is nothing in existence that is more central than the Anonymous Fund. But Blaga will not even affirm that the Fund is a being in the usual sense, saying rather that conceiving it thus is merely a “crutch” used by the understanding.

Blaga sees the product of self-reproduction of the Anonymous Fund as necessarily differentiated from the Fund itself in order to preserve the order of the cosmos. This differentiation is accomplished by the Fund in order to achieve a specific goal. The goal and benefits of differentiated creation include: (1) facilitation of the fulfillment of the Fund’s generative nature, (2) the avoidance of genesis of innumerable identical “hypostases,” and (3) the avoidance of genesis of complex, indivisible, and indestructible existents that would have too great an autarchic potential, (4) the generation of complex existents that do not infringe upon numbers 2 and 3 above, (5) the genesis of an immense variety of existents and beings, (6) a proportioning of existents between those that are simple and those that are more complex, and (7) the generation of beings with cognitive capacity while at the same time censoring that capacity so as to protect both the beings and the order of the universe. Blaga believes that his proposal shows that the Anonymous Fund has employed a means of genesis that achieves a maximum number of advantages through the employment of a minimum number of measures.

He states that the existence of dis-analogy between Creator and creation is paradoxical. It is paradoxical because the expected result of an Anonymous Fund as postulated by Blaga would be the production of other entities like itself, the production of identical self-replications. Blaga finds it surprising but empirically evident that this self-replication does not take place. The explanation for this surprising nonoccurrence is the necessity of thwarting “theo-geneses” in order to preserve the necessary order of existence. Thus the apparent paradox is only an initial paradox, which is seen to be resolvable through the means of minus-cognition (as discussed in Blaga’s epistemology).

In Blaga’s metaphysics there are two important measures employed by the Anonymous Fund in preservation of cosmic equilibrium. One of these has already been discussed: differentiated creation. The other is transcendent censorship. While many metaphysicians have struggled with the question “What is the nature of existence?” and many epistemologists have struggled with “What are the methods of knowledge?” relatively few have sought to answer the question “What is it that impedes our answering of these fundamental questions?” Blaga insightfully observes that this “prohibitive factor” is one of the factors of existence that philosophy has yet to reckon with.

Blaga proposes that ultimate questions are difficult to answer, and in some sense unanswerable, because in addition to the ontological limit that the Anonymous Fund has imposed upon creation (through the means of differentiated creation), the Fund has also imposed a cognitive limit on creation. This was done at the time of the creation of the cosmos, and is now an inherent aspect, affecting all modes of cognition. Blaga refers to this limit as “transcendent censorship.” This censorship is accomplished via a network of factors, including obligatory epistemic reliance on the concrete, the intervention of cognitive structures, the resulting “dissimulation of the transcendent,” and “the illusion of adequacy.” Transcendent censorship not only prevents humans from having positive-adequate knowledge of the mysteries of existence, it prevents them from having “positive-adequate” knowledge of any object of cognition whatsoever. Blaga points out that this view has an interesting difference from the Kantian/neo-Kantian view. In Kant’s epistemology, existence is passive in the cognitive event; according to Blaga’s theory, existence is active in preventing itself from being known.

According to Blaga, the result of transcendent censorship is that all human knowledge is either dissimulation (in which objects of cognition are represented as being other than they really are), or negative cognition (in which antinomian elements of a cognitive problem are reconciled through the employment of a heuristic “theory idea,” which leads to a deepened understanding of the problem without resulting in its complete elimination), or a combination of these. This does not indicate that Blaga is a skeptic, however. Even the “mysteries” of existence are approachable through the strategy that Blaga names “luciferic cognition,” although they are not actually reachable.

The reasons that the Anonymous Fund would impose censorship upon its creation are similar to the reasons for its dissimulation of creation. Blaga lists the following four reasons for transcendent censorship: (1) Human possession of perfect knowledge would upset the equilibrium of existence by bestowing perfection on limited beings. (2) Human possession of perfect knowledge might threaten the benign governance of the universe by introducing the possibility of a human cognitive rival to the Anonymous Fund. (3) Possession of absolute knowledge would ossify the human spirit, quenching human creativity. (4) Censorship spurs human creativity and exertion, giving humanity its raison d’etre. To this list could be added the explanation that human creativity is one indirect outlet of the creativity of the Anonymous Fund, and anything that lessens human creativity is an attack on the creative intentions of the Fund.

The responsibility for the human inability to arrive at an absolute understanding of existence therefore rests squarely upon the Anonymous Fund, for benevolent reasons. This is in striking contrast to the philosophical system of Descartes, wherein God’s righteousness and benevolence are made the foundation of all sure knowledge. In Blaga’s system, the benevolence and wisdom of the Anonymous Fund result in the prevention of sure knowledge.

It is clear that Blaga’s metaphysical system can say relatively little about the actual structure of the universe, because according to this system such knowledge is structurally excluded from human cognition. Blaga’s system does allow for metaphysical postulation, however, and these postulates can be supported or substantiated by experience and by pragmatic arguments. Thus Blaga justifies himself in asserting that the cosmos has a center, and that this center is the Anonymous Fund.

Blaga does take stands on several of the standard issues of cosmology. He rejects both naive realism and subjective idealism (a la Berkeley), opting for a neo-Kantian critical realism. With regards to the monism-pluralism controversy, Blaga is clearly a pluralist. While the cosmos is a result of one single entity, and is composed of pieces emitted from that one entity, these pieces (the differentials) are separate, distinct entities in their own right. They are permanent and unchanging, and are the building blocks of all else that exists. Although Blaga is a realist, he is not a materialist. Differentials are not material, but rather are submaterial. They underlie all material existents, but underlie nonmaterial realities as well.

According to Blaga, humanity is, in a sense, the very pinnacle of the Anonymous Fund’s creation, because the human consciousness is the most complex organization of differentials it has permitted. There is also another sense in which humanity is the pinnacle of creation: more than any other complex created existent, humanity has the ability to further the Fund’s own creative activity. Humans are naturally creative, and their creations can be viewed as secondary creations of the Anonymous Fund.

Human existence is characterized by two modes of existence, the “paradisaic” mode, which is the normal state of life in the world, and the “luciferic” mode, which is life lived in the presence of mystery and for the purpose of “revealing” it (grappling with it; trying to make it understandable). The latter mode results in an “ontological mutation” that is unique in the universe and essential to full humanness. “Mystery” is a result of the protective limits imposed on creation by the Anonymous Fund (transcendent censorship and the discontinuity between creator and creation). Through these means the Fund bestows upon humanity a destiny and a purpose in life: humanity’s purpose is to create; its destiny is to strive (through creating) to reveal the mysteries of existence.

Since the mysteries of existence are not ultimately fathomable by humans, humanity is doomed to a continual striving to reveal them, sometimes experiencing partial successes, but never reaching the ultimate goal. In Blaga’s vision, human history becomes an endless, permanent creative state, never reaching its goal, but never exhausting its source of motivation and meaningfulness either. Through this artifice the Anonymous Fund gives to humanity a goal, a purpose, and gives it the unique historicity that makes it so culturally rich. Thus historicity is one of the greatest dimensions of human existence. It is seen to be a vital aspect of “luciferic” humanness. Likewise, the “principle of conservation of mystery,” which was made part of the very fiber of existence in order to preserve the centrality of the Anonymous Fund from the ambitions of created beings, is seen to be one of the chief metaphysical conditions of the historicity of humanity.

This description highlights the two opposing components that shape human history: the inner human desire to creatively reveal mystery, and the necessity of the Anonymous Fund to thwart this desire. This desire and its lack of fulfillment are here seen as essential both to the historicity of humanity and to full humanness, since they provide the peaks and valleys of failed attempts and renewed aspirations toward the absolute of which human history is composed.

The human inability to have absolute knowledge is often viewed as a failure, shortcoming, or handicap. Blaga reverses this evaluation, making human subjectivity and relativity essential to humanness and the glory of the human situation: according to Blaga, these factors give humanity its role and place in a great ontological scheme. Humans are not the deplorable victims of their own limits that they are sometimes supposed to be; rather, they are the servants of a system that is so great it surpasses them.

The final question of cosmology might be, “Is there anything beyond this cosmos?” Transcendent censorship does not prevent Blaga from having an answer to this question. All that exists is either one of two things: a structure of differentials emitted by the Anonymous Fund, or the Anonymous Fund itself. The cosmos is composed of differentials, as discussed above. The Fund, on the other hand, is not composed of differentials. Therefore the Anonymous Fund is not part of the cosmos. The answer to this question, then, is that there is something “beyond” the cosmos, but only one thing: the Anonymous Fund.

5. Other Philosophical Issues

From this sketch of Blaga’s epistemology and metaphysics one can infer the major themes and directions of the rest of his philosophy. His philosophy of culture elevates culture to the pinnacle of human existence. Culture is a product of the human drive to creatively “reveal” the mysteries of reality and the thwarting of this drive by the Anonymous Fund. Its creativity is an extension of the creativity of the Fund itself, and lends beauty and meaning to human existence. In his philosophy of history he describes humans as historical beings who derive their meaningfulness through living in the face of the unknown, wrestling with it, and conquering it, only to find that it rises up again, providing ever new mountains to climb. Human history is a record of a long series of only partially successful attempts to master our world. (It can be seen that history and culture are closely related, both as subjects and as themes in Blaga’s oeuvre.)

Blaga’s attitude towards science is fairly remarkable considering the positivistic philosophical currents of the early 20th century. He has a great appreciation for science as a cognitive approach that combines both paradisaic and luciferic cognition. As can be anticipated, his application of his epistemology to science leads to insights similar to those of Michael Polanyi and Thomas Kuhn. His is not a critique of science, per se, but rather an explanation of how science operates, locating it within the range of human creative endeavors that aim at revealing the mysteries of existence. As such, science, like art, religion, and all other human cognitive enterprises can be successful, but only within the boundaries postulated by Blaga’s metaphysics.

Blaga views religion as yet another attempt to penetrate the mysteries of existence. Religion is perhaps the grandest of all such attempts, since it aims the highest, but it is also the most doomed, since its reach far outstretches its grasp. He argues that religion is a cultural production, which is a position that caused great animosity toward him on the part of some contemporary Romanian theologians. However, an implication of his metaphysic is that religion is a response to the transcendent Anonymous Fund, a position that is fairly harmonious with Romanian Orthodox theology. And Blaga admits that his “Anonymous Fund” could also be termed “God,” though he explains that he avoids using this term because it carries with it so much baggage.

Blaga’s primary books on philosophy of culture are the three that comprise his “Trilogy of Culture”: Orizont si stil (Horizon and Style), Spatiul mioritic (The Ewe-Space), and Geneza metaforei si sensul culturii (The Genesis of Metaphor and the Meaning of Culture). His philosophy of history is exposited together with his philosophical anthropology in Aspecte antropologice (Anthropological Aspects) and Fiinta istorica (The Historical Being), two of the books making up his “Cosmological Trilogy.” He addresses issues in the philosophy of science in Experimentul si spiritual mathematic (Experiment and the Mathematical Spirit) and Stiinta si creatie (Science and Creation). His theory of aesthetics is outlined in Arta si valoare (Art and Value), and his philosophy of religion is explained in Gandire magica si religiei (Magical Thinking and Religion) and in a set of lecture notes that were posthumously published as Curs de filosofia religiei (Curse on the Philosophy of Religion). 

6. References and Further Reading

Blaga authored many books and articles. Unfortunately, while all of his poetry and some of his theater is available in English, his philosophy remains to be translated. An anthology of fragments, which have been translated into English, with some secondary sources in English as well, is available on CD from Richard T. Allen.

a. Primary Sources

i. English

  • Blaga, Lucian, and Andrei Codrescu (trans.). At the Court of Yearning. Columbus, OH: Ohio State University Press, 1989.
  • Blaga, Lucian, and Brenda Walker (trans.). Complete Poetical Works of Lucian Blaga. Iasi, RO, Oxford, GB, and Portland, USA: Center for Romanian Studies, 2001.
  • Blaga, Lucian, and Doris Planus-Runey (trans.). Zalmoxis: Obscure Pagan. Iasi, RO, Oxford, GB, and Portland, USA: Center for Romanian Studies, 2000.
  • Interestingly, some of Blaga’s poetry, accompanied by music and visuals, is also available on YouTube.

ii. Romanian

  • Eonul dogmatic (The Dogmatic Age). Bucharest: Cartea Romaneasca, 1931.
  • Cunoasterea luciferica (Luciferic Knowledge). Sibiu: Tiparul Institutului de arte grafice “Dacia Traiana,” 1933.
  • Cenzura transcendenta: Incercare metafizica. (Transcendent Censorship: A Metaphysical Attempt). Bucharest: Cartea Româneasca, 1934.
  • Orizont si stil (Horizon and Style). Bucharest: Fundatia pentru literatura si arta “Regele Carol II,” 1935.
  • Spatiul mioritic (The Ewe-Space). Bucharest: Cartea Româneasca, 1936.
  • Geneza metaforei si sensul culturii (The Genesis of Metaphor and the Meaning of Culture). Bucharest: Fundatia pentru literatura si arta “Regele Carol II,” 1937.
  • Arta Si valoare (Art and Value). Bucharest: Fundatia pentru literatura si arta “Regele Carol II,” 1939.
  • Diferentialele divine (The Divine Differentials). Bucharest: Fundatia pentru literatura si arta “Regele Carol II,” 1940.
  • Despre gandirea magica (Concerning Magical Thinking). Bucharest: Fundatia regala pentru literatura si arta, 1941.
  • Religie si spirit (Religion and Spirit). Sibiu: Editura “Dacia Traiana,” 1942.
  • Stiinta si creatie (Science and Creation). Sibiu: Editura “Dacia Traiana,” 1942.
  • Despre constiinta filosofica (Concerning Philosophical Consciousness). Cluj-Napoca: Lito-Schildkraut, 1947.
  • Aspecte antropologice (Anthropological Aspects). Cluj-Napoca: Uniunea nationala a studentilor din Romania, Centrul studentesc Cluj, 1948.
  • Experimentul si spiritul matematic (Experiment and the Mathematical Spirit). Bucharest: Editura Stiintifica, 1969.
  • Fiinta istorica (The Historical Being). Cluj-Napoca: Editura Dacia, 1977.

iii. Other Languages

  • Orizzonte e stile, ed. Antonio Banfi. Milano: Minuziano Editore, 1946.
  • Zum Wesen der rumanischen Volkseele, ed. Mircea Flonta. Bucharest: Editura Minerva, 1982.
  • L’Eon dogmatique, L’Age d’Homme, trans. Jessie Marin, Raoul Marin, Mariana Danesco, and Georges Danesco. Lausanne: Editions l’Age d’Home, 1988.
  • L’Eloge du village roumain, ed. Jessie Marin and Raoul Marin. Paris: Librairie du Savoir, 1989.
  • L’Etre historique, trans. Mariana Danesco. Paris: Librairie du Savoir, 1990.
  • Les Differentielles divines, trans. Thomas Bazin, Raoul Marin, and Georges Danesco. Paris: Librairie du Savoir, 1990.
  • La trilogie de la connaissance, trans. Raul Marin and Georges Piscoci-Danescu. Paris: Librairie du Savoir, 1992.
  • Trilogia della cultura: Lo spazio mioritico, trans. Ricardo Busetto and Marco Cugno. Alessandria, IT: Editionni dell’Orso, 1994.

b. Secondary Sources in English

  • Bejan, Cristina. “The Paradox of the Young Generation in Inter-war Romania,” Slovo, 18:2 (Autumn 2006): 115-128.
  • Botez, Angela. “Comparativist and Valuational Reflections on Blaga’s Philosophy,” Revue Roumaine de Philosophie et Logique 40 (1996): 153–62.
  • Botez, Angela. “Lucian Blaga and the Complementary Spiritual Paradigm of the 20th Century,” Revue Roumaine de Philosophie et Logique 37 (1993): 51–55.
  • Botez, Angela. “The Postmodern Antirepresentationalism (Polanyi, Blaga, Rorty),” Revue Roumaine de Philosophie et Logique 41 (1997): 59–70.
  • Eliade, Mircea. “Rumanian Philosophy,” in Encyclopedia of Philosophy, ed. Paul Edwards (New York: Macmillan and the Free Press, 1967).
  • Flonta, Mircea. “Blaga, Lucian.” In Routledge Encyclopedia of Philosophy Online, edited by E. Craig. London: Routledge, 2004, (accessed January 3, 2006).
  • Hitchins, Keith. “Introduction to Complete Poetical Works of Lucian Blaga, translated by Brenda Walker, 23–48. Iasi, RO, Oxford, Portland, USA: Center for Romanian Studies, 2001.
  • Isac, Ionut. “Considerations on some Historical and Contemporary Issues in Lucian Blagas Metaphysics,” Journal for the Study of Religions and Ideologies 7:19, spring 2008, 184-202.
  • Jones, Michael S. “Culture and Interreligious Understanding according to the Romanian Philosopher Lucian Blaga,” Journal of Ecumenical Studies 45: 1, winter 2010, 97-112.
  • Jones, Michael S. “Culture as Religion and Religion as Culture in the Philosophy of Lucian Blaga,” Journal for the Study of Religions and Ideologies no. 15, winter 2006, 66-87.
  • Jones, Michael S. “The Metaphysics of Religion: Lucian Blaga and contemporary philosophy.” Teaneck and Madison, NJ: Fairleigh Dickenson University Press, 2006.
  • Munteanu, Bazil. “Lucian Blaga, Metaphysician of Mystery and Philosopher of Culture,” Revue Roumaine de Philosophie et Logique 39 (1995): 43–46.
  • Nemoianu, Virgil. “Mihai Sora and the Traditions of Romanian Philosophy,” Review of Metaphysics 43 (March 1990): 591–605.
  • Nemoianu, Virgil. “A Theory of the Secondary: Literature, Progress, and Reaction.” Baltimore: Johns Hopkins University Press, 1989, 153 –170.

Author Information

Michael S. Jones
Email: msjones2@liberty.edu
Liberty University
U. S. A.

Manbendra Nath Roy (1887—1954)

M. N. RoyM. N. Roy was a twentieth century Indian philosopher. He began his career as a militant political activist and left India in 1915 in search of arms for organizing an insurrection against British rule in India. However, Roy’s attempts to secure arms ended in a failure, and finally in June 1916, he landed in San Francisco, California. It was there that Roy, who was then known as Narendra Nath Bhattacharya, changed his name to Manbendra Nath Roy. Roy developed friendships with several American radicals, and frequented the New York Public Library. He began a systematic study of socialism, originally with the intention of combating it, but he soon discovered that he had himself become a socialist! Roy met Lenin in Moscow in 1920, and went on to become an international ranking communist leader. Nevertheless, in September 1929 he was expelled from the Communist International for various reasons. He returned to India in December 1930 and was sentenced to six years imprisonment for his role in the Kanpur Communist Conspiracy Case.

Roy’s real philosophical quest began during his prison years which he decided to use for writing a systematic study of ‘the philosophical consequences of modern science’, which would be a re-examination and re-formulation of Marxism to which he subscribed since 1919. The reflections, which Roy wrote in jail over a period of five years, grew into nine rigorous volumes. The ‘Prison Manuscripts’ have not yet been published in their totality, and are currently preserved in the Nehru Memorial Museum and Library Archives in New Delhi. However, selected portions from the manuscript were published as separate books in the 1930s and the 1940s.

In his philosophical works, Roy has made a clear distinction between philosophy and religion. According to Roy, no philosophical advancement is possible unless we get rid of orthodox religious ideas and theological dogmas. On the other hand, Roy has envisaged a very close relationship between philosophy and science. Moreover, Roy has given a central place to intellectual and philosophical revolution in his philosophy. Roy maintained that a philosophical revolution must precede a social revolution. Besides, Roy has, in the tradition of eighteenth century French materialist Holbach, revised and restated materialism in the light of twentieth century scientific developments. In the context of Indian philosophy, Roy could be placed in the tradition of ancient Indian materialism—both Lokayata and Carvaka.

Table of Contents

  1. Life
    1. Militant Nationalist Phase: In Search of Arms
    2. Towards Communism
    3. Return to India: Prison Years
    4. Beyond Communism: Towards New Humanism
    5. Final years
  2. Writings
    1. Autobiographical
    2. Philosophical
  3. Roy’s Concept of Philosophy
    1. Philosophy and Metaphysics
    2. Philosophy and Religion
    3. Philosophy and Science
  4. Roy’s New Humanism: Twenty-Two Theses on Radical Democracy
    1. Basic Tenets of New Humanism
    2. Humanist Interpretation of History
    3. Inadequacies of Communism
    4. Shortcomings of Formal Parliamentary Democracy
    5. Radical Democracy
  5. Philosophical Revolution or Renaissance
  6. Roy’s Materialism or Physical Realism
    1. Roy’s Materialism
    2. Roy’s Materialism and Traditional Materialism
      1. Change in the Concept of “Matter”
      2. Revision of Physical Determinism
      3. Objective Reality of Ideas and the Autonomy of the Mental World
      4. Emphasis on Ethics
    3. Roy’s Materialism and Marxian Materialism
      1. Delinking of Dialectics and Materialism
      2. Rejection of Historical Materialism
      3. Emphasis on Ethics
    4. Roy and Lokayata
  7. Roy’s Intellectual Legacy
  8. References and Further Reading
    1. Primary Sources
    2. Secondary Sources

1. Life

 

M. N. Roy’s original name was Narendra Nath Bhattacharya. He was born on 21 March 1887, at Arbalia, a village in 24 Parganas district in Bengal. His father, Dinabandhu Bhattacharya, was head pandit of a local school. His mother’s name was Basanta Kumari.

a. Militant Nationalist Phase: In Search of Arms

 

Roy began his political career as a militant nationalist at the age of 14, when he was still a student. He joined an underground organization called Anushilan Samiti, and when it was banned, he helped in organizing Jugantar Group under the leadership of Jatin Mukherji. In 1915, after the beginning of the First World War, Roy left India for Java in search of arms for organizing an insurrection to overthrow the British rule in India. From then on, he moved from country to country, using fake passports and different names in his attempt to secure German arms. Finally, after wandering through Malay, Indonesia, Indo-China, Philippines, Japan, Korea and China, in June 1916, he landed at San Francisco in United States of America.

Roy’s attempts to secure arms ended in a failure. The Police repression had shattered the underground organization that Roy had left behind. He had also come to know about the death of his leader, Jatin Mukherji, in an encounter with police.

b. Towards Communism

 

The news of Roy’s arrival at San Francisco was somehow published in a local daily, forcing Roy to flee south to Palo Alto, California near Stanford University. It was here that Roy, until then known as Narendra Nath Bhattacharya or Naren, changed his name to Manbendra Nath Roy. This change of name on the campus of Stanford University enabled Roy to turn his back on a futile past and look forward to a new life of adventures and achievements.

Roy’s host at Palo Alto introduced him to Evelyn Trent, a graduate student at Stanford University. Evelyn Trent, who later married Roy, became his political collaborator. She accompanied him to Mexico and Russia and was of great help to him in his political and literary work. The collaboration continued until they separated in 1929.

At New York, where he went from Palo Alto, Roy met Lala Lajpat Rai, the well-known nationalist leader of India. He developed friendships with several American radicals, and frequented the New York Public Library. Roy also went to public meetings with Lajpat Rai. Questions asked by the working class audience in these meetings made Roy wonder whether exploitation and poverty would cease in India with the attainment of independence. Roy began a systematic study of socialism, originally with the intention of combating it, but he soon discovered that he had himself become a socialist! In the beginning, nurtured as he was on Bankimchandra, Vivekanand and orthodox Hindu philosophy, Roy accepted socialism except its materialist philosophy.

Later in Mexico in 1919, Roy met Michael Borodin, an emissary of the Communist International. Roy and Borodin quickly became friends, and it was because of long discussions with Borodin that Roy accepted the materialist philosophy and became a full-fledged communist.

In 1920, Roy was invited to Moscow to attend the second conference of the Communist International. Roy had several meetings with Lenin before the Conference. He differed with Lenin on the role of the local bourgeoisie in nationalist movements. On Lenin’s recommendation, the supplementary thesis on the subject prepared by Roy was adopted along with Lenin’s thesis by the second conference of the Communist International. The following years witnessed Roy’s rapid rise in the international communist hierarchy. By the end of 1926, Roy was elected as a member of all the four official policy making bodies of the Comintern – the presidium, the political secretariat, the executive committee and the world congress.

In 1927, Roy was sent to China as a representative of the Communist International. However, Roy’s mission in China ended in a failure. On his return to Moscow from China, Roy found himself in official disfavor. In September 1929, he was expelled from the Communist International for “contributing to the Brandler press and supporting the Brandler organizations.” Roy felt that he was expelled from the Comintern mainly because of his “claim to the right of independent thinking.” (Ray 1987)

c. Return to India: Prison Years

 

 

Roy returned to India in December 1930. He was arrested in July 1931 and tried for his role in the Kanpur Communist Conspiracy Case. He was sentenced to six years imprisonment.

When Roy returned to India, he was still a full-fledged communist, though he had broken from the Comintern. The forced confinement in jail gave him more time than before for systematic study and reflection. His friends in Germany, especially his future wife, Ellen Gottschalk, kept providing him books, which he wanted to read. Roy had planned to use his prison years for writing a systematic study of ‘the philosophical consequences of modern science’, which would be in a way a re-examination and re-formulation of Marxism to which he had been committed since 1919. The reflections, which Roy wrote down in jail, grew over a period of five years into nine thick volumes (approximately over 3000 lined foolscap-size pages). The ‘Prison Manuscripts’ have not so far been published in their totality, and are currently preserved in the Nehru Memorial Museum and Library Archives in New Delhi. However, selected portions from the manuscript were published as separate books in the 1930s and the 1940s. These writings show that Roy was not satisfied with a primarily economic explanation of historical processes. He studied and tried to assess the role of cultural and ideational factors in traditional and contemporary India, in the rise and expansion of Islam, and in the phenomenon of fascism. He was particularly severe on the obscurantist professions and practices of neo-Hindu nationalism. Roy tried to reformulate materialism in the light of latest developments in the physical and biological sciences. He was convinced that without the growth and development of a materialist and rationalist outlook in India, neither a renaissance nor a democratic revolution would be possible. In a way, seeds of the philosophy of new humanism, which was later developed fully by Roy, were already evident in his jail writings.

d. Beyond Communism: Towards New Humanism

 

 

Immediately after his release from jail on 20 November 1936, Roy joined Indian National Congress along with his followers. He organized his followers into a body called League of Radical Congressmen. However, in December 1940, Roy and his followers left Congress owing to differences with the Congress leadership on the role of India in the Second World War. Thereafter, Roy formed the Radical Democratic Party of his own. This signaled the beginning of the last phase of Roy’s life in which he developed his philosophy of new humanism.

After Roy’s release from jail in 1936, Ellen Gottschalk joined Roy in Bombay in March 1937. They were married in the same month. Subsequently, Ellen Roy played an important role in Roy’s life, and cooperated in all of his endeavors.

Roy prepared a draft of basic principles of “radical democracy” before the India conference of the Radical Democratic Party held in Bombay in December 1946. The draft, in which his basic ideas were put in the form of theses, was circulated among a small number of selected friends and associates of Roy. The “22 Theses” or “Principles of Radical Democracy”, which emerged as a result of intense discussions between Roy and his circle of friends, were adopted at the Bombay Conference of the Radical Democratic Party. Roy’s speeches at the conference in connection with the 22 Theses were published later under the title Beyond Communism.

In 1947, Roy published New Humanism A Manifesto, which offered an elaboration of the 22 Theses. The ideas expressed in the manifesto were, according to Roy, “developed over a period of number of years by a group of critical Marxists and former Communists.”

Further discussions on the 22 Theses and the manifesto led Roy to the conclusion that party-politics was inconsistent with his ideal of organized democracy. This resulted in the dissolution of the Radical Democratic Party in December 1948 and launching of a movement called the Radical Humanist Movement. At the Calcutta Conference, itself where the party was dissolved, theses 19 and 20 were amended to delete all references to party. The last three paragraphs of the manifesto were also modified accordingly. Thus, the revised versions of the 22 Theses and the manifesto constitute the essence of Roy’s New Humanism.

e. Final years

 

In 1946, Roy established the Indian Renaissance Institute at Dehradun. Roy was the founder-director of the Institute. Its main aim was to develop and organize a movement to be called the Indian Renaissance Movement.

In 1948, Roy started working on his last major intellectual project. Roy’s magnum opus Reason, Romanticism and Revolution is a monumental work (638 pages). The fully written, revised and typed press copy of the book was ready in April 1952. It attempted to combine a historical survey of western thought with an elaboration of his own system of ideas. While working on Reason, Romanticism and Revolution, Roy had established contacts with several humanist groups in Europe and America, which had views similar to his own. The idea gradually evolved of these groups coming together and constituting an international association with commonly shared aims and principles. The inaugural congress of the International Humanist and Ethical Union (IHEU) was planned to be organized in Amsterdam in 1952, and Roy was expected to play an influential role in the congress and in the development of the IHEU.

However, before going abroad, Roy needed some rest. He and Ellen Roy went up for a few days from Dehradun to the hill station of Mussoorie. On June 11 1952, Roy met a serious accident. He fell fifty feet down while walking along a hill track. He was moved to Dehradun for treatment. On the 25th of August, he had an attack of cerebral thrombosis resulting in a partial paralysis of the right side. The accident prevented the Roys from attending the inaugural congress of the IHEU, which was held in August 1952 at Amsterdam. The congress, however, elected M.N. Roy, in absentia, as one of its vice-presidents and made the Indian Radical Humanist Movement one of the founder members of the IHEU. On August 15 1953, Roy had the second attack of cerebral thrombosis, which paralyzed the left side of his body. Roy’s last article dictated to Ellen Roy for the periodical Radical Humanist was about the nature and organization of the Radical Humanist Movement. This article was published in the Radical Humanist on 24 January 1954. On January 25 1954, ten minutes before midnight, M.N. Roy died of a heart attack. He was nearly 67 at that time.

2. Writings

a. Autobiographical

 

Roy was a prolific writer. He wrote many books edited, and contributed to several journals. However, he was reluctant to write about himself.  M. N. Roy’ Memoirs (627 pages), which he wrote after initial reluctance, only covers a short period of six years from 1915. When Roy was in an Indian prison, his friends in Germany, especially his future wife, Ellen Gottschalk, kept providing him books, which he wanted to read. Roy’s letters to her from jail, published subsequently as Letters from Jail (1943), contains pointers to his reading and thinking during those years.

b. Philosophical

 

Four volumes of Selected Works of M. N. Roy, edited by Sibnarayan Ray, have been published by the Oxford University Press.  Many of the writings of M. N. Roy such as Revolution and Counter-Revolution in China belong to the period when he was a communist. We have already mentioned some of his works related to the final humanist phase of his life, such as, Beyond Communism, New Humanism – A Manifesto and Reason, Romanticism and Revolution. According to M. N. Roy, his books Scientific Politics (1942) along with New Orientation (1946) and Beyond Communism (1947) constitute the history of the development of radical humanism. The final ideas are, of course, contained in New Humanism.

As mentioned earlier, Roy’s real philosophical quest began during his prison years after returning to India in 1930.  He decided to use his prison years for writing a systematic study of ‘the philosophical consequences of modern science’, which was to be in a way a re-examination and re-formulation of Marxism to which he subscribed since 1919. The reflections, which Roy wrote down in jail, grew over a period of five years into nine thick volumes (approximately over 3000 lined foolscap-size pages). The ‘Prison Manuscripts’ have not so far been published in their totality, and are currently preserved in the Nehru Memorial Museum and Library Archives in New Delhi. However, selected portions from the manuscript were published as separate books in the 1930s and the 1940s. Materialism (1940), Science and Superstition (1940), Heresies of the 20th Century (1939), Fascism (1938), The Historical Role of Islam (1939), Ideal Of Indian Womanhood (1941), Science and Philosophy (1947) and India’s Message (1950) are among the books that were made from these handwritten note books. Of these Materialism and Science and Philosophy, are of special interest from the point of view of studying Roy’s concept of philosophy and his formulations on materialism.

Since 1937, Roy was editing a new weekly named Independent India. In 1949, Independent India weekly changed to The Radical Humanist weekly. The name of another quarterly journal The Marxian Way, which Roy had been publishing since 1945 in collaboration with Sudhindranath Datta, was changed to The Humanist Way in the same year. The Humanist Way has ceased publication, but The Radical Humanist is still being published by the Indian Renaissance Institute as a monthly.

3. Roy’s Concept of Philosophy

Philosophy, according to Roy, is contemplation, study and knowledge of nature. Its function is “to know things as they are, and to find the common origin of the diverse phenomena of nature, in nature itself”.

Philosophy, says Roy, begins when “spiritual needs” of human beings are no longer satisfied by primitive natural religion, which imagines and worships a variety of gods as personification of the diverse phenomena of nature. The grown-up human is no longer satisfied with the nursery-tales, with which “he was impressed in his spiritual childhood”. Intellectual growth emboldens him to seek the causes of all natural phenomena in nature itself and to “find in nature a unity behind its diversity.” (Roy 1951)

In his book Science and Philosophy, Roy defines philosophy as “the theory of life”. The function of philosophy, in words of Roy, “is to solve the riddle of the Universe”. According to Roy, philosophy is born out of the efforts of man to explain nature and to understand his own relationship with it. (5-6)

a. Philosophy and Metaphysics

Roy has made a distinction between philosophy and metaphysics. According to him, metaphysics also begins with the desire to discover the unity behind the diversity. However, it leaves the ground of philosophy in its quest for a “noumenon” beyond nature, something that is distinct from “phenomena”. Thus, it abandons the inquiry into what really exists, and “plunges into the wilderness of speculation”. It takes up the absurd task of knowing the intangible, as the condition for the knowing the tangible.

It is obvious that Roy was opposed to speculative philosophy, which set for itself the impossible task of prying into the transcendental being “above and behind” the physical universe – of acquiring the knowledge of the reality behind the appearance. According to Roy, an inquiry, which denies the very existence of the object to be enquired, is bound to end in idle dreams and hopeless confusion.

b. Philosophy and Religion

Roy was opposed not only to speculative philosophy but also to the identification of philosophy with poetic fancy or theology and religion. According to him, for the average educated human, the term philosophy has a very vague meaning. It stands not only for speculative thought, but also for poetic fancy. Not only that, in India, philosophy is often not distinguished from religion and theology. “Indeed,” according to Roy, “what is generally believed to be the distinctive feature of Indian philosophy is that it has not broken away from the medieval tradition, as modern Western philosophy did in the seventeenth century.”

According to Roy, faith in the supernatural does not allow the search for the causes of natural phenomena in nature itself. Therefore, rejection of orthodox religious ideas and theological dogmas is the precondition for philosophy.  Roy was of the view that, religion will certainly be liquidated by the rise of science, because scientific knowledge enables humankind to answer questions, confronted by which in its childhood, it was forced to assume super-natural forces or agencies. Therefore, according to Roy, in order to perform its function, “philosophy must break away from religion” and start from the reality of the physical universe.

c. Philosophy and Science

On the one hand, Roy regards rejection of orthodox religious ideas and theological dogmas as the essential precondition of philosophy, and on the other, he envisages a very intimate relationship between philosophy and science. In fact, according to Roy, the philosophical significance of modern scientific theory is to render untenable the old division of labor between science and philosophy. Science is, says Roy, stepping over the old boundary line, “Digging deeper and deeper into the secrets of nature, science has come up against problems, the solution of which was previously left to philosophy. Scientific inquiry has pushed into what is traditionally regarded as the ‘metaphysical’ realm.”

The problems of philosophy – cosmology, ontology and epistemology – can all be progressively solved, according to Roy, in the light of scientific knowledge. The function of philosophy is, points out Roy, to explain existence as a whole. An explanation of existence requires knowledge of existence. Knowledge about the different phases of existence is being gathered by the various branches of science. Therefore, says Roy, the function of philosophy is to coordinate an entire body of scientific knowledge into a comprehensive theory of nature and life.  Philosophy can now exist only as the science of sciences – a systematic coordination, a synthesis of all positive knowledge, continuously readjusting itself to progressive enlargement of the store of human knowledge. A mystic metaphysical conception of the world is no longer to be accorded the distinction of philosophy. Thus, according to Roy, philosophy is a logical coordination of all the branches of positive knowledge in a system of thought to explain the world rationally and to serve as a reliable guide for life.

4. Roy’s New Humanism: Twenty-Two Theses on Radical Democracy

 

“New Humanism” is the name given by Roy to the “new philosophy of revolution” which he developed in the later part of his life. This philosophy has been summarized by Roy in the “Twenty-Two Theses” and elaborated in his New Humanism A Manifesto.

New Humanism, as presented in the Twenty-Two Theses, has both a critical and a constructive aspect. The critical aspect consists of describing the inadequacies of communism (including the economic interpretation of history), and of formal parliamentary democracy. The constructive aspect, on the other hand, consists of giving highest value to the freedom of individual, presenting a humanist interpretation of history, and outlining a picture of radical or organized democracy along with the way for achieving the ideal of radical democracy.

a. Basic Tenets of New Humanism

 

Apart from Roy’s effort to trace the quest for freedom and search for truth to the biological struggle for existence. The basic idea of the first three theses of Roy is individualism. According to Roy, the central idea of the Twenty-Two Theses is that political philosophy must start from the basic idea that the individual is prior to society, and freedom can be enjoyed only by individuals.

Quest for freedom and search for truth, according to Roy, constitute the basic urge of human progress. The purpose of all-rational human endeavor, individual as well as collective, is attainment of freedom in ever-increasing measure. The amount of freedom available to the individuals is the measure of social progress. Roy refers back the quest for freedom to human being’s struggle for existence, and he regards search for truth as a corollary to this quest. Reason, according to Roy, is a biological property, and it is not opposed to human will. Morality, which originates from the rational desire for harmonious and mutually beneficial social relations, is rooted in the innate rationality of man.

b. Humanist Interpretation of History

 

In his humanist interpretation of history, presented in theses four, five and six, Roy gives an important place to human will as a determining factor in history, and emphasizes the role of ideas in the process of social evolution. Formation of ideas is, according to Roy, a physiological process but once formed, ideas exist by themselves and are governed by their own laws. The dynamics of ideas runs parallel to the process of social evolution and both of them influence each other. Cultural patterns and ethical values are not mere super structures of established economic relations. They have a history and logic of their own.

c. Inadequacies of Communism

Roy’s criticism of communism, contained in theses seven to eleven is based mainly on the experience of the former Soviet Union, particularly the “discrepancy between the ideal and the reality of the socialist order.”  According to Roy, freedom does not necessarily follow from the capture of political power in the name of the oppressed and the exploited classes and abolition of private property in the means of production. For creating a new world of freedom, revolution must go beyond an economic reorganization of society. A political system and an economic experiment which subordinate the man of flesh and blood to an imaginary collective ego, be it the nation or class, cannot possibly be, in Roy’s view, the suitable means for the attainment of the goal of freedom.

The Marxian doctrine of state, according to which the state is an instrument of exploitation of one class by another, is clearly rejected by Roy. According to Roy, the state is “the political organization of society” and “its withering away under communism is a utopia which has been exploded by experience”.

Similarly, Roy rejects the communist doctrine of the dictatorship of the proletariat. “Dictatorship of any form, however plausible may be the pretext for it, is,” asserts Roy, “excluded by the Radical-Humanist perspective of social revolution”.

d. Shortcomings of Formal Parliamentary Democracy

 

Roy has discussed the shortcomings of formal parliamentary democracy in his twelfth and thirteenth theses. These flaws, according to Roy, are outcome of the delegation of power. Atomized individual citizens are, in Roy’s view, powerless for all practical purposes, and for most of the time. They have no means to exercise their sovereignty and to wield a standing control of the state machinery.

“To make democracy effective,” says Roy, “power must always remain vested in the people and there must be ways and means for the people to wield sovereign power effectively, not periodically, but from day to day.”

e. Radical Democracy

 

Thus, Roy’s ideal of radical democracy, as outlined in theses fourteen to twenty-two consists of a highly decentralized democracy based on a network of people’s committee’s through which citizens wield a standing democratic control over the state.

Roy has not ignored the economic aspect of his ideal of radical democracy. He argued that progressive satisfaction of the material necessities is the pre-condition for the individual members of society unfolding their intellectual and other finer human potentialities. According to him, an economic reorganization, which will guarantee a progressively rising standard of living, is the foundation of the Radical Democratic State. “Economic liberation of the masses”, says Roy, “is an essential condition for their advancing towards the goal of freedom.”

The ideal of radical democracy will be attained, according to Roy, through the collective efforts of mentally free men united and determined for creating a world of freedom. They will function as the guides, friends and philosophers of the people rather than as their would-be rulers. Consistent with the goal of freedom, their political practice will be rational and, therefore, ethical.

Roy categorically asserts that a social renaissance can come only through determined and widespread endeavor to educate the people as regards the principles of freedom and rational co-operative living.  Social revolution, according to Roy, requires a rapidly increasing number of men of the new renaissance, and a rapidly expanding system of people’s committees and an organic combination of both. The program of revolution will similarly be based on the principles of freedom, reason and social harmony.

As pointed out by Roy himself in his preface to the second edition of the New Humanism: A Manifesto, though new humanism has been presented in the twenty-two theses and the Manifesto as a political philosophy, it is meant to be a complete system. Because of being based on the ever-expanding totality of scientific knowledge, new humanism cannot be a closed system. “It will not be”, says Roy, “a dogmatic system claiming finality and infallibility.”

f. Philosophical Revolution or Renaissance

It is obvious from the foregoing that Roy was a great supporter of philosophical revolution or renaissance, and he has given a central place to it in his radical humanism. Roy was an admirer of European renaissance and drew inspiration from it. For him, “the renaissance was the revolt of man against God and his agents on this earth”. According to Roy, the renaissance “heralded the modern civilization and the philosophy of freedom”. He strongly believed that India, too, needed a renaissance on rationalist and humanist lines. According to him, this was a necessary condition for democracy to function in a proper manner. He believed that “a new Renaissance based on rationalism and cosmopolitan humanism” was essential for democracy to be realized. (Roy has used the word “rationalist” not in the Cartesian sense but in the popular sense. In this sense, a “rationalist” regards reason including both perception and inference as a source of knowledge.)

According to Roy, a philosophical revolution must precede a social revolution. He was opposed to blind faith and superstitions of all kinds and supported rationalism. As a physical realist, he rejected all allegedly supernatural entities like god and soul. Similarly, he was opposed to fatalism and the doctrine of karma. He unequivocally rejected the religious mode of thinking and advocated a scientific outlook and a secular morality. As noted earlier, he was in favor of delinking philosophy with religion and associating it closely with science.  He believed that science would ultimately liquidate religion.

According to Roy, a revolutionary is one who has got the idea that the world can be remade, made better than it is to-day, that it was not created by a supernatural power, and therefore, could be remade by human efforts.

Further, according to Roy, “the idea of improving upon the creation of God can never occur to God-fearing. We can conceive of the idea only when we know that all gods are our own creation, and we can depose whomsoever we have enthroned.”

Roy’s critical approach towards religion comes out very clearly in the preface of his book, India’s Message, where he asserts that a criticism of religious thought and a searching analysis of traditional beliefs and the time-honored dogmas of religion is essential for the belated Renaissance of India. “The spirit of inquiry should overwhelm the respect for tradition.”

According to Roy, “a critical examination of what is cherished as India’s cultural heritage will enable the Indian people to cast off the chilly grip of a dead past. It will embolden them to face the ugly realities of a living present and look forward to a better, brighter and pleasanter future.” Thus, Roy was opposed to an uncritical and vain glorification of India’s so-called “spiritual” heritage. However, he did not stand for a wholesale rejection of ancient Indian thought either. He favored a rational and critical approach towards ancient traditions and thoughts. Roy believed that the object of European renaissance was to rescue the positive contributions of ancient European civilization, which were lying buried in the Middle Ages owing to the dominance of the Church. Roy had something similar in his mind about India. According to him, one of the tasks of the Renaissance movement should be to rescue the positive outcome and abiding contributions of ancient thought – contributions, which just like the contributions of Greek sages, are lying in ruins under the decayed structure of the Brahmanical Society – the tradition of which is erroneously celebrated as the Indian civilization.

5. Roy’s Materialism or Physical Realism

a. Roy’s Materialism

M .N. Roy was a strong supporter of materialist philosophy. According to Roy, strictly speaking, materialism is “the only philosophy possible”, because  it represents the knowledge of nature as it really exists—knowledge acquired through the contemplation, observation and investigation of nature itself.

Roy points out that materialism is not the “monstrosity” it is generally supposed to be. It is not the cult of “eat, drink and be merry”, as it has been depicted by its ignorant or malicious adversaries. It simply maintains that “the origin of everything that really exists is matter, that there does not exist anything but matter, all other appearances being transformations of matter, and these transformations are governed necessarily by laws inherent in nature.”  (Roy 1951)

Thus, broadly speaking, Roy’s philosophy is in the tradition of materialism. However, there are some important differences between Roy’s materialism and traditional materialism.  In fact, Roy’s “materialism” is a restatement of traditional materialism in the light of contemporary scientific knowledge.  According to Roy, the substratum of the universe is not matter as traditionally conceived, but it is “physical as against mental or spiritual”. It is, in other words, “a measurable entity”. Therefore, says Roy, to prevent prejudice, materialism could be renamed “physical realism”.

b. Roy’s Materialism and Traditional Materialism

Roy was of the view that materialism must be dissociated from certain notions, which have been rendered untenable by the discoveries of science. His revision and restatement of materialism embraces both the basic tenets of materialism: the concept of matter as well as the doctrine of physical determinism.

i. Change in the Concept of “Matter”

According to Roy, the discoveries of quantum physics have “made the classical notion of matter untenable”. Nevertheless, Roy insists that though the substratum of the universe is “not matter as traditionally conceived” it is “physical as against mental or spiritual. It is a measurable entity”.

The so-called “crisis” of materialism, according to Roy, involved the conception of matter, and not its existence.  The “crisis” simply exposed the inadequacy of the old atomist theory. The substance of the “crisis” was, “that it appeared to reduce matter from mass to energy and radiation”. However, there cannot be any doubt about the fact that “atomic physics deals with material realities which exist objectively, outside the mind of the physicist.” Thus, in Roy’s physical realism “matter” is not made up of hard and massy stone-like atoms as in traditional “mechanical materialism”. The whole concept of “matter” has been revised in the light of new physics. In fact, Roy was even ready to discard the term “matter” provided a more appropriate new term could be coined. In Science and Philosophy, Roy describes “matter” as the “sole-existence.” According to Roy, it is not very important what name is attached to the “substratum of existence” – matter, energy, action, vibratory motion or field. However, he insists that it is a physical reality. What Roy means by calling it physical is that it exists objectively and that it is measurable. As we have seen, Roy has even renamed his revised version of materialism as “physical realism”.

ii. Revision of Physical Determinism

 

Roy disagrees with the view of some thinkers that Heisenberg’s Principle of Uncertainty requires the rejection of the doctrine of determinism. According to Roy, only a modification in the traditional conception of causality is required. Causality, in Roy’s view, is not an a priori form of thought or an axiomatic law; it is physical relation inherent in the constitution of the universe.

Roy, in fact, tries to temper a rigidly mechanical view of determinism by interpreting it in terms of probability. He admits plurality of possibilities and the element of contingency in the world, and tries to show that determinism and probability are not mutually exclusive. However, Roy insists that statistical methods presuppose determinism. The universe is a law-governed system, and the existence of law presupposes causality. He is emphatic that the element of uncertainty in the sub-atomic world is not to be equated with indeterminacy. Rejection of the idea that there are invariant relations in nature, maintains Roy, will blast the very foundation of science.

Roy also tries to reconcile “freedom of will” with determinism. According to him, human beings possess free will and can choose out of various alternatives in front of them. Roy, however, is not unique among materialists in recognizing free will. Epicurus, among ancient Greek materialists, and Hobbes, among modern materialists, tried to accommodate free will in their philosophies. According to Roy, the vast world of biological evolution lies between the world of human beings and the world of inanimate matter, and, therefore, the world of human beings has its own specific laws, though these laws can be referred back to the general laws of the world of dead matter. Nevertheless, human will, says Roy, cannot be directly related to the laws of physical universe. Thus, Roy is not, to use the terminology of William James, a “hard” determinist like Holbach, but a “soft” determinist like Hobbes.

iii. Objective Reality of Ideas and the Autonomy of the Mental World

Though Roy traces the origin of mental activities to the physical background of the living world, yet he also grants them an objective existence of their own. Mind and matter, according to Roy, can be reduced to a common denominator; as such, they are two objective realities. In Roy’s view, once formed, ideas exist by themselves, governed by their own laws. Thus, Roy grants much more objectivity and autonomy to the mental world than has been traditionally granted by materialists. Roy’s materialism is not an “extreme” materialism like that of the eighteenth century French materialist, Julien de la Mettrie, who regarded man to be a self-moving machine. According to Roy, on the other hand, “Man is not a living machine, but a thinking animal”.

iv. Emphasis on Ethics

Roy has given a very important place to ethics in his philosophy. According to Roy, “the greatest defect of classical materialism was that its cosmology did not seem to have any connection with ethics”. Roy strongly asserts that if it is not shown that materialist philosophy can accommodate ethics, then, human spirit, thirsting for freedom, will spurn materialism. In Roy’ view materialist ethics is not only possible but also the noblest form of morality. Roy links morality with human being’s innate rationality. Human beings are moral, according to Roy, because they are rational. In Roy’s ethics freedom, which he links with the struggle of existence is the highest value. Search for truth is a corollary to the quest for freedom.

However, Roy is not unique among materialists in emphasizing the importance of ethics in his philosophy. Contrary to popular impression, ancient materialist Epicurus and modern materialist Holbach, for example, accorded an important place to ethics in their philosophies. However, the details of Roy’s ethics are somewhat different from these philosophers.

c. Roy’s Materialism and Marxian Materialism

 

Before he formulated and expounded his own philosophy of New Humanism, Roy was an orthodox Marxist. In fact, Roy’s revision of materialism was carried out in the context of Marxism. Thus, Roy’s revision of materialism in general is also applicable to Marxian materialism to the extent Marxian materialism resembles traditional materialism.

Roy’s Physical Realism is, however, different from Marxian materialism in three important ways:

i. Delinking of Dialectics and Materialism

Roy considers the Hegelian heritage a weak spot of Marxism. The simplicity and scientific soundness of materialism are marred by making its validity conditional upon dialectic. According to Roy, materialism pure, and simple, can stand on its own legs, and, therefore, he tries to de-link dialectics from materialism. The validity of materialism, maintains Roy, is in no way conditional on dialectics, as there is no logical connection between the two.

ii. Rejection of Historical Materialism

Roy rejects historical materialism and advocates a humanist interpretation of history in which he gives an important place to human will as a determining factor in history, and he recognizes the autonomy of the mental world. He argues that human will cannot be directly related to the laws of physical universe. Ideas, too, have an objective existence, and are governed by their own laws. The economic interpretation of history is deduced from a wrong interpretation of materialism.

iii. Emphasis on Ethics

 

Roy’s materialism is sharply different from Marxian materialism in so far it recognizes the importance of ethics and gives a prominent place to it. According to Roy, Marxian materialism wrongly disowns the humanist tradition and thereby divorces materialism from ethics. The contention that “from the scientific point of view this appeal to morality and justice does not help us an inch farther” was based, according to Roy, upon a false notion of science.

d. Roy and Lokayata

 

His materialism or physical realism could be placed in the tradition of ancient Indian materialism: Lokayata or Carvaka. In fact, in the third chapter of his book Materialism, titled “Materialism in Indian Philosophy”, Roy has approvingly referred to Lokayata. According to him, “the long process of the development of naturalist, rationalist, skeptic, agnostic and materialist thought in ancient India found culmination in the Charvak system of philosophy, which can be compared with Greek Epicureanism, and as such is to be appreciated as the positive outcome of the intellectual culture of India”. (94)

6. Roy’s Intellectual Legacy

Roy was a former Marxist and a hero of Indian communists for having rubbed shoulders with Lenin and Stalin. However, he later gave up Marxism and advocated his own “radical humanism”. Naturally, he has been criticized generally by Marxists and communists for renouncing Marxism as well as for finding fault with communist doctrines like “the dictatorship of the proletariat”. Nonetheless, there was a small group of intellectuals who collaborated closely with him in preparing the “Twenty-two Theses on Radical Democracy” and New Humanism: A Manifesto; namely, V. M. Tarkunde, Phillip Spratt, Laxman Shastri Joshi, Sibnarayan Ray, G. D. Parikh, G. R. Dalvi, Sikander Choudhary and Ellen Roy (Roy’s wife). Some of them like Tarkunde, Sibnarayan Ray and G. D. Parikh remained active in the radical humanist movement launched by Roy and also wrote and edited books on him. (See References and Further Reading)

Among the journals founded by M. N. Roy, The Humanist Way, has ceased publication, but The Radical Humanist is still being published as a monthly by the Indian Renaissance Institute. It was edited for a long time by Tarkunde. Since Tarkunde, it has been edited by R. M. Pal and R. A. Jahagirdar among others.

Outside the select group of Roy’s close friends and associates, A. B. Shah, founder of the Indian Secular Society and The Secularist journal was one of the important intellectuals influenced by Roy’s ideas.

Some of his final ideas are open to criticism even from a humanist perspective. For example, Roy’s use of the word “spiritual” in certain contexts is problematic. Roy talks of “spiritual needs” and “spiritual childhood” of human beings (Section 3), when, in fact, he was a materialist. He was at pains to emphasize that reality is “physical as against mental or spiritual” (Section 5 a ). As a materialist, he was also opposed to the vain glorification of the so-called “spiritual” heritage of India. (Section 4 f ) Apparently, Roy has used the word “spiritual” in phrases like “spiritual needs” and “spiritual childhood” in the sense of “intellectual.” Nonetheless, the use of the term “spiritual” by Roy even in these contexts could be misleading, considering the fact that he did not believe in the existence of soul and spirit. (Ramendra 2001)

Roy’s advocacy of party-less democracy, too, is open to criticism. Freedom of association is a fundamental democratic freedom. In any democracy worth the name, citizens with similar political ideas and programs are bound to come together and cooperate with one another by forming political parties and other non-party organizations. The only possible way to prevent them from doing so will be to deny the fundamental right to association, which will be an undemocratic act in itself. Therefore, the ideal of “party-less democracy” seems to be self-contradictory, impractical and unrealizable.

7. References and Further Reading

a. Primary Sources

  • Parikh, G. D. compiler.  Essence of Royism (Bombay: Nav Jagriti Samaj, 1987).
    • Anthology of  M. N. Roy’s writings by one of his close academician associates.
  • Ray, Sibnarayan. ed., Selected Works of M. N. Roy, vol. I, (Delhi: Oxford University Press, 1987).
    • Altogether four volumes have been published. The first volume contains an illuminating introduction written by the editor, another close academician associate of Roy.
  • Roy, M. N. and Spratt, Phillip.  Beyond Communism (Delhi: Ajanta Publications, 1981).
    • Shows Roy’s transition from Marxism to Humanism.
  • Roy, M. N.  India’s Message (Delhi: Ajanta Publications, 1982).
    • One of the books made out of the ‘Prison Manuscripts’. Published originally as the second volume of Fragments of a Prisoner’s Diary.  First volume was titled Memoirs of a Cat.
  • Roy, M. N.  Letters from Jail ( Calcutta: Renaissance Publishers Private Ltd., 1965).
    • Third volume of Fragments of a Prisoner’s Diary.  Letters written to his future wife and a close associate, Ellen Roy.
  • Roy, M. N.  Materialism (Calcutta: Renaissance Publishers Ltd., 1951).
    • From the ‘Prison Manuscripts’.
  • Roy, M. N.  M. N. Roy’s Memoirs (New Delhi: Ajanta Publications, 1983).
    • Covers a short period of six years from 1915.
  • Roy, M. N. New Humanism – A Manifesto (Delhi: Ajanta Publications, 1981).
    • Roy’s final ideas on humanism
  • Roy, M. N. New Orientation (Delhi: Ajanta Publications, 1982).
    • Transition from Marxism to Humanism.
  • Roy, M. N. Politics, Power and Parties (Delhi: Ajanta Publications, 1981).
    • Roy’s views on party-politics.
  • Roy, M. N. Reason, Romanticism and Revolution (Delhi: Ajanta Publications, 1989).
    • Last major work.
  • Roy, M. N. Science and Philosophy (Delhi: Ajanta Publications, 1984).
    • From the ‘Prison Manuscripts’.
  • Roy, M. N. Scientific Politics (Calcutta: Renaissance Publishers, 1947).
    • Transition from Marxism to Humanism

b. Secondary Sources

  • Karnik, V. B. M. N. Roy (New Delhi: National Book Trust, 1980).
    • A biography of Roy by one of his close associates.
  • Pal, R. M. ed., Selections from the Marxian Way and the Humanist Way (Delhi: Ajanta Publications, 2000).
    • Selections from the journal The Marxian Way and The Humanist Way (changed name), edited by Roy.
  • Ramendra. M. N. Roy’s New Humanism and Materialism (Patna: Buddhiwadi Foundation, 2001).
    • A critical study of Roy’s new humanism and materialism.
  • Ray, Sibnarayan, ed. M.N. Roy Philosopher-Revolutionary (New Delhi: Ajanta Publications, 1995).
    • Contains some writings of  M. N. Roy, but also writings of many others on Roy, including that of V. B. Karnik, Suddhindranath Datta, Ellen Roy, Laxman Shastri Joshi, Phillip Spratt, G. D. Parikh, Stanley Maron and H. J. Blackham.
  • Tarkunde, V. M.    Radical Humanism (New Delhi: Ajanta Publications, 1983).
    • An exposition of radical humanism by an intellectual-activist and a close associate of Roy.

Author Information

Ramendra Nath
Email: ramendra@sancharnet.in
Patna University
India

Angélique Arnauld (1591—1661)

The reforming abbess of the convent of Port-Royal, Mère Angélique Arnauld developed an Augustinian philosophy deeply influenced by the Jansenist movement.  Her philosophy of God follows the via negativa in its stress on God’s incomprehensibility.  Her theory of moral virtue is emphatically theocentric; even such natural virtues as patience are presented as emanations of grace.   The monastic virtues of poverty and humility receive pride of place in this account.  Reflecting her struggle with political and ecclesiastical authorities, Arnauld’s philosophy stresses the rights of women to education, to theological culture, to self-government, and to fidelity to personal conscience.  Although she affirms the existence of free will, Arnauld’s emphasis on the depth of human depravity and the dependence of the moral agent on divine grace for the execution of virtuous action bears the traces of theological determinism.  Long considered a minor voice in the Jansenist strand of Augustinian philosophy developed by Antoine Arnauld and Blaise Pascal, Angélique Arnauld’s original variations on Augustinianism have recently been underscored by feminist commentators.  The abbess’s gendered account of virtue is drawn from the experience of nuns committed to a strict contemplative life. Her defense of the rights of women to theological culture and opinion challenged the patriarchal commands issued by throne and altar that recommended religious silence for nuns.

Table of Contents

  1. Biography
  2. Works
  3. Philosophical Themes
    1. Apophatic Theology
    2. Virtue Theory
    3. Theological Determinism
    4. Gender and Right
  4. Interpretation and Relevance
  5. References and Further Reading
    1. Primary Sources
    2. Secondary Sources

1. Biography

Born on September 8, 1591, Jacqueline Marie Arnauld belonged to a prominent noblesse de robe family.  Both her father Antoine Arnauld and her mother Catherine Marion Arnauld descended from families renowned for their lawyers, parliamentarians, and diplomats.  Her grandfather Antoine Arnauld the Elder had battled the Jesuits in the parlement of Paris.  The family’s juridical culture and hostility to the Jesuits would durably mark the younger generation during the battle over Jansenism.

From her infancy, Jacqueline Marie Arnauld had been designated by the family as a future nun and superior.  In 1599 her maternal grandfather Simon Marion successfully obtained from King Henri IV the royal appointment of his niece as the abbess of Port-Royal, a Cistercian convent in the southwestern hinterlands of Paris. The family falsified the age of the abbatial nominee in documents sent to the Vatican in order to circumvent the requirements of canon law.  To receive the rudiments of religious education, Jacqueline Marie was sent to the decadent convent of Maubuisson, governed by the sister of the king’s mistress.  Despite the low level of education, the nascent abbess discovered Stoicism through her reading of Plutarch.  Now known by her religious name as Mère Marie-Angélique de Sainte-Magdalene [popularly as Mère Angélique], the young nun was formally installed as coadjutor abbess on April 10, 1602 and upon the death of Abbess Jeanne Boulehart on July 3, 1602, was declared the abbess of Port-Royal.  This childhood experience of forced vocations, trafficking in religious office, and perjury to attain social position would influence the mature Mère Angélique’s campaign to abolish just such abuses in the name of vocational freedom.

The eleven-year old abbess had inherited a lax convent containing scarcely a dozen nuns.  Liturgical offices had languished; sermons were rarely heard; the nuns entertained themselves by leisurely visits with the neighbors, masked balls, and elaborate decoration of their private apartments.  In 1608, Mère Angélique underwent a sudden conversion.  With the reluctant consent of her subjects, she launched the convent’s reform.  Her reforms included the abolition of private property and the institution of strictly communal property; the restoration of the rule of silence; the restoration of the night office; the institution of a vegetarian diet; the strict practice of cloister, with visitors relegated to a parlor divided by a grille.  The Angelican reform campaign reached a dramatic apogee on September 15, 1609, the journée du guichet, when Mère Angélique refused her own family admission to the convent.

Intellectual as well as moral, the Angelican reform also devoted itself to the development of a theological culture by the nuns.  Distinguished theologians, notably Archange de Pembroke, Benot de Canfield, and François de Sales, preached and lectured at the convent.  The philosophical and theological orientation of these preachers was clearly Augustinian rather than Thomistic.  This new Augustinian intellectual culture was reinforced by readings from Saint Augustine and Saint Bernard of Clairvaux at table and by the neo-Augustinian conferences on the spiritual life delivered by Mère Angélique herself.  During the Oratorian ascendancy (1626-33), when Oratorian priests served as the convent’s chaplains, Port-Royal fell under the influence of the mystical école française.  A disciple of the Pseudo-Dionysius, Pierre Bérulle defended apophatic theology, in which the negative attributes of the godhead are underscored; Charles Condren stressed the annihilation of self which is essential for union with God and authentic knowledge of the divine attributes.  The Oratorian themes of divine incomprehensibility and annihilation of the human subject clearly influenced Mère Angélique’s philosophy.

Under Mère Angélique’s direction, Port-Royal flourished as a model of reformed convent life.  In 1626, the convent opened a Parisian branch in the neighborhood adjacent to the Sorbonne.  In 1629, King Louis XIII approved one of the abbess’s most cherished reforms: the election of the abbess by the convent nuns themselves rather than by royal appointment; the term of office would now be fixed (three years) rather than lifetime.  The nuns elected Mère Geneviève Tardiff as Mère Angélique’s successor in 1630.  The new episcopal overseer of the convent, Bishop Sebastien Zamet, opposed Mère Angélique on the nature of conventual reform; the austere Angelican reform was soon replaced by more theatrical liturgies and greater intercourse with the Parisian lay elite.  The effort of Zamet and Mère Angélique to develop a new religious order, the ephemeral Institut du Saint-Sacrement (1633-36), ended in bitter failure.  The election of Mère Angélique’s sister Mère Agnès Arnauld as Port-Royal’s new abbess in 1636 permitted a swift return to the principles of the Angelican reform.  Appointed as novice mistress, Mère Angélique now focused on the spiritual and intellectual formation of younger community members.

The restoration of the Angelican reform also marked the transformation of Port-Royal into a bastion of Jansenism.  Jean de Hauranne, abbé de Saint-Cyran, became the convent chaplain.  A friend and disciple of Cornelius Jansen, Louvain theologian and bishop of Ypres, Saint-Cyran propagated the radical Augustinianism of Jansen through sermons, lectures, and spiritual direction.  Typical Jansenist themes included radical human depravity, the centrality of predestination, the small number of the elect, complete dependence on divine grace for salvation, and moral rigorism.  The imprisonment of Saint-Cyran (1638-43) initiated the French government’s campaign against the convent and its Jansenist theology, perceived as heretical and supportive of political dissidence.

In subsequent decades, the Jansenist orientation of Port-Royal intensified.  Devoted to meditation and scholarship, a community of laymen living in the environs of the convent (the solitaires) published the works of Augustine, Jansen, Saint-Cyran, and other authors favored by the Jansenists.   These works became the staples of the convent school curriculum and of the texts read aloud during convent meals.  The Arnauld family spearheaded the Jansenist movement.  Mère Angélique’s brother Antoine the Younger Arnauld emerged as the school’s leading theologian; her brother Henri Arnauld, bishop of Angers, defended Jansenism in the episcopate; her brother Robert Arnauld d’Andilly became the most prolific of the solitaire translators and editors.  Through correspondence Mère Angélique personally participated in the philosophical and theological disputes occasioned by Antoine Arnauld’s Frequent Communion (1643), Blaise Pascal’s Provincial Letters (1656), and Jean de Brisacier’s anti-Jansenist Jansenism Confounded (1651).

In 1661 Louis XIV intensified the campaign against Port-Royal.  The French government demanded that all clerics, teachers, and members of religious orders sign a statement in which they assented to the papacy’s condemnation of five heretical theses allegedly found in Jansen’s Augustinus.  To guide the perplexed conscience of Jansenists during “the crisis of the signature,” Antoine Arnauld developed the droit/fait distinction.  According to this distinction, the judgment of the church on matters of faith and morals (droit) binds the conscience of the faithful, since such judgments were essential to the church’s divine mission of guiding people to salvation.  On judgments of empirical fact (fait), however, the church’s judgment is fallible, open to subsequent alteration or even reversal.  Using the droit/fait distinction, the Jansenists could assent to the church’s condemnation of the five condemned theses concerning grace and freedom; however, they could not assent to what they considered the church’s erroneous judgment of fact that Jansen had actually defended these heretical theories.

Despite Mère Angélique’s efforts at intervention with civil and ecclesiastic authorities, the campaign of coercion against Port-Royal stiffened.  In April 1661 the throne closed the convent school and novitiate; the convent’s chaplain and confessors were exiled.  In June 1661 the nuns reluctantly signed the controversial statement concerning Jansen but they declared that they only offered a reserved assent, to be interpreted according to the droit/fait distinction.  At the urging of Louis XIV, the Vatican quickly annulled the reserved signature and demanded unconditional submission to all the church’s judgments in the controversy.

In the midst of the crisis Mère Angélique Arnauld died on August 6, 1661.

2. Works

From a philosophical perspective, two works written by Mère Angélique Arnauld merit particular attention.  They are her correspondence and a series of conferences she gave at Port-Royal.

Published in 1742-44 as the three-volume Letters of Reverend Mère Angélique Arnauld, Abbess and Reformer of Port-Royal [L], the massive correspondence of Mère Angélique includes more than thirteen hundred letters and more than one hundred correspondents.  Her letters to her scholarly brothers, Antoine Arnauld and Robert Arnauld d’Andilly, clearly express her philosophical Augustinianism.  She repeatedly declares herself a “disciple of Saint Augustine” and defends neo-Augustinian positions on the nature of virtue, the critique of casuistry, and the relationship between grace and freedom.  Her letters to her nephew Antoine Le Maître and her niece Angélique Arnauld d’Andilly present the abbess’s theories of the rights of the conscience and the limits of civil and ecclesiastical authority.  Numerous letters study the exercise of authority by women.  In her correspondence with fellow women superiors, Mère Angélique emphasizes the right of superiors to act as the convent’s principal spiritual director, including the right to appoint the convent’s chaplain and confessors and to instruct the nuns in theological matters germane to convent life.  Her extensive correspondence with Marie-Louise de Gonzague, queen of Poland, analyzes the virtues necessary for Christian women to govern others in the civic sphere and to endure reversals of fortune.

The correspondence also indicates Mère Angélique’s philosophical and theological culture.  Outside of the Bible, Saint Augustine is the author most cited in the letters; his Confessions are the work most frequently quoted.  The abbess often cites neo-Augustinian authors in the Jansenist circle: Saint-Cyran, Blaise Pascal, Antoine Arnauld, Pierre Nicole, Claude Lancelot, Antoine Singlin, and Antoine Le Maître.   Saint Teresa of Avila, whose works were translated into French by Robert Arnauld d’Andilly, is the most influential of the women authors cited by Mère Angélique, especially in her considerations on the exercise of authority by women.

Published in 1757, Speeches or Conferences of Reverend Mother Marie-Angélique Arnauld, Abbess and Reformer of Port-Royal [C] are transcriptions of the informal talks given by the abbess to the nuns at Port-Royal during the 1650s.  In these conferences the abbess often addresses moral issues pertinent to convent life.  She argues that it is intention rather than the quality of an act that determines the act’s moral worth.  Her severe judgments on minor ethical lapses by nuns and laity reflect the influence of Saint-Cyran’s moral rigorism.

Other major works by Mère Angélique include her autobiographical Report Written by Mère Angélique Arnauld, in which the abbess defends her policy of conventual reform, and her Commentary on the Rule of Saint Benedict, especially her discussion of the monastic virtue of humility.

3. Philosophical Themes

Angélique Arnauld develops philosophical arguments in three major areas: philosophy of God, especially the question of divine attributes; moral philosophy, in particular theory of the virtues; and gender theory, especially as it relates to the education and spiritual rights of women.  Arnauld’s philosophical reflection is subordinated to her larger theological project of Augustinian theology and reflects the distinctive culture of a convent devoted to the Jansenist cause.

a. Apophatic Theology

In her presentation of the divine attributes, Mère Angélique focuses on God’s incomprehensibility.  This apophatic theology stresses that what the human agent cannot grasp about the divine essence far surpasses what the human agent claims to know through positive images and concepts.

This incapacity to penetrate God’s essence stems from the stark alterity between divine and human natures.  “Everything God does is above human ability.  It is incomprehensible to our minds.  If we exist before God as if we were the smallest ants, is it so strange that we can neither know him nor grasp his intentions…. Between human beings and God there is no proportion.  They must acknowledge that God is a being infinitely superior to them.  As a result, it is impossible for them to know his ways” [C, 254].  Analogical affirmations of God’s attributes collapse in such an account of divine/human otherness.  The sinful nature of postlapsarian humanity deepens the chasm between humanity and God.  Only divine self-revelation can disclose a glimmer of God’s authentic nature to a human reason enticed by idols of its own imagination.

Arnauld’s apophatic theology underscores a particular dimension of divine incomprehensibility: inscrutability.  Although the Christian affirms the operation of divine providence in the ordering of the world, the accurate interpretation of God’s intention behind a particular historical act eludes human power.  “The judgments and ways of God are admirable.  Often God seems to want to destroy when he wants to edify.  What appears to be the result of his wrath and justice is actually the result of his mercy.  Blessed are the souls who perceive in every event only God’s holy will in order to submit to it” [L; to the Abbess of Gif; 23 December 1651].  Arnauld’s appeal to God’s inscrutability also expresses the voluntarist cast of her philosophy of God and of human nature.  It is divine sovereignty and self-abandonment to the divine will that are central in the convenant between God and the believer.

b. Virtue Theory

In her correspondence and conferences, Mère Angélique not only exhorts her public to the practice of virtue; she repeatedly defines the virtues in the context of monastic life.  Furthermore, she defends an Augustinian account of the moral virtues.  Authentic moral virtue can only spring from the operation of the theological virtues of faith, hope, and charity.  The theological virtues themselves are the gift of divine grace, not the creation of human endeavor.  Apparently natural virtues, such as the cardinal virtue of courage, are only the masks of pride.

Arnauld’s discussion of the virtues privileges those moral habits typical of conventual life.  Rooted in the monastic vow of poverty, the virtue of poverty not only renounces personal possessions and cultivates sympathy for the materially poor; it refuses all aesthetic pleasures, even those found in liturgical arts and ceremonies.  Contempt for the world inevitably provokes persecution by the world.  “Many graces of God are required in order to disdain the world and tolerate its disdain for us….It is impossible for us to hate the world without the world hating us and considering us fools” [L; To Mademoiselle de Luzanci; September 7, 1650].  The centerpiece of monastic virtue, piety demands complete oblation of oneself.  Authentic piety requires total abolition of human personal desire as one obeys the divine will.  “God has given everything to humanity so it will give everything to him.  By this sacrifice it obtains from God’s goodness the grace to possess him and to reign with him forever” [L; to Marie-Louise de Gonzague, queen of Poland; November 12, 1655].  The moral rigorism typical of the Jansenist movement marks this account of the nature of the prominent virtues in monastic life.  Militant opposition to the world accompanies every authentic expression of moral virtue.

The theocentric framework of Arnauld’s virtue theory also appears in her analysis of more secular virtues.  Patience is rooted in the recognition of our grave sinfulness before God.  Tolerating the imperfections of others prevents us from descending into moral complacency and the illusion of personal perfection.  “To avoid falling, we must humbly tolerate the faults of others and of ourselves.  We are born in sin and we will be totally liberated from it only at the moment of our death” [L; to Madame Allen; May 12, 1665].  Even the Socratic virtue of self-knowledge undergoes theological reconstruction.  The moral self-knowledge granted by divine grace surpasses any self-knowledge acquired by the natural means of reflection.  “If you had spent an entire year making an examination of conscience with the most meticulous care, it would have been inferior to the moment when God touched your heart.  You could not have been as well prepared and you could not have known your faults as well as you do right now without any personal examination of conscience.  It is grace alone that can grant us such knowledge” [L; to Anne de Rohan, Princess de Guéménée; November 15, 1629].  Authentic self-knowledge thus emerges as a product of divine illumination.

Arnauld’s analysis of the moral virtues affirms a traditional tenet of Augustinian virtue theory: that the allegedly natural virtues praised by pagan philosophers are in fact disguised vices and that authentic moral virtue is rooted in the theological virtues grafted into the human will by divine grace.   Following the Augustinian perspective, Mère Angélique dismisses the neoclassical virtues of magnanimity and courage as masks of self-satisfaction.  “There is nothing more pleasant for a generous person than the act of giving.  People so possess courage according to the world’s standard fear nothing so much as asking for something.  It is characteristic of the rich to give and the poor to receive” [C, 71].  It is selfless intention, rather than external act, that is the criterion for the existence of authentic virtue.  Only the presence of the divinely infused virtues of faith, hope, and charity can provide the proper theocentric motivation for the exercise of the moral virtues.  Arnauld’s analysis of these theological virtues expresses her typical rigorism.  Faith plunges the moral agent into combat against the world, the flesh, and the devil.  Authentic hope turns the believer against the sinful hope of domination that poisons the courtier.  Charity demands the annihilation of the self.  “Charity is nothing other than the love of God and the love of God is an ardent desire that God reign in everything and that all creatures annihilate themselves and recognize his supreme grandeur and infinite majesty” [C, 77].

Reflecting the Jansenist combat with the French throne and episcopate, Mère Angélique stresses the virtue of suffering for the truth.  Under persecution, this virtue of sacrificial witness to the truth has become the hallmark of the convent’s moral character.  “I am overwhelmed by the vision I have of the great and singular grace God has granted us to suffer for the truth, trying to serve the souls he has redeemed by his death” [L; to Antoine Arnauld; early 1644].  This suffering for the contested truths of salvation, however, must be animated by the theological virtue of charity.  Mère Angélique censures the sarcasm that has increasingly stained the polemical defenses of the convent by Jansenist sympathizers.  “I only desire that God may deign to fill you deeply, not only with the knowledge of the truth, but with a perfect love that you practice faithfully and that gives you a humble patience toward everything.  The current impassioned applause from so many people disturbs me” [C; to Antoine Arnauld; April 1644].  To be authentic, even the convent’s distinctive willingness to suffer persecution for the sake of doctrinal truth must remain rooted in the infused virtue of charity toward one’s opponents.

c. Theological Determinism

Angélique Arnauld’s heavy emphasis on divine grace as the cause of all virtuous action often appears close to a species of theological determinism.  So central is the initiative and sovereignty of God in all cosmic and moral activity that free will appears to disappear.  Human beings remain voluntaristic beings inasmuch as they can choose one course of action over another after deliberation, but the deliberative and elective activities appear destined by concupiscence to involve the selection of evil unless divine grace intervenes through an inscrutable decree of divine providence.

The abbess reinforces this theological determinism by her insistence on the Augustinian doctrine of predestination.  The salvation of sinners, illustrated through the conversion of the good thief by Jesus on the cross, is due entirely to divine initiative, not to the personal merit of the saved.  “God grants his grace to whomever he pleases.  This is seen in the prayer of the thief he touched and converted in the very act of sinning.  At first he blasphemed just like the other thief; nevertheless, one is taken and the other is left behind.  Jesus Christ gazes at the former with his great mercy and leaves the latter by an effect of justice that condemns him” [C, 108].  Moral conversion and the concomitant status of being saved are caused uniquely by divine election.

Reinforcing the Augustinian framework of predestination, Arnauld rejects the compromise solution of human cooperation with divine grace.  The abbess insists on the comprehensive nature of divine causation in the act of salvation.  “The creature could never cooperate with the grace of God.  It would lose infinitely more than it could keep for its use.  God’s grace always suffered some loss when it came to us….The creature’s incapacity stems from sin, which is infinitely opposed to this grace” [C, 365].  It is in her discussions of grace and salvation that the theological determinism of Mère Angélique becomes the most pronounced.

Arnauld’s emphasis on divine causation is not without paradox.  The abbess repeatedly exhorts her listeners and correspondents to act virtuously and to seek God’s favor, as if they exercised authentic free will and responsibility.  But these appeals to moral reform find scant justification in a metaphysics and a moral philosophy that make divine volition the unique cause of physical and ethical action.

d. Gender and Right

The convent is not simply the site for the elaboration of Mère Angélique’s Augustinian philosophy; her corpus defends the right of women, specifically of nuns, to exercise judgments of conscience on matters of theological dispute, even when this entails conflict with political and religious authorities.  It also defends the correlative rights of women to education, to theological culture, and to limited autonomy through the selection of superiors and of constitutional law.

In many writings, Arnauld analyzes how women should exercise their rightful authority in the religious and civic spheres.  Despite the rigorism of her moral views, Mère Angélique argues that flexibility is central in a religious superior’s exercise of authority.  Women superiors must tailor their judgments to the needs of the individual subject.  “Superiors must be extremely reserved in all their words.  They must  weigh all of them according to the golden measure of holy charity.  Whether they are exhorting, reproving, or consoling, they must proportion themselves to the capacity of each soul” [L; to the Anonciade superior in Boulogne; May 14, 1640].  Authentic conventual governance involves respect of each individual’s experience and of the superior’s personal judgment in a particular case.

Legalism constitutes a particular danger for this exercise of personalized spiritual governance.  Mère Angélique criticizes the tendency of overly detailed constitutions in religious orders to suffocate the right of women superiors to exercise substantial freedom in the governance of their subjects.  “In the area of penances, it seems to me that one should usually grant freedom to a superior to act according to her inclinations.  She must try to take into account the spirit of God and the different circumstances and not try to regulate all the nuns according to the same manner.  The letter kills and the spirit gives life” [L; to Monsieur Macquet; January 4, 1635].  These counsels on the proper exercise of authority religious superiors reinforce the principal provisions of the Angelican reform of the convent.  Nuns are to elect their own superiors and to draw up their constitutions and legislation in chapter.  Superiors are to act as the principal spiritual directors of their subjects with the authority to appoint chaplains and to direct schools and retreat programs for laywomen.

In her extensive correspondence with Marie-Louise de Gonzague, queen of Poland, Arnauld evokes the virtues central for the exercise of civil authority by women.  Like nuns, monarchs must practice the annihilation of self that unites the believer completely to God.  “God will treat you in his judgment just as you have treated him here.  He will honor those who have honored him as their sovereign lord, considering themselves nothing at all before his divine majesty” [L; to Marie-Louis de Gonzague, queen of Poland; January 20, 1655].  In particular, the queen must practice the love of enemies, a theological virtue that proves especially difficult in time of war.  “Your soul should often ponder these words of Our Lord Jesus Christ when he was nailed to the cross in the last extremity and by the most horrible malice and cruelty that could be imagined: ‘Father, forgive them; they did not know what they do.’  It was not only those who crucified the Son of God who did not know that he was the king of glory; it is those who make war on you” [L; to Marie-Louise de Gonzague de Poland; November 5, 1655].

As the persecution of Port-Royal intensified, Mère Angélique insisted on the right and duty of women to follow an informed conscience on the theological disputes of the moment.  Appeals to obedience cannot force a nun to assent to a judgment by a political or ecclesiastical authority she believes to be erroneous.  Citing the precedent of Teresa of Avila, Arnauld urges Queen Anne of Austria, the French queen-mother, to intervene on behalf of the Port-Royal nuns against the errors of the church on the question of Jansenism.  “I believe that God will use the wisdom of Your Majesty and the wisdom of the king, as he used Phillip II, the ancestor of Your Majesty, in the past, to save Saint Teresa from the greatest persecution she had ever known.  We see in her writings that even the pope had been wrongly informed about her and the nuns of her order and that his nuncio had been prejudiced against her, thus pushing this conflict, in her own words, to the last step of violence” [L; to Anne of Austria, queen-mother of France; May 25, 1661].  Against the fallible, prudential judgments of both church and state on matters of fact, both nuns and laywomen enjoyed the right to challenge judgments that appeared to wound truth and charity.

4. Interpretation and Relevance

Several factors have impeded a properly philosophical appreciation of the works of Angélique Arnauld.  First, apologists for Jansenism have long argued that Mère Angélique was simply ignorant of the theological controversies erupting during the crisis of the signature.  Gazier’s defense of the Port-Royal nuns (1929) is typical of such apologetic.  Furthermore, Mère Angélique’s spirituality has often been described as practical, more interested in moral than speculative issues.  Undoubtedly, Mère Angélique never read Jansen’s Augustinus.  But she had certainly read Jansen’s Reformation of the Interior Man, translated by her brother Robert Arnauld d’Andilly from the Latin original, and she had studied Saint-Cyran’s Popular Theology, a summary of Jansenist doctrine that served as the convent school’s major catechetical reference.  The misleading portrait of Mère Angélique as a theological innocent baffled by the intricacies of the ecclesiastical quarrel over grace has long closed her canon to philosophical commentary.

Second, the thought of Mère Angélique has often been simply assimilated to the Augustinianism of the male leaders of the Jansenist movement.  While the abbess was clearly influenced by the constellation of Augustinian thinkers who surrounded her, especially her brother Antoine Arnauld and her mentor Saint-Cyran, she maintains her own distinctive voice in this school of radical Augustinianism.  Her analysis of monastic virtue in a conventual context and her defense of the spiritual rights of women against political and ecclesiastical coercion reflect her own theoretical and gendered priorities.

Finally, the philosophy of Mère Angélique has been obscured by its forbidding theological architecture.  The abbess expresses her theories in monastic genres foreign to the philosophical treatise: letters of spiritual direction, abbatial conferences, and commentaries on patristic texts.  The substantive issues which preoccupied her appear similarly foreign to contemporary philosophy.  Her philosophy of divine attributes, moral virtue, and personal rights are embedded in quarrels over grace and monastic reform that can appear arcane.  Philosophical analysis of her works requires a grasp of the early modern theological controversies which provided the occasion for the abbess’s philosophical speculation.

The current revival of interest in the philosophy of Angélique Arnauld is fueled in large part by the feminist effort to expand the cannon of the humanities in the early modern period.  Mère Angélique’s defense of the rights of women to exercise religious and political authority has emerged as a central interest in contemporary commentary on her own writings.  Kostroun’s study of Port-Royal (2003) focuses on the gendered subversion of authority justified by Mère Angélique and her associates.

5. References and Further Reading

All French to English translations above are by the author of this article.

a. Primary Sources

  • Arnauld, Mère Angélique. Entretiens ou conférences de la Révérende Mère Marie-AngéliqueArnauld, abbesse et réformatrice de Port-Royal (Brussels: A. Boudet, 1757).
    • [Transcribed largely by her niece Soeur Angélique de Saint-Jean Arnauld d’Andilly, this series of conferences delivered in the 1650s at Port-Royal expresses the moral rigorism of the abbess.]
  • Arnauld, Mère Angélique. Lettres de la Révérende Mère Marie-Angélique Arnauld, abbesse et réformatrice de Port-Royal, 3 vols. (Utrechet: Aux dépens de la Compagnie, 1742-44).
    • [The facsimile edition of this collection of Mère Angélique’s letters produced by Phénix Éditions in 2003 contains an excellent introduction by Jean Lesaulnier, who identifies the various networks of correspondents with the abbess.]
  • Arnauld, Mère Angélique. Rélation écrite par la Mère Angélique Arnauld, ed. Louis Cognet (Paris: Grasset, 1949).
    • [This autobiographical text is an apologetic for the Angelican reform of Port-Royal and a description of the burgeoning persecution of the convent and the Jansenist movement.]

b. Secondary Sources

  • Bugnion-Secrétan, Perle. La Mère Angélique Arnauld, 1591-1661, après ses écrits (Paris: Cerf, 1991).
    • [This biography of the abbess focuses on her writings, with particular attention to their polemical context.]
  • Chédeozeau, Bernard. “Idéal intellectuel et vie monastique à Port-Royal,” Chroniques de Port-Royal 37 (1997), 57-74.
    • [This article examines the intellectual culture developed at Port-Royal under the Angelican reform and compares it favorably to the anti-intellectual piety dominating most of the convents of the period.]
  • Conley, John J. Adoration and Annihilation: The Convent Philosophy of Notre Dame (Notre Dame, IN: University of Notre Dame Press, 2009), 43-112.
    • [This chapter analyzes and critiques the neo-Augustinian philosophy developed by the abbess in her letters and conferences.]
  • Gastellier, Fabian, Angélique Arnauld (Paris: Fayard, 1998).
    • [This biography studies the politics of the abbess’s civil and ecclesiastical context.]
  • Gazier, Cécile. Histoire du monastére de Port-Royal (Paris: Perrin, 1929).
    • [This pro-Janenist apologetic history depicts a theologically uncultured Mère Angélique, baffled by the arcane doctrinal disputes by which the enemies of Port-Royal attempted to destroy the convent through charges of heresy.]
  • Kostroun, Daniella. “A Formula for Disobedience: Jansenism, Gender, and the Feminist Paradox,” Journal of Modern History 75 (2003), 483-522.
    • [This article analyzes the subversive nature of the gendered arguments employed by Mère Angélique and her associates against royal and episcopal demands to submission on matters of conscience.]
  • Rowan, Mary. “Angélique Arnauld’s Web of Feminine Friendships: Letters to Jeanne de Chantal and the Queen of Poland,” in Les femmes au grand siècle; le Baroque; Musique et littérature; Music et liturgie, ed. David Wetsel et al. (Tübingen: Narr, 2003), 53-59.
    • [This study of the abbess’s correspondence focuses on her relationship with two prominent women who exercised authority in the religious and civic spheres respectively.]

Author Information

John J. Conley
Email: jconley1@loyola.edu
Loyola University in Maryland
U. S. A.

Social and Political Recognition

Acts of recognition infuse many aspects of our lives such as receiving a round of applause from a rapt audience, being spotted in a crowded street by a long-forgotten friend, having an application for a job rejected because of your criminal record, enjoying some words of praise by a respected philosophy professor, getting pulled over by the police because you are a black man driving an expensive car, and fighting to have your same-sex marriage officially sanctioned in order to enjoy the same benefits as hetero-sexual marriages. Evidently the various ways we are recognised (and recognise others) play an important role in shaping our quality of life. Recognition theorists go further than this, arguing that recognition can help form, or even determine, our sense of who we are and the value accorded to us as individuals.

Political theories of recognition, which attempt to reconfigure the concept of justice in terms of due or withheld recognition, can be contrasted with (but set alongside) the rise of multiculturalism, which has produced an array of literature focused on recognising, accommodating and respecting difference. Although these two trajectories overlap, there are important differences between them. Multicultural politics is rooted in the identity politics underlying various social movements that gained prominence during the 1960s, such as the civil rights movement and radical/cultural feminism. These movements tend to emphasise the distinctness and value of their cultural identity and demand group-specific rights to protect this uniqueness. Without depreciating identity politics and multiculturalism, this article is primarily concerned with political theories of recognition, particularly those formulated by Charles Taylor (who is also a prominent figure in multicultural politics), Nancy Fraser and Axel Honneth. These focus on the role played by recognition in individual identity formation and the normative foundation this can provide to theories of justice.

Despite its brief history as an explicitly political concept, philosophical interest in the idea of recognition can be traced to the work of Hegel, who first coined the phrase ‘struggle for recognition’ (kampf um anerkennung). This article begins by clarifying the specific political and philosophical meaning of recognition. It will provides an overview of Hegel’s remarks on recognition before proceeding to identify the contemporary advocates of recognition. It presents the main similarities and differences between these authors before examining some important criticisms levelled at concept of recognition. The conclusion is a reflection upon the increasing influence of recognition and how it may develop in the future.

Table of Contents

  1. Defining Recognition
  2. The Hegelian Legacy
  3. Contemporary Theories of Recognition
    1. Charles Taylor
    2. Axel Honneth
    3. Nancy Fraser
  4. Redistribution or Recognition? The Fraser-Honneth Debate
  5. Criticisms of Recognition
    1. The Reification of Identity
    2. The Accusation of Essentialism
    3. The Danger of Subjectivism
    4. The Problem of the Other
    5. The Post-Structural Challenge
  6. The Future of Recognition
  7. References and Further Reading

1. Defining Recognition

The term ‘recognition’ has several distinct meanings: (1) an act of intellectual apprehension, such as when we ‘recognise’ we have made a mistake or we ‘recognise’ the influence of religion on American politics; (2) a form of identification, such as when we ‘recognise’ a friend in the street; and (3) the act of acknowledging or respecting another being, such as when we ‘recognise’ someone’s status, achievements or rights (upon the different meanings of recognition, see Inwood, 1992: 245-47; Margalit, 2001: 128-129). The philosophical and political notion of recognition predominantly refers to (3), and is often taken to mean that not only is recognition an important means of valuing or respecting another person, it is also fundamental to understanding ourselves.

Various attempts have been made to clarify precisely what is, and is not, to count as an act of recognition (perhaps most comprehensively by Ikäheimo and Laitinen, 2007). Ikäheimo (2002: 450) defines recognition as ‘always a case of A taking B as C in the dimension of D, and B taking A as a relevant judge’. Here A and B indicate two individual persons, specifically A is the recogniser and B the recognisee. C designates the attribute recognised in A, and D is the dimension of B’s personhood at stake. For example, I may recognise you as a person possessing certain rights and responsibilities in light of your being an autonomous, rational human being (for more on defining the structure of recognition, see Laitinen, 2002). A key feature of Ikäheimo’s definition is that it requires not only that someone be recognised by another, but that the person being recognised judges that the recogniser is capable of conferring recognition. This means that we must place sufficient value in the recogniser in order for their attitude towards us to count as recognitive. Brandom (2009) approaches this idea through the idea of authority, arguing that a genuine instance of recognition requires that we authorise someone to confer recognition. Similarly, one can gain authority and responsibility by petitioning others for recognition. Consequently, one has authority only insofar as one is recognised as authoritative.

We may not consider being valued by a wilful criminal as any sort of recognition in the sense being defined here. We do not judge them capable of conferring value on us, as we do not accord any value or respect to them. Similarly, someone who is coerced into recognising us may also fail to count as a relevant judge. A king who demands recognition of his superiority from all his subjects, simply in virtue of his being king, and threatens to punish them if they disobey, does not receive any meaningful kind of recognition for the subjects do not genuinely choose to confer value on him. Thus, in recognising another, we must also be recognised as a subject capable of giving recognition. This indicates that reciprocity or mutuality is likely to be a necessary condition of appropriate recognition (for a discussion of this point, see Laden, 2007).

A further issue in defining recognition is whether it is generative or responsive (Laitinen, 2002; Markell, 2007). A generation-model of recognition focuses on the ways in which recognition produces or generates reasons for actions or self-understandings. This is to say that someone ought to act in a certain way in virtue of being recognised as, for example, recognising someone as a rational being will generate certain duties and responsibilities for both the person being recognised and those who interact with him. A response-model of recognition focuses on the ways in which recognition acknowledges pre-existing features of a person. Here, to recognise someone is to acknowledge them as they already really are (Appiah, 1994: 149). This means that there are reasons why one ought to give recognition to someone prior to the act of recognition itself. Thus, for example, we ought to recognise someone’s ability to self-determination because they possess certain features, such as rational autonomy. The demand for recognition in a response-model is produced and justified through pre-existing characteristics of a person, whilst in the generation-model it is the act of recognition itself which confers those characteristics onto a person through their being recognised as such. The former is a case of person ‘knowing’, whilst the latter is a case of person ‘making’ (see Markell, 2002).

A third issue is whether groups or collectives can count as recognisers and recognisees. For example, when speaking of recognising a particular cultural group, do we mean we recognise that group qua a group, or as a collection of individuals? Similarly, does the granting of certain rights or respect apply to the group itself or the individual members belonging to that group? (For a detailed discussion and defence of group-differentiated minority rights, see Kymlicka, 1995). These questions revolve, at least in part, around the ontological status afforded to groups or collectives. Advocates of a politics of recognition are not always clear regarding whether or not groups can be granted recognition. Debates over the legitimacy or sovereignty of a state may depend upon the extent to which we recognise it as legitimate or sovereign. Important discussions of groups as entities include Tuomela (2007), Jones (2009) and List and Pettit (2011). However, as yet there has been little analysis of the connection between recognition and the ontology of groups. Charles Taylor (1994) argues for the importance of collective rights, but gives little consideration to whether collectives are genuine subjects over-and-above the individuals that constitute them. In his more recent work, Axel Honneth (Fraser and Honneth 2003: 159ff.) appears to give consideration to the possibility of groups as the object of recognition, but his general emphasis is on individual rights and recognition.

Common to all social and political notions of recognition is the shift from an atomistic to an intersubjective, dialogical understanding of the individual. Because our identity is shaped precisely through our relations to others, our being recognised by them, feelings of self-worth, self-respect and self-esteem are possible only if we are positively recognised for who we are. To this extent, theories of political recognition, which were first formulated in the 1990s, developed out of political movements centred upon such concepts as gender, sexuality, race, ethnicity and culture. Recognition, according to Taylor (1994), is an indispensible means of understanding and justifying the demands of these identity movements, which have had a major impact on society, particularly from the 1960s onwards. Consequently, for many political theorists, recognition is an integral component of any satisfactory modern theory of justice as well as the means by which both historical and contemporary political struggles can be understood and justified. In order to understand how such theories developed, it is necessary to examine their genesis within Hegel’s philosophy.

2. The Hegelian Legacy

Descartes’ dualistic philosophy of consciousness created an influential legacy in which the mind was characterised as a private theatre and knowledge of the self was achieved through introspection. This atomistic conception of self, encapsulated in Descartes’ cogito, filtered into the transcendental idealism of Kant (despite his objections to Descartes’ philosophy) and the transcendental phenomenology of Husserl, as well as being present in the contract theories of Hobbes and Locke. Against this trend there emerged a strongly intersubjective conception of selfhood that found expression through the concept of recognition, the founder of which is typically identified as Hegel. Although Hegel has undoubtedly influenced the contemporary understanding of recognition more than any other philosopher, Hegel was himself inspired by the work of Johann Fichte (see Williams, 1992). In his Foundations of Natural Right (1796/7), Fichte argues that the ‘I’ (the ego or pure consciousness) must posit itself as an individual to be able to understand itself as a free self. In order for such self-positing to occur, the individual must recognise itself as ‘summoned’ by another individual. This is to say, the individual must acknowledge the claims of other free individuals in order to understand itself as a being capable of action and possessing freedom. Hence, one’s freedom is both rendered possible and yet limited by the demands made on us by others. A key feature of this idea is that the same applies in reverse – the other can only comprehend itself as free by being recognised as such. Hence, mutual recognition is necessary for human beings to understand themselves as free individuals (as beings capable of ‘I-hood’). Through this analysis, Fichte produced a thoroughly intersubjective ontology of humans and demonstrated that freedom and self-understanding are dependent upon mutual recognition.

These ideas were developed in greater detail by Hegel. In his Phenomenology of Spirit Hegel (1807: 229) writes, ‘Self-consciousness exists in itself and for itself, in that, and by the fact that it exists for another self-consciousness; that is to say, it is only by being acknowledged or “recognized”’. Self-knowledge, including one’s sense of freedom and sense of self, is never a matter of simple introspection. Rather, understanding ourselves as an independent self-consciousness requires the recognition of another. One must recognise oneself as mediated through the other. As Sartre, who was heavily influenced by Hegel, wrote, ‘The road of interiority passes through the Other’ (Sartre, 1943: 236-7). The idea of recognition is developed further in Hegel’s mature works, particularly Elements of the Philosophy of Right (1821), where it becomes an essential factor in the development of ethical life (sittlichkeit). According to Hegel, it is through the intersubjective recognition of our freedom that right is actualised. Rights are not instrumental to freedom; rather they are the concrete expression of it. Without recognition we could not come to realise freedom, which in turn gives rise to right. The work of Hegel consciously echoes the Aristotelian conception of humans as essentially social beings. For Hegel, recognition is the mechanism by which our existence as social beings is generated. Therefore, our successful integration as ethical and political subjects within a particular community is dependent upon receiving (and conferring) appropriate forms of recognition.

The part of Hegel’s work to lay bare certain fundamental dynamics involved in recognition is the oft-discussed master-slave dialectic which appears in the Phenomenology (see Pinkard, 1996: 46ff; Stern, 2002: 83ff.). Hegel introduces the idea of a ‘struggle for recognition’, describing an encounter between two self-consciousnesses which both seek to affirm the certainty of their being for themselves (Hegel, 1807: 232ff.). Such a conflict is described as a life-and-death struggle, insofar as each consciousness desires to confirm its self-existence and independence through a negation or objectification of the other. That is, it seeks to incorporate the other within its field of consciousness as an object of negation, as something which this consciousness is not, thus affirming its own unfettered existence. Of course, the other also tries to negate this consciousness, thus generating the struggle which results in affirmation of one self-consciousness at the cost of the negation or annihilation of the other. Only in this way, Hegel observes, only by risking life, can freedom be obtained. However, there is a key moment with this struggle. Namely, consciousness realises that it cannot simply destroy the other through incorporating it within itself, for it requires the other as a definite other in order to gain recognition. Thus, it must resist collapsing the other into itself, for to do so would also be to annihilate itself. It would be starving itself of the recognition it requires in order to be a determinate self-consciousness.

Within Hegel’s radical reworking of how the individual subject is understood, autonomy becomes a contingent, social and practical accomplishment; it is an intersubjectively-mediated achievement which is never simply given or guaranteed but always dependent upon our relations with others. This co-dependency results in mutual relations of recognition which are the condition for understanding oneself as a genuinely free being, albeit a free being which acknowledges, and thus adjusts itself, to the freedom of others. Discussing the process of recognition, Hegel (1807: 230) notes that it ‘is absolutely the double process of both self-consciousnesses… Action from one side only would be useless, because what is to happen can only be brought about by means of both’. As a result, these two self-consciousnesses ‘recognize themselves as mutually recognizing one another’ (ibid: 231). Hegel characterises this mutuality, which cannot be coerced but be freely given and received, as being at home in the other. Such a relation with another is the condition for the phenomenological experience of freedom and right. Consequently, our interactions with others are not a limitation on freedom, but rather the ‘enhancement and concrete actualization of freedom’ (Williams, 1997: 59).

We see now how the master-slave dialectic of recognition is inherently unstable and unsatisfying. The master has dominion over the slave, reducing the latter to the status of a mere ‘thing’ through refusing to recognise it as a free and equal self-consciousness. The slave, realising that life as a slave is better than no life at all, accepts this relation of dominance and subservience. Whilst the slave receives no recognition from the master, the master has ‘earned’ the recognition of a slave which it considers as less-than-human. Such recognition is not ‘real’ recognition at all and yet, within this Hegel’s dialectic of recognition, the master requires the recognition of the slave in order to gain some modicum of self-understanding and freedom. The recognition of the slave is ultimately worthless, for it is not the recognition of a free self-consciousness, which alone can grant the recognition on another required for self-certainty of existence and freedom. Trapped in this fruitless relation, the slave becomes the ‘truth’ of the master, and so the master, paradoxically, becomes enslaved to the slave. For Hegel, relations of domination provide a vicious spiral of recognition. They lead nowhere but to their own destruction. Hence recognition must always take place between equals, mediated through social institutions which can guarantee that equality and thus produce the necessary mutual relations of recognition necessary for the attainment of freedom. It is precisely this last point that recent recognition theorists have seized upon and elaborated into comprehensive discussions of justice.

3. Contemporary Theories of Recognition

a. Charles Taylor

Much contemporary interest in recognition was undoubtedly fuelled by Charles Taylor’s essay ‘Multiculturalism and the Politics of Recognition’ (1994), first published in 1992. Taylor’s lucid and concise article is often treated as the classic expression of a theory of recognition. However, it would be more accurate to say that Taylor awoke a general interest in the idea of recognition. His short essay provides a series of reflections and conjectures which, whilst insightful, do not constitute a full-blown theory of recognition. However, its exploratory nature and non-technical language has helped install it as the common reference point for discussions of recognition.

Taylor begins with the assertion that ‘a number of strands in contemporary politics turn on the need, sometimes the demand, for recognition’ (Taylor, 1994: 25). He identifies such a demand as present in the political activities of feminism, race movements and multiculturalists (for a critical discussion of this point, see Nicholson, 1996). The specific importance of recognition lies in its relationship to identity, which he defines as ‘a person’s understanding of who they are, of their fundamental characteristics as a human being’ (Taylor, 1994: 25). Because identity is ‘partly shaped by recognition or its absence’, then ‘Nonrecognition or misrecognition can inflict harm, can be a form of oppression, imprisoning someone in a false, distorted, and reduced mode of being’ (ibid.). Underlying Taylor’s model is the Hegelian belief that individuals are formed intersubjectively (see Section II). Our individual identity is not constructed from within and generated by each of us alone. Rather, it is through dialogue with others that we negotiate our identity. Taylor refers to these others as ‘significant others’, meaning those people who have an important role in our lives (that is, family, friends, teachers, colleagues, and so forth.). The idea that our sense of who we are is determined through our interaction with others initiates a shift from a monologic to a dialogic model of the self.

Taylor is keen to stress just how important recognition is, referring to it as ‘a vital human need’ (ibid: 26) and stating that misrecognition ‘can inflict a grievous wound, saddling its victims with a crippling self-hatred’ (ibid: 26). Deploying a brief historical narrative, Taylor argues that the collapse of social hierarchies, which had provided the basis for bestowing honour on certain individuals (that is, those high up on the social ladder), led to the modern day notion of dignity, which rests upon universalist and egalitarian principles regarding the equal worth of all human beings. This notion of dignity lies at the core of contemporary democratic ideals, unlike the notion of honour which is, he claims, clearly incompatible with democratic culture. This picture is complicated by the fact that alongside this development of dignity there emerged also a new understanding of ‘individualised identity’, one in which the emphasis was on each person’s uniqueness, which Taylor defines as ‘being true to myself and my own particular way of being’ (ibid: 28). Taylor refers to this idea of uniqueness as the ideal of authenticity, writing ‘Being true to myself means being true to my own originality, which is something only I can articulate and discover. In articulating it, I am also defining myself’ (ibid: 31).

Taylor has been accused of adopting an essentialist view of the self, on the basis that there is some inner ‘me’ waiting to be uncovered and displayed to (recognised by) the world (see section V. b). However, he is quick to point out that the discovery of our authenticity is not simply a matter of introspection. Rather, it is through our interactions with others that we define who we are. Nor is there an end point to this dialogue. It continues throughout our entire lives and does not even depend upon the physical presence of a specific other for that person to influence us. Consider, for example, the way an imaginary conversation with a deceased partner might influence how we act or view ourselves. The importance of recognition lies precisely in the fact that how others see (might) us is a necessary step in forming an understanding of who we are. To be recognised negatively, or misrecognised, is to be thwarted in our desire for authenticity and self-esteem.

Taylor’s uses these insights to construct a politics of equal recognition. He identifies two different ways in which the idea of equal recognition has been understood. The first is a politics of equal dignity, or a politics of universalism, which aims at the equalisation of all rights and entitlements. In this instance, all individuals are to be treated as universally the same through recognition of their common citizenship or humanity. The second formulation is the politics of difference, in which the uniqueness of each individual or group is recognised. Rousseau bitterly noted that man, having shifted from a state of self-sufficiency and simplicity to one of competition and domination that characterises modern society, has come to crave the recognition of their difference (Rousseau, 1754). In this detrimental situation, man is rendered dependent upon the views of others, craving what Rousseau termed ‘amour propre’ through the admiration of those around him, leading to an endless competition for greater achievements and respect and thus robbing man of his independence. For Rousseau, this desire for individual distinction, achievement and recognition conflicts with a principle of equal respect

Returning to Taylor, he notes that there is also a universal basis to this second political model insofar as all people are entitled to have their identity recognised: ‘we give due acknowledgement only to what is universally present – everyone has an identity – through recognizing what is peculiar to each. The universal demand powers an acknowledgement of specificity’ (Taylor, 1994: 39). One consequence of this politics of difference is that certain rights will be assigned to specific groups but not others. The two approaches can be summed as follows. The politics of equal dignity is difference-blind, whereas the politics of difference is, as the name suggests, difference-friendly (this does not mean that a politics of equal dignity is not also ‘friendly’ towards difference, but rather that differences between individuals cannot be the normative foundation for the assignment of certain rights or entitlement to some individuals or groups but not others).

Taylor defends a politics of difference, arguing that the concept of equal dignity often (if not always) derives its idea of what rights and entitlement are worth having from the perspective of the hegemonic culture, thus enforcing minority groups to conform to the expectations of dominant culture and hence relinquish their particularity. Failure to conform will result in the minority culture being derided and ostracised by the dominant culture. As Taylor (ibid: 66) notes, ‘dominant groups tend to entrench their hegemony by inculcating an image of inferiority in the subjugated’. A clear instance of this can be seen in de Beauvoir’s claim that woman is always defined as man’s ‘other’ or ‘shadow’ (de Beauvoir, 1949). Woman exists as a lack; characterised through what she does not possess or exhibit (namely, male and masculine traits). Similarly, civil rights movements have frequently protested that the image of the ‘human’ was inevitably white, Western, educated, middle-class and wealthy. An example of how this plays out in everyday life is the recent, though now generally discarded, practice of labelling pink crayons ‘flesh’ coloured. Both feminist and race theorists have tried to convey the idea that the white male is simply another particular instance of humanity, rather than its ‘default’ image or constitutive, universal norm. This point was strongly made by Fanon (1952), who detailed how racism infiltrates the consciousness of the oppressed, preventing psychological health through the internalisation of subjection and otherness. This in turn alienates the black person from both their society and their own body, owing to the fact that the world is defined in terms of ‘whiteness’ and thus as something essentially irretrievably different (alien) to them.

b. Axel Honneth

Axel Honneth has produced arguably the most extensive discussion of recognition to date. He is in agreement with Taylor that recognition is essential to self-realisation. However, he draws more explicitly on Hegelian intersubjectivity in order to identify the mechanics of how this is achieved, as well as establishing the motivational and normative role recognition can play in understanding and justifying social movements. Following Hegel (1807; 1821) and Mead (1934), Honneth identifies three ‘spheres of interaction’ which are connected to the three ‘patterns of recognition’ necessary for an individual’s development of a positive relation-to-self.  These are love, rights, and solidarity (Honneth, 1995: 92ff; also Honneth 2007, 129-142).

The mode of recognition termed ‘love’ refers to our physical needs and emotions being met by others and takes the form of our primary relationships (that is, close friends, family and lovers). It provides a basic self-confidence, which can be shattered through physical abuse.  The mode of recognition termed ‘rights’ refers to the development of moral responsibility, developed through our moral relations with others. It is a mutual mode of recognition ‘in which the individual learns to see himself from the perspective of his [or her] partner in interaction as a bearer of equal rights’ (Honneth, 1992: 194). The denial of rights through social and legal exclusion can threaten one’s sense of being a fully active, equal and respected member of society. Finally, the mode of recognition termed ‘solidarity’ relates to recognition of our traits and abilities. It is essential for developing our self-esteem and for how we become ‘individualised’, for it is precisely our personal traits and abilities that define our personal difference (Honneth, 1995: 122). Consequently, unlike the relations of love and rights, which express universal features of human subjects, esteem ‘demands a social medium that must be able to express the characteristic differences between human subjects in a universal, and more specifically, intersubjectively obligatory way’ (ibid.). All three spheres of recognition are crucial to developing a positive attitude towards oneself:

For it is only due to the cumulative acquisition of basic self-confidence, of self-respect, and of self-esteem… that a person can come to see himself or herself, unconditionally, as both an autonomous and an individuated being and to identify with his or her goals and desires (ibid: 169).

According to Honneth, the denial of recognition provides the motivational and justificatory basis for social struggles. Specifically, it is through the emotional experiences generated by certain attitudes and actions of others towards us that we can come to feel we are being illegitimately denied social recognition. This argument makes use of Dewey’s theory of emotion as intentionally orientated. Certain emotional states, such as shame, anger and frustration, are generated by the failure of our actions. Conversely, more positive emotional states are generated through successful action. The experience of negative emotional states can, in theory, reveal to us that an injustice is taking place (namely, that we are not being given due and appropriate recognition). However, as Honneth points out, feelings of shame or anger need not (indeed, do not) necessarily disclose relations of disrespect (ibid: 138). What they provide is the potential for identifying the occurrence of an injustice which one is justified in opposing. The experience of disrespect is the raw material from which normatively justified social struggles can be formulated. Furthermore, it is only within certain social contexts, those in which the ‘means of articulation of a social movement are available’ (ibid: 139), that experiences of disrespect provide the motivational basis for political struggles (see Honneth, 2007). Presumably, disrespect in other contexts would lead to individual acts of retaliation or undirected violence, rather than coordinated resistance.

This phenomenological approach to recognition thus locates the source and justification of social struggles in the experiences and expectations of recognition. Of course, as noted, it requires the further steps of (a) locating these experiences within a socially-generated framework of emancipatory discourse; and (b) the establishment of common experiences amongst individuals for these individual frustrations to develop into social struggles. Therefore, it would be naïve to think that Honneth is blind to the importance of, say, ensuring the means and rights to collective political action within societies. But the fundamental component of any attempt to identify injustice and vindicate the necessary remedies must be located in the individual’s experiences of disrespect (Honneth, 2007) (for a potential problem with this position, see Rogers, 2009).

In order to justify these claims, Honneth ascribes an inherent expectation of recognition to humans, referring to demands generated from such an expectation as the ‘“quasi-transcendental interests” of the human race’ (Fraser and Honneth, 2003: 174). It is only through the failure of such expectations that recognition can be a motivational source, arising via negative emotional experiences. This assumption allows Honneth to assess societal change as a developmental process driven by moral claims arising from experiences of disrespect. Honneth (1995: 168) summarises his somewhat teleological account (a product of Honneth’s Hegelian and Aristotelian tendencies) as follows: ‘Every unique, historical struggle or conflict only reveals its position within the development of society once its role in the establishment of moral progress, in terms of recognition, has been grasped’.

The positing of an approximate and ideal end-state, presumably one in which full recognition reigns supreme, allows a distinction between progressive, emancipatory struggles and those which are reactionary and / or oppressive. Therefore, from this general position of enabling the self-realisation of one’s desires, characteristics and abilities, we can assess current socio-political struggles and analyse their future directions so as to ensure their promoting of the conditions for self-realisation. Honneth is careful to specify that he is not advocating a single, substantive set of universal values and social arrangements. Rather, his concept of ‘the good’ is concerned with the ‘structural elements of ethical life’ which enable personal integrity (ibid: 172). Therefore, the posited ‘end-point’ from which normative claims can be made must emanate from structural relations outlined in the three distinct patterns of recognition which foster a positive relation-to-self (for a discussion of Honneth’s conception of the good / ethical life, see Zurn, 2000). Here, Honneth is trying to retain a Kantian notion of respect and autonomy through identifying the necessary conditions for self-realisation and self-determination, akin to a Kantian kingdom of ends in which all individuals receive and confer recognition on one another. Simultaneously, in stressing the minimal or ‘bare’ conditions necessary for this, he aims to avoid committing himself to a singular, substantial conception of the good life and thus resists the dangers of reproducing an exclusivist and exclusionary conception of what constitutes the good life.

c. Nancy Fraser

Whereas there are broad areas of agreement between Honneth and Taylor, Nancy Fraser is keen to differentiate her theory of recognition from both of their respective positions. Fraser’s overarching theme throughout her works on recognition is the dissolving of the assumed antithesis between redistribution and recognition (arguably this assumption is a consequence of critical theory’s Marxist roots, within which framework Fraser’s work undoubtedly emerges from). Thus far, the presentation of recognition and redistribution has been presented (at least implicitly) as an either/or decision. Fraser believes that this binary opposition derives from the fact that, whereas recognition seems to promote differentiation, redistribution supposedly works to eliminate it. The recognition paradigm seems to target cultural injustice, which is rooted in the way people’s identities are positively or negatively valued. Individuals exist as members of a community based upon a shared horizon of meanings, norms and values. Conversely, the distribution paradigm targets economic injustice, which is rooted in one’s relation to the market or the means of production (Fraser and Honneth, 2003: 14). Here, individuals exist in a hierarchically-differentiated collective class system which, from the perspective of the majority class who are constituted by a lack of resources, needs abolishing.

According to Fraser, both these forms of injustice are primary and co-original, meaning that economic inequality cannot be reduced to cultural misrecognition, and vice-versa. Many social movements face this dilemma of having to balance the demand for (economic) equality with the insistence that their (cultural) specificity be met. Fraser (1997: 19) gives the example of the feminist movement by posing the question, ‘How can feminists fight simultaneously to abolish gender differentiation [through economic redistribution] and to valorize gender specificity [through cultural recognition]?’. There is a clear divergence here between the monistic models of Taylor and Honneth, in which recognition is the foundational category of social analysis and distribution is treated as derivative, and Fraser’s dualistic model. Whereas Honneth thinks a sufficiently elaborated concept of recognition can do all the work needed for a critical theory of justice, Fraser argues that recognition is but one dimension of justice, albeit a vitally important one.

The disagreement over whether or not distribution can be made to supervene on recognition arises from the differing interpretations of recognition. According to Fraser (Fraser and Honneth 2003: 29), one can understand recognition as either (a) a matter of justice, connected to with the concept of a universal ‘right’ (Fraser’s position); or (b) a matter of self-realisation, connected with historically-relative cultural conceptions of the ‘good’ (Honneth’s and Taylor’s position). In (b) Fraser draws out the Aristotelian idea of eudaimonia (flourishing), which runs throughout Honneth’s teleological account. Contra Honneth and Taylor, Fraser does not look to situate the injustice of misrecognition in the retardation of personal development. Rather, she identifies it with the fact that ‘some individuals and groups are denied the status of full partners in social interaction simply as a consequence of institutionalized patterns of cultural value in whose construction they have not equally participated and which disparage their distinctive characteristics or the distinctive characteristics assigned to them’ (ibid). Addressing injustices arising from misrecognition therefore means looking at the discursive representations of identities in order to identity how certain individuals are assigned a relatively inferior social standing. Hence, on Fraser’s model, misrecognition should not be construed as an impediment to ethical self-realization (as it is for Taylor and Honneth). Instead, it should be conceived as an institutionalised relation of subordination.

Owing to her identification of recognition with social status, the evaluative element in Fraser’s account is the notion of ‘parity of participation’. According to this principle, ‘justice requires that social arrangements permit all (adult) members of society to interact with one another as peer’ (ibid: 36). In effect, recognition is required in order to guarantee that all members of society have an equal participation in social life. Crucially, participatory parity also requires material / economic redistribution in order to guarantee that people are independent and ‘have a voice’ (ibid). Because Honneth equates recognition with self-realisation, the derivative issues of redistribution are only generated to the extent that they inhibit this personal development. For Fraser, injustice in the form of both misrecognition and maldistribution is detrimental to the extent that it inhibits participatory parity.

Fraser considers two possible remedies for injustice, which transcend the redistribution-recognition divide by being applicable to both. The first is ‘affirmation’, which incorporates any action which corrects ‘inequitable outcomes of social arrangements without disturbing the underlying framework that generates them’ (ibid: 23). The second is ‘transformation’, which refers to ‘remedies aimed at correcting inequitable outcomes precisely by restricting the underlying generative framework’ (ibid). Fraser’s concept of transformation highlights her belief that certain forms of injustice are ingrained within ‘institutionalized patterns of cultural value’ (ibid: 46). Certain forms of inequality, including those of race and gender, derive from the signifying effect of socio-cultural structures. These discursive frameworks, situated within language and social arrangements, reproduce hierarchical binary oppositions such as ‘heterosexual/homosexual’, ‘white/black’ and ‘man/woman’. Thus, the solution is not simply a matter of revaluing heterosexual, female or black identities. Rather, one must attempt to deconstruct the binary logic which situates people as inherently inferior, creating a ‘field of multiple, debinarized, fluid, ever-shifting differences’ (Fraser, 1997: 24). One key aspect of this transformative approach is that, unlike the affirmative approach which aims to alter only one particular group’s sense of worth or material situation, it would change everyone’s sense of self. The proposal made by Fraser, then, is the radical restructuring of society, achieved through transformative redistribution (that is, socialism) and recognition (cultural deconstruction). It should be noted that in her more recent work on recognition (that is, Fraser 2000; 2001), she resists offering any particular remedies, arguing instead that the required response to injustice will be dictated by the specific context. Thus, she appears to distance herself from the more ‘deconstructive’ elements of her earlier work (see Zurn, 2003).

4. Redistribution or Recognition? The Fraser-Honneth Debate

In a very important discussion, Fraser and Honneth (2003) defend their respective theories of recognition (see also Honneth, 2001). Underlying the disagreements between them is their respective positions regarding the distribution / recognition debate. As noted in Section III, Fraser believes that recognition and distribution are two irreducible elements of a satisfactory theory of justice. This is to say, they are of equal foundational importance – the one cannot be collapsed into the other. Honneth, on the other hand, contends that issues of distribution are ultimately explained and justified through issues of recognition. As he writes, ‘questions of distributive justice are better understood in terms of normative categories that come from a sufficiently differentiated theory of recognition’ (ibid: 126). He begins justifying this claim through a historical survey of political movements and unrest amongst the lower classes during the early stages of capitalism. What marked such activities was the commonly held belief that the honour and dignity of the members of the lower classes were not being adequately respected. Summarising these findings, Honneth (ibid: 132) proclaims that ‘subjects perceive institutional procedures as social injustice when they see aspects of their personality being disrespected which they believe they have a right to recognition’.

One important consequence of this view is that it undermines the received wisdom that collective identity movements are a recent ‘modern’ phenomenon. In actual fact, according to Honneth, experiences of disrespect and denigration of an individual’s or group’s identity are the constitutive feature of all instances of social discontent. Portraying ‘recognition’ as the sole preserve of cultural minorities struggling for social respect is therefore highly misleading and obscures the fact that challenges to the existing social order are always driven by the moral experience of failing to receive what is deemed to be sufficient recognition (ibid: 160). Any dispute regarding redistribution of wealth or resources is reducible to a claim over the social valorisation of specific group or individual traits. The feminist struggle over the gendered division of labour is, according to Honneth, primarily a struggle regarding the prevailing assessment of achievement and worth which has had important redistributive effects, such as a trend towards greater access to, and equality within, the workplace and the acknowledgement of ‘female’ housework. The division that Fraser makes between economic distribution and cultural recognition is, Honneth claims, an arbitrary and ultimately misleading one that ignores the fundamental role played by recognition in economic struggles, as well as implying that the cultural sphere of society can be understood as functioning independently of the economic sphere.

Fraser (ibid: 30ff.) offers four advantages of her status model over Honneth’s monistic vision of justice as due recognition (for a discussion of these, see Zurn, 2003). Arguably the most important of these is that, in locating injustice in social relations governed by cultural patterns of representations, she can move beyond both Taylor’s and Honneth’s reliance on psychology as the normative force underlying struggles for recognition. Recalling that Honneth locates the experiences of injustice in the emotional responses to frustrated expectations of due recognition, Fraser argues that she is able to ‘show that a society whose institutionalized norms impede parity of participation is morally indefensible whether or not they distort the subjectivity of the oppressed’ (ibid: 32). The ideal of participatory parity gives Fraser her normative component, for it provides the basis on which different recognition claims can be judged. Namely, a valid recognition claim is one in which subjects can show that ‘institutionalized patterns of cultural value deny them the necessary intersubjective conditions [for participatory parity]’ (ibid: 38). Honneth’s invocation of pre-political suffering, generated by the perceived withholding of recognition, as the motivating force behind social movements is thus rejected by Fraser as seriously problematic. In particular, she says, the idea that all social discontent has the same, single underlying motivation (misrecognition) is simply implausible. Honneth rejects other motivational factors such as ‘resentment of unearned privilege, abhorrence of cruelty, aversion of arbitrary power… antipathy to exploitation, dislike of supervision’ that cannot not simply be reduced down to, or subsumed by, an overarching expectation of appropriate recognition.

Another problem with Honneth’s psychological model of experiences of injustice is that, so Fraser argues, it shifts the focus away from society and onto the self, thus ‘implanting an excessively personalized sense of injury’ (ibid: 204). This can lead to the victim of oppression internalising the injustice or blaming themselves, rather than the discursive and material conditions within which they are situated as oppressed or harmed. Indeed, Fraser proceeds to point out that there can be no ‘pure’ experience of moral indignation caused by withheld or inappropriate recognition. There is no realm of personal experience that is not experienced through a particular linguistic and historical horizon, which actively shapes the experience in question (see section V. d). Thus to introduce a ‘primordial’ sense of moral suffering is, Fraser claims, simply incoherent (similar concerns are raised by McNay, 2008: 138ff.). Honneth cannot invoke psychological experiences of disrespect as the normative foundation for his theory of recognition as they cannot be treated as independent of the discursive conditions within which the subject is constituted. To do so is to rely on an ultimately unjustifiable transcendental account of the subject’s access to their sense of moral worth grounded in the right to recognition.

In his response to Fraser, Honneth points out that she can necessarily focus only on those social movements that have already become visible. By analysing the ways in which individuals and groups are socially-situated by institutionalised patterns of cultural value, Fraser limits herself to only those expressions of social discontent that have already entered the public sphere. The logic of this criticism seems to be that, if (in)justice is a matter of how society signifies subjects’ abilities and characteristics, then it can only address those collective subjectivities which are currently socially recognised. In other words, there could be a plethora of individuals and groups who are struggling for recognition which have not yet achieved public acknowledgement and thus have not been implicated within positive or negative social structures of signification. There appears some weight to this criticism, for a successful critical social theory should be able to not only critique the status quo, but identify future patterns of social resistance. If, on Fraser’s account, justice is a matter of addressing how subjects are socially-situated by existing value structures, then it seems to lack the conceptual apparatus to look beyond the present. The ability to identify social discontent must, Honneth argues, be constructed independently of social recognition, and therefore ‘requires precisely the kind of moral-psychological considerations Fraser seeks to avoid’ (ibid: 125). In ignoring the individual’s experiences of injustice as the disrespect of aspects of their personality, a social theory can only address the present situation, rather than exploring the normative directions of future social struggles. It is out of the frustration of individual expectations of due recognition that new social movements will emanate, rather than the pre-existing patterns of signification which currently hierarchically situate subjects.

5. Criticisms of Recognition

Despite its influence and popularity, there are a number of concerns regarding the concept of recognition as a foundational element in a theory of justice. This article cannot hope to present an exhaustive list, so instead offers a few of the most common critiques.

a. The Reification of Identity

Perhaps the one most frequently voiced criticism is that regarding the reification of group identity. Put simply, the concern is that, in initiating an identity politics in which one demands positive recognition for a group’s specific characteristics, specific characteristics can be seen as necessarily constitutive of this group and thus any group member who does not display these characteristics risks being ostracised. Such claims are often cloaked in a language of ‘authenticity’ which leads to demands for conformity amongst individual members of the group in order to gain acceptance and approval. This risks producing intergroup coercion and enforcing conformity at the expense of individual specificity.

To give an example, discussed by Appiah (1994) in his response to Taylor’s essay on recognition, the construction of a black politics in which black identity is celebrated can provide a sense of self-worth and dignity amongst historically denigrated black communities. However, it can also lead to a ‘proper’ way of being black, one which all members of the black community must demonstrate in order to partake in this positive self-image. Such expectations of behaviour can lead, Appiah notes (ibid: 163), to one form of tyranny being replaced by another. Specifically, individuals who fail to exemplify authentic ‘black’ identity can find themselves once again the victims of intolerance and social exclusion. Similar dynamics of exclusion can be seen in the debate within certain feminist circles about whether lesbians can be properly considered ‘women’. Extrapolating from these concerns, Markell (2003) argues that Taylor conflates individual identity with group identity with the result that agency is rendered a matter of adopting the identity one is assigned through membership of one’s community. Consequently, the critical tension between the individual and community is dissolved, which leaves little (if any) space for critiquing or resisting the dominant norms and values of one’s community (see also Habermas, 1991: 271).

The reification of group identity can also lead to separatism through generating an ‘us-and-them’ group mentality. By valorising a particular identity, those other identities which lack certain characteristics particular to the group in question can be dismissed as inferior. This isolationist policy runs counter to the ideal of social acceptability and respect for difference that a politics of recognition is meant to initiate. Reifying group identity prevents critical dialogue taking place either within or between groups. Internal group members who challenge apparently ‘authentic’ aspects of their culture or group identity can be labelled as traitors, whilst non-group members are dismissed as unqualified to comment on the characteristics of the group on the basis that they are ‘outsiders’. The result is a strong separatism and radical relativism in which intergroup dialogue is eliminated. This can mask over the ways in which various axes of identity overlap and thus ignores the commonalities between groups. For example, the focus of black feminists on ‘black culture’ and the oppression this has suffered can lead to a failure to recognise their commonality with women in other cultures. Conversely, the tendency among feminists to focus on the concept of ‘woman’ can lead them to ignore the potential alliances they might share with other oppressed groups that don’t focus on gender injustice. Underlying this critique is the idea that identity is always multilayered and that each individual is always positioned at the intersection of multiple axes of oppression. Simply reducing one’s sense of oppression to a single feature of identity (such as race or gender) fails to acknowledge the way that each feature of identity is inextricably bound up with other features, so that, for example, race and gender cannot be treated as analytically distinct modes of dominance.

b. The Accusation of Essentialism

Similar to the concerns over reification, there is a concern that recognition theories invoke an essentialist account of identity. This has particularly been the case with regards Taylor’s model of recognition (see McNay, 2008: 64ff). Critics accuse recognition theory of assuming that there is a kernel of selfhood that awaits recognition (see, for example, Heyes, 2003). The struggle for recognition thus becomes a struggle to be recognised as what one truly is. This implies that certain features of a person lie dormant, awaiting discovery by the individual who then presents this authentic self to the world and demands positive recognition for it. Although Taylor is keen to stress that his model is not committed to such an essentialist account of the self, certain remarks he makes do not help his cause. For example, in describing the modern view of how we create a sense of ‘full being’, he notes that, rather than connecting with some source outside of ourselves (such as God or the Platonic Good), ‘the source we have to connect with is deep within us. This fact is part of the massive subjective turn of modern culture, a new form of inwardness, in which we come to think our ourselves as beings with hidden depths’ (Taylor, 1994: 29). Taylor proceeds to note that ‘Being true to myself means being true to my own originality, which is something only I can articulate and discover’ (ibid: 31) and that authenticity ‘calls on me to discover my own original way of being. By definition, this way of being cannot be socially derived, but must be inwardly generated’ (ibid: 32).

A more radical account of intersubjectivity can be found in Arendt (1958). Examining the processes by which the subject reveals who they are, she shifts the focus away from a personal revelation on the part of the agent and into the social realm: ‘it is more than likely that the “who” , which appears so clearly and unmistakably to others, remains hidden from the person itself, like the daimōn in Greek religion which accompanies each man throughout his life, always looking over his shoulder from behind and thus visible only to those he encounters’ (Arendt, 1958: 179-80). One important consequence of this idea is that, in order to address the question of ‘who’ we are, we must be willing to relinquish control of any such answer. In so doing, we place ourselves into the hands of others. As Arendt writes, ‘This unpredictability of outcome [of personal disclosure] is closely related to the revelatory character of action and speech, in which one discloses one’s self without ever either knowing himself or being able to calculate beforehand whom he reveals’ (ibid: 192) (for an Arendt-inspired critique of recognition, see Markell, 2003).

c. The Danger of Subjectivism

Taylor mitigates his position and, arguably, eschews any form of essentialism, by arguing that we always work out our identity through dialogue with others. However, there is a possibility that he slips towards a subjectivist position, for it seems that it is the individual who ultimately decides what their ‘true’ identity is. For example, Taylor (1994: 32-3) states that this dialogue with others requires that we struggle with and sometimes struggle against the things that others want to see in us. However, he does not state to what we appeal to in this potential struggle with others. If it is ultimately our sense of who we are, then this would seem to undermine the very conditions of intersubjectivity that Taylor wants to introduce into the notion of personal identity. For, if one is the ultimate judge and jury on who one is, then those around us will simply be agreeing or disagreeing with our pre-existent or inwardly-generated sense of self, rather than playing an ineliminable role in its constitution.

Again, it is unlikely that Taylor would endorse any form of subjectivism. Indeed, his turn towards intersubjective recognition is precisely meant to resist the idea that one simply decides who one is and demands that others recognise oneself in such a way. Taylor would certainly seem critical of the existential tradition, which emphasised the need for one to define oneself and provide meaning to the world. Although Sartre deployed the language of intersubjectivity (see V. d) and highlighted the importance of the other, his analysis of the in-itself and the for-itself, coupled with describing how we are each born alone and must carry the weight of the world on our shoulders (with no-one able to lighten the burden), suggests an ego which negates (and hence is radically separated from) the world. This split between ‘I’ and ‘you’ renders any notion of dialogical identity construction impotent.

Recognition, contrasted with this existential picture, theories seem well equipped to resist any accusation that they slide into subjectivism. However, they must provide a criterion from which to judge whether individual and collective demands for recognition are legitimate. For example, it cannot be the case that all demands for recognition are accepted, for we are unlikely to want to recognise the claims of a racist or homophobic group for cultural protection. There is a danger that Taylor’s model does not explicitly state the conditions by which acceptable claims for recognition can be separated from unacceptable claims. His politics of difference is premised on ‘a universal respect for the human capacity to form one’s identity (Taylor, 1994: 42). Hence he seems committed to respecting difference qua difference, regardless of the particular form this difference takes. There is a sense that, as long as recourse is made to an ‘authentic’ life, then the demand for recognition should be met. But no matter how strongly the racist group insists upon their authenticity, we would be likely to resist recognising the value and worth of their identity as racists.

d. The Problem of the Other

Certain theorists have tended to cast recognition in a far more negative, conflictual light. Typically, they interpret Hegelian recognition as evolving an inescapable element of domination between, or appropriation of, subjects. Perhaps the most notable of such thinkers is Sartre (1943), whose account of intersubjectivity appears to preclude any possibility of recognition functioning as a means of attaining political solidarity or emancipation. According to Sartre, our relations with other people are always conflictual as each of us attempts to negate the other in an intersubjective dual. The realisation of our own subjectivity is dependent upon our turning the other into an object. In turn, we are made to feel like an object within the gaze of the other. Sartre’s famous example is the shameful, objectifying experience of suddenly feeling the ‘look’ or ‘gaze’ of another person upon us when carrying out a contemptible act. In this moment of shame, I feel myself as an object and am thus denied existence as a subject. My only hope is to make the other into an object. There are no equal or stable relations between people; all interactions are processes of domination.

Whereas Sartre focuses on the problem of being recognised, Levinas (1961) turns to the ethical issues attending how one recognises others. According to Levinas, Hegelian recognition involves an unavoidable appropriation or assimilation of the other into one’s own subjectivity. By this he means that in recognising the other we render them ‘knowable’ according to our own terms, thus depriving him or her of their irreducible ‘alterity’ or difference. Levinas believes that the denying of such difference is the fundamental ethical sin as it fails to respect the other in their absolute exteriority, their absolute difference to us. In effect, to recognise someone is to render them the same as us; to eliminate their inescapable, unapprehendable and absolute alterity (Yar, 2002).

An alternative perspective on the self-other relationship can be found in Merleau-Ponty who argues that the other is always instigated within oneself, and vice-versa, through the potential reversibility of the self-other dichotomy (that is, that the self is also a potential other; seeing someone necessarily involves the possibility of being seen). Merleau-Ponty explicitly rejects the Levinasian perspective that the other is an irreducible alterity. Rather, the self and other are intertwined through their bodily imbrications in the world. He describes our respective perspectives on the world as slipping into one another and thus being brought together: ‘In reality, the other is not shut up inside my perspective of the world, because this perspective itself has no definite limits, because it slips spontaneously into the other’s’ (Merleau-Ponty, 1945: 411). Consequently, there is no ‘problem’ of the other, for the other is already contained within our being, as we are within theirs. This resonates with Heidegger’s characterisation of Being (Dasein) as being-with-others. We are always already alongside others, bound up in relations of mutuality that prevent any strict ontological distinction between self, other and world.

The Levinasian and Sartrean accounts of the self-other relationship can be criticised from a hermeneutic perspective for failing to acknowledge the fact that understanding is essentially a conversation with another, and that a simple reduction of the other to a sameness with oneself, or a pure objectification of the other, would preclude the possibility of a genuine interaction from which mutual understanding could arise (Gadamer, 1960). Levinas presents a monological account of understanding, ignoring the fundamentally dialogical nature of intersubjectivity. As Taylor (1994: 67) approvingly noted, understanding according to Gadamer is always a fusion of horizons, a coming-to-understanding between two individuals who require the perspective of the other in order to make sense of their own (and vice-versa). Neither the total incorporation of the other into the perspective of the recognisee, nor the reduction of the other to pure object, is possible on a hermeneutic account of meaning and understanding.

e. The Post-Structural Challenge

Concurrent with the rise of identity politics, there has been a trend towards ‘deconstructive’ or ‘destabilising’ accounts of the individual subject. Rather than representing a single critical perspective on recognition and identity politics, the post-structural challenge can be understood as a broad term incorporating various attempts at showing how the subject is always constructed through and within networks of power and discourse (e.g. Foucault, 1980; Butler, 1990; Haraway 1991; Lloyd, 2005; McNay, 2008).

Perhaps the most notable theorist in this regard is Foucault, who develops a detailed account of the way in which the subject is constituted through discursive relations of power. Within Foucault’s theory, the individual becomes the ‘site’ where power is enacted (and, importantly, resisted or reworked). Foucault’s genealogical method was employed precisely in order to explore the conditions under which we, as subjects, exist and what causes us to exist in the way that we do. According to Foucault, not only are we controlled by truth and power, we are created by it too. Concerning his genealogical method, Foucault (1980: 117) writes, ‘One has to dispense with the constituent subject, to get rid of the subject itself, that’s to say, to arrive at an analysis which can account for the constitution of the subject within a historical framework’. This leads to a far more problematic view of the subject than is generally found within recognition theories. Specifically, issues of power, coercion and oppression are seen as coeval with identity formation and intersubjective relations.

This suggests that there can be no instances of mutual recognition that do not simultaneously transmit and reproduce relations of power. As Foucault (1988: 39) notes, ‘If I tell the truth about myself… it is in part that I am constituted as a subject across a number of power relations which are exerted over me and which I exert over others’. Critics of recognition theorists argue that they ignore the fundamental relationship between power and identity formation, assuming instead that intersubjective relations can be established which are not mediated through power relations. McNay (2008) develops this critique through a discussion of Bourdieu’s concept of habitus, arguing that Taylor assumes that language is an expressive medium that functions independently of, and antecedent to, power and thus fails to analyse how ‘self-expression is constitutively shaped by power relations’ (ibid: 69).

Another important theorist in this regard is Judith Butler, whose account of gender identity develops certain key themes of Foucauldian theory as well as insights offered up by Derrida on the re-iteration of norms as fundamental to identity formation. Butler (1988: 519) begins outlining this project by arguing that gender is ‘in no way a stable identity or locus of agency from which various acts proceed; rather it is an identity tenuously constituted in time – an identity instituted through a stylized repetition of acts’. Gender is created through acts which are ‘internally discontinuous’. These acts produce the ‘appearance of substance’, but this apparition is no more than ‘a constructed identity, a performative accomplishment which the mundane social audience, including the actors themselves, come to believe and to perform in the mode of belief’ (ibid: 520; see also Butler 1990: 141). Essentially, we internalise a set of discursive practices which enforce conformity to a set of idealised and constructed accounts of gender identity that reinforce heterosexual, patriarchal assumptions about what a man and woman is meant to be like.

Turning the commonsense view of gender on its head, Butler argues that the various acts, thoughts and physical appearances which we take to arise from our gender are actually the very things which produce our sense of gender. Gender is the consequence, rather than the cause, of these individual, isolated, norm-governed acts. Because acts which constitute gender are governed by institutional norms which enforce certain modes of behaviour, thought, speech, and even shape our bodies, all positive constructions of gender categories will be exclusionary. Consequently, not only does Butler deny any ontological justification for a feminist identity politics, but she also rejects the possibility of a political justification. Identity categories ‘are never merely descriptive, but always normative, and as such, exclusionary’ (1992: 16). As a result, ‘Any effort to give universal content or specific content to the category of women… will necessarily produce factionalization’ and so, ‘“identity” as a point of departure can never hold as the solidifying ground of a feminist political movement’ (ibid: 15).

Infusing issues of power into the recognition debate therefore presents problems for existent models of recognition. Taken to its extreme, contemporary feminist accounts of gender and identity may be seen as reason to decisively reject recognition politics. If, as Butler suggests, gender identity is intrinsically connected to power, then to demand recognition for one’s identity could seen as becoming compliant with existing power structures. Such a position would have no possibility of radically critiquing the status quo and would thus potentially forfeit any emancipatory promise. Upon the relationship between the individual and power, Foucault (1980: 98) writes: ‘[Individuals] are not only its [power’s] intent or consenting target; they are always also the elements of its articulation. In other words, individuals are the vehicles of power, not its point of application’. The concern is that there is no form of self-realisation in recognition models that does not, in some way, reproduce patterns of dominance or exclusion.

6. The Future of Recognition

Despite the above reservations regarding the concept of recognition and its political application, there is a growing interest in the value of recognition as a normative socio-political principle. The increasingly multicultural nature of societies throughout the world seems to call for a political theory which places respect for difference at its core. In this regard, recognition theories seem likely to only increase in influence. It should also be noted that they are very much in their infancy. It was only in the 1990s that theorists formulated a comprehensive account of recognition as a foundational concept within theories of justice. To this extent, they are still in the process of being fashioned and re-evaluated in the light of critical assessment from various schools of thought.

For many thinkers, the concept of recognition captures a fundamental feature of human subjectivity. It draws attention to the vital importance of our social interactions in formulating our sense of identity and self-worth as well as revealing the underlying motivations for, and justifications of, political action. It seems particularly useful in making sense of notions of authenticity and the conditions for agency, as well as mapping out the conditions for rational responsibility and authority (see Brandom, 2009). As a result, recognition can be seen as an indispensible means for analysing social movements, assessing claims for justice, thinking through issues of equality and difference, understanding our concrete relations to others, and explicating the nature of personal identity. Although there remain concerns regarding various aspects of recognition as a social and political concept, it is entirely possible that many of these will be addressed and resolved through future research.

7. References and Further Reading

  • Alexander, Jeffrey C. and Lara, Maria P., ‘Honneth’s New Critical Theory of Recognition’. New Left Review. 1/220 (1996): 126-136
  • Appiah, Kwame, A. ‘Identity, Authenticity, Survival: Multicultural Societies and Social Reproduction’. Multiculturalism: Examining the Politics of Recognition. Ed. Amy Gutmann. Princeton: Princeton University Press. 1994: 149-163
  • Arendt, Hannah. The Human Condition. Chicago: Chicago University Press, 1958
  • Brandom, Robert. Reason in Philosophy: Animating Ideas. Cambridge, Ma.: Harvard University Press, 2009
  • Brandom, Robert. ‘The Structure of Desire and Recognition: Self-consciousness and Self-Constitution’. Philosophy & Social Criticism. 33:1 (2007): 127-150
  • Butler, Judith. ‘Performative Acts and Gender Constitution: An Essay in Phenomenology and Feminist Theory’. Theatre Journal, 40:4 (1988): 519-531
  • Butler, Judith.Gender Trouble: Feminism and the Subversion of Identity. New York: Routledge, 1990
  • Butler, Judith. ‘Contingent Foundations: Feminism and the Question of “Postmodernism”’. Feminists Theorize the Political. Ed. Butler, Judith and Joan W. Scott. London: Routledge, 1992. 3-21
  • De Beauvoir, Simone. The Second Sex. Harmondsworth: Penguin, 1972 [1949]
  • Fanon, Frantz. Black Skin, White Masks. London: Pluto, 1986 [1952]
  • Fichte, Johann G. Foundations in Natural Right: According to the Principles of the Wissenschaftslehre. Cambridge: CUP, 2000 [1796/7]
  • Foucault, Michel. Power/Knowledge: Selected Interviews and Other Writings. Brighton: Harvester, 1980
  • Foucault, Michel. Politics, Philosophy, Culture: Interviews and Other Writings 1977-1984. Ed. Kritzman, L. D. London: Routledge, 1988
  • Fraser, Nancy. Justice Interruptus: Critical Reflections on the “Postsocialist” Condition. New York: Routledge, 1997
  • Fraser, Nancy. ‘Rethinking Recognition’. New Left Review. 3 (2000): 107-120
  • Fraser, Nancy. ‘Recognition Without Ethics?’. Theory Culture & Society. 18:2-3 (2001): 21-42
  • Fraser, Nancy and Axel Honneth. Redistribution or Recognition: A Political-Philosophical Exchange. London: Verso, 2003
  • Gadamer, Hans-Georg. Truth and Method. London: Sheed and Ward, 1975 [1960]
  • Haraway, Donna. Simians, Cyborgs, and Women: The Reinvention of Nature. London: Free Association Books, 1991
  • Hegel, Georg W. G. Phenomenology of the Spirit. Trans. A. V. Miller. Oxford: Clarendon Press, 1977 [1807]
  • Hegel, Georg W. G. Elements of the Philosophy of Right. Trans. H. B. Nisbet. Cambridge: CUP, 1991 [1821]
  • Habermas, Jürgen. ‘A Reply’. Communicative Action. Ed. Honneth, Axel and Hans Joas. Cambridge: Polity, 1991
  • Heyes, Cressida. ‘Can There Be a Queer Politics of Recognition?’. Recognition, Responsibility, and Rights: Feminist Ethics and Social Theory. Ed. Fiore, Robin N. and Hilde L. Nelson. Lanham: Rowman and Littlefield, 2003: 53-66
  • Honneth, Axel. ‘Integrity and Disrespect: Principles of a Conception of Morality Based on the Theory of Recognition’. Political Theory, 20:2 (1992): 187-201
  • Honneth, Axel. The Struggle for Recognition: The Grammar of Social Conflicts. Cambridge: Polity, 1995
  • Honneth, Axel. ‘Recognition or Redistribution? Changing Perspectives on a Moral Order of Society’. Theory, Culture & Society. 18:2-3 (2001): 43-44
  • Honneth, Axel. Disrespect: The Normative Foundations of Critical Theory. Cambridge: Polity, 2007
  • Ikäheimo, Heikki. ‘On the Genus and Species of Recognition.’ Inquiry, 45 (2002): 447-62
  • Ikäheimo, Heikki and Arto Laitinen. ‘Analyzing Recognition: Identification, Acknowledgement, and Recognitive Attitudes Towards Persons.’ Recognition and Power: Axel Honneth and the Tradition of Social Theory. Ed. Bert van den Brink and David Owen. New York: CUP, 2007: 33-56
  • Inwood, Michael. A Hegel Dictionary. Oxford: Blackwell, 1992
  • Jones, Peter. ‘Equality, Recognition and Difference’. Critical Review of International Social and Political Philosophy. 9:1 (2006): 23-46
  • Jones, Peter (ed.). Group Rights. Aldershot: Ashgate, 2009
  • Kymlicka, Will. Multicultural Citizenship: A Liberal Theory of Minority Rights. Oxford: Clarendon Press, 1995
  • Laden, Anthony S. ‘Reasonable Deliberation, Constructive Power, and the Struggle for Recognition’. Recognition and Power: Axel Honneth and the Tradition of Social Theory. Ed. Bert van den Brink and David Owen. New York: CUP, 2007: 270-289
  • Laitinen, Arto. ‘Interpersonal Recognition: A Response to Value or a Precondition of Personhood?’ Inquiry, 45 (2002): 463-78
  • Levinas, Emmanuel. Totality and Infinity: An Essay on Exteriority. Pittsburgh: Duquesne University Press, 1969 [1961]
  • List, Christian and Philip Pettit, Group Agency. Oxford: OUP, 2011
  • Lloyd, Moya. Beyond Identity Politics: Feminism, Power & Politics. London: Sage, 2005
  • Margalit, Avishai. ‘Recognizing the Brother and the Other.’ Proceedings of the Aristotelian Society: Supplementary Volumes, vol. 75 (2001): 127-139
  • Markell, Patchen. ‘The Recognition of Politics: A Comment on Emcke and Tully’. Constellations, 7:4 (2002): 496-506
  • Markell, Patchen. Bound By Recognition. New Jersey: Princeton University Press, 2003
  • Markell, Patchen. ‘The Potential and the Actual: Mead, Honneth, and the “I”’. Recognition and Power: Axel Honneth and the Tradition of Social Theory. Ed. Bert van den Brink and David Owen. New York: CUP, 2007: 100-134
  • McBride, Cillian. ‘Demanding Recognition: Equality, Respect, and Esteem’. European Journal of Political Theory 8:1 (2009): 96-108
  • McNay, Lois. Against Recognition. Cambridge: Polity, 2008
  • Mead, George H. Mind, Self and Society: From the Standpoint of a Social Behaviourist. Chicago: Chicago University Press, 1934
  • Nicholson, Linda. ‘To Be or Not To Be: Charles Taylor and the Politics of Recognition’. Constellations, 3:1 (1996): 1-16
  • Merleau-Ponty, Maurice. Phenomenology of Perception. Trans. C. Smith. Oxford: Routledge, 2002 [1945]
  • Pinkard, Terry. Hegel’s Phenomenology: The Sociality of Reason. Cambridge: CUP, 1996
  • Rogers, Melvin L. ‘Rereading Honneth: Exodus Politics and the Paradox of Recognition’. European Journal of Political Theory, 8:2 (2007): 183-206
  • Rousseau, Jean-Jacques. ‘A Discourse on the Origin of Inequality’. Discourses and Other Early Political Writings. Trans. Victor Gourevitch. Cambridge: CUP, 1997 [1754]
  • Sartre, Jean-Paul. Being and Nothingness: An Essay in Phenomenological Ontology. London: Methuen & Co., 1957 [1943]
  • Stern, Robert. Hegel and the Phenomenology of Spirit. London: Routledge, 2002
  • Taylor, Charles. ‘The Politics of Recognition’. Multiculturalism: Examining the Politics of Recognition. Ed. Amy Gutmann. Princeton: Princeton University Press. 1994: 25-73
  • Thompson, Simon. The Political Theory of Recognition: A Critical Introduction. Cambridge: Polity, 2006
  • Tully, James. Strange Multiplicity: Constitutionalism in the Age of Diversity. Cambridge: CUP, 1995
  • Tuomela, Raimo. The Philosophy of Sociality: The Shared Point of View. Oxford: OUP. 2007
  • Williams, Robert R. Recognition: Fichte and Hegel on the Other. Albany: State University of New York Press, 1992
  • Williams, Robert R. Hegel’s Ethics of Recognition. Albany: State University of New York Press, 1992
  • Yar, Majid. ‘Recognition and the Politics of Human(e) Desires’. Recognition and Difference: Politics, Identity, Multiculture. Ed. Featherstone, Mike and Scott Lash. London: Sage, 2002
  • Young, Iris M. Justice and the Politics of Difference. New Jersey: Princeton University Press, 1990
  • Zurn, Christopher F. ‘Anthropology and Normativity: A Critique of Axel Honneth’s “Formal Conception of Ethical Life”’. Philosophy and Social Criticism, 26:1 (2000): 115-214
  • Zurn, Christopher F. ‘Identity or Status? Struggles Over ‘Recognition’ in Fraser, Honneth and Taylor’. Constellations, 10:4 (2003): 519-537

Author Information

Paddy McQueen
Email: paddymcqueen@gmail.com
Queen’s University
Northern Ireland

Ninon de Lenclos (1620—1705)

LenclosA salonnière during the reign of Louis XIV, Ninon de Lenclos embodied libertinism in both theory and practice.  As a notorious courtesan, Lenclos scandalized France by her numerous affairs with prominent statesmen and ecclesiastics.  As a philosopher, she justified sexual license by her naturalistic theories of human nature and of morality.  In Lenclos’s perspective, the human person is demonstrably a part of material nature.  Allegedly spiritual human experiences, love in particular, are only a sophisticated variation on animal instincts.  Rather than being a pursuit of spiritualized virtue, human moral conduct is the effort to expand pleasure and to eliminate pain.  Within the hierarchy of pleasures, romantic love constitutes the pinnacle.  Lenclos condemns the ascetical ethics of monastic Christianity and of Platonism because it has exalted a spiritualized love which is illusory and impossible to practice.  In her mechanistic theory of the world, material causation is responsible for many of the intellectual and volitional activities other philosophers wrongly attribute to an immaterial soul.  Through her naturalistic metaphysics and her ethics of the primacy of pleasure, Lenclos contributed to the Epicurean revival of the French Renaissance.  In her insistence on the equal rights of women and men to the pursuit of pleasure, Lenclos developed a gendered version of Epicureanism which challenged her society’s subordination of women to men.

Table of Contents

  1. Biography
  2. Works
  3. Epicurean Philosophy
    1. Theory of Love
    2. Ethics of Pleasure
    3. Metaphysical Naturalism
    4. Gender Critique
  4. Reception and Interpretation
  5. References and Further Reading
    1. Primary Sources
    2. Secondary Sources

1. Biography

Anne de Lenclos was born in Paris on November 10, 1620, and she died on October 17, 1705. She was nicknamed “Ninon” by her father. Her family was philosophically divided.  A devout Catholic, her mother attempted to raise the precocious daughter according to the strict moral standards of Counter-Reformational Catholicism.  A neo-Epicurean, her father introduced her to a more skeptical world outlook and to a libertine code of ethics.  From an early age, Ninon clearly sided with her father’s perspective.

A child prodigy, Ninon de Lenclos enchanted Parisian salons with her mastery of the lute and clavichord.  She mastered multiple foreign languages, including Spanish and Italian.  A voracious reader, she maintained a life-long affection for her favorite philosopher, Montaigne.

During her teen years, Lenclos began her career as one of Paris’s most celebrated courtesans.  During her many public affairs, she became the mistress of numerous prominent men.  Statesmen included the Grand Condé, Gaston de Cligny; Louis de Mornay, marquis de Villarceaux; and François, duc de la Rochefoucauld.  Clergy included the Abbé de Chateauneuf and Canon Gédoyn.  Cardinal Richelieu counted among one of her spurned petitioners.

If Lenclos’s bold libertinism titillated the more skeptical salons, it scandalized the capital’s devout circles.  Queen Anne of Austria, regent of France, placed Lenclos under house arrest at the Convent of the Madelonnettes in 1656.  Thanks to the intervention of the exiled Christine, Queen of Sweden, who visited Lenclos in her convent cell, Lenclos was quickly freed from the convent and returned to her life as a courtesan.

Starting in 1667, Lenclos conducted a salon at the Hôtel de Sagonne in Paris.  Major philosophical writers participating in the salon included the Cartesian Fontenelle and a group of neo-Epicurean authors: Charles de Saint-Évremond; Antoine Gombaud, chevalier de Méré; and Damien Mitton.  The religious skepticism of this circle permeated the libertine atmosphere of Lenclos’s salon.  Prominent literary figures included the aphorist François de la Rochefoucauld; the fabulist Jean de La Fontaine; the memorialist Louis de Rouvroy, duc de Saint-Simon; the chronicler Roger de Rabutin, comte de Bussy; the poet Jean Chapelle; the dramatist Jean Racine; and the literary critic Nicolas Boileau.  Lenclos was especially close to Molière, whose anticlerical Tartuffe received an earlier reading in her salon.  Other artists included the painter Nicolas Mignard and the musical composer Jean-Baptiste Lully.  One of the central members of the salon was Henri de Sévigné, the husband of the famous epistoler; the correspondence between the Marquis de Sévigné and Lenclos would survive as Lenclos’s major piece of philosophical reflection.  Numerous women also participated in the salon.  Notable female authors included Marguerite de la Sablière; Henriette de Coligny, comtesse de la Suze; and Françoise d’Aubigné, the future Madame de Maintenon and morganatic wife of Louis XIV.

Over the decades, Lenclos’s salon became a protected meeting place for those who treated Christianity with skepticism and who championed a neo-Epicurean ethics of pleasure, with romantic love canonized as the supreme good.  Lenclos impressed her guests with her wit and her defense of a life devoted to the pursuit of refined pleasure without conventional religious restraints.  To the outrage of the devout, she presented lectures on love, with detailed counsels on romantic conquest and disentanglement.  Female students were admitted for free; male students paid tuition.  A generous patron of authors in need, she provided the young Voltaire with a substantial gift for the purchase of books in her will and testament.

2. Works

The major extant philosophical work of Lenclos is found in her correspondence.  In 1750, Crébillon fils published a posthumous collection of Lenclos’s correspondence with the Marquis de Sévigné [LMDS].  In the letters,  Lenclos details her philosophy of love and the naturalistic metaphysics and ethics underpinning it.  Lenclos conducted a later correspondence (1697-1702) with Saint-Évremond, a neo-Epicurean friend and disciple of Gassendi.  First published in 1752, these letters express her views on aging and death.

A 1659 pamphlet, The Coquette Avenged, has long been attributed to her.  This brief work defends the possibility of a virtuous life independent of all formal religious influence.  Contemporary literary scholars doubt that Lenclos is the actual author of this work, but it does express the secularist moral philosophy which characterized Lenclos and her inner salon circle.

3. Epicurean Philosophy

Throughout her career, Lenclos declared herself a disciple of Epicurus. In a letter to her, Saint-Évremond defines the neo-Epicurean creed of their milieu:

It would be useless to press the arguments, repeated a hundred times by the Epicureans, that the love of pleasure and the abolition of pain are the primary and most natural inclinations noticed in all people.  Wealth, power, honor, and virtue contribute to our happiness, but the enjoyment of pleasure, let us call it voluptuousness, to sum up everything in a word, is the true aim and purpose to which all human acts are inclined. (Letter to the modern Leontium)

For Lenclos, the activity of love is the clearest human manifestation of the fundamental inclination to maximize pleasure and to minimize pain.  Rather than the virtues, it is the passions which dominate the human will in its moral choices.  Natural inclination determines psychological interaction as surely as it does the physical interaction of bodies.  In exploring the romantic pursuit of pleasure, Lenclos argues for the fundamental equality and reciprocity between the genders.

a. Theory of Love

The sentiment of love is predominantly an instinct.  It is a passion which owes little to reason.  “The precise truth is that love is just a blind instinct which one must personally experience in order to appreciate it.  It is an appetite which one has for one object in preference to another.  One is not able to provide reasons for why one has this particular taste (LMDS no. 2).”  The instinctual and emotive nature of love precludes rational justification for its emergence and development.

Lenclos stresses the sensate nature of love.  The stereotyped practices of the lover indicate the sensual nature of the passion and its associated states.  “Now tell me honestly: If your love were not the work of the senses, would you experience such pleasure in pondering that form, those eyes you find so beguiling, that mouth which you describe to me in such glowing colors (LMDS no. 11)?”  While friendship may be contracted with many different types of people, love maintains a close connection with physical attraction.  The obsessions common to the state of love indicate its sensate framework.

The attempt to tie the phenomenon of love to human reason leads to illusion.  Especially damaging is the effort to refine love according to the canons of chivalry.

If you attempt to walk in the footsteps of our ancient heroes of the romances and attempt to develop great and restrained sentiments, you will soon discover that this alleged heroism only turns love into a sad and occasionally lethal folly.  Love in fact is fanaticism.  If you separate it from the romantic baggage public opinion has added to it, it will soon give you its proper sort of pleasure and happiness.  Trust me: If it were reason that designed the affairs of the heart, love would be insipid (LMDS no. 4).

Divorced from its distinctive emotive violence, love would disappear and deteriorate into simple affection based on convenience or calculation.

Love also eludes moral subordination.  It is basically an amoral passion.  The characterization of the moral character of an amorous experience is the responsibility of the moral agent who has undergone it.  “Love is a passion which is neither good nor bad in and of itself.  Only those who have been affected by it can decide whether it is good or evil (LMDS no. 8).”

Especially illusory is the effort to conflate love with moral virtue.  Despite efforts to subordinate love to virtue, love will govern even the most virtuous person as the natural instinct which it is.

Oh, you mortals, who rely so much on the power of your virtue!  No matter how great your strength may be, there are moments when the most virtuous person becomes the weakest.  The reason for this strange fact is that nature is always pursuing us.  It is always aiming to achieve its ends.  The desire for love in a woman is a substantial part of her natural constitution; her virtue has only been patched on (LMDS no. 10).

The effort to present love as a virtue masks the fundamental fact that love is a natural instinct, accountable to neither intellect nor will.  Ascetical efforts to eliminate the arational dimensions of love will fail, given this natural inclination’s violent claim over the human person.

Since love can be explained by natural causation, it should be the object of neither praise nor blame.  It is no more moral or immoral than the human instinct toward hydration and nutrition.  “If I were you, I would not speculate on whether it is a good or bad thing to fall in love.  I would rather have you speculate on whether it is good or bad to be thirsty, or whether it should be forbidden to give someone a drink just because some people might end up inebriated (LMDS no.9).”  The natural causation of human love justifies Lenclos’s sexual libertinism.  Like other natural impulses, sexual desire should be discussed in an objective, non-moral way.  The possible distortions of this impulse should not lead to a moral censure of it or an attempt to repress it.

Lenclos’s naturalistic theory of the origin of love often uses mechanistic rhetoric.  Echoing Descartes, Lenclos perceives amorous desire as the reflex reactions of the body-machine to external stimuli.  “We have our need to have our emotions stimulated.  Connected to our emotions is our basic physical machinery.  This machinery is the ultimate and necessary cause of love (LMDS no. 8).”  This mechanistic account of the origin and nature of human love reinforces Lenclos’s critique of efforts to rationalize or moralize what is simply a powerful natural instinct.  Attempts to locate love’s origins in the allegedly spiritual faculties of intellect and will not only obscure its actual material roots; they ground ethical codes of love which mandate an unrealistic regulation of a powerful and necessary impulse.

b. Ethics of Pleasure

According to Lenclos, love is only the highest of goods in the hierarchy of pleasures which human beings pursue.  “All that I can say in favor of love is that it gives us a greater pleasure than any of the other comforts of life.  It pulls us out of our routines and stirs us up.  It is love which satisfies one of our most urgent needs (LMDS no.8).”  An authentic moral compass gauges the greater pleasures and the greater pains in light of the natural human inclinations which demand satisfaction.  Herein lies the authentic path of human happiness.

Although love represents the highest pleasure known to the mature adult, it is a complex pleasure, often bearing concomitant pain.

When I spoke about temper I only meant the type which gives a stronger attraction, anxiety, and a little jealousy: that, in a word, which springs from love alone and not from any natural vulgarity, the vulgarity we often call ‘bad temper.’  When it is love that makes a woman difficult, when that alone is the cause of her prickliness, how can a lover be so insensitive as to complain about it?  Don’t those rough spots only demonstrate the violence of this passion?  Personally, I have always believed that those lovers who try to keep themselves within reasonable bounds are not completely in love.  Can we really be in love without permitting ourselves to be pushed on by the fire of a consuming impetuosity, without experiencing all the commotion it necessarily causes?  No, without any doubt (LMDS no.5)!

The mental and physical anguish provoked by the experience of authentic love is only a necessary dolor intrinsic to the greater pleasure of love itself.  In fact, this violent moodiness constitutes one of the darker pleasures for the connoisseurs of sophisticated passion.

Love’s mixed pleasure is especially challenging for women.  The contemporary Frenchwoman suffers from two social desires springing from romantic love: the desire to make her romantic conquest known by others and the desire to avoid censure by society.

A woman is always trying to balance two irreconcilable passions which continually agitate her soul: the desire to please and the fear of dishonor.  You can easily recognize our embarrassment.  On the one hand, we are consumed with the desire to have an audience to notice the effect of our charms….We would like the entire world to witness the favors we encounter and the homage paid to us….We enjoy the despair of our rivals and the indiscretions which betray the sentiments we inspire in others.  We are delighted in proportion to the extent of the misery they suffer….But such sweet pleasures bring great bitterness….A sober and reasonable woman always prefers her reputation to romantic celebrity (LMDS no. 23).

The social context of love in the era of the salon complicates its mixed pleasures.  The desire to be acclaimed as a triumphant courtesan, enjoying the rarified pleasure of revenge on one’s rivals, conflicts with the desire to maintain one’s moral status in a civil society which swiftly punishes the least trace of sexual transgression.  The pleasure of sexual conquest contests the more sober pleasure of social esteem.

In her defense of an ethics of pleasure, Lenclos criticizes the dominant ethics of virtue.  Led by the Platonists, philosophers have long mistakenly transformed the natural inclination of love, dominated by the axis of pain and pleasure, into the moralized field of virtue and vice.

Isn’t love a passion?  Don’t very upright people argue that the passions and the vices are the same thing?  Is vice ever more seductive than when it wears the cloak of virtue?  In order to corrupt virtuous souls, it is sufficient for the alleged vice of love to appear in potential form.  The Platonists deified love in an idealized form.  In every age, in order to justify the passions, it was necessary to apotheosize them.  What am I saying here?  Am I so forward as to play the iconoclast with a prestigious superstition?  Such temerity!  Don’t I deserve to be attacked by all women for questioning their favorite cult (LMSD no.27)?

Both the condemnation of love as a vice and the exaltation of Platonic love to a virtue mask the material origins and nature of the amatory impulse.  The contemporary morals of love occult the amoral nature of this mechanical inclination and delude moral agents into the censure of all carnal romance or into the pursuit of an impossibly rarified love emptied of all passion.

In her critique of the ethics of virtue, Lenclos attacks the moral asceticism that has long distorted the practice of love in France.  Both Christianity and philosophy bear responsibility for the refusal to accord pleasure its natural rights in the human experience of love.

How I pity our good ancestors!  What they thought was a mortal weariness and a melancholic madness we consider a joyous folly and a delightful delirium.  They were fools.  They preferred the austerities of deserts and rocks to the pleasures of a garden bursting with flowers.  The habit of reflection has visited such prejudices upon us (LMDS no.8).

Christian and philosophical asceticism has not only attempted to eradicate sexual pleasure by considering love as a vice; it has turned all the pleasures of civilization into a temptation to be shunned in favor of the hermit-like illusions of the desert.

c. Metaphysical Naturalism

Lenclos’s treatment of love as a natural impulse is part of a broader naturalist metaphysics. The workings of nature, often conceived in mechanistic terms, cause many of the human actions wrongly interpreted as the offspring of a spiritual intellect or will.

The itinerary of romantic love indicates the all-embracing power of this natural causation.  A romantic affair which begins in the illusion that it is the artifice of refined virtue quickly reveals its debt to the imperious powers of nature.

At the beginning of their affair, lovers fancy themselves inspired by the noblest and most refined sentiments.  They exhaust their ingenuity, exaggerations, and the gushing of the most exquisite metaphysics.  For a period they are intoxicated with the belief that their love is of a superior nature.  But let us follow them as their affair unfolds.  Nature quickly recovers its rights and re-assumes its influence.  Soon enough, vanity, tired by the effusions of such an exalted purpose, leaves the heart free to express its feelings without any restraint.  The day arrives when these lovers become dissatisfied with the pleasures of love.  After having traveled a long circuit, they are surprised to find themselves at the very point where a peasant, following the inclinations of nature, would have begun (LMDS no. 9).

Even the most exalted romantic itinerary ultimately reveals its debt to the same forces of nature which control the amorous urges of the most illiterate peasant.  Nature, rather than soul, controls human action.

Love is not the only area where nature manifests its blind hold over human desire and action.  Nature determines the entire spectrum of objects to which the human person is attracted in its moral deliberations.  “In forming us, nature gave us a certain range of sentiments, which must expend themselves upon some particular object (LMDS no. 2).”  The predictable objects of desire for the various human instincts indicate how determinative nature has been in the constitution of the human person.

This naturalistic metaphysics is often presented in mechanistic terms.  The phenomenon of love is only the most prominent of the activities by which the human body reveals its fundamental nature as a machine whose internal instincts are manipulated by appropriate external stimuli.  “We women enter the world with this necessity of loving undefined.  If we take one man in preference to another, let us honestly admit that we yield less to some knowledge of merit than to some mechanical instinct, which is nearly always blind (LMDS no.14).”  The mechanistic nature of human deliberation and interaction is only masked by philosophical efforts to attribute such functions to a chimerical soul.

d. Gender Critique

In her analysis of love, Lenclos criticizes the roles assigned each gender in contemporary France.  She insists that the psychological qualities of men do not differ from those of women.  Furthermore, she insists that for love to yield its maximum pleasure, men must learn to treat women with a respect they have heretofore denied them.

In the dominant social prejudices of her era, men and women were perceived as fundamentally different in psyche as in body.  In romantic intercourse, women were to be pleasant and amusing; men were to be intellectual and commanding.  Lenclos argues, however, that the arts of politeness are as necessary for men as they are for women.

It is not because you [Marquis de Sévigné and other men] possess superior qualities that you are an agreeable companion….To be embraced with outstretched arms, you must be sympathetic, amusing, important to the pleasure of others.  I warn you that you cannot succeed in any other manner, especially with women.  Now tell me, what would you like me to do with your learning, with the geometry of your mind, and with the exactitude of your memory?  Dear Marquis, if you only have such advantages, if you have no personal charm to balance your austerity, you will not please women.  I can vouch for that.  Far from pleasing them, you will seem to them like an intimidating critic.  You will so constrain them that any pleasure they might have enjoyed in your presence will be banished (LMDS no. 7).

For romantic love to flourish, reciprocity must reign between the two partners.  Men as well as women must cultivate the gentle arts of charm, flirtation, and fascination.  The sexual stereotype of the emotional woman and the rational man must be abolished in favor of a more egalitarian vision of both genders called to develop both their intellectual and affective personalities.

In her critique of masculine misperceptions of women, Lenclos argues that men must learn to treat women as they themselves would like to be treated.  She criticizes the Marquis de Sévigné for his misinterpretation of the reactions of a countess he is seeking to conquer romantically.  Men must learn to empathize with the complex emotions women experience in the adventure of romantic love; like men, women must balance physical desire with public reputation.

A man without prejudice would see in the countess only a lover as reasonable as she is tender.  Without stressing virtue, she remains attached to her reputation. In short, she is a woman who seeks to reconcile love and duty.  The difficulty in allying these two contrary qualities is not slight.  It is the source of the complexities which wound you.  You yourself should imagine the battles she must sustain, the upheavals she endures, and the embarrassment she must be suffering on account of a lover who is too easily put off by signs of resistance (LMDS no. 43).

The masculine attempt to overwhelm women by a display of force and by a refusal to negotiate patiently destroys the possibility of authentic romance.  In respecting the psychological complexity of women, a complexity more similar to that of men than men would usually imagine, men can both maximize their chance of romantic success and place the desired woman in a position of equality.

4. Reception and Interpretation

Lenclos’s long life as a bold courtesan, cheerfully mating prominent aristocrats and clerics of Parisian high society, has long overshadowed her contribution to philosophy.  Most biographies detail the remarkable beauty of this courtesan into her eightieth year, her colorful affairs, and the scandals of her libertine salon.  A series of artists have fictionalized her as the courtesan par excellence.  In the nineteenth century, Henrion Ragueneau de la Chainaye wrote the drama Ninon de l’Enclos: comédie historique en un acte and Eugène de Mirecourt published the novel Mémoires de Ninon de Lenclos.  In the twentieth century, director Abel Gance featured her in his film Cyrano et D’Artagnan and the cartoonist Patrick Cothias turned her into a comic spy in his graphic novels, especially Ninon secrète.  An exception to this emphasis on scandal is the philosopher Victor Cousin (1792-1867), who treated Ninon de Lenclos as a philosopher who continued the hedonistic brand of philosophy she had learned from Saint-Évremond.

The recent revival of feminism has altered the racy popular portrait of Lenclos the naughty courtesan.  Recent biographies have focused more extensively on the political sophistication, philosophical culture, and Epicurean theories of Lenclos.  Duchêne (1984) underscores the complex philosophy of love by which Lenclos justified her numerous romantic liaisons.  Debriffe (2002) presents Lenclos as a proto-feminist rebel against the submissive roles assigned to women in the genteel French society of the period.

Recent scholarship has focused on the actual writings and philosophical theories of Lenclos.  Arenberg (2005), for example, studies the neo-Epicurean theories of aging and death present in Lenclos’s correspondence with Saint-Évremond.  Lenclos’s theory of gender reciprocity and the role of her salon in the diffusion of Cartesianism and Epicureanism invite further investigation.

5. References and Further Reading

a. Primary Sources

  • Lenclos, Ninon de. The Coquette Avenged. 1659.
  • Lenclos, Ninon de. Lettres de Ninon de Lenclos au marquis de Sévingé, avec sa Vie, 2 vols. Paris: Lendentu, 1820.
    • An electronic version of this edition of Lenclos’s letters detailing her Epicurean philosophy of love can be found on the Gallica: Bibliothèque numérique subsection of the website of the Bibliothèque nationale de France.
  • Robinson, Charles Henry. Life, Letters, and Epicurean Philosophy of Ninon de l’Enclos. Chicago: Lion Publishing Co., 1903.
    • Despite the antiquated biography, this edition of Lenclos’s writings provides a reliable translation of her correspondence with the Marquis de Sévigné and Saint-Évremond.  An electronic version of this text is available on the website of Project Gutenberg.

b. Secondary Sources

  • Arenberg, Nancy. “Getting Old: Reflections on Aging in the Letters of Saint-Évremond and Ninon de Lenclos,” Papers on French Seventeenth-Century Literature, 2005, 32 (62), 243-56.
    • The article analyzes the Epicurean treatment of aging, illness, and death in the correspondence between Lenclos and Saint-Évremond.
  • Debriffe, Martial. Ninon de Lenclos; La belle insoumise. Paris: France-Empire, 2002.
    • This feminist interpretation of Lenclos interprets her libertinism as an act of social disobedience against the repressive mores of the period.
  • Duchêne, Roger. Ninon de Lenclos: La Courtisane du Grand Siècle. Paris: Fayard, 1984.
    • While emphasizing Lenclos’s status as courtesan, this biography details the political complexity and the intellectual sophistication of Lenclos.
  • Niderst, Alain. “Lenjouée Plotine, Madame de Maintenon, Madeleine de Scudéry et Ninon de Lenclos,” Papers on French Seventeenth-Century Literature, 2000, 27 (53), 501-08.
    • The article studies the political influence wielded by Lenclos and two contemporary salonnières.
  • Verdier, Gabrielle. “Libertine, Philanthropist, Revolutionary: Ninon’s Metamorphoses in the Age of Enlightenement,” in Libertinage and the Art of Writing, II, ed. David Rubin. New York: AMS, 1992, 101-147.
    • The article interprets Ninon’s theoretical and sexual libertinage as a forerunner of the skepticism of the lumières.

 

Author Information

John J. Conley
Email: jconley1@loyola.edu
Loyola University Maryland
U. S. A.

Sarah Grimké (1792—1873) and Angelina Grimké Weld (1805—1879)

Sarah GrimkéSarah Grimké and Angelina Grimké Weld, sisters from a South Carolina slave-holding family, were active abolitionist public speakers and pioneer women’s rights advocates in a time when American women rarely occupied the public stage. Their personal stories about the horrors of slavery made them effective agents in the Northern abolitionist movement, and their subsequent marginalization in the leadership of that movement spurred them toward an articulation of women’s rights and duties in the public arena.

By necessity and conviction, both sisters connected appeals for abolition of slavery with defenses of a woman’s right to political action, understanding that they could not be effective against slavery while they did not have a public voice. They are most often referenced together in historical and philosophical texts because they lived and worked together most of their lives, jointly developing their arguments and reading each other’s works. Angelina is best known for her original work in opposition to slavery and her brilliant oratory style, while Sarah Grimké developed a radical theory of women’s rights that pre-dated and influenced the beginning of the women’s right movement in Seneca Falls. Both women connected the oppression of African Americans with the oppression of women.

Table of Contents

  1. Biography
    1. Cultural and Formative Influences
    2. Public Activism: 1834-1837
    3. Continuing Influence and Later Projects
  2. Philosophical Writing
    1. Abolitionist Reasoning
    2. Women’s Rights
    3. Connecting Race Oppression to the Oppression of Women
  3. References and Further Reading

1. Biography

a. Cultural and Formative Influences

Sarah Grimké was the sixth child born to John and Mary Grimke, plantation owners and slave holders in Charleston, South Carolina. Her father was a well-known attorney who became the chief judge of the Supreme Court of South Carolina. Sarah loved learning and studied with her older brother, hoping to go on to college and practice law like her brother, but her father forbade her from continuing her studies.  At age 13, she was allowed to become the godmother of her baby sister Angelina, and was very much a part of her sister’s upbringing. In her mid-twenties, Sarah traveled with her seriously ill father to Philadelphia to be treated by a doctor, where they took up residence at a Quaker boarding house.  When the treatment failed, she alone took care of him at the New Jersey seashore for the months he was dying.

Sarah and Angelina Grimké lived in an era of religious revivalism and utopian experimentalism, both of which had an impact in various times on their lives. While returning to Charleston after her father’s death, Sarah experienced a religious conversion after reading Quaker literature and began having religious mystical experiences. In 1821, at the age of 29, disillusioned with life in Charleston, Sarah moved to Philadelphia and joined the Fourth and Arch Street Meeting of the Society of Friends. Women were welcomed into the ministry in Quaker congregations; she saw the example of Lucretia Mott in the ministry in her own meeting.  Sarah felt called to a role in the ministry, but her testimony in the meetings was never supported by the Quaker elders. Her conviction of a religious calling may be the reason she turned down a marriage proposal, and is evident in her early tendency to defend women’s rights using Christian theology iconography (Lerner 1998b, 4).

When Angelina Grimké came to realize the horrors of slavery, she first spoke out against it in the Presbyterian Church in Charleston where she had been an active member and teacher. She became frustrated with the Presbyterian minister who spoke privately with her against slavery but would not publicly denounce it. In 1827, Angelina, like her sister, had a religious conversion experience when Quaker minister Anna Braithwaite came to Charleston and stayed with the Grimké family. Following that event, she began worshipping in the tiny Charleston Quaker meeting. Instead of leaving Charleston, Angelina stayed on with a perceived mission to convert her family, if not to Quakerism, at least to abandon slavery. Six months of articulate and forceful discussion, intense prayer sessions, family upheavals and visits from friends and clergy did not change Angelina’s mind, nor did her arguments produce any resulting change among her family or friends. She was called to testify to Presbyterian Church leaders about her beliefs which resulted in expulsion from the church. In 1829 Angelina left South Carolina to join Sarah in Philadelphia, and began worshipping with the Fourth and Arch Street Meeting.

As women, Sarah and Angelina were sheltered and limited in both thought and action in their South Carolina culture, but joining the Society of Friends also limited their interaction with their contemporary world. In Philadelphia the only newspaper they read was The Friend, a Quaker weekly that passed along Quaker views on issues such as women’s rights. They could have benefited from larger feminist dialogue, such as in a sympathetic reading of the work of the Englishwoman, Frances Wright, who publicly campaigned for women’s rights in the U.S. in 1828-29. It seems more probable that the sisters would have known of African American activist Maria W. Stewart, the first American woman to speak publicly against slavery in 1831-33, whose work was published by William Lloyd Garrison.

In Philadelphia, Angelina taught classes at the school that her widowed sister Anna Frost had started in order to supplement her small income. Unhappy with her teaching experience there, Angelina considered attending and possibly teaching at the prestigious Hartford Female Seminary, a school founded and run by Catherine Beecher (whose views on women’s roles Angelina would later publicly criticize). Although she was impressed and excited by the opportunities she saw at Hartford, the Quaker elders would not give her permission to move there. Instead she settled into Philadelphia Quaker life, teaching at the infant school.

b. Public Activism: 1834-1837

In 1934, after the sudden death of the man Angelina expected to marry and the death of their respected brother, Thomas, who had been Sarah’s childhood companion, the two sisters found themselves growing more uncertain of the Quaker restrictions, and began looking for new ways to be “useful.” Angelina devoured news of the abolitionists’ struggles, including the persecution they suffered in nearly every northern city where they spoke. After reading about the struggles of the abolitionists, she wrote a moving letter to Garrison, which was published without her permission in his abolitionist journal, The Liberator. This letter catapulted Angelina into the public realm, and was followed in (1836) by her Appeal to the Christian Women of the Southern States. The Appeal was written in a personal tone, addressing Southern women as friends and colleagues. When copies of it reached the Grimké sisters’ home town of Charleston, they were publicly burned by the postmaster, and Mrs. Grimké was warned that her daughters would be prevented from ever visiting Charleston again.

In 1836, Angelina and Sarah moved to New York (against the advice and without the permission of the Philadelphia Quakers) to begin work as agents for the abolitionist cause. They were the only women to attend the 19-day intensive training during the Agents’ Convention of the American Anti-Slavery Convention in November. Within weeks of the training, they began offering public talks for female anti-slavery meetings in New York. In their talks they advocated practical ways that Northerners could influence slavery regulations, but also urged their audiences to locate and root out race prejudice in their own lives and communities. According to their analysis, race prejudice in the North and the South was a major support of the slave system. They understood this from their own experience with slaves and free blacks in the North, as well as through discussions with one of their mentors, the leading abolitionist Theodore Dwight Weld.

Following the Agents’ Convention, both Grimké sisters began their active political campaign for the abolitionist cause. They helped organize the New York Anti-Slavery Convention of American Women which strengthened their bonds with other women activists in the anti-slavery cause and in 1837 began touring Northern cities, giving talks to packed audiences. Their work was very successful and led to the creation of more female anti-slavery associations and thousands of signatures on anti-slavery petitions. However, in every city they visited, the fact that they were women speaking before a mixed (male and female) audience created an uproar, even among abolitionist sympathizers. Many religious leaders hotly rejected the idea that women should speak from pulpits and public stages. In mid-1837, as Angelina and Sarah were on their speaking tour, the Congregational Churches issued a “Pastoral Letter” warning their congregations of “the dangers which at present seem to threaten the female character with widespread and permanent injury” and called women to remember their “appropriate duties and influence … as clearly stated in the New Testament” (Lerner 1998a, 143).

Religious leaders were not the only ones who reacted against the sisters’ work. Catherine Beecher, whom Angelina had once hoped to study with, published a critique of their approach to abolition, specifically addressed to Angelina Grimké. In her essay Beecher advocated gradualism instead of immediate emancipation, and also called women to remember their subordinate role in society. Angelina responded in the summer of 1837, publishing Letters to Catherine Beecher, defending immediate emancipation of slaves, as well as the right and responsibility of women to participate as citizens in their society. During this same period, Sarah also began writing Letters on the Equality of the Sexes.

Their speaking tour ended in late 1837 with Angelina very ill and both sisters exhausted from their grueling traveling and lecturing schedule.

c. Continuing Influence and Later Projects

The debate over women’s participation began to split the anti-slavery movement. Even the New York Executive Committee, who trained the sisters for public speaking, was unwilling to give them official “agent” status, fearing that public energy would be diverted by the women’s right issue. In letters to the Grimké sisters, abolitionist leaders Theodore Weld and Quaker John Whittier asked them to concentrate only on the anti-slavery issue. Referencing their work on women’s rights, Whittier asked, “Is it not forgetting the great and dreadful wrongs of the slave in a selfish crusade against some paltry grievances of our own?” Angelina and Sarah responded that women needed to claim free speech and other rights in order to do the work they were called to do.  “What then can woman do for the slave when she is herself under the feet of man and shamed into silence?” (Lerner 1998a, 151-2)

Instead of withdrawing from the public stage, Angelina and Sarah went on to achieve even more notoriety when, in 1838, Angelina testified at a Committee of the Legislature of the State of Massachusetts, becoming the first American woman to testify in a legislative meeting. Later in 1838, at the age of 33, Angelina married abolitionist Theodore Dwight Weld, and they moved with Sarah to Fort Lee, New Jersey. Although it was both Angelina’s and Sarah’s intention to continue their public activism, the pressures of running a household and raising three children, as well as the results of increasing poverty required them to withdraw from the speaking in the public arena. They continued to write and work to support abolitionist causes. One of their first projects after the marriage was to comb through back issues of Southern newspapers to gather empirical data about slavery for Weld’s book, American Slavery As It Is (1839).  That text also contains essays written by the Grimké sisters which provide clear and horrifying details of the conditions of slavery from their own experiences.

Although the Grimké sisters were no longer public lecturers, the fervor they raised about the “woman question” continued to cause dissention in the abolitionist movement. That issue, joined with other divisions about utopian/anarchist withdrawal versus political action, caused the movement to split into two separate organizations in 1840. Unfortunately the organization which embraced political action – in some ways the natural choice for Weld and for the Grimké sisters – also excluded the participation of women. Weld and both of the sisters withdrew from active participation for a short time; when Theodore resumed his activist work in 1841, Angelina and Sarah were overwhelmed with caring for the young children and maintaining the farm.

In 1848, Weld and the two sisters established a co-educational boarding school out of their home, with Angelina teaching history and Sarah teaching French. Although friends and family sent their children there (two sons of Elizabeth Cady Stanton attended) the school was a struggle, and made very little money. In 1854 the Welds and Sarah joined the utopian community Raritan Bay Union, where Theodore Weld started the progressive and experiential Eagleswood School.  Angelina and Sarah both taught and assisted with administration. The school continued when the Union failed two years later, with Angelina and Sarah as teachers. In 1862 the family moved to Boston to continue their teaching careers.

Angelina and Sarah remained committed to women’s issues, but their ability to be physically involved in activism was limited. Their example as women who spoke publicly against slavery and for women’s rights continued to inspire other female activists. Recognizing Angelina’s impact on women’s rights, the women’s rights leaders elected her a member of the Central Committee of the 1850 woman’s rights convention even though she was unable to attend. Both women continued to participate in future women’s rights convention and in the women’s movement, however mostly through letters and other writings.  As late as 1870 they were both vice presidents of the Massachusetts Woman Suffrage Association, and symbolically cast ballots in a local election.

In 1868, Angelina and Sarah discovered that their brother Henry had fathered children by his female slave, Nancy Weston.  Those children, Archibald, John and Francis James Grimké had been sold into slavery by their half-brother. When Angelina and Sarah found out about these three young men, they established close relationships, and supported Archibald and Francis through college and graduate school. Archie studied law at Harvard, and Francis went to Princeton Theological Seminary.  Both men went on to national leadership among the Black communities. As pastor of the 15th Street Presbyterian Church in Washington D.C., Francis Grimké and his wife Charlotte Forten Grimké were friends and colleagues of Anna Julia Cooper. Archibald was a vice-president of the NAACP and president of the American Negro Academy.

Sarah Grimké died in 1873, Angelina Grimké Weld died in 1879. Catherine Birney, a former student at their boarding school, published a full biography of the sisters in 1885.

2. Philosophical Writing

a. Abolitionist Reasoning

Although their early education was limited, while growing up in a family of lawyers and judges the Grimké sisters became familiar with constitutional law based on the liberal political theory of the founders of the constitution. According to historian Gerda Lerner (1998b, 22) Sarah had read Locke, Jefferson, and other Enlightenment thinkers, and her writing consistently integrates these Enlightenment ideals with Biblical analysis. In Sarah’s first major writing, An Epistle to the Clergy of the Southern States (1836), she combines a scholarly interpretation of the Old Testament with language from Bill of Rights. It is written in the style of the apostolic letters in the New Testament, and as such seems odd to modern ears.  In this essay, Sarah’s first claim is that slavery is in opposition to Biblical teaching. She uses Genesis to support her claim that although humans can use animals as “means” – as food to sustain their existence, all people are created in God’s image and so cannot be used as mere means for others’ ends. She moves to a more Kantian understanding that making such a God-created “man” into a “thing” violates “God’s unchanging degree.” Given a philosophic understanding that immortality is dependent on rationality, she points out that every slave is a “rational and immortal being” and so has certain “inalienable rights” (Ceplair 92), reflective of her reading of Locke’s Natural Rights argument.  In her Epistle, Sarah called “every Christian minister” to “preach the word of immediate emancipation” (Ceplair 109).

In 1836, Angelina published her first work, the Appeal to the Christian Women of the South, written in the form of a letter to close friends. Her claim for the equality of slaves is also based on natural rights as well as God-given rights.  In the Lockean language of the Declaration of Independence she points out that “all men, every where and of every color are born equal and have an inalienable right to liberty” (Ceplair 38). She continues by dismantling claims that slavery is sanctioned under Hebrew law, pointing out that none of the ways that Hebrew men or women became servants in Old Testament law are applicable to the conditions of African American slavery.  Responding to slave-owners’ claims that Christ did not condemn slavery, she demonstrates how treating other humans as “chattel property” contradicts the commandment that “Whatsoever ye would that men should do to you, do ye even so to them” (Ceplair 50). While acknowledging that women do not have the right to vote, she evokes the image of Esther’s political action in the Bible, appealing directly to Southern women to gather signatures on petitions to their legislatures.  “Such appeals to your legislatures would be irresistible, for there is something in the heart of man which will bend under moral suasion” (Ceplair 66).

After reading Angelina’s Appeal to the Christian Women of the South, influential educator Catherine Beecher (1800-1878) responded by publishing an 1837 article addressed to Angelina Grimké, titled An Essay on Slavery and Abolitionism with reference to the Duty of American Females, critiquing the idea of immediate emancipation. Beecher was one of a number of Americans who agreed that slavery was wrong but argued for “gradualism.” Gradualists argued that slavery could be eradicated slowly though such measures as stopping the slave trade, freeing the children of slaves, banning slavery in new territories, or even re-colonizing slaves back to Africa. Beecher supported the re-colonization movement which raised funds to send black ex-slaves back to Africa. Responding in the Letters to Catherine Beecher, Angelina defended immediate abolition, showed the fallacies of gradualism, attacked racial prejudice, and defended women’s right and responsibility to do public activism. She reiterated the abolitionist principle “that no circumstances can ever justify a man holding his fellow man as property,” concluding that the slave owner is “bound immediately to cease holding (the slave)” (Ceplair 149-150). She also pointed out that although Northerners had outlawed slavery, they were guilty of racial prejudice, which was at the heart of gradualism and the re-colonization movement. And so, she said, “I am trying to talk down, and write down and live down this horrible prejudice … we must dig up this weed by the roots out of each of our hearts…” (Lerner 1998a, 141).  Later in 1837, responding to a public request to present “definite practicable means” by which Northerners could have affect on slavery, the Grimké sisters once again point to racial prejudice in the North as also responsible for making slavery possible. As she reiterated later, “Northern prejudice against color is grinding the colored man to the dust in our free states, and this is strengthening the hands of the oppressor continually” (“Letter to Clarkston” Ceplair 121). The Grimkés understood that slavery and race inequity “degrades the oppressor as well as the oppressed” (Weld, et al 1934, 790).

b. Women’s Rights

In order to claim the right to speak in public, Angelina and Sarah had to argue against the conventional philosophy that men and women were naturally meant to occupy “separate spheres” of existence – that woman’s role was in the private sphere, while men controlled the public sphere. As has often been the case in history, some of the most fervent opposition to women’s rights came from other women. Catherine Beecher was well-known as a pioneering advocate for women’s education, who established many schools for women including the Hartford Female Seminary. As such, she was a public person by virtue of the organizations that she created, and the leadership roles she played in society, yet she did not believe that women should have political roles. Although Beecher argued that child-raising and women’s work in the household required that women be educated, she did not support women’s suffrage or the idea of women petitioning Congress.  As she said, “Men are the proper persons to make appeals to the rulers whom they appoint… [women] are surely out of place in attempting to do it themselves” (Lerner 1998a, 140).

In Letters XI and XII of her response to Beecher, Angelina logically dismantles the separate spheres mentality as prescribed by Rousseau and claimed instead that as moral beings the spheres of woman and man are the same. She explicitly connects abolitionist work to women’s rights, noting that her “investigation into the rights of the slave has led me to a better understanding of my own.”  The natural corollary of her abolitionist argument that as slaves, “human beings have rights because they are moral beings” is that women too have human rights that are not dependent on their sex. Hence, “whatever is morally right for a man to do, it is morally right for a woman to do” (Ceplair 194-5).  Angelina ends these two letters with the recommendation that the reader refer to Sarah Grimké’s writing on the subject of women’s rights.

Immersed as they were in Christian culture and traditions, the Grimké sisters faced Bible-based opposition to the idea of women’s equality.  Given their religious training, they replied to their critics with careful Biblical reasoning in defense of women’s rights, against what they said was the common “perverted interpretation” of the Bible. In the first chapter of Sarah’s Letters on the Equality of the Sexes and the Condition of Women (Ceplair 104) she begins with the two creation stories in Genesis to claim women’s equality. First, she points out that women as well as men were created in God’s image, in “perfect equality.” In the second creation story, Eve is formed out a rib of Adam, to be a helpmeet, “in all respects his equal,” as she reasons that a companion created by God would necessarily be his equal. Her religious critics would then point to the story of the fall, and Eve’s role in that. Sarah points out that Eve succumbed to supernatural evil, while Adam succumbed to merely mortal temptation, and thus men could not claim moral superiority over women.  She claims that the phrase “Thou wilt be subject unto thy husband, and he will rule over thee” is a mistranslation.  The Hebrew, she claimed, “uses the same word to express shall and will.” Thus, the phrase should in fact be translated as a prophecy, not a command, in the same way that the “immediate struggle for dominion” among humans is a prophecy, not a command.  She concludes that there is no reason to conclude from the Genesis story that original sin created a necessary condition of inequality between men and women. Then, as God has not caused a condition of inequality between men and women, she asks in the next letter that her brethren “take their feet from off our necks allowing us to stand upright” (Ceplair 208).

Angelina and Sarah continued throughout their work to claim their rights as citizens of the republic, whose “honor, happiness and well-being are bound up in its politics, government and law” (Lerner 1998a, 8). This claim was echoed 10 years later in 1848 in the Declaration of Sentiments presented at Seneca Falls Convention for women’s rights.  Throughout Sarah’s and Angelina’s writing, their arguments for women’s rights is based on the moral authority of the reasoning person – similar to the arguments that they both made for natural rights for African Americans. In this they may also be reflecting some of the arguments that they had read in Mary Wollstonecraft’s 1792 Vindication of the Rights of Women.

c. Connecting Race Oppression to the Oppression of Women

The Grimké sisters were among the first to explicitly connect race oppression to women’s oppression. Sarah “thanked” John Quincy Adams in her Letters on Equality for placing women “side by side with the slave” “ranking us with the oppressed.” Using a Kantian ethical argument that opposes using humans as means rather than as ends in themselves, she noted that historically “woman has … been made a means to promote the welfare of man” (Ceplair 209).  She tied the subordination of slaves and women to educational deprivation, noting that both women and slaves been deemed mentally inferior “while being denied the privileges of liberal education” (Lerner 1998a, 122-3).  In 1863 after Lincoln’s Emancipation Proclamation, Angelina said that as women, “True, we have not felt the slaveholder’s lash; true we have not had our hands manacled, but our hearts have been crushed … I want to be identified with the negro; until he gets his rights, we shall never have ours” (Lerner 1998a, 263).  Throughout their lives the sisters also stressed their bonds of sisterhood with African American women, in both their writing and in their close friendships with African American women.

Sarah’s claim that sexual oppression was a major cause of the subordination of women was far ahead of her contemporaries.  Writing in the late 1850s Sarah used the language of bondage to speak of women’s roles. “She is your slave, the victim of your passions, the sharer willingly and unwillingly of your licentiousness” (Lerner 1998b, 81). She examined marital rape in her essay “Marriage” in an era when it would have been shocking for a woman to discuss such things. She identifies women as in a position of slavery for being unable to refuse sex to her husband. Instead of having loving sexual relations, women would often “rise in the morning oppressed with a sense of degradation from the fact that their chastity has been violated…” discovering that she is a “legal prostitute … a mere convenience” (Lerner 1998b, 113-114). She called for the right of education for women, full human rights, financial independence and a woman’s right to decide when and if she would become a mother.  In other essays she identified men, individually and as a group, as having benefited from women’s oppression in the same way that plantation owners benefited from race oppression.

When John Stuart Mill’s The Subjection of Women came out in 1869, Sarah at age 77 walked throughout her neighborhood selling 150 copies of the book to her neighbors (Lerner 40). It’s no wonder that Mill’s book appealed to Sarah – it contains arguments for equal rights for women that are very similar to those developed by the Grimké sisters.

3. References and Further Reading

  • Bartlett, Elizabeth Ann (ed.) 1988. Sarah Grimké, Letters on the Equality of the Sexes and other Essays. New Haven: Yale University Press.
    • In this collection of the writings of Sarah Grimké Bartlett argues that Sarah’s Letters is the first philosophical work on the condition of women in America.
  • Birney, Catherine. 1885. The Grimké Sisters: Sarah and Angelina Grimké: the First Women Advocates of Abolition and Women’s Rights. Boston: Lee and Sheppard.
    • The earliest biography of the Grimké sisters written by the daughter of a family friend. The full text is available in e-format from several online publishers.
  • Ceplair, Larry, (ed). 1989. The Public Years of Sarah and Angelina Grimké: Selected Writings 1836-1839. New York: Columbia University Press.
    • This valuable text contains the major works of the Grimké sisters (some of them excerpted) with short introductions by the editor.
  • Grimké, Angelina. 2003. Walking by Faith: The Diary of Angelina Grimke, 1828-1835. Ed. Charles Wilbanks. University of South Carolina Press.
  • Lerner, Gerda. 1998a. The Grimké Sisters from South Carolina: Pioneers for Women’s Rights and Abolition. New York: Oxford University Press.
    • Reprint of Lerner’s 1967 biography with a new introduction; this is the most complete intellectual biography of the Grimké sisters.
  • Lerner, Gerda. 1998b. The Feminist Thought of Sarah Grimké. New York: Oxford University Press. This companion book to Lerner’s full biography contains a short biographical introduction to Sarah Grimké’s life, as well as reprints of Sarah’s letters and essays.
  • Lumpkin, Katharine Du Pre. 1974. The Emancipation of Angelina Grimké. Chapel Hill: University of North Carolina Press.
    • Biography of Angelina Grimké that explores the forces behind Angelina’s break with the Southern Christian culture of her youth.
  • Weld, Theodore Dwight, ed. 1839. American Slavery As It Is: Testimony of a Thousand Witnesses. New York: American Anti-Slavery Society.
  • Weld, Theodore Dwight, ed. 1934. Angelina Grimké Weld, and Sarah Grimké. Letters of Theodore Dwight Weld, Angelina Grimké Weld and Sarah Grimké: 1822-1844. 2 vols. Gilbert H. Barnes and Dwight L. Dumond, eds.  New York: D. Appleton-Century Company, Inc.

 

Author Information

Judy Whipps
Email: whippsj@gvsu.edu
Grand Valley State University
U. S. A.

Empathy and Sympathy in Ethics

The distinction between “empathy” and “sympathy” in the context of ethics is a dynamic and challenging one. The eighteenth century texts of David Hume and Adam Smith used the word “sympathy,” but not “empathy,” although the conceptual distinction marked by empathy was doing essential work in their writings. After discussing the early uses of these terms, this article is organized historically. Two traditions are distinguished. The first is the Anglo-American tradition, and it extends from Hume and Smith to the twenty-first century work of Michael Slote. Stephen Darwall’s contribution is applied in engaging Hume and Smith. Finally, the interrelation of empathy, sympathy and altruism is explored in the work of John Rawls and Thomas Nagel.  The second tradition is the Continental one. It extends from the spirituality of Johann Herder to the phenomenological movement of Edmund Husserl, Martin Heidegger, Max Scheler, and Edith Stein. The intentional analysis of empathy is directly relevant to the constitution of the social community in a broad, normative relationship with the “Other.” Empathy (Einfühlung) is sui generis an intentional (mental) act that starts out in the superstructure of intersubjectivity in Husserl and steadily migrates towards the foundation of community under the influence of Heidegger, Scheler, and Stein. The choice of which philosophers and thinkers to include is also determined by the contingent facts that those chosen are most likely to be encountered in contemporary debates about empathy, sympathy, and ethics. Stein, Husserl, and Heidegger are primarily epistemological, ontological, and post-onto-theological, and are in the background of any contemporary, formal engagement with ethical theories, which is the focus of the present article. Scheler turns his phenomenological intuition of essence (wesenschau) towards the moral sentiments; and his analysis of the diversity of sympathetic forms is a lasting contribution to the topic. Contemporary Continental thinkers such as Larry Hatab and Frederick Olafson associate empathy with Heideggerian Mitsein and Mitdasein (being in the world with others) as the existential foundation of ethics). The roles of Friedrich Nietzsche, the Holocaust, and the “Other,” especially in Emmanuel Levinas, are distinguishing marks of the ethical approach on the Continent. The article ends with a discussion of how the discipline of psychoanalysis contributes to the role of empathy.

Table of Contents

  1. Introduction
  2. An Example and a Working Definition
  3. The Anglo-American Tradition
    1. Hume’s Many Meanings of “Sympathy”
    2. Adam Smith’s Philosophy of Sympathy
    3. Contractualism and Sympathy in Rawls
    4. Nagel’s Incomplete Version of Empathy
    5. Empathy as a Moral Criterion in Slote’s Ethics of Caring
  4. The Continental Tradition
    1. Nietzsche’s Empathy of Smell Complements His Suspicion
    2. The Challenge to Empathy of the Event of the Holocaust
    3. Ethics Against Empathy in Levinas
  5. Empathy in the Context of Psychoanalysis and Ethics
  6. A Common Root of Empathy and Ethics
  7. References and Further Reading

1. Introduction

The words “sympathy” and “empathy” can be distinguished in several ways. Some of these distinctions are controversial, and work is needed to make them more precise. For example, “sympathy” is frequently used to mean one person’s response to the negative affects (suffering) of another individual, leading to pro-social (helping) behavior towards the other. In contrast, “empathy” generally includes responding to positive affects as well as negative ones without, however, necessarily requiring doing anything about it (no pro-social behavior required). “Sympathy” is understood to include agreement or approbation whereas “empathy” is often, though by no means always, a relatively neutral form of data gathering about the experiences and affects of others. “Sympathy” means a specific affective response such as compassion or pity whereas “empathy” once again encompasses affects in general including negative ones such as anger, fear, or resentment.

The words “empathy” and “sympathy” both point to the ancient Greek root “pathos” in the etymological context of modern English (Partridge 1966/1977).  “Pathos” in turn means to suffer in the sense of to endure, to undergo, or to be at the effect of. A single mention in Aristotle in the original Greek of empathes occurs in Aristotle’s On Dreams in which the coward experiences intense fear upon imagining that he sees his enemy approaching. In the original Greek, the references to empathes are few and marginal, generally meaning “in a state of intense emotion,” “passionate emotion,” or “much affected by,” a distinctly different meaning than it has today. The short list of other occurrences in antiquity is filled out by a single reference each in Plutarch’s Lives, in Flavius Jospheus Antiquitates Judaica, and Polybius Histories (entry on empathes in Liddell and Scott 1940).

In contrast, the number of references to “sympathy” is hundreds of entries long and is diverse, extending from Aeschylus, Aristophanes, Aristotle, Demosthenes, and frequently breaking though to the English in Shakespeare. The meanings include the constellation of ones that we would recognize including “agreement,” “pity,” “compassion,” “transmission of affect,” and “suggestibility.”

In the English language “empathy” simply did not exist prior to Cornell University psychologist Edward Bradford Titchner’s neologism in translating the German word “Einfühlung” as “empathy” in his lectures based on his work in the laboratory of Wilhelm Wundt (E. B. Titchener (1909)). Arguably the German is best captured by a phrase such as “feeling one’s way into,” but the advantages of a single word also have merit. Thus, it is technically an error, but one with an underlying kernel of truth, when one of the foremost researchers on empathy uses “empathy” as a substitute for “sympathy” as in the following from Hoffman: “And the British version of utilitarianism represented by David Hume, Adam Smith, and others for whom empathy was a necessary social bond, finds expression in current research on empathy, compassion, and the morality of caring” (2000: 2, 123). As noted, the word “empathy” did not exist in the English language when Hume (1739) and Smith (1759) write about engaging the foundations of morality in “sympathy,” the latter being the only word they used. Yet Hoffman captures an aspect of the truth as the word “sympathy” itself as used by Hume and Smith included the communicability of affect and emotional contagion, which today we would also count as inputs to “empathy” without, however, reducing empathy to emotional contagion and low level transmission of affect without remainder.

Prior to the arrival of the word “empathy” into the English language, “sympathy” captured the distinction “communicability of affect,” onto which additional meanings were layered. Hume and Smith are the main witnesses to this development. With the arrival of the word “empathy,” the difference between a method of data gathering about the experiences (sensations, affects, emotions) of other individuals and the use of this experience for ethically relevant processing, decision making, and evaluations was able to moved into the foreground.

Meanwhile, the Continental tradition reenacts in its own terms some of the same challenges in the German language that occurred around “sympathy” in the British tradition. Starting with Herder (1772/1792; see also Forster 2010: 19), and reaching to the 20th century in the writings of the phenomenologists such as Husserl, Scheler, and Stein a group of terms around “fühlen” [to feel] was occurring. Thus: “mitfühlen,” to “feel with” or “sympathize” and “nachfühlen,” to “feel vicariously” or even “to feel after” as in an after-image of a feeling. All these semantic distinctions emerged alongside “einfühlen,” to “empathize” or “to feel one’s way into” (Scheler 1913/22; Forster 2010: 39). Wilhelm Dilthey dismissed Einfühlung in favor of nacherleben [reexperiencing, reliving], nachfühlen (and Verstehen [understanding]) (see Makkreel 1975: 6-7, 252, 290). However, the point where these two traditions intersect is precise. The German psychologist Theodor Lipps translated Hume’s Treatise of Human Nature into German (1739/1904 Hume/Lipps) even as Lipps was completing his own Aesthetik (1903). Lipps eventually published the translation of Hume in two volumes in 1904/1906. Without directly borrowing what Hume said about “sympathy,” Lipps made empathy (Einfühlung) into the foundation of his aesthetics and an account of other minds. While “sympathy” comes across into German as “sympathie,” the seed was planted for the close connection between sympathy and (aesthetic) taste that developed into an entire aesthetic (Lipps 1903) in which Einfühlung (empathy) plays the central role. An entire generation of thinkers, including Freud, Husserl, and Heidegger, was inhibited from using the precise term “empathy” [“Einfühlung”]. Further more, when they did use it in the context of overcoming otherness, they marginalized it. This was because they were reluctant to invoke echoes of Lipps’ psychology of beauty and art – as well as Lipps’s solipsistic reveries that the individual psyche is what animates and enlivens nature and other individual through projective empathy. Scheler got it accurate dismissing Lipps’ “projective empathy”.

One of the innovations in the use of “empathy” in the 1950s is by the psychoanalyst Heinz Kohut (1959, 1971, 1977, 1984; Goldberg 1999). Kohut’s use is based on his view of philosophy of science (see the Hartmann-Nagel debate (Hartmann, 1959; E. Nagel 1959)) rather than in any usage in Freud, who mostly neglected the word but not the underlying distinction (Trosman & Simmons 1972; Freud 1909 where Einfühlung is explicitly used). Kohut’s use of “empathy” is a method of data gathering oriented towards a listening-based immersion in the affective, experiential, and mental life of the other person. However, even in a relatively value neutral inquiry such as psychoanalysis, the use of empathy as a method of data gathering has turned out to be relevant to ethics. Issues arise around the coherence and integrity of character and the self as a bulwark against unethical behavior such as rampant cheating, drug abuse, gambling, moral malaise and other individual, social, and communal ills.

2. An Example and a Working Definition

In the parable of the Good Samaritan (Luke 10: 30-27), the Priest and the Levite cross the road and pass by the Jewish traveler who was robbed, beaten by thieves, and left for dead. The Samaritan (today that would be a local inhabitant, a Palestinian) stops to help the individual in need. Multiple, overlapping descriptions are available of the Samaritan as a would-be moral agent. For example: The Samaritan’s altruism was aroused. His sympathy was aroused. His empathy was aroused. In the case of those who crossed the road and passed by the victim without stopping, the experience of empathic distress was decisive (arguably). They experienced the other’s suffering and were overwhelmed by it. They handled the empathic experience of suffering by avoiding the situation. In the case of the Samaritan, the empathic distress was transformed into sympathetic distress (under one description (Hoffman 2000: 87-88)), which, in turn, motivated a pro-social, helping, altruistic intervention to aide the traveler. An entirely different description is available: ethics is fundamental in attributing the altruistic decision from the start to a fundamental recognition on the part of the Samaritan, answering the question, “Who is my neighbor?” The answer?  The neighbor is the individual in need. By the end of this article, we shall not necessarily know which description is the truth with a capital “T,” but we shall have determined the terms of the debate and defined the issues in detail.

A working definition of “empathy” will be useful. At the level of phenomenal awareness of everyday human experience in the world with other humans, the minimal essential constituents of empathy include: (i) a receptivity (“openness”) to the affects of others whether in face-to-face encounter or as artifacts of human imagination (“empathic receptivity”); (ii) an understanding of the other in which the other individual is interpreted as a possibility—a possibility of choosing, making commitments, and implementing them (“empathic understanding”) in which the aforementioned possibility is implemented; (iii) an interpretation of the other from first-, second-, and third-person perspectives (“empathic interpretation”); and (iv) an articulation in language of this receptivity, understanding and interpretation, including the form of speech known as listening that enables the other to appreciate that he or she has been the target of empathy (“empathic listening”). In terms of the example of the Good Samaritan, the Samaritan is empathically receptive to the suffering of the traveler. This openness informs his understanding of the possibility that the other is a fellow traveler like himself. The other is interpreted as a neighbor (in the second person). This neighborliness is expressed in words and deeds by his stopping and altruistically giving assistance.

This working definition includes the possibility of alternative, orthogonal definitions, for example, from the perspective of functional causality. In the latter, another’s affects are the cause of mine in the context of a self-other distinction in which a causal construct such as a “shared manifold” is deployed below the threshold of introspective awareness in our biology (“neurology”) to explain the functions of perspective taking and emotional control (Gallese 2007). It is also consistent with a neuron-computational representation that uses mirror neurons to implement the transfer of affectivity from one individual to another (Jackson, Meltzoff, and Decety 2005; Decety & Jackson). It is consistent with a hermeneutic definition that deploys a double representation of the self’s representation of the other’s intentional fulfillment and the further processing of these representations (Agosta 2010; Husserl 1929/35; Stein 1917; Zahavi 2005).  As we shall see, “sympathy” in Hume corresponds to at least two of these definitions of the communicability of affect and the ability to put oneself in the other’s place ((ideal) perspective taking). In addition, he has two definitions that do not overlap with empathy – a reactive response that is compassionate and caring towards another’s suffering and (as Hume uses it) the power of suggestion (to which we now turn).

3. The Anglo-American Tradition

a. Hume’s Many Meanings of “Sympathy”

David Hume has at least four distinct meanings of “sympathy.” These develop along with his thinking about the foundations of ethics. First, “sympathy” functions in the communicability of affect; next it encompasses what is often described as “emotional contagion,” the communicability of affect without the inclusion of the idea of the other individual as its source; additionally, it encompasses the power of suggestion; and, finally, it comes to include an element of benevolence. How this series of transformations unfolds is the topic of this story as the meaning of “sympathy” evolves from a communicability of affect to the responsive sentiment of compassion which is one of the essential ways that we regard sympathy today.

Always the astute phenomenologist, the philosopher, David Hume, witnesses the divergence of sympathy into components that will blend with the judgment of taste, taking on an irreversible dimension of evaluation, across both an ethical and aesthetic dimension. Other components identified by Hume develop into the form of human empathy known to us as the mere communicability of affect, subject to further cognitive processing. By the time Hume writes his 1741 essay “Of the delicacy of taste and passion,” he assimilates all the advantages for human interrelations of “sympathy” such as friendship, intimacy, interpersonal warmth to “delicacy of taste” (1741: 25-28; 1757: 3-24). Hume’s contribution to the transformations of sympathy has a significant subtlety and depth that deserves a substantial treatment of its own much longer than that engaged here.

By “sympathy” Hume does not initially mean the particular sentiments of pity or compassion or benevolence but rather the function of communicating affect in general. Relying on his simple psychology of ideas and impression, sympathy reverses the operation of the understanding, which converts impressions of sensation into ideas. In the case of sympathy, the operation is in the other direction – from idea to impression. Sympathy arouses ideas in the recipient that are transformed into impressions – though this time impressions of reflection – through the influence of the ideas. Thus, the operation of sympathy:

‘Tis indeed evident, that when we sympathize with the passions and sentiments of others, these movements appear at first in our mind as mere ideas, and are conceiv’d to belong to another person, as we conceive any other matter of fact. ‘Tis also evident, that the ideas of the affections of others are converted into the very impressions they represent, and that the passions arise in conformity to the images we form of them (T 2.1.11.8; SBN 319-20).

Sympathy reverses the operation of the understanding, which transforms impressions of sensation into ideas. Sympathy arouses impressions through the influence of ideas. The functional basis of this sympathetic conversion will turn out to be the imagination. In this view, sympathy is not to be mistaken with some particular affect such as pity or compassion, but is rigorously defined by Hume as “the conversion of an idea into an impression by the force of imagination” (T 2.3.6.8; SBN 427).  The other’s anger gets expressed and is apprehended sympathetically as an idea, which idea is communicated to me, and, in turn, through the sympathetic work of the imagination, arouses a corresponding impression of my own. This is an impression of reflection that is fainter and calmer than the initial idea (or impression) of anger. (An “impression of reflection” is an impression of an idea or (in some cases) of a vivid impression.) I thus experience what may be variously described as a trace affect, a counter-part feeling, or a vicarious experience—of anger.

In short, the one individual now knows what the other is experiencing because she experiences it too, not as the numerically identical impression, but as one that is qualitatively similar. This operation of sympathy, at least in this example, is also crucially distinct from emotional contagion, as in the mass behavior of crowds, since the passion and sentiments are “conceived to belong to another person.” This is crucial. This introduces the other and the distinction between one individual and the other. Significantly, the concept of the other accompanies the impression that is aroused in the one individual as a result of the other’s expression.

Hume distinguishes between sympathy and emotional contagion (T 2.1.11.2; SBN 316-7; T 3.3.2.2; SBN 592). Sympathy requires a double representation. What the other is feeling is represented in a vicarious feeling, which is what sympathy shares with emotional contagion. However, sympathy in the full sense also requires a representation of the other as the source of the first representation, “conceived to belong to the other person” (T 2.1.11.8; SBN 319-20). The distinction “other” is what is missing in the case of emotional contagion.

Hume establishes sympathy as the glue that affectively binds others to oneself and, by implication, binds a community of ethical individuals together. One of the undisputed masters of the English language in his own day (and ours), Hume asserts that “the minds of men are mirrors to one another, not only because they reflect each others emotions, but also because those rays of passions, sentiments and opinions may be often reverberated and decay away by insensible degrees” (T 2.2.5.21; SBN 365). Here one does experience an immediate resonance (“reverberation”) with the other, perceiving pleasure in the smile, pain in the grimace, or anger in the clenched teeth. In this case, a counterpart feeling – a vicarious feeling – is aroused in oneself and, in turn, becomes the experiential basis for further cognitive activity about what is going on with the other person.

However, Hume finds now that he is at risk of having undercut ethics by giving to sympathy such a central role in creating community. Experience shows that sympathy is diminished by distance of time and proximity and relatedness (“acquaintance”). We are much less affected by the pleasures and pains of those at a great distance than by those in our immediate physical vicinity or (say) close family relations. So an earthquake in China creates less sympathetic distress in me than an earthquake in Los Angeles (in my own country), even if I am perfectly safe in either case. According to Hume, my ethical approbation of (and obligations to) those at a great distance from me are no less strong than to those close at hand. The balance of impartiality needs to be restored by appealing to an unbiased ideal observer. In turn, this sets up a tension between the sympathetic observer of the moral agent and the ideal, unbiased one. “Unbiased” does not mean “unsympathetic”; yet it does not mean “wholly sympathetic” either. This is an issue. The ideal observer and the sympathetic one are complementary at best, and possibly even contrary. Being sympathetic reduces distance between individuals; being an ideal observer creates distance. Let us now look at two possible ways of resolving the tension between the ideal observer and sympathy as the basis for moral approbation and disapproval. (Slote will have a third approach considered in detail further below.)

The first is due to Stephen Darwall’s reading of Hume as going beyond moral sentiment (at least implicitly) to rule regulation in accounting for such artificial virtues as justice and related convention-based virtues like adhering to contracts. Hume says that the motivation to justice is produced through sympathy in observing the beneficial results of justice (Darwall 1995: 314-5). Indeed Hume expresses what would become a very Kantian approach, though whether he does so consistently is an issue: “[W]e have no real or universal motive for observing the laws of equity but the very equity and merit of that observance” (SBN: 483). And: “’Tis evident we have no motive leading us to the performance of promises, distinct from a sense of duty. If we thought, that promises had no moral obligation, we never shou’d feel any inclination to observe them” (SBN 518; quoted in Darwall 1995: 302). It is easy to agree with Darwall’s general conclusion that Hume points towards the result that a virtue such as justice requires a rule-based obligation without explicitly embracing it, going beyond empirical naturalism, to account for justice. Through Darwall’s argumentative force, subtlety, and mastery of the details, both sympathy and the ideal observer are undercut, resulting in a Hume that reads much like Kant. This is not Hume’s point of view, though he anticipates and inspired Kant. Hume is not a closet Kantian.  Sympathy is a source of information about the experience of the other individual. But that is not all. Hume’s commitment is that, in addition to the latter, sympathy is also a source of morality. Thus, Hume is constrained to evolve “sympathy” in the direction of “compassion” and “benevolence” to maintain his program. Darwall does not follow him there, but, as we shall see, it is a matter of controversy whether the modern account of empathy should do so.

The second approach is a reconstruction of the disinterested spectator as the sympathetic spectator. In other words, the key term “disinterested” means lacking a “conflict of interest,” not “unsympathetic” in the sense of “inhumanly cold-hearted”. The ideal spectator has to be sympathetic, not in the sense of benevolent (which “sympathy” has come to mean in part thanks to Hume’s usage), but in the sense of openness to the communicability of affect. Appreciating what the other is feeling is a useful, though not always decisive, data point in evaluating the ethical qualities of the agent being considered in the judgment of approbation. It makes a difference in contemplating the moral worth of someone making a charitable gift whether it is done with the feeling of pleasure in being better than the poor wretches who are its beneficiaries or with a trace feeling of the suffering of the other individual, which one’s gift might relieve. What the other is experiencing is useful input to the process of ethical assessment of the quality of character of the individual in question. As sympathy is enlarged in Hume beyond the narrow scope of one’s family and friends, it gives way to benevolence, an interest in the well-being of all mankind, as the basis of morality, while “sympathy” as a term used by Hume is trimmed back and reduced to emotional contagion.

Historically, what Hume does next is to develop his understanding (and definition) of “sympathy” in the direction of “benevolence.” “Sympathy” converges with benevolence as the latter supplements it in the founding of morality in an Enquiry Concerning the Principles of Morals (1751). Without appreciating the consequences for his use of “sympathy,” Hume starts developing the idea in the direction of “benevolence,” the latter being specific benefits that interest us in the good of mankind:

‘Tis true, when the cause is compleat, and a good disposition is attended with  good fortune, which renders it really beneficial to society, it gives a stronger pleasure to the spectator, and is attended with a more lively sympathy (T 3.3.1.21; SBN 585).

Virtue in rags is still virtue, as Hume famously notes, and sympathy interests us in the good of all mankind (“society”) (T 3.3.1.19; SBN 584), including communities distant from us in location or time. In answering the objection that “good intentions are not good enough for morality,” Hume argues back in so many words that good intentions are indeed good enough, granted that good intentions plus good consequences (results) are even better. However, “sympathy” has now taken on the content of benevolence, that is, an interest in furthering the well being of mankind, not just being open to man’s experiences, including suffering. By the time Hume’s Enquiry Concerning the Principles of Morals is published in 1751, “sympathy” has been downgraded to the power of suggestion, nothing more; and the basis of morality is shifted to such sentiments as benevolence, displaying qualities useful and agreeable to oneself and others.

In the following passage in Treatise, we witness Hume’s development of the meaning of “sympathy”. “Sympathy” migrates from a communicability of affect, which includes a concept of the other that aligns with the modern concept of “empathy,” towards a narrower, but not exclusive, sense of emotional contagion. Within the context of the Treatise, Hume builds a complete sense of sympathy out of the contagiousness of the passions by adding the idea of the other to the communicability of affect. In the Enquiry Concerning the Principles of Morals the contagiousness of the passions is all that will remain of sympathy:

‘Tis remarkable, that nothing touches a man of humanity more than any instance of extraordinary delicacy in love or friendship, where a person is attentive to the smallest concerns of his friend. . . The passions are so contagious, that they pass with the greatest facility from one person to another, and produce correspondent movements in all human breast. Where friendship appears in very signal instances, my heart catches the same passion, and is warmed by those warm sentiments, that display themselves me (T 3.3.3.5; SBN 604-5).

When put in context, this points to a remarkable development in Hume’s thinking. Hume moves sympathy from the center to the periphery of his account of human judgments (approbation and disapproval). This is complimented by the contrary movement of taste from the periphery to the center. The social advantages of sympathy in forming human relationships – friendship, enjoyment of the “characters of men,” fellow feeling, and sensitivity to how one’s actions have an impact on others – are shifted elsewhere, amazingly enough in the direction of the aesthetic sense of taste.

Hume explicitly writes:

Thus the distinct boundaries and offices of reason and of taste are easily ascertained. The former conveys the knowledge of truth and falsehood; the latter give the sentiment of beauty and deformity, vice and virtue (1751: 173f., 269; emphasis added).

Thus, Hume is engaging in what we might describe a journey back from morality to its foundation and infrastructure in taste. By 1751, “sympathy” has been reduced in Hume’s work to “natural sympathy,” which overlaps substantially with what we would today call “the power of suggestion”. The merit of benevolence and its utility in promoting the good of mankind through attributes useful and agreeable to oneself and others looms large in founding morality (for example, Hume 1751: 241).

By the time of the Enquiry (1751), the push down of “sympathy” behind compassion and taste is complete. The reactive aspects of “sympathy” get split off and migrate in the direction of compassion. Compassion takes on the content of qualities useful to mankind as benevolence. Taste dominates the field of fine-grained distinctions in the communicability of feelings between persons (“friends”) as well as in the appreciation of beauty.  This former point is essential. Taste gives us an enjoyment of the qualities of the characters of persons in conversation, humor, and friendship that are a super-set of what empathy does today in our current usage with its fine-grained distinctions in accessing the experiences of other persons. The prospect of “delicacy of sympathy” in the social realm of human interrelations is left without further development by Hume. The true heir to Hume’s undeveloped “delicacy of sympathy”, without, however, explicitly having any idea of it, is the psychoanalyst Heinz Kohut whose transformations of empathy include humor, appreciation of art, and wisdom (Kohut 1966).

The other main witness to the vicissitudes of sympathy is Adam Smith, to whom we now turn.

b. Adam Smith’s Philosophy of Sympathy

Adam Smith explicitly distinguishes between sympathy and compassion (pity) in his 1759 The Theory of the Moral Sentiments. He also acknowledges a traditional overlap between the two, noting, however, the generalization of sympathy:

Pity and compassion are words appropriated to signify our fellow-feeling with the sorrow of others, sympathy, though its meaning was, perhaps, originally the same, may now, however, without much impropriety, be made use of to denote our fellow-feeling with any passion whatever (1759: 49).

Here sympathy is not some separate reactive affect that occurs in witnessing the pain and suffering of another individual. Rather sympathy operates as the communicability of affect (the passions) regardless of the particular passion. “Fellow feeling” is used as a high level category that enables Smith stylistically to suggest nuances and fine-grained distinctions in his phenomenological descriptions. An argument might be made that, when all is said and done, “sympathy” and “fellow feeling” are used synonymously by Smith. For example, in defining sympathy, Smith cannot use the same term without succumbing to the logical fallacy of petitio principi:

As we have no immediate experience of what other men feel, we can form no idea of the manner in which they are affected, but by conceiving what we ourselves should feel in the like situation [. . . . ] [I]t is by the imagination only that we can form any conception of what are his sensations [. . . .] By the imagination we place ourselves in his situation, we conceive ourselves enduring all the same torments, we enter as it were into his body, and become in some measure the same person with him, and thence form some idea of his sensation, and even feel something which, though weaker in degree, is not altogether unlike them. [. . . .] That this is the source of our fellow-feeling for the misery of others, that it is by changing places in fancy with the sufferer, that we come either to conceive or to be affected by what he feels, may be demonstrated by many obvious observations . . . (1759: 47-8)

In addition to using “fellow feeling” to define “sympathy,” the mechanism by which sympathy operates is the imagination. Specifically, it is the taking of the perspective of the other in the other’s situation. This points to three results.

(1) If “sympathy” is not synonymous with “fellow feeling,” then what is the difference? Sympathy is not responsive in the sense of pity or compassion, the latter being reactions to the suffering of another. Yet sympathy has its responsive dimension. Sympathy requires a responsive approbation or disapprobation of the beneficial or mischievous conduct of the other individual. In Smith, sympathy is fellow feeling plus (dis)approbation:

That where there is no approbation of the conduct of the person who confers the benefit, there is little sympathy with the gratitude of him who receives it: and that, on the contrary, where there is no disapprobation of the motives of the person who does the mischief, there is no sort of sympathy with the resentment of him who suffers it (1759: 143; chapter abstract).

This is a definitive textual answer. Of course, “little sympathy” is perhaps distinct from “absolutely and positively no sympathy”. But this is just understatement for effect. Sympathy is simply missing in the case of an unmerited boon conferred by a would-be benefactor. The bounds of (dis)approbation align closely with those of sympathy. Ultimately, sympathy is the basis of the moral sentiments for Smith because “to sympathize with” means “to align with the estimation of right or wrong based on fellow feeling”. The nuances that arise are many and varied; but Smith is more consistent than he is generally credited in standardly using sympathy as the source of intuitions about the merit (or demerit) of other individuals. This extends not only to their conduct but in the heartfelt attitude they bring to the conduct and its consequences. When we sympathize with the other – approving or disapproving based on the other’s perspective (not one’s own) – then we are aligned with the values of the shared community, especially the community of well-bred English gentlemen. When sympathy breaks down, when we have no fellow feeling with the other, then it is a strong indication that the other has put himself outside the community and is blameworthy, lacking merit. The result is an ethics of the well-bred English gentleman, including his attachments to reputation, prudence, temperance, and so on.

In addition, Smith’s spectator-like perspective aligns with that of the second person in contrast with Hume’s which has a closer resemblance to that of the third person. Stephen Darwall is keenly aware of this and makes the point: “It is not far wrong, indeed, to think of Smith as one of the first philosophers of the ‘second person,’ if not the very first” (Darwall 2006: 46). In many contexts, especially those is which (dis)approbation is not the main issue, Smith’s use of “sympathy” is indistinguishable from “communicability of affect”. This has led some commentators to equate Smith’s use of “sympathy” with empathy pure-and-simple. Thus Darwall, keenly aware of his own second person inquiry:

I prefer to use ‘sympathy’ for feelings of concern for others that are felt, not entirely as from their own point of view, but as from a third-person perspective of one who cares for them, and to use ‘empathy’ for feelings that either imaginatively enter into the other’s standpoint or result from his feelings by contagion (Darwall 2006: 45).

Of course, Darwall’s previous point is that Smith’s usage (unlike Hume’s) is precisely to take the second person perspective. Therefore, for Darwall, Smith’s usage of “sympathy” requires revision to equate “sympathy” with the third-person perspective, leaving room to rewrite the text using “empathy”. However, Darwall arguably overlooks the point indicated in the above-cited quote that, for Smith, sympathy is fellow-feeling plus (dis)approbation, not fellow feeling pure-and-simple.

(2) Do we merely “conceive what we ourselves should feel in the like situation” or are we allowed (or even required) to take on the characteristics of the other in so far as we are able to do so? This is similar to the question “How complete is the identification with the other?” While the above-cited text suggests that the one individual carries his or her characteristics into the situation of the other, the analysis does not stop there. To be sure, a person never completely stops being himself; yet the meta-rule is to put oneself in the other’s situation with the other’s character and circumstance:

But though sympathy is very properly said to arise from an imaginary change of situations with the person principally concerned, yet this imaginary change is not supposed to happen to me in my own person and character, but in that of the person with whom I sympathize. When I condole with you for the loss of your only son, in order to enter into your grief, I do not consider what I, a person of such a character and profession, should suffer, if I had a son, and if that son was unfortunately to die; but I consider what I should suffer if I was really you; and I not only change circumstances, but I change persons and characters. My grief, therefore, is entirely upon your account, and not in the least upon my own. It is not, therefore, in the least selfish (1759: 501-2).

We imagine what it would be like to be the other person with the other’s character. The ideal spectator runs a cognitive simulation in which one may indeed begin with one’s own characteristics as input, but quarantines one’s own peculiarities in favor of those of the other. To be sure, such an exercise is bound to be imperfect and incomplete. There is a strong identification, yet it remains transient, temporary, and incomplete. At that point, sentiments of approbation or disapprobation emerge, which inform the individual’s moral assessment of the situation and the other person in it.

(3) What is involved in feeling what the other feels, yet not approving of it? Smith allows an extensive continuum of degrees of fellow feeling, reaching from a slight hint of what the other is feeling to full blown identification. Sympathy mostly falls in the middle of this spectrum, a transient and trial identification with the other, soon interrupted. The one individual feels what the other feels, yet not quite as intensely. It seems as though there ought to be a logical space for the possibility of fellow feeling without sympathetic (dis)approbation, given that these are not completely synonymous. Yet it does not seem to occur to Smith to allow it. Consider the situation of the condemned criminal about to be hanged.

When an inhuman murder is brought to the scaffold, though we have some compassion for his misery, we can have no sort of fellow-feeling with his resentment, if he should be so absurd as to express any against either his prosecutor or his judgment (1759: 145).

Indeed we adopt the sentiments of the prosecutor and judgment. Taking a hint from Smith’s language here, we can say that the criminal has placed himself outside the limits of the human community with his murderous deeds, a human community from which he is about to be ejected by being hanged. The lesson learned here is that we may have compassion for lower forms of life, but sympathy is arguably co-extensive with the human and defines the foundation of our participation in the community. For Smith, “sympathize with” is synonymous with “align with the other’s feeling in such a way as to approve or disapprove along with the other” (not a quote from Smith, but Smith’s bottom line).

c. Contractualism and Sympathy in Rawls

John Rawl’s magisterial A Theory of Justice (1971) contains sections on features of the moral sentiments and moral psychology, including a discussion of sympathy and the impartial sympathetic spectator. After the parties in a would-be society have adopted the principles of justice as fairness in the original position, the result is Kantian. Natural abilities such as strength, intelligence, inherited gifts, are unevenly distributed to individuals but are a collective asset so that the more fortunate are to benefit in ways that help those who are least well endowed. Inequalities are arranged for reciprocal advantage. By abstaining from the exploitation of the accidents of nature and social contingencies with the framework of equal liberty and the difference principle, persons express their respect for one another in the constitution of society itself (Rawls 1971: 179). Can this capture the utilitarian approach of the impartial, sympathetic spectator? At first it seems that it can:

Now while it is possible to supplement the impartial spectator definition with the contract point of view, there are other ways of giving it a deductive basis. Thus suppose that the ideal observer is thought of as a perfectly sympathetic being. Then there is a natural derivation of the classical principle of utility along the following lines. An institution is right, let us say, if an ideally sympathetic and impartial spectator would approve of it more strongly than any other institution feasible in the circumstances. For simplicity we may assume, as Hume sometimes does, that approval is a special kind of pleasure [. . . ] This special pleasure is the result of sympathy. In Hume’s account it is quire literally a reproduction in our experience of the satisfactions and pleasures which we recognize to be felt by others [. . . .] Men’s natural capacity for sympathy suitably generalized provides the perspective from which they can reach an understanding on a common conception of justice. (1971: 185-6)

In either case, contractual or utilitarian, the argument moves in the direction of fairness and equilibrium; yet the original (contractual) position more accurately captures the human condition – namely, that individuals have distinct abilities and gifts. In Rawl’s original position, the parties are behind the “veil of ignorance,” disinterested and lacking knowledge of their natural abilities or social situation. Inequalities are just only if they result in compensating benefits for everyone. In contrast, the classical utilitarians have perfect knowledge and sympathetic identification. Both result in a correct estimate of the net sum of satisfaction. But classical utilitarianism fails to distinguish between persons, in effect collapsing distinct persons with distinct abilities into a one dimensional, impartial sympathetic spectator.  The parties in the original position would not agree to the approvals of the impartial sympathetic spectator as the standard of justice, according to Rawls. Why not? Such a spectator does not have access to the concept of risk – the risk that one might be born poor and marginalized rather than (say) rich and in the main stream. It simply is not captured. The veil of ignorance is designed to yield principles of justice as fairness whereby, even if your antipathetic enemy is choosing what role you will play in society (presumably you will end up poor and with limited natural abilities), the advantages that the other party has will be distributed in such a way as to contribute to everyone’s advantage. In contrast, for the impartial sympathetic spectator to yield justice as fairness, the parties are conceived as perfect altruists, whose desires conform to the approvals of such a spectator. “The greater net balance of happiness with which to sympathize, the more a perfect altruist achieves his desire” (1971: 189). In fact, the world is filled with individuals with competing interests, who, moreover, are antipathetic (hostile) to one another, and the utilitarian aligns everyone’s interests in a most counter-intuitive way. Justice is not necessary unless individuals are antipathetic and the interests of individuals come into conflict, the actual situation of the real world. In a world of conflicting interests, sympathy still has a role to play in transmitting affects, but it is not foundational. Is there then a logical space between self-interest and duty to account for the relationship between empathy and altruism?

d. Nagel’s Incomplete Version of Empathy

In Nagel’s argument, the interest of others in the world and one’s own interest are in balance. It is not that the world comes ahead of one’s own – which, arguably, would look like utilitarianism and the greatest good of the greatest number. Although consequences are inevitably a part of the description of the ethical dilemma, the determination of interest is not exclusively reducible to the consequential calculation. Rather we are looking formally and logically at the priority of one’s own interest over against that of the other individuals in the world, and one’s own interest does not have any more priority. But it does not necessarily have any less, for the majority of the world would not be justified in inflicting pain on one individual even if it resulted in their greater good anymore than one individual would be justified in doing so to the majority. Is there a logical space available to establish a link between empathy and the austere ethics of duty (deontology) based on the structure of action? This is needed to avoid succumbing to the reactivity of moral sentiments (shame, guilt, benevolence, compassion, and so on). These are powerful motivators of moral behavior, and as such deserve cultivation, but are logically dubious founders of it.

This argument enables Thomas Nagel (1970) in effect to say “Act so as to reduce the pain (of persons) in the world.” Depending on one’s perspective, this is a special case by way of generalization of the self-interested maxim to “act so as to reduce my own pain” along with “I am in the world with others” and “we are all others (persons)”.  For example, in being altruistic, both my own pain and that of the other are regarded impersonally. Actions that reduce my pain remain self-interested in an obvious way – I am no longer in pain. Acts that reduce the pain of the other are just an impersonal version of my acting to reduce the pain experienced personally.

The next step was not taken by Nagel who elsewhere disparages a version of empathy based on an incomplete and misleading definition. Nagel calls for “an objective phenomenology not dependent on empathy or the imagination” (1974: 402); but this phenomenology may turn out to be inconsistent with his commitment to finite human understanding. Without empathy and the imagination, the bat’s experience becomes the inaccessible thing in itself (ding an sich). The bat is the one who does not know what it is like to be a bat, since the bat lacks my concept of battiness (not to mention such distinctions as echo-location, flying mouse, and mammal). In Nagel 1970 he writes: “Any justification ends finally with the rationally gratuitous presence of the emotion of sympathy; if that condition were not met, one would simply have no reason to be moral” (1970: 11). Here “sympathy” means “pity” or “compassion” or “benevolence,” rather than the possibility of communicating any possible affect or sensation, which was Hume’s initial and primary meaning of “sympathy” (see Hume 1739: 319).

Yes, I should so act to reduce the pain in the world, including the other’s and my own too. But how do I know the other is in pain? The answer is empathy. In any particular situation and with apologies to Kant, altruism without empathy (sympathy) is like a concept without intuition. The vicarious experience of the other’s pain and the processing of it in empathic receptivity and interpretation is an essential part of how the would-be altruist comes to know of the other’s distress. This does not mean I cannot be wrong. It means that I can advance from the possibility of altruism to its implementation in actual situations through marshalling, capturing, and organizing the evidence of interrelational receptivity through empathy. Having established then that empathy provides an essential input to ethical altruism, is it perhaps capable of being elaborated into a foundation for an ethics of caring?

e. Empathy as a Moral Criterion in Slote’s Ethics of Caring

Michael Slote (2007, 2010) finds in empathy the basis for an ethics of caring that provides the basis for a moral sentimentalism. Drawing mainly on Hume, with an occasional nod to Smith, Slote shifts the analysis from an evaluation of behavior to the moral worth of agents engaging in action. One’s ability to empathize defines the boundary of the human community or as Slote puts it (2010: 13) provides “cement of the moral universe”. Slote finds significant inspiration in Hutcheson’s idea of a moral sense of approval or disapproval. Slote then substitutes empathy for that moral sense, however, with significant conditions and qualifications (2010: 33-4). Slote argues that empathy provides an “understandable mechanism for moral approval and disapproval” (2010: 33), lending philosophic rigor to the mere metaphor of moral sense. Slote claims to identify a second order empathy:

In particular, if agents’ actions reflect empathic concern for (the well-being or wishes of) others, empathic beings will feel warmly or tenderly toward them, and such warmth and tenderness empathically reflect the empathic warmth or tenderness of the agents. I want to say that such (in one sense) reflective feeling, such empathy with empathy, also constitutes moral approval, and possibly admiration as well, for agents and/or their actions (2010:  34-5).

Let us return to our paradigm example of the Good Samaritan and see how this works. We are fine as long as the Good Samaritan, for example, feels warmly or tenderly towards the traveler who was waylaid by robbers and left for dead. Empathy is properly understood as requiring a communicability of affect between the rescuer and the observer – in this case Slote or the ideal observer – who approve or disapprove of the moral worth of the agent. Of course, we can contingently imagine that the Samaritan, in the act of helping the stranger, recognizes that what he is doing has significant value, acknowledging himself for his good character and producing a warm feeling, whether of self-approval or mere self-recognition. This warm feeling of the Samaritan, in turn, becomes the target for the empathy of the ideal observer, who experiences a trace affect of it. However, the example works less well if the Samaritan is too busy helping the other to consider self-recognition or –acknowledgement. Even more, the example works not at all if the Samaritan, who is actually a Palestinian, sees that the victim is a Hebrew settler, but, in spite of their being sworn enemies, he recognizes the suffering humanity through a trace affect of suffering disclosed in his empathic receptivity, and decides, in spite of his antipathic negative feeling, to come to the traveler’s aid, not because of any feeling but because that is what duty requires of him. Ambivalence is not a problem and is likely in most scenarios (see Greenspan 1980).

Further issues loom for Slote with altruism, which is considered in more detail in his first empathy book, which complements the later one consistently (2007, 2010). Thus, he distinguishes empathy as a method of accessing the experiences of the other from specific emotions such as compassion, pity, love, and so on. However, Slote then mixes empathy with altruism: “A person who is fully empathic with and concerned about others will sometimes give up something that she wants in order to help another person gain something good” (Slote 2007: 116). In this context, Slote points out that this “give up something” is not irrational. Indeed it is not. This sounds like altruism because that is what it is. Only if “rationality” is mistaken for one’s own narrow self-interest (not Slote’s problem), whether long or short term, does it become impossible to help others without falling into unqualified egoism. In general, when, on the basis of empathy, a person does something to help another person, the helping shows up as a form of altruism. Thus Slote:

The criterion offered [. . .] in terms of empathic caring was a moral criterion, a criterion of moral permissibility, and when I spoke of supererogation, I was again speaking in specifically moral terms. In that sense, too, the empathically caring individual can be characterized as possessing (a) moral virtue, and I think it is fair to say that the present book has been primarily interested in the moral aspect of the ethics of care (Slote 2007: 118).

The moral aspect of the ethics of caring is precisely empathy, according to Slote. However, contra Slote, empathy does not require that one do anything other than listen empathically and talk empathically in response, thus falling short of the practical caring (for example, serving dinner) intended here by Slote. Indeed a quiet, rich empathic silence is often sufficient. If one decides to take action on the basis of empathy, then the action may be altruistic if the beneficiary is a stranger or the action may be caring (in the narrow sense of feeding one’s own hungry child) if the beneficiary is someone “near and dear,” who one is obligated to attend to in any case. Thus, it is important to distinguish directly helping others by caring for their physical needs, feeding the hungry, binding up the wounds of the injured, sheltering the injured traveler, and so on, and empathizing with the other in such a way to allow the other to regain their emotional equilibrium when it has been lost or upset (admittedly by traumas and suffering). If a person telling a moving story is moved to tears in the course of the narrative, strictly speaking, offering the individual a tissue to dry his eyes is an action. However, it is a largely symbolic gesture that says “I recognize the human suffering and wish to comfort it” rather than an action such as that of the Roman Centurion who cuts his cloak in half and gives it to the freezing beggar.

It is a useful rule of thumb that altruism ministers to one’s physical (bodily) needs whereas empathy responds to and is aroused by a person’s emotional and affective expressions of animate life. The two can diverge dramatically. For example, after the fall of the Soviet Union Rumanian orphanages were understaffed, bare bones institutions that rigged up mechanical, assembly line-like ways of delivering bottled milk to infants, like feeders in a bird cage. The results were the production of symptoms developmentally similar to neurological damage, autism, and infantile psychosis (M. A. Diego and N. A. Jones 2007:   161; Spitz 1946). Many of these symptoms were able to be reversed by adoptive, caring, nurturing parents, but, depending on the duration of the neglect, not all.

Much is made of the short circuiting of action in and by empathy by professional practitioners of empathy. This is due to the uses of empathy in psychotherapy, counseling, and psychoanalysis. In such situations, it would be counterproductive, if not harmful, for the therapist actively to intervene altruistically in the client’s life with specific maxims and advice about what to do. Psychotherapy activates many boundaries between therapist and client, including ethical ones and ones of action. Psychotherapy provides a counter-example to Slote. Neither therapeutic empathy nor empathic distress are a motive for action, though they can clarify the context of action or provide insight into both reasons and causes. If one grasps aspects of the other and his situation through empathy, then one may discover reasons that one did not know were relevant or engaged by his character or that character in a particular situation.

Next, going beyond imperfect duties such as altruism to perfect ones, Slote offers a general criterion of right and wrong action based in the notion of empathy – specifically empathic caring or concern for others:

[. . . ] One can claim that actions are morally wrong and contrary to moral obligation if, and only if, they reflect or exhibit or express an absence (or lack) of fully developed empathic concern for (or caring about) others on the part of the agent (2007: 31).

Slote proposes that empathy – more precisely, “empathic caring” – is a moral criterion. Are we then obligated to strive to develop our empathy (empathic caring) so that we are equal to the criterion? Slote does not explicitly assert that we are obligated to develop our empathy; yet without empathy we do not flourish as humans.

Slote asserts that he does not advocate implementing an obligation to be empathic. However, this risks getting stranded on the horns of a dilemma. First of all, such an obligation (Slote asserts) would set a bar too high for most people, given that we do not get training in empathy, and so risk violating the “ought implies can” injunction. That is, we cannot create an obligation, which we know in advance that we cannot live up to. Yet if we are to advance to “fully developed empathic concern” do we not have an obligation to develop empathy even if it is not required in any particular situation? At the very least, empathy would be an “imperfect duty to self and others” similar to developing one’s talents and gifts – empathy, humor, charity, wisdom – and using them to create community and a flourishing society.

“Fully developed empathic concern” is doing a lot of the work for Slote here. It will contain all the conditions and qualifications required to restrict empathic concern from requiring supererogatory deeds. Empathic concern does not. Empathic concern includes those geographically remote, and excluding them would unfairly subject them to violations of obligations. Empathic concern allows for the development of empathy in those whose initial (“natural”) endowments of it may be less generous than the average individual (practice improves talent).  The argument is instructive and useful – as well as ad hoc.

In order to preserve empathy as a moral criterion, even against these issues, Slote argues against describing empathy as partialist (favoring those “near and dear”). Slote argues against the common sense intuition that people initially seem to have less empathy towards those who are different – different race (skin color), different religion, different gender, different diet than those to whom one is partial such as one’s family (2007: 35). Unfortunately, distrust seems to be the default attitude towards strangers, and often with good reason. Therefore, in a reaction formation meant to manage mistrust, many traditional cultures make it obligatory to shelter and protect guests. What better way than carefully to keep watch over them than to declare they are “guests”?

It is a further question of how the example of the Good Samaritan affects our own empathic receptivity as potential (ideal) observers of the would-be moral agent. In observing the suffering of the victim of the robbers as well as observing the Samaritan’s empathic suffering with the victim, we (as observers) have several empathic experiences. Like the Samaritan, we empathize directly with the distress of the victim. We also experience the suffering of the Samaritan who experiences a double pain. The Samaritan experiences first the vicarious experience (pain) of the suffering of the victim. In addition, the Samaritan experiences the pain experienced in sacrificing his own time, effort, and money in interrupting his (the Samaritan’s) trip, binding up the victims wounds, and leaving him with the Inn Keeper, promising to pay for any additional expenses upon his return. If the Samaritan gives himself credit for his good deed, in effect saying to himself “I acknowledge myself as an agent for doing the right thing in a tough spot,” then we (as observers) can arguably also have an empathic experience of the warm feeling of that self acknowledgement. Slote finds in this experience a second order empathy (Slote 2010: 39) that contains a warm feeling of approval. Slote then finds in this experience an important contribution to the moral basis of an ethics of caring. Some critics find this second order empathy to be a mis-description of the phenomenon, mistaking a response of what Hoffman calls “sympathetic distress” towards the suffering of the other for empathy (Hoffman 2000: 87-8). Indeed Hoffman, who is consistently cited by Slote with agreement, clearly distinguishes “empathic distress” from “sympathetic distress,” granted that Slote does not use such a distinction. For empathy to become an input to morality, it is first transformed from “empathic distress” into “sympathetic distress,” at which point the latter can become the motive for pro-social (altruistic) behavior such as the Samaritan’s (Hoffman 2000: 87-88).

It is a matter of controversy that empathy includes a warm feeling of approval (as asserted by Slote 2010: 36-41). Slote implicitly lines up with Rawls where approval provides a special kind of pleasure, granted that Rawls (like Hume) restricts the argument to “sympathy” (Rawls 1971: 185-6). Approval or disapproval is not the only ethical response possible. Empathy is distinct from (ethical) approval in that it gives rise to a whole host of downstream responses (reactions). As distinct from empathy, certain forms of antipathy can also be marshaled as when a (hostile) enemy of the traveler experiences Schadenfreude (delight at someone’s misfortune) upon witnessing the latter’s misfortune. Indeed what makes the parable so powerful and dramatic is that the Samaritan (Palestinian) is actually the enemy of the victim, but through empathy recognizes suffering humanity and then in a separate decision acts ethically and humanely like a neighbor, not an enemy.

Thus, the altruistic person must frequently deal with overcoming empathic distress – that is, a too intensely felt experience of the other’s pain that goes beyond a vicarious experience of pain and becomes the individual’s own pain pure-and-simple (Hoffman 2000: 87-8). Empathic distress can become so intense that one tries to flee from the situation rather than engaging other alternatives such as helping. Under this interpretation, instead of just being hard-hearted (which remains a possibility in principle), empathic distress is what happened to the first two would-be helpers who passed by the victim/traveler, crossing the road due to this empathic distress in order to put distance between themselves and the source of suffering.

However, one may object, does this not raise the bar too high on altruism? Altruism occasions a triple pain. It now produces three episodes of pain – first the initial distress (for example) of traveler waylaid and beaten by robbers; second, the vicarious experience of the victim’s pain as experienced by the would-be Good Samaritan; and finally the sacrifice (pain) incurred by the Samaritan in aiding the victim. Of course, if successful, altruism eliminates the initial suffering of the victim and by implication the vicarious pain in which the Good Samaritan is empathically connected to the target of altruism. This leaves altruism only with whatever pain is caused by the cost and effort incurred in aiding the victim. In contrast, empathy is left with the initial suffering, the vicarious experience of pain, and the question of what, if anything, to do about the suffering disclosed by one’s empathy. Of course, one possible answer is to act altruistically. Alternatively, one could also simply cross over – cross the road – and pass by. Thus, in answer to the objection that this analysis through empathy sets the bar too high for altruism, the answer is direct. Altruism is indeed a high bar; but one which we are challenged by and, with ethical effort, able to surmount. Empathy tells us what the other is experiencing; altruism what to do about it.

4. The Continental Tradition

The Anglo-American and Continental traditions (neither of which is homogeneous in themselves) have enjoyed an expanding exchange of views in comparison with past periods when each tradition tended to maintain its own island of ideas. Still, the Continental tradition has its own voice and views on empathy and ethics. Three topics have arguably have been the target of a more dedicated inquiry from the Continental side: Nietzsche’s empathic sense of smell as a compliment to his philosophy of suspicion, especially in On the Genealogy of Morals (1887); the Holocaust, especially as interpreted by Hannah Arendt; and the role of the Other (with a capital “O”), especially in Levinas (1961). None of these topics are exclusively the domain of Continental thinking, but are rather invoked here as witnesses of a distinctly Continental contribution.

a. Nietzsche’s Empathy of Smell Complements His Suspicion

Although the word “Einfühlung” does not occur in Nietzsche, his approach is arguably an empathic one, especially as empathy activates the primitive sense of smell. Nietzsche’s status as an outsider, as an individual of the limit experience (Grenzerfahrung), informs his sense of smell – and his empathy – with the dynamics of moral sentimentalism in a fundamental way. Nietzsche’s empathy informs his suspicion time-and-again. Empathy functions as a complement to the method of suspicion. Nietzsche feels – and smells – more acutely and distinctly than the various persons that are intermittently the target of his debunking – the scholar (scientist), artist, priest, last man, the mass (herd) man, even the higher man. For example, behind the moral prescription of “love of one’s neighbor” lies the ascetic priest’s antidote to depression, namely, the performance of petty pleasures of doing (minor) acts of kindness to those who are less fortunate. Nietzsche’s empathy detects the petty pleasure as a trace affect, which, in turn, feeds his suspicion. Nietzsche detects the will to power in a cautious dose of the petty pleasure of doing a good deed that costs the doer nothing and benefits the recipient equally little, but has the result of disclosing a happiness of “slight superiority” (1887: Third Essay, Section 18).

For Nietzsche, the sense of smell functions empathically in disclosing a mostly unsuspected trace of weariness. Suspicion is frequently called out as being essential to Nietzsche’s debunking of established bourgeois morals and iconoclasm towards cherished values. Christian ethics, the love of Saint Paul (“agape”), and their implementation by the Church of Rome, are not what they seem to be according to the Christian account. Once called out, Nietzsche experiences what Hoffman calls “empathic distress” at the leveling and mediocrity from which contemporary man suffers – the emptiness, apathy, malaise, and depression – in short, the nihilism of which the average man is mostly unaware but from which he too suffers. However, Nietzsche does not take the next step to “sympathetic distress” (Hoffman 2000: 87-8). Instead this underlying suffering is transmitted via emotional contagion through the sense of smell as further input to Nietzsche’s empathic processing: “Bad air, bad air! The approach of some ill-constituted thing, that I have to smell the entrails of some ill-constituted soul” (1887: First Essay, Section 11).  Behind the Christian love of the “meek that inherit the earth” Nietzsche’s empathy discovers a trace affect of ressentiment; behind the anti-Semitism of the Church of Rome lies the vengefulness and hatred of slave morality. What passes for virtue is lack of opportunity for badness. Undertaking an inquiry into the formation of ideals such as the Christian “Last Judgment,” “the Kingdom of Heaven,” and [Christian] “faith, hope, and love,” using his empathy, Nietzsche discovers an empathic trace affect of “the intoxication of sweet revenge.” His suspicion then uses these traces to unmask pretensions. All the while, rhetorically exclaiming, “I’ll open my ears again (Oh! Oh! Oh! And close my nose)” (1887: First Essay, Section 14).

Nietzsche’s empathic debunking continues. He quotes from Dante and the Church Father Tertullian that the pleasures of the blessed in heaven will be enhanced by watching from above the tortures of those who are damned in Hell. Not empathy but rather what has come to be called, even in English, “Schadenfreude” – an enjoyment (Freude) at the damages (Schaden) being done to another. Yet even Schadenfreude implies an empathic communication of affect, since the observer’s enjoyment is enhanced by a deep grasp of the suffering of the damned, an appreciation enhanced by a trace affect of the suffering. However, the relationship of Schadenfreude to empathy is one of reactive antipathy. Schaenfreude like sympathy is reactive. In addition to the communication of affect, it includes a response to what is transmitted. Instead of approval or disapproval as in the case of sympathy, the response in Schadenfreud is one of enjoyment at the suffering of the other.

Bad conscience (guilt) seems like a mark of advancing civilization. However, suspicion fulfilled by empathy discloses otherwise. “Bad conscience” is an illness. The more advanced the civilization, the more advanced the guilt. This illness (bad conscience) is hostility, cruelty, destructiveness turned against oneself. This causes “the gravest and uncanniest illness, from which humanity has not yet recovered, man’s suffering of man” (1887: Essay Two, Section 16).

Individuals who escape bad conscience are rare. We can at least envision the possibility in figures in literature such as Achilles, Faust, or Nietzsche’s own cipher Zarathustra, the latter in his better moments of recovery from the great contempt. Nietzsche drives forward his empathic inquiry into the suffering of modern persons and their ideals by a debunking of ascetic ideals. Following his nose – and his empathy – Nietzsche calls again, “And therefore let us have fresh air! fresh air! And keep clear of the madhouse and hospitals of culture!” (1887: Third Essay, Section 14). The result? The priest does not discharge the ressentiment of the modern mass of men, leading their lives of silent desperation and suffering; rather the priest alters the direction of the ressentiment and turns it against the individual: “This is brazen and false enough: but one thing at least is achieved by it, the direction of ressentiment is altered” (1887: Third Essay, Section 16). In conclusion, using a suspicion informed by empathy, Nietzsche famously asserts the position: man would rather will nothing than not will, setting the stage for nihilism. Nietzsche’s empathy points to hidden (and not so hidden) suffering in unexpected places – art, science, religion, philosophy, and, above all, morals. However, in every case, a pattern emerges. Empathy complemented by suspicion shows that it is not so much suffering that man dreads as suffering’s meaninglessness. Thus, ascetic ideals give meaning to man’s suffering by holding out asceticism as a path to something higher. However, some events defy both suffering and meaning and leave us numb, like a deer in the headlights.  We now turn to one such event.

b. The Challenge to Empathy of the Event of the Holocaust

The Holocaust represents a challenge to our empathy as we try to grasp the meaning of historic event from afar in anguished, benumbed remembrance. We can grasp the killing of a single individual as a crime and the killing of many as an even more serious crime. We are challenged to grasp the mutual slaughter of armies on fields of battle in the bloodbath known as “history,” but tentatively rise to the occasion as in Tolstoy’s account of the Battle of Borodino, Crane’s The Red Badge of Courage, or even Boden’s Black Hawk Down. But the systematic, bureaucratized, automated destruction of the Jewish population of Europe by the Nazis between 1938 and 1945 is a challenge to our empathy for so many reasons. It remains one of the defining events of post modern ethics. If we cannot empathize with it, we cannot imagine it. If we cannot imagine it, we cannot punish it. If we cannot punish it, we cannot forgive it. We are burdened with it in a way that defines, chokes, and diminishes our humanity. We are stuck with it in a way that defines, chokes, and diminishes what is possible for human beings; but in the face of which we have to go forward into possibility nonetheless.  It is important to note this was accompanied by and included the extrajudicial killing of other “life unworthy of life” such as the mentally ill and retarded, gypsies, gays, communists, uncooperative members of other religious and political parties. However, the racial laws and anti-Semitic ideology that specifically preceded the event, targeting Jewish people, make it their Holocaust in a special and unhappy way.

Grasping these ideas require putting one’s thoughts and sensibility in a place that they usually do not venture. In the early days of World War II and prior to the automation of killing in the death factories such as Auschwitz, it was difficult for soldiers and paramilitaries to kill people for eight hours a day by shooting them. However, continuous killing is what was required of the Nazis soldiers when there are so many people to kill. That was what the so-called special intervention groups [Einsatzgruppen] had to do. In addition, it is difficult to watch people suffering over so long a period of time, especially if you have insufficient bullets to shoot or gas them all immediately. This is a challenge for any approach to genocide, even after the intended victims have been marked with a yellow star or otherwise “branded,” equated with vermin, insects, and dehumanized. On the street, people still look like humans when we confront them face-to-face or even face-to-back. The misuse of the Nazi concept of duty, which only superficially resembles a deontological one, has been often noted. It occurs again here and should never be mentioned without being challenged. Briefly, the fallacy consists in making an exception for a subset of humans, thus contradicting one’s own humanness. Even formally, the good Nazi morally contradicts himself – a consistency in shooting only one or a few types of persons (in addition to Jews – gypsies, communists, Catholic converts, gays, mentally retarded, physically disabled – the list grows tellingly) – is inconsistency pure-and-simple. Arendt is worth quoting at length:

. . . The murders were not sadists or killers by nature; on the contrary, a systematic effort was made to weed out all those who derived physical pleasure from what they did. The troops of the Einsatzgruppen [responsible for shooting] had been drafted from the Armed S.S., a military unit with hardly more crimes in its record than any ordinary unit of the German Army, and their commanders had been chosen by [Chief Commander] Heydrich from the S.S. elite with academic degrees. Hence the problem was how to overcome not so much their conscience as the animal pity by which all normal men are affected in the presence of physical suffering. The trick used by Himmler – who apparently was rather strongly afflicted with these instinctive reasons himself – was very simple and probably very effective; it consisted in turning these instincts around, as it were, in directing them toward the self. So that instead of saying: What horrible things I did to people! the murderers would be able to say: What horrible things I had to watch in the pursuance of my duties, how heavily the task weighed upon my shoulders (Arendt 1971: 105-6).

While life is filled with moral ambiguities and difficult ethical choices, this is hardly one of them. What was done was wrong and to be condemned in the strongest terms. Nor is lack of empathy what represents the moral problem. It is the killing.

What made it easier for the soldiers to do their “duty” – commit murder (genocide) – was the manipulation by the leaders to deflect the individual soldier’s natural empathy for the prisoner and to increase the soldier’s empathy for himself, deflecting the natural trajectory towards the other. The “animal pity” and “instinctive reasons” against killing humans are an incomplete form of empathy, based in a mechanism like emotional contagion. In Emile, Rousseau refers to a pre-reflective sentiment of pity. “I am, so to speak, in him, it is in order not to suffer that I do not want him to suffer. I am more interested in him for love of myself, and the reason for the precept is in nature itself, which inspires in me the desire of my well-being in whatever place I feel my existence” (cited in Birmingham 2006: 42). Himmler was afflicted with what Martin L. Hoffman (2000) describes as “empathic distress.”

Without being able to engage further in the infinite sorrow and commentary invoked by the Holocaust, the relevance to empathy is direct. The issue is whether empathy can be used for harm as well as good. In invading Poland and the Netherlands in 1939 and 1940, the Nazis attached sirens to the Stuka dive bombers creating an uncanny noise that seemed to get inside the heads and hearts of the civilian population causing empathic distress. Although it may sound strange to say it, especially after reading Slote, this was based on the Nazi empathy with the victims. On the Anglo-American side, Slote’s reply is that this is not “fully developed empathic concern”. Indeed it is not.

One might try to turn such an example in the direction of an ethics of caring based on empathy by saying in effect, “Look at how fundamental empathy is.” This is accurate. However, what is missing from such a turning is use of empathy separately from its ethically informed application.

The maneuver of the Nazi (or individual psychopathic criminal) of “getting inside someone’s head” is different than being distressed by what distresses him. At the very least, the latter requires a communicability of affect or emotional contagion. Nazis and psychopaths are believed to be able to “get into your head,” but arguably they are noticeably lacking in empathy. They are also thought to lack a conscience (or at least a properly developed one), so an appeal to the example of the psychopath may not be decisive. The variables of missing empathy and missing conscience inevitably confound one another.

Such examples as psychopaths and Nazis allegedly using empathy point in the direction of multiple empathic phenomena such as emotional contagion, gut reactions, and primal pity, that are not empathy pure-and-simple but rely on the same somatic and semantic functions. The distinction between the psychopath or Nazi “getting inside one’s head” and being empathic is a fine one. Like other “diseases” of empathy such as autism, the behavior and motivations lie along a continuum between extremes. At one extreme, empathy is conspicuous by its absence. At the other extreme, low level empathic functioning is in evidence as emotional contagion along with and aspects of pathological (or criminal) behavior in a high functioning, educated individual, who also enjoys aspects of normal empathy.

The anti-foundationalist argument asserts that empathy – whether fully developed or not – does not supply its own ethical application. Empathy does not supply its own ethical justification. Empathy does indeed supply the otherness of the other – simply stated, the other. It is a separate step to care for the other, say, altruistically, or not care for the other. The empathy provides me access to the suffering of the other. It is a further step to take action to reduce that suffering in line with one’s conscience and other ethical conditions and qualifications.

Thus, the supposedly empathic Nazi spends the day shooting the helpless enemies of the Aryan race and feels a full measure of suffering (of the victims), because his mirror neurons are working normally; but instead of saying “Look how they suffer” says “Look how hard my work is – look how much I suffer.” The fall back position for the Anglo-American philosopher such as Slote is to argue that “full, adult human empathy” requires an education (along the lines of Hoffman’s inductive discipline) that leads one to experience strongly with the distress of others. Such an induction of the other’s distress with one’s own results in an inhibition of deliberate harm to the other. But wait. We already have that. It’s not induction of empathy that is needed. This hypothetical Nazis is already suffering, but continues to shoot due to a defective, misguided sense of duty to the Fuhrer. He needs instruction in ethics: Killing is wrong, regardless of what the Fuhrer says. As noted above, this example is discussed by Hannah Arendt in the context of Himmler’s animal pity for his men – under one interpretation (not necessarily Arendt’s) how he provided leadership as an “empathic” Nazi (Arendt 1971a: 105-6). The counter-counter-argument is direct. It is not empathy that inhibits one’s performing harm but rather an ethical prohibition against doing so, regardless of whether one enjoys inflicting pain or not, that stops the hurtful action. In that sense, empathy doesn’t care – it tells you how the other feels – whereas ethics tells one what one ought to do about it. In the final analysis the Nazis or psychopath exemplifies a pathological, distorted, immoral use of empathy. It is a part of the possibility of empathy to be so used and abused, though human beings with integrity and character will undertake the positive development of empathy so that the misuse does not occur or is made less likely. Of course, this has the distressing implication that we are perhaps not as different from the average, everyday Nazi in regard to our empathic capacities as we might want to imagine.

c. Ethics Against Empathy in Levinas

The third contributor in the Continental tradition is Emmanuel Levinas. The paradox of Levinas and his contribution is that someone who makes a substantial contribution to engaging the Other (as with a capital “O”) still has so little to say about empathy, fellow feeling, or sympathy. In fact, nothing. Arguably, Levinas is the anti-empath; yet much of what he says can be marshaled and read as a contribution to the conversation about empathy. For Levinas, the other is radically different. This does not just “raise the bar” on empathy and make it hard; it makes it impossible.

Still, input to a process of empathy is in evidence. For Levinas, access to the other and the infinite depth of the other’s alterity is given through the other’s face. The face is a hot spot announcing the epiphany, the arrival in force, of the other. The infinity that is available in the face is the source of the ethical power of the other in making an unconditional demand on the individual to be regarded and treated ethically as under an obligation without limitation or qualification. Receptivity to the other occurs in the embodiment of the other’s transcendence in the face. The face plays a central role in announcing the other. The other, in turn, commands responsibility. The obligation and ethical demands that the other imposes on the individual are apparent to merest inspection in the face of another human being. “The epiphany of the face is ethical” (Levinas 1961: 199). The hunger, destitution, and suffering expressed in the face of the other shows up as a demand without further comment being permitted. The face puts a stop to my egoism, my enjoyment of the world as if the world were my oyster and only mine. In this context, the word “empathy” does not occur. Yet it is implicitly able to be conceptualized en passant as “the reduction of the other to the same by interposition of a middle and neutral term” such as “cognition of [transient] identity” (1961: 43). In contrast, “we name this calling into question of my spontaneity by the presence of the other ethics” (1961: 43).

To give meaning to one’s presence is an event irreducible to evidence. It does not enter into intuition; it is a presence more direct than visible manifestation, and at the same time a remote presence – that of the other …. The eyes break through the mask – the language of the eyes, impossible to dissemble. The eye does not shine; it speaks. The alternative of truth and lying, of sincerity and dissimulation, is the prerogative of him who abides in the relation of absolute frankness, in the absolute frankness which cannot hide itself (1961: 66).

From this perspective, it is ethics against empathy. The key term here is “absolute”. The other, the face – in this passage, the eye(s) – embody the absolute. The absolute points toward and includes the infinity that is constantly in play in Totality and Infinity. In contrast, empathy totalizes the self and the other in providing evidence as a trace affect of how the self and the other are similar or even transiently identified; whereas ethics requires the other as a radically other, infinitely other, ethical demand. Yet the reader cannot help but suspect that there is an enlarged sense of empathy beyond the specific intentionality of apprehending in evidence what another feels because I feel it too. Empathy provides evidence of the other in that I know what you feel, because I feel it too, at least as a trace affect. The face is also a “hot spot” of empathic clues and receptivity. Yet, for Levinas, evidence of the other would be the ethically irrelevant, toy problem of other minds in (merely) academic philosophy. In empathy, the self and other form (or would form) a totality; but the ipseity (the self) of the I and the other go beyond totality into the infinity of absolute separation and difference in which the self and other are infinitely incommensurable. From this perspective, empathy is a regression to intentional phenomenology, a regression to Heideggerian care, a regression to inauthenticity. An attempt to reconcile the tension between empathy and the other in Levinas argues that empathy, even as a method of gathering evidence, contains at its core an irreducible respect for the other and the other’s demand that, independent of approval or disapproval, recognizes the other’s infinite authority to block my arbitrary actions towards her or him.

The one is about to lie to the other or kill the other over some trivial (or grand) matter, but is stopped up short by the recognition that one would be killing a type (though not a token) of oneself. The contingent matter of fact of nature is then institutionalized, codified, and canonized into an obligation. “Thou shalt not kill, lie, steal” and so on. The obligation is now established. However, the step from a “stopped up short” to an obligatory “Thou shalt not. . . ” is contingent and problematic. As long as the affects (and so on) disclosed through empathy are such as to support the demand of the other and of one’s obligation to the other, then we are on firm ground. However, when the demand fails or is manipulated by advertising, social pressure, or propaganda to disqualify the other and reduce the other into an subhuman entity prior to extra-judicial execution, then the lack of an ethical (moral) criterion independent of affects is sorely missed. It is necessary to transform one’s empathic distress, experiencing the suffering of the other vicariously, into a sympathetic distress that engages the aspect of feeling sorry for the other, resulting in pro-social intervention to assist the other (Hoffman 2000: 87-8). No man (individual) is an island, and narrow self-interest is readily subordinated to the imperative to reduce suffering in the world includes both the self and other on the list of agents without necessarily giving one or the other priority.

5. Empathy in the Context of Psychoanalysis and Ethics

The psychoanalyst Heinz Kohut defines empathy as the primary method of data gathering about other human beings in the discipline of psychoanalysis.  Thus, Kohut writes: “Empathy does indeed in essence define the field of our observations” (1977: 306). He offers the resonant image of empathy being the oxygen within which the developing child lives, breathes, and grows up to be a healthy human being, and, in turn, in which the adult flourishes in his or her relationships in love and work. Tactically, empathy is a method of data gathering about what is going on with the other person, and without empathy one’s appreciation of the other is incomplete. Strategically, the empathy of the other person is that which humanizes the individual, and when one individual loses the other, the individual is left apathetic, lethargic, feeling lifeless, depressed. Empathy is the oxygen breathing life into the relationship between individual and other, a metaphor introduced by Heinz Kohut (1977) without, however, Kohut extending it to the ethical dimension. In the contemporary Continental tradition, such an extension of empathy is left to Larry Hatab and John Riker, who note that empathy is a primal existential condition that makes ethical life possible (Hatab 2000; Riker 2010). Such an approach, for example, limits that of Slote, Hume, and Smith, whose use of empathy (sympathy) does indeed overlap extensively with fellow feeling in the sense of the communicability of affect yet also is extended to encompass an essential aspect of approbation or disapprobation (and the primary “warmth” (Slote 2010)) of the other’s heartfelt attitude and sentiment.

Empathy is foundational for psychoanalysis, or, more precisely, Kohut’s version of psychoanalysis that has also come to be called “self psychology”. With insufficient, misdirected, or distorted empathy, the growing, developing person (child) is incomplete. In severe cases of lack of empathy the result is hospitalism (Spitz 1946), profound emotional detachment and lack of connectivity similar to infantile autism. In cases of less severe but still traumatic failures of empathy in parent-child relatedness, the results are defects of the self, similar to but not reducible to character disorders, displaying features of narcissistic grandiosity or the seemingly compulsive pursuit of materialistic ideals of status, stuff, and the conventional success of the inauthentic mass man. The social malaise spreads, displaying ethical failings from road rage to children demanding the latest stuff to school yard bullying, precipitating suicide. Getting something for nothing, inner emptiness, immature grandiosity, and fragile self-esteem are characteristic of disorders of the self, resulting from defective and incomplete empathy (Riker 2010: 15-18). Lack of self-regulation is expressed as the numbers of people who are overweight reaches epidemic proportions, psychiatric sedatives and mood stabilizers (that is, drugs) are a growth industry in avoiding ethical responsibility and the (imperfect) duties of developing one’s talents. Addictions and chronic over-indulgence in alcohol, gambling, sex, recreational (illegal) drugs, and rampant cheating on everything from school testing to income tax to faithlessness to one’s spouse express an unhealthy sense of entitlement. Individuals strive to bandage over the pervasive feelings of inner emptiness and feelings of being a fake in spite of the external trappings of material success. The resulting image is a Nietzschian one – everywhere fragments of persons and no where a complete, whole human being, capable of engaging life with integrity (wholeness). The antidote is empathy. Empathy functions as an on-going process of distinguishing, sustaining, and strengthening the structure of the self.

Paradoxically, the structure of the self is distinguished, sustained, and maintained through failures of empathy, but failures in a phase appropriate, non-traumatic context that enable growth and development going forward. In itself, empathy provides symptom relief to emotional upset and behavioral acting out with drugs or sex, which, as symptom relief, does not last over the long run. Empathy comes into its own when, in an on-going empathic relationship, empathy breaks down and fails in a phase-appropriate, non-traumatic way. These non-traumatic failures of empathy occur within a context of successful empathy that lays down and builds psychic structure in the self. This structure enables the individual to deal with the slings and arrows of outrageous fortune, as setbacks, breakdowns, defeats as well as accomplishments inevitably arise in the course of life. Empathy becomes the foundation of an ethics of excellence through its contribution to the development of the self.

Empathy heals the self, and a well-integrated self is one able to sustain the commitments required to keep one’s word, avoid cheating and self medication with alcohol and recreational drugs, productively engage in satisfying activities and relatedness to others, and contribute to the community.

Empathy is a form of receptivity to the other; it is also a form of understanding. In the latter case, one puts oneself in the place of the other conceptually. In the former, one is open experientially to the affects, sensations, emotions that the other experiences. Undertaking an ethical inquiry without empathy – sensitivity to what is happening to and with the other – would be like engaging in an epistemological inquiry without drawing on the resources of perception. Thus, empathy is a method of access as well as a foundational structure as such.

6. A Common Root of Empathy and Ethics

In the final analysis, morality is separate from empathy and neither necessarily grounds the other, although arguably both point to a common root in human beings as the source of possibility. It will not be practical to argue here at this late point whether humans are intrinsically good or evil. Human beings are intrinsically human. Here “human” means intrinsic possibility. Human possibilities include both good and evil as well as empathy. The evidence provided by the history of the 20th century is not encouraging, yet it is not too late to turn it around as regards the present and future of humanity. Humans are also capable of great good works as demonstrated in the agricultural revolution of high yield grains that ended hunger for decades, medical “miracles” such as the eradication of small pox and other diseases, which saved many, many millions of lives. No doubt, cynics will find a flaw in every accomplishment and assure that no good deed goes unpunished. Indeed the consequence of our actions often escape us; and the propagation of forgiveness is an innovation and recommendation well counseled (Arendt 1926/65, 1971; Tutu 1999). Likewise, it is a part of the possibility of empathy to be so used and abused, although humans with integrity and character will undertake the positive development of full, adult empathy so that the misuse does not occur or is made less likely.

7. References and Further Reading

  • Lou Agosta. (1984). “Empathy and intersubjectivity,” Empathy I, ed. J. Lichtenberg et al. Hillsdale, NJ: Lawrence Erlbaum Press.
    • Engages the implicit transcendental argument that empathy is the foundation of intersubjectivity (community).
  • Lou Agosta. (2010). Empathy in the Context of Philosophy. London: Palgrave. Lou Agosta. (2010). Empathy in the Context of Philosophy. London: Palgrave.
    • A hermeneutic account of empathy, extending it by means of Heidegger (hermeneutics), Husserl (phenomenology), Searle (language analysis), and Kohut (self psychology).
  • Aristotle, “On dreams,” Loeb Classical Library: Aristotle VIII: On the Soul, Parva Naturalia, On Breath, tr. W.S. Hett. Cambridge, MA: Harvard University Press, 1936.
    • The first reference to ‘empathesteros’ in the tradition.
  • Hannah Arendt. (1926/65). Love and Saint Augustine. University of Chicago Press, 1996.
  • Hannah Arendt. (1971). Eichmann in Jerusalem. New York: Viking Press.
    • Input to the debate whether Himmler’s (Nazi) “animal pity” rises to the level of empathy and if so, so what?
  • John L. Austin. (1946). “Other Minds,” Classics in Analytic Philosophy, ed. R.R. Ammerman. New York: McGraw-Hill, 1965: 353-78.
    • Footnote about a “counter-part feeling” that sounds like the vicarious aspect of empathy.
  • Bruno Bettelheim. (1974). A Home for the Heart. New York: Bantam Paperback, 1975.
    • Many uses of empathy in the context of milieu therapy, informed empathically by Bettelheim’s survival of the Nazi concentration camps.
  • Simon Baron-Cohen. (1995). Mindblindness: An Essay on Autism and Theory of Mind. Cambridge, MA: MIT Press, 1997.
    • Extensive use of “false belief” experiments in zeroing in on diseases of empathy such as autism; empathy (absent) as a form of “mindblindness.”
  • Michael F. Basch. (1983). “Empathic understanding: a review of the concept and some theoretical considerations,” Journal of the American Psychoanalytic Association, Vol. 31, No. 1: 101-126.
    • A foundation for psychoanalysis in affective dynamics (empathy) separate from libido and drives.
  • Peg Birmingham. (2006) Hannah Arendt and Human Rights: The Predicament of Common Responsibility. Bloomington: Indiana University Press.
  • Kaj Björkqvist. (2007). “Empathy, social intelligence and aggression in adolescent boys and girls,” Empathy in Mental Illness, T. Farrow and P. Woodruff, eds. Cambridge UK: Cambridge University Press.
    • Training (induction) in empathy reduces aggression.
  • Michael Boylan. (2008). The Good, the True and the Beautiful. New York: Continuum Books.
    • Uses sympathy to ground an account of the affective good will.
  • Ted Cohen. (2008). Thinking of Others: On the Talent for Metaphor. Princeton, NJ: Princeton University Press, 2008.
    • The capacity for metaphor (arguably) underlies the capacity for empathy (that is, thinking of others).
  • Stephen Darwall. (2006). The Second-Person Standpoint: Morality, Respect and Accountability. Cambridge, MA: Harvard University Press.
    • A linguistic-grammatically informed version of the struggle for recognition (without endorsing Hegel) that you should not hurt me.
  • Stephen Darwall. (1995) The British Moralists and the Internal ‘Ought’: 1640 – 1740. Cambridge, UK: Cambridge University Press.
    • A monumental study, arguing (among other things) that Hume goes beyond sympathy (and moral sentimentalism) to rule-regulation.
  • Charles Darwin. (1872). The Expression of the Emotions in Man and Animals. Chicago: University of Chicago Press, 1965.
    • Basic emotions are continuous, arguably due to shared physiological (evolutionary) infrastructure, between man and animals.
  • Frans de Waal. (2009). The Age of Empathy. New York: Harmony Press.
    • Empathy with higher mammals breaks through to the general reading market.
  • J. Decety and T. Chaminade. (2003). “When the self represents the other: A new cognitive neuroscience view on psychological identification,” Consciousness and Cognition 12 (2003).
    • Has a fMRI machine and knows how to use it to study empathy.
  • J. Decety & P.L. Jackson. (2004). “The functional architecture of human empathy,” Behavioral and Cognitive Neuroscience Reviews, Vol 3, No. 2, June 2004, 71-100.
    • One of the most robust definitions of empathy in the literature.
  • J. Decety & C. Lamm. (2006). “Human empathy through the lens of social neuroscience,” The ScientificWorld Journal 6 (2006), 1146-1163.
    • Just as the title says.
  • M. A. Diego and N. A. Jones. (2007). “Neonatal antecedents for empathy,” Empathy in Mental Illness, T. Farrow and P. Woodruff, eds. Cambridge UK: Cambridge University Press.
    • Without empathy, the infant is damaged emotionally and behaviorally, resulting in autistic- and psychotic-like symptoms.
  • M. A. Diego and N. A. Jones. (2003). Emotions Revealed: Recognizing Faces and Feelings to Improve Communication and Emotional Life. New York: Henry Holt.
    • Same as Ekman below.
  • Paul Ekman. (1985). Telling Lies: Clues to Deceit in the Marketplace, Politics, and Marriage. New York, W.W. Norton.
    • Develops the idea of micro-expressions betraying otherwise hidden affects, which are relevant inputs to further empathic processing (the latter not discussed by Ekman). Should be read along with Hume.
  • T. Farrow and P. Woodruff, eds. (2007). Empathy in Mental Illness. Cambridge UK: Cambridge University Press.
    • Diseases of (absent) empathy such as autism, psycho-pathy, hospitalism; the neurological infrastructure in mirror neurons, extending to philosophy.
  • S. Freud. (date unknown). The Standard Edition of Freud’s Works, tr. under the supervision of James Strachey, 24 volumes. London: Hogard Press, 1955-64.
  • S. Freud. (1909). Jokes and their Relation to the Unconscious, tr. J. Strachey. New York: W. W. Norton, 1960.
    • Most of the occurrences of Einfühlung” [empathy] in Freud occur in this work, which explicitly references Lipps, who Freud owned and marked.
  • Michael N. Forster. (2010). After Herder: Philosophy of Language in the German Tradition. Oxford: Oxford University Press.
    • Herder applies “Einfühlung” [empathy] to understanding difficult texts and interpretations, obtaining clear priority in publication over Lipps and other users of the concept.
  • Vittorio Gallese. (2007). “The shared manifold hypothesis: Embodied simulation and its role in empathy and social cognition,” Empathy in Mental Illness, eds. Tom Farrow and Peter Woodruff, Cambridge, UK: Cambridge University Press.
    • Argues for a shared (empathic) manifold based on mirror neurons.
  • Arnold Goldberg. (1999). Being of two Minds: The Vertical Split in Psychoanalysis and Psychotherapy. Hillsdale, NJ: The Analytic Press.
    • Examples of empathy based on immersion in listening to the other.
  • Pat Greenspan. (1980). “A case of mixed feelings: ambivalence and the logic of emotion,” Explaining Emotions, ed., A. O. Rorty. Berkeley: University of California Press.
    • Ambivalent feelings happen, requiring revising our understanding of consistency and rationality.
  • Ralph R. Greenson. (1960). “Empathy and its vicissitudes,” International Journal of Psychoanalysis 41 (1960): 418-24.
    • Empathy as building a model of the other and using it to capture the other.
  • H. Hartmann. (1959). “Psychoanalysis as a scientific theory.” Psychoanalysis, Scientific Method, and Philosophy: A Symposium, ed. S. Hook. New York: New York University Press, 1964: 3-37.
    • This is the philosophy of science being debated at the time that Heinz Kohut was writing his first empathy article (Kohut 1959).
  • Lawrence J. Hatab. (2000). Ethics and Finitude: Heideggerian Contributions to Moral Philosophy. Lanham, MD: Rowman and Littlefield.
    • Contains a chapter on empathy and engages empathy on the critical path of the existential foundation of ethics.
  • Elaine Hatfield, John T. Cacioppo, Richard L. Rapson . (1994). Emotional Contagion. Cambridge, UK: Cambridge University Press.
    • Emotional contagion is input to affective processing by empathy (the latter not otherwise engaged).
  • Martin Heidegger. (1927). Sein und Zeit. Tübingen: Max Niemeyer, 1972.
    • See below.
  • Martin Heidegger. (1927). Being and Time, tr. J. Stambaugh. Albany, NY: State University of New York Press, 1996.
  • Martin Heidegger. (1927). Being and Time, trs. J. Macquarrie and E. Robinson. New York: Harper and Row, 1962.
    • Two references to “a special hermeneutic of empathy [Einfühlung]” as part of engagement (and dismissal) of Scheler and Stein; a significant indirect contribution to empathy.
  • Johann Gottfried von Herder. (1772/1792). Philosophical Writings, tr. and ed. M. N. Forster. Cambridge, UK: Cambridge University Press, 2002.
    • Applies Einfühlung” [empathy] to understanding difficult texts and interpretations, obtaining clear priority in publication over Lipps.
  • Martin L. Hoffman. (2000). Empathy and Moral Development: Implications for Caring and Justice. Cambridge, UK: Cambridge University Press, 2001.
    • Rich engagement with moral issues, distinguishing empathic distress and sympathetic distress.
  • David Hume. (1739). A Treatise of Human Nature, ed. L. A. Selby-Bigge (Oxford: Clarendon Press, 1973).
    • Above “SBN” refers to the Selby-Bigge edition and “T” refers to the Chapter, section, and paragraph in the Clarendon edition text. Many meanings of “sympathy” as engaged herein.
  • David Hume. (1739/1904). Ein Traktat über die menschliche Natur in 2 Bänden, tr. Theodor Lipps. Hamburg: Felix Meiner Verlag, 1904/06.
    • The point where Lipps (1903) was enlightened about the relevance of empathy to taste and beauty.
  • David Hume, “Of the delicacy of taste and passion” (1741) in Of the Standard of Taste and Other Essays, Indianapolis: Bobbs-Merrill: 1965.
    • Leaves a logical space for a “delicacy of sympathy” open and undeveloped.
  • David Hume, “Of the standard of taste” (1757) in Of the Standard of Taste and Other Essays, Indianapolis: Bobbs-Merrill: 1965.
    • The standard of taste perceives a micro impression that the ordinary person does not perceive, leaving a logical space open for an undeveloped delicacy of sympathy (that is, delicacy of empathy).
  • Edmund Husserl. (1929/35). Zur Phänomenologie der Intersubjectivität: Texte aus dem Nachlass: Dritter Teil: 1929-1935, ed. I. Kern. Den Haag: Martinus Nijhoff, 1973. Husserliana XV.
    • Dozens of references to Einfühlung [empathy] as it migrates from the periphery and superstructure of intersubjectivity to the foundation of community.
  • M. Iacoboni. (2005). “Understanding others: Imitation, language, and empathy,” Perspectives on Imitation: From Neuroscience to Social Science, S. Hurly and N. Chater, eds. Vol. 1 (76-100). Cambridge, MA: MIT Press.
    • Seeking a philosophical description of how mirror neurons bind us together in empathy
  • M. Iacoboni. (2007). “Existential empathy: the intimacy of self and other,” Empathy in Mental Illness, eds. Tom Farrow and Peter Woodruff, Cambridge, UK: Cambridge University Press.
  • Philip L. Jackson, Andrew N. Meltzoff, and Jean Decety. (2005). “How do we perceive the pain of others?
    • A window into the neural processes involved in empathy,” Neuroimage 24 (2005).
  • Hans H. Kögler and Karsten R. Stueber, eds. (2000). Empathy and Agency: The Problem of Understanding in the Human Sciences. Boulder, CO: Westview Press.
    • Emphasizes Verstehen.
  • Hans H. Kögler. (2000). “Empathy, dialogical self, and reflexive interpretation: The symbolic source of simulation,” Empathy and Agency: The Problem of Understanding in the Human Sciences, Hans H. Kögler and Karsten R. Stueber, eds. Boulder, CO: Westview Press, 2000.
    • As the title says.
  • Heinz Kohut. (1959). “Introspection, empathy, and psychoanalysis,” The Journal of the American Psychoanalytic Association 7: 459-83.
    • Empathy (“vicarious introspection”) as a data gathering method, defining psychoanalysis.
  • Heinz Kohut. (1966). “Forms and transformations of narcissism.” Journal of the American Psychoanalytic Association 14: 243-272.
    • Empathy is related to humor, appreciation of art, and wisdom, enhanced in working through the self (narcissism).
  • Heinz Kohut. (1971). The Analysis of the Self. New York: International Universities Press.
    • Identifies two new forms of transference and empathy in each in the context of the self.
  • Heinz Kohut. (1977). The Restoration of the Self. New York: International Universities Press.
    • Arguably the book that Kohut was trying to write in 1971, exploring the role of empathy as the oxygen in which the well-being of the self flourishes.
  • Heinz Kohut. (1984). How Does Analysis Cure? eds. A. Goldberg and P. E. Stepansky. Chicago: University of Chicago Press.
    • The short answer is “empathy”; the longer answer is “phase appropriate (non traumatic) failures of empathy that get worked through in therapy.”
  • Emmanuel Levinas. (1961). Totality and Infinity: An Essay on Exteriority, tr. A. Lingis. Pittsburgh, PA: Dusquesne University Press, 2007.
    • Provides empathy against ethics with so much to say about The Other; so little, about empathy, which latter falls on the side of totality, not infinity.
  • H. G. Liddell and R. Scott. (1940). A Greek-English Lexicon.
    • Revised and augmented throughout by Sir Henry Stuart Jones with the assistance of Roderick McKenzie. Oxford. Clarendon Press. 1940.
  • T. Lipps. (1903). Aesthetik. Hamburg: Leopold Voss, 1903.
    • Einfühlung” [empathy] is engaged as the basis of the experience of beauty.
  • T. Lipps . (1909). Leitfaden der Psychologie. Leipzig: Wilhelm Engelmann Verlag, 1909.
    • “Einfühlung” [empathy] is engaged as the basis of our experience of other minds [fremden Seelen Lebens].
  • Bonnie E. Litowitz. (2007). “The second person,” Journal of the American Psychoanalytic Association, 57: 1129.
    • Distinguishes between the dialogical and dyadic contexts in which empathy flourishes.
  • Rudolf Makkreel. (1975). Dilthey: Philosopher of the Human Science. Princeton, NJ: Princeton University Press.
    • Embraces re-experiencing (nacherleben) and Verstehen rather than empathy.
  • E. Nagel. (1959). “Methodological issues in psychoanalytic theory.” Psychoanalysis, Scientific Method, and Philosophy: A Symposium, ed. S. Hook. New York: New York University Press, 1964: 38-56.
    • The philosophy of science being debated at the time that Heinz Kohut was writing his first empathy article (Kohut 1959).
  • Thomas Nagel. (1970). The Possibility of Altruism. Princeton, NJ: Princeton Paperbacks, 1978.
    • Arguably, empathy implements altruism, which is (still) possible.
  • Thomas Nagel. (1974). “On What It’s Like to Be a bat,” The Mind’s I, eds. D. R. Hofstadter & D. C. Dennett. New York: Bantam Books, 1981.
    • Empathy pushed into a footnote.
  • Friedrich Nietzsche. (1887). On the Genealogy of Morals, tr. W. Kaufmann & R. J. Hollingdale. New York: Vintage/Random House, 1969.
    • Uses empathy (but not the word) to inform his sense of smell and suspicion.
  • Frederick A. Olafson. (1998). Heidegger and the Ground of Ethics: A Study of Mitsein, Cambridge, UK: Cambridge University Press.
  • Christine Olden. (1956). “On empathy with children,” The Psychoanalytic Study of the Child 8 (1956: 111-26).
  • Eric Partridge. (1966). Origins: A Short Etymological Dictionary of Modern English, 4th Edition. New York: Macmillan, 1977.
  • Adriaan Peperzak. (1997). Beyond: The Philosophy of Emmanuel Levinas. Evanston: Northwestern University Press.
    • An introduction with so much to say about The Other and so little about empathy. The latter falls on the side of totality, not infinity; it is empathy against ethics.
  • John Rawls. (1971). A Theory of Justice. Cambridge, MA: Harvard University Press.
    • Engages “sympathy” as part of his analysis of altruism.
  • John Riker. (2010). Why It’s Good to be Good. New York: Jason Aronson Press.
    • Self psychology (Kohut 1977, 1984), with its focus on empathy and restoring integrity to the self, addresses ethical issues such as rampant cheating, addiction, selfishness, and (unethical) narcissism.
  • Max Scheler. (1913). Zur Phänomenologie und Theorie der Sympathiegefühle in Scheler’s Späte Schriften in Gesammelte Werke, ed. Maria Scheler and Manfred Frings. Vol. 9, Bern: Francke Verlag 1976.
  • Max Scheler. (1913/22).  The Nature of Sympathy, tr. Peter Heath. Hamden: CN: Archon Books, 1970.
    • An insightful analysis of the distinction between vicarious feeling, shared feeling, and projective empathy.
  • Michael Slote. (2007). The Ethics of Care and Empathy. London: Routledge.
    • Empathic caring as a moral criterion and the moral aspects of the ethics of care.
  • Michael Slote. (2010). Moral Sentimentalism. Oxford: Oxford University Press.
    • A rich engagement in which empathy provides an intelligible mechanism for moral approval and disapproval, lending philosophic rigor to the mere metaphor of moral sense (and sentimentalism).
  • Adam Smith. (1759). The Theory of the Moral Sentiments. Indianapolis: Liberty Classics 1969.
    • Sympathy recruits the imagination and fellow feeling to align with a sense of (dis)approbation, defining the limits of the human (ethical) community.
  • R. A. Spitz. (1946). “Hospitalism: a follow up report.” The Psychoanalytic Study of the Child, 2: 113-117.
    • First researcher to document that without empathy (affectionate care taking), institutionalized (hospitalized) infants sustain serious emotional, behavioral damage, simulating autism and psychosis; provided significant input to B. Bettelheim.
  • Edith Stein. (1917). On the Problem of Empathy, tr. Waltraut Stein. The Hague: Martinus Nijhoff, 1970.
    • Thoroughly debunks Lipps and (arguably) taught Husserl everything he knew about empathy in Ideas II; yet fails to ‘surface’ a deep analysis of the underlying intentionality of the other in relation to the act of empathy.
  • Karsten R. Stueber. (2006). Rediscovering Empathy. Cambridge, MA: MIT Press.
    • Stueber endorses the approach of the philosopher Donald Davidson, and, if the latter had engaged empathy, he might have developed an argument similar to this one.
  • Edward B. Titchner. (1909). Lectures on the Experimental Psychology of the Thought-Processes. New York: Macmillan.
    • First translation into English of “Einfühlung” as “empathy.”
  • H. Trosman and R. Simmons. (1972). “The Freud Library,” Journal of the American Psychoanalytic Association, 21 (1973): 646-87.
    • Tracks the two dozen or so references to “Einfühlung” in Freud.
  • J.D. Trout.(2009). The Empathy Gap. New York: Viking Press.
    • Empathy falls short of reason; and reason falls short of empathy.
  • Desmond Tutu. (1999). No Future Without Forgiveness. New York: Doubleday.
  • Lauren Wispé. (1987). “History of the concept of empathy,” Empathy and its Development, N. Eisenberg & J. Strayer, eds. Cambridge: Cambridge University Press.
    • A collection of quotations.
  • Lauren Wispé. (1991). The Psychology of Sympathy. New York: Plenum Press.
    • A collection of quotations; we now know who said what and when they said it.
  • Dan Zahavi. (2005). Subjectivity and Selfhood: Investigating the First-Person Perspective. Cambridge, UK: Bradfordbook/MIT Press.
    • Lively engagement with empathy, narrative, Heidegger, Husserl, Ricoeur, and Sartre.

Author Information

Lou Agosta
Email: LAgosta@UChicago.edu
University of Chicago
U. S. A.

Care Ethics

The moral theory known as “ the ethics of care” implies that there is moral significance in the fundamental elements of relationships and dependencies in human life. Normatively, care ethics seeks to maintain relationships by contextualizing and promoting the well-being of care-givers and care-receivers in a network of social relations. Most often defined as a practice or virtue rather than a theory as such, “care” involves maintaining the world of, and meeting the needs of, ourself and others. It builds on the motivation to care for those who are dependent and vulnerable, and it is inspired by both memories of being cared for and the idealizations of self. Following in the sentimentalist tradition of moral theory, care ethics affirms the importance of caring motivation, emotion and the body in moral deliberation, as well as reasoning from particulars. One of the original works of care ethics was Milton Mayeroff’s short book, On Caring, but the emergence of care ethics as a distinct moral theory is most often attributed to the works of psychologist Carol Gilligan and philosopher Nel Noddings in the mid-1980s. Both charged traditional moral approaches with male bias, and asserted the “voice of care” as a legitimate alternative to the “justice perspective” of liberal human rights theory. Annette Baier, Virginia Held, Eva Feder Kittay, Sara Ruddick, and Joan Tronto are some of the most influential among many subsequent contributors to care ethics.

Typically contrasted with deontological/Kantian and consequentialist/utilitarian ethics, care ethics is found to have affinities with moral perspectives such as African ethics, Confucian ethics, and others. Critics fault care ethics with being a kind of slave morality, and as having serious shortcomings including essentialism, parochialism, and ambiguity. Although care ethics is not synonymous with feminist ethics, much has been written about care ethics as a feminine and feminist ethic, in relation to motherhood, international relations, and political theory. Care ethics is widely applied to a number of moral issues and ethical fields, including caring for animals and the environment, bioethics, and more recently public policy. Originally conceived as most appropriate to the private and intimate spheres of life, care ethics has branched out as a political theory and social movement aimed at broader understanding of, and public support for, care-giving activities in their breadth and variety.

Table of Contents

  1. History and Major Authors
    1. Carol Gilligan
    2. Nel Noddings
    3. Other Influential authors
      1. Annette Baier
      2. Virginia Held
      3. Eva Feder Kittay
      4. Sara Ruddick
      5. Joan Tronto
  2. Definitions of Care
  3. Criticisms
    1. Care Ethics as a Slave Morality
    2. Care Ethics as Empirically Flawed
    3. Care Ethics as Theoretically Indistinct
    4. Care Ethics as Parochial
    5. Care Ethics as Essentialist
    6. Care Ethics as Ambiguous
  4. Feminine and Feminist Ethics
  5. Relation to Other Theories
  6. Maternalism
  7. International Relations
  8. Political Theory
  9. Caring for Animals
  10. Applied Care Ethics
  11. Care Movements
  12. References and Further Reading

1. History and Major Authors

a. Carol Gilligan

While early strains of care ethics can be detected in the writings of feminist philosophers such as Mary Wollstonecraft, Catherine and Harriet Beecher, and Charlotte Perkins, it was first most explicitly articulated by Carol Gilligan and Nel Noddings in the early 1980s. While a graduate student at Harvard, Gilligan wrote her dissertation outlining a different path of moral development than the one described by Lawrence Kohlberg, her mentor. Kohlberg had posited that moral development progressively moves toward more universalized and principled thinking and had also found that girls, when later included in his studies, scored significantly lower than boys. Gilligan faulted Kohlberg’s model of moral development for being gender biased, and reported hearing a “different voice” than the voice of justice presumed in Kohlberg’s model. She found that both men and women articulated the voice of care at different times, but noted that the voice of care, without women, would nearly fall out of their studies. Refuting the charge that the moral reasoning of girls and women is immature because of its preoccupation with immediate relations, Gilligan asserted that the “care perspective” was an alternative, but equally legitimate form of moral reasoning obscured by masculine liberal justice traditions focused on autonomy and independence. She characterized this difference as one of theme, however, rather than of gender.

Gilligan articulated these thematic perspectives through the moral reasoning of “Jake” and “Amy”, two children in Kohlberg’s studies responding to the “Heinz dilemma”. In this dilemma, the children are asked whether a man, “Heinz”, should have stolen an overpriced drug to save the life of his ill wife. Jake sees the Heinz dilemma as a math problem with people wherein the right to life trumps the right to property, such that all people would reasonably judge that Heinz ought to steal the drug. Amy, on the other hand, disagrees that Heinz should steal the drug, lest he should go to prison and leave his wife in another predicament. She sees the dilemma as a narrative of relations over time, involving fractured relationships that must be mended through communication. Understanding the world as populated with networks of relationships rather than people standing alone, Amy is confident that the druggist would be willing to work with Heinz once the situation was explained. Gilligan posited that men and women often speak different languages that they think are the same, and she sought to correct the tendency to take the male perspective as the prototype for humanity in moral reasoning.

Later, Gilligan vigorously resisted readings of her work that posit care ethics as relating to gender more than theme, and even established the harmony of care and justice ethics (1986), but she never fully abandoned her thesis of an association between women and relational ethics. She further developed the idea of two distinct moral “voices”, and their relationship to gender in Mapping the Moral Domain:  A Contribution of Women’s Thinking to Psychological Theory and Education (Gilligan, Ward, and Taylor, 1988), a collection of essays that traced the predominance of the “justice perspective” within the fields of psychology and education, and the implications of the excluded “care perspective”. In Making Connections:  The Relational Worlds of Adolescent Girls at Emma Willard School, Gilligan and her co-editors argued that the time between the ages of eleven and sixteen is crucial to girls’ formation of identity, being the time when girls learn to silence their inner moral intuitions in favor of more rule bound interpretations of moral reasoning (Gilligan, Lyons, and Hamner, 1990, 3). Gilligan found that in adulthood women are encouraged to resolve the crises of adolescence by excluding themselves or others, that is, by being good/responsive, or by being selfish/independent. As a result, women’s adolescent voices of resistance become silent, and they experience a dislocation of self, mind, and body, which may be reflected in eating disorders, low leadership aspiration, and self-effacing sexual choices. Gilligan also expanded her ideas in a number of articles and reports (Gilligan, 1979; 1980; 1982; 1987).

b. Nel Noddings

In 1984 Noddings published Caring, in which she developed the idea of care as a feminine ethic, and applied it to the practice of moral education. Starting from the presumption that women “enter the practical domain of moral action…through a different door”, she ascribed to feminine ethics a preference for face-to face moral deliberation that occurs in real time, and appreciation of the uniqueness of each caring relationship. Drawing conceptually from a maternal perspective, Noddings understood caring relationships to be basic to human existence and consciousness. She identified two parties in a caring relationship—“one-caring” and the “cared-for”—and affirmed that both parties have some form of obligation to care reciprocally and meet the other morally, although not in the same manner. She characterized caring as an act of “engrossment” whereby the one-caring receives the cared-for on their own terms, resisting projection of the self onto the cared-for, and displacing selfish motives in order to act on the behalf of the cared-for. Noddings located the origin of ethical action in two motives, the human affective response that is a natural caring sentiment, and the memory of being cared-for that gives rise to an ideal self. Noddings rejected universal principles for prescribed action and judgment, arguing that care must always be contextually applied.

Noddings identified two stages of caring, “caring-for” and “caring-about”. The former stage refers to actual hands-on application of caring services, and the latter to a state of being whereby one nurtures caring ideas or intentions. She further argued that the scope of caring obligation is limited. This scope of caring is  strongest towards others who are capable of reciprocal relationship. The caring obligation is conceived of as moving outward in concentric circles so enlarged care is increasingly characterized by a diminished ability for particularity and contextual judgment, which prompted Noddings to speculate that it is impossible to care-for everyone. She maintained that while the one-caring has an obligation to care-for proximate humans and animals to the extent that they are needy and able to respond to offerings of care, there is a lesser obligation to care for distant others if there is no hope that care will be completed. These claims proved to be highly controversial, and Noddings later revised them somewhat. In her more recent book Starting From Home, Noddings endorsed a stronger obligation to care about distant humans, and affirms caring-about as an important motivational stage for inspiring local and global justice, but continued to hold that it is impossible to care-for all, especially distant others. (See 3a.iv below)

c. Other Influential authors

Although many philosophers have developed care ethics, five authors are especially notable.

i. Annette Baier

Annette Baier observes certain affinities between care ethics and the moral theory of David Hume, whom she dubs the “women’s moral theorist.” Baier suggests both deny that morality consists in obedience to a universal law, emphasizing rather the importance of cultivating virtuous sentimental character traits, including gentleness, agreeability, compassion, sympathy, and good temperedness (1987, 42). Baier specially underscores trust, a basic relation between particular persons, as the fundamental concept of morality, and notes its obfuscation within theories premised on abstract and autonomous agents. She recommends carving out room for the development of moral emotions and harmonizing the ideals of care and justice.

ii. Virginia Held

Virginia Held is the editor and author of many books pertaining to care ethics. In much of her work she seeks to move beyond ideals of liberal justice, arguing that they are not as much flawed as limited, and examines how social relations might be different when modeled after mothering persons and children. Premised on a fundamental non-contractual human need for care, Held construes care as the most basic moral value. In Feminist Morality (1993), Held explores the transformative power of creating new kinds of social persons, and the potentially distinct culture and politics of a society that sees as “its most important task the flourishing of children and the creation of human relationships”. She describes feminist ethics as committed to actual experience, with an emphasis on reason and emotion, literal rather than hypothetical persons, embodiment, actual dialogue, and contextual, lived methodologies. In The Ethics of Care (2006), Held demonstrates the relevance of care ethics to political, social and global questions. Conceptualizing care as a cluster of practices and values, she describes a caring person as one who has appropriate motivations to care for others and who participates adeptly in effective caring practices. She argues for limiting both market provisions for care and the need for legalistic thinking in ethics, asserting that care ethics has superior resources for dealing with the power and violence that imbues all relations, including those on the global level. Specifically, she recommends a view of a globally interdependent civil society increasingly dependent upon an array of caring NGOs for solving problems. She notes: “The small societies of family and friendship embedded in larger societies are formed by caring relations… A globalization of caring relations would help enable people of different states and cultures to live in peace, to respect each others’ rights, to care together for their environments, and to improve the lives of their children”(168). Ultimately, she argues that rights based moral theories presume a background of social connection, and that when fore-grounded, care ethics can help to create communities that promote healthy social relations, rather than the near boundless pursuit of self-interest.

iii. Eva Feder Kittay

Eva Feder Kittay is another prominent care ethicist. Her book, Women and Moral Theory (1987), co-edited with Diana T. Meyers, is one the most significant anthologies in care ethics to date. In  this work they map conceptual territory inspired by Gilligan’s work, both critically and supportively, by exploring major philosophical themes such as self and autonomy, ethical principles and universality, feminist moral theory, and women and politics. In Love’s Labor (1999), Kittay develops a dependency based account of equality rooted in the activity of caring for the seriously disabled. Kittay holds that the principles in egalitarian theories of justice, such as  those of John Rawls, depend upon more fundamental principles and practices of care, and that without supplementation such theories undermine themselves (108). Kittay observes that in practice some women have been able to leave behind traditional care-giving roles only because other women have filled them, but she resists the essentialist association between women and care by speaking of “dependency workers” and “dependency relations”. She argues that equality for dependency workers and the unavoidably dependent will only be achieved through conceptual and institutional reform. Employing expanded ideals of fairness and reciprocity that take interdependence as basic, Kittay poses a third principle for Rawls’ theory of justice: “To each according to his or her need, from each to his or her capacity for care, and such support from social institutions as to make available resources and opportunities to those providing care” (113). She more precisely calls for the public provision of Doulas, paid professional care-workers who care for care-givers, and uses the principle of Doula to justify welfare for all care-givers, akin to worker’s compensation or unemployment benefits.

iv. Sara Ruddick

Held identifies Sara Ruddick as the original pioneer of the theory of care ethics, citing Ruddick’s 1980 article “Maternal Thinking” as the first articulation of a distinctly feminine approach to ethics. In this article, and in her later book of the same title (1989), Ruddick uses care ethical methodology to theorize from the lived experience of mothering, rendering a unique approach to moral reasoning and a ground for a feminist politics of peace. Ruddick explains how the practices of “maternal persons” (who may be men or women), exhibit cognitive capacities or conceptions of virtue with larger moral relevance. Ruddick’s analysis, which forges strong associations between care ethics and motherhood, has been both well-received and controversial (see Section 6, below).

v. Joan Tronto

Joan Tronto is most known for exploring the intersections of care ethics, feminist theory, and political science. She sanctions a feminist care ethic designed to thwart the accretion of power to the existing powerful, and to increase value for activities that legitimize shared power. She identifies moral boundaries that have served to privatize the implications of care ethics, and highlights the political dynamics of care relations which describe, for example, the tendency of women and other minorities to perform care work in ways that benefit the social elite. She expands the phases of care to include “caring about”, “taking care of” (assuming responsibility for care), “care-giving” (the direct meeting of need), and “care-receiving”. She coins the phrase “privileged irresponsibility” to describe the phenomenon that allows the most advantaged in society to purchase caring services, delegate the work of care-giving, and avoid responsibility for the adequacy of hands-on care. (See Sections 2 and 8 below).

2. Definitions of Care

Because it depends upon contextual considerations, care is notoriously difficult to define. As Ruddick points out, at least three distinct but overlapping meanings of care have emerged in recent decades—an ethic defined in opposition to justice, a kind of labor, and a particular relationship (1998, 4). However, in care ethical literature, ‘care’ is most often defined as a practice, value, disposition, or virtue, and is frequently portrayed as an overlapping set of concepts. For example, Held notes that care is a form of labor, but also an ideal that guides normative judgment and action, and she characterizes care as “clusters” of practices and values (2006, 36, 40). One of the most popular definitions of care, offered by Tronto and Bernice Fischer, construes care as “a species of activity that includes everything we do to maintain, contain, and repair our ‘world’ so that we can live in it as well as possible. That world includes our bodies, ourselves, and our environment”. This definition posits care fundamentally as a practice, but Tronto further identifies four sub-elements of care that can be understood simultaneously as stages, virtuous dispositions, or goals. These sub-elements are: (1) attentiveness, a proclivity to become aware of need; (2) responsibility, a willingness to respond and take care of need; (3) competence, the skill of providing good and successful care; and (4) responsiveness, consideration of the position of others as they see it and recognition of the potential for abuse in care (1994, 126-136). Tronto’s definition is praised for how it admits to cultural variation and extends care beyond family and domestic spheres, but it is also criticized for being overly broad, counting nearly every human activity as care.

Other definitions of care provide more precise delineations. Diemut Bubeck narrows the definitional scope of care by emphasizing personal interaction and dependency. She describes care as an emotional state, activity, or both, that is functional, and specifically involves “the meeting of needs of one person by another where face-to-face interaction between care and cared for is a crucial element of overall activity, and where the need is of such a nature that it cannot possibly be met by the person in need herself” (129). Bubeck thus distinguishes care from “service”, by stipulating that “care” involves meeting the needs for others who cannot meet their needs themselves, whereas “service” involves meeting the needs of individuals who are capable of self-care. She also holds that one cannot care for oneself, and that care does not require any emotional attachment. While some care ethicists accept that care need not always have an emotional component, Bubeck’s definitional exclusion of self-care is rejected by other care ethicists who stress additional aspects of care.

For example, both Maurice Hamington and Daniel Engster make room for self-care in their definitions of care, but focus more precisely on special bodily features and end goals of care (Hamington, 2004; Engster, 2007). Hamington focuses on embodiment, stating that: “care denotes an approach to personal and social morality that shifts ethical considerations to context, relationships, and affective knowledge in a manner that can only be fully understood if care’s embodied dimension is recognized. Care is committed to flourishing and growth of individuals, yet acknowledges our interconnectedness and interdependence” (2004, 3). Engster develops a “basic needs” approach to care, defining care as a practice that includes “everything we do to help individuals to meet their vital biological needs, develop or maintain their basic capabilities, and avoid or alleviate unnecessary or unwanted pain and suffering, so that they can survive, develop, and function in society” (2007, 28). Although care is often unpaid, interpersonal, and emotional work, Engster’s definition does not exclude paid work or self-care, nor require the presence of affection or other emotion (32). Although these definitions emphasize care as a practice, not all moral theorists maintain this view of.

Alternatively, care is understood as a virtue or motive. James Rachels, Raja Halwani, and Margaret McLaren have argued for categorizing care ethics as a species of virtue ethics, with care as a central virtue (Rachels, 1999; McLaren, 2001; Halwani, 2003). The idea that that care is best understood as virtuous motives or communicative skills is endorsed by Michael Slote who equates care with a kind of motivational attitude of empathy, and by Selma Sevenhuijsen, who defines care as “styles of situated moral reasoning” that involves listening and responding to others on their own terms.” (Slote, 2007; Sevenhuijsen, 1998, 85).

Some ethicists prefer to understand care as a practice more fundamental than a virtue or motive because doing so resists the tendency to romanticize care as a sentiment or dispositional trait, and reveals the breadth of caring activities as globally intertwined with virtually all aspects of life. As feminist ethicists, Kittay and Held like to understand care as a practice and value rather than as a virtue because it risks “losing site of it as work” (Held, 2006, 35). Held refutes that care is best understood as a disposition such as compassion or benevolence, but defines “care” as “more a characterization of a social relation than the description of an individual disposition.”

Overall, care continues to be an essentially contested concept, containing ambiguities that Peta Bowden, finds advantageous, revealing  “the complexity and diversity of the ethical possibilities of care”(1997, 183).

3. Criticisms

A number of criticisms have been launched against care ethics, including that it is: a) a slave morality; b) empirically flawed; c) theoretically indistinct; d) parochial, e) essentialist, and f) ambiguous.

a. Care Ethics as a Slave Morality

One of the earliest objections was that care ethics is a kind of slave morality valorizing the oppression of women (Puka, 1990; Card, 1990; Davion, 1993). The concept of slave morality comes from the philosopher Frederick Nietzsche, who held that oppressed peoples tend to develop moral theories that reaffirm subservient traits as virtues. Following this tradition, the charge that care ethics is a slave morality interprets the different voice of care as emerging from patriarchal traditions characterized by rigidly enforced sexual divisions of labor. This critique issues caution against uncritically valorizing caring practices and inclinations because women who predominantly perform the work of care often do so to their own economic and political disadvantage. To the extent that care ethics encourages care without further inquiring as to who is caring for whom, and whether these relationships are just, it provides an unsatisfactory base for a fully libratory ethic. This objection further implies that the voice of care may not be an authentic or empowering expression, but a product of false consciousness that equates moral maturity with self-sacrifice and self-effacement.

b. Care Ethics as Empirically Flawed

Critics also question the empirical accuracy and validity of Gilligan’s studies. Gilligan has been faulted for basing her conclusions on too narrow a sample, and for drawing from overly homogenous groups such as students at elite colleges and women considering abortion (thereby excluding women who would not view abortion as morally permissible). It is argued that wider samples yield more diverse results and complicate  the picture of dual and gendered moral perspectives (Haan, 1976; Brabeck, 1983). For instance, Vanessa Siddle Walker and John Snarey surmise that resolution of the Heinz dilemma shifts if Heinz is identified as Black, because in the United States African-American males are disproportionately likely to be arrested for crime, and less likely to have their cases dismissed without stringent penalties (Walker and Snarey, 2004). Sandra Harding observes certain similarities between care ethics and African moralities, noting that care ethics has affinities with many other moral traditions (Harding, 1987). Sarah Lucia Hoagland identifies care as the heart of lesbian connection, but also cautions against the dangers of assuming that all care relations are ideally maternalistic (Hoagland, 1988). Thus, even if some women identify with care ethics, it is unclear whether this is a general quality of women, whether moral development is distinctly and dualistically gendered, and whether the voice of care is the only alternative moral voice. However, authors like Marilyn Friedman maintain that even if it cannot be shown that care is a distinctly female moral orientation, it is plausibly understood as a symbolically feminine approach (Friedman, 1987).

c. Care Ethics as Theoretically Indistinct

Along similar lines some critics object that care ethics is not a highly distinct moral theory, and that it rightly incorporates liberal concepts such as autonomy, equality, and justice. Some defenders of utilitarianism and deontology argue that the concerns highlighted by care ethics have been, or could be, readily addressed by existing theories (Nagl-Docekal, 1997; Ma, 2002). Others suggest that care ethics merely reduces to virtue ethics with care being one of many virtues (Rachels, 1999; Slote, 1998a; 1998b; McLaren, 2001, Halwani, 2003). Although a number of care ethicists explore the possible overlap between care ethics and other moral theories, the distinctiveness of the ethic is defended by some current advocates of care ethics, who contend that the focus on social power, identity, relationship, and interdependency are unique aspects of the theory (Sander-Staudt, 2006). Most care ethicists make room for justice concerns and for critically scrutinizing alternatives amongst justice perspectives. In some cases, care ethicists understand the perspectives of care and justice as mutual supplements to one another. Other theorists underscore the strategic potential for construing care as a right in liberal societies that place a high rhetorical value on human rights. Yet others explore the benefits of integrating care ethics with less liberal traditions of justice, such as Marxism (Bubeck, 1995).

d. Care Ethics as Parochial

Another set of criticisms center around the concern that care ethics obscures larger social dynamics and is overly parochial. These critiques aim at Noddings’ original assertion that care givers have primary obligations to proximate others over distant others (Tronto, 1995, 111-112; Robinson, 1999, 31). Critics worry that this stance privileges elite care-givers by excusing them from attending to significant differences in international standards of living and their causes. Critics also express a concern that without a broader sense of justice, care ethics may allow for cronyism and favoritism toward one’s family and friends (Friedman, 2006; Tronto, 2006). Noddings now affirms an explicit theme of justice in care ethics that resists arbitrary favoritism, and that extends to public and international domains. Yet she upholds the primacy of the domestic sphere as the originator and nurturer of justice, in the sense that the best social policies are identified, modeled, and sustained by practices in the “best families”. Other care ethicists refine Noddings’ claim by emphasizing the practical and moral connections between proximate and distant relations, by affirming a principle of care for the most vulnerable on a global level, and by explicitly weaving a political component into care theory.

e. Care Ethics as Essentialist

The objection that care ethics is essentialist stems from the more general essentialist critique made by Elizabeth Spelman (1988). Following this argument, early versions of care ethics have been faulted for failing to explore the ways in which women (and others) differ from one another, and for thereby offering a uniform picture of moral development that reinforces sex stereotypes (Tronto, 1994). Critics challenge tendencies in care ethics to theorize care based on a dyadic model of a (care-giving) mother and a (care-receiving) child, on the grounds that it overly romanticizes motherhood and does not adequately represent the vast experiences of individuals (Hoagland, 1991). The charge of essentialism in care ethics highlights ways in which women and men are differently implicated in chains of care depending on variables of class, race, age, and more. Essentialism in care ethics is problematic not only because it is conceptually facile, but also because of its political implications for social justice. For example, in the United States women of color and white women are differently situated in terms of who is more likely to give and receive care, and of what degree and quality, because the least paid care workers predominantly continue to be women of color. Likewise, lesbian and heterosexual women are differently situated in being able to claim the benefits and burdens of marriage, and are not equally presumed to be fit as care-givers. Contemporary feminist care ethicists attempt to avoid essentialism by employing several strategies, including: more thoroughly illuminating the practices of care on multiple levels and from various perspectives; situating caring practices in place and time; construing care as the symbolic rather than actual voice of women; exploring the potential of care as a gender neutral activity; and being consistently mindful of perspective and privilege in the activity of moral theorizing.

f. Care Ethics as Ambiguous

Because it eschews abstract principles and decisional procedures, care ethics is often accused of being unduly ambiguous, and for failing to offer concrete guidance for ethical action (Rachels, 1999). Some care ethicists find the non-principled nature of care ethics to be overstated, noting that because a care perspective may eschew some principles does not mean that it eschews all principles entirely (Held, 1995). Principles that could be regarded as central to care ethics might pertain to the origin and basic need of care relations, the evaluation of claims of need, the obligation to care, and the scope of care distribution. On principle, it would seem, a care ethic guides the moral agent to recognize relational interdependency, care for the self and others, cultivate the skills of attention, response, respect, and completion, and maintain just and caring relationships. However, while theorists define care ethics as a theory derived from actual practices, they simultaneously resist subjectivism and moral relativism.

4. Feminine and Feminist Ethics

Because of its association with women, care ethics is often construed as a feminine ethic. Indeed, care ethics, feminine ethics, and feminist ethics are often treated as synonymous. But although they overlap, these are discrete fields in that although care ethics connotes feminine traits, not all feminine and feminist ethics are care ethics, and the necessary connection between care ethics and femininity has been subject to rigorous challenge. The idea that there may be a distinctly woman-oriented, or a feminine approach to ethics, can be traced far back in history. Attempts to legitimate this approach gained momentum in the 18th and 19th centuries, fueled by some suffragettes, who argued that granting voting rights to (white) women would lead to moral social improvements. Central assumptions of feminine ethics are that women are similar enough to share a common perspective, rooted in the biological capacity and expectation of motherhood, and that characteristically feminine traits include compassion, empathy, nurturance, and kindness.

But once it is acknowledged that women are diverse, and that some men exhibit equally strong tendencies to care, it is not readily apparent that care ethics is solely or uniquely feminine. Many women, in actuality and in myth, in both contemporary and past times, do not exhibit care. Other factors of social identity, such as ethnicity and class, have also been found to correlate with care thinking. Nonetheless, care has pervasively been assumed to be a symbolically feminine trait and perspective, and many women resonate with a care perspective. What differentiates feminine and feminist care ethics turns on the extent to which there is critical inquiry into the empirical and symbolic association between women and care, and concern for the power-related implications of this association. Alison Jaggar characterizes a feminist ethic as one which exposes masculine and other biases in moral theory, understands individual actions in the context of social practices, illuminates differences between women, provides guidance for private, public, and international issues, and treats the experiences of women respectfully, but not uncritically (Jaggar, 1991).

While most theorists agree that it is mistaken to view care ethics as a “woman’s morality”, the best way to understand its relation to sex and gender is disputed. Slote develops a strictly gender neutral theory of care on the grounds that care ethics can be traced to the work of male as well as female philosophers. Engster endorses a “minimally feminist theory of care” that is largely gender neutral because he defines care as meeting needs that are more generally human. Although he acknowledges that women are disadvantaged in current caring distributions and are often socialized to value self-effacing care, his theory is feminist only in seeking to assure that the basic needs of women and girls are met and their capabilities developed.

In contrast, Held, Kittay, and Tronto draft more robust overlaps between care and feminist theory, retaining yet challenging the gender-laden associations of care with language like “mothering persons” or “dependency workers”. While cautious of the associations between care and femininity, they find it useful to tap the resources of the lived and embodied experiences of women, a common one which is the capacity to birth children. They tend to define care as a practice partially in order to stay mindful of the ongoing empirical (if misguided) associations between care and women, that must inform utopian visions of care as a gender-neutral activity and virtue. Complicating things further, individuals who are sexed as women may nonetheless gain social privilege when they exhibit certain perceived traits of the male gender, such as being unencumbered and competitive, suggesting that it is potentially as important to revalue feminine traits and activities, as it is to stress the gender-neutral potential of care ethics.

As it currently stands, care ethicists agree that women are positioned differently than men in relation to caring practices, but there is no clear consensus about the best way to theorize sex and gender in care ethics.

5. Relation to Other Theories

Care ethics originally developed as an alternative to the moral theories of Kantian deontology and Utilitarianism consequentialism, but it is thought to have affinities with numerous other moral theories, such as African ethics, David Hume’s sentimentalism, Aristotelian virtue ethics, the phenomenology of Merleau-Ponty, Levinasian ethics, and Confucianism. The most pre-dominant of these comparisons has been between care ethics and virtue ethics, to the extent that care ethics is sometimes categorized as a form of virtue ethics, with care being a central virtue. The identification of caring virtues fuels the tendency to classify care ethics as a virtue ethic, although this system of classification is not universally endorsed.

Some theorists move to integrate care and virtue ethics for strategic reasons. Slote seeks to form an alliance against traditional “masculine” moral theories like Kantianism, utilitarianism, and social contract theory (Slote, 1998). He argues that, in so doing, care ethics receives a way of treating our obligations to people we don’t know, without having to supplement it with more problematic theories of justice. McLaren posits that virtue theory provides a normative framework which care ethics lacks (McLaren, 2001). The perceived flaw in care ethics for both authors is a neglect of justice standards in how care is distributed and practiced, and a relegation of care to the private realm, which exacerbates the isolation and individualization of the burdens of care already prevalent in liberal societies. McLaren contends that virtue theory provides care ethics both with a standard of appropriateness and a normative framework: “The standard of appropriateness is the mean—a virtue is always the mean between two extremes…The normative framework stems from the definition of virtue as that which promotes human flourishing” (2001, 105). Feminist critics, however, resist this assimilation on the grounds that it may dilute the unique focus of care ethics (Held, 2006; Sander-Staudt, 2006). They are optimistic that feminist versions of care ethics can address the above concerns of justice, and doubt that virtue ethics provides the best normative framework.

Similar debates surround the comparison between care ethics and Confucianism. Philosophers note a number of similarities between care ethics and Confucian ethics, not least that both theories are often characterized as virtue ethics (Li, 1994, 2000; Lai Tao, 2000). Additional similarities are that both theories emphasize relationship as fundamental to being, eschew general principles, highlight the parent-child relation as paramount, view moral responses as properly graduated, and identify emotions such as empathy, compassion, and sensitivity as prerequisites for moral response. The most common comparison is between the concepts of care and the Confucian concept of jen/ren. Ren is often translated as love of humanity, or enlargement. Several authors argue that there is enough overlap between the concepts of care and ren to judge that care ethics and Confucian ethics are remarkably similar and compatible systems of thought (Li, 1994; Rosemont, 1997).

However, some philosophers object that it is better to view care ethics as distinct from Confucian ethics, because of their potentially incompatible aspects. Feminist care ethicists charge that a feminist care ethic is not compatible with the way Confucianism subordinates women. Ranjoo Seodu Herr locates the incompatibility as between the Confucian significance of li, or formal standards of ritual, and a feminist care ethics’ resistance to subjugation (2003). For similar reasons, Lijun Yuan doubts that Confucian ethics can ever be acceptable to contemporary feminists, despite its similarity to care ethics. Daniel Star categorizes Confucian ethics as a virtue ethic, and distinguishes virtue ethics and care ethics as involving different biases in moral perception (2002). According to Star, care ethics differs from Confucian ethics in not needing to be bound with any particular tradition, in downgrading the importance of principles (versus merely noting that principles may be revised or suspended), and in rejecting hierarchical, role-based categories of relationship in favor of contextual and particular responses.

There are also refutations of the belief that care ethics is conceptually incompatible with the justice perspectives of Kantian deontology and liberal human rights theory. Care ethicists dispute the inference that because care and justice have evolved as distinct practices and ideals, that they are incompatible. Some deny that Kantianism is as staunchly principled and rationalistic as often portrayed, and affirm that care ethics is compatible with Kantian deontology because it relies upon a universal injunction to care, and requires a principle of caring obligation. An adaptation of the Kantian categorical imperative can be used to ground the obligation to care in the universal necessity of care, and the inconsistency of willing a world without intent to care. Other theorists compare the compatibility between care ethics and concepts of central importance to a Kantian liberal tradition. Thus, Grace Clement argues that an ideal of individual autonomy is required by normative ideals of care, in the sense that care-givers ideally consent to and retain some degree of autonomy in caring relations, and also ideally foster the autonomy of care-receivers (Clement, 1996). Mona Harrington explores the significance of the liberal ideal of equality to care ethics by tracing how women’s inequality is linked to the low social valuing and provision of care work (Harrington, 2000). Other ways that Kantianism is thought to benefit care ethics is by serving as a supplementary check to caring practice, (denouncing caring relations that use others as mere means), and by providing a rhetorical vehicle for establishing care as a right.

6. Maternalism

As a theory rooted in practices of care, care ethics emerged in large part from analyses of the reasoning and activities associated with mothering. Although some critics caution against the tendency to construe all care relations in terms of a mother-child dyad, Ruddick and Held use a maternal perspective to expand care ethics as a moral and political theory. In particular, Ruddick argues that “maternal practice” yields specific kinds of thinking and supports a principled resistance to violence. Ruddick notes that while some mothers support violence and war, they should not because of how it threatens the goals and substance of care. Defining a mother as “a person who takes responsibility for children’s lives and for whom providing child care is a significant part of his or her working life”, Ruddick stipulates that both men and women can be mothers (40). She identifies the following metaphysical attitudes, cognitive capacities, and virtues associated with mothering: preservative love (work of protection with cheerfulness and humility), fostering growth (sponsoring or nurturing a child’s unfolding), and training for social acceptability (a process of socialization that requires conscience and a struggle for authenticity). Because children are subject to, but defy social expectations, the powers of mothers are limited by the “gaze of the others”. Loving attention helps mothers to perceive their children and themselves honestly so as to foster growth without retreating to fantasy or incurring loss of the self.

Expanding on the significance of the bodily experience of pregnancy and birth, Ruddick reasons that mothers should oppose a sharp division between masculinity and femininity as untrue to children’s sexual identities. In so doing, mothers should challenge the rigid division of male and female aspects characteristic of military ideology because it threatens the hope and promise of birth. Ruddick creates a feminist account of maternal care ethics that is rooted in the vulnerability, promise, and power of human bodies, and that by resisting cheery denial, can transform the symbols of motherhood into political speech.

But however useful the paradigm for mothering has been to care ethics, many find it to be a limited and problematic framework. Some critics reject Ruddick’s suggestion that mothering is logically peaceful, noting that mothering may demand violent protectiveness and fierce response. Although Ruddick acknowledges that many mothers support military endeavors and undermine peace movements, some critics are unconvinced that warfare is always illogical and universally contrary to maternal practice. Despite Ruddick’s recognition of violence in mothering, others object that a motherhood paradigm offers a too narrowly dyadic and romantic paradigm, and that this approach mistakenly implies that characteristics of a mother-child relationship are universal worldly qualities of relationship. For these reasons, some care ethicists, even when in agreement over the significance of the mother-child relationship, have sought to expand the scope of care ethics by exploring other paradigms of care work, such as friendship and citizenship.

7. International Relations

Care ethics was initially viewed as having little to say about international relations. With an emphasis on known persons and particular selves, care ethics did not seem to be a moral theory suited to guide relations with distant or hostile others. Fiona Robinson challenges this idea, however, by developing a critical ethics of care that attends to the relations of dependency and vulnerability that exist on a global scale (Robinson, 1999). Robinson’s analysis expands the sentiment of care to address the inequalities within current international relations by promoting a care ethic that is responsive and attentive to the difference of others, without presuming universal homogeneity. She argues that universal principles of right and wrong typically fail to generate moral responses that alleviate the suffering of real people. But she is optimistic that a feminist phenomenological version of care ethics can do so by exploring the actual nature, conditions, and possibilities of global relations. She finds that the preoccupation with the nation state in cosmopolitanism and communitarianism, and the enforced global primacy of liberal values such as autonomy, independence, self-determination, and others, has led to a ‘culture of neglect’. This culture is girded by a systemic devaluing of interdependence, relatedness, and positive interaction with distant others. A critical ethic of care understands the global order not as emerging from a unified or homogeneous humanity, but from structures that exploit differences to exclude, marginalize and dominate. While Robinson doubts the possibility of “a more caring world” where poverty and suffering are entirely eliminated, she finds that a critical care ethic may offer an alternative mode of response that can motivate global care.

Likewise, Held is hopeful that care ethics can be used to transform international relations between states, by noticing cultural constructs of masculinity in state behaviors, and by calling for cooperative values to replace hierarchy and domination based on gender, class, race and ethnicity (Held, 2006). Care ethicists continue to explore how care ethics can be applied to international relations in the context of the global need for care and in the international supply and demand for care that is served by migrant populations of women.

8. Political Theory

As a political theory, care ethics examines questions of social justice, including the distribution of social benefits and burdens, legislation, governance, and claims of entitlement. One of the earliest explorations of the implications of care ethics for feminist political theory was in Seyla Behabib’s article “The Generalized and the Concrete Other:  The Kohlberg-Gilligan Controversy and Feminist Theory” (Benhabib, 1986). Here, Benhabib traces a basic dichotomy in political and moral theory drawn between the public and private realms. Whereas the former is thought to be the realm of justice, the social and historical, and generalized others, the latter is thought to be the realm of the good life, the natural and atemporal, and concrete others. The former is captured by the favored metaphor of social contract theory and the “state of nature”, wherein men roam as adults, alone, independent, and free from the ties of birth by women. Benhabib traces this metaphor, internalized by the male ego, within the political philosophies of Thomas Hobbes, John Locke, and John Rawls, and the moral theories of Immanuel Kant and Lawrence Kohlberg. She argues that under this conception, human interdependency, difference, and questions about private life become irrelevant to politics.

The earliest substantial account of care as a political philosophy is offered by Tronto, who identifies the traditional boundary between ethics and politics as one of three boundaries  which serves to stymie the political efficacy of a woman’s care ethic, (the other two being the boundary between the particular and abstract/impersonal moral observer, and the boundary between public and private life) (Tronto, 1993). Together, these boundaries obscure how care as a political concept illuminates the interdependency of human beings, and how care could stimulate democratic and pluralistic politics in the United States by extending a platform to the politically disenfranchised. Following Tronto, a number of feminist care ethicists explore the implications of care ethics for a variety of political concepts, including Bubeck who adapts Marxist arguments to establish the social necessity and current exploitation of the work of care; Sevenhuijsen who reformulates citizenship to be more inclusive of caring need and care work; and Kittay who develops a dependency based concept of equality (Bubeck, 1995; Sevenhuijsen, 1998; Kittay, 1999). Other authors examine the relevance of care ethics to the political issues of welfare policy, restorative justice, political agency, and global business.

The most comprehensive articulation of care ethics as a political theory is given by Engster, who defends a need based account of moral obligation (Engster, 2007). Engster’s “minimal capability theory” is formed around two major premises—that all human beings are dependent upon others to develop their basic capabilities, and that in receiving care, individuals tacitly and logically become obliged to care for others. Engster understands care as a set of practices normatively informed by three virtues: attention, responsiveness, and respect. Defining care as everything we do to satisfy vital biological needs, develop and sustain basic capabilities, and avoid unnecessary suffering, Engster applies these goals to domestic politics, economic justice, international relations, and culture. Engster holds governments and businesses responsible for offering economic provisions in times of sickness, disability, frail old age, bad luck, and reversal of fortune, for providing protection, health care, and clean environments, and for upholding the basic rights of individuals. He calls for businesses to balance caring and commodity production by making work and care more compatible, although he surmises that the goals of care need not fully subordinate economic ends such as profitability.

According to Engster, care as a political theory has universal application because conditions of dependency are ubiquitous, but care need not be practiced by all groups in the same way, and has no necessary affinities with any particular political system, including Marxism and liberalism. Governments ought to primarily care for their own populations, but should also help the citizens of other nations living under abusive or neglectful regimes, within reasonable limits. International humanitarian interventions are more obligatory than military given the risk of physical harm, and the virtues of care can help the international community avoid dangers associated with humanitarian assistance. With specific reference to cultural practices in the U.S., Engster recommends a number of policy changes to education, employment, and the media.

9. Caring for Animals

While Gilligan was relatively silent about the moral status of animals in care ethics, Noddings made it clear that humans have moral obligations only to animals which are proximate, open to caring completion, and capable of reciprocity. On these grounds she surmises that while the one-caring has a moral obligation to care for a stray cat that shows up at the door and to safely transport spiders out of the house, one is under no obligation to care for a stray rat or to become a vegetarian. She further rejects Peter Singer’s claim that it is specieist to favor humans over animals. Other care ethicists, however, such as Rita Manning, point out differences in our obligations to care for companion, domesticated, and wild animals based upon “carefully listening to the creatures who are with you in [a] concrete situation” (Manning, 1992; 1996).

The application of care ethics to the moral status of animals has been most thoroughly explored by Carol Adams and Josephine Donovan (Adams and Donovan 1996; 2007). Expanding on Adams’ original analysis of the sexual politics of meat (Adams, 1990), they maintain that a feminist care tradition offers a superior foundation for animal ethics. They specifically question whether rights theory is an adequate framework for an animal defense ethic because of its rationalist roots and individualist ontology, its tendency to extend rights to animals based on human traits, its devaluing of emotion and the body, and its preference for abstract, formal, and quantifiable rules. Alternatively, they argue that a feminist care ethic is a preferable foundation for grounding moral obligations to animals because its relational ontology acknowledges love and empathy as major bases for human-animal connections, and its contextual flexibility allows for a more nuanced consideration of animals across a continuum of difference.

Engster similarly argues that the human obligation to care for non-human animals is limited by the degree to which non-human animals are dependent upon humans (Engster, 2006). Because an obligation to care is rooted in dependency, humans do not have moral obligations to care for animals that are not dependent upon humans. However, an obligation to care for animals is established when humans make them dependent by providing food or shelter. Engster surmises that neither veganism nor vegetarianism are required providing that animals live happy, mature lives, and are humanely slaughtered, but also acknowledges that the vast majority of animals live under atrocious conditions that care ethics renounces.

Empirical studies suggest interesting differences between the way that men and women think about the moral status of animals, most notably, that women are more strongly opposed to animal research and meat eating, and report being more willing to sacrifice for these causes, than men (Eldridge and Gluck, 1996). While feminist care ethicists are careful not to take such empirical correlations as an automatic endorsement of these views, eco-feminists like Marti Kheel explicate the connection between feminism, animal advocacy, environmental ethics, and holistic health movements (Kheel, 2008). Developing a more stringent obligation to care for animals, Kheel posits the uniqueness of all animals, and broadens the scope of the moral obligation of care to include all individual beings as well as larger collectives, noting that the majority of philosophies addressing animal welfare adopt masculine approaches founded on abstract rules, rational principles, and generalized perspectives.

10. Applied Care Ethics

In addition to the above topics, care ethics has been applied to a number of timely ethical debates, including reproductive technology, homosexuality and gay marriage, capital punishment, political agency, hospice care, and HIV treatment, as well as aspects of popular culture, such as the music of U-2 and The Sopranos. It increasingly informs moral analysis of the professions, such as education, medicine, nursing, and business, spurring new topics and modes of inquiry. It is used to provide moral assessment in other ethical fields, such as bioethics, business ethics, and environmental ethics. Perhaps because medicine is a profession that explicitly involves care for others, care ethics was quickly adopted in bioethics as a means for assessing relational and embodied aspects of medical practices and policies. As well as abortion, both Susan Sherwin and Rosemary Tong consider how feminist ethics, including an ethic of care, provides new insights into contraception and sterilization, artificial insemination and in vitro fertilization, surrogacy, and gene therapy. Care ethics is also applied by other authors to organ transplantation, the care of high risk patients, artificial womb technologies, advanced directives, and the ideal relationships between medical practitioners and patients.

11. Care Movements

There are a rising number of social movements organized around the concerns highlighted in care ethics. In 2000, Deborah Stone called for a national care movement in the U.S. to draw attention to the need for social programs of care such as universal health care, pre-school education, care for the elderly, improved foster care, and adequate wages for care-givers. In 2006, Hamington and Dorothy Miller compiled a number of essays concerning the theoretical understanding and application of care ethics to public life, including issues of welfare, same-sex marriage, restorative justice, corporate globalization, and the 21st century mother’s movement (Hamington and Miller, 2006). A number of formal political organizations of care exist, most of them on the internet, which variously center around themes of motherhood, fatherhood, health care, care as a profession, infant welfare, the woman’s movement, gay and lesbian rights, disability, and elder care. These organizations work to disseminate information, organize care advocates on key social issues, and form voting blocks. Of those focused around mothering, one of the most prominent is MomsRising.org, organized by Joan Blades, one of the original founders of MoveOn.org, and Kristin Rowe-Finkbeiner. Others include: The Mothers Movement Online,  Mothers Ought to Have Equal Rights, the National Association of Mothers’ Centers, and Mothers and More. Judith Stadtman Tucker notes that problems with some mother’s movements include an overly exclusive focus on the interests of white, middle class care-givers, and an occasional lack of serious-mindedness, but she is also hopeful that care movements organized around motherhood can forge cultural transitions, including shorter work weeks, universal health care unhitched from employment, care leave policies, and increased levels of care work performed by men and states (Tucker, 2001).

12. References and Further Reading

  • Adams, Carol. The Sexual Politics of Meat. New York:  Continuum, 1990.
  • Adams, C. and Donovan, J. Beyond Animal Rights:  A feminist Caring Ethic for the Treatment of Animals. New York:  Continuum, 1996.
  • Adams, C. and Donovan, J. The Feminist Care Tradition in Animal Ethics. New York:  Columbia University Press, 2007.
  • Baier, Annette. “Hume: The Woman’s Moral Theorist?” in Women and Moral Theory, Kittay, Eva Feder, and Meyers, Diana (ed.s). U.S.A.: Rowman & Littlefield, 1987.
  • Baier, Annette. Moral Prejudices: Essays on Ethics. Cambridge, MA: Harvard University Press, 1994.
  • Benhabib, Seyla. “The Generalized and Concrete Other:  The Kohlberg-Gilligan Controversy and Moral Theory”, in Praxis International (1986) 38-60.
  • Blades, Joan and Rowe-Finkbeiner, Kristin. The Motherhood Manifesto: What America’s Moms Want and What to do about It. New York, NY: Nation Books, 2006.
  • Bowden, Peta. “An ‘Ethics of Care’ in Clinical Settings: Encompassing ‘Feminine” and “Feminist” Perspectives.” Nursing Philosophy 1.1 (2001): 36-49.
  • Bowden, Peta. Caring: Gender Sensitive Ethics. New York, NY: Routledge, 1997.
  • Brabeck, Mary. “Moral Judgment:  Theory and Research on Differences between males and Females” Developmental Review 3 (1983) 274-91.
  • Bubeck, Diemut. Care, Gender and Justice. Oxford: Clarendon Press, 1995.
  • Card, Claudia. “Caring and Evil.” Hypatia 5.1 (1990) 101-8.
  • Clement, Grace. Care, Autonomy and Justice: Feminism and the Ethic of Care. Boulder, CO: Westview Press, 1996.
  • Davion, Victoria. “Autonomy, Integrity, and Care” Social Theory and Practice 19.2 (1993) 161-82.
  • Donovan, Josephine and Adams, Carol, ed. Beyond Animal Rights: A Feminist Caring Ethic for the Treatment of Animals. New York, NY: Continuum Press, 1996.
  • Engster, Daniel. “Care ethics and Animal Welfare.” Journal of Social Philosophy 37.4 (2006): 521.
  • Engster, Daniel. The Heart of Justice. Oxford:  Oxford University Press, 2007.
  • Friedman, Marilyn. “Beyond Caring:  The De-Moralization of Gender” in V. Held, Justice and Care:  Essential Readings in Feminist Ethics Boulder, CO:  Westview Press (2006): 61-77.
  • Fry, Sara T. “The Role of Caring in a Theory of Nursing Ethics.” Hypatia 4.2. (1989): 88-101.
  • Gilligan, Carol. “Women’s Place in Man’s Life Cycle.” Harvard Educational Review, 29. (1979).
  • Gilligan, C. In A Different Voice. Cambridge, Mass.: Harvard University Press, 1982.
  • Gilligan, C. “Adult Development and Women’s Development:  Arrangements for a Marriage” in J. Giele, ed. Women in the Middle Years. New York:  Wiley-Interscience Publications, John Wiley and Sons (1982).
  • Gilligan, C. “Reply” (to critics). Signs 11.2. (1986): 324-333.
  • Gilligan, C. and Belenky, M. “A Naturalist Study of Abortion Decisions.” In R. Selman and R. Yando (ed.s) New Directions in Child Development:  Clinical-Developmental Psychology. 7. San-Francisco, CA:  Jossey-Bass. (1980): 69-90.
  • Gilligan, C. Langdale, S. Lyons, N. & Murphy, J. The Contribution of Women’s Thought to Developmental Theory:  The Elimination of Sex Bias in Moral Developmental research and Education. Final Report to the National Institute of Education. Cambridge, Mass: Harvard University Press, 1982.
  • Gilligan, C. and Wiggins, G. “The Origins of Morality in Early Childhood Relationships”  in J. Kaggan and S. Lamb (ed.s) The Emergence of Morality in Early Childhood. Chicago:  University of Chicago Press (1987).
  • Gilligan, Ward, Taylor, and Bardige. Mapping the Moral Domain:  A Contribution of Women’s Thinking to Psychological Theory and Education. Cambridge, MA:  Harvard University Press, 1988.
  • Gilligan, Lyons, and Hammer. Making Connections:  The Relational Worlds of Adolescent Girls at Emma Willard School. Cambridge, MA:  Harvard University Press, 1990.
  • Haan, N. et al. “family Moral Patterns” Child Development 47.7 (1976) 1204-06.
  • Halwani, Raja. Virtuous Liaisons:  Care, Love, Sex, and Virtue Ethics. Peru, IL:  Open Court, 2003.
  • Hamington, Maurice. Embodied Care: Jane Addams, Maurice Merleau-Ponty and Feminist Ethics. Chicago, IL: University of Illinois Press, 2004.
  • Hamington, Maurice and Miller, Dorothy, ed. Socializing Care. New York: NY: Rowman & Littlefield, 2006.
  • Hanen, Marsha and Nelson, Kai, ed.s. Science, Morality, and Feminist Theory. Calgary: University of Calgary Press, 1987.
  • Harding, Sandra. “The Curious Coincidence of Feminine and African Moralities.” In Women and Moral Theory. Ed. Kittay, Eva Feder and Meyers, Diane. U.S.A: Rowman & Littlefield, 1989. 296-317.
  • Harrington, Mona. Care and Equality: Inventing a New Family Politics. New York, NY: Routledge, 2000.
  • Held, Virginia. Feminist Morality: Transforming Culture, Society, and Politics. Chicago, IL: University of Chicago Press, 1993.
  • Held, Virginia. “Feminist Moral Inquiry and the Feminist Future” in V. Held (ed.) Justice and Care. Boulder, Co:  Westview Press, 2006:  153-176.
  • Held, Virginia. The Ethics of Care. New York, NY: Oxford University Press, 2006.
  • Herr, Ranjoo Seodu. “Is Confucianism Compatible with care ethics?: A Critique.” Philosophy East and West 53.4: 471-489.
  • Hoagland, Sarah. Lesbian Ethics. Palo Alto, CA: Institute of Lesbian Studies, 1988.
  • Hoagland, Sarah Lucia. “Some Thoughts about Caring.” Feminist Ethics. Ed. Claudia Card. Lawrence, KS: University of Kansas Press, 1991.
  • Jaggar, Allison. “”Feminist Ethics: Problems, Projects, Prospects.” Feminist Ethics. Lawrence, Kansas: University of Kansas, 1991. 78- 104.
  • Kheel, Marti. Nature Ethics. New York, NY: Rowman & Littlefield, 2008.
  • Kittay, Eva Feder and Myers, Diana T., ed. Women and Moral Theory. U.S.A.: Rowman and Littlefield, 1987.
  • Kittay, Eva Feder. Love’s Labor: Essays on Women, Equality, and Dependency. New York, NY: Routledge, 1999.
  • Koehn, Daryl. Rethinking Feminist Ethics. New York, NY: Routledge, 1998.
  • Kuhse, Helga. Caring: Nurses, Women and Ethics. Oxford: Blackwell, 1997.
  • Kuhse, H. “Clinical Nursing: ‘Yes’ to Caring, ‘No’ to Female Ethics of Care.” Bioethics 9.3 (1995): 207-219.
  • Lai Tao, Julia Po-Wah. “Two Perspectives of Care: Confucian Ren and Feminst Care.” Journal of Chinese Philosophy 27.2 (2000): 215-40.
  • Larrabee, Mary Jeane, ed. An Ethic of Care: Feminist and Interdisciplinary Perspectives. New York, NY: Routledge, 1993.
  • Li, Chenyang, ed. The Sage and the Second Sex: Confucianism, Ethics, and Gender. Chicago, IL: Open Court Press, 2000.
  • Lijun, Yuan. “Ethics of Care and the Concept of Jen: A Reply to Chenyang LI.” Hypatia 17.1 (2002): 107-129.
  • Ma, John Paley. ” Virtues of Autonomy: The Kantian Ethics of Care.” Nursing Philosophy 3.2 (2002): 133-43.
  • Manning, Rita. Speaking from the Heart: A Feminist Perspective on Ethics. New York: NY: Rowman and Littlefield, 1992.
  • Manning, R. “Caring for Animals”. In Adams, C. and Donovan, J (ed.s) Beyond Animal Rights:  A feminist Caring Ethic for the Treatment of Animals. New York:  Continuum, 1996.
  • Mayeroff, Milton. On Caring. New York: Harper & Row, 1971.
  • McLaren, Margaret. “Feminist Ethics: Care as a Virtue.” In Feminists Doing Ethics. Lanham, MD: Rowman and Littlefield, 2001.
  • Miller, Sarah Clark. “A Kantian Ethic of Care?.” In Andrew, Keller and Schwartzman (ed.s) Feminist Interventions in Ethics and Politics: Feminist Ethics and Social Theory. Boulder: CO: Rowman & Littlefield, 2005. 111-127.
  • Nagl-Docekal H. (1997) Feminist ethics: how it could benefit from Kant’s moral philosophy. In: Feminist Interpretations of Immanuel Kant (ed. R.M. Schott), pp. 101124. Pennsylvania State University Press, University Park, Pennsylvania.
  • Nelson, Hilde. “Against Caring.” The Journal of Clinical Ethics (1997): 8-15.
  • Noddings, Nel. Caring: A Feminine Approach to Ethics and Moral Education. Berkeley: University of CA Press, 1982.
  • Noddings, Nel. Starting at Home: Caring and Social Policy. Berkeley, CA: University of CA Press, 2002.
  • Puka, Bill. “The Liberation of Caring: A Different Voice for Gilligan’s ‘Different Voice’.” Hypatia 55.1 (1990): 58-82.
  • Rachels, James. The Elements of Moral Philosophy. San Francisco, CA:  McGraw-Hill, 1999.
  • Robinson, Fiona. Globalizing Care: Ethics, Feminist Theory, and International Relations. Boulder, CO: West View Press, 1999.
  • Ruddick, Sara. Maternal Thinking: Toward a Politics of Peace. New York, NY: Ballentine Books, 1989.
  • Ruddick, Sara. ”Care as Labor and Relationship” in Mark S. Haflon and Joram C. Haber (ed.s) Norms and Values: Essays on the Work of Virginia Held. Lanham, MD: Rowman & Littlefield, 1998.
  • Sander-Staudt, Maureen. “The Unhappy Marriage of Care Ethics and Virtue Ethics.” Hypatia 21.4 (2006): 21-40.
  • Sevenhuijsen, Selma. Citizenship and the Ethics of Care. New York, NY: Routledge, 1998.
  • Slote, Michael. “The Justice of caring” In Virtues and Vices. Paul, Miller, and Paul (ed.s) New York:  Cambridge University Press. 1998a.
  • Slote, M. Caring in the Balance” in Norms and Values. Haber, Halfon (ed.s). Lanham, MD:  Rowman & Littlefield. 1998b.
  • Slote, M. The Ethics of Care and Empathy.” New York, NY: Routledge, 2007.
  • Star, Daniel. “Do Confucians really Care? A Defense of the Distinctiveness of care ethics: A Reply to Chenyang Li.” Hypatia 17.1 (2002): 77-106.
  • Stone, Deborah. “Why we need a Care Movement.” The Nation Feb. 25 (2000): 1-5.
  • Tronto, Joan. Moral Boundaries: A Political Argument for an Ethic of Care. New York, NY: Routledge, 1994.
  • Tronto, J. “Women and Caring:  What can Feminists learn about morality from Caring?” in V. Held, Justice and Care:  Essential Readings in Feminist Ethics Boulder, CO:  Westview Press (2006) 101-115.
  • Tucker, Judith Stadtman. “Care as a Cause: Framing the Twenty-First Century Mother’s Movement.”. In Hamington, Maurice and Miller, Dorothy (ed.s) Socializing Care, New York: NY, Rowman & Littlefield Publishers, 2006.
  • Walker, Vanessa Siddle and Snarey, John, ed. Race-Ing Moral Formation: African American Perspectives on Care and Justice. New York, NY: Teachers College Press, 2004.
  • West, Robin. Caring for Justice. New York, NY: New York University Press, 2000.

 

Author Information

Maureen Sander-Staudt
Email: Maureen.Sander-Staudt@asu.edu
Arizona State University
U. S. A.

Augustine: Political and Social Philosophy

augustineSt. Augustine (354-430 C.E.), originally named Aurelius Augustinus, was the Catholic bishop of Hippo in northern Africa.  He was a skilled Roman-trained rhetorician, a prolific writer (who produced more than 110 works over a 30-year period), and by wide acclamation, the first Christian philosopher.  Writing from a unique background and vantage point as a keen observer of society before the fall of the Roman Empire, Augustine’s views on political and social philosophy constitute an important intellectual bridge between late antiquity and the emerging medieval world.  Because of the scope and quantity of his work, many scholars consider him to have been the most influential Western philosopher.

Although Augustine certainly would not have thought of himself as a political or social philosopher per se, the record of his thoughts on such themes as the nature of human society, justice, the nature and role of the state, the relationship between church and state, just and unjust war, and peace all have played their part in the shaping of Western civilization. There is much in his work that anticipates major themes in the writings of moderns like Machiavelli, Luther, Calvin and, in particular, Hobbes.

Table of Contents

  1. Background
    1. Historical Context
    2. Augustinian Political “Theory”
    3. The Augustinian World View
  2. Foundational Political and Social Concepts
    1. Two Cities
    2. Justice and the State
    3. Church and State
  3. War and Peace
    1. War Among Nations
    2. War and Human Nature
    3. The Just War
    4. Jus ad Bellum and Jus in Bello
    5. Augustine’s Conception of Peace
  4. Conclusion
  5. References and Further Reading
    1. Primary Sources
    2. Secondary Sources

1. Background

a. Historical Context

Augustine’s political and social views flow directly from his theology.  The historical context is essential to understanding his purposes.  Augustine, more than any other figure of late antiquity, stands at the intellectual intersection of Christianity, philosophy, and politics.  As a Christian cleric, he takes it as his task to defend his flock against the unremitting assault by heresies spawned in an era uninformed by the immediate, divine revelations which had characterized the apostolic age. As a philosopher, he situates his arguments against the backdrop of Greek philosophy in the Platonic tradition, particularly as formulated by the Neo-Platonists of Alexandria.  As a prominent Roman citizen, he understands the Roman Empire to be the divinely-ordained medium through which the truths of Christianity are to be both spread and safeguarded.

Augustine died reciting the Penitential Psalms as the Vandals besieged the city of Hippo on the coast of northern Africa (now the city of Annaba, in Algeria). This occurred two decades after the sacking of Rome by Alaric.

b. Augustinian Political “Theory”

Augustine’s willingness to grapple with substantive political and social issues does not mean, however, that the presentation of his ideas comes pre-packaged as a simple system—or even as a system at all.  Quite the contrary, his political arguments are scattered throughout his voluminous writings, which include autobiography, sermons, expositions, commentaries, letters, and Christian apologetics.  Moreover, the contexts in which the political and social issues are addressed are equally varied.

Nevertheless, it would be a mistake to suggest that his arguments are not informed by a cogent theory.  Taken together, his political and social musings constitute a remarkable tapestry.  Indeed, the consistency evident in the expression of his varied but related ideas leads both fairly and directly to the assumption that Augustine’s political-philosophical statements arise from a consistent set of premises which guide him to his conclusions; in other words, they reveal the presence of an underlying, if unstated, theory.

c. The Augustinian World View

Because Augustine considers the Christian scriptures to constitute the touchstone against which philosophy—including political philosophy—must be assayed, his world view necessarily includes the Christian tenets of the Creation, the Fall of man, and the Redemption.  In stark contrast to the pagan philosophers who preceded him—who viewed the unfolding of history as a cyclical phenomenon, Augustine conceives history in strictly linear terms, with a beginning and an end.  According to Augustine, the earth was brought into existence ex nihilo by a perfectly good and just God, who created man. The earth is not eternal; the earth, as well as time, has both a beginning and an end.

Man, on the other hand, was brought into existence to endure eternally. Damnation is the just desert of all men because of the Fall of Adam, who, having been created with free will, chose to disrupt the perfectly good order established by God. As the result of Adam’s Fall, all human beings are heirs to the effects of Adam’s original sin, and all are vessels of pride, avarice, greed and self-interest.  For reasons known only to God, He has predestined some fixed number of men for salvation (as a display of His unmerited mercy—a purely gratuitous act altogether independent even of God’s foreknowledge of any good deeds those men might do while on earth), while most He has predestined for damnation as a just consequence of the Fall. The onward march of human history, then, constitutes the unfolding of the divine plan which will culminate in one or the other outcome for every member of the human family.

Within this framework of political and legal systems, the state is a divinely ordained punishment for fallen man, with its armies, its power to command, coerce, punish, and even put to death, as well as its institutions such as slavery and private property. God shapes the ultimate ends of man’s existence through it.  The state simultaneously serves the divine purposes of chastening the wicked and refining the righteous.  Also simultaneously, the state constitutes a sort of remedy for the effects of the Fall, in that it serves to maintain such modicum of peace and order as it is possible for fallen man to enjoy in the present world.

While it is not clear that God predestines every event during man’s sojourn on earth, nothing happens in contravention of His designs. In any case, predestination fixes the ultimate destination of every human being—as well as the political states to which they belong.  Hence, predestination for Augustine is the proverbial elephant in the room. Whether predestination was divinely contemplated prior or incidental to the Fall (a point which Augustine never clearly articulates), the following problem arises:  If one is to be saved or damned by divine fiat, what difference does it make whether the world possesses the social order of a state?  For those who are predestined for damnation, what is the point of their being “chastened” (or a means to encourage their reformation) by the state?  For those predestined for salvation, what is the point of their being refined by the vicissitudes of life in a political state?  In order to prevent the collapse of such a systematic account of the human condition as Augustine provides, the question simply must be set aside as a matter unknowable to finite man.  However, this means that the best Augustine can hope to accomplish is to provide a description of political life on earth, but not a prescription for how to obtain membership in the perfect society of heaven; for, even strict obedience to Christian precepts will not compensate for one’s not being gratuitously elected for salvation.

As the social fabric of the world around him unravels in the twilight years of the Roman Empire, Augustine attempts to elucidate the relationship between the eternal, invisible verities of his faith and the stark realities of the present, observable political and social conditions of humanity.  At the intersection of these two concerns, Augustine finds what for him is the central question of politics:  How do the faithful operate successfully but justly in an unjust world, , where selfish interests dominate, where the general welfare is rarely sought, and where good and evil men are inextricably (and, to human eyes, often unidentifiably) intermingled, yet search for a heavenly reward in the world hereafter?

2. Foundational Political and Social Concepts

a. Two Cities

Even though those elected for salvation and those elected for damnation are thoroughly intermingled, the distinction arising from their respective destinies gives rise to two classes of persons, to whom Augustine refers collectively and allegorically as cities—the City of God and the earthly city.  Citizens of the earthly city are the unregenerate progeny of Adam and Eve, who are justifiably damned because of Adam’s Fall.  These persons, according to Augustine, are aliens to God’s love (not because God refuses to love them, but because they refuse to love God as evidenced by their rebellious disposition inherited from the Fall).  Indeed, the object of their love—whatever it may be—is something other than God.  In particular, citizens of the “earthly city” are distinguished by their lust for material goods and for domination over others.  On the other hand, citizens of the City of God are “pilgrims and foreigners” who (because God, the object of their love, is not immediately available for their present enjoyment) are very much out of place in a world without an earthly institution sufficiently similar to the City of God.  No political state, nor even the institutional church, can be equated with the City of God.  Moreover, there is no such thing as “dual citizenship” in the two cities; every member of the human family belongs to one—and only one.

b. Justice and the State

The Augustinian notion of justice includes what by his day was a well-established definition of justice of “giving every man his due.”  However, Augustine grounds his application of the definition in distinctively Christian philosophical commitments:  “justice,” says Augustine, “is love serving God only, and therefore ruling well all else.”  Accordingly, justice becomes the crucial distinction between ideal political states (none of which actually exist on earth) and non-ideal political states—the status of every political state on earth.  For example, the Roman Empire could not be synonymous with the City of God precisely because it lacked true justice as defined above; and since, “where there is no justice there is no commonwealth,” Rome could not truly be a commonwealth, that is, an ideal state.  “Remove justice,” Augustine asks rhetorically, “and what are kingdoms but gangs of criminals on a large scale?  What are criminal gangs but petty kingdoms?”  No earthly state can claim to possess true justice, but only some relative justice by which one state is more just than another.  Likewise, the legitimacy of any earthly political regime can be understood only in relative terms:  The emperor and the pirate have equally legitimate domains if they are equally just.

Nevertheless, political states, imperfect as they are,  serve a divine purpose.  At the very least, they serve as vehicles for maintaining order and for preventing what Hobbes will later call the “war of all against all.”  In that respect, the state is a divine gift and an expression of divine mercy—especially if the state is righteously ruled.  The state maintains order by keeping wicked men in check through the fear of punishment.  Although God will eventually punish the sins of all those elected for damnation, He uses the state to levy more immediate punishments against both the damned and the saved (or against the wicked and the righteous, the former dichotomy not necessarily synonymous with the latter).  Rulers, as God’s ministers, punish the guilty and always are justified in punishing sins “against nature,” and circumstantially justified in punishing sins “against custom” or “against the laws.” The latter two categories of sins change from time to time.  In this regard, the institution of the state marks a relative return to order from the chaos of the Fall.  Rulers have the right to establish any law that does not conflict with the law of God. Citizens have the duty to obey their political leaders regardless of whether the leader is wicked or righteous.  There is no right of civil disobedience.  Citizens are always duty bound to obey God; and when the imperatives of obedience to God and obedience to civil authority conflict, citizens must choose to obey God and willingly accept the punishment of disobedience. Nevertheless, those empowered to levy punishment should take no delight in the task.  For example, the prayer of the judge who condemns a man to death should be, as Augustine’s urges, “From my necessities [of imposing judgment to a person] deliver thou me.”

c. Church and State

Even though the ostensible reason for the state’s divinely appointed existence is to assist and bless humankind, there is no just state, says Augustine, because men reject the thing that best could bring justice to an imperfect world, namely, the teachings of Christ.  Augustine does not suggest that current rejection of Christ’s teachings means that all hope for future amendment and reformation is lost.  However, Augustine’s whole tenor is that there is no reason to expect that the political jurisdictions of this world ever will be anything different than what they now are, if the past is any predictor of the future.  Hence, Augustine concludes that

Christ’s servants, whether they are kings, or princes, or judges, or soldiers . . . are bidden, if need be, to endure the wickedness of an utterly corrupt state, and by that endurance to win for themselves a place of glory . . . in the Heavenly Commonwealth, whose law is the will of God.

Augustine clearly holds that the establishment and success of the Roman Empire, along with its embracing of Christianity as its official religion, was part of the divine plan of the true God.  Indeed, he holds that the influence of Christianity upon the empire could be only salutary in its effect:

Were our religion listened to as it deserves,” says Augustine, “it would establish, consecrate, strengthen, and enlarge the commonwealth in a way beyond all that Romulus, Numa, Brutus, and all the other men of renown in Roman history achieved.

Still, while Augustine doubtless holds that it is better for Rome to be Christian than not, he clearly recognizes that officially embracing Christianity does not automatically transform an earthly state into the City of God.  Indeed, he regards Rome as “a kind of second Babylon.”  Even if the Roman Emperor and the Roman Pontiff were one and the same—even if the structures of state and church merged so as to become institutionally the same—they would not thereby become the City of God, because citizenship in the City of God is determined at the individual and not the institutional level.

Augustine does not wish ill for Rome.  Quite the contrary, he supplicates God for Rome’s welfare,  since he belongs to it, in temporal terms at least.  He sees Rome as the last bastion against the advances of the pagan barbarians, who surely must not be allowed to overrun the mortal embodiment of Christendom that Rome represents.  Nevertheless, Augustine cannot be overly optimistic about the future of the Roman state as such—not because it is Rome, but because it is a state; for any society of men other than the City of God is part and parcel of the earthly city, which is doomed to inevitable demise.  Even so, states like Rome can perform the useful purpose of championing the cause of the Church, protecting it from assault and compelling those who have fallen away from fellowship with it to return to the fold.  Indeed, it is entirely within the provinces of the state to punish heretics and schismatics.

3. War and Peace

a. War Among Nations

Inasmuch as the history of human society is largely the history of warfare, it seems quite natural for Augustine to explain war as being within God’s unfolding plan for human history.  As Augustine states, “It rests with the decision of God in his just judgment and mercy either to afflict or console mankind, so that some wars come to an end more speedily, others more slowly.”

Wars serve the function of putting mankind on notice, as it were, of the value of consistently righteous living.  Although one might feel to call upon Augustine to defend the notion that God can, with propriety, use so terrible a vehicle as war to chasten the wicked, two points must be kept in mind:  The first point is that, for Augustine, all of God’s acts are just, by definition, even if the application of that definition to specific cases of the human experience eludes human reasoning.             This point invites a somewhat more philosophically intriguing question:  Is it just to compel men to do good who, when left to their own devices, would prefer evil?  If one were forced to act righteously contrary to his or her will, is it not the case that he or she would still lack the change of heart that is necessary to produce a repentant attitude—an attitude that results in genuine reformation?  Perhaps; but Augustine is unwilling to concede that it is better, in the name of recognizing the agency of others, to let them continue to wallow in evil practices.  Augustine argues,

The aim towards which a good will compassionately devotes its efforts is to secure that a bad will be rightly directed.  For who does not know that a man is not condemned on any other ground than because his bad will deserved it, and that no man is saved who has not a good will?

Exactly how God is to bring about his good purposes through the process of war may not be clear to man in any particular case.  Any who acquire a glimpse of understanding as to why the divine economy operates as it does truly possess a good will and shall not hesitate to administer to those erring, according to God’s direction, the punitive discipline that war is intended to bring.  Moreover, those of good will shall administer discipline to those erring by moving them toward repentance and reformation.

All of this leads conveniently to a second point: War can bring the need to discipline by chastening. Those of good will do not manifest cruelty in the proper administration of punishment but, rather, in the withholding of punishment.  “It does not follow,” Augustine states, “that those who are loved should be cruelly left to yield themselves with impunity to their bad will; but in so far as power is given, they ought to be both prevented from evil and compelled to good.”  What if, however, the violence of war serves only to subdue the wrongdoings of the wicked but fails to produce the change of heart that would characterize the transition from a bad to a good will—much like the case of the criminal who is sentenced to prison but who feels no remorse for his or her actions and, given his or her freedom, would all too readily repeat the crime?  For Augustine, it is always better to restrain an evil man from the commission of evil acts than it is to permit his continued perpetration of those acts.  As for the evil but unrepentant man, it would seem that he will have failed to reap the intended benefit of God’s chastening, which, reckoned by any measure, is a great tragedy indeed.

For Augustine, even the death of the mortal body, as ultimate a penalty as it might appear from the mortal perspective, is not nearly so serious a consequence as that which would ensue if one is left to wallow in sin: “But great and holy men, although they at the time knew excellently well that that death which separates the soul from the body is not to be dreaded, yet, in accordance with the sentiment of those who might fear it, punished some sins with death, both because the living were struck with a salutary fear, and because it was not death itself that would injure those who were being punished with death, but sin, which might be increased if they continued to live.”

Writing after the time when Christianity became the official religion of the Roman Empire, Augustine holds that there is no prohibition against a Christian serving the state as a soldier in its army.  Neither is there any prohibition against taking the lives of the enemies of the state, so long as he does it in his public capacity as a soldier and not in the private capacity of a murderer.  Nevertheless, Augustine also urges that soldiers should go to war mournfully and never take delight in the shedding of blood.

b. War and Human Nature

If, however, the presence of war serves as a defining characteristic of the earthly city, why does Augustine not pursue the course taken by some of the Latin Patristic writers who precede him by labeling war and military service as merely a “worldly” institution in which true Christians have no place.  The answer seems to lie in Augustine’s world view, which differs from that of many of his predecessors in terms of his optimism for man to comprehend the ultimate verities, live in an orderly manner and find his way back to God.  He becomes quite pessimistic though in his view of human nature and of the ability and desire of humans to maintain themselves orderly, much less rightly.  Pride, vanity and the lust for domination entice men toward waging wars and committing all manner of violence, because of men’s tendency to do evil as the result of Adam’s Fall.  Augustine holds that, given the inextricable mixing of citizens of the two cities, the total avoidance of war or its effects is a practical impossibility for all men, including the righteous.  Happily, he holds that the day will come when, coincident with the end of the earthly city, wars will no longer be fought.  For, says Augustine, citing words from the Psalms to the effect that God will one day bring a cessation of all wars,

This not yet see we fulfilled:  yet are there wars, wars among nations for sovereignty; among sects, among Jews, Pagans, Christians, heretics, are wars, frequent wars, some for the truth, some for falsehood contending.  Not yet then is this fulfilled, ‘He maketh wars to cease unto the end of the earth;’ but haply it shall be fulfilled.

For the present, however, man—particularly Christian man—is left with the question of how to live in a world full of war.

c. The Just War

As the Roman Empire collapses around him, Augustine confronted the question of what justifies warfare for a Christian.  On the one hand, the wicked are not particularly concerned about just wars.  On the other hand, the righteous vainly hope to avoid being affected by wars in this life,   and at best they can hope for just wars rather than unjust ones.   This is by no means a perfect solution; but then again, this is not a perfect world.  If it were, all talk of just wars would be altogether nonsensical. Perfect solutions characterize only the heavenly City of God. Its pilgrim citizens sojourning on earth can do no better than try to cope with the present difficulties and imperfections of the earthly life.  Thus, for Augustine, the just war is a coping mechanism for use by the righteous who aspire to citizenship in the City of God.  In terms of the traditional notion of jus ad bellum (justice of war, that is, the circumstances in which wars can be justly fought), war is a coping mechanism for righteous sovereigns who would ensure that their violent international encounters are minimal, a reflection of the Divine Will to the greatest extent possible, and always justified.  In terms of the traditional notion of jus in bello (justice in war, or the moral considerations which ought to constrain the use of violence in war), war is a coping mechanism for righteous combatants who, by divine edict, have no choice but to subject themselves to their political masters and seek to ensure that they execute their war-fighting duty as justly as possible. Sometimes that duty might arise in the most trying of circumstances, or under the most wicked of regimes; for

Christ’s servants, whether they are kings, or princes, or judges, or soldiers, or provincials, whether rich or poor, freemen or slaves, men or women, are bidden, if need be, to endure the wickedness of an utterly corrupt state, and by that endurance to win for themselves a place of glory

hereafter in the heavenly City of God.

In sum, why would a man like Augustine, whose eye is fixed upon attainment of citizenship in the heavenly city, find it necessary to delineate what counts as a just war in this lost and fallen world?  In general terms, the demands of moral life are so thoroughly interwoven with social life that the individual cannot be separated from citizenship in one or the other city.  In more specific terms, the just man who walks by faith needs to understand how to cope with the injustices and contradictions of war as much as he needs to understand how to cope with all other aspects of the present world where he is a stranger and pilgrim.  Augustine takes important cues from both Cicero and Ambrose and synthesizes their traditions into a Christianized world view that still retains strong ties to the pre-Christian philosophic past.  He resolves the dilemma of just war and pacifist considerations by denying the dilemma: war is simply a part of the human experience that God Himself has ordained or permitted.  War arises from, and stands as a clear manifestation of, the nature of fallen man.  For adherents to nominal Christianity, the explanatory power of Augustine’s thoughts on just war is substantial; his approach enables Christians to understand just war as a coping mechanism for just men who are trying to get along as morally (if not as piously) as they can in an imperfect world.

However, since Augustine seeks to resolve the nature of his ethical tensions, the synthetic character of Augustine’s approach to war is important, not merely for adherents to Christianity, but also for others seeking a strictly rational account of the problem.  For example, if one were to take a de-theologized view of Augustine’s approach and focus simply upon the general theoretical problem of the morality of war, Augustine’s attempt fully deserves serious philosophical consideration. His approach explains how a morally upright citizen of a relatively just state could be justified in pursuing warfare, in prosecuting war, and ultimately, although unhappily, in taking human life.  In any case, Augustine’s just war theory arises from his most deeply rooted philosophical assumptions.  Augustine as a Christian philosopher achieves a full synthesis of the Roman and Christian values associated with war in a way that legitimizes war as an instrument of national policy which, although inferior to the perfect ideals of Christianity, is one which Christians cannot altogether avoid and with which they must in some sense make their peace.

d. Jus ad Bellum and Jus in Bello

Traditionally, the philosophical treatment of the just war is divided into two categories:  jus in bellum and jus in bello.  The former describes the necessary (and, by some accounts, sufficient) conditions for justifying engagement in war.  The latter describes the necessary conditions for conducting war in a just manner.

Augustine’s jus ad bellum prescripts enjoin that wars can be initiated justly only on the basis of:

1.  a just cause, such as to defend the state from external invasion; to defend the safety or honor of the state, with the realization that their simultaneous defense might be impossible; to avenge injuries; to punish a nation for failure to take corrective action for wrongs (legal or moral ) committed by its citizens; to come to the defense of allies; to gain the return of something that was wrongfully taken; or to obey a divine command to go to war (which, in practice, issues from the political head of state acting as God’s lieutenant on earth); and in any case, the just cause must be at least more just than the cause of one’s enemies;

2.  a rightly intended will, which has the restoration of peace as its prime objective, takes no delight in the wickedness of potential adversaries, views waging war as a stern necessity, tolerates no action calculated to provoke a war, and does not seek to conquer others merely for conquest’s sake or for territorial expansion; and

3. a declaration of war by a competent authority, and except in the most unusual of circumstances, in a public manner, and only as a last resort.

Concerning jus in bello, Augustine holds that wars, once begun, must be fought in a manner which:

1. represents a proportional response to the wrong to be avenged, with violence being constrained within the limits of military necessity;

2. discriminates between proper objects of violence (that is, combatants) and noncombatants, such as women, children, the elderly, the clergy, and so forth.; and

3. observes good faith in its interactions with the enemy, by scrupulously observing treaties and not prosecuting the war in a treacherous manner.

e. Augustine’s Conception of Peace

Both Augustine’s political world view and his approach to war incorporate his conception of peace.  According to Augustine, God designed all humans to live together in the “bond of peace.” However, fallen man lives in society  as according to the divine will or as opposing it.  Augustine distinguishes the two cities in several important ways, as well as the kind of peace they seek:

There is, in fact, one city of men who choose to live by the standard of the flesh, another of those who choose to live by the standard of the spirit.  The citizens of each of these desire their own kind of peace, and when they achieve their aim, that is the kind of peace in which they live.

Because the common choice of fallen man is a peace of his own liking—one that selfishly serves his own immediate or foreseeable ends, peace becomes, in practice, merely an interlude between ongoing states of war.  Augustine is quick to point out that this life carries with it no guarantee of peace; that blessed state is reserved for the saved in heaven.

Augustine delineates three kinds of peace:  the ultimate and perfect peace which exists exclusively in the City of God, the interior peace enjoyed by the pilgrim citizens of the City of God as they sojourn on earth, and the peace which is common to the two cities. Sadly, Augustine is abundantly clear that temporal peace is rather an anomalous condition in the totality of human history and that perfect peace is altogether unattainable on earth:

Such is the instability of human affairs that no people has ever been allowed such a degree of tranquility as to remove all dread of hostile attacks on their dwelling in this world.  That place, then, which is promised as a dwelling of such peace and security is eternal, and is reserved for eternal beings.

However, Augustine insists that, by any estimation, it is in the best interest of everyonesaint or sinner—to try to keep the peace here and now; and indeed, establishing and maintaining an earthly peace is as fundamental to the responsibilities of the state as protecting the state in times of war.

As for the church’s quest for peace, he writes, “it seems to me that no limit can be set to the number of persecutions which the Church is bound to suffer for her training;” and he opines that persecutions will continue until the final scenes of the current state of human history incidental to the second coming of Christ. Interestingly, Augustine gives no suggestion whatsoever that the rest of the earth will be at peace while this violence against the church continues.  On the contrary, the entire tenor of his argument suggests that anti-Christian violence is merely typical of the violence and disorder that will accompany the human experience until the second coming of Christ.

While men do not agree on which kind of peace to seek, all agree that peace in some form is the end they desire to achieve.  Even in war, all parties involved desire—and fight to obtain—some kind of peace.  Ironically, although peace is the end toward which wars are fought, war seems to be the more enduring, more characteristic of the two states in the human experience. War is the natural (albeit lamentable) state in which fallen man finds himself.  The flesh and the spirit of man—although both are good—are in perpetual opposition:

But what in fact, do we achieve, when we desire to be made perfect by the Highest Good?  It can, surely, only be a situation where the desires of the flesh do not oppose the spirit, and where there is in us no vice for the spirit to oppose with its desires.  Now we cannot achieve this in our present life, for all our wishing. But we can at least, with God’s help, see to it that we do not give way to the desires of the flesh which oppose the spirit to be overcome, and that we are not dragged to the perpetration of sin with our own consent.

Augustine concludes that war among men and nations cannot be avoided altogether because it is simply characteristic of the present existence.  The contention that typifies war is merely the social counterpart to the spirit-body tension that typifies every individual person.  However, man can, through the general application of divine precepts contained in scripture and through the pursuit of virtue as dictated by reason, manage that tension both on the individual and societal levels in such a way as to obtain a transitory peace.  War and peace are two sides of the same Augustinian coin.  Owing to the injustice that is inherent in the mortal state, the former is presently unavoidable, and the latter, in its perfect manifestation, is presently unattainable.

4. Conclusion

In sum, the state is an institution imposed upon fallen man for his temporal benefit, even if the majority of men will not ultimately benefit from it in light of their predestination to damnation.  However, if one can successfully set aside Augustine’s doctrine of predestination, one finds in his writings an enormously valuable descriptive account of the psychology of fallen man, which can take the reader a very great distance toward understanding social interactions among men and nations.  Although the doctrine of predestination is indispensable for understanding Augustine’s theology, its prominence does not preclude one from reaping value from his appraisal of the state of man and his political and social relationships in the fallen “earthly city,” to which all either belong or with which they have unavoidable contact.

5. References and Further Reading

All the primary sources are readily available in English.

a. Primary Sources

  • Augustine.  City of God [De civitate Dei]. Translated by Marcus Dods, in The Nicene and Post-Nicene Fathers. Edited by Philip Schaff.  First Series.  Vol. II. Grand Rapids:  Eerdmans Publishing Company, 1956.
  • Augustine.  On Christian Doctrine [De doctrina christiana]. Translated by J. F. Shaw, in The Nicene and Post-Nicene Fathers. Edited by Philip Schaff.  First Series.  Vol. II. Grand Rapids:  Eerdmans Publishing Company, 1956.
  • Augustine.  On Free Choice of the Will [De libero arbitrio libri III].  Translated by Anna S. Benjamin and L. H. Hackstaff.  New York:  Macmillan Publishing Company, 1964.
  • Augustine.  Our Lord’s Sermon on the Mount [De Sermone Domini in Monte secundum Matthaeum]. Translated by William Findlay, in The Nicene and Post-Nicene Fathers. Edited by Philip Schaff.  First Series.  Vol. VI. Grand Rapids:  Eerdmans Publishing Company, 1956.

b. Secondary Sources

  • Bainton, Roland H.  Christian Attitudes Toward War and Peace:  A Historical Survey and Critical Re-evaluation. Nashville:  Abington Press, 1960.
    • On Augustine and War.
  • Battenhouse, Roy W.  A Companion to the Study of St. Augustine.  New York:  Oxford University Press, 1969.
  • Deane, Herbert A. The Political and Social Ideas of St. Augustine. New York:  Columbia University Press, 1963.
    • The indispensable and definitive work on Augustine’s political and social philosophy.
  • Gilson, Etienne.  The Christian Philosophy of Saint Augustine.  New York:  Random House, 1960.
  • Mattox, John Mark.  St. Augustine and the Theory of Just War.  London:  Thoemmes Continuum, 2006.
    • On Augustine and War.
  • Swift, Louis J.  The Early Fathers on War and Military Service. Wilmington:  Michael Glazier, Inc., 1983.
    • On Augustine and War.

 

Author Information

John Mark Mattox
Email: john.mark.mattox@ndu.edu
U. S. A.

Friedrich Wilhelm Joseph von Schelling (1775—1854)

Schelling F. W. J. von Schelling is one of the great German philosophers of the late 18th and early 19th Century. Some historians and scholars of philosophy have classified him as a German Idealist, along with J. G. Fichte and G. W. F. Hegel. Such classifications obscure rather than illuminate the importance and singularity of Schelling’s place in the history of philosophy. This is because the dominant and most often limited understanding of Idealism as systematic metaphysics of the Subject is applicable more to Hegel’s philosophy than Schelling’s. While initiating the Post-Kantian Idealism of the Subject, Schelling went on to exhibit in his later works the limit and dissolution of such a systemic metaphysics of the Subject. Therefore, the convenient label of Schelling as one German Idealist amongst others ignores the singularity of Schelling’s philosophy and the complex relationship he had with the movement of German Idealism.

The real importance of Schelling’s later works lies in the exposure of the dominant systemic metaphysics of the Subject to its limit rather than in its confirmation. In this way, the later works of Schelling demand from the students and philosophers of German Idealism a re-assessment of the notion of German Idealism itself. In that sense, the importance and influence of Schelling’s philosophy has remained “untimely.” In the wake of Hegelian rational philosophy that was the official philosophy of that time, Schelling’s later works was not influential and fell onto deaf ears. Only in the twentieth century when the question of the legitimacy of the philosophical project of modernity had come to be the concern for philosophers and thinkers, did Schelling’s radical opening of philosophy to “post-metaphysical” thinking receive renewed attention.

This is because it is perceived that the task of philosophical thinking is no longer the foundational act of the systematic metaphysics of the Subject. In the wake of “end of philosophy,” the philosophical task is understood to be the inauguration of new thinking beyond metaphysics. In this context, Schelling has again come into prominence as someone who in the heyday of German Idealism has opened up the possibility of a philosophical thinking beyond the closure of the metaphysics of the Subject. The importance of Schelling for such post-metaphysical thinking is rightly emphasized by Martin Heidegger in his lecture on Schelling of 1936. In this manner Heidegger prepares the possibility of understanding Schelling’s works in an entirely different manner. Heidegger’s reading of Schelling in turn has immensely influenced the Post-Heideggerian French philosophical turn to the question of “the exit from metaphysics”. But this Post-Structuralist and deconstructive reading of Schelling is not the only reception of Schelling. Philosophers like Jürgen Habermas, whose doctorate work was on Schelling, would like to insist on the continuation of the philosophical project of modernity, and yet attempt to view reason beyond the instrumental functionality of reason at the service of domination and coercion. Schelling is seen from this perspective as a “post-metaphysical” thinker who has widened the concept of reason beyond its self-grounding projection. During the last half of the last century, Schelling’s works have tremendously influenced the post-Subject oriented philosophical discourses. During recent times, Schelling scholarship has remarkably increased both in the Anglo-American context and the Continental philosophical context.

Table of Contents

  1. Life
  2. Philosophy
    1. Naturphilosophie and Transcendental Philosophy
    2. Identity Philosophy
    3. The Middle period
    4. Positive Philosophy
  3. Influences
  4. References and Further Reading
    1. Primary Sources
    2. Secondary Sources

1. Life

 

Friedrich Wilhelm Joseph Schelling was born on 27 January, 1775 in Leonberg, Germany. His father was Joseph Friedrich Schelling and mother was Gottliebin Maria Cless. In 1785 Schelling attended the Latin School in Nürtingen. A precocious child, his teachers soon found nothing more to teach him. In 1790, Schelling joined the Tübingenstift, a Protestant Seminary, in Tübingen where he befriended Hölderlin who was later to become a great German poet, and Hegel who was to become a great philosopher. In 1794 Schelling published Über die Möglichkeit einer Form der Philosophie Überhaupt, in the same year of the publication of Fichte’s  Wissenshaftlehre. Fichte’s Wissenshaftlehre, along with Kant’s Critique of Judgment that was published four years before (1790), proved to be of decisive importance for Schelling’s early philosophical career. In 1798 at the age of just 23, Schelling was called to a professorship at the University of Jena where he came in contact with German Romantic poets and philosophers like the Schlegel brothers and Novalis. He also met August Wilhelm Schlegel’s wife Caroline Schlegel and there begun one of the most fascinating and scandalous romantic stories of that time, leading to Caroline’s divorce and her marriage to Schelling in 1803. In 1803 he left Jena for Würzburg where he was called to a professorship. In the Autumn of 1805 Würzburg fell to Austria. The following year Schelling left for Munich where he was to stay till 1841 apart from a break between 1820-1827 when he lived in Erlangen. In 1809 Schelling published his great treatise on human freedom, Philosophical Inquiries Concerning the Nature of Human Freedom. A few months later Caroline died.. Schelling was devastated. In 1812 Schelling married Pauline who was to remain his life long companion. In 1831 Hegel died. In 1840 Schelling was called upon to the now vacant chair in Berlin to replace Hegel where he sought to elaborate his Positivphilosophie which was attended by the likes of Søren Kierkegaard, Alexander Humboldt, Bakunin and Engels. In 1854 on 20 August Schelling died at the age of 79 in Bad Ragaz, Switzerland.

2. Philosophy

 

Encounter with the works of Schelling often baffles the scholars and historians of philosophy. Schelling’s works seem to exhibit the lack of consistent development or systematic completion which most of his contemporaries possess. As a result scholars and historians of philosophy complain of the absence of a “single” Schelling. Recent scholarship, however, while accepting the often disruptive and discontinuous movement with which Schelling’s thinking moves that defies and un-works the completion of a single definite philosophical system, finds issues that are singular to Schelling’s continuous attention and unceasing concern. Thus the absence of a systematic completion is what has become the source of fascination for recent Schelling scholarship. Schelling appears to be the mark that delineates the limit of the systematic task of philosophy, “the end of philosophy and the task of thinking” as Heidegger says. Prominent Schelling scholars like Manfred Frank and Andrew Bowie (1993) have, however, pointed out that Schelling had never abandoned the idea of ‘system’, although the idea of ‘system’ was no longer grounded on a restricted, narcissistic concept of reason as totalizing and self-grounding but as opening to that which cannot be thought in the concept.

For the sake of convenience we can roughly divide the philosophical career of Schelling into four stages:

a. Naturphilosophie and Transcendental Philosophy

b. Identity philosophy

c. The Middle period: Freedom essay and The Ages of the World

d. Positive Philosophy (Philosophy of Mythology and Philosophy of Revelation)

a. Naturphilosophie and Transcendental Philosophy

The significance of Schelling’s early philosophical works lies in its radically new understanding of nature that departs significantly from the then dominant philosophical and scientific understanding of nature. Perhaps the best the way to approach the Schelling of Naturphilosophie is to see him, on the one hand, in relation to the dominant mechanistic determination of nature at that time, that of the Newtonian mathematical determination of nature according to which nature follows certain determinable physical laws of motion and rest, and that can be grasped in the objective cognition that has universal and non-relative validity and on the other hand, as a development of post-Kantian philosophy that led to a radical revision of Kant himself. Schelling’s philosophy of nature thus arose out of the demand to respond to the mechanistic determination of nature that was dominant at that time on the one hand, and to respond to the problems that arose in Kant’s division of the phenomenal realm of nature and noumenal realm of freedom. This demanded a dynamic philosophical account of nature where nature is no longer seen as a totality of objects that are a mere inert, opaque mass, but nature that is subjected to universal laws of causality. Such a dynamic philosophy of nature must be able to resolve the abyss that is opened up in the wake of Kant’s Critique of Pure Reason. It is the abyss between the deterministic, causal, conditioned realm of understanding on the one hand, and the unconditioned realm of ethical self-determination on the other hand, between theoretical philosophy and practical philosophy. The task that the Post-Kantian philosophy has given to itself is to bridge this gap between the conceptual, constitutive realm of nature which can be grasped by causal laws that has universal validity, and the ethical spontaneity of the practical reason where the ethical subject is beyond the conditioned realm of determination and is thus a free Subject of self-determination. This Subject is the Subject of freedom that cannot be grounded in the constitutive principles of understanding but in the regulative Ideas of reason. J. G Fichte sought to unify the theoretical reason (that is “understanding”) and the practical reason by  grounding them both in the dynamic activity of the self-consciousness that posits itself as pure, unconditioned act of self-positing ‘I’. The task of accounting for the process of emergence of the world of nature, which is thus a dynamic process, is addressed by Fichte thus: nature is an essential self-limitation of the ‘I’. The unconditioned, infinite self-positing ‘I’, in order to know itself as itself, divides itself into the finite ‘I’ and its counter-movement “Not-I”. In this manner, Fichte claimed to have resolved the problem that appeared to him and to the post-Kantian philosophers as that which is left unresolved by Kant himself. This is the question of how to account for the mysterious X, “the thing-in-itself” which, according to Kant, can never be grounded in the constitutive principle of understanding. As the condition of possibility of knowledge, “the thing-in-itself” can never be known. It is irreducible to the concepts of understanding. Fichte in his Science of Knowledge accounts for the genesis of this “thing-in-itself” in the pure self-positing act of the ‘I’. Since the ‘I’ cannot be an object of outer sense like any other objects of cognition ( Kant prohibits this), ‘I’ can only emerge in a pure, primordial act of inner-self. This self-emerging ‘I’ cannot therefore be an object of conceptual cognition, of an empirical intuition. It can only be grasped in the inner sense in ‘intellectual intuition’ which is none but ‘the fact of self-consciousness.’ According to Fichte, ‘the thing-in-itself’ is this self-emerging self-consciousness which is a ‘fact’ unlike any other ‘fact’. It is a fact that only ‘intellectual intuition’ grasps in the act of pure self-intuition. This is because only a being capable of intuiting itself as simultaneously representing and represented can account for the unity of representation and object. For such a being, that is ‘I’, there is no other predicate than itself. It is its own object. This object for it appears as nature which is the self-limitation of the self-positing Subject. Fichte’s idealism later came to be known as Subjective Idealism.

Schelling’s early works flourished under the influence of Fichte’s thinking. In 1797 Schelling published an essay called Treatise Explicatory of the Idealism in the “Science of Knowledge” in Philosophisches Journal edited by Immanuel Niethammer. This essay is crucial document for understanding the transition from Kantian critical philosophy to German Idealism. While attempting to elucidate what Kant would have intended if Kant’s philosophy is to prove internally cohesive, Schelling moves to the task of unifying theoretical and practical philosophy in a single principle in such a manner that he actually moves beyond both Kantian and Fichtean philosophy. What allows this unification of theoretical and practical philosophy is the Spirit’s infinite striving to represent the universe. The Spirit is not a static entity given, something mysterious X, but infinite becoming and infinite productivity. It is in this ceaseless production lies the organic nature of human Spirit that is moved by its immanent laws and that has its purposive-ness within itself. Schelling here introduces the notion of organism which unites in its immanence its goal and purpose, its form and matter, concept and intuition. As such each organism is a system which is “an arabesque delineation of the soul” or “eternal archetype” that finds expression in every plant. As immanent unity of form and matter that orients itself towards absolute purposive-ness through successive stages, this organism is not thus mere static, lifeless entity but is said to exhibit life. The Idealist notion of the system here takes this unified world of organism as model. Intuition is the unity of form and matter, representation and object which is distinguishable only in the concept that freely repeats the originary unity. With the help of the schematic power of the imagination, concept here produces the individual object of cognition. The succession of representation occurs alternately in a circle. To move beyond this circle of theoretical knowledge, this circle where the object always returns, it is necessary to introduce an act of free self-determination which cannot be further determined. This act is the absolute act of free will which is primordial and infinite. It is with this act the theoretical and practical philosophy is united.

In the same year Schelling published his  Naturphilosophie that further elaborates the concept of organism through analysis of natural phenomena with the help of scientific studies of the day. This work responds to the dual tasks mentioned above. On the one hand it must give an account of a dynamic process of the emergence of nature as against the mechanistic, deterministic understanding of nature; and on the other hand, to resolve the problem left by Kant, that of bridging the realm of theoretical and practical philosophy by developing a dynamic philosophy of nature that takes into account Fichtean dialectical philosophy of consciousness. Like the Treatise of the same year, this new philosophy of nature is not grounded in the self-positing, unconditioned self-consciousness but by positing a “non-objective”, unconditioned in nature itself which Schelling calls “productivity”. It is this productivity that emerges through the logic of polar oppositions between subject and object that is shown to lead to a higher subject-object synthesis. For Schelling such a dialectical logic is deducted as a movement of potencies. The first potency is the movement of infinite to the finite. The second potency makes the reverse movement, while the third potency alone, which is higher than the other two, unities preceding potencies. In this manner Schelling explains magnetism as the first potency, electricity as the second and chemistry as the third potency that dialectically sublates the other two. Schelling’s philosophy of nature that attempts to develop the dynamic process of Idealism from the objective side can be seen as a parallel development to the Subjective Idealism that is elaborated by Fichte.

In the Treatise Explicatory of the Idealism in the “Science of Knowledge” of 1797 Schelling hints at the idea of “the history of self-consciousness”. The Spirit through its originary activity presents the infinite in the finite, a movement whose goal is self-consciousness that marks the unification of theoretical and practical philosophy, nature and history. Schelling perfects this model in his System of Transcendenatl Idealism.   Schelling’s publication of The System of Transcendental Idealism in 1800 brought immediate fame to the young 25 year old philosopher. Schelling here draws from Fichte’s great insight that self-consciousness is not a mere “given entity”. It is not an unknown and inaccessible X,  a mysterious transcendental “in-itself” as the formal ground of cognition, but a coming into presence of itself, a pure self-positing emergence through the dialectical process of self-positing and self-limitation. In that way a “history of self-consciousness” can be deduced from one principle that explains the coming into being of the theoretical cognition that at its limit passes into the practical realm of freedom, that is, the objective world of history . This is the task of Schelling’s System of Transcendental Idealism of 1800. Thus the axiomatic sense of Fichtean I=I is transformed into the dynamic deduction of the self-consciousness by one principle. This is emergence of the Idealist notion of System whose possibility, according to the Idealists, is already given in Kantian Critical philosophy; a possibility is denied by Kant himself.

“The history of self-consciousness” comes into being in three stages or epochs. While the first epoch manifests the coming into being of “productive intuition” from “original sensation” and the second epoch manifests the emergence of “reflection” from “productive intuition”, the third epoch recounts the emergence of “the absolute act of will” from “reflection”. At the end of the third epoch, “the history of self-consciousness” passes into the practical realm where the deduction of the concept of history is shown to be the realm of unity of freedom and necessity. This has led Schelling to ask at the end of System: how the Subject which is now a completed self-consciousness can become conscious of that moment of its origin which is now unconscious for it, a past that appears to have receded into an immemorial origin and is inaccessible? It now appears that the condition of possibility of consciousness as such remains irreducible to consciousness itself. This is the problem that has become decisive, not only for Schelling’s subsequent philosophical career, but for the fate of Idealism as such. It now appears as if our self-consciousness is driven or constituted by an unconscious ground, forever inaccessible to consciousness, which can never be grounded in consciousness itself.

For Schelling this shows the limit of philosophical cognition and at the same time the importance of works of art. By refusing the claim to say or represent the synthesis of unconscious and conscious, the work of art rather shows it. Therefore art can be said to be the “the eternal organ and document of philosophy” whose basic character is an “unconscious infinity” that arises in the work of art’s synthesis of nature and freedom. While the artist initiates a work of art with a manifest, conscious intention, she, in an unconscious and unintentional manner, depicts infinity without representing or saying it. Such an unintentional showing exceeds the representational acts of consciousness. It cannot be reduced to categorical statements. Therefore works of art cannot be understood on the basis of pre-given set of rules. Works of art are not exhausted in the normative or axiomatic definitions as to ‘what constitutes art as such’. What constitutes the ‘essence’ of art lies rather in its excess of showing over the said. In that sense works of art are more analogous with organisms by virtue of its existing as a link between unconsciousness and consciousness. Such a link can only be shown and therefore remains irreducible to the propositional character of judgment. Schelling develops such insights further in his lectures on The Philosophy of Art (1802), two years after The System of Transcendental Idealism . Unlike Hegel’s lectures on Aesthetics where Hegel argues that “the work of art is a thing of the past” in so far as it no longer has an essential relation to the Absolute even though works of art will continue to be produced, and thus pass into the sobriety of philosophy’s Absolute Knowledge, Schelling sees works of art and philosophy as manifesting the differential mode of the Absolute where art retains an essential, singular and irreducible role.

b. Identity Philosophy

In 1795,  Friedrich Hölderlin published an article called On Judgment and Being that has proved to be of decisive importance for the later development of German Idealism. In this small article Hölderlin attempts to think of an Absolute identity, a prior and originary ground of consciousness that cannot be grasped or known within the immanence of self-consciousness. Hölderlin calls this originary identity “being”( Seyn) which he distinguishes from Judgment ( das Urteil). Hölderlin here attempts to think of an originary identity that grounds the reflective judgment. According to Hölderlin this reflective judgment which is the unity of a disjunction, separation or difference between the subject and the object, must already presuppose an originary identity before judgment. In so far as judgment presupposes the difference between the subject and the object of consciousness, it must already be grounded in an identity. This identity is being (Seyn) which, because of its ground character, remains irreducible to the reflective consciousness. In order for judgment to be possible, it must be grounded in a principle that exceeds judgment itself. This originary identity is being which is before or without consciousness.

In his Identity philosophy, Schelling too attempts to move beyond the immanence of self-consciousness and the circle of reflective judgment. With this move, Schelling decisively breaks away from the Fichtean subjective Idealism. The question of ‘I’ is no longer the point of departure, unlike that of Fichte’s absolute ‘I’ that is not an inert substance but arises purely in the act of self-positing. Rather, here it is the question of consciousness as a result of a process which is to be grasped not merely from the side of the Subject of self-consciousness but from the other side as well. This relation between subject and object thus can no longer be grounded within self-consciousness itself but in an absolute indifference that is prior to this distinction and hence, that can only be presupposed but is never accessible to reflective judgment or to the categories of understanding. Unlike that of reflective philosophy, the question is no longer that of making a correspondence between the subject and the object of consciousness. Such a representational philosophy of correspondence is here abandoned. The problem is rather that of explaining the manifestation of a finite world from a ground that is forever excluded from the infinite chain of conditioned, finite, particular entities. In order not to fall into dualism, which Jacobi alludes is the dualism between the unconditioned ground on the one hand and the infinite chain of conditioned, finite entities on the other, Schelling has to explain the manifestation of the finite world out of its unconditioned ground, from an absolute indifference, without falling into the logic of reflective thinking which Hegel later uses to develop in his Phenomenology of Spirit. This is the emergence of the finite world of entities that are connected to each other in an infinite chain of predicates from an originary indifference which is unconditioned. This emergence is not a smooth transition but a qualitative leap, a diversion, a falling away (Abfall) from its originary ground. Later in his critique of Hegel, Schelling argues that such a leap cannot be understood on the basis of Hegelian modality of dialectical negativity that arrives at absolute knowledge only on the basis of the self-cancellation of the finite.

Perhaps the most lucid and systematic exposition of Schelling Identity philosophy will be found in his posthumously published lecture called The System of Philosophy in General and of the Philosophy of Nature in Particular (1804). Schelling gave this lecture during his brief years of stay at Würzburg. Schelling here begins with the proposition which according to him is the first presupposition of all knowledge, that is: “the knower and that which is known are the same”. This proposition immediately puts into question the correspondence theory of truth and knowledge that was dominant at that time. The correspondence theory of knowledge posits two principles – the subject and the object of knowledge – which are then sought to be reconciled in a higher synthetic principle. According to Schelling, once this dualism is posited, the possibility of knowledge itself becomes inexplicable. Therefore Schelling begins with an absolute identity of the known and the knower, an identity that cannot be posited within subjectivity. With this notion of absolute identity beyond subjectivity, Schelling definitely breaks with Fichte’s Subjective Idealism and Kant’s reflective philosophy. Distinguishing his Identitätssystem from both Empiricism and merely subjective Idealism, Schelling here introduces the notion of the Absolute that has proved to be of crucial importance for German Idealism in general. The absolute identity is the unconditional identity of the subject and the object, idea and Being, Ideal and Real both at once, immediately posited and not discreetly. As immediate knowledge of the absolute, this system of identity is distinguished from what Schelling calls “common sense understanding”.

The common sense understanding distinguishes conditional knowledge, which is synthetic, real knowledge from unconditional knowledge, which is analytic and thus is no real knowledge. Here common sense understanding comes to an irresolvable aporia: either I have real, objective knowledge, but then I renounce the unconditional; or, I have the unconditional in which case it is merely subjective and thus is no real knowledge. According to Schelling, this irresolvable aporia is the aporia of Kantian philosophy  which Kantian dogmatism can never resolve. This demands a move beyond Kant’s critical philosophy. This move which inaugurates German Idealism consists of going beyond the mediated knowledge of the Absolute to the immediate knowledge of the Absolute which is an immediate affirmation of this affirmation. As immediate knowledge of the absolute, Reason is Absolute Knowledge. From this idea Hegel’s notion of the Absolute is not far.  Unlike Kant’s regulative idea of Reason, Reason here is the idea of God as an immediate, absolute, unconditional identity. The immediate awareness of the Spirit of its absolute will which can never be further grounded in concept, is what Schelling calls in this essay ‘intellectual intuition’. It is intuition because it is not yet mediated by concept, and it is intellectual because it goes beyond the empirical in that it has as its predicate its self-affirmation. As the unconditional ground of all knowledge, ‘intellectual intuition’ does not belong even to inner sense. Thus what Fichte calls ‘intellectual intuition’ is no longer seen here as belonging to the inner sense but to the unconditional absolute which is beyond the circle of self-consciousness. “The fact of consciousness” is not originary, for there must already be a priori identity before differences come to manifest in consciousness. The essence of Reason can be said to be ‘intellectual intuition’ whose object is exclusively the absolute which is monolithic, one and only substance. By virtue of this affirmation, Reason recognizes “the eternal impossibility of non-being”. Being is not a predicate of God as something lying outside or exterior, but God and being is immediately, unconditionally one without duration. This absolute identity is infinite by virtue of its idea. Therefore God can neither be thought as the end result of the self-negation of difference, nor being involved in a process of emanation. The indivisibility and univocity of God is neither a numerical concept nor a concept of totality as aggregate unity of finite particulars. This is because the indivisibility and univocity of God is the ground for infinite divisibility in form or in accidents. How can the existence of finite, particulars be explained within Identitätssystem?

In regard to the absolute identity, these finite, particulars are surely non-being, non-ens, non-essentials that can neither subtract nor add anything to the essence of the being who is the absolute substance. The existence of the finite, particulars can only be understood, not as modification of essence, but as modifications in form. They are non-being in respect to the universal which is absolute identity, but considered independently, they are not completely devoid of being. They are in part being and in part non-being. As such they are “real” or “concrete” things, irreducibly finite, particular, multiple, whose ground of existence does not lie within themselves but in that absolute identity of Being and essence. Schelling here deduces the finitude of particulars which ‘common sense understanding’ calls ‘actuality’, not as a process of emanation from the absolute identity, but as negativity that adheres in all finite things. Since these finite things cannot have positivity of being within themselves, they must therefore always relate themselves to other finite things, all sensuous cognition of them can only be non-cognition. Schelling here radically departs from Kant. For Kant all cognition is cognition of the sensible but not of the supersensible. By contrast Schelling argues that all of our sensory knowledge is only a privation of knowledge, or rather, “a negation of knowledge”. Hegel argues in a similar manner in Phenomenology of Spirit (1807) where he shows in a dialectical manner, the vanity of the supposed certitude of sensuous cognition.

One can present the schema of Schelling’s Identitätssystem as follows. God as absolute identity is an essential, qualitative identity. Absolute indifference follows from this essential identity of the absolute. Therefore, absolute indifference is not in-itself essential but a quantitative identity. There is thus a difference between absolute identity and absolute indifference. The opposition between real and ideal, subject and object arises out of this indifference. This is the birth of the finite world. Schelling here introduces the theory of potencies in triplicates that are “the necessary modes of appearances of the real and ideal universes”. While the potencies in triplicates are “the necessary modes of appearances” of the finite universes, they are not applicable to the absolute identity. The absolute identity is thus without potency or devoid of power. The potencies are those modes of appearances that make manifest the non-essential. Therefore they all have equal dignity in relation to the absolute. No potency has priority over the others temporally, for they are not posited successively in a genetic sequence but simultaneously, with equal primordiality. As such, they constitute a circle where all the potencies are posited together but not in an equal manner. Each time the potencies are posited, a particular potency predominates, subjugating the others to their relative non-being. At another time another potency predominates in an alternate manner, always returning to the same and always going away, always being attracted and repulsed, always contracted and expanded in an alternate, circular manner. In this alternating,  rotatory movement of potencies the Real principle comes first as the ground or condition of the Ideal Universe. The Ideal universe then overcomes the Real principle, its conditioning and grounding factor, by relegating it to its relative non-being. Only the higher synthetic principle can unify both the Real and Ideal universes by inhering in both and yet separating each from the other. Schelling presents the theory of potency in the following formula:

A3

—————–

A2 =  (A=B)

Where:

A=B  :   The domination of the Real or affirmed. It is A1

A2     :    The domination of the Ideal

A3     :    Indifference between the other two

With the theory of potencies Schelling explains the existence of the finite universes which are originally one. Their existence is neither completely being nor nothing, but a relative being and relative non-being. As relative being and relative non-being, potencies exceed each time from the immanence of self-presence. They never arrive at the absolute equilibrium of forces without ceasing themselves to be potencies. The circle of the potencies never comes to standstill, or that they do not come out of the circle unless a will superior to this circle of the conditioned existence breaks in.

Three years after this lecture, Hegel published his magnum opus Phenomenology of Spirit. In his Phenomenology of Spirit published in 1807, Hegel apparently criticizes Schelling’s notion of the Absolute indifference as “the night where all cows are black”. In a letter to Hegel, Schelling asks Hegel to clarify in the Preface to the Phenomenology whether this criticism is applied to him or to others who misuse Schelling’s ideas. Hegel did not incorporate this clarification in the subsequent edition of Phenomenology that the criticism is applied, not to Schelling, but to others. This led to the break in the friendship between the two philosophers who shared the same room at Tübingenstift. While this friendship was profoundly important and fruitful for both of them, the bitterness proved to be equally decisive for the development of  their singular modes of thinking, one leading to the task of systematic completion of the metaphysics of the Subject, the other leading to the attempt to inaugurate a new thinking beyond such a metaphysics of the Subject.

c. The Middle period

 

Published in 1809, Philosophical Inquiries into the Nature of Human Freedom is perhaps the most important book that Schelling published in his life time. Along with Hegel’s Phenomenology of Spirit, Fichte’s Science of Knowledge, and Kant’s Critique of Judgment, this essay is one of the greatest philosophical achievements of the late 18th and 19th century Germany. Published immediately before the death of Caroline, it evokes “a deep, unappeasable melancholy” that adheres to all finite beings. Here Schelling does not pose the question concerning the essence of human freedom as the dialectical problem between nature and freedom. Freedom does not appear here as the free exercise of the rational Subject’s will to mastery over its sensuous nature, but as the capacity to do evil. The question thus posed is no longer one question amongst others but the metaphysical question concerning the possibility of a system of freedom. On the one hand, freedom appears to be that which cannot be included within a system at all; on the other hand, the demand of Idealism that there must be a system without which nothing is adequately comprehensible is not to be renounced. The essay attempts to reconcile these two incommensurable demands: the demand of the unconditionality of freedom that grounds being and the demand of the grounding act of the system. This attempt at the system of freedom arose in the wake of what came to be known as the “pantheism controversy”.

The pantheism controversy is centred on the supposedly atheistic figure of Spinoza. During the late 18th century, and early 19th century, the dominant understanding of Spinoza was that of a pantheist and consequently an atheist. It is understood that within the pantheistic system of Spinoza’s ethics wherein God is immediately identified with the world, there is no place for the affirmation of God as unconditional reality. If the world is only a totality of conditioned, finite beings, then the unconditioned existence of God cannot be understood to be immediately identifiable with the world, and consequently with any dogmatic, rational system.  In the famous pantheism controversy, Friedrich Heinrich Jacobi attempted to show that a system of rational knowledge never arrives at the unconditioned since, for such a system, the unconditioned can only arise as a result of a process where the one conditioned leads to other conditioned in an infinite chain of negativity. To be properly concerned with the unconditioned, one must begin with the unconditioned itself  which no rational knowledge ever attains. For Jacobi it is only the leap of faith beyond the system of rational knowledge that enables us to open to the unconditionality of the absolute being. Therefore all system of rational knowledge for Jacobi is nihilism. Jacobi thereby becomes the first to use the word “nihilism” that arose in the context of the pantheism controversy.

Schelling here agrees with Jacobi about the limit of purely rational attainment of the unconditioned. Schelling, however, disagrees with Jacobi’s use of a limited and restricted notion of ‘system’ and ‘freedom’, along with Jacobi’s restricted use of the metaphysical and logical notion of judgment. In the Freedom essay Schelling attempts to re-interpret the logical and metaphysical notion of judgment in such a manner that it opens up to the unconditioned character of freedom without renouncing the demand of a system. Such a system must, on the one hand, be other than a purely formal, lifeless realism of Spinoza; and on the other hand, it must be otherwise than a conventional system of idealism that reduces the dynamic character of freedom and the world into pure rational necessity. Only a dynamic notion of the system that affirms the exuberance of life and the generosity of freedom can truly be system. The formal, rational notion of freedom as the intelligible principle that overcomes sensuous impulses must be opened to the ontological question of the beings in their becoming. The question of judgment is thus no longer merely a formal logical question but the question of the jointure, or bond of beings. This bond or jointure of beings is grounded in freedom which, understood in more originary manner, is not arbitrary free will but that belongs together with highest necessity. This jointure of beings – the infinite, creative being of God and the finite, created being called ‘man’ – must be  essentially a free relation, a relation that is governed by freedom which in the highest sense is also necessity. If man is free in a certain manner, then this manner is also the manner of man’s individuation. This is to say that to the extent that man is individuated by freedom, man’s freedom is distinguishable from the absolute freedom of the infinite, eternal being called God. This peculiar essence of human freedom is the capacity to do evil.

According to Schelling, the human is distinguished from the eternal creative God by the specificity of his freedom which is essentially and inextricably a finite freedom. God is the being whose condition, though never completely immanent, can be actualized in its very existing. On the other hand, the finite being can never actualize itself completely because the ground of its existence remains inappropriable. This is the source of the fundamental melancholy of all finite beings. The distinction between the absolute freedom of the eternal being and the finite freedom of the mortal can be better understood with the help of Schelling’s distinction between the ground of existence and existence itself. This is not a formal distinction between sensuous nature and intelligible will, but a dynamic distinction of freedom. Eternal or finite, each being is a jointure of  the ground of existence and existence itself. In the eternal, creative being, this jointure is indissoluble. In the mortal, however, there can occur dissolution of this jointure. It is the possibility of the dissolution of the principles that explains the finitude of the finite being, and the freedom of this finite being. The human is essentially finite being, and only such a finite being is capable of evil. Therefore evil is neither divine nor beastly but essentially belongs to the human freedom. Evil has this peculiar, specific relation to human finitude. Unlike the beasts in whom the jointure of the principles is governed by necessity, and unlike the divine in whom the jointure of the principles is indissoluble, human freedom partakes of the divine freedom and is yet separated by an abyss. According to Schelling, this abyss is the possibility of dissolution of the principles.

In the dynamic freedom there are two oppositional principles that never reach equilibrium. In the coming to existence of the finite being there adhere these oppositional principles. There is the dark principle which is the principle of ground, and there is the ideal principle of light. The dark principle that operates in the realm of history as the principle of particularity is the principle of evil. Man is the finite being that unites in himself both of these principles in an equal measure. Since the nexus (band) of these principles in him is free and not governed by necessity, man is free to bring permutation to this nexus. Therefore what ought to remain as mere condition of existence, as mere principle of particularity, man can seek to elevate to totality or to universal domination. Out of this self-affirmation of the finite being who in this self-affirmation seeks to abnegate its very finitude, there arises evil. Thus while the possibility of evil is given to man in the coming into existence of this being, to actualize this principle of possibility is the work of human freedom. As mere ground, this principle is the very source of creative joy and affirmation of life, but elevating it into the universality or totality results into the most terrible form of evil that seeks to negate any form of its life-affirmative character. Thus the source of life and the origin of evil is grounded in the same principle. This principle is the human freedom whose origin remains unfathomable for man. According to Schelling, this unfathomable, inappropriable, unconditional freedom ought to remain inappropriable and unconditional, for the human creates a conditioned world on the basis of the unconditioned freedom. This conditioned world is history. By beginning this new “covenant”, man partakes the creativity of the divine freedom. This is the source of creative joy for the human, for through this creative act of human, the world of nature is redeemed.  But in his vain arrogance and in his self-affirmation that is pushed to the point of absolutization and totalization, the human seeks to negate the finite character of his freedom and thereby seeks to elevate the principle of particularity to the universal domination. Herein lays the evil when the non-being, which is for that matter is not completely devoid of being, seeks to attain the complete, absolute being. Evil is therefore neither being nor nothing, but non-being’s malicious hunger for being. Therefore power of evil cannot be said to be the power of being. It is rather the power of non-being that seeks to devour itself and is never satisfied at any point, because it never reaches being without a remainder of non-being. More it does not reach being, more self-consuming becomes its lust. According to Schelling such is the character of evil.

In The Ages of the World which was written between 1809-1827 and is  found in various incomplete versions, Schelling develops a narrative method that seeks to recount the stages of the world’s becoming through the agonal movement of conflictual forces. This is the germ of Schelling’s theory of potencies. The world as it exists has its ground in a dark, unfathomable past which no work of human reason can ever elevate into thought. This non-reason is not irrationality that is opposed to reason nor is it the negation of the possibility of reason but the ground of reason. Human reason thus exists only as a “regulated madness”. On account of its immanent force alone the human reason cannot attain the unconditioned which is the realm of absolute freedom. The emergence of the world-order is not seen as an immanent order ruled by the necessary principles of reason but has its source in an absolute, unconditional freedom. This freedom can arrive to the finite, mortal being as a gift. Man can never master this gift, because it opens man to his historicity. The essence of history is freedom. “The ages of the world” thus arises out of the unconditional character of freedom. This principle of freedom manifests itself in the agonal movement of contradictory forces, one repulsive and the other attractive. It is this agonal movement of oppositional forces that makes possible the emergence of “the ages of the world” out of the unconditional. This unconditional is that which cannot be further grounded in thought or in self-consciousness, it is what Schelling in his Freedom essay calls “the indivisible remainder” that constantly solicits from finite human beings ‘awe’ or ‘respect’.

Here as elsewhere Schelling’s thought wrestles with the question of the unconditioned. If there is anything that is singular to Schelling’s whole of philosophy, and that unifies Schelling’s often discontinuous philosophical career, it is this question of the unconditioned. Schelling does not explain the existence of the world with the help of logical categories. For Schelling, a rational system constitutive of logical categories cannot explicate the facticity or actuality of the world. It is the unconditional character of freedom whose ground is groundless (Abgrund), this freedom alone opens the world. Therefore there is always something excessive about freedom. In many texts, especially in his 1797 treatise, Schelling evokes a freedom which is not only a promise for the human but also a danger (Gefahr). “The ages of the world” is grounded by a condition which is excessive and unthinkable. The human belongs to the “un-pre-thinkable” ( Unvordenkliche). This is a promise as well as danger. Schelling evokes this excess to explain the possibility of the world and finite existence. This unconditional excess makes the world and being-in-the world as essentially finite and irreducibly mortal. It is this aspect of Schelling’s work that has most profoundly influenced the twentieth century philosophers like Franz Rosenzweig and Martin Heidegger.

d. Positive Philosophy

On 14 November 1831 Hegel died in Berlin. In 1840 Schelling was called to the now vacant chair in Berlin to replace Hegel. The following year Schelling began his lectures on  “positive philosophy” (Positivphilosophie) which was attended by Kierkegaard, Bakunin, Humboldt and Engels. These lectures were delivered in three phases:  Grounding of Positive Philosophy that introduces and grounds Positive Philosophy vis-à-vis the history of Negative Philosophy from Descartes onwards, followed by Philosophy of Mythology (Philosophie der Mythologie) and Philosophy of Revelation ( Philosophie der Offenbarung).

Schelling’s grounding of Positive Philosophy begins with the distinction between the “what” of being and “that being”. “What” of being is being as essence and “that” being is the contingent being’s pure actuality of existence. This actuality is not an attribute of being but its  existentiality, the very facticity of its coming into being. From here comes the distinction between a negative philosophy, that is, the rational philosophy that is essentially concerned with the essence of being (its ‘what’ character) and the positive philosophy that is concerned with the pure actuality of the existence of “that” being which comes into its being. Such a being (“that” being) is not a settled entity that is given, but that which comes into being . Schelling calls such a coming into being, existence. Since this coming into being is not a finished entity but yet becoming and always contingent, it cannot be grasped in the concept. Therefore existence and movement cannot be a logical category. There is a concept only if a being already exists, for by definition concept can only grasp the essence of being which in turn is possible if such a being already exists. Understood in this sense, negative philosophy is not concerned with the facticity of something that exists at all. Therefore it is not concerned with the question “why something exists at all?” The negative philosophy is rather concerned with the question: if and if something exists, what is its essence, what is the “being” character of this being irrespective of the problem whether such a being exists as “this” being at all.

For example, when Kant argues against the ontological proof of God, he argues neither for the existence of God nor for its non-existence. He only argues that the concept of God is not extendable to the existence of God because ‘existence’ cannot be predicated. In so far as ‘existence’ cannot be predicated, its actuality or facticity can only be for rational philosophy a presupposition. This presupposition is a point of beginning whose existence can only be deduced only if such an existence is already granted; only if such and such a being has already revealed itself. What then Kant’s philosophy shows, for Schelling, is the limit of negative philosophy, a limit that constitutes the possibility of negative philosophy. Schelling does not contest the possibility of negative philosophy, but precisely demands it however, on the condition that it recognizes this limit that is constitutive of it and does not pretend to be able to constitute itself as absolute system that includes the concept as well as existence of being. What Schelling finds problematic in Hegel is not that there should not be negative philosophy, but of Hegel’s claim to include existence in a system that is logical and purely negative system. For Schelling, Hegel’s extension of his negative notion of system to the Absolute totality without outside is without justification. For Schelling there always remains a remainder of such a system of negativity, which is the positivity of existence. Hegel’s system is founded upon purely negative relation of the finite being in relation to other finite beings where the unconditioned is supposed to be reached as a self-negation of negation. According to this conception, the unconditioned is the end result of a process of the self-cancellation of finite, conditioned entities. As early as 1804 in a lecture in Würzburg on The System of Philosophy in General Schelling contests this idea of the absolute as the end result of a process of the self-negation of finitude. According to Schelling, such a system is based upon a false premise and a presupposition. It presupposes to have reached the unity of being and thought, while it reaches such a unity merely in thought that means, only from negative side. It leaves out the pure actuality of existence whose unconditional character of its being cannot be merely the result of a dialectical process of the self-cancellation of finitude. Unlike Hegel’s claim, a purely negative philosophy cannot be presupposition-less. It presupposes what it cannot incorporate within its systemic edifice. This limitation of negative philosophy demands a positive philosophy that begins with the unconditionality of existence, with a prius whose existence can only be proved posteriori once there is a manifest world. Schelling called  such a positive philosophy, “metaphysical Empiricism”.  Hence the idea of a positive philosophy is where the ground is a presupposition. This presupposition is the unconditional existence of being whose pure actuality no rational knowledge based upon potentiality can ever attain. While the philosophical concept that is essentially concerned with essence can only elaborate the possibility of being, the actuality of being itself is beyond such categorical cognition, for the existence of this being exists as absolute freedom and not as a necessary consequence of a concept.

Here the limit of the Idealist notion of system is reached. Schelling in these lectures shows that the (Hegelian ) notion of the Subject presupposes as its condition that which cannot be further grounded in the Subject itself. One then has to begin from the pure actuality of existence, from a facticity, which is already always before self-consciousness and before thought’s ability to grasp it in the concept. This immemoriality of the origin is the “exuberance of being” that elicits from us awe or respect ( Achtung), because it exposes us to the Infinite that unconditionally and groundlessly exists. It thereby exposes us to our own finitude and mortality.

3. Influences

How deeply Schelling’s later philosophy has influenced Kierkegaard cannot be shown by quoting Kierkegaard or from Kierkegaard’s self-understanding. This can better be shown by understanding Kierkegaard’s anti-systematic notions of “existence”, “temporality” and “finitude” that he understands to be irreducible to the general order of the system. Like Schelling, Kierkegaard understands the question of existence as the highest question of philosophy. There is in existence something that cannot be grasped in the predicative. Likewise, in the realm of history there is a preponderant mass of contingencies that cannot be completely and exhaustively accounted by the speculative dialectical logic. The Post-Schellingian philosophies that are concerned with this problem have the source of their inspiration in Schelling’s later works. For Schelling neither history nor existence is a homogenous process leading straight, necessarily, to a telos of absolute knowledge by irresistible law which is auto-generative and anonymous. History is rather a field of polemos where agonal forces are at work. Kierkegaard’s The Concept of Anxiety begins with a Schellingian note. Kierkegaard here argues, in a manner that recalls Schelling’s critique of Hegel, that the notion of movement does not allow itself to be thought within the immanent speculative logic of Hegel, for the true movement presupposes transcendence which by definition a logical category cannot grasp. The task of Kierkegaard’s philosophy is to open towards an Archimedean point outside totality, or outside the general, normative order of validity. That point cannot be attained within the realm of the ethical, that is, within the homogenous order of universal norms, but in a “quantum leap” of faith. That leap of faith must pass through an existential experience of anxiety (Angst) which no phenomenology of spirit can thematize.

This anxiety has family resemblance with Schelling’s notion of anxiety of the mortal who constantly flees from the fire of the centre and takes shelter in the periphery. In Schelling as well as in Kierkegaard, especially in his Fear and Trembling, this anxiety manifests the irreducible finitude of the mortal being who is seized by the gaze of the wholly other, the divine, holding his hand, tearing him out of the totality of finite knowledge. In his Concluding Unscientific Postscript Kierkegaard attempts to open this universal order of the ethical to the notion of subjectivity, the subjectivity of that singular individual for whom transcendence of the wholly other is an existential interest. This existential interest, argues Kierkegaard, cannot be addressed within the immanent order of the system. One of the most prominent tendencies of the post-Schellingian philosophy is this question of existence from the religious point of view. For Schelling himself the question of religion remains irreducible to the rational-logical system of knowledge. The transcendence of the absolute cannot be reduced to a theodicy of history. As early as 1804, Schelling warned in his Philosophy and Religion against the danger of the acts of legitimacy by the earthly power in the name of the embodiment of the divine in the profane body. Religion for Schelling, as for Kierkegaard remains irreducible to the violence of a historical reason that constantly evokes a theological foundation for the justification of its domination. As against this theologico-political foundation, Kierkegaard evokes the whole other God. Thus religion cannot be used as the foundation of the profane in order to legitimize the power of earthly sovereignty, because religion essentially opens us to a non-foundation that eternally delegitimizes any earthly power, like the power of the State. In his 1804 lecture Philosophy and Religion and in his Stuttgart lectures of 1810, Schelling raises this important theologico-political question that has profound significance for our contemporary historical world. The recent upsurge of the question of political theology attempts to go back to Schelling to see how Schelling helps us to think of a critique of historical reason.

Such a question is pursued further by Franz Rosenzweig, a German Jewish philosopher who is contemporary of Martin Heidegger. Rosenzweig’s first scholarly work was his doctoral thesis on Hegel called Hegel and the State. In the wake of his horror of the First World War, Rosenzweig soon abandoned Hegelianism; his The Star of Redemption, which he wrote on post cards to his mother  when he was in the Balkan Front, is an anti-Hegelian work. In this book, that evokes Schelling’s later works as one of the main sources of inspiration, Rosenzweig envisions the messianic notion of history and redemption beyond the closure of a historical-speculative reason. This remarkable book begins with the question of existence which he takes from Schelling’s later works. It is the notion of the individual, finite existence whose fear of death cannot be consoled by the concept of the universal history. This demands opening up the closure of the universal historical reason to the arrival of redemption that is always to come. This eternity which is always to come, that alone can redeem the violence of a historical reason, does not itself belong to the “Philosophy of the All”. Rosenzweig’s critique of “the philosophy of the All” begins with Schellingian critique of Hegel, that existence precedes thought and thus it cannot be enclosed within the All. It is what falls outside totality or system, and in this manner opens the world to the messianic event of pure future. The messianic arrival of eternity does not allow itself to be reduced to the theological foundation of the profane order, like the power of the State. Thus the State is no longer an expression of the Absolute. Like Schelling, Rosenzweig’s later works are deeply suspicious of the theodicy of history that legitimizes the political sovereignty of the State.

The question of existence is important for Martin Heidegger’s early philosophical works. What Heidegger calls in his early works “hermeneutics of facticity” has resonance with Schelling’s notion of actuality of “that”, the pre-predicative, pre-conceptual and pre-categorical disclosure. The existential analytic of Dasein that Heidegger elaborates in his Being and Time and his deconstruction of the metaphysical foundation of logic has inspiration in Schelling’s attempt to open the system of negative philosophy to the more  originary revelation of being. Schelling’s positive philosophy seeks to release philosophy beyond its metaphysical foundation in the logic of the thinkable to a disclosure that can only be shown a posteriori . In this sense Schelling’s metaphysical empiricism is at once an exit from the metaphysics founded upon the notion of the predicative truth. What both Heidegger and Rosenzweig have sought to complete is this exit from metaphysics.  Heidegger’s 1936 lecture on Schelling shows the real importance of Schelling’s thinking for him.

The exit from metaphysics is a fundamental problem even for Marx. Ernst Bloch, whom Jürgen Habermas calls “Schellingian Marxist”, combines a certain version of Marxism and messianism that envisions a utopian fulfilment oriented towards the “not yet”. His The Spirit of Utopia and his later work The Principle of Hope evoke a notion of history that is disruptive, opening to the “not yet”, a fundamental affirmation of future which Schelling always insisted as the very creative, free task of philosophy. While Schelling has attempted to open the radical notion of future in a certain eschatological-theological manner, Bloch’s messianism is essentially an atheistic eschatology.

Schelling’s influence is seen to be growing in our contemporary philosophical world. Thus Jean Louis Chrétien, the French philosopher, has drawn on Schelling from a certain phenomenological perspective. In his Unforgettable and the Unhoped for, Chrétien is concerned with the immemoriality of a promise that arrives from the extremity of time, from an eschatos of future always to come. Chrétien draws here on Schelling’s notion of the eternal past which has not come to pass but that is always a past, an immemorial past that, being the principle of foundation, always opens the world to its futurity. Schelling indeed develops such a notion of an immemorial past in his The Ages of the World. Like Schelling in his various texts, Chrétien too evokes Plato’s notion of Anamnesis as remembrance, not of what has passed, but what has immemorially opened us to truth. What has found us, the excess that opens us to the world, is immemorially lost. For both Schelling and Chrétien, this is not the occasion of despair but the occasion of a creative freedom and the possibility of future.  In recent years the Anglophone philosophical world has been witnessing increased attention to Schelling’s works. This shows the continuing relevance of Schelling in our contemporary historical existence. Schelling’s philosophy has come to be interpreted and understood as a philosophy of affirmation and a philosophy of the exuberance of life as against petrified system of concepts. Jason Wirth’s recent work on Schelling rightly emphasizes the contemporaneity of Schelling for our concerns: our ethical concern with the primacy of Good over truth, the affirmation of life beyond the instrumental use of Reason, the affirmation of the more originary ecstatic temporality, and our deep ecological concerns. The ‘unconscious’ has psychoanalysis speaks of, evokes the notion of ‘unconscious’ in Schelling, the abyss that cannot be further grounded, and hence is unground. In Jacques Lacan’s term, it is the Real that never stops haunting, destabilizing and disturbing the symbolic order of the world. “The indivisible remainder” that Schelling speaks of in his 1809 Freedom essay  is that element of eternal nature as ground that never ceases de-constituting the cultural-historical order of totality. The symbolic order of a restrictive Reason never reaches totality, but always opens to an eternal remnant outside. This question has profound importance of Schelling for our time.

4. References and Further Reading

a. Primary Sources

  • Friedrich Wilhelm Joseph Schelling’s Sämmtliche Werke, ed. K.F.A. Schelling, I Abtheilung Vols. 1-10, II Abtheilung Vols. 1-4, Stuttgart: Cotta, 1856-61.
  • Friedrich Wilhelm Joseph von Schelling, Ausgewählte Schriften, 6 Vols., ed. Manfred   Frank, Frankfurt: Suhrkamp 1985.
  • Aus Schellings Leben. In Briefen (three volumes), Adamant Media Corporations, 2003.
  • The Unconditional in Human Knowledge: Four early essays 1794-6 , trans. F. Marti, Lewisburg: Bucknell University Press, 1980.
  • Ideas for a Philosophy of Nature: as Introduction to the Study of this Science , trans. E.E. Harris and P. Heath with an introduction R. Stern, Cambridge: Cambridge University Press, 1797/1988.
  • System of Transcendental Idealism, trans. P. Heath with an introduction by M. Vater, Charlottesville: University Press of Virginia, 1800/1978.
  • Bruno, or On the Natural and the Divine Principle of Things , trans. with an introduction by M. Vater, Albany: State University of New York Press, 1802/1984.
  • The Philosophy of Art , Minnesota: Minnesota University Press, 1802-03/1989.
  • On University Studies , trans. E.S. Morgan, ed. N. Guterman, Athens, Ohio: Ohio University Press, 1803/ 1966.
  • Philosophical Inquiries into the Nature of Human Freedom, trans. With an introduction by J. Gutmann, Chicago: Open Court, 1809/1936.
  • Clara : or On Nature’s Connection to the Spirit World, trans. Fiona Steinkamp, Albany: State University of New York Press, 1811/2002.
  • The Ages of the World, trans. Jason M. Wirth, Albany: State University of New York, 1811-15/2000.
  • The Ages of the World , trans. F. de W. Bolman, jr., New York: Columbia University Press, 1811-15/1967.
  • The Deities of Samothrace’ , trans. R.F. Brown, Missoula, Mont.: Scholars Press, 1815/1977.
  • On the History of Modern Philosophy, trans. Andrew. Bowie, Cambridge: Cambridge University Press, 1833-4/1994.
  • Philosophie der Offenbarung . ed. M. Frank, Frankfurt: Suhrkamp, 1841-2/1977.
  • Historical-Critical  Introduction  to  the  Philosophy  of Mythology,    trans. Richey, M., Albany, NY: State University of New York Press, 2007.
  • The Grounding of Positive Philosophy: the Berlin Lectures , trans. Bruce Matthews, Albany, NY: State University of New York Press, 2008.
  • Philosophy and Religion , Spring Publications, 2010.
  • Idealism and the Endgame of Theory , trans. Thomas Pfau , Albany: State University of New York, 1994.
  • Philosophy of German Idealism: Fichte, Jacobi and Schelling, ed. Ernst Behler , Contuum, 1987.

b. Secondary Sources

  • Beach, Edward Allen, The Potencies of God(s): Schelling’s Philosophy of Mythology,         Albany: State University of New York Press, 1994.
  • Behun, William A. The Historical Pivot: Philosophy of History in Hegel, Schelling and Hölderlin , Triad Press, 2006
  • Beiser, Frederick C., German Idealism: Struggle Against Subjectivism , Harvard: Harvard University Press, 2008.
  • Bowie, Andrew, Aesthetics and Subjectivity: from Kant to Nietzsche, Manchester:    Manchester University Press, 1990.
  • Bowie, Andrew, Schelling and Modern European Philosophy: An Introduction,     London: Routledge, 1993
  • Brown, Robert F., The Later Philosophy of Schelling: The Influence of Boehme in the Works of 1809-1815 , The Associated University Press, 1977
  • Courtine, Jean-Francois , Extase de la raison. Essais sur Schelling, Paris, Galilée, 1990
  • Distaso, Leonardo V., The Paradox of Existence : Philosophy and Aesthetics in the Young Schelling, Springer, 2010
  • Esposito, Josephe L., Schelling’s Idealism and Philosophy of Nature, Associated University Press, 1977
  • Fackenheim, Emil, The God Within: Kant, Schelling and Historicity , ed. John W. Burbridge, University of Toronto Press, 1996
  • Frank, Manfred, Der Unendliche Mangel an Sein, Frankfurt: Suhrkamp, 1975
  • Frank, Manfred, Eine Einführung in Schellings Philosophie, Frankfurt: Suhrkamp, 1985
  • Frank, Manfred, Selbstbewußtsein und Selbsterkenntnis, Stuttgart: Reclam, 1991
  • Frank, M. (ed).  with Kurz, G., Materialien zu Schellings philosophischen Anfängen, Frankfurt: Suhrkamp, 1975
  • Freydberg, Bernard, Schelling’s Dialogical Freedom Essay: Provocative Philosophy Then and Now , State University of New York Press, 2009
  • Geldhof, J, Revelation, Reason and Reality: Theological Encounters with Jaspers, Schelling and Baader, Peeters, 2007
  • Goudeli, Kyriaki, Challenges to German Idealism: Schelling, Fichte and Kant, Palgrave Macmillan, 2003
  • Grant, Ian Hamilton, Philosophies of Nature After Schelling, Continuum, 2008
  • Hegel, G.W. F., The Difference between Fichte’s and Schelling’s System of Philosophy, Albany: State University of New York Press, 1977
  • Heidegger, Martin, Schellings Abhandlung über das Wesen der menschlichen Freiheit, Tübingen: Niemeyer, 1971. Schelling’s Treatise on the Essence of Human Freedom, trans. Joan Stambaugh, Athens: Ohio University Press, 1985
  • Heidegger, Martin, Die Metaphysik des Deutschen Idealismus (Schelling), Frankfurt: Klostermann, 1991
  • Henrich, D. Selbstverhältnisse, Stuttgart: Reclam, 1982
  • Horn, Friedemann , Schelling and Swedenborg: Mysticism and German Idealism, trans. George F. Dole , Swedenborg Foundation Publishers, 1997
  • Jaspers, Karl, Schelling: Größe und Verhängnis, Munich: Piper, 1955
  • Kierkegaard, Søren, The Concept of Irony/Schelling Lecture Notes : Kierkegaard’s Writings Vol 2, Princeton University Press, 1992
  • Kosch, Michelle, Freedom and Reason in Kant, Schelling and Kierkegaard, Oxford University Press, 201
  • Lauer, Christopher, Suspension of Reason in Hegel and Schelling, Continuum,201
  • Limnatis, Nectarios G., German Idealism and the Problem of Knowledge: Kant, Fichte, Schelling and Hegel , Springer, 2010
  • Marx, W. , The Philosophy of F.W.J. Schelling: History, System, Freedom, Bloomington: Indiana University Press, 1984
  • Norman, Judith and Alistair Welchman , ed.  New Schelling , Continuum, 2004
  • O’Meara, Thomas, Romantic Idealism and Roman Catholicism: Schelling and the Theologians, University of Notre Dame Press, 1982
  • Shaw, Davin Jane , Freedom and Nature in Schelling’s Philosophy of Art , Continuum, 2011
  • Snow, Dale E., Schelling and the End of Idealism, Albany: SUNY Press, 1996
  • Tillich, Paul, Mysticism and Guilt Consciousness in Schelling’s Philosophical Development , Bucknell University Press, 197
  • Tilliette, X. , Schelling une philosophie en devenir, Two Volumes, Paris: Vrin, 1970
  • White, A., Absolute Knowledge: Hegel and the Problem of Metaphysics, Ohio: Ohio University Press, 1983a
  • White, A., Schelling: Introduction to the System of Freedom, New Haven and London: Yale University Press, 1983b
  • Wirth, Jason, The Conspiracy of Life: Meditations on Schelling and His Times, New York: State University of New York Press, 2003
  • Wirth , Jason, (ed.),Schelling Now: Contemporary Readings, Indiana: Indiana University Press, 2004
  • Zizek, Slavoj ,The Indivisible Remainder: An Essay on Schelling and Related Matters, London: Verso, 1996.

Author Information

Saitya Brata Das
Email: satyadx@yahoo.com
The University of Delhi
India

Feminist Standpoint Theory

Feminist standpoint theorists make three principal claims: (1) Knowledge is socially situated. (2) Marginalized groups are socially situated in ways that make it more possible for them to be aware of things and ask questions than it is for the non-marginalized. (3) Research, particularly that focused on power relations, should begin with the lives of the marginalized. Feminist standpoint theory, then, makes a contribution to epistemology, to methodological debates in the social and natural sciences, to philosophy of science, and to political activism. It has been one of the most influential and debated theories to emerge from second-wave feminist thinking. Feminist standpoint theories place relations between political and social power and knowledge center-stage. These theories are both descriptive and normative, describing and analyzing the causal effects of power structures on knowledge while also advocating a specific route for enquiry, a route that begins from standpoints emerging from shared political struggle within marginalized lives. Feminist standpoint theories emerged in the 1970s, in the first instance from Marxist feminist and feminist critical theoretical approaches within a range of social scientific disciplines. They thereby offer epistemological and methodological approaches that are specific to a variety of disciplinary frameworks, but share a commitment to acknowledging, analyzing and drawing on power/knowledge relationships, and on bringing about change which results in more just societies. Feminist scholars working within a number of disciplines—such as Dorothy Smith, Nancy Hartsock, Hilary Rose, Sandra Harding, Patricia Hill Collins, Alison Jaggar and Donna Haraway—have advocated taking women’s lived experiences, particularly experiences of (caring) work, as the beginning of scientific enquiry. Central to all these standpoint theories are feminist analyses and critiques of relations between material experience, power, and epistemology, and of the effects of power relations on the production of knowledge.

Table of Contents

  1. Introduction
  2. Historical Roots of Feminist Standpoint Theory
  3. Central Themes in Feminist Standpoint Theory
  4. What is a Standpoint?
  5. Acquiring Knowledge via Standpoints
  6. The Outsider Within
  7. Controversies
    1. False Universalism
    2. Epistemic Relativism
    3. The Bias Paradox
  8. References and Further Reading
    1. Works Cited
    2. Earlier Papers
    3. Later Contributions

1. Introduction

At first blush there appears a tension between the traditional epistemological assumption that a general, universal and abstract account of knowledge and scientific enquiry is possible, and the politically inflected feminist claim that such analyses are only properly understood in the social contexts in which they arise, and in terms of the biases and prejudices those contexts generate. From the outset, then, feminist epistemologies seem to be located within the contradictory pull of the politicized material and experiential concerns of feminism and the abstract universal concerns of epistemology. Feminist epistemological projects began as a critique of that tradition but have evolved beyond the critical to reframe and reconceptualize the problems of knowledge and the epistemological project itself. Feminist epistemology does not adopt a monolithic critical position with respect to a traditional canon of epistemological work; rather it consists of a variety of feminist epistemological approaches, of which feminist standpoint epistemologies form a strand.

Here feminist standpoint theory is examined primarily as a feminist epistemology and as a methodology for feminist researchers in the social sciences where, arguably, feminist standpoint theory has had the most influence and been the subject of most debate. As with feminist theories generally, it would be somewhat misleading to represent feminist standpoint theory as a single set of epistemological commitments or a single methodological approach. More appropriate would be to think of them in terms of ‘standpoint theories’. Nevertheless, standpoint theories share common commitments and approaches, which are taken as the focus here. Aspects of those theories that attract controversy both within and outside of the intellectual conversations in which feminist standpoint theories have been developed and employed are also briefly discussed.

2. Historical Roots of Feminist Standpoint Theory

The genealogy of feminist standpoint theory begins in Hegel’s account of the master/slave dialectic, and subsequently in Marx and, particularly, Lukacs’ development of the idea of the standpoint of the proletariat. Hegel argued that the oppressed slave can eventually reach a state of freedom of consciousness as a result of her/his realization of self-consciousness through struggles against the master, and via involvement through physical labor in projects that enable her/him to fashion the world—to affect it in various ways. Hegel’s analysis of the struggle inherent in the master/slave relationship gave rise to the insight that oppression and injustice are better analyzed and understood from the point of view of the slave than from that of the master. Marx and Engels, and, later, Lukacs developed this Hegelian idea within the framework of the dialectic of class consciousness, thereby giving rise to the notion of a standpoint of the proletariat (the producers of capital) as an epistemic position that, it was argued, provided a superior starting point for understanding and eventually changing the world than that of the controllers and owners of capital. The Hegelian and Marxist traditions, then, provide the genesis of standpoint theorists’ claim that the ‘double vision’ afforded to those who experience social relations from a position of marginality can, under certain circumstances, offer them epistemic advantage.

Although their genealogy begins in the Hegelian and Marxist traditions, some current feminist standpoint theories are also located squarely within an empiricist tradition in epistemology. These feminist epistemologies extend the traditional empiricist commitment to experience and observation as the starting points for knowledge. Following Quine and his successors, they recognize and acknowledge that observation is theory-laden and that those theories themselves are artifacts of our making. They also draw on the insight that a set of observation-based data can serve as equally credible evidence for more than one of those theories.

3. Central Themes in Feminist Standpoint Theory

Feminist standpoint theorists such as sociologists Dorothy Smith and Patricia Hill Collins, political philosophers Nancy Hartsock and Alison Jaggar, sociologist of science Hilary Rose, and philosopher of science Sandra Harding extended and reframed the idea of the standpoint of the proletariat to mark out the logical space for a feminist standpoint. Their principal claim regarding feminist standpoint theories is that certain socio-political positions occupied by women (and by extension other groups who lack social and economic privilege) can become sites of epistemic privilege and thus productive starting points for enquiry into questions about not only those who are socially and politically marginalized, but also those who, by dint of social and political privilege, occupy the positions of oppressors. This claim is captured by Sandra Harding thus: “Starting off research from women’s lives will generate less partial and distorted accounts not only of women’s lives but also of men’s lives and of the whole social order.” [1993:  56]

Following Marxist tradition in rejecting liberal assumptions that social and historical factors are irrelevant to epistemic questions, central tenets of feminist standpoint theories include their recognition of the role of social and historical location in shaping epistemic agents and their knowledge, and an embrace of that location as a potentially valuable contribution to knowledge.  Feminist standpoint theories work towards an epistemic approach that continues to value objectivity (albeit rethought and reworked) as a goal of enquiry, while at the same time accommodating, analyzing and understanding the effects of social location on epistemic agents and on knowledge. This stance is in stark contrast to the relatively pervasive traditional assumption that recognizing the effects of the socio-historical location of epistemic agents rather than abstracting them from that location disrupts enquiry. Feminist standpoint theories, then, involve a commitment to the view that all attempts to know are socially situated. The social situation of an epistemic agent—her gender, class, race, ethnicity, sexuality and physical capacities—plays a role in forming what we know and limiting what we are able to know. They can affect what we are capable of knowing and what we are permitted to know. The influence of social location on epistemic content and capacity can be felt throughout our epistemic practices, shaping not only the way in which we understand the world, but also the way in which it is presented to us via experience. Consider the following example offered by Terri Elliot:

Person A approaches a building and enters it unproblematically. As she approaches she sees something perfectly familiar which, if asked, she might call ‘The Entrance’. Person X approaches the same building and sees a great stack of stairs and the glaring lack of a ramp for her wheelchair. [1994: 424]

The experience of person A is of the entrance to a building. Whereas the experience of person X is of a barrier to entrance and (at best) an inconvenience. Person X’s social location—qua person with a disability—means that the building presents differently to her from how it does to someone without a disability.

Feminist standpoint theories seek, moreover, to go beyond analysis and description of the role played by social location in structuring and shaping knowledge. The normative aspect of feminist standpoint theories manifests firstly in a commitment to the thesis that the ways in which power relations inflect knowledge need not be understood as with a subjectivity that threatens their objectivity; rather that socially situated knowledge can be properly objective. Secondly, feminist standpoint theories’ normative weight is felt via their commitment to the claim, developed by extension of the Marxist view of the epistemic status of the standpoint of the proletariat, that some social locations, specifically marginalized locations, are epistemically superior in that they afford hitherto unrecognized epistemic privilege, thereby correcting falsehoods and revealing previously suppressed truths. Thus, as Sandra Harding puts it, “Standpoint theories map how a social and political disadvantage can be turned into an epistemic, scientific and political advantage.” [2004; 7-8]

Standpoint theories, then, move beyond a descriptive situated-knowledge thesis to a normative thesis, among the transformative objectives of which is a more socially just world.

4. What is a Standpoint?

The concept of a standpoint employed in feminist standpoint theories takes a narrow meaning, owed to Marxist theory, according to which a standpoint is an achieved collective identity or consciousness. The establishment of a standpoint is the political achievement of those whose social location forms its starting point; it is not merely ascribed from beyond that location. There is a consensus among feminist standpoint theorists that a standpoint is not merely a perspective that is occupied simply by dint of being a woman. Whereas a perspective is occupied as a matter of the fact of one’s socio-historical position and may well provide the starting point for the emergence of a standpoint, a standpoint is earned through the experience of collective political struggle, a struggle that requires, as Nancy Hartsock puts it, both science and politics [Harding 2004: p. 8]. By way of emphasis of this point, Hartsock uses the label ‘feminist standpoint’ whereas Dorothy Smith uses the label ‘women’s standpoint’, reflecting the way in which standpoint theory argues for “women’s place” as a starting point for enquiry [Harding 2004: 21].

So while both the dominant and the dominated occupy perspectives, the dominated are much more successfully placed to achieve a standpoint. Nevertheless, it is not impossible for those who occupy non-marginalized perspectives to become part of the process of helping reach a shared critical consciousness with respect to the effects of power structures on epistemic production. There are many different lives consisting of many different activities and many different social relations and, thus, potentially many different consciousnesses and many different standpoints. The ongoing political and epistemic project of achieving a standpoint offers critical insights that give rise to a new perspective on reality. Sandra Harding explains the point thus,

Only through such struggles can we begin to see beneath the appearances created by an unjust social order to the reality of how this social order is in fact constructed and maintained. This need for struggle emphasizes the fact that a feminist standpoint is not something that anyone can have simply by claiming it. It is an achievement. A standpoint differs in this respect from a perspective, which anyone can have simply by ‘opening one’s eyes’. [1991:  127]

5. Acquiring Knowledge via Standpoints

According to feminist standpoint theories, the process of achieving knowledge begins when standpoints begin to emerge. They emerge when those who are marginalized and relatively invisible from the vantage point of the epistemically privileged become conscious of their social situation with respect to socio-political power and oppression, and begin to find a voice. It is no historical accident that feminist standpoint theory emerged in academic discourses more or less contemporaneously with the feminist consciousness movement within feminist activism. This demonstrates the way in which feminist standpoint theories are grounded in feminist political practice. Contrary to the tendency of critics who perceive feminist standpoint theory via an individualist lens, mistakenly reducing the notion of a standpoint to an individual’s social location, the emergence of standpoints is a collective process occurring through the recognition and acknowledgment of others who occupy more or less the same standpoint as oneself. Although such narratives may form a starting point, the emergence of a standpoint does not consist merely in the telling of individual women’s narratives. Self-definition in terms of a standpoint provides a starting point for the self-assertion of one’s own identity, challenging those identities imposed by conventional stereotypes that form part of hegemonic ways of thinking from the point of view of the socially and politically dominant. This assertion of identity—of who I am—adds to a body of knowledge about how my life is and how I experience the world. Those truths debunk myths about me, about my relationship with the world, and about my relationships with others in that world that have heretofore been taken to be true. In this vein, Patricia Hill Collins discusses a stereotypical understanding of African American women working as domestic servants—the Mammy stereotype which objectifies black women as ‘faithful and obedient’ domestic servants, dedicated to the care of their white family—in contrast to the Sapphire, controlling and manipulative, or the Jezebel, a temptress [1990; 456].  As Collins shows, stereotypes such as these serve as ‘controlling images’ that serve to reinforce for everyone, including African American women, the ways of thinking from the point of view of the racially and sexually dominant. This way of thinking oppresses as it constrains what can be known about being an African American woman. African American women, rather than racist and sexist social structures, are blamed for that oppression. Thus the epistemic process whereby a standpoint emerges enables the occupants of that standpoint to gain an element of power and control over knowledge about their lives. In becoming occupants of a standpoint, they also become knowing subjects in their own right, rather than merely objects that are known by others.

As shown by the claim from Harding that appears at the end of the previous section, feminist standpoint theorists argue that the epistemic and political advantages of beginning enquiry from within women’s lived experiences are not limited to providing a truer account of those lives, but of all the lives and socio-political relations within which those lives are enmeshed. Initial enquiry in women’s lived experiences, mediated by the politicized consciousness that emerges within a feminist standpoint, reveals the way in which male-dominated ideologies distort reality. Standpoints make visible aspects of social relations and of the natural world that are unavailable from dominant perspectives, and in so doing they generate the kinds of questions that will lead to a more complete and true account of those relations. Feminist standpoint theorists point out that, in order to survive within social structures in which one is oppressed, one is required to understand practices of oppression, to understand both oppressed and oppressor; but, this epistemic bi-polarity is neither required of, nor available to, the dominant. For example, the colonized have to learn the language of the colonizer—the New Zealand Māori learned English while use of the Māori language was strongly discouraged, for instance—in order to survive colonization, but the colonizer need not learn the language of the colonized in order to survive. The colonized, then, have some means of entry into the world of the colonizer, and the potential for gaining some understanding of how the world works from that perspective, but the colonizer is generally shut out of the world of the colonized and restricted to a mono-visual view of how the world is. The double vision afforded via the social location of women and other marginalized groups can provide the epistemic advantage of insights into social relations that are unavailable to the non-marginalized. An illustration of the way in which the often undervalued, messy caring work (caring for the sick and the elderly, bearing and raising children, unrewarding, unpaid domestic labor, emotional labor) in which women are traditionally engaged offers productive epistemic starting points; Hartsock cites a passage from Marilyn French’s novel The Women’s Room:

Washing the toilet used by three males, and the floor and walls around it, is, Mira thought, coming face to face with necessity. And that is why women were saner than men, did not come up with the made, absurd schemes men developed; they were in touch with necessity, they had to wash the toilet bowl and the floor. [Harding 2004: 43; French, 1978: 214]

Mediated via a critical standpoint, Mira’s lived experience could form a wellspring of epistemic insights not only into the gender power relations of which her situation (cleaning up men’s mess) is a result, but also of the basic necessities of all our lives and of the need to ensure that they are met equitably. Thus, from this starting point in the material condition of women’s lives, questions arise that would not otherwise get asked, and these questions can form rich sites for research, for policy reform and, ultimately, for social change. For instance, such questions might address issues such as violence against women—why is it so prevalent in so many societies against women of all classes and races, and why are women so often blamed for it? While violence against women remains an ongoing challenge and tragedy, women have derived epistemic advantage from the conceptual resources and clearer understanding of violence that has been afforded to them within feminist standpoints. In turn, this stronger understanding has flowed into social and political discourses to the extent that, at least in some parts of the world, violence is no longer considered acceptable or part of the normal dynamics of a marriage or partnership. Moreover, campaigning by women and their male allies has resulted, in some jurisdictions, in an anti-violence policy environment, and in legal protection and redress for women. Moreover, women’s feminist stand against family violence has (among other factors) motivated researchers to look for and critically analyze the causes and conditions of family violence, such as poverty and inter-generational family violence. In so doing, they have widened understanding of, and enquiry into, family violence more generally to encompass violence perpetrated on children, on male partners, and on elders. Other sites of enquiry that emerge from women’s lived experience might include:  Gender equity in the workplace—why are women so over-represented in low-paid and under- or unvalued caring work?; Gender equity in the domestic sphere—why is it often considered normal or usual for a woman to work a double shift, one outside the home and one at home?; Bodies and their normal biological processes—why are menstruation, birthing and menopause understood as medical problems to be treated as illnesses?; Women’s bodies and objectification—why do women’s bodies continue to be used to promote and sell products that run the gamut from instant coffee powder to motorsport?

The development of a standpoint by the dominated dissipates the conceptual dissonance experienced by someone who has been forced to adopt dominant conceptual frameworks that do not truly belong to them. Conceptual frameworks emanating from patriarchal systems fail to provide cognitive tools that enable women and others who are marginalized to make sense of their experiences in and of the world. The emergence of appropriate conceptual frameworks furnishes the marginalized with the cognitive tools to become epistemic subjects, whereas previously they are merely known by others. It enables them to name and think about their experiences in ways that properly represent those experiences. That is not to say that existing conceptual frameworks have been of no use whatsoever for women, for even this conceptual dissonance has been mediated and expressed within those frameworks. Rather, thinking from within a standpoint enables the emergence of conceptual frameworks which resolve the contradictions that arise, and fill the gaps and silences that are left empty when using a conceptual framework that is not entirely fit for purpose.

Some critics of standpoint theories have charged that their central claim of epistemic advantage amounts to a claim of automatic epistemic privilege. However, as we have seen in Section 4, a standpoint is not equivalent to a social location and the standpoint theorist’s claim is not that epistemic advantage is bestowed by dint of one’s social location, but that it is rather earned through involvement in collective political struggle. Theorists argue that experiences of the marginalized reveal problems to be explained; problems that can become research agendas or policy issues/initiatives and are a source of objectivity-maximizing questions. Such questions force us to examine the beliefs, prejudices and biases of the dominant groups in society, the propositions that have previously counted as knowledge. It is in this way, feminist standpoint theorists propose, that we achieve less partial and distorted understandings of all of our lives than we do if we allow questions about those lives to originate only from the experiences of dominant groups. The realities of women’s lives, then, can provide sites of enquiry that lead to new, more complete, less partial, and more objective knowledge.

Moreover, as Alison Wylie argues [2004: 345-6], standpoint theorists’ situated-knowledge claims explicitly undermine the conventional assumption that objective epistemic agents are non-specifically located, and that they are neutral and disinterested with respect to the subject of their enquiry. In so doing, however, standpoint theorists’ approach to the method of enquiry demonstrates a commitment to what are typically taken to be its virtues when it is scientific: empirical adequacy, construed as either empirical depth or breadth; internal coherence; inferential robustness; consistency with relevant well-established bodies of knowledge; and, explanatory power. Indeed, as Wylie notes, feminist interventions in social and scientific enquiry have been successful in demonstrating how it thus far has not always manifested those virtues. Standpoint theorists move beyond this critical moment, showing how the inclusion of lived realities, not yet properly visible to enquirers, can make for better-supported hypotheses. In a case study of archaeology considered in their article “Coming to Terms with the Values of Science”, Wylie and Nelson point out the ways in which researchers taking a gender-sensitive standpoint with respect to studies on netting and basketry and on skeletal remains have resulted in (among other things) a widening of the evidential base of archaeological enquiry in these areas, leading to the re-examination of established hypotheses [2007: 64-70].

6. The Outsider Within

The epistemic advantage of the ‘double vision’ afforded to those in the position of being outsiders within is a recurring theme of feminist standpoint theories. Several theorists emphasize the epistemic advantage afforded to those forced conceptually to straddle both sides of a dichotomous social divide. That advantage is captured by black feminist critic Bell Hooks’ description of growing up in small-town Kentucky thus:

Living as we did—on the edge—we developed a particular way of seeing reality. We looked both from the outside in and from the inside out…we understood both. [1984: vii]

Patricia Hill Collins, for instance, considers black feminist academics to occupy a position of potential epistemic privilege in so far as they are, on the one hand, insiders by dint of their position as authentic academics; yet, on the other hand, outsiders in so far as they are women and black, thus remaining to some extent decentered within the context of the Academy. This places them in a unique position from which to understand how things are in the Academy from the perspective of an insider who enjoys some degree of power and privilege both professionally and personally as a result of her membership, and who at the same time has an understanding of how things are from the perspective of one who is marginalized with respect to the centre of that power as a result of her gender and race. The dual perspective available to someone in this position leaves her well-placed to recognize the underlying assumptions and evaluative commitments that drive and shape the dynamics of power within the Academy, while at the same time providing her with a critical frame of reference derived from her own experience of the Academy, within which to potentially gain a better understanding of its power structures and dynamics. A dual perspective such as this, then, could form the basis of a feminist standpoint which would generate challenging questions about the social and political structures that engender the reality that black women academics experience in their professional and personal lives. In addition, standpoint theories offer explanatory resources for understanding how this dual positioning can potentially bestow epistemic advantage.

The self-reflexivity inherent in the identification of this insider/outsider position as a potentially advantaged epistemic location connects with the broader feminist theme of the (often vexed) relationship between feminist practice and feminist theory. Several feminist standpoint theorists’ work starts in their own lives, their initial site of analysis is the material experience of women as academics and scientists. Feminist sociologist Dorothy Smith argues that women sociologists are placed at the centre of a contradiction in the relation of their discipline to their experience of the world. This contradiction leads to a ‘bifurcation of consciousness’ [Harding 2004: p. 27]. On one side of that divide is the conceptual practice of academic work conducted within the conceptual structures of the discipline of sociology; and on the other, the concrete of the domestic sphere. She argues that sociological discourse has been authored and authorized by men, noting that the frames of reference against which its discourses of enquiry and discussion take place have their origins in men’s lived experiences, not women’s. The sociologist is thus conceived of as male, and women cannot be afforded the status of full participants in the practices of sociology without suffering a ‘double estrangement’. To gain legitimacy and status as sociologists they must suspend their identities qua women.

In her “Hand, Brain and Heart: A Feminist Epistemology for the Natural Sciences”, Hilary Rose makes similar points with respect to women in the natural sciences—‘Women Scientists in the Men’s Laboratories’, as she puts it [Harding 2004: 75]. Those women also have to negotiate the contradictory demands of private and professional spheres. Drawing on physicist Evelyn Fox Keller’s accounts of her experiences as a student (which includes narrative of male students avoiding her and a male university teacher not countenancing the possibility that she could solve mathematical problems without male help), Rose, like Smith, identifies a split in the woman scientist’s consciousness: she is ‘cut in two’, her abstract, conceptual scientific labor arises in ‘painful contradiction’ with her caring labor [Harding 2004: 76]. The implicit requirement that a woman suppress part of herself in order to acquire any professional credibility is one reason, Rose argues, why women scientists were, and in some disciplines remain, comparative rarities.

Thus, while the outsider-within position can afford the epistemic advantage of ‘double vision’ in the absence of the kind of political context and consciousness of which a standpoint is constituted, those benefits can remain unrealized as women scientists suppress their identity as women and as feminists in order to pass as scientists. As Uma Narayan has argued, it should be acknowledged that this colonialized bi-culturalism has a ‘dark side’ [Harding 2004: 221-3] with which women adopt various strategies to survive. In order to negotiate and cope, the best she can, with various contexts in which she finds herself having to operate, a woman might suppress part of herself in some of those contexts while assuming the persona best suited to each. Thus some women professionals emphasize only those characteristics considered valuable in their professional context, allowing themselves to be women and feminist only in private contexts. Alternatively, a woman might simply try to imitate the traits, habits and practices of the dominant group while suppressing herself entirely. For the feminist standpoint theorist, an alternative to these strategies is to attempt to remain within the contradictory contexts, and to do so critically. This is, potentially, the most epistemically powerful response, but it is also the most challenging given the risk of alienation from oneself and from those with whom one may have the most in common.

The difficulty of surmounting such challenges might account, in part, for the tension inherent in many feminist standpoint theorists’ accounts of epistemic insight. This tension arises between, on the one hand, recognition that epistemic insights occur as a result of an individual’s insider/outsider experience and, on the other, the central claim that a standpoint is a shared, rather than an individual, achievement. Perhaps the existence of this tension reinforces the claim that, while epistemic insight is achievable on the basis of individual insider/outsider experience, it is only from the political context and shared consciousness of a standpoint that such insights can be truly advantageous and move those within it from improved understanding of the realities of their lives towards social and political change.

7. Controversies

More than three decades have passed since the publication of the first work that developed and advocated feminist standpoint theories. Yet standpoint theory remains controversial and its controversies manifest both between and beyond feminist scholars, as Alison Wylie writes,

Standpoint theory may rank as one of the most contentious theories to have been proposed and debated in the twenty-five to thirty year history of second-wave feminist thinking about knowledge and science. Its advocates, as much as its critics, disagree vehemently about its parentage, its status as a theory, and crucially, its relevance to current thinking about knowledge. [Harding 2004: 339-40]

This section outlines what are perhaps the most significant challenges to feminist standpoint theory.

a. False Universalism

Since feminist standpoint theories take the view that enquiry is best started from within women’s material experience and that epistemic advantage ensues from within standpoints that emerge from that experience, they can mistakenly be understood to espouse an essentialist universalism, according to which women are afforded automatic epistemic privilege simply for the fact of their being women. Since feminist standpoint theorists argue that enquiry is best started from women’s lives, and that standpoints emerge only when women begin to reflect upon and question the reality of those lives through a politicized framework, feminist standpoint theories can also be misunderstood as proposing a single, monolithic feminist standpoint. This misunderstanding presents this feminist standpoint as arising not from ordinary women’s lives but from the lives of relatively privileged, mostly middle-class, mostly white, women academics.

A good proportion of the work that has since built on early moments in feminist standpoint theory has focused on incorporating considerations of difference within feminist standpoint theories. Feminist standpoint theories are clearly not committed to the project of formulating a homogenous women’s or feminist standpoint. Rather, they recognize that women’s presence in many areas of the terrain of social and economic marginalization means that women occupy positions at the intersection of a number of oppressive social structures.

However, reconciliation between feminist standpoint theories and those feminist theories which prioritize difference remains problematic and presents a dilemma: The formation of a standpoint requires shared experiences of oppression and of struggle against that oppression. But the inclusion of those experiences within a standpoint, it can be argued, runs the risk of occluding epistemically significant differences between women. A feminist standpoint may be taken (implicitly) as the position of all women, but what account is taken of class, race, sexuality, and other markers of difference, which structure the power relations that generate oppression, the shared experience of which forms the basis of the standpoint? The response to this dilemma from within standpoint theory has been, firstly, to emphasize that feminist standpoint theories envisage a plurality of feminist standpoints; and, secondly, to modify feminist standpoint theories to take account of the ways in which women’s different experiences at the intersections of various oppressive social structures will engender different standpoints. Patricia Hill Collins and Bell Hooks, for example, have developed black feminist standpoint theories that take into account the role of women of color in slavery and in devalued menial and caring labor, and the way in which this oppression is experienced at the hands of other, mostly white, women.

Thirdly, some feminist standpoint theorists respond head-on to the charge that by focusing on the experiences that are common to most women, standpoint theories fail to take account of significant differences between women. Maria Mies and Vandana Shiva, for example, participate in the feminist standpoint conversation from within their own experiences as activists in the women’s and ecology movements. They argue, contrary to the charge of false universalization, that there are many examples of women’s activism in pursuit of environmental causes which demonstrate the reality of women overcoming differences and developing a shared sense of solidarity through which they begin to gain an understanding of the oppressive relations in which their lives are enmeshed [Harding, 2004: 334-5]. Thus, as standpoints emerge, some differences will be occluded, but some significant similarities will be thrown into sharper relief.

Postmodernist feminist critics argue not only that the risk of occlusion of difference remains but, more fatally with respect to the possibility of reconciliation, the categories upon which feminist standpoint theory depends—woman, feminist, knowledge—are fluid and in a state of socially influenced flux and contestation, making it impossible ever properly to capture experiences and identities within standpoints. Standpoint theorists counter that the idea that identity is fluid itself puts the political power of feminism at risk and threatens the loss of the material experience of women’s oppression. Standpoint and postmodernist feminism remain opposed in this respect: the former requires materiality as its starting point, the latter rejects the reality of that ‘real world’ outright. As Haraway puts the point from the perspective of the former position:

[T]o lose authoritative biological accounts of sex, which set up productive tensions with its binary pair, gender, seems to be to lose too much; it seems to be to lose not just analytical power within a particular Western tradition, but the body itself as anything but a blank page for social inscriptions, including those of biological discourse. [Harding 2004: p. 94]

b. Epistemic Relativism

The charge that feminist standpoint epistemologies are committed to a politically dangerous epistemic relativism ensues from the claim that all knowledge is socially situated and that some social values enhance the process of enquiry and the acquisition of knowledge.

In response to this charge, Sandra Harding reconceptualizes objectivity, arguing for the pursuit of strong objectivity [1993: passim]. Harding argues that standpoint theory imposes a rigorous logic of discovery involving a strong demand for ongoing reflection and self-critique from within a standpoint, enabling the justification of socially-situated knowledge claims. This critical approach, Harding asserts, results in a stronger notion of objectivity than that achieved by traditional approaches to enquiry. The traditional starting point for knowledge is the position of the dominant and, despite assumptions to the contrary, that position is ideologically permeated. This results in partial and distorted accounts of reality, which thereby fail to live up to modernistic standards of impartiality, neutrality and universality associated with a commitment to epistemic objectivity.

With regard to the idea that the reconceptualization of objectivity represents a retreat from modernity, rationality and science [Walby, 2001: 489], Harding labels her feminist standpoint approach ‘neo-modern’ [2001: 518]. By these lights, feminist standpoint theories remain committed to strengthening modernist commitments to truth and objectivity, but are distanced from modernity’s absolutist overtones. Strong objectivity encompasses a sense of completeness and a lack of distortion. The ultimate epistemic goal of enquiry based on this model would be the inclusion of all standpoints, enabling the revelation of different aspects of truth. This would be a dialectical process consistent with standpoint theories’ roots in the Marxist tradition. Objective enquiry modelled thus requires trust—we need to trust others truthfully to reveal aspects of reality. Conceived thus, objectivity is not a goal that is easily achievable.

c. The Bias Paradox

Some, such as Helen Longino [1993] and Susan Hekman [1997] have argued that two of the central tenets of feminist standpoint theories—the claim that knowledge is socially situated and the claim that marginalized standpoints (but not perspectives) offer epistemic advantage—are in deep tension with each other. On the one hand, it is claimed that there is no standpoint-neutral vantage point from which to make judgements about the relative epistemic superiority of certain standpoints over other ways of knowing the world; while on the other it is claimed that marginalized standpoints are, indeed, epistemically better than the epistemic positions of the non-marginalized. If this tension cannot be resolved, it is argued, the standpoint theorist is pushed back towards the relativistic embrace of ‘multiple and incompatible knowledge positions’ [Longino 1993: 107].

Harding’s ‘strong objectivity’ suggests a possible dissolution of this apparent tension. Responses to the claim of a bias paradox offered by Rebecca Kukla [2006] and Kristina Rollins can be understood as means by which Harding’s notion can be realized. Both draw upon the resources of contextual approaches to epistemic justification to show how taking account of the social location of epistemic agents can strengthen knowledge claims. Kukla draws upon and extends Wilfrid Sellars’ account of perceptual warrant to argue for an account of epistemic objectivity in which contingent, contextual factors such as gender and race are recognized as sources of justification for knowledge claims, rather than rejected as disruptors of ‘aperspectival’ objective epistemic endeavor [2006: 86-7]. Only when we are socially located in certain respects, Kukla argues, can we best perceive certain aspects of reality.

Rollins, meanwhile, argues that the bias paradox arises out of a foundationalist framework: The standard of impartiality against which standpoints would be assessed involves basic, foundational beliefs and, according to foundationalism, these cannot be socially situated; hence, the tension between standpoint theories’ epistemic advantage thesis on the one hand and situated knowledge thesis on the other. Drawing and expanding upon the resources of Michael Williams’ contextualism, Rollins argues that by offering a standard of impartiality provided by a context of default entitlements whose status as such is always context-dependent, contextualism shows how it is possible to establish standards of epistemic justification that are themselves situated knowledge claims [2006: 129]. It is against a background of a standard such as this that it would be possible to claim, without retreat to relativism, that marginalized standpoints can offer epistemic advantage.

Generally, with respect to their commitment to objectivity, then, feminist standpoint theories can be understood as attempts to synthesize the elements that usually create an inherent tension in feminist and emancipationist projects. This tension arises from acknowledging the epistemic value of the inescapable social situation and dependence of epistemic subjects and of knowledge, and yet remaining committed to the idea that we don’t make the world up. Feminist standpoint theory attempts to occupy a position that incorporates both epistemic deference to the world and acceptance of the way in which that world and the ways we experience and understand it are shaped by our material circumstances. Feminist standpoint theory is also informed by an acceptance of the way in which different experiences, needs and interests give rise to different practices, and different ways of thinking about and interacting with the world, some of which are better than others. Real knowledge on this view just is socially situated; it is interested as opposed to disinterested [Harding 2004: 24-25]. In this vein, feminist standpoint theory serves as a critique of conventional epistemic standards, arguing that what Donna Haraway dubbed ‘the God Trick’—the traditional epistemic view that knowledge is only achieved by adopting a disinterested, impartial view from nowhere—is unachievable, for knowledge is always from somewhere [Harding, 2004:  93].

8. References and Further Reading

Many of the seminal articles on feminist standpoint theories (including the papers by Dorothy Smith, Nancy Hartsock, Hilary Rose, Patricia Hill Collins and Donna Haraway) mentioned in this article are now collected together in Harding’s The Feminist Standpoint Theory Reader. Harding’s collection also includes more recent papers that make new contributions to these debates, including the papers by Mies and Shiva, Narayan and Wylie.

Susan Hekman’s article, “Truth and Method: Feminist Standpoint Theory Revisited”, Signs Vol 22, No. 2 (Winter, 1997) provides a critical response to feminist standpoint theories which manifests the tension between standpoint theory and the preoccupations of postmodernist feminism. That article and replies from Harding, Hartsock, Hill Collins and Smith all appear in Harding’s 2004 collection.

a. Works Cited

  • Terri Elliot, “Making Strange What had Appeared Familiar”, The Monist; Oct94, Vol. 77 Issue 4
  • Evelyn Fox Keller, “The Anomaly of a Woman in Physics” in Sara Ruddick and Pamela Daniels, eds., Working it Out: 23 Women, Writers, Scientists and Scholars Talk about their Lives, New York: Pantheon Books, 1977
  • Marilyn French, The Women’s Room, New York: Jove, 1978
  • Sandra Harding Whose Science/ Whose Knowledge? Milton Keynes: Open University Press, 1991
  • Sandra Harding, “Rethinking Standpoint Epistemology: What is Strong Objectivity?” in L. Alcoff and E. Potter, eds., Feminist Epistemologies, New York/London: Routledge, 1993 (also appears in Harding, 2004)
  • Sandra Harding, “Comment on Walby’s ‘Against Epistemological Chasms: The Science Question in Feminism Revisited’: Can Democratic Values and Interests Every Play a Rationally Justifiable Role in the Evaluation of Scientific Work?, Signs, Vol. 26, No. 2 (Winter 2001)
  • Sandra Harding, ed., The Feminist Standpoint Theory Reader New York and London: Routledge, 2004
  • Donna Haraway, “Situated Knowledges” in Harding 2004
  • Nancy Hartsock, “The Feminist Standpoint: Developing the Ground for a Specifically Feminist Historical Materialism” in Harding, 2004
  • Patricia Hill Collins, Black Feminist Thought: Knowledge, Consciousness and the Politics of Empowerment, New York and London: Routledge, 1990
  • Patricia Hill Collins, “Learning from the Outsider Within: The Sociological Significance of Black Feminist Thought” in Harding 2004
  • Bell Hooks, From Margin to Center, Boston: South End Press, 1984
  • Rebecca Kukla, “ Objectivity and Perspective in Empirical Knowledge”. Episteme 3(1): 80-95. 2006
  • Helen Longino, ‘Subjects, Power, and Knowledge: Description and Prescription in Feminist Philosophies of Science’ in Feminist Epistemologies, L. Alcoff and E. Potter (eds.), New York: Routledge, 1993, 101-120.
  • Maria Mies and Vandana Shiva, “The Subsistence Perspective” in Harding, 2004
  • Uma Narayan, “The Project of Feminist Epistemology: Perspectives from a Nonwestern Feminist” in Harding, 2004
  • Kristina Rolin, “ The Bias Paradox in Feminist Standpoint Epistemology” Episteme 1(2): 125-136. 2006
  • Hilary Rose, “Hand, Brain and Heart: A Feminist Epistemology for the Natural Sciences” in Harding, 2004
  • Wilfrid Sellars, Empiricism and the Philosophy of Mind, Harvard: Harvard University Press, 1997
  • Dorothy Smith, “Women’s Perspective as a Radical Critique of Sociology” in Harding, 2004
  • Sylvia Walby, “Against Epistemological Chasms: the Science Question in Feminism Revisited” Signs, Vol. 26, No. 2 (Winter 2001)
  • Michael Williams, Problems of Knowledge: A Critical Introduction to Epistemology, Oxford and New York: Oxford University Press, 2001
  • Alison Wylie, “Why Standpoint Matters” in Harding, 2004
  • Alison Wylie & Lynn Hankinson Nelson, “Coming to terms with the values of science: Insights from feminist science studies scholarship” In Value-free science: Ideals and illusions, eds. Harold Kincaid, John Dupré, and Alison Wylie. Oxford: Oxford University Press, 2007

b. Earlier Papers

  • Sally J. Kenney and Helen Kinsella, eds. Politics and Feminist Standpoint Theories, New York and London, The Haworth Press, 1997

c. Later Contributions

  • Sharon Crasnow, “Feminist anthropology and sociology: Issues for social science” In Handbook of the philosophy of science, Volume 15: Philosophy of anthropology and sociology, 2006
  • Sharon Crasnow, “Is Standpoint Theory a Resource for Feminist Epistemology? An Introduction”  Hypatia 24(4) 2009: 189-192.
  • Sandra Harding, Sciences from below: Feminisms, postcolonialities, and modernities, Raleigh: Duke University Press, 2008.
  • Sandra Harding, “Standpoint Theories: Productively Controversial”,  Hypatia 24(4) 2009: 192-200.
  • Kristen Intemann, “Standpoint empiricism: Rethinking the terrain in feminist philosophy of science”  In New waves in philosophy of science, eds. P.D. Magnus and Jacob Busch. Houndsmills, Basingstoke, Hampshire, UK: Palgrave MacMillan, 2010; 198-225.
  • Kourany, Janet, “The Place of Standpoint Theory in Feminist Science Studies”, Hypatia 24(4) 2009: 209-218.
  • Kourany, Janet, “Standpoint Theory as a Methodology for the Study of Power Relations”, Hypatia 24(4) 2009: 218-226.
  • Joseph Rouse, “Standpoint Theories Reconsidered” Hypatia 24(4) 2009: 200-209.
  • Miriam Solomon, “Standpoint and Creativity”, Hypatia 24(4) 2009: 226-237.

Author Information

T. Bowell
Email: TABOO@waikato.ac.nz
University of Waikato
New Zealand

Chrysippus (c. 280—207 B.C.E.)

Chrysippus

Chrysippus was among the most influential philosophers of the Hellenistic period. He is usually thought of as the most important influence on Stoicism. A later Stoic catchphrase ran, “If Chrysippus had not existed, neither would the Stoa.” (Lives 292). Carneades, the fourth Chair of the New Academy, modified the phrase to, “If Chrysippus had not existed, neither would I.” (Lives 438) Chrysippus defended and developed an empiricist epistemology and psychology. He offered important alternatives to the metaphysical theories of Aristotle and Plato, defending a thoroughgoing materialist ontology. His views concerning freedom and determinism continue to generate interest, and he is thought to have endorsed a form of compatibilism countenancing both freedom of the will and a deterministic cosmos. His work in logic was considerable, as he developed, as an alternative to the logic of Aristotle, a kind of propositional logic. As an ethicist, he maintained that the life of human happiness and the life of virtue are one and the same. He seems to have thought virtue is best understood as related essentially, if not entirely reducible, to wisdom. And he thought wisdom derives especially from the study of natural philosophy. That Chrysippus would take wisdom to derive primarily from the study of natural philosophy may be explained, in part, by his conviction that the cosmos exists in accordance with proper ends.

Table of Contents

  1. Life and Times
  2. Sources
  3. Epistemology
    1. Perception
    2. Epistemic Criteria
  4. Logic
    1. Arguments
    2. Propositions
    3. Modality
  5. Metaphysics
    1. Ontology
    2. Being, Being Something, and the Nature of Properties
    3. The Growing Argument
    4. Freedom and Necessity
  6. Ethics
    1. Eudaimonism
    2. The Unity of Virtues
  7. References and Further Reading
    1. Primary Sources
    2. Secondary Sources
    3. Other References

1. Life and Times

Chrysippus was born in Soli, near what is today known as Mersin, Turkey. He died in the 143rd Olympiad at the age of seventy-three (living c. 280-207 B.C.E.). Diogenes Laertius, in his Lives of the Philosophers, reports that before becoming a student of Cleanthes, Chrysippus used to practice as a long-distance runner (287 B.C.E.). He was a master dialectician. Most people thought, according to Diogenes Laertius, that if the gods took to dialectic, they would adopt no other system than that of Chrysippus (289 B.C.E.). Cleanthes had succeeded Zeno, who had founded the school at the Stoa Poikilê in 262 B.C.E. Chrysippus, having taught outside the school for a number of years, returned to succeed his former teacher in 230 B.C.E. Stoicism continued to flourish after his death, as the work begun by early Stoics was continued in the era of Panaetius and Posidonius, and later into the Roman Imperial period by thinkers such as Seneca, Epictetus, and Marcus Aurelius.

2. Sources

Chrysippus was a prolific writer. He is reported to have written more than 705 books (SVF 2.1). No single treatise remains, and we have something in the neighborhood of 475 fragments. Two doxographies stand out—Diogenes Laertius’ Lives of the Philosophers, hereafter, and above, (Lives), and pseudo-Plutarch’s Philosopher’s Opinions on Nature. Cicero writes several works that are of use in reconstructing Stoic thought: Academica, De Finibus, and De Natura Deorum, are among the most important. These contain summaries and critical evaluations of the views of the Hellenistic schools, and although Cicero identifies with the Academics, he is not without sympathy for Stoic philosophy. It is from these sources primarily that scholars have attempted to cobble together a set of fragments and testimonia that present a coherent picture of Stoicism and Stoic philosophers. The Stoicorum Veterum Fragmenta, hereafter (SVF), compiled in three volumes by H. von Arnim, is one of the more important collections of fragments in Greek and Latin, and it is customary to refer to Stoic fragments by making use of von Arnim’s volume numbers and numbering system. More recently, David Sedley and A.A. Long have produced an English translation and commentary of the principal sources of Hellenistic philosophy, The Hellenistic Philosophers, hereafter (LS). Corresponding Greek and Latin texts, with notes and bibliography, can be found in the second volume of this work.

3. Epistemology

a. Perception

Zeno, the founder of Stoicism, defined ‘presentation’ as an impression (phantasia) upon the soul (SVF I 58). That he neglected to indicate what he meant by ‘impression’ is suggested by the controversy between Cleanthes and Chrysippus concerning the exegesis of this term. Cleanthes maintained that an object impresses itself on the soul just as a signet ring impresses itself upon a wax seal—a metaphor Plato had made use of in the Theaetetus (191c8). Chrysippus rejects this model by arguing that if the soul were like a piece of wax (with only one surface exposed), it might only receive one impression at a time. Furthermore, every impression would be lost as the next image impinged upon the former. The soul, however, as Chrysippus argues, receives a number of impressions at once, and it does, in fact, retain some impression while receiving others (Lives 7.50). In place of the wax model, Chrysippus argues that the soul is more akin to air, as air is able to undergo a number of alterations simultaneously, as when a number of individuals are speaking at the same time in the same place. (In fact, the soul, according to Chrysippus, who was a thoroughgoing materialist, is a mixture of air and fire). Chrysippus seems, therefore, to accept an indirect realist view of perception, whereby an impression can represent an external object, i.e., a phantaston:

Chrysippus says that these four, [impression, impressor, imagination, and figment] are to be distinguished from one another. An impression is an affection occurring in the soul that reveals itself and its cause. Thus, when through sight we observe something white, the affection is what is engendered in the soul through vision; and it is this affection (pathos) that enables us to say there is a white object that activates us. The word ‘impression’ (phantasia) is derived from the word light (phõs); just as light reveals itself and whatever else it includes in its range, so impression reveals itself and its cause. (SVF 2.54)

Not every presentation is, however, accurate. Some presentations lack a corresponding ‘impressor’. Although we see what appears to be Clint Eastwood on the silver screen, nobody believes that Clint is actually up there (LS 236). Chrysippus considers a presentation such as this one to be the result of imagination, or a phantastikon. A presentation that is believed, but which lacks an underlying real object, as for example the dragoness of Hades as it appears to Orestes, would be classified as a figment, or a phantasma.

Imagination is an empty attraction, an affection in the soul which arises from no impressor…A figment is that to which we are attracted in the empty attraction of imagination; it occurs in people who are melancholic and mad. (SVF 2.54)

b. Epistemic Criteria

So how does the ruling element of the soul distinguish a real presentation from a mere figment? According to Diogenes Laertius, the Stoics maintained the following (Lives 155):

[W]ith respect to presentations, on the one hand there is the cataleptic, and on the other there is the non-cataleptic. The cataleptic, which they say is the criterion of things, is that which comes to be from an existent object, agrees with the real object itself, having been stamped and imprinted. The non-cataleptic does not come to be from the existent object, or if it does, fails to accord with that which exists; it is neither clear nor distinct.

A cataleptic (“apprehending” or “grasping”) presentation (i) comes from a real object (ii) accurately corresponds to that object, (iii) is imprinted on the sense faculty, and, as we might add, (iv) is clear and distinct. The Stoics, moreover, take the cataleptic impression to be the criterion by means of which one may discern fact from fiction. Needless to say, the Skeptics were eager to ask how it is that one may discern the cataleptic from the non-cataleptic. The conceptual landscape here is filled out aptly by Cicero in his Academica (2.83):

There are four headings to prove there is nothing that can be known, cognized, or grasped, which is the subject of this whole controversy. The first of these is that (i) some false impression does exist. The second is that (ii) it is not cognitive. The third is that (iii) impressions between which there is no difference cannot be such that some are cognitive and others not. The fourth is that (iv) no true impression arises from sensation that does not have alongside it another impression, no different from it, that is not cognitive. Everyone accepts the second and third of these. Epicurus does not grant the first, but you…[Stoics], with whom we are dealing, admit that one. The entire battle is about the fourth.

The Stoics seemed to have denied that a true and a false impression might nevertheless be qualitatively indistinguishable. But, as Cicero goes on to ask, what mark (notum) of a cataleptic impression might one have such that it could not be false? In response to Cicero’s question, the Stoics developed several strategies (Lives 162):

The standard of truth they claim is the cataleptic presentation, i.e., that which comes to be from a real object, according to Chrysippus in the twelfth book of his Physics, and to Antipater and Apollodorus. Boëthus, however, admits a plurality of standards, namely intelligence, sense-perception, appetency, and knowledge. But Chrysippus contradicts himself…and says that the criterion is sensation and preconception (prolepsis). And preconception is a natural concept of the universal. Again, certain others of the older Stoics make orthos logos the criterion; so also does Posidonius in his treatise On the Criterion.

The pivotal question for understanding Chrysippus’ epistemology seems to concern the meaning he attaches to prolepsis. On one reading, these preconceptions might be considered innate concepts. This reading, however, rings inconsonant with the empiricist leanings associated with the Stoics (SVF 2.83):

The Stoics say that whenever a person is born, the hegemonic part of his soul is like a sheet prepared to be written upon. Upon this he writes each one of his conceptions. The first method of inscription is via the senses. For by perceiving something, e.g., white, they have a memory of it when it has departed. And when many memories of a similar kind have occurred, we then say we have experience. For the plurality of similar experiences is experience. Some conceptions arise naturally in the previously mentioned ways without deliberation, others by means of our own instruction and attention. The latter are called merely conceptions. The former are called preconceptions (prolepseis) as well. Reason, for (possessing) which we are called rational, is said to be completed by means of our preconceptions during our first seven years.

The above passage suggests that Stoics viewed the mind as a tabula rasa. Concepts, or prolepseis, are acquired causally by the repetition of similar presentations. (See Aristotle’s Posteria Analytica II.9) Chrysippus seems to have thought that one might compare a given token presentation with the concept one has of the type thereof. When, for example, the sage rejects the presentation of a dragoness, one may note that the dragoness does not match the prolepsis one has of females. If this is his line of thinking, it is fair to say that Chrysippus is sanguine indeed with respect to the reliability of the cognitive mechanism responsible for the acquisition of prolepseis. Optimism at this level, however, accords well with the Stoic view that the world is constructed in accordance with reason (Lives VII.134):

[The Stoics] think that there are two principles of the universe—that which acts, and that which is acted upon. That which is acted upon is unqualified substance, i.e. matter; that which acts is the reason (logos) in it, i.e. god.

Understood in this way, Chrysippus’s view seems to share certain features with those more contemporary epistemologists who are keen to emphasize the proper function of one’s cognitive abilities—more so than one’s subjective relationship to a belief—in the production of beliefs that meet the requirements for warrant or knowledge. The Stoics saw the presence of the divine in terrestrial things. The epistemic faculties, like other faculties, are there for a reason.

4. Logic

The Stoics’ appreciation for logic probably originated with the school’s founder. Zeno studied with Stilpo the Megarian. And Megara was the home of a number of important philosophers engaging in a discipline that we would today call logic. Chrysippus, without question, is the philosopher most responsible for the logic associated with the Stoics. In the following section, we will consider some of the elements that set his logic apart from that of his predecessors, as well as the propositional logic that most students of philosophy are familiar with today.

a. Arguments

One way of understanding Chrysippus’ contribution to logic is to see what Stoic arguments are not. Consider the following argument:

All mammals are warm-blooded animals.
All warm-blooded animals are sanguineous.
So: All mammals are sanguineous.

Aristotle called an argument of this variety a syllogism. He took it as evident that given the truth of the premises, the conclusion is likewise true. Indeed, one need not know the meaning of the terms in the argument to determine that the conclusion follows logically from the premises. One may simply consider the form of the argument:

All A are B A = mammals
All B are C B = warm-blooded animals
So: All A are C C = sanguineous

Logicians consider arguments wherein the conclusion follows from the premises, in just this way, to be valid. It is important to recognize that an argument’s having true premises provides neither necessary nor sufficient grounds for it to be valid. The following argument is valid, as the conclusion follows from the premises, even though the premises would rightly be resisted. The argument thereafter has true premises, but it is clearly invalid, as the premises may be true although the conclusion is false.

All trout are mammals. No viviparous animal is a trout.
All mammals are oviparous. No trout is a mammal.
So: All trout are oviparous. So: No viviparous animal is a mammal.

Consider, next, the following argument. It is clearly valid, but how would one fit it into an Aristotelian syllogism?

If Plato walks, then Plato moves. p = Plato walks.
Plato walks. q = Plato moves.
So: Plato moves.

Aristotle resisted the idea that there could be a scientific demonstration that concerned individuals. (Notice that the Aristotelian arguments above made use of classes). The argument currently under discussion resists an Aristotelian formulation, for the relata herein are propositions—Stoics called these ‘sayables’—rather than classes. In Aristotelian logic, the key connectives are ‘all’, ‘some’, ‘is’, and ‘is not’. In Chrysippus’ logic, the key connectives are ‘if’, ‘or’, ‘and’, and ‘not’. In the forms below, the former represents the form of our argument above; the latter a form that is clearly fallacious:

If p, then q If p, then q p = Plato walks.
p q q = Plato moves.
So: q So: p

Stoics built arguments in this way. Following Aristotle’s lead in the Prior Analytics, they listed a number of indemonstrable modes of argument—forms whose validity is clearly not in question—to which, they were optimistic, all other valid arguments may be reduced. The following is a list of the five indemonstrable forms (the first four have names):

Modus ponens: If p, q; p; ergo, q.
Modus tollens: If p, q; not q; ergo, not p.
Modus ponendo tollens: Either p or q; p; ergo not q.
Modus tollendo ponens: Either p or q; not q; ergo p.
Not both p and q; p; ergo, not q.

b. Propositions

That the Stoics distinguished between simple and complex propositions should be clear from what has been said heretofore. The sayable Plato walks is simple, and the sayable If Plato walks, then Plato moves is complex, as the statements Plato walks and Plato moves are connected by the particles ‘if’ and ‘then’. But Chrysippus’s account of what makes complex ‘sayables’ true or false differs in interesting ways with the truth-functional accounts that many modern philosophers make use of (a sentence is truth-functional when the truth of the sentence is a function of the values of its subsentences). Consider ponendo tollens above and let the value of p be the car won’t start because of a dead battery, and the value of q be the car won’t start because of a bad alternator. When your mechanic asserts the disjunction of these claims, and subsequently finds the former to be true, an inference to the falsity of the latter is not deductively valid. (It might be that you have a bad alternator as well.) This is not the way that Chrysippus seems to have viewed the matter. The Stoics were keen to understand all legitimate disjunctions to present mutually exclusive alternatives, for example, p or q, but not both p and q:

But all the disjuncts must mutually conflict, and their contradictories…must also be mutually opposed. Of all disjuncts, one must be true, the other false. But if neither of them is true, or all or more than one of them are true, or the disjuncts do not conflict, or the contradictories of the disjuncts are not incompatible, then that is false as a disjunctive proposition (LS 301).

For Chrysippus the paradigmatically true conditional is one where the contradictory of the consequent conflicts with the antecedent. In the conditional below, the contradictory of the consequent is Plato does not move. And this claim can reasonably be said to be in conflict with our antecedent: Plato walks.

If (antecedent) Plato walks, then (consequent) Plato moves.

And although there is no detailed or precise account of how Chrysippus understood this notion of conflict, it is likely the case that the conflict in question is of the conceptual variety rather than mere empirical incompatibility (See LS 211). The type of conditional in question can be contrasted with the material conditional of the propositional calculus favored by many modern logicians, where a conditional is true provided the antecedent is false or the consequent is true (or is understood inclusively here). Consider the claim If we have pie, then we will have coffee. The contradictory of our consequent is We will not have coffee, and this claim does not present any real conflict with the antecedent of our claim, namely, We will have pie. Furthermore, a conditional of the material variety is truth-functional, that is, true just by virtue of the truth of its constituents. (The Stoics preferred to express claims of this sort as ‘it is not the case that we will have pie and not have coffee’.) Notice how a Chrysippean conditional, mutatis mutandis a Chrysippean disjunction, depends upon more than the truth-value of its constituents. Not only does it depend upon the contradictory of the consequent, it depends as well on a semantic understanding of what it is to conflict.

c. Modality

As we saw above, the Stoics classified certain kinds of sayables with reference to certain properties that are not part of linguistic form. In addition, Stoics classified sayables in terms of their modal properties, that is, possibility, necessity, impossibility, and non-necessity. (They also considered the properties of plausibility and probability as well (Bobzien 2003). For the Stoics’ account of modality, the Megarians are again an important influence (LS 231):

Diodorus defines the possible as ‘what is or will be’; the impossible as ‘what, being false, will not be true’; the necessary as ‘what being true, will not be false’; and the non-necessary as ‘what either is now, or will be false’.

An account such as this one fixes the possible to the temporal. If something is possible, say a sea-battle, then it occurs at some time. If something is impossible, say a triangle with less than three sides, then there will not be an instance of it at some time. The necessary sayable, for example, triangles have fewer than four sides, is that which will not be false at some time. And the non-necessary, Cypselus is ruling, is a claim that will be false at some time. One difficulty that accounts such as this face is that they do not explain sentences such as the following:

A golden mountain is at the top of Hannah Street.

This coat might have been shredded, but it was burned instead.

Sometimes we express a possibility that we believe will never actualize. Chrysippus offers a criticism similar in kind. He seems to argue that a jewel might never be smashed, but that insofar as one might receive credit for not smashing it, we would nevertheless consider it possible to smash the jewel (LS 235).

The set of modal definitions preferred by the Stoics is difficult to reconstruct. But there is some reason for thinking that Chyrsippus followed Philo in taking possible propositions to be those wherein the internal nature is capable of truth. Precisely what one is to make of the locution ‘internal nature is capable of truth’ is not known. But it is tempting to understand Chrysippus on the basis of his objections to Diodorus, and his account of conditionals, as saying something akin to “a proposition is possible if conceptually self-consistent, and (if) there are no external circumstances preventing it”. The remaining modalities would be defined in relation to the possible.

5. Metaphysics

For reasons similar to those of the Epicureans, the Stoics were materialists. Only that which can affect or be affected may be said to exist, so the argument runs; and only that which is corporeal may affect or be affected (Academica I.39). Nevertheless, their view does admit of a kind of dualism, as everything in the cosmos is the result of an active principle, namely God, upon a passive, or causally inert, indeterminate substrate (Lives VII.134). And for Chrysippus, divine presence in terrestrial things comes in the form of pneuma–a fiery kind of air, or ether (Lives 7.138-9), which gives form and quality to that which is per se devoid of quality. Although the Epicureans, following in the tradition of Democritus and Leucippus, were arguing that everything is made up of the discrete and the indivisible (atomon), the Stoics maintained that the prima materia upon which the active principle exercised its influence is continuous, serving as the substrate for everything from basic continua such as earth, water, air, and fire, to individual organisms such as Socrates and Secretariat. Where the Peripatetic would characterize Secretariat as an individual substance or thing, and brown as a property, or modification, thereof; Chrysippus, and the Stoics in general, would consider Secretariat to be a property or modification of the passive element, and the brown color to be, in turn, a disposition thereof.

a. Ontology

As Simplicius reports in his commentary on Aristotle’s Categories (SVF 2.369):

The Stoics see fit to reduce the number of primary genera, and others they take over with changes in the reduced list. For they make their division a fourfold one, into substrates, the qualified, the disposed, and the relatively disposed.

Thus, the Stoics recognized four main levels of existence, and, as Long and Sedley have argued, there is some reason for thinking that these categories originated with Chrysippus (165). The first category is that of substrate (hupokeimenon). To locate an entity in this first category is to assign it existence without reference to its qualities. This characterless substance is being, or ousia, in its primary sense. The next level of being is that of quality. To be more precise, this level of being is something qualified by the inseparable god, reason, or soul, that pervades the substrate, and on the account of Chrysippus, this takes the form of pneuma (LS 46A). In the second category the Stoics placed modifications or qualified substrates (poia) of the prima materia and, interestingly, it is in this category of qualities, or poia, that the Stoics include people, and organisms, in general. Organisms are, therefore in an important sense, adjectival in being. The third category concerns dispositions of the things that are qualified, or things that are qualified, disposed in some way (põs echonta). Grammatical knowledge, for example, would be a disposition of the grammarian. In the fourth category, the Stoics countenanced dispositions that are relatively so, or pros ti põs echonta. These dispositions bear a resemblance to the so called Cambridge properties that may change without any change to the item under consideration, as, for example, your being to the right of me may occur without my having changed my position.

b. Being, Being Something, and the Nature of Properties

The Stoics were, moreover, materialists. Only corporeals, or somata, exist. Existence, however, is not the highest ontological category. In the Sophist, Plato had suggested that for something x to be some thing was for x to exist. But the Stoics reject Plato’s suggestion, maintaining that reality may be divided on the one hand into corporeals (somata), which may be said to exist; and on the other hand into incorporeals (asomata), which may be said to subsist (SVF 1.65). The Stoics thought that the latter category was comprised of such things as time, place, void, and ‘sayable’ (lekta). The Stoics’ reason for saying that the incoporeals are real has, to be sure, a Meinongian ring. In order to be an object of one’s thought, a lekta, for example, must be something. Otherwise there is nothing one is ex hypothesi thinking about.

Understood in this way, the Stoic ontological classification implicitly rules out properties of a certain kind. Socrates and Glaucon, for example, both have the attribute of being a man. And this invites the question: Does this so called entity Man, under which both are said to be classified, exist? Plato on the received view (but see Grabowski 2008), and Aristotle, held that (i) the property of being a man exists independently of our recognition thereof, and, (ii) that the property, that of being a man, which belongs to Socrates, is numerically identical to the property, similarly described, which belongs to Callias. Though the philosophical literature on properties is less than univocal, in terminology and criteria, those who accept the received view, attributed here to Plato and Aristotle, are frequently referred to as Realists. Those who maintain that properties described in this way do not exist are frequently called Nominalists.

Chrysippus resolutely rejects the views of Plato and Aristotle concerning properties. But the committed Nominalist will have to say something about propositions which are taken to be true, and which, importantly, seem to refer to such entities, call them for present purposes, universals. Consider the following proposition:

(A) Man is a rational mortal animal.

Chrysippus, as we will see below, relies upon the truth of this proposition in his ethical philosophy. The advocate of this proposition, however, would seem to be committed to the universal Man. In order to avoid this ontological commitment, Chrysippus understands the claim as conditional rather than existential in form (LS 301):

(A*) If something is a man, that thing is a rational mortal animal.

Although (A*), as one might argue, is identical in meaning to (A), (A*) does not make use of the abstract term Man, and if (A*) does not require the use of the abstract term Man, Chrysippus needn’t countenance any entity to which the term may refer in order to count (A*) to be true. And, since he takes the meaning of (A) and (A*) to be one and the same, he can take (A) to be true as well. Of course, there are propositions in addition to (A) that one might consider orthodox, which seem to make reference to abstract entities. Chrysippus is optimistic that a similar kind of paraphrase will apply, mutatis mutandis, to these.

Having argued that one need not posit the existence of universals to account for certain propositions of the orthodox variety, Chrysippus developed an argument in support of the claim that the Platonic account is unintelligible (LS 30 E):

Indeed, Chrysippus too raises problems as to whether the Idea will be called a certain this (tode ti)…Namely: if someone is in Athens, he is not in Megara; <but man is in Athens; therefore man is not in Megara>. For man is not someone, since the universal is not someone; but we took him as someone in the argument. And that is why the argument has this name, being called the ‘Not-someone’ argument.

The argument is a little impressionistic, but we might begin to reconstruct it as follows:

(1) If someone (or something) is in Athens, he (or it) is not in Megara.
(2) Man is in Athens.
So: (3) Man is not in Megara.

The argument seems to be a reductio ad absurdum built upon the idea that the Platonist will accept claims (1) and (2). The acceptance of (1) and (2), however, with an application of modus ponens, entails something the Platonist will not willingly accept. At this point, the Platonist might feel pushed to claim that Man is not to be classified by the terms, that is, someone or something, in the first premise. This, however, comes dangerously close to conceding a point to which the Platonist is not entitled, that is, Man is not something. Perhaps a more appealing move for the Platonist would be to give up (1) maintaining that universals are something, and that they are things that may be found in objects that are in different places, such as Athens and Megara, at the same time. This of course flushes out the very queer nature of universals. It seems that countenancing universals countenances entities that may be located simultaneously in a number of places, if they are located at all. It is, of course, open to the Platonist to argue that the properties in question do not exist in space, so it is a mistake to say that they are ‘located’ in a number of places at the same time.

c. The Growing Argument

As Long and Sedley argue, the fourfold ontological distinction seems to have emerged in response to Academic criticism, and Chrysippus seems to have been the central respondent (172). The Academics had made use of an argument known as the Growing Argument, which is traced to the fifth-century comic poet named Epicharmus, in order to generate skepticism with respect to the idea that an organism may endure, or exist diachronically (LS 28A). The argument begins by making use of an analogy that may be adapted to suit our purposes as follows: Suppose that I have a stack of pennies in my garage on Tuesday, and I remove one of the pennies on Wednesday. The resultant stack of pennies in my garage is, arguably, not the same stack that was in my garage on Tuesday, but a new stack altogether. To drive the idea home, we might suppose that the stack is not composed, as it were, of pennies, but rather of fifty dollar bills. Suppose I promise Helena, on Tuesday, that she may have the stack of bills provided that she cleans the garage. Were I to remove one of the fifties before giving her the stack, she can fairly protest that I did not give her the same stack I had promised her. The moral to glean, then, is that when it comes to stacks or lumps, the removal (or addition) of one (or more) of the constituents results in the destruction of the old (or the generation of a new) lump or stack. As the Academics were inclined to point out, on the Stoic view, people are composed entirely of matter. And who can deny that in, say, drinking a cup of coffee, or a glass of wine, there is an influx (and later there will be an efflux) of material constituents? What is more, work central to Stoic cosmology involved the study of natural processes. Where growth and decay are among the most fundamental of these, the Academics challenged the idea that any coherent account may here be provided (Sedley 1982, 258).

Those who are familiar with John Locke’s treatment of diachronic identity, in The Essay Concerning Human Understanding, will spot a certain resemblance in Chrysippus’s response. For Chrysippus seems to believe that a single object may, under different descriptions, have incompatible properties ascribed thereto (Sedley 1982, 259). Under the description ‘this lump of matter’, one’s identity may change continually from one moment to the next. Under the description ‘this person’, this need not be the case. For Chrysippus, moreover, a thing’s constitutive matter is a hupokeimenon. And, as a substrate, as Chrysippus concedes, things do not exist diachronically. Nevertheless, lumps of matter possess unique sets of qualities, and may be classified under the second category of the Stoic ontology. These items can be said to be idiõs poioi, ‘peculiarly qualifieds’, which can indeed endure through time. It is in this way that living things are said to exist diachronically.

Having disarmed the Growing Argument in this way, Chrysippus seems to have taken his case on the offensive. The text we have to work with derives from Philo of Alexandria:

Chrysippus, the most distinguished member of their school, in his work On the Growing Argument, creates a freak of the following kind. Having first established that it is possible for two peculiarly qualified individuals to occupy the same substance jointly, he says: “For the sake of argument, let one individual be thought of as whole-limbed; the other as minus one foot. Let the whole-limbed one be called Dion, the defective one Theon. Then let one of Dion’s feet be amputated…”

Chrysippus’ thought-experiment here bears a striking resemblance to a more modern puzzle that has been discussed by David Wiggins (1968): consider a cat by the name of Tibbles, and focus for the moment on that portion of Tibbles that includes everything save Tibbles tale. Designate the portion that includes everything but Tibbles’ tale with the name Tib. Tibbles and Tib, heretofore, do not occupy the same space at the same time. Thus, Tibbles and Tib are not identical. Suppose, however, that we amputate Tibbles’ tale. Tibbles and Tib now occupy the same region of space. If Tibbles is still a cat, it will be difficult to discern a criterion by which one could deny that Tib is a cat. Nevertheless, Tibbles and Tib are distinct individuals as they have had different histories. Hence, two cats occupy, and of course don’t occupy (so runs the reductio), the same place. And the puzzle, for Wiggins, is to see where the reasoning has gone astray. Chrysippus’ puzzle is very similar: Dion and Theon are said to occupy the same substance (rather than the same space), and where Dion has a right foot, Theon lacks one. However, when Dion’s foot is amputated, Chrysippus’s approach is not that of considering the conclusion to be validly derived from faulty premises. Instead, Chrysippus rejects the idea that Theon continues to exist. By all appearances, Chrysippus seems to have argued that after the amputation, when a bystander poses the question ‘Whose foot has been severed?’ the only intelligible answer is: that of Dion. As Theon could not have lost a foot he never had, and whoever is being bandaged there just lost a foot, Dion is the better candidate for survival. Of course, the paradox cannot be said to be constructed on Stoic premises. Sedley argues that Chrysippus turns the Growing Argument back on the Academics by showing that although they assume that material growth and diminution are fatal to the idea of an enduring entity, diminution is indeed part of the condition of Dion’s enduring, while the undiminished, unchanged Theon is the one who perishes (1982, 270).

d. Freedom and Necessity

Providence plays the leading role in Stoic cosmology. An instance of causation is, after all, the operation of the active principle, namely god, on that which is passive or inert. This providence gives way, for Chrysippus, to an outlook on the cosmos that is very deterministic (SVF2.1000).

In On Providence book 4, Chrysippus says that fate is a certain natural, everlasting ordering of the whole: one set of things follows and succeeds another, and the interconnection is inviolable.

Chrysippus’s deterministic outlook invites the following question: If the cosmos is ordered in a deterministic fashion, and we are part of it, then it would seem that we are ordered in a deterministic fashion. Thus, our actions will follow the one upon the other in a succession where the nexus is never violable. And, if we may never deviate from the course ordered by fate, we never have the control to act otherwise. Chrysippus’s cosmology seems not to accommodate the view that we are sometimes morally responsible for our actions. Stoic philosophy is, however, a philosophy of moral rigor, so Chrysippus owes an answer. Epicurean philosophers maintained a minimal level of indeterminacy at the atomic level (De Rerum Natura 2.216). As we have seen, this option is not open to Chrysippus and, in any case, the indeterminacy view is not without difficulty. (As Hume argued in the Enquiry, one has no control over that which lacks a cause). The acceptance of determinism taken with an optimistic outlook concerning the possibility of free action suggests that Chrysippus countenances the latter as compatible with the former. Precisely how the two are going to be understood as compatible has, however, been the subject of some controversy.

One possibility for understanding Chrysippus’ view can be derived from the following text (Long and Sedley 62a):

They too [Zeno and Chrysippus] affirmed that everything is fated, with the following model: When a dog is tied to a cart, if it wants to follow it is pulled and follows, making its spontaneous act coincide with necessity, but if it does not want to follow it will be compelled in any case. So it is with men too: even if they do not want to, they will be compelled in any case to follow what is destined.

This passage has been interpreted in a variety of ways. One might understand the point to be as follows. The threat that fatalism or determinism poses for moral responsibility seems to enter with the idea that if we cannot act otherwise than we do, we are not free. But consider our dog Maddy, fettered as she is to her master’s cart. We may assume that she follows her master, who is pulling the cart, with a canine sense of alacrity, and that she would do so, in much the same way, even were she not tied to the cart. It seems that she freely follows her master when, nevertheless, she could not act otherwise. Read in this way, the passage is a counterexample to the claim that if one cannot act otherwise, one is not free.

Susanne Bobzien has argued that the simile was authored by neither Zeno nor Chrysippus, but instead has its source in the Roman Stoa, and although she does acknowledge that it is probably an interpretation of a number of verses belonging to Cleanthes (2001 354), she finds nothing in the passage that meets up with accommodating free will. The point of the passage, on her view, is simply that it is in vain that one revolts from what fate administers.

Another important passage for exegetes is the following:

For although assent cannot occur unless it is prompted by an impression, nevertheless, since it has that impression as its proximate, not its primary cause, Chrysippus wants it to have the rationale which I mentioned just now. He does not want assent, at least, to be able to occur without the stimulus of external force (for assent must be prompted by an impression). But he resorts to his cylinder and spinning-top; these cannot begin to move without a push; but once that has happened, he holds that it is thereafter through their own nature that the cylinder rolls and the top spins. “Hence”, he says, “just as the person who pushed the cylinder gave it its beginning of motion but not its capacity for rolling, likewise, although the impression encountered will print and, as it were, emblazon its appearance on the mind, assent will be in our power. And assent, just as we said in the case of the cylinder, although prompted from will, thereafter moves through its own force”.

The above passage is important in connection with Chrysippus’s epistemology, meeting up as it does with the issue of whether belief or acceptance is a voluntary process. But it may also hold a clue concerning Chrysippus’ account of free action in general. The idea seems to be that the cylinder needs to be pushed in order to be set in motion, and considerations of the Newtonian variety aside, the power to keep rolling is specific to its nature. And, just as the power to keep rolling is found in the nature of the cylinder, so too the power of assent or dissent issues from the agent’s power. Of course, one might object that one may have little to no control over the dispositions that drive one to assent or dissent. And, if one doesn’t haven’t any control over certain antecedent events, that necessitate certain other events, then it seems that one doesn’t have any control over the subsequent events either. That this latter claim is true is not obvious however (McKay and Johnson 1996): Suppose with fair dice I roll snake eyes. From this fact, it follows by necessity that I am playing dice. But does it follow that my playing dice was not up to me? I certainly cannot roll snake eyes without playing dice. But is my playing dice necessitated in the same way? Perhaps the way is unimpeded for Chrysippus to make use of a similar kind of reasoning in his account of free action.

6. Ethics

a. Eudaimonism

As Aristotle observed, although most people agree that happiness or eudaimonia is the goal of life, there is less agreement concerning just what happiness consists in (1095a17). The Stoics agree with Aristotle that living a happy life is the end for the sake of which all things are done, and that such an activity is not done for the sake of anything else (SVF 3.16). Having set eudaimonia or eu zên as the summum bonum, the Stoics are left with the task of indicating wherein happiness consists. And, for the Stoics, happiness consists in nothing other than virtue (De Finibus III.44):

We deem health to be deserving of a certain value, but we do not reckon it a good; at the same time we rate no value so highly as to place it above virtue. This is not the same view held by the Peripatetics, who are going to say that an action which is both moral (honesta) and without pain is more desirable than the same action with pain. We [Stoics] think otherwise.

Nothing, then, is valued more than virtue. One might take happiness to be of a higher value than virtue, as the former may be thought of as a means to the latter, but this would be a mistake, for it seems that Chrysippus was willing to simply identify the virtuous life and the happy life (LS 63h):

He (Chrysippus) maintains that vice is the essence of unhappiness, insisting in every book that he writes on ethics and physics that living viciously is identical to living unhappily.

Of course, in identifying happiness with virtue, Chrysippus is obligated to provide an account of virtue. Zeno, the founder of the school at the Stoa Poikilê, had maintained that the final end of life was to live consistently and harmoniously (SVFI.79). His successor, Cleanthes, understood living consistently and harmoniously as living consistently with nature (SVF3.12.) Chrysippus further explicated the idea with the following formulation: “To live in accordance with one’s experience of the things which come about by nature” (SVF1.12). But it is still unclear what living in accordance with nature means. Friedrich Nietzsche, a German philosopher of the 19th century, attempted to parody the idea by saying:

According to nature you want to live? O you noble Stoics, what deceptive words these are…And supposing your imperative, “live according to nature,” meant at bottom as much as “living according to life”—how could you not do that? Why make a principle out of what you yourselves are and must be?

As we have seen, a fairly thorough determinism pervades the thought of the Stoics, and the course to understanding Chrysippus’s views on action is not without difficulty. So one might fairly ask the question: How might one not live according to nature? In order to understand what the Stoics have in mind, it is important to note that a recurring debate among philosophical circles in antiquity concerned whether what is good is so in virtue of nature (physis) or convention (nomos). As Aristotle points out in the Nichomachean Ethics, there is such diversity among opinions concerning which actions are just that they are thought to exist according to convention rather than nature (1094b 14-16). So, living in conformity with nature or with what is good by nature, contrasts with living in accordance with mere opinion, or convention, concerning what is good. Now, when Chrysippus says, one ought to live in accordance with one’s experience of the things which come about by nature, there is some reason for thinking the nature he has in mind is the nature that is peculiar to human beings. The proprium or nature of the human species is the use of reason. So it seems that Chrysippus’s position is that the man who lives in accordance with reason lives in accordance with virtue. This is a position that enjoys some popularity and that rings reminiscent of Aristotle’s in Nichomachean Ethics. The problem of accounting for free action has not, thus, been cleared away. Nor has the gap between is and ought, hereby, closed. But the imperative ‘live according to nature’ may simply mean ‘live according to your nature qua human being by living in accordance with reason’. And, as an account of virtue, this view had a few considerations in its favor.

b. The Unity of Virtues

Although Chrysippus’s views on virtue share a great deal with those of Aristotle, another clear similarity concerns the connections that Chrysippus and Socrates found to obtain among the cardinal virtues. Socrates seems to have thought that virtues may be reduced to one thing (Protagoras 361a-b), namely, the ability to discern good from evil. In the Laches, for example, it is suggested that courage may be reduced to wisdom by arguing that courage is the knowledge of what inspires fear and confidence. That which issues in fear is evil, and that which inspires confidence is good. Courage is, so the argument goes, knowledge of what is good and what is evil (198b2-199e). Arguments similar in kind reduce the other virtues to knowledge of good and evil, and Chrysippus seems to have retained Socrates’s line of thinking. Prudence is the knowledge of things to be chosen; temperance is the knowledge, for example, of that for which one must be resolute (SVF3.95). Of course, understanding virtue as reducible, in some way, to knowledge dovetails nicely with the idea that virtue is living in accordance with reason.

It is usually thought that the source of knowledge requisite for virtue, according to Chrysippus, is the study of natural philosophy. However, in recent years there has been some debate concerning whether or not the science of ethics, according to Zeno, Cleanthes, and Chrysippus depends crucially upon claims about the nature of the cosmos. A number of scholars now believe that the founders of Stoic philosophy should be freed from the interpretation that ethics relies heavily upon a cosmology that is un- controversially providential. Julia Annas, for example, finds it hard to swallow the idea that ethics depends on cosmology as it “…lead[s] us…to accept counterintuitive conclusions such as nothing but vice is bad, and that emotions like regret are all mistaken”. Accepting the view, however, that the Stoics did not think ethics inextricably tied to physics has its own set of counterintuitive consequences, for it becomes difficult to take them at their word in more than a few key texts. The following is taken from Plutarch (admittedly not the most charitable reader of Stoicism), from his De Stoicorum Repugnantis (LS 60A):

Again in his Physical Postulates, [Chrysippus] says, “There is no other or more appropriate way of approaching the theory of good and bad things or the virtues or happiness than from universal nature (koinê physis), and from the administration of the world”. And later: “For the theory of good and bad, things must be attached to these, since there in no other starting point or reference for them that is better, and physical speculation (physika theoria) is to be adopted for no other purpose than for the differentiation of good and bad things”.

A straightforward reading of Plutarch’s quotation here—this is the reading, for example, of Long and Sedley (LS 374)—is that an account of the rational and providential features of god, which pervade the natural world, are indispensable when determining that which constitutes the good of man. Read in this way, Chrysippus comes across as a teleologist countenancing the good, the scientific, and the pious lives to be one and the same.

7. References and Further Reading

The following lists some important work that has been done on Chrysippus and Stoic philosophy. It is not intended to be a comprehensive bibliography.

a. Primary Sources

  • Stoicorum Veterum Fragmenta in Four Volumes (Leipzig: Teubner, 1903-1924)
    • This work, originally compiled by Hans von Arnim, consisted of three volumes of testimonia and fragments intended for individuals with a background in Latin and Greek literature. Volume one contains fragments relating to Zeno and his followers. Volume two contains fragments related to Chrysippus’s logical and natural philosophy. The third volume contains material related to Chrysippus’s ethical philosophy. A fourth volume containing general indices was added in 1924 by Maximillian Adler.
  • The Hellenistic philosophers, Volume 1: Translations of the Principal Sources, with Philo-Sophical Commentary, by A.A. Long and D.N. Sedley, Cambridge: Cambridge University Press, 1987.
    • This work is probably indispensable for anyone interested in Hellenistic philosophy. It contains English translations and astute commentary. An accompanying volume contains the Greek and Latin texts from which the translations are derived.

b. Secondary Sources

  • Josiah B. Gould, The Philosophy of Chrysippus, Albany, SUNY Press, 1970.
    • This is one of the few monographs limited to discerning the thought of Chrysippus. It is a common starting point for exegetes interested in Chrysippus’s influence on Stoicism.
  • Susanne Bobzien, Determinism and Freedom in Stoic Philosophy, Oxford: Oxford University Press, 2001.
    • Although this work treats the issue of free action in Stoic thought in general, there is a great deal that focuses upon Chrysippus’s views specifically. A number of other articles by Bobzien, including the following, will also be rewarding:
  • Susanne Bobzien, ‘Chrysippus and the Epistemic Theory of Vagueness’, Proceedings of the Aristotelian Society 102 (2002), 217-38.
  • Susanne Bobzien,‘Chrysippus’s Theory of Causes’, Topics in Stoic Philosophy, ed. K. Ierodiakonou, Oxford: Oxford University Press, 1999, 196-242.
  • Susanne Bobzien, ‘Logic’, Cambridge Companion to the Stoics, ed. Brad Inwood, Cambridge: Cambridge University Press, 2003, 85-123.
  • J. Bowin, ‘Chrysippus’s Puzzle about Identity’, Oxford Studies in Ancient Philosophy 24 (2003), 239-251.
  • God and Cosmos in Stoicism, ed. Ricardo Salles Oxford: Oxford University Press, 2009.
    • Readers interested in questions concerning nature, and how humans fit therein, will find a number of interesting essays in God and Cosmos. Recent interest in how humans fit into the environment has issued an interest in the Stoic conception of how humans fit into nature. Several of the essays in this work treat primarily Chrysippus’s views.

c. Other References

  • Annas, J. ‘Ethics in Stoic Philosophy’, Phronesis, 52, 58-87.
  • Caston, V. ‘Something and Nothing: The Stoics on Concepts and Universals’, Oxford Studies in Ancient Philosophy 17, 145-213.
  • Grabowski, F. Plato, Metaphysics and the Forms. Continuum Studies in Ancient Philosophy, New York/London: Continuum, 2008.
  • McKay, Thomas and David Johnson (1996). “A Reconsideration of an Argument against Compatibilism”, Philosophical Topics 24: 113-122.
  • Nietzsche, F. Beyond Good and Evil, in Basic Writings of Nietzsche, tr. and ed. Walter Kaufmann, New York: Modern Library, 1992: 205.

Author Information

Jeremy Kirby
Email: jkirby@albion.edu
Albion College
U. S. A.

Immortality

Immortality is the indefinite continuation of a person’s existence, even after death. In common parlance, immortality is virtually indistinguishable from afterlife, but philosophically speaking, they are not identical. Afterlife is the continuation of existence after death, regardless of whether or not that continuation is indefinite. Immortality implies a never-ending existence, regardless of whether or not the body dies (as a matter of fact, some hypothetical medical technologies offer the prospect of a bodily immortality, but not an afterlife).

Immortality has been one of mankind’s major concerns, and even though it has been traditionally mainly confined to religious traditions, it is also important to philosophy. Although a wide variety of cultures have believed in some sort of immortality, such beliefs may be reduced to basically three non-exclusive models: (1) the survival of the astral body resembling the physical body; (2) the immortality of the immaterial soul (that is an incorporeal existence); (3) resurrection of the body (or re-embodiment, in case the resurrected person does not keep the same body as at the moment of death). This article examines philosophical arguments for and against the prospect of immortality.

A substantial part of the discussion on immortality touches upon the fundamental question in the philosophy of mind: do souls exist? Dualists believe souls do exist and survive the death of the body; materialists believe mental activity is nothing but cerebral activity and thus death brings the total end of a person’s existence. However, some immortalists believe that, even if immortal souls do not exist, immortality may still be achieved through resurrection.

Discussions on immortality are also intimately related to discussions of personal identity because any account of immortality must address how the dead person could be identical to the original person that once lived. Traditionally, philosophers have considered three main criteria for personal identity: the soul criterion , the body criterion  and the psychological criterion.

Although empirical science has little to offer here, the field of parapsychology has attempted to offer empirical evidence in favor of an afterlife. More recently, secular futurists envision technologies that may suspend death indefinitely (such as Strategies for Engineered Negligible Senescence, and mind uploading), thus offering a prospect for a sort of bodily immortality.

Table of Contents

  1. Semantic Problems
  2. Three Models of Immortality
    1. The Survival of the Astral Body
    2. The Immaterial Soul
    3. The Resurrection of the Body
  3. Pragmatic Arguments for the Belief in Immortality
  4. Plato’s Arguments for Immortality
  5. Dualism
    1. Descartes’ Arguments for Dualism
    2. More Recent Dualist Arguments
    3. Arguments against Dualism
  6. A Brief Digression: Criteria for Personal Identity
    1. The Soul Criterion
    2. The Body Criterion
    3. The Psychological Criterion
    4. The Bundle Theory
  7. Problems with the Resurrection of the Body
  8. Parapsychology
    1. Reincarnation
    2. Mediums and Ghostly Apparitions
    3. Electronic-Voice Phenomena
    4. Near-Death Experiences
    5. Extrasensory Perception
  9. The Technological Prospect of Immortality
    1. Cryonics
    2. Strategies for Engineered Negligible Senescence
    3. Mind Uploading
  10. References and Further Reading

1. Semantic Problems

Discourse on immortality bears a semantic difficulty concerning the word ‘death’. We usually define it in physiological terms as the cessation of biological functions that make life possible. But, if immortality is the continuation of life even after death, a contradiction appears to come up (Rosemberg, 1998). For apparently it makes no sense to say that someone has died and yet survived death. To be immortal is, precisely, not to suffer death. Thus, whoever dies, stops existing; nobody may exist after death, precisely because death means the end of existence.

For convenience, however, we may agree that ‘death’ simply means the decomposition of the body, but not necessarily the end of a person’s existence, as  assumed in most dictionary definitions. In such a manner, a person may ‘die’ in as much as their body no longer exists (or, to be more precise, no longer holds vital signs: pulse, brain activity, and so forth), but may continue to exist, either in an incorporeal state, with an ethereal body, or with some other physical body.

Some people may think of ‘immortality’ in vague and general terms, such as the continuity of a person’s deeds and memories among their friends and relatives. Thus, baseball player Babe Ruth is immortal in a very vague sense: he is well remembered among his fans. But, philosophically speaking, immortality implies the continuation of personal identity. Babe Ruth may be immortal in the sense that he is well remembered, but unless there is someone that may legitimately claim “I am Babe Ruth,” we shall presume Babe Ruth no longer exists and hence, is not immortal.

2. Three Models of Immortality

Despite the immense variety of beliefs on immortality, they may be reduced to three basic models: the survival of the astral body, the immaterial soul and resurrection (Flew, 2000). These models are not necessarily mutually exclusive; in fact, most religions have adhered to a combination of them.

a. The Survival of the Astral Body

Much primitive religious thought conceives that human beings are made up of two body substances: a physical body, susceptible of being touched, smelt, heard and seen; and an astral body made of some sort of mysterious ethereal substance. Unlike the physical body, the astral body has no solidity (it can go through walls, for example.) and hence, it cannot be touched, but it can be seen. Its appearance is similar to the physical body, except perhaps its color tonalities are lighter and its figure is fuzzier.

Upon death, the astral body detaches itself from the physical body, and mourns in some region within time and space. Thus, even if the physical body decomposes, the astral body survives. This is the type of immortality most commonly presented in films and literature (for example, Hamlet’s ghost). Traditionally, philosophers and theologians have not privileged this model of immortality, as there appears to be two insurmountable difficulties: 1) if the astral body does exist, it should be seen depart from the physical body at the moment of death; yet there is no evidence that accounts for it; 2) ghosts usually appear with clothes; this would imply that, not only are there astral bodies, but also astral clothes – a claim simply too extravagant to be taken seriously (Edwards, 1997: 21).

b. The Immaterial Soul

The model of the immortality of the soul is similar to the ‘astral body’ model, in as much as it considers that human beings are made up of two substances. But, unlike the ‘astral body’ model, this model conceives that the substance that survives the death of the body is not a body of some other sort, but rather, an immaterial soul. In as much as the soul is immaterial, it has no extension, and thus, it cannot be perceived through the senses. A few philosophers, such as Henry More, have come to believe that for something to exist, it must occupy space (although not necessarily physical space), and hence, souls are located somewhere in space (Henry, 2007). Up until the twentieth century, the majority of philosophers believed that persons are souls, and that human beings are made up of two substances (soul and body). A good portion of philosophers believed that the body is mortal and the soul is immortal. Ever since Descartes in the seventeenth century, most philosophers have considered that the soul is identical to the mind, and, whenever a person dies, their mental contents survive in an incorporeal state.

Eastern religions (for example, Hinduism and Buddhism) and some ancient philosophers (for example, Pythagoras and Plato) believed that immortal souls abandon the body upon death, may exist temporarily in an incorporeal state, and may eventually adhere to a new body at the time of birth (in some traditions, at the time of fertilization). This is the doctrine of reincarnation.

c. The Resurrection of the Body

Whereas most Greek philosophers believed that immortality implies solely the survival of the soul, the three great monotheistic religions (Judaism, Christianity and Islam) consider that immortality is achieved through the resurrection of the body at the time of the Final Judgment. The very same bodies that once constituted persons shall rise again, in order to be judged by God. None of these great faiths has a definite position on the existence of an immortal soul. Therefore, traditionally, Jews, Christians and Muslims have believed that, at the time of death, the soul detaches from the body and continues on to exist in an intermediate incorporeal state until the moment of resurrection. Some others, however, believe that there is no intermediate state: with death, the person ceases to exist, and in a sense, resumes existence at the time of resurrection.

As we shall see, some philosophers and theologians have postulated the possibility that, upon resurrection, persons do not rise with the very same bodies with which they once lived (rather, resurrected persons would be constituted by a replica). This version of the doctrine of the resurrection would be better referred to as ‘re-embodiment’: the person dies, but, as it were, is ‘re-embodied’.

3. Pragmatic Arguments for the Belief in Immortality

Most religions adhere to the belief in immortality on the basis of faith. In other words, they provide no proof of the survival of the person after the death of the body; actually, their belief in immortality appeals to some sort of divine revelation that, allegedly, does not require rationalization.

Natural theology, however, attempts to provide rational proofs of God’s existence. Some philosophers have argued that, if we can rationally prove that God exists, then we may infer that we are immortal. For, God, being omnibenevolent, cares about us, and thus would not allow the annihilation of our existence; and being just, would bring about a Final Judgement (Swinburne, 1997). Thus, the traditional arguments in favor of the existence of God (ontological, cosmological, teleological) would indirectly prove our immortality. However, these traditional arguments have been notoriously criticized, and some arguments against the existence of God have also been raised (such as the problem of evil) (Martin, 1992; Smith, 1999).

Nevertheless, some philosophers have indeed tried to rationalize the doctrine of immortality, and have come up with a few pragmatic arguments in its favor.

Blaise Pascal proposed a famous argument in favor of the belief in the existence of God, but it may well be extended to the belief in immortality (Pascal, 2005). The so-called ‘Pascal’s Wager’ argument goes roughly as follows: if we are to decide to believe whether God exists or not, it is wiser to believe that God does exist. If we rightly believe that God exists, , we gain eternal bliss; if God does not exist, we lose nothing, in as much as there is no Final Judgment to account for our error. On the other hand, if we rightly believe God does not exist, we gain nothing, in as much as there is no Final Judgment to reward our belief. But, if we wrongly believe that God does not exist, we lose eternal bliss, and are therefore damned to everlasting Hell. By a calculation of risks and benefits, we should conclude that it is better to believe in God’s existence. This argument is easily extensible to the belief in immortality: it is better to believe that there is a life after death, because if in fact there is a life after death, we shall be rewarded for our faith, and yet lose nothing if we are wrong; on the other hand, if we do not believe in a life after death, and we are wrong, we will be punished by God, and if we are right, there will not be a Final Judgment to reward our belief.

Although this argument has remained popular among some believers, philosophers have identified too many problems in it (Martin, 1992). Pascal’s Wager does not take into account the risk of believing in a false god (What if Baal were the real God, instead of the Christian God?), or the risk of believing in the wrong model of immortality (what if God rewarded belief in reincarnation, and punished belief in resurrection?). The argument also assumes that we are able to choose our beliefs, something most philosophers think very doubtful.

Other philosophers have appealed to other pragmatic benefits of the belief in immortality. Immanuel Kant famously rejected in his Critique of Pure Reason the traditional arguments in favor of the existence of God; but in his Critique of Practical Reason he put forth a so-called ‘moral argument’. The argument goes roughly as follows: belief in God and immortality is a prerequisite for moral action; if people do not believe there is a Final Judgment administered by God to account for deeds, there will be no motivation to be good. In Kant’s opinion, human beings seek happiness. But in order for happiness to coincide with moral action, the belief in an afterlife is necessary, because moral action does not guarantee happiness. Thus, the only way that a person may be moral and yet preserve happiness, is by believing that there will be an afterlife justice that will square morality with happiness. Perhaps Kant’s argument is more eloquently expressed in Ivan Karamazov’s (a character from Dostoevsky’s The Brothers Karamazov) famous phrase: “If there is no God, then everything is permitted… if there is no immortality, there is no virtue”.

The so-called ‘moral argument’ has been subject to some criticism. Many philosophers have argued that it is indeed possible to construe secular ethics, where appeal to God is unnecessary to justify morality. The question “why be moral?” may be answered by appealing to morality itself, to the need for cooperation, or simply, to one’s own pleasure (Singer, 1995; Martin, 1992). A vigilant God does not seem to be a prime need in order for man to be good. If these philosophers are right, the lack of belief in immortality would not bring about the collapse of morality. Some contemporary philosophers, however, align with Kant and believe that secular morality is shallow, as it does not satisfactorily account for acts of sacrifice that go against self-interest; in their view, the only way to account for such acts is by appealing to a Divine Judge (Mavrodes, 1995).

Yet another pragmatic argument in favor of the belief in immortality appeals to the need to find meaning in life. Perhaps Miguel de Unamuno’s Del sentimiento tràgico de la vida is the most emblematic philosophical treatise advocating this argument: in Unamuno’s opinion, belief in immortality is irrational, but nevertheless necessary to avoid desperation in the face of life’s absurdity. Only by believing that our lives will have an ever-lasting effect, do we find motivation to continue to live. If, on the contrary, we believe that everything will ultimately come to an end and nothing will survive, it becomes pointless to carry on any activity.

Of course, not all philosophers would agree. Some philosophers would argue that, on the contrary, the awareness that life is temporal and finite makes living more meaningful, in as much as we better appreciate opportunities (Heidegger, 1978). Bernard Williams has argued that, should life continue indefinitely, it would be terribly boring, and therefore, pointless (Williams, 1976). Some philosophers, however, counter that some activities may be endlessly repeated without ever becoming boring; furthermore, a good God would ensure that we never become bored in Heaven (Fischer, 2009).

Death strikes fear and anguish in many of us, and some philosophers argue that the belief in immortality is a much needed resource to cope with that fear. But, Epicurus famously argued that it is not rational to fear death, for two main reasons: 1) in as much as death is the extinction of consciousness, we are not aware of our condition (“if death is, I am not; if I am, death is not”); 2) in the same manner that we do not worry about the time that has passed before we were born, we should not worry about the time that will pass after we die (Rist, 1972).

At any rate, pragmatic arguments in favor of the belief in immortality are also critiqued on the grounds that the pragmatic benefits of a belief bear no implications on its truth. In other words, the fact that a belief is beneficial does not make it true. In the analytic tradition, philosophers have long argued for and against the pragmatic theory of truth, and depending on how this theory is valued, it will offer a greater or lesser plausibility to the arguments presented above.

4. Plato’s Arguments for Immortality

Plato was the first philosopher to argue, not merely in favor of the convenience of accepting the belief in immortality, but for the truth of the belief itself. His Phaedo is a dramatic representation of Socrates’ final discussion with his disciples, just before drinking the hemlock. Socrates shows no sign of fear or concern, for he is certain that he will survive the death of his body. He presents three main arguments to support his position, and some of these arguments are still in use today.

First, Socrates appeals to cycles and opposites. He believes that everything  has an opposite that is implied by it. And, as in cycles, things not only come from opposites, but also go towards opposites. Thus, when something is hot, it was previously cold; or when we are awake, we were previously asleep; but when we are asleep, we shall be awake once again. In the same manner, life and death are opposites in a cycle. Being alive is opposite to being dead. And, in as much as death comes from life, life must come from death. We come from death, and we go towards death. But, again, in as much as death comes from life, it will also go towards life. Thus, we had a life before being born, and we shall have a life after we die.

Most philosophers have not been persuaded by this argument. It is very doubtful that everything has an opposite (What is the opposite of a computer?) And, even if everything had an opposite, it is doubtful that everything comes from its opposite, or even that everything goes towards its opposite.

Socrates also appeals to the theory of reminiscence, the view that learning is really a process of ‘remembering’ knowledge from past lives. The soul must already exist before the birth of the body, because we seem to know things that were not available to us. Consider the knowledge of equality. If we compare two sticks and we realize they are not equal, we form a judgment on the basis of a previous knowledge of ‘equality’ as a form. That knowledge must come from previous lives. Therefore, this is an argument in favor of the transmigration of souls (that is, reincarnation or metempsychosis).

Some philosophers would dispute the existence of the Platonic forms, upon which this argument rests. And, the existence of innate ideas does not require the appeal to previous lives. Perhaps we are hard-wired by our brains to believe certain things; thus, we may know things that were not available to us previously.

Yet another of Socrates’ arguments appeals to the affinity between the soul and the forms. In Plato’s understanding, forms are perfect, immaterial and eternal. And, in as much as the forms are intelligible, but not sensible, only the soul can apprehend them. In order to apprehend something, the thing apprehending must have the same nature as the thing apprehended. The soul, then, shares the attributes of the forms: it is immaterial and eternal, and hence, immortal.

Again, the existence of the Platonic forms should not be taken for granted, and for this reason, this is not a compelling argument. Furthermore, it is doubtful that the thing apprehending must have the same nature as the thing apprehended: a criminologist need not be a criminal in order to apprehend the nature of crime.

5. Dualism

Plato’s arguments take for granted that souls exist; he only attempts to prove that they are immortal. But, a major area of discussion in the philosophy of mind is the existence of the soul. One of the doctrines that hold that the soul does exist is called ‘dualism’; its name comes from the fact that it postulates that human beings are made up of two substances: body and soul. Arguments in favor of dualism are indirectly arguments in favor of immortality, or at least in favor of the possibility of survival of death. For, if the soul exists, it is an immaterial substance. And, in as much as it is an immaterial substance, it is not subject to the decomposition of material things; hence, it is immortal.

Most dualists agree that the soul is identical to the mind, yet different from the brain or its functions. Some dualists believe the mind may be some sort of emergent property of the brain: it depends on the brain, but it is not identical to the brain or its processes. This position is often labeled ‘property dualism’, but here we are concerned with substance dualism, that is, the doctrine that holds that the mind is a separate substance (and not merely a separate property) from the body, and therefore, may survive the death of the body (Swinburne, 1997).

a. Descartes’ Arguments for Dualism

René Descartes is usually considered the father of dualism, as he presents some very ingenuous arguments in favor of the existence of the soul as a separate substance (Descartes, 1980). In perhaps his most celebrated argument, Descartes invites a thought experiment: imagine you exist, but not your body. You wake up in the morning, but as you approach the mirror, you do not see yourself there. You try to reach your face with your hand, but it is thin air. You try to scream, but no sound comes out. And so on.

Now, Descartes believes that it is indeed possible to imagine such a scenario. But, if one can imagine the existence of a person without the existence of the body, then persons are not constituted by their bodies, and hence, mind and body are two different substances. If the mind were identical to the body, it would be impossible to imagine the existence of the mind without imagining at the same time the existence of the body.

This argument has been subject to much scrutiny. Dualists certainly believe it is a valid one, but it is not without its critics. Descartes seems to assume that everything that is imaginable is possible. Indeed, many philosophers have long agreed that imagination is a good guide as to what is possible (Hume, 2010). But, this criterion is disputed. Imagination seems to be a psychological process, and thus not strictly a logical process. Therefore, perhaps we can imagine scenarios that are not really possible. Consider the Barber Paradox. At first, it seems possible that, in a town, a man shaves only those persons that shave themselves. We may perhaps imagine such a situation, but logically there cannot be such a situation, as Bertrand Russell showed. The lesson to be learned is that imagination might not be a good guide to possibility. And, although Descartes appears to have no trouble imagining an incorporeal mind, such a scenario might not be possible. However, dualists may argue that there is no neat difference between a psychological and a logical process, as logic seems to be itself a psychological process.

Descartes presents another argument. As Leibniz would later formalize in the Principle of Identity of Indiscernibles, two entities can be considered identical, if and only if, they exhaustively share the same attributes. Descartes exploits this principle, and attempts to find a property of the mind not shared by the body (or vice versa), in order to argue that they are not identical, and hence, are separate substances.

Descartes states: “There is a great difference between a mind and a body, because the body, by its very nature, is something divisible, whereas the mind is plainly indivisible. . . insofar as I am only a thing that thinks, I cannot distinguish any parts in me. . . . Although the whole mind seems to be united to the whole body, nevertheless, were a foot or an arm or any other bodily part amputated, I know that nothing would be taken away from the mind” (Descartes, 1980: 97).

Descartes believed, then, that mind and body cannot be the same substance. Descartes put forth another similar argument: the body has extension in space, and as such, it can be attributed physical properties. We may ask, for instance, what the weight of a hand is, or what the longitude of a leg is. But the mind has no extension, and therefore, it has no physical properties. It makes no sense to ask what the color of the desire to eat strawberries is, or what the weight of Communist ideology is. If the body has extension, and the mind has no extension, then the mind can be considered a separate substance.

Yet another of Descartes’ arguments appeals to some difference between mind and body. Descartes famously contemplated the possibility that an evil demon might be deceiving him about the world. Perhaps the world is not real. In as much as that possibility exists, Descartes believed that one may be doubt the existence of one’s own body. But, Descartes argued that one cannot doubt the existence of one’s own mind. For, if one doubts, one is thinking; and if one thinks, then it can be taken for certain that one’s mind exists. Hence Descartes famous phrase: “cogito ergo sum”, I think, therefore, I exist. Now, if one may doubt the existence of one’s body, but cannot doubt the existence of one’s mind, then mind and body are different substances. For, again, they do not share exhaustively the same attributes.

These arguments are not without critics. Indeed, Leibniz’s Principle of Indiscernibles would lead us to think that, in as much as mind and body do not exhaustively share the same properties, they cannot be the same substance. But, in some contexts, it seems possible that A and B may be identical, even if that does not imply that everything predicated of A can be predicated of B.

Consider, for example, a masked man that robs a bank. If we were to ask a witness whether or not the masked man robbed the bank, the witness will answer “yes!”. But, if we were to ask the witness whether his father robbed the bank, he may answer “no”. That, however, does not imply that the witness’ father is not the bank robber: perhaps the masked man was the witness’ father, and the witness was not aware of it. This is the so-called ‘Masked Man Fallacy’.

This case forces us to reconsider Leibniz’s Law: A is identical to B, not if everything predicated of A is predicated of B, but rather, when A and B share exhaustively the same properties. And, what people believe about substances are not properties. To be an object of doubt is not, strictly speaking, a property, but rather, an intentional relation. And, in our case, to be able to doubt the body’s existence, but not the mind’s existence, does not imply that mind and body are not the same substance.

b. More Recent Dualist Arguments

In more recent times, Descartes’ strategy has been used by other dualist philosophers to account for the difference between mind and body. Some philosophers argue that the mind is private, whereas the body is not. Any person may know the state of my body, but no person, including even possibly myself, can truly know the state of my mind.

Some philosophers point ‘intentionality’ as another difference between mind and body. The mind has intentionality, whereas the body does not. Thoughts are about something, whereas body parts are not. In as much as thoughts have intentionality, they may also have truth values. Not all thoughts, of course, are true or false, but at least those thoughts that pretend to represent the world, may be. On the other hand, physical states do not have truth values: neurons activating in the brain are neither ‘true’, nor ‘false’.

Again, these arguments exploit the differences between mind and body. But, very much as with Descartes’ arguments, it is not absolutely clear that they avoid the Masked Man Fallacy.

c. Arguments against Dualism

Opponents of dualism not only reject their arguments; they also highlight conceptual and empirical problems with this doctrine. Most opponents of dualism are materialists: they believe that mental stuff is really identical to the brain, or at the most, an epiphenomenon of the brain. Materialism limits the prospects for immortality: if the mind is not a separate substance from the brain, then at the time of the brain’s death, the mind also becomes extinct, and hence, the person does not survive death. Materialism need not undermine all expectations of immortality (see resurrection below), but it does undermine the immortality of the soul.

The main difficulty with dualism is the so-called ‘interaction problem’. If the mind is an immaterial substance, how can it interact with material substances? The desire to move my hand allegedly moves my hand, but how exactly does that occur? There seems to be an inconsistency with the mind’s immateriality: some of the time, the mind is immaterial and is not affected by material states, at other times, the mind manages to be in contact with the body and cause its movement. Daniel Dennett has ridiculed this inconsistency by appealing to the comic-strip character Casper. This friendly ghost is immaterial because he is able to go through walls. But, all of a sudden, he is also able to catch a ball. The same inconsistency appears with dualism: in its interaction with the body, sometimes the mind does not interact with the body, sometimes it does (Dennett, 1992).Dualists have offered some solutions to this problem. Occasionalists hold that God directly causes material events. Thus, mind and body never interact. Likewise, parallelists hold that mental and physical events are coordinated by God so that they appear to cause each other, but in fact, they do not. These alternatives are in fact rejected by most contemporary philosophers.

Some dualists, however, may reply that the fact that we cannot fully explain how body and soul interact, does not imply that interaction does not take place. We know many things happen in the universe, although we do not know how they happen. Richard Swinburne, for instance, argues as follows: “That bodily events cause brain events and that these cause pains, images, and beliefs (where their subjects have privileged access to the latter and not the former), is one of the most obvious phenomena of human experience. If we cannot explain how that occurs, we should not try to pretend that it does not occur. We should just acknowledge that human beings are not omniscient, and cannot understand everything” (Swinburne, 1997, xii).

On the other hand, Dualism postulates the existence of an incorporeal mind, but it is not clear that this is a coherent concept. In the opinion of most dualists, the incorporeal mind does perceive. But, it is not clear how the mind can perceive without sensory organs. Descartes seemed to have no problems in imagining an incorporeal existence, in his thought experiment. However, John Hospers, for instance, believes that such a scenario is simply not imaginable:

You see with eyes? No, you have no eyes, since you have no body. But let that pass for a moment; you have experiences similar to what you would have if you had eyes to see with. But how can you look toward the foot of the bed or toward the mirror? Isn’t looking an activity that requires having a body? How can you look in one direction or another if you have no head to turn? And this isn’t all; we said that you can’t touch your body because there is no body there; how did you discover this?… Your body seems to be involved in every activity we try to describe even though we have tried to imagine existing without it. (Hospers, 1997: 280)

Furthermore, even if an incorporeal existence were in fact possible, it could be terribly lonely. For, without a body, could it be possible to communicate with other minds. In Paul Edward’s words: “so far from living on in paradise, a person deprived of his body and thus of all sense organs would, quite aside from many other gruesome deprivations, be in a state of desolate loneliness and eventually come to prefer annihilation”. (Edwards, 1997:48). However, consider that, even in the absence of a body, great pleasures may be attained. We may live in a situation the material world is an illusion (in fact, idealists inspired in Berkley lean towards such a position), and yet, enjoy existence. For, even without a body, we may enjoy sensual pleasures that, although not real, certainly feel real. However, the problems with dualism do not end there. If souls are immaterial and have no spatial extension, how can they be separate from other souls? Separation implies extension. Yet, if the soul has no extension, it is not at all clear how one soul can be distinguished from another. Perhaps souls can be distinguished based on their contents, but then again, how could we distinguish two souls with exactly the same contents? Some contemporary dualists have responded thus: in as much as souls interact with bodies, they have a spatial relationships to bodies, and in a sense, can be individuated.

Perhaps the most serious objection to dualism, and a substantial argument in favor of materialism, is the mind’s correlation with the brain. Recent developments in neuroscience increasingly confirm that mental states depend upon brain states. Neurologists have been able to identify certain regions of the brain associated with specific mental dispositions. And, in as much as there appears to be a strong correlation between mind and brain, it seems that the mind may be reducible to the brain, and would therefore not be a separate substance.

In the last recent decades, neuroscience has accumulated data that confirm that cerebral damage has a great influence on the mental constitution of persons. Phineas Gage’s case is well-known in this respect: Gage had been a responsible and kind railroad worker, but had an accident that resulted in damage to the frontal lobes of his brain. Ever since, Gage turned into an aggressive, irresponsible person, unrecognizable by his peers (Damasio, 2006).

Departing from Gage’s case, scientists have inferred that frontal regions of the brain strongly determine personality. And, if mental contents can be severely damaged by brain injuries, it does not seem right to postulate that the mind is an immaterial substance. If, as dualism postulates, Gage had an immortal immaterial soul, why didn’t his soul remain intact after his brain injury?

A similar difficulty arises when we consider degenerative neurological diseases, such as Alzheimer’s disease. As it is widely known, this disease progressively eradicates the mental contents of patients, until patients lose memory almost completely. If most memories eventually disappear, what remains of the soul? When a patient afflicted with Alzheimer dies, what is it that survives, if precisely, most of his memories have already been lost? Of course, correlation is not identity, and the fact that the brain is empirically correlated with the mind does not imply that the mind is the brain. But, many contemporary philosophers of mind adhere to the so-called ‘identity theory’: mental states are the exact same thing as the firing of specific neurons.

Dualists may respond by claiming that the brain is solely an instrument of the soul. If the brain does not work properly, the soul will not work properly, but brain damage does not imply a degeneration of the soul. Consider, for example, a violinist. If the violin does not play accurately, the violinist will not perform well. But, that does not imply that the violinist has lost their talent. In the same manner, a person may have a deficient brain, and yet, retain her soul intact. However, Occam’s Razor requires the more parsimonious alternative: in which case, unless there is any compelling evidence in its favor, there is no need to assume the existence of a soul that uses the brain as its instrument.

Dualists may also suggest that the mind is not identical to the soul. In fact, whereas many philosophers tend to consider the soul and mind identical, various religions consider that a person is actually made up of by three substances: body, mind and soul. In such a view, even if the mind degenerates, the soul remains. However, it would be far from clear what the soul exactly could be, if it is not identical to the mind.

6. A Brief Digression: Criteria for Personal Identity

Any philosophical discussion on immortality touches upon a fundamental issue concerning persons–personal identity. If we hope to survive death, we would want to be sure that the person that continues to exist after death is the same person that existed before death. And, for religions that postulate a Final Judgment, this is a crucial matter: if God wants to apply justice, the person rewarded or punished in the afterlife must be the very same person whose deeds determine the outcome.

The question of personal identity refers to the criterion upon which a person remains the same (that is, numerical identity) throughout time. Traditionally, philosophers have discussed three main criteria: soul, body and psychological continuity.

a. The Soul Criterion

According to the soul criterion for personal identity, persons remains the same throughout time, if and only if, they retain their soul (Swinburne, 2004). Philosophers who adhere to this criterion usually do not think the soul is identical to the mind. The soul criterion is favored by very few philosophers, as it faces a huge difficulty: if the soul is an immaterial non-apprehensible substance (precisely, in as much as it is not identical to the mind), how can we be sure that a person continues to be the same? We simply do not know if, in the middle of the night, our neighbor’s soul has transferred into another body. Even if our neighbor’s body and mental contents remain the same, we can never know if his soul is the same. Under this criterion, it appears that there is simply no way to make sure someone is always the same person.

However, there is a considerable argument in favor of the soul criterion. To pursue such an argument, Richard Swinburne proposes the following thought experiment: suppose John’s brain is successfully split in two, and as a result, we get two persons; one with the left hemisphere of John’s brain, the other with the right hemisphere. Now, which one is John? Both have a part of John’s brain, and both conserve part of John’s mental contents.  So, one of them must presumably be John, but which one? Unlike the body and the mind, the soul is neither divisible nor duplicable. Thus, although we do not know which would be John, we do know that only one of the two persons is John. And it would be the person that preserves John’s souls, even if we have no way of identifying it. In such a manner, although we know about John’s body and mind, we are not able to discern who is John; therefore, John’s identity is not his mind or his body, but rather, his soul (Swinburne, 2010: 68).

b. The Body Criterion

Common sense informs that persons are their bodies (in fact, that is how we recognize people ) but, although many philosophers would dispute this,  ordinary people seem generally to adhere to such a view). Thus, under this criterion, a person continues to be the same, if, and only if, they conserve the same body. Of course, the body alters, and eventually, all of its cells are replaced. This evokes the ancient philosophical riddle known as the Ship of Theseus: the planks of Theseus’ ship were gradually replaced, until none of the originals remained. Is it still the same ship? There has been much discussion on this, but most philosophers agree that, in the case of the human body, the total replacement of atoms and the slight alteration of form do not alter the numerical identity of the human body.

However, the body criterion soon runs into difficulties. Imagine two patients, Brown and Robinson, who undergo surgery simultaneously. Accidentally, their brains are swapped in placed in the wrong body. Thus, Brown’s brain is placed in Robinson’s body. Let us call this person Brownson. Naturally, in as much as he has Brown’s brain, he will have Brown’s memories, mental contents, and so forth. Now, who is Brownson? Is he Robinson with Brown’s brain; or is he Brown with Robinson’s body? Most people would think the latter (Shoemaker, 2003). After all, the brain is the seat of consciousness.

Thus, it would appear that the body criterion must give way to the brain criterion: a person continues to be the same, if and only if, she conserves the same brain. But, again, we run into difficulties. What if the brain undergoes fission, and each half is placed in a new body? (Parfit, 1984). As a result, we would have two persons pretending to be the original person, but, because of the principle of transitivity, we know that both of them cannot be the original person. And, it seems arbitrary that one of them should be the original person, and not the other (although, as we have seen, Swinburne bites the bullet, and considers that, indeed, only one would be the original person). This difficulty invites the consideration of other criteria for personal identity.

c. The Psychological Criterion

John Locke famously asked what we would think if a prince one day woke up in a cobbler’s body, and the cobbler in a prince’s body (Locke, 2009). Although the cobbler’s peers would recognize him as the cobbler, he would have the memories of the prince. Now, if before that event, the prince committed a crime, who should be punished? Should it be the man in the palace, who remembers being a cobbler; or should it be the man in the workshop, who remembers being a prince, including his memory of the crime?

It seems that the man in the workshop should be punished for the prince’s crime, because, even if that is not the prince’s original body, that person is the prince, in as much as he conserves his memories. Locke, therefore, believed that a person continues to be the same, if and only if, she conserves psychological continuity.

Although it appears to be an improvement with regards to the previous two criteria, the psychological criterion also faces some problems. Suppose someone claims today to be Guy Fawkes, and conserves intact very vividly and accurately the memories of the seventeenth century conspirator (Williams, 1976). By the psychological criterion, such a person would indeed be Guy Fawkes. But, what if, simultaneously, another person also claims to be Guy Fawkes, even with the same degree of accuracy? Obviously, both persons cannot be Guy Fawkes. Again, it would seem arbitrary to conclude that one person is Guy Fawkes, yet the other person isn’t. It seems more plausible that neither person is Guy Fawkes, and therefore, that psychological continuity is not a good criterion for personal identity.

d. The Bundle Theory

In virtue of the difficulties with the above criteria, some philosophers have argued that, in a sense, persons do not exist. Or, to be more precise, the self does not endure changes. In David Hume’s words, a person is “nothing but a bundle or collection of different perceptions, which succeed each other with an inconceivable rapidity, and are in a perpetual flux and movement” (Hume, 2010: 178). This is the so-called ‘bundle theory of the self’.

As a corollary, Derek Parfit argues that, when considering survival, personal identity is not what truly matters (Parfit, 1984). What does matter is psychological continuity. Parfit asks us to consider this example.

Suppose that you enter a cubicle in which, when you press a button, a scanner records the states of all the cells in your brain and body, destroying both while doing so. This information is then transmitted at the speed of light to some other planet, where a replicator produces a perfect organic copy of you. Since the brain of your replica is exactly like yours, it will seem to remember living your life up to the moment when you pressed the button, its character will be just like yours, it will be every other way psychologically continuous with you. (Parfit, 1997: 311)

Now, under the psychological criterion, such a replica will in fact be you. But, what if the machine does not destroy the original body, or makes more than one replica? In such a case, there will be two persons claiming to be you. As we have seen, this is a major problem for the psychological criterion. But, Parfit argues that, even if the person replicated is not the same person that entered the cubicle, it is psychologically continuous. And, that is what is indeed relevant.

Parfit’s position has an important implication for discussions of immortality. According to this view, a person in the afterlife is not the same person that lived before. But, that should not concern us. We should be concerned about the prospect that, in the afterlife, there will at least be one person that is psychologically continuous with us.

7. Problems with the Resurrection of the Body

As we have seen, the doctrine of resurrection postulates that on Judgment Day the bodies of every person who ever lived shall rise again, in order to be judged by God. Unlike the doctrine of the immortality of the soul, the doctrine of resurrection has not been traditionally defended with philosophical arguments. Most of its adherents accept it on the basis of faith. Some Christians, however, consider that the resurrection of Jesus can be historically demonstrated (Habermas, 2002; Craig, 2008). And, so the argument goes, if it can be proven that God resurrected Jesus from the dead, then we can expect that God will do the same with every human being who has ever lived.

Nevertheless, the doctrine of resurrection runs into some philosophical problems derived from considerations on personal identity; that is, how is the person resurrected identical to the person that once lived? If we were to accept dualism and the soul criterion for personal identity, then there is not much of a problem: upon the moment of death, soul and body split, the soul remains incorporeal until the moment of resurrection, and the soul becomes attached to the new resurrected body. In as much as a person is the same, if and only if, she conserves the same soul, then we may legitimately claim that the resurrected person is identical to the person that once lived.

But, if we reject dualism, or the soul criterion for personal identity, then we must face some difficulties. According to the most popular one conception of resurrection, we shall be raised with the same bodies with which we once lived. Suppose that the resurrected body is in fact made of the very same cells that made up the original body, and also, the resurrected body has the same form as the original body. Are they identical?

Peter Van Inwagen thinks not (Van Inwagen, 1997). If, for example, an original manuscript written by Augustine is destroyed, and then, God miraculously recreates a manuscript with the same atoms that made up Augustine’s original manuscript, we should not consider it the very same manuscript. It seems that, between Augustine’s original manuscript, and the manuscript recreated by God, there is no spatio-temporal continuity. And, if such continuity is lacking, then we cannot legitimately claim that the recreated object is the same original object. For the same reason, it appears that the resurrected body cannot be identical to the original body. At most, the resurrected body would be a replica.

However, our intuitions are not absolutely clear. Consider, for example, the following case: a bicycle is exhibited in a store, and a customer buys it. In order to take it home, the customer dismantles the bicycle, puts its pieces in a box, takes it home, and once there, reassembles the pieces. Is it the same bicycle? It certainly seems so, even if there is no spatio-temporal continuity.

Nevertheless, there is room to doubt that the resurrected body would be made up of the original body’s same atoms. We know that matter recycles itself, and that due to metabolism, the atoms that once constituted the human body of a person may later constitute the body of another person. How could God resurrect bodies that shared the same atoms? Consider the case of cannibalism, as ridiculed by Voltaire:

A soldier from Brittany goes into Canada; there, by a very common chance, he finds himself short of food, and is forced to eat an Iroquis whom he killed the day before. The Iroquis had fed on Jesuits for two or three months; a great part of his body had become Jesuit. Here, then, the body of a soldier is composed of Iroquis, of Jesuits, and of all that he had eaten before. How is each to take again precisely what belongs to him? And which part belongs to each? (Voltaire, 1997: 147)

However, perhaps, in the resurrection, God needn’t resurrect the body. If we accept the body criterion for personal identity, then, indeed, the resurrected body must be the same original body. But, if we accept the psychological criterion, perhaps God only needs to recreate a person psychologically continuous with the original person, regardless of whether or not that person has the same body. John Hick believes this is how God could indeed proceed (Hick, 1994).

Hick invites a thought experiment. Suppose a man disappears in London, and suddenly someone with his same looks and personality appears in New York. It seems reasonable to consider that the person that disappeared in London is the same person that appeared in New York. Now, suppose that a man dies in London, and suddenly appears in New York with the same looks and personality. Hick believes that, even if the cadaver is in London, we would be justified to claim that the person that appears in New York is the same person that died in London. Hick’s implication is that body continuity is not needed for personal identity; only psychologically continuity is necessary.

And, Hick considers that, in the same manner, if a person dies, and someone in the resurrection world appears with the same character traits, memories, and so forth, then we should conclude that such a person in the resurrected world is identical to the person who previously died. Hick admits the resurrected body would be a replica, but as long as the resurrected is psychologically continuous with the original person, then it is identical to the original person.

Yet, in as much as Hick’s model depends upon a psychological criterion for personal identity, it runs into the same problems that we have reviewed when considering the psychological criterion. It seems doubtful that a replica would be identical to the original person, because more than one replica could be recreated. And, if there is more than one replica, then they would all claim to be the original person, but obviously, they cannot all be the original person. Hick postulates that we can trust that God would only recreate exactly one replica, but it is not clear how that would solve the problem. For, the mere possibility that God could make more than one replica is enough to conclude that a replica would not be the original person.

Peter Van Inwagen has offered a somewhat extravagant solution to these problems: “Perhaps at the moment of each man’s death, God removes his corpse and replaces it with a simulacrum which is what is burned or rots. Or perhaps God is not quite so wholesale as this: perhaps He removes for ‘safekeeping’ only the ‘core person’ – the brain and central nervous system – or even some special part of it” (Van Inwagen, 1997: 246). This would seem to solve the problem of spatio-temporal continuity. The body would never cease to exist, it would only be stored somewhere else until the moment of resurrection, and therefore, it would conserve spatio-temporal continuity. However, such an alternative seems to presuppose a deceitful God (He would make us believe the corpse that rots is the original one, when in fact, it is not), and would thus contradict the divine attribute of benevolence (a good God would not lie), a major tenet of monotheistic religions that defend the doctrine of resurrection.

Some Christian philosophers are aware of all these difficulties, and have sought a more radical solution: there is no criterion for personal identity over time. Such a view is not far from the bundle theory, in the sense that it is difficult to precise how a person remains the same over time. This position is known as ‘anti-criterialism’, that is, there is no intelligible criterion for personal identity; Trenton Merricks (1998) is its foremost proponent. By doing away with criteria for personal identity, anti-criterialists purport to show that objections to resurrection based on difficulties of personal identity have little weight, precisely because we should not be concerned about criteria for personal identity.

8. Parapsychology

The discipline of parapsychology purports to prove that there is scientific evidence for the afterlife; or at least, that there is scientific evidence for the existence of paranormal abilities that would imply that the mind is not a material substance. Originally founded by J.B.S. Rhine in the 1950s, parapsychology has fallen out of favor among contemporary neuroscientists, although some universities still support parapsychology departments.

a. Reincarnation

Parapsychologists usually claim there is a good deal of evidence in favor of the doctrine of reincarnation. Two pieces of alleged evidence are especially meaningful: (1) past-life regressions; (2) cases of children who apparently remember past lives.

Under hypnosis, some patients frequently have regressions and remember events from their childhood. But, some patients have gone even further and, allegedly, have vivid memories of past lives. A few parapsychologists take these as so-called ‘past-life regressions’ as evidence for reincarnation (Sclotterbeck, 2003).

However, past-life regressions may be cases of cryptomnesia, that is, hidden memories. A person may have a memory, and yet not recognize it as such. A well-known case is illustrative: an American woman in the 1950s was hypnotized, and claimed to be Bridey Murphy, an Irishwoman of the 19th century. Under hypnosis, the woman offered a fairly good description of 19th century Ireland, although she had never been in Ireland. However, it was later discovered that, as a child, she had an Irish neighbor. Most likely, she had hidden memories of that neighbor, and under hypnosis, assumed the personality of a 20th century Irish woman.

It must also be kept in mind that hypnosis is a state of high suggestibility. The person that conducts the hypnosis may easily induce false memories on the person hypnotized; hence, alleged memories that come up in hypnosis are not trustworthy at all.

Some children have claimed to remember past lives. Parapsychologist Ian Stevenson collected more than a thousand of such cases (Stevenson, 2001). And, in a good portion of those cases, children know things about the deceased person that, allegedly, they could not have known otherwise.

However, Stevenson’s work has been severely critiqued for its methodological flaws. In most cases, the child’s family had already made contact with the deceased’s family before Stevenson’s arrival; thus, the child could pick up information and give the impression that he knows more than what he could have known. Paul Edwards has also accused Stevenson of asking leading questions towards his own preconceptions (Edwards, 1997: 14).

Moreover, reincarnation runs into conceptual problems of its own. If you do not remember past lives, then it seems that you cannot legitimately claim that you are the same person whose life you do not remember. However, a few philosophers claim this is not a good objection at all, as you do not remember being a very young child, and yet can still surely claim to be the same person as that child (Ducasse, 1997: 199).

Population growth also seems to be a problem for reincarnation: according to defenders of reincarnation, souls migrate from one body to another. This, in a sense,  presupposes that the number of souls remains stable, as no new souls are created, they only migrate from body to body. Yet, the number of bodies has consistently increased ever since the dawn of mankind. Where, one may ask, were all souls before new bodies came to exist? (Edwards, 1997: 14). Actually, this objection is not so formidable: perhaps souls exist in a disembodied form as they wait for new bodies to come up (D’Souza, 2009: 57).

b. Mediums and Ghostly Apparitions

During the heyday of Spiritualism (the religious movement that sought to make contact with the dead), some mediums gained prominence for their reputed abilities to contact the dead. These mediums were of two kinds: physical mediums invoked spirits that, allegedly, produced physical phenomena (for example, lifting tables); and mental mediums whose bodies, allegedly, were temporarily possessed by the spirits.

Most physical mediums were exposed as frauds by trained magicians. Mental mediums, however, presented more of a challenge for skeptics. During their alleged possession by a deceased person’s spirit, mediums would provide information about the deceased person that, apparently, could not have possibly known. William James was impressed by one such medium, Leonora Piper, and although he remained somewhat skeptical, he finally endorsed the view that Piper in fact made contact with the dead.

Some parapsychologists credit the legitimacy of mental mediumship (Almeder, 1992). However, most scholars believe that mental mediums work through the technique of ‘cold reading’: they ask friends and relatives of a deceased person questions at a fast pace, and infer from their body language and other indicators, information about the deceased person (Gardner, 2003).

Parapsychologists have also gathered testimonies of alleged ghost appearances, especially cases where the spirit communicates something that no person could have known (for example, the location of a hidden treasure), and yet it is corroborated. This evidence seems far too anecdotal to be taken seriously; it does not go through the rigorous control that such a claim would require.

c. Electronic-Voice Phenomena

Some parapsychologists have tried to record white noise generated by vacant radio stations, and in places where it is known that no person is present (Raudive, 1991). Allegedly, some of those recordings have produced noises similar to human voices with strange messages, and these voices are believed to come from ghosts. Skeptics claim that such messages are too vague to be taken seriously, and that the tendency of the human mind to find purpose everywhere, promotes the interpretation of simple noises as human voices.

d. Near-Death Experiences

Ever since ancient times (for example, Plato’s myth of Er in The Republic), there have been reports of people who have lost some vital signs, and yet, regained them after a brief period of time. Some people have claimed to have unique experiences in those moments: an acute noise, a peaceful and relaxed sensation; the feeling of abandoning the body, floating in the air and watching the body from above; a passage thorough a dark tunnel; a bright light at the end of the tunnel; an encounter with friends, relatives, and religious characters; a review of life’s most important moments. These are described as near death experiences (Moody, 2001).

Some parapsychologists claim near death experiences are evidence of life after death, and some sort of window revealing the nature of the afterlife. Skeptical scientists, however, have offered plausible physiological explanations for such experiences. Carl Sagan considered the possibility that near death experiences evoke memories from the moment of birth: the transit through the tunnel would evoke the birth canal, and the sensation of floating in the air would evoke the sensation of floating in amniotic acid during gestation (Sagan, 1980).

There are still other physiological explanations. These experiences can be induced by stimulating certain regions of the brain. In moments of intense crises, the brain releases endorphins, and this may account for the peaceful and relaxed sensation. The experience of going through a tunnel may be due to anoxia (lack of oxygen), or the application of anesthetics containing quetamine. The review of life’s most important moments may be due to the stimulation of neurons in the temporal lobules. Encounters with religious characters may be hallucinations as a result of anoxia (Blackmore, 2002) Some patients that have undergone near death experiences have allegedly provided verifiable information that they had no way to know. Some parapsychologists take this as evidence that patients float through the air during near death experiences and, during the ordeal, they are capable to travel to other locations. This evidence is, however, anecdotal. And there is contrary evidence: Researchers have placed computer laptops with random images in the roof of emergency rooms, so that only someone watching from above could know the content of the images, but, so far, no patient has ever accurately described such images (Roach, 2005).

e. Extrasensory Perception

Parapsychologists have designed some experiments that purport to prove that some people have the ability of extrasensory perception or ESP (Radin, 1997). If this ability does indeed exist, it would not prove immortality, but it would seem to prove dualism; that is, the mind is not reducible to the brain.

The best formulated experiment is the so-called Ganzfeld experiment. Person A relaxes in a cabin, her eyes are covered with ping-pong ball halves, and listens to white noise for fifteen minutes. This is intended to promote sense deprivation. In another cabin, person B is shown a target image. Afterwards, subject A is shown the target image, along with three other images. We should expect a 25% chance probability that subject A will choose the target image, but when experiments are performed, 32% of the time, subject A is successful. Parapsychologists claim this is evidence that something strange is going on (as it defies the expectancy of chance), and their explanation is that some people have the ability of extra sensory perception.

However, this experiment is not without critics. There may be sensory leakage (perhaps the cabins are not sufficiently isolated from each other). The experiment’s protocols have not adequately presented the images in random sequences. And, even if, indeed, the results come out 32% accurate when only 25% is expected by chance, it should not be assumed that a paranormal phenomenon is going on; at most, further research would be required to satisfactorily reach a conclusion.

9. The Technological Prospect of Immortality

Most secular scientists have little patience for parapsychology or religiously-inspired immortality. However, the exponential growth of technological innovation in our era has allowed the possibility to consider that, in a not-too-distant future, bodily immortality may become a reality. A few of these proposed technologies raise philosophical issues.

a. Cryonics

Cryonics is the preservation of corpses in low temperatures. Although it is not a technology that purports to bring persons back to life, it does purport to conserve them until some future technology might be capable of resuscitating dead bodies. If, indeed, such technology were ever developed, we would need to revise the physiological criterion for death. For, if brain death is a physiological point of no return, then bodies that are currently cryogenically preserved and will be brought back to life, were not truly dead after all.

b. Strategies for Engineered Negligible Senescence

Most scientists are skeptical of the prospect of resuscitating already dead people, but some are more enthusiastic about the prospect of indefinitely procrastinating death by stopping the aging processes. Scientist Aubrey De Grey has proposed some strategies for engineered negligible senescence: their goal is to identify the mechanisms accountable for aging, and attempt to stop, or even, reverse them (by, say cell repair) (De Grey and Rae, 2008). Some of these strategies involve genetic manipulation and nanotechnology, and hence they bring forth ethical issues. These strategies also bring concern about the ethics of immortality, that is, is immortality even desirable? (See section 3 of this article).

c. Mind Uploading

Yet other futurists consider that, even if it were not possible to indefinitely suspend the body’s death, it would at least be possible to emulate the brain with artificial intelligence (Kurzweil, 1993; Moravec, 2003). Thus, some scientists have considered the prospect of ‘mind-uploading’, that is, the transfer of the mind’s information to a machine. Hence, even if the organic brain dies, the mind could continue to exist once it is uploaded in a silicon-based machine.

Two crucial philosophical issues are raised by this prospect. First, the field of philosophy of artificial intelligence raises the question: could a machine ever really be conscious? Philosophers who adhere to a functionalist understanding of the mind would agree; but other philosophers would not (Consider Searle’s Chinese Room Argument in Searle, 1998).

Even if we were to claim that a machine could in fact be conscious, the technological prospect of mind uploading raises a second philosophical issue: would an emulation of the brain preserve personal identity? If we adhere to a soul or body criterion of personal identity, we should answer negatively. If we adhere to a psychological criterion of personal identity, then we should answer affirmatively, for the artificial brain would indeed be psychologically continuous with the original person.

10. References and Further Reading

  • Almeder, Robert. Death & Personal Survival. Rowan & Littlefield. 1992.
  • Armstrong, D. M. A Materialist Theory of the Mind. Routledge & Kegan Paul. 1968.
  • Baggini, Julian and Fosl, Peter. The Philosopher’s Toolkit. John Wiley and Sons. 2009.
  • Barnes, Jonathan. The Presocratic Philosophers: Thales to Zeno. Routledge. 1979.
  • Beloff, John. Parapsychology: A Concise History. Palgrave. 1997.
  • Bernstein, Morey. The Search for Bridey Murphy. Doubleday. 1956.
  • Blackmore, Susan. “Near-Death Experiencies”. in Shermer, Michael (Ed.). Skeptic Encyclopedia of Pseudoscience. ABC Clio. 2002, p. 150-155.
  • Blackmore, Susan. Consciousness: A Very Short Introduction. Oxford University Press. 2005.
  • Blum, Deborah. Ghost Hunters: William James and the Search for Scientific Proof of Life After Death. Penguin. 2007.
  • Broad, C.D. “On Survival Without a Body” in Edwards, Paul (Ed.) Immortality. Prometheus. 1997, pp. 276-278.
  • Carter, Matt. Minds and Computers: An Introduction to the Philosophy of Artificial Intelligence. Edinburgh University Press. 2007.
  • Chalmers, David. The Conscious Mind: In Search of a Fundamental Theory. Oxford University Press. 1996.
  • Chisholm, Roderick. Person and Object. Routledge. 2002.
  • Churchland, Paul. Neurophilosophy at Work. Cambridge University Press. 2007.
  • Craig. William. Reasonable Faith: Christian Truth and Apologetics. Crossway Books. 2008.
  • Cranston, S.L. and Williams, Carey. Reincarnation: A New Horizon in Science, Religion and Society. Julian Press, 1984.
  • Damasio, Antonio. Descartes’ Error: Emotion, Reason and The Human Brain. Vintage Books. 2006.
  • Damer, T. Edward. Attacking Faulty Reasoning. Cengage Learning. 2008.
  • De Grey, Aubrey and Rae, Michael. Ending Aging: The Rejuvenation Breakthroughs That Could Reverse Human Aging in Our Lifetime. St. Martin’s Griffin. 2008
  • Dehaene, Stanislas. The Cognitive Neuroscience of Consciousness. MIT Press. 2002.
  • Dennett, Daniel. Conciousness Explained. Back Bay Books. 1992.
  • Descartes, René. Discourse on Method and Meditations on First Philosophy, Donald A. Cress trans. Hackett Publishing Co 1980.
  • D’Souza, Dinseh. Life After Death. Regnery Publishing. 2009
  • Ducasse, C.J. “Survival as Transmigration” in Edwards, Paul (Ed.). Immortality. Prometheus. 1997, pp. 194-199.
  • Edwards, Paul. “Introduction” in Edwards, Paul (Ed.) Immortality. Prometheus. 1997, pp.1-70
  • Feser, Edward. Philosophy of Mind. Oneworld. 2008.
  • Fischer, John Martin. Our Stories: Essays on Life, Death and Free Will. Oxford University Press. 2009.
  • Flew, Antony. Merely Mortal: Can You Survive Your Own Death? Prometheus. 2000.
  • Frohock, Fred. Lives of the Psychics: The Shared Worlds of Science and Mysticism. University of Chicago Press. 2000
  • Gardner, Martin. Are Universes Thicker Than Blackberries? W.W. Norton. 2003.
  • Garrett, Brian. “Personal Identity” in Routledge Encyclopedia of Philosophy. Taylor & Francis. 1998, pp. 320-330.
  • Gazzaniga, Michael. Human. Harper Perennial. 2008.
  • Geach, Peter. “Reincarnation” in Flew, Antony (Ed.). Readings in the Philosophical Problems of Parapsychology. Prometheus. 1987, pp. 320-330.
  • Graham, George. Philosophy of Mind: An Introduction. Wiley Blackwell. 1998.
  • Guiley, Rosemary. The Guinness Encyclopedia of Ghosts and Spirits. Guinness Publishing. 1994.
  • Habermas, Gary and Moreland, J.P. Beyond Death: Exploring the Evidence for Immortality. Wipf & Stock Publishers. 2004.
  • Habermas, Gary. In Defence of Miracles. Inter Varsity Press. 2002.
  • Harman, Gilbert. Thought. Princeton University Press. 1973.
  • Harpur, Tom. Life After Death. McClelland and Stewart. 1991.
  • Hasker, William. The Emergent Self. Cornell University Press. 2001.
  • Heidegger, Martin. Being and Time. Wiley-Blackwell. 1978.
  • Henry, John. “Henry More”. Stanford Encyclopedia of Philosophy. 2007.
  • Hick, John. Death and Eternal Life. Westminster John Knox Press. 1994.
  • Hines, Terence. Pseudoscience and the Paranormal: A Critical Examination of the Evidence. Prometheus. 1988.
  • Hospers, John. “Is the Notion of Disembodied Existence Intelligible?” in Edwards, Paul (Ed.). Immortality. Prometheus. 1997, pp. 279-281.
  • Hume, David. A Treatise of Human Nature. Nabu Press. 2010.
  • Hume, David. An Enquiry Concerning Human Understanding. Nuvision Publications. 2008.
  • Irwin, Harvey. An Introduction to Parapsychology. MacFarland. 2004.
  • Jordan, Jeff. Pascal’s Wager: Pragmatic Arguments and the Existence of God. Oxford University Press. 2006.
  • Kant, Immanuel. Critique of Practical Reason. Forgotten Books. 1999.
  • Kurzweil, Raymond. Age of Spiritual Machines: When Computers Exceed Human Intelligence. Allen & Unwin. 1999.
  • Kurzweil, Raymond. The Singularity is Near: When Humans Transcend Biology. Viking. 2005.
  • Lamont, Corliss. The Illusion of Immortality. Philosophical Library. 1959.
  • Lewis, James. Encyclopedia of Afterlife Beliefs and Phenomena. Visible Ink. 1995.
  • Locke, John. An Essay Concerning Human Understanding. WLC. 2009.
  • Ludemann, Gerd. The Resurrection of Christ: A Historical Enquiry. Prometheus. 2004.
  • Martin, Michael. Atheism: A Philosophical Justification. Temple University Press. 1992.
  • Martin, Raymond and Barresi, John. The Rise and Fall of Soul and Self. Columbia University Press. 2006.
  • Mavrodes, George. “Religion and the Queerness of Morality” in Pojman, Louis (Ed.). Ethical Theory: Classical and Contemporary Readings. 1995.
  • Merricks, Trenton. “There are No Criteria of Identity Over Time.” Noûs 32: 106-124. 1998.
  • Minsky, Marvin. The Emotion Machine: Commonsense Thinking, Artificial Intelligence, and the Future of the Human Mind. Simon & Schuster. 2007.
  • Moody, Raymond. Life After Life. Rider. 2001.
  • Moravec, Hans. Robot: Mere Machine to Transcendent Mind. Oxford University Press. 2003.
  • Noonan, Harold. Personal Identity. Routledge. 2003.
  • Parfit, Derek. Reasons and Persons. Oxford University Press. 1984.
  • Parfit, Derek. “Divided Minds and the Nature of Persons” in Edwards, Paul (Ed.). Immortality. Prometheus. 1997, pp. 308-315.
  • Pascal, Blaise. Pensées. Hackett Publishing. 2005.
  • Perry, John. A Dialogue on Personal Identity and Immortality. Hackett Publishers. 1978.
  • Plato. Phaedo. Forgotten Books. 1959.
  • Putnam, Hilary. Representation and Reality. MIT Press. 1988
  • Radin, Dean. The Conscious Universe: The Scientific Truth of Psychic Phenomena. Harper Edge. 1997.
  • Raudive, Konstantin. Breakthrough: An Amazing Experiment in Electronic Communication with the Dead. Smythe. 1991.
  • Rhine, J.S.B. Extra-Sensory Perception. Forgotten books. 1964.
  • Rist, John. Epicurus: An Introduction. CUP Archive. 1972.
  • Roach, Mary. Spook. W.W. Norton Company. 2005.
  • Rosemberg, John. Thinking Clearly About Death. Hackett Publishing. 1998.
  • Ryle, Gilbert. El concepto de lo mental. Paidos. 2005.
  • Sagan, Carl. Brocca’s Brain. Newsweek Books. 1980.
  • Sclotterbeck, Karl. Living Your Past Lives: The Psychology of Past-Life Regression. Iuniverse. 2003.
  • Searle, John. The Philosophy of Mind. The Teaching Company. 1998.
  • Singer, Peter. How Are We to Live: Ethics in an Age of Self-Interest. Prometheus. 1995.
  • Shoemaker, Sydney. Identity, Cause, and Mind: Philosophical Essays. Oxford University Press. 2003.
  • Smith, G.H. Atheism: The Case Against God. Buffalo. 1999.
  • Stevenson, Ian. Children Who Remember Previous Lives: A Question of Reincarnation. McFarland. 2001
  • Strokes, Douglas. The Nature of Mind: Parapsychology and the Role of Consciousness in the Physical World. Macfarland. 1997.
  • Swinburne, Richard. The Evolution of the Soul. Oxford University Press. 1997.
  • Swinburne, Richard. The Existence of God. Oxford University Press. 2004.
  • Swinburne, Richard. Is There a God? Oxford University Press. 2010.
  • Taliaferro, Charles. Philosophy of Religion. Oneworld. 2009.
  • Tart, Charles, Huston Smith and Kendra Smith. The End of Materialism: How Evidence of the Paranormal Is Bringing Science and Spirit Together. New Harbinger Publications. 2009
  • Turing, Alan. “Computing Machinery and Intelligence” in Dawkins, Richard (Ed.). The Oxford Book of Modern Science Writing. Oxford University Press. 2008, pp. 305-314.
  • Unamuno, Miguel. Del sentimiento trágico de la vida. Ediciones Akal. 1983.
  • Van Inwagen, Peter. “The Possibility of Resurrection” in Edwards, Paul (Ed). Immortality. Prometheus. 1997 pp. 242-246.
  • Voltaire. “The Soul, Identity and Immortality” in Edwards, Paul (Ed). Immortality. Prometheus. 1997 pp. 141-147.
  • Whasker, William. “Afterlife”. Stanford Encyclopedia of Philosophy (2005 Edition). .
  • Wilkes, Kathleen V. Real People: Personal Identity without Thought Experiments. Clarendon Press. 1988.
  • Williams, Bernard. Problems of the Self: Philosophical Papers. 1956-1972. Cambridge University Press. 1976.
  • Wright, N.T. The Resurrection of the Son of God. Fortress Press. 2006.
  • Zimmermann, Dean. “The Compatibility of Materialism and Survival: The ‘Falling Elevator’ Model.” Faith and Philosophy 16; 1999, pp. 194-212.

Author Information

Gabriel Andrade
Email: Gabrielernesto2000@yahoo.com
La Universidad del Zulia
Venezuela

William James (1842—1910)

William James is considered by many to be the most insightful and stimulating of American philosophers, as well as the second of the three great pragmatists (the middle link between Charles Sanders Peirce and John Dewey).  As a professor of psychology and of philosophy at Harvard University, he became the most famous living American psychologist and later the most famous living American philosopher of his time.  Avoiding the logically tight systems typical of European rationalists, such as the German idealists, he cobbled together a psychology rich in philosophical implications and a philosophy enriched by his psychological expertise.  More specifically, his theory of the self and his view of human belief as oriented towards conscious action raised issues that required him to turn to philosophy.  There he developed his pragmatic epistemology, which considers the meaning of ideas and the truth of beliefs not abstractly, but in terms of the practical difference they can make in people’s lives.  He explored the implications of this theory in areas of religious belief, metaphysics, human freedom and moral values, and social philosophy. His contributions in these areas included critiques of long-standing philosophical positions on such issues as freedom vs. determinism, correspondence vs. coherence, and dualism vs. materialism, as well as a thorough analysis of a phenomenological understanding of the self and consciousness, a “forward-looking” conception of truth (based on validation and revisable experience), a thorough-going metaphysical pluralism, and a commitment to a full view of agency in connection with communal and social concerns. Thus he created one of the last great philosophical systems in Western thought, even if he did not live quite long enough to complete every aspect of it. The combination of his provocative ideas and his engaging writing style has contributed to the enduring impact of his work.

Table of Contents

  1. Life and Works
  2. Philosophical Psychology
    1. The Stream of Consciousness and the Self
    2. Sensation, Perception, Imagination, and Belief
    3. Emotion and Will
  3. Epistemology
    1. The Pragmatic Method
    2. The Pragmatic Theory of Truth
    3. The Pragmatic Approach to Belief
  4. Philosophy of Religion
    1. The Will to Believe in God
    2. The Varieties of Religious Experience
    3. James’s Own Religious Views
  5. Metaphysics
    1. Realms of Reality
    2. The Philosophical Importance of Metaphysics
    3. Monism vs. Pluralism
  6. Freedom and Morality
    1. Human Freedom
    2. Moral Responsibility
    3. The Meaningfulness of Life
  7. Social Philosophy
    1. Individuals and Their Communities
    2. War and Peace
    3. Democratic Tolerance and Social Progress
  8. References and Further Readings
    1. Primary Sources
    2. Secondary Sources

1. Life and Works

Born in New York City on January 11, 1842, William James was the oldest of the five children of Henry James, Sr., and Mary Walsh James.  His oldest brother, Henry James, Jr., the renowned writer of fiction, was followed by two other brothers and a sister.  The family frequently moved between America and Europe, the father having inherited an amount of money sufficient to allow him to enjoy the life of an intellectual.  While growing up, William had a passion for drawing.  Since he wanted to become a painter, the family moved to Newport, Rhode Island in 1860, where William studied with the leading American portraitist, William Morris Hunt.  Although he had talent, he gave up this career goal in less than a year.  He had decided that it was insufficient for him to do first-rate work.  All this is indicative of three things:  the family’s remarkable support for his aspirations; his own quest to achieve excellence; and his restless, indecisive difficulty in remaining committed to a career path.

In 1861, the American Civil War erupted.  In response to President Lincoln’s call for volunteers, James committed himself to a short-term enlistment.  However, already in delicate health, he left when it expired after three months.  (His younger brothers Wilky and Bob served in the Union Army.)  He then enrolled in the Lawrence Scientific School at Harvard University, his family moving to Boston.  There he studied chemistry and then physiology, prior to his entering Harvard’s Medical School in 1863.  A couple of years later, he took a year off to join a scientific expedition to Brazil, led by Louis Agassiz.  But bad health eventually forced him to quit the expedition, and he returned to medical school (the James family moving from Boston to Cambridge, Massachusetts).   Again he left, this time to study physiology and medicine in Germany and to recover his health.  He failed to find a cure for his curious back pains, but returned to Harvard, passed his medical exams, and received his medical degree in 1869.  Nevertheless, he did not plan to practice medicine and seemed lost as to what to do with the rest of his life.

By the end of that same year, James’s neurological symptoms had become worse.  His training in hard science was making it impossible for him to believe in human freedom and, thus, in the value of struggling for moral ideals; the despair of materialism was leading him to the depression of determinism.  In a barely disguised case history in his Varieties of Religious Experience, he tells of visiting an asylum while he was a medical student, and seeing an epileptic patient whose condition had reduced him to an idiotic state.  James could not dispel the realization that if universal determinism prevails, he could likewise sink into such a state, utterly incapable of preventing it (Varieties, pp. 135-136).  His dread over the sense of life’s absolute insecurity pushed him to become a virtual invalid in his parents’ home.  He considered suicide.  By the spring of 1870, when James was twenty-eight years old, he experienced a critical moment while reading a treatment of human freedom by the French neo-Kantian Charles Renouvier.  He discovered the solution to his problem in the voluntaristic act of will whereby he could commit himself to believing in his own freedom despite any lack of objective evidence.  He started down the road to recovery, though the remainder of his life would be plagued by seemingly psychosomatic troubles (serious eye strain, mysterious back pains, digestive problems, and periods of exhaustion, as well as chronic mood swings, including times of brooding depression).  Unfortunately, he still lacked a constructive career goal.

In 1872, one of James’s former chemistry professors, now Harvard’s President, offered him a job teaching physiology.  He accepted and began his career of more than a third of a century as a faculty member there.  The next year, he became an instructor of anatomy and physiology.  By the mid-eighteen-seventies, he was teaching psychology there, using the physiological approach he had learned in Germany and establishing the first psychology laboratory in America.  He met a schoolteacher named Alice Howe Gibbens, whom he married in 1878.  Like his parents, they had five children, naming the first two Henry and William.  Alice was adept at handling his neurotic obsessions and emotional moodiness, and they seem to have had a good marriage, living comfortably in Cambridge.  The year they married, James agreed to write a psychology textbook; however, by then he was already drifting away from psychology into philosophy.  He was a member of a Metaphysical Club that included Oliver Wendell Holmes, who taught law at Harvard and would go on to serve on the U. S. Supreme Court, and Charles Sanders Peirce, a philosopher of science, who would become the founder of American pragmatism.  In 1879, James began teaching philosophy at Harvard, becoming an assistant professor of philosophy the next year.  He published “The Sentiment of Rationality,” his first important article in his new discipline.  As he got deeper into philosophy, he developed a negative attitude towards psychology.  After becoming a full professor of philosophy in 1885 and of psychology in 1889, he published his Principles of Psychology in 1890.  It had taken him close to twelve years to finish it, and, though it would be extremely successful, he was dissatisfied with it and disgusted with psychology (Letters, vol. 1, pp. 294, 296, & vol. 2, pp. 2-3).  Nevertheless, he agreed to prepare an abridged version, which was published two years later as Psychology:  Briefer Course; it too would be widely used and help to establish his reputation as the foremost living American psychologist.  He resigned his directorship of Harvard’s psychology lab and committed himself to teaching and writing philosophy.

In 1897, James’s first philosophical book, The Will to Believe and Other Essays in Popular Philosophy, was published, dedicated to Charles Sanders Peirce.  The following year, at the University of California at Berkeley, he delivered a lecture, “Philosophical Conceptions and Practical Results,” which helped to launch pragmatism as a nationwide philosophical movement.  In 1899, his Talks to Teachers on Psychology and to Students on Some of Life’s Ideals was published.  Overworked at Harvard and jeopardizing his fragile health, he suffered a physical breakdown that same year.  While recovering his health, he studied a wide range of accounts of religious experience and prepared his Gifford Lectures, which he delivered at the University of Edinburgh in 1901-02.  These were published as The Varieties of Religious Experience in 1902 and proved to be quite successful, although James himself was displeased, believing them to contain too much reporting on facts and too little philosophical analysis.

For the remainder of his life, James focused on the development of his own philosophy, writing essays and lectures that would later be collected and published in four books.  In the spring of 1906, he took a leave of absence from Harvard to take a visiting professorship at Stanford University, though his lecture series in California was interrupted by the great San Francisco earthquake.  In late 1906 and early 1907, he delivered his lectures on Pragmatism in Boston and at Columbia University, publishing them in the spring of 1907.  That was also the year he resigned from Harvard, worried that he might die before being able to complete his philosophical system, as he was suffering from angina and shortness of breath.  He delivered the Hibbert Lectures in England in 1908, published the next year as A Pluralistic Universe, aimed at combating the neo-Hegelian idealism that was then prevalent in Great Britain.  Meanwhile, he was under intellectual assault by mainstream philosophers for his pragmatic treatment of truth, which he defended in a collection of essays published in 1909 as The Meaning of Truth.

By the next year, James’s heart trouble left him so plagued by fatigue that normal activities became quite difficult.  He was attempting to complete his textbook on Some Problems of Philosophy, but died on August 26, 1910.  In 1911, his textbook, edited by his son Henry, and his Memories and Studies were posthumously published.  In 1912, his Essays in Radical Empiricism was published, followed, in 1920, by some of his Collected Essays and Reviews and The Letters of William James, edited in two volumes by his son Henry.  His writings have survived in part because of the provocative honesty of his ideas, but also because of the vibrant, sometimes racy, style in which he expressed them.  In A Pluralistic Universe, he castigates philosophers who use technical jargon instead of clear, straightforward language.  He practiced the spontaneous thinking and freshness of expression he advocates there (Universe, pp. 129-130).  It has been said (by the novelist Rebecca West) that, while Henry James wrote fiction as though it were philosophy, his older brother, William, wrote philosophy in a colorful style typical of fiction.

2. Philosophical Psychology

By the early 1890s, when James published his two books on psychology, the discipline was in the process of splitting off from philosophical speculation (“psychology” literally means  “the study of the soul”) to establish itself as an empirical social science.  Despite impatience with the process of that development, he contributed significantly to moving it along, regarding psychology as the science of our mental phenomena or states of consciousness, such as thoughts, feelings, desires, volitions, and so forth.

a. The Stream of Consciousness and the Self

In analyzing what can broadly be termed human thinking, James delineates five generic characteristics:  (1) all thought is owned by some personal self; (2) all thought, as experienced by human consciousness, is constantly in flux and never static; (3) nevertheless, there is an ongoing continuity of thought for every thinker, as it moves from one object to another (like the alternating times of flight and perching in a bird’s life), constantly comprising shifting foci and the contextual fringes within which they are given; (4) thought typically deals with objects different from and independent of consciousness itself, so that two minds can experience common objects; and (5) consciousness takes an interest in particular objects, choosing to focus on them rather than on others (Principles, vol. 1, pp. 224-226, 236-237, 239, 243, 258-259, 271-272, 284; Psychology, pp. 152-154, 157-160, 166-167, 170).  The self can be viewed as an object of thought or as the subject of thought.  The former is the empirical self or “me,” while the latter is the pure ego or “I.”  The dimensions of the empirical self (“me”) include the “material” self (comprised of one’s body and such extensions of it as one’s clothing, immediate family, and home), the “social” self (or significant interpersonal relations), and the “spiritual” self (one’s personality, character, and defining values).  The pure ego (“I”), identifiable with the soul of traditional metaphysics, cannot be an object of science and should not be assumed to be a substance (Principles, vol. 1, pp. 291-294, 296, 319, 343-344, 348, 350; Psychology, pp. 176-181, 194, 196, 198, 200, 202-203, 215-216).

b. Sensation, Perception, Imagination, and Belief

James states that if we track the dynamic of mental activity, we discern a standard pattern from sensation to perception to imagination to belief.  Through sensation, we become acquainted with some given fact.  This can, but need not, lead to knowledge about that fact, achieved by perceiving its relations to other given facts.  Both sensation and perception involve an immediate intuition of some given objects.  Imagination, less immediate, retrieves mental copies of past sensations and perceptions, even when their external stimuli are no longer present.  Belief is the sense or feeling that ideas or propositions formed in the imagination correspond to reality.  Every proposition can be analyzed in terms of its object and whether that object is believed.  The object of a proposition comprises a subject (such as my horse), a predicate (wings), and a relation between them (my horse has sprouted wings).  The belief is the psychic attitude a mind has towards that object (for example, I believe it or deny it or am in doubt about it) (Principles, vol. 2, pp. 1-3, 44, 76-77, 82-83, 283-284, 287-290; Psychology, pp. 12-14, 302, 312, 316-317).

c. Emotion and Will

Like other animals, we have primitive instincts, such as fear, some desires, and certain forms of sympathy, which do not require being taught them or consciously focusing on ends.  However, we also have emotions that are learned behavior and do involve such a focus—for example, a fear of failure and the desire for an academic degree.  Instincts and emotions thus overlap, the latter tending to cover a broader range of objects than the former.  We tend to assume that perceptions trigger emotional responses, eventuating in bodily expressions—that we suddenly see a bear, become frightened, and then tremble and run away.  But James thinks the actual sequence is perception, followed by bodily expressions, followed by emotional feeling—that we see the bear, tremble and run away, then feel those physical events as what we call fear.  The idea that emotions ultimately have physical causes emphasizes the intimate relationship between our bodies and our mental life (Principles, vol. 2, pp. 383, 410, 442, 449-453, 467; Psychology, pp. 391, 375-376, 378-381).

The human will is crucial for deliberately acting on our beliefs and emotions.  Sometimes we consider alternative courses of action and seem to select one among them, as if making a voluntary decision.  James maps out five sorts of decision-making:  (1) the reasonable sort, whereby we accede to rational arguments; (2) the sort that is triggered by external circumstances, such as overhearing a rumor; (3) the sort that is prompted by our submission to something within ourselves, such as a habit formed by past actions; (4) the sort that results from a sudden change of mood such as might be caused by a feeling of grief; and (5) the rare sort that is a consequence of our own voluntary choice, which will be identified as the “will to believe.”  Whether we have free will or not is a metaphysical issue that cannot be scientifically determined (Principles, vol. 2, pp. 486-488, 528, 531-534, 572-573; Psychology, pp. 415, 419-420, 428-434, 456-457).

3. Epistemology

Even if philosophically interesting matters such as freedom vs. determinism cannot be scientifically resolved, some sort of epistemological methodology is needed if we are to avoid arbitrary conclusions.  Whatever approach is chosen, it is clear that James repudiates rationalism, with its notions of a priori existential truths.  He is particularly hostile to German idealism, which he identifies especially with Hegel and which he attacks in many of his essays (this identification leads him to be remarkably unfair to Kant, an earlier German idealist).  As he makes clear in “The Sentiment of Rationality,” the personality of the would-be knower and various practical concerns are far too relevant to allow for such abstract intellectualism.  The tradition of modern empiricism is more promising, yet too atomistic to allow us to move much beyond the knowledge of acquaintance to genuine comprehension (Will, pp. 63-67, 70, 75-77, 82-86, 89, 92).  Fortunately, James had already learned about the pragmatic approach from Peirce.

a. The Pragmatic Method

James’s book of lectures on Pragmatism is arguably the most influential book of American philosophy.  The first of its eight lectures presents pragmatism as a more attractive middle ground between the two mainstream approaches of European philosophy.  The “tender-minded” approach tends to be rationalistic, intellectualistic, idealistic, optimistic, religious, committed to freedom, monistic, and dogmatic; by contrast, the “tough-minded” approach tends to be empirical, grounded in sensations, materialistic, pessimistic, irreligious, fatalistic, pluralistic, and skeptical.  It is difficult to identify many pure types of either of these in the history of philosophy, and some thinkers (such as Kant) are deliberately mixed, as is James himself.  He thinks that most of us want a philosophical method that is firmly anchored in empirical facts, while being open to, rather than dismissive of, moral and religious values.  He offers pragmatism as a philosophy that coherently meets both demands.  James’s second lecture is committed to showing how the pragmatic method helps us establish meaning by making it a function of practical consequences (the word “pragmatic” means having to do with action and is etymologically related to our English word “practical”).  Before we invest much time or effort in seeking the meaning of anything, we should consider what practical difference it would make if we could find out.  Providing an example to illustrate his point, James refers to the Hegelian notion of God as the all-encompassing Absolute Spirit.  How should we decide whether this is what we should mean by God?  Consider the practical consequences for a believer:  on the one hand, it would provide us with the optimistic, comforting assurance that everything will work out for the best; but, on the other, it also undermines the values of human individuality, freedom, and responsibility.  From that pragmatic perspective, James rejects the Hegelian notion.  Undoubtedly, philosophy provides us with only one legitimate approach to belief, as he observes in his fifth lecture, others being common sense (with its basic concepts derived from experience) and science.  However, these others are impotent in dealing with questions of freedom and value (Pragmatism, pp. 10-13, 18, 26-28, 30-38, 79-80, 83-85).

b. The Pragmatic Theory of Truth

It seems that anything knowable must be true.  But what does it mean to call a proposition or belief “true” from the perspective of pragmatism?  This is the subject of James’s famous sixth lecture.  He begins with a standard dictionary analysis of truth as agreement with reality.  Accepting this, he warns that pragmatists and intellectualists will disagree over how to interpret the concepts of “agreement” and “reality,” the latter thinking that ideas copy what is fixed and independent of us.  By contrast, he advocates a more dynamic and practical interpretation, a true idea or belief being one we can incorporate into our ways of thinking in such a way that it can be experientially validated.  For James, the “reality” with which truths must agree has three dimensions:  (1) matters of fact, (2) relations of ideas (such as the eternal truths of mathematics), and (3) the entire set of other truths to which we are committed.  To say that our truths must “agree” with such realities pragmatically means that they must lead us to useful consequences.  He is a fallibilist, seeing all existential truths as, in theory, revisable given new experience.  They involve a relationship between facts and our ideas or beliefs.  Because the facts, and our experience of them, change we must beware of regarding such truths as absolute, as rationalists tend to do (Pragmatism, pp. 91-97, 100-101).  This relativistic theory generated a firestorm of criticism among mainstream philosophers to which he responded in The Meaning of Truth.

c. The Pragmatic Approach to Belief

Western philosophers have traditionally viewed knowledge as justified, true belief.  So long as the idea of truth is pragmatically analyzed and given a pragmatic interpretation of justification, James seems to accept that view.  His entire philosophy can be seen as fundamentally one of productive beliefs.  All inquiry must terminate in belief or disbelief or doubt; disbelief is merely a negative belief and doubt is the true opposite of both.  Believing in anything involves conceiving of it as somehow real; when we dismiss something as unreal (disbelief), it is typically because it somehow contradicts what we think of as real.  Some of our most fundamental and valuable beliefs do not seem sufficiently justified to be regarded as known.  These “postulates of rationality” include the convictions that every event is caused and that the world as a whole is rationally intelligible (Principles, vol. 2, pp. 283-284, 288-290, 670-672, 675, 677).  As he holds in “The Sentiment of Rationality,” to say that such beliefs, however crucial, are not known, is to admit that, though they involve a willingness to act on them, doubt as to their truth still seems theoretically possible.  He identifies four postulates of rationality as value-related, but unknowable, matters of belief; these are God, immortality, freedom, and moral duty (Will, pp. 90, 95).  He proceeds to deal with each of them individually.

4. Philosophy of Religion

James is arguably the most significant American philosopher of religion in intellectual history, and many of his writings, in addition to the obligatory “Will to Believe” essay and his book on The Varieties of Religious Experience, offer provocative insights into that area.

a. The Will to Believe in God

Because we do not naturally experience the supernatural, James, the radical empiricist, thinks of faith in God as falling short of knowledge.  Yet such faith is pragmatically meaningful to many people, and it is reasonable to wonder whether, how, and to what extent it can be justified.  For James, the logical philosopher trained in science, both logic and science have limits beyond which we can legitimately seek the sentiment of rationality.  His notorious “Will to Believe” essay is designed to be a defense of religious faith in the absence of conclusive logical argumentation or scientific evidence.  It focuses on what he calls a “genuine option,” which is a choice between two hypotheses, which the believer can regard as “living” (personally meaningful), “forced” (mutually exclusive), and “momentous” (involving potentially important consequences).  Whether an option is “genuine” is thus relative to the perspective of a particular believer.  James acknowledges that in our scientific age, there is something dubious about the voluntaristic view that, in some circumstances, we can legitimately choose to believe in the absence of any objective justification.  However, he claims we naturally do so all the time, our moral and political ideas being obvious examples.  When you believe that your mother loves you or in the sincerity of your best friend, you have no conclusively objective evidence.  In addition, you will never be able to secure such evidence.  Yet it often seems unreasonable to refuse to commit to believing such matters; if we did so, the pragmatic consequences would be a more impoverished social life.  Indeed, in some cases, believing and acting on that belief can help increase the chances of the belief being true.  Now let us apply this argument to religious belief.  What does religion in general propose for our belief?  The two-pronged answer is that ultimate reality is most valuable and that we are better off if we believe that.  Committing to that two-pronged belief is meaningful, as is the refusal to do so.  At any given moment, I must either make that two-pronged commitment or not; and how I experience this life, as well as prospects for a possible after-life, may be at stake.  Whether one makes that commitment or not, pragmatic consequences can be involved.  Nor should we imagine that we could avoid having to make a choice, as the commitment not to commit is itself a commitment (Will, pp. 1-4, 7-9, 11-14, 22-30; see also Problems, pp. 221-224).

b. The Varieties of Religious Experience

If religious faith is not to be reduced to arbitrary whimsy (the “will to make believe”), it must rest on some sort of personal experience.  As psychologist and philosopher, James deliberately defines “religion” broadly as the experiences of human individuals insofar as they see themselves related to whatever they regard as divine.  This definition indicates that religion does not require faith in a transcendent, monotheistic God, and that it does not mandate the social dimension of religious community.  James distinguishes between “healthy-mindedness” and the “sick soul” as two extreme types of religious consciousness, the former being characterized by optimistic joy and the latter by a morbid pessimism.  In between these extremes are “the divided self” and the stable, well-integrated believer.  James develops lengthy analyses of religious conversion, saintliness, and mysticism.  In going beyond these, he considers what philosophy might contribute to establishing “over-beliefs” regarding the existence and nature of the divine.  He critically considers traditional arguments for God—the cosmological argument, the argument from design, the moral argument, and the argument from popular consensus—finding none of them particularly cogent, but exhibiting the most respect for the argument from design.  He likewise weighs in the balance and finds wanting arguments for metaphysical and moral divine attributes, finding the latter of more pragmatic relevance to human values, choices, and behavior than the former.  In his final lecture, he draws conclusions regarding three beliefs that experience finds in religions in general:  (1) that our sensible world is part of and derives its significance from a greater spiritual order; (2) that our purpose is fulfilled by achieving harmonious union with it; and (3) that prayer and spiritual communion are efficacious.  Furthermore, religions typically involve two psychological qualities in their believers:  (1) an energetic zest for living; and (2) a sense of security, love, and peace.  Given that thought and feeling both determine conduct, James thinks that different religions are similar in feeling and conduct, their doctrines being more variable, but less essential.  Most generally, these doctrines attempt to diagnose a fundamental uneasiness about our natural state and to prescribe a solution whereby we might be saved (Varieties, pp. 42, 83, 121, 124, 137, 142, 145-147, 330, 334-341, 367, 380-383).

c. James’s Own Religious Views

Although James is somewhat vague regarding his own religious “over-beliefs,” they can be pieced together from various passages.  He believes there is more to reality than our natural world and that this unseen realm generates practical effects in this world.  If we call the supreme being “God,” then we have reason to think the interpersonal relationship between God and humans is dynamic and that God provides us with a guarantee that the moral values we strive to realize will somehow survive us.  James describes himself as a supernaturalist (rather than a materialist) of a sort less refined than idealists and as unable to subscribe to popular Christianity.  He is unwilling to assume that God is one or infinite, even contemplating the polytheistic notion that the divine is a collection of godlike selves (Varieties, pp. 384-386, 388-390, 392-393, 395-396).  In “The Dilemma of Determinism,” James depicts his image of God with a memorable analogy, comparing God to a master chess player engaged in a give-and-take with us novices.  We are free to make our own moves; yet the master knows all the moves we could possibly make, the odds of our choosing one over the others, and how best to respond to any move we choose to make.  This indicates two departures from the traditional Judeo-Christian concept of God, in that the master is interacting with us in time (rather than eternal) and does not know everything in the future, to the extent that it is freely chosen by us.  In “Reflex Action and Theism,” James subscribes to a theistic belief in a personal God with whom we can maintain interpersonal relations, who possesses the deepest power in reality (not necessarily omnipotent) and a mind (not omniscient).  We can love and respect God to the extent that we are committed to the pursuit of common values.  In “Is Life Worth Living?” James even suggests that God may derive strength and energy from our collaboration (Will, pp. 181-182, 116, 122, 141, 61).  Elsewhere, rejecting the Hegelian notion of God as an all-encompassing Absolute, he subscribes to a God that is finite in knowledge or in power or in both, one that acts in time and has a history and an environment, like us (Universe, pp. 269, 272; see Letters, vol. 2, pp. 213-215, for James’s responses to a 1904 questionnaire regarding his personal religious beliefs).

5. Metaphysics

a. Realms of Reality

In contrast to monists such as Hegel, James believes in multiple worlds, specifying seven realms of reality we can experience:  (1) the realm that serves as the touchstone of reality for most of us is the world of physical objects of sense experience; (2) the world of science, things understood in terms of physical forces and laws of nature, is available to the educated; (3) philosophy and mathematics expose us to a world of abstract truth and ideal relations; (4) as humans, we are all subject to the distortions of commonplace illusion and prejudices; (5) our cultures expose us to the realms of mythology and fiction; (6) each of us has his or her own subjective opinions, which may or may not be expressed to others; and (7) the world of madness can disconnect us from the reality in which others can readily believe.  Normally we can inhabit more than one of these and be able to discriminate among them.  What we take to be real must connect with us personally because we find it interesting and/or important, which emphasizes elements of both subjectivity and pragmatic relevance (Principles, vol. 2, pp. 292-299).

b. The Philosophical Importance of Metaphysics

Part of what makes James a great philosopher in the grand tradition is that, unlike so many post-Hegelian Western philosophers, he advocates the pivotal importance of metaphysics.  The theory of reality in general provides a crucial foundational context for philosophy of human nature, philosophy of religion, ethics, social philosophy, and so forth.  Philosophy essentially is an intellectual attempt to come to grips with reality, as he says on the first page of Pragmatism.  In its third lecture, James approaches four standard metaphysical issues using his pragmatic method, those of (1) physical and spiritual substance, (2) materialism vs. theism as explanations of our world, (3) whether the natural world indicates intelligent design, and (4) freedom vs. universal determinism.  For each of these, we cannot conclusively establish where we should stand based merely on what experience discloses about the past, but can take reasonable positions based on pragmatic anticipated future consequences.  As modern philosophy demonstrates, we can never directly and immediately experience any sort of substance; however, we do experience physical qualities and mental events and can best make sense of them by attributing them to bodies and minds.  The world is what it is, regardless of whether it is the result of divine activity or of the random interactions of atoms moving in space; whether or not it was intelligently designed in the distant past has no bearing on the fact that we experience it as we do.  But a world intelligently designed by a deity pragmatically involves the possibility of a promising future, whereas one resulting from unconscious physical forces promises nothing more than a collapse into meaningless obliteration.  On the one hand, if everything we may do or fail to do is determined, why bother doing anything?  On the other hand, if we are free to choose at least some of our actions, then effort can be meaningful.  In the fourth lecture, James states that our world can be viewed as one (monism) or as an irreducible many (pluralism).  There are certain ways in which we humans generate a unity of the objects of our experience, yet the absolute unity to which monism is committed remains a perpetually vanishing ideal.  In his seventh lecture, James identifies three dimensions of reality:  (1) the objects of factual experience; (2) relationships between our sensations and our ideas and among our ideas; and (3) the entire network of truths to which we are committed at any given time.  Again, we see here a combination of subjectivity and pragmatic relevance that views reality as a process of development, which he calls “humanism” (Pragmatism, pp. 7, 43-55, 62-69, 71, 73-74, 110-111, 115-116; see also Truth, pp. 100-101).

c. Monism vs. Pluralism

James intended Some Problems of Philosophy to be largely a textbook in metaphysics, which he defines in terms of the ultimate principles of reality, both within and beyond our human experience.  Much of it concerns the issue of the one and the many, which is arguably the oldest problem of Western philosophy and represents the split between collective monism (such as Hegel’s) and distributive pluralism (such as James himself advocates).  Monism, pursued to its logical extreme, is deterministic, setting up a sharp dichotomy between what is necessary and what is impossible, while pluralism allows for possibilities that may, but need not, be realized.  The former must be either optimistic or pessimistic in its outlook, depending on whether the future that is determined is seen as attractive or unattractive.  In contrast, pluralism’s possibilities allow for a “melioristic” view of the future as possibly better, depending on choices we freely make.  Pluralism need not specify how much unnecessary possibility there is in the world; by contrast, monism must say that everything about the future is locked in from all eternity—to which pluralism says, “Ever not quite.”  James is advocating what he calls the possibility of “novelty” in the world.  Pluralism, being melioristic, calls for our trusting in and cooperating with one another in order to realize desirable possibilities that are not assured (Problems, pp. 31, 114, 139-143, 205, 228-230).  In his Essays in Radical Empiricism, James attempts to distance himself from the philosophical dualism that sees physical reality (bodies) and spiritual reality (minds) as essentially distinct.  He claims that the “philosophy of pure experience” is more consonant with the theory of novelty, indeterminism, moralism, and humanism that he advocates, though it is less than clear why.  We never experience mind in separation from body, and he dismisses as an illusion the notion of consciousness as substantial; however, he does not want to reject the reality of mind as a materialist might do.  So after years of opposing monism, he adopts an admittedly vague sort of neutral (neither materialistic nor idealistic) monism that sees thoughts and things as fundamentally the same stuff, the further definition of which eludes us (Empiricism, pp. 48, 115-117, 120).

6. Freedom and Morality

In the eighth lecture of Pragmatism, James sees monism as tending to a passive sort of quietism rather than to a vital life of active effort.  By contrast, pluralistic pragmatism emphasizes the possibilities that may be if we work to realize them.  Monism determines the future optimistically as working out for the best, whatever we do, or pessimistically as working out for the worst, whatever we do.  By contrast, pluralistic meliorism holds that it can get better if we freely try to make it so.  Whether we embrace the option of freedom and moral responsibility or not is ultimately a matter of personal faith rather than one of objective logic or scientific evidence (Pragmatism, pp. 125, 127-128, 132).

a. Human Freedom

Like God and human immortality, the possibility of which James defends without firmly committing himself to believing in it (Immortality, pp. 3, 6-7, 10-18, 20, 23-24, 28-31, 35-37, 39-41, 43-45), freedom is a postulate of rationality, an unprovable article of faith.  James wrote an essay on the topic, called “The Dilemma of Determinism.”  After admitting that human freedom is an old and shopworn topic about which we may suspect that nothing new can be said, and that he will not pretend to be able to prove or disprove, he launches a pragmatic justification for believing in it.  Indeterminism, the belief in freedom, holds that there is some degree of possibility that is not necessitated by the rest of reality, while determinism must deny all such possibilities.  These beliefs constitute exhaustive and mutually exclusive alternatives, so that if we reject either, we logically should accept the other.  Let us consider a commonplace example such as walking home from campus.  Before the fact neither the determinist nor the indeterminist can infallibly predict which path will be taken, but after the fact the determinist can irrefutably claim that the path taken was necessary, while the indeterminist can irrefutably claim that it was freely chosen.  Thus far, there is no advantage on either side.  But now consider the example of a man gruesomely murdering his loving wife.  We hear the awful details recounted and naturally regret what the wicked man did to her.  Now, what are we to make of that regret from the perspective of determinism?  What sense can it make to regret what had to occur?  From that perspective, we logically must embrace pessimism (all of reality is determined to be bad) or optimism (everything is destined to work out for the best) or subjectivism (good and evil are merely subjective interpretations we artificially cast on things).  All of these can be logically coherent positions, but each of them minimizes the evil we experience in the world and trivializes our natural reaction of regret as pointless.  From a practical (as opposed to a logical) point of view, can we live with that?  James deliberately puts the point quite personally.  Though thoughtful and reflective pessimists, optimists, and subjectivists can live with it, he would not, because its pragmatic implications would render life not worth living.  In that sense, determinism, though logically tenable, is pragmatically unacceptable, and James commits to indeterminism (Will, pp. 145-146, 150-152, 155-156, 160-161, 175-176, 178-179).

b. Moral Responsibility

In addition to God, immortality, and freedom, moral duty is a fourth postulate of rationality.  James offers us one remarkable essay on the topic, entitled “The Moral Philosopher and the Moral Life.”  He addresses three questions:  (1) the psychological one, regarding the origins of our moral values and judgments, (2) the metaphysical one, regarding the grounds of meaning for our basic moral concepts, and (3) the casuistic one, regarding how we should order conflicting values.  First, our human nature comprises a capacity for an intuitive moral sense, but this must be developed in a context of values that socially evolve.  Second, our basic moral concepts of good and bad, right and wrong, and so forth, are all person-relative, grounded in the claims people make on their environment.  Third, when values conflict, those which would seem to satisfy as many personal demands as possible, while frustrating the fewest, should have priority regardless of the nature of those demands.  This represents a pragmatic form of moral relativism, in which no action can be absolutely good or evil in all conceivable circumstances.  Finally, James distinguishes between “the easy-going mood” that tries to avoid conflict, and “the strenuous mood” that strives to achieve ideals, apparently preferring the latter (Will, pp. 185-186, 190, 194-195, 197, 201, 205, 209, 211).

c. The Meaningfulness of Life

In answering the question of what is the primary objective of human life, James maintains that a natural answer is happiness.  It is this, which motivates us to act and endure.  Evolution is often seen as a progressive advance towards happiness (Varieties, pp. 76, 85).  The person who seems incapable of achieving it may well wonder whether life is worth living; suicide deciding that it is not.  For James there is no absolute answer, and it is relative to the life being lived.  A human life involves an ongoing series of possibilities.  Some of these “maybes” may be realized if we believe in our own capacity to realize them; others will not be, either because we do not try or because we try and fail.  Life can become worth living if we believe that it is and act on that belief, our commitment giving it meaning (Will, pp. 37, 59-62).  Our happiness seems to require that we have ideals, that we strive to achieve them, and that we think we are making some progress towards doing so (Talks, pp. 185-189).

7. Social Philosophy

a. Individuals and Their Communities

James’s philosophy is so individualistic that it does not allow for a robust theory of community.  Still, he offers us some interesting insights and one great paper.  “Great Men and Their Environment” views one’s society as not only a context in which great individuals emerge, but even as playing a selective role in allowing their greatness to develop. In turn, that social environment is affected by them.  Whether or not an individual will be able to have an impact is, to some extent, determined by society.  Thus socially significant individuals and their communities have a dynamic, correlative relationship.  In a follow-up article, “The Importance of Individuals,” he maintains that agents of social change, beyond being gifted in some way(s), tend to take greater advantage of given circumstances than more ordinary persons do (Will, pp. 225-226, 229-230, 232, 259).

b. War and Peace

In the last decade of his life, following the Spanish-American War, in which Theodore Roosevelt, his former student, was the hero, James gave a talk at the banquet of the Universal Peace Congress.  Regarding human nature as essentially antagonistic, he warns against our permanent tendencies to mass violence and the romantic idealization of war.  We have to forever be on our guard to resist those dangerous, destructive tendencies; however, he doubts that humanity will ever be able to achieve universal disarmament and peace.  What we can and should do is work to minimize conflicts and to resolve them non-violently.  The great paper James wrote in the area of social relations, written just a few years before the outbreak of World War I and first published the month he died, is “The Moral Equivalent of War.”  In it he warns of the extreme challenge of suppressing our martial tendencies.  Warfare has become so costly, in terms of treasure and carnage, thanks to modern technology, that we need to find some way of rechanneling the primitive tendencies inherited from our ancestors.  How can we create an atmosphere in which peace is the norm rather than that interim period between wars?  Identifying himself as a “pacifist,” he nevertheless admits that there are desirable human qualities—such as patriotism, loyalty, social solidarity, and national vigor—that have traditionally been nurtured by war and the preparation for conducting it.  The question at hand is whether a “moral equivalent” might be found that would generate such martial virtues without involving the horrible destructiveness of war. Since wars are the results of human choices rather than fatalistically determined, he anticipates a time when they will be formally outlawed among civilized societies.  But then how can these martial virtues still be nurtured?  His answer is that we should draft young adults into national service, as opposed to military service, fighting against adverse natural conditions rather than against fellow human beings, working for a time in coal mines, on constructing roads, and so forth.  Thus they could cultivate those desirable qualities by serving society in a way that would yield good consequences rather than more suffering (Studies, pp. 300-301, 303-306, 267, 269, 275-276, 280, 283, 286-292).

c. Democratic Tolerance and Social Progress

In “What Makes a Life Significant,” James advocates the move towards mutual non-interference with people who are not threatening us with violence.  Tolerance of others is an antidote to cruelty and injustice.  He maintains that the trend of social evolution is in the direction of democratic progress.  Yet this trend requires effort, and its continuation poses challenges for us, including that of striving for a more equitable distribution of wealth (Talks, pp. 170, 178, 189).  His commitments to individual freedom, mutual respect, peaceful interrelationships, and tolerance converge to point us in the direction of progressing towards what he calls “the intellectual republic” (Will, p. 30).  This is the pragmatically beneficial ideal towards which social progress can take us, if we have faith in it and commit ourselves to acting on that belief.

8. References and Further Readings

a. Primary Sources

  • William James, Essays in Radical Empiricism and A Pluralistic Universe (called “Empiricism” and “Universe,” respectively).  New York:  E. P. Dutton, 1971.
  • William James, The Letters of William James, Two Volumes in One, ed. Henry James (called “Letters”).  Boston:  Little, Brown, 1926.
  • William James, The Meaning of Truth (called “Truth”).  Ann Arbor:  University of Michigan Press, 1970.
  • William James, Memories and Studies (called “Studies”).  Westport:  Greenwood Press, 1968)
  • William James, Pragmatism (called “Pragmatism”), ed. Bruce Kuklick.  Indianapolis:  Hackett, 1981.
  • William James, The Principles of Psychology, Two Volumes (called “Principles”).  New York:  Dover, 1950.
  • William James, Psychology:  Briefer Course (called “Psychology”).  New York:  Henry Holt, 1910.
  • William James, Some Problems of Philosophy, ed. Henry James (called “Problems”).  New York:  Longmans, Green, 1931.
  • William James, Talks to Teachers on Psychology:  and to Students on some of Life’s Ideals (called “Talks”).  New York:  W. W. Norton, 1958.
  • William James, The Varieties of Religious Experience (called “Varieties”).  New York:  New American Library, 1958.
  • William James, The Will to Believe and Other Essays in Popular Philosophy and Human Immortality (called “Will” and ”Immortality,” respectively).  New York:  Dover, 1956.

b. Secondary Sources

  • Jacques Barzun, A Stroll with William James.  Chicago:  University of Chicago Press, 1984.
    • This is a non-technical discussion of James’s life and thought.
  • Patrick Kiaran Dooley, Pragmatism as Humanism:  The Philosophy of William James.  Totowa:  Littlefield, Adams, 1975.
    • This is a brief but good introduction to James’s philosophy of human nature.
  • Richard M. Gale, The Philosophy of William James:  An Introduction.  New York:  Cambridge University Press, 2005.
    • This is a monograph on the theory and vision of James.
  • R. W. B. Lewis, The Jameses:  A Family Narrative.  New York:  Farrar, Straus and Giroux, 1991.
    • This is a monumental collective biography of the James family.
  • Gerald E. Myers, William James:  His Life and Thought.  New Haven:  Yale University Press, 1986.
    • This is a long, comprehensive, in-depth analysis of James’s psychology and philosophy.
  • Ralph Barton Perry, The Thought and Character of William James:  Briefer Version.  Cambridge:  Harvard University Press, 1948.
    • This is a classic intellectual biography of James by one of his famous students.
  • Ruth Anna Putnam, The Cambridge Companion to William James.  New York:  Cambridge University Press, 1997.
    • This is a collection of critical analyses of important parts of James’s thought.
  • Robert D. Richardson, William James:  In the Maelstrom of American Modernism.  Boston:  Houghton Mifflin, 2006.
    • This intellectual biography of James studies the man through his work.
  • Robert J. Vanden Burgt, The Religious Philosophy of William James.  Chicago:  Nelson-Hall, 1981.
    • This is a short but illuminating treatment of James’s philosophy of religion.

Author Information

Wayne P. Pomerleau
Email: Pomerleau@calvin.gonzaga.edu
Gonzaga University
U. S. A.

Surrogate Parenting

Surrogate parenting is an arrangement in which one or more persons, typically a married infertile couple (the intended rearing parents), contract with a woman to gestate a child for them and then to relinquish it to them after birth. Surrogate parenting is also sometimes referred to as “contract pregnancy,” in part, so as not to beg the question about who is the child’s real mother, but also to refer to the non-nuclear-familial nature of the arrangement.  That is, this is a mode of parenting which allows a couple to have a child by involving a third party to their relationship who serves as birth mother, whether there is a contract or not. As will become apparent, surrogate parenting complicates the parenting terrain and, as such, raises significant philosophical and ethical issues. This article briefly examines some of the principal differences between commercial and non-commercial forms of surrogate/contract parenting arrangements, including the presentation of arguments against and for the moral appropriateness of this sort of parenting arrangement. After raising some of the principal challenges, four legal remedies for this complex mode of parenting are considered. This is followed by a brief summary of some healthcare organizations’ and professionals’ attitudes toward surrogacy/contract parenting arrangements. The article concludes with an assessment of the current availability and accessibility of surrogacy services and some observations about the future of surrogacy/contract parenting arrangements.

Table of Contents

  1. Introduction
  2. Traditional Surrogacy versus Gestational Surrogacy
  3. Commercial Surrogacy vs. Noncommercial Surrogacy
  4. Moral Arguments against Surrogacy vs. Moral Arguments for Surrogacy
  5. Legal Remedies for Surrogacy Arrangements
    1. Banning
    2. Endorsement
    3. Assimilation
    4. Hands-off
  6. Healthcare Organizations’ and Professionals’ Attitudes toward Surrogacy Arrangements
  7. Conclusion
  8. References and Further Reading
    1. Primary Sources
    2. Secondary Sources

1. Introduction

Based on available statistics, which are quite incomplete due to a lack of reporting regulations, about 1,500 to 2,000 surrogate/contracted babies are born per annum in the United States (Ali and Kelley 2008, 44). In addition, several thousands more are born each year as the result of a surrogate arrangement in a wide-variety of nations worldwide. Australia, Canada, and Brazil report numbers at least as large as those reported in the United States (Covington and Burns 2006, 371); and in India, where commercial surrogacy was legalized in 1992, poor women are recruited to gestate what may amount to hundreds (more likely thousands) of babies for couples throughout the world (Gentleman 2008, A9).

Although surrogate parenting arrangements are spreading across the globe, such arrangements apparently remain the exception rather than the rule. Most people prefer not to involve third parties in their attempts to procreate, and most people cannot afford to finance expensive third-party pregnancies. Still, the existence and use of surrogacy arrangements lead society to raise substantial questions about the nature of parenthood. What makes a parent the real parent of a child? The fact that she or he is genetically related to the child? The fact that she or he rears the child? Or the fact that she gestates the child (at present, gestation is an exclusively female task)? Is surrogate parenting just one of a series of steps towards what some term “collaborative reproduction”? In the future, will an increasing number of people form families that consist of an egg and/or sperm donor(s), a gestational mother, and one or more men and women who collaboratively produce and rear a child to adulthood? Or will most people continue to form the kind of nuclear, biologically-based families that exist today?

2. Traditional Surrogacy versus Gestational Surrogacy

There are two basic types of surrogate parenting: traditional surrogacy and gestational surrogacy. In traditional surrogacy, the surrogate is inseminated with the sperm of the man who intends to be the child’s rearing parent. Because the surrogate is both the genetic mother and the gestational mother of the child, she must legally terminate her parental rights to the child after its birth, at which point the intended female rearing parent may adopt the child as her stepchild. The intended male rearing parent does not need to adopt the child because he is the child’s genetic father.

The situation is quite different in a gestational surrogacy arrangement where both of the intended rearing parents may have a genetic connection to the child. The intended rearing mother’s eggs are fertilized with the intended rearing father’s sperm in vitro (outside the womb). The resulting embryo(s) are then implanted in the surrogate’s womb. The surrogate’s only connection to the child is gestational. In a variation on this arrangement, the intended rearing parents provide the surrogate with an embryo(s) to gestate for them. If the intended rearing parents are also the child’s genetic parents, they are not required to adopt the child. However, if one or both of the intended rearing parents do not have a genetic connection to the child, they may be required to adopt the child. For example, if a couple adopt surplus frozen embryos from an assisted reproduction clinic, they may be required to adopt any child(ren) that result from the embryo(s)’ gestation in a surrogate.

Infertile, heterosexual couples are the group most likely to contract a surrogate mother. However, single people, lesbian couples, gay couples, and even fertile people may also seek the services of a contracted mother. Collaborative reproductive and/or parenting arrangements have a long history throughout the world. For example, in the Judeo-Christian tradition, the Old Testament octogenarian couple, Abraham and Sarah, used a surrogate mother to carry a child to term for them. Much later, throughout the European continent and England, middle- and upper-class women used wet nurses to nurture their infant children. Moreover, in polygamous families, two or more wives of one husband collaboratively raise all of his children. Thus, it should not surprise us that people are increasingly entering into formal surrogacy/contract parenting arrangements.

3. Commercial Surrogacy vs. Noncommercial Surrogacy

Of particular significance in any discussion of surrogate parent arrangements is the fact that some of them are commercial while others are noncommercial. Commercial surrogate parenting arrangements involve monetary payments both to the surrogate and to other third parties. Depending on state laws regulating commercial surrogacy, the cost of such arrangements in the early 21st Century ranged anywhere from $20,000 to as high as $150,000 in the United States. Surrogates typically received about $20,000 with the rest of the costs being paid out to healthcare professionals, counselors, screeners, lawyers, and surrogate brokers. In an effort to hold their costs down, some intended parents have engaged in reproductive tourism, traveling to developing nations, where the total cost for a surrogate parenting arrangement ranges from $5,000 to $12,000 largely because women are willing to rent their wombs for a relatively low sum of money.

In contrast to commercial surrogacy, non-commercial surrogacy involves an arrangement where the intended rearing parents use the services of a family member or a friend. The surrogate’s compensation supposedly consists in the satisfaction she derives from giving the gift of a new human life to people for whom she personally cares. Typically, the surrogate does not expect or want monetary compensation for her gestational services. At the most, she will accept funds to cover costs such as physician’s bills.

4. Moral Arguments against Surrogacy vs. Moral Arguments for Surrogacy

As noted above, the number of surrogate babies born annually in the United States or elsewhere is relatively small. In large measure, it is probably failed, highly-publicized surrogate parenting arrangements such as the notorious Baby M case that continue to put a damper on intended rearing parents using surrogate parenting arrangements. In the 1980s New Jersey Baby M case, Mary Beth Whitehead contracted with William Stern to be artificially inseminated with his sperm, to get pregnant (if possible), to carry the child to term, and then to relinquish the child to Stern. Whitehead and Stern also agreed that Whitehead would receive $10,000 for a healthy child, but only $1,000 for a miscarried or still-born child. In addition, Whitehead agreed not to engage in sexual relations with her husband until she was pregnant; to abstain from harmful substances during the pregnancy; to undergo amniocentesis; and to submit to abortion if Stern so requested. Finally, Whitehead agreed that, upon the child’s birth, she would terminate her maternal rights so that Stern’s wife could adopt the baby.

Whitehead became pregnant and gave birth to a baby girl. But she did not relinquish the baby to Stern. Feeling very attached to the baby, Whitehead refused to abide by the terms of the contract. At first, lawyer Noel Keane, who had brokered the surrogacy arrangement, did not take Whitehead that seriously. He managed to persuade Whitehead to give the baby to the Sterns for an overnight stay. But when Whitehead became extremely upset the next day, the Sterns gave the baby back to her. They thought that within a short time, the $10,000 fee would start looking better to Whitehead than the responsibility of adding another child to her existing two-child family. The Sterns’ speculation turned out to be false. Whitehead soon told the Sterns she had reached a final decision and would never relinquish the child to them. Money was not nearly as important to her as love for the baby. At one point, the entire Whitehead family fled to Florida with the baby to escape the arm of the law, but the New Jersey police tracked the Whiteheads down and seized the baby. By then the Sterns had persuaded Judge Harvey Sorkow to grant them sole custody of the baby.

After a prolonged custody battle between Whitehead and Stern, presided over by none other than Judge Sorkow, the court determined it was in the best interests of “Baby M” to enforce the surrogacy contract. Judge Sorkow thought that Whitehead was unfit parent material because of her emotional instability and the fact that she had entered into the arrangement at all. Stern was given custody of the baby, Whitehead was stripped of her parental rights, and Stern’s wife was permitted to adopt the baby. Whitehead appealed the court’s decision and, after another protracted legal battle, the New Jersey Supreme Court overturned Sorkow’s decision, invalidating commercial surrogacy contracts as a disguised form of baby-selling. As the immediate consequence of its decision, the New Jersey Supreme Court voided the termination of Whitehead’s parental rights and invalidated Mrs. Stern’s adoption of the baby. However, the high court allowed the Sterns to keep the baby with the proviso that Whitehead be given visitation rights. The high court reasoned it was in the best interests of the two-year-old child to remain in the home of the only family she had ever known: the Sterns. Although many people thought this decision was fair, others complained it was an instance where privileged people, the Sterns, were presumed by virtue of their wealth and status to be better parents than poor, working-class people like the Whiteheads.

Cases like the Baby M case have prompted opponents of commercial surrogacy to emphasize that such parenting arrangements tend to exploit poor, young, single, or ethnic/minority women desperate for money, and that some surrogacy agencies instruct surrogates to view themselves as mere incubators for intended rearing parent(s)’ babies. In addition, some opponents of surrogacy object to non-commercial surrogacy for at least two reasons. First, a female family member may be pressured to demonstrate love for another female family member, for example, by serving as a surrogate mother for her. Second, a child may be deeply troubled upon discovering that not the mother who is raising her but actually her aunt is her gestational and perhaps also genetic mother. Finally, yet other opponents of surrogacy object that surrogacy arrangements risk the commoditization of babies as goods or products that can be contracted for as if they were mere things rather than human persons.

Advocates of surrogate parenting accuse its opponents of distorting empirical facts to rationalize their discomfort about breaking the formerly seamless web between genetic, gestational, and rearing forms of parentage. Although those who favor surrogacy concede that intended rearing parents are typically more affluent than most of the surrogates they hire, they deny that intended rearing parents routinely exploit surrogates. They note that, truth be told, most surrogates are white, between 20 and 30 years old, working class (not underclass), and married. Moreover, most surrogacy agencies prefer to use surrogates who have had at least one child and are altruistically as well as financially motivated to work as surrogates. Advocates of surrogate parenting also stress that when intended rearing parents and surrogates have and maintain good relationships with each other, no harm befalls the very-much wanted child. If anything, such a child generally finds him or herself in a particularly loving family. Finally, they claim that in a society that increasingly favors open adoptions and celebrates blended families, surrogate parenting is just another way for people to establish a family.

5. Legal Remedies for Surrogacy Arrangements

In the United Sates, four legal remedies (each with varying permutations) have been proposed at the state level to regulate surrogate parenting. They are: (1) banning commercial surrogate parenting arrangements; (2) legally enforcing most commercial and even non-commercial surrogate parenting arrangements; (3) assimilating most surrogate parenting arrangements into either traditional or modified adoption law; and (4) refusing to legally enforce any surrogate parenting arrangement whatsoever. Each of these remedies has its strengths and weaknesses, and none of them is the clear winner in attempts to properly regulate surrogate parenting arrangements.

a. Banning

A relatively small number of states ban commercial surrogacy, imposing civil and criminal penalties on surrogacy brokers in particular. Michigan is probably the most anti-surrogacy state in the Union. In 1993, Michigan legislators ruled it is a misdemeanor to be party to a surrogacy contract and a felony to serve as a “surrogate broker,” with a maximum penalty of a $50,000 fine and five years in prison (MICH. COMP. LAWS, 1993). Even though states like Michigan typically do not prohibit non-commercial surrogate parenting arrangements, they do refuse to enforce them as binding contracts. States that ban commercial surrogacy do so on the grounds that it is a disguised form of baby-selling that exploits women and commodifies children. They dismiss arguments that the intended rearing parents do not actually pay for the baby, but only for the surrogate’s gestational services. Such states also dismiss arguments that most women who serve as surrogates do so freely, and that the children to which they give birth are not viewed as merchandise but as very much wanted children. Whether outright bans of commercial surrogate parenting arrangements are constitutionally permissible remains an open question, however. Advocates of these arrangements argue that if the only way a married infertile couple, for example, can have child genetically-related to them is to use a surrogate mother service, then prohibiting them from doing so is probably a violation of their fundamental right to procreate.

b. Endorsement

An increasing number of states use codified law to recognize and enforce properly negotiated surrogate parenting contracts. One state, California, uses case law to enforce parenting contracts. Importantly, these states regulate the terms of the surrogacy contract. For example, in most states, intended rearing parents are not permitted to pay for more than the surrogate’s medical and ancillary expenses; and, in all states, intended rearing parents may not interfere with a surrogate’s abortion rights. Interestingly, most states that enforce surrogacy contracts are maximally supportive of intended rearing parents who are also the genetic parents of the child. These states reason that the intended rearing parents have two parental claims—one based on the intent to rear the child and the other based on genetics—which in combination trump any parental claim a surrogate might make solely on the basis of her gestational relationship to the child. In instances of gestational surrogacy where the intended rearing parents supply the surrogate with an embryo that is genetically unrelated to them, codified law and case law rely solely on the intended rearing parents’ intent to establish legal parenthood. Moreover, in some states where intent is recognized as the factor which establishes parenthood, the intended rearing parents may apply for an order, prior to the baby’s birth, directing that their names rather than the names of the surrogate and her husband (if she has one) be entered on the birth certificate.

c. Assimilation

Some states that recognize surrogate parenting arrangements maintain features of adoption law in their codified-law and case-law frameworks. Typically, these states provide the surrogate with a change-of-heart period, usually around 72 hours, during which she may decide not to revoke her parental rights to the child. In the estimation of some legal theorists, intent alone does not necessarily determine rightful parenthood. They believe that the “sweat equity” of gestation should count as establishing some sort of parental claim to a child.

d. Hands-off

Nearly half of the states do not view surrogate parenting arrangements as legally enforceable. Deeming a contract for a mother unenforceable means that, if either the surrogate mother or the intended rearing parents breach the contract, the state will not intervene. The parties to the contract will need to work out their differences in custody court. So, for example, if the intended rearing parents fail to pay the surrogate mother her fee, the state will not help her collect it. Or if the intended rearing parents refuse to take the child from the surrogate mother because they no longer want the child, the state will not force them to become rearing parents. Instead, the state will require the surrogate mother either to maintain her parental relationship with the child or to put the child up for adoption. In the former case, she may be entitled to child support from the genetic or intended rearing father.

As it so happens, a non-enforcement situation is just as risky for the intended rearing parents as it is for the surrogate. If the surrogate mother refuses to relinquish the child to the intended parents, they will not be able to secure custody of the child based solely on the contract they made with the surrogate mother. However, because of the state’s interest in the well-being of the child, a family-law court will rely on the traditional “best interests of the child” standard to resolve custody disputes between the surrogate and the intended rearing parents. In cases of traditional surrogacy, virtually all family law courts will view the custody dispute as one between two genetic parents: the surrogate mother and the intended father. In cases of gestational surrogacy, some family-law courts will view the custody dispute as a conflict between the gestational mother (the surrogate mother) and the genetic mother (most typically the intended mother but in some instances an egg donor) to be decided by appeal to the original intention of the involved parties.

6. Healthcare Organizations’ and Professionals’ Attitudes toward Surrogacy Arrangements

Largely because the legal remedies for surrogacy arrangements in many states are either non-existent or ambiguous, healthcare leaders have been cautious about either total bans of or wholesale endorsements of surrogate parenting. The two main medical societies that set the gold standards for assisted-reproduction accept with some reservations surrogate parenting arrangements. The American College of Obstetrics and Gynecology (ACOG) accepts surrogacy arrangements only when they are medically necessary, and the compensation to the surrogate or gestational mother is based on her services and not on her ability to produce a child for the intended rearing parents. Adopting the assimilationist view described above, ACOG also suggests that private nonprofit agencies, similar to adoption agencies, oversee surrogacy arrangements (ACOG 1990, 133), and that subsequent to the birth of the child, the surrogate or gestational mother be given a short period of time during which she can change her mind about giving up the child.

The other main assisted-reproduction medical society, the American Society for Reproductive Medicine (ASRM), recommends that intended rearing parents avoid a traditional surrogacy arrangement and instead use a gestational surrogacy arrangement. Although the ASRM is not enthusiastic about any widespread use of surrogate mothers, it recognizes surrogacy arrangements as a way for a limited number of people to exercise their procreative freedom. Like ACOG, the ASRM frowns on assisting surrogacy arrangements unless there is a medical reason to do so.

7. Conclusion

Because the assisted-reproduction industry is regulated primarily by non-enforceable practice guidelines, assisted reproduction centers and infertility clinics usually decide their own policies and procedures for surrogacy arrangements. Some centers and clinics limit their services to infertile married couples, whereas other centers and clinics welcome anyone who can pay for their services. Limiting one’s practice to certain groups of people is sometimes a covert form of discrimination against other groups of people, however. It may be morally and legally justifiable for a center or clinic to refuse to assist people who do not have a medical reason, specifically infertility, to contract a surrogate mother. After all, physicians and nurses have no clear obligation to use their skills and/or connections to serve people who do not suffer from a disease, disability, or medical abnormality. However, when physicians and nurses refuse to help gay or lesbian individuals or couples who need to enter into a surrogacy arrangement in order to have a child, their refusal to extend help to these individuals or couples may constitute an act of discrimination.

As surrogate/contract parenting arrangements are normalized and routinized, the U.S. public will probably press federal and state authorities to pass clear legislation governing surrogacy. People in the United States view their procreative rights as sacrosanct: too important to be left to the unpredictable rulings of courts and/or the sometimes arbitrary policies of infertility clinics and assisted reproduction centers. Developing ideal laws to govern surrogate parenting arrangements will be no easy matter, however, not when a gay man can use donor sperm and donor eggs to produce embryos that are then implanted in the womb of a gestational mother. Should the law, on the basis of this gay man’s intentions alone, deem him the legal father of the child? Perhaps so, despite the fact that it will be quite some time before the majority of the U.S. population and the feminist community is comfortable with such a novel surrogate parenting arrangement.

8. References and Further Reading

a. Primary Sources

  • Ali, Lorraine and Raina Kelley. “The Curious Lives of Surrogates.” Newsweek, April 7, 2008, vol. 151, Issue 14, pp. 44-51.
  • American College of Obstetricians and Gynecologists. “Ethical Issues in Surrogate Motherhood.” (ACOG Committee Opinion 88). Washington, DC: ACOG, 1990.
  • Andrews, Lori B. “The Aftermath of Baby M: Proposed Laws on Surrogate Motherhood.” In Richard T. Hull (ed.). Ethical Issues in the New Reproductive Technologies. Belmont: Wadsworth, 1990.
  • Andrews, Lori B. “Alternative Modes of Reproduction.” In Sherrill Cohen and Nadine Taub (eds.). Reproductive Laws for the 1990s. Clifton, NJ: Humana Press, 1988, pp. 361-403.
  • Andrews, Lori B. “Beyond Doctrinal Boundaries: A Legal Framework for Surrogate Motherhood.” Virginia Law Review, November 1995, 81, 2343.
  • Annas, George J. “Death without Dignity for Commercial Surrogacy: The Case of Baby M.” Hastings Center Report, 1988, vol. 18, no. 2, pp. 23-24.
  • ASRM. Third Party Reproduction (Sperm, Egg, and Embryo Donation and Surrogacy. American Society for Reproductive Medicine: Birmingham, Alabama, 2006.
  • Bayles, Michael D. “Genetic Choice.” In Richard T. Hull (ed.). Ethical Issues in the New Reproductive Technologies. Belmont: Wadsworth, 1990, pp. 241-253.
  • Binion, Gayle. “Baby M, Surrogate Parenting and Public Policy.” Policy Studies Review, Spring, 1992, vol. 11, Issue 1, pp. 126-140.
  • Capron, Alexander, and M.S. Radin. “Choosing Family Law over Contract Law as a Paradigm for Surrogate Motherhood.” Law, Medicine, and Health Care, 1988, vol. 16, no. 2, p. 3a.
  • Chesler, Phyllis. The Sacred Bond: The Legacy of Baby M. New York: Times Books, 1988.
  • Covington, Sharon N. and Linda Hammer Burns. Infertility Counseling: A Comprehensive Handbook for Clinicians. Cambridge University Press: Cambridge, 2006.
  • Daar, Judith F. “Assisted Reproductive Technologies and the Pregnancy Process: Developing and Equality Model to Protect Reproductive Liberties.” American Journal of Law and Medicine, vol. 25, no. 4, 1988, pp. 453-478.
  • Davis, Peggy C. “Alternative Modes of Reproduction: Determinants of Choice.” In Sherrill Cohen and Nadine Taub (eds.). Reproductive Laws for the 1990s. Clifton, NJ: Humana Press, 1988, pp. 421-431.
  • Ethics Committee of the American Fertility Society. “Ethical Considerations of Assisted Reproductive Technologies.” Fertility and Sterility, vol. 62, no. 5, Suppl.1, November 1994, pp. 15-125S.
  • Gentleman, Amelia. “India Nurtures Business of Surrogate Motherhood.” New York Times, March 10, 2008, A9.
  • Hull, Richard T. (ed.). Ethical Issues in the New Reproductive Technologies. Belmont, CA: Wadsworth, 1990.
  • In re Baby M (1988), 537 A.2d 1227, N.J.
  • Jaggar, Alison M. “Feminist Ethics.” In Lawrence Becker with Caroline Becker (eds.). Encyclopedia of Ethics. New York: Garland Press, 1993, pp. 361-370.
  • Johnson v. Calvert, 851 P. 2d., 776 (Cal. 1993), cert. denied, 1145. Ct. 206 (1993), and cert. dismissed, 114 S. Ct. 374 (1993).
  • Kass, Leon. “Making Babies Revisited.” In John Arras and Robert Hunt (eds.). Ethical Issues in Modern Medicine. Palo Alto, CA: Mayfield, 1983, pp. 407-413.
  • Ketchum, Sara Ann. “New Reproductive Technologies and the Definition of Parenthood: A Feminist Perspective.” Paper presented at Feminism and Legal Theory: Women and Intimacy, a conference sponsored by the Institute for Legal Studies at the University of Wisconsin-Madison.
  • Krimmel, Herbert T. “The Case against Surrogate Parenting.” The Hastings Center Report. October, 1983, vol. 13, no. 5, pp. 35-39.
  • Mich. Comp. Laws Ann. 722.855 (West 1993).
  • O’Brien, Mary. The Politics of Reproduction. Boston: Routledge and Kegan Paul, 1981.
  • Overall, Christine. Ethics and Human Reproduction: A Feminist Analysis. Boston: Allen and Unwin, 1987.
  • Rich, Adrienne. Of Woman Born. New York: Norton, 1979.
  • Robertson, John A. “Surrogate Mothers: Not So Novel after All.” The Hastings Center Report, vol. 13, no. 5, October 1983, pp. 28-34.
  • Robertson, John A. “Assisted Reproductive Technology and the Family.” Hastings Law Journal, vol. 47, no. 4, April 1996, pp. 911-933.
  • Rothman, Barbara Katz. Recreating Motherhood Ideology and Technology in a Patriarchal Society. New York: W. W. Norton and Co., 1989.
  • Schultz, Marjorie Maguire. “Reproductive Technology and Intention-based Parenthood: An Opportunity for Gender Neutrality.” Wisconsin Law Review, 1990, vol. 2, pp. 377-378.
  • Shannon, Thomas A. and Lisa S. Cahill. Religion and Artificial Reproduction: An Inquiry into the Vatican ‘Instruction on Respect for Human Life’. New York: Crossroad, 1988.
  • Singer, Peter and Deane Wells. Making Babies: The New Science and Ethics of Conception. New York: Charles Scribner’s Sons, 1984.
  • Spallone, Patricia and Deborah Lynn Steinberg. Made to Order: The Myth of Reproductive and Genetic Progee. Oxford: Pergamon Press, 1987.
  • Surrogacy Arrangements Act, 1985, United Kingdom, Chapter 49, p. 2. (1) (a) (b) (c).
  • United States Congress, Office of Technology Assessment. Infertility: Medical and Social Choices. Washington, DC: US Government Printing Office, 1988, OTA-BA-358.

b. Secondary Sources

  • Andrews, Lori B. Between Strangers: Surrogate Mothers, Expectant Fathers, & Brave New World Babies (New York: Harper & Row, 1989).
    • Andrews’ well-balanced and accessible examination of surrogate parenting arrangements provides many case studies, statistics, and detailed policies for readers to ponder.
  • Chesler, Phyllis. Sacred Bond: The Legacy of Baby M (New York Times Books, 1988).
    • Chesler analyzes every aspect of the Baby M case and provides a deep examination of the factors that truly make for good parents.
  • Corea, Gena. The Mother Machine: Reproductive Technologies from Artificial Insemination to Artificial Wombs (New York: Harper & Row, 1985).
    • Corea’s classic radical feminist case against all forms of assisted reproduction is accessible and spirited. Although her claims are sometimes exaggerated, she does prompt readers to consider some of the harmful consequences that might result from breaking the links between genetic, gestational, and rearing parentage.
  • Overall, Christine. Reproduction: Principles, Practices, Policies (Toronto: Oxford University Press, 1993).
    • Overall’s discussion of assisted reproduction and surrogate parenting is an excellent contribution to the literature. Its style is that of the probing philosopher with more questions than answers.
  • Robertson, John. Children of Choice: Freedom and the New Reproductive Technologies (Princeton, NJ: Princeton University Press, 1994).
    • Robertson provides the best case for a nearly absolute right to procreative liberty. Strong arguments for state enforcement of properly-framed surrogate parenting arrangements can be found in his very comprehensive survey of laws governing assisted reproduction in the United States.
  • Blythe, Eric and Ruth Landau. Third Party Assisted Conception across Cultures: Social, Legal, and Ethical Perspectives (London: Jessica Kingsley Publishers, 2004).
    • Blythe and Landau provide a survey of assisted reproductive technologies throughout the world. They help readers understand why some cultures accept and others reject surrogate parenting arrangements.
  • Markens, Susan. Surrogate Motherhood and the Politics of Reproduction (Berkeley: University of California Press, 2007). Markens compares and contrasts California’s and New York’s statutes involving surrogate contracts.
    • She explores how voices on all sides of reproductive technology at times converge with regards to creating surrogate policy.
  • Shevory, Thomas C. Body/Politics: Studies in Reproduction, Production, and (Re)Construction (Westport, CT: Praeger Publishers, 2000).
    • Shevory devotes a chapter of his book to a discussion of the complex nature of surrogacy contracts.

Author Information

Rosemarie Tong
Email: rotong@uncc.edu
University of North Carolina, Charlotte
U. S. A.

William Mitchell (1861—1962)

MitchellSir William Mitchell was the first major philosopher to live in South Australia. He worked at Adelaide University from 1895 to 1940 primarily in the area of what is now known as cognitive science. His major work: Structure and Growth of the Mind is a treatise on philosophical psychology.

Mitchell anticipated the claims of Nagel, McGinn, and Chalmers and their emphasis on the nonreductive character of subjective experience. He also anticipated the themes associated with perceptual plasticity, developmental accounts of modularity, and connectionism.

Mitchell’s non-reductive view of experience is historically awkward to place between Australia’s 19th century idealism and 20th century radical materialism. Mitchell thought the mind was a structure reacting to the environment. These reactions constitute experiences, through which objects can be known, similar to idealism. Studying these experiences provide “direct” evidence (or data) of the mind. Mitchell also recommended the study of the brain, which provides “indirect” evidence of the mind. The (then) emerging sciences, such as neuroscience, provide an important but limited explanation of the mind. This distinguishes Mitchell from most present contemporaries.

Mitchell explains the growth of the mind through three kinds of content found in experience: feelings, interests, and actions. Experience begins by sensations or by what we feel, which develop into interests of various levels of perception, which in turn may result in action. Although some psychologists and philosophers, like Piaget and Nagel, later present accounts similar to the idea of mental growth, most contemporary accounts of mind focus on the indirect methods or on representational and computational functions of the brain. Contemporary accounts sympathetic to non-symbolic modal information processing may find interest in Mitchell’s work.

Table of Contents

  1. Biographical Sketch
  2. Scottish and Australian Philosophy, The Background
    1. Idealism
    2. Realism and Materialism
    3. Psychology
    4. Neuroscience
  3. Contemporary Philosophy of Mind
  4. Mitchell’s Philosophy of Mind
  5. Contemporary Cognitive Science
  6. Why Mitchell has been Forgotten
  7. References and Further Reading

1. Biographical Sketch

William Mitchell was born in Inveravon in north Scotland in 1861, the son of a hill farmer. He was one of six children. Before he died in 1962 at the age of 101, he had distinguished himself both as Vice Chancellor (1916-1942) and later Chancellor (1942-48) at the University of Adelaide in South Australia. He held the Hughes Chair in English Language and Literature and Mental and Moral Philosophy, and was the first (and to date only) philosopher working within Australia to give the Gifford Lectures at the University of Aberdeen. This he did in 1924 and 1926. In 1927 he was knighted for his services to South Australia (Miller, 1929, p. 248).

In South Australia, Mitchell is remembered as an important figure at Adelaide University. He is certainly well-known for his contributions to scholarly life: this included obtaining grants for the University; founding the chair of biochemistry; spending large sums on library acquisitions; making many administrative contributions (the neo-Gothic Mitchell Building on North Terrace in Adelaide is named in his honour). However, he was also a first-rate philosopher. He published his first paper in Mind while still an undergraduate, and later, two discursive and wide-ranging books with MacMillan; the first entitled: Structure and Growth of the Mind (1907) ranged over issues in mind and content, philosophical psychology and neuroscience; the second The Place of Minds (1933) covered issues overlapping mind and the philosophy of physics, including the then relatively new area of quantum mechanics. The only copy of the third manuscript The Power of Mind—part of the trilogy—is said to have been lost during the London bombing raids. There are surviving manuscripts of this last book and proceedings of it as the last in the series of Gifford lectures—none of which, however, have ever reached print. There are also a number of shorter papers including: “Nature and Feeling”, “Universities and Life”, “Reform in Education”, “Christianity and the Industrial System”, “The Quality of Life”, and others, which were published as monographs by the Hassell printing company in Adelaide. Mitchell was also a regular contributor to the early editions of the Mind journal and regularly wrote shorter topical pieces for the Murdoch paper, The Advertiser, when it was a newspaper of some repute.

As a teacher and academic, Mitchell was highly regarded and something of a polymath, being engaged to teach economics and education as well as philosophy, psychology and literature. It might be disputed how much teaching he actually did in economics and literature—though a recent publication claims that he taught economics four evenings a week in addition to his other duties as professor of philosophy and a Vice Chancellor (“Economics at Adelaide”, 2003, p. 15). There is no doubt that he was a man of considerable energy. For this reason perhaps he described his chair, not as a chair but a sofa. He was also an unpretentious character. It is said, for example, that he didn’t have need for a room in his capacity of Vice Chancellor. If he wanted to see someone on an administrative matter, Mitchell would see them in his room. (Smart, 1962). Because of his considerable abilities as an academic, administrator, and intellectual/social commentator, Duncan and Leonard describe Mitchell as “the nearest approach to a philosopher-king the academic world has ever seen” (Duncan and Leonard, 1973, p. 78; Trahair, 1984, p. 52).

Mitchell always considered himself to be, first and foremost, a philosopher (Smart, 1962). He was, arguably, Australia’s first significant philosopher. Yet, curiously, he is not remembered at all as such. In academic terms, he is today a largely forgotten figure. The last serious discussion known to appear in print on Mitchell’s work was probably in Blanshard’s Nature of Thought in 1939; the last review of his books appeared in 1934 (Harvey and Acton wrote reviews in the same year; an earlier review by Hoernlé appeared in 1909); the last postgraduate dissertation in 1984 (Allen, 1984, see also Allen, 1995). No mention is made of Mitchell in contemporary philosophical writing (although see Boucher, in press). In Honderich’s Dictionary of Philosophy, Mitchell’s main work, Structure and Growth of the Mind, is described as the last remaining example of Australian idealism which “still survives” (Honderich, 1995, p. 67). If it survives at all, it certainly doesn’t survive by very much.

2. Scottish and Australian Philosophy, The Background

Although much had been written on early Scottish philosophical influences on the development of Australian philosophy, the focus of this work has centred mainly on the Sydney connection—particularly, the writing and influence of John Anderson, Challis Professor of Philosophy at the University of Sydney (1927-58). (See Anderson, et al., 1962; Anderson, 1980, 1982; Kennedy, 1995; Coombs, 1996; Baker, 1979, 1986; Mackie, 1962, 1977). In contrast to the Andersonian influence, little scholarly work had been undertaken on what impact, if any, Scottish traditions had on philosophical writing elsewhere in Australia.

Western philosophical thought made an appearance in Australia long before Anderson arrived in New South Wales, yet it may be forever overshadowed by Anderson’s legacy. From approximately 1850 a small community of scholars—mostly of Scots origin—working against the considerable difficulties of time and distance (both among themselves and also between them and their colleagues in the northern hemisphere) managed to bring together a philosophical community in Australia, add to the then dominant idealist and quasi-religious debates which occupied the intellectual scene in America and Europe, and leave behind a number of manuscripts and assorted papers which provided the basis for the metaphysical and epistemological work of those that followed. These scholars included Barzillai Quaife, John Woolley, Charles Badham and Francis Anderson in Sydney; M. H. Irving, H. A. Strong, W. E. Hearn, Richard Hodgson, Alexander Sutherland and Henry Laurie in Melbourne; William Mitchell and John McKellar-Stewart in Adelaide; Elton Mayo and Scott Fletcher in Queensland; R. L. Dunbabin in Tasmania; and P. R. Le Couteur and A. C. Fox in Western Australia.

Any systematic survey of the earliest Australian philosophers and their ideas is beyond the scope of this article. For a comprehensive review, see, Grave, 1984. However, it is necessary to mention the background of those philosophers in broad terms before turning to the subject of this article—William Mitchell. Mitchell spanned two groups of philosophers having very different concerns: the idealist and “common-sense” philosophers who worked from the mid to late 1850s until the late nineteenth century; and, what might be called the realist and materialist revolutionaries beginning in Australia in the early twentieth century with fellow-Scot John Anderson, and later dominated by the work of J. J. C. Smart, U. T. Place, D. M. Armstrong, C. B. Martin, and others—a “school” now known internationally as “Australian Materialism” (all except Armstrong were based in Adelaide). Any understanding and appreciation of Mitchell’s work, must be understood in the context of these two very different traditions.

Mitchell was the product of an old and vibrant school of philosophy which had its roots in the Scottish traditions of idealism and “common-sense” philosophy. The dead hand of idealism and the consequences it had for philosophical realism was one of the influences which gave rise to Mitchell’s work. Other early Australian philosophers before, during and after Mitchell’s time also owe their foundations to these traditions. In brief, these influences can be summarised as follows: from the common-sense philosophers such as Thomas Reid (1710-1796), Mitchell accepts the arguments advanced against solipsism and anti-realism, and the idea that the mind may exhibit different information-processing hierarchies. From T. H. Green (1836-1882), Mitchell derived the idea that an uninterpreted sense datum was simply folly. From F. H. Bradley (1846-1924), Mitchell takes the idea that experience—at least initially—is a seamless unity of knower and known. From James Ward (1843-1925), Mitchell takes the important idea that organisms grow, and that an adequate explanation of mental activity must capture this. From William James (1842-1910), Mitchell adopts a version of realism. Each of these ideas are represented in one way or another in Mitchell’s thought.

However, there was another influence on Mitchell’s philosophical development: the challenges forced by the growing relevance of the physical sciences to philosophical speculation about mind. Developments in physics, psychology and neuroscience, for example, were considerable influences at the time Mitchell was working. Both these influences conspired, not intentionally but effectively, to bring about a materialist reaction to idealism that, for better or worse, shared more of its idealist ancestry than the materialism we know today. Consequently, this flavored Mitchell’s work in Australia during the same period. The implications of them for Mitchell’s thought are mentioned below.

a. Idealism

Mitchell is not an idealist in the strict sense, though he certainly came from the idealist tradition. Some of his more shaky arguments even turn on idealist assumptions. This should not be surprising. Mitchell’s views, after all, descend from the influence of the British idealists, T. H. Green and F. H. Bradley, among others, who endeavored to push the empiricist views of Locke and Hume closer to the views of the German idealists. On the other hand, Mitchell was also impressed by the arguments of his compatriots T. Reid, D. Stewart, J. Beattie, W. Hamilton—the Scottish “common sense” theorists, who attacked idealism and tried to outline a doctrine closer to what we would now call “realism”. While it should be acknowledged that idealism is a broad church, and can encompass a wide variety of positions, on balance, Mitchell’s views are best placed at the beginning of another tradition entirely.

Mitchell’s views demonstrate cautious materialist and non-doctrinaire realist themes—themes which have more in common with contemporary philosophical work (for example, current work in cognitive science) than with the idealist tradition; views which are also indicative of the region of the world in which he worked. His writing is best described as marking a transition between the idealist tradition which arrived on Australian soil in the early part of the nineteenth century, and the more radical materialist views which followed (especially in Adelaide)—but, strictly speaking, he belonged properly to neither tradition. There is no doubt that Mitchell wrote like an idealist—sometimes argued like one—but there is an ambiguity in his work which seems to indicate that he was attempting to stake out a position that, for the time, was genuinely original. If he was an idealist, he was only a methodological idealist.

b. Realism and Materialism

There is a light-hearted reason why Mitchell should not be seen as an idealist: for were it so, it would stand as an anomalous case to the oft-quoted remark of Armstrong (and quoted by Devitt, 1984, p. vii) that realism is born only of dry countries with harsh landscapes and strong sunlight, whereas anti-realisms are born of moist countries with misty air and green landscapes where the mind is allowed to wander. (Devitt even claims that a bastion of idealism still survives in Victoria where the sun doesn’t shine quite as much.) Since Mitchell spent most of his philosophical life in Australia—and in the very harsh climate of South Australia—it would be unfitting that, if he was an idealist, he would remain one for long. J. J. C. Smart remembers Mitchell regarding himself as a staunch realist. One recollection recalls Mitchell in conversation with a solipsist: “You know, the trouble with you, is that you think only minds exist”, and adding (under his breath) “and your mind at that.” (Edgeloe, 1993). Not the kind of remark an idealist would make. And, it is certainly not like an anti-realist to make claims such as the following: “No object is made mental, nor altered, by being felt, imagined, or known in any way” (PMW, p. 33) and: “When your ideas quarrel with mine, and when they agree, it is because they….grasp the same object as mine, and to find it independent of our grasp” (PMW, p. 45). Or, finally, his claim: “The room is….not affected by my perceiving it” (SGM, p. 60). If Mitchell is an idealist, he is an unusual one indeed. However, if he is a realist, as Mitchell himself claimed, we may see his pronouncements to the contrary as mere epistemological lapses—perhaps even forgivable ones given the preoccupation of early Australian philosophers with the idealist curse.

Just as Mitchell was no idealist or antirealist, it is also clear that he was no anti-materialist. There are a number of passages which indicate this. Here’s one example (recall that is was written before 1907):

When you try to picture the structure and the action of the mind, remember you are trying to picture the structure and action of the nervous system. In this way you will avoid the usual confusion of trying to picture a hybrid process consisting partly of visible movements and partly of invisible feelings (SGM, p. 7).

It is not unreasonable, therefore, to look for evidence of realist and materialist themes in Mitchell, given that he worked here and not in the misty green landscape of Scotland, and given such pronouncements as those above. It should certainly not be automatically assumed that his views are similar to the tradition from which he descended. I shall submit that Mitchell’s work should be reconsidered in the light of contemporary philosophical debates. Perhaps J. A. Passmore was only partly right when he described Mitchell’s work as articulating “an introduction to an Idealist philosophy for which the mind is the central ontological conception” (Passmore in McLeod, 1963, p. 146). While it is certainly true that, for Mitchell, the role of the mind is a pre-eminent consideration, this doesn’t by itself make him an idealist. The common qualification for being an idealist is that what is real is in some way confined or at least related to the contents of our minds (Honderich, 1995, p. 386). And the evidence for this in Mitchell’s writing is somewhat less clear.

c. Psychology

Aside from the Scottish idealist and common sense traditions, there were other influences which complicate the picture further. These influences indicate that Mitchell was a more sophisticated philosopher than previously thought. These influences came from the discipline of psychology. Mitchell was a near contemporary of the Swiss psychologist Piaget, who argued for an epistemology which was both dynamic and materialist—setting the stage for a later cybernetic approach to epistemology. (Piaget published his first substantial works in 1923, some 16 years after Mitchell’s SGM). Mitchell articulated a kind of early dynamic process philosophy of the structure and growth of the mind which anticipated some of Piaget’s account later to receive wide acclaim in the philosophy of psychology. There are considerable differences here, of course. Whereas Piaget aimed at a strictly empirical developmental psychology underpinned by the influence of some Aristotelian, Kantian and Hegelian philosophical conceptions (with empirical work predominating), Mitchell aimed at—in Passmore’s words—”a psychology which is in turn an introduction to philosophy” (Passmore, 1963, p. 145). That is, a psychology which leads to a new way of thinking philosophically about the mind. Indeed, for Mitchell, philosophy was a kind of psychology.

While there are differences between the two thinkers, there are also similarities: unlike the focus of contemporary philosophy of mind (which deals centrally with ontological questions such as what the mind is—how a neural state can be a representational state, for instance), both Mitchell and Piaget seemed more interested in how the mind grows (how the mind of an infant is different from the mind of an adult; a learned mind differs from one which exhibits “invincible stupidity”; how the minds of lower animals differ from those of primates; and so on.) It was, in other words, an entirely different philosophical agenda. The issue of what minds are was, for Mitchell and his contemporaries, subordinate to the issue of what minds do. Structure and Growth of the Mind is, broadly speaking, an attempt to outline the precise processes undergone by minds during different stages of their growth, and under different conditions. It might be considered a conceptual psychology—or an analytic phenomenology—of the stages of mental growth. And, the central category of this “psychology” was the category of experience. This way of looking at things is currently out of favor among philosophers of mind, though it does seem to be making a come-back (see for example, Karmiloff-Smith’s amalgamation of Fodorian modularity theory and Piagetian themes) (Karmiloff-Smith, 1992).

Other psychologists to influence Mitchell were Wundt, Helmholtz and Stumpf. Additional strong influences on his work come from ethology and related disciplines. For example, Mitchell approvingly cites Lubbock’s work on the senses of insects (Lubbock, 1888, cited in Mitchell, 1907, p. 39 passim) and Preyer’s and Münsterberg’s views about the behavior of lower animals. These influences seem to discredit the claim that Mitchell was an ontological idealist. He was more interested in a naturalist account of mind and content. And he was certainly more interested in evidence from emerging sciences than the inchoate ramblings of British and German idealists (there are no references to either in his books).

d. Neuroscience

Were Mitchell an antimaterialist of some conviction, we might expect rather less of this material to feature in his writings. Yet Mitchell devotes an entire chapter reviewing the then current work in neuroscience, and much of the rest of his work is sprinkled liberally with evidence from such sources. He looks at experiments involving prosthesis and brain bisection, conjectures about differently weighted neuronal paths in animals, and so on. He called this evidence the “indirect” method of understanding mind—indirect because it relied on evidence from the brain, not “direct” evidence from experience as it seems to us, that is, not phenomenological content. Moreover, Mitchell seemed to believe that any proper understanding of mind required an analysis in which evidence from both sources was required. He didn’t think that one needed to be subordinated to the other. Mitchell “saw in psychological and neurological inquiry alternative means of explanation—the philosophical being the more “direct”—rather than attempts to describe entities of a different ontological order” (Passmore, 1963, 147).

3. Contemporary Philosophy of Mind

In contemporary cognitive science, philosophers refer to the “easy” and the “hard” problem of consciousness. The “easy” problem consists in how brains might do things such as represent perceptions in thought in a neural or computational form. The “hard” problem consists in explaining how things seem to us in experience (the “what it is like” of consciousness) (Chalmers, 1996). Many contemporary cognitive scientists believe one can’t understand mind without an understanding of the “hard” problem, as this requires an understanding of “subjectivity”, or experience “from the inside.”

This distinction approximates Mitchell’s “indirect” and “direct” distinction to this extent: While the “indirect” method offers a potentially complete understanding of “the immediate physical correlates” (SGM, p. 450) of experience, only the direct method offers an understanding of what experience is like “from the inside”. Both approaches, according to Mitchell, are essential. While Mitchell did not have the conceptual resources to understand features of mind that we have today (courtesy of the modern computer and its binary method of information storage), he did have enormous faith that the indirect method would yield considerable insights; hence his emphasis on neuroscience. In the final chapter of SGM, Mitchell even sketches what an indirect account might look like—an account which has a startling resemblance to recent “connectionist” models (McClelland, 1999; McClelland and Rumelhart, 1986).

However, while he thought this important, he also thought that this could only ever be a “correlate” of mind as it is experienced by us. Thus, he argued for a cautious, non-reductive physicalism and rejected materialist accounts which promised more. One certainly can’t understand mind without both the “direct” and “indirect” methods according to him. Mitchell’s account of mind, to the extent that it makes a contribution to such views, is thus historically relevant to the debates in present day philosophy of mind.

It could even be argued, that Mitchell anticipated the views of contemporary theorists such as Thomas Nagel, Colin McGinn and David Chalmers—the “new mysterians”, as they are sometimes disparagingly called. These theorists argue, in very different ways, for the claims that: 1. the subjective quality of experience is essentially dissimilar from objective descriptions of brain states; and 2. the current brain sciences are limited in their application. They are united in their view that, while the evidence from the neurosciences is impressive, these results don’t tell us anything about consciousness properly so-called, even though they might tell us a good deal about associated problems to do with mentality (how a propositional attitude can be a representational state, and so on). They are also united in their regard for the importance, and non-reducibility of subjective experience.

None of the “new mysterians” are dualists by fiat (although many of them openly espouse dualism); they are, rather, unconvinced that a materialist theory of mind in its present form will do the job. Materialism can’t be said to be false—indeed, Nagel states this much explicitly (Nagel, 1979, pp. 175-6). Chalmers, likewise, exhibits a reluctance to say that materialism can’t at present do the job required, and advocates a monism which is “broader”. So it seems that the new mysterians are not hostile to materialism—only unwilling to take it seriously as a complete theory of mind (this point is not often stressed in the literature). The theory of mind they argue for would have to offer an account of the subjective character of experience without attempting to eliminate, reduce or otherwise distort the “what it is like” of phenomenal experience. To paraphrase Chalmers, the right theory of consciousness will have to “feel the problem [of subjective experience] in its bones”. One can, perhaps, describe the new mysterians, in a very liberal mood, as very cautious materialists (so cautious as to support dualism or panpsychism). And, in this sense, Mitchell was one too—though he doesn’t reach such radical conclusions.

The other point worth noting is that Mitchell also anticipated the views of some contemporary cognitive scientists, especially those theorists who are somewhat sympathetic to the claims of the new mysterians but who don’t wish to be tarred with the same “new mysterian” brush.

Where is the evidence that Mitchell anticipated such views? Briefly, though not conclusive evidence on its own, some of his remarks about mind do see him articulating a position which has similarities with some of these more recent theorists:

A mind and its experience are realities that are presentable to sense as the brain and its actions. In that respect the mind and experience are not parallel with nature, but part of it. And, on the other hand, the facts of nature, including the brain, whenever they are phenomena, are not parallel with mental phenomena, but part of them (SGM, p. 23).

In one sense, it is easy to see why the American idealists in the 1930s embraced such comments (see Blanshard, 1939, for extensive reference to Mitchell’s writing). On one reading they seem to suggest that Mitchell thought the brain might be a product of minds: whenever brain states are “phenomenal” states, they are mental phenomena, he seems to say. Given his outright rejection of idealism, and his own insistence that he was a realist, other interpretations of his remarks seem called for. Another, more benign reading is that Mitchell was arguing a similar line to that of Thomas Nagel’s “Dual Aspect” theory: According to Nagel’s account, “both the mental and the physical properties of a mental event are essential properties of it—properties which it could not lack” (Nagel, 1986, p. 48). This too can be a way of interpreting Mitchell’s assertion above. This reading makes no such commitment to idealist doctrines and seems to suggest that Mitchell was trying to outline a kind of non-reductive account in which mental and physical states both feature in a more inclusive account of mind—a “fundamental” theory incorporating both. This too is the emphasis in the theories of Chalmers and McGinn (Chalmers, 1996; McGinn, 1983). Mitchell’s account also bears close similarities to Sellars’ articulation of the “manifest” and the “scientific” images (Sellars, 1963).

Gone are the days, it seems, of either being a realist and materialist, or an idealist and/or dualist, and shunning the possibility of intermediate positions. Now, it seems, empirically-minded philosophers seriously entertain alternative accounts; theories of which Anderson, no doubt, would have disapproved (Cantwell-Smith, 1996; Marshall, 2001). Chalmers is an example of an Australian who has attempted to stake out such an account, though there are others: Keith Campbell and Frank Jackson are examples of contemporary Australian dualists or qualiaphiles, as they are called; though Jackson has recently undergone a change of heart. In any case, a kinder face of Australian materialism can be seen emerging in the late twentieth century, and this probably began with Mitchell. What seems clear from Mitchell’s work is that this trend began long before Anderson’s arrival in Australia, but was overlooked. It is certainly true that Mitchell, unlike Anderson and those materialists that followed him, took consciousness as a phenomenon to be explained in its own terms, not reduced, eliminated or ignored.

I previously outlined the Scottish traditions and Australian traditions which helped to shaped Mitchell’s work. In a later section, I shall suggest that Mitchell’s work has surprising application to current trends in cognitive science. His work thus deserves serious study by contemporary philosophers of mind. I shall briefly outline the central elements of Mitchell’s ideas here before continuing.

4. Mitchell’s Philosophy of Mind

Mitchell’s philosophical contributions have, as their focus, the nature of mind and experience. Particularly, he is interested in the growth of the mind; and, to a lesser extent, its ontology. He does make contributions to the philosophy of science and education; but these fall naturally out of his philosophy of mind. It remains to introduce in general outline what these contributions are and how they differ from present-day theories.

The key elements of Mitchell’s thought are easy enough to state in general terms: experience is the crucial element of our mental lives; or, to put it another way: “mental activity is central in experience” (Miller, 1929, p. 249). As I have suggested, Mitchell is a forerunner of what we now call the “New Mysterians”, who regard conscious subjective experience as a crucial, ineliminable feature of our lives. For Mitchell, it was no different. We are happy or depressed; we worry and at other times we are elated; we feel pains and pleasures. This kind of experience is fundamental to our mental and physical lives, and cannot to be reduced or eliminated.

However Mitchell is not merely interested in such conscious experiences. He recognizes that not all experience is conscious, but is nonetheless important to the growth of the mind. Experience, for Mitchell, covers everything from qualia to high-level intentional content at various levels. There is no principled epistemic divide to be drawn between these levels on Mitchell’s account. One learns about the mind primarily by studying experience directly as we live it (the “direct” approach); and secondarily, by studying the mind indirectly by means of the emerging sciences of the mind, for example, neuroscience (the “indirect” approach). Knowledge acquired by means of the direct approach aids in directing attention to relevant features of the indirect approach (thus, an adequate neuroscience might be directed to features of interest by means of contentful phenomenal experience).

The action of mind is always action on an occasion. The occasion is the moment and conditions under which an experience happens and the content that such conditions bring about. The occasion is a stimulus property (either mental, physical or environmental). Experience is what the mind, the “reacting structure”, does in reaction to its environment (a definition which is sufficiently vague to cover all aspects of content). Not everything about the mind is always involved on an occasion, only the activity which the occasion calls forth (so, for example, low-level modular-type processing, which do not seem to involve higher level concepts, is consistent with the concept of an occasion).

The organism aims to resolve occasions in order to achieve pragmatic and experiential ends. Thus, we focus our eyes to achieve a better view, etc. However this also occurs at higher levels. So, for example, our concepts are deployed in making sense of more complex experiences. Organisms start off by resolving low-level instinctual experiences, and then move to higher, more satisfactory levels of experience, though this is not so for all creatures on which there might be evolutionary and experiential constraints. As the idea of resolving experiences is a key to Mitchell’s account, this leads to an account which demands levels of experiential content.

There are three main levels of content according to Mitchell: sensory, perceptual and cognitive intelligence. These levels are represented in the following diagram.

The sensory level is roughly equivalent to instinct. Some organisms remain at this level and advance no higher. As Mitchell defines it, the course of instinctive action is: “the power of pursuing an infinite variety of courses, directed throughout by present sensation” (SGM, p. 194). Thus, we resolve our eyes to focus; cup or fix our ears; sniff with our noses. The next level is perceptual intelligence or “interest” which is equivalent to content which already comes with the power to anticipate further experiences (for example, we simply “see” a display of objects and know how to react; we don’t have to infer our course of action). This has a number of levels (feeling, practical and cognitive interests). Some organisms—some humans—even remain at these levels. The last level is cognitive intelligence which is influenced by rules, language and principles, and it helps differentiate the expert from the non-expert. Thus, in Hanson’s sense:

There is a ‘linguistic’ factor in seeing….Unless there were this linguistic element, nothing we ever observed could have relevance for our knowledge. We could not speak of significant observations: nothing seen would make sense, and microscopy would only be a kind of kaleidoscopy. For what is it for things to make sense other than for descriptions of them to be composed of meaningful sentences? (Hanson, 1975, p. 25).

Mitchell differs from Hanson in regarding the higher level conceptual intelligence as containing features of the lower levels as well. Thus, while at higher levels there is a “linguistic factor in seeing”, this is not all there is. Cutting across this tripartite division of forms of intelligence, which constitute broad bands or levels of content, is a distinction between the functions and forms of experience: feeling, interest and action. Each of these typify the kinds of content that organisms are interested in at particular moments.

On the metaphysics of mind, Mitchell has an interesting case to put. He believes the capacity to experience allows an inference to the notion of mind (Allen, 1984, p. 7). This is rather different from some current approaches which regard to the capacity to experience as a reason to deny the existence of mind (for, example, Dennett’s 1988, 1991, and Churchland’s views, 1979, 1984, 1986). By complete contrast, Mitchell thinks that the very structure of experience is evidence that mind exists (otherwise there would be no evident structure).

However, he does not argue for a faculty-based account of mind, nor the notion of “self” as an ontologically legitimate entity. This, to Mitchell, is an invalid inference. Rather, the working of the mind is a process due to various faculties, but they themselves are not processes and not an experience; rather, the relationship defines nominal entities which stand for what experiences are produced on an occasion. A faculty means, for Mitchell, merely the capacity to produce or the capacity to have, an experience of a certain kind (Miller, 1929, p. 249). Thus, Mitchell is no defender of a literal faculty-based psychology—unlike Fodor, who has recently tried to resuscitate the idea (Fodor, 1983). Rather, his account more closely resembles a defense of some kind of early dynamic process account, recently featured in the literature as “interactivist-constructionist” models (Christensen and Hooker, 1999; van Gelder, 1998, 1999; Port and van Gelder, 1995).

What of Mitchell’s position regarding the metaphysical relation of subject and object? Mitchell claims that in every experience there is differentiation of subject and object. But it does not follow that there is always an experience of difference between two subjects of experience (for example, we can be so absorbed in an experience we can forget the object) (Jackson, 1977). Rather, this differentiation is a product of the mind’s growth. Nor can we infer from one entity to the other qua self-subsistent entities (Miller, 1929, p. 249). For Mitchell, experience involves an implicit two-factor relation: experience helps in the analysis of the two factors in relation, and experience would be impossible without these factors. But, at the same time, experience begins as mere feeling or sensation without the division into subject and object; i.e., as an undifferentiated whole. In this sense, and only this sense, Mitchell follows Bradley. Experience does not, at least initially, consist of ourselves feeling something (for this involves higher-level thought—thought which is part of the later growth of the mind); rather, it is feeling as such, or—as Mitchell calls it—mere sensation; not somebody’s feeling or a feeling of something. Experience contains diversity, but a diversity which is prior to relations (Passmore, 1984, p. 62-3).

Why develop this apparently bizarre idea of mere experience as a non-relational whole? The answer to this is possibly the same as why others, such as Bradley, developed it. Mitchell was writing at a time of considerable Humean influence. Hume, of course, took the opposite assumption to that of Bradley and Mitchell. Instead of regarding experience as an undifferentiated whole, from which distinctions between subject and object arise, Hume took the opposite assumption, a skeptical attitude. He thought of experience as comprising a disconnected “bundle” of sensations on which we impose conventions of regularity and association. On Hume’s account, the “self,” and the subject of experience and action, disappears.

Mitchell, like his Scottish forebears, rejected this assumption as irrational and counterintuitive. Like Bradley, he attempted to ground an account of experience which more closely mirrored the unity, coherence and completeness which we really do find in our conscious lives. Unlike Bradley’s Hegelian musings about the Absolute, however, Mitchell was more interested in an account of the growth of the mind from its undifferentiated feeling to the stock of mental constructions and concepts which we know in experience. In other words, he aimed to construct “a psychology which is in turn an introduction to philosophy” (Passmore, 1984, p. 145).

Thus, Mitchell’s metaphysics is complex, descended from the Scottish common-sense views, British empiricism, and idealist metaphysics. He has idealist sympathies in so far as objects can only be understood or known as the subject of experiences. However, he does not confine objects as mental products in our heads, and he sees objects qua objects as part of a dynamical exchange between organisms and the world which makes experience possible (for a recent account that is similar, see Cantwell-Smith, 1996). In this latter sense, Mitchell can be understood as a die-hard realist. Though if “idealism” is interpreted generously enough to allow for the existence of independent external material objects—as perhaps it should be—he could also be considered an idealist of some conviction.

This point is often confused in the literature. E. M. Miller points out the confusion, and Mitchell’s attitude to it, very clearly indeed:

An idealism that denies external reality is no true idealism. The experience of the real is admitted. What the idealist wants to know is the nature and meaning of reality; and as to its nature and meaning there may be and is a great variety of opinions. No one in his senses doubts the existence of material objects. What brings about endless trouble is the confusion of material existence with the assertion of the existence of a material reality independent of mind. We cannot be conscious of something which is out of consciousness, and if we are conscious of anything, we know somewhat of it. This fact is a necessity of knowledge, and to assert its independence of the relations under which it is experienced as an object of consciousness is to assert nothing. We are not aware of anything to which consciousness does not testify. In a like manner we know mental facts as distinct from physical facts or processes. We may speak of mental processes as internal and of physical processes as external; but neither internality nor externality is applicable to mental processes as such. They are entirely different from the physical. They are not coordinate, to use Mitchell’s words….and “their correlation does not mean identity of nature” (Miller, 1930, p. 10).

The latter remark, that the mental is defined in terms that are neither internal nor external, captures the point that, for Mitchell, the exchange between subject and object is crucial to the nature of mind. For convenience, we refer to the “internal” and “external” (or subject and object), but the mental is not coordinate with either; and though they are often correlated, this does not amount to a relationship of identity. (Compare, the onset of spring and bees: they are coordinate facts, and there is a high correlation between them, but they are certainly not identical.)

5. Contemporary Cognitive Science

Now let us look briefly at the kind of environment current in contemporary philosophy of mind. I shall make a few points about how Mitchell differs from the contemporary discussions, and where he has sympathies. Obviously in an article of this length I can only gesture in the direction of Mitchell’s position on the issues.

1. Contemporary accounts of mind have no account of how and why minds grow. With few notable exceptions (Karmiloff-Smith, Piaget, Vygotsky) this is true. Most philosophers are more interested in ontological questions: What is consciousness?; What is a representational state?; What is a pain?, Are representations computational states?; and so on. They are less interested in the developmental question. Mitchell, by contrast, is concerned with the growth of the mind as the primary metaphysical issue.

2. Contemporary accounts assume that the computational processes of mind are central. The computational account, or—as it is known—the representational theory of mind (RTM) is dominant in the current literature. Computations performed over amodal, structured symbolic expressions tokened in a neural form is considered to be the main processing mechanism for cognitive states. There are a number of variations on how this is supposed to be achieved, but the metaphor of the mind as a computational system is widespread. Contemporary accounts which stress the processing of non-symbolic, modal, perceptual information is now making an appearance in the cognitive science literature, but this is a minority view (Barsalou, 1999). Mitchell is sympathetic with the modal-format account, which makes him rather contemporary.

3. Contemporary accounts subordinate the phenomenal features of mind to their representational/computational features. Many cognitive scientists are principally interested in how brains represent the world in thought. Phenomenological features of experience are an infuriating problem for computational accounts because they seem to resist explanation in the terms of the RTM. If qualia occur at all—and there is much dissension on the question—they are considered to be another form of representational capacity. Thus, the RTM allows for a variety of representational formats. However, it is not clear how neurally encoding—regardless of format—can capture the “what it is like” of phenomenal experience. Mitchell’s account attempts to outline a variety of representational formats employed by the organism at various stages of its cognitive growth.

4. Contemporary accounts assume the “indirect” (neurophysiological) approach to be the best, or only, approach. Contemporary accounts generally assume that the advancing neurosciences will eventually shed insight on questions of consciousness, representation and cognition. There are some who claim that there is an “explanatory gap” and that we are cognitively prevented from crossing it (McGinn, 1991; Levine, 1983). Mitchell agrees that the indirect approach is essential but only in conjunction with the direct approach. This is in line with others who, while they regard the direct approach as valuable, claim that it plays a subordinate role to first person experiential perspectives (Nagel, 1974; Jackson, 1990; Chalmers, 1996). This kind of position is now gaining currency again, long after Mitchell originally proposed it (Edelman, 1992; Flanagan, 1992, 1995; Overgaard, 2001; van Gulick, 1993; see Davies, 2003).

5. Contemporary accounts assume that an epistemology of content is subordinate to an ontology of mind. Contemporary accounts are less interested in epistemological concerns; when they are, it is usually expressed in terms of how minds represent the world in thought in computational terms. However, this already assumes an ontology of mind. Mitchell’s approach is to construct an epistemological account from which an ontology of mind is derived as an inference. The central issue is not what minds are—the key question is how we have the experiences we do. Since experience has structure there must be minds. From the epistemological agenda an “indirect” account of the nature of mind follows.

6. Why Mitchell has been Forgotten

The reasons for the lack of interest in Mitchell’s philosophical work are fourfold: first, Mitchell’s work is historically badly poised. As I have already mentioned, he dealt with themes and ideas at the cross-over point between the death of idealism and “common-sense” philosophy, and the rise of Australian materialism and realism. This virtually ensured that his work sat uncomfortably between scholarly periods, but belonged properly to neither.

Second, his style of writing was poor. Even taking into account the stylistic conventions of the time—and allowing for the difficulty of the philosophical concepts he was engaged with—his work is badly written, often divorced of clear central themes, lacking in detailed exegesis and often ponderous in delivery. (A professor of classics at Adelaide at the time “used to say that he could never understand Mitchell’s books until he had translated them into Latin”.) (Duncan and Leonard, 1973, p. 19; Grave, 1984, p. 22). True enough, obscurity of style is no barrier to greatness (e.g., Wittgenstein). But in Mitchell’s case there were other factors in addition to stylistic obscurity that conspired to defeat him. Moreover, this estimation of Mitchell’s writing was not an individual complaint, but, by and large, consensual: reviewers of Mitchell’s first book complained about the difficulty “in focussing to a definite view the central conceptions upon which the work as a whole rests” (Kemp-Smith, 1908, p. 333). It was also criticized for its “obscurity”, its “somewhat oracular style” (Acton, 1934, p. 245) and even its “undeniable dreariness”. One reviewer pointed out that, while reading it, one always has to “retrace one’s steps and grope for the context”. The same complained that, because of “no contour or difference in emphasis”, reading the book was like “swimming under water with never a chance to come up and look about” (Perry, 1908, p. 45). Norman Kemp-Smith, a philosopher later famous for his extremely clear exposition of Kant’s Critique of Pure Reason, even had the audacity to suggest that Mitchell’s work could have been “condensed to half its present size” without loss, and complained about his “obscurity” and “constant digression into….side issues” (Kemp-Smith, 1908, p. 332). Everybody, except Mitchell himself, found his work virtually impenetrable.

Third, Mitchell’s perspective on the issues of the day was unconventional and is hard to understand even with the hindsight of trends and developments in the late twentieth century. A number of his views are simply unfashionable: for instance, the emphasis taken in both his writing and his classes was that psychology “is the proper introduction to philosophy”; a view certainly not popular today notwithstanding recent interest in a return to “philosophical psychology” (see Gold and Stoljar, 1999).

Fourth, Mitchell made no allowances for the reader: his second book was premised on the reader having read and digested the first; however the first book assumes an acquaintance with the themes and concerns of nineteenth century thought not merely in philosophy, but also in developmental psychology, neuroscience, physics and biology. Thus, for the contemporary reader Mitchell’s writing is now almost beyond reach. His second book, universally regarded as harder to read than the first, presupposes a detailed knowledge of quantum mechanics and other areas of physics very fresh for the time. Not only this, but Mitchell makes no attempt to connect his ideas with the debates which were current at the time in the literature and “never ties his reflections to a specific philosophical controversy” (Passmore, 1962; 1963, p. 145). To make matters worse, Mitchell never provided indexes to his books, and gives no summaries, recapitulations of points, nor linguistic “signposts” to aid the unwitting reader. It is this kind of inconsiderate authorship which helps explain V. A. Edgeloe’s cryptic remark that Structure and Growth of the Mind was, “for more than a quarter of a century….a textbook over which university students, in Adelaide at least, sweated” (Edgeloe, 1966, p. 536).

There is no excuse for such obscurity these days, but in the colonies during the late nineteenth century, things were different. Another reason for Mitchell’s obscurity is the factor of academic isolation to which I have already alluded. J. A Passmore has highlighted this point in relation to his two works Structure and Growth of the Mind and The Place of Minds:

Both books are, very obviously, the products of a solitary thinker. When Mitchell went to South Australia, contacts between Adelaide and the eastern states were rare, voyages to Europe or America even rarer. Few Australian philosophers as much as met Mitchell, and his influence in Australia has not been extensive (Passmore, 1963, p. 145).

There were yet further reasons for the neglect of Mitchell’s work. At around the time Mitchell’s work was beginning to be discussed, a new philosophical star was on the rise. Wittgenstein had emerged on the scene and, along with the influence of Rylean behaviorism, this presented a potent philosophical cocktail. Subjective states and discussions about sui generis conscious states fell into philosophical abeyance. Under the influence of Wittgenstein and behaviorism, issues concerning mind and consciousness began to be seen as no longer topics for fruitful philosophical discussion, but rather avoided or smothered under linguistic analysis. This remained the case well into the latter half of the twentieth century.

7. References and Further Reading

  • Acton, H. B., (1934), “The Place of Minds in the World. Gifford Lectures at the University of Aberdeen 1924-26,” (Review), Mind, Vol. 43, No. 170, pp. 243-245.
  • Allen, H. J., (1984), Mitchell’s Concept of Human Freedom. Masters Dissertation: University of Adelaide.
  • Allen, H. J. (1995), An Exposition of Selected Aspects of the Philosophy of the Late Sir William Mitchell. Unpublished manuscript: University of Adelaide.
  • Anderson, J., Cullum, G., Lycos, K., (eds.) (1962), Studies in Empirical Philosophy. Sydney: Angus and Robertson.
  • Anderson, J., (1980), Education and Inquiry. (ed.) D. Z. Phillips. Oxford: Blackwell.
  • Anderson, J. (1982), Art and Reality: John Anderson on Literature and Aesthetics. Marrickville, NSW: Hale and Iremonger.
  • Baker, A.J., (1979), Anderson’s Social Philosophy. Hong Kong: Angus and Robertson Publishers.
  • Baker, A.J., (1986), Australian Realism: The Systematic Philosophy of John Anderson. U.K.: C.U.P.
  • Barsalou, L. W., (1999), “Perceptual Symbol Systems”, Behavioral and Brain Sciences, 22, pp. 577-660.
  • Blanshard, B., (1939), The Nature of Thought. London: George Allen and Unwin Ltd. (Two Volumes).
  • Boucher, D. (in press) “Sir William Mitchell” in Dictionary of Twentieth Century British Philosophers. Bristol, UK.: Thoemmes Press. http://www.thoemmes.com/dictionaries/20entries.htm
  • Cantwell-Smith, B., (1996), On the Origin of Objects. A Bradford Book. Cambridge, Mass.: M.I.T. Press.
  • Chalmers, D., (1996), The Conscious Mind. Cambridge: Cambridge University Press.
  • Christensen, W. D. and Hooker, C.A., (2000), “An Interactivist-Constructivist Approach to Intelligence: Self Directed Anticipative Learning”, Philosophical Psychology, 13, pp. 5-45.
  • Churchland, P. M., (1979), Scientific Realism and the Plasticity of Mind. New York: Cambridge University Press.
  • Churchland, P. M., (1984), Matter and Consciousness. Cambridge, Mass: MIT Press
  • Churchland, P. M., (1986), “Some Reductive Strategies in Cognitive Neurobiology”, Mind, 95, no. 379, pp. 279-309.
  • Coombs, A., (1996), Sex and Anarchy: The Life and Death of the Sydney Push. Ringwood, Victoria: Viking.
  • Cussins, A., (1992), “Content, Embodiment and Objectivity: The Theory of Cognitive Trails”, Mind 101: pp. 651-688.
  • Davies, W. M., (1996), Experience and Content: Consequences of a Continuum Theory. Aldershot, UK: Avebury.
  • Davies, W. M. (1999) “Sir William Mitchell and the New Mysterianism“, Australasian Journal of Philosophy, Volume 77, No. 3, September 1999: pp. 253-257.
  • Davies, W. M. (2001) “Sir William Mitchell”, SA’s Greats: The Men and Women of the North Terrace Plaques. J. Healey (ed), Historical Society of South Australia Publication.
  • Davies, W. M. (2003) The Thought of Sir William Mitchell 1861-1962: A Mind’s Own Place.  Studies in the History of Philosophy, Volume 73. Edwin Mellen Press: Lewiston, USA.
  • Dennett, D. C. (1988), “Quining Qualia”, Consciousness in Contemporary Science. A. Marcel and E. Bisiach, Oxford, New York: Oxford University Press.
  • Dennett, D. C. (1991), Consciousness Explained. Boston: Little, Brown and Co.
  • Devitt, M., (1984), Realism and Truth, UK: Basil Blackwell.
  • Duncan, W. G. K., and Leonard, R. A., (1973), The University of Adelaide, 1874-1974. Rigby, The Griffin Press, Adelaide. See especially Chapter 7, “The Mitchell Era”.
  • “Economics at Adelaide” (2003), Lumen magazine, University of Adelaide: Adelaide. (URL: http://www.lumen.adelaide.edu.au/2003winter/page15/ [accessed 10/2/04]
  • Edelman, G., (1992), Bright Air, Brilliant Fire, NY: Basic Books.
  • Edgeloe, V. A., (1966), Australian Dictionary of Biography. Volume 10, 1891-1939, “Sir William Mitchell”, Melbourne University Press: Melbourne, pp. 535-537.
  • Edgeloe, V. A., (1993), Servants of Distinction: Leadership in a Young University 1874-1925. University of Adelaide Foundation: Educational Technology Unit.
  • Flanagan, O., (1992), Consciousness Reconsidered. Cambridge, Mass: MIT/Bradford.
  • Flanagan, O., (1995), The Science of the Mind. 2nd Ed., Cambridge, Mass: MIT/Bradford.
  • Fodor, J. A., (1983), Modularity of Mind: An Essay in Faculty Psychology. Cambridge, Mass: MIT/Bradford.
  • Franklin, J., (2000), Corrupting the Youth: A History of Philosophy in Australia. Paddington, Macleay Press.
  • Gold, I. and Stoljar, D., (1999), “A Neuron Doctrine in the Philosophy of Neuroscience”, Behavioral and Brain Sciences, 22, pp. 809-869.
  • Grave, S., (1984), A History of Philosophy in Australia. Hong Kong: University of Queensland Press.
  • Hanson, N. R., (1975), Patterns of Discovery: An Enquiry into the Conceptual Foundations of Science. Cambridge: Cambridge University Press.
  • Harvey, J. W., (1934), “The Place of Minds in the World: Gifford Lectures at the University of Aberdeen 1924-26,” (Review), Journal of the British Institute of Philosophical Studies, Vol. 8, No. 33, pp. 103-106.
  • Hoernlé, R. F. A., (1909), “Structure and Growth of the Mind” (Critical Notice) Mind, New Series, XVIII, pp. 255-264.
  • Honderich, T., (ed.) (1995), The Oxford Companion to Philosophy, Oxford: Oxford University Press.
  • Jackson, F., (1977), Perception: A Representative Theory. Cambridge, New York: Cambridge University Press.
  • Jackson, F., (1990), “Epiphenomenal Qualia”, Mind and Cognition: A Reader. William Lycan (ed) UK: Basil Blackwell.
  • Karmiloff-Smith, A., (1992), Beyond Modularity: A Developmental Perspective on Cognitive Science. Cambridge, Mass: MIT/Bradford.
  • Kemp-Smith, N., (1908), “Review of Structure and Growth of the Mind”, The Philosophical Review, Vol. XVII, No. 3, pp. 332-339.
  • Kennedy, B., (1995), A Passion to Oppose: John Anderson, Philosopher. Melbourne: Melbourne University Press.
  • Levine, J., (1983), “Materialism and Qualia: the Explanatory Gap”, Pacific Philosophical Quarterly, 64: pp. 354-361.
  • Lubbock, J., Sir (1888), On the Senses, Instincts and Intelligence of Animals. London: Kegan Paul.
  • Mackie, J. L., (1962), “The Philosophy of John Anderson“, Australasian Journal of Philosophy, Volume, 40, No. 3, December, pp. 265-282.
  • Mackie, J. L., (1977), “Fifty Years of John Anderson“, Quadrant .77, July.
  • McGinn, C., (1983), The Subjective View: Secondary Qualities and Indexical Thoughts. Oxford: Clarendon Press.
  • McGinn, C., (1991) The Problem of Consciousness. Oxford, Blackwell.
  • McClelland, J., Rumelhart, D. E., et al, (1986), Parallel Distributed Processing: Explorations in the MicroStructure of Cognition. Cambridge, MA: MIT Press.
  • McClelland, J., (1999), “Cognitive Modelling, Connectionist”, in R. A. Wilson and F. C. Keil, The MIT Encyclopedia of the Cognitive Sciences.
  • McLeod, A. L., (1963), The Pattern of Australian Culture. Melbourne University Press: Melbourne, (especially J. A. Passmore’s contribution: “Philosophy”, pp. 131-169).
  • Marshall, P. D., (2001), “Transforming the World Into Experience: An Idealist Experiment”, “Journal of Consciousness Studies”, 8, No. 1, pp. 59-76.
  • Miller, E. M., (1929), “The Beginnings of Philosophy in Australia and the Work of Henry Laurie“, Part 1, Australasian Association of Psychology and Philosophy, Vol VII, No. 4, pp. 241-251.
  • Miller, E. M., (1930), “The Beginnings of Philosophy in Australia and the Work of Henry Laurie“, Part 2, Australasian Association of Psychology and Philosophy, Vol VIII, No. 1, pp. 1-22.
  • Mitchell, W and Ledingham, M., (1890-1977), Papers of Sir William Mitchell and Sir Mark Ledingham. Unpublished MS.
  • Mitchell, W., (1895), The Advertiser, 19/12/1895
  • Mitchell, W., (1895), “Reform in Education”. International Journal of Ethics. October.
  • Mitchell, W., (1898), “What is Poetry?: A Lecture given to the South Australian Teachers Union”. Southern Cross Print.
  • Mitchell, W., (1903), Lectures on Materialism. (Extension lectures—Syllabus of Three) Adelaide: Thomas and Co.
  • Mitchell, W., (1907), Structure and Growth of the Mind. London: Macmillan and Co., Ltd.
  • Mitchell, W., (1908), “Discussion: Structure and Growth of the Mind”, The Journal of Philosophy, Psychology and Scientific Methods, 5, pp. 316-321.
  • Mitchell, W., (1909), Lecture on the Rate of Interest. Adelaide: Vardon and Sons. Institute of Accountants in South Australia.
  • Mitchell, W., (1912), Christianity and the Industrial System. Issued by the Methodists Social Service League. Adelaide: Hussey and Gillingham Ltd.
  • Mitchell, W., (1917), Lecture on the Two Functions of the University and their Cost. Adelaide: Hassel and Son Press.
  • Mitchell, W., (1918), “The National Spirit”, The Advertiser, 19/12/1918.
  • Mitchell, W., “Mitchell Archives”. A Collection of Newspaper clippings, photographs, articles, letters, lectures and notebooks.
  • Mitchell, W., (1925), “The Place of the Mind”, Syllabus of the Gifford Lectures. First Series. UK: University of Aberdeen.
  • Mitchell, W., (1926), “The Power of the Mind”, Syllabus of the Gifford Lectures. Second Series. UK: University of Aberdeen.
  • Mitchell, W., (1927), Jubilee Celebrations 1876-1926. Adelaide (Preface).
  • Mitchell, W., (1929), Nature and Feeling, University of Queensland John Murtagh MacCrossan Lectures. Adelaide: Hassel and Son Press.
  • Mitchell, W., (1929), “How Far Nature is Intelligible”, University of Adelaide Public Lectures, July 30th and August 6th 1929. Lecture Advertisement: Adelaide: Hassel Press.
  • Mitchell, W., (1931), Letter on the University and Education to the Committee on Public Education. Adelaide: Hassel and Son Press.
  • Mitchell, W., (1933), The Place of Minds in the World. Gifford Lectures at the University of Aberdeen, 1924-1926. First Series. London: MacMillan and Co., Ltd.
  • Mitchell, W., (1934), “The Quality of Life”, Proceedings of the British Academy, XX. Annual British Academy Henrietta Herz Lecture. Oxford: Oxford University Press.
  • Mitchell, W., (1937), Universities and Life. Introductory Address. Australian and New Zealand Universities Conference. Adelaide: The Hassell Press.
  • Nagel, T., (1974), “What is it like to be a Bat?”, Philosophical Review, LXXXIII, pp. 435-451.
  • Nagel, T., (1979), Mortal Questions. Cambridge, New York: Cambridge University Press.
  • Nagel, T., (1986), The View from Nowhere. New York: Oxford University Press.
  • Overgaard, M., (2001), “The Role of Phenomenological Reports in Experiments on Consciousness”, Psycholoquy, 12, No. 029. [http://www.cogsci.soton.ac.uk/]
  • Passmore, J., (1962), “John Anderson and Twentieth Century Philosophy”, in J. Anderson, Studies in Empirical Philosophy.
  • Passmore, J., (1977), “Fifty Years of John Anderson“, Quadrant, 21. July.
  • Passmore, J., (1984), A Hundred Years of Philosophy. UK: Penguin, Chaucer Press.
  • Perry, R. B., (1908), “Structure and Growth of the Mind” (Review), Journal of Philosophy, Psychology and Scientific Method, 5 pp. 45-48.
  • Port, R. F., and van Gelder, T., (1995), Mind as Motion: Explorations in the Dynamics of Cognition. Cambridge: MIT Press.
  • Sellars, W., (1963), “Empiricism and the Philosophy of Mind”, in Science, Perception and Reality. London: Routledge and Kegan Paul.
  • Smart, J. J. C., (1962), “Sir William Mitchell K. C. M. G. (1861-1962)”, Australasian Journal of Philosophy, Volume, 40, No. 3, December, pp. 259-263.
  • Smart, J. J. C., (1989), “Australian Philosophers of the 1950s”, Quadrant, Volume, 33, No. 6, June: pp. 35-39.
  • Sprigge, T. L. S., (1983), A Vindication of Absolute Idealism. Edinburgh: Edinburgh University Press.
  • Trahair, R. C. S., (1984), The Humanist Temper: The Life and Work of Elton Mayo, New Brunswick, NJ: Transaction Books.
  • van Gelder, T., (1998), “The Dynamical Hypothesis in Cognitive Science”, Behavioral and Brain Sciences, 21 (5), pp. 615-627.
  • van Gelder, T., (1999), “Revisiting the Dynamical Hypothesis”, Preprint Series, University of Melbourne Philosophy Department.
  • van Gulick, R., (1993), “Understanding the Phenomenal Mind: Are we all just Armadillos?”, in Consciousness: Psychological and Philosophical Essays, M. Davies and G. W. Humphreys (eds), Oxford: Blackwells.

Author Information

W. Martin Davies
Email: wmdavies@unimelb.edu.au
The University of Melbourne
Australia

Law and Economics

The law and economics movement applies economic theory and method to the practice of law. It asserts that the tools of economic reasoning offer the best possibility for justified and consistent legal practice. It is arguably one of the dominant theories of jurisprudence. The law and economics movement offers a general theory of law as well as conceptual tools for the clarification and improvement of its practices. The general theory is that law is best viewed as a social tool that promotes economic efficiency, that economic  analysis and efficiency as an ideal can guide legal practice.  It also considers how legislation should be used to improve market conditions  in return. Law and economics offers a framework with which to model legal outcomes, and common objectives with which to unify disparate areas of legal activity. The bringing together of legal theory and economic reasoning has also created new research agendas in the fields of behavioral economics: how rationality affects people’s behavior within legal scenarios; public choice theory and how collective behavior should have an effect on legislation; and game theory: understanding strategic action in a legal context.

Table of Contents

  1. Law as an Autonomous Practice
  2. Law as a Tool to Encourage Economic Efficiency
    1. Basic Concepts in Economic Reasoning
    2. How Law Can Encourage Economic Efficiency
    3. Can All Law be Explained as Economic in Nature?
  3. Economics and Normative Jurisprudence
  4. Later Developments
    1. Behavioral Economics and Law
    2. Game Theory
    3. Public Choice Theory
  5. References and Further Reading

1. Law as an Autonomous Practice

Most traditional theories of jurisprudence look to uncover the essential or definitive aspects of the institution of law. Two of the most influential are Legal Positivism and Dworkin’s Law as Integrity. While these two differ as to their definition of law and legal reasoning, they agree upon some basic central assumptions, determining the conclusions that two philosophical investigations with largely the same aims, can reach. Because of this it is important to acknowledge some of the assumptions that are held in common by these jurisprudential stances.

First, both theories agree upon the conceptual nature of jurisprudence. Both agree that it is important for a philosophical theory of law to define the core aspects of proper legal practice in order to fulfill the function of philosophical jurisprudence. In fact, much philosophical discussion of law assumes that such a characterization is the essential aim of jurisprudence. Second in order to arrive at a properly analyzed concept of law, both legal positivism and law as integrity are best constructed from specific techniques of analytic and linguistic philosophy. These techniques include the investigation and clarification of the way people commonly speak about law and careful parsing of social practice that separate the legal from the non-legal. The third common assumption is that the best way to understand legal practice is to understand the necessary and sufficient qualities that make some rule or statement into a law. Once such a set of necessary and sufficient conditions is identified (or approximated) it is thought that the essential aspects of particularly legal practices have been understood.

Instead of following this path, theorists within the law and economics movement have attacked the study of law from another angle. Rather than trying to identify unique conceptual aspects of law, what is advocated is an investigation of legal practices through the means of economic analysis. The conclusion offered is that legal practice is best understood through its function as a social tool promoting economic efficiency, in common with other social practices.

2. Law as a Tool to Encourage Economic Efficiency

So, instead of looking for the unique and defining features of law, the practitioner of law and economics looks at law as a social tool and tries to evaluate it functionally. What is emphasized is not its uniqueness as an institution, but its place within the general and common economic structure of society. The descriptive claim most often associated with law and economics is that legal practices are best characterized as tools for encouraging economically efficient social relations. To understand this claim it is important to examine some of the basic concepts used in models of economic reasoning.

a. Basic Concepts in Economic Reasoning

Essential to an understanding of the law and economics movement is a set of fundamental concepts. The most central assumption in economics is that human beings are rational maximizers of their individual satisfactions, and, in turn, respond to incentives. A rational maximizer of personal satisfaction adjusts means to ends in the most efficient way possible. It is important to realize that economics, as understood here, is not restricted to analysis of monetary issues; there are nonmonetary as well as monetary satisfactions. Every potential satisfaction is implicated in the calculus of economic satisfactions and therefore can be investigated according to economic or means-end rationality and the trade-off of costs and benefits. Normally what is aimed at through economic reasoning is the improvement of efficiency.

A more efficient allocation is one that increases the net value of resources. Efficiency in the allocation of resources is distinguished from equity, which is concerned with justice in the distribution of wealth. Because some people value specific goods higher or lower than others, economic efficiency can often be raised through voluntary transfers of goods. The most common example of a transfer promoting efficiency is that of a freely entered into contractual relationship. Because one party to the transaction values money more than the item owned, and the other values the item owned more than the asking price, the exchange produces a net gain in economic goods. Each person ends up better off than before. Some economists have gone so far as to argue that such a contractual exchange is morally optimal because it works within both Kantian and utilitarian theories of morality. They argue that it works with Kantian theories because a contract is thought to represent a good example of interaction between free and rational agents. It works with utilitarianism because the idea of wealth maximization intuitively translates into more utility.

Economists have a variety of terms to describe possible outcomes of economic exchanges. For instance Pareto optimality is defined as a point where resources are allocated such that no one is willing to trade further. Pareto optimality is the eventual endpoint of a series of Pareto superior moves. A Pareto superior change makes at least one person better of without making anyone worse off. Because no one is worse off after the trade there are no losers in Pareto improvements, although there may be many different Pareto optimal endpoints. Furthermore, economists have developed the concept of Kaldor-Hicks efficiency to compensate for obstacles to freely contracted exchanges. Kaldor-Hicks efficiency, or potential Pareto superiority, results when the overall economic gains outweigh the losses. In other words, the gains in economic efficiency are large enough that the winners could, if they had to, compensate the losers in the new allocation of goods and still remain better off.

b. How Law Can Encourage Economic Efficiency

The law and economics movement claims that law is best understood as a tool to promote economic efficiency. But how can the institution of law help encourage efficient transactions? One way is to help avoid situations that lead to market failure. One example of market failure is the existence of monopolies: a situation where one party is able to extract more profit from a good than a healthy market would allow. Law can be used as a tool to ensure that monopoly situations are hard to bring about and maintain. Another way legal systems can be used to ensure economically efficient transactions is through the enforcement of valid contracts. By ensuring compliance with contractual terms courts can give parties to a contract confidence that the other party will fulfill the agreed-to obligations. This becomes especially important in situations where the parties must complete their obligations at different times.

But some types of market failure are less obvious, and the legal means toward remedying them subtler. One problem in market transactions is that of externalities. An externality is a cost not reflected in the market price of a good. For instance, a factory may not have to internalize the costs it imposes upon the environment into the selling price of its goods. In this case the market price of the good will not reflect its real cost – and therefore some of the costs are imposed upon parties in an involuntary manner. Pigou argues in regard to this that legal means should be used to impose a marginal tax upon the offending party, to internalize any externalities. The economist Coase argued that this conclusion, while warranted in specific cases, was too global. Coase argued that in a market where transactions are costless and people do not act strategically, rights assignments are irrelevant because from any starting point the results will be economically efficient. In other words, the Coase Theorem states that if there are no transaction costs the assignment of entitlements will be irrelevant to the goal of allocative efficiency. In such a situation there will be no need for law to internalize costs because people will bargain to the most efficient possible allocation of goods. But outside of conceptually ideal markets there are always transaction costs such as information costs, opportunity costs and administrative costs. If transaction costs are somewhat high, then it does matter how property rights are assigned. Therefore the enforcement and allocation of legal entitlements will be an important factor in ensuring economically efficient exchanges. So law can be used to encourage economic efficiency. But is all law best described in economic terms?

c. Can All Law be Explained as Economic in Nature?

It may be no real surprise that law often is used to encourage efficient exchanges. But it seems a stretch to claim that law as an institution is best completely described in economic terms. It seems counterintuitive to view all law as based upon market principles. What the economic analysis of law manages, though , is to see such disparate areas as contract, tort and criminal law as all based upon economic aims, therefore giving law a more coherent basis than other theories can offer. Richard Posner argues that tort cases – those involving private harm – can be seen as contractual by looking for the hypothetical terms that the parties to an accident would have agreed to in advance in order to bring about the accident voluntarily. Also that criminals are deterred by the threat of punishment only if the likelihood of punishment multiplied by the quantity of punishment exceeds the gain offered by the  criminal act. Scholars have been quite effective in extending the tools of economic analysis into areas that seem to be anything but economic in nature. Even rules of evidence and legal ethics have proved amenable to economic analysis. However, it may be argued that an economic explanation of law fails on two counts. Firstly as a descriptive analysis it doesn’t do justice to everyday legal conceptions. Secondly  as an analytical analysis of the necessary conditions for the practice of law  it may not be able to account for the internal point of view which Hart thought so central to a proper understanding of law. More analytical approaches to economic explanation of law have considered this a fatal flaw in the project (see Coleman 2001). This may be mistakenly importing traditional philosophical aims into a drastically different project, but the truth is that it is often hard to tell what types of theoretical claims are being made within law and economics. If the claims are of exhaustive descriptive accuracy or of the necessary and sufficient conceptual foundations of law then it is more than likely a failure. But whether or not law and economics is an accurate or even conceptually necessary description of law as a social institution, and whether or not it suffices as a complete analysis of law, it could be argued that law should in any case adopt economic efficiency as the central aim guiding judicial decision-making.

3. Economics and Normative Jurisprudence

Though analytically incomplete, economic analysis models the actual results of legal institutions better than any other theory. This does not entail, however, that law ought to be consciously used for such an aim. Might not law be better used to consider issues related to  justice, duty and the like? Advocates of law and economics have argued against such a conclusion. The arguments usually are of two types. First, it is claimed that meanings of words such as justice or duty are so vague and in dispute that the use of such concepts for a basis of judicial decisions offers no guidance whatsoever. It is argued that while such concepts are unhelpfully complex, the tools of economic analysis and the concept of economic efficiency are sufficiently clear to provide the judge a solid and predictable basis of decision. Law is better able to decide according to efficiency rather than justice or duty due to limitations of institutional competence. This might be so if issues of justice are so complex as to involve information that courts are structurally unable to process. Second, it has been argued that because the paradigm case of justice is the freely entered in to contract, law is best seen as a tool to optimize contractual arrangements. If this is so, then where law can help is in situations where transaction costs are so high as to prohibit efficient contractual relationships. Here Posner argues that law can encourage economic efficiency by assigning property rights to those parties who would have secured them through market exchange if transaction costs were lower. In other words law should bring about allocations that mimic the results of a properly functioning market. In addition, advocates of economic analysis of law make a claim that other jurisprudential traditions seem to be unable to: that the analytic tools offered by law and economics has encouraged the further creation of other productive areas for analyzing law (see Posner 1998).

4. Later Developments

Another argument for the fertility of the economic analysis of law is that it has spawned a number of further tools that seem helpful in understanding legal institutions. Three of the most important of these are the results of behavioral economics, game theory and public choice theory.

a. Behavioral Economics and Law

Practitioners of behavioral law and economics examine human limits to means-end rationality. One of the outcomes of behavioral economics is the concept of bounded rationality. Bounded rationality means that information is not processed according to a model of perfect means-end rationality but, to the contrary, is distorted due to limits of our cognitive abilities. For instance the endowment effect is thought to be a behavioral limit that distorts the proper valuation of property, an important aspect of bargaining to efficient outcomes. According to the effect, the ownership of objects creates an irrational cognitive overvaluation of them. Another claim is that our cognitive abilities are distorted by the availability heuristic. According to this the availability of strong imagery may induce us to over or underestimate the actual probability of events associated with the image. For instance, graphic representations of highly improbable harms might be more influential on behavior and demand unjustified use of resources than statistical analysis showing another equally undesirable harm to be more common and easier to avoid. Jurisprudential practices could be significantly influenced by such results. For instance, judges might be as irrationally influenced by the availability heuristic as other human beings. Therefore victim impact statements might be important correctives to proceedings if a well-presented defendant’s presence in the court skews judge or jury’s decisions. An awareness of such a cognitive failure could help adjust legal reasoning and its conclusions accordingly. Finally, an awareness and exploitation of universal cognitive limits might help legislators to design more effective laws (see Sunstein 2000).

b. Game Theory

Game theory adds to economic modeling the phenomenon of strategic action. Strategic actions are those adopted because of the competitive nature of many social transactions. They are adopted due to how one individual expects another to act in response. For example, a person who wishes to buy an item cheap would act disinterested so as not to signal his or her actual desires to the seller. Addition of analytic tools dealing with strategic action greatly strengthens the economic analysis of law. For instance, the Coase theorem, to function properly, necessarily excludes strategic action; cooperation is just assumed. But it seems apparent that legal actions often are deeply implicated in and animated by strategic motives. Common sense tells us that full open cooperation is not always the best path to bringing about one’s desired results. In fact much of the bargaining invested in designing an effective contract seems to be done in the shadow of potential strategic action on the part of the contracting parties. Designing legal rules with an eye to the possibility of strategic action helps ensure that the rules will not create perverse outcomes. For instance, if a defendant’s privilege against self-incrimination could also encourage an inference of guilt from the silence the privilege would be all but useless. Therefore, courts have not only barred comment on the refusal to testify but also have required that juries, on defendant’s request, make no inference from such a choice (see Baird et al 1994). Further, the understanding that legislators might have adopted specific wording for a law based upon strategic motives may help direct the proper aims of judicial interpretation. This type of claim, though, is often better analyzed by the tools offered in public choice theory.

c. Public Choice Theory

Public choice theory is centered upon how the nature of the legislative process and collective decision making influence the nature of law. It is the application of economic models of decision-making and their results to the issues that traditionally occupy political science, for example Arrow’s Theorem. One claim made within public choice theory is that a proper understanding of collective decision processes will help judges understand their position within the system. If all collective decisions are unavoidably influenced by those who get to frame the questions debated and the order of voting – the agenda-setters – public legislation will need to be interpreted differently than if it were a more neutral recording of collective wishes. Such a theoretical result makes problematic a court’s reference to the intent of the legislature.

5. References and Further Readings

  • Ackerman, Bruce, The Economic Foundations of Property Law (Boston: Little, Brown & Co., 1975)
  • Baird, Douglas, Robert Gertner and Randal Picker, Game Theory and the Law (Cambridge: Harvard University Press, 1994)
  • Becker, Gary S., “Nobel Lecture: The Economic Way of Looking at Behavior,” 101 Journal of Political Economy 385 (1993)
  • Calabresi, Guido, The Costs of Accidents (1970)
  • Calabresi, Guido, and Douglas Melamed, “Property Rules, Liability Rules and Inalienability: One View of the Cathedral,” 85 Harvard Law Review 1089 (1972)
  • Calabresi, Guido, “Some Thoughts on Risk Distribution and the Law of Torts,” 70 Yale Law Journal 499 (1961)
  • Coase, Ronald, “The Problem of Social Cost,” 3 Journal of Law and Economics 1 (1960)
  • Coleman, Jules, “Efficiency, Auction and Exchange: Philosophic Aspects of the Economic Approach to Law,” 68 California Law Review 221 (1980)
  • Coleman, Jules, Market, Morals and the Law (Cambridge: Cambridge University Press, 1988)
  • Coleman, Jules, The Practice of Principle: In Defense of a Pragmatist Approach to Legal Theory (Oxford: Oxford University Press, 2001)
  • Cotter, Thomas F., “Legal Pragmatism and the Law and Economics Movement,” 84 Georgetown Law Journal 2071 (1996)
  • Farber, Daniel, and Philip Frickey, Law and Public Choice (Chicago: University of Chicago Press, 1991)
  • Harrison, Jeffrey L., Law and Economics (St. Paul: West Group, 1995)
  • Horwitz, Morton, “Law and Economics: Science or Politics?,” 8 Hofstra Law Review 905 (1981)
  • Katz, Avery Weiner, Foundations of the Economic Approach to Law (Oxford: Oxford University Press, 1998)
  • Landes, William and Richard Posner, The Economic Structure of Tort Law (Cambridge: Harvard University Press, 1987)
  • Leff, Arthur, “Economic Analysis of Law: Some Realism About Nominalism,” 60 Virginia Law Review 451 (1974)
  • Miceli, Thomas J., Economics of the Law: Torts, Contracts, Property, Litigation (1997)
  • Murphy, Jeffrie G. and Jules L. Coleman, Philosophy of Law (Boulder: Westview Press, 1990)
  • Polinsky, A. Mitchell, An Introduction to Law and Economics (Boston: Little, Brown & Company, 1989)
  • Posner, Richard A., Economic Analysis of Law (New York: Aspen, 5th ed., 1998)
  • Posner, Richard A., The Economics of Justice (Cambridge: Harvard University Press, 1983)
  • Posner, Richard A., Frontiers of Legal Theory (Cambridge: Harvard University Press, 2001)
  • Posner, Richard A., “Gary Becker’s Contributions to Law and Economics,” 22 Journal of Legal Studies 211 (1993)
  • “Symposium on Post-Chicago Law and Economics,” 65 Chicago-Kent Law Review 1 (1989)
  • Sunstein, Cass R., Behavioral Law and Economics (Cambridge: Cambridge University Press, 2000)

Author Information

Brian Edgar Butler
University of North Carolina at Asheville
U. S. A.

Sigmund Freud (1856—1939)

freudSigmund Freud, the father of psychoanalysis, was a physiologist, medical doctor, psychologist and influential thinker of the early twentieth century. Working initially in close collaboration with Joseph Breuer, Freud elaborated the theory that the mind is a complex energy-system, the structural investigation of which is the proper province of psychology. He articulated and refined the concepts of the unconscious, infantile sexuality and repression, and he proposed a tripartite account of the mind’s structure—all as part of a radically new conceptual and therapeutic frame of reference for the understanding of human psychological development and the treatment of abnormal mental conditions. Notwithstanding the multiple manifestations of psychoanalysis as it exists today, it can in almost all fundamental respects be traced directly back to Freud’s original work.

Freud’s innovative treatment of human actions, dreams, and indeed of cultural artifacts as invariably possessing implicit symbolic significance has proven to be extraordinarily fruitful, and has had massive implications for a wide variety of fields including psychology, anthropology, semiotics, and artistic creativity and appreciation. However, Freud’s most important and frequently re-iterated claim, that with psychoanalysis he had invented a successful science of the mind, remains the subject of much critical debate and controversy.

Table of Contents

  1. Life
  2. Backdrop to His Thought
  3. The Theory of the Unconscious
  4. Infantile Sexuality
  5. Neuroses and the Structure of the Mind
  6. Psychoanalysis as a Therapy
  7. Critical Evaluation of Freud
    1. The Claim to Scientific Status
    2. The Coherence of the Theory
    3. Freud’s Discovery
    4. The Efficacy of Psychoanalytic Therapy
  8. References and Further Reading
    1. Works by Freud
    2. Works on Freud and Freudian Psychoanalysis

1. Life

Freud was born in Frieberg, Moravia in 1856, but when he was four years old his family moved to Vienna where he was to live and work until the last years of his life. In 1938 the Nazis annexed Austria, and Freud, who was Jewish, was allowed to leave for England. For these reasons, it was above all with the city of Vienna that Freud’s name was destined to be deeply associated for posterity, founding as he did what was to become known as the first Viennese school of psychoanalysis from which flowed psychoanalysis as a movement and all subsequent developments in this field. The scope of Freud’s interests, and of his professional training, was very broad. He always considered himself first and foremost a scientist, endeavoring to extend the compass of human knowledge, and to this end (rather than to the practice of medicine) he enrolled at the medical school at the University of Vienna in 1873. He concentrated initially on biology, doing research in physiology for six years under the great German scientist Ernst Brücke, who was director of the Physiology Laboratory at the University, and thereafter specializing in neurology. He received his medical degree in 1881, and having become engaged to be married in 1882, he rather reluctantly took up more secure and financially rewarding work as a doctor at Vienna General Hospital. Shortly after his marriage in 1886, which was extremely happy and gave Freud six children—the youngest of whom, Anna, was to herself become a distinguished psychoanalyst—Freud set up a private practice in the treatment of psychological disorders, which gave him much of the clinical material that he based his theories and pioneering techniques on.

In 1885-86, Freud spent the greater part of a year in Paris, where he was deeply impressed by the work of the French neurologist Jean Charcot who was at that time using hypnotism to treat hysteria and other abnormal mental conditions. When he returned to Vienna, Freud experimented with hypnosis but found that its beneficial effects did not last. At this point he decided to adopt instead a method suggested by the work of an older Viennese colleague and friend, Josef Breuer, who had discovered that when he encouraged a hysterical patient to talk uninhibitedly about the earliest occurrences of the symptoms, they sometimes gradually abated. Working with Breuer, Freud formulated and developed the idea that many neuroses (phobias, hysterical paralysis and pains, some forms of paranoia, and so forth) had their origins in deeply traumatic experiences which had occurred in the patient’s past but which were now forgotten—hidden from consciousness. The treatment was to enable the patient to recall the experience to consciousness, to confront it in a deep way both intellectually and emotionally, and in thus discharging it, to remove the underlying psychological causes of the neurotic symptoms. This technique, and the theory from which it is derived, was given its classical expression in Studies in Hysteria, jointly published by Freud and Breuer in 1895.

Shortly thereafter, however, Breuer found that he could not agree with what he regarded as the excessive emphasis which Freud placed upon the sexual origins and content of neuroses, and the two parted company, with Freud continuing to work alone to develop and refine the theory and practice of psychoanalysis. In 1900, after a protracted period of self-analysis, he published The Interpretation of Dreams, which is generally regarded as his greatest work. This was followed in 1901 by The Psychopathology of Everyday Life; and in 1905 by Three Essays on the Theory of Sexuality. Freud’s psychoanalytic theory was initially not well received—when its existence was acknowledged at all it was usually by people who were, as Breuer had foreseen, scandalized by the emphasis placed on sexuality by Freud. It was not until 1908, when the first International Psychoanalytical Congress was held at Salzburg that Freud’s importance began to be generally recognized. This was greatly facilitated in 1909, when he was invited to give a course of lectures in the United States, which were to form the basis of his 1916 book Five Lectures on Psycho-Analysis. From this point on Freud’s reputation and fame grew enormously, and he continued to write prolifically until his death, producing in all more than twenty volumes of theoretical works and clinical studies. He was also not averse to critically revising his views, or to making fundamental alterations to his most basic principles when he considered that the scientific evidence demanded it—this was most clearly evidenced by his advancement of a completely new tripartite (id, ego, and super-ego) model of the mind in his 1923 work The Ego and the Id. He was initially greatly heartened by attracting followers of the intellectual caliber of Adler and Jung, and was correspondingly disappointed when they both went on to found rival schools of psychoanalysis—thus giving rise to the first two of many schisms in the movement—but he knew that such disagreement over basic principles had been part of the early development of every new science. After a life of remarkable vigor and creative productivity, he died of cancer while exiled in England in 1939.

2. Backdrop to His Thought

Although a highly original thinker, Freud was also deeply influenced by a number of diverse factors which overlapped and interconnected with each other to shape the development of his thought. As indicated above, both Charcot and Breuer had a direct and immediate impact upon him, but some of the other factors, though no less important than these, were of a rather different nature. First of all, Freud himself was very much a Freudian—his father had two sons by a previous marriage, Emmanuel and Philip, and the young Freud often played with Philip’s son John, who was his own age. Freud’s self-analysis, which forms the core of his masterpiece The Interpretation of Dreams, originated in the emotional crisis which he suffered on the death of his father and the series of dreams to which this gave rise. This analysis revealed to him that the love and admiration which he had felt for his father were mixed with very contrasting feelings of shame and hate (such a mixed attitude he termed ambivalence). Particularly revealing was his discovery that he had often fantasized as a youth that his half-brother Philip (who was of an age with his mother) was really his father, and certain other signs convinced him of the deep underlying meaning of this fantasy—that he had wished his real father dead because he was his rival for his mother’s affections. This was to become the personal (though by no means exclusive) basis for his theory of the Oedipus complex.

Secondly, and at a more general level, account must be taken of the contemporary scientific climate in which Freud lived and worked. In most respects, the towering scientific figure of nineteenth century science was Charles Darwin, who had published his revolutionary Origin of Species when Freud was four years old. The evolutionary doctrine radically altered the prevailing conception of man—whereas before, man had been seen as a being different in nature from the members of the animal kingdom by virtue of his possession of an immortal soul, he was now seen as being part of the natural order, different from non-human animals only in degree of structural complexity. This made it possible and plausible, for the first time, to treat man as an object of scientific investigation, and to conceive of the vast and varied range of human behavior, and the motivational causes from which it springs, as being amenable in principle to scientific explanation. Much of the creative work done in a whole variety of diverse scientific fields over the next century was to be inspired by, and derive sustenance from, this new world-view, which Freud with his enormous esteem for science, accepted implicitly.

An even more important influence on Freud however, came from the field of physics. The second fifty years of the nineteenth century saw monumental advances in contemporary physics, which were largely initiated by the formulation of the principle of the conservation of energy by Helmholz. This principle states, in effect, that the total amount of energy in any given physical system is always constant, that energy quanta can be changed but not annihilated, and that consequently when energy is moved from one part of the system, it must reappear in another part. The progressive application of this principle led to monumental discoveries in the fields of thermodynamics, electromagnetism and nuclear physics which, with their associated technologies, have so comprehensively transformed the contemporary world. As we have seen, when he first came to the University of Vienna, Freud worked under the direction of Ernst Brücke who in 1873-4 published his Lecture Notes on Physiology (Vorlesungen über Physiologie. Vienna: Wilhelm Braumüller), setting out the view that all living organisms, including humans, are essentially energy-systems to which, no less than to inanimate objects, the principle of the conservation of energy applies. Freud, who had great admiration and respect for Brücke, quickly adopted this new dynamic physiology with enthusiasm. From there it was but a short conceptual step—but one which Freud was the first to take, and on which his claim to fame is largely grounded—to the view that there is such a thing as psychic energy, that the human personality is also an energy-system, and that it is the function of psychology to investigate the modifications, transmissions and conversions of psychic energy within the personality which shape and determine it. This latter conception is the very cornerstone of Freud’s psychoanalytic theory.

3. The Theory of the Unconscious

Freud’s theory of the unconscious, then, is highly deterministic—a fact which, given the nature of nineteenth century science, should not be surprising. Freud was arguably the first thinker to apply deterministic principles systematically to the sphere of the mental, and to hold that the broad spectrum of human behavior is explicable only in terms of the (usually hidden) mental processes or states which determine it. Thus, instead of treating the behavior of the neurotic as being causally inexplicable—which had been the prevailing approach for centuries—Freud insisted, on the contrary, on treating it as behavior for which it is meaningful to seek an explanation by searching for causes in terms of the mental states of the individual concerned. Hence the significance which he attributed to slips of the tongue or pen, obsessive behavior and dreams—all these, he held, are determined by hidden causes in the person’s mind, and so they reveal in covert form what would otherwise not be known at all. This suggests the view that freedom of the will is, if not completely an illusion, certainly more tightly circumscribed than is commonly believed, for it follows from this that whenever we make a choice we are governed by hidden mental processes of which we are unaware and over which we have no control.

The postulate that there are such things as unconscious mental states at all is a direct function of Freud’s determinism, his reasoning here being simply that the principle of causality requires that such mental states should exist, for it is evident that there is frequently nothing in the conscious mind which can be said to cause neurotic or other behavior. An unconscious mental process or event, for Freud, is not one which merely happens to be out of consciousness at a given time, but is rather one which cannot, except through protracted psychoanalysis, be brought to the forefront of consciousness. The postulation of such unconscious mental states entails, of course, that the mind is not, and cannot be, either identified with consciousness, or an object of consciousness. To employ a much-used analogy, it is rather structurally akin to an iceberg, the bulk of it lying below the surface, exerting a dynamic and determining influence upon the part which is amenable to direct inspection—the conscious mind.

Deeply associated with this view of the mind is Freud’s account of instincts or drives. Instincts, for Freud, are the principal motivating forces in the mental realm, and as such they energise the mind in all of its functions. There are, he held, an indefinitely large number of such instincts, but these can be reduced to a small number of basic ones, which he grouped into two broad generic categories, Eros (the life instinct), which covers all the self-preserving and erotic instincts, and Thanatos (the death instinct), which covers all the instincts towards aggression, self-destruction, and cruelty. Thus it is a mistake to interpret Freud as asserting that all human actions spring from motivations which are sexual in their origin, since those which derive from Thanatos are not sexually motivated—indeed, Thanatos is the irrational urge to destroy the source of all sexual energy in the annihilation of the self. Having said that, it is undeniably true that Freud gave sexual drives an importance and centrality in human life, human actions, and human behavior which was new (and to many, shocking), arguing as he does that sexual drives exist and can be discerned in children from birth (the theory of infantile sexuality), and that sexual energy (libido) is the single most important motivating force in adult life. However, a crucial qualification has to be added here—Freud effectively redefined the term sexuality to make it cover any form of pleasure which is or can be derived from the body. Thus his theory of the instincts or drives is essentially that the human being is energized or driven from birth by the desire to acquire and enhance bodily pleasure.

4. Infantile Sexuality

Freud’s theory of infantile sexuality must be seen as an integral part of a broader developmental theory of human personality. This had its origins in, and was a generalization of, Breuer’s earlier discovery that traumatic childhood events could have devastating negative effects upon the adult individual, and took the form of the general thesis that early childhood sexual experiences were the crucial factors in the determination of the adult personality. From his account of the instincts or drives it followed that from the moment of birth the infant is driven in his actions by the desire for bodily/sexual pleasure, where this is seen by Freud in almost mechanical terms as the desire to release mental energy. Initially, infants gain such release, and derive such pleasure, from the act of sucking. Freud accordingly terms this the oral stage of development. This is followed by a stage in which the locus of pleasure or energy release is the anus, particularly in the act of defecation, and this is accordingly termed the anal stage. Then the young child develops an interest in its sexual organs as a site of pleasure (the phallic stage), and develops a deep sexual attraction for the parent of the opposite sex, and a hatred of the parent of the same sex (the Oedipus complex). This, however, gives rise to (socially derived) feelings of guilt in the child, who recognizes that it can never supplant the stronger parent. A male child also perceives himself to be at risk. He fears that if he persists in pursuing the sexual attraction for his mother, he may be harmed by the father; specifically, he comes to fear that he may be castrated. This is termed castration anxiety. Both the attraction for the mother and the hatred are usually repressed, and the child usually resolves the conflict of the Oedipus complex by coming to identify with the parent of the same sex. This happens at the age of five, whereupon the child enters a latency period, in which sexual motivations become much less pronounced. This lasts until puberty when mature genital development begins, and the pleasure drive refocuses around the genital area.

This, Freud believed, is the sequence or progression implicit in normal human development, and it is to be observed that at the infant level the instinctual attempts to satisfy the pleasure drive are frequently checked by parental control and social coercion. The developmental process, then, is for the child essentially a movement through a series of conflicts, the successful resolution of which is crucial to adult mental health. Many mental illnesses, particularly hysteria, Freud held, can be traced back to unresolved conflicts experienced at this stage, or to events which otherwise disrupt the normal pattern of infantile development. For example, homosexuality is seen by some Freudians as resulting from a failure to resolve the conflicts of the Oedipus complex, particularly a failure to identify with the parent of the same sex; the obsessive concern with washing and personal hygiene which characterizes the behavior of some neurotics is seen as resulting from unresolved conflicts/repressions occurring at the anal stage.

5. Neuroses and the Structure of the Mind

Freud’s account of the unconscious, and the psychoanalytic therapy associated with it, is best illustrated by his famous tripartite model of the structure of the mind or personality (although, as we have seen, he did not formulate this until 1923). This model has many points of similarity with the account of the mind offered by Plato over 2,000 years earlier. The theory is termed tripartite simply because, again like Plato, Freud distinguished three structural elements within the mind, which he called id, ego, and super-ego. The id is that part of the mind in which are situated the instinctual sexual drives which require satisfaction; the super-ego is that part which contains the conscience, namely, socially-acquired control mechanisms which have been internalized, and which are usually imparted in the first instance by the parents; while the ego is the conscious self that is created by the dynamic tensions and interactions between the id and the super-ego and has the task of reconciling their conflicting demands with the requirements of external reality. It is in this sense that the mind is to be understood as a dynamic energy-system. All objects of consciousness reside in the ego; the contents of the id belong permanently to the unconscious mind; while the super-ego is an unconscious screening-mechanism which seeks to limit the blind pleasure-seeking drives of the id by the imposition of restrictive rules. There is some debate as to how literally Freud intended this model to be taken (he appears to have taken it extremely literally himself), but it is important to note that what is being offered here is indeed a theoretical model rather than a description of an observable object, which functions as a frame of reference to explain the link between early childhood experience and the mature adult (normal or dysfunctional) personality.

Freud also followed Plato in his account of the nature of mental health or psychological well-being, which he saw as the establishment of a harmonious relationship between the three elements which constitute the mind. If the external world offers no scope for the satisfaction of the id’s pleasure drives, or more commonly, if the satisfaction of some or all of these drives would indeed transgress the moral sanctions laid down by the super-ego, then an inner conflict occurs in the mind between its constituent parts or elements. Failure to resolve this can lead to later neurosis. A key concept introduced here by Freud is that the mind possesses a number of defense mechanisms to attempt to prevent conflicts from becoming too acute, such as repression (pushing conflicts back into the unconscious), sublimation (channeling the sexual drives into the achievement socially acceptable goals, in art, science, poetry, and so forth), fixation (the failure to progress beyond one of the developmental stages), and regression (a return to the behavior characteristic of one of the stages).

Of these, repression is the most important, and Freud’s account of this is as follows: when a person experiences an instinctual impulse to behave in a manner which the super-ego deems to be reprehensible (for example, a strong erotic impulse on the part of the child towards the parent of the opposite sex), then it is possible for the mind to push this impulse away, to repress it into the unconscious. Repression is thus one of the central defense mechanisms by which the ego seeks to avoid internal conflict and pain, and to reconcile reality with the demands of both id and super-ego. As such it is completely normal and an integral part of the developmental process through which every child must pass on the way to adulthood. However, the repressed instinctual drive, as an energy-form, is not and cannot be destroyed when it is repressed—it continues to exist intact in the unconscious, from where it exerts a determining force upon the conscious mind, and can give rise to the dysfunctional behavior characteristic of neuroses. This is one reason why dreams and slips of the tongue possess such a strong symbolic significance for Freud, and why their analysis became such a key part of his treatment—they represent instances in which the vigilance of the super-ego is relaxed, and when the repressed drives are accordingly able to present themselves to the conscious mind in a transmuted form. The difference between normal repression and the kind of repression which results in neurotic illness is one of degree, not of kind—the compulsive behavior of the neurotic is itself a manifestation of an instinctual drive repressed in childhood. Such behavioral symptoms are highly irrational (and may even be perceived as such by the neurotic), but are completely beyond the control of the subject because they are driven by the now unconscious repressed impulse. Freud positioned the key repressions for both, the normal individual and the neurotic, in the first five years of childhood, and of course, held them to be essentially sexual in nature; since, as we have seen, repressions which disrupt the process of infantile sexual development in particular, according to him, lead to a strong tendency to later neurosis in adult life. The task of psychoanalysis as a therapy is to find the repressions which cause the neurotic symptoms by delving into the unconscious mind of the subject, and by bringing them to the forefront of consciousness, to allow the ego to confront them directly and thus to discharge them.

6. Psychoanalysis as a Therapy

Freud’s account of the sexual genesis and nature of neuroses led him naturally to develop a clinical treatment for treating such disorders. This has become so influential today that when people speak of psychoanalysis they frequently refer exclusively to the clinical treatment; however, the term properly designates both the clinical treatment and the theory which underlies it. The aim of the method may be stated simply in general terms—to re-establish a harmonious relationship between the three elements which constitute the mind by excavating and resolving unconscious repressed conflicts. The actual method of treatment pioneered by Freud grew out of Breuer’s earlier discovery, mentioned above, that when a hysterical patient was encouraged to talk freely about the earliest occurrences of her symptoms and fantasies, the symptoms began to abate, and were eliminated entirely when she was induced to remember the initial trauma which occasioned them. Turning away from his early attempts to explore the unconscious through hypnosis, Freud further developed this talking cure, acting on the assumption that the repressed conflicts were buried in the deepest recesses of the unconscious mind. Accordingly, he got his patients to relax in a position in which they were deprived of strong sensory stimulation, and even keen awareness of the presence of the analyst (hence the famous use of the couch, with the analyst virtually silent and out of sight), and then encouraged them to speak freely and uninhibitedly, preferably without forethought, in the belief that he could thereby discern the unconscious forces lying behind what was said. This is the method of free-association, the rationale for which is similar to that involved in the analysis of dreams—in both cases the super-ego is to some degree disarmed, its efficiency as a screening mechanism is moderated, and material is allowed to filter through to the conscious ego which would otherwise be completely repressed. The process is necessarily a difficult and protracted one, and it is therefore one of the primary tasks of the analyst to help the patient recognize, and overcome, his own natural resistances, which may exhibit themselves as hostility towards the analyst. However, Freud always took the occurrence of resistance as a sign that he was on the right track in his assessment of the underlying unconscious causes of the patient’s condition. The patient’s dreams are of particular interest, for reasons which we have already partly seen. Taking it that the super-ego functioned less effectively in sleep, as in free-association, Freud made a distinction between the manifest content of a dream (what the dream appeared to be about on the surface) and its latent content (the unconscious, repressed desires or wishes which are its real object). The correct interpretation of the patient’s dreams, slips of tongue, free-associations, and responses to carefully selected questions leads the analyst to a point where he can locate the unconscious repressions producing the neurotic symptoms, invariably in terms of the patient’s passage through the sexual developmental process, the manner in which the conflicts implicit in this process were handled, and the libidinal content of the patient’s family relationships. To create a cure, the analyst must facilitate the patient himself to become conscious of unresolved conflicts buried in the deep recesses of the unconscious mind, and to confront and engage with them directly.

In this sense, then, the object of psychoanalytic treatment may be said to be a form of self-understanding—once this is acquired it is largely up to the patient, in consultation with the analyst, to determine how he shall handle this newly-acquired understanding of the unconscious forces which motivate him. One possibility, mentioned above, is the channeling of sexual energy into the achievement of social, artistic or scientific goals—this is sublimation, which Freud saw as the motivating force behind most great cultural achievements. Another possibility would be the conscious, rational control of formerly repressed drives—this is suppression. Yet another would be the decision that it is the super-ego and the social constraints which inform it that are at fault, in which case the patient may decide in the end to satisfy the instinctual drives. But in all cases the cure is created essentially by a kind of catharsis or purgation—a release of the pent-up psychic energy, the constriction of which was the basic cause of the neurotic illness.

7. Critical Evaluation of Freud

It should be evident from the foregoing why psychoanalysis in general, and Freud in particular, have exerted such a strong influence upon the popular imagination in the Western World, and why both the theory and practice of psychoanalysis should remain the object of a great deal of controversy. In fact, the controversy which exists in relation to Freud is more heated and multi-faceted than that relating to virtually any other post-1850 thinker (a possible exception being Darwin), with criticisms ranging from the contention that Freud’s theory was generated by logical confusions arising out of his alleged long-standing addiction to cocaine (see Thornton, E.M. Freud and Cocaine: The Freudian Fallacy) to the view that he made an important, but grim, empirical discovery, which he knowingly suppressed in favour of the theory of the unconscious, knowing that the latter would be more socially acceptable (see Masson, J. The Assault on Truth).

It should be emphasized here that Freud’s genius is not (generally) in doubt, but the precise nature of his achievement is still the source of much debate. The supporters and followers of Freud (and Jung and Adler) are noted for the zeal and enthusiasm with which they espouse the doctrines of the master, to the point where many of the detractors of the movement see it as a kind of secular religion, requiring as it does an initiation process in which the aspiring psychoanalyst must himself first be analyzed. In this way, it is often alleged, the unquestioning acceptance of a set of ideological principles becomes a necessary precondition for acceptance into the movement—as with most religious groupings. In reply, the exponents and supporters of psychoanalysis frequently analyze the motivations of their critics in terms of the very theory which those critics reject. And so the debate goes on.

Here we will confine ourselves to: (a) the evaluation of Freud’s claim that his theory is a scientific one, (b) the question of the theory’s coherence, (c) the dispute concerning what, if anything, Freud really discovered, and (d) the question of the efficacy of psychoanalysis as a treatment for neurotic illnesses.

a. The Claim to Scientific Status

This is a crucially important issue since Freud saw himself first and foremost as a pioneering scientist, and repeatedly asserted that the significance of psychoanalysis is that it is a new science, incorporating a new scientific method of dealing with the mind and with mental illness. There can, moreover, be no doubt but that this has been the chief attraction of the theory for most of its advocates since then—on the face of it, it has the appearance of being not just a scientific theory but an enormously strong one, with the capacity to accommodate, and explain, every possible form of human behavior. However, it is precisely this latter which, for many commentators, undermines its claim to scientific status. On the question of what makes a theory a genuinely scientific one, Karl Popper’s criterion of demarcation, as it is called, has now gained very general acceptance: namely, that every genuine scientific theory must be testable, and therefore falsifiable, at least in principle. In other words, if a theory is incompatible with possible observations, it is scientific; conversely, a theory which is compatible with all possible observations is unscientific (see Popper, K. The Logic of Scientific Discovery). Thus the principle of the conservation of energy (physical, not psychic), which influenced Freud so greatly, is a scientific one because it is falsifiable—the discovery of a physical system in which the total amount of physical energy was not constant would conclusively show it to be false. It is argued that nothing of the kind is possible with respect to Freud’s theory—it is not falsifiable. If the question is asked: “What does this theory imply which, if false, would show the whole theory to be false?,” the answer is “Nothing” because the theory is compatible with every possible state of affairs. Hence it is concluded that the theory is not scientific, and while this does not, as some critics claim, rob it of all value, it certainly diminishes its intellectual status as projected by its strongest advocates, including Freud himself.

b. The Coherence of the Theory

A related (but perhaps more serious) point is that the coherence of the theory is, at the very least, questionable. What is attractive about the theory, even to the layman, is that it seems to offer us long sought-after and much needed causal explanations for conditions which have been a source of a great deal of human misery. The thesis that neuroses are caused by unconscious conflicts buried deep in the unconscious mind in the form of repressed libidinal energy would appear to offer us, at last, an insight in the causal mechanism underlying these abnormal psychological conditions as they are expressed in human behavior, and further show us how they are related to the psychology of the normal person. However, even this is questionable, and is a matter of much dispute. In general, when it is said that an event X causes another event Y to happen, both X and Y are, and must be, independently identifiable. It is true that this is not always a simple process, as in science causes are sometimes unobservable (sub-atomic particles, radio and electromagnetic waves, molecular structures, and so forth), but in these latter cases there are clear correspondence rules connecting the unobservable causes with observable phenomena. The difficulty with Freud’s theory is that it offers us entities (for example repressed unconscious conflicts), which are said to be the unobservable causes of certain forms of behavior But there are no correspondence rules for these alleged causes—they cannot be identified except by reference to the behavior which they are said to cause (that is, the analyst does not demonstratively assert: “This is the unconscious cause, and that is its behavioral effect;” rather he asserts: “This is the behavior, therefore its unconscious cause must exist”), and this does raise serious doubts as to whether Freud’s theory offers us genuine causal explanations at all.

c. Freud’s Discovery

At a less theoretical, but no less critical level, it has been alleged that Freud did make a genuine discovery which he was initially prepared to reveal to the world. However, the response he encountered was so ferociously hostile that he masked his findings and offered his theory of the unconscious in its place (see Masson, J. The Assault on Truth). What he discovered, it has been suggested, was the extreme prevalence of child sexual abuse, particularly of young girls (the vast majority of hysterics are women), even in respectable nineteenth century Vienna. He did in fact offer an early seduction theory of neuroses, which met with fierce animosity, and which he quickly withdrew and replaced with the theory of the unconscious. As one contemporary Freudian commentator explains it, Freud’s change of mind on this issue came about as follows:

Questions concerning the traumas suffered by his patients seemed to reveal [to Freud] that Viennese girls were extraordinarily often seduced in very early childhood by older male relatives. Doubt about the actual occurrence of these seductions was soon replaced by certainty that it was descriptions about childhood fantasy that were being offered. (MacIntyre).

In this way, it is suggested, the theory of the Oedipus complex was generated.

This statement begs a number of questions, not least, what does the expression extraordinarily often mean in this context? By what standard is this being judged? The answer can only be: By the standard of what we generally believe—or would like to believe—to be the case. But the contention of some of Freud’s critics here is that his patients were not recalling childhood fantasies, but traumatic events from their childhood which were all too real. Freud, according to them, had stumbled upon and knowingly suppressed the fact that the level of child sexual abuse in society is much higher than is generally believed or acknowledged. If this contention is true—and it must at least be contemplated seriously—then this is undoubtedly the most serious criticism that Freud and his followers have to face.

Further, this particular point has taken on an added and even more controversial significance in recent years, with the willingness of some contemporary Freudians to combine the theory of repression with an acceptance of the wide-spread social prevalence of child sexual abuse. The result has been that in the United States and Britain in particular, many thousands of people have emerged from analysis with recovered memories of alleged childhood sexual abuse by their parents; memories which, it is suggested, were hitherto repressed. On this basis, parents have been accused and repudiated, and whole families have been divided or destroyed. Unsurprisingly, this in turn has given rise to a systematic backlash in which organizations of accused parents, seeing themselves as the true victims of what they term False Memory Syndrome, have denounced all such memory-claims as falsidical — the direct product of a belief in what they see as the myth of repression. (see Pendergast, M. Victims of Memory). In this way, the concept of repression, which Freud himself termed the foundation stone upon which the structure of psychoanalysis rests, has come in for more widespread critical scrutiny than ever before. Here, the fact that, unlike some of his contemporary followers, Freud did not himself ever countenance the extension of the concept of repression to cover actual child sexual abuse, and the fact that we are not necessarily forced to choose between the views that all recovered memories are either veridical or falsidical are frequently lost sight of in the extreme heat generated by this debate, perhaps understandably.

d. The Efficacy of Psychoanalytic Therapy

It does not follow that, if Freud’s theory is unscientific, or even false, it cannot provide us with a basis for the beneficial treatment of neurotic illness because the relationship between a theory’s truth or falsity and its utility-value is far from being an isomorphic one. The theory upon which the use of leeches to bleed patients in eighteenth century medicine was based was quite spurious, but patients did sometimes actually benefit from the treatment! And of course even a true theory might be badly applied, leading to negative consequences. One of the problems here is that it is difficult to specify what counts as a cure for a neurotic illness as distinct, say, from a mere alleviation of the symptoms. In general, however, the efficiency of a given method of treatment is usually clinically measured by means of a control group—the proportion of patients suffering from a given disorder who are cured by treatment X is measured by comparison with those cured by other treatments, or by no treatment at all. Such clinical tests as have been conducted indicate that the proportion of patients who have benefited from psychoanalytic treatment does not diverge significantly from the proportion who recover spontaneously or as a result of other forms of intervention in the control groups used. So, the question of the therapeutic effectiveness of psychoanalysis remains an open and controversial one.

8. References and Further Reading

a. Works by Freud

  • The Standard Edition of the Complete Psychological Works of Sigmund Freud (Ed. J. Strachey with Anna Freud), 24 vols. London: 1953-1964.

b. Works on Freud and Freudian Psychoanalysis

  • Abramson, J.B. Liberation and Its Limits: The Moral and Political Thought of Freud. New York: Free Press, 1984.
  • Bettlelheim, B. Freud and Man’s Soul. Knopf, 1982.
  • Cavell, M. The Psychoanalytic Mind: From Freud to Philosophy. Harvard University Press, 1993.
  • Cavell, M. Becoming a Subject: Reflections in Philosophy and Psychoanalysis. New York:  Oxford University Press, 2006.
  • Chessick, R.D. Freud Teaches Psychotherapy. Hackett Publishing Company, 1980.
  • Cioffi, F. (ed.) Freud: Modern Judgements. Macmillan, 1973.
  • Deigh, J. The Sources of Moral Agency: Essays in Moral Psychology and Freudian Theory. Cambridge, UK: Cambridge University Press, 1996.
  • Dilman, I. Freud and Human Nature. Blackwell, 1983
  • Dilman, I. Freud and the Mind. Blackwell, 1984.
  • Edelson, M. Hypothesis and Evidence in Psychoanalysis. University of Chicago Press, 1984.
  • Erwin, E. A Final Accounting: Philosophical and Empirical Issues in Freudian Psychology. MIT Press, 1996.
  • Fancher, R. Psychoanalytic Psychology: The Development of Freud’s Thought. Norton, 1973.
  • Farrell, B.A. The Standing of Psychoanalysis. Oxford University Press, 1981.
  • Fingarette, H. The Self in Transformation: Psychoanalysis, Philosophy, and the Life of the Spirit. HarperCollins, 1977.
  • Freeman, L. The Story of Anna O.—The Woman who led Freud to Psychoanalysis. Paragon House, 1990.
  • Frosh, S. The Politics of Psychoanalysis: An Introduction to Freudian and Post-Freudian Theory. Yale University Press, 1987.
  • Gardner, S. Irrationality and the Philosophy of Psychoanalysis. Cambridge, Cambridge University Press, 1993.
  • Grünbaum, A. The Foundations of Psychoanalysis: A Philosophical Critique. University of California Press, 1984.
  • Gay, V.P. Freud on Sublimation: Reconsiderations. Albany, NY: State University Press, 1992.
  • Hook, S. (ed.) Psychoanalysis, Scientific Method, and Philosophy. New York University Press, 1959.
  • Jones, E. Sigmund Freud: Life and Work (3 vols), Basic Books, 1953-1957.
  • Klein, G.S. Psychoanalytic Theory: An Exploration of Essentials. International Universities Press, 1976.
  • Lear, J. Love and Its Place in Nature: A Philosophical Interpretation of Freudian Psychoanalysis. Farrar, Straus & Giroux, 1990.
  • Lear, J. Open Minded: Working Out the Logic of the Soul. Cambridge, Harvard University Press, 1998.
  • Lear, Jonathan. Happiness, Death, and the Remainder of Life. Harvard University Press, 2000.
  • Lear, Jonathan. Freud. Routledge, 2005.
  • Levine, M.P. (ed). The Analytic Freud: Philosophy and Psychoanalysis. London: Routledge, 2000.
  • Levy, D. Freud Among the Philosophers: The Psychoanalytic Unconscious and Its Philosophical Critics. New Haven, CT: Yale University Press, 1996.
  • MacIntyre, A.C. The Unconscious: A Conceptual Analysis. Routledge & Kegan Paul, 1958.
  • Mahony, P.J. Freud’s Dora: A Psychoanalytic, Historical and Textual Study. Yale University Press, 1996.
  • Masson, J. The Assault on Truth: Freud’s Suppression of the Seduction Theory. Faber & Faber, 1984.
  • Neu, J. (ed). The Cambridge Companion to Freud. Cambridge          University Press, 1994.
  • O’Neill, J. (ed). Freud and the Passions. Pennsylvania State University Press, 2004.
  • Popper, K. The Logic of Scientific Discovery. Hutchinson, 1959.
  • Pendergast, M. Victims of Memory. HarperCollins, 1997.
  • Reiser, M. Mind, Brain, Body: Towards a Convergence of Psychoanalysis and Neurobiology. Basic Books, 1984.
  • Ricoeur, P. Freud and Philosophy: An Essay in Interpretation (trans. D. Savage). Yale University Press, 1970.
  • Robinson, P. Freud and His Critics. Berkeley, University of California Press, 1993.
  • Rose, J. On Not Being Able to Sleep: Psychoanalysis and the Modern World. Princeton University Press, 2003.
  • Roth, P. The Superego. Icon Books, 2001.
  • Rudnytsky, P.L. Freud and Oedipus. Columbia University Press, 1987.
  • Said, E.W. Freud and the Non-European. Verso (in association with the Freud Museum, London), 2003.
  • Schafer, R. A New Language for Psychoanalysis. Yale University Press, 1976.
  • Sherwood, M. The Logic of Explanation in Psychoanalysis. Academic Press, 1969.
  • Smith, D.L. Freud’s Philosophy of the Unconscious. Kluwer, 1999.
  • Stewart, W. Psychoanalysis: The First Ten Years, 1888-1898. Macmillan, 1969.
  • Sulloway, F. Freud, Biologist of the Mind. Basic Books, 1979.
  • Thornton, E.M. Freud and Cocaine: The Freudian Fallacy. Blond & Briggs, 1983.
  • Tauber, A.I. Freud, the Reluctant Philosopher. Princeton University Press, 2010.
  • Wallace, E.R. Freud and Anthropology: A History and Reappraisal. International Universities Press, 1983.
  • Wallwork, E. Psychoanalysis and Ethics. Yale University Press, 1991.
  • Whitebrook, J. Perversion and Utopia: A Study in Psychoanalysis and Critical Theory. MIT Press, 1995.
  • Whyte, L.L. The Unconscious Before Freud. Basic Books, 1960.
  • Wollheim, R. Freud. Fontana, 1971.
  • Wollheim, R. (ed.) Freud: A Collection of Critical Essays. Anchor, 1974.
  • Wollheim, R. & Hopkins, J. (eds.) Philosophical Essays on Freud. Cambridge University Press, 1982.

See also the articles on Descartes’ Mind-Body DistinctionHigher-Order Theories of Consciousness and Introspection.

Author Information

Stephen P. Thornton
University of Limerick
Ireland

Elizabeth Cady Stanton (1815—1902)

StantonElizabeth Cady Stanton was one of the most influential public figures in nineteenth-century America. She was one of the nation’s first feminist theorists and certainly one of its most productive activists. She was in the tradition of Abigail Adams, who implored her husband John to “remember the ladies” as he helped form the new American nation. Along with Susan B. Anthony, Stanton fueled the movement for women’s suffrage. She advocated for change in both the public and private lives of women–regarding property rights, equal education, employment opportunity, more liberal divorce provisions, and child custody rights. By addressing such a wide range of women’s issues, she laid down the foundation for the three main branches of feminism that are in existence today: liberal feminism, which focuses on women’s similarities to men and emphasizes equality; cultural feminism, which celebrates women’s differences from men and aims for gender equity; and dominance feminism, which focuses on male power / female submission and aims to overturn all forms of gender hierarchy.

Stanton was motivated by liberal humanist ideals of egalitarianism and individual autonomy, which were an outgrowth of the Enlightenment. She was familiar with the philosophical thinkers whose works and ideas were discussed among American intellectuals at the time: Jean-Jacques Rousseau, Immanuel Kant, Mary Wollstonecraft, Johann Wolfgang Goethe, Alexis de Tocqueville, and later John Stuart Mill and Harriet Taylor Mill. Stanton’s writings and speeches demonstrate this. In addition, her years of studying and bantering with the apprentices in her father’s law office ensured that she was acquainted with the works of English legal theorists Edward Coke, William Blackstone, and Jeremy Bentham, as well as with those of her father’s influential colleagues, Joseph Story and James Kent.

Although she was theoretically minded, Stanton had no interest in living the life of an intellectual, removed from the hubbub of social and political life. In fact, her feminist theory grew out of the real-life problems women faced in her age and was developed to solve them. This places her squarely in the American tradition established by Benjamin Franklin, of developing philosophical thought that can be applied to everyday life. Stanton, then, could be termed an applied philosopher or a philosopher-practitioner. She did not seek out theory for theory’s sake, but instead put theory into practice for the purpose of improving social and political life.

Table of Contents

  1. Early Life and Education
  2. Marriage and Family in an Activist World
  3. Women’s Rights Activism
  4. Writings and Influence
    1. The Woman’s Bible
  5. Conclusion
  6. References and Further Reading
    1. Works by Elizabeth Cady Stanton
    2. Biographical and Historical Works Relating to Elizabeth Cady Stanton
    3. Papers and Articles Relating to Daniel Cady

1. Early Life and Education

Elizabeth Cady, the third surviving child and second of the five daughters of Margaret (formerly Livingston) and Daniel Cady (1773-1859), was born November 12, 1815, in Johnstown, New York. Her mother was from a well-to-do family with ties to the American Revolution. Margaret Livingston’s father had been a colonel in the Continental Army, assisting in the capture of John Andre, one of Benedict Arnold’s co-conspirators. Daniel Cady was a prominent lawyer and politician in the state of New York. He became a member of the New York State Assembly (1808-13), held office as a member of Congress (1815-17), and served on the New York Supreme Court (1847-54).

From a young age, Elizabeth was keenly aware of the gender-based power imbalances that were in place in her day. With bitterness, she later recounted the many times her father responded to her aspirations and achievements by declaiming that she should have been born a boy. After the death of her older brother, the only boy in the family to have reached adulthood, she resolved to do her best to fill the void his death left in her father’s life. She learned to play chess and ride a horse. She studied Greek with the family’s minister, the Rev. Simon Hosack. She entered Johnstown Academy and won prizes and awards. Though still unable to please and impress her father to her satisfaction, these moments clearly motivated her to achieve. In all likelihood, Daniel Cady was not quite as indifferent to his daughter’s achievements, nor as hostile to feminism as her memoir sometimes suggests. His laments that she was not a boy were perhaps more a recognition of the social constraints she would face as a grown woman than an expression of his own need for a son.

When Elizabeth was a child, she overheard a conversation her father had with a woman, a  would-be client for whom the law provided no remedy. She voiced her dismay to her father, and this is the counsel he gave her: “When you are grown up, and able to prepare a speech, you must go down to Albany and talk to the legislators; tell them all you have seen in this office–the sufferings of these Scotchwomen . . . if you can persuade them to pass new laws, the old ones will be a dead letter” (Eighty Years, pp. 31-32). Prior to their conversation, young Elizabeth’s plan had been to destroy all the laws that were unjust to women by cutting them out of his law books! His advice gave her an alternative and foreshadowed the career she would make for herself as a reformer.

Born into a world of wealth and privilege, Elizabeth benefited from a better education than most girls were granted in her day. After attending Johnstown Academy, she entered the Troy Female Seminary. She felt it unjust that she was barred from attending the more academically rigorous Union College, then an all-male institution. While she gained greater understanding of women and feminine culture at Troy, overall her experience there convinced her that male-female co-education is superior to single-sex education. Seeing and visiting with men was such a novelty at Troy that it created an almost unnatural obsession with the other sex. In Elizabeth’s view it also exaggerated any differences between the genders and intensified the deficiencies of each.

Elizabeth did not complete a degree at Troy. In part this was because of the influence of the evangelist Charles Finney, a pivotal regional figure in America’s “Second Great Awakening”. Attendance at Finney’s revival sessions was mandatory at Troy, and several of her classmates were readily converted by his “fire and brimstone” sermons. Yet his preaching left Elizabeth terrified and perplexed. She considered his calls to give her heart to Jesus irrational, if not incomprehensible, and she refused to repent. Even so, she was still disturbed by the images of hell and damnation Finney had planted in her mind. Her parents, like many traditional Protestants, rejected the heightened emotionalism of Finney’s evangelical fervor and allowed her to withdraw from Troy. They treated her to a retreat in Niagara where all talk of religion was forbidden, so that she could settle herself and regain her spiritual bearings. After this exposure to Protestant revivalism, Elizabeth remained a religious skeptic for the rest of her life.

Elizabeth continued to study on her own after her time at Troy Seminary. She read the moral philosophy of George Combe and discussed the novels of Sir Walter Scott, James Fenimore Cooper, and Charles Dickens with her brother-in-law Edward Bayard. She also spent time with her intellectual and reform-minded cousins in nearby Peterboro, New York. These were the children of her mother’s sister Elizabeth (Livingston) and her husband Peter Smith.

In the Smith household, Elizabeth was exposed to a number of new people as well as to new social and political ideas. Her aunt and uncle were egalitarians not only in the ideal, but in the everyday, sense. Their home was open to African Americans on their way to freedom in Canada as well as to Oneida Indians they had befriended. It also teemed with activists and intellectuals who discussed, debated and strategized about the social and political events of the day–chief among them abolition. Her uncle, Peter Smith, was a staunch advocate of racial equality who sought an end to American slavery. Her cousin Gerrit Smith followed closely in his father’s footsteps. Gerrit and his friends in the abolition movement would not only influence Elizabeth, but introduce lifelong challenges as she and other social reformers sought to bring full equality to all people, regardless of color, creed, or gender.

2. Marriage and Family in an Activist World

It was at the home of her cousin, Gerrit Smith, that Elizabeth Cady met Henry Stanton, a man ten years her senior. He was already an extremely prominent and influential abolitionist orator. Beginning his career as a journalist, Stanton met Theodore Weld while attending the Rochester Manual Labor Institute and Weld was touring the country to learn more about manual labor schools. Both were compelling public speakers. Both were committed to social and political reform. And both had been influenced by Charles Finney. In Rochester, Stanton first met Finney when he was serving as replacement pastor at a local church. Like Weld–and in stark contrast to his future wife–Stanton was thoroughly impressed by Finney as an orator and theological thinker. In contrast to Elizabeth Cady’s response, the excesses of Finney’s fire and brimstone approach were of no concern to Stanton. He was simply full of awe and admiration for the man.

Stanton and Weld became lifelong friends, and at Weld’s behest, Stanton began attending the newly-established Lane Theological Seminary in 1832. Lane was based on the manual labor model and initially was a great success. In 1834 however, a student-sponsored debate over slavery raised the ire of the institution’s board of trustees, and a gag order was issued:  Henceforth, no events related to political issues were to be held without prior approval from the board. Nearly half the students at the seminary–Stanton and Weld among them–withdrew from the institution in protest. Stanton then began working alongside Weld, first as an agent of the American Anti-Slavery Society, then as an officer of the organization. Studying law under Daniel Cady after he and Elizabeth married, Henry then became a lawyer and a political operative. He aspired to hold office himself, and succeeded in doing so for a short time in the early 1850s. However, he was mostly known as an orator, a social/political activist, and a journalist, writing for the Anti-Slavery Standard, The Liberator, The New York Tribune, and The New York Sun.

In the 1830s, Stanton was a frequent visitor to the Smith household and a chief contributor to their many discussions about social and political issues. When Elizabeth and Stanton met in 1839, she was under the illusion that he was already married. So her earliest interactions with him were as simply an acquaintance who shared his interest in abolition, not as a potential love interest. Once the two discovered the misunderstanding, they began courting against the wishes of Elizabeth’s father. Daniel Cady was no more fond of abolitionism than he was of social reformers in general, and Stanton’s personal cause with Cady was not helped by his questionable financial viability.

After succumbing to family pressure and breaking her engagement to Stanton, Elizabeth had a change of heart, and the two married hastily in May 1840. They then went to London, where Henry was due to serve as a delegate at the World Antislavery Convention. Well-known within women’s rights history is the fact that none of the female delegates at the meeting were allowed to take a seat on the convention floor, but were segregated behind a screen in the balcony. This enraged Elizabeth as well as other American women present, such as Lucretia Mott, Abby Southwick, and Elizabeth Neal. Significantly, Henry gave a speech in favor of full participation by the women present, but his support stopped there. He did not join William Lloyd Garrison and a handful of other male delegates when they sat in the women’s section as an act of protest against such overt inequality.

Henry Stanton’s moderate support of women’s rights in London signaled an ongoing point of disconnect between Elizabeth and her husband. His passion was for abolition. The suffragists and feminists argued that women needed more social and political freedom than they currently had. For Henry, however, the plight of slaves held in bondage, abused, oppressed, and murdered at their masters’ whim was a far greater concern than women’s liberty to fill out a ballot or to hold office. Elizabeth’s passion was for women’s rights. Certainly American slavery was cruel and unjust, but the system of oppression that permitted it was the same system that allowed men to rule over women with arbitrary and capricious authority. A woman who was married to a kind and egalitarian man was simply lucky. The legal system still maintained the power of all men over their wives, no matter how cruel and unkind they may be.

Biographers have debated whether Henry was truly an advocate of Elizabeth’s quest for women’s rights, merely moderately supportive, or actually antagonistic to both her quest and her stature as a suffragist. Like both Henry Blackwell and Theodore Weld, Stanton accepted Elizabeth’s decision to excise the word “obey” from the vows she spoke at their wedding. The minister performing the ceremony was troubled by this detour from convention, and Elizabeth was convinced that the lengthy prayer he offered after the ceremony–lasting nearly an hour–was payback for this crucial omission from their marriage vows. There is no evidence that the matter troubled her husband. Even so, others in their reform-minded circles went further to advance equality in marriage. Theodore Weld, who wed the feminist and abolitionist Angelina Grimke in 1838, vowed to treat his wife as an equal partner in their marriage. Marrying in 1855, Henry Blackwell went much further, denouncing marriage as an institution that enforced male dominance over women. In addition, Blackwell accepted Lucy Stone’s decision to keep her own surname after marriage, the first woman on record in America to have done so. Other male reformers supported or worked alongside their wives in the suffrage struggle. For example, Henry Blackwell spoke on behalf of women’s suffrage and essentially co-edited The Woman’s Journal with Stone. Clearly, Henry Stanton did not match this level of commitment to his wife’s suffrage work.

Yet the level of support that Henry Stanton gave to Elizabeth’s work as a women’s rights advocate must also be measured by the standard set by her father. Daniel Cady repeatedly lamented the fact that Elizabeth was female because he believed her intellect and forceful personality would go to waste in a woman. Women in the world they lived in were meant to attend to the hearth and home, not to go out into the world to become intellectuals or, worse still, rabble-rousing activists. At the same time, her father was not completely unmoved by seeing Elizabeth act on her convictions. After she read one of her upcoming women’s rights speeches to him, he asked if she–a woman of position, privilege, and relative power–genuinely believed that she was at a disadvantage as a woman. When Elizabeth responded by reminding him of all the laws that privileged men and harmed women, her father turned to his law books to provide her with another example that would help further illustrate her point. While never more than outwardly lukewarm to her feminist efforts, Daniel Cady often provided support in this way–giving her legal ammunition to use in her writings and speeches.

Elizabeth was accustomed to receiving only the dimmest signs of approval from her father. So as an adult, she neither expected nor needed the motivation of resounding applause for her suffrage work from Henry Stanton. She had found her calling as a women’s rights advocate, and though she was a formidable force at the podium, Henry was always able to write to his “Dear Lizzie” when the two were apart. Though not a proponent of women’s rights himself, he does not appear to have been hostile to her leadership role in the women’s movement. It simply seems that, like her father, he found it difficult to fully comprehend Elizabeth’s passion for women’s rights when he saw her as a woman of considerable privilege. Because both Elizabeth and Henry were fairly protective of their private life together, it may be impossible to know for certain how each felt about the other’s reform work. Each seems to have been impressed by the other’s achievements in their chosen reform efforts, though neither was exactly a cheerleader for the other’s cause.

After the London antislavery convention, Elizabeth and Henry toured Europe for six months, returning home in the winter and settling with Elizabeth’s parents. During this period, Henry studied law under Daniel Cady, before taking up a position in Boston in 1843. In Boston Elizabeth met prominent reformers and intellectuals, among them Lydia Maria Child, Frederick Douglass, Ralph Waldo Emerson, Margaret Fuller, Nathaniel Hawthorne, Robert Lowell, Abby Kelly, Elizabeth Palmer Peabody, John Greenleaf Whittier, and Paulina Wright. She regularly listened to the sermons of the radical Unitarian and abolitionist minister, Theodore Parker, and was eager to attend the “conversations” of Amos Bronson Alcott and Margaret Fuller. She also visited the utopian Brook Farm community, admiring its idealism, though not the spartan way of life of its inhabitants.

Elizabeth loved Boston, and the art, culture, and intellectual life it had to offer. The loss of all this made the adjustment to rural life difficult for her when, in 1847, the couple moved to Seneca Falls in upstate New York. But with a house generously provided by Daniel Cady for his daughter’s growing family, there was really no way the couple could refuse. By 1847 they had three children, and there would be more–each named in honor of a beloved family member or friend: David Cady (born 1842), Henry Brewster (1844), Gerrit Smith (1845), Theodore Weld (1851), Margaret Livingston (1852), Harriot Eaton (1856) and Robert Livingston (1859).

3. Women’s Rights Activism

Elizabeth Cady Stanton became acquainted with women’s rights activists for the first time at the antislavery convention in London. Women’s abilities, achievements, and rights had been of concern to her since her youth, when she bantered with boys at Johnstown Academy and with the young men apprenticing at her father’s law office. Yet it was her experience as a housewife in Seneca Falls that prompted her to take action on behalf of women’s rights.

In her earliest years as a wife and mother, Cady Stanton found fulfillment in managing a household. In fact, she thrived on the day-to-day challenge to do so with order and efficiency. After a time, the novelty had worn off, and she found housework mundane and depressing. She also found herself sympathizing with everyday women who did not have the same access to power and privilege that she had. Assisting victims of domestic abuse in the area on several occasions, Cady Stanton saw how the same unjust laws that she had intuitively resented and wanted to change as a child were especially burdensome to women without means. Just at this point in her life, an invitation for a visit came from Lucretia Mott, who was only eight miles away in Waterloo. Cady Stanton eagerly took the trip to meet with Mott and other reformers in the community, and the idea was born to hold a convention to discuss women’s rights. In one afternoon, the group planned and announced the two-day meeting, the first of its kind. It was to be held only five days later. The event was a success that far exceeded the expectations of Cady Stanton and her convention co-planners. While a group of about fifty devoted social reformers from nearby Rochester and Syracuse were expected to participate, over two hundred people attended. Nearly seventy signed the Declaration of Sentiments, which Cady Stanton had authored, modeling it after the American Declaration of Independence.

The Seneca Falls Women’s Rights Convention was followed a month later by another such meeting in Rochester, thus setting in motion a tradition that was to shape the nineteenth century women’s movement. Conventions to discuss women’s rights were held annually between 1850 and 1860, with Elizabeth Cady Stanton and her good friend and colleague, Susan B. Anthony, playing complementary roles. Anthony was the strategist, tactician, and all-round logistics coordinator. Cady Stanton was the philosophical thinker, writer, and theoretician. Anthony sometimes chastised Cady Stanton for letting family obligations, namely childrearing, get in the way of her women’s rights work. At the same time, however, she was known to Elizabeth’s children as “Aunt Susan”.  After they had turned seven they often went on extended visits to Anthony’s home in Rochester, a landmark in their lives later described with awe and wonder.

A network of countless brilliant and talented women worked to advance women’s rights in nineteenth-century America, among them Amelia Bloomer, Olympia Brown, Paulina Wright Davis, Abby Foster, Matilda Joslyn Gage, Frances Watkins Harper, Isabella Beecher Hooker, Julia Ward Howe, Mary Ashton Rice Livermore, Lucreita Mott, Ernestine Rose, Caroline Severance, Anna Howard Shaw, Lucy Stone, Sojourner Truth, Frances Willard and Victoria Woodhull. The lifelong friendship of two of them, Elizabeth Cady Stanton and Susan B. Anthony, makes for a unique and compelling story. On the surface, the two could not have been more different. Cady Stanton was born of privilege, had a forceful and sometimes challenging personality, was fond of luxury, was a religious skeptic, and refused to believe that women had to choose between motherhood and public activism. Anthony was not born into wealth, had a quiet and calm demeanor, eschewed self-indulgence, was a devout Quaker who later converted to Unitarianism, and felt strongly that domesticity seriously compromised women’s participation in public life. But for some reason, the contrasts between Cady Stanton and Anthony served to complement, rather than to compete with, each other. Anthony had a room in Cady Stanton’s home where she was welcome to stay at any time. When Cady Stanton was unable to attend a convention, Anthony would often read the speech Elizabeth had written. Writing with eloquence, Cady Stanton could pen an essay or speech with ease, an ability that Anthony greatly admired.

While the relationship between Stanton and Anthony remained stable, the movement they were part of was not always placid. During the Civil War, suffrage leaders agreed to focus on supporting the war effort, forming the Women’s Loyal League. Once the war had ended and slavery had been abolished, Stanton and Anthony joined Frederick Douglass and others to form the American Equal Suffrage Association. This organization was devoted to securing voting rights for newly-freed African Americans and for all women simultaneously. However, when the fifteenth amendment to the Constitution was being ratified without the word “sex” included in the text, the branch of the women’s suffrage movement that was led by Stanton and Anthony was outraged. The text read: “The right of citizens of the United States to vote shall not be denied or abridged by the United States or by any state on account of race, color, or previous condition of servitude”. It was inconceivable to Stanton and her colleagues that their male advocates had failed to bring women along in the struggle for voting rights. In response Stanton and her white female colleagues made arguments on behalf of women that today smack of elitism, if not outright racism. How, they asked, could persons who were uneducated and lacking autonomy while held for years in bondage be given the right to vote when well-educated and “cultured” white women continued to be treated like children at best, chattel at worst? In their anger, they allied themselves with race-baiters in the months prior to the amendment’s passage. Stanton, Anthony, and others felt strongly that any change to the constitution involving voting rights simply must be universal–it must include African American males and females, as well as white women and others who did not yet hold the franchise. Yet clearly the pair of activists were willing to turn a blind eye to the ways in which their arguments fueled the fires of race-hate across the nation.

The race issue contributed to a schism in the women’s movement that would last for two decades. Some women, led by Lucy Stone and joined by Julia Ward Howe and Caroline Severance, formed a new organization based in Boston, the American Woman’s Suffrage Association. The AWSA ultimately endorsed the amendment giving only African American males the vote, believing that their good will and co-operative spirit would be rewarded in time. Unfortunately the NWSA and the AWSA had to struggle to distinguish themselves from each other to the rank and file of women’s rights supporters. They printed rival publications to promote women’s rights from the perspective of the leaders of each organization. Stanton and Anthony published The Revolution (1868-1872) and Stone The Woman’s Journal (1870-1917). They also competed for members and political support. It was not until 1890 that the divided movement reunited and was renamed the National American Suffrage Association, continuing the voting rights struggle for another thirty years.

4. Writings and Influence

As noted, Elizabeth Cady Stanton was an eloquent and prolific writer. While she served as the philosopher of the suffrage movement, Susan B. Anthony served as its strategist. Historians have noted that their respective strengths complemented each other. Equally significant is the different approaches they took to securing rights for women. Anthony was single-minded in her quest for the vote as the stepping stone that would provide women access to all other rights. If only women could vote and hold public office, they would then be able to self-advocate: Women could vote for candidates with policies that empower and support women and their families. They could press for changes to laws related to marriage, divorce, and custody for children. Women could help enact any number of provisions that would give them more power and influence in society. If only they had the vote.

Stanton also believed in the power of voting rights, but she also saw no reason not to carry on the battle for women’s equality on all fronts at once. Women had needed property rights when legislatures began to pass them in the 1830s and ‘40s. They continued to need and deserve other rights related to property and financial self-determination in the 1860s, ‘70s, and ‘80s, and Stanton spoke out about how and why these rights should be granted. Similarly, laws dealing with the most intimate aspects of women’s lives–marriage, divorce, and child custody–merited attention in Stanton’s “here and now,” not at some future time when women would hold the power of the franchise.

Stanton’s published works addressed four main areas of feminist concern:  personal/social freedom, marriage/family matters, legal/political rights and religion’s role and influence. Chief among the social constraints that restricted women’s freedom in Stanton’s day was access to education. She began discussing this subject early in her work as a reformer. Prohibitions against rigorous academic training for girls and women thwarted their intellectual growth and thus the levels of personal and social development they could achieve. The tradition of single-sex education further exacerbated this problem. The respective weaknesses of men and women (which Stanton believed were not natural to each gender but nurtured by social norms and values) were reinforced when they were deprived of interaction. This perpetuated the imbalance of power based on gender. The inferior education women received when Stanton began her work also had the potential to contribute to women’s moral inferiority. Without properly exercising their intellectual powers and being challenged to make difficult academic and moral distinctions, women were unable to function as independent decision-makers. This harmed not only women as individuals, but also the social institutions they are a part of: The family, the local community, and the state. In this sense, Stanton laid the foundation for what would later be called liberal feminism, a school of feminist thought which maintains that women are more similar to men than they are different from them. Thus, it aims for equal treatment of men and women, particularly in matters related to education, employment, pay equity and political participation.

Stanton was among the more bold women’s rights advocates in her era in that she took on issues relating to marriage that other suffragists considered off-limits. She and Anthony both referred to the institution of marriage as it existed in their time as “male marriage”, that is, a civil union made for and by men, which was used to perpetuate male power and authority. Among the most controversial was divorce. She was under none of the popular illusions that marriage was a blessed institution that, fairytale-like, brought out the best in people. She resented the suggestion that a virtuous and patient woman could persuade–through her love, faith or virtue–a domineering, alcoholic or abusive man to become a more kind and considerate husband. Women were only and always harmed by their relationships with such men in Stanton’s view. Their own moral character was compromised, as was the overall moral tone of their home and family. Speaking in favor of pending legislation in New York that would liberalize divorce policies, Stanton said that rather than prevent a woman from leaving an abusive and alcoholic husband, the law should prohibit such men from getting married. Such a policy would go much farther toward protecting the institution of marriage than the laws that prevailed in the day were able to do. Here we see in Stanton’s thought a kernel of what would later become dominance feminism. This is a branch of feminist thought that focuses on the differences between men and women. It concludes that, whether natural or socially constructed, gender distinctions are used to reinforce male dominance and female submission. Like Stanton, today’s dominance feminists are concerned with marriage and divorce. They also venture into territory that Stanton and her contemporaries only dared to hint at in the age of Victorian propriety: Domestic violence, rape, incest, pornography, and prostitution. The aim of dominance feminism is to overturn the male power structure that makes these abuses of women possible.

Stanton’s discussion of the legal and political rights of women is the best known within popular discussions of women’s history today. Women’s right to participate fully in public, political life was certainly of paramount importance to her. Over and over in her lectures and essays, Stanton emphasized the ways in which men’s dominance in public life disabled and disgraced women. With men as the chief arbiters of legal and political right, women were rendered “civilly dead” as Stanton so eloquently termed it at the Seneca Falls convention. Women were powerless to stop their husbands if they chose to squander the family’s resources, even if the source of those resources was the savings or inheritance that women themselves had brought into their households. Men made all the laws regarding education, employment, marriage, divorce, child custody, property, inheritance, breaches of the civil and the criminal codes–every aspect of life that could and did affect women’s lives, but over which women were powerless to effect change. Men thus became like monarchs ruling over all classes of society, who could readily wield tyrannical force, if they willed to do so. All the other social inequalities that concerned Stanton trickled down from this one arena–that of legal and political rights.

Significantly, when Stanton spoke in favor of universal suffrage–that is, of extending voting rights to not only all African American males but also to all women–after the Civil War, she cautioned against maintaining distinctions among the various classes of people in society. The entire class of African Americans held in slavery had been prohibited from voting since the founding of the country. As legislators considered extending the franchise, Stanton implored them to erase all similar social distinctions. Women should no longer be treated as a separate class of individuals who are prohibited from voting any more than newly freed African Americans should. On American soil, Stanton said, all citizens were to be granted equal consideration in this way.

This stance, too, created some friction for Stanton as the post-Civil War discourse on voting rights got underway. She and Anthony were absolutely unyielding in their call for women’s full political equality. While other suffragists, like Lucy Stone, were willing to consider partial suffrage, which would allow women to vote on local issues of concern to them, like education or municipal budgets, Stanton and Anthony held firm: Women are equal in all ways to men and should be treated as such. At times they displayed their own class biases on this point. Why should an ignorant and uneducated man of any race be granted the right to vote in all matters when well-educated and cultured women were allowed to vote only about matters like school policies and local road construction? Most critically, as the post-War discussion advanced and feminists were being urged to accept this as “the Negro’s hour” (as Frederick Douglass so famously put it), Lucy Stone and others were willing to surrender women’s voting rights for the time being to appease their allies in Congress. Stone and others assumed the discussion of women’s voting rights would be resumed in a timely fashion, but they believed that, for the peace of the nation, they needed to step aside temporarily. Stanton and Anthony saw the writing on the wall, as such, and held to insisting on female suffrage until the very last legislators had cast their vote. Ultimately, the controversy over universal suffrage resulted in the schism between Stone and her camp and the Stanton-Anthony coalition discussed above.

a. The Woman’s Bible

 

Stanton had always been critical of the ways in which the Christian churches contributed to women’s oppression, but she addressed this topic head-on in the last decade of her life. In publishing The Woman’s Bible, Stanton was far ahead of the feminist curve. Biblical criticism of any sort was relatively new, having been initiated in mid-nineteenth-century Germany by thinkers like Johann Gottfried Eichhorn (1752–1827), Wilhelm Martin Leberecht de Wette (1780–1849) and Julius Wellhausen (1844–1918). The idea of a largely secular examination of the Bible that investigated its implications, flaws and shortcomings as related to women was virtually unthought of.

The Woman’s Bible consists of a collection of essays by a committee of women intellectuals on passages of the Judeo-Christian scriptures that discuss women. Stanton and her colleagues took a critical approach to the story of Eve in the Garden of Eden, for instance. First of all, the story is deemed an allegory, not a factual account. Darwin’s theory of evolution provides us with a more plausible account of the development of human beings over time, not in one act of creation. Secondly, Eve is praised, rather than blamed, for taking the fruit, because this act demonstrates her thirst for knowledge. As one contributor stated it, “Fearless of death if she can gain wisdom [she] takes of the fruit” (The Woman’s Bible, p. 26). The work takes a similar realist approach to the story of Jesus’ life. Cady Stanton and her contributors readily write about him as a man, not as God nor even as God’s Son. Stanton’s view on the miracles he performed is that human development is such that any human being could perform miracles, if only they have the will to make themselves as pure and good as Jesus was.

Other passages, such as those that recount the marriages of the patriarchs, are discussed so as to reify ideals of marital love. In both the Old and New Testament passages, Stanton and her contributors highlight and heighten the role of the women in question. They put particular emphasis on Miriam’s role in the quest for Jewish freedom, for instance. They also discuss the important work of Deborah the judge, as well as of female disciples in the early Christian church, noting that they were committed to their cause and were conveyors of God’s word. In this sense, Stanton sowed the seed of cultural feminism–the ideal that women bring a special perspective and set of values to their participation in society. Therefore they bring a special set of values that needs to be recognized, understood, and appreciated.

The overall objective of The Woman’s Bible was to use textual analysis and historical criticism to dismantle the traditional interpretation of any one biblical passage and replace it with a feminine, if not a feminist, perspective. The effort was a success on this level. Though it went into additional printings, The Woman’s Bible brought Elizabeth Cady Stanton a great deal of criticism, more for her audacity in taking on the project than in what she actually said about the Bible or its merits.

Although Stanton’s efforts in The Woman’s Bible did not match the increasingly rigorous standards of her contemporaries in theology who were beginning their own critical examinations of the Jewish and Christian scriptures, neither did it fail as an intellectual exercise. This was simply a groundbreaking work, which called into question a number of widely accepted claims about the nature of God, God’s esteem for and relation to women, and women’s place within faith communities. It also paved the way for future work for and by women in religion in the twentieth century. Second wave feminists in the 1960s and 1970s, who were struggling for the full ordination of women in the Christian and Jewish traditions, relied heavily on this early work of early feminist criticism by Stanton. Academic women in the same era were inspired by her example and produced more modern and academically rigorous works that scrutinized sacred texts and religious traditions from a feminist perspective as well. Despite her own religious skepticism, Stanton would have been heartened to have seen a future in which over half of the students in mainstream Protestant seminaries are women, and the ordination of women is commonplace in liberal Protestant and Jewish traditions.

5. Conclusion

Certainly Elizabeth Cady Stanton had an immense amount of influence in her day; her influence and her legacy continue even if she is not always an overtly recognized source by today’s feminists. At the heart of her ideals and advocacy lies a blurring of the distinction between the “public” world of law and politics, and the “private” world of home and family–a distinction that held sway through so much of the eighteenth and nineteenth centuries. Feminists in the 1970s and ‘80s would craft a mantra of sorts that got to the heart of this distinction-blurring: “The personal is political”. Cady Stanton’s work is truly the source of such thinking. Stanton wanted the “public” realm, particularly law and politics, to be imported into the home. She wanted to bring an end to the abuse and neglect men were allowed to impose on women because the law turned a blind eye to domestic violence as a “private” matter. Other women activists of the day, such as Frances Willard the temperance activist, wanted to see women’s “private” virtues exported into the public realm: Give us access to the vote, and we will clean up the crime and corruption of the “public” political realm. Susan B. Anthony’s position represented a middle ground. She rejected the public/private distinction as much as Cady Stanton did, but she did so from a different angle and for different reasons. Access to the vote and the ability to hold public office would allow women to speak for themselves and act on their own behalf. If they are dissatisfied with the laws governing marriage and divorce, then give them voting rights and let them change such laws.

Fond of luxury and susceptible to self-indulgence all her life, Elizabeth Cady Stanton became obese late in life and suffered from maladies that were related to her overall poor health:  Fading eyesight, decreased mobility and chronic fatigue. Even so she remained active and engaged in life, and optimistic that women would indeed succeed in winning the vote in the twentieth century. At the very least she could rest knowing that she had passed on the legacy of the suffrage struggle to her daughter’s generation of women’s rights activists. Stanton’s daughter, Harriot Stanton Blatch, was a feminist activist in her own right who helped compile the six-volume History of Woman’s Suffrage, which Stanton, Anthony, Matilda Joslyn Gage and others had begun in 1881. While her daughter was able to vote for the last twenty years of her life, Elizabeth Cady Stanton was never able to register a ballot. She died October 26, 1902, in New York City, just shy of eighteen years before the passage of the nineteenth amendment.

6. References and Further Reading

a. Works by Elizabeth Cady Stanton

  • “Address of Mrs. Elizabeth Cady Stanton”, delivered at Seneca Falls and Rochester (July and August, 1848).
  • “It is so Unladylike” (1853).
  • “I Have all the Rights I Want” (1858).
  • “The Slave’s Appeal” (1860).
  • “Appeal to the Women of New York” (1860).
  • “Address of Elizabeth Cady Stanton on the Divorce Bill”, before the Judiciary Committee of the New York Senate (February 1861).
  • “Free Speech” (1861).
  • “Address in Favor of Universal Suffrage” before the Judiciary Committees of the Legislature of New York (January 1867).
  • “Marriage/Divorce” (1871).
    • A memorial for Victoria Woodhull, so later than 1871 in actuality.
  • “Woman Suffrage,” delivered to the Judiciary Committee of the House of Representatives (1890).
  • “Patriotism and chastity” (1891).
  • “Solitude of Self,” delivered to the Judiciary Committee of the United States Congress (January 1892).
  • “Suffrage, a Natural Right” (1894).
  • The Woman’s Bible (1895). New York: European Publishing Company.
  • “Bible and Church Degrade Woman” (1898).
  • Eighty Years and More (1898). New York: European Publishing Company.
  • Elizabeth Cady Stanton Papers, Rutgers University Archives.

b. Biographical and Historical Works Relating to Elizabeth Cady Stanton

  • Baker, Jean (2005). Sisters: The Lives of America’s Suffragists. New York: Hill and Wang.
  • Davis, Sue (2009). The Political Thought of Elizabeth Cady Stanton. New York: New York University Press.
  • DuBois, Ellen Carol (1981). Elizabeth Cady Stanton, Susan B. Anthony: Correspondence, Writings, Speeches. Boston: Northeastern University Press.
  • Gage, Matilda Joslyn, and others. A History of Woman Suffrage (in six volumes, 1881-1922). New York: C. Mann.
  • Ginzberg, Lori D. (2009). Elizabeth Cady Stanton: An American Life. New York: Hill and Wang.
  • Griffith, Elisabeth (1984). In Her Own Right: The Life of Elizabeth Cady Stanton. New York: Oxford University Press.
  • Matthews, Jean V. (1997). Women’s Struggle for Equality: The First Phase, 1828-1876. Lanham, MD: Ivan R. Dee Pub.
  • Stanton, Henry B. (1885). Random Recollections. New York: Harper & Bros.
  • Stanton, Theodore and Harriot Stanton Blatch (1922). Elizabeth Cady Stanton. New York: Harper & Bros.

c. Papers and Articles Relating to Daniel Cady

  • Cady, Daniel. Letters and Papers at Syracuse University Library.
    • Also includes some of Elizabeth Cady Stanton’s letters to her relatives.
  • “Retirement of Judge Daniel Cady,” New York Times (January 25, 1855).
  • “Daniel Cady”, in Biographical Directory of the United States Congress. Washington, DC: United States Government Printing Office.
  • Raymond, William (1851). “Daniel Cady,” in Biographical Sketches of Distinguished Men of Columbia County (New York). Albany, NY: Weed, Parsons, & Co.

Author Information

Dorothy Rogers
Email: rogersd@mail.montclair.edu
Montclair State University
U.S.A.

Relational Models Theory

Relational Models Theory is a theory in cognitive anthropology positing a biologically innate set of elementary mental models and a generative computational system operating upon those models.  The computational system produces compound models, using the elementary models as a kind of lexicon.  The resulting set of models is used in understanding, motivating, and evaluating social relationships and social structures.  The elementary models are intuitively quite simple and commonsensical.  They are as follows: Communal Sharing (having something in common), Authority Ranking (arrangement into a hierarchy), Equality Matching (striving to maintain egalitarian relationships), and Market Pricing (use of ratios).  Even though Relational Models Theory is classified as anthropology, it bears on several philosophical questions.

It contributes to value theory by describing a mental faculty which plays a crucial role in generating a plurality of values.  It thus shows how a single human nature can result in conflicting systems of value.  The theory also contributes to philosophy of cognition.  The complex models evidently result from a computational operation, thus supporting the view that a part of the mind functions computationally.  The theory contributes  to metaphysics.  Formal properties posited by the theory are perhaps best understood abstractly, raising the possibility that these mental models correspond to abstract objects.  If so, then Relational Models Theory reveals a Platonist ontology.

Table of Contents

  1. The Theory
    1. The Elementary Models
    2. Resemblance to Classic Measurement Scales
    3. Self-Organization and Natural Selection
    4. Compound Models
    5. Mods and Preos
  2. Philosophical Implications
    1. Moral Psychology
    2. Computational Conceptions of Cognition
    3. Platonism
  3. References
    1. Specifically Addressing Relational Models Theory
    2. Related Issues

1. The Theory

a. The Elementary Models

The anthropologist Alan Page Fiske pioneered Relational Models Theory (RMT).  RMT was originally conceived as a synthesis of certain constructs concerning norms formulated by Max Weber, Jean Piaget, and Paul Ricoeur.  Fiske then explored the theory among the Moose people of Burkina Faso in Africa.  He soon realized that its application was far more general, giving special insight into human nature.  According to RMT, humans are naturally social, using the relational models to structure and understand social interactions, the application of these models seen as intrinsically valuable. All relational models, no matter how complex, are, according to RMT, analyzable by four elementary models: Communal Sharing, Authority Ranking, Equality Matching, Market Pricing.

Any relationship informed by Communal Sharing presupposes a bounded group, the members of which are not differentiated from each other.  Distinguishing individual identities are socially irrelevant.  Generosity within a Communal Sharing group is not usually conceived of as altruism due to this shared identity, even though there is typically much behavior which otherwise would seem like extreme altruism.  Members of a Communal Sharing relationship typically feel that they share something in common, such as blood, deep attraction, national identity, a history of suffering, or the joy of food.  Examples include nationalism, racism, intense romantic love, indiscriminately killing any member of an enemy group in retaliation for the death of someone in one’s own group, sharing a meal.

An Authority Ranking relationship is a hierarchy in which individuals or groups are placed in relative higher or  lower relations .  Those ranked higher have prestige and privilege not enjoyed by those who are lower.  Further, the higher typically have some control over the actions of those who are lower.  However, the higher also have duties of protection and pastoral care for those beneath them.  Metaphors of spatial relation, temporal relation, and magnitude are typically used to distinguish people of different rank. For example, a King having a larger audience room than a Prince, or a King arriving after a Prince for a royal banquet.  Further examples include military rankings, the authority of parents over their children especially in more traditional societies, caste systems, and God’s authority over humankind.  Brute coercive manipulation is not considered to be Authority Ranking; it is more properly categorized as the Null Relation in which people treat each other in non-social ways.

In Equality Matching, one attempts to achieve and sustain an even balance and one-to-one correspondence between individuals or groups.  When there is not a perfect balance, people try to keep track of the degree of imbalance in order to calculate how much correction is needed.  “Equality matching is like using a pan balance: People know how to assemble actions on one side to equal any given weight on the other side” (Fiske 1992, 691).  If you and I are out of balance, we know what would restore equality.  Examples include the principle of one-person/one-vote, rotating credit associations, equal starting points in a race, taking turns offering dinner invitations, and giving an equal number of minutes to each candidate to deliver an on-air speech.

Market Pricing is the application of ratios to social interaction.  This can involve maximization or minimization as in trying to maximize profit or minimize loss.  But it can also involve arriving at an intuitively fair proportion, as in a judge deciding on a punishment proportional to a crime.  In Market Pricing, all socially relevant properties of a relationship are reduced to a single measure of value, such as money or pleasure.  Most utilitarian principles involve maximization.  An exception would be Negative Utilitarianism whose principle is the minimization of suffering.  But all utilitarian principles are applications of Market Pricing, since the maximum and the minimum are both proportions.  Other examples include rents, taxes, cost-benefit analyses including military estimates of kill ratios and proportions of fighter planes potentially lost, tithing, and prostitution.

RMT has been extensively corroborated by controlled studies based on research using a great variety of methods investigating diverse phenomena, including cross-cultural studies (Haslam 2004b).  The research shows that the elementary models play an important role in cognition including perception of other persons.

b. Resemblance to Classic Measurement Scales

It may be jarring to learn that intense romantic love and racism are both categorized as Communal Sharing or that tithing and prostitution are both instances of Market Pricing.  These examples illustrate that a relational model is, at its core, a meaningless formal structure.  Implementation in interpersonal relations and attendant emotional associations enter in on a different level of mental processing.  Each model can be individuated in purely formal terms, each elementary model strongly resembling one of the classic scale types familiar from measurement theory.  (Strictly speaking, it is each mod which can be individuated in purely formal terms.  This finer point will be discussed in the next section.)

Communal Sharing resembles a nominal (categorical) scale.  A nominal scale is simply classifying things into categories.  A questionnaire may be designed to categorize people as theist, atheist, agnostic, and other.  Such a questionnaire is measuring religious belief by using a nominal scale.  The groups into which Communal Sharing sorts people is similar.  One either belongs to a pertinent group or one does not, there being no degree or any shades of gray.  Another illustration of nominal scaling is the pass/fail system of grading.  Authority Ranking resembles an ordinal scale in which items are ranked.  The ranking of students according to their performance is one example.  The ordered classification of shirts in a store as small, medium, large, and extra large is another.  Equality Matching resembles an interval scale.  On interval scales , any unit measures the same magnitude on any point in the scale.  For example, on the Celsius scale the difference between 1 degree and 2 degrees is the same as the difference between 5 degrees and 6 degrees.  Equality Matching resembles an interval scale insofar as one can measure the degree of inequality in a social relationship using equal intervals so as to judge how to correct the imbalance.  It is by use of such a scale that people in an Equality Matching interaction can specify how much one person owes another.  However, an interval scale cannot be used to express a ratio because it has no absolute zero point.  For example, the zero point on the Celsius scale is not absolute so one cannot say that 20 degrees is twice as warm as 10 degrees while on a Kelvin scale because the zero point is absolute one can express ratios.  Given that Market Pricing is the application of ratios to social interactions, it resembles a ratio scale such as the Kelvin scale.  One cannot, for example, meaningfully speak of the maximization of utility without presupposing some sort of ratio scale for measuring utility.  Maximization would correspond to 100 percent.

c. Self-Organization and Natural Selection

The four measurement scales correspond to different levels of semantic richness and precision.  The nominal scale conveys little information, being very coarse grained.  For example, pass/fail grading conveys less information than ranking students.  Giving letter grades is even more precise and semantically rich, conveying how much one student out-performs another.  This is the use of an interval scale.  The most informative and semantically rich is a percentage grade which illustrates the ratio by which one student out-performs another, hence a ratio scale.  For example, if graded accurately a student scoring 90 percent has done twice as well as a student scoring 45 percent.  Counterexamples may be apparent: two students could be ranked differently while receiving the same letter grade by using a deliberately coarse-grained letter grading system so as to minimize low grades.  To take an extreme case, a very generous instructor might award an A to every student (after all, no student was completely lost in class) while at the same time mentally ranking the students in terms of their performance.  Split grades are sometimes used to smooth out the traditional coarse-grained letter grading system .  But, if both scales are as sensitive as possible and based on the same data, the interval scale will convey more information than the ordinal scale.  The ordinal ranking will be derivable from the interval grading, but not vice versa.  This is more obvious in the case of temperature measurement, in which grade inflation is not an issue.  Simply ranking objects in terms of warmer/colder conveys less information than does Celsius measurement.

One scale is more informative than another because it is less symmetrical; greater asymmetry means that more information is conveyed.  On a measurement scale, a permutation which distorts or changes information is an asymmetry.  Analogously, a permutation in a social-relational arrangement which distorts or changes social relations is an asymmetry.  In either case, a permutation which does not carry with it such a distortion or change is symmetric.  The nominal scale type is the most symmetrical scale type, just as Communal Sharing is the most symmetrical elementary model.  In either case, the only asymmetrical permutation is one which moves an item out of a category, for example, expelling someone from the social group.  Any permutation within the category or group makes no difference; no difference to the information conveyed, no difference to the social relation.  In the case of pass/fail grading, the student’s performance could be markedly different from what it actually was.  So long as the student does well enough to pass (or poorly enough to fail), this would not have changed the grade.  Thanks to this high degree of symmetry, the nominal scale conveys relatively little information.

The ordinal scale is less symmetrical.  Any permutation that changes rankings is asymmetrical, since it distorts or changes something significant.  But items arranged could change in many respects relative to each other while their ordering remains unaffected, so a high level of symmetry remains.  Students could vary in their performance, but so long as their relative ranking remains the same, this would make no difference to grades based on an ordinal scale.

An interval scale is even less symmetrical and hence more informative, as seen in the fact that a system of letter grades conveys more information than does a mere ranking of students.  An interval scale conveys the relative degrees of difference between items.  If one student improves from doing C level work to B level work, this would register on an interval scale but would remain invisible on an ordinal scale if the change did not affect student ranking.  Analogously, in Equality Matching, if one person, and one person only, were to receive an extra five minutes to deliver their campaign speech, this would be socially significant.  By contrast, in Authority Ranking, the addition of an extra five minutes to the time taken by a Prince to deliver a speech would make no socially significant difference provided that the relative ranking remains undisturbed (for example, the King still being allotted more time than the Prince, and the Duke less than the Prince).

In Market Pricing, as in any ratio scale, the asymmetry is even greater.  Adding five years to the punishment of every convict could badly skew what should be proportionate punishments.  But giving an extra five minutes to each candidate would preserve balance in Equality Matching.

The symmetries of all the scale types have an interesting formal property.  They form a descending symmetry subgroup chain.  In other words, the symmetries of a ratio scale form a subset of the symmetries of a relevant interval scale, the symmetries of that scale form a subset of the symmetries of a relevant ordinal scale, and the symmetries of that scale form a subset of the symmetries of a relevant nominal scale.  More specifically, the scale types form a containment hierarchy.  Analogously, the symmetries of Market Pricing form a subset of the symmetries of Equality Matching which form a subset of the symmetries of Authority Ranking which form a subset of the symmetries of Communal Sharing.  Descending subgroup chains are common in nature, including inorganic nature.  The symmetries of solid matter form a subset of the symmetries of liquid matter which form a subset of the symmetries of gaseous matter which form a subset of the symmetries of plasma.

This raises interesting questions about the origins of these patterns in the mind: could they result from spontaneous symmetry breakings in brain activity rather than being genetically encoded?  Darwinian adaptations are genetically encoded, whereas spontaneous symmetry breaking is ubiquitous in nature rather than being limited to genetically constrained structures.  The appeal to spontaneous symmetry breaking suggests a non-Darwinian approach to understanding how the elementary models could be “innate” (in the sense of being neither learned nor arrived at through reason).  That is, are the elementary relational models results of self-organization rather than learning or natural selection?  If they are programmed into the genome, why would this programming imitate a pattern in nature which usually occurs without genetic encoding?  The spiral shape of a galaxy, for example, is due to spontaneous symmetry breaking, as is the transition from liquid to solid.  But these transitions are not encoded in genes, of course.  Being part of the natural world, why should the elementary models be understood any differently?

d. Compound Models

While all relational models are analyzable into four fundamental models, the number of models as such is potentially infinite.  This is because social-relational cognition is productive; any instance of a model can serve as a constituent in an even more complex instance of a model.  Consider Authority Ranking and Market Pricing; an instance of one can be embedded in or subordinated to an instance of the other.  When a judge decides on a punishment that is proportionate to the crime, the judge is using a ratio scale and hence Market Pricing.  But the judge is only authorized to do this because of her authority, hence Authority Ranking.  We have here a case of Market Pricing embedded in a superordinate (as opposed to subordinate) structure of Authority Ranking resulting in a compound model.  Now consider ordering food from a waiter.  The superordinate relationship is now Market Pricing, since one is paying for the waiter’s service.  But the service itself is Authority Ranking with the customer as the superior party.  In this case, an instance of Authority Ranking is subordinate to an instance of Market Pricing.  This is also a compound model with the same constituents but differently arranged.  The democratic election of a leader is Authority Ranking subordinated to Equality Matching.  An elementary school teacher’s supervising children to make sure they take turns is Equality Matching subordinated to Authority Ranking.

A model can also be embedded in a model of the same type.  In some complex egalitarian social arrangements, one instance of Equality Matching can be embedded in another.  Anton Pannekoek’s proposed Council Communism is one such example.  The buying and selling of options is the buying and selling of the right to buy and sell, hence recursively embedded Market Pricing.  Moose society is largely structured by a complex model involving multiple levels of Communal Sharing.  A family among the Moose is largely structured by Communal Sharing, as is the village which embeds it, as is the larger community that embeds the village, and so on.  In principle, there is no upper limit on the number of embeddings in a compound model.  Hence, the number of potential relational models is infinite.

e. Mods and Preos

A model, whether elementary or compound, is devoid of meaning when considered in isolation.  As purely abstract structures, models are sometimes known as “mods” , which is an abbreviation of, “cognitively modular but modifiable modes of interacting” (Fiske 2004, 3).  (This may be a misnomer, since, as purely formal structures devoid of semantic content, mods are not modes of social interaction any more than syntax.   is a communication system.)  In order to externalize models, that is, in order to use them to interpret or motivate or structure interactions, one needs “preos,” these being “socially transmitted prototypes, precedents, and principles that complete the mods, specifying how, when and with respect to whom the mods apply” (2004, 4).  Strictly speaking, a relational model is the union of a mod with a preo.  A mod has the formal properties of symmetry, asymmetry, and in some cases embeddedness.  But a mod requires a preo in order to have the properties intuitively identifiable as meaningful, such as social application, emotional resonance, and motivating force.

The notion of a preo updates and includes the notion of an implementation rule, from an earlier stage of relational-models theorizing.  Fiske has identified five kinds of implementation rules (1991, 142).  One kind specifies the domain to which a model applies.  For example, in some cultures Authority Ranking is used to structure and give meaning to marriage.  In other cultures, Authority Ranking does not structure marriage and may even be viewed as immoral in that context.  Another sort of implementation rule specifies the individuals or groups which are to be related by the model.  Communal Sharing, for example, can be applied to different groups of people.  Experience, and sometimes also agreement, decides who is in the Communal Sharing group.  In implementing Authority Ranking, it is not enough to specify how many ranks there are.  One must also specify who belongs to which rank.  A third sort of implementation rule defines values and categories.  In Equality Matching, each participant must give or receive the same thing.  But what counts as the same thing?  In Authority Ranking, a higher-up deserves honor from a lower-down, but what counts as honor and what constitutes showing honor?  There are no a priori or innate answers to these questions; culture and mutual agreement help settle such matters.  Consider the principle of one-person/one-vote, an example of Equality Matching.  Currently in the United States and Great Britain, whoever gets the most votes wins the election.  But it is also possible to have a system in which a two-thirds majority is necessary for there to be a winner.  Without a two-thirds majority, there may be a coalition government, a second election with the lowest performing candidates eliminated, or some other arrangement.  These are different ways of determining what counts as treating each citizen as having an equal say.  A fourth determines the code used to indicate the existence and quality of the relationship.  Authority Ranking is coded differently in different cultures, as it can be represented by the size of one’s office, the height of one’s throne, the number of bars on one’s sleeve, and so forth.  A fifth sort of implementation rule concerns a general tendency to favor some elementary models over others.  For example, Market Pricing may be highly valued in some cultures as fair and reasonable while categorized as dehumanizing in others.  The same is clearly true of Authority Ranking.  Communal Sharing is much more prominent and generally valued in some cultures than in others.  This does not mean that any culture is completely devoid of any specific elementary model but that some models are de-emphasized and marginalized in some cultures as compared to others.  So the union of mod and preo may even serve to marginalize the resulting model in relation to other models.

The fact that the same mod can be united with different preos is one source of normative plurality across cultures, to be discussed in the next section.  Another source is the generation of distinct compound mods.  Different cultures can use different mods, since there is a considerable number of potential mods to choose from.

2. Philosophical Implications

a. Moral Psychology

Each elementary model crucially enters into certain moral values.  An ethic of service to one’s group is a form of Communal Sharing.  It is an altruistic ethic in some sense, but bear in mind that all members of the group share a common identity.  So, strictly speaking, it is not true altruism.  Authority Ranking informs an ethic of obedience to authority including respect, honor, and loyalty.  Any questions of value remaining to be clarified are settled by the authority; subordinates are expected to follow the values thus dictated.  Fairness and even distribution are informed by Equality Matching.  John Rawls’ veil of ignorance exemplifies Equality Matching; a perspective in which one does not know which role one will play guarantees that one aim for equality.  Gregory Vlastos has even attempted to reduce all distributive justice to a framework that can be identified with Equality Matching.  Market Pricing informs libertarian values of freely entering into contracts and taking risks with the aim of increasing one’s own utility or the utility of one’s group.  But this also includes suffering the losses when one’s calculations prove incorrect.  Utilitarianism is a somewhat counterintuitive attempt to extend this sort of morality to all sentient life, but is still recognizable as Market Pricing.  It would be too simple, however, to say that there are only four sorts of values in RMT.  In fact, combinations of models yield complex models, resulting in a potential infinity of complex values.  Potential variety is further increased by the variability of possible preos.  This great variety of values leads to value conflicts most noticeably across cultures.

RMT strongly suggests value pluralism, in Isaiah Berlin’s sense of “pluralism”.  The pluralism in question is a cultural pluralism, different traditions producing mutually incommensurable values.  Berlin drew a distinction between relativism and pluralism, even though there are strong similarities between the two.  Relativism and pluralism both acknowledge values which are incommensurable, meaning that they cannot be reconciled and that there is no absolute or objective way to judge between them.  Pluralism, however, acknowledges empathy and emotional understanding across cultures.  Even if one does not accept the values of another culture, one still has an emotional understanding of how such values could be adopted.  This stands in contrast to relativism, as defined by Berlin.  If relativism is true, then there can be no emotional understanding of alien values.  One understands the value system of an alien culture in essentially the same manner as one understands the behavior of ants or, for that matter, the behavior of tectonic plates; it is a purely causal understanding.  It is the emotionally remote understanding of the scientist rather than the empathic understanding of someone engaging, say, with the poetry and theatre of another culture.  Adopting RMT, pluralism seems quite plausible.  Given that one has the mental capacity to generate the relevant model, one can replicate the alien value in oneself.  One is not simply thinking about the foreigner’s relational model, but using one’s shared human nature to produce that same model in oneself.  This does not, however, mean that one adopts that value, since one can also retain the conflicting model characteristic of one’s own culture.  One’s decisive motivation may still flow wholly from the latter.

But the significance of RMT for the debate over pluralism and absolutism may be more complex than indicated above.  Since RMT incorporates the view that people perceive social relationships as intrinsic values, this may indicate that a society which fosters interactions and relationships is absolutely better than one which divides and atomizes, at least along that one dimension.  This may be an element of moral absolutism in RMT, and it is interesting to see how it is to be reconciled with any pluralism also implied.

b. Computational Conceptions of Cognition

The examples of embedding in Section 1.d. not only illustrate the productivity of social-relational cognition, but also its systematicity.  To speak of the systematicity of thought means that the ability to think a given thought renders probable the ability to think a semantically close thought.  The ability to conceive of Authority Ranking embedding Market Pricing makes it highly likely that one can conceive of Market Pricing embedding Authority Ranking.  One finds productivity and systematicity in language as well.  Any phrase can be embedded in a superordinate phrase.  For example, the determiner phrase [the water] is embedded in the prepositional phrase [in [the water]], and the prepositional phrase [in [the water]] is embedded in the determiner phrase [the fish [in [the water]]].  The in-principle absence of limit here means that the number of phrases is infinite.  Further, the ability to parse (or understand) a phrase renders very probable the ability to parse (or understand) a semantically close phrase.  For example, being able to mentally process Plato did trust Socrates makes it likely that one can process Socrates did trust Plato as well as Plato did trust Plato and Socrates did trust Socrates.  Productivity and systematicity, either in language or in social-relational cognition, constitute a strong inductive argument for a combinatorial operation that respects semantic relations.  (The operation respects semantic relations, given that the meaning of a generated compound is a function of the meanings of its constituents and their arrangement.)  In other words, it is an argument for digital computation.

This is essentially Noam Chomsky’s argument for a computational procedure explaining syntax (insofar as syntax is not idiomatic).  It is also essentially Jerry Fodor’s argument for computational procedures constituting thought processes more generally.  That digital computation underlies both complex social-relational cognition and language raises important questions.  Are all mental processes largely computational or might language and social-relational cognition be special cases?  Do language and social-relational cognition share the same computational mechanism or do they each have their own?  What are the constraints on computation in either language or social-relational cognition?

c. Platonism

Chomsky has noted the discrete infinity of language.  Each phrase consists of a number of constituents which can be counted using natural numbers (discreteness), and there is no longest phrase meaning that the set of all possible phrases is infinite.  Analogous points apply to social-relational cognition.  The number of instances of an elementary mod within any mod can be counted using natural numbers.  In the case discussed earlier in which a customer is ordering food from a waiter, there is one instance of Authority Ranking embedded in one instance of Market Pricing.  The total number of instances is two, a natural number.  There is no principled upper limit on the number of embeddings, hence infinity.  The discrete infinity of language and social-relational cognition is tantamount to their productivity.

However, some philosophers, especially Jerrold Katz, have argued that nothing empirical can exhibit discrete infinity.  Something empirical may be continuously infinite, such as a volume of space containing infinitely many points.  But the indefinite addition of constituent upon constituent has no empirical exemplification.  Space-time, if it were finite in this sense, would contain only finite energy and a finite number of particles.  There are not infinitely many objects, as discrete infinity would imply.  On this reasoning, the discrete infinity of an entity can only mean that the entity exists beyond space and time, still assuming that space-time is finite.  This would mean that sentences, and by similar reasoning compound mods as well, are abstract objects rather than neural features or processes.  This would mean that mods and sentences are abstract objects like numbers.  One finds here a kind of Platonism, Platonism here defined as the view that there are abstract objects.

As a tentative reply, one could say that the symbols generated by a computational system are potentially infinite in number, but this raises questions about the nature of potentiality.  What is a merely potential mod or a merely potential sentence?  It is not something with any spatiotemporal location or any causal power.  Perhaps it is sentence types (as contrasted with tokens) that exhibit discrete infinity.  And likewise with mods, it is mod types that exhibit discrete infinity.  But here too, one is appealing to entities, namely types, that have no spatiotemporal location or causal power.  By definition, these are abstract objects.

The case for Platonism is perhaps stronger for compound mods, but one could also defend the same conclusion with regard to the elementary mods.  Each elementary mod, as noted earlier, corresponds to one of the classic measurement scales.  Different scale types are presupposed by different logics.  Classical two-valued logic presupposes a nominal scale, as illustrated by the law of excluded middle: a statement is either on the truth scale, in which case it is true, or off the scale, in which case it is false.  Alternatively, one could posit two categories, one for true and one for false, and stipulate that any statement belongs on one scale or the other.  Fuzzy logics conceive truth either in terms of interval scales, for example, it is two degrees more true that Michel is bald than that Van is bald, or in terms of ratio scales, for example, it is 80 percent true that Van is bald, 100 percent true that Michel is bald.  Even though it has perhaps not been formalized, there is intuitively a logic which presupposes an ordinal scale.  A logic, say,  in which it is more true that chess is a game than that Ring a Ring o’ Roses is a game, even though it would be meaningless to ask how much more.  If nominal, ordinal, interval, and ratio scales are more basic than various logics, then the question arises as to whether they can seriously be considered empirical or spatiotemporal.  If anything is Platonic, then something more basic than logic is likely to be Platonic.  And what is an elementary mod aside from the scale type which it “resembles”?  Is there any reason to distinguish the elementary mod from the scale type itself?  If not, then the elementary mods themselves are abstract objects, at least on this argument.

Does reflection upon language and the relational models support a Platonist metaphysic?  If so, what is one to make of the earlier discussion of RMT appealing, as it did, to neural symmetry breakings and mental computations?  If mods are abstract objects, then the symmetry breakings and computations may belong to the epistemology of RMT rather than to its metaphysics.  In other words, they may throw light on how one knows about mods rather than actually constituting the mods themselves.  Specifically, the symmetry breaking and computations may account for the production of mental representations of mods rather than the mods themselves.  But whether or not there is a good case here for Platonism is, no doubt, open to further questioning.

3. References

a. Specifically Addressing Relational Models Theory

  • Bolender, John. (2010), The Self-Organizing Social Mind (Cambridge, Mass.: MIT Press).
    • Argues that the elementary relational models are due to self-organizing brain activity.  Also contains a discussion of possible Platonist implications of RMT.
  • Bolender, John. (2011), Digital Social Mind (Exeter, UK: Imprint Academic).
    • Argues that complex relational models are due to mental computations.
  • Fiske, Alan Page. (1990), “Relativity within Moose (‘Mossi’) culture: four incommensurable models for social relationships,” Ethos, 18, pp. 180-204.
    • Fiske here argues that RMT supports moral relativism, although his “relativism” may be the same as Berlin’s “pluralism.”
  • Fiske, Alan Page. (1991), Structures of Social Life: The Four Elementary Forms of Human Relations (New York: The Free Press).
    • The classic work on RMT, containing the first full statement of the theory and a wealth of anthropological illustrations.
  • Fiske, Alan Page. (1992), “The Four Elementary Forms of Sociality: Framework for a Unified Theory of Social Relations,” Psychological Review, 99, 689-723.
    • Essentially, a shorter version of Fiske’s (1991).  Nonetheless, this is a detailed and substantial introduction to RMT.
  • Fiske, Alan Page. (2004), “Relational Models Theory 2.0,” in Haslam (2004a).
    • An updated introduction to RMT.
  • Haslam, Nick. ed. (2004a), Relational Models Theory: A Contemporary Overview (Mahwah, New Jersey and London: Lawrence Erlbaum).
    • An anthology containing an updated introduction to RMT as well as discussions of controlled empirical evidence supporting the theory.
  • Haslam, Nick. ed. (2004b), “Research on the Relational Models: An Overview,” in Haslam (2004a).
    • Reviews controlled studies corroborating that the elementary relational models play an important role in cognition including person perception.
  • Pinker, Steven. (2007), The Stuff of Thought: Language as a Window into Human Nature (London: Allen Lane).
    • Argues that Market Pricing, in contrast to the other three elementary models, is not innate and is somehow unnatural.

b. Related Issues

  • Berlin, Isaiah. (1990), The Crooked Timber of Humanity: Chapters in the History of Ideas. Edited by H. Hardy (London: Pimlico).
    • A discussion of value pluralism in the context of history of ideas.
  • Fodor, Jerry A. (1987), Psychosemantics: The Problem of Meaning in the Philosophy of Mind (Cambridge, Mass. and London: MIT Press).
    • The Appendix argues that systematicity and productivity in thought require a combinatorial system.  The point, however, is a general one, not specifically focused on social-relational cognition.
  • Katz, Jerrold J. (1996), “The unfinished Chomskyan revolution,” Mind & Language, 11 (3), pp. 270-294.
    • Argues that only an abstract object can exhibit discrete infinity.
  • Rawls, John. (1971), A Theory of Justice (Cambridge, Mass.: Harvard University Press).
    • The veil of ignorance illustrates Equality Matching.
  • Szpiro, George G. (2010), Numbers Rule: The Vexing Mathematics of Democracy, from Plato to the Present (Princeton: Princeton University Press).
    • Illustrates various ways in which Equality Matching can be implemented.
  • Stevens, S. S. (1946), “On the Theory of Scales of Measurement,” Science 103, pp. 677-680.
    • A classic discussion of the types of measurement scales.
  • Vlastos, Gregory. (1962), “Justice and Equality,” in Richard B. Brandt, ed. Social Justice (Englewood Cliffs, New Jersey: Prentice-Hall).
    • An attempt to understand all distributive justice in terms of Equality Matching.

Author Information

John Bolender
Email: bolender@metu.edu.tr
Middle East Technical University
Turkey

Global Ethics: Capabilities Approach

The capabilities approach is meant to identify a space in which we can make cross-cultural judgments about ways of life. The capabilities approach is radically different from, yet indebted to, traditional ethical theories such as virtue ethics, consequentialism and deontology.

This article begins with a background on global ethics. This situates the capabilities approach as a possible solution to the problems that arise from globalization. The second section provides Amartya Sen’s account of the basic framework of the capabilities approach. That section also shows how Martha Nussbaum develops the approach. The third section describes Nussbaum’s list of ten central capabilities. This list has been viewed by some philosophers as a definitive list, while others, notably Sen, have argued that no list is complete, because a list should always be subject to revision. The fourth section shows how the approach is similar to, yet very different from, traditional ethical theories such as virtue ethics, consequentialism and deontology. The capabilities approach is shown to add to the approaches of global ethics such as communitarianism, human rights, and the approach of John Rawls. The section compares Michael Boylan’s table of embeddedness with Nussbaum’s capabilities list. The fifth section discusses two main philosophical critiques of the capabilities approach. First, and most notably, Alison Jaggar criticizes Nussbaum for not paying closer attention to asymmetrical power relations. Second, Bernard Williams raises questions about what constitutes a capability. The sixth section shows how the capabilities approach has been applied to advance various areas of applied philosophy including the environment and disability ethics. The final section explains how the capabilities approach has been undertaken as a global endeavor by the United Nations Development Program to fight poverty and illiteracy and to empower women.

Table of Contents

  1. Background of Global Ethics
  2. The Capabilities Approach
    1. Sen
    2. Nussbaum
  3. Nussbaum’s List of Central Capabilities
  4. The Relationship between the Capabilities Approach and Other Ethical Theories
    1. Virtue Ethics
    2. Communitarianism
    3. Deontology
    4. Rawls’ The Law of Peoples
    5. Human Rights
    6. Consequentialism
    7. Boylan’s Table of Embeddedness
  5. Philosophical Criticisms of the Capabilities Approach
    1. Illiberal and Neo-Colonialist
    2. What Is a Capability?
  6. Philosophical Applications
    1. The Environment
    2. Disability Ethics
  7. United Nations Development Program
  8. References and Further Reading

1. Background of Global Ethics

Issues of globalization have sparked great controversy since the 1980s. Globalization, broadly construed, is manifested in various forms of social activity including economic, political and cultural life. Practicing global ethics entails moral reasoning across borders. Borders can entail culture, religion, ethnicity, gender, race, class, sexuality, global location, historical experience, environment, species and nations. Ethicists ask how we best address issues of globalization–that is, how we begin to address conflicts that arise when vastly different cultural norms, values, and practices collide.

 

There have been two broad philosophical approaches to address cross-border moral disagreement and conflict. The dominant approach aims to develop moral theories that are not committed to a single metaphysical world-view or religious foundation, but are compatible with various perspectives. In other words, it is a goal to develop a theory that is both ‘thick’ (that is, it has a robust conception of the good embedded within a particular context, and respects local traditions) and ‘thin’ (that is, it embraces a set of universal norms). These universalists include human rights theorists, Onora O’Neill’s deontology, Seyla Benhabib’s discourse ethics and Martha Nussbaum’s capabilities approach. They tend to be associated with constructing ‘thin’ theories of morality. The other approach, most notably advocated by Michael Walzer, is communitarianism. Communitarians deny the possibility of developing a single universal standard of flourishing that is both thick enough to be useful and thin enough to support reasonable pluralism.

 

The debate between these two approaches to global ethics has reached an impasse. Since communitarians hold that moral norms are always local and valid internal to a particular community, universalists charge the communitarians with relativism. Moreover, universalists argue that communitarians fail to provide useful methods for addressing cross-border moral conflict. However, the communitarians charge the universalists with either positing theories that are too thin to be useful or advancing theories that are substantive but covertly build in premises that are not universally shared, and so risk cultural imperialism.

 

Martha Nussbaum believes her capabilities theory resolves the impasse and offers a viable approach to global ethics that provides a universal measure of human flourishing while also respecting religious and cultural differences. The capabilities approach, she argues, is universal, but ‘of a particular type.’ That is, it is a thick (or substantive) theory of morality that accommodates pluralism. Thus, she argues that her theory avoids criticisms applied to other universalists and communitarians. Before examining her theory, we must address her predecessor, Amartya Sen.

2. The Capabilities Approach

a. Sen

Amartya Sen, an economic theorist and founder of the capabilities approach, developed his theory in order to identify a space in which we can make cross cultural judgments on the quality of life. To best understand how these judgments can be passed, we must investigate a critical distinction made by proponents of the capabilities approach–between function and capability. A function, on the one hand, according to Sen, is an achievement, but this should be broadly understood to include any ‘state of being.’ Let’s examine Sen’s bike-riding example to shed light on a ‘function.’ He says a bicyclist has achieved the purpose of what one does with a bike–namely, ride it. From this example, clearly the choice to ride a bike is a function of a human being, however, the scope of functioning is not merely limited to a person’s intention to ride the bike. A ‘function’ entails any ‘state of being’ which includes excitement, happiness and fear. For example, a child who first begins to ride her bike may display a great amount of fear as she wobbles down the road, but once she understands how to ride the bike smoothly, she can enjoy (or perhaps become excited) riding her bike. Thus, when the child rides her bike (and is excited from doing so), she has performed the functions of riding a bike, and having the emotions associated with doing so, while partaking in the capability of play.

A capability, on the other hand, is a possibility, not just any possibility, but a real one. For example, we can talk about the possibility of a person in a deeply poverty-stricken area to find employment and support a family. However, such a possibility may not be real considering external circumstances–for example, no clothing, food or shelter. Put differently, a ‘capability set’ (as Sen calls it) is the total functions available for a person to perform.  By describing it in such a way, Sen places a deep correlation between freedom and function. That is to say, the more limited one’s freedom, the less opportunities one has to fulfill one’s functions. In sum, Crocker (2008) says succinctly that, according to Sen, a capability X entails (1) having the real possibility for X which (2) depends on my powers and (3) and no external circumstances preventing me from X.

A capability and function should not be understood as mutually exclusive or completely paralleling one another. Let’s consider two people with the same capabilities. Even though they have same capabilities, they may participate in radically different functions. For example, two people may both have the opportunity to engage in play, but do so in radically different ways (for example, one may swim while the other volunteers at a homeless shelter). Proponents of the capabilities approach argue this makes the theory most attractive, that is, it accommodates various ways of life even though it puts forth a conception of the good. Now, let’s consider a situation in which people participate in the same functions, but possess different capabilities set. Consider Sen’s example of hunger. Two people may be hungry, but for radically different reasons. Consider, on the one hand, a person who seeks to fulfill her desire to eat, but cannot because of socio-economic circumstances.  On the other, a person may be hungry because she is fasting for religious reasons or protesting an injustice. In both examples, the person suffers from starvation, but for radically different reasons.

b. Nussbaum

Nussbaum begins her capabilities approach by noting her indebtedness to Aristotle and Karl Marx (and to a lesser extent, J.S. Mill). Like Sen, she embraces the capabilities/function distinction. However, she begins to part ways with Sen’s philosophy when she grounds her theory in Marx and Aristotle. In doing so she argues that a function must not be performed in just any way, but in a ‘truly human way.’ That is to say, if a person lives a life where she is unable to exercise her human powers (for example, self-expressive creativity) then she is living her life in more of an animalistic manner than as a human being.

Nussbaum seeks a capabilities approach that can fully express human powers and not just provide (real) opportunities for people to perform certain functions. In other words, she does not deny, as Sen argues, that a capability is a real possibility or opportunity for an individual to perform certain actions, but that is merely necessary and not sufficient for the capabilities approach. Sen is missing, according to Nussbaum, aspects of what is particularly unique to human beings, that is, human powers. Nussbaum understands the capabilities/function distinction as multiply realized–that is, while the capabilities are the space for the opportunity for particular actions, the way in which that space is manifested, via different actions, is a person’s functioning.

Nussbaum notes that there are three specific differences that sets her capabilities approach apart from Sen. First, Nussbaum (2000) charges Sen with not explicitly rejecting cultural relativism. She agrees with his sympathies for universal norms, she also, criticizes his inability to completely reject cultural relativism. Second, Nussbaum criticizes Sen for not grounding his theory in a Marxian/Aristotelian idea of true human functioning. This is not to say that he would reject Nussbaum’s conclusions drawn from Marx and Aristotle, but rather he is not specifically indebted to (and does not ground his theory in) them.  Third, Sen does not provide an explicit list of central capabilities As a matter of fact, Sen has been critical of attempting to provide a list of central capabilities. Nonetheless, these three points of division seem to separate Sen and Nussbaum.

Nussbaum’s two philosophical justifications are the non-Platonic substantive good approach (that is, intuitionism) and a limited role of proceduralism (that is, discourse ethics)–which are a point of contention amongst critics. According to the former, the primary justification for the capabilities approach, we test various ethical theories against our fixed intuitions and decide which theory best matches them. Nussbaum contends that the theory that best represents our intuitions is the capabilities approach. The intuition that grounds the capabilities, according to Nussbaum, is the intuition of a dignified human life whereby people have the capability to pursue their conception of the good in cooperation with others. Consider her example of a person’s fixed intuition that rape is damaging to human dignity. She claims if one matches that intuition against all ethical theories that it will be best represented by the capabilities approach.

One may have reservations for this justification in situations where a person has underdeveloped (that is, intuitions that have not been challenged by competing intuitions) or mistaken intuitions. In response, Nussbaum argues that underdeveloped and mistaken intuitions must be rejected, and replaced with diversely experienced people who have tested their intuitions against competing beliefs. Although Nussbaum notes the primacy of intuitionism, she also argues that proceduralism has an ancillary justification for the capabilities approach.

Nussbaum’s proceduralism begins not with an intuition, but with a decision procedure, and it is the procedure that confers justification on the outcome. She is sympathetic to this form of proceduralism since it is rooted in Kantian discourse ethics (adopted by Jean Hampton), and has accordingly built into it a conception of equal human worth. In that sense proceduralism is similar to the intuitionist justification. However, there are stark contrasts. What is proceduralism, then? The version Nussbaum is concerned with claims that one consults the desires or preferences of another who is impacted by the outcome of the decision at hand. Similar to the concern above, Nussbaum fears that many people’s desires (like intuitions) will be corrupt, and thus produce a morally repugnant conclusion. Therefore, she seeks not just any desires, but ‘informed desires,’ that is, desires constructed by treating people with dignity. However, because not all desires are informed, and yet proceduralism calls for us to consult all desires affected by the decision, the capabilities approach would be placed on too weak of a foundation. Thus, in virtue of all the mistaken desires, proceduralism merely plays an ancillary role. Yet, it’s fair to say that if everyone had informed desires, then Nussbaum would grant proceduralism as a primary justification for the capabilities approach.

These two justifications are meant to be mutually reinforcing. They are meant to justify both the capabilities approach qua theory and the particular list of central capabilities put forth by Nussbaum. However, due to the limitations Nussbaum places on proceduralism, we must rely on intuitionism as the main justification.

3. Nussbaum’s List of Central Capabilities

 

There is much debate over whether Nussbaum’s list of central capabilities is revisable, and thus subject to change, or whether it is a fixed set of capabilities that cannot be compromised. Earlier in her career, Nussbaum (1995) argued that her list was static, however, she has since backed off such a claim and acknowledged the possibility that they could be altered. From her book, Women and Human Development: The Capabilities Approach (WHD hereafter), here is her list of capabilities, along with a brief description of each.

1. Life – Able to live to the end of a normal length human life, and to not have one’s life reduced to not worth living.

2. Bodily Health – Able to have a good life which includes (but is not limited to) reproductive health, nourishment and shelter.

3. Bodily Integrity – Able to change locations freely, in addition to, having sovereignty over one’s body which includes being secure against assault (for example, sexual assault, child sexual abuse, domestic violence and the opportunity for sexual satisfaction).

4. Senses, Imagination and Thought – Able to use one’s senses to imagine, think and reason in a ‘truly human way’–informed by an adequate education. Furthermore, the ability to produce self-expressive works and engage in religious rituals without fear of political ramifications. The ability to have pleasurable experiences and avoid unnecessary pain. Finally, the ability to seek the meaning of life.

5. Emotions – Able to have attachments to things outside of ourselves; this includes being able to love others, grieve at the loss of loved ones and be angry when it is justified.

6. Practical Reason – Able to form a conception of the good and critically reflect on it.

7. Affiliation

A. Able to live with and show concern for others, empathize with (and show compassion for) others and the capability of justice and friendship. Institutions help develop and protect forms of affiliation.

B. Able to have self-respect and not be humiliated by others, that is, being treated with dignity and equal worth. This entails (at the very least) protections of being discriminated on the basis of race, sex, sexuality, religion, caste, ethnicity and nationality. In work, this means entering relationships of mutual recognition.

8. Other Species – Able to have concern for and live with other animals, plants and the environment at large.

9. Play – Able to laugh, play and enjoy recreational activities.

10. Control over One’s Environment

A. Political – Able to effectively participate in the political life which includes having the right to free speech and association.

B. Material – Able to own property, not just formally, but materially (that is, as a real opportunity). Furthermore, having the ability to seek employment on an equal basis as others, and the freedom from unwarranted search and seizure.

Even though Nussbaum claims each of the ten capabilities is equally important, she places special emphasis on two of them–namely, practical reason and affiliation. We see the importance when she explicitly says the core behind the intuition of human functioning is that of a dignified free person who constructs her way of life in reciprocity with others, and not merely following, or being shaped by, others. Furthermore, Nussbaum notes that these two capabilities suffuse all the others, and this in turn, constitutes a truly human pursuit.

Furthermore, Nussbaum argues that the list is ‘thick,’ but ‘vague.’ It is thick because it provides a specific conception of the good life (that is, human flourishing), however, it is not thick enough that it mandates how one ought to live one’s life. Thus, the capabilities list is ‘thick’ enough to allow us to make cross-cultural judgments (for example, identifying areas where an individual or groups of people are unable to actualize a capability), and yet ‘vague’ enough for an individual to choose whether or not (or how) she wishes to participate in a capability.

Finally, Nussbaum says that citizens should be guaranteed a social minimum whereby capabilities can be realized. It is the role of institutions to ensure that a threshold level of central capabilities is achieved. Institutions (for example, religious, labor, government, and so forth) come in many forms, and protect various interests. For example, the Self Employed Women’s Association (SEWA) helps women provide protection and benefits for work in which they have been traditionally underappreciated. However, as Nussbaum notes, achieving the threshold may not be enough for justice.

4. The Relationship between the Capabilities Approach and Other Ethical Theories

The ethical theories that have dominated Western philosophy include (in one form or another) virtue ethics, consequentialism and deontology. The capabilities cannot be reduced to any of those ethical theories, however, it is indebted more or less to each of them. This section will review Rawls and human rights, both of which have numerous deontological underpinnings, and communitarianism which is closely linked with ethics. Finally, this section will include a section on Michael Boylan’s ‘table of embeddedness’ in order to see the challenges and parallels between it and Nussbaum’s list of capabilities. This section will explore parallels and differences between the capabilities approach and the above ethical theories.

a. Virtue Ethics

Even though there are clear differences between the virtue tradition (specifically, Aristotle) and the capabilities approach, Nussbaum uses the former as a point of departure. That is, Aristotle is the foundation for the capabilities approach because Nussbaum seeks a theory that provides the opportunity for human beings to use their powers to flourish in a truly human way.

Virtue ethics, broadly speaking, like the capabilities approach, claims human beings should exercise their powers qua human in attempt in order to live well. Contemporary neo-Aristotelians strive to explicate an account of flourishing  which may entail providing a naturalistic account of flourishing or through empirical psychology. Nussbaum, however, interprets Aristotle’s account of functioning as merely a moral concept and not naturalistic). However, unlike other neo-Aristotelians (and Aristotle himself), Nussbaum has no intention of providing a comprehensive doctrine of human flourishing, although, as noted above, she believes she is providing a tentatively comprehensive list of capabilities.

There is another stark contrast between virtue ethics and the capabilities approach–namely, character building and motivation. Nussbaum is less concerned with why people perform certain actions, and building one’s character over a period of time through proper motivations, and more concerned with providing the proper space that allows an individual to use her powers to fulfill a capability, if she chooses. One should not mistake this claim to mean that Nussbaum is not concerned with motivation at all, but rather this should be viewed as a shift in emphasis. Nussbaum argues in WHD that informed desires (that is, the justification for the capabilities approach) cannot be any desire, but those which contribute to living well. For example, even though one may fulfill the capability of practical reason through education, one should not use it in such a way that coerces others. Such a desire would be condemned by Nussbaum since, on the one hand, it prevents the coerced person from participating in all the capabilities, and on the other, it does not reflect an informed one.

b. Communitarianism

Communitarianism is a critique of liberal theory, and, on the other, emphasizes the importance of political norms within a community. In brief, liberal theorists contend that a self is ahistorical, asocial and apolitical.  Thus it is not necessarily the case that it will be burdened by the practices and beliefs of its community. Michael Sandel, a nationalist-communitarian, explains that a liberal self is ‘unencumbered’–that is, it is not wedded to a particular conception of the good not of its choosing. This abstract ontology allows liberals to make certain moves in the political sphere. For example, the concept of ‘justice’ entails universal normative claims since all human beings are ontologically the same.

In contrast, Alasdair MacIntyre, a communitarian indebted to Aristotle, argues against liberal political theory beginning with their conception of the self. He says a self is embedded within a particular set of cultural beliefs, practices and history. MacIntyre, following Aristotle, claims that in order for one to live a good life, one must be virtuous. A virtue, according to MacIntyre (2007), is a character trait that allows us to achieve goods that are internal to one’s practices By ‘practice,’ he is referring to a “socially established cooperative human activity through which goods internal to that form of activity are realized in the course of trying to achieve those standards of excellence….”  Thus, living a good life entails being virtuous within the context of a given practice (or community).

Furthermore, communitarians believe justice is limited to communities rather than human beings at large. This, in turn, allows them to reject the notion that we can make universal normative judgments. Finally, MacIntyre believes we need extend our conception of virtue from the individual to the community. It’s a bit unclear what a virtuous community would look like exactly, however, we know that it would have a conception of the good life in which people strive. This is clearly contrary to the liberal project in which, , individuals pursue whatever conception of the good they wish as long as they do not interfere or harm another.

Nussbaum is sympathetic to communitarianism insofar as it acknowledges the importance of local traditions and practices that shape our lives. For example, a Hindi woman in India will have a set of beliefs that shape who she is that differs from a Protestant male in the United States. However, Nussbaum ultimately rejects communitarianism. In her section entitled “Defending Universal Values” from WHD, she says communitarians fail to recognize that there is a conception of the individual that is not indebted to a particular metaphysical tradition. She argues that each person should be treated as an end, worthy of respect, dignity and honor. As mentioned in section II, Nussbaum believes the capabilities is founded on the intuition that each person is worthy of a dignified life, and this intuition holds irrespective of one’s community.

c. Deontology

In putting forth her ancillary justification for the capabilities, Nussbaum is indebted to Jean Hampton’s Kantian proceduralism. Nussbaum (2000) believes we need a “Kantian conception of human worth that prominently includes the ideas of equal worth and nonaggregation” (Nussbaum’s italics,). There are two points to take from this claim. First, she is indebted to the Kantian notion that all human beings have intrinsic worth, and as a result, they should always be treated as an end and never merely as a means. Second, she is critiquing the consequentialist argument for aggregate utility. We saw her specific problems with this argument immediately above.

Although Nussbaum is clearly indebted to deontology since it is a justification (albeit auxiliary) for the capabilities, there remains questions to what extent Kant plays a role. David Crocker (2008) argues that her Kantian equal-worth commitment is nothing more than an addition onto her Aristotelianism since the latter justifies moral and political inequality.

d. Rawls’ The Law of Peoples

 

John Rawls uses the same methodology (and preserves the liberal ontological framework of ‘autonomy’ and ‘reason’) in The Law of Peoples as in A Theory of Justice however, he has extended justice to a global scale rather than merely nationally. Beginning with the ‘global original position,’ Rawls argues that all reasonable (or decent) persons would construct political ideals that benefit all liberal peoples; these ideals would be reached via overlapping consensus. See Daniels (1989) and Pogge (1989) for further discussion on Rawls’ original position. A liberal, democratic society, according to Rawls (1999), would include the following benefits: (1) fair equality of opportunity–including, education, (2) a decent distribution of income, (3) society as employer of last resort through general or local government, (4) basic health care for all citizens and (5) public financing of elections (p. 50).

Rawls (1999) claims that the policies constructed by liberal peoples should direct non-liberal societies to (ideally) all become liberal. Rawls deems an illiberal society which rejects the possibility of becoming liberal (for example, abiding by human rights regulations) as an ‘outlaw state.’ While liberal societies should attempt to tolerate illiberal societies initially, he contends an outlaw state eventually subjects itself to severe sanctions and possible intervention

Nussbaum is indebted to not only Rawls specifically, but often praises the values of liberalism. First, she is committed to Rawls’ method of ‘overlapping consensus’ insofar as it is politically advantageous to perform such tasks as fairly distributing primary goods. Furthermore, Nussbaum (2000) respects Rawls attentiveness to “pluralism and paternalism” while remaining committed to the importance of basic liberties Finally, Nussbaum agrees with Rawls (and liberalism more generally) that we should treat people as dignified human beings, and respect their autonomy qua individual.

Nussbaum is also critical of Rawls beginning with his reluctance to make comparisons of well-being. Rawls refuses to make comparisons since each person constructs their conception of the good, so a person may be satisfied with their way of life even though another may find it unsatisfactory. While there may be fears of paternalism, Nussbaum is clear that we should make comparisons of well-being in order to grant certain areas as needing more resources than others. From this, Nussbaum (2000) criticizes Rawls for not taking seriously enough how greatly individuals vary in their needs. Consider her example. If we are concerned with spending resources on increasing literacy rates around the world, we will have to spend much more on women than men given the discrepancy between them. However, Nussbaum argues that Rawls’ approach could not properly address the obstacles when distributing resources since he is merely concerned with resource-distribution, and not cognizant of the variations of distribution within a particular region.

e. Human Rights

The rhetoric of human rights has arguably been more powerful than any other approach to global justice. There is debate amongst human rights advocates in regards to the origin of rights, how they are manifested (that is, who possess them), their possibility of group distribution and how they ought to be enforced. Nonetheless, human rights are universal political norms that belong to every individual simply in virtue of being human. It does not matter whether one belongs to one affiliation or another; but merely in virtue of being a human being, she is guaranteed minimal norms (for example, the right to life or liberty). These are minimal insofar as they are not connected with any conception of the good life, and thus, do not preclude any groups of people (or communities). For further discussion on the nature of human rights see Griffin (2008) and Donnelly (2003).

Alan Gewirth, in The Community of Rights, attempts to make human rights compatible with communities. We can see the difficulty of such a task given the commitment the communitarianism theorists have to a common good, on the one hand, and a value-neutral approach from rights, on the other. Nonetheless, Gewirth argues that if a community does not uphold a doctrine of human rights, then it ought to be rejected as a legitimate community. Gewirth puts forth a theory of human rights while respecting the role communities play in our lives. Furthermore, Will Kymlicka (1989) extends the concept of rights by constructing a theory of rights that considers communities or group rights.

In WHD, Nussbaum directly addresses the “very close” relationship between human rights and the capabilities approach. She believes the capabilities approach has advantages over human rights insofar as it can take a clear position on issues the latter cannot in addition to providing a clear goal. For example, human rights theorists often disagree on the origin and foundation of rights, whereas the capabilities approach, according to Nussbaum, is not plagued by such criticisms. She raises two concerns for why we should reject human rights in favor of the capabilities approach, and then provides four key roles for human rights.

Nussbaum first claims that human rights proponents often make rights claims in regards to property or economic advantage (for example, they have a right to shelter). However, in converting a language of rights to capabilities, she explains that this statement becomes problematic insofar as it can be understood in many ways including resources, utility and capabilities. The human rights tradition would discuss it in terms of resources; however, merely providing resources does not necessarily raise everyone to the same level of capability in order to allow them to fulfill their function. Second, the language of capability ethics does not contain all the baggage that pertains to human rights.  Although Nussbaum rejects the understanding that human rights are often characterized as simply being Western, she also says the capabilities approach avoids the troubles surrounding this debate.

Even though Nussbaum is critical of human rights, she believes is plays an essential role in global ethics. She presents the following four roles (or advantages) of human rights. First, human rights have the advantage of showing the urgency to claims of injustice. Second, human rights (as of now) have rhetorical power. Third, human rights place value on people’s autonomy. Finally, human rights preserve a sense of agreement insofar as it purports norms that apply to everyone.

f. Consequentialism

It would be easy to mistake the capabilities approach as a consequentialist argument to increase the overall utility in the world, where ‘utility’ can be understood in many ways–including ‘happiness.’ Peter Singer (1972), in his influential work, “Famine, Affluence and Morality,” puts forth arguments fighting global poverty from a consequentialist standpoint. In sum, he argues through a series of objections and replies that those in positions of material power should donate to those in less favorable conditions in order to increase the overall utility (and ultimately decrease poverty) throughout the world. It can be said that that Singer’s consequentialism and the capabilities approach are similar insofar as they both more or less seek to directly reduce poverty, and furthermore, provide more opportunities for those who have few or none.

However, Nussbaum (2000) provides three reasons for why consequentialism is different from the capabilities approach. First, one major difference is for whom the ethical theory accounts. On the one hand, consequentialism is interested in maximizing the utility of everyone (that is, the aggregate). On the other, the capabilities approach is interested in the individual. For example, Nussbaum says that the aggregative solution does not tell us who are the bottom and top, that is, who has control over material goods and whether or not someone else deserves a share of it. Thus, by focusing on the individual, we are able to best identify who needs resources and how much.

Second, related to the above point, consequentialism tends to ignore cross-cultural differences, that is, ignoring the fact that people live vastly different lives. As consequentialism is concerned with overall utility (and not merely particular persons or groups of people), it may ignore a particular good that is minimized in one culture, but widely present in another. Put differently, there are many goods–including education and religion–that are highly important to some and relatively unimportant to others. Consequentialism aggregates all goods under the heading of ‘utility,’ and thus, we are unable to identify which goods must be properly distributed to a particular region. The capabilities approach, however, is not only interested in allowing groups of people to use their power to fulfill a capability, but in each individual person to partake in a capability.

Finally, consequentialism ignores relevant aspects of individuals including emotions (that is, how individuals feel about what is happening to them) and what they are able to do or be (that is, fulfill a capability). This critique tends to be associated with consequentialism at large (and not specifically from the capabilities approach), but it is still worth noting. Since the capabilities strive for human flourishing, which entails the ability to express emotions without fear, we can understand why Nussbaum reiterates this critique.

g. Boylan’s Table of Embeddedness

Michael Boylan, in A Just Society, presents a ‘table of embeddedness,’ which is meant to describe a hierarchy of goods. Boylan’s argument for the table can be seen as follows: if people desire to be good, and becoming good requires action, then all people desire to act; the following table presents the interconnectedness between Boylan’s preconditions for actions and a hierarchy of goods.

Boylan (2004) splits the table into two levels–basic goods and secondary goods. The former, on the one hand, is broken further into ‘most deeply embedded’ goods (for example, food, clothing, shelter and free from being harmed) and ‘deeply embedded’ goods (for example, literate, basic math skills, treated with self-respect, and so forth). On the other hand, Boylan divides the latter into ‘life enhancing’ goods (for example, societal respect, equal opportunity and equal political participation), ‘useful’ goods (for example, property, gain from one’s labor and pursue goods owned by the general public such as a cell phone) and ‘luxurious’ goods (for example, pursue pleasant goods such as vacationing and use one’s will to possess a large portion of society’s resources). Even though society has no duty to provide ‘useful’ or ‘luxurious’ goods, it has an obligation to provide basic goods and life enhancing goods (from the secondary goods) to its members. Finally, in striving for equal respect, Boylan claims society may have to spend greater resources on those who are disadvantaged; in doing so Nussbaum would be sympathetic to Boylan’s claim that some groups of people require disproportionally more resources given their unfortunate circumstances than another. This was her critique of Rawls–namely, that he did not account for the varying needs of individuals. Furthermore, Nussbaum would also grant that society has an obligation to provide its citizens with Boylan’s basic goods such as food, shelter and water. However, the roles in which each list plays will be different given how their respective authors understand its purpose.

Nussbaum’s list, unlike Boylan’s, is not hierarchal, but rather everyone ought to have equal opportunity to perform a function that fulfills a capability. In other words, no capability, according to Nussbaum, is more essential than another. Marcus Düwell (2009) provides two criticisms of this view. First, he claims a lack of hierarchy of goods (or capabilities) raises concerns about its practical guidance in “morally contested topics.” Even though Nussbaum argues that no primacy should be given to a particular capability, it’s worth noting that it would be difficult to fulfill the capability of ‘bodily integrity,’ for example, if one’s capability of life is taken away. Second, it also raises concerns to what extent the capabilities are “foundational moral obligations for others.”

5. Philosophical Criticisms of the Capabilities Approach

The capabilities approach has endured many criticisms since its inception. The primary critique is constructed from the feminist and non-Western perspective. This entry will focus on Alison Jaggar’s critique since it embodies many concerns of power relations. Meanwhile, the latter critique can be found in many theorists, but the focus of this entry will be limited to Bernard Williams since he puts forth two challenges in attempt to seek the nature of a capability. Jaggar’s criticisms are limited to Nussbaum, and Williams’ critique is directed primarily towards Sen. This will provide a greater array of criticisms for the capabilities theory in general.

a. Illiberal and Neo-Colonialist

Alison Jaggar criticizes both Nussbaum’s justifications for the capabilities approach and her list. Jaggar believes Nussbaum may have ignored power asymmetries that exist between not only men and women, but also Western and non-Western peoples. She argues that the intuitionist and proceduralist justifications seem to be neo-colonialist and illiberal.

First, Jaggar (2006) argues that Nussbaum’s theory appears to be neo-colonialist insofar as those in power have the “final authority…to assess the moral worth of…[other’s] voices”. This is problematic for the intuitionist justification since those who possess intuitions that do not match the capabilities list, for example, will be interpreted and possibly jettisoned. Put differently, there are no mechanisms in Nussbaum’s approach that allow us to encourage self-criticism from those who possess the list. Furthermore, Jaggar emphasizes that Nussbaum is committed to a politically liberal project (that is, considering everyone’s intuitions), however, the intuitionist justification paradoxically dismisses ideas that do not match the theory put forth by Nussbaum, and thus, it illiberally disregards others. In order for Nussbaum’s theory to encourage self-criticism, she must include all intuitions.

Second, the capabilities list seems to be illiberal since “other voices” (that is, mistaken or uninformed desires) are not ready for a proceduralist justification. Since Nussbaum demands only informed desires participate in the proceduralist justification for the list, desires that do not match the list will be unable to partake in the discourse. Furthermore, because these voices are silenced, there may be capabilities missing from the list or capabilities on the list that ought to be challenged. Regardless, they will be left untouched.

In sum, Jaggar criticizes Nussbaum’s justifications for the capabilities approach since they ignore asymmetrical power relationships. Jaggar believes that even though Nussbaum claims to be paying attention to such relations, she paradoxically fails to produce a theory that yields an outcome that is cognizant of power. It’s worth noting, though, that Jaggar does not believe these criticisms ultimately entail rejecting the capabilities. Rather, she believes that placing discourse ethics as the main justification for the capabilities may allow the theory to be self-critical, and thus, fully aware of power dynamics.

b. What Is a Capability?

Williams’ (1987) primary concern of the capabilities approach is trying to understand what is meant by a ‘capability.’ In pursuing this inquiry, he believes Sen in particular, but capabilities proponents in general, are unclear on the relationship between ‘choice’ and ‘capability.’ Williams does not provide knock-down arguments against the capabilities, but rather poses two challenges for the capabilities theorist to consider.

First, Williams asks what it means to have the capability to do X? Consider his example. If a person is posted once a year to a desirable holiday resort, does she have the capability to go? In a trivial sense, “yes,” but not in a meaningful way (that is, in a way that contributes to the well-being of an individual). If the term ‘capability’ is understood merely as ‘possibility,’ then it could be granted that she has the capability to go, although, there is still something missing–namely, the ability to choose whether or not to go. This example is meant to illustrate the correlation between capabilities and choice. That is, according to Williams, in this case a capability cannot exist without the option to choose it. However, consider Sen’s example where a capability exists without the ability to choose it. Sen, in his Tanner Lectures, notes that the life expectancy is higher in China than India. He believes this example shows that the higher one’s life expectancy the higher the capability of a standard of living. In response to this claim William asks, what capability is increased by a greater life expectancy? He poses this question since it might be the case that living longer only contributes to one having more time to contemplate whether to commit suicide. In this example, Williams is pointing out the problems with the relationship of a capability that completely lacks choice.

Second, and related to the above challenge, William questions the relationship of the capability of doing X to the actual ability to do X here and now. He notes that the ‘actual ability to do X’ can be understood as ‘can do X.’ In other words, if a person possesses the capability to do X, then it must be the case that she can do X. Consider Sen’s example of the capability of breathing unpolluted air. He would argue that if a person has the capability to breathe unpolluted air, then she can do so. Williams grants that a person living in Los Angeles cannot breathe unpolluted air here and now, however, that is not to say she cannot do so at all. In other words, this person has the capability to breathe unpolluted air, but she cannot do it here and now; this position is contrary, though, to Sen’s claim above that if one has the capability to do X, she can do X. Because she has the capability to breathe unpolluted air, she should move to a place where it is possible to do so. Williams argues, though, that there are large costs associated with moving to a place where she can breathe unpolluted air. Let’s assume that person does not have the economic means to do so. Does this person really have the capability, then, to breathe unpolluted air?–logically speaking, “yes,” however, certainly not in any meaningful sense. By considering the opportunity costs associated with a capability such as breathing unpolluted air, some capabilities may become nearly impossible for many to acquire. Thus, Williams argues it is not simply because one can do X that one has the capability to do X.

6. Philosophical Applications

The capabilities approach is often discussed in terms of providing opportunities (Sen) and using human powers (Nussbaum). More often than not it is an argument to reduce poverty or increase the well-being of people around the globe. Recently, it has provided the framework to further advance arguments in other areas of applied ethics including business ethics, the environment, disability ethics and animal ethics. This entry will merely focus on the environment and disability ethics because it calls attention to how far the capabilities approach can be extended.

a. The Environment

The biggest challenge facing capabilities theorists in regards to the environment is on the area of emphasis. The goal of the capabilities–whether Sen or Nussbaum–is human flourishing or well-being. It is never simply understood as non-human or ecological flourishing. Of course, this is not to say that the capabilities approach has nothing to say about the environment, or worse, that it must harm it in order for human beings to flourish, although, there are obstacles standing in the way when putting forth not only an environmentally friendly capabilities approach, but one in which environmental flourishing is taken just as seriously as human flourishing.

There seems to be two ways in which we can approach environmental ethics from a capabilities perspective. By briefly examining each solution, we will have a broader perspective of how the capabilities approach begins to asses environmental concerns. First, one may begin with the capabilities list, and show how environmental values relate to human flourishing. Recall Nussbaum’s eighth capability (out of ten): Other species have the ability to have a concern for and live with others animals, plants and the environment at large. There are two points we can take from this capability. First, Nussbaum believes the environment clearly plays a role in human flourishing otherwise she would not have included it as a capability. Even though the environment seems to be playing an instrumental role insofar as it contributes to human flourishing, it is nonetheless an essential capability. Furthermore, Nussbaum’s list is beneficial because she believes it should be implemented as public policy which would force countries that do not take the environmental capability seriously to reconsider their current policies. Second, however, Victoria Kamsler (2006) recalls that she places it eighth on the list which, she argues, is hard to deny that it is given less emphasis than on almost all the other capabilities. In defense of Nussbaum, she notes that all the capabilities are meant to be mutually reinforcing, and thus, the dignity of a human being as truly human cannot be met without taking environment flourishing seriously.

Second, rather than starting with the list and placing instrumental value on the environment, one may begin with a general account of flourishing that can be applied to non-human beings such as animals and the environment. Here, the environment is understood as being intrinsically valuable (that is, valuable independent of human beings). Kamsler notes that Nussbaum believes the “most basic intuition behind [the] capability theory… ‘wants to see each thing flourish as the sort of thing that it is'”. In other words, the environment qua capability must be treated as an entity that must flourish in its own right, and not merely for the value it provides human beings.

There still remains a lingering question about the relationship between the environment and the capabilities approach. If the capability is understood as anthropocentric insofar as it is concerned with human flourishing, what should we do when the environment impedes such flourishing? In other words, there seem to be cases in which being concerned with the environment’s flourishing will directly conflict with human flourishing (for example, the capability of work and protecting forests). Kamsler addresses this conflict when she says that the only way to overcome this seemingly tragic dilemma is through technological and political means. This is not to say that it will not be costly or conflict with other capabilities, but it is a solution that goes beyond being complacent with the dilemma.

b. Disability Ethics

A person cannot be said to flourish, according to the capabilities approach, if she is unable to perform functions that partake in the capabilities. This raises interesting questions with people who have disabilities insofar as they may be either physically or mentally impaired from having the ability to perform many functions. Nussbaum has given this topic ample discussion through her Tanner Lectures and various publications.

Nussbaum addresses the question of disabilities via the capabilities approach through her list. Her early formulation of the capabilities list excluded many people from the ability to live a truly human life since she required such a life to include using all five senses, for example. She has since retracted from such bold statements. However, Nussbaum (1995) does note that it would be difficult to imagine a person living a truly human life with total lack of the senses, imagination and reasoning.

Nussbaum (2002) has extended her account of functioning in a truly human way (that is, for human dignity) “as containing many different types of animal dignity, all of which deserve respect and even wonder”. In other words, she believes the mentally disabled can gain dignity not merely from rationality, but also through support for the “capabilities of life, health, and bodily integrity. It will also provide stimulation for senses, imagination and thought” This passage indicates a clear responsibility on the state to not only allow for such stimulation of the senses to occur, but to actually provide the resources for such stimulation to occur.

There are interesting questions about how to implement policies that provide the best opportunity for disabled peoples to perform functions that fulfill capabilities. Nussbaum heralds the Individuals with Disabilities Education Act (IDEA) as a way to understand how the capabilities can be manifested in the current education system. IDEA is a disabilities act that begins with the idea of human individuality. Instead of lumping all disabled students into one group, each student is taken on a case-by-case basis. This approach in turn, allows for each student to receive the proper care she needs. This Act does not focus on education being a ‘human right’ because that would entail the goal of merely providing an education to the student, that is, ensuring she receives an education in one form or another. What makes this Act uniquely indebted to the capabilities is its commitment to providing the opportunity for the students to use their powers qua human beings to fulfill their functions in a truly human way–for example, via their senses, imagination and thought.

7. United Nations Development Program

The UNDP is an organization built on the theoretical principles of the capabilities approach. Its goals include helping countries best address solutions pertaining to democratic governance, poverty reduction, crisis prevention and recovery, environment and energy and HIV/AIDS. The organization is clear that none of these solutions will ever come at the expense of women since they are an advocate of empowering women. The four solutions listed here are designed to assist the various challenges facing nations. However, there are eight concrete goals the UNDP is interested in achieving.

The UNDP has put forth eight Millennium Development Goals (MDGs). The MDGs include the following: (1) eradicate extreme poverty and hunger, (2) achieve universal primary education, (3) promote gender equality and empower women, (4) reduce child mortality, (5) improve maternal health, (6) combat HIV/AIDS, malaria and other diseases, (7) ensure environmental sustainability and (8) develop a global partnership for development. The success or failure of achieving these goals is based on a measurement from the Human Development Report (HDR).

The HDR is designed to measure the ways in which people can live up to their full potential in accordance with their desires and interests. Mahbub ul Haq, founder of the HDR, says “the basic purpose of development is to enlarge people’s choices…[which include] greater access to knowledge, better nutrition and health services, more secure livelihoods, security against crime and physical violence, satisfying leisure hours, political and cultural freedoms and sense of participation in community activities.” There are two points to take from this. First, it is clear that the theoretical aspects of the capabilities approach have been preserved upon measuring the MDGs. Second, the HDR is not committed to merely measuring wealth, but rather providing the opportunities for a person to fulfill any of the capabilities she is interested in pursuing.

8. References and Further Reading

  • Appiah, Kwame A. (2006) Cosmopolitanism: Ethics in a World of Strangers, W.W. Norton: NY.
  • Benhabib, Seyla (1995) “Cultural Complexity, Moral Interdependence, and the Global Dialogical Community” in Women, Culture and Development, Martha C. Nussbaum and Jonathan Glover (eds.), Clarendon Press: Oxford.
  • Boylan, Micahel (2004) A Just Society, Rowman & Littlefield Publishers, Inc: Lanham, MD.
  • Crocker, David (2008) Ethics of Global Development: Agency, Capability and Deliberative Democracy, Cambridge University Press: NY.
  • Daniels, Norman (1989) Reading Rawls: Critical Studies on Rawls’ “A Theory of Justice,” Stanford University Press: Stanford, CA.
  • Donnelly, Jack (2003) Universal Human Rights in Theory and Practice, Cornell University Press: Ithaca.
  • Düwell, Marcus (2009) “On the Possibility of a Hierarchy of Moral Goods,” in Morality and Justice: Reading Boylan’s A Just Society, John-Steward Gordon (ed.), Rowman & Littlefield Publishers, Inc: Lanham, MD.
  • Gewirth, Alan (1978) Reason and Morality, The University of Chicago Press: Chicago, IL.
  • Gewirth, Alan (1996) The Community of Rights, The University of Chicago Press: Chicago, IL.
  • Griffin, James (2008) On Human Rights, Oxford University Press, Oxford.
  • Jaggar, Alison (2006) “Reasoning About Well-Being: Nussbaum’s Methods of Justifying the Capabilities,” The Journal of Political Philosophy, 14:3, 301-322.
  • Kymlicka, Will (1989) Liberalism, Community and Culture, Oxford University Press, Oxford.
  • MacIntyre, Alasdair (1988) Whose Justice? Which Rationality?, University of Notre Dame Press, Notre Dame, IN.
  • MacIntyre, Alasdair (2007) After Virtue, Notre Dame University Press: Notre Dame, IN.
  • Nussbaum, Martha (1995), “Human Capabilities, Female Human Beings,” in Women, Culture, and Development: A Study of Human Capabilities, Martha C. Nussbaum and Jonathan Glover (eds.), Clarendon Press: Oxford.
  • Nussbaum, Martha (2000) Women and Human Development: The Capabilities Approach, Cambridge University Press: Cambridge, MA.
  • Nussbaum, Martha (2002) “Capabilities and Disabilities: Justice for Mentally Disabled Citizens,” Philosophical Topics, 30:2, 133-165.
  • O’Neill, Onora (1996) Towards Justice and Virtue: A Constructive Account of Practical Reason, Cambridge University Press: Cambridge, MA.
  • Pogge, Thomas W. (1989) Realizing Rawls, Cornell University Press, Ithaca.
  • Rawls, John (1999) A Theory of Justice, Harvard University Press: Cambridge, MA.
  • Rawls, John (1999) The Law of Peoples, Harvard University Press: Cambridge, MA.
  • Sandel, Michael (1996) Democracy’s Discontent: America in Search of a Public Philosophy, Harvard University Press: Cambridge, MA.
  • Sandel, Michael J. (1982) Liberalism and the Limits of Justice, Cambridge University Press, Cambridge.
  • Sen, A. K. (1985) “Well-being, Agency and Freedom: the Dewey Lectures,” Journal of Philosophy, 82:4, 169-221.
  • Sen, Amartya K. (1987) The Standard of Living: The Tanner Lectures, Cambridge: Cambridge University Press.
  • Sen, Amartya (2009) The Idea of Justice, Harvard University Press: Cambridge, MA.
  • Singer, Peter (1972) “Famine, Affluence, and Morality,” Philosophy and Public Affairs, 1:1, 229-243.
  • Kamsler, Victoria (2006) “Attending to nature: capabilities and the environment,” in Capabilities Equality: Basic issues and problems (ed.) Alexander Kaufman, Routledge: NY, 198-213.
  • Williams, Bernard (1987), “The Standard of Living: Interests and Capabilities,” in The Standard of Living (ed.), Geoffrey Hawthorn, Cambridge University Press: Cambridge, 94-102.

Author Information

Chad Kleist
Email: chad.kleist@marquette.edu
Marquette University
U. S. A.

Autonomy: Normative

Autonomy is variously rendered as self-law, self-government, self-rule, or self-determination. The concept first came into prominence in ancient Greece (from the Greek auto-nomos), where it characterized city states that were self governing. Only later–during the European Enlightenment–did autonomy come to be widely understood as a property of persons. Today the concept is used in both senses, although most contemporary philosophers deal with autonomy primarily as a property of persons.  This orientation will be maintained here.

Most people would agree that autonomy is normatively important. This agreement is reflected both in the presence of broad assent to the principle that autonomy deserves respect, and in the popular practice of arguing for the institution (or continuation, or discontinuation) of public policy based in some way on the value of self-determination. Many also believe that developing and cultivating autonomy is an important–indeed, on some accounts, an indispensable–part of living a good life. But although the claim that autonomy is normatively significant in some way is intuitively compelling, it is not obvious why autonomy has this significance, or what weight autonomy-based considerations should be given in relation to competing normative considerations. In order to answer these questions with sufficient rigor, it is necessary to have a more detailed understanding of what autonomy is.

This article will be devoted to canvassing the leading work done by philosophers on these two issues, beginning with the question of the nature of autonomy, and then moving to the question of the normative significance of autonomy. It will be seen that autonomy has been understood in several different ways, that it has been claimed to have normative significance of various kinds, and that it has been employed in a wide range of philosophical issues. Special attention will be paid to the question of justification of the principle of respect for autonomous choice.

Table of Contents

  1. History of the Concept of Autonomy
  2. Conceptions of Autonomy
    1. Moral Autonomy
    2. Existentialist Autonomy
    3. Personal Autonomy
    4. Autonomy as a Right
  3. The Normative Roles of Autonomy
    1. Autonomy in Ethical Theory
    2. Autonomy in Applied Ethics
    3. Autonomy in Political Philosophy
    4. Autonomy in Philosophy of Education
  4. Warrant for the Principle of Respect for Autonomous Choice
  5. References and Further Reading

1. History of the Concept of Autonomy

The concept of autonomy first came into prominence in ancient Greece, where it characterized self-governing city-states. Barring one exception (mentioned below), autonomy was not explicitly predicated of persons, although there is reason to hold that many philosophers of that time had something similar in mind when they wrote of persons being guided or ruled by reason. Plato and Aristotle, for example–as well as many of the Stoics–surely would have agreed that a person ruled by reason is a properly self-governing or self-ruling person. What one does not find, however, are ancient philosophers speaking of the ideal of autonomy as that of living according to one’s unique individuality. The one exception to this appears to be found in the thinker and orator Dio of Prusa (ca. 50–ca. 120), who, in his 80th Discourse, clearly seems to predicate autonomy of individual persons in roughly the sense in which it has come to be understood in our own day (see Cooper 2003).

Medieval philosophers made no use of the concept of autonomy that is worthy of note, although once again, many medieval philosophers would have doubtless agreed that those who live in accordance with right reason and the will of God are properly self-governing. The concept of autonomy wouldn’t be circulated in learned circles again until the Renaissance and early modern times, when it was employed both in the traditional political sense, and in an ecclesiastical sense, to refer to churches that were–or at least claimed to be –independent of the authority of the Roman Catholic Pope (see Pohlmann 1971).

The concept of autonomy came into philosophical prominence for the first time with the work of Immanuel Kant. Kant’s work on autonomy, however, was strongly influenced by the writings of Jean-Jacques Rousseau, so a brief word on Rousseau is in order.  Although Rousseau did not use the term ‘autonomy’ in his writings, his conception of moral freedom–defined as “obedience to the law one has prescribed to oneself”– has a clear relation to Kant’s understanding of autonomy (as will be shown below). Moreover, Rousseau wrote of moral freedom as a property of persons, thus presaging Kant’s predication of autonomy of persons. The connections between Rousseau and Kant cannot be taken too far, however; for Rousseau was primarily concerned with the question of how moral freedom can be achieved and sustained by individuals within society given the presence of relations of social dependency and the possibility of domination, whereas Kant was primarily concerned with the place of autonomy in accounts of the subjective conditions requisite for, and the nature of, morality. Because of the connections Kant drew between autonomy and morality, Kant’s conception of autonomy is sometimes referred to as ‘moral autonomy’.

In the nineteenth century, John Stuart Mill contributed to the discussion on the normative significance of autonomy in his work On Liberty. Although Mill did not use the term ‘autonomy’ in this work, he is widely understood as having had self-determination in mind. Mill’s work continues to have considerable influence on discussions on the normative significance of autonomy in relation to paternalism of various kinds.

A tremendous amount of research on autonomy has taken place in the last several decades in both the analytic and continental traditions. Continental philosophers speak more often of authenticity than of autonomy, but there are clear connections between the two, insofar as the ‘self’ in ‘self-determination’ is plausibly understood as the authentic self. Philosophers working in the analytic tradition have gone into great detail attempting to discern necessary and sufficient conditions for the presence of autonomy, as well as to uncover the ground and implications of its normative significance.

2. Conceptions of Autonomy

There are several different conceptions of autonomy, all of which are loosely based upon the core notions of self-government or self-determination, but which differ considerably in the details.

a. Moral Autonomy

As mentioned, moral autonomy is associated with the work of Kant, and is also referred to as ‘autonomy of the will’ or ‘Kantian autonomy.’ This form of autonomy consists in the capacity of the will of a rational being to be a law to itself, independently of the influence of any property of objects of volition. More specifically, an autonomous will is said to be free in both a negative and a positive sense. The will is negatively free in that it operates entirely independently of alien influences, including all contingent empirical determinations associated with appetite, desire-satisfaction, or happiness. The will is positively free in that it can act in accordance with its own law. Kant’s notion of autonomy of the will thus involves, as Andrews Reath has written, “not only a capacity for choice that is motivationally independent, but a lawgiving capacity that is independent of determination by external influence and is guided by its own internal principle–in other words, by a principle that is constitutive of lawgiving” (Reath 2006).  Now, because the lawgiving of the autonomous will contains no content given by contingent empirical influences, this lawgiving must be universal; and because these laws are the product of practical reason, they are necessary. Insofar, then, as Kant understood moral laws as universal and necessary practical laws, it can be seen why Kant posited an essential connection between the possession of autonomy and morality: the products of the autonomous will are universal and necessary practical laws–that is, moral laws. It is thus by virtue of our autonomy that we are capable of morality, and we are moral to the extent that we are autonomous. It is for this reason that Kant’s conception of autonomy is described as moral autonomy. Moral autonomy refers to the capacity of rational agents to impose upon themselves–to legislate for themselves–the moral law.

Furthermore, the capacity for autonomy, according to Kant, is “the basis of the dignity of human and of every rational nature;” and in accordance with this rational nature, is an end in itself. Furthermore, it “restricts freedom of action, and is an object of respect”. Many thinkers have followed Kant in grounding the dignity of persons (and respect for persons generally) in our capacity for autonomy (although it should be noted that not all of these thinkers have accepted Kant’s conception of autonomy). More will be said on this below.

Moral autonomy is said to be a bivalent property possessed by all rational beings by virtue of their rationality–although according to Kant, it is certainly possible not to live in accordance with its deliverances in practice (for more on Kant’s conception of autonomy, see Hill 1989, Guyer 2003, and Reath 2006).

One of the most common objections to this conception of autonomy is that such a robust form of independence from contingent empirical influences is not possible. Kant defended the possibility of such robust independence by arguing that human agents inhabit two realms at once: the phenomenal realm of experience, in relation to which we are determined; and a noumenal or transcendental realm of the intellect, in relation to which we are free. Given the further claim that our noumenal self can exercise efficient causality in the phenomenal realm, Kant held that our autonomy is in large part constituted by our noumenal freedom. The postulation of such a form of freedom may be criticized as metaphysically extravagant, however; and if such freedom is not possible, then neither is moral autonomy in Kant’s strict sense. Some thinkers have argued that Kant’s theorization on the noumenal realm was not meant to have metaphysical significance. Thomas Hill has argued, for example, that Kant may have been merely elaborating on the practical conditions in which we must understand ourselves insofar as we conceive ourselves as free. Objectors have insisted, however, that Kant intended to assert the more robust form of metaphysical freedom. Indeed, it could be pressed, he must have; for without this sense of freedom being operative, actual autonomy–and hence morality, by Kant’s lights–would not be possible.

b. Existentialist Autonomy

Existentialist autonomy is an extreme form of autonomy associated principally with the writings of Jean Paul Sartre. It refers to the complete freedom of subjects to determine their natures and guiding principles independently of any forms of social, anthropological or moral determination. To possess existentialist autonomy is thus to be able to choose one’s nature without constraint from any principles not of one’s own choosing. Sartre held this radical freedom to be entailed by the truth of atheism. According to Sartre, God’s nonexistence entails two key conclusions: firstly, humans cannot have a predetermined nature; and secondly, there cannot exist a realm of values possessing independent validity. Taken together, this entails that human beings are radically free: “For if indeed existence precedes essence, one will never be able to explain one’s action by reference to a given and specific human nature; in other words, there is no determinism–man is free, man is freedom. Nor, on the other hand, if God does not exist, are we provided with any values or commands that could legitimize our behavior. Thus we have neither behind us, nor before us in a luminous realm of values, any means of justification or excuse.”  Fettered neither by a predetermined nature nor by an independently existing order of values, “[m]an is nothing else but what he makes of himself” (Sartre 1946).

Like moral autonomy, existentialist autonomy is a bivalent property which all human persons are said to possess (although possibly without being aware of this). Unlike moral autonomy, however, existentialist autonomy has no necessary connections to morality or to rationality as traditionally conceived.

The primary objection to existentialist autonomy is that it is too radical to be plausible.  Even if God does not exist, it is argued, it does not follow that humans lack a nature that determines–at least to some extent–their choices, tendencies, proclivities, and guiding principles. A thoroughly naturalistic conception of human nature, informed by an understanding of the evolutionary forces operative in human psychology, seems to militate against the notion that humans are as unbounded as existentialist autonomy suggests we are. At the very least, it could be argued that empirical evidence does not speak in favor of the existence of existentialist autonomy in any robust form.

c. Personal Autonomy

Without question, the majority of contemporary work on autonomy has centered on analyses of the nature and normativity of personal autonomy. Personal autonomy (also referred to as ‘individual autonomy’) refers to a psychological property, the possession of which enables agents to reflect critically on their natures, preferences and ends, to locate their most authentic commitments, and to live consistently in accordance with these in the face of various forms of internal and external interference. Personally autonomous agents are said to possess heightened capacities for self-control, introspection, independence of judgment, and critical reflection; and to this extent personal autonomy is often put forth as an ideal of character or a virtue, the opposite of which is blind conformity, or not ‘being one’s own person.’

As mentioned above, personal autonomy has an essential relation to authenticity: the personally autonomous agent is the agent who is effective in determining her life in accordance with her authentic self. Personal autonomy is thus constituted, on the one hand, by a cluster of related capacities (often termed ‘authenticity conditions’), centered on identifying one’s authentic nature or preferences and, on the other hand, by a cluster of capacities (often termed ‘competency conditions’) that are centered on being able effectively to live in accordance with these throughout one’s life in the face of various recalcitrant foreign influences. These capacities may be possessed singly or in unison, and often require a considerable amount of life experience to assume robust forms.

One of the most intractable problems surrounding personal autonomy concerns the analysis of the authentic self (the ‘self’ in ‘self-determination’, as it were).  Some philosophers have claimed that no such self exists; and indeed, some philosophers claim that no self exists at all (for an overview of these problems, see Friedman 2003 and Mackenzie & Stoljar 2000).  Most philosophers accept the possibility of the authentic self at least as a working hypothesis, however, and concentrate attention on the question of how authenticity is secured by an agent. The most popular and influential account is based on the work of Harry Frankfurt and Gerald Dworkin. According to their ‘hierarchical’ account, agents validate the various commitments (beliefs, values, desires, and so forth) that constitute their selves as their own by a process of reflective endorsement. On this account, agents are said to possess first-order and second-order volitions. Our first-order volitions are what we want; and our second-order volitions are what we want to want. According to the hierarchical model, our first-order desires, commitments, and so on are authentic when they are validated by being in harmony with our second-order volitions: that is, when we want what we want to want. Following from this model, an agent is autonomous in relation to a given object when the agent is able to determine her first-order volitions (and corresponding behavior) by her second-order volitions. A simple example may help to illustrate the model. Say that I am a smoker. Although I enjoy lighting up, I do not reflectively endorse my smoking; I desire it, but I do not want to desire it. On the hierarchical model, smoking is not an aspect of my authentic self, because I do not reflectively endorse it; and to the extent that I am unable to change my habits, I am not autonomous in relation to smoking. Conversely, if I can bring my first-order volitions into harmony (or identity) with my second-order volition, then my desire is authentic because it is reflectively endorsed; and to the extent that I can mold my behavior in accordance with my reflective will, I am autonomous in relation to smoking.  Persons who possess the requisite capacities to form authentic desires and effectively to generally live in accordance with them are autonomous agents according to this model (see Frankfurt 1971, 1999 and Dworkin 1988).

The hierarchical model remains–in outline, at least–the leading account of authenticity undergirding most contemporary accounts of personal autonomy, although it has been attacked on many fronts. The primary objection tendered against this account is ‘the problem of origins.’ As we have seen, authentic selfhood as reflective endorsement holds that my authentic self is the self that I reflectively ratify: the self that I endorse as expressing, in a deep sense, who I fundamentally am or wish to be. The problem of origins arises when one attempts to explain how this act of reflective endorsement actually constitutes a break from other-determination (that is, from foreign influence). For, could it not be the case that what appears to me to be an independent act of reflective endorsement is itself conditioned by other-determining factors and therefore ultimately an other-determined act? If this is the case, then it doesn’t seem that the possession of autonomy or the making of autonomous choices is possible. In short, the problem is how to sustain an account of self-determination that is not threatened by the pervasive effects of other-determination (see Taylor 2005 for elaboration on the problem of origins and related sub-problems). Much work on theories of personal autonomy has been explicitly devoted to addressing precisely these sorts of difficulties.

Besides analyzing and clarifying the authenticity conditions necessary for autonomy, philosophers have also worked on providing a thorough account of the competency conditions necessary for the presence of autonomy (see Meyers 1989, Mele 1993, and Berofsky 1995). Competency conditions, as we have seen, are those capacities or conditions that need to be present in order for one to be effective in living according to one’s authentic self-conception in the face of various kinds of interference to that end.  Examples of competency conditions include self-control, logical aptitude, instrumental rationality, resolve, temperance, calmness, and a good memory.

In addition to authenticity and competency conditions, many theories of personal autonomy require the presence of certain external enabling conditions: that is, external or environmental (social, legal, familial, and so forth) conditions which are more than less out of the agent’s control, but which must be in place in order for fully autonomous living to be possible. Such enabling conditions include, for example, a modicum of social freedom, an array of substantive options for choice, the presence of authenticity-oriented social relations, and autonomy-supporting networks of social recognition and acknowledgment (see Raz 1986 and Anderson & Honneth 2005). Without these conditions, effective autonomous living is said by some to be impossible, even where authenticity and competency conditions are robustly satisfied. Different autonomy theorists place different emphases on external enabling conditions. Some contend that external enabling is a necessary condition for autonomy (see Oshana 1998). Others hold that autonomy more properly concerns agential satisfaction of authenticity and competency conditions, regardless of whether the external environment allows for actual autonomous expression (see Christman 2007). Both views can claim some intuitive support. On the one hand, it is reasonable to hold that it is only fitting to call a person ‘autonomous’ if that person is in fact effective in living according to her authentic self-conception. Yet, it also makes sense to call persons ‘autonomous’ who have formed an authentic self-conception and possess the requisite competency conditions effectively to express that self-conception, but happen to lack the contingent socio-relational conditions that allow for the expression of that authentic self. A possible solution to this impasse may be to avoid seeking hard and fast borders to the existence of autonomy, and say that autonomy is present in both cases, but is more robust where the proper external enabling conditions are in place.

The question of normative commitments associated with personal autonomy possession has also been a matter of some dispute. Many philosophers hold that autonomy is normatively content-neutral. According to this account, one (or one’s commitments) can be autonomous regardless of the values one endorses. On this account, one could commit to any kind of life–even the life of a slave–and still be autonomous (see, for example, Friedman 2003). Other philosophers hold that autonomy possession requires substantive normative constraints of some kind or other–at the very least, it is argued that one must value autonomy in order to be truly autonomous (see Oshana 2003). As with the debate just mentioned, both sides of this debate can claim some intuitive support; this can be shown through the asking of opposing but seemingly equally compelling (apparently rhetorical) questions; namely, ‘Can’t one autonomously choose whatever one wants?’, and, ‘How can we call someone autonomous who doesn’t value or seek autonomous living?’ One possible solution to this debate is to say that while almost any individual choice can be autonomous, persons cannot live autonomous lives as a whole without some commitment to the value of autonomy.

Unlike moral and existentialist autonomy, personal autonomy is possessed in degrees, depending on the presence and strength of the constellation of internal capacities and external enabling conditions that make it possible. While not all persons possess personal autonomy, it is commonly claimed that virtually everyone–with the exception of the irredeemably pathological and the handicapped–possesses the capacity for personal autonomy. In addition, the links between personal autonomy possession and moral agency are usually said to be thin at best. Even those who hold that personal autonomy possession requires substantive normative commitments of some kind (such as, for example, a commitment to the value of autonomy itself), they usually hold that it is quite possible to be an autonomous villain. Some philosophers have argued that personal autonomy possession requires the presence of normative competency conditions that effectively provide agents with the capacity to distinguish right from wrong (see Wolf 1990), but this strong account is in general disfavor, and even if the account is correct, few would argue that this means that personally autonomous agents must also always act morally. In the face of this, one may wonder why autonomy-based claims are said to generate demands of respect upon others. This question will be dealt with in more detail in section 4 below.

Lastly, a word should be given on the relation between personal autonomy and freedom (or liberty, which is here taken to be synonymous with freedom). Although it is not uncommon to find the terms ‘(personal) autonomy’ and ‘freedom’ used essentially synonymously, there are some important differences between them.

More often than not, to claim that a person is free is to claim that she is negatively free in the sense that she is not constrained by internal or external forces that hinder making a choice and executing it in action. There is a clear distinction between autonomy and negative freedom, however, given that autonomy refers to the presence of a capacity for effective authentic living, and negative freedom refers to a lack of constraints on action.  It is entirely possible for a person to be free in this negative sense but nonautonomous, or–on accounts that do not require the presence of external enabling conditions for autonomy to be present–for a person to be autonomous but not (negatively) free.

Some writers also speak of positive freedom, and here the connections with autonomy become much deeper. Speaking very generally, to be free in this sense is to possess the abilities, capacities, knowledge, entitlements or skills necessary for the achievement of a given end. For example, I am only (positively) free to win an Olympic gold medal in archery if I am extremely skilled in the sport. Here it should be clear that one can be positively free in many ways and yet not be autonomous. Some philosophers, however, following Isaiah Berlin (Berlin 1948), have described positive freedom in such a way that it becomes basically synonymous with personal autonomy. Like autonomy, the conception of freedom that is operative in a given discussion can vary considerably; but more often than not personal autonomy is distinguished from freedom by the necessary presence, in the former, of a connection to the authenticity of the agent’s self-conception and life-plan–a connection that is usually not found in conceptions of freedom.

d. Autonomy as a Right

Lastly, autonomy is sometimes spoken of in a manner that is more directly normative than descriptive. In political philosophy and bioethics especially, it is common to find references to persons as autonomous, where the autonomy referred to is understood principally as a right to self-determination. In these contexts, to say that a person is autonomous is largely to say that she has a right to determine her life without interference from social or political authorities or forms of paternalism. Importantly, this right to self-directed living is often said to be possessed by persons by virtue either of their potential for autonomous living or of their inherent dignity as persons, but not by virtue of the presence of a developed and active capacity for autonomy (see Hill 1989). Some have argued that political rights (Ingram 1994) and even human rights generally (Richards 1989) are fundamentally based upon respect for the entitlements that attend possessing the capacity for autonomy.

3. The Normative Roles of Autonomy

Although disagreements concerning the nature of autonomy are rife, almost no one disagrees that autonomy has normative significance of some kind; and this agreement is found both in relation to the claim that autonomy is normatively significant for the autonomous agent and to the claim that autonomy is normatively significant for the addressees of autonomy-based demands. Following from this, autonomy plays an important normative role in a variety of philosophical areas.

a. Autonomy in Ethical Theory

Autonomy is referenced or invoked in a number of key ways in ethical theory:

(i) Autonomy serves as a ground for the claims that persons have dignity and inherently deserve basic moral respect

(ii) Autonomy is said to have a value that grounds the claim that persons deserve to be told the truth

(iii) Autonomy is referenced as a fundamental principle of ethics in Kantian deontology

(iv) Autonomy is commonly viewed as a key component of human well-being (and is therefore significant for utilitarianism)

(v) Autonomy is defended as an important virtue

(vi) Autonomy is said to be necessary for moral responsibility

(vii) Autonomy is said to have a value that grounds the claim that autonomy-based demands are worthy of special respect

(i) Ever since Kant, autonomy (or the capacity for autonomy) has been referenced by some philosophers as that property of human beings by virtue of which they possess inherent dignity and therefore inherently deserve to be treated with basic moral respect.  Kant’s justification for the claim that autonomy grounds the inherent dignity of persons was based on the view that it is by virtue of our autonomy that we are ends-in-ourselves.  Beings that lack autonomy are, precisely because of this lack, essentially at the mercy of the determinism that characterizes the phenomenal realm: they are controlled by forces that have nothing to do with their own will. Beings that possess autonomy on the other hand, are, precisely because of this possession, free from this determination; they have the capacity for freedom through the active exercise of their autonomous wills, which allows for the legislation of universal law. Autonomous agents are not passive players in life; they are active agents, determining themselves by their own will, the authors of the laws that they follow (see Guyer 2003). As such, they are not passive means towards nature’s determined ends, but are ends-in-themselves, by virtue of which they possess inherent dignity and deserve basic moral respect. Many have followed Kant in referencing autonomy as the ground of human dignity and as the basis of the basic moral respect owed to persons, although not all have followed Kant in the details of his account (for a recent account that moves away from Kant’s conception of noumenal freedom, see Korsgaard 1996). The most common objection leveled against this account is that it runs into problems involving exclusion. Most would argue that the mentally handicapped, for example, are owed basic moral respect, even if they do not possess (even the capacity for) autonomy. And if human dignity is indexed to the presence of autonomy, it is argued, this would entail, counter-intuitively, that those who are more autonomous have more dignity, and are more worthy of respect. It may also be argued that the capacity for autonomy is a poor ground for human dignity (and respect for persons) for other reasons–for example, because autonomy has no essential connection to morality, or because better grounds are available, or because the very project of grounding human dignity on a property of some kind is ill-conceived. Despite these worries, however, appeals to autonomy as a basis for human dignity and basic moral respect remain quite popular.

(ii) Some philosophers have argued that a proper appreciation for others as autonomous (or as possessing the capacity for autonomy) requires that one not seek to deceive them.  Respect for autonomy is thus said to have an important relation to truthfulness. In Thomas Hill’s words, “Lies often reflect inadequate respect for the autonomy of the person who is deceived.” (Hill 1991) We saw above that autonomy’s value has been used to ground the basic moral respect owed to persons; and the present injunction against deception may be viewed as a specific form that autonomy-based respect for persons may take. It is easy to see why a connection between respect for autonomy and truthfulness (or what comes to the same thing–an injunction against deception) has been attractive to some philosophers, especially those in the Kantian tradition. When we deceive others for our own purposes, we bypass their reflective abilities and make them instruments in the achievement of our own ends, and in doing this we fail to treat them as persons capable and deserving of self-determination. Proper respect for persons as autonomous thus requires a commitment to truthfulness. It has been argued, however, that one may respect and value the autonomy of another while deceiving them at the same time (Buss 2005). One may, for example, use forms of deception so that another’s capacity for autonomy may flourish. The basic idea here is that one may still reason for oneself despite being deliberately influenced by the deceptive behavior of others. As Sarah Buss writes, “To put it somewhat crudely, whether an instance of practical reasoning is self-determined is a matter of whether it is really the agent herself who is doing the reasoning. And this would seem to depend on whether she determines her response to the considerations that figure in her reasoning–not on how the considerations to which she responds relate to reality, nor on how she came to be aware of these considerations.” It may be argued, however, that the conception of autonomy underlying this claim is too thin to be acceptable, and that a better conception would contain the resources necessary to judge self-determining reasoning influenced by the deliberate deception of another as nonautonomous. In this vein, some have argued that a person is autonomous in relation to a given desire or choice only if that person would not feel alienated from the causal process that gave rise to that desire or choice (Christman 2007). On the assumption that persons would feel alienated from deceptive desire- or choice-forming processes, the associated desires or choices would not count as autonomous. In response to this, however, it may be argued that autonomous agents may not feel alienated from all (or many) deceptive forms of influence upon the formation of their desires and choices, depending on the circumstances (Buss 2005). If this were the case, then a commitment to the value of autonomy may not be inconsistent with certain forms of deception or manipulation. Yet, given the traditional opposition between autonomous self-determination and agential determination rooted in deceit and manipulation, it is to be expected that resistance to the notion that they are not incompatible will continue.

(iii) Autonomy plays a key role in Kant’s deontological ethics. We have already seen this in the way in which Kant grounds human dignity in autonomy. But autonomy plays a further (and closely related) normative role for Kant. It is often said that Kant held that the Categorical Imperative can be expressed in three closely related formulas: the Formula of Universal Law, the Formula of Humanity, and the Formula of the Kingdom of Ends. It has also been claimed, however, that Kant defended a fourth formula, which may be called the Formula of Autonomy. Although Kant did not state this formula explicitly, it has been argued that it can be plausibly derived from his description of the Categorical Imperative as “the idea of the will of every rational being as a will that legislates universal law.” The corresponding Formula of Autonomy could then be expressed as an imperative in this way: act so that the maxims you will could be the legislation of universal law. According to this formula, we should act according to principles that express the autonomy of the will. This formulation is important, firstly because it suggests that Kant conceived autonomy as a normative principle (and not merely as a condition of the will that makes morality possible), and secondly because it further reinforces Kant’s claim that humans, as autonomous law-givers, are the source of the universal law that guarantees their freedom and hence marks them out as possessing inherent dignity (see Reath 2006).

(iv) Autonomy is commonly held to be a core component of well-being. This view goes back at least to Mill’s On Liberty, and has been accepted by many contemporary philosophers as well (see for example Griffin 1986 and Sumner 1996). In this connection, some argue that autonomy is an intrinsic part of well-being, and others argue that being autonomous reliably leads to well-being (and hence has instrumental prudential value).  Although thus far, the normative importance of autonomy has been described as being associated primarily with deontology, the claim that autonomy is a core component of well-being shows that it can play a key role in consequentialist moral theories as well. Indeed, as will be discussed in greater detail below (section 4), although most defenses of the principle of respect for autonomy are deontological in nature, it is also possible to defend the principle on consequentialist grounds. From this point of view, it can be argued that autonomy deserves respect because respecting autonomy is reliably conducive to well-being.

(v) Autonomy has been claimed to be an important virtue to possess. It is not difficult to see why this is the case. The autonomous person is a person possessing a constellation of widely desirable qualities such as self-control, self-knowledge, rationality and reflective maturity. To be autonomous is to be self-governing; to be free from domination by foreign influences over one’s character and values; to ‘be one’s own person’. Following from this, it is claimed by some that autonomy is a great virtue to possess – one which constitutes an important part of human flourishing. It may be objected, however, that an excessive concern with autonomy can be at odds with virtue, especially if robust autonomy entails an inability to exhibit loyalty or fidelity to projects, other persons or communities. Recent work on personal autonomy, however, has tended to support the notion that autonomy possession is not incompatible with these and similar forms of attachment (Friedman 2003).

(vi) Autonomy has been seen by some thinkers as having implications for a correct account of moral responsibility. Some accounts hold that autonomy is a necessary condition for moral responsibility. The basic defense of this claim is that it makes little sense to say that someone is morally responsible for her actions if she is not the author of those actions; and since one is the author of one’s actions only if one is autonomous, autonomy possession is necessary for moral responsibility. According to this account, the class of actions that are autonomous and the class of actions for which we are morally responsible are identical, or at least almost so (see Fischer and Ravizza 1998). Other accounts hold that although persons are certainly morally responsible for their autonomous actions, they are also morally responsible for a wider range of actions as well. An account of this sort is often made by those who hold a more demanding conception of autonomy; and defenders of this account argue that we still want to hold persons morally responsible for the many actions that do not satisfy robust autonomy conditions on the one hand, but are not constituted of sheer heteronomy (brainwashing, psychosis, coercion, and so forth) on the other (see Arpaly 2005).

(vi) Many thinkers believe that autonomous claims or demands are worthy of special normative uptake–special respect–by virtue of the fact that they are autonomous. It is important to see how this claim is different from the first point given above (viz., that autonomy is said to ground basic moral respect for persons). The former claim is that the fact that persons are autonomous (or have the capacity for autonomy) is what grounds their special dignity, by virtue of which they are owed basic moral respect. Now, it is possible to owe someone basic moral respect, but not to owe special respect to a subset of their choices. Imagine that someone is brainwashed, for example. Many would argue that although we owe that person basic moral respect (for example, we are obligated, say, not to harm them or to lie to them), we do not owe special respect to that person’s demands (say, to promote or not interfere with those demands). The current claim holds, however, that the fact that a person’s choices are autonomous generates special demands of respect for those choices over and above the basic respect owed to the chooser (whether this be conceived as being by virtue of their capacity for autonomy or not). This principle–that autonomous choice deserves special respect–may be justified in either a deontological or consequentialist manner. Because of the considerable importance of this principle, however, it deserves a more detailed discussion, which is provided in section 4 below.

b. Autonomy in Applied Ethics

The principle of respect for autonomy has had a considerable influence on applied ethics largely because of its versatility: it can be invoked in any applied ethics debate that bears (even remotely) on morally significant situations that involve the demands of self-determination, free choice, authenticity or independence. Seven of the most important of these debates–certainly not an exhaustive list–will be briefly canvassed below:

(i) Autonomy and informed consent

(ii) Autonomy and abortion

(iii) Autonomy and end-of-life decisions

(iv) Autonomy and same-sex marriage

(v) Autonomy and just war theory

(vi) Autonomy and advertising

(vii) Autonomy and environmental ethics

(i) Respect for autonomy has had a major influence on debates in medical ethics, especially those concerning the constraints that should be in place within the physician-patient relationship. Perhaps the most important such constraint is that of informed consent. According to this principle, a patient should not receive medical treatment of any sort unless she is well-informed enough as to the treatment’s nature and effects to be able to make an informed decision about it. The patient must agree to the treatment on the basis of this information. Many have argued that the requirement of informed consent is necessitated as part and parcel of a more basic imperative to respect patient autonomy (Dworkin 2006). Few argue that respecting patient autonomy has no weight at all; more commonly, objectors argue that there are cases in which overriding patient autonomy is sometimes justified by the good consequences that will likely result from doing so.

(ii) Autonomy is also referenced as an important value to be taken into consideration in the abortion debate, although it is referenced in different ways. On the one hand, it is argued that some abortions are justified as an expression of a woman’s reproductive autonomy (see Overall 1990 and Fischer 2003). On the other hand, it could be argued that abortion is morally unacceptable, among other reasons, because it fails to respect the potential future autonomy of the aborted (for a related argument, see Marquis 1989).  Assuming that both of these autonomy-based arguments have weight, adjudicating this dispute requires–among other things–establishing and defending the relative weights of actual and potential autonomy, both in relation to particular choices and in relation to lives as wholes.

(iii) Many argue that considerations of respect for autonomy are also decisive in the debates concerning the moral acceptability of euthanasia and suicide. Respect for autonomy can be viewed as a reason for accepting voluntary euthanasia. The basic argument here is one of consistency: if respect for others’ autonomy requires respecting others’ self-determining life-choices (at least when these are competently made), and if end-of-life decisions are placed within the ambit of life-choices, then end-of-life decisions made by competent, autonomous persons should be respected, even if these decisions involve voluntary euthanasia (for a related argument, see Brock 1993). Some have cast doubt, however, on whether a decision to die can be an autonomous decision at all, given the likely presence of psychological factors such as fear, hopelessness, and despair–factors which would undermine careful introspection and critical thought (Hartling 2006). Respect for autonomy can also be seen as a reason for respecting the decision to end one’s life, even when reasons of mercy are not in play–that is, in cases of suicide–at least where there is reason to hold that the agent is sufficiently competent and rational (Webber & Shulman 1987). Some argue, however, that autonomy-based defenses of voluntary euthanasia and suicide involve a contradiction insofar as they invoke the value of autonomy to justify an act that destroys autonomy (Safranak 1988 and Doerflinger 1989). If correct, these arguments do not show that voluntary euthanasia or suicide are unacceptable; they show rather that arguments to establish their acceptability cannot be based on respect for autonomy. It may be a cause of worry, however, that such arguments prove too much by rendering unacceptable autonomy-based respect for any decision that involves a subsequent lessening of free choice.

(iv) Autonomy also carries normative weight in a number of applied ethics debates relating to public policy. Respect for autonomy can be straightforwardly referenced, for example, as an argument in favor of the acceptability of same-sex marriage: respect for others’ autonomy entails respect for their autonomous decisions, and decisions regarding marriage–even same-sex marriage–fall within these parameters (when autonomous).  Objectors might argue, however, that homosexual marriage is immoral, and that the right to noninterference with autonomous choice does not exist where the object of the choice is immoral. Few would argue, for example, that there exists an obligation to respect someone’s autonomous decision to embezzle money, given that that act is immoral. The question then becomes whether same-sex marriage is morally acceptable.

(v) Respect for autonomy also plays a role in discussions of a just-war theory. Specifically, it has been referenced as the key principle determining the proper constraints and limitations that should be in place if we wish our prosecution of war to be just. It has been argued, for example–and in partial conjunction with what has been said above–that possession of autonomy (or the existence of a capacity for it) is the ground of human dignity, and hence the actions which appreciate that dignity must center on respect for autonomy. In relation to war, this suggests that while war may sometimes be morally permissible (in cases of self-defense, for example), wartime actions cannot involve violations of others’ autonomy, especially that of noncombatants (for an extended discussion, see Zupan 2004).

(vi) In business ethics, respect for autonomy has been identified as a key reason why persuasive advertising practices are morally unacceptable (Crisp 1987). The arguments given in support of this claim largely follow those mentioned above in relation to truthfulness–viz. that respect for others’ autonomy is incompatible with deception or manipulation–combined with the claim that persuasive advertising practices constitute deception or manipulation. In this vein it has also been argued that persuasive advertising undermines consumer autonomy by creating foreign desires and wants and by producing compulsive behavior in consumers. Some have argued, however, that persuasive advertising practices do not threaten consumers’ autonomy, at least not necessarily or intrinsically (Arrington 1982). From this point of view, although such deception may occur, this is the exception; usually it provides consumers with the information necessary for making informed decisions. It has also been argued that, even if persuasive advertising thwarts autonomy, it is still in consumers’ interests to be exposed to it, given that companies would likely only go to such trouble for products that will be market-winners, and hence that consumers would have desired and bought those products anyway, even after careful consideration (Nelson 1978). One obvious problem with this argument is that it assumes that heavy persuasive marketing is a sign of product quality, which is certainly debatable; but even if that premise is granted, it may still be argued that rhetorical device laden advertising, by attempting to bypass consumers’ critical thinking abilities, violates their autonomy.

(vii) Respect for autonomy has even been referenced in relation to issues in environmental ethics. Eric Katz has argued, for example, that nature as a whole constitutes an ‘autonomous subject’, which therefore deserves moral respect and should not be treated as a mere means to the satisfaction of human ends (Katz 1997). Critics of this view may wonder whether the notion of an autonomous subject operative here has been stretched to breaking point, or rendered hollow. If correct, this criticism does not, of course, entail that prohibitions against using nature as a mere means to human ends cannot be provided; it simply means that an acceptable defense cannot be based on autonomy-related considerations.

It should be clear from the breadth and diversity of the employment of the principle of respect for autonomy that it is both, extremely versatile and a mainstay of applied ethics debates. The brief sketches given above concern some of the more prominent autonomy-related discussions in applied ethics, but other debates in applied ethics–relating, for example, to injunctions against discrimination (Gardner 1992 and Doyle 2007) or against domestic abuse (Friedman 2003), to name a couple–have been approached and adjudicated in reference to the importance of respecting autonomy as well.

More often than not, however, those who reference the principle of respect for autonomy in applied ethics either take its normative force for granted, or only devote passing attention to the question of its justification. Yet, given that the principle is neither self-evident nor immune to challenge, it is very important that those who reference the principle be able to provide a robust justification of its normative weight. Because of its fundamentality, this issue will be considered separately and in more detail in section 4 below.

c. Autonomy in Political Philosophy

Autonomy is considered normatively significant for issues in political philosophy, primarily in relation to discussions of social justice and rights. It is particularly important for political liberalism (see, for example, Christman and Anderson 2005); and some have argued that autonomy is the core value of liberalism (see White 1991 and Dagger 2005).  Four of the most important issues in political philosophy that invoke the normative significance of autonomy include:

(i) The establishment and validation of just social and political principles

(ii) The legitimation of political power

(iii) The justification of political rights (both specific and general)

(iv) The acceptability of political paternalism

(i) A conception of the autonomous individual provides the perspective from which social and political principles are formulated, and validated as just, in several contractarian political theories. A classic example is provided in Rawls’ A Theory of Justice (1971).  According to Rawls, principles of social justice are best conceived and validated based on what would be acceptable to (representatives of) members of society gathering together in an ‘original position’ behind a ‘veil of ignorance’. Rawls argued that the conditions that constrain this process will ensure that those taking part in it are acting autonomously (that is, according to Rawls, as free and rational). Of key importance is the veil of ignorance because, by preventing any detailed knowledge of one’s condition or place in society, it “deprives the persons in the original position of the knowledge that would enable them to choose heteronomous principles”. Rawls concludes: “[W]e can say that by acting from these principles persons are acting autonomously: they are acting from principles that they would acknowledge under conditions that best express their nature as free and equal rational beings.” Given these key constraints in the contracting process, it  results, according to Rawls, in valid principles of social justice. Here it can be seen that autonomy has double (and mutually supporting) normative significance: it characterizes members of society in an idealized way in order to form the normatively privileged perspective from which to establish principles of social justice; and it provides the standard that validates those principles as just (viz., by being accepted by autonomous agents). One can see the influence of Kant’s conception of autonomy and its normative significance in this doctrine. Roughly put, Kant held that moral principles are those that would be accepted by persons within an idealized constraint–viz., insofar as they are autonomous. Similarly, Rawls argued that the principles of social justice are those that would be accepted by persons within an idealized constraint–viz., insofar as they are conceived as autonomous (free and rational) agents behind a veil of ignorance in the original position. Rawls explicitly acknowledged his indebtedness to Kant in this regard.

(ii) In relation, a cornerstone of political liberalism is the view that political power is legitimized by its free acceptance by a state’s subjects who are conceived (at least minimally) as autonomous persons. John Locke is recognized as one of the key progenitors of this view of the legitimation of political power. In Two Treatises on Government (1689), he wrote: “Men being, as has been said, by nature all free, equal, and independent, no one can be put out of his estate, and subjected to the political power of another, without his own consent, which is done by agreeing with other men to join and unite into a community for their comfortable, safe, and peaceable living one amongst another…” That which secures the legitimacy of government on such a view is precisely the agreement to do so amongst those who are not only naturally equal in standing, but also free and independent–that is, self-directing. The tradition of placing crucial normative weight on the autonomy of the contracting parties has continued to the present.  Referring to liberalism in political philosophy in general, John Christman has written (2005), “Liberal legitimacy…assumes that autonomous citizens can endorse the principles that shape the institutions of political power….In this way, political power is an outgrowth of autonomous personhood and choice.” As before, autonomy is fulfilling two (mutually supporting) roles: it is being used to delimit the normatively privileged perspective from which judgment is authoritative regarding political legitimacy, and it is informing (at least partly) the standard in relation to which that judgment (viz., the acceptance of a political power) is made.

(iii) Autonomy is referenced as a core ground in the justification of political rights in a broadly liberal political framework. It is argued, for example, that a theory of autonomy has to be presupposed to achieve agreement with a theory of rights that in principle is acceptable to all. It is also argued that autonomy is absolutely central in views of rights that enshrine the idea that people have the freedom equally to conceive and enjoy widely different forms of meaningful life. Attracta Ingram (Ingram 1994) provides a clear articulation of the view that autonomy deserves a central place in the defense of a scheme of political rights: “I think that the most compelling answer is that people’s most vital human interest is in living meaningful lives. This interest cannot be secured while they are at risk of slavery, social subordination, repression, persecution, and grinding poverty – conditions that history shows to be the lot of many in societies which do not recognize the value of individual freedom. So it is rational for people to want to develop the mental capacities and social environment necessary to living independent lives. Since what is at stake is the proper distribution of human freedom, we have here matters of justice and rights; the province of political morality.” (112-3) Autonomy is thus said by some to be a–or the–core unifying value in a conception of rights that is liberal (and hence pluralistic) in tenor (see also Richards 1989). In addition, the value of autonomy is referenced to justify particular rights such as the right to free speech (Brison 2000) or the right to privacy (Kupfer 1987).

(iv) Autonomy is referenced by many as the core value that militates against the acceptability of political (and informal) paternalism. According to a widely-accepted conception, an act is paternalistic if it involves direct interference with another’s actions and will for the purpose of advancing (what the interferer takes to be) that person’s own good. Paternalism bypasses the agent’s capacity to be self-directing and ignores the agent’s wishes regarding the way she would like to live her own life; and it is these factors that constitute a violation of the autonomy of the one suffering paternalistic influence. It is commonly held that possession of the capacity for autonomy gives the agent a right and an authority–at least in relation to minimally voluntary, self-regarding choices (all else being equal)–to be self-determining without interference (for a detailed account of paternalism and the defense of the claims of autonomy, see VanDeVeer 1986; see also Mill’s classic argument against paternalism in On Liberty). Supporters of paternalistic doctrines tend to argue that paternalism is justified based either on the highly beneficial consequences of such interference, or on the ground that a policy of paternalism could be hypothetically accepted by autonomous agents when the possible related consequences are severe enough (on the latter see Rawls 1999). It has also been argued that a certain degree of paternalism is unavoidable, but that such paternalism should be constrained by the goal of leading persons to welfare-promoting choices while not threatening freedom of choice (Sunstein and Thaler 2003).

It is worth mentioning in passing that J.S. Mill, who is often referenced as a champion of individual liberty and a firm critic of paternalistic policy, endorsed a strong version of paternalism, but only in relation to “those backward states of society in which the race itself may be considered as in its nonage.”  In relation to these, Mill (1971) claimed that “Despotism is a legitimate mode of government in dealing with barbarians, provided the end be their improvement and the means justified by actually effecting that end.”  Although these claims of Mill’s would find few supporters today, it is worth adding that the standard that Mill employed to ground the distinction between unjustified and justified paternalism was the presence of a kind of maturity of thought and judgment that is not greatly dissimilar to autonomy: “[A]s soon as mankind have attained the capacity of being guided to their own improvement by conviction or persuasion…compulsion, either in the direct form or in that of pains and penalties for noncompliance, is no longer admissible as a means to their own good, and justifiable only for the security of others.”

d. Autonomy in Philosophy of Education

Several philosophers have argued that autonomy development is the most important goal (or at least one of the most important goals) of a liberal education. Reasoning in support of this claim usually takes two forms. Firstly, some argue that autonomy should be the primary goal of liberal education because autonomy enhancement is the most important goal of the liberal state, and hence an education in such a state should be an education for autonomy (see White 1991, and compare with Raz 1986, ch. 14). Secondly, some argue that autonomy should be the goal of liberal education because it should be a key goal of any form of education, largely because an education for autonomy is crucial for human well-being across the board.

The latter position has been challenged by communitarians, however, who argue that there is no justification for the claim that autonomy is universally valuable, and who see autonomy as at best a parochial (Western) value (see MacIntyre 1981, White 1991, and Raz 1986). The communitarian argument has been challenged in various ways. It has been directly counter-argued, for example, that autonomy is universally intrinsic to well-being (see Norman 1994 and Ishtiyaque & Cuypers 2008). In addition, the epistemic benefits of autonomy development for forming rational judgments about one’s life have been cited as reason for allowing the state to mandate education for autonomy, even over the protests of more traditionally-minded parents (MacMullen 2007). Although communitarians continue to be suspicious of the claim that autonomy should be a goal of all education, it is widely agreed that education for autonomy is central to an education in a liberal society.

4. Warrant for the Principle of Respect for Autonomous Choice

As mentioned above (in section 3a), the idea that autonomy gives rise to demands of respect can take two forms. On the one hand, it is argued that the possession of autonomy or the capacity for it grounds human dignity and the basic moral respect for persons that attends that dignity. On the other hand, it is argued that the fact that a choice or demand is autonomous is reason to give special or added normative uptake to that choice or demand. For clarity, one might refer to the former as the principle of respect for autonomy and the latter as the principle of respect for autonomous choice. The principle of respect for autonomy has already been examined in connection with Kant’s moral philosophy, and it was shown that although this principle has been popular, it is also quite controversial, largely because of problems involving exclusion. The principle of respect for autonomous choice will be examined in the present section. As shown above, this principle plays a key role in a variety of normative debates, especially debates in applied ethics. As has been mentioned, however, the principle is often either invoked without supporting argument or is given thin justification at best. The principle is therefore worthy of further discussion, especially with regard to its normative warrant. What is the warrant for the claim that autonomous choices give rise to special demands of respect? Two views have emerged on this question. Unsurprisingly, these views can be delineated along deontological and consequentialist lines.

Firstly, many philosophers following Kant (often only roughly), contend that autonomous choices deserve special respect because persons, as capable of self-determination, are entitled, all else being equal, to be self-determining without interference. This may be termed the authority view of the justification for the principle of respect for autonomous choice. The authority view is most often allied to the view that respect for autonomy functions primarily as a side-constraint which forbids paternalistically-motivated interference in the self-regarding, minimally voluntary choices of others, even if such interference would be prudentially best for the choosing persons.

Secondly, some philosophers following Mill, contend that autonomous choices deserve special respect because a policy of such respect conduces to desirable prudential consequences, either for the choosing agent, or aggregately. On this view, autonomous choices are not to be respected merely because they are autonomous, or because those making them have a capacity for self-determination, but rather because doing so will lead to the most beneficial prudential results. This more consequentialist view of the normative warrant for the principle of respect for autonomous choice may be termed the benefit view.

Based on the literature, it is quite clear that the authority view is the dominant view in the field, and has been for some time. Many philosophers hold that persons have a right to have their self-determining choices respected even in cases where there is good reason to think that the fulfillment of their autonomous choices would lead to bad prudential results (see Wellman 2003 and Darwall 2006). Against this it may be argued that where the prudential results of respecting a person’s autonomous choice are disastrous enough, interference may be justified, thus opening the door to the salience of consequentialist considerations bearing upon the principle (see Young 1982 and Wellman 2003). It has also been argued that the relation between the fulfillment of at least minimally robust autonomous choice and the resulting expression of authentic selfhood (conceived as highly prudentially significant) suggests that the benefit view deserves to be given closer attention (Piper 2009). Given the great popularity and wide employment of the principle of respect for autonomous choice, it is safe to say that the question of its normative warrant deserves far greater attention than it has thus far received.

5. References and Further Reading

  • Anderson, Joel and Axel Honneth. “Autonomy, Vulnerability, Recognition, and Justice.”  In Autonomy and the Challenges to Liberalism: New Essays, eds. John Christman and Joel Anderson, 127-149. Cambridge: Cambridge University Press, 2005.
  • Arpaly, Nomy. “Responsibility, Applied Ethics, and Complex Autonomy Theories.” In Personal Autonomy: New Essays in Personal Autonomy and Its Role in Contemporary Moral Philosophy, ed. James Stacey Taylor, 162-180. Cambridge: Cambridge University Press, 2005.
  • Arrington, Robert L. “Advertising and Behavior Control.” Journal of Business Ethics 1, no. 1 (Feb. 1982): 3-12.
  • Berlin, Isaiah. Two Concepts of Liberty. Oxford: Oxford University Press, 1958.
  • Berofsky, Bernard. Liberation from Self: A Theory of Personal Autonomy. Cambridge; Cambridge University Press, 1995.
  • Brison, Susan J. “Relational Autonomy and the Freedom of Expression.”  In Relational Autonomy: Feminist Perspectives on Autonomy, Agency, and the Social Self, eds. Catriona Mackenzie and Natalie Stoljar, 280-299. Oxford: Oxford University Press, 2000.
  • Brock, D. “Voluntary Active Euthanasia.” The Hastings Center Report 22, no. 2 (1993): 10-22.
  • Buss, Sarah. “Valuing Autonomy and Respecting Persons: Manipulation, Seduction, and the Basis of Moral Constraints.” Ethics 115, no. 2 (Jan 2005): 195-235.
  • Christman, John and Joel Anderson, eds. Autonomy and the Challenges to Liberalism: New Essays. Cambridge: Cambridge University Press, 2005.
  • Christman, John. “Autonomy, History, and the Subject of Justice.” Social Theory and Practice 33, no. 1 (January 2007): 1-26.
  • Christman, J. “Autonomy, Self-Knowledge, and Liberal Legitimacy.” In Autonomy and the Challenges to Liberalism: New Essays, eds. John Christman and Joel Anderson, 330-357. Cambridge: Cambridge University Press, 2005.
  • Cooper, John. “Stoic Autonomy.” In Autonomy, eds. Ellen Frankel Paul, Fred Miller, and Jeffrey Paul: 1-29. Cambridge: University of Cambridge Press, 2003.
  • Crisp, Roger. “Persuasive Advertising, Autonomy, and the Creation of Desire.” Journal of Business Ethics 6, no. 5 (July 1987): 413-418.
  • Dagger, Richard. “Autonomy, Domination, and the Republican Challenge to Liberalism.”  In Autonomy and the Challenges to Liberalism: New Essays, eds. John Christman and Joel Anderson, 177-203. Cambridge: Cambridge University Press, 2005.
  • Darwall, Stephen. “The Value of Autonomy and Autonomy of the Will.” Ethics 116, no. 2 (January 2006): 263-284.
  • Doerflinger, Richard. “Assisted Suicide: Pro-Choice or Anti-Life?” The Hastings Center Report 19, no. 1 (Jan-Feb 1989): 16-19.
  • Doyle, Oran. “Direct Discrimination, Indirect Discrimination, and Autonomy.” Oxford Journal of Legal Studies 27, no. 3 (2007): 537-553.
  • Dworkin, Gerald. The Theory and Practice of Autonomy. Cambridge: Cambridge University Press, 1988.
  • Dworkin, Gerald. “Autonomy and Informed Consent.” In Ethical Health Care, eds. Patrician Illingworth and Wendy Parmet, 79-91. Upper Saddle River, NJ: Pearson Prentice-Hall, 2006.
  • Fischer, John Martin. “Abortion, Autonomy, and Control Over One’s Body.” Social Philosophy and Policy 20, no. 2 (2003): 286-306.
  • Fischer, John Martin and Mark Ravizza. Responsibility and Control: A Theory of Moral Responsibility. Cambridge: Cambridge University Press, 1998.
  • Frankfurt, Harry. “Freedom of the Will and the Concept of a Person.” In The Importance of What We Care About by Harry Frankfurt, 11-25. Cambridge: Cambridge University Press, 1988.
  • Frankfurt, Harry. Necessity, Volition, and Love. Cambridge: Cambridge University Press, 1999.
  • Friedman, Marilyn. Autonomy, Gender, Politics. Oxford: Oxford University Press, 2003.
  • Gardner, John. “Private Activities and Personal Autonomy: At the Margins of Anti-discrimination Law.” In Discrimination: The Limits of the Law, eds. Bob Hepple and Erika Szyszczak, 148-171. London: Mansell, 1992.
  • Griffin, James. Well-Being: Its Meaning, Measurement, and Moral Importance. Oxford: Oxford University Press, 1986.
  • Guyer, Paul. “Kant on the Theory and Practice of Autonomy.” In Autonomy, eds. Ellen Frankel Paul, Fred Miller, and Jeffrey Paul: 70-98. Cambridge: Cambridge University Press, 2003.
  • Hartling, O.J.  “Euthanasia–the Illusion of Autonomy.” Medicine and Law 25, no. 1 (2006): 189-99.
  • Hill, Thomas. “The Kantian Conception of Autonomy.” In The Inner Citadel: Essays on Individual Autonomy, ed. John Christman, 91-105. Oxford: Oxford University Press, 1989.
  • Hill, Thomas. “Autonomy and Benevolent Lies.” In Autonomy and Self-Respect by Thomas Hill, 25-42. Cambridge: Cambridge University Press, 1991.
  • Ingram, Attracta. A Political Theory of Rights. Oxford: Oxford University Press, 1994.
  • Ishtiyaque, Haji and Stefaan Cuypers. “Authenticity-Sensitive Preferentism and Educating for Well-Being and Autonomy.” Journal of Philosophy of Education 42, no. 1 (February 2008): 85-106.
  • Kant, Immanuel, trans. Mary Gregor. Groundwork of the Metaphysics of Morals.  Cambridge: Cambridge University Press, 1998.
  • Katz, Eric. Nature As Subject. New York: Rowman & Littlefield, 1997.
  • Korsgaard, Christine. Creating the Kingdom of Ends. New York: Cambridge University Press, 1996.
  • Kupfer, Joseph. “Privacy, Autonomy, and Self-Concept.” American Philosophical Quarterly 24, no. 1 (January 1987): 81-9.
  • MacIntyre, Alasdair. After Virtue: A Study in Moral Theory. Notre Dame: University of Notre Dame Press, 1981.
  • Mackenzie, Catriona and Natalie Stoljar, eds. Relational Autonomy: Feminist Perspectives on Autonomy, Agency, and the Social Self. Oxford: Oxford University Press, 2000.
  • MacMullen, Ian. Faith in Schools? Autonomy, Citizenship and Religious Education in the Liberal State. Princeton, NJ: Princeton University Press, 2007.
  • Marquis, Don. “Why Abortion is Immoral.” Journal of Philosophy 86 (April 1989): 183-202.
  • Mele, Alfred. Autonomous Agents: From Self-Control to Autonomy. Oxford: Oxford University Press, 1995.
  • Meyers, Diana. Self, Society, and Personal Choice. New York: Columbia University Press, 1989.
  • Mill, John Stuart. On Liberty. Edited by Curran V. Shields. New Jersey: Prentice-Hall Inc., 1997.
  • Nelson, Philip. “Advertising and Ethics.” In Ethics, Free Enterprise, and Public Policy: Original Essays on Moral Issues in Business, eds. Robert T. De George and Joseph A. Pichler. New York: Oxford University Press, 1978.
  • Norman, Richard. “‘I Did It My Way’: Some Thoughts on Autonomy.” Journal of Philosophy of Education 28, no. 1 (1994): 25-34.
  • Oshana, Marina. “Personal Autonomy and Society.” Journal of Social Philosophy 29, no. 1 (Spring 1998): 81-102.
  • Oshana, Marina. “How Much Should We Value Autonomy?” In Autonomy, eds. Ellen Frankel Paul, Fred Miller, and Jeffrey Paul, 99-126. Cambridge: Cambridge University Press, 2003.
  • Overall, Christine. “Selective Termination of Pregnancy and Women’s Reproductive Autonomy.” The Hastings Center Report 20, no. 3 (May-June 1990): 6-11.
  • Piper, Mark. “On Respect for Personal Autonomy and the Value Instantiated in Autonomous Choice.” Southwest Philosophy Review 25, no. 1 (January 2009): 189-198.
  • Pohlmann, R. “Autonomie.” In Historisches Wörterbuch der Philosophie, ed. J. Ritter, 1: 701-719. Basel: Schwabe, 1971.
  • Rawls, John. A Theory of Justice, rev. ed. Cambridge, MA: Harvard University Press, 1999.
  • Raz, Joseph. The Morality of Freedom. Oxford: Oxford University Press, 1986.
  • Reath, Andrews. “Autonomy of the Will as the Foundation of Morality.” In Agency and Autonomy in Kant’s Moral Theory by Andrews Reath, 121-172. Oxford: Oxford University Press, 2006.
  • Richards, David A.J. “Rights and Autonomy.” In The Inner Citadel: essays on Individual Autonomy, ed. John Christman, 203-220. Oxford: Oxford University Press, 1989.
  • Safranek, John. “Autonomy and Assisted Suicide: the Execution of Freedom.” The Hastings Center Report 28, no. 4 (July-Aug. 1998): 32-36.
  • Sartre, Jean-Paul, trans. Bernard Frechtman. Existentialism is a Humanism. New York: Philosophical Library, 1946.
  • Sumner, L.W. Welfare, Happiness, and Ethics. Oxford: Oxford University Press, 1996.
  • Sunstein, Cass and Richard Thaler. “Libertarian Paternalism is Not an Oxymoron.” The University of Chicago Law Review 70, no. 4 (Fall 2003): 1159-1202.
  • Taylor, James Stacey. “Introduction.” In Personal Autonomy: New Essays on Personal Autonomy and Its Role in Contemporary Moral Philosophy, ed. James Stacey Taylor, 1-29. Cambridge: Cambridge University Press, 2005.
  • VanDeVeer, Donald. Paternalistic Intervention: The Moral Bounds of Benevolence.  Princeton, NJ: Princeton University Press, 1986.
  • Webber, May and Ernest Shulman. “Personal Autonomy and Rational Suicide.” Paper given at the Annual Convention of the American Association of Suicidology/International Association for Suicide Prevention (1987).
  • Wellman, Christopher Heath. “The Paradox of Group Autonomy.” In Autonomy, eds. Ellen Frankel Paul, Fred Miller, and Jeffrey Paul, 265-285. Cambridge: Cambridge University Press, 2003.
  • White, John. Education and the Good Life: Autonomy, Altruism, and the National Curriculum. New York: Teachers College Press, 1991.
  • Wolf, Susan. Freedom and Reason. New York: Oxford University Press, 1990.
  • Young, Robert. “The Value of Autonomy.” The Philosophical Quarterly 32, no. 126 (January 1982): 35-44.
  • Zupan, Daniel S. War, Morality, and Autonomy: An Investigation in Just War Theory. Hampshire, England: Ashgate Publishing Ltd., 2004.

Author Information

Mark Piper
Email: pipermc@jmu.edu
James Madison University
U. S. A.

Cloning

In biology, the activity of cloning creates a copy of some biological entity such as a gene, a cell, or perhaps an entire organism. This article discusses the biological, historical, and moral aspects of cloning mammals. The main area of concentration is the moral dimensions of reproductive cloning, specifically the use of cloning in order to procreate.

The article summarizes the different types of cloning, such as recombinant DNA/molecular cloning, therapeutic cloning, and reproductive cloning. It explores some classic stereotypes of human clones, and it illustrates how many of these stereotypes can be traced back to media portrayals about human cloning. After a brief history of the development of cloning technology, the article considers arguments for and against reproductive cloning.

One of the most predominate themes underlying arguments for reproductive cloning is an appeal to procreative liberty. Because cloning may provide the only way for some individuals to have a child that is genetically their own, a ban on cloning interferes with their reproductive autonomy.

Arguments against cloning appeal to concerns about a clone’s lack of genetic uniqueness and what may be implied because of this. Human cloning is of special interest. There are concerns that cloned humans would lack individuality, that they would be treated in undignified ways by their creators, or that they would be damaged by society’s expectations that they should be more like those from whom they were cloned. Because they would essentially be facsimiles of the original person, there is concern that the clones might possess less moral worth. The predominate theme underlying arguments against human cloning is that the cloned child would undergo some sort of physical, social, mental, or emotional harm. Because of these and other concerns, the United Nations and many countries have banned human cloning. An important philosophical issue is whether such a response against human cloning is warranted.

Table of Contents

  1. Types of Cloning
    1. Recombinant DNA Technology / Molecular Cloning
    2. Therapeutic Cloning
    3. Reproductive Cloning
  2. Misconceptions About Cloning and Their Sources
  3. Cloning Mammals: A Brief History
  4. Arguments in Favor of Reproductive Cloning and Responses
    1. Reproductive Liberty: The Only Way to Have a Genetically Related Child
    2. Cloning and Savior Siblings
    3. Cloning In Order to “Replace” a Deceased Child
    4. The Resultant Loss of Therapeutic Cloning for Stem Cell Research and Treating Diseases
  5. Arguments Against Reproductive Cloning and Responses
    1. The Right to an Open Future
    2. The Right to a Unique Genetic Identity
    3. Cloning is Wrong because it is “Playing God” or because it is “Unnatural”
    4. The Dangers of Cloning
    5. Cloning Entails the Creation of Designer Children, or it Turns Children into Commodities
    6. Cloning and the Ambiguity of Familial Roles
  6. References and Further Reading

1. Types of Cloning

a. Recombinant DNA Technology / Molecular Cloning

DNA/Molecular cloning has been in use by molecular biologists since the early 1960s. When scientists wish to replicate a specific gene to facilitate more thorough study, molecular cloning is implemented in order to generate multiple copies of the DNA fragment of interest. In this process, the specific DNA fragment is transferred from one organism into a self-replicating genetic element, e.g., a bacterial plasmid (Allison, 2007).

Because this kind of cloning does not result in the genesis of a human organism, it has no reproductive intent or goals, and it does not result in the creation and destruction of embryos, there is little to no contention regarding its use.

b. Therapeutic Cloning

Embryonic stem cells are derived from human embryos at approximately five days post-fertilization, in the blastocyst stage of development. Because of their plasticity, embryonic stem cells can be manipulated to become any cell in the human body, e.g., neural cells, retinal cells, liver cells, pancreatic cells, or heart cells. Many scientists hope that, with proper research and application, embryonic stem cells can be used to treat a wide variety of afflictions, e.g., tissue toxicity resulting from cancer therapy (National Cancer Institute, 1999) Alzheimer’s disease (Gearhart, 1998), Parkinson’s disease (Freed et al, 1999; National Institute of Neurological Disorders and Strokes, 1999; Wager et al. 1999; Gearhart, 1998), diabetes (Voltarelli et al, 2007; Shapiro et al., 2000), heart disease (Lumelsky, 2001; Zulewski, 2001), and limb paralysis (Kay and Henderson, 2001).

One current obstacle for the successful use of embryonic stem cells for disease therapy concerns immunological rejection. If a patient were to receive stem cell therapy in order to treat some affliction, her body may reject the stem cells for the same reason human bodies have a tendency to reject donated organs: the body tends to not recognize, and therefore reject, foreign cells. One way to overcome stem cell rejection is by creating embryos through somatic cell nuclear transfer with the patient’s own DNA. In 2008, a California research team succeeded in creating embryos via SCNT and growing them to the blastocyst stage (French et al., 2008). In SCNT, an ovum is emptied of its own nucleus, its DNA, and the chromosomal DNA from another person (in the case, a patient’s) is inserted. The ovum is then artificially induced to begin dividing as if it had been naturally fertilized (usually via the use of an electrical current). Once the embryo is approximately five days old, the stem cells are removed, cultured, differentiated to the desired type of body cell, and inserted back into the patient (the genetic donor in this case). Since the embryo was a genetic duplicate of the patient, there would be no immunological rejection. One use of this technology, for example, is to help treat individuals in the aftermath of a heart attack. Using SCNT to create a genetically identical blastocyst, new healthy cells could be derived and inserted back into the genetic donor’s heart in order to replace the damaged cardiac cells (Strauer, 2009).

It may also be possible to use therapeutic cloning to repair defective genes by homologous recombination (Doetschman et al., 1987). Cellular models of diseases can be developed as well, along with the ability to test drug efficacy: “cloning a single skin cell from a patient with a disease could be used to produce inexhaustible amounts of cells and tissue with that disease. The tissue could be experimented upon to understand why disease occurs. It could be used to understand the genetic contribution to disease and to test vast arrays of new drugs which could not be tested in human people” (Savulescu, 2007, 1-2). Pluripotent stem cells can also be used to test drug toxicity which could also diminish the chances of drug-related birth defects (Boiani and Schöler, 2002, 124).

Therapeutic cloning is controversial because isolating the stem cells from the embryo destroys it. Many individuals regard the human embryo as a person with moral rights, and so they consider its destruction to be morally impermissible. Moreover, because the embryos are created with the explicit intention to destroy them, there are concerns that this treats the embryos in a purely instrumental manner (Annas et al.,, 1996). Although some ethicists are in favor of using surplus embryos from fertility treatments for research (since the embryos were slated for destruction in any case), they are simultaneously against creating embryos solely for research due to the concern that doing so treats the embryos purely as means (Outka, 2002; Peters, 2001). Indeed, it is precisely because of these ethical issues that some individuals object to the positive connotations of the term “therapeutic” and refer to this work, instead, as “research cloning.” The term “therapeutic cloning” is, however, more widely used.

c. Reproductive Cloning

SCNT can also be used for reproductive purposes. Unlike therapeutic cloning, the cloned embryo is transferred into a uterus of a female of the same species and would be, upon successful implantation, allowed to gestate as a naturally fertilized egg would. The cloned embryo would possess identical chromosomal DNA as its genetic predecessor, but, because of the use of a different ovum, its mitochondrial DNA (the genetic material inhabiting the cytoplasm of the enucleated ovum) would differ, and, consequently, it would not be 100% genetically identical (unlike monozygotic multiples who, because they are derived from the same ovum, share identical chromosomal and mitochondrial DNA). In addition to its slight genetic difference, the cloned embryo would likely be gestated in a different uterine environment, which can also have an effect in ways that may serve to distinguish it from its genetic predecessor. For example, a cloned entity’s phenotype (its appearance) may look very different than that of its genetic predecessor because the embryo can undergo epigenetic reprogramming, where nongenetic (i.e., environmental) causes influence genes to manifest themselves differently. The result is that the genes behave in ways that may lead to a difference in appearance.

In addition to somatic cell nuclear transfer, there is another, less controversial and less technologically complex, manner of reproductive cloning: artificial embryo twinning. Here, an embryo is created in a Petri dish via In Vitro Fertilization (IVF). The embryo is then induced to divide into genetic copies of itself, thereby artificially mimicking what happens when monozygotic multiples are formed (Illmensee et al., 2009). The embryos are then transferred into a womb and, upon successful implantation and gestation, are born as identical multiples. If implantation is unsuccessful, the process is repeated.

One argument in favor of artificial embryo twinning is that it provides an infertile couple, who may not have been able to produce many viable embryos through IVF, with more embryos that they can then implant for an increased chance at successful reproduction (Robertson, 1994). Because some of the embryos may be saved and implanted later, it is possible to create identical multiples who are not born at the same time. One advantage to doing this is that the later born twin could serve as a blood or bone marrow donor for her older sibling should the need arise; because they are genetically identical, the match would be guaranteed (the converse could also hold, that is, the older individual could serve as a donor for the clone should the latter ever need it. The existence of a cloned person, therefore, could be mutually beneficial, rather than asymmetrical). However, some concerns have been raised. For example, it has been argued that artificially dividing the embryo constitutes an immoral manipulation of it and that, as much as possible, a unique embryo should be allowed to develop without interference (McCormick, 1994). Concerns over individuality have also been raised; whereas naturally occurring twins are valued as individuals, one worry is that embryos created through artificial twinning, precisely because of the synthetic nature of their genesis, may not be as valued (McCormick, 1994).

2. Misconceptions About Cloning and Their Sources

The general public still seems to regard human reproductive cloning as something that can occur only in the realm of science fiction. The portrayal of cloning in movies, television, and even in journalism has spanned from comedic to dangerous. Human clones have often been depicted in movies as nothing but carbon copies of their genetic predecessor with no minds of their own (e.g., Multiplicity and Star Wars: Attack of the Clones), as products of scientific experiments that have gone horribly wrong, resulting in deformed quasi-humans (Alien Resurrection) or murderous children (Godsend), as persons created simply for spare parts for their respective genetic predecessor (The Island), or as deliberate recreations of famous persons from the past who are expected to act just like their respective predecessor (The Boys from Brazil). Even when depicting nonhuman cloning, films (such as Jurassic Park) tend to portray products of cloning as menacing, modern-day Frankensteinian monsters of sorts, which serve to teach humans a lesson about the dangers of “playing God.”

Many other media outlets, although usually shying away from the ominous representation of clones so prevalent in the movies, have usually portrayed clones as, essentially, facsimiles of their genetic predecessor. On the several occasions which Time Magazine has addressed the issue of cloning, the cover illustrates duplicate instances of the same picture. For example, the February 19, 2001 cover shows two mirror image infants staring at each other, the tagline suggesting that cloning may be used by grieving parents who wish to resurrect their dead child. Even a Discovery Channel program, meant to educate its viewers on the nature of cloning, initially portrays a clone as nothing more than a duplicate of the original person. Interestingly enough, however, a few minutes into the program, the narrator, speaking over a picture of two identical cows, says: “But even if a clone person is created, that doesn’t mean it would be an exact copy of the original.” Yet almost immediately afterwards, the same narrator calls a clone “You, version 2.0.”

As philosopher Patrick Hopkins has pointed out, media conceptions about what human cloning entails, and the type of offspring that will arise from cloning, employ the tacit premise that clones are nothing but copies. The predominate belief that fuels this conception is that genetic determinism is true, i.e., that a person’s genes are the sole determining factor of her behavior and physical appearance; essentially, that a person’s identity is solely determined by her genetic constitution. If a person were to believe that genetic determinism is true, then it follows that she believes that a cloned person would be psychologically identical with her genetic predecessor because they are (almost) genetically identical. Hopkins also points out that, like the narrator in the Discovery Channel program, many media outlets “engage in confusing, contradictory bits of double-talk (or double-show). The images and not-very-clever headlines all convey unsettling messages that clones will be exact copies, while inside the stories go to some effort to educate us that clones will not in fact be exact copies” (1998, 129-130).

3. Cloning Mammals: A Brief History

In 1894, Hans Driesch cloned a sea urchin through inducing twinning by shaking an embryonic sea urchin in a beaker full of sea water until the embryo cleaved into two distinct embryos. In 1902, Hans Spemann cloned a salamander embryo through inducing twinning as well, using a hair from his infant son as a noose to divide the embryo.  In 1928, Spemann successfully cloned a salamander using nuclear transfer. This involved enucleating a single-celled salamander embryo and inserting it with the nucleus of a differentiated salamander embryonic cell.  In 1951, Robert Briggs and Thomas Kling, using Spemann’s methods of embryonic nucleus transfer, successfully cloned frogs. In 1962, John Gurdon announced that he too had successfully cloned frogs but, unlike Briggs and Kling’s method, he did so by transferring differentiated intestinal nuclei from feeding tadpoles (Wilmut et al., 2000). Gurdon’s successful use of differentiated nuclei, rather than the embryonic nuclei used by Briggs and Kling, was particularly surprising to the scientific community. Because embryonic cells are undifferentiated, and therefore extremely malleable, it was not too surprising that transferred embryonic nuclei produced distinct embryos when inserted into an enucleated oocyte. However, inciting differentiated nuclei to behave as undifferentiated nuclei was thought to be impossible, since the conventional wisdom at the time was that once a cell was differentiated (e.g., once it became a cardiac cell, a liver cell, or a blood cell) it could never reverse into an undifferentiated state. It was for this reason that, for a long time, creating a cloned embryo from adult somatic cells was thought to be impossible – it would require taking long-time differentiated cells and getting them to behave like the totipotent cells (cells that are able to differentiate into any cell type, including the ability to form an entirely distinct organism) found in newly fertilized eggs.

In 1995, Dr. Ian Wilmut and Dr. Keith Campbell successfully cloned two mountain sheep, Megan and Morag, from embryonic sheep cells. One year later, in 1996, Wilmut and Campbell successfully cloned the first mammal to be born from an adult somatic cell, specifically an udder cell (a sheep’s mammary gland): Dolly the sheep (Wilmut et al., 1997). In other words, Wilmut and Campbell were able to take a fully differentiated adult cell and revert it back to an undifferentiated, totipotent, state. This was the first time the process had been accomplished for mammalian reproduction. Furthermore, they were able to create a viable pregnancy and produce from it a healthy lamb (however, there were 276 failed attempts before Dolly was created, which, as it will be discussed below, creates concerns over the safety and efficacy of the procedure). Dolly the sheep died in 2003 after having been euthanized due to her suffering from pulmonary adenomatosis, a disease fairly common in sheep that are kept indoors; indeed, many members of Dolly’s flock had succumbed to the same disease. Additionally, she suffered from arthritis. Before she died, she produced six healthy lambs through natural reproduction. Since Dolly, many more mammals have been cloned through the use of SCNT. Some examples are deer, ferrets (Li et al., 2006), mules (Lovgren, 2003), other sheep, goats, cows, mice, pigs, rabbits, a gaur, dogs, and cats. One possible use of reproductive cloning technology is to help save endangered species (Lanza et al., 2000). In 2005, two endangered gray wolves were cloned in Korea (Oh et al., 2008).

The successful cloning of household pets holds special significance in that, when discussing the circumstances that led to their cloning, we can begin to discuss the ethical issues that arise in human reproductive cloning. In 2001, the first feline created via somatic cell nuclear transfer was born. She was named CC, short for “Copy Cat,” and was born at the College of Veterinary Medicine at Texas A&M University. The research that led to her creation was funded by the California based company “Genetic Savings and Clone,” who, between 2004 and 2006, offered grieving pet owners a chance to clone their sick or deceased pets (they closed their doors in 2006 due to the unsustainability of their business). What is most striking about CC is not simply her mere existence, but also that CC does not look nor act like her feline progenitor, Rainbow. Whereas Rainbow, a calico, is stocky and has patches of tan, orange, and white throughout her body, CC barely resembles a calico at all. Not only is she lanky and thin, she has a grey coat over a white body and is lacking the patches of orange or tan typical to calicos. There are personality differences between Rainbow and CC as well; whereas Rainbow is described as a shy, reticent, and a more “hands-off” kind of cat, CC is described as more playful, inquisitive, and affectionate (Hays, 2003).

“Genetic Savings and Clone” was founded by Lou Hawthorne, who was seeking a means to clone his family’s beloved dog Missy. Although Missy died before she was successfully cloned, Hawthorne banked her DNA in the hopes of ultimately succeeding in this endeavor. In 2004, a Texas woman paid $50,000 to clone her deceased Maine Coone Nicky and, as a result, Little Nicky, the world’s first commercially cloned cat, was born.  This was followed, in 2005, by the birth of Snuppy, the world’s first cloned dog. In 2007, three clones from Missy’s DNA were created and returned to the Hawthorne family. All this has incited some pet owners to pay large sums of money to clone their beloved deceased pets. Alan and Kristine Wolf paid thousands of dollars to have their deceased cat, Spot, cloned from skin cells they had preserved. According to the Wolfs, preserving Spot’s skin cells, in their mind, was almost equivalent to having Spot himself preserved. In other words, the Wolfs (and the woman who cloned Nicky) were willing to spend an exorbitant amount of money to clone their pets not just in order to receive another pet, but to, rather, receive what was, in their eyes, the same pet that they had lost (Masterson, 2010).

This allows us to begin exploring the ethical issues in the reproductive cloning debate. Some questions that arise are: Why did these individuals regard the recreation of the same DNA to equate to the recreation of the same entity that had died? Will these expectations transfer over to human cloning, where people will regard cloned children as the same individuals as their genetic predecessors, and therefore treat them with this expectation in mind? Will cloning, thus, compromise a child’s identity? Are such concerns grave enough to permanently ban reproductive cloning altogether?

4. Arguments in Favor of Reproductive Cloning and Responses

a. Reproductive Liberty: The Only Way to Have a Genetically Related Child

The Argument.

Procreative liberty is a right well established in Western political culture (Dworkin, 1994). However, not everyone is physically capable of procreating through traditional modes of conception. Cloning may be the only way for an otherwise infertile couple to have a genetically related child. Therefore, providing cloning as an option contributes to a greater scope of procreative liberty (Häyry, 2003; Harris, 2004; Robertson, 1998). For example, a couple may be able to generate only a few embryos from IVF procedures; cloning via artificially induced twinning would increase the number of embryos to a quantity that is more likely to result in a live birth. In another case, the male partner in a relationship may be unable to produce viable sperm and, instead of seeking a sperm donor, the couple can choose to use SCNT in order to produce a genetic copy of the prospective father. Since the prospective mother would use her own ova, they would both contribute genetically to the child (albeit with a different proportion than a couple who conceived using gamete cells).  In yet another example, neither parent may have usable gametes, so they employ a donor ovum, clone one of the two parents, and gestate the fetus in the female’s uterus. Or, perhaps one of the prospective parents is predisposed to certain genetic disorders and, in order to completely avoid their offspring inheriting these disorders, they decide to clone the other prospective parent. A single woman may want to have a baby, and would rather clone herself instead of using donated sperm. Also, cloning may give homosexual couples the opportunity to have genetically related children (this is especially true for homosexual women where one partner provides the mitochondrial DNA and the other partner provides the chromosomal DNA). These are a few examples of how cloning may provide a genetically related child to a person otherwise unable to have one. Because cloning may be the only way some people can procreate, to deny cloning to these people would be a violation of procreative liberty (Robertson, 2006).

Response 1: Negative vs. positive right to procreate.

One response is to distinguish between a positive right to procreate and a negative right to procreate (Pearson, 2007), and argue that reproductive liberty can be fully respected in the latter sense, and only conditionally respected in the former sense. This conditional respect may support the permissibility of prohibiting human cloning for reproductive purposes.

A negative right to x means that no one has the prima facie right to interfere in your request to fulfill x.  If you possess a negative right to x, this entails only one obligation on the behalf of others: the obligation to not obstruct your obtainment of x. For example, if I have a negative right to life, what this entails is that others have an obligation to not kill me, since this obstructs or hinders my right. Another way to regard it is that a negative right only requires passive obligations (the obligation to not do something or to refrain from acting).

A positive right requires more from obligation-bearers; it requires that active steps be taken in order to provide the right-bearers with the means to fulfill that right. If I have a positive right to life, for instance, it is not just that others have an obligation to not kill me; they have a further obligation to provide me with any services that I would need to ensure my survival. That is, the obligation becomes an active one as well as a passive one: an obligation to not destroy my life and also to provide services that enable me to preserve my life.

Keeping this distinction in mind, it is possible to deny that the right to reproduce is a positive right in the first place. That is, while we ought not to prevent anyone from procreating, we are not required to provide them with any technology whatsoever in order to enable them to procreate if they cannot do so by their own means. Hence, limiting access to certain types of assisted reproductive technologies to an otherwise infertile couple would not necessarily infringe on their (negative) right to procreate (Courtwright and Doron, 2007). Some have argued the opposing side, however, and have maintained that respect for procreative liberty not only entails access to artificial reproductive technology, but also the right to employ gamete donors and surrogate mothers (Ethics Committee of the American Fertility Society, 1985).

Response 2: Procreative liberty is not categorical.

Another possible response is to stress that, even if there is a positive right to procreate, the right is a prima facie, rather than a categorical, one and it is not the case that any step taken to combat infertility is in itself ethical (McCormick, 1993).  Therefore, determining what types of services can be offered to infertile couples must be tempered with certain considerations, e.g., the safety of the offspring born as a result of these services must be taken into account. If a particular type of reproductive technology poses a health risk to the resulting children, this is grounds enough to prevent the use of that technology (Cohen, 1996). In other words, even granting that individuals have a positive right to procreate, it does not follow from this alone that they should be provided with any means necessary for successful procreation. They may not be entitled to the use of a certain technological advancement (e.g., SCNT) if that advancement is deemed to pose a danger to the resulting offspring. Robertson concedes this objection, but he responds that “if a ban on cloning is justified, then a ban on many other forms of assisted reproduction and genetic selection should be as well, yet few persons are prepared to go that far” (2006, 206).  That is, in order for advocates of this objection to be consistent, they should be equally willing to ban other forms of reproductive technology that may result in harm to potential offspring.

b. Cloning and Savior Siblings

The Argument.

The concept of a “savior sibling,” a child that is deliberately conceived so that she could provide a means (through the donation of bodily fluids, umbilical cord blood, a non-vital organ, or tissue) to save an older sibling from illness or death is not new. What is new is that cloning would ensure that the new child is an appropriate match for the existing ailing person, since they would be genetically identical. Permitting cloning, therefore, would allow for a more expedient means of creating a savior sibling, since the alternatives (using preimplantation genetic diagnosis to screen embryos to determine which are genetically compatible with the sibling, implanting into a womb only the ones that are a match and discarding the others, or creating an embryo through natural reproduction and terminating the pregnancy if it is not a genetic match) are more involved and more time consuming. Of course, the rights of the new child would have to be respected; tissue, organs, or bodily fluids should only be removed given her consent (although this would not apply to umbilical cord blood banking, since the infant lacks the capacity for giving consent) (Robertson, 2006).

Response: Violating Kant’s formula of humanity.

Such a prospect raises concerns that cloning would facilitate viewing the resulting children as objects of manufacture, rather than as individuals with value and dignity of their own. The prospect of creating a child, solely to meet the needs of another child and not for her own sake, reduces the created child to a mere means to achieve the ends of the parents and the sick child. While it is admirable that the parents wish to save their existing child, it is not ethically permissible to create another child solely as an instrument to save the life of her sibling (Quintavalle, 2001).

Another way of explaining it is that creating a child solely for the purposes of providing life-saving aid for another child violates Immanuel Kant’s second principle formulation of the categorical imperative. Kant proscribes treating persons as a mere means, rather than as ends in themselves, maintaining that persons should “act in such a way that [humanity is treated] always at the same time as an end and never simply as a means” (1981, 36). Creating a child for the sole purpose of saving another child violates the formula of humanity because the child is created specifically for this end.

It should be noted, however, that such an objection would apply to any method that is used to create a child for similar reasons, including any other type of reproductive technology or even natural procreation. It is the intention with which a child is created that is in question here, not the method that is used in order to create the child. Another response is that Kant’s dictum is misapplied. A child who is created as a “savior sibling” may still, also, be loved and respected as an individual in her own right, and therefore may not necessarily be treated solely as a means (Boyle and Savulescu, 2001).

c. Cloning In Order to “Replace” a Deceased Child

The Argument.

In his article “Even If It Worked, Cloning Won’t Bring Her Back”, ethicist Thomas Murray recounts a letter he heard read at a congressional hearing regarding human reproductive cloning. A chemist, who was presenting her views in support of reproductive cloning, read a letter by a father grieving the death of his infant son. Murray recounts as follows:

Eleven days ago, as I awaited my turn to testify at a congressional hearing on human reproductive cloning, one of five scientists on the witness list took the microphone. Brigitte Boisselier, a chemist working with couples who want to use cloning techniques to create babies, read aloud a letter from “a father (Dada).” The writer, who had unexpectedly become a parent in his late thirties, describes his despair over his 11-month-old son’s death after heart surgery and 17 days of “misery and struggle.” The room was quiet as Boisselier read the man’s words: “I decided then and there that I would never give up on my child. I would never stop until I could give his DNA – his genetic make-up – a chance” (2001).

Depriving grieving parents of this unique opportunity, the only opportunity “to get back the child that they lost,” would be morally wrong. Cloning would provide such an opportunity to grieving parents.

Response 1: Assuming genetic determinism.

Like many of the arguments against reproductive cloning listed below, this argument in favor of cloning, despite its emotional appeal, erroneously assumes that genetic determinism is true. The grieving father’s letter maintained that he would never “give up on my child”, and that the way he would achieve this is to “give his DNA – his genetic make-up – a chance.” In other words, the father equated his son as an individual person to his genetic make-up; because he could recreate his son’s genes, he could recreate his son as a person. The tacit implication here is that cloning is desirable because it somehow presents a way to cheat death. It is through cloning that his son could be, in some sense, resurrected.

Given that individuals have sought to clone their deceased pets, the idea that grieving parents would seek to clone a deceased child is not far-fetched. Thomas Murray continues his article by disclosing that he too is a grieving father, having suffered the death of his twenty-year-old daughter who was abducted from her college campus and shot. Yet cloning, Murray continues, “can neither change the fact of death nor deflect the pain of grief” (2001). Murray goes on to stress that, due to varying other influences outside of genetic duplication, a clone would not, in fact, be a mere copy of its genetic predecessor. One interesting point is that both detractors of cloning (e.g., Kass and Callahan, whose views are explored below) and supporters of cloning (like the researcher that read this letter at the congressional hearing) find convergence in committing the same fallacy. Both assume that cloning recreates identity, and they differ only as to the desirability of that consequence. Yet, given that we have evidence that the robust form of genetic determinism these arguments assume is false (Resnik and Vorhaus, 2006; Elliot, 1998), both detractors and supporters of cloning who rely on it produce faulty arguments.

Response 2: A child is not replaceable.

Given the evidence that genetic determinism is false, Murray further stresses that using cloning as a method of replacing a dead child “is unfair. No child should have to bear the oppressive expectation that he or she will live out the life denied to his or her idealized genetic avatar…. Cloning a child to be a reincarnation of someone else is a grotesque, fun-house mirror distortion of parental expectations” (2001). Dan Brock further supports the contention that cloning in order to replace a deceased child is misguided (Brock, 1997). Moreover, because parents have cloned this child with the expressed purpose of replacing a deceased child, the expectations that the new child will be just like the deceased one would be overwhelming and impede the child’s ability to develop her own individuality (Levick, 2004). It should be stressed, however, that this response targets a particular use of cloning (one based on faulty assumptions), not the actual cloning procedure.

d. The Resultant Loss of Therapeutic Cloning for Stem Cell Research and Treating Diseases

The Argument.

Although SCNT is used to create embryos for therapeutic cloning, there is no intent to implant them in order to create children. Rather, the intent is to use the cells of the embryo in order to further research that may ultimately lead to treatments or cures for certain afflictions. Therefore, a categorical ban on SCNT affects not just the prospect of reproductive cloning, but also the research that could be done with cloned embryos. At the very least, the argument concludes, SCNT should be allowed for research and therapeutic purposes (Devolder and Savulescu, 2006; American Medical Association, 2003; Maas, 2001). This was the position presented by Senator Arlen Specter in his proposed Senate Bill 2439, called the “Human Cloning Prohibition Act of 2002:  A Bill to Prohibit Human [Reproductive] Cloning While Preserving Important Areas of Medical Research, Including Stem Cell Research.”

Response 1: Therapeutic cloning leads to reproductive cloning.

The first response maintains that, because therapeutic cloning and reproductive cloning both implement SCNT, allowing the procedure to be perfected for therapeutic cloning makes it more likely that it will later be used for reproductive purposes (Rifkin, 2002; Kass, 1998)

Response 2: Embryo experimentation is unethical.

The second response applies not just to therapeutic cloning, but to any type of embryo experimentation. From the time that an ovum is fertilized and syngamy (the fusion of two gametes to form a new and distinct genetic code) has successfully taken place, there exists a subject, the embryo, which is a bearer of dignity, moral status, and moral rights. It is unethical to experiment on an embryo for the same reason it is unethical to experiment on any human being and since embryo experimentation often results in the destruction of the embryo, this equates to murdering the embryo (Deckers, 2007; Oduncu, 2003; Novak, 2001). Typically, those who offer the second response (e.g., the Catholic Church) regard the human embryo as a complete moral subject upon conception (Pope John Paul II, 1995; Pope Paul VI, 1968), and therefore any experiment that harms them or destroys them is morally tantamount to any experiment that would destroy a person.

5. Arguments Against Reproductive Cloning and Responses

a. The Right to an Open Future

The Argument.

According to some ethicists who oppose human cloning, a cloned child’s identity and individuality will be compromised given that she will be “saddled with a genotype that has already lived” (Kass, 1998, 56; see also Annas, 1998 and Kitcher, 1997). Because of the expectations that the cloned child will re-live the life of her genetic predecessor, the child would necessarily be deprived of her right to an open future. Because all children deserve to have a life and a future that is completely open to them in terms of its prospects (Feinberg, 1980), and because being the product of cloning would necessarily deprive the resulting child of these prospects, cloning is seriously immoral. In a sense, this objection maintains that a cloned child would either lack the free will to live her life according to her own desire and goals or that, at the very least, her free will would be severely restricted by her parents or the society that has certain expectations of her given her genetic lineage. The child would be destined to live in the shadows of her genetic predecessor (Holm, 1998).

Response 1: Faulting cloning for the misconceptions of others.

This argument is unsuccessful in illustrating that there is something intrinsically morally wrong with cloning. The subject of this objection is not cloning itself, but rather the erroneous attitude that parents will have in regard to their cloned child. The child’s very desire to be different from her predecessor illustrates that she is not destined to be like her predecessor. Once prospective parents, and society in general, come to understand that cloned children will possess just as much individuality as any other person, it is possible that these fears, and the attempts to control the child’s future, will largely abate (Wachbroit, 1997). Additionally, if the reason people treat cloned children unfavorably is due to their misconceptions about cloning, then the proper response is not to ban cloning at the expense of compromising procreative liberty, but rather work to rectify these prejudices and misconceptions (Burley and Harris, 1999).

Moreover, it is not just parents of cloned children that may be guilty of violating the child’s right to an open future;  many parents are, to varying degrees of severity, already guilty of violating such a right with their naturally created children, and often times those attempts are subject to failure (see Agar, 2004, 106 for such an example). If such parents are not deprived of their opportunity to have children out of concern that they will violate their child’s right to an open future, then we seem hard pressed to find a reason to deprive couples who would turn to cloning for reproductive purposes of a similar opportunity.

Response 2: Assuming genetic determinism (again).

At its core, however, this objection assumes the very controversial thesis that either a person’s genes play an almost fatalistic role in her life decisions, or that individuals in society will assume some robust version of genetic determinism to be true and will treat cloned children according to that assumption. As abovementioned, there is much evidence to suggest that genetic determinism is not true. In their article “Genetic Modification and Genetic Determinism,” David Resnik and Daniel Vorhaus state that, when it comes to genetic modification, “even if a desired trait is successfully expressed it may not actually restrict options for the child… the open future critique paints with a far broader brush, alleging that the act of modification per se impacts the child’s right to an open future. And it is this claim that we reject…” (2006, 9). The same can be said about cloning (Pence, 1998 and 2008; Wachbroit, 1997). Even if a cloned child did display certain behavioral traits belonging to her genetic predecessor, it is unclear whether the similarity in traits entails that a child’s future would be closed off. Moreover, there is much evidence that, usually, the general public rejects genetic determinism (Hopkins, 1998).

There is evidence, however, that some would regard cloning as a method for resuscitating the dead (the grieving father in Murray’s article attests to this, as well as the individuals who are willing to pay thousands of dollars in order to clone a deceased pet). This supports Kass’ claim that many people may expect a cloned child to be like her genetic predecessor. However, this misconception may quickly be rectified simply by observing the unique personality of the cloned child, especially since her experiences and her nurture, removed by at least a generation, will be substantially different than that of her genetic predecessor (Dawkins, 1998; Pence, 1998).

b. The Right to a Unique Genetic Identity

The Argument.

Because cloning entails recreating an existing person’s genetic code (with the exception of the difference in mitochondrial DNA), some argue that cloning would, necessarily, entail a violation of the cloned child’s right to a distinctive genetic identity (European Parliament, 1998). According to this objection, our DNA is what endows each human being with uniqueness and dignity (Callahan, 1993). Because cloning recreates a pre-existing DNA sequence, the cloned child would be denied that uniqueness and, therefore, her dignity would be compromised. This objection appears to be an incarnation of the objection from the Right to an Open Future. Certainly the concerns are similar: that a cloned child would be deprived of her own individual identity because of her genetic origins. However, whereas in the objection from the Right to an Open Future, the cloned child is deprived of individuality based on the perception of others (and, as is developed above, this does not seem to really be an objection to the practice of cloning simpliciter), this objection indicates that there is something inherently individuality-compromising, and therefore dignity-compromising, in recreating an existing genetic code. If this objection is successful, if recreating a pre-existing genetic code is intrinsically morally objectionable, then it would seem to present an objection to the actual cloning process.

Response 1: Genetic duplication and identical multiples.

Callahan argues that there is something intrinsically identity-depriving, and therefore dignity-depriving, in duplicating a genetic code. However, there is much evidence to counter this claim. As abovementioned, CC the cat neither looks nor acts like Rainbow, her genetic predecessor. However, the strongest evidence against this claim is the existence of identical multiples, who are, in essence, clones of nature (Pence, 2004; Gould, 1997). No one claims that identical multiples’ right to a unique genetic identity was compromised simply in virtue of their creation, which calls into question whether such a right exists in the first place (Silver, 1998; Tooley, 1998; Rhodes, 1995). If Callahan’s concerns were accurate, identical multiples would fail to be individuals in their own right, and, consequently, be harmed because of this. However, there is no evidence that identical multiples feel this way, and there does not seem to be anything inherent about sharing a genetic code that compromises individuality (Elliot, 1998). The fact that identical multiples do not seem harmed or deprived of individuality merely by virtue of not possessing a unique genetic code is evidence that Callahan’s concern against cloning in this regard is misguided.

Response 2: Forgetting nurture.

Lastly, proponents of this objection ignore the very important role that nurture has in shaping a person’s identity. A cloned child would be gestated in a different uterine environment. She would be born into either the same family, but with a different dynamic, as her genetic predecessor, or be born into a different family altogether. She would also likely be raised in a much different society (e.g., a child born in 2010 would have vastly different social influences than a child born in the 1960s or 1970s). She would have different friends, attend different schools, play different games, watch different television shows, listen to different music. The generational and historical differences between a clone and her genetic predecessor would undoubtedly go a long way when it comes to shaping the personality of the former (Pence, 1998; Dawkins, 1998; Harris, 1997; Bor, 1997).

What forms or shapes each person’s individual identity is an intricate interaction of genetics and nurture (Ridley, 2003). While being genetically identical to a pre-existing person will most likely result in some similarities, it will certainly not be strong enough to deprive a cloned child of her individuality or dignity.  A cloned child’s future would remain open, and there is no evidence that she is denied something irreplaceably unique by not having a unique genetic code. Moreover, concerns that genetic duplication compromises dignity overemphasize the role that genetics has as the source of human dignity. Human dignity, some philosophers have argued, has its source in virtue of our being persons and autonomous rational beings. Since, presumably, a clone would still be a person and an autonomous rational being, a clone would certainly retain her human dignity (Glannon, 2005; Elliot, 1998).

c. Cloning is Wrong because it is “Playing God” or because it is “Unnatural”

The Argument.

Another common concern is that cloning is morally wrong because it oversteps the boundaries of humans’ role in scientific research and development. These boundaries are set by either God (and therefore cloning is wrong because it is “playing God”) or nature (and therefore cloning is wrong because it is “unnatural”). Any method of procreation that does not implement traditional modes of conception, i.e., not involving the union of sperm and ova, is guilty of one (or both) of these infractions (Goodman, 2008; Tierney, 2007).  Moreover, advocates of this objection caution against removing God from the process of creation altogether, which, it is argued, is what reproductive cloning achieves (Rikfin, 2000).

Response 1: Clarifying the meaning of “playing God.”

Advocates of the “playing God” objection have the onus to define exactly what “playing God” means. One possible definition of “playing God” is that anything that interferes with nature, or the natural progression of life, interferes with God’s plan for humanity, and is therefore morally wrong. But this is too vague; humans constantly interfere with nature in ways that are not morally criticized. Almost all instances of medical advancements in the past 100 years (e.g., vaccines against diseases, respirators, incubators for preterm infants, pacemakers, etc.) interfere with nature in the sense that they prevent otherwise harmful or fatal afflictions from taking their toll on a human body. Would the same advocates of this objection against cloning object to artificial insulin injections to treat diabetes? (Glannon, 2005). To be more extreme, almost everything humans engage in, from wearing clothing, to using phones and computers, to indoor plumbing, all, in some sense, interfere with some aspect of nature.

Perhaps the more charitable understanding is that “playing God” is morally wrong when it comes to cloning because it is a process that artificially creates life, outside of the practice of sexual intercourse (Meilaender, 1997). Adhering to this definition of “playing God”, however, would condemn any form of artificial reproductive technology, as well as cloning, e.g., IVF, artificial insemination, or intrauterine insemination. In addition, anything that thwarts the natural process of conception (i.e., birth control) may also be morally condemned.  In the “Instruction on Respect for Human Life in Its Origin and on the Dignity of Procreation,” the Catholic Church denounces all forms of reproductive technology on the grounds that reproductive creation is strictly God’s domain (Congregation for the Doctrine of the Faith, 1987). However, most people who denounce human cloning on the grounds that it “plays God” do not denounce other forms of artificial reproduction on similar grounds.

Response 2: Knowing God’s will.

Yet another response is that this objection purports to know what God’s will is in regards to technological advancements such as cloning. However, since key religious texts (e.g., The Bible, The Torah, or the Qu’ran) make no mention of such advancements, it is presumably impossible to determine what God would have to say about them. In other words, inferences about God’s will on such matters are tenuous because we have little basis from which to draw these purported moral inferences (Pence, 2008).

Response 3: Biologism Fallacy.

One response to the “unnatural” objection is similar to the first response to the “playing God” objection; most everything humans do, from medicine to modern forms of sanitation, are “unnatural”, and most are not considered morally objectionable as a consequence. A second response is that such an objection commits what philosopher Daniel Maguire calls the “Biologism Fallacy”: “the fallacious effort to wring a moral mandate out of raw biological facts” (1983, 148). In other words, “unnatural” is not synonymous with “immoral” (and conversely, “natural” is not synonymous with “moral”). While it is true that cloning (along with other types of reproductive technologies) is not the “natural” way of conceiving a child, this alone does not render cloning immoral.

d. The Dangers of Cloning

The Argument.

Many philosophers and ethicists who would otherwise support reproductive cloning concede that concern for the safety of children born via cloning is reason to caution against its use (Harris, 2004; Glannon, 2005).  The claim is that a cloned child would be in danger of suffering from severe genetic defects as a result of being a clone, or that cloning would result in a high number of severely defective embryos before one healthy human embryo is developed. Ian Wilmut, Dolly’s creator, has denounced human reproductive cloning as too dangerous to attempt (Travis, 2001). According to Wilmut, “Dolly was derived from 277 embryos, so the other 276 didn’t make it. The previous year’s work, which led to the birth and survival of Megan and Morag, used more than 200 embryos. We have success rates of roughly one in a hundred or less” (Klotzko, 1998, 134). Even if a clone were to appear healthy at birth, there are concerns about health problems arising later in life. For example, while there is no evidence that Dolly’s respiratory issues were due to her being a clone, questions remain whether her arthritis, which is uncommon among sheep her age, could have resulted because of the nature of her genesis (Williams, 2003). Even attempting to perfect human reproductive cloning would entail a trial and error approach that would lead to the destruction of many embryos, and may produce severely disabled children before a healthy one is born.

Response 1: The nonidentity problem.

One response typically given by philosophers when concerning the ethics of preconception decisions that may lead to the birth of a disabled child involves an appeal to Derek Parfit’s nonidentity problem (Parfit, 1984, though Parfit himself does not apply this to cloning). Applied to preconception choices, Parfit’s argument can be applied as follows. Suppose I desire to get pregnant, but am currently suffering from a physical ailment that would result in conceiving and birthing an infant with developmental impairments. Yet, if I were to wait two months, my ailment would pass and I would conceive a perfectly healthy baby. Most people would agree that I should wait those two months; and, indeed, if I do not wait, many people would say that I acted wrongly. The resulting child, moreover, would most likely be identified as the victim of my actions. This intuitive response, however, is surprisingly tricky to defend.  If harm is defined as making someone worse off than she otherwise would have been, it is difficult to maintain that I harmed the resulting child by my actions, even if she were impaired. For the child that would have been born two months later would not have been the same child that is born if I do not wait; the impaired child would never have existed had I waited those two months. Unless the child’s life is so bad that her nonexistence would be preferable, I did not make the child worse off by conceiving her and giving birth to her with those impairments, and thus I did not harm her. Because I did not harm her, I did not do anything morally wrong in this circumstance. The argument can best be standardized as follows:

1. I have only harmed an individual if I had made her worse off than she otherwise would have been had it not been for my actions.

2. Only if I have harmed someone can my action be deemed morally wrong.

3. A child born with mental, physical, or developmental impairments usually does not have a life that is so bad that it renders nonexistence preferable.

4. Therefore, a child born with mental, physical or developmental impairments is not made worse off by being brought into existence.

5. Therefore, deliberate conception, gestation, and birthing of a child with mental, physical, or developmental impairments does not, usually, harm the child (unless the impairments are so bad that they make the child’s life worse than not having existed at all).

6. Therefore, I have (usually) done nothing morally wrong by deliberately bringing into existence a child who suffers from mental, physical, or developmental impairments.

Using the nonidentity problem in the context of the reproductive cloning debate yields the following result: The alternative to being born a clone is not to be born at all. Unless the cloned child’s life is made so horrible by her disabilities that it would have been better that she not been born at all, she was not harmed by being brought into existence via cloning, even if she is born with genetic defects as a result. As long as the cloned child has a life that, despite her genetic defect, is still worth living, then it would still be permissible to use cloning to bring her into being (Lane, 2006).

It is important to note, however, that the nonidentity problem is controversial, and that not all philosophers and ethicists agree with its conclusion (Weinberg, 2008; Cohen, 1996). Indeed, many argue that it would be morally impermissible to bring a child into the world who suffers, even if the child’s life has a net value that renders it worth living (Steinbock and McClamrock, 1994).

Response 2: The dangers of natural reproduction.

Natural reproduction can itself produce dangerous results. Women dispose of fertilized eggs during their menstrual cycle more often than they are aware; one study claims that as many as 73% of fertilized eggs do not survive to 6 weeks gestation (Boklage, 1990). From the ones that do implant, approximately 2% to 3% of newborn infants suffer from congenital abnormities of varying degrees of severity (Kumar et al., 2004). If safety concerns about cloning are severe enough to ban its practice, this can only be justified if cloning were more risky (that is, resulted in the birth of more children with more severe abnormalities) than natural reproduction. Some couples choose to reproduce in full knowledge that one or both of them harbor genetic disorders that may be passed along to their offspring, and some of these are rather severe, such as Huntington’s disease. Yet these parents are not prohibited from procreating because of this. Therefore, if parents are not prohibited from procreating on the grounds that they may pass along a severe genetic defect to their children, then it is difficult to deny a set of parents who can only rely on cloning for procreation the chance to do so based on safety reasons alone (unless the abnormalities that may result from cloning are more severe than the abnormalities that may result from natural conception) (Brock, 1997). Similarly, objecting to cloning on the grounds that embryos are sacrificed in order to achieve a live birth is only a valid objection if the number of embryos lost are greater in cloning than in natural reproduction.

Finally, even if safety concerns are sufficient to warrant a current ban on human reproductive cloning, such concerns would be temporary, and would abate as cloning becomes safer. Indeed, safety concerns led the National Bioethics Advisory Commission (1997) to recommend a temporary, rather than permanent, moratorium on human reproductive cloning.

e. Cloning Entails the Creation of Designer Children, or it Turns Children into Commodities

The Argument.

If we engage in cloning, this objection goes, we run the risk of inserting our will too much into our procreative decisions; we would get to choose not just to have a child, but what kind of child to have. In doing so, we run the risk of relegating children to the status of mere possessions or commodities, rather than regarding them as beings with their own intrinsic worth (Harakas, 1998; Kass, 1998; Meilaender, 1997).  When a couple engages in sexual intercourse and produces a baby, the child is an “offspring of a man and woman, but a replication of neither; their offspring but not their product whose meaning and destiny they might determine” (Meilaender, 1997, 42). Because cloning involves the artificial process of recreating a pre-existing genetic code, prospective parents could, first, choose their child’s DNA (thereby creating a “designer child”), and, second, because they are creating a “replica” of an existing person, they will consider the child more akin to property than an individual in her own right. These factors will contribute to viewing and treating the child as a mere commodity. The more “artificial” conception becomes, the more the resulting children will be seen as the possessions of the parents, rather than as persons in their own right. Rev. Stanley Harakas puts this point as follows: “Cloning would deliberately deny by design the cloned human being a set of loving and caring parents. The cloned human being would not be the product of love, but of scientific procedures. Rather than being considered persons, the likelihood is that these cloned human beings would be considered ‘objects’ to be used” (1998, 89).

Although he rejects the contention that clones would not be considered persons, Thomas Shannon expresses concerns that the increasing artificiality of conception, not just via the use of cloning, but via the use of all forms of artificial reproductive technologies, will “transform our thinking about ourselves, and the transformation will be in a mechanistic direction” (Shannon and Walter, 2003, 134). That is, the move away from natural conception towards artificial conception will lead to humans collectively regarding themselves as more machine-like rather than as organic beings.

Response 1: Cloning is not genetic modification.

Cloning does not necessarily entail the creation of “designer” children because cloning recreates a pre-existing DNA; it does not involve modifying or enhancing DNA in order to produce a child with certain desired traits. Cloning is not to be equated with genetic modification or enhancement (Wachbroit, 1997; Strong, 1998).

Response 2: Natural vs. artificial conception.

Advocates of the objection that cloning results in the transformation of procreation into manufacture seem to assume that, whereas we do not consider children that arise from natural reproduction as ours to do what we wish with, we would if they arise from artificial conception. That is, the tacit premise is that there is some trait inherent in artificial (i.e., non-sexual) conception that necessitates parents regarding their children as mere objects, and this trait is not found in “natural” conception. Yet, we can look towards the children who are products of modern day artificial reproduction in order to see that such a concern is not supported by the evidence. There are many children who are products of artificial reproductive technologies (IVF, intrauterine insemination, gender selection, and gamete intrafallopian transfer, among others) and there does not seem to be an increase of despotic control over these children on behalf of their parents. One study found that children born from IVF and DI (donor insemination) are faring as well as children born via natural conception. More importantly, given Meilaender’s concern that the quality of parenting is compromised in tandem with the artificiality of conception, the study found that “the quality of parenting in families with a child conceived by assisted conception is superior to that shown by families with a naturally conceived child, even when gamete donation is used in the child’s conception” (Golombok et al., 1995, 295; also see Golombok, 2003 and Golombok et al., 2001).

Meilaender may respond that, in these cases, the children are still a product of a unification of sperm and ovum, whereas this is not the case with cloning. However, it is unclear why generating a child via somatic cells is more likely to foster despotism than when the child is generated using germ cells. Some have argued that, on the contrary, a cloned child would feel even closer to the parent from whom she was cloned, given that they would share all their genetic information, rather than just half (Pence, 2008). Moreover, the findings of the study supported the thesis that “genetic ties are less important for family functioning than a strong desire for parenthood” (Golombok et al., 1995, 296), which suggests that the parents of cloned children would not be as caught up with the genetic origins of their offspring, and so their parenting would not be as affected by it, as Meilaender contends. According to the study, the quality of parenting increased in tandem with the amount of effort it took to achieve parenthood. It could be argued, therefore, that the quality of parenting for cloned children would be just as good, if not superior, to that of naturally conceived children.

Response 3: Clones would not be loveless creations.

Harakas claims that cloned children will be deprived of loving parents because their genesis will be one of science, rather than love. The studies conducted by Golombok certainly seem to provide evidence to the contrary. Intentionally taking steps to create a child via cloning (or any other kind of reproductive technology) could be seen, instead, as a mutual affirmation of love on behalf of the prospective parents and clear evidence that they really desired the resulting child. Whereas in sexual reproduction the child may be a product of chance, a cloned child would be a product of deliberate choice, which, according to some philosophers, could be a superior method of creation in some respects (Buchanan et al. 2000). Creating a child via cloning does not entail that there is a lack of mutual love between the parents, or that the resulting child would be any less loved (Strong, 1998). Genesis via sexual reproduction is neither a necessary nor a sufficient condition for being born to a set of loving parents and in a nurturing environment.

f. Cloning and the Ambiguity of Familial Roles

The Argument.

Genetically speaking, a cloned child would be her genetic predecessor’s identical twin sibling. If the child is cloned with the intent to serve as the social child of her genetic predecessor, she would be, genetically, her social mother’s twin sister (or his social father’s twin brother), and her social grandparents’ genetic daughter. The concern is that such a radical alteration of familial relationships would be detrimental to the cloned child (Kass, 1998; O’Neil, 2002). As Paul Ramsey puts it: “To mix the parental and the twin relation might well be psychologically disastrous for the young” (Ramsey, 1970). Wide-spread cloning would exacerbate the problem by distorting generational boundaries, which would add a layer of confusion to society’s conception of the nature of the family, and the roles of its individual members (Kass, 1998).

Response 1: No such confusion is likely to arise.

There are two responses to this response. First, doubts can be cast as to whether this confusion would really ensue. Second, even if such confusion did result, it is questionable whether it would be any more detrimental to the child than any confusion that currently exists about parental roles given certain reproductive technologies. For example, it is physically possible for a child to have as many as six distinct “parents”: three genetic parents (the mitochondrial DNA donor, the somatic cell donor used to re-nucleate an enucleated ovum, and the sperm donor), one gestational parent, and two (perhaps even more) social parents. If a cloned child would not experience any less confusion than a child in such a situation, then we would be hard pressed to show why the prospective parents of the former ought to be denied the opportunity to have a genetically related child based on these grounds alone (Harris, 2004). Moreover, doubts can be cast as to whether the ambiguity of genetic lineage caused by the cloning relationship will really result in the consequences Kass and O’Neil are fretting. A social father, for example, is not likely to suddenly rescind his responsibilities toward his daughter because the child is, genetically, his wife’s twin sister (Wachbroit, 1997). Finally, as is evident from children raised by adoptive parents, social parents usually retain the honorific role as the child’s “real” parents, even though there are no genetic ties between them and the adopted child. In other words, what defines a parent seems to have less to do with genetics and more to do with who performs the social role of mother and father (Purdy, 2005).

Response 2: Such confusion would not warrant a prohibition on cloning.

Even if there were such confusion, however, would it be so detrimental as to warrant banning reproductive cloning altogether? Moreover, even if there were a detriment, it is unclear whether that would be a result of society’s prejudice and fear of human cloning, or a result that inherently comes with being a clone. Finally, it would have to be clear that being the genetic twin to a social parent is so detrimental that it would warrant interfering with the prospective parents’ reproductive liberty. Indeed, for any purported harm that may come from cloning (whether physical, psychological, or emotional), it must be argued why those harms are sufficient for banning reproductive cloning if comparable harm would not be sufficient for banning any other kind of reproductive method, whether natural or artificial (Harris, 2004; Robertson, 2006).

6. References and Further Reading

  • Agar, Nicholas (2004), Liberal Eugenics: A Defence of Human Enhancement. Malden: Blackwell.
  • Allison, Lizabeth (2007), Fundamental Molecular Biology, chapter 8: “Recombinant DNA Technology and Molecular Cloning.” Malden: Blackwell, pp. 180-231.
  • Annas, George et  al. (1996), “The Politics of Human-Embryo Research – Avoiding Ethical Gridlock.” New England Journal of Medicine, 334.20: 293-340.
  • Annas, George (1998), “The Prospect of Human Cloning: An Opportunity for National and International Cooperation” in Human Cloning: Biomedical Ethical Review. James Humber and  Robert Almeder (eds). Totawa: Humana Press, pp. 53-63.
  • Boiani, Michele and Hans Schöler (2002), “Determinants of Pluripotency in Mammals” in Principles of Cloning. Jose Cibelli, Robert Lanza, Keith Campbell, Michael D. West (eds.) New  York: Academic Press, pp. 109-152.
  • Boklage, Charles (1990), “Survival Probability of Human Conceptions from Fertilization to Term.” International Journal of Fertility, 35.2:75-94.
  • Bor, Jonathan, “Cloning Adds a Dimension to Nature-Nurture Debate: Identical Humans are Not in the Cards.” The Baltimore Sun, March 9, 1997.
  • Boyle, Robert and Julian Savulescu (2001) “Ethics of Using Preimplantation Genetic Diagnosis to Select a Stem Cell Donor for an Existing Person.”  BMJ 323:1240-1243.
  • Brannigan, Michael (ed.) (2001), Ethical Issues in Human Cloning. New York, NY: Seven Bridges Press.
  • Brock, Dan (1997), “Cloning Human Beings: An Assessment of the Ethical Issues Pro and Con,” in Cloning Human Beings Volume II: Commissioned Papers. Rockville, MD: National Bioethics Advisory Commission.
  • Buchanan, Allen et al. (2002), From Chance to Choice: Genetics and Justice. Cambridge: Cambridge University Press.
  • Burely, Justin and John Harris (1999), “Human Cloning and Child Welfare.” Journal of Medical Ethics, 25.2:108-113.
  • Callahan, Daniel (1993), “Perspective on Cloning: A Threat to Individual Uniqueness; an Attempt to Aid Childless Couples by Engineered Conceptions Could Transform the Idea of Human Identity” in Los Angeles Times, Nov. 12, 1993.
  • Cohen, Cynthia (1996), “’Give Me Children or I’ll Shall Die!’: New Reproductive Technologies and Harm to Children.” Hastings Center Report, 26.2: 19-27.
  • Courtwright, Andrew M. and Wechsler Doron (2007), “Is Restricting Access to Assisted Reproductive Technology an Infringement of Reproductive Rights?”  American Medical Association Journal of Ethics, 9.9:  635-640.
  • Dawkins, Richard (1998), “What Wrong with Cloning?” in Clones and Cloning: Facts and Fantasies About Human Cloning. Martha Nussbaum and Cass Sunstein (eds). New York: Norton and Company Inc, pp. 54-66.
  • Deckers, Jan (2007), “Are Those Who Subscribe to the View that Early Embryos are Persons Irrational and Inconsistent?: A Reply to Brock.” Journal of Medical Ethics, 33.2: 102-106.
  • Devolder, Katrien and Julian Savulescu (2006), “The Moral Imperative to Conduct Cloning and Stem Cell Research.” Cambridge Quarterly of Healthcare Ethics, 15.1: 7-21.
  • Doetschman, R.G. et al. (1987), “Targeted Correction of a Mutant HPRT Gene in Mouse Embryonic Stem Cells.” Nature, 330: 576–578
  • Dworkin, Ronald. Life’s Dominion: An Argument About Abortion. Euthanasia, and Individual Freedom. New York: Vintage Press.
  • Elliot, David (1998), “Uniqueness, Individuality, and Human Cloning,” Journal of Applied Philosophy 15.3: 217-230.
  • Ethics Committee of the American Fertility Society (1985), “The Constitutional Aspects of Procreative Liberty” in Ethical Issues in the New Reproductive Technologies. Richard T. Hull (ed.)Belmont: Wadsworth Publishing Company, pp. 8-15.
  • Feinberg, Joel (1980), “A Child’s Right to an Open Future” in Philosophy of Education: An Anthology. Randall Curren (ed.) Malden: Blackwell, pp. 112-123.
  • Freed. Curt. R. et al. April 21, 1999. “Double-Blind Controlled Trial of Human Embryonic Dopamine Cell Transplants in Advanced Parkinson’s Disease: Study, Design, Surgical Strategy, Patient Demographics, and Pathological Outcome” (presented to the American Academy of Neurology).
  • French, Andrew J. et al. (2008), “Development of Human Cloned Blastocysts Following Somatic Cell Nuclear Transfer with Adult Fibroblasts.” Stem Cells 26.2: 485-493.
  • Gearhart, John D (1998), “New Potential for Human Embryonic Stem Cells,” Science 282: 1061.
  • Glannon, Walter (2005), Biomedical Ethics. New  York: Oxford University Press.
  • Golombok, Susan et al. (1995), “Families Created by the New Reproductive Technologies: Quality of Parenting and Social and Emotional Development of the Children.” Child Development, 66.2: 285-298.
  • Golombok, Susan (2001), “The “Test-Tube” Generation: Parent-Child Relationships and the Psychological Well-Being of In Vitro Fertilization Children at Adolescence.” Child Development, 72.2: 599-608.
  • Golombok, Susan (2003), “ Reproductive Technology and Its Impact on Child Psychosocial and Emotional Development” in: Tremblay RE, Barr RG, Peters RDeV (eds). Encyclopedia on Early Childhood Development [online]. Montreal, Quebec: Centre of Excellence for Early Childhood Development; 2003:1-7.
  • Goodman, Jim. “Cloning Animals is Unnatural, Unethical.” The Capital Times, January 25, 2008.
  • Gould, Stephen Jay (1997), “Individuality: Cloning and Discomfiting Cases of Siamese Twins.” The Sciences 37: 14-16.
  • Harakas, Stanley (1998), “To Clone or Not to Clone?” in Ethical Issues in Human Cloning. Edited by Michael C. Brannigan. New York, NY: Seven Bridges Press, pp. 89-90.
  • Harris, John (1997), “Good-bye, Dolly?: The Ethics of Human Cloning.” Journal of Medical Ethics, 23.6: 353-360)
  • Harris, John (2004), On Cloning. New York: Routledge.
  • Häyry, Matti (2003), “Philosophical Arguments For and Against Human Cloning.” Bioethics, 17.5-6: 447-459.
  • Holm, Soren (1998), “A Life in Shadows: One Reason Why We Should Not Clone Humans.” Cambridge Quarterly of Healthcare Ethics, 7.2: 160-162.
  • Hopkins, Patrick. “Bad Copies: How Popular Media Represent Cloning as an Ethical Problem” in Ethical Issues in Human Cloning. Edited by Michael C. Brannigan. New York, NY: Seven Bridges Press, pp. 128-140.
  • Illmensee, Karl et al. (2009), “Human Embryo Twinning with Applications in Reproductive Medicine.” Fertility and Sterility 93.2: 423-427.
  • Kant, Immanuel (1981), Grounding for the Metaphysics of Morals. Indianapolis: Hackett Publishing Company.
  • Kass, Leon (1998), “The Wisdom of Repugnance: Why We Should Ban the Cloning of Humans” in Ethical Issues in Human Cloning. Edited by Michael C. Brannigan. New York, NY: Seven Bridges Press, pp. 43-66.
  • Kitcher, Phillip (1997), “Who’s Self is it Anyway?” Sciences, 37.5: 58-62.
  • Klotzko, Arlene (1998), “Voices from Roslin: The Creators of Dolly Discuss Science, Ethics, and Social Responsibility.” Cambridge Quarterly of Healthcare Ethics, 7.2: 121-140.
  • Kumar, Vinay et al. (2004), Robbins & Cotran Pathologic Basis of Disease, 7th Edition. New York: Saunders.
  • Lane, Robert (2006), “Safety, Identity, and Consent: A Limited Defense of Reproductive Human Cloning.” Bioethics, 20.3: 125-135.
  • Lanza, Robert et al. (2000), “Cloning Noah’s Ark.” Scientific American, 283.5:84-89.
  • Levick, Stephen (2004), Clone Being: Exploring the Psychological and Social Dimensions. Lanham: Rowman and Littlefield Publishers, Inc.
  • Li, Ziyi (2006), “Cloned Ferrets Produced by Somatic Cell Nuclear Transfer.” Developmental Biology, 293.2: 439-448
  • Lovgren, Stefan, May 29, 2003. ”U.S. Team Produces First Mule Clone.” National Geographic News.
  • Lumelsky, Nadya et al. (2001), “Differentiation of Embryonic Stem Cells to Insulin- Secreting Structures Similar to Pancreatic Islets,” Science, 292: 1389–1394.
  • Maas, Heiko. Auigust 24, 2001. “Taking the Stand for Therapeutic Cloning.” Science Career Magazine.
  • Maguire, Daniel (1983), “The Morality of Homosexual Marriages” in Same-Sex Marriage: The Moral and Legal Debate. Amherst: Prometheus Books, pp. 147-161.
  • McCormick, Richard. “Should We Clone Humans?” The Christian Century, November 17-24, 1993.
  • McCormick, Richard (1994), “Blastomere Separation: Some Concerns.” Hastings Center Report 24.2: 14- 16.
  • Meilaender, Gilbert (1997), “Begetting and Cloning.” First Things, 74: 41-43.
  • Murray, Thomas April 8, 2001. “Even If It Worked, Cloning Won’t Bring Her Back.” The Washington Post.
  • National Bioethics Advisory Commission. “Cloning Human Beings: Reports and Recommendations.” June 9, 1997.
  • National Institute of Neurological Disorders and Stroke.  March 25, 1999, “What Would You Hope to Achieve from Stem Cell Research?” Response to Senator Arlen Specter’s Inquiry.
  • Novak, Michael. September 3, 2001. “The Stem Cell Slide: Be Alert to the Beginnings of Evil.”  National Review.
  • O’Neil, Onera (2002),  Autonomy and Trust in Bioethics. Cambridge: Cambridge University Press.
  • Oduncu Fuat S. (2003), “Stem Cell Research in Germany Ethics of Healing vs. Human Dignity.” Medicine, Healthcare, and Philosophy 6.1: 5-16.
  • Oh, H.J. (2008), “Cloning Endangered Gray Wolves (Canis lupus) from Somatic Cells Collected Postmortem.” Theriogenology, 70.4: 638-647.
  • Outka, Gene (2002), “The Ethics of Human Stem Cell Research.” Kennedy Institute of Ethics Journal 12.2: 175-213.
  • Parfit, Derek (1984), Reasons and Persons. New York: Oxford.
  • Pearson, Yvette E. (2007), “Storks, Cabbage Patches, and the Right to Procreate.” Journal of Bioethical Inquiry 4: 105-115.
  • Pence, Gregory (1998), Who’s Afraid of Human Cloning? Lanham: Rowman and Littlefield.
  • Pence, Gregory (2008), Classic Cases in Medical Ethics: Accounts of Cases That Have Shaped and Define Medical Ethics. New York: McGraw Hill.
  • Peters, Ted (2001), “The Stem Cell Controversy” in The Stem Cell Controversy. Michael Ruse and Christopher Pynes (eds.) Amherst: Prometheus Books, pp. 231-237.
  • Pope John Paul II (1995), Encyclical Evangelium Vitae. New York: Random House.
  • Pope Paul VI (1968), Encyclical Humanae Vitae, AAS 60, No.4, 1 – 11, Q. eciv.
  • President’s Council on Bioethics (2002), Human Cloning and Human Dignity. New York: Public Affairs Reports.
  • Purdy, Laura (2005), “Like Motherless Children: Fetal Eggs and Families.” Journal of Clinical Ethics 16.4: 329-334.
  • Quintavalle, Josephine. December 11, 2001. Quoted in: BBC News. “Doctor Plans UK ‘‘Designer Baby’’ clinic.  <http://news.bbc.co.uk/1/hi/health/1702854.stm>
  • Ramsey, Paul (1970), Fabricated Man: The Ethics of Genetic Control. New Haven: Yale University Press.
  • Resnik, David B and Daniel B Vorhaus (2006), “Genetic modification and genetic determinism.” Philosophy, Ethics, and Humanities in Medicine, 1:9.
  • Rhodes, Rosamond (1995), “Clones, Harms, and Rights.” Cambridge Quarterly of Healthcare Ethics, 4.3: 285-290.
  • Ridley, Matt (2003). Nature via Nurture: Genes, Experience, and What Makes Us Human. London: Fourth Estate.
  • Rifkin, Jeremy. February 3, 2000. “Cloning: What Hath Genomics Wrought”? Los Angeles Times.
  • Rifkin, Jeremy. July-August 2002. “Why I Oppose Human Cloning.” Tikkun, 17.4: 23- 32.
  • Robertson, John (1998), “Human Cloning and the Challenge of Regulation.” New England Journal of Medicine 339: 119-122.
  • Robertson, John (2006), “Reproductive Liberty and Cloning Humans.” Annals of the New York Academy of Sciences 913: 198-208.
  • Robertson John (1994), “The Question of Human Cloning.”  Hastings Center Report 24.2: 6-14.
  • Ruse, Michael and Aryne Sheppard (eds.) (2001). Cloning: Responsible Science or Technomadness? Amherst: Prometheus Books.
  • Shannon, Thomas and James Walter (2003), The New Genetic Medicine. Lanham: Rowman and Littlefield Publishers, Inc.
  • Shaprio, James et al. . (2000), “Islet Transplantation in Seven Patients with Type I Diabetes Mellitus Using a Glucocorticoid-Free Immunosuppressive Regimen.” New England Journal of Medicine, 343.4: 230-238.
  • Silver, Lee (1998), “Cloning, Ethics, and Religion” Cambridge Quarterly of Healthcare Ethics, 7.2: 168-172.
  • Steinbock, Bonnie and Ron McClamrock, (1994) “When Is Birth Unfair to the Child?”, Hastings Center Report 24.6: 15-21.
  • Strauer, B.E. (2009), “Therapeutic Potentials of Stem Cells in Cardiac Diseases.” Minerva Cardioangiol 57. 2: 249–267.
  • Strong, Carson (1998), “Cloning and Infertility.” Cambridge Quarterly of Healthcare Ethics, 7.2: 279-293.
  • Tierney, John. “Are Scientists Playing God?: It Depends on Your Religion.” The New York Times, November 20, 2007.
  • Tooley, Michael (1998), “The Moral Status of Cloning Humans” in Human Cloning: Biomedical Ethical Review. James Humber and  Robert Almeder (eds). Totawa: Humana Press, pp. 67-101.
  • Travis, John. “Dolly Was Lucky: Scientists Warn that Cloning is Too Dangerous for People.” Science News, October 20, 2001.
  • Voltarelli, Julio C. (2007), “Autologous Nonmyeloablative Hematopoietic Stem Cell Transplantation in Newly Diagnosed Type 1 Diabetes Mellitus.” Journal of the American Medical Association, 297.14: 1568-1576.
  • Wachbroit, Robert (1997), “Genetic Encores: The Ethics of Human Cloning.” Institute for Philosophy and Public Policy, 17.4: 1-7.
  • Wagner, John et al. (1999), “Induction of a Midbrain Dopaminergic Phenotype in Nurr1-Overexpressing Neural Stem Cells by Type 1 Astrocytes.” Nature Biotechnology17: 653-659.
  • Wakayama, Teruhiko et al. (1999), “Mice Cloned from Embryonic Stem Cells.” Proceedings of the National Academy of Sciences of the United States of America 96.26: 14984–14989.
  • Weinberg, Rivka (2008), “Identifying and Dissolving the Non-Identity Problem.” Philosophical Studies, 137.1: 3-18.
  • Williams, Nigel (2003). “Death of Dolly Marks Cloning Milestone.” Current Biology, 13.6: R209-R210.
  • Wilmut, Ian et  al. (1997), “Viable Offspring Derived from Fetal and Adult Mammalian Cells.” Nature 385.6619: 810-813.
  • Wilmut, Ian et al. (2000), The Second Creation: Dolly and the Age of Biological Control. London: Headline Book Publishing
  • Zulewski, Henry et al. (2001), “Multipotential nestin-positive stem cells isolated from adult pancreatic islets differentiate ex vivo into pancreatic endocrine, exocrine, and hepatic phenotypes,” Diabetes 50: 521–533.

Author Information

Bertha Alvarez Manninen
Email: bertha.manninen@asu.edu
Arizona State University at the West Campus
U. S. A.

Autonomy

Autonomy is an individual’s capacity for self-determination or self-governance. Beyond that, it is a much-contested concept that comes up in a number of different arenas. For example, there is the folk concept of autonomy, which usually operates as an inchoate desire for freedom in some area of one’s life, and which may or may not be connected with the agent’s idea of the moral good. This folk concept of autonomy blurs the distinctions that philosophers draw among personal autonomy, moral autonomy, and political autonomy. Moral autonomy, usually traced back to Kant, is the capacity to deliberate and to give oneself the moral law, rather than merely heeding the injunctions of others. Personal autonomy is the capacity to decide for oneself and pursue a course of action in one’s life, often regardless of any particular moral content. Political autonomy is the property of having one’s decisions respected, honored, and heeded within a political context.

Another distinction that can be made is between autonomy as a bare capacity to make decisions and of autonomy as an ideal. When autonomy functions as an ideal, agents who do not meet certain criteria in having reached a decision are deemed non-autonomous with respect to that decision. This can function both locally, in terms of particular actions, and globally, in terms of agents as a whole. For instance, children, agents with cognitive disabilities of a certain kind, or members of oppressed groups have been deemed non-autonomous because of their inability to fulfill certain criteria of autonomous agency, due to individual or social constraints.

There is debate over whether autonomy needs to be representative of a kind of “authentic” or “true” self. This debate is often connected to whether the autonomy theorist believes that an “authentic” or “true” self exists. In fact, conceptions of autonomy are often connected to conceptions of the nature of the self and its constitution. Theorists who hold a socially constituted view of the self will have a different idea of autonomy (sometimes even denying its existence altogether) than theorists who think that there can be some sort of core “true” self, or that selves as agents can be considered in abstraction from relational and social commitments and contexts.

Finally, autonomy has been criticized as being a bad ideal, for promoting a pernicious model of human individuality that overlooks the importance of social relationships and dependency. Responses to these criticisms have come in various forms, but for the most part philosophers of autonomy have striven to express the compatibility of the social aspects of human action within their conceptions of self-determination, arguing that there need not necessarily be an antagonism between social and relational ties, and our ability to decide our own course of action.

This article will focus primarily on autonomy at the level of the individual and the work being done on personal autonomy, but will also address the connection of autonomy to issues in bioethics and political theory.

Table of Contents

  1. The History of Autonomy
    1. Before Kant
    2. Kant
    3. The Development of Individualism in Autonomy
    4. Autonomy and Psychological Development
  2. Personal Autonomy
    1. Content-Neutral or Procedural Accounts
      1. Hierarchical Procedural Accounts
      2. Criticisms of Hierarchical Accounts
      3. Coherentist Accounts
    2. Substantive Accounts
  3. Feminist Philosophy of Autonomy
    1. Feminist Criticisms of Autonomy
    2. Relational Autonomy
  4. Autonomy in Social and Political Context
    1. Autonomy and Political Theory
    2. Autonomy and Bioethics
  5. References and Further Reading

1. The History of Autonomy

a. Before Kant

The roots of autonomy as self-determination can be found in ancient Greek philosophy, in the idea of self-mastery. For both Plato and Aristotle, the most essentially human part of the soul is the rational part, illustrated by Plato’s representation of this part as a human, rather than a lion or many-headed beast, in his description of the tripartite soul in the Republic. A just soul, for Plato, is one in which this rational human part governs over the two others. Aristotle identifies the rational part of the soul as most truly a person’s own in the Nicomachean Ethics (1166a17-19).

Plato and Aristotle also both associate the ideal for humanity with self-sufficiency and a lack of dependency on others. For Aristotle, self-sufficiency, or autarkeia, is an essential ingredient of happiness, and involves a lack of dependence upon external conditions for happiness. The best human will be one who is ruled by reason, and is not dependent upon others for his or her happiness.

This ideal continues through Stoic philosophy and can be seen in the early modern philosophy of Spinoza. The concept of autonomy itself continued to develop in the modern period with the decrease of religious authority and the increase of political liberty and emphasis on individual reason. Rousseau’s idea of moral liberty, as mastery over oneself, is connected with civil liberty and the ability to participate in legislation.

b. Kant

Kant further developed the idea of moral autonomy as having authority over one’s actions. Rather than letting the principles by which we make decisions be determined by our political leaders, pastors, or society, Kant called upon the will to determine its guiding principles for itself, thus connecting the idea of self-government to morality; instead of being obedient to an externally imposed law or religious precept, one should be obedient to one’s own self-imposed law.  The former he called heteronomy; the latter autonomy. In his “What is Enlightenment” essay, he described enlightenment as “the human being’s emergence from his self-incurred minority” and called on his readers to have the courage to use their own understanding “without direction from another” (Kant 1996, 17). This description is close to what we might acknowledge today as personal autonomy, but Kant’s account is firmly located within his moral philosophy.

In acting we are guided by maxims, which are the subjective principles by which we might personally choose to abide. If these maxims can be deemed universal, such that they would be assented to and willed by any rational being, and thus not rooted in any individual’s particular contingent experience, then they may gain the status of objective laws of morality. Each moral agent, then, is to be seen as a lawgiver in a community where others are also lawgivers in their own right, and hence are to be respected as ends in themselves; Kant calls this community the kingdom of ends.

While the will is supposed to be autonomous, for Kant, it is also not supposed to be arbitrary or particularistic in its determinations. He sees our inclinations and emotional responses as external to the process of the will’s self-legislation; consequently, letting them determine our actions is heteronomous rather than autonomous. Feelings, emotions, habits, and other non-intellectual factors are excluded from autonomous decision-making. Any circumstances that particularize us are also excluded from autonomous decision-making.

The reason for Kant’s exclusion of feelings, inclinations, and other particular aspects of our lives from the structure of autonomy is rooted in his metaphysical account of the human being, which radically separates the phenomenal human self from the noumenal human self. All empirical aspects of our selfhood — all aspects of our experience — are part of the phenomenal self, and subject to the deterministic laws of natural causality. Our freedom, on the other hand, cannot be perceived or understood; rather we must posit the freedom of the will as the basis for our ability to act morally.

Contemporary Kantians within moral theory do not adhere to Kant’s metaphysics, but seek to understand how something like Kant’s conception of autonomy can still stand today. Thomas Hill suggests, for example, that the separation of our free will from our empirical selfhood be taken less as a metaphysical idea but as a normative claim about what ought to count as reasons for acting (Hill 1989, 96-97)

There are significant differences between Kant’s conception of moral autonomy and the conceptions of personal autonomy developed within the last thirty years, which attempt to articulate how social and cultural influences can be compatible with autonomous decision-making. Further, the majority of contemporary theories of personal autonomy are content-neutral accounts of autonomy which are unconcerned with whether or not a person is acting according to moral laws; they focus more on determining whether or not a person is acting for his or her own reasons than on putting any restrictions on autonomous action.

c. The Development of Individualism in Autonomy

Between Kant’s description of moral autonomy and the recent scholarship on personal autonomy, however, there was a process of individualizing the idea of autonomy. The Romantics, reacting against the emphasis on the universality of reason put forth by the Enlightenment, of which Kant’s philosophy was a part, prized particularity and individuality. They highlighted the role of the passions and emotions over reason, and the importance of developing one’s own unique self. John Stuart Mill also praised and defended the development and cultivation of individuality as worthwhile in itself, writing that “A person whose desires and impulses are his own – are the expression of his own nature, as it has been developed and modified by his own culture – is said to have a character. One whose desires and impulses are not his own has no character, no more than a steam engine has a character” (Mill 1956, 73).

The Romantic conception of individuality was then echoed within the conception of authenticity that runs through phenomenological and existential philosophy. Heidegger posits an inner call of conscience summoning us away from ‘das Man’: in order to be authentic, we need to heed this inner call and break away from inauthentically following the crowd. This conception of authenticity became intertwined with the idea of autonomy: both involve a call to think for oneself and contain a streak of individualism (see Hinchman 1996).

Unlike the universalism espoused by Kantian autonomy, however, authenticity, like the Romantic view, involves a call to be one’s own person, not merely to think for oneself. For Kant, thinking for oneself would, if undertaken properly, lead to universalizing one’s maxims; for both the Romantics and the Existentialists, as well as for Mill, there is no such expectation. This division is still present in the contrast between conceiving of autonomy as a key feature of moral motivation, and autonomy as self-expression and development of individual practical identity.

The emphasis on autonomy within this strain of philosophy was criticized by Emmanuel Lévinas, who sees autonomy as part of our selfish and close-minded desire to strive toward our own fulfillment and self-gratification rather than being open to the disruptive call of the other’s needs (Lévinas 1969). He argues for the value of heteronomy over autonomy. For Lévinas, in heteronomy, the transcendent face of the other calls the ego into question, and the self realizes its unchosen responsibility and obligation to the other. The self is hence not self-legislating, but is determined by the call of the other. This criticism of the basic structure of autonomy has been taken up within continental ethics, which attempts to determine how or whether a practical, normative ethics could be developed within this framework (see for example Critchley 2007).

d. Autonomy and Psychological Development

The connection between autonomy and the ideal of developing one’s own individual self was adopted within the humanistic psychologies of Abraham Maslow and Carl Rogers, who saw the goal of human development as “self-actualization” and “becoming a person,” respectively. For Maslow and Rogers, the most developed person is the most autonomous, and autonomy is explicitly associated with not being dependent on others.

More recently Lawrence Kohlberg developed an account of moral psychological development, in which more developed agents display a greater amount of moral autonomy and independence in their judgments. The highest level bears a great resemblance to the Kantian moral ideal, in its reference to adopting universal values and standards as one’s own.

Kohlberg’s work was criticized by Carol Gilligan, who argued that this pattern reflected male development, but not female. Instead of taking “steps toward autonomy and independence,” in which “separation itself becomes the model and the measure of growth,” “for women, identity has as much to do with intimacy as with separation” (Gilligan 1982, 98). The trajectory is thus less about individualization and independence than toward ultimately balancing and harmonizing an agent’s interests with those of others.

Gilligan does not entirely repudiate autonomy itself as a value, but she also does not suggest how it can be distinguished from the ideals of independence and separation from others. Her critiques have been widely influential and have played a major role in provoking work on feminist ethics and, despite her criticism of the ideal of autonomy, conceptions of “relational autonomy.”

The contemporary literature on personal autonomy within philosophy tends to avoid these psychological ideas of individual development and self-actualization. For the most part, it adopts a content-neutral approach that rejects any particular developmental criteria for autonomous action, and is more concerned with articulating the structure by which particular actions can be deemed autonomous (or, conversely, the structure by which an agent can be deemed autonomous with respect to particular actions).

2. Personal Autonomy

The contemporary discussion of personal autonomy can primarily be distinguished from Kantian moral autonomy through its commitment to metaphysical neutrality. Related to this is the adherence to at least a procedural individualism: within contemporary personal autonomy accounts, an action is not judged to be autonomous because of its rootedness in universal principles, but based on features of the action and decision-making process purely internal and particular to the individual agent.

The main distinction within personal autonomy is that between content-neutral accounts, which do not specify any particular values or principles that must be endorsed by the autonomous agent, and substantive accounts which specify some particular value or values that must be included within autonomous decision-making.

a. Content-Neutral or Procedural Accounts

Content-neutral accounts, also called procedural, are those which deem a particular action autonomous if it has been endorsed by a process of critical reflection. These represent the majority of accounts of personal autonomy. Procedural accounts determine criteria by which an agent’s actions can be said to be autonomous, that do not depend on any particular conception of what kinds of actions are autonomous or what kinds of agents are autonomous. They are neutral with respect to what an agent might conceive of as good or might be trying to achieve.

i. Hierarchical Procedural Accounts

The beginning of the contemporary discussion of personal autonomy is in the 1970s works of Harry Frankfurt and Gerald Dworkin. Their concern was to give an account of what kind of individual freedom ought to be protected, and how that moral freedom may be described in the context of contemporary conceptions of free will. Their insight was that our decisions are worth protecting if they are somehow rooted in our values and overall commitments and objectives, and that they are not worth protecting if they run counter to those values, commitments, and objectives. The concept of personal autonomy, thus, can be used as a way of protecting certain decisions from paternalistic interference. We may not necessarily want to honor the decision of a weak-willed person who decides to do something against their better judgment and against their conscious desire to do otherwise, whereas we do want to protect a person’s decision to pursue an action that accords with their self-consciously held values, even if it is not what we ourselves would have done. Frankfurt and Dworkin phrase this insight in terms of a hierarchy of desires.

Frankfurt’s and Dworkin’s hierarchical accounts of autonomy form the basis upon which the mainstream discussion builds and reacts against. Roughly speaking, according to this hierarchical model, an agent is autonomous with respect to an action on the condition that his or her first-order desire to commit the act is sanctioned by a second-order volition endorsing the first-order desire (see Frankfurt 1988, 12-25). This account is neutral with respect to what the origins of the higher-order desires may be, and thus does not exclude values and desires that are socially or relationally constituted. The cause of such desires does not matter, solely the agent’s identification with them (Frankfurt 1988, 53-54). Autonomy includes our ability to consider and ask whether we do, in fact, identify with our desires or whether we might wish to override them (Dworkin 1988).  The “we”, in this case, is constituted by our higher-order preferences; Dworkin speaks of them as the agent’s “true self” (Dworkin 1989, 59).

ii. Criticisms of Hierarchical Accounts

There are several different objections to the hierarchical model, mostly revolving around the problem in locating the source of an agent’s autonomy, and questioning the idea that autonomy can be located somehow in the process of reflective endorsement itself.

First, the Problem of Manipulation criticism points out that because Frankfurt’s account is ahistorical, it does not protect against the possibility that someone, such as a hypnotist, may have interfered with the agent’s second-order desires. We would hesitate to call such a hypnotized or mind-controlled agent autonomous with respect to his or her actions under these circumstances, but since the hierarchical model does not specify where or how the second order volitions ought to be generated, it cannot adequately distinguish between an autonomous agent and a mind-controlled one. The structure of autonomous agency therefore seems to have a historical dimension to it, since the history of how we developed or generated our volitions seems to matter (see Mele 2001, 144-173).

John Christman develops a historical model of autonomy in order to rectify this problem, such that the means and historical process by which an agent reaches certain decisions is used in determining his or her status as autonomous or not (Christman 1991). This way, an agent brainwashed into having desire X would be deemed nonautonomous with respect to X.  The theory runs into difficulty in a case where an agent might freely choose to give up his or her autonomy, or conversely where an agent might endorse a desire but not endorse the means by which he or she was forced into developing the desire (see Taylor 2005, 10-12), but at least it draws attention to some of the temporal features of autonomous agency.

Another criticism of the hierarchical model is the Regress or Incompleteness Problem. According to Frankfurt and Dworkin, an agent is autonomous with respect to his or her first order desires as long as they are endorsed by second-order desires. However, this raises the question of the source of the second-order volitions; if they themselves rely on third-order volitions, and so on, then there is the danger of an infinite regress in determining the source of the autonomous endorsement (see Watson 1975). If the second order desires are autonomous for some other reason than a higher-order volition, then the hierarchical model is incomplete in its explanation of autonomy. Frankfurt, while acknowledging that there is “no theoretical limit” to the series of higher order desires, holds that the series can end with an agent’s “decisive commitment” to one of the first order desires (Frankfurt 1988, 21). However, the choice of terminating the series is itself arbitrary if there no reason behind it (Watson 1975).

Frankfurt responds to this criticism in “Identification and Wholeheartedness” by defining a decisive commitment as one which the agent makes without reservation, and where the agent feels no reason to continue deliberating (Frankfurt 1988, 168-9). To stop at this point is, Frankfurt argues, hardly arbitrary. It is possible that the agent is mistaken in his or her judgment, but that is always a possibility in deliberation, and thus not an obstacle to Frankfurt’s theory in particular. In making a decision, an agent “also seeks thereby to overcome or to supersede a condition of inner division and to make himself into an integrated whole” (Frankfurt 1988, 174). Thus, by making this decision, the agent has endorsed an intention that establishes “a constraint by which other preferences and decisions are to be guided” (Frankfurt 1988, 175), and thus is self-determining and autonomous.

The criterion of wholeheartedness and unified agency has been criticized by Diana Meyers, who argues for a decentered, fivefold notion of the subject, which includes the unitary, decision-making self, but also acknowledges the functions of the self as divided, as relational, as social, as embodied, and as unconscious (Meyers 2005). The ideal of wholeheartedness has also been criticized on the grounds that it does not reflect the agency of agents from oppressed groups or from mixed traditions. Edwina Barvosa-Carter sees ambivalence as an inescapable feature of much decision-making, especially for mixed-race individuals who have inherited conflicting values, commitments, and traditions (Barvosa-Carter 2007). Marina Oshana makes a similar point, with reference to living within a racist society (Oshana 2005).

In any case, it is a puzzle how decisive commitments or higher-order desires acquire their authority without themselves being endorsed, since deriving authority from external manipulation would seem to undermine this authority. This is the Ab Initio Problem: If the source of an agent’s autonomy is ultimately something that can’t itself be reflectively endorsed, then the agent’s autonomy seems to originate with something with respect to which he or she is non-autonomous, something that falls outside the hierarchical model.

A related objection to the Regress Problem is that this hierarchical account seems to give an unjustified ontological priority to higher versions of the self (see Thalberg 1978). Marilyn Friedman has argued that it begs the question to assume some sort of uncaused “true self” at the top of the hierarchical pyramid. In order to give a procedural account that would avoid these objections, Friedman has proposed an integration model in which desires of different orders ought to be integrated together, rather than being constructed in a pyramid (Friedman 1986).

iii. Coherentist Accounts

Part of the appeal of understanding autonomy is not simply in explaining how we make decisions, but because the idea of autonomy suggests something about how we identify ourselves, what we identify with. For Frankfurt, we identify with a lower level desire if we have a second order volition endorsing it. However, our second order volitions don’t necessarily represent us — we may have no reason for them, which Frankfurt acknowledges.

This concern drives some of the other approaches to personal autonomy, such as Laura Ekstrom’s coherentist account (Ekstrom 1993). Since autonomy is self-governance, it stands to reason that in order to understand autonomous agency, we must clarify our notion of the self and hence what counts as the self’s own reasons for acting; she argues that this will help avoid the Regress Problem and the Ab Initio Problem.

Ekstrom’s account of self is based on the endorsement of preferences. An agent has a preference if he or she holds a certain first level desire to be good; it is similar to a second order volition for Frankfurt. It presupposes higher level states since they are the result of an agent’s higher order reflection about the agent’s desires with regard to goodness. A self, then, is a particular character with certain beliefs and preferences which have been endorsed in a process of self-reflection, and the ability to reshape those beliefs and preferences in light of self-evaluation. The true self includes those beliefs and preferences which cohere together; that coherence itself gives them authorization. A preference is thus endorsed if it coheres with the agent’s character.

Michael Bratman develops a similar account, arguing that our personal identity is partly constituted by the organizing and coordinating function of our long-range plans and intentions (Bratman 2007, 5). Our decisions are autonomous or self-governing with respect to these plans.

This is, of course, only a very brief account of some of the literature on proceduralist accounts of autonomy, and it omits the various defenses of the hierarchical model and the objections to Friedman’s, Christman’s, and others’ formulations. But it should be enough to make clear the way in which theorists offering these accounts strive to ensure that no particular view of what constitutes a flourishing human life is imported into their accounts of autonomy. Autonomy is just one valued human property amongst others, and need not do all the work of describing human flourishing (Friedman 2003).

b. Substantive Accounts

Some doubt, however, that proceduralist accounts are adequate to capture autonomous motivation and action, or to rule out actions that or agents who we would hesitate to call autonomous. Substantive accounts of autonomy, of which there are both weak and strong varieties, set more requirements for autonomous actions to count as autonomous. Whether weak or strong, all substantive accounts posit some particular constraints on what can be considered autonomous; one example might be an account of autonomy that specifies that we might not autonomously be able to choose to be enslaved. Susan Wolf offers a strong substantive account, in which agents must have “normative competency,” in other words, the capacity to identify right and wrong (Wolf 1990). We do not need to be metaphysically responsible for ourselves or absolutely self-originating, but as agents we are morally responsible, and capable of revising ourselves according to our moral reasoning (Wolf 1987). Similarly, Paul Benson’s early accounts of autonomy also advocated a strong substantive account, stressing normative competence, and also the threat of oppressive or inappropriate socialization to our normative competence and thus to our autonomy (Benson 1991).

Contemporary Kantians such as Thomas Hill and Christine Korsgaard also advocate substantive accounts of autonomy. Korsgaard argues that we have practical identities which guide us and serve as the source of our normative commitments (Korsgaard 1996). We have multiple such identities, not all of which are moral, but our most general practical identity is as a member of the “kingdom of ends,” our identity as moral agents. This identity generates universal duties and obligations. Just as Kant called autonomy our capacity for self-legislation, so too Korsgaard calls autonomy our capacity to give ourselves obligations to act based on our practical identities. Since one of these is a universal moral identity, autonomy itself thus has substantive content.

Autonomy, for Hill, means that principles will not simply be accepted because of tradition or authority, but can be challenged through reason. He acknowledges that in our society we do not experience the kind of consensus about values and principles that Kant supposed ideally rational legislators might possess, but argues that it is still possible to bear in mind the perspective of a possible kingdom of ends. Human dignity, the idea of humanity as an end in itself, can represent a shared end regardless of background or tradition (Hill 2000, 43-45).

Substantive accounts have been criticized for conflating personal and moral autonomy and for setting too high a bar for autonomous action. If too much is expected of autonomous agents’ self-awareness and moral reflection, then can any of us be truly said to be autonomous (see for example Christman 2004 and Narayan 2002)? Does arguing that agents living under conditions of oppressive socialization have reduced autonomy help set a standard for promotion of justice, or does it overemphasize their diminished capacity without encouraging and promoting the capacities that they do have? This interplay between our socialization and our capacity for autonomy is highlighted in the relational autonomy literature, covered below.

In order to come to some middle ground between substantive and procedural accounts, Paul Benson has also suggested a weak substantive account, which does not specify any content, but sets the requirement that the agent must regard himself or herself as worthy to act; in other words, that the agent must have self-trust, self-respect (Benson 1991). This condition serves to limit what behavior can be deemed autonomous and to bring it in line with our intuition that a mind-controlled or utterly submissive agent is not acting autonomously, while not ruling out the agent’s ability to decide what values he or she wants to live by.

3. Feminist Philosophy of Autonomy

a. Feminist Criticisms of Autonomy

Feminist philosophers have been critical of concepts and values traditionally seen to be gender neutral, finding that when examined they reveal themselves to be masculine (see Jaggar 1983, Benjamin 1988, Grimshaw 1986, Harding and Hintikka 2003, and Lloyd 1986). Autonomy has long been coded masculine and associated with masculine ideals, despite being something which women have called for in their own right. Jessica Benjamin argues that while we are formally committed to equality, “gender polarity underlies such familiar dualisms as autonomy and dependency” (Benjamin 1988, 7). There has been some debate over whether autonomy is actually a useful value for women, or whether it has been tarnished by association.

Gilligan’s criticisms of autonomy have already been covered, but Benjamin writes along similar lines that:

The ideal of the autonomous individual could only be created by abstracting from the relationship of dependency between men and women. The relationships which people require to nurture them are considered private, and not truly relationships with outside others. Thus the other is reduced to an appendage of the subject – the mere condition of his being – not a being in her own right. The individual who cannot recognize the other or his own dependency without suffering a threat to his identity requires the formal, impersonal principle of rationalized interaction, and is required by them. (Benjamin 1988, 197)

Benjamin ultimately argues that the entire structure of recognition between men and women must be altered in order to permit an end to domination. Neither Gilligan nor Benjamin addresses the possibility of reformulating the notion of autonomy itself, but each sees it as essentially linked with individualism and separation. Sarah Hoagland is more emphatic: she openly rejects autonomy as a value, referring to it as “a thoroughly noxious concept” as it “encourages us to believe that connecting and engaging with others limits us” (Hoagland 1988, 144).

These criticisms have been countered, however, by feminists looking to retain the value of autonomy, who argue that the critics conflate the ideal of “autonomy” with that of “substantive independence.” Autonomy, while it has often been associated with individualism and independence, does not necessarily entail these. Most feminist criticism of autonomy is based on the idea that autonomy implies a particular model or expectation of the self. Marilyn Friedman and John Christman, however, point out that the proceduralist notion of autonomy which is the focus of contemporary philosophical attention does not have such an implication, but is metaphysically neutral and value neutral (Friedman 2000, 37-46; Christman 1995).

b. Relational Autonomy

A feminist attempt to rehabilitate autonomy as a value, and to further underscore the contingency of its relationship to atomistic individualism or independence, emerges in the growing research on “relational autonomy” (Nedelsky 1989, Mackenzie and Stoljar 2000). It addresses the challenge of balancing agency with social embeddedness, without promoting an excessively individualistic liberal atomism, or denying women the agency required to criticize or change their situation. The feminist work on relational autonomy attempts to capture the best of the available positions.

It is worth noting first, for clarity, that there are two levels of relationality at work within relational autonomy: social and relational sources of values, goals, and commitments, and social and relational commitments themselves. While all acknowledge that relationality at both levels is not incompatible with autonomy, not all accounts of relational autonomy require that we pursue social and relational commitments. For instance, on Marilyn Friedman’s account, a person could autonomously choose to be a hermit, despite having been brought up in a family and in a society and having been shaped by that upbringing (Friedman 2003, 94). However other relational autonomy theorists are skeptical about neatly separating the two, because they note that even our unchosen relationships still affect our self-identity and opportunities. They argue that while we need not pursue relationships, we cannot opt out entirely. Anne Donchin demonstrates this with regard to testing for genetically inherited disease (Donchin 2000).

In general, on relational autonomy accounts, autonomy is seen as an ideal by which we can measure how well an agent is able to negotiate his or her pursuit of goals and commitments, some of which may be self-chosen, and some the result of social and relational influences. Social and relational ties are examined in terms of their effect on an agent’s competency in this negotiation: some give strength, others create obstacles, and others are ambiguous. The primary focus of most relational autonomy accounts, however, tends to be less on procedure and more on changing the model of the autonomous self from an individualistic one to one embedded in a social context.

4. Autonomy in Social and Political Context

The value of autonomy can be seen in its social and political context. The idea that our decisions, if made autonomously, are to be respected and cannot be shrugged off, is a valuable one. It concerns the legitimacy of our personal decisions in a social, political, and legislative context.

a. Autonomy and Political Theory

The importance and nature of the value of autonomy is debated within political theory, but is generally intertwined with the right to pursue one’s interests without undue restriction. Discussions about the value of autonomy concern the extent of this right, and how it can be seen as compatible with social needs.

Kant described the protection of autonomy at the political level as encapsulated in the principle of right: that each person had the right to any action that can coexist with the freedom of every other person in accordance with universal law (Kant 1996, 387). Mill’s On Liberty similarly defends the rights of individuals to pursue their own personal goals, and emphasizes the need for being one’s own person (Mill 1956). On his view, this right prohibits paternalism, or restrictions or interference with a person of mature age for his or her own benefit. As Mill writes, “The only part of the conduct of anyone for which he is amenable to society is that which concerns others. In the part which merely concerns himself, his independence is, of right, absolute. Over himself, over his own body and mind, the individual is sovereign” (Mill 1956, 13).

Non-interference is generally seen as key to political autonomy; Gerald Gaus specifies that “the fundamental liberal principle” is “that all interferences with action stand in need of justification” (Gaus 2005, 272). If any paternalistic interference is to be permitted, it is generally restricted to cases where the agent is not deemed to be autonomous with respect to a decision (see for example Dworkin 1972); autonomy serves as a bar to be reached in order for an agent’s decisions to be protected (Christman 2004). The question is then how high the bar ought to be set, and thus what individual actions count as autonomous for the purposes of establishing social policy. Because of this, there is a strong connection between personal and political autonomy.

Further, there is also a connection between political liberalism and content-neutral accounts of autonomy which do not require any predetermined values for the agent to be recognized as autonomous. As Christman and Anderson point out, content-neutral accounts of autonomy accord with liberalism’s model of accommodating pluralism in ways of life, values, and traditions (Christman and Anderson 2005).

The framework of seeing the value of political autonomy in terms of protecting individual choices and decisions, however, has been criticized by those who argue that it rests on an inadequate model of the self.

Communitarians such as Michael Sandel criticize the model of the autonomous self implicit in liberal political theory, arguing that it does not provide an adequate notion of the human person as embedded within and shaped by societal values and commitments. Procedural accounts of autonomous decision-making do not adequately recognize the way our relational commitments shape us. We do not choose our values and commitments from the position of already being autonomous individuals; in other words, the autonomous self does not exist prior to the values and commitments that constitute the basis for its decisions. To deliberate in the abstract from these values and commitments is to leave out the self’s very identity, and that which gives meaning to the deliberation (Sandel 1998).

Feminist scholars have agreed with some of the communitarian criticism, but also caution that the values and commitments that communitarians appeal to may not be ones that are in line with feminist goals, in particular those values that concern the role and makeup of the family (Okin 1989 and Weiss 1995).

Another criticism of the dominant model of autonomy within political theory is made by Martha Fineman, who argues for the need to rethink the conceptions of autonomy that undergird legal and governmental policies in order to better recognize our interdependence and the dependence of all of us upon society (Fineman 2004, 28-30). While not drawing on the philosophical literature on personal autonomy or relational autonomy, but rather drawing upon sociological theories and accounts of legal and government policy, she traces the historical and cultural associations of autonomy with individuality and masculinity, and argues the need to see that real human flourishing includes dependency.

Recognizing the different levels of autonomy at play within the political sphere as a whole can help to clarify what is at stake, and to avoid one-sided accounts of autonomy or the autonomous self. Rainer Forst outlines five different conceptions of autonomy that can combine into a multidimensional account (Forst 2005). The first is moral autonomy, in which an agent can be considered autonomous as long as he or she “acts on the basis of reasons that take every other power equally into account” and which are “justifiable on the basis of reciprocally and generally binding norms” (Forst 2005, 230). Even though this is an interpersonal norm, it is relevant to the political, argues Forst, because it promotes the mutual respect needed for political liberty. Ethical autonomy concerns a person’s desires in the quest for the good life, in the context of the person’s values, commitments, relationships, and communities. Legal autonomy is thus the right not to be forced into a particular set of values and commitments, and is neutral toward them. Political autonomy concerns the right to participate in collective self-rule, exercised with the other members of the relevant community. Finally, social autonomy concerns whether an agent has the means to be an equal member of this community. Attending to social autonomy helps to demonstrate the responsibility of members of the community to consider each other’s needs, and to evaluate political and social structures in terms of whether they serve to promote the social autonomy of all of the members. Forst argues that ultimately “citizens are politically free to the extent to which they, as freedom-grantors and freedom-users, are morally, ethically, legally, politically, and socially autonomous members of a political community … Rights and liberties therefore have to be justified not only with respect to one conception of autonomy but with a complex understanding of what it means to be an autonomous person” (Forst 2005, 238).

Whether or not one agrees with this particular way of dividing the conceptions of autonomy, or with the particular explanation of the details of any of the conceptions, Forst’s account highlights the way that understanding the contribution of autonomy to political theory involves a multifaceted approach. It is of limited use to say that citizens are autonomous because they have the right to vote, if their material needs are not met, or if they are not free in their choice of values or ethical commitments.  Taking ethical autonomy into consideration can help to meet some of the concerns raised above by communitarian and feminist critics of autonomy; meanwhile, taking legal autonomy into account alongside ethical autonomy can help to provide the bulwark of protection against oppressive traditions that feminists are concerned about.

This can also be related to the work done by Martha Nussbaum and Amartya Sen on the capabilities approach to human rights, in which societies are called upon to ensure that all human beings have the opportunity to develop certain capabilities; agents then have a choice whether or not to develop them (see for example Sen 1999 and Nussbaum 2006).  The kind of political autonomy granted to subjects, then, depends on their ability to cultivate these various capabilities within a given society.

b. Autonomy and Bioethics

In applied ethics, such as bioethics, autonomy is a key value. It is appealed to by both sides of a number of debates, such as the right to free speech in hate speech versus the right to be free from hate speech (Mackenzie and Stoljar 2000, 4). There is a lack of consensus, however, on how autonomy ought to be used: how much rationality it requires, whether it merely involves the negative right against interference or whether it involves positive duties of moral reflection and self-legislation.

Autonomy has long been an important principle within biomedical ethics. For example, in the Belmont report, published in 1979 in the United States, which articulates guidelines for experimentation on human subjects, the protection of subjects’ autonomy is enshrined in the principle of “respect for persons.” One of the three key principles of the Report, it states that participants in trials ought to be treated as autonomous, and those with diminished autonomy (due to cognitive or other disabilities or illnesses) are entitled to protection. The way this principle is to be applied takes shape in the form of informed consent, as the Report presumes that this is the best way to protect autonomy.

One of the standard textbooks in biomedical ethics, Principles of Biomedical Ethics by Tom L. Beauchamp and James F. Childress, defends four principles for ethical decision-making, of which “respect for autonomy” is the first, even though it is not intended to override other moral considerations.  The principle can be seen as both a negative and a positive obligation. The negative obligation for health care professionals is that patients’ autonomous decisions should not be constrained by others. The positive obligation calls for “respectful treatment in disclosing information and fostering autonomous decision-making” (Beauchamp and Childress 2001, 64).

Beauchamp and Childress accept that a patient can autonomously choose to be guided by religious, traditional, or community norms and values. While they acknowledge that it can be difficult to negotiate diverse values and beliefs in sharing information necessary for decision-making, this does not excuse a failure to respect a patient’s autonomous decision: “respect for autonomy is not a mere ideal in health care; it is a professional obligation. Autonomous choice is a right, not a duty of patients” (Beauchamp and Childress 2001, 63).

Autonomy is also important within the disability rights movement. Within the disability rights movement, the slogan, “Nothing about us without us” is a call for autonomy or self-determination (see Charlton 1998). It goes beyond merely rejecting having decisions made for people with disabilities by others, but also speaks to the desire for empowerment and recognition as being agents capable of self-determination.

The relational approach to autonomy has become popular in the spheres of health care ethics and disability theory. The language of relational autonomy has been helpful in reframing the dichotomy between strict independence and dependence and providing a way of framing the relationship between a person with a disability and his or her caretaker or guardian. It has also been argued that a relational approach to patient autonomy provides a better model of the decision-making process.

Criticisms of a rationalistic and individualistic ideal of autonomy and the development of the idea of relational autonomy have been taken up within the mainstream of biomedical ethics. In response to criticism that early editions of their textbook on biomedical ethics had not paid adequate heed to intimate relationships and the social dimensions of patient autonomy, Beauchamp and Childress emphasize that they “aim to construct a conception of respect for autonomy that is not excessively individualistic (neglecting the social nature of individuals and the impact of individual choices and actions on others), not excessively focused on reason (neglecting the emotions), and not unduly legalistic (highlighting legal rights and downplaying social practices)” (Beauchamp and Childress, 2001, 57).

Their account of autonomy, however, has still been criticized by Anne Donchin as being a “weak concept” of relational autonomy (Donchin 2000). While they do not deny that selves are developed within a context of community and human relationships, agents are still assumed to have consciously chosen their beliefs and values and to be capable of detaching themselves from relationships at will (Donchin 2000, 238). A strong concept of relational autonomy, on the other hand, holds that “there is a social component built into the very meaning of autonomy,” and that autonomy “involves a dynamic balance among interdependent people tied to overlapping projects” (Donchin 2000, 239). The autonomous self is one “continually remaking itself in response to relationships that are seldom static,” and which “exists fundamentally in relation to others” (Donchin 2000, 239). Donchin argues that it is the strong concept of relational autonomy that offers the most helpful account of decision-making in health care.

5. References and Further Reading

  • Barvosa-Carter, Edwina. “Mestiza Autonomy as Relational Autonomy: Ambivalence and the Social Character of Free Will,” The Journal of Political Philosophy Vol. 15, no. 1 (2007), 1-21.
  • Beauchamp, Tom L. and James F. Childress. Principles of Biomedical Ethics, 5th ed, Oxford and New York: Oxford University Press, 2001.
  • Benjamin, Benjamin. The Bonds of Love: Psychoanalysis, Feminism, and the Problem of Domination, New York: Pantheon Books, 1988, 183-224.
  • Benson, Paul. “Autonomy and Oppressive Socialization,” Social Theory and Practice 17, no. 3 (1991), 385-408.
  • Bratman, Michael. Structures of Agency, Oxford: Oxford University Press, 2007.
  • Charlton, James I. Nothing About Us Without Us: Disability, Oppression and Empowerment, Berkeley and Los Angeles: University of California Press, 1998.
  • Christman, J., (ed.). The Inner Citadel: Essays on Individual Autonomy, New York and Oxford: Oxford University Press, 1989.
  • Christman, John, and Joel Anderson (ed.) Autonomy and the Challenges to Liberalism, Cambridge: Cambridge University Press, 2005.
  • Christman, John. “Autonomy and Personal History,” Canadian Journal of Philosophy, 21 no. 1(1991), 1-24.
  • Christman, John. “Autonomy, Self-Knowledge, and Liberal Legitimacy,” in Autonomy and the Challenges to Liberalism, ed. John Christman and Joel Anderson, Cambridge: Cambridge University Press, 2005.
  • Christman, John. “Feminism and Autonomy,” “Nagging” Questions: Feminist Ethics in Every Life, ed. Dana E. Bushnell. Lanham, MD: Rowman and Littlefield Publishers, Inc., 1995, 17-39.
  • Christman, John. “Relational Autonomy, Liberal Individualism, and the Social Constitution of Selves,” Philosophical Studies 117, no. 1-2 (2004), 143-164.
  • Critchley, Simon. Infinitely Demanding: Ethics of Commitment, Politics of Resistance, London: Verso, 2007.
  • Donchin, Anne. “Autonomy and Interdependence: Quandaries in Genetic Decision Making.” In Relational Autonomy: Feminist Perspectives on Autonomy, Agency, and the Social Self, edited by Catriona Mackenzie and Natalie Stoljar, 236-258. Oxford: Oxford University Press, 2000.
  • Dworkin, Gerald. “Paternalism,” The Monist, 56 no. 1 (1972), 64-84.
  • Dworkin, Gerald. The Theory and Practice of Autonomy, Cambridge: Cambridge University Press, 1988.
  • Dworkin, Gerald. “The Concept of Autonomy,” in The Inner Citadel, ed. John Christman, 54-62.
  • Ekstrom, Laura. “A Coherence Theory of Autonomy,” Philosophy and Phenomenological Research, 53 (1993), 599–616.
  • Fineman, Martha Albertson. The Autonomy Myth: A Theory of Dependency. New York: The New Press, 2004.
  • Forst, Rainer. “Political Liberty: Integrating Five Conceptions of Autonomy,” in Autonomy and the Challenges to Liberalism, 2005, 226-242.
  • Frankfurt, Harry. The Importance of What We Care About, ed. Harry Frankfurt, Cambridge: Cambridge University Press, 1988.
  • Friedman, Marilyn. “Autonomy and the Split-Level Self,” Southern Journal of Philosophy 24 (1986), 19-35.
  • Friedman, Marilyn. Autonomy, Gender, Politics, Oxford: Oxford University Press, 2003.
  • Gaus, Gerald F. “The Place of Autonomy Within Liberalism,” in Autonomy and the Challenges to Liberalism, 2005, 272-306.
  • Gilligan, Carol. In a Different Voice: Psychological Theory and Women’s Development. Cambridge, Mass.: Harvard University Press, 1982.
  • Grimshaw, Jean. Philosophy and Feminist Thinking. Minneapolis: University of Minnesota Press, 1986.
  • Harding, Sandra and Merrill B. Hintikka, eds., Discovering Reality: Feminist Perspectives on Epistemology, Metaphysics, Methodology, and Philosophy of Science, 2 ed.. Dordrecht: Kluwer Academic Publishers, 2003.
  • Hill, Thomas. “The Kantian Conception of Autonomy,” in The Inner Citadel, ed. John Christman, 91-105.
  • Hill, Thomas. “A Kantian Perspective on Moral Rules,” in Respect, Pluralism, and Justice: Kantian Perspectives (Oxford and New York: Oxford University Press, 2000), 33-55.
  • Hinchman, Lewis. “Autonomy, Individuality, and Self-Determination,” in What is Enlightenment? Eighteenth-Century Answers and Twentieth-Century Questions., ed. James Schmidt. Berkeley: University of California Press, 1996, 488-516.
  • Hoagland, Sarah L. Lesbian Ethics: Toward New Value. Palo Alto, California: Institute of Lesbian Studies, 1988, 144.
  • Jaggar, Alison M. Feminist Politics and Human Nature, Totowa, NJ: Rowman & Allenheld, 1983.
  • Lévinas, Emmanuel. Totality and Infinity, trans. Alphonso Lingis, Pittsburgh: Duquesne University Press, 1969.
  • Lloyd, Genevieve. The Man of Reason: Male and Female in Western Philosophy (London: Routledge, 1986).
  • Kant, Immanuel. Practical Philosophy. ed. and trans. Mary Gregor. 1996
  • Korsgaard, Christine. The Sources of Normativity. New York: Cambridge University Press, 1996.
  • Kymlicka, Will. Contemporary Political Philosophy: An Introduction. Oxford: Oxford University Press, 1991.
  • Mackenzie, Catriona, and Stoljar, Natalie, (eds.). Relational Autonomy, New York and Oxford: Oxford University Press, 2000.
  • Mele, Alfred R. Autonomous Agents: From Self-Control to Autonomy, New York and Oxford: Oxford University Press, 2001.
  • Meyers, Diana Tietjens. Self, Society, and Personal Choice, New York: Columbia University Press, 1989.
  • Meyers, Diana Tietjens. “Decentralizing Autonomy: Five Faces of Selfhood.” In Autonomy and the Challenges to Liberalism, edited by John Christman and Joel Anderson, 27-55. Cambridge: Cambridge University Press, 2005.
  • Mill, John Stuart. On Liberty, Indianapolis and New York: The Liberal Arts Press, 1956; originally published 1859.
  • Narayan, Uma. “Minds of Their Own: Choices, Autonomy, Cultural Practices, and Other Women,” in A Mind of One’s Own: Feminist Essays on Reason and Objectivity, ed. Louise M. Antony and Charlotte Witt (Boulder, CO: Westview Press, 2002), 418-432.
  • Nedelsky, Jennifer. “Reconceiving Autonomy: Sources, Thoughts and Possibilities.” Yale Journal of Law and Feminism, no. 1 (1989): 7-36.
  • Nussbaum, Martha. Frontiers of Justice: Disability, Nationality, Species Membership, Cambridge, MA: Belknap Press, 2006.
  • Okin, Susan Moller. Justice, Gender, and the Family, New York: Basic Books, Inc., 1989.
  • Oshana, Marina A. L. “Autonomy and Self-Identity.” In Autonomy and the Challenges to Liberalism: New Essays, edited by John Christman and Joel Anderson, 77-97. Cambridge: Cambridge University Press, 2005
  • Sandel, Michael J. Liberalism and the Limits of Justice, 2nd ed., Cambridge: Cambridge University Press, 1998.
  • Sen, Amartya. Development as Freedom, Oxford: Oxford University Press, 1999.
  • Taylor, James S. (ed.). Personal Autonomy, Cambridge: Cambridge University Press, 2005.
  • Thalberg, Irving. “Hierarchical Analyses of Unfree Action,” Canadian Journal of Philosophy, no. 8 (1978).
  • Watson, Gary. “Free Agency,” Journal of Philosophy, no. 72 (1975), 205-220.
  • Weiss, Penny A. “Feminism and Communitarianism: Comparing Critiques of Liberalism.” In Feminism and Community, edited by Penny A. Weiss and Marilyn Friedman. Philadelphia: Temple University Press, 1995, 161-186.
  • Wolf, Susan. Freedom within Reason (New York: Oxford University Press, 1990).
  • Wolf, Susan. “Sanity and the Metaphysics of Responsibility,” in Responsibility, Character and the Emotions, ed. Ferdinand Schoeman. New York: Cambridge University Press, 1987, 46-62.

Author Information

Jane Dryden
Email: jdryden@mta.ca
Mount Allison University
Canada

Metaphilosophy

What is philosophy? What is philosophy for? How should philosophy be done? These are metaphilosophical questions, metaphilosophy being the study of the nature of philosophy. Contemporary metaphilosophies within the Western philosophical tradition can be divided, rather roughly, according to whether they are associated with (1) Analytic philosophy, (2) Pragmatist philosophy, or (3) Continental philosophy.

The pioneers of the Analytic movement held that philosophy should begin with the analysis of propositions. In the hands of two of those pioneers, Russell and Wittgenstein, such analysis gives a central role to logic and aims at disclosing the deep structure of the world. But Russell and Wittgenstein thought philosophy could say little about ethics. The movement known as Logical Positivism shared the aversion to normative ethics. Nonetheless, the positivists meant to be progressive. As part of that, they intended to eliminate metaphysics. The so-called ordinary language philosophers agreed that philosophy centrally involved the analysis of propositions, but, and this recalls a third Analytic pioneer, namely Moore, their analyses remained at the level of natural language as against logic. The later Wittgenstein has an affinity with ordinary language philosophy. For Wittgenstein had come to hold that philosophy should protect us against dangerous illusions by being a kind of therapy for what normally passes for philosophy. Metaphilosophical views held by later Analytic philosophers include the idea that philosophy can be pursued as a descriptive but not a revisionary metaphysics and that philosophy is continuous with science.

The pragmatists, like those Analytic philosophers who work in practical or applied ethics, believed that philosophy should treat ‘real problems’ (although the pragmatists gave ‘real problems’ a wider scope than the ethicists tend to). The neopragmatist Rorty goes so far as to say the philosopher should fashion her philosophy so as to promote her cultural, social, and political goals. So-called post-Analytic philosophy is much influenced by pragmatism. Like the pragmatists, the post-Analyticals tend (1) to favor a broad construal of the philosophical enterprise and (2) to aim at dissolving rather than solving traditional or narrow philosophical problems.

The first Continental position considered herein is Husserl’s phenomenology. Husserl believed that his phenomenological method would enable philosophy to become a rigorous and foundational science. Still, on Husserl’s conception, philosophy is both a personal affair and something that is vital to realizing the humanitarian hopes of the Enlightenment. Husserl’s existential successors modified his method in various ways and stressed, and refashioned, the ideal of authenticity presented by his writings. Another major Continental tradition, namely Critical Theory, makes of philosophy a contributor to emancipatory social theory; and the version of Critical Theory pursued by Jürgen Habermas includes a call for ‘postmetaphysical thinking’. The later thought of Heidegger advocates a postmetaphysical thinking too, albeit a very different one; and Heidegger associates metaphysics with the ills of modernity. Heidegger strongly influenced Derrida’s metaphilosophy. Derrida’s deconstructive approach to philosophy (1) aims at clarifying, and loosening the grip of, the assumptions of previous, metaphysical philosophy, and (2) means to have an ethical and political import.

Table of Contents

  1. Introduction
    1. Some Pre-Twentieth Century Metaphilosophy
    2. Defining Metaphilosophy
    3. Explicit and Implicit Metaphilosophy
    4. The Classification of Metaphilosophies – and the Treatment that Follows
  2. Analytic Metaphilosophy
    1. The Analytic Pioneers: Russell, the Early Wittgenstein, and Moore
    2. Logical Positivism
    3. Ordinary Language Philosophy and the Later Wittgenstein
    4. Three Revivals
      1. Normative Philosophy including Rawls and Practical Ethics
      2. History of Philosophy
      3. Metaphysics: Strawson, Quine, Kripke
    5. Naturalism including Experimentalism and Its Challenge to Intuitions
  3. Pragmatism, Neopragmatism, and Post-Analytic Philosophy
    1. Pragmatism
    2. Neopragmatism: Rorty
    3. Post-Analytic Philosophy
  4. Continental Metaphilosophy
    1. Phenomenology and Related Currents
      1. Husserl’s Phenomenology
      2. Existential Phenomenology, Hermeneutics, Existentialism
    2. Critical Theory
      1. Critical Theory and the Critique of Instrumental Reason
      2. Habermas
    3. The Later Heidegger
    4. Derrida’s Post-Structuralism
  5. References and Further Reading
    1. Explicit Metaphilosophy and Works about Philosophical Movements or Traditions
    2. Analytic Philosophy including Wittgenstein, Post-Analytic Philosophy, and Logical Pragmatism
    3. Pragmatism and Neopragmatism
    4. Continental Philosophy
    5. Other

1. Introduction

The main topic of the article is the Western metaphilosophy of the last hundred years or so. But that topic is broached via a sketch of some earlier Western metaphilosophies. (In the case of the sketch, ‘Western’ means European. In the remainder of the article, ‘Western’ means European and North American. On Eastern meta­philosophy, see the entries filed under such heads as ‘Chinese philosophy’ and ‘Indian philosophy’.) Once that sketch is in hand, the article defines the notion of metaphilosophy and distinguishes between explicit and implicit metaphilosophy. Then there is a consideration of how metaphilosophies might be categorized and an outline of the course of the remainder of the article.

a. Some Pre-Twentieth Century Metaphilosophy

Socrates believed that the unexamined life – the unphilosophical life – was not worth living (Plato, Apology, 38a). Indeed, Socrates saw his role as helping to rouse people from unreflective lives. He did this by showing them, through his famous ‘Socratic method’, that in fact they knew little about, for example, justice, beauty, love or piety. Socrates’ use of that method contributed to his being condemned to death by the Athenian state. But Socrates’ politics contributed too; and here one can note that, according to the Republic (473c-d), humanity will prosper only when philosophers are kings or kings’ philosophers. It is notable too that, in Plato’s Phaedo, Socrates presents death as liberation of the soul from the tomb of the body.

According to Aristotle, philosophy begins in wonder, seeks the most fundamental causes or principles of things, and is the least necessary but thereby the most divine of sciences (Metaphysics, book alpha, sections 1–3). Despite the point about necessity, Aristotle taught ethics, a subject he conceived as ‘a kind of political science’ (Nicomachean Ethics, book 1) and which had the aim of making men good. Later philosophers continued and even intensified the stress on philosophical practicality. According to the Hellenistic philosophers – the Cynics, Sceptics, Epicureans and Stoics – philosophy revealed (1) what was valuable and what was not, and (2) how one could achieve the former and protect oneself against longing for the latter. The Roman Cicero held that to study philosophy is to prepare oneself for death. The later and neoplatonic thinker Plotinus asked, ‘What, then, is Philosophy?’ and answered, ‘Philosophy is the supremely precious’ (Enneads, I.3.v): a means to blissful contact with a mystical principle he called ‘the One’.

The idea that philosophy is the handmaiden of theology, earlier propounded by the Hellenistic thinker Philo of Alexandria, is most associated with the medieval age and particularly with Aquinas. Aquinas resumed the project of synthesizing Christianity with Greek philosophy – a project that had been pursued already by various thinkers including Augustine, Anselm, and Boethius. (Boethius was a politician inspired by philosophy – but the politics ended badly for him. In those respects he resembles the earlier Seneca. And, like Seneca, Boethius wrote of the consolations of philosophy.)

‘[T]he word “philosophy” means the study [or love – philo] of wisdom, and by “wisdom” is meant not only prudence in our everyday affairs but also a perfect knowledge of all things that mankind is capable of knowing, both for the conduct of life and for the preservation of health and the discovery of all manner of skills.’ Thus Descartes (1988: p. 179). Locke’s Essay Concerning Human Understanding (bk. 4. ch. 19, p. 697) connects philosophy with the love of truth and identifies the following as an ‘unerring mark’ of that love: ‘The not entertaining any Proposition with greater assurance than the Proofs it is built upon will warrant.’ Hume’s ‘Of Suicide’ opens thus: ‘One considerable advantage that arises from Philosophy, consists in the sovereign antidote which it affords to superstition and false religion’ (Hume 1980: 97). Kant held that ‘What can I know?’, ‘What ought I to do?’, and, ‘What may I hope?’ were the ultimate questions of human reason (Critique of Pure Reason, A805 / B33) and asserted that philosophy’s ‘peculiar dignity’ lies in ‘principles of morality, legislation, and religion’ that it can provide (A318 / B375). According to Hegel, the point of philosophy – or of ‘the dialectic’ – is to enable people to recognize the embodiment of their ideals in their social and political lives and thereby to be at home in the world. Marx’s famous eleventh ‘Thesis on Feuerbach’ declared that, while philosophers had interpreted the world, the point was to change it.

b. Defining Metaphilosophy

As the foregoing sketch begins to suggest, three very general metaphilosophical questions are (1) What is philosophy? (2) What is, or what should be, the point of philosophy? (3) How should one do philosophy? Those questions resolve into a host of more specific meta­philosophical conundra, some of which are as follows. Is philosophy a process or a product? What kind of knowledge can philosophy attain? How should one understand philosophical disagreement? Is philosophy historical in some special or deep way? Should philosophy make us better people? Happier people? Is philosophy political? What method(s) and types of evidence suit philosophy? How should philosophy be written (presuming it should be written at all)? Is philosophy, in some sense, over – or should it be?

But how might one define metaphilosophy? One definition owes to Morris Lazerowitz. (Lazerowitz claims to have invented the English word ‘metaphilosophy’ in 1940. But some foreign-language equivalents of the term ‘metaphilosophy’ antedate 1940. Note further that, in various languages including English, sometimes the term takes a hyphen before the ‘meta’.) Lazerowitz proposed (1970) that metaphilosophy is ‘the investigation of the nature of philosophy.’ If we take ‘nature’ to include both the point of philosophy and how one does (or should do) philosophy, then that definition fits with the most general meta­philosophical questions just identified above. Still: there are other definitions of metaphilosophy; and while Lazerowitz’s definition will prove best for our purposes, one needs – in order to appreciate that fact, and in order to give the definition a suitable (further) gloss – to survey the alternatives.

One alternative definition construes metaphilosophy as the philosophy of philosophy. Sometimes that definition intends this idea: metaphilosophy applies the method(s) of philosophy to philosophy itself. That idea itself comes in two versions. One is a ‘first-order’ construal. The thought here is this. Metaphilosophy, as the application of philosophy to philosophy itself, is simply one more instance of philosophy (Wittgenstein 2001: section 121; Williamson 2007: ix). The other version – the ‘second-order’ version of the idea that metaphilosophy applies philosophy to itself – is as follows. Metaphilosophy stands to philosophy as philosophy stands to its subject matter or to other disciplines (Rescher 2006), such that, as Williamson puts it (loc. cit) metaphilosophy ‘look[s] down upon philosophy from above, or beyond.’ (Williamson himself, who takes the first-order view, prefers the term ‘the philosophy of philosophy’ to ‘metaphilosophy’. For he thinks that ‘metaphilosophy’ has this connotation of looking down.) A different definition of metaphilosophy exploits the fact that ‘meta’ can mean not only about but also after. On this definition, metaphilosophy is postphilosophy. Sometimes Lazerowitz himself used ‘metaphilosophy’ in that way. What he had in mind here, more particularly, is the ‘special kind of investigation which Wittgenstein had described as one of the “heirs” of philosophy’ (Lazerowitz 1970). Some French philosophers have used the term similarly, though with reference to Heidegger and/or Marx rather than to Wittgenstein (Elden 2004: 83).

What then commends Lazerowitz’s (original) definition – the definition whereby metaphilosophy is investigation of the nature (and point) of philosophy? Two things. (1) The two ‘philosophy–of–philosophy’ construals are competing specifications of that definition. Indeed, those construals have little content until after one has a considerable idea of what philosophy is. (2) The equation of metaphilosophy and post-philosophy is narrow and tendentious; but Lazerowitz’s definition accommodates post-philosophy as a position within a more widely construed metaphilosophy. Still: Lazerowitz’s definition does require qualification, since there is a sense in which it is too broad. For ‘investigation of the nature of philosophy’ suggests that any inquiry into philosophy will count as meta­philosophical, whereas an inquiry tends to be deemed meta­philosophical only when it pertains to the essence, or very nature, of philosophy. (Such indeed is a third possible reading of the philosophy-of-philosophy construal.) Now, just what does so pertain is moot; and there is a risk of being too unaccommodating. We might want to deny the title ‘metaphilosophy’ to, say, various sociological studies of philosophy, and even, perhaps, to philosophical pedagogy (that is, to the subject of how philosophy is taught). On the other hand, we are inclined to count as meta­philosophical claims about, for instance, philosophy corrupting its students or about professionalization corrupting philosophy (on these claims one may see Stewart 1995 and Anscombe 1957).

What follows will give a moderately narrow interpretation to the term ‘nature’ within the phrase ‘the nature of philosophy’.

c. Explicit and Implicit Metaphilosophy

Explicit metaphilosophy is metaphilosophy pursued as a subfield of, or attendant field to, philosophy. Metaphilosophy so conceived has waxed and waned. In the early twenty-first century, it has waxed in Europe and in the Anglophone (English-speaking) world. Probable causes of the  increasing interest include Analytic philosophy having become more aware of itself as a tradition, the rise of philosophizing of a more empirical sort, and a softening of the divide between ‘Analytic’ and ‘Continental’ philosophy. (This article will revisit all of those topics in one way or another.) However, even when waxing, metaphilosophy generates much less activity than philosophy. Certainly the philosophical scene contains few book-length pieces of metaphilosophy. Books such as Williamson’s The Philosophy of Philosophy, Rescher’s Essay on Metaphilosophy, and What is Philosophy? by Deleuze and Guattari – these are not the rule but the exception.

There is more to metaphilosophy than explicit metaphilosophy. For there is also implicit metaphilosophy. To appreciate that point, consider, first, that philosophical positions can have meta­philosophical aspects. Many philosophical views – views about, say, knowledge, or language, or authenticity – can have implications for the task or nature of philosophy. Indeed, all philosophizing is somewhat meta­philosophical, at least in this sense: any philosophical view or orientation commits its holder to a metaphilosophy that accommodates it. Thus if one advances an ontology one must have a metaphilosophy that countenances ontology. Similarly, to adopt a method or style is to deem that approach at least passable. Moreover, a conception of the nature and point of philosophy, albeit perhaps an inchoate one, motivates and shapes much philosophy. But – and this is what allows there to be implicit metaphilosophy – sometimes none of this is emphasized, or even appreciated at all, by those who philosophize. Much of the metaphilosophy treated here is implicit, at least in the attenuated sense that its authors give philosophy much more attention than philosophy.

d. The Classification of Metaphilosophies – and the Treatment that Follows

One way of classifying metaphilosophy would be by the aim that a given metaphilosophy attributes to philosophy. Alternatively, one could consider that which is taken as the model for philosophy or for philosophical form. Science? Art? Therapy? Something else? A further alternative is to distinguish metaphilosophies according to whether or not they conceive philosophy as somehow essentially linguistic. Another criterion would be the rejection or adoption or conception of metaphysics (metaphysics being something like the study of’ the fundamental nature of reality). And many further classifications are possible.

This article will employ the Analytic–Continental distinction as its most general classificatory schema. Or rather it uses these categories: (1) Analytic philosophy; (2) Continental philosophy; (3) pragmatism, neopragmatism, and post-Analytic philosophy, these being only some of the most important of metaphilosophies of the last century or so. Those metaphilosophies are distinguished from one from another via the philosophies or philosophical movements (movements narrower than those of the three top-level headings) to which they have been conjoined. That approach, and indeed the article’s most general schema, means that this account is organized by chronology as much as by theme. One virtue of the approach is that it provides a degree of historical perspective. Another is that the approach helps to disclose some rather implicit metaphilosophy associated with well-known philosophies. But the article will be thematic to a degree because it will bring out some points of identity and difference between various metaphilosophies and will consider criticisms of the metaphilosophies treated. However, the article will not much attempt to determine, on meta­philosophical or other criteria, the respective natures of Analytic philosophy, pragmatism, or Continental philosophy. The article employs those categories solely for organizational purposes. But note the following points.

  1. The particular placing of some individual philosophers within the schema is problematic. The case of the so-called later Wittgenstein is particularly moot. Is he ‘Analytic’? Should he have his own category?
  2. The delineation of the traditions themselves is controversial. The notions of the Analytic and the Continental are particularly vexed. The difficulties here start with the fact that here a geographical category is juxtaposed to a more thematic or doctrinal one (Williams 2003). Moreover, some philosophers deny that Analytic philosophy has any substantial existence (Preston 2007; see also Rorty 1991a: 217); and some assert the same of Continental philosophy (Glendinning 2006: 13 and ff).
  3. Even only within contemporary Western history, there are significant approaches to philosophy that seem to at least somewhat warrant their own categories. Among those approaches are ‘traditionalist philosophy’, which devotes itself to the study of ‘the grand […] tradition of Western philosophy ranging from the Pre-Socratics to Kant’ (Glock 2008: 85f.), feminism, and environmental philosophy. This article does not examine those approaches.

2. Analytic Metaphilosophy

a. The Analytic Pioneers: Russell, the Early Wittgenstein, and Moore

Bertrand Russell, his pupil Ludwig Wittgenstein, and their colleague G. E. Moore – the pioneers of Analytic philosophy – shared the view that ‘all sound philosophy should begin with an analysis of propositions’ (Russell 1992: 9; first published in 1900). In Russell and Wittgenstein such analysis was centrally a matter of logic. (Note, however, that the expression ‘Analytic philosophy’ seems to have emerged only in the 1930s.)

Russellian analysis has two stages (Beaney 2007: 2–3 and 2009: section 3; Urmson 1956). First, propositions of ordinary or scientific language are transformed into what Russell regarded as their true form. This ‘logical’ or ‘transformative’ analysis draws heavily upon the new logic of Frege and finds its exemplar in Russell’s ‘theory of descriptions’ (Analytic Philosophy, section 2.a). The next step is to correlate elements within the transformed propositions with elements in the world. Commentators have called this second stage or form of analysis – which Russell counted as a matter of ‘philosophical logic’ – ‘reductive’, ‘decompositional’, and ‘metaphysical’. It is decompositional and reductive inasmuch as, like chemical analysis, it seeks to revolve its objects into their simplest elements, such an element being simple in that it itself lacks parts or constituents. The analysis is metaphysical in that it yields a metaphysics. According to the metaphysics that Russell actually derived from his analysis – the metaphysics which he called ‘logical atomism’ – the world comprises indivisible ‘atoms’ that combine, in structures limned by logic, to form the entities of science and everyday life. Russell’s empiricism inclined him to conceive the atoms as mind-independent sense-data. (See further Russell’s Metaphysics, section 4.)

Logic in the dual form of analysis just sketched was the essence of philosophy, according to Russell (2009: ch. 2). Nonetheless, Russell wrote on practical matters, advocating, and campaigning for, liberal and socialist ideas. But he tended to regard such activities as unphilosophical, believing that ethical statements were non-cognitive and hence little amenable to philosophical analysis (see Non-Cognitivism in Ethics). But he did come to hold a form of utilitarianism that allowed ethical statements a kind of truth-aptness. And he did endorse a qualified version of this venerable idea: the contemplation of profound things enlarges the self and fosters happiness. Russell held further that practicing an ethics was little use given contemporary politics, a view informed by worries about the effects of conformity and technocracy. (On all this, see Schultz 1992.)

Wittgenstein agreed with Frege and Russell that ‘the apparent logical form of a proposition need not be its real one’ (Wittgenstein 1961: section 4.0031). And he agreed with Russell that language and the world share a common, ultimately atomistic, form. But Wittgenstein’s Tractatus Logico-Philosophicus developed these ideas into a somewhat Kantian and perhaps even Schopenhauerian position. (That book, first published in 1921, is the main and arguably only work of the so-called ‘early Wittgenstein’. section 2.c treats Wittgenstein’s later views. The title of the book translates as ‘[a] schema of philosophical logic’.)

The Tractatus maintains the following.

Only some types of proposition have sense (or are propositions properly so called), namely, those that depict possible states of affairs. The cat is on the mat is one such proposition. It depicts a possible state of affairs. If that state of affairs does not obtain – if the cat in question is not on the mat in question – then the proposition is rendered false but still has sense. The same holds for most of the propositions of our everyday speech and for scientific propositions. Matters are otherwise with propositions of logic. Propositions of logic express tautologies or contradictions; they do not depict anything – and that entails that they lack sense. (Wittgenstein calls them ‘senseless’, sinnlos.) Nor do metaphysical statements make sense. (They are ‘nonsense’, Unsinn.) Such statements concern value or the meaning of life or God. Thus, they do try to depict something; but that which they try to depict is no possible state of affairs within the world.  ‘[W]henever someone […] want[s] to say something metaphysical’, one should ‘demonstrate to him that he had failed to give a meaning to certain signs in his propositions’ (section 6.53).

A complication is that the Tractatus itself tries to say something metaphysical or at least something logical. Consequently the doctrines of the book entails that it itself lacks sense. Accordingly, Wittgenstein ends the Tractatus with the following words. ‘My propositions serve as elucidations in the following way: anyone who understands me eventually recognises them as nonsensical, when he has used them – as steps – to climb up beyond them […] He must transcend these propositions, and then he will see the world aright. ¶ What we cannot speak about we must pass over in silence’ (section 6.54–7).

Here is the metaphilosophical import of all this. Philosophy is ‘a critique of language’ that exposes metaphysical talk as senseless (section 4.0031). Accordingly, and as just heard, we are to eschew such talk. Yet, Wittgenstein’s attitude to such discourse was not straightforwardly negative. For, as seen, the Tractatus itself is senseless by its own lights. Moreover, the book uses the seemingly honorific words ‘mystical’ and ‘higher’ in relation to the states of affairs that various metaphysical or metaphysico-logical statements purport to depict (section 6.42–6.522). There is an element of reverence, then, in the ‘passing over in silence’; there are some things that philosophy is to leave well enough alone.

Like Russell and Wittgenstein, Moore advocated a form of decompositional analysis. He held that ‘a thing becomes intelligible first when it is analyzed into its constituent concepts’ (Moore 1899: 182; see further Beaney 2009: section 4). But Moore uses normal language rather than logic to specify those constituents; and, in his hands, analysis often supported commonplace, pre-philosophical beliefs. Nonetheless, and despite confessing that other philosophers rather than the world prompted his philosophizing (Schilpp 1942: 14), Moore held that philosophy should give ‘a general description of the whole Universe’ (1953: 1). Accordingly, Moore tackled ethics and aesthetics as well as epistemology and metaphysics. His Principia Ethica used the not-especially-commonsensical idea that goodness was a simple, indefinable quality in order to defend the meaningfulness of ethical statements and the objectivity of moral value. Additionally, Moore advanced a normative ethic, the wider social or political implications of which are debated (Hutchinson 2001).

Russell’s tendency to exclude ethics from philosophy, and Wittgenstein’s protective version of the exclusion, are contentious and presuppose their respective versions of atomism. In turn, that atomism relies heavily upon the idea, as meta­philosophical as it is philosophical, of an ideal language (or at least of an ideal analysis of natural language). Later sections criticize that idea. Such criticism finds little target in Moore. Yet Moore is a target for those who hold that philosophy should be little concerned with words or even, perhaps, with concepts (see section 2.c and the ‘revivals’ treated in section 2.d).

b. Logical Positivism

We witness the spirit of the scientific world-conception penetrating in growing measure the forms of personal and public life, in education, upbringing, architecture, and the shaping of economic and social life according to rational principles. The scientific world-conception serves life, and life receives it. The task of philosophical work lies in […] clarification of problems and assertions, not in the propounding of special “philosophical” pronouncements. The method of this clarification is that of logical analysis.

The foregoing passages owe to a manifesto issued by the Vienna Circle (Neurath, Carnap, and Hahn 1973: 317f. and 328). Leading members of that Circle included Moritz Schlick (a physicist turned philosopher), Rudolf Carnap (primarily a logician), and Otto Neurath (economist, sociologist, and philosopher). These thinkers were inspired by the original positivist, Auguste Comte. Other influences included the empiricisms of Hume, Russell and Ernst Mach, and also the Russell–Wittgenstein idea of an ideal logical language. (Wittgenstein’s Tractatus, in particular, was a massive influence.) The Circle, in turn, gave rise to an international movement that went under several names: logical positivism, logical empiricism, neopositivism, and simply positivism.

The clarification or logical analysis advocated by positivism is two-sided. Its destructive task was the use of the so-called verifiability principle to eliminate metaphysics. According to that principle, a statement is meaningful only when either true by definition or verifiable through experience. (So there is no synthetic apriori. See Kant, Metaphysics, section 2, and A Priori and A Posteriori.) The positivists placed mathematics and logic within the true-by-definition (or analytic apriori) category, and science and most normal talk in the category of verifiable-through-experience (or synthetic aposteriori). All else was deemed meaningless. That fate befell metaphysical statements and finds its most famous illustration in Carnap’s attack (1931) on Heidegger’s ‘What is Metaphysics?’ It was the fate, too, of ethical and aesthetic statements. Hence the non-cognitivist meta-ethics that some positivists developed.

The constructive side of positivistic analysis involved epistemology and philosophy of science. The positivists wanted to know exactly how experience justified empirical knowledge. Sometimes – the positivists took various positions on the issue – the idea was to reduce all scientific statements to those of physics. (See Reductionism.) That particular effort went under the heading of ‘unified science’. So too did an idea that sought to make good on the claim that positivism ‘served life’. The idea I have in mind was this: the sciences should collaborate in order to help solve social problems. That project was championed by the so-called Left Vienna Circle and, within that, especially by Neurath (who served in a socialist Munich government and, later, was a central figure in Austrian housing movements). The positivists had close relations with the Bauhaus movement, which was itself understood by its members as socially progressive (Galison 1990).

Positivism had its problems and its detractors. The believer in ‘special philosophical pronouncements’ will think that positivism decapitates philosophy (compare section 4.a below, on Husserl). Moreover, positivism itself seemingly involved at least one ‘special’ – read: metaphysical – pronouncement, namely, the verifiability principle. Further, there is reason to distrust the very idea of providing strict criteria for nonsense (see Glendinning 2001). Further yet, the idea of an ideal logical language was attacked as unachievable, incoherent, and/or – when used as a means to certify philosophical truth – circular (Copi 1949). There were the following doubts, too, about whether positivism really ‘served life’. (1) Might positivism’s narrow notion of fact prevent it from comprehending the real nature of society? (Critical Theory leveled that objection. See O’Neill and Uebel 2004.) (2) Might positivism involve a disastrous reduction of politics to the discovery of technical solutions to depoliticized ends? (This objection owes again to Critical Theory, but also to others. See Galison 1990 and O’Neill 2003.)

Positivism retained some coherence as a movement or doctrine until the late 1960s, even though the Nazis – with whom the positivists clashed – forced the Circle into exile. In fact, that exile helped to spread the positivist creed. But, not long after the Second World War, the ascendancy that positivism had acquired in Anglophone philosophy began to diminish. It did so partly because of the developments considered by the next section.

c. Ordinary Language Philosophy and the Later Wittgenstein

Some accounts group ordinary language philosophy and the philosophy of the later Wittgenstein (and of Wittgenstein’s disciples) together – under the title ‘linguistic philosophy’. That grouping can mislead. All previous Analytic philosophy was centrally concerned with language. In that sense, all previous Analytic philosophy had taken the so-called ‘linguistic turn’ (see Rorty 1992). Nevertheless, ordinary language philosophy and the later Wittgenstein do mark a change. They twist the linguistic turn away from logical or constructed languages and towards ordinary (that is, vernacular) language, or at least towards natural (non-artificial) language. Thereby the new bodies of thought represent a movement away from Russell, the early Wittgenstein, and the positivists (and back, to an extent, towards Moore). In short – and as many accounts of the history of Analytic philosophy put it – we have here a shift from ideal language philosophy to ordinary language philosophy.

Ordinary language philosophy began with and centrally comprised a loose grouping of philosophers among whom the Oxford dons Gilbert Ryle and J. L. Austin loomed largest. The following view united these philosophers. Patient analysis of the meaning of words can tap the rich distinctions of natural languages and minimize the unclarities, equivocations and conflations to which philosophers are prone. So construed, philosophy is unlike natural science and even, insofar as it avoided systematization, unlike linguistics. The majority of ordinary language philosophers did hold, with Austin, that such analysis was not the ‘the last word’ in philosophy. Specialist knowledge and techniques can in principle everywhere augment and improve it. But natural or ordinary language ‘is the first word’ (Austin 1979: 185; see also Analytic Philosophy, section 4a).

The later Wittgenstein did hold, or at least came close to holding, that ordinary language has the last word in philosophy. This later Wittgenstein retained his earlier view that philosophy was a critique of language – of language that tried to be metaphysical or philosophical. But he abandoned the idea (itself problematically metaphysical) that there was one true form to language. He came to think, instead, that all philosophical problems owe to ‘misinterpretation of our forms of language’ (Wittgenstein 2001: section 111). They owe to misunderstanding of the ways language actually works. A principal cause of such misunderstanding, Wittgenstein thought, is misassimilation of expressions one to another. Such misassimilation can be motivated, in turn, by a ‘craving for generality’ (Wittgenstein 1975: 17ff.) that is inspired by science. The later Wittgenstein’s own philosophizing means to be a kind of therapy for philosophers, a therapy which will liberate them from their problems by showing how, in their very formulations of those problems, their words have ceased to make sense. Wittgenstein tries to show how the words that give philosophers trouble – words such as ‘know’, ‘mind’, and ‘sensation’ – become problematical only when, in philosophers’ hands, they depart from the uses and the contexts that give them meaning. Thus a sense in which philosophy ‘leaves everything as it is’ (2001: section 124). ‘[W]e must do away with all explanation, and description alone must take its place’ (section 109). Still, Wittgenstein himself once asked, ‘[W]hat is the use of studying philosophy if all that it does for you is to enable you to talk with some plausibility about some abstruse questions of logic, etc. […]’? (cited in Malcolm 1984: 35 and 93). And in one sense Wittgenstein did not want to leave everything as it was. To wit: he wanted to end the worship of science. For the view that science could express all genuine truths was, he held, barbarizing us by impoverishing our understanding of the world and of ourselves.

Much meta­philosophical flack has been aimed at the later Wittgenstein and ordinary language philosophy. They have been accused of: abolishing practical philosophy; rendering philosophy uncritical; trivializing philosophy by making it a mere matter of words; enshrining the ignorance of common speech; and, in Wittgenstein’s case – and in his own words (taken out of context) – of ‘destroy[ing] everything interesting’ (2001: section 118; on these criticisms see Russell 1995: ch. 18, Marcuse 1991: ch. 7 and Gellner 2005). Nonetheless, it is at least arguable that these movements of thought permanently changed Analytic philosophy by making it more sensitive to linguistic nuance and to the oddities of philosophical language. Moreover, some contemporary philosophers have defended more or less Wittgensteinian conceptions of philosophy. One such philosopher is Peter Strawson (on whom see section 2.d.iii). Another is Stanley Cavell. Note also that some writers have attempted to develop the more practical side of Wittgenstein’s thought (Pitkin 1993, Cavell 1979).

d. Three Revivals

Between the 1950s and the 1970s, there were three significant, and persisting, meta­philosophical developments within the Analytic tradition.

i. Normative Philosophy including Rawls and Practical Ethics

During positivism’s ascendancy, and for some time thereafter, substantive normative issues – questions about how one should live, what sort of government is best or legitimate, and so on – were widely deemed quasi-philosophical. Positivism’s non-cognitivism was a major cause. So was the distrust, in the later Wittgenstein and in ordinary language philosophy, of philosophical theorizing. This neglect of the normative had its exceptions. But the real change occurred with the appearance, in 1971, of A Theory of Justice by John Rawls.

Many took Rawls’ book to show, through its ‘systematicity and clarity’, that normative theory was possible ‘without loss of rigor’ (Weithman 2003: 6). Rawls’ procedure for justifying normative principles is of particular metaphilosophical note. That procedure, called ‘reflective equilibrium’, has three steps. (The quotations that follow are from Schroeter 2004.)

  1. ‘[W]e elicit the moral judgments of competent moral judges’ on whatever topic is at issue. (In Theories of Justice itself, distributive justice was the topic.) Thereby we obtain ‘a set of considered judgments, in which we have strong confidence’.
  2. ‘[W]e construct a scheme of explicit principles, which will ‘‘explicate’’, ‘‘fit’’, ‘‘match’’ or ‘‘account for’’ the set of considered judgments.’
  3. By moving ‘back and forth between the initial judgments and the principles, making the adjustments which seem the most plausible’, ‘we remove any discrepancy which might remain between the judgments derived from the scheme of principles and the initial considered judgments’, thereby achieving ‘a point of equilibrium, where principles and judgments coincide’.

The conception of reflective equilibrium was perhaps less philosophically orthodox than most readers of Theory of Justice believed. For Rawls came to argue that his conception of justice was, or should be construed as, ‘political not metaphysical’ (Rawls 1999b: 47–72). A political conception of justice ‘stays on the surface, philosophically speaking’ (Rawls 1999b: 395). It appeals only to that which ‘given our history and the traditions embedded in our public life […] is the most reasonable doctrine for us’ (p. 307). A metaphysical conception of justice appeals to something beyond such contingencies. However: despite advocating the political conception, Rawls appeals to an ‘overlapping consensus’ (his term) of metaphysical doctrines. The idea here, or hope, is this (Rawls, section 3; Freeman 2007: 324–415). Citizens in modern democracies hold various and not fully inter-compatible political and social ideas. But those citizens will be able to unite in supporting a liberal conception of justice.

Around the same time as Theory of Justice appeared, a parallel revival in normative philosophy begun. This was the rise of practical ethics. Here is how one prominent practical ethicist presents ‘the most plausible explanation’ for that development. ‘[L]aw, ethics, and many of the professions—including medicine, business, engineering, and scientific research—were profoundly and permanently affected by issues and concerns in the wider society regarding individual liberties, social equality, and various forms of abuse and injustice that date from the late 1950s’ (Beauchamp 2002: 133f.). Now the new ethicists, who insisted that philosophy should treat ‘real problems’ (Beauchamp 2002: 134), did something largely foreign to previous Analytic philosophy (and to that extent did not, in fact, constitute a revival). They applied moral theory to such concrete and pressing matters as racism, sexual equality, abortion, governance and war. (On those problems, see Ethics, section 3).

According to some practical ethicists, moral principles are not only applied to, but also drawn from, cases. The issue here – the relation between theory and its application – broadened out into a more thoroughly metaphilosophical debate. For, soon after Analytic philosophers had returned to normative ethics, some of them rejected a prevalent conception of normative ethical theory, and others entirely rejected such theory. The first camp rejects moral theory qua ‘decision procedure for moral reasoning’ (Williams 1981: ix-x) but does not foreclose other types of normative theory such as virtue ethics. The second and more radical camp holds that the moral world is too complex for any (prescriptive) codification that warrants the name ‘theory’. (On these positions, see Lance and Little 2006, Clarke 1987, Chappell 2009.)

ii. History of Philosophy

For a long time, most analytic philosophers held that the history of philosophy had little to do with doing philosophy. For what – they asked – was the history of philosophy save, largely, a series of mistakes? We might learn from those mistakes, and the history might contain some occasional insights. But (the line of thought continues) we should be wary of resurrecting the mistakes and beware the archive fever that leads to the idea that there is no such thing as philosophical progress. But in the 1970s a more positive attitude to the history of philosophy began to emerge, together with an attempt to reinstate or re-legitimate serious historical scholarship within philosophy (compare Analytic Philosophy section 5.c).

The newly positive attitude towards the history of philosophy was premised on the view that the study of past philosophies was of significant philosophical value. Reasons adduced for that view include the following (Sorell and Rogers 2005). History of philosophy can disclose our assumptions. It can show the strengths of positions that we find uncongenial. It can suggest rolesthat philosophy might take today by revealing ways in which philosophy has been embedded in a wider intellectual and sociocultural frameworks. A more radical view, espoused by Charles Taylor (1984: 17) is that, ‘Philosophy and the history of philosophy are one’; ‘we cannot do the first without also doing the second.’

Many Analytical philosophers continue to regard the study of philosophy’s history as very much secondary to philosophy itself. By contrast, many so-called Continental philosophers take the foregoing ideas, including the more radical view – which is associated with Hegel – as axiomatic. (See much of section 4, below.)

iii. Metaphysics: Strawson, Quine, Kripke

Positivism, the later Wittgenstein, and Ordinary Language Philosophy suppressed Analytic metaphysics. Yet it recovered, thanks especially to three figures, beginning with Peter Strawson.

Strawson had his origins in the ordinary language tradition and he declares a large debt or affinity to Wittgenstein (Strawson 2003: 12). But he is indebted, also, to Kant; and, with Strawson, ordinary language philosophy became more systematic and more ambitious. However, Strawson retained an element of what one might call, in Rae Langton’s phrase, Kantian humility. In order to understand these characterizations, one needs to appreciate that which Strawson advocated under the heading of ‘descriptive metaphysics’. In turn, descriptive metaphysics is best approached via that which Strawson called ‘connective analysis’.

Connective analysis seeks to elucidate concepts by discerning their interconnections, which is to say, the ways in which concepts variously imply, presuppose, and exclude one another. Strawson contrasts this ‘connective model’ with ‘the reductive or atomistic model’ that aims ‘to dismantle or reduce the concepts we examine to other and simpler concepts’ (all Strawson 1991: 21). The latter model is that of Russell, the Tractatus, and, indeed, Moore. Another way in which Strawson departs from Russell and the Tractatus, but not from Moore, lies in this: a principal method of connective analysis is ‘close examination of the actual use of words’ (Strawson 1959: 9). But when Strawson turns to ‘descriptive metaphysics’, such examination is not enough.

Descriptive metaphysics is, or proceeds via, a very general form of connective analysis. The goal here is ‘to lay bare the most general features of our conceptual structure’ (Strawson 1959: 9). Those most general features – our most general concepts – have a special importance. For those concepts, or at least those of them in which Strawson is most interested, are (he thinks) basic or fundamental in the following sense. They are (1) irreducible, (2) unchangeable in that they comprise ‘a massive central core of human thinking which has no history’ (1959: 10) and (3) necessary to ‘any conception of experience which we can make intelligible to ourselves’ (Strawson 1991: 26). And the structure that these concepts comprise ‘does not readily display itself on the surface of language, but lies submerged’ (1956: 9f.).

Descriptive metaphysics is considerably Kantian (see Kant, metaphysics). Strawson is Kantian, too, in rejecting what he calls ‘revisionary metaphysics’. Here we have the element of Kantian ‘humility’ within Strawson’s enterprise. Descriptive metaphysics ‘is content to describe the actual structure of our thought about the world’, whereas revisionary metaphysics aims ‘to produce a better structure’ (Strawson 1959: 9; my stress). Strawson urges several points against revisionary metaphysics.

  1. A revisionary metaphysic is apt to be an overgeneralization of some particular aspect of our conceptual scheme and/or
  2. to be a confusion between conceptions of how things really are with some Weltanschauung.
  3. Revisionary metaphysics attempts the impossible, namely, to depart from the fundamental features of our conceptual scheme. The first point shows the influence of Wittgenstein. So does the third, although it is also (as Strawson may have recognized) somewhat Heideggerian. The second point is reminiscent of Carnap’s version of logical positivism. All this notwithstanding, and consistently enough, Strawson held that systems of revisionary metaphysics can, through the ‘partial vision’ (1959: 9) that they provide, be useful to descriptive metaphysics.

Here are some worries about Strawson’s metaphilosophy. ‘[T]he conceptual system with which “we” are operating may be much more changing, relative, and culturally limited than Strawson assumes it to be’ (Burtt 1963: 35). Next: Strawson imparts very little about the method(s) of descriptive metaphysics (although one might try to discern techniques – in which imagination seems to play a central role – from his actual analyses). More serious is that Strawson imparts little by way of answer to the following questions. ‘What is a concept? How are concepts individuated? What is a conceptual scheme? How are conceptual schemes individuated? What is the relation between a language and a conceptual scheme?’ (Haack 1979: 366f.). Further: why believe that the analytic philosopher has no business providing ‘new and revealing vision[s]’ (Strawson 1992: 2)? At any rate, Strawson helped those philosophers who rejected reductive (especially Russellian and positivistic) versions of analysis but who wanted to continue to call themselves ‘analytic’. For he gave them a reasonably narrow conception of analysis to which they could adhere (Beaney 2009: section 8; compare Glock 2008: 159). Finally note that, despite his criticisms of Strawson, the contemporary philosopher Peter Hacker defends a metaphilosophy rather similar to descriptive metaphysics (Hacker 2003 and 2007).

William Van Orman Quine was a second prime mover in the metaphysical revival. Quine’s metaphysics, which is revisionary in Strawson’s terms, emerged from Quine’s attack upon ‘two dogmas of modern empiricism’. Those ostensible dogmas are: (1) ‘belief in some fundamental cleavage between truths that are analytic, or grounded in meanings independently of matters of fact, and truths that are synthetic, or grounded in fact’; (2) ‘reductionism: the belief that each meaningful statement is equivalent to some logical construction upon terms which refer to immediate experience’ (Quine 1980: 20). Against 1, Quine argues that every belief has some connection to experience. Against 2, he argues that the connection is never direct. For when experience clashes with some belief, which belief(s) must be changed is underdetermined. Beliefs ‘face the tribunal of sense experience not individually but as a corporate body’ (p. 41; see Evidence section 3.c.i). Quine expresses this holistic and radically empiricist conception by speaking of ‘the web of belief’. Some beliefs – those near the ‘edge of the web’ – are more exposed to experience than others; but the interlinking of beliefs is such that no belief is immune to experience.

Quine saves metaphysics from positivism. More judiciously put: Quine’s conception, if correct, saves metaphysics from the verifiability criterion (q.v. section 2.b). For the notion of the web of belief implies that ontological beliefs – beliefs about ‘the most general traits of reality’ (Quine 1960: 161) – are answerable to experience. And, if that is so, then ontological beliefs differ from other beliefs only in their generality. Quine infers that, ‘Ontological questions […] are on a par with questions of natural science’ (1980: 45). In fact, since Quine thinks that natural science, and in particular physics, is the best way of fitting our beliefs to reality, he infers that ontology should be determined by the best available comprehensive scientific theory. In that sense, metaphysics is ‘the metaphysics of science’ (Glock 2003a: 30).

Is the metaphysics of science actually only science? Quine asserts that ‘it is only within science itself, and not in some prior philosophy, that reality is to be identified and described’ (1981: 21). Yet he does leave a job for the philosopher. The philosopher is to translate the best available scientific theory into that which Quine called ‘canonical notation’, namely, ‘the language of modern logic as developed by Frege, Peirce, Russell and others’ (Orenstein 2002: 16). Moreover, the philosopher is to make the translation in such a way as to minimize the theory’s ontological commitments. Only after such a translation, which Quine calls ‘explication’ can one say, at a philosophical level: ‘that is What There Is’. (However, Quine cannot fully capitalize those letters, as it were. For he thinks that there is a pragmatic element to ontology. See section 3.a below.) This role for philosophy is a reduced one. For one thing, it deprives philosophy of something traditionally considered one of its greatest aspirations: necessary truth. On Quine’s conception, no truth can be absolutely necessary. (That holds even for the truths of Quine’s beloved logic, since they, too, fall within the web of belief.) By contrast, even Strawson and the positivists – the latter in the form of ‘analytic truth’ – had countenanced versions of necessary truth.

Saul Kripke – the third important reviver of metaphysics – allows the philosopher a role that is perhaps slightly more distinct than Quine does. Kripke does that precisely by propounding a new notion of necessity. (That said, some identify Ruth Barcan Marcus as the discoverer of the necessity at issue.) According to Kripke (1980), a truth T about X is necessary just when T holds in all possible worlds that contain X. To explain: science shows us that, for example, water is composed of H20; the philosophical question is whether that truth holds of all possible worlds (all possible worlds in which water exists) and is thereby necessary. Any such science-derived necessities are aposteriori just because, and in the sense that, they are (partially) derived from science.

Aposteriori necessity is a controversial idea. Kripke realizes this. But he asks why it is controversial. The notions of the apriori and aposteriori are epistemological (they are about whether or not one needs to investigate the world in order to know something), whereas – Kripke points out – his notion of necessity is ontological (that is, about whether things could be otherwise). As to how one determines whether a truth obtains in all possible worlds, Kripke’s main appeal is to the intuitions of philosophers. The next subsection somewhat scrutinizes that appeal, together with some of the other ideas of this subsection.

e. Naturalism including Experimentalism and Its Challenge to Intuitions

Kripke and especially Quine helped to create, particularly in the United States, a new orthodoxy within Analytic philosophy. That orthodoxy is naturalism or – the term used by its detractors – scientism. But naturalism (/scientism) is no one thing (Glock 2003a: 46; compare Papineau 2009). Ontological naturalism holds that the entities treated by natural science exhaust reality. Meta­philosophical naturalism – which is the focus in what follows – asserts a strong continuity between philosophy and science. A common construal of that continuity runs thus. Philosophical problems are in one way or another ‘tractable through the methods of the empirical sciences’ (Naturalism, Introduction). Now, within meta­philosophical naturalism, one can distinguish empirical philosophers from experimental philosophers (Prinz 2008). Empirical philosophers enlist science to answer, or to help answer, philosophical problems. Experimental philosophers (or ‘experimentalists’) themselves do science, or do so in collaboration with scientists. Let us start with empirical philosophy.

Quine is an empirical philosopher in his approach to metaphysics and even more so in his approach to epistemology. Quine presents and urges his epistemology thus: ‘The stimulation of his sensory receptors is all the evidence anybody has had to go on, ultimately, in arriving at his picture of the world. Why not just see how this construction really proceeds? Why not settle for psychology?’ (Quine 1977: 75). Such naturalistic epistemology – in Quine’s own formulation, ‘naturalized epistemology’ – has been extended to moral epistemology. ‘A naturalized moral epistemology is simply a naturalized epistemology that concerns itself with moral knowledge’ (Campbell and Hunter 2000: 1). There is such a thing, too, as naturalized aesthetics: the attempt to use science to solve aesthetical problems (McMahon 2007). Other forms of empirical philosophy include neurophilosophy, which applies methods from neuroscience, and sometimes computer science, to questions in the philosophy of mind.

Naturalized epistemology has been criticized for being insufficiently normative. How can descriptions of epistemic mechanisms determine license for belief? The difficulty seems especially pressing in the case of moral epistemology. Wittgenstein’s complaint against naturalistic aesthetics – a view he called ‘exceedingly stupid’ – may intend a similar point. ‘The sort of explanation one is looking for when one is puzzled by an aesthetic impression is not a causal explanation, not one corroborated by experience or by statistics as to how people react’ (all Wittgenstein 1966: 17, 21). A wider disquiet about meta­philosophical naturalism is this: it presupposes a controversial view explicitly endorsed by Quine, namely that science alone provides true or good knowledge (Glock 2003a: 28, 46). For that reason and for others, some philosophers, including Wittgenstein, are suspicious even of scientifically-informed philosophy of mind.

Now the experimentalists – the philosophers who actually do science – tend to use science not to propose new philosophical ideas or theories but rather to investigate existing philosophical claims. The philosophical claims at issue are based upon intuitions, intuitions being something like ‘seemings’ or spontaneous judgments. Sometimes philosophers have employed intuitions in support of empirical claims. For example, some ethicists have asserted, from their philosophical armchairs, that character is the most significant determinant of action. Another example: some philosophers have speculated that most people are ‘incompatibilists’ about determinism. (The claim in this second example is, though empirical, construable as a certain type of second-order intuition, namely, as a claim that is empirical, yet made from the armchair, about the intuitions that other people have.) Experimentalists have put such hunches to the test, often concluding that they are mistaken (see Levin 2009 and Levy 2009). At other times, though, the type of intuitively-based claim that experimentalists investigate is non-empirical or at least not evidently empirical. Here one finds, for instance, intuitions about what counts as knowledge, about whether some feature of something is necessary to it (recall Kripke, above), about what the best resolution of a moral dilemma is, and about whether or not we have free will. Now, experimentalists have not quite tested claims of this second sort. But they have used empirical methods in interrogating the ways in which philosophers, in considering such claims, have employed intuitions. Analytic philosophers have been wont to use their intuitions about such non-empirical matters to establish burdens of proof, to support premises, and to serve as data against which to test philosophical theories. But experimentalists have claimed to find that, at least in the case of non-philosophers, intuitions about such matters vary considerably. (See for instance Weinberg, Nichols and Stitch 2001.) So, why privilege the intuitions of some particular philosopher?

Armchair philosophers have offered various responses. One is that philosophers’ intuitions diverge from ‘folk’ intuitions only in this way: the former are more considered versions of the latter (Levin 2009). But might not such considered intuitions vary among themselves? Moreover: why at all trust even considered intuitions? Why not think – with Quine (and William James, Richard Rorty, Nietzsche, and others) that intuitions are sedimentations of culturally or biologically inherited views? A traditional response to that last question (an ‘ordinary language response’ and equally, perhaps, ‘an ideal language’ response) runs as follows. Intuitions do not convey views of the world. Rather they convey an implicit knowledge of concepts or of language. A variation upon that reply gives it a more naturalistic gloss. The idea here is that (considered) intuitions, though indeed ‘synthetic’ and, as such, defeasible, represent good prima facie evidence for the philosophical views at issue, at least if those views are about the nature of concepts (see for instance Graham and Horgan 1994).

3. Pragmatism, Neopragmatism, and Post-Analytic Philosophy

a. Pragmatism

The original or classical pragmatists are the North Americans C.S. Peirce (1839–1914), William James (1842–1910), John Dewey (1859–1952) and, perhaps, G. H. Mead. The metaphilosophy of pragmatism unfolds from that which became known as ‘the pragmatic maxim’.

Peirce invented the pragmatic maxim as a tool for clarifying ideas. His best known formulation of the maxim runs thus: ‘Consider what effects, which might conceivably have practical bearings, we conceive the object of our conception to have. Then, our conception of these effects is the whole of our conception of the object’ (Peirce 1931-58, volume 5: section 402). Sometimes the maxim reveals an idea to have no meaning. Such was the result, Peirce thought, of applying the maxim to transubstantiation, and, indeed, to many metaphysical ideas. Dewey deployed the maxim similarly. He saw it ‘as a method for inoculating ourselves against certain blind alleys in philosophy’ (Talisse and Aikin 2008: 17). James construed the maxim differently. Whereas Peirce seemed to hold that the ‘effects’ at issue were, solely, effects upon sensory experience, James extended those effects into the psychological effects of believing in the idea(s) in question. Moreover, whereas Peirce construed the maxim as a conception of meaning, James turned it into a conception of truth. ‘“The true”’ is that which, ‘in almost any fashion’, but ‘in the long run and on the whole’, is ‘expedient in the way of our thinking’ (James 1995: 86). As a consequence of these moves, James thought that many philosophical disputes were resolvable, and were only resolvable, through the pragmatic maxim.

None of the pragmatists opposed metaphysics as such or as a whole. That may be because each of them held that philosophy is not fundamentally different to other inquiries. Each of Peirce, James and Dewey elaborates the notion of inquiry, and the relative distinctiveness of philosophy, in his own way. But there is common ground on two views. (1) Inquiry is a matter of coping. Dewey, and to an extent James, understand inquiry as an organism trying to cope with its environment. Indeed Dewey was considerably influenced by Darwin. (2) Experimental science is the exemplar of inquiry. One finds this second idea in Dewey but also and especially in Peirce. The idea is that experimental science is the best method or model of inquiry, be the inquiry practical or theoretical, descriptive or normative, philosophical or non-philosophical. ‘Pragmatism as attitude represents what Mr. Peirce has happily termed the “laboratory habit of mind” extended into every area where inquiry may fruitfully be carried on’ (Dewey 1998, volume 2: 378). Each of these views (that is, both 1 and 2) may be called naturalistic (the second being a version of metaphilosophical naturalism; q.v. section 2.e).

According to pragmatism (though Peirce is perhaps an exception) pragmatism was a humanism. Its purpose was to serve humanity. Here is James (1995: 2): ‘no one of us can get along without the far-flashing beams of light it sends over the world’s perspectives’, the ‘it’ here being pragmatist philosophy and also philosophy in general. James held further that pragmatism, this time in contrast with some other philosophies, allows the universe to appear as ‘a place in which human thoughts, choices, and aspirations count for something’ (Gallie 1952: 24). As to Dewey, he held the following. ‘Ideals and values must be evaluated with respect to their social consequences, either as inhibitors or as valuable instruments for social progress’; and ‘philosophy, because of the breadth of its concern and its critical approach, can play a crucial role in this evaluation’ (Dewey, section 4). Indeed, according to Dewey, philosophy is to be ‘a social hope reduced to a working programme of action, a prophecy of the future, but one disciplined by serious thought and knowledge’ (Dewey 1998, vol. 1: 72). Dewey himself pursued such a programme, and not only in his writing – in which he championed a pervasive form of democracy – but also (and to help enable such democracy) as an educationalist.

Humanism notwithstanding, pragmatism was not hostile to religion. Dewey could endorse religion as a means of articulating our highest values. James tended to hold that the truth of religious ideas was to be determined, at the broadest level, in the same way as the truth of anything else. Peirce, for his part, was a more traditional philosophical theist. The conceptions of religion advocated by James and Dewey have been criticized for being very much reconceptions (Talisse and Aikin 2008: 90–94). A broader objection to pragmatist humanism is that its making of man the measure of all things is false and even pernicious. One finds versions of that objection in Heidegger and Critical Theory. One could level the charge, too, from the perspective of environmental ethics. Rather differently, and even more broadly, one might think that ‘moral and political ambitions’ have no place ‘within philosophy proper’ (Glock 2003a: 22 glossing Quine). Objections of a more specific kind have targeted the pragmatic maxim. Critics have faulted Peirce’s version of the pragmatic maxim for being too narrow or too indeterminate; and Russell and others have criticized James’ version as a misanalysis of what we mean by ‘true’.

Pragmatism was superseded (most notably in the United States) or occluded (in those places where it took little hold in the first place) by logical positivism. But the metaphilosophy of logical positivism has important similarities to pragmatism’s. Positivism’s verifiability principle is very similar to Peirce’s maxim. The positivists held that science is the exemplar of inquiry. And the positivists, like pragmatism, aimed at the betterment of society. Note also that positivism itself dissolved partly because its original tenets underwent a ‘“pragmaticization”’ (Rorty 1991b: xviii). That pragmaticization was the work especially of Quine and Davidson, who are ‘logical pragmatists’ in that they use logical techniques to develop some of the main ideas of pragmatists (Glock 2003a: 22–3; see also Rynin 1956). The ideas at issue include epistemological holism and the underdetermination of various type of theory by evidence. The latter is the aforementioned (section 2.d.iii) pragmatic element within Quine’s approach to ontology (on which see further Quine’s Philosophy of Science, section 3).

b. Neopragmatism: Rorty

The label ‘neopragmatism’ has been applied to Robert Brandom, Susan Haack, Nicholas Rescher, Richard Rorty, and other thinkers who, like them, identify themselves with some part(s) of classical pragmatism. (Karl-Otto Apel, Jürgen Habermas, John McDowell, and Hilary Putnam are borderline cases; each takes much from pragmatism but is wary about ‘pragmatist’ as a self-description.) This section concentrates upon the best known, most controversial, and possibly the most meta­philosophical, of the neopragmatists: Rorty.

Much of Rorty’s meta­philosophy issues from his antirepresentationalism. Antirepresentationalism is, in the first instance, this view: no representation (linguistic or mental conception) corresponds to reality in a way that exceeds our commonsensical and scientific notions of what it is to get the world right. Rorty’s arguments against the sort of privileged representations that are at issue here terminate or summarize as follows. ‘[N]othing counts as justification unless by reference to what we already accept […] [T]here is no way to get outside our beliefs and our language so as to find some test other than coherence’ (Rorty 1980: 178). Rorty infers that ‘the notion of “representation,” or that of “fact of the matter,”’ has no ‘useful role in philosophy’ (Rorty 1991b: 2). We are to conceive ourselves, or our conceptions, not as answerable to the world, but only to our fellows (see McDowell 2000: 110).

Rorty thinks that antirepresentationalism entails the rejection of a metaphilosophy which goes back to the Greeks, found a classic expression in Kant, and which is pursued in Analytic philosophy. That metaphilosophy, which Rorty calls ‘epistemological’, presents philosophy as ‘a tribunal of pure reason, upholding or denying the claims of the rest of culture’ (Rorty 1980: 4). More fully: philosophy judges discourses, be they religious, scientific, moral, political, aesthetical or metaphysical, by seeing which of them, and to what degree, disclose reality as it really is. (Clearly, though, more needs to be said if this conception is to accommodate Kant’s ‘transcendental idealism’. See Kant: Metaphysics, section 4.)

Rorty wants the philosopher to be, not a ‘cultural overseer’ adjudicating types of truth claims, but an ‘informed dilettante’ and a ‘Socratic intermediary’ (Rorty 1980: 317). That is, the philosopher is to elicit ‘agreement, or, at least, exciting and fruitful disagreement’ (Rorty 1980: 318) between or within various types or areas of discourse. Philosophy so conceived Rorty calls ‘hermeneutics’. The Rortian philosopher does not seek some schema allowing two or more discourses to be translated perfectly one to the other (an idea Rorty associates with representationalism). Instead she inhabits hermeneutic circle. ‘[W]e play back and forth between guesses about how to characterize particular statements or other events, and guesses about the point of the whole situation, until gradually we feel at ease with what was hitherto strange’ (1980: 319). Rorty connects this procedure to the ‘edification’ that consists in ‘finding new, better, more interesting, more fruitful ways of speaking’ (p. 360) and, thereby, to a goal he calls ‘existentialist’: the goal of finding new types of self-conception and, in that manner, finding new ways to be.

Rorty’s elaboration of all this introduces further notable meta­philosophical views. First: ‘Blake is as much of a philosopher Fichte and Henry Adams more of a philosopher than Frege’ (Rorty 1991a: xv). For Sellars was right, Rorty believes, to define philosophy as ‘an attempt to see how things, in the broadest possible sense of the term, hang together, in the broadest possible sense of the term’ (Sellars 1963: 1; compare section 6, Sellars’ Philosophy of Mind; presumably, though, Rorty holds that one has good philosophy when such attempts prove ‘edifying’). Second: what counts as a philosophical problem is contingent, and not just in that people only discover certain philosophical problems at certain times. Third: philosophical argument, at least when it aspires to be conclusive, requires shared assumptions; where there are no or few shared assumptions, such argument is impossible.

The last of the foregoing ideas is important for what one might call Rorty’s practical metaphilosophy. Rorty maintains that one can argue about morals and/or politics only with someone with whom one shares some assumptions. The neutral ground that philosophy has sought for debates with staunch egoists and unbending totalitarians is a fantasy. All the philosopher can do, besides point that out, is to create a conception that articulates, but does not strictly support, his or her moral or political vision. The philosopher ought to be ‘putting politics first and tailoring a philosophy to suit’ (Rorty 1991b: 178) – and similarly for morality. Rorty thinks that no less a political philosopher than John Rawls has already come close to this stance (Rorty 1991b: 191). Nor does Rorty bemoan any of this. The ‘cultural politics’ which suggests ‘changes in the vocabularies deployed in moral and political deliberation’ (Rorty 2007: ix) is more useful than the attempt to find philosophical foundations for some such vocabulary. The term ‘cultural politics’ could mislead, though. Rorty does not advocate an exclusive concentration on cultural as against social or economic issues. He deplores the sort of philosophy or cultural or literary theory that makes it ‘almost impossible to clamber back down […] to a level […] on which one might discuss the merits of a law, a treaty, a candidate, or a political strategy’ (Rorty 2007: 93).

Rorty’s metaphilosophy, and the philosophical views with which it is intertwined, have been attacked as irrationalist, self-refuting, relativist, unduly ethnocentric, complacent, anti-progressive, and even as insincere. Even Rorty’s self-identification with the pragmatist tradition has been challenged (despite the existence of at least some clear continuities). So have his readings, or appropriations, of his philosophical heroes, who include not only James and Dewey but also Wittgenstein, Heidegger and, to a lesser extent, Davidson and Derrida. For a sample of all these criticisms, see Brandom 2000 (which includes replies by Rorty) and Talisse and Aikin 2008: 140–148.

c. Post-Analytic Philosophy

‘Post-Analytic philosophy’ is a vaguely-defined term for something that is a current rather than a group or school. The term (in use as early as Rajchman and West 1985) denotes the work of philosophers who owe much to Analytic philosophy but who think that they have made some significant departure from it. Often the departures in question are motivated by pragmatist allegiance or influence. (Hence the placing of this section.) The following are all considerably pragmatist and are all counted as post-Analytic philosophers: Richard Rorty; Hilary Putnam; Robert Brandom; John McDowell. Still, those same figures exhibit, also, a turn to Hegel (a turn rendered slightly less remarkable by Hegel’s influence upon Peirce and especially upon Dewey). Some Wittgensteinians count as post-Analytic too, as might the later Wittgenstein himself. Stanley Cavell stands out here, though in one way or another Wittgenstein strongly influenced most of philosophers mentioned in this paragraph. Another common characteristic of those deemed post-Analytic is interest in a range of ‘Continental’ thinkers. Rorty looms large here. But there is also the aforementioned interest in Hegel, and, for instance, the fact that one finds McDowell citing Gadamer.

Post-Analytic philosophy is associated with various more or less meta­philosophical views. One is the rejection or severe revision of any notion of philosophical analysis. Witness Rorty, Brandom’s self-styled ‘analytic pragmatism’, and perhaps, meta­philosophical naturalism (q.v. section 2.e). (Still: only rarely – as in Graham and Horgan 1994, who advocate what they call ‘Post-Analytic Metaphilosophy’ – do naturalists call themselves ‘post-Analytic’.) Some post-Analytic philosophers go further, in that they tend, often under the influence of Wittgenstein, to attempt less to solve and more to dissolve or even discard philosophical problems. Each of Putnam, McDowell and Rorty has his own version of this approach, and each singles out for dissolution the problem of how mind or language relates to the world. A third characteristic feature of post-Analytic philosophy is the rejection of a certain kind of narrow professionalism. That sort of professionalism is preoccupied with specialized problems and tends to be indifferent to broader social and cultural questions. One finds a break from such narrow professionalism in Cavell, in Rorty, in Bernard Williams, and to an extent in Putnam (although also in such “public” Analytic philosophers as A. C. Grayling).

Moreover, innovative or heterodox style is something of a criterion of post-Analytic philosophy. One thinks here especially of Cavell. But one might mention McDowell too. Now, one critic of McDowell faults him for putting ‘barriers of jargon, convolution, and metaphor before the reader hardly less formidable than those characteristically erected by his German luminaries’ (Wright 2002: 157). The criticism betokens the way in post-Analytic philosophers are often regarded, namely as apostates. Post-Analytic philosophers tend to defend themselves by arguing either that Analytic philosophy needs to reconnect itself with the rest of culture, and/or that Analytic philosophy has itself shown the untenability of some of its most central assumptions and even perhaps ‘come to the end of its own project—the dead end’ (Putnam 1985: 28).

4. Continental Metaphilosophy

a. Phenomenology and Related Currents

i. Husserl’s Phenomenology

Phenomenology, as pursued by Edmund Husserl describes phenomena. Phenomena are things in the manner in which they appear. That definition becomes more appreciable through the technique through which Husserl means to gain access to phenomena. Husserl calls that technique the epoche (a term that owes to Ancient Greek skepticism). He designates the perspective that it achieves – the perspective that presents one with ‘phenomena’ – ‘the phenomenological reduction’. The epoche consists in suspending ‘the natural attitude’ (another term of Husserl’s coinage). The natural attitude comprises assumptions about the causes, the composition, and indeed the very existence of that which one experiences. The epoche, Husserl says, temporarily ‘brackets’ these assumptions, or puts them ‘out of play’ – allowing one to describe the world solely in the manner in which it appears. That description is phenomenology.

Phenomenology means to have epistemological and ontological import. Husserl presents the epistemological import – to begin with that – in a provocative way: ‘If “positivism” is tantamount to an absolutely unprejudiced grounding of all sciences on the “positive,” that is to say, on what can be seized upon originaliter, then we are the genuine positivists’ (Husserl 1931:  20). The idea that Husserl shares with the positivists is that experience is the sole source of knowledge. Hence Husserl’s ‘principle of all principles’: ‘whatever presents itself in “intuition” in primordial form […] is simply to be accepted as it gives itself out to be, but obviously only within the limits in which it thus presents itself’ (Husserl 1931: section 24). However, and like various other philosophers (including William James and the German Idealists), Husserl thinks that experience extends beyond what empiricism makes of it. For one thing – and this reveals phenomenology’s intended ontological import – experience can be of essences. A technique of ‘imaginative variation’ similar to Descartes’ procedure with the wax (see Descartes, section 4) allows one to distinguish that which is essential to a phenomenon and, thereby, to make discoveries about the nature of such phenomena as numbers and material things. Now, one might think that this attempt to derive essences from phenomena (from things in the manner in which they appear) must be idealist. Indeed – and despite the fact that he used the phrase ‘to the things themselves!’ as his slogan – Husserl did avow a ‘transcendental idealism’, whereby ‘transcendental subjectivity […] constitutes sense and being’ (Husserl 1999b: section 41). However, the exact content of that idealism – i.e. the exact meaning of the phrase just quoted – is a matter of some interpretative difficulty. It is evident enough, though, that Husserl’s idealism involves (at least) the following ideas. Experience necessarily involves various ‘subjective achievements’. Those achievements comprise various operations that Husserl calls ‘syntheses’ and which one might (although here one encounters difficulties) call ‘mental’. Moreover, the achievements are attributable to a subjectivity that deserves the name ‘transcendental’ in that (1) the achievements are necessary conditions for our experience, (2) the subjectivity at issue is transcendent in this sense: it exists outside the natural world (and, hence, cannot entirely be identified with what we normally construe as the mind). (On the notion of the transcendental, see further Kant’s transcendental idealism and transcendental arguments.)

Husserl argued that the denial of transcendental subjectivity ‘decapitates philosophy’ (Husserl 1970: 9). He calls such philosophy ‘objectivism’ and asserts that it confines itself to the ‘universe of mere facts’ and allies itself with the sciences. (Thus Husserl employs ‘positivism’ and ‘naturalism’ as terms with similar import to ‘objectivism’.) But objectivism cannot even understand science itself, according to Husserl; for science, he maintains, presupposes the achievements of transcendental subjectivity. Further, objectivism can make little sense of the human mind, of humanity’s place within nature, and of values. These latter failings contribute to a perceived meaninglessness to life and a ‘fall into hostility toward the spirit and into barbarity’ (Husserl 1970: 9). Consequently, and because serious investigation of science, mind, our place in nature, and of values belongs to Europe’s very raison d’être, objectivism helps to cause nothing less than a ‘crisis of European humanity’ (Husserl 1970: 299). There is even some suggestion (in the same text) that objectivism prevents us from experiencing people as people: as more than mere things.

The foregoing shows that phenomenology has a normative aspect. Husserl did make a start upon a systematic moral philosophy. But phenomenology is intrinsically ethical (D. Smith 2003: 4–6), in that the phenomenologist eschews prejudice and seeks to divine matters for him- or herself.

ii. Existential Phenomenology, Hermeneutics, Existentialism

Husserl hoped to found a unified and collaborative movement. His hope was partially fulfilled. Heidegger, Sartre and Merleau-Ponty count as heirs to Husserl because (or mainly because) they believed in the philosophical primacy of description of experience. Moreover, many of the themes of post-Husserlian phenomenology are present already, one way or other, in Husserl. But there are considerable, and indeed meta­philosophical, differences between Husserl and his successors. The meta­philosophical differences can be unfolded from this: Heidegger, Sartre and Merleau-Ponty adhere to an ‘existential’ phenomenology. ‘Existential phenomenology’ has two senses. Each construal matters meta­philosophically.

In one sense, ‘existential phenomenology’ denotes phenomenology that departs from Husserl’s self-proclaimed ‘pure’ or ‘transcendental’ phenomenology. At issue here is this view of Husserl’s: it is logically possible that a consciousness could survive the annihilation of everything else (Husserl 1999b: section 13). Existential phenomenologists deny the view. For they accept a kind of externalism whereby experience, or the self, is what it is – and not just causally – by dint of the world that is experienced. (On externalism, see Philosophy of Language, section 4a and Mental Causation, section 3.b.ii.) Various slogans and terms within the work existential phenomenologists express these views. Heidegger’s Being and Time presents the human mode of being as ‘being-in-the-world’ and speaks not of ‘the subject’ or ‘consciousness’ but of ‘Da-sein’ (‘existence’ or, more literally, ‘being-there’). Merleau-Ponty asserts that we are ‘through and through compounded of relationships with the world’, ‘destined to the world’ (2002: xi–xv). In Being and Nothingness, Sartre uses such formulations as ‘consciousness (of) a table’ (sic) in order to signal his rejection of ‘the “reificatory” idea of consciousness as some thing or container distinct from the world in the midst of which we are conscious’ (as Cooper puts – Cooper 1999: 201).

Existential phenomenology, so construed, has meta­philosophical import because it has methodological implications. Being and Nothingness holds that the inseparability of consciousness from the objects of consciousness ruins Husserl’s method of epoche (Sartre 1989: part one, chapter one; Cerbone 2006: 1989). Merleau-Ponty may not go as far. His Phenomenology of Perception has it that, because we are ‘destined to the world’, ‘The most important lesson of the reduction is the impossibility of a complete reduction’ (2002: xv). But the interpretation of this remark is debated (see J Smith 2005). At any rate, Merleau-Ponty found a greater philosophical use for the empirical sciences than did Husserl. Heidegger was more inclined to keep the sciences in their place. But he too – partly because of his existential (externalist) conception of phenomenology – differed from Husserl on the epoche. Again, however, Heidegger’s precise position is hard to discern. (Caputo 1977 describes the interpretative problem and tries to solve it.) Still, Heidegger’s principal innovation in philosophical method has little to do with the epoche. This article considers that innovation before turning to the other sense of existential phenomenology.

Heidegger’s revisions of phenomenological method place him within the hermeneutic tradition. Hermeneutics is the art or practice of interpretation. The hermeneutic tradition (sometimes just called ‘hermeneutics’) is a tradition that gives great philosophical weight to an interpretative mode of understanding. Members of this tradition include Friedrich Schleiermacher (1768–1834), Wilhelm Dilthey (1833–1911) and, after Heidegger, Hans-Georg Gadamer (1900–2002) and Paul Ricœur. Heidegger is hermeneutical in that he holds the following. All understanding is interpretative in that it always has preconceptions. One has genuine understanding insofar as one has worked through the relevant preconceptions. One starts ‘with a preliminary, general view of something; this general view can guide us to insights, which then lead – should lead – to a revised general view, and so on’ (Polt 1999: 98). This ‘hermeneutic circle’ has a special import for phenomenology. For (according to Heidegger) our initial understanding of our relations to the world involves some particularly misleading and stubborn preconceptions, some of which derive from philosophical tradition. Heidegger concludes that what is necessary is ‘a destruction—a critical process in which the traditional concepts, which at first must necessarily be employed, are deconstructed down to the [experiential, phenomenological] sources from which they were drawn’ (Heidegger 1988: 22f.). But Heidegger’s position may be insufficiently, or inconsistently, hermeneutical. The thought is that Heidegger’s own views entail a thesis that, subsequently, Gadamer propounded explicitly. Namely: ‘The very idea of a definitive interpretation [of anything] seems to be intrinsically contradictory’ (Gadamer 1981: 105). This thesis, which Gadamer reaches by conceiving understanding as inherently historical and linguistic, bodes badly for Heidegger’s aspiration to provide definitive ontological answers (an aspiration that he possessed at least as much as Husserl did). Yet arguably (compare Mulhall 1996: 192–5) that very result gels with another of Heidegger’s goals, namely, to help his readers to achieve authenticity (on which more momentarily).

The second meaning or construal of ‘existential phenomenology’ is existentialism. Gabriel Marcel invented that latter term for ideas held by Sartre and by Simone de Beauvoir. Subsequently, Heidegger, Merleau-Ponty, Camus, Karl Jaspers, Kafka, and others, got placed under the label. A term used so broadly is hard to define precisely. But the following five theses each have a good claim to be called ‘existentialist’. Indeed: each of the major existential phenomenologists held some version of at least most of the theses (although, while Sartre came to accept the label ‘existentialist’, Heidegger did not).

  1. One’s life determines ever anew the person that one is.
  2. One is free to determine one’s life and, hence, one’s identity.
  3. There is no objective moral order that can determine one’s values. One encounters values within the world (indeed, one encounters them bound up with facts); but nothing rationally compels decision between values.
  4. 1–3 perturb. Hence a tendency towards the inauthenticity (Heidegger’s term) or bad faith (Sartre’s term) which consists in the denial or refusal of those points – often by letting society determine one’s values and/or identity.
  5. The relation to one’s death – as well as to certain types of anxiety and absurdity or groundlessness – is important for disclosing possibilities of authentic existence.

These theses indicate that for the existentialist philosophy must be practical. It is not, though, that existentialism puts ethics at the heart of philosophy. That is because a further central existentialist idea is that no-one, even in principle, can legislate values for another. True, Sartre declared freedom to be ‘the foundation of all values’ (Sartre 2007: 61); and he wrote Notebooks for an Ethics. According to the ethic in question, to will one’s own freedom is to will the freedom of others. But in no further way does that ethic make much claim to objectivity. Instead, much of it turns upon the ‘good faith’ that consists in not denying the fact of one’s freedom.

What of politics? Little in Husserl fits a conventional understanding of political philosophy. Sartre came to hold that his existential ethics made sense only for a society that had been emancipated by Marxism (Sartre 1963: xxv-xxvi). Merleau-Ponty developed a phenomenologically informed political philosophy – and disagreed with Sartre on concrete political questions and on the manner in which the philosopher should be ‘engaged’ (Diprose and Reynolds: ch. 8; Carmen and Hansen 2005: ch. 12). Sartre and Merleau-Ponty give one to think, also, about the idea of artistic presentations of philosophy (Diprose and Reynolds: ch.s 9 and 18). What of Heidegger? He was, of course, a Nazi, although for how long – how long after he led the ‘Nazification’ of Freiburg University – is debated, as is the relation between his Nazism and his philosophy (Wolin 1993; Young 1997; see also section 4.c below). Now the ‘Heidegger case’ raises, or makes more urgent, some general meta­philosophical issues. Should philosophers get involved in politics? And was Gilbert Ryle right to say – as allegedly, apropos Heidegger, he did say (Cohen 2002: 337 n. 21), – that ‘a shit from the heels up can’t do good philosophy’?

The foregoing material indicates a sense in which phenomenology is its own best critic. Indeed, some reactions against phenomenology and existentialism as such – against the whole or broad conception of philosophy embodied they represent – owe to apostates or to heterodox philosophers within those camps. We saw that, in effect, Sartre came to think that existentialism was insufficient for politics. In fact, he came to hold this: ‘Every philosophy is practical, even the one which at first sight appears to be the most contemplative [. . . Every philosophy is] a social and political weapon’ (Sartre 1960: 5). Levinas accused phenomenologists prior to himself of ignoring an absolutely fundamental ethical dimension to experience (see Davis 1996). Derrida resembles Sartre and Levinas, in that, like them, he developed his own metaphilosophy (treated below) largely via internal criticism of phenomenology. Another objection to phenomenology is that it collapses philosophy into psychology or anthropology. (Husserl himself criticized Heidegger in that way.) Rather differently, some philosophers hold that, despite its attitude to naturalism, phenomenology needs to be naturalized (Petitot et al 1999). As to existentialism, it has been criticized for ruining ethics and for propounding an outlook that is not only an intellectual mistake but also – and Heidegger is taken as the prime exhibit – politically dangerous (see Adorno 1986 and ch. 8 of Wolin).

b. Critical Theory

‘Critical Theory’ names the so-called Frankfurt School – the tradition associated with the Institute of Social Research (Institutfürsozialforschung) which was founded in Frankfurt in 1924. (See Literary Theory section 1 for a wider or less historical notion of Critical Theory.) According to Critical Theory, the point of philosophy is that it can contribute to a critical and emancipatory social theory. The specification of that idea depends upon which Critical Theory is at issue; Critical Theory is an extended and somewhat diverse tradition. Its first generation included Theodor Adorno, Max Horkheimer and Herbert Marcuse. Most of the members of this generation had Jewish backgrounds. For that reason, and because the Institute was Marxist, the first generation fled the Nazis. The Institute re-opened in Frankfurt in 1950. Within the second generation, the most prominent figures are Jürgen Habermas and Albrecht Wellmer. Within the third, Axel Honneth is the best known. There is a fourth generation too. Moreover, there were stages or phases within the first generation. Following Dubiel (1985), we may distinguish, within that generation: (i) an intitial stage (1924 to around 1930) in which the school was more traditionally Marxist than it was subsequently; (ii) a ‘materialist’ stage (1930–1937); (iii) a stage (1937–1940) that began with the adoption of the label ‘Critical Theory’; and (iv) the ‘critique of instrumental reason’ (1940–1945). The treatment of first generation Critical Theory that follows confines itself to iii and iv.

i. Critical Theory and the Critique of Instrumental Reason

It was Horkheimer who introduced the term ‘the critical theory of society’ (‘Critical Theory’ for short) in 1937. He was director of the Institute at the time. He introduced the phrase partly from prudence. By 1937 the Frankfurt School was in the United States, where it was unwise to use the word ‘Marxist’ or even ‘materialist’. But prudence was not the only motive for the new name. Horkheimer meant to clarify and shape the enterprise he was leading. That enerprise, he proposed (see Horkheimer 1937), was the construction of a social theory that was, for one thing, broad. It treats society as a whole or in all its aspects. That breadth, together with the idea that society is more independent of the economy than traditional Marxism recognizes, means that Critical Theory ought to be interdisciplinary. (The expertise of the first-generation encompassed economics, sociology, law, politics, psychology, aesthetics and philosophy.) Next, Critical Theory is emancipatory. It aims at a society that is rational and free and which meets the needs of all. It is to that end that Critical Theory is critical. It means to reveal how contemporary capitalist society, in its economy and its culture and in their interplay, deceives and dominates.

Critical Theory so defined involves philosophy in several ways. (1) From its inception, it adapted philosophical ideas, especially from German Idealism, in order to analyze society. Nonetheless, and following Lukács, (2) Critical Theory thought that some parts of some philosophies could be understood as unknowing reflections of social conditions. (3) Philosophy has a role to play, not as the normative underpinning of the theory, but in justification for the lack of such underpinning. To begin to explain that third point: Horkheimer and company little specified the rational society they sought and little defended the norms by which they indicted contemporary society. With Marx, they held that one should not legislate for what should be the free creation of the future. With Hegel, they held that, anyway, knowledge is conditioned by its time and place. They held also, and again in Hegelian fashion, that there are norms that exist (largely unactualized) within capitalism – norms of justice and freedom and so forth – which suffice to indict capitalism. (4) Critical Theory conceives itself as philosophy’s inheritor. Philosophy, especially post-Kantian German Idealism, had tried to overcome various types of alienation. But only the achievement of a truly free society could actually do that, according to Critical Theory. Note lastly here that, at least after 1936, Critical Theory denied both that ostensibly Marxist regimes were such and that emancipation was anywhere nearly at hand. Consequently, this stage of Critical Theory tended to aim less at revolution and more at propagating awareness of the faults of capitalism and (to a lesser extent) of ‘actually existing socialism’.

There is a sense in which philosophy looms larger (or even larger) in the next phase of the first generation of Critical Theory. For, this phase of the movement (the ‘critique of instrumental reason’ phase) propounded that which we might call (with a nod to Lyotard) a (very!) grand narrative. Adorno and Horkheimer are the principle figures of this phase, and their co-authored Dialectic of Enlightenment its main text. That text connects enlightenment to that which Max Weber had called ‘the disenchantment of the world’. To disenchant the world is to render it calculable. The Dialectic traces disenchantment from the historical Enlightenment back to the proto-rationality of myth and forward to modern industrial capitalism (to its economy, psychology, society, politics, and even to its philosophies). Weber thought that disenchantment had yielded a world wherein individuals were trapped within an ‘iron cage’ (his term) of economy and bureaucracy. Here is the parallel idea in the Dialectic. Enlightenment has reverted to myth, in that the calculated world of contemporary capitalism is ruled, as the mythic world was ruled, by impersonal and brutish forces. Adorno and Horkheimer elaborate via the idea of instrumental reason (although, actually, the preferred term in Dialectic of Enlightenment – and in Horkheimer’s Eclipse of Reason, something of a popularization of the Dialectic – is ‘subjective reason’). Disenchantment produces a merely instrumental reason in that it pushes choice among ends outside of the purview of rationality. That said, the result – Horkheimer and Adorno argue – is a kind of instrumentalization of ends. Ends get replaced, as a kind of default, by things previously regarded merely instrumentally. Thus, at least or especially by the time of contemporary capitalism, life comes to be governed by such means-become-ends as profit, technical expertise, systematization, distraction, and self-preservation.

Do these ideas really amount to Critical Theory? Perhaps they are too abstract to count as interdisciplinary. Worse: they might seem to exclude any orientation towards emancipation. True, commentators show that Adorno offered more practical guidance than was previously thought; also, first-generation Critical Theory, including the critique of instrumental reason, did inspire the 1960s student movement. However: while Marcuse responded to that movement with some enthusiasm, Adorno and Horkheimer did not. Perhaps they could not. For though they fix their hopes upon reason (upon ‘enlightenment thinking’), they indict that very same thing. They write (2002: xvi):

We have no doubt—and herein lies our petitio principii—that freedom in society is inseparable from enlightenment thinking. We believe we have perceived with equal clarity, however, that the very concept of that thinking, no less than the concrete historical forms, the institutions of society with which it is intertwined, already contains the germ of the regression.

ii. Habermas

Habermas is a principal source of the criticisms of Adorno and Horkheimer just presented. (He expresses the last of those criticisms by speaking of a ‘performative contradiction’.) Nonetheless, or exactly because he thinks that his predecessors have failed to make good upon the conception, Habermas pursues Critical Theory as Horkheimer defined it, which is to say, as broad, interdisciplinary, critical, and emancipatory social theory.

Habermas’ Critical Theory comprises, at least centrally, his ‘critique of functionalist reason’, which is a reworking of his predecessors’ critique of instrumental reason. The central thesis of the critique of functionalist reason is that the system has colonized the lifeworld. In order to understand the thesis, one needs to understand not only the notions of system, lifeworld, and colonization but also the notion of communicative action and – this being the most philosophical notion of the ensemble – the notion of communicative rationality.

Communicative action is action that issues from communicative rationality. Communicative rationality consists, roughly, in ‘free and open discussion [of some issue] by all relevant persons, with a final decision being dependent upon the strength of better argument, and never upon any form of coercion’ (Edgar 2006: 23). The lifeworld comprises those areas of life that exhibit communicative action (or, we shall see, which could and perhaps should exhibit it). The areas at issue include the family, education, and the public sphere. A system is a social domain wherein action is determined by more or less autonomous or instrumental procedures rather than by communicative rationality. Habermas counts markets and bureaucracies as among the most significant systems. So the thesis that the lifeworld has been colonized by the system is the following claim. The extension of bureaucracy and markets into areas such as the family, education, and the public sphere prevent those spheres from being governed by free and open discussion.

Habermas uses his colonization thesis to explain alienation, social instability, and the impoverishment of democracy. He maintains, further, that even systems cannot function if colonization proceeds beyond a certain point. The thinking runs thus. Part of the way in which systems undermine communicative action is by depleting resources (social, cultural and psychological) necessary for such action. But systems themselves depend upon those resources. (Note that, sometimes, Habermas uses the term ‘lifeworld’ to refer to those resources themselves rather than to a domain that does or could exhibit communicative action.) Still: Habermas makes it relatively clear that the colonization thesis is meant not only as descriptive but also as normative. For consider the following. (1) A ‘critique’ – as in ‘critique of functional reason’ – is, at least in its modern usage, an indictment. (2) Habermas presents the creation of a ‘communicative’ lifeworld as essential to the completion – a completion that he deems desirable – of what he calls ‘the unfinished project of modernity’. (3) Habermas tells us (in his Theory of Communicative Action, which is the central text for the colonization thesis) that he means to provide the normative basis for a critical theory of society.

How far does Habermas warrant the normativity, which is to say, show that colonization is bad? It is hard to be in favour of self-undermining societies. But (some degree of?) alienation might be thought a price worth paying for certain achievements; and not everyone advocates democracy (or at least the same degree or type of it). But Habermas does have the following argument for the badness of colonization. There is ‘a normative content’ within language itself, in that ‘[r]eaching understanding is the inherent telos of human speech’; and/but a colonized lifeworld, which by definition is not a domain of communicative action, thwarts that telos. (Habermas 1992a: 109 and Habermas 1984: 287 respectively.)

The idea that language has a communicative telos is the crux of Habermas’ thought. For it is central both to his philosophy of language (or to his so-called universal pragmatics) and to his ethics. To put the second of those points more accurately: the idea of a communicative telos is central to his respective conceptions of both ethics and morality. Habermas understands morality to be a matter of norms that are mainly norms of justice and which are in all cases universally-binding. Ethics, by contrast, is a matter of values, where those values: express what is good for some individual or some group; have no authority beyond the individual or group concerned; and are trumped by morality when they conflict with it. Habermas has a principle, derived from the linguistic, communicative telos mentioned above, which he applies to both normal norms and ethical values. To wit: a norm or value is acceptable only if all those affected by it could accept it in reasonable – rational and uncoerced – discourse. This principle makes morality and ethics matters not for the philosopher but ‘for the discourse between citizens’ (Habermas 1992a: 158). (For more on Habermas’ moral philosophy – his ‘discourse ethics’, as it is known – and on his political philosophy, and also on the ways in which the various aspects of his thought fit together, see Finlayson 2005. Note, too, that in the twenty-first century Habermas has turned his attention to (1) that which religion can contribute to the public discourse of secular states and (2) bioethics.)

Habermas’ denial that philosophers have special normative privileges is part of his general (meta)­philosophical orientation. He calls that orientation ‘postmetaphysical thinking’. In rejecting metaphysics, Habermas means to reject not only a normative privilege for philosophy but also the idea that philosophy can ‘make claims about the world as a whole’ (Dews 1995: 209). Habermas connects postmetaphysical thinking to something else too. He connects it to his rejection of that which he calls ‘the philosophy of consciousness’. Habermas detects the philosophy of consciousness in Descartes, in German Idealism, and in much other philosophy besides. Seemingly a philosophy counts as a philosophy of consciousness, for Habermas, just in case it holds this: the human subject apprehends the world in an essentially individual and non-linguistic way. To take Habermas’ so-called ‘communicative turn’ is to reject that view; it is to hold, instead, that human apprehension is at root both linguistic and intersubjective. Habermas believes that Wittgenstein, Mead, and others prefigured and even somewhat accomplished this ‘paradigm shift’ (Habermas 1992a: 173, 194).

Habermasian postmetaphysical thinking has been charged both with retaining objectionable metaphysical elements and with abandoning too many of philosophy’s aspirations. (The second criticism is most associated with Karl-Otto Apel, who nonetheless has co-operated with Habermas in developing discourse ethics. On the first criticism, see for instance Geuss 1981: 94f.) Habermas has been charged, also, with making Critical Theory uncritical. The idea here is this. In allowing that it is alright for some markets and bureaucracies to be systems, Habermas allows too much. (A related but less meta­philosophical issue, touched on above, is whether Habermas has an adequate normative basis for its social criticisms. This issue is an instance of the so-called normativity problem in Critical Theory, on which see Freyenhagen 2008; Finlayson 2009.)

Here are two further meta­philosophical issues. (1) Is it really tenable or desirable for philosophy to be as intertwined with social science as Critical Theory wishes it to be? (For an affirmative answer, see Geuss 2008.) (2) Intelligibility seems particularly important for any thinker who means ‘to reduce the tension between his own insight and the oppressed humanity in whose service he thinks’ (Horkheimer 1937: 221); but Critical Theory has been criticized as culpably obscure and even as mystificatory (see especially the pieces by Popper and Albert in Adorno et al 1976). Adorno has been the principal target for such criticisms (and Adorno did defend his style; see Joll 2009). Yet Habermas, too, is very hard to interpret. That is partly because this philosopher of communication exhibits an ‘unbelievable compulsion to synthesize’ (Knödler-Bunte in Habermas 1992a: 124), which is to say, to combine seemingly disparate – and arguably incompatible – ideas.

c. The Later Heidegger

‘The later Heidegger’ is the Heidegger of, roughly, the 1940s onwards. (Some differences between ‘the two Heideggers’ will emerge below. But hereafter normally ‘Heidegger’ will mean ‘the later Heidegger’.) Heidegger’s difficult, radical, and influential metaphilosophy holds that: philosophy is metaphysics; metaphysics involves a fundamental mistake; metaphysics is complicit in modernity’s ills; metaphysics is entering into its end; and ‘thinking’ should replace metaphysics/philosophy.

Heidegger’s criterion of metaphysics – to start with that – is the identification of being with beings. To explain: metaphysics seeks something designatable as ‘being’ in that metaphysics seeks a principle or ground of beings; and metaphysics identifies being with beings in that it identifies this principle or ground (i.e. being) with something that it itself a being – or at least a cause or property of some being or beings. Heidegger’s favored examples of such construals of being include: the Idea in Plato; Aristotelian or Cartesian or Lockean ‘substance’; various construals of God; the Leibnizian ‘monad’; Husserlian subjectivity; and the Nietzschean ‘will to power’. Philosophy is co-extensive with metaphysics in that all philosophy since Plato involves such a project of grounding.

Now Heidegger himself holds that beings (das Seiende) have a dependence upon being (das Sein). Yet, being is ‘not God and not a cosmic ground’ (Heidegger 1994: 234). Indeed, being is identical to no being or being(s) or property or cause of any being(s) whatsoever. This distinction is ‘the ontological difference—the differentiation between being and beings’ (Heidegger 1982: 17; this statement is from Heidegger’s earlier work, but this idea, if not quite the term, persists). We may put the contention thus: pace metaphysics/philosophy, being is not ontic. But what, then, is being?

It may be that Heidegger employs ‘das Sein’ in two senses (Young 2002: ch. 1, Philipse 1998: section 13b; compare for instance Caputo 1993: 30). We might (as do Young and Philipse) use ‘being’, uncapitalized, to refer to the first of these sense and ‘Being’ (capitalized) to refer to the other. (Where both senses are in play, as sometimes they seem to be in Heidegger’s writing, this article resorts sometimes to the German das Sein. Note, however, that this distinction between two senses of Heideggerian Sein is interpretatively controversial.) In the first and as it were lowercase sense, being is what Heidegger calls sometimes a ‘way of revealing’.  That is, it is something – something ostensibly non-ontic – by dint of which beings are ‘revealed’ or ‘unconcealed’ or ‘come to presence’, and indeed do so in the particular way or ways in which they do. In the second and ‘uppercase’ sense,  Being is that which is responsible for unconcealment, i.e. is responsible for das Sein in the first, lowercase sense. A little more specifically, Being (in this second, uppercase sense) ‘sends’ or ‘destines’ being; accordingly, it is that from which beings are revealed, the ‘reservoir of the non-yet-uncovered, the un-uncovered’ (Heidegger 1971: 60). With this second notion of das Sein, Heidegger means to stress the following point (a point that perhaps reverses a tendency in the early Heidegger): humanity does not determine, at least not wholly, how beings are ‘unconcealed’.

One wants specification of all this. We shall see that Heidegger provides some. Nevertheless, it may be a mistake to seek an exact specification of the ideas at issue. For Heidegger may not really mean das Sein (in either sense) to explain anything. He may mean instead to stress the mysteriousness of the fact that beings are accessible to us in the form that they are and, indeed, at all.

One way in which Heidegger fills out the foregoing ideas is by posit ing ‘epochs’ of being, which is to say, a historical series of ontological regimes (and here lies another difference between the earlier and the later Heidegger). The series runs thus: (1) the ancient Greek understanding of being, with which Heidegger associates the word ‘physis’; (2) the Medieval Christian understanding of being, whereby beings (except God and artifacts) are divinely created things; (3) the modern understanding of being as resource (on which more below). That said, sometimes Heidegger gives a longer list of epochs, in which list the epochs correlate with metaphysical systems. Thus the idea of a ‘history of being [Seinsgeschichte] as metaphysics’ (Heidegger 2003: 65). It is important that this history, and indeed the simpler tripartite scheme, does not mean to be a history merely of conceptions of being. It means to be also a history of being itself, i.e. of ontological regimes. Heidegger holds, then, that beings are ‘unconcealed’ in different ways in different epochs (although he holds also that each metaphysic ‘absolutizes’ its corresponding ontological regime, i.e. that each metaphysic overlooks the fact that beings are unconcealed differently in different epochs; see Young 2002: 29, 54, 68).

Heidegger allows also for some ontological heterogeneity within epochs, too. Here one encounters Heidegger’s notion of ‘the thing’ (das Ding). Trees, hills, animals, jugs, bridges, and pictures can be Things in the emphatic sense at issue, but such Things are ‘modest in number, compared with the countless objects’. A Thing has ‘a worlding being’. It opens a world by ‘gathering’ the fourfold (das Geviert). The fourfold is a unity of ‘earth and sky, divinities and mortals’. (All Heidegger 1971: 179ff.). Some of this conception is actually fairly straightforward. Heidegger tries to show how a bridge (to take one case) can be so interwoven with human life and thereby with other entities that, via the ‘world’ that comprises those interrelations (a world not identical with any particular being), the following is the case. The Thing (the bridge), persons, and numerous other phenomena all stand in relations of mutual determination, i.e. make each other what they are.

But in modernity ontological variety is diminished, according to Heidegger. In modernity Things become mere objects. Indeed subsequently objects themselves, together with human beings, become mere resources. A resource (or ‘standing-reserve’; the German is Bestand) is something that, unlike an object, is determined wholly by a network of purposes into which we place it. Heidegger’s examples include a hydroelectric powerplant on the Rhine and an airplane, together with the electricity and fuel systems to which those artifacts are connected. Heidegger associates resources with modern science and with ‘the metaphysics of subjectivity’ within which (he argues) modern science moves. That metaphysics, which tends towards seeing man as the measure of all things, is in fact metaphysics as such, according to Heidegger. For anthropocentrism is incipient in the very beginnings of philosophy, blossoms in various later philosophers including Descartes and Kant, and reaches its apogee in Nietzsche, the extremity of whose anthropocentrism is the end of metaphysics. It is the end of metaphysics (or, pleonastically: of the metaphysics of subjectivity) in that here, in Nietzsche’s extreme anthropocentrism, metaphysics reaches  its completion or full unfolding. And that end reflects the reign of resources. ‘[T]he world of completed metaphysics can be stringently called “technology”’ (Heidegger 2003: 82). However, in Heidegger’s final analysis the ubiquity of resources owes not to science or metaphysics but to a ‘mode of revealing’; it owes to an epochal ontological regime that Heidegger calls ‘Enframing’, even if he seems to think, also, that a change in human beings could mitigate Enframing and prepare for something different and better. (More on this mitigation shortly.)

What though is wrong with the real being revealed as resource? Enframing is ‘monstrous’ (Heidegger 1994: 321). It is monstrous – Heidegger contends – because it is nihilism. Nihilism is a ‘forgetfulness’ of das Sein (Seinsvergessenheit). Some such forgetfulness is nigh inevitable. We are interested in beings as they present themselves to us. So we overlook the conditions of that presentation, namely, being and Being. But Enframing represents a more thoroughgoing form of forgetfulness. The hegemony of resources makes it very hard (harder than usual – recall above) to conceive that beings could be otherwise, which is to say, to conceive that there is something called ‘Being’ that could yield different regimes of being. In fact, Enframing actively denies being/Being. That is because Enframing, or the metaphysics/science that corresponds to it, proceeds as if humanity were the measure of all things and hence as if being, or that which grants being independently of us (Being), were nothing. Such nihilism sounds bearable. But Heidegger lays much at its door: an impoverishment of culture; a deep kind of homelessness; the devaluation of the highest values (see Young 2002: ch. 2 and passim). He goes so far as to trace ‘the events of world history in this [the twentieth] century’ to Seinsvergessenheit (Heidegger in Wolin 1993: 69).

Heidegger’s response to nihilism is ‘thinking’ (Denken). The thinking at issue is a kind of thoughtful questioning. Its object – that which it thinks about – can be the pre-Socratic ideas from which philosophy developed, or philosophy’s history, or Things, or art. Whatever its object, thinking always involves recognition that it is das Sein, albeit in some interplay with humanity, which determines how beings are. Indeed, Heideggerian thinking involves wonder and gratitude in the face of das Sein. Heidegger uses Meister Eckhart’s notion of ‘releasement’ to elaborate upon such thinking. The idea (prefigured, in fact, in Heidegger’s earlier work) is of a non-impositional comportment towards beings which lets beings be what they are. That comportment ‘grant[s] us the possibility of dwelling in the world in a totally different way’. It promises ‘a new ground and a new foundation upon which we can stand and endure in the world of technology without being imperiled by it’ (Heidegger 1966: 55). Heidegger calls the dwelling at issue ‘poetic’ and one way in which he specifies it is via various poets. Moreover, some of Heidegger’s own writing is semi-poetic. A small amount of it actually consists of poems. So it is not entirely surprising to find Heidegger claiming that,  ‘All philosophical thinking’ is ‘in itself poetic’ (Heidegger 1991, vol. 2: 73; Heidegger made this claim at a time when he still considered himself a philosopher as against a non-metaphysical, and hence non-philosophical, ‘thinker’). The claim is connected to the centrality that Heidegger gives to language, a centrality that is summed up (a little gnomically) in the statement that language is ‘the house of das Sein’ (Heidegger 1994: 217).

Heideggerian ‘thinking’ has been attacked as (some mixture of) irrationalist, quietist, reactionary, and authoritarian (see for example Adorno 1973 and Habermas 1987b: ch. 6). A related objection is that, though Heidegger claimed to leave theology alone, what he produced was an incoherent reworking of religion (Haar 1993; Philipse 1998). Of the more or less secular or (in Caputo’s term) ‘demythologized’ construals of Heidegger, many are sympathetic and, among those, many fasten upon such topics as technology, nihilism, and dwelling (Borgmann 1984, Young 2002: ch.s 7–9; Feenberg 1999: ch. 8). Other secular admirers – including, notably, Rorty and Derrida – concentrate upon Heidegger’s attempt to encapsulate and interrogate the entire philosophical tradition.

d. Derrida’s Post-Structuralism

Structuralism was an international trend in linguistics, literary theory, anthropology, political theory, and other disciplines. It sought to explain phenomena (sounds, tropes, behaviors, norms, beliefs . . .) less via the phenomena themselves, or via their genesis, and more via structures that the phenomena exist within or instantiate. The post-structuralists applied this structural priority to philosophy. They are post-structuralists less because they came after structuralism and more because, in appropriating structuralism, they distanced themselves from the determinism and scientism it often involved (Dews 1987: 1–4). The post-structuralists included Deleuze, Foucault, Lyotard and Lacan (and sometimes post-structuralism is associated with ‘post-modernism’; see Malpas 2003: 7–11). Each of these thinkers (perhaps excepting Lacan) is highly meta­philosophical. But attention is restricted to the best known and most controversial of the post-structuralists, namely, Jacques Derrida.

Derrida practiced ‘deconstruction’ (Déconstruire, la Déconstruction; Derrida adapts the notion of deconstruction from Heidegger’s idea of ‘destruction’, on which latter see section 4.a.ii above). Deconstruction is a ‘textual “operation”’ (Derrida 1987: 3). The notion of text here is a broad one. It extends from written texts to conceptions, discourses, and even practices. Nevertheless, Derrida’s early work concentrates upon actual texts and, more often than not, philosophical ones. The reason Derrida puts ‘operation’ (‘textual “operation”’) within scare-quotes is that he holds that deconstruction is no method. That in turn is for two reasons (each of which should become clearer below). First, the nature of deconstruction varies with that which is deconstructed. Second, there is a sense in which texts deconstruct themselves. Nonetheless: deconstruction, as a practice, reveals such alleged self-deconstruction; and that practice does have a degree of regularity. The practice of deconstruction has several stages. (In presenting those stages, ‘text’ is taken in the narrow sense. Moreover, it is presumed that in each case a single text is, at least centrally, at issue.)

Deconstruction begins with a commentary (Derrida 1976: 158) – with a ‘faithful’ and ‘interior’ reading of a text (Derrida 1987: 6). Within or via such commentary, the focus is upon metaphysical oppositions. Derrida understands metaphysics as ‘the metaphysics of presence’ (another notion adapted from Heidegger); and an opposition belongs to metaphysics (pleonastically, the metaphysics of presence) just in case: (i) it contains a privileged term and a subordinated term; and (ii) the privileged term has to do with presence. ‘Presence’ is presence to consciousness and/or the temporal present. The oppositions at issue include not only presence–absence (construed in either of the two ways just indicated) but also, and among others (and with the term that is privileged within each opposition given first) these: ‘normal/abnormal, standard/parasitic, fulfilled/void, serious/nonserious, literal/nonliteral’ (Derrida 1988: 93).

The next step in deconstruction is to show that the text undermines its own metaphysical oppositions. That is: the privileged terms reveal themselves to be less privileged over the subordinate terms – less privileged vis-à-vis presence, less ‘simple, intact, normal, pure, standard, self-identical’ (Derrida 1988: 93) – than they give themselves out to be. Here is a common way in which Derrida tries to establish the point. He tries to show that a privileged term essentially depends upon, or shares some crucial feature(s) with, its supposed subordinate. One of Derrida’s deconstructions of Husserl can serve as an example. Husserl distinguishes mental life, which he holds to be inherently intentional (inherently characterized by aboutness) from language, which is intentional only via contingent association with such states. Thereby Husserl privileges the mental over the linguistic. However: Husserl’s view of the temporality of experience entails that the presence he makes criterial for intrinsic intentionality – a certain presence of meanings to the mind – is always partially absent. Or so Derrida argues (Derrida, section 4). A second strategy of Derrida’s ‘is to apply a distinction onto itself reflexively and thus show that it itself is imbued with the disfavored term’ (Landau, 1992/1993: 1899). ‘For example, Derrida shows that when Aristotle and other philosophers discuss the nature of metaphors (and thereby the distinction between metaphors and non-metaphors), they use metaphors in the discussions themselves’ (idem) – and so fail in their attempts to relegate or denigrate metaphor. A further strategy involves the notion of undecidability (see Derrida, section 5).

A third stage or aspect of deconstruction is, one can say, less negative or more productive (and Derrida himself calls this the productive moment of deconstruction). Consider Derrida’s deconstruction(s) of the opposition between speech and writing. Derrida argues, initially, as follows. Speech – and even thought, understood as a kind of inner speech – shares with writing features that have often been used to present writing as only a poor descendent of speech. Those features include being variously interpretable and being derivative of something else. But there is more. Derrida posits something, which he calls archi-écriture, ‘arche-writing’, which is ‘fundamental to signifying processes in general, a “writing” that is the condition of all forms of expression, whether scriptural, vocal, or otherwise’ (Johnson 1993: 66). Indeed: as well as being a condition of possibility, arche-writing is, in Derrida’s frequent and arresting phrase, a condition of its impossibility. Arche-writing establishes or reveals a limit to any kind of expression (a limit, namely, to the semantic transparency, and the self-sufficiency, of expressions). Other deconstructions proceed similarly. A hierarchical opposition is undermined; a new term is produced through a kind of generalization of the previously subordinate term; and the new term – such as ‘supplement’, ‘trace’ and the neologism différance (Derrida, section 3.c–e) – represents a condition of possibility and impossibility for the opposition in question.

What is the status of these conditions? Sometimes Derrida calls them ‘quasi-transcendental’. That encourages this idea: here we have an account not just of concepts but of things or phenomena. Yet Derrida himself does not quite say that. He denies that we can make any simple distinction between text and world, between conceptual system and phenomena. Such may be part of the thrust of the (in)famous pronouncement, ‘There is nothing outside of the text’ (il n’y a pas de hors-texte; Derrida 1976: 158). Nor does Derrida think that, by providing such notions as arche-writing, he himself wholly evades the metaphysics of presence. ‘We have no language—no syntax and no lexicon—that is foreign to this history; we can pronounce not a single deconstructive proposition which has not already had to slip into the form, the logic, and the implicit postulations of precisely what it seeks to contest’ (Derrida 1990: 280f.). Still: ‘if no one can escape this necessity, and if no one is therefore responsible for giving in to it […] this does not mean that all the ways of giving in to it are of equal pertinence. The quality and fecundity of a discourse are perhaps measured by the critical rigor with which this relation to the history of metaphysics and to inherited concepts is thought’ (Derrida 1990: 282).

Derrida retained the foregoing views, which he had developed by the end of the 1960s. But there were developments of metaphilosophical significance. (1) In the ’70s, his style became more playful, and his approach to others’ text became more literary (and those changes more or less persisted; Derrida would want to know, however, just what we understand by ‘playful’ and ‘literary’). (2) Again from the ’70s onwards, Derrida joined with others in order to: sustain and promote the teaching of philosophy in schools; to consider philosophy’s role; and to promote philosophy that transgressed disciplinary boundaries. (3) In the ’80s, Derrida tried to show that deconstruction had an ethical and political import. He turned to themes that included cosmopolitanism, decision, forgiveness, law, mourning, racism, responsibility, religion, and terrorism – and claimed, remarkably, that ‘deconstruction is justice’ (Derrida 1999: 15). To give just a hint of this last idea: ‘Justice is what the deconstruction of the law’ – an analysis of the law’s conditions of possibility and impossibility, of its presuppositions and limits – ‘means to bring about’, where ‘law’ means ‘legality, legitimacy, or legitimation (for example)’ (Caputo 1997: 131f.). (On some of these topics, see Derrida, section 7.) (4) By the ’90s, if not earlier, Derrida held that in philosophy the nature of philosophy is always and everywhere at issue (see for instance Derrida 1995: 411).

Despite his views about the difficulty of escaping metaphysics, and despite his evident belief in the critical and exploratory value of philosophy, Derrida has been attacked for undermining philosophy. Habermas provides an instance of the criticism. Habermas argued that Derrida erases the distinction between philosophy and literature. Habermas recognizes that Derrida means to be ‘simultaneously maintaining and relativizing’ the distinction between literature and philosophy (Habermas 1987b: 192). But the result, Habermas thinks, is an effacement of the differences between literature and philosophy. Habermas adds, or infers, that ‘Derrida does not belong to those philosophers who like to argue’ (Habermas 1987b: 193). Derrida objected to being called unargumentative. He objected, also, to Habermas’ procedure of using other deconstructionists – those that Habermas deemed more argumentative – as the source for Derrida’s views.

Subsequently, Habermas and Derrida underwent something of a rapprochement. Little reconciliation was achieved in the so-called ‘Derrida affair’, wherein a collection of philosophers, angry that Derrida was to receive an honorary degree from Cambridge, alleged that Derrida ‘does not meet accepted standards of clarity or rigor’ (quoted Derrida 1995: 420; a detailed attack upon Derrida’s scholarship is Evans 1991).

There might be a sense in which Derrida is too rigorous. For he holds this: ‘Every concept that lays claim to any rigor whatsoever implies the alternative of “all or nothing”’ (Derrida 1988: 116). One might reject that view. Might it be, indeed, that Derrida insists upon rigid oppositions ‘in order to legitimate the project of calling them into question’ (Gerald Graff in Derrida 1988: 115)? One might object, also, that Derrida’s interrogation of philosophy is more abstract, more intangible, than most metaphysics. Something Levinas said apropos Derrida serves as a response. ‘The history of philosophy is probably nothing but a growing awareness of the difficulty of thinking’ (Levinas 1996: 55; compare Derrida 1995: 187f.). The following anxiety might persist. Despite Derrida’s so-called ethical and political ‘turns’, and despite the work he has inspired within he humanities, deconstruction little illuminates phenomena that are not much like anything reasonably designatable as a text (Dews 1987: 35). A more general version of the anxiety is that, for all the presentations of Derrida as ‘a philosopher of difference’, deconstruction obscures differences (Kearney 1984: 114; Habermas 1992a: 159).

5. References and Further Reading

Note that, in the case of many of the items that follow, the date given for a text is not the date of its first publication.

a. Explicit Metaphilosophy and Works about Philosophical Movements or Traditions

  • Anscombe, G. E. M. (1957) ‘Does Oxford Moral Philosophy Corrupt Youth?’ in her Human life, Action, and Ethics: Essays, pp. 161–168. Exeter, UK: Imprint Academic, 2005. Edited by Mary Geach and Luke Gormally.
  • Beaney, Michael (2007) ‘The Analytic Turn in Early Twentieth-Century Philosophy’, in Beaney, Michael ed. The Analytic Turn. Essays in Early Analytic Philosophy and Phenomenology, New York and London: Routledge, 2007.
    • Good on, especially, the notions of analysis in early Analytic philosophy and on the historical precedents of those notions.
  • Beaney, Michael (2009) ‘Conceptions of Analysis in Analytic Philosophy’: Supplement to entry on ‘Analysis’, The Stanford Encyclopedia of Philosophy (Summer 2009 Edition), Edward N. Zalta (ed.).
  • Beauchamp, Tom L. (2002) ‘Changes of Climate in the Development of Practical Ethics’, Science and Engineering Ethics 8: 131–138.
  • Bernstein, Richard J. (2010) The Pragmatic Turn. Cambridge MA and Cambridge.
    • An account of the influence and importance of pragmatism.
  • Chappell, Timothy (2009) ‘Ethics Beyond Moral Theory’ Philosophical Investigations 32: 3 206–243.
  • Chase, James, and Reynolds, Jack (2010) Analytic Versus Continental: Arguments on the Methods and Value of Philosophy. Stocksfield: Acumen.
  • Clarke, Stanley G. (1987) ‘Anti-Theory in Ethics’, American Philosophical Quarterly 24: 3 237–244.
  • Deleuze, Giles, and Guattari, Félix (1994) What is Philosophy? London and New York: Verso. Trans. Graham Birchill and Hugh Tomlinson.
    • Less of an introduction to metaphilosophy than its title might suggest.
  • Galison, Peter (1990) ‘Aufbau/Bauhaus: Logical Positivism and Architectural Modernism’, Critical Inquiry, 16(4[Summer]): 709–752.
  • Glendinning, Simon (2006) The Idea of Continental Philosophy: A Philosophical Chronicle. Edinburgh: Edinburgh University Press.
  • Glock, Hans-Johann (2008) What Is Analytic Philosophy? Cambridge and New York: Cambridge University Press.
    • Comprehensive. Illuminating. Not introductory.
  • Graham, George and Horgan, Terry (1994) ‘Southern Fundamentalism and the End of Philosophy’, Philosophical Issues 5: 219–247.
  • Lazerowitz, Morris (1970) ‘A Note on “Metaphilosophy”, Metaphilosophy, 1(1): 91–91 (sic).
    • An influential (but very short) definition of metaphilosophy.
  • Levin, Janet (2009) ‘Experimental Philosophy’, Analysis, 69(4) 2009: 761–769.
  • Levy, Neil (2009) ‘Empirically Informed Moral Theory: A Sketch of the Landscape’, Ethical Theory and Moral Practice 12:3–8.
  • McNaughton, David (2009) ‘Why Is So Much Philosophy So Tedious?’, Florida Philosophical Review IX(2): 1-13.
  • Joll, Nicholas (2009) ‘How Should Philosophy Be Clear? Loaded Clarity, Default Clarity, and Adorno’, Telos 146 (Spring): 73–95.
  • Joll, Nicholas (Forthcoming) Review of Jürgen Habermas et al, An Awareness of What Is Missing (Polity, 2010), Philosophy.
    • Tries to clarify and evaluate some of Habermas’ thinking on religion.
  • Papineau, David (2009) ‘The Poverty of Analysis’, Proceedings of the Aristotelian Society Supplementary Volume lxxxiii: 1–30.
  • Preston, Aaron (2007) Analytic Philosophy: The History of an Illusion. London and New York: Continuum.
    • Argues, controversially, that Analytic philosophy has never had any substantial philosophical or meta­philosophical unity.
  • Prinz, Jesse J. (2008) ‘Empirical Philosophy and Experimental Philosophy’ in J. Knobe and S. Nichols (eds.) Experimental Philosophy. Oxford: Oxford University Press, 2008.
  • Urmson, J. D. (1956) Philosophical Analysis: Its Development Between the Two World Wars. London: Oxford University Press.
  • Rescher, Nicholas (2006) Philosophical Dialectics. An Essay on Metaphilosophy. Albany: State University of New York Press.
    • Centres upon the notion of philosophical progress. Contains numerous, occasionally gross typographical errors.
  • Rorty, Richard ed. (1992) The Linguistic Turn: Essays in Philosophical Method, Chicago and London: University of Chicago Press. Second edition.
    • A useful study of 1930s to 1960s Analytic metaphilosophy.
  • Rorty, Richard, Schneewind, Jerome B., and Skinner, Quentin eds. (1984) Philosophy in History: Essays in the Historiography of Philosophy. Cambridge: Cambridge University Press.
  • Sorell, Tom, and Rogers, C. A. J. eds. (2005) Analytic Philosophy and History of Philosophy. Oxford and New York: Oxford.
  • Stewart, Jon (1995) ‘Schopenhauer’s Charge and Modern Academic Philosophy: Some Problems Facing Philosophical Pedagogy’, Metaphilosophy 26(3): 270–278.
  • Taylor, Charles (1984) ‘Philosophy and Its History’, in Rorty, Schneewind, and Skinner 1984.
  • Williams, Bernard (2003) ‘Contemporary Philosophy: A Second Look’ in The Blackwell Companion to Philosophy, ed. Nicholas Bunnin and E. P. Tsui-James, pp. 25–37. Oxford: Blackwell. Second edition.
  • Williamson, Timothy (2007) The Philosophy of Philosophy, Malden MA and Oxford: Blackwell.
    • A dense, rather technical work aiming to remedy what it sees as a meta­philosophical lack in Analytic philosophy. Treats, among other things, these notions: conceptual truth; intuitions; thought experiments.

b. Analytic Philosophy including Wittgenstein, Post-Analytic Philosophy, and Logical Pragmatism

  • Austin, J. L., Philosophical Papers (1979). Third edition. Oxford and New York: Oxford University Press.
  • Burtt, E. A. (1963) ‘Descriptive Metaphysics’, Mind 72(285):18–39.
  • Campbell, Richmond and Hunter, Bruce (2000) ‘Introduction’, in R. Campbell and B. Hunter eds. Moral Epistemology Naturalized, Supple. Vol., Canadian Journal of Philosophy: 1–28.
    • Campbell has a published a similar piece, under the title ‘Moral Epistemology’, in the online resource the Stanford Encyclopedia of Philosophy.
  • Carnap (1931) ‘The Elimination of Metaphysics Through Logical Analysis of Language’ in Ayer, A. J. (1959) ed. Logical Positivism. Glencoe IL: The Free Press.
  • Cavell, Stanley (1979) The Claim of Reason. Wittgenstein, Skepticism, Morality, and Tragedy. Oxford: Oxford University Press.
  • Cohen, G. A. (2002) ‘Deeper into Bullshit’, in Buss, Sarah and Overton, Lee eds. Contours of Agency: Themes from the Philosophy of Harry Frankfurt, Cambridge, MA: MIT Press.
    • Adapts Harry Frankfurt’s construal of bullshit in order to diagnose and indict much ‘bullshit in certain areas of philosophical and semi-philosophical culture’ (p. 335). Reprinted in Hardcastle, Gary L. and Reich, George A. eds. Bullshit and Philosophy, Chicago and La Salle, IL: Open Court, 2006.
  • Copi, Irving M. (1949) ‘Language Analysis and Metaphysical Inquiry’ in Rorty 1992.
  • Freeman, Samuel (2007) Rawls. Oxford and New York: Routledge.
  • Gellner, Ernest (2005) Words and Things. An Examination of, and an Attack on, Linguistic Philosophy. Second edition. Abingdon and New York: Routledge.
  • Glock, Hans-Johann (2003a) Quine and Davidson on Language, Thought and Reality. Cambridge and New York: Cambridge University Press.
  • Glock, Hans-Johann ed. (2003b) Strawson and Kant. Oxford and New York: Oxford University Press.
  • Haack, Susan (1979) ‘Descriptive and Revisionary Metaphysics’, Philosophical Studies 35: 361–371.
  • Hacker, P. M. S. (2003) ‘On Strawson’s Rehabilitation of Metaphysics’ in Glock ed. 2003b.
  • Hacker, P. M. S. (2007) Human Nature: the Categorial Framework. Oxford: Blackwell.
  • Hutchinson, Brian (2001) G. E. Moore’s Ethical Theory: Resistance and Reconciliation. Cambridge: Cambridge University Press.
  • Kripke, Saul A (1980) Naming and Necessity. Oxford: Blackwell. Revised and Enlarged edition.
  • Lance, M. and Little, M., (2006) ‘Particularism and anti-theory’, in D. Copp, ed., The Oxford handbook of ethical theory, Oxford and New York: Oxford University Press.
  • Loux, Michael J (2002) Metaphysics. A Contemporary Introduction, second ed. Routledge: London and New York.
  • Malcolm, Norman (1984) Ludwig Wittgenstein: a memoir / by Norman Malcolm; with a biographical sketch by G. H. von Wright and Wittgenstein’s Letters to Malcolm. Second ed. Oxford and New York, Oxford University Press.
  • McDowell, John (1994) Mind and World. Cambridge MA and London: Harvard University Press.
    • Perhaps the paradigmatic ‘post-Analytic’ text.
  • McDowell, John (2000) ‘Towards Rehabilitating Objectivity’ in Brandom ed. (2000).
  • McMahon, Jennifer A. (2007) Aesthetics and Material Beauty: Aesthetics Naturalized. New York and London: Routledge.
  • Moore, G. E. (1899) ‘The Nature of Judgement’, in G. E. Moore Selected Writings, London: Routledge, 1993, ed. T. Baldwin.
  • Moore, G. E. (1953) Some Main Problems of Philosophy. New York: Humanities Press.
    • From lectures given in 1910 and 1911.
  • Moore, G. E. (1993) Principia Ethica. Cambridge and New York: Cambridge University Press.
    • Second and revised edition, containing some other writings by Moore.
  • Neurath, Otto, Carnap, Rudolf, and Hahn, Hans (1996) ‘The Scientific Conception of the World: the Vienna Circle’, in Sarkar, Sahotra ed. The Emergence of Logical Empiricism: from 1900 to the Vienna Circle. New York: Garland Publishing, 1996. pp. 321–340.
    • An English translation of the manifesto issued by the Vienna Circle in 1929.
  • Orenstein, Alex (2002) W. V. Quine. Chesham, UK: Acumen.
  • Pitkin, Hanna (1993) Wittgenstein and Justice. On the Significance of Ludwig Wittgenstein for Social and Political Thought. Berkeley and London: University of California Press.
  • Putnam, Hilary (1985) ‘After Empiricism’ in Rajchman and West 1985.
  • Quine, W. V. O. (1960) Word and Object. Cambridge MA: MIT Press.
  • Quine, W. V. O. (1977) Ontological Relativity and Other Essays. New York: Columbia University Press. New edition.
  • Quine, W. V. O. (1980) From A Logical Point of View. Harvard: Harvard University Press. New edition.
  • Quine, W. V. O. (1981) Theories and Things. Cambridge, MA: Harvard University Press.
  • Rawls, John (1999a) A Theory of Justice. Revised edition. Cambridge MA: Harvard University Press.
  • Rawls, John (1999b) Collected Papers ed. Samuel Freeman. Cambridge, MA: Harvard University Press.
  • Russell, Bertrand (1992) A Critical Exposition of the Philosophy of Leibniz. London and New York: Routledge.
  • Russell, Bertrand (1995) My Philosophical Development. Abingdon, UK and New York: Routledge.
  • Russell, Bertrand (2009) Our Knowledge of the External World: As a Field for Scientific Method in Philosophy. Abingdon and New York: Routledge.
  • Rynin, David (1956) ‘The Dogma of Logical Pragmatism’, Mind 65(259): 379–391.
  • Schilpp, P. A. ed. (1942) The Philosophy of G. E. Moore Northwestern University Press, Evanston IL.
  • Schilpp, Paul Arthur ed. (1942) The Philosophy of G. E. Moore. Evanston and Chicago: Northwestern University Press.
  • Schroeter, François (2004) ‘Reflective Equilibrium and Antitheory’, Noûs, 38(1): 110–134.
  • Schultz, Bart (1992) ‘Bertrand Russell in Ethics and Politics’, Ethics, 102: 3 (April): 594–634.
  • Sellars, Wilfred (1963) Science, Perception and Reality. Routledge & Kegan Paul Ltd; London, and The Humanities Press: New York.
  • Strawson, Peter (1959) Individuals: An Essay in Descriptive Metaphysics. London: Methuen.
  • Strawson, Peter (1991) Analysis and Metaphysics. An Introduction to Philosophy. Oxford and New York: Oxford University Press.
    • Both an introduction to philosophy and an introduction to Strawson’s own philosophical and meta­philosophical views.
  • Strawson, Peter (2003) ‘A Bit of Intellectual Autobiography’ in Glock ed. 2003b.
  • Weinberg, Jonathan M., Nichols, Shaun and Stitch, Stephen (2001) ‘Normativity and Epistemic Intuitions’, Philosophical Topics, 29(1&2): 429–460.
  • Williams, Bernard (1981) Moral Luck. Cambridge: Cambridge University Press.
  • Wittgenstein, Ludwig (1961) Tractatus Logico-Philosophicus. Trans. D.F. Pears and B.F. McGuinness. Routledge: London.
  • Wittgenstein, Ludwig (1966) Lectures and Conversations on Aesthetics, Psychology and Religious Belief. Oxford: Blackwell.
  • Wittgenstein, Ludwig (1969) The Blue and Brown Books. Preliminary Studies for the “Philosophical Investigations”. Blackwell: Oxford.
  • Wittgenstein, Ludwig (2001) Philosophical Investigations. The German Text, with a Revised English Translation. Malden MA and Oxford: Blackwell. Third edition. Trans. G. E. M. Anscombe.
    • The major work of the ‘later’ Wittgenstein.
  • Wright, Crispin (2002) ‘Human Nature?’ in Nicholas H. Smith ed. Reading McDowell. On Mind and World. London and New York: Routledge.

c. Pragmatism and Neopragmatism

  • Brandom, Robert B. ed. (2000) Rorty and His Critics. Malden MA and Oxford: Blackwell.
  • Dewey, John (1998) The Essential Dewey, two volumes, Larry Hickman and Thomas M. Alexander eds. Indiana University Press.
  • James, William (1995) Pragmatism: A New Name for Some Old Ways of Thinking. New York: Dover Publications.
    • Lectures.
  • Peirce, C. S. (1931–58) The Collected Papers of Charles Sanders Peirce, eds. C. Hartshorne, P. Weiss (Vols. 1–6) and A. Burks (Vols. 7–8). Cambridge MA: Harvard University Press.
  • Rorty, Richard (1980) Philosophy and the Mirror of Nature. Oxford: Blackwell.
    • Rorty’s magnum opus.
  • Rorty, Richard (1991a) Consequences of Pragmatism (Essays: 1972–1980). Hemel Hempstead, UK: Harvester Wheatsheaf.
  • Rorty, Richard (1991b) ‘The Priority of Democracy to Philosophy’, pp. 175–196 of his Objectivity, Relativism, and Truth. Philosophical Papers, Volume 1. Cambridge, New York and Melbourne: Cambridge University Press.
  • Rorty, Richard (1998) Achieving Our Country. Leftist Thought in Twentieth-Century America. Cambridge MA and London: Harvard University Press.
  • Rorty, Richard (2007) Philosophy as Cultural Politics. Philosophical Papers, Volume 4. Cambridge: Cambridge University Press.
  • Talisse, Robert B. and Aikin, Scott F. (2008) Pragmatism: A Guide for the Perplexed. Continuum: London and New York.
    • Good and useful.

d. Continental Philosophy

  • Adorno, Theodor W. (1986) The Jargon of Authenticity. London and Henley: Routledge and Kegan Paul, 1986; trans. Knut Tarnowski and Frederic Will.
  • Adorno, Theodor W. and Horkheimer, Max (2002) Dialectic of Enlightenment. Philosophical Fragments. Stanford: Stanford University Press. Trans. Edmund Jephcott.
  • Adorno, Theodor W. (1976) with R. Dahrendorf, J. Habermas, H. Pilot, and K. Popper, The Positivist Dispute in German Sociology, trans. G. Adey and D. Frisby, London: Heinemann Educational Books.
    • Documents from debates between Popperians (who were not, in fact, positivists in any strict sense) and the Frankfurt School.
  • Baxter, Hugh (1987) ‘System and Life-World in Habermas’ Theory of Communicative ActionTheory and Society 16: 1 (January): 39–86.
  • Braver, Lee (2009) Heidegger’s Later Writings. A Reader’s Guide. London and New York: Continuum.
    • Accessible and helpful, yet perhaps somewhat superficial.
  • Caputo, John D (1977) ‘The Question of Being and Transcendental Phenomenology: Reflections on Heidegger’s relationship to Husserl’, Research in Phenomenology 7 (1):84–105.
  • Caputo, John D (1993) Demythologizing Heidegger. Bloomington and Indianapolis: Indiana University Press.
    • More ‘Continental’ than one might guess merely from the title.
  • Caputo, John, D (1997) ‘A Commentary’, Part Two of Derrida, Jacques (1997) Deconstruction in a Nutshell. A Conversation with Jacques Derrida. New York: Fordam University Press. Edited and with a commentary by John D. Caputo.
  • Carmen, Taylor, and B. N. Hansen eds. (2005) The Cambridge Companion to Merleau-Ponty. Cambridge, Cambridge University Press.
  • Cerbone, David (2006) Understanding Phenomenology. Chesham, UK: Acumen.
    • A good introduction to phenomenology.
  • Cooper, David (1999) Existentialism. A Reconstruction 2nd ed. Blackwell: Oxford and Malden, MA
    • Careful, argumentative, fairly accessible.
  • Davis, Colin (1996) Levinas. An Introduction. Cambridge: Polity.
    • Not only introduces Levinas but also mounts a strong challenge to him.
  • Derrida, Jacques (1976) Of Grammatology. Baltimore and London: Johns Hopkins University Press. Trans. G. C. Spivak.
  • Derrida, Jacques (1987) Positions. London: Althone. Trans. Alan Bass.
    • Three relatively early interviews with Derrida. Relatively accessible.
  • Derrida, Jacques (1988) Limited Inc. Evanston, IL: Northwestern University Press.
    • Contains Derrida’s side of an (acrimonious) debate with John Searle. Includes an Afterword wherein Derrida answers questions put to him by Gerald Graff.
  • Derrida, Jacques (1990) Writing and Difference. London: Routledge. Trans. Alan Bass.
  • Derrida, Jacques (1995) Points . . . : Interviews, 1974–1994. Trans. Peggy Kamuf et al. Stanford, CA: Stanford University Press.
  • Derrida, Jacques (1999) ‘Force of Law’ in Drucilla Cornell, Michel Rosenfeld, and David Gray Carlson eds. (1982) Deconstruction and the Possibility of Justice, New York: Routledge.
  • Dews, Peter (1987) Logics of Disintegration. Post-stucturalist Thought and the Claims of Critical Theory. London and New York: Verso.
  • Dews, Peter (1995) ‘Morality, Ethics and “Postmetaphysical Thinking”’ in his The Limits of Disenchantment. Essays on Contemporary European Philosophy. London and New York: Verso, 1995.
  • Diprose, Rosalyn and Reynolds, Jack eds. (2008) Merleau-Ponty: Key Concepts. Chesham, UK: Acumen.
  • Dubiel, Daniel (1985) Theory and Politics. Studies in the Development of Critical Theory. Cambridge MA: MIT Press.
  • Edgar, Andrew (2006) Habermas. The Key Concepts. Routledge. London and New York.
  • Elden, Stuart (2004) Understanding Henri Lefebvre: Theory and the Possible. London and New York: Continuum.
  • Evans, J. Claude (1991) Strategies of Deconstruction: Derrida and the Myth of the Voice. Minneapolis: University of Minnesota Press.
    • Detailed contestation of Derrida’s interpretation of, especially, Husserl.
  • Finlayson, Gordon (2005) Habermas: A Very Short Introduction. Oxford: Oxford University Press.
  • Finlayson, Gordon (2009) ‘Morality and Critical Theory. On the Normative Problem of Frankfurt School Social Criticism’, Telos (146: Spring): 7–41.
  • Freyenhagen, Fabian (2008) ‘Moral Philosophy’ in Deborah Cook (ed.) Theodor Adorno: Key Concepts. Stocksfield: Acumen.
    • A good and somewhat revisionist synopsis of Adorno’s moral philosophy.
  • Gadamer, Hans-Geog (1981) Reason in the Age of Science. Cambridge MA: MIT. Trans. Frederick Lawrence.
  • Geuss, Raymond (1981) The Idea of a Critical Theory. Cambridge and New York: Cambridge University Press.
  • Geuss, Raymond (2008) Philosophy and Real Politics. Princeton and Oxford: Princeton University Press.
  • Glendinning, Simon (2001) ‘Much Ado About Nothing (on Herman Philipse, Heidegger’s Philosophy of Being)’. Ratio 14 (3):281–288.
  • Haar, Michel (1993) Heidegger and the Essence of Man. New York: State University of New York Press. Trans. McNeill, William.
  • Habermas, Jürgen (1984) The Theory of Communicative Action, Volume 1: Reason and the Rationalization of Society. Cambridge: Polity. Trans. McCarthy, Thomas.
  • Habermas, Jürgen (1987a) Knowledge and Human Interests. Cambridge: Polity. Second edition. Trans. Jeremy Shapiro.
  • Habermas, Jürgen (1987b) The Philosophical Discourse of Modernity: Twelve Lectures. Cambridge: Polity Press in association with Blackwell Publishers. Trans. Frederick Lawrence.
    • One of Habermas’ more accessible – and more polemical – works.
  • Habermas, Jürgen (1992a) Autonomy and Solidarity. Interviews with Jürgen Habermas. Ed. Peter Dews. Revised edition.
    • A good place to start with Habermas.
  • Habermas, Jürgen (1992b) Postmetaphysical Thinking: Philosophical Essays. Oxford: Polity Press. Trans. William Mark Hohengarten.
  • Habermas, Jürgen (2008) Between Naturalism and Religion. Philosophical Essays. Cambridge and Malden Ma.: Polity. Trans. Ciaran Cronin.
  • Heidegger, Martin (1962) Being and Time. Oxford: Blackwell. Trans. John Macquarrie and Edward Robinson.
    • The ‘early’ Heidegger’s main work.
  • Heidegger, Martin (1966) Discourse on Thinking. A translation of Gelassenheit. New York: Harper & Row. Trans. John M. Anderson and E. Hans Freund.
  • Heidegger, Martin (1971) Poetry, Language, Thought. New York: Harper & Row. Trans. Albert Hofstadter.
  • Heidegger, Martin (1982) The Basic Problems of Phenomenology. Bloomington and Indianapolis: University of Indiana Press. Revised ed. Trans. Albert Hofstadter.
    • Close in its doctrines to Being and Time, but often considerably more accessible.
  • Heidegger, Martin (1991) Nietzsche, 4 volumes. New York: HarperCollins. Trans. David Farrell Krell.
  • Heidegger, Martin (1994) Basic Writings. London: Routledge. Revised and expanded edition.
    • Contains ‘What is Metaphysics?’, ‘Letter on Humanism’, and ‘The Question Concerning Technology’, among other texts.
  • Heidegger, Martin (2003) The End of Philosophy. Chicago: University of Chicago Press. Trans. Joan Stambaugh.
  • Held, David (1990) Introduction to Critical Theory. Cambridge: Polity.
    • Broad-brush and fairly accessible account of first-generation Critical Theory and of the relatively early Habermas.
  • Horkheimer, Max (1937) ‘Traditional and Critical Theory’ in Horkheimer, Critical Theory: Selected Essays. London and New York: Continuum, 1997.
  • Horkheimer, Max (1974) Eclipse of Reason. New York: Continuum.
    • Like Horkheimer and Adorno’s Dialectic of Enlightenment, but more accessible.
  • Husserl, Edmund (1931) Ideas. General Introduction to Pure Phenomenology. George Allen & Unwin Ltd / Humanities Press. Trans. W. R. Boyce Gibson.
    • Kluwer have produced a newer and more accurate version of this book; but the Boyce Gibson version is slightly more readable.
  • Husserl, Edmund (1970) The Crisis of the European Sciences and Transcendental Phenomenology. Evanston, IL: Northwestern University Press. Trans. David Carr.
  • Husserl, Edmund (1999a) The Idea of Phenomenology Dordrecht: Kluwer. Trans. Lee Hardy.
    • Probably Husserl’s most accessible (or least inaccessible) statement of phenomenology.
  • Husserl, Edmund (1999b) Cartesian Meditations. An Introduction to Phenomenology. Trans. Dorian Cairns. Dordrecht: Kluwer.
  • Johnson, Christopher (1993) System and Writing in the Philosophy of Jacques Derrida. Cambridge: Cambridge University Press.
  • Johnson, Christopher (1999) Derrida. The Scene of Writing. New York: Routledge.
    • Good, short, and orientated around Derrida’s Of Grammatology.
  • Landau, Iddo (1992/1993 [sic]) ‘Early and Later Deconstruction in the Writings of Jacques Derrida’, Cardozo Law Review, 14: 1895–1909.
    • Unusually clear.
  • Levinas, Emmanuel (1996) Proper Names. Stanford: Stanford University Press.
  • Malpas, Simon (2003) Jean-François Lyotard. Routledge. London and New York.
  • Marcuse, Herbert (1991) One-Dimensional Man. Second edition. Routledge: London.
    • A classic work of first-generation Critical Theory.
  • Merleau-Ponty, Maurice (2002) Phenomenology of Perception. New York: Routledge. Trans. Colin Smith.
    • Merleau-Ponty’s principal work.
  • Mulhall, Stephen (1996) Heidegger and Being and Time. Routledge: London and New York.
  • Outhwaite, William (1994) Habermas. A Critical Introduction. Cambridge. Polity.
  • Pattison, George (2000) The Later Heidegger. London and New York: Routledge.
    • A helpful introduction to ‘the later Heidegger’.
  • Philipse, Herman (1998) Heidegger’s Philosophy of Being: a Critical Interpretation. New Jersey: Princeton University Press.
    • A large, serious, and very controversial work that sets out to understand, but also to demolish much of, Heidegger. Q.v. Glendinning (2001) – which defends Heidegger.
  • Plant, Robert (Forthcoming) ‘This strange institution called “philosophy”: Derrida and the primacy of metaphilosophy’, Philosophy and Social Criticism.
  • Polt, Richard (1999) Heidegger: An Introduction. London: UCL Press.
    • Superb introduction, but light on the later Heidegger.
  • Russell, Matheson (2006) Husserl: A Guide for the Perplexed. London and New York: Continuum.
    • Excellent.
  • Sartre, Jean-Paul (1963) The Problem of Method. Trans. Hazel E. Barnes. London: Methuen.
  • Sartre, Jean-Paul (1989) Being and Nothingness. An Essay on Phenomenological Ontology. London: Routledge. Trans. Hazel E. Barnes.
    • The early Sartre’s major work.
  • Sartre, Jean-Paul (1992) Notebooks for an Ethics. Chicago and London: Chicago University Press. Trans. David Pellauer.
  • Sartre, Jean-Paul (2004) The Transcendence of the Ego. A Sketch for a Phenomenological Description. Abingdon, U.K.
  • Sartre, Jean-Paul (2007) Existentialism and Humanism. London: Methuen. Trans. Philip Mairet.
    • Sartre’s philosophy at its most accessible.
  • Smith, David (2003) Husserl and the Cartesian Meditations. London and New York: Routledge.
  • Smith, Joel (2005) ‘Merleau-Ponty and the Phenomenological Reduction’, Inquiry 48(6): 553–571.
  • Wolin, Richard, ed. (1993) The Heidegger Controversy: A Critical Reader. Cambridge MA and London: MIT Press.
    • The controversy in question concerns Heidegger’s Nazism. See also Young 1997.
  • Young, Julian (1997) Heidegger, Philosophy, Nazism. Cambridge: Cambridge University Press.
  • Young, Julian (2002) Heidegger’s Later Philosophy. Cambridge: Cambridge University Press.
    • A slim introduction to, and an attempt to make compelling, the thought of the later Heidegger.

e. Other

  • Borgmann, Albert (1984) Technology and the Character of Everyday Life: A Philo­sophical Inquiry. Chicago and London: University of Chicago Press.
    • Interesting and impassioned. Influenced by Heidegger.
  • Descartes, René (1988) The Philosophical Writings Of Descartes (3 vols). Cambridge: Cambridge University Press. Trans. John Cottingham, Robert Stoothoff, and Dugald Murdoch. Volume one.
  • Feenberg, Andrew (1999) Questioning Technology. London and New York: Routledge.
    • This book has at least one foot in the Critical Theory tradition but also appropriates some ideas from Heidegger.
  • Hume, David (1980) Dialogues Concerning Natural Religion and the Posthumous Essays ‘Of the Immortality of the Soul’ and ‘Of Suicide.’ Indianapolis: Hackett. Ed. Richard H. Popkin.
  • Kant, Immanuel Critique of Pure Reason. Various translations.
    • As is standard, the article above refers to this work using the ‘A’ and ‘B’ nomenclature. The number(s) following ‘A’ denote pages from Kant’s first edition of the text. Number(s) following ‘B’ denote pages from Kant’s second edition.
  • Locke, John (1975) An Essay Concerning Human Understanding. Oxford: Oxford University Press.
  • O’Neill, John (2003) ‘Unified Science as Political Philosophy: Positivism, Pluralism and Liberalism’, Studies in History and Philosophy of Science, vol. 34 (September): 575–596.
  • O’Neill, John and Uebel, Thomas (2004) ‘Horkheimer and Neurath: Restarting a Disrupted Debate’, European Journal of Philosophy, 12:1 75–105.
  • Petitot, Jean, Varela, Francisco, Pachoud, Bernard, and Roy, Jean-Michel eds. (2000) Naturalizing Phenomenology: Issues in Contemporary Phenomenology and Cognitive Science. Stanford: Stanford University Press.

Author and Article Information

Nicholas Joll
Email: joll.nicholas@gmail.com
United Kingdom

Article first published 17/11/2010. Last revised 01/08/2017.

An encyclopedia of philosophy articles written by professional philosophers.