ANALYTICITY WITHOUT SYNONYMY IN SIMPLE
COMPARATIVE LOGIC*
Theodore J. Everett
SUNY-Geneseo
In a recent article[1]
I gave an intuitive account of a new, simple logic of comparisons, CL. In this paper I provide some formal schemas
for the analysis of vague predicates in terms of a set of semantic relations
other than classical synonymy, all of which are best represented in CL. These relations include weak synonymy (as
between "large" and "huge"), antonymy (as between
"large" and "small"), relativity (as between
"large" and "large for a dog"), and a kind of supervenience
(as between "large" and "wide" or "long"). I use Carnapian meaning postulates to define
these relations as constraints on interpretations of the formal language of CL,
in accordance with the general formula: "the more something is F, the more (or less) it is G".
The
comparative logic CL.
Consider the following valid inference:
Frank is taller than Larry.
Larry is tall.
Therefore, Frank is tall.
I
have argued that a correct interpretation of such inferences requires that we
assign not just a traditional truth value, but also a "how much"
value for each object with respect to each (unary) predicate, plus a separate
"cut-off" value attached to the predicate itself. So, in the above example, if the cut-off for
tallness is set at six feet (or any particular height), and Larry is tall (i.e.
taller than six feet), and Frank is taller than Larry, then it follows
arithmetically that Frank is also taller than six feet, hence that Frank is
tall.
CL is a minimal comparative logic based
on these ideas, and different from standard treatments of vagueness such as fuzzy
logic, supervaluation theory, or Cresswell's
"semantics of degree". It has
the same syntax as a classical logical language L, except that the symbol > ("more than") serves as
a two-place logical connective for atomic sentences (the other comparisons: <, £, ³, and =, are defined in the obvious
way). In the semantics of CL, the
interval from 0 to 1 is used, not as an infinite set of truth values as in
fuzzy logic, but as an artificial scale of sub-values or extensions for
atomic sentences. Every interpreted
predicate letter in the language of CL is also assigned a minimum standard
in the same range (both extensions and standards can be represented explicitly
in CL using metric constants).
The truth-value of each atomic sentence is then determined by whether
its extension is at least as great as the standard for its predicate. The truth values of comparisons depend only
on the sameness or difference of the extensions of their component sentences.[2] Everything else is computed classically, on
the basis of ordinary truth values alone.
CL is thus much more conservative than Casari's
(1987) smallest system ("restricted" comparative logic), which allows
comparisons to be formed between non-quantified molecular statements as well as
atomics.[3]
The above inference might then be
translated into CL as follows:
Ta
Tb > Ta
\
Tb
If
the standard function for some interpretation assigned the value 0.60 to T, and the extension function assigned
the values 0.65 and 0.69 to the pairs <T,
a> and <T, b>,
respectively, then all three sentences would be satisfied by that
interpretation. It should be clear that
there are no allowable interpretations in which the premises would be true and
the conclusion false.
Here
is a list of the rules of CL:
I.
Syntax
A. Vocabulary
1. A
denumerable set of variables {x, y, z,
x1,...}
2. A denumerable set of constants {a, b,
c, a1,...}
3. A denumerable set of metric constants {i, j, k,
i1,...}
4. For each n 1, a denumerable set of n-place predicates
{F, G, H, F1,...}
5.
Logical symbols {Ø, &, " , >}
6. Parentheses
{(, )}
B.
Formation rules
1. If
f is an n-place predicate and t1,...tn are terms (variables or constants), ft1...tn is an atomic formula.
2. If
f and y are atomic formulas or metric constants, (f
> y) is a formula.
3. If
f is a formula, ¬f is a formula.
4. If
f and y are formulas, (f
& y) is a formula.
5. If
f is a formula and a is a variable, "af is a formula.
6. If
f is a formula in which no variable occurs
free, f is a sentence.
C. Definitions
1. (f
v y) =df
Ø(Øf
& Øy)
2. (f
® y) =df (Øf v Øy)
3. (f
« y) =df
((f
® y) & (y
® f))
4. $af =df Ø"aØf
5. (f
< y) =df
(y
> f)
6. (f
³ y) =df Ø(y
> f)
7. (f
£ y) =df
(y
³ f)
8. (f
= y) =df
((f
³ y) & (f
£ y))
II. Semantics
A. Interpretations
An interpretation is an
ordered triple <D, s*, e*>. D (the
domain) is a non-empty set. s* (the standard function) is a function from predicates to
members of E (the interval [0, 1]). e*
(the extension function) is a function (1) from constants to members of D, (2)
from n-place predicates to functions from n-tuples of
members of D to members of E, (3) from metric constants to members of E.
B. Extensions
The
extension rules define, for each interpreted atomic sentence or metric constant
f in the language, its extension e(f) as a function of the interpretation.
1. If f
is a sentence of the
form yt1...tn,
e(f) = e*(y)(e*(t1),...e*(tn)).
2. If
f is a metric constant, e(f) = e*(f).
C. Valuations
The
valuation rules define, for each interpreted sentence f in the language, its truth-value I(f) as a
function of the interpretation.
1.
If f
is a sentence of the
form yt1...tn,
I(f) =
1, if e(f) ³ s*(y)
0, if e(f) <
s*(y).
2. If f
is a sentence of the
form (y1
> y2),
I(f) =
1, if e(y1) > e(y2)
0, if e(y1) £ e(y2).
3. If f
is a sentence of the form Øy,
I(f) =
1, if I(y) = 0
0, if I(y) = 1.
4. If f
is a sentence of the
form (y1
& y2),
I(f) =
1, if I(y1) = I(y2) = 1
0, if I(y1) or I(y2) = 0.
5. If f
is a sentence of the form "ay,
I(f) =
1, if I b/d(ya/b) = 1, for all dÎD
0, if I b/d(ya/b) = 0,
for some dÎD, where b
is any constant.
Structural
definitions of semantic notions.
As a classical logical language L is
usually defined, and as I have described CL, all of the predicates in those
languages are primitive ones, in that there are no specific constraints
on their interpretations. Hence, there
is nothing in L or CL that corresponds to the meaning or intension of a
predicate in natural language, or to semantic relations such as synonymy among
expressions, or to the (non-tautological) analyticity of a sentence. These notions are controversial, but I think
that there is a reasonably neutral way of specifying what they are, and how
they relate to one another, and what it would take to include them in a formal
language system.
To begin with, it seems that
synonymy can be loosely defined as sameness of meaning, and analyticity as
truth based on meaning. The notion of
meaning itself is much more difficult - there are several conceptions within
the analytic tradition that do not seem to have much in common with one another. The most inclusive conception of meaning that
one could produce is probably this one, that the
meaning of a term in a language is specified by whatever there is in that
language that constrains the use of that term.
If we limit ourselves to core uses in scientific or descriptive
language, it seems that the meaning of an ordinary predicate is given by
whatever constrains its extension. I do
not intend this minimal definition to imply that meanings are exclusively
linguistic items - they might also be ideas or properties, or whatever else has
been suggested in the past. But it is
fair to require of anyone who claims that meanings are non-linguistic things
that they admit that those things are also in an important sense
linguistic. Otherwise, we would have to
conclude that synonymy, for example, is not at all a linguistic relation.
I do not even mean my definition to
entail that meanings are items in any absolute sense. They might be things like spatial locations, only
specifiable with relation to one another.
One can describe the spatial location of some object a by saying that it is between b and c, and that of c by
saying that it is to the left of d
with respect to e, and so on. Similarly, one might specify the meaning of a
predicate F by saying that it is
synonymous with G, and that of G by saying that it bears some other
semantic relation to H, and so
on. This sort of procedure is likely to
be circular, but not viciously so. What
we say about locations is that the physical world as a whole has a certain
spatiotemporal structure, and that the relativity applies only to its parts,
considered as separate individuals.
Similarly, we can say about meanings that a language has a certain whole
semantic structure, and that the relativity of meaning applies only to its
individual terms.
I do not need to claim that there is
nothing to the meaning of a term beyond that term's position in an overall
semantic structure. What is important is
that the structural definition is weak enough to be taken safely for granted,
yet still adequate to the analytic tasks at hand. As with locations, if it turns out that there
is some absolute correlate to the structural position of each term, then that
fact would do no damage to the structure, or to the relations within it.
A simple, very general way of
imposing a semantic structure on a formal language was devised by Rudolf Carnap (1952). All
one has to do is pick out a set of sentences (called "meaning
postulates") of the language in question, and require that they be
satisfied by all permissible interpretations.
For example, if we were to represent the English predicates
"bachelor", "married", and "man" in L by the
predicates F, G, and H, respectively, and we wanted to represent the synonymy
of "bachelor" and "unmarried man", then we could include
the sentence "x(Fx «
(Hx & ØGx)) in a set of meaning postulates for L. That formula would be the image of the
sentence "All and only bachelors are unmarried men," which is the
most standard example of analyticity in English.[4]
In general, given any structure
of a logical language (i.e. that language together with any set of meaning
postulates), the meaning of each non-logical term in that structure is given
implicitly by the meaning postulates that constrain its extension with respect
to the extensions of the other terms.[5] A sentence is analytic in that structure if
and only if it is entailed by the set of meaning postulates. A few distinct notions of synonymy arise,
reflecting different identities of meaning or extension. Two expressions n
and n
may be said to be strongly synonymous in a structure of L if and only if
the sentence "x1..."xn(x1...xn «
x1...xn)
is analytic. n and n
may be said to be partially strongly synonymous in that structure if
either "x1..."xn(x1...xn ®
x1...xn)
or "x1..."xn(x1...xn ®
x1...xn)
is analytic (so that the extension of one is necessarily a subset of the
extension of the other). Two expressions
may be said to be synonymous in the broadest sense in that structure
just in case their interpretations are constrained in identical ways, so that
each may be substituted for the other in any sentence of that structure, salva veritate. All primitive predicates (those not
constrained by meaning postulates at all) are synonymous in this broad
sense.
Semantic relations in L- and
CL-structures.
For a long time, analytic
philosophers believed, in effect, that natural language (or its descriptive, or
scientific, fragment) could be adequately modeled by some structure of L, at
least in principle. One of the problems
with this view stems from the fact that the only interesting semantic relations
that can be expressed in a language like L are full and partial strong synonymy
- one expression spelling out necessary and/or sufficient truth-conditions for
another. Therefore, most philosophers
who thought that natural language is fundamentally like L had to accept the
consequence that predicates of natural language must be either synonymous (at
least partially) with other expressions, or else completely primitive. So it was common to define the notion of
analyticity, not in terms of meanings or meaning-relations in general, but
directly in terms of synonymy. The most
famous instance is in Quine's "Two Dogmas of Empiricism" (1951):
The characteristic of such a statement
[as "no bachelor is married"] is that it can be turned into a logical
truth by putting synonyms for synonyms...We still lack a proper
characterization of this second class of analytic statements, and therewith of
analyticity generally, inasmuch as we have had in the above description to lean
on the notion of "synonymy", which is in no less need of
clarification than analyticity itself (page 23).
The problem is,
it turns out that there are not very many pairs of strongly synonymous
expressions in ordinary language. The
logical empiricists were never able to come up with satisfactory sets of
linguistically based truth-conditions for most interesting terms. Some more recent writers have taken this
failure to find synonyms to imply that the very notions of meaning and
analyticity, and the whole enterprise of the logical analysis of language, are
worthless.[6] But I claim that their arguments should be taken
to apply only to L-based conceptions of these things. Classical first-order logic is inadequate,
not just for the purpose of validating certain kinds of inference (as in the
example above), but also for the explication of many important semantic relations
among terms. In the language of CL, we
can do much better than mere strong synonymy.
Weak synonymy.
There is, for example, a kind of
loose synonymy that holds between such terms as "large" and
"huge", "old" and "ancient", and the like. These terms are not intersubstitutable - one
can say that whatever is huge is also large, but not the other way around. Yet there is clearly a similarity of meaning
between the two terms, over and above this partial strong synonymy. That relationship can be expressed in English
by saying that one thing is huger (or as huge, or less huge) than another just
in case the first is larger (or as large, or less large, respectively) than the
second. In other words, the larger something
is, the more it is huge. This form of
statement: the more something is ,
the more it is ,
has no formal analysis in the language of L.
But it does have one in CL, to wit:
"x"y((x
> y)
« (x > y)).[7]
When
employed as a meaning postulate in CL, such a sentence constrains the
interpretation of the predicates
and , to the
effect that the extensions of those predicates are required to rise and fall
together, as it were, but it says nothing at all about their standards. And this is what we want: if someone
understands the terms, he knows that if anything is large to a greater or
lesser extent than something else, then it is also huge to a greater or lesser
extent. But one does not know a
priori, beyond knowing that all huge things are large, how large things
have to be in order for them to be huge.
The English modifier
"very" is explicable in CL, according to the following schema for
meaning postulates:
"x"y((Vx ® x) & ((Vx
> Vy)
« (x > y))).
The
representative of "huge" should then be synonymous (in the broadest
sense) in CL with that of "very large", since they will have the same
meaning, to the extent that they are meaningful. But they will not be strongly synonymous in
CL, since their meanings, as I have defined them, do not fully determine their
extensions. This is intuitively right, I
think, because in ordinary language we can use such terms synonymously if we
want to, but nothing requires us to do so.
"Extremely" is explicable
in turn, according to:
"x"y((Ex ® Vx) & ((Ex
> Ey)
« (x > y))).
At
a popcorn booth, for example, we might find the terms "large",
"very large", and "huge" denoting three different
sizes. "Extremely large" might
denote a fourth size, possibly larger than "huge", possibly smaller -
it is up to the vendor. The form of
definition I am suggesting does not even entail that whatever is very (or even
extremely) must
also be to a
strictly greater extent than what is (just plain) . It is possible, consistent with the rules
of the language, to assign to weak synonyms exactly the same extension. Some people do so with the expressions
"bad headache", "very bad headache", and "extremely
bad headache", at least when describing their own cases, without any real
insincerity.
Another version of this sort of
semantic relation is the antonymy that holds between such pairs as
"large" and "small", "hot" and "cold",
and the like. Again, this relation
cannot correctly be modeled in the language of L. The closest one could get would be "x(x « Øx), which says too much - if someone is not
tall, that does not make him short - or "x(x
® Øx), which says only that one cannot be both
tall and short, which is too little.
What needs also to be said is that the terms are opposites, in that the more
something is one way, the less it is the other. And this can be expressed in CL, according to
the formula:
"x"y((x > y)
« (x < y)).
Again,
an instance of this formula, when employed as a meaning postulate for CL, would
act to constrain the extensions of
and relative
to each other, but not their standards.
We use the prefix "un-"
(as distinct from "non-") to turn any vague predicate into its weak
antonym: "happy" and "unhappy", "believable" and
"unbelievable", and the like.
This prefix can be defined schematically as follows:
"x"y((UNx > UNy)
« (x < y)).
Relativity or
"context-dependence" for predicates.
A semantic operation that can
readily be captured in CL is the relativization of
one predicate to another - "large for a dog", "expensive for a
toaster", etc. What is interesting
about such expressions is that many predicates, especially the most obviously
measurable-type adjectives like "tall", "hot", or
"old", do not seem to have determinate classical extensions unless
they are (at least implicitly, or contextually) relativized to other predicates
in that way. If we say that Frank is
tall, we might mean that he is a tall man, a tall center on a basketball team,
or a tall five-year-old child, but it does not seem that anyone or anything is
(objectively speaking) tall simpliciter.
However, such predicates do appear to make full sense all by themselves
for purposes of comparison: that thing over there may or may not be a tall one
(depending on what it is), but it is certainly taller than the table it is
standing next to.
These facts have a simple account in
the jargon of CL. What happens is that
some predicates have no standards associated with them on their own, but
can be used with different standards when relativized to different other
predicates. They can still make sense unrelativized when they occur in comparisons, just because
the evaluation of a comparison depends only on the extensions of the
predicates involved. CL already allows
for standardless predicates, and the extra semantics
are simple. The classical extension of /
(i. e. "
for a ")
should be a subset of the classical extension of , the
"how much" extension of /
should rise and fall with the "how much" extension of , and the
standard for /
should be whatever it is, regardless of the standard for . These things can be accomplished by adding
instances of the following schema to the class of meaning postulates for CL:
"x"y((/x
® x) &
((/x
> /y)
« (x > y))).
The
predicates and
/
are fundamentally just weak synonyms.
Since the standard function in an
interpretation of CL is defined for predicates generally (not just for
primitive predicates), the possibility of having different standards for
different relativizations of one predicate to others is already taken care of.[8]
The basic form of "cluster-concepts".
Another important semantic relation
that can be modeled in CL, but not in L, is the one that holds between the term
for a so-called cluster concept (or "family resemblance" concept) and
the set of terms that name the multiple criteria over which that concept
supervenes. Many difficult and
interesting concepts seem to have that structure: goodness, for example, with
respect to pleasantness and desirability; being a person, with respect to
various controversial criteria, perhaps including consciousness or
self-awareness, plus some kind of social standing; etc. All these concepts are applied to objects by
way of an overall judgement based on a number of
variable factors, no particular one being either necessary or sufficient.
Here is a somewhat artificial
example. Consider the predicate
"large", as it is applied to such things as boxes. It should be obvious that the largeness of a
box has something essentially to do with its length, width, and depth. We might sometimes identify the largeness of
the box with the product of those variables (i. e.
its volume) - but not always. If we are
trying to fit boxes into the trunk of a car, say, then the different dimensions
are likely to be weighted unequally, and we might end up saying something like,
"We'll have to strap the larger one onto the roof," referring to the
longer, wider, shallower box with lesser volume. There are nevertheless some
constraints on the possible orderings of boxes according to size. For one, it
should be clear that if one box would fit entirely inside another, then the
first cannot be larger than the second. That is, we cannot correctly say,
"This box is less long, less wide, less deep, and larger than that
one." At the same time, if we say
that a certain box is larger than another, then we cannot sensibly deny that
the first is either longer, wider, or deeper than the second.
Once again, there is no adequate way of
representing such constraints in the language of L. The closest we could get without employing
comparisons would be something like "x((1x
&...& nx) ®
x), together with "x(x ® (1x
v...v nx)). This would say that the factors 1...n are, in a yes-no way, conjunctively
sufficient, and disjunctively necessary, for the applicability of the cluster
term .[9]
But that cannot be right. To judge that a box is large overall does not
entail that one judges it long or wide or deep. If we are going by volume, for
example, then we might reasonably judge a cubical box with average dimensions
to be large enough to be large, without also judging that it is long enough to
be long, wide enough to be wide, or deep enough to be deep. Once again, our judgements
as to are not made on the basis of whether 1...n obtain; they are based on how
much each applies. The factors are
not chosen among; they are weighed. This
essential feature of cluster concepts can be modeled in CL, just by using the
comparative analogue to the above L-based analysis. The appropriate schemas are:
"x"y(((1x
> 1y)
&...& (nx > ny)) ®
(x
> y))
and "x"y((x
> y)
® ((1x
> 1y)
v...v (nx > ny))).
These
two formulas, taken together, entail that each k
(1 £ k £ n) counts for something in the determination of ,
without saying how much any of them counts.
There is no a priori assignment of weights.
Conclusion.
I will not attempt to
"define" any more interesting concepts here, so as not to introduce
extraneously controversial material. My purpose in this paper has been only to
provide a set of formal schemas that might prove useful to those with
substantive analysis in mind, and to those who wish to argue that a useful
analysis of meaning is still possible.
In general, it should be clear that modeling natural language in
CL-structures allows for sets of terms to be related analytically, but in a
sliding way, not pegged to any one set of standards. A whole system of such partial definitions
would provide not a rigid set of categories, but rather a flexible network of
terms, which can adjust to fit the world as it is seen from different points of
view, or as it changes, or as more of its features are discovered. This sort of partial, formalizable semantic
holism may be a step toward satisfying some of the critics of logical
empiricism since Quine, while retaining reasonably traditional notions of
meaning and analyticity.
NOTES
* I would like to thank David Brink,
Michael Byrd, Daniel Finer, Richard Fumerton, Carl Ginet, Sally
McConnell-Ginet, Alan Sidelle, Stephen Sullivan, Jack Temkin,
Winifred Wood, departmental audiences at Cornell and the University of Iowa,
and especially Robert Stalnaker for many useful comments on early drafts of
this material in 1982 and 1983; also three helpful reviewers for this
journal. I now refer to my system as simple
comparative logic in deference to Ettore Casari's (1987) independently developed,
much more complex and powerful framework.
[2] I am sorry for pretending that every
answer to the question "how much?" can be crammed into the range [0,
1]. Obviously, there is no absolute upper
limit for the applicability of such predicates as "tall", so Casari's (1997) use of unbounded sets of positive and
negative truth degrees has a big advantage here. I am also pretending that all predicates are
uniformly comparable, as if all that mattered were our overall degree of
confidence in every proposition. It
follows that my system cannot resolve the ambiguity of a sentence like
"Sally is taller than Jack is wide around", which may be true if both
sub-propositions are considered on the single scale of 0 to 1, but false if
both are to be measured in feet. Here
some representation of units of measurement, or at least of different scales of
comparison (as in Cresswell 1976) would seem to be
required, in addition to a single (bounded or unbounded) scale of
overall evaluation. I have excluded such
things for simplicity's sake, in the hope of making my main points with a
minimum of formal machinery. For a
detailed discussion of these issues, see Keefe 1998.
[3] See also Paoli 1996. I do not claim that comparisons formed
between non-atomic sentences cannot make sense.
A full treatment of the semantics of comparisons would have to
comprehend, at a minimum, such statements as "Frank is taller than Larry,
(by) more than Larry is taller than Sue".
I do, however, find it hard to make clear sense of comparisons formed
between traditional truth-functional molecules, such as "If Larry is tall
then Frank is short, more than it is not the case that Sue is tall".
[4] I realize that this use of meaning
postulates (along with talk of analyticity in general) will probably strike
some readers as an attempt to ride a long dead horse. But one reason for their moribund status is
their past failure to explain, in the too-simple language of classical logic,
those very features of vagueness that I claim can be analyzed in CL. Katz and Nagel (1974), for example, list this
as one of their main complaints against meaning postulates.
[7] That is the formula for one-place
expressions. For n-place expressions the
formula is: "x1..."xn"y1..."yn((x1...xn > y1...yn) «
(x1...xn > y1...yn)). Similar
expansions are possible for all the other formulas discussed in this section.
[8] My analysis of relativity here is
sharply different from Paoli's (1999), which is based on Casari's
arithmetic of truth degrees. Paoli
employs a new conjunction connective for /x
which assigns to this relativization the product
of the truth degrees of x
and x. Perhaps this works well enough in simple
inferences, though it is not intuitively clear just why it ought to. I take it as a virtue of my analysis that it
avoids the need for any special connectives.
Paoli also claims that simple predications like "A is tall"
are either relativized implicitly, or are equivalent to higher-order statements
like "A is tall by any (plausible) standard", meaning under any
plausible relativization. I think that such predications are often made
very loosely, without any particular (or general) relativization
in mind, though when pressed for a more determinate statement, we