On Morphosemantics: the Internal Meanings of Words

Emmon Bach
UMass(Amherst) / SOAS
Copenhagen Linguistic Circle
21 October 2008
© Emmon Bach 2009. All rights reserved.

Opening

  1. The Central Question
  2. The term "morphosemantics" in the title of this talk is intended to raise a fundamental question about linguistic expressions and their meanings. When we talk about the meanings of morphemes and their combination into words should we expect to find the same kinds of meanings and combinations of meanings that we associate with the processes of putting together words into phrases? The answers to this question vary widely or even wildly across different linguists and their schools or theories. For example, some linguists would say: "Decidedly not, since morphemes don't have any meanings at all. Meaning only begins with words and their combinations." Others would say: "Yes, of course! Complex words and their meanings are built up in syntax, so we predict that their complex meanings are made up uniformly across words and syntactic phrases."

    So the broad general question I want to address here is this:

    Are word-internal and word-external meanings the same?

    This question requires that we take up a number of preliminary questions about languages and grammars, about meaning, and about words. Various answers to the question also lead to basic issues about language diversity and typology. I will be leaning heavily on evidence from North American languages that enjoy rich systems of word-formation, especially languages of the Pacific Northwest. And I will take time to fill in some background on this linguistic area.

  3. Synthesis: poly- and other
  4. There are spectacular differences among language in how much complexity is accorded to words. This dimension of diversity is the source for some of the oldest typologies of language:

    and the like. Popular books on language as well as elementary textbooks of linguistics like to cite £100 words from languages like Nuuchahnulth (Nootka) and contrast them with languages like what we might call "mythical Chinese." English will do as well as Chinese for contrasts like these, citing a Northern Wakashan language Haisla (that I will return to):

    1. Lanis tlakemliselasuʼina.
    2. [lànis ƛàkemliselàsuʼina]
    3. Someone is paddling along in front of our (village) beach.

    (2) is a fair translation of (1), I'll unpack (1) later. (1) expresses in two (typographical, phonological, morphological -- see below) words what the English rendering takes nine to do. Sometimes the dimension we are talking about here is described as a difference in the number of "ideas" or "concepts" that can be expressed in a single word (Sapir 1921). Talking about concepts or ideas is not quite right, as I am sure Sapir would have agreed. Here's a simple root again from one of Sapir's favourite languages, Nuuchahnulth (Nootka):

    1. √yah̩ʷ- `to shout in a prescribed manner in the woods as a daily morning practice for a mother of twins for a year after their birth'

    Another nice meaning for a simple word from Kwakw'ala (Boas ms.: 231):

    1. nəxá `to be pleased at having attained one's end by plaguing someone.'
    Or again another simple root from Haisla:

    1. √k̕elt- `reluctant to go out (of harbour etc.) because of the weather.'

    We don't actually have to go far afield to find examples of this sort of conceptual complexity. Think of the English verb to bean; it requires a contextual reference point in the game of baseball, and means to hit a batter (what's a batter?) on the head with a pitched ball (what's that?) intentionally (?). Quite a few concepts there!

    So the synthesis dimension really has to do with the complexity of the formal manifestations or reflexes of concepts in "words." The scare quotes here anticipate the hard question: What counts as a word? We'll get to that a bit later.

  5. Basic Assumptions about Languages and Grammars
  6. In general, I follow the programme of formal grammar. A formal grammar is an explicit linking of the expressions and meanings of a language. The particular form of grammar that I assume is roughly that laid out in Richard Montague's paper "Universal Grammar" (1970), with some particular choices as to implementation. A language is described by a recursive specification of the signs of a language, starting with a lexicon as a listing of the basic signs of the language and a system of rules that define how the expressions are to be combined (recursively) into complex expressions. Each sign is a k-tuple of items specifying the phonology, phonetics, morphology, syntax and semantics of the expressions. The individual elements in the sign correspond to levels of representation and interpretation. In so far as possible the principles of combination conform to a requirement of compositionality:

    Compositionality: the form and meaning of a complex expression is a function of the form and meaning of its parts and the manner of their combination.

    An important point that I wish to stress is that the division drawn here between grammar and lexicon is not the same as the division between syntax and morphology:

    Grammar v Lexicon ≠ Syntax v Morphology

    The reason for this inequality is that the lexicon can contain items that are both smaller than and greater than the word. And the grammar can affect and combine words, phrases, and discontinuous sequences of items (compare Sadock's (1991) autolexical model).

    We'll come back to this point in a moment, but first we need to take up the problem of different meanings for "word."

  7. What is a word?
  8. There are several different meanings for the word "word" (di Sciullo and Williams 1987 and many other sources)

    These need to be distinguished:

    1. phonological word;
    2. Phonological words are part of a phonological hierarchy that reflects the organization of sounds in a language. Other terms in this hierarchy include: segment, onset, coda, syllable, foot, phonological phrase. Languages differ a good deal in how such hierarchies are defined and how they are interpreted.

      Example: Underlined and bracketed off pieces are:

      [Sally's a]   [friend of mine]
      [sǽlizə fréndəmayn]
      compare:

      Helena anodyne
      [hɛ́lənə ǽnədayn]

    3. morphological word;
    4. Morphological analyses of words use labels like these: Root, stem, affix, infix, etc. These are parallel to the bar-levels of X-bar syntax, perhaps. In addition categorial assignments are made. It is an open question whether these are the same as those of syntax proper -- N, V, etc -- or different, and if different, how they are related to the syntactic categories.

    5. syntactic word;
    6. A syntactic word is a terminal element in the syntax. This notion is heavily theory dependent so that its significance depends on complementary ideas about morphology, considered as theories about the word. One line (for example, di Sciullo and Williams 1987) has it that syntactic words are atoms and hence their internal structure and characteristics are invisible to the syntax, The principal cheats here arise by feature percolations: the syntax may not be able to look at a case morpheme, for example, in a syntactic word, but the relevant information can be conveyed by feature values. But in any case, information about morphological allomorphy cannot be available to the syntax.

    7. lexical word (lexeme, "listeme").

    As indicated already, I take a lexeme to be the basis for the recursive specification of form and meaning in a formal grammar. The lexeme is the point at which the famous Saussurean arbitrariness of the linguistic sign is most evident. This point is emphasized in the term "listeme."

    Now, various writers and theories or frameworks differ in what kinds of items are supposed to be listed in the lexicon, for example in whether items that are smaller than words should be listed in the lexicon. On the face of it, it seems that some items that are larger than words need to be listed. Some examples will make this clear. In each case I underline what I take to be a lexical item or lexeme:

      English:
    1. I will no longer put up with your behaviour!
    2. They looked up the word in the dictionary.
    3. They looked the word up in the dictionary.
    4. Please don't crack wise in this context!
    5. take ... to task
    6. it beats the shit out of ... `it baffles ...'
    7. keep tabs on ...
    8. get ... 's goat
    9. x hold x's tongue
    10. x hold x's breath

    Two kinds of word-internal elements

    It follows from our general picture of a language that there will be two kinds of word-internal elements, operations, relations, and so on. On the one hand we have those items (to choose a neutral term) that are part of the recursive specification of how to make complex expressions starting from a base of lexemes, and on the other those items that are used for making or analyzing lexemes themselves. This is the wellknown but perhaps not always clearly understood difference between inflectional or grammatical items and derivational items and processes (including compounding). The first type are part of a synchronic grammar, the second are diachronic in nature. This picture is implicit in Montague's general theory, it has been most forcefully enunciated early on by David Dowty (1988, 1989).

    Today, we will be looking almost exclusively at derivational processes.

    Now we can state a central claim about languages:

    The semantics of grammar is compositional, the semantics of derivation is not.

    We need to understand how this is meant and how it is not meant.

  9. Meanings of Meaning
  10. Generally, I follow the programme of model-theoretic semantics, which assumes that a central task of linguistic semantics is to assign (sets of) semantic values or denotations to the expressions of a language as laid out by formal or generative grammar. In line with the outline just given it also assigns possible phonetic interpretations to these expression as well as other elements that might be considered components of meaning in the pretheoretical sense: implicatures, presuppositions, indication of sociolinguistic level and so on. (Why "sets of semantic values"? Because natural languages are in general ambiguous.)

    The very idea of an explicit or formal or generative grammar has been under attack in recent years (as it was from the very beginning). I consider most of these critiques to be best answered by providing a formal theory to accomodate these criticisms to the extent that they are valid, thus providing better still formal theories. I believe that a great deal has been learned in the decades since this programme was initiated by Montague, Davidson, and others.

    Note that the programme of model-theoretic semantics, or more broadly named formal semantics presupposes a base of formal grammar. (Chomsky's thesis: a natural language can be characterized as a formal system. Montague's thesis: a natural language can be characterized as an interpreted formal system.)

    Here are some examples to make this a bit more concrete, stripped to essentials:

    To interpret simple natural languages we need at least this much machinery: a set of individals (entities), a set of truth values, a set of situations or worlds, functions built up out these elements, a set of assignments of values to variables.

    1. Guinevere is unhappy.
    2. Guinevere loves Lancelot.
    3. Guinevere loves Arthur.
    4. Guinevere might not love Lancelot.
    5. Every knight loves Guinevere.
    Expressions are evaluated relative to a world. (16) is true relative to a world if and only if Guinevere is in the set of unhappy entities in that world. (I ignore the complications introduced by tenses.) Similarly (17) and (18) are true iff Guinevere is in the set of Lancelot-lovers and the set of Arthur-lovers. Finally (19) is true in this world iff there is another world -- way of being -- in which Guinevere does not love Lancelot.

    Assignments of values to variables come in if we adopt a standard story about quantification: (20) is going to be true iff the open sentence x loves Guinevere is true on every assignment of values to x, where x is a knight.

  11. Some Languages
    1. Haisla
    2. Pacific Northwest Language Area

    3. Basics on Haisla: notes from a short dictionary

      So let's take a quick look at the formal elements involved in our initial example: the Haisla sentence, provided to me by the late Sampson Ross in Kitamaat Village, British Columbia. [Remark]

      1. Lanis tlakemliselasu'ina. (=1)
      2. [lànis ƛàkemliselàsuʼina]
        `somebody's paddling along in front of our village'
        more literally: `we're (incl.) being paddled down along the beach in front of'
           la- aux(with_ʼin(a))
            -nis -1pl-incl
           √tlak- paddle-
           -em -front
           -l -go
           -=is -on_beach
           -el(a) -continuative
           -su -passive
           -'in -a (with la- perfective/progressive)

      3. Kʼaˈqex̄damʼuasuxʼilapin ʼEbuˈkʷs Tlalemqʷʼaˈx̄s his tlʼaˈsiagʷemikax̄i.
      4. The wolf put its head on the knees of ʼEbuˈkʷs Tlalemqʷʼaˈx̄s
           kʼaˈqex̄damʼuasuxʼilapin
           √kʼaq- touch with head
           -x̄damʼua -on knees
           -su -passive
           -[x]ʼilap -momentaneous
           -in -then/now/perfective
           ʼE. Tl. Personal name (= `mother of Tl')
           his (= -s) oblique marker: here agentive/instrumental
           tlʼaˈsiagʷemik `wolf' [Story: etymological knowledge.]
           -a Primary deictic suffix
           -x̄i Secondary deictic suffix (-ax̄i `remote visible')

        So the sentence means more literally:
        `ʼEbukʷs Tlalemqʷʼax̄s was head-touched-on-knees by the wolf.'
        (From a Haisla text by Jeffrey L Legaic))

    4. Some derived words in Haisla:
    5. Haisla open class words are all based on a root or extended root followed by one or more derivational (`lexical,' `semantic') suffixes. Here are some typical examples (√marks roots, =, !, - for voicing, glottalizing, plain suffixes):

      1. √ʼik- `good'
      2.     ʼíxsdu `bright, white colored' -sdu `colour'
            ʼíxsduqʷia `bald-eagle' -qi(a) `head' cf. híxtʼi `head'
            ʼíxp̕a `tasty' -p̕a `smell'
            ʼíxp̕ala `smelling good'-p̓ala `sensing, tasting'

      3. √lhaxʷ- `hurt, love, strain'
      4. Lincoln and Rath (1986) gloss the root like this: `strong, valuable, painful, love, difficult, tight'    lhaxʷela tight, fit tightly, stormy, rough water, firmly fastened, wound up tight
           lhaxʷmala ache, be in pain, hurt, suffer
           lhaxʷmecʼuaqiala (have) headache (many items for hurting or aching in a particular part of the body)
           lhaˈwʼinaiˈ hard, difficult, close friend, like, care, love
           lhawʼinaiˈnukʷ love (someone), like
           lhaˈwʼenem, lhilaˈwʼenem husband
           lhaˈwad have husband

      5. -!ex̄d `arse, foundation, basis'
      6.     gélt̓ex̄d `tall' √gelt- `long'
            démk̓ʷex̄ `kick in the arse'
            k̓ʷenc̓ex̄d `lazy'
            q̓emc̓ex̄d `have a wrinkled bum' [story]

    6. Nuuchahnulth (aka Nootka Southern Wakashan )
    7. (A famous example repeated in many publications, from Sapir's Language (1921: 141):

      1. ʔinikʷiłm̓inih̩ʔisita
      2. `several small fires were burning in the house'
        analysis: (forms from Sapir and Swadesh, 1939)

            ʔink(ʷ)- `fire; burning'
           `-ił `in the house, on the floor'
          2 -m̓inh̩ plural
           -ʔis, -ʔic- diminutive
           -(m)it past
           -ma / -a (a: with past -(m)it)
        [some details omitted EB]

    8. Yup'ik (Eskimo-Aleut = Yuppik, in the practical orthography C' marks a geminate consonant: CC)
      1. Angyarpaliciqsugnarquq.
      2. [ángyárpalíciqsúgnarquq]
        S/He probably will make a big boat.
        analysis
            angyaq- boat-
           --rpak -big
           --li -make
           +ciqe -will
           ~+yugnarqe -probably
           --uq -3sgInd
      (The markings on the beginnings of the affixes signal various special morphonological effects and allomorphy), for example "--" marks affixes which delete the last segment of their hosts, "~" : delete final e of host, and so on.)

    9. English as a Wakashan language!
    10. Part of the value of studying a wide variety of languages is that they often throw light on each other. It is rare that a certain kind of formal or semantic feature does not find counterparts in other languages that seem at first sight to be very different. In this section I will draw attention to some global ways in which English is very different from languages like Haisla, but then take a look at English through Wakashan glasses, so to speak, and mention some derivational items that would seem very familiar to a Wakashan linguist.

      It should first be noted that Haisla and other Wakashan languages are strikingly different from English, in that they show no examples of "compounding" in the strict sense: putting together roots or stems in derivational ways. In this way, Wakashan lines up with the Eskimo-Aleut family, and is opposed to other groups in the area, such as Salishan, Tsimshianic. This fact has led some linguists to look to compounding as a source for the extensive lexical affixes of Wakashan. Whether that historical story is right or not, it is a fact that in Wakashan, the lexical affixes in general show no formal resemblance to independent words that might overlap semantically with the affix: compare the `head' affix -qia and the independent word hixt̓i `head.' It is in fact misleading to identify the affix semantically with the item meaning head, as the affix generally has an adverbial force meaning something like `on the head' or the like.

      That said, it is evident that many of the derived words of Wakashan languages can be matched most straightforwardly by English compounds. So we can easily reproduce the semantic structure of the Haisla word for baldheaded eagle by means of a compound like whitehead or (more exactly still) brightcolorhead (where we have to understand brightcolor as already having the conventionalized meaning `white'), and this would even be a common pattern for forming nouns for species of animals, birds, kinds of people, etc.: bluefin, redbreast, graybeard, paleface. What is different is that the Haisla word is built not by compounding (putting together two or more `free' forms) but by affixing a bound morpheme. So we might ask: does Haisla have something that English doesn't? There is a tradition that seems to say Yes to this question, namely Haisla (Wakashan, other families like Salishan) has things called variously `semantic' or `lexical' suffixes. More on this below. But first, some English things that are somewhat but not exactly like these suffixes.

      English (like many languages) shows special forms of morphemes that occur only in close combination inside words. Most of these are learned Latino-Graeco [!] morphemes, but there are some perfectly ordinary native ones as well.

      1. -mən, -lənd, -səl as in postman, Finland, mainsail (only for sealubbers)
      2. But the majority of these special combining forms occur in learned or `foreign' items (like Graeco-): we have series like:

      3. syntax, syntact-; syntactic, syntactico- (semantic) etc.
      4. Somewhat closer are items that occur only as combining forms. We can coin a word like leucocephalic (`white-headed'), but there are no free forms like leuk(os) or cephal(os). Japanese also has lots of Sino-Japenese morphemes that occur as such only in combination:

      5. Japanese: den `electricity'; niti `day' (denwa, densha; nitiyoobi etc.) [gratia M. Takahashi] (den is like English e- in email, etc.)
      6. Two things are different about these bound forms of English (and Japanese) as against the Haisla items: (1) in the former case the items seem more like roots or stems, whereas in Haisla there is no doubt about the assignment of the items to the class of suffixes (note that the English items can often occur as first members or later members of words); (2) the learned English (and Japanese) items seem to be marked off as participants in a special subsystem of the language (often signalled in linguistic discussions and analyses by features like Foreign, -Yamato, etc.) with special things to say about their combinatory potential and phonology. To approach the Haisla eagle word we would have to have some monstrosity like white-cephal or imagine that pate say could only occur as a suffix and make up a word like white-pate. STILL and all: English does have some genuine affixes with meanings that are quite comparable to the meanings of particular Haisla lexical suffixes. A few examples:

      7. -ade: orangeade, lemonade, limeade, grapefruitade ?
      8. Compare items like Haisla -bes meaning `liquid from or small particles of' (snow, milk, etc.), Kw -asdi `dried meat of' etc. (Names for foods seem to excite the energies of Amerenglish wordsmiths: -burger, -furter, -wich etc.).

      9. -(a)teria: cafeteria, washateria `place or establishment for'
      10. This is quite comparable to Ha and Kw suffixes like =as, =ilas `place (etc.) for.'

      11. -gate: Watergate, Whitewatergate, Irangate, skategate (= Tonyagate), Cherie-gate
      12. -er: not just agentive -er but also -er for tool names, person from, birdnames as in baker, thresher, thrasher, warbler, New Yorker
      13. -oid: humanoid, android, factoid etc.
      14. -ee: payee, employee, promisee, etc.
      15. Finally, English has at least one "phrasal" derivational suffix

      16. -ish X-ish: somewhat X, somewhat like an X greenish, boyish, Monday morning-ish
      17. English seems to be poor or entirely lacking in `adverbial' affixes like Haisla =is `on beach' =ilh `in house' but particles and prepositions in English seem to function in somewhat the same way, with idiomatic combinations acting as lexical derivatives.

  12. Conclusions: typology, parameters, and Sapir
  13. Look back at the traditional typology that I cited early on in this presentation:

    1. typology
      • analytic - synthetic - polysynthetic
      • isolating - agglutinative - fusional

      The traditional typology that we cited early in this essay is based exclusively on properties of words as phonological or morphological entities. Another dimension comes in from the perspective of this essay.

    2. parameters
    3. There was a time when it was hoped or claimed that differences among languages were to be explained on the basis of "parameters." For example, there was supposed to be a "null-subject" parameter. If a language tolerated sentences with no overt subject then a whole cluster of other properties was supposed to follow. This approach was characteristic of the (aptly-named) "Principles and Parameters" model of Chomsky and his followers, of roughly the eighties of the last century. (A few leading references: Chomsky 1982, Baker 1996, Chomsky 1995, for critique: Culicover and Jackendoff 2005.)

      Variation within languages approximates variation across languages (Bach 1974).

      An easy example in this context is afforded by languages that allow null subjects but only in certain circumstances (person, tense etc.) such as Modern Irish or Modern Hebrew. In the realm of word-order a clear example is afforded by Modern German which shows three different patterns of word-order: SVO, VSO, SOV, each of which is characteristic of a larger number of different languages (see Bach 1974 for discussion of this point). This is, however, not the place to enter into an extended discussion. Instead I give the

    4. last word to Edward Sapir
    5. An analytic language is one that either does not combine concepts into single words at all (Chinese) or does so economically (English, French). In an analytic language the sentence is always of prime importance, the word is of minor interest. In a synthetic language (Latin, Arabic, Finnish) the concepts cluster more thickly, the words are more richly chambered but there is a tendency, on the whole, to keep the range of concrete significance in a single word down to a moderate compass. A polysynthetic language, as its name impliies, is more than ordinarily synthetic. The elaboration of the word is extreme. Concepts which we should never dream of treating in a subordinate fashion are symbolized by derivational affixes or "symbolic" changes in the radical element, while the more abstract notions, including the syntactic relations, may also be conveyed by the word. A polysynthetic language illustrates no principles that are not already exemplified in the more familiar synthetic languages. It is related to them very much as a synthetic language is related to our own analytic English. The three terms are purely quantitative -- and relative, that is, a language may be "analytic" from one standpoint, "synthetic" from another. I believe the terms are more useful in defining certain drifts than as absolute counters. Edward Sapir (1921: p. 128 [paper bound edition])

References

Bach, Emmon. 1974. Syntactic Theory. New York: Holt, Rinehart and Winston,

Bach, Emmon and Wynn Chao. [2008] Semantic universals and typology. To appear in Chris Collins, Morten Christiansen and Shimon Edelman, eds., Language Universals (Oxford: Oxford University Press).

Bach, Emmon, and Wynn Chao. In press. Language universals from a semantic perspective. To appear in Maienborn, von Heusinger, Portner, eds, Semantics: International Handbook of Natural Language Meaning.

Baker, Mark C. 1996. The Polysynthesis Parameter. New York / Oxford: Oxford University Press

Boas, Franz. N.d. Kwakiutl Dictionary. n.p. Ms.

Chomsky, Noam. 1982. Lectures on Government and Binding. Dordrecht: Foris.

Chomsky, Noam. 1995. The Minimalist Program. Cambridge, Mass. and London: The MIT Press.

Culicover, Peter W., and Ray Jackendoff. 2005. Simpler Syntax. Oxford University Press.

Di Sciullo, Anna-Marie and Edwin Williams. 1987. On the Definition of Word. Cambridge, Massachusetts: MIT Press.

Dowty, David R. 1978. Lexically governed transformations as lexical rules in a Montague Grammar. Linguistic Inquiry 1: 45-78.

Dowty, David R. 1979. Word Meaning and Montague Grammar. Dordrecht: Reidel.

Jespersen, Otto. [1964] Language: its Nature, Development, and Origin. New York: Norton (reprint).

Montague, Richard. 1970. Universal Grammar. [Paper 7 in Montague, 1974.]

Montague, Richard. 1974. Formal Philosophy. Edited by Richmond H. Thomason. New Haven: Yale University Press.

Sapir, Edward S. 1921. Language: an Introduction to the Study of Speech. New York: Harcourt, Brace.

Greenberg, Joseph H. 1957. Essays in Linguistics. Chicago: University of Chicago Press.

Jacobson, Steven A. 1984. Yup'ik Eskimo Dictionary. Fairbanks: Alaska Native Language Center, University of Alaska.

Jacobson, Steven A. 1995. A Practical Grammar of the Central Alaskan Yup'ik Eskimo Language. With Yup'ik readings written by Anna W. Jacobson. Fairbanks: Alaska Native Language Center and Program, University of Alaska Fairbanks.

Lincoln, Neville J. and John C. Rath. 1986. Phonology, Dictionary and Listing of Roots and Lexical Derivates of the Haisla Language of Kitlope and Kitimaat. 2 vols.) Ottawa: National Museum of Man Mercury Series. Canadian Ethnology Service: Paper No. 103. WAK HAISLA NWAK

Radford, Andrew. 2004. Minimalist Syntax: Exploring the Structure of English. Cambridge: Cambridge University Press.

Sadock, Jerrold M. 1991. Autolexical Syntax. Chicago: University of Chicago Press.

Sapir, Edward and Morris Swadesh. 1939. Nootka Texts. Philadelphia: Linguistic Society of America.

Swadesh, Morris. 1939. Nootka internal syntax. International Journal of American Linguistics 9: 77-102.

network connections

link to these notes: http://www.people.umass.edu/ebach/courses/copenh.htm
email: ebach at linguist dot umass dot edu
link to notes for EB's recent course on Morphosemantics at Ohio State University:
http://www.people.umass.edu/ebach/courses/osu08-pl.htm
Homepage