Greenberg’s Universal 38 and its diachronic implications

This post is being written hastily and therefore may be more incomprehensible than usual.

Greenberg’s Universal 38 says:

Where there is a case system, the only case which ever has only zero allomorphs is the one which includes among its meanings that of the subject of the intransitive verb. (Greenberg, 1963, 59)

A slightly better way to put this (more clearly clarifying what the universal says about languages that code case by means of non-concatenative morphology) might be: if a language makes a distinction between nominative/absolutive and ergative/accusative case by means of concatenative morphology, then there is always at least one ergative/accusative suffix form with nonzero phonological substance. Roughly, there’s a preference for there to be an ergative/accusative affix rather than a nominative/absolutive affix (but it’s OK if there are phonologically substantive affixes for both cases, or if ergative/accusative is zero-coded in some but not all environments).

On the other hand, Greenberg’s statement of the universal makes clear a rather interesting property of it: if you’re thinking about which argument can be zero-coded in a transitive sentence, Universal 38 actually says that it depends on what happens in an intransitive sentence: the one which can be zero-coded is the one which takes the same case that arguments in intransitive sentences take. If the language is accusative, then the nominative, the agenty argument, can be zero-coded, and the accusative, the patienty argument, can’t. If the language is ergative, then the absolutive, the patienty argument, can be zero-coded, and the ergative argument, can’t. (I mean can’t as in can’t be zero-coded in all environments.)

This is a problem, perhaps, for those who think of overt coding preferences and other phenomena related to “markedness” (see Haspelmath, 2006, for a good discussion of the meaning of markedness in linguistics) as related to the semantics of the category values in question. Agenty vs. patienty is the semantic classification of the arguments, but depending on the morphosyntactic alignment of the language, it can be either the agenty or patienty arguments which are allowed to be zero-coded. This seems like a case where Haspelmath’s preferred explanation of all phenomena related to markedness—differences in usage frequency—is much more preferable, although I don’t think he mentions it in his paper (but I might have missed it—I’m not going to check, because I’m trying to not spend too long writing this post).

Anyway, one thing I wonder about this universal (and a thing it’s generally interesting to wonder about with respect to any universal) is how it’s diachronically preserved. For it’s quite easy to imagine ways in which a language could end up in a situation where it has a zero-coded nominative/absolutive due to natural changes. Let’s say it has both cases overtly coded to start with; let’s say the nominative suffix is -ak and the accusative suffix is -an. Now final -n gets lost, with compensatory nasalization, and then vowels in absolute word-final position get elided. (That’s a perfectly natural sequence of sound changes; it happened in the history of English, cf. Proto-Indo-European *yugóm > Proto-Germanic *juką > English yoke.) The language would then end up with nominative -ak and a zero-coded accusative, thus violating Universal 38. So… well, I don’t actually know how absolute Universal 38 is, perhaps it has some exceptions (though I don’t know of any), and if there are enough exceptions we might be able to just say that it’s these kinds of developments that are responsible for the exceptions. But if the exceptions are very few, then there’s probably some way in which languages which end up with zero-coded accusatives like this are hastily “corrected” to keep them in line with the universal. Otherwise we’d expect to see more exceptions. Here’s one interesting question: how would that correction happen? It could just be that a postposition gets re-accreted or something and the accusative ends up being overtly coded once again. But it could also be that subjects of intransitive sentences start not getting the -ak suffix added to them, so that you get a shift from accusative to ergative morphosyntactic alignment, with the zero-coded accusative becoming a perfectly Universal 38-condoned zero-coded absolutive. That’d be pretty cool: a shift in morphosyntactic alignment triggered simply by a coincidence of sound change. Is any such development attested? Somebody should have it happen in a conlang family.

According to Wichmann (2009), morphosyntactic alignment is a “stable” feature which might be a problem if alignment shifts can occur in the manner described above. But then again, I wonder how common overt coding of both nominative/absolutive and ergative/accusative is, actually—most Indo-European languages that mark cases have it, but I did a quick survey of some non-IE languages with case marking, both accusative (Finnish, Hungarian, Turkish, Tamil, Quechua) and ergative (Basque, Dyirbal) and they all seem to code nominative/absolutive by zero (well, Basque codes absolutive overtly in one of its declensions, but not in the other two). If it’s pretty rare for both to be overtly coded, then this correction doesn’t have to happen very often, but it would surely need to happen sometimes if Universal 38 is absolute or close to it.


Greenberg, J. H., 1963. Some universals of grammar with particular reference to the order of meaningful elements. In Greenberg, J. H. (ed.), Universals of Language, 73–113. MIT Press.

Haspelmath, M. (2006). Against markedness (and what to replace it with). Journal of linguistics, 42(01), 25–70.

Wichmann, S. & Holman, E. W. (2009). Temporal stability of linguistic typological features. Retrieved from


3 responses to “Greenberg’s Universal 38 and its diachronic implications

  1. Yeah, this does not seem to be a proper universal as much as a statistical one. There are at least three subtly different conterexamples already in European Uralic…

    For one, the genitive-accusative singular in Northern Sami is zero-marked in terms of suffixation, exactly due to recent loss of word-final nasals, and is expressed purely by consonant gradation. For typical bisyllabic word stems, this furthermore usually means degemination or lenition (e.g. giehta ‘hand,’ : gieđa ‘acc. sg.’; viellja ‘brother, nom. sg.’ : vielja ‘acc. sg.’) — so accusative singulars end up having less phonetic material than nominatives. (Both cases retain their suffixes in the plural, though, e.g. gieđat ‘nom. pl.’ : gieđaid ‘gen-acc. pl.’)

    For two, Estonian does this as well for nouns of the shape CVCV: gen.-acc. *-n is again lost. Also, since consonant gradation is more limited in Estonian, many words end up with entirely identical and forms.

    In longer stems, though, final vowel loss intervenes; e.g. *veljə ‘brother’ yields veli : velj-e, from earlier *velj : velje-n, showing typical stem-vowel-to-oblique-case-marker transition. So I suppose if you really wanted to defend the universal, you could argue that this has also happened in the other cases, and that wordforms like kala ‘fish, /’ are now to be analyzed as having a root √kal and, in all case forms, a suffix -a, similar to cases like päev ‘day,’ : päev-a ‘’

    For three, Livonian in some cases does this as well — except, due to the complete absense of consonant gradation, the genitive-accusative singular ends up unmarked entirely, unless some sound changes particular to the have occurred. Here some instances have moreover come about through levelling towards a lack of case marking: some nominative singulars that would have become too detached from the rest of the paradigm may have been lost, and the genitive-accusative singular has been then generalized as also the nominative singular. E.g. ‘snow,’ was still attested as *lumi > lü’m in the early 1800s, while 20th-to-21st century Livonian only has *lume-n &gt > lu’m for and both (perhaps motivated by the obfuscating recent merger ü > i).

    • Oh no, I just noticed I accidentally omitted a rather crucial word (“only”) in the quote from Greenberg’s paper. (I’ve added it in just now) So if I read you correctly, the gen./acc. sg. in Estonian and Livonian is only zero-marked for some words, and therefore these languages don’t constitute strict counterexamples to the universal… although they do show that zero-marking of the accusative can be present in quite non-marginal environments, and it would be easy to imagine it spreading by analogy to other ones. (I imagine when Greenberg makes the qualification about his universals being about presence of *only* zero allomorphs, which he also does for his universal about plural being overtly coded rather than singular, he’s thinking of cases such as the English coding of number on third-person singular verbs, or the French coding of number in a few isolated words like *os* ‘bone’, where the environment in which zero-coding of singular happens is quite a marginal one.)

  2. Since writing this I’ve discovered that counterexamples to U38 do exist, and they’re called “marked-S languages” (or “marked nominative” if they have accusative alignment, “marked absolutive” if they have ergative alignment). You can read a book about them here.

Leave a Reply

Fill in your details below or click an icon to log in: Logo

You are commenting using your account. Log Out /  Change )

Google+ photo

You are commenting using your Google+ account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )


Connecting to %s