On my recent trip to Nepal I came across two inscriptions of linguistic interest.
The first is an unusual inscription in Kathmandu’s Durbar Square. This was placed here by King Pratap Malla in the 17th century. The king was a linguaphile and this poem to the goddess Kali includes words from 15 scripts and languages. According to an article in the Nepali newspaper República these are Persian, Arabic, Maithili, Kiranti, Newari, Kayathinagar (the script then used in western Nepal), Devanagri, Gaudiya, Kashmiri, Sanskrit, two different Tibetan scripts, English and French.
You can clearly make out French l’hiver ‘winter’ and automne ‘autumn’ as well as English winter.
Sadly, a significant part of this inscription has already been effaced. Indeed, the same is happening to most of the inscriptions in Durbar Square, and in spite of its UNESCO World Heritage Site status nothing is being done to protect them.
The second interesting inscription is on the pillar that the Emperor Ashoka set up in the 3rd century BC in Lumbini, the birthplace of the Buddha. This Prakrit-language proclamation releasing Lumbini from tax obligations is written in the Brahmi script. The plaque standing in front of the pillar has a Latin transliteration and translations into English and Nepali.
Any student of classical languages with a linguistics bent will delight at discovering W. Sidney Allen’s books Vox Latina and Vox Graeca that reconstruct the pronunciation of Classical Latin and Greek, respectively. Cambridge University Press has published them in relatively cheap paperbacks. However, there are two more works by this scholar that that don’t get anywhere near the attention they deserve, even though they are logical next steps.
The first is Accent and Rhythm: Prosodic Features of Latin and Greek (Cambridge University Press, 1973). Here W. Sidney Allen takes the linguistic reconstruction of Greek and Latin one step further from Vox Latina and Vox Graeca to encompass suprasegmental aspects of these languages. This book does demand a greater understanding of theory (whereas the earlier books expected little more than some knowledge of IPA), and it takes some work to apply Allen’s insights to one’s own enunciation.
The second book treats what is historicaly the third important classical language for Indo-European studies, Sanskrit. Allen’s Phonetics in Ancient India (Oxford University Press, 1953) was published years before Vox Latina and Vox Graeca, and is organized somewhat differently in that it is mainly a retelling of the already very detailed ancient Indian sources for Sanskrit pronunciation. However, Allen does engage in some detective work to clarify matters obscure in the ancient grammarians, such as the pronunciation of the visarga.
I may have come across such etymologies before, but as far as I remember, this is the first proposal I’ve seen of a Uralic loanword in Proto-Indo-European. In Ananta Śāstram: Indological and Linguistic Studies in Honour of Bertil Tikkanen ed. Klaus Karttunen (Helsinki: Finnish Oriental Society, 2010), Asko Parpola has this to say on the etymology of Finnish kaivaa ‘dig’:
The Finnish words kaiva-a ‘to dig’ and kaivo ‘digging, well, pit’ have cognates in Finnic languages, in Saami and the Volgaic and Permic languages. Ante Aikio has shown that Proto-Finno-Ugric *kajwa- can be regularly connected with Proto-Samoyedic käjwa ‘spade’, as
the change *a > *ä took place in Samoyedic before a tautosyllabic palatal consonant, thereby settling an old problem, the history and material of which is fully discussed by Aikio. Hence the etymon is an archaic Uralic nomen verbum.
What I offer here is not a new etymology, but simply a reference to an old etymology proposed as early as 1920 that was not included in the indexes of etymologically treated Finnish words by Donner and Erämetsä, and so has escaped notice in SKES and SSA. K. F. Johansson had reconstructed an archaic Proto-Indo-European heteroclitic noun *kaiw-r̥-t (nom.) ~ *kaiwn̥n-eś (gen.) on the basis of Greek and Old Indo-Aryan. Hesychius records καίατα in the sense of ‘pits, excavations, trenches, ditches’ (ὀρύγματα) or ‘landslide chasms caused by earthquake’ (ἢ τὰ ὑπὸ σεισμῶν καταρραγέντα χωρία) The plural καίατα is supposed to stand for καίϝατα, from the singular καίϝαρ. Old Indo-Aryan kevaṭa- ‘pit’ is attested in a single occurence in the oldest text, Rigveda, 6,45,7; Old Indo-Aryan e goes back to Proto-Aryan *ai and *rt has often become retroflex *ṭ. Pokorny accepts the comparison and reconstructs for Proto-Indo-European *kaiwr̥t *kaiwn̥-t. Thomas Burrow and Manfred Mayrhofer have considered the scanty evidence in both Old Indo-Aryan and Greek as too uncertain for the assumption of a PIE hetercliton. Still, Mayrhofer thinks it is possible that the words are related. Herbert Petersson also emphasizes that no trace of this etymon is found in other Indo-European languages — and Frisk points out that no corresponding PIE verbal root can be traced — while the root structure too, with a diphthong following by -w-, also looks peculiar for PIE. Petersson therefore takes this to be one of the rare cases where Proto-Indo-European is likely to have borrowed from Proto-Finno-Ugric. Mayrhofer refers to Petersson’s suggestion as noteworthy but unconfirmed. However, the confirmed Uralic origin of kajwa- and the archaic appearance of the word on both sides gives new significane to Petersson’s hypothesis.
(The title of Parpola’s contribution to this volume is ‘New Etymologies for Some Finnish Words’, pp. 305–318. In quoting it here, I have slightly abridged the text and left out the parenthetical citations for the sake of readabiity.)
In Romani: a linguistic introduction (Cambridge University Press, 2005), Yaron Matras gives several examples of how the Roma people have been very inventive with names for the countries and people encountered on their westward migration (pp. 26–27):
Characteristic of Romani is – alongside replications of nations’ self-ascription (e.g. sasitko ‘German’, njamco ‘German’, valšo ‘French’) – the widespread use of inherited or internal names for nations. Thus we find das ‘Slavs’ (cf. OIA dāsa- ‘slave’), a word play based on Greek sklavos; xoraxaj/koraxaj of unclear etymology, in the Balkans generally ‘Muslim, Turk’ and elsewhere ‘foreigner’ or ‘non-Rom’; gadžo ‘non-Rom’. Other inherited words for non-Rom include xalo (‘meagre, shabby’), also in the diminutive xaloro ‘Jew’, balamo and goro ‘Greek, non-Gypsy’; biboldo ‘Jew’ (‘unbaptised’), chindo ‘Jew’ (‘cut’ = ‘circumcised’), trušulo ‘Christian’ (cf. trušul ‘cross’), džut ‘Jew’ (possibly Iranian). Names attached to foreign countries by individual Romani groups often refer to incomprehensible speech, based on either lal- ‘dumb’ or čhib ‘tongue’: lallaro-temmen ‘Finland’ and lalero them ‘Bohemia’ (= ‘dumb land’), lalero ‘Lithuanian’, čibalo/čivalo meaning ‘Albania’ among Balkan Rom, ‘Bavaria’ among German Rom, and ‘Germany’ among Yugoslav Rom. More recently, barvale thema (lit. ‘rich countries’) has emerged as a designation for ‘western Europe’, lole thema (lit. ‘red countries’) for ‘eastern, communist Europe’.
Internal creations of place names are common mostly among the northwestern dialects of Romani. They are frequently either translations, or semantic or sound associations based on the original place names: nevo foro lit. ‘new town’ for ‘Neustadt’, xačerdino them lit. ‘burned country’ for ‘Brandenburg’, čovaxanjakro them lit. ‘witches’ country’ for ‘Hessen’ (German Hexen ‘witches’), kiralengro them lit. ‘cheese country’ for ‘Switzerland’, u baro rašaj lit. ‘the big priest’ for ‘Rome’, lulo piro lit. ‘red foot’ for ‘Redford’, baro foro lit. ‘big town’ for capital cities of various countries (Helsinki, Stockholm, Belgrade).
I remember thinking how it cool it was that the Chuvash coined the name чул хула ‘stone city’ for Nižnyj Novgorod, once the closest large Russian settlement, and how disappointed I was to hear that it was no longer in use. I wonder how many of these Romani examples are still current.
Sheldon Pollock, one of the most prominent scholars of Sanskrit literature today, has contributed a jeremiad entitled ‘Crisis in the Classics’ to the journal Social Research Vol. 78 No. 1 (Spring 2011) on the decline of classical philology in India. The article is available as a PDF and its 28 pages have so much good material that I can hardly decide what to quote here, but here’s the heart of Pollock’s observations:
Indeed, there have been no successors to any of the pre-independence generation of Sanskrit scholars, the sort who mastered their discipline and thought conceptually about it and wrote for an international audience: S. N. Dasgupta, S. K. De, Mysore Hiriyanna, P. V. Kane, S. Radhakrishnan, Venkata Raghavan, C. Kunhan Raja, V. S. Sukthankar, are the first in a long and distinguished list from across India (I leave aside the loss of the great tradition of pandit learning, which is now virtually extinct). There have been no major Sanskrit projects in India since the completion of the critical edition of the Ramayana at Baroda more than 30 years ago. All the great classical series (such as Anandasrama, Trivandrum, Gaekwad, Madras) have been more or less discontinued, and as a result the manuscripts in those collections are no longer being published. Indeed, there have been few new Indian editions of complex Sanskrit texts at all from among the scores of important manuscripts that lie unpublished in archives. In the area of hermeneutics (Mimamsa), for example, I know of no one in India today capable of editing works like those edited just a generation ago by P. N. Pattabhirama Sastry or S. Subrahmanya Sastry. (The same holds for many other areas of classical studies; with the death of A. N. Upadhye in 1975 and H. C. Bhayani in 2000, the editing of Prakrit and Apabhramsha works seems to have died too.) I have not encountered a single PhD dissertation on Sanskrit in India—and I have seen many—worthy of publication by a Western university press.
The situation is no different in the other classical languages, as I learned in the late 1990s when I organized a project on the histories of South Asian literary cultures (Pollock 2003). Our core group of colleagues was looking for others to join us who possessed a deep historical understanding of a regional language, conceptual skills, and the capacity to communicate their knowledge effectively. We were able to locate only four qualified scholars in India, and identified no one for a host of languages, including Assamese, Marathi, Newari, Oriya, and Panjabi.
I suspected as much when I visited university bookshops in India: almost no publications from the last 30 years, and heaps of decaying old editions that evidently no one wanted to buy. Online language discussions are so often overrun by Hindu fundamentalist claims that Sanskrit is a divine language and India’s literature the oldest and wisest in the world, but for all the prominence of such views on the internet, this heritage is neglected in India.
In his textbook Teach Yourself Beginner’s Hindi Script Rupert Snell offers the following charming anecdote:
Legend has it that in the days of the Raj the British memsahibs, indifferent to real Hindi, would learn simple Hindi commands by assimilating them to English phrases: ‘There was a banker’ was to be interpreted by servants as representing दरवाज़ा बंद कर darvāzā band kar ‘Close the door’, and ‘There was a cold day’ meant दरवाज़ा खोल दे darvāzā khol de, ‘Open the door’. Thankfully, those days are long gone
While a Google search on the first phrase gives no results, a search on the second comes up with numerous references to this phenomenon, but only as an urban myth. If this were really a commonplace among Brits in India, one would expect it to appear in some contemporary written form, perhaps something penned as advice to recent arrivals.
It’s not often I learn anything on Wikipedia, reference to current scholarship being so rare, but an addition today to the article on Siever’s Law suggests an interesting recent development:
A second difficulty has emerged much more recently (Sihler 2006): the actual passages from the Rigveda cited in Edgerton’s two large articles in 1934 and 1943 cited as examples of the effects of his theory in action seriously misrepresent the facts in all but a handful of cases. No more than three Rigvedic passages cited in the 1934 article, and none at all in 1943, actually support the claims of Edgerton’s Law, well within the operation of pure chance. And it has been shown also that the apparent success of Lindeman’s more modest claims are not without troubling problems too, such as the limitation of the reliable examples to vocalic semivowels (the glides *y and *w) even though such alternations in the other four semivowels should have left much more persistent traces; and that the syllabified alternants (e.g. *diyēws) are very much rarer than they should be: they account for only fifteen to twenty percent of the total, when they should account for at least eighty percent. Further, only the diyēws alternants have a “distribution”: the dyēws shape shows no sensitivity to phonetic environment at all.
The work by Andrew Sihler cited here is Edgerton’s Law: The Phantom Evidence (Universitätsverlag Winter, 2006) ISBN 3-8253-5167-X. What a delightfully polemic title. I’ll have to add this to my wishlist.
LingNews.net, a conglomeration of language-related links founded by Bridget Samuels, recently linked to a Times of India article reporting the end of Sanskrit studies at Cambridge. Sad news, although I never thought before that there even was significant scholarship in Sanskrit there, since Oxford and SOAS get all the attention.
The article mentioned an online critical text of the Mahabharata, so I went ahead and searched for it. This electronic text unfortunately isn’t freely redistributable, although anyone can register to view it and save the text files to disk. Anyone interested in the Sanskrit text of the Mahabharata probably has access to a university library and can read the work more comfortably from a fine print version. What this electronic text may be useful for, I suspect, is allowing people to copy and paste needed lines into another document if they don’t want to learn the complexities of UTF-8 Devanāgarī input.
The Early Indo-European Languages Online series of short introductions put up by the A. Richard Diebold Center for Indo-European Language and Culture, Linguistics Research Center, University of Texas at Austin, has a couple of new offerings.
Introduction to the Ancient Sanskrit Texts cannot really compete with the traditional primers, but it does provide useful supporting material for the language of the Vedas. Even for those who don’t want to intend to reach proficiency in Sanskrit at this time, authors Karen Thomson and Jonathan Slocum give an engaging critique of the various interpretations and translations (including the popular Penguin edition) of the Rigveda. Their polemic tone is quite refreshing compared to the dry style of the traditional handbooks.
The innumerable texts of the Veda continue to be the subject of extensive study. However, from the point of view of understanding the Rigveda itself, this vast body of derivative material has always been, and continues to be, crucially misleading … Because the poems were put to ritual use by the ancient priests, much of their vocabulary was assumed by the authors of the later texts to refer in some way to ritual activity. The word paśú ‘beast, cattle’ came to designate a sacrificial victim in texts of the Brāhmaṇas, for example, and juhū́ ‘tongue’ was thought to mean ‘butter ladle’. Abstract words of sophisticated meaning particularly suffered. The compound puro-ḷā́ś ‘fore-worship’ (from purás ‘in front’ and √dāś ‘worship’) acquired the specific sense ‘sacrificial rice cake’, despite the fact that the word vrīhí ‘rice’, found in later texts, does not occur in the poems of the Rigveda. The complex noun krátu ‘power, intellectual ability’ was misunderstood to mean ‘sacrifice’ by the authors of the commentaries … With major pieces of the jigsaw firmly in the wrong place, the rest, inevitably, refuses to fit.
Even it doesn’t really deal with an “early IE language”, in the sense of the first language attested in a branch, there’s also Introduction to the Old French Texts. It’s a pity that while it mentions the Strasbourg Oaths, it doesn’t give the text and an analysis of it, though the other staple of early Old French, St Eulalie, is present.
Yves Codet has come up with LaTeX Unicode hyphenation patterns for Sanskrit and Prakrit in the Devanagari, Bengali, Kannada, Malayalam and Telugu scripts. These work only for the Xetex form of TeX (formerly for Mac OSX only, coming soon to Linux and Windows). Nice to see that progress continues to be made in typesetting non-European scripts with LaTeX.