  • 1 Department of Document Studies, Linguistics, Philology and Geography University of Rome “La Sapienza”, Rome, Italy
This paper intends to provide some data about the occurrence of 〈e〉 and 〈o〉 for Classical Latin (= CL) /1/ and /ŭ/ in Latin papyri and ostraca. In order to carry out a study of the incidence of some grapho-phonological phenomena within documentary texts and to examine to what extent they could be related with parameters of sociolinguistic variation, the examined texts have been collected in a corpus which has been tagged for both linguistic and extralinguistic aspects. This corpus is available in the Data-base CLaSSES (, created at the FILELI Department of the Uni-versity of Pisa (§ 1). The study will focus in particular on the analysis of this graphemic alternance in the Bu Njem ostraca (§ 2.1); then, it will dwell on the qualitative analysis of three lexemes in Egyptian papyri and ostraca in which a proto-Romance merger between /ĭ/ and /ē/ in /e/ and /ŭ/ and /ō/ in /o/ in tonic posi-tion might be documented. Particular attention is paid to interference phenomena with Greek (§ 2.2).


This paper aims at providing additional data to the framework outlined by J. N. Adams1 for the occurrence of 〈e〉 and 〈o〉 for Classical Latin (= CL) /ĭ/ and /ŭ/ in Latin papyri and ostraca. Many studies have already been carried out on this orthographic variation, especially on epigraphic texts.2 Indeed, the question may be related to the rise of the pan-Romance vowel system, with the loss of the phonological value of CL vowel length and the merger between /ĭ/ and /ē/ in /e/ and /ŭ/ and /ō/ in /o/.

According to a well-established doctrine, in Latin vowel system the feature [±TENSE] was related allophonically with vowel length; therefore, a long vowel had the tendency to be tense and close, whereas a short vowel had the tendency to be re-laxed and open.3 For this reason, CL /ĭ/ and /ŭ/, probably pronounced [1] and [ʊ], are often written (e) and (o) since early inscriptions4 According to some scholars, such as E. Pulgram,5 E. Vineis6 and G. Marotta,7 the use of the graphemes (e) and (o) for CL /1/ and /ŭ/ would be a clue of the feature [± TENSE] phonologization - and subsequently of a timbre opposition - in some substandard varieties since the 3rd century BC. This phenomenon would be related to the dephonologization of vowel quantity and the emersion (in tonic position) of the pan-Romance vowel system, within a gen-eral process of drift which leads to syllabic isochrony and involves other phonological and morphoprosodic processes such as consonant gemination, super-heavy syllables reduction and vowel shortening in final position.8 On the other hand, J. Herman9 and M. Loporcaro,10 among others, date the dephonologization of CL vowel length not earlier than the 5th century AD.

This is not of course the place to deal with such a complex debate. Neverthe- less, the forms which will be discussed reveal in our opinion a proto-Romance merger between /ĭ/ and /ē/ in /e/ and /ŭ/ and /ō/ in /o/ in tonic position in texts dated between the 1st and the 2nd century AD. The probable phonetic spellings emerge in contexts of graphemic instability ad interference, providing clues of some submerged forms.

The analyzed corpus has been tagged for both meta-linguistic11 and extra-lin- guistic aspects. In comparison with the corpus - available on the database CLaSSES12 - which has been already described elsewhere,13 and limited to Cugusi’s CEL,14 further data have been added: namely all the Bu Njem ostraca (O. Bu Njem), edited by R. Marichal,15 and eight Latin letters among the ostraca of Didyme (O. Did.), edited by H. Cuvigny.16 The corpus amounts to 348 documents, with 11.351 tokens total. The definition of the textual typologies - which are shown in Table 1 - has been provided following mainly the classification put forward by P. Cugusi,17 who subdivides letters into three macro-categories: private letters, letters between private and public, and public letters. These documentary typologies have been afterwards arranged along a continuum based on the parameter of immediacy, ranging from texts characterized by spontaneity and thematic freedom (viz. private letters), to more formal texts, characterized by a greater communicative separation (viz. official letters).18 Private letters, which belong to the so-called “ego-documents”,19 constitute the closest evidence to spoken language, and provide useful information for the analysis in a (socio)graphic and hopefully (socio)phonetic sense.20

Table 1.

Textual typologies within the corpus



This section focuses on the analysis of the texts dated from the 1st century BC to the 3rd century AD.21 The data of the texts coming from Africa and Egypt are analyzed separately, since Latin - as it is known - had a different use in the linguistic reper-toire of the two provinces. While in Africa Latin was a vehicular language and probably developed into a Romance variety,22 in the eastern province of Egypt the vehicu-lar language was mainly the Egyptian Koine23 and Latin was used as a super-high variety.24 However, recent studies on Latinisms in Egyptian Koine suggest that Latin was more widespread in everyday life than is generally supposed.25

Some preliminary remarks are necessary before the analysis of data. The study of the grapho-phonological dimension requires a thorough evaluation of the relation-ship existing between the graphematic and phonetic level. Two considerations should be taken into account: (a) the graphematic level is comparable with a filter, since possible variations on the (socio)phonetic level could have been normalized by the scribe, depending on his level of literacy;26 (b) some aberrant spellings could be due to an archaizing style rather than representing a phonetic spelling.27 As regards point (a), it could be generally stated that the laxer is the orthographic norm (due for example to graphematic interference or poor literacy), the easier phonetic spellings emerge.28 As to point (b), it could be asserted that the possibility of discerning whether a form is due to a phonetic spelling or to a stylistic feature depends on the consideration of the textual typology, the modality of execution of the text and its paleographical characteristics.

2.1 The Bu Njem ostraca

The Bu Njem ostraca represent a very interesting documentary niche, to which many scholars paid attention.29 It is a coherent corpus composed of 151 ostraca, dated to AD 253-260, in which a situation of contact between Latin and Punic emerges. The textual typologies represented are letters (both private and official) and military documents (especially lists of soldiers). The analysis of these texts shows an overall low level of literacy,30 since the occurrence of misspellings is widespread in all the tex-tual typologies.31

It has been generally pointed out than within the Bu Njem ostraca attestations of the graphemic alternance (e) ∼ (i) and (o) ∼ (u) cannot be found.32 This aspect might be related to the African Latin vowel system, in which there would have been presumably a merger between long and short vowels.33 This might therefore explain the absence of graphemic clues of the merger between /ĭ/ and /ē/ in /e/ and /ŭ/ and /ō/ in /o/. Indeed, Table 2 clearly shows the absence of the allographs (e) and (o) for CL /ĭ/ and /ŭ/.

Table 2.

Graphemic realization of CL /ĭ/ and /ŭ/ in the Bu Njem ostraca

Grapheme/Ŭ/ (non-tonic)/ Ŭ / (tonic)Grapheme/Ĭ/ (non-tonic)/Ĭ/ (tonic)

This notwithstanding, some uncertain forms in which (o) for CL /ŭ/ might be attested require a further analysis. The form (fornus) for CL fŭrnus, which is attested in two texts,34 is not to be taken into account, since it is a special case.35 Even if one cannot rule out the hypothesis that (fornus) attests a merger between /ŭ/ and /ō/ in /o/ in tonic position, it is more likely that this form is due to lexical juxtaposition with formus and fornax.36 Two forms in which (o) for CL /ŭ/ in final syllable might be attested require a particular attention. The first example is (per kamellarios) (O. Bu Njem 76).

The form 〈kamellarios〉 might be interpreted as a nominative singular37 rather than an accusative plural,38 in accordance with the construction per + kamellarius, well docu- mented in these texts.39 The second one - which is generally neglected - is 〈per Pano fr(umentarium)〉 (O. Bu Njem 95). This form is interpreted by Marichal40 as an abbre-viation of the genitive plural Pan(n)o(nium). P. Cugusi41 provides a more interesting interpretation: 〈Pano〉 is to be interpreted as an accusative singular (Panum), resulting from the loss of final -〈m〉 and the confusion between /ŭ/ and /o/ in final position (such as in many Egyptian letters42). In fact, the name Panus (attested in Latin only in AE 2007, 1103 as a freedman’s name) is documented in Egyptian papyri of the same period as well (Greek Πάνος and Coptic ΠλNOC43).

2.2 Letters from Roman Egypt

For reasons of brevity, it is not possible to carry out a complete analysis of the forms with ⟨e⟩ and ⟨o⟩ for CL /ĭ/ e /ŭ/ in Egyptian letters. The analysis will be therefore limited to some remarkable forms in which a possible merger between /ŭ/ and /ō/ in /o/ and /ĭ/ and /ē/ in /e/ in tonic position might be attested.

2.2.1 ⟨Σεκόνδος⟩ (P. Vindob. Lat. 135 = CEL 13)

The first noteworthy form is attested in P. Vindob. Lat. 135.44 It is a Latin letter with

Greek subscriptio, dated 25 August AD 27. The Latin text is written in capitalis romana rustica, and it is characterized by constant punctuation. It deals with an act in which the soldier L. Caecilius Secundus certifies that he has contracted a debt with the soldier C. Pompeius. It is probable that L. Caecilius Secundus was not literate in Latin, so he had to turn to a scribe.45 In the letter some substandard elements from a morphosyntactic point of view emerge, such as mistakes in gender agreement:46 these could be ascribed either to Caecilius’ dictation (meaning that he did actually speak Latin but he was not literate) or to the scribe himself. The subscriptio, written directly by Caecilius, is in Greek language and script: although the text is not easily interpretable, due to the conditions of the lower part of the papyrus, the name Kα[ι]κ[ί]λιος Σεκόνδος can be traced.

It is worth paying attention to the spelling ⟨Σεκόνδος⟩. It is well known that Latin ⟨u⟩ (both /ŭ/ and /ū/) was usually transliterated via the digraph ⟨oυ⟩ in Greek texts.47 Nevertheless, the use of Greek ⟨o⟩ for transliterating Latin ⟨u⟩ (/ŭ/) is not un- usual at all, and it is not generally considered as an evidence of the lowering of Latin /ŭ/, since Greek vowel system had not /ŭ/, and ⟨o⟩, which represented /ŏ/, was the only grapheme which could have been used to transliterate a short vowel.48 Furthermore, the use of Greek ⟨o⟩ in order to transliterate Latin ⟨u⟩ (/ŭ/) is attested especially in names in which Latin /ŭ/ is followed by a consonant cluster.49 This notwithstanding, it cannot be excluded that in the spelling Σεκόνδος the use of ⟨o⟩ could be inter- preted as a clue of a proto-Romance merger between /ŭ/ and /ō/ in /o/ in tonic position (see it. [se’kondo]). As F. Rovai highlights,50 since Hellenistic Greek vowel system distinguished between /o/ (⟨o⟩ / ⟨ω⟩) and /u/ (⟨oυ⟩) from a qualitative and not quantitative point of view, the use of Greek ⟨o⟩ to represent Latin ⟨u⟩ (/ŭ/) could mean that Latin /ŭ/ was open enough to be perceived by a Greek speaker as /o/.

It is remarkable that the probable phonetic spellings of this personal name are documented earlier in Greek rather than Latin. The form attested in this letter is one of the most ancient, together with Σεκονδίων(1st century BC)51 and Σεκόνδα(1st century AD),52 whereas Latin documents in which the form with ⟨o⟩ is attested are dated not previously than the 2nd century AD.

From a quantitative point of view, the spelling with ⟨o⟩ is attested more in Greek documents than in Latin ones, as it can be seen in Table 3, which refers to all the attestations of the Latin lexeme SECUND- in Latin and Greek inscriptions and papyri. The percentage of presumably phonetic spellings is much higher in Greek documents (82%) compared to Latin (18%). As a matter of fact, the adoption of a different script can bring out, as it is known, phonetic phenomena the orthographic norm would otherwise conceal.

Table 3.

Attestations of the Latin lexeme SECUND- in documentary texts

Language⟨Secund-⟩ / ⟨Σεκουνδ-⟨Second-⟩ / ⟨Σεκονδ-

For this reason, in a context in which Greek and Latin interfere on the linguistic and graphematic sphere, the probable vulgar spelling of the Latin name Secundus has pre- viously emerged and in a more conspicuous manner in Greek texts. in the examined letter this aspect is particularly evident: in the Latin text, written by a scribe with a good literacy, the name is written according to the standard orthography ((Secundus)), whereas in the subscriptio, written by Caecilius himself, who writes his own name in Greek adaptation and script, the phonetic spelling emerges.

2.2.2 ⟨sopera⟩ (P. Mich. Viii 471) and ⟨esopera⟩ (O. Did. 417)

The forms ⟨sopera⟩ and ⟨esopera⟩ can be analyzed together. The first one, already known, is attested in a letter of the well-studied archive of Claudius Tiberianus (P. Mich. Viii 471), the second one, which has been generally neglected, is docu- mented in a Latin letter of the ostraca of Didyme (O. Did. 417).

These texts are almost contemporary, respectively dated AD 100-125 and AD 120-125. The forms ⟨sopera⟩ and ⟨esopera⟩ can be related to CL sŭprā, therefore it may be assumed that they attest the phonetic spelling of CL /ŭ/ as [o] in tonic posi-tion (see it. [‘so:pra], [‘so:vra]).

Nevertheless, the presence of /e/ in ⟨sopera⟩ raised a problem of interpretation. on the one hand, the editors Youtie and Winter53 interpreted it as a clue of the archa- izing spelling of the non-syncopated ancient form (sŭpĕra > sŭprā)54 On the other, J. N. Adams argues that it is possible that ⟨sopera⟩ represents a phonetic spelling,55 and assumes that the presence of /e/ could be due to a contamination with super.56 Further considerations to confirm this hypothesis can be done.

First, it is to be noticed that the form with /e/ is widespread: within the corpus taken into account, the CL form sŭprā is never recorded, whereas Terentianus’ ⟨sopera⟩ finds parallel not only with the mentioned (esopera) of the ostraca of Didyme,57 but also with (supera) of CEL 157, a private letter coming from Egypt and dated AD 167. it would therefore appear that in Substandard Latin, around the 2nd century AD, it was spread the preposition supera, probably pronounced [‘sopera], with an epenthetic /e/. This insertion could be due both to a heterosyllabic treatment of the muta cum liquida cluster - as it clearly emerges in O. Did. 41758 - and the contamination with super.

Secondly, the substandard (and phonetic) character of the spellings 〈sopera〉 and 〈esopera〉 could be inferred on the one hand by the multitude of misspellings in these texts and, on the other hand, by the function of these prepositions. Actually, both 〈sopera〉 and 〈esopera〉 introduce the topic.59 The use of super (and consequently of supra) to introduce the topic was generally considered a typical element of the sermo familiaris (Cicero uses it only in private letters).60 Furthermore, Sextus Pompeius Festus, who was almost contemporary to these letters, claims that the use of super to introduce the topic is due to Greek influence, 61 and the examined texts are clearly representative of a situation of contact between Latin and Greek.

In such a context, it is therefore conceivable that 〈sopera〉 and 〈esopera〉 are phonetic spellings, in which the merger between /ŭ/ and /ō/ in /o/ in tonic position might emerge. It is remarkable that 〈sopera〉 and 〈esopera〉 are the first attestations of the merger in this lexeme: other examples come from onomastic material in later 62 inscriptions.

2.2.3 〈entro〉 (CEL 74)

The form 〈entro〉 is attested in a letter of the well-known archive of Rustius Barbarus, dated to the half of the 1st century AD.63 According to J. N. Adams,64 the presence of 〈e〉 is due to the interference with Greek ἐv-. Nonetheless, it is more likely that the spelling 〈entro〉 may represent a proto-Romance merger between /1/ and /ё/ in /e/ in tonic position (see it. [‘entro]), as E. Campanile had already highlighted.65 Further considerations can be done to support the hypothesis that 〈entro〉 represents a pho- netic spelling. Note that in this letter the use of intro as a static adverb instead of intus is attested (ll. 13-14 chiloma entro / ha[b]et). Since, during the same period, Quintili- anus condemns this use as a soloecismus,66 it is therefore plausible that 〈entro〉 repre-sents a trace of spoken Latin.


The analysis - both quantitative and qualitativeaimed at highlighting the relation existing between the emersion of substandard forms, which might have a Romance continuation, and contexts which interfere with Greek. This is the case of the forms discussed in § 2.2, in which the merger between /ĭ/ and /ē/ in /e/ and /ŭ/ and /ō/ in /o/ in tonic position is attested in documents dated to the 1st and the 2nd century AD. These are only few remarkable data from a quantitative point of view; nonetheless the quality of the documentation and the internal coherence of the texts of the corpus allowed us to capture some rather significant details. For this reason, it cannot be ex- cluded that the examined texts, full of substandard elements, show, as far as the inves- tigated phenomenon, a tendency to the proto-Romance merger in a particular portion of the lexicon. On the other hand, it is necessary in our opinion to be cautious about the interpretation of the absence of graphemic clues of vowel merger in the Bu Njem ostraca as an indicator of the African Latin vowel system. An attentive analysis of the texts shows in fact the presence of some uncertain forms (especially 〈per Pano〉).


